TY - JOUR
T1 - ChatSOS
T2 - Vector database augmented generative question answering assistant in safety engineering
AU - Tang, Haiyang
AU - Chen, Dongping
AU - Chu, Qingzhao
AU - Liu, Zhenyi
N1 - Publisher Copyright:
© 2025 Elsevier Ltd
PY - 2025/12/15
Y1 - 2025/12/15
N2 - With the rapid advancement of natural language processing technologies, generative artificial intelligence techniques, represented by large language models (LLMs), are gaining increasing prominence and demonstrating significant potential for applications in safety engineering. However, foundational LLMs face constraints such as limited training data coverage and unreliable responses. This study develops a vector database from 117 explosion accident reports in China spanning 2013 to 2023, employing techniques such as corpus segmenting and vector embedding. By utilizing the vector database, which outperforms the relational database in information retrieval quality, we provide LLMs with richer, more relevant knowledge. Comparative analysis of LLMs demonstrates that ChatSOS significantly enhances reliability, accuracy, and comprehensiveness, improves adaptability and clarification of responses. These results illustrate the effectiveness of supplementing LLMs with an external database, highlighting their potential to handle professional queries in safety engineering and laying a foundation for broader applications.
AB - With the rapid advancement of natural language processing technologies, generative artificial intelligence techniques, represented by large language models (LLMs), are gaining increasing prominence and demonstrating significant potential for applications in safety engineering. However, foundational LLMs face constraints such as limited training data coverage and unreliable responses. This study develops a vector database from 117 explosion accident reports in China spanning 2013 to 2023, employing techniques such as corpus segmenting and vector embedding. By utilizing the vector database, which outperforms the relational database in information retrieval quality, we provide LLMs with richer, more relevant knowledge. Comparative analysis of LLMs demonstrates that ChatSOS significantly enhances reliability, accuracy, and comprehensiveness, improves adaptability and clarification of responses. These results illustrate the effectiveness of supplementing LLMs with an external database, highlighting their potential to handle professional queries in safety engineering and laying a foundation for broader applications.
KW - Accident analysis
KW - Large language models
KW - Prompt engineering
KW - Safety engineering
KW - Vector database
UR - http://www.scopus.com/pages/publications/105009234196
U2 - 10.1016/j.eswa.2025.128644
DO - 10.1016/j.eswa.2025.128644
M3 - Article
AN - SCOPUS:105009234196
SN - 0957-4174
VL - 294
JO - Expert Systems with Applications
JF - Expert Systems with Applications
M1 - 128644
ER -