ChatSOS: Vector database augmented generative question answering assistant in safety engineering

Haiyang Tang; Dongping Chen; Qingzhao Chu; Zhenyi Liu

doi:10.1016/j.eswa.2025.128644

ChatSOS: Vector database augmented generative question answering assistant in safety engineering

Haiyang Tang, Dongping Chen^*, Qingzhao Chu, Zhenyi Liu

^*此作品的通讯作者

Beijing Institute of Technology

科研成果: 期刊稿件 › 文章 › 同行评审

摘要

With the rapid advancement of natural language processing technologies, generative artificial intelligence techniques, represented by large language models (LLMs), are gaining increasing prominence and demonstrating significant potential for applications in safety engineering. However, foundational LLMs face constraints such as limited training data coverage and unreliable responses. This study develops a vector database from 117 explosion accident reports in China spanning 2013 to 2023, employing techniques such as corpus segmenting and vector embedding. By utilizing the vector database, which outperforms the relational database in information retrieval quality, we provide LLMs with richer, more relevant knowledge. Comparative analysis of LLMs demonstrates that ChatSOS significantly enhances reliability, accuracy, and comprehensiveness, improves adaptability and clarification of responses. These results illustrate the effectiveness of supplementing LLMs with an external database, highlighting their potential to handle professional queries in safety engineering and laying a foundation for broader applications.

源语言	英语
文章编号	128644
期刊	Expert Systems with Applications
卷	294
DOI	http://doi.org/10.1016/j.eswa.2025.128644
出版状态	已出版 - 15 12月 2025
已对外发布	是

访问文件

10.1016/j.eswa.2025.128644

其它文件与链接

链接到 Scopus 的出版物

引用此

@article{d353f86e6e704f1c81f9571663c210cb,

title = "ChatSOS: Vector database augmented generative question answering assistant in safety engineering",

abstract = "With the rapid advancement of natural language processing technologies, generative artificial intelligence techniques, represented by large language models (LLMs), are gaining increasing prominence and demonstrating significant potential for applications in safety engineering. However, foundational LLMs face constraints such as limited training data coverage and unreliable responses. This study develops a vector database from 117 explosion accident reports in China spanning 2013 to 2023, employing techniques such as corpus segmenting and vector embedding. By utilizing the vector database, which outperforms the relational database in information retrieval quality, we provide LLMs with richer, more relevant knowledge. Comparative analysis of LLMs demonstrates that ChatSOS significantly enhances reliability, accuracy, and comprehensiveness, improves adaptability and clarification of responses. These results illustrate the effectiveness of supplementing LLMs with an external database, highlighting their potential to handle professional queries in safety engineering and laying a foundation for broader applications.",

keywords = "Accident analysis, Large language models, Prompt engineering, Safety engineering, Vector database",

author = "Haiyang Tang and Dongping Chen and Qingzhao Chu and Zhenyi Liu",

note = "Publisher Copyright: {\textcopyright} 2025 Elsevier Ltd",

year = "2025",

month = dec,

day = "15",

doi = "10.1016/j.eswa.2025.128644",

language = "English",

volume = "294",

journal = "Expert Systems with Applications",

issn = "0957-4174",

publisher = "Elsevier Ltd.",

}

TY - JOUR

T1 - ChatSOS

T2 - Vector database augmented generative question answering assistant in safety engineering

AU - Tang, Haiyang

AU - Chen, Dongping

AU - Chu, Qingzhao

AU - Liu, Zhenyi

PY - 2025/12/15

Y1 - 2025/12/15

N2 - With the rapid advancement of natural language processing technologies, generative artificial intelligence techniques, represented by large language models (LLMs), are gaining increasing prominence and demonstrating significant potential for applications in safety engineering. However, foundational LLMs face constraints such as limited training data coverage and unreliable responses. This study develops a vector database from 117 explosion accident reports in China spanning 2013 to 2023, employing techniques such as corpus segmenting and vector embedding. By utilizing the vector database, which outperforms the relational database in information retrieval quality, we provide LLMs with richer, more relevant knowledge. Comparative analysis of LLMs demonstrates that ChatSOS significantly enhances reliability, accuracy, and comprehensiveness, improves adaptability and clarification of responses. These results illustrate the effectiveness of supplementing LLMs with an external database, highlighting their potential to handle professional queries in safety engineering and laying a foundation for broader applications.

AB - With the rapid advancement of natural language processing technologies, generative artificial intelligence techniques, represented by large language models (LLMs), are gaining increasing prominence and demonstrating significant potential for applications in safety engineering. However, foundational LLMs face constraints such as limited training data coverage and unreliable responses. This study develops a vector database from 117 explosion accident reports in China spanning 2013 to 2023, employing techniques such as corpus segmenting and vector embedding. By utilizing the vector database, which outperforms the relational database in information retrieval quality, we provide LLMs with richer, more relevant knowledge. Comparative analysis of LLMs demonstrates that ChatSOS significantly enhances reliability, accuracy, and comprehensiveness, improves adaptability and clarification of responses. These results illustrate the effectiveness of supplementing LLMs with an external database, highlighting their potential to handle professional queries in safety engineering and laying a foundation for broader applications.

KW - Accident analysis

KW - Large language models

KW - Prompt engineering

KW - Safety engineering

KW - Vector database

UR - http://www.scopus.com/pages/publications/105009234196

U2 - 10.1016/j.eswa.2025.128644

DO - 10.1016/j.eswa.2025.128644

M3 - Article

AN - SCOPUS:105009234196

SN - 0957-4174

VL - 294

JO - Expert Systems with Applications

JF - Expert Systems with Applications

M1 - 128644

ER -

ChatSOS: Vector database augmented generative question answering assistant in safety engineering

摘要

访问文件

其它文件与链接

指纹

引用此