Hai Jiali, Wang Run, Yuan Liangzhi, Zhang Kairui, Deng Wenping, Xiao Yong, Zhou Tao, Chang Kai
Online available: 2025-02-11
[Objective]To develop a retrieval-augmented question-answering(QA) system for Traditional Chinese Medicine (TCM) standards, aimed at accurately and effectively delivering high-quality standardized TCM knowledge and practical experience to clinicians and the general public. This system seeks to enhance the research and application of TCM standardization. [Methods]By comparing the performance of existing large language models (such as BaiChuan, Gemma, Tongyi Qianwen, etc.), the GPT-3.5 model was selectively chosen as the foundational model. This was combined with data optimization and retrieval-augmented generation techniques to develop a TCM standards question-answering system with capabilities in semantic analysis, contextual association, and content generation. [Results]The retrieval-augmented TCM standards QA system demonstrated answer relevance with precision, recall, and F1 scores of 87.9%, 83.9%, and 85.7%, respectively, on the TCM literature question generation dataset, and 87.1%, 83.6%, and 85.3%, respectively, on the TCM standards QA dataset. Contextual relevance on the TCM literature question generation dataset showed precision, recall, and F1 scores of 83.8%, 86.9%, and 85.3% respectively.These metrics outperformed the compared models, indicating that this system can more accurately answer questions related to TCM standards.[Limitations]The current system's intent recognition module requires further optimization, and the TCM standards knowledge base needs to be expanded and refined at a more granular level. [Conclusions]This study addresses the practical needs of TCM knowledge services by exploring the construction of a retrieval-augmented TCM standards QA system. This system can answer various questions related to TCM treatment guidelines, herbal medicine standards, information standards, etc., including treatment principles, disease classification, treatment methods, and technical requirements of TCM standards, demonstrating high practicality and feasibility.