[Objective] This paper builds a Q-LDA model to identify topics of online health community, aiming to improve the quality of information generated by the LDA model, as well as its theme representation ability. [Methods] Firstly, we evaluated and weighted the online health information. Then, we constructed a Q-LDA topic mining model based on the LDA model. Finally, we examined the proposed model with real world data. [Results] The Q-LDA model yielded better results than the traditional LDA model. The efficiency of extracting topics was improved by 16%. [Limitations] We only examined the proposed model with textual data from online discussion boards on one disease. [Conclusions] Adding quality of health information to data mining could help us meet the needs of users.
杨磊,王子润,侯贵生. 基于Q-LDA主题模型的网络健康社区主题挖掘研究 *[J]. 数据分析与知识发现, 2019, 3(11): 52-59.
Lei Yang,Zirun Wang,Guisheng Hou. Discovering Topics of Online Health Community with Q-LDA Model. Data Analysis and Knowledge Discovery, 2019, 3(11): 52-59.
Fox S . The Social Life of Health Information[EB/OL]. [ 2017- 10- 29]. http://www.pewresearch.org/fact-tank/2014/01/15/thesocial-life-of-health-information/ .
[2]
Liu Y, Jin J, Ji P , et al. Identifying Helpful Online Reviews: A Product Designer’s Perspective[J]. Computer-Aided Design, 2013,45(2):180-194.
doi: 10.1016/j.cad.2012.07.008
( Qian Minghui, Xu Zhixuan, Wang Shan . Information Service Quality of Online Health Platform Based on User Participation[J]. Journal of the China Society for Scientific and Technical Information, 2019,38(2):132-142.)
( Luo Xiaolan, Han Jingti, Fan Weiguo , et al. Health Information and Health Anxiety in the Internet Age[J]. Information and Documentation Services, 2019,40(2):76-86.)
( Li Yuelin, Zhang Xiu, Wang Shanshan . Health Information Quality in Social Media: An Analysis Based on the Features of Real and Fake Health Information[J]. Journal of the China Society for Scientific and Technical Information, 2018,37(3):294-304.)
[6]
Medical Library Association. The Medical Library Association Task Force on Health Information Literacy [EB/OL]. [2017-02-28]. https://www.mlanet.org/resources/healthlit/define.html.
( Zhang Min, Nie Rui, Luo Meifen . Analysis on the Effect of Health Literacy on Users’ Online Health Information Seeking Behavior[J]. Library and Information Service, 2016,60(7):103-109.)
( Li Yuelin, Cai Wenjuan . A Review of the Studies on Health Information Seeking Behavior Overseas[J]. Library and Information Service, 2012,56(19):128-132.)
( Mu Dongmei, Ju Yuanhong, Dai Wenhao , et al. Knowledge Discovery Strategy and Model of Virtual Health Community Text Data[J]. Library and Information Service, 2018,62(5):125-131.)
( Mo Zuying, Ma Feicheng . Game Analysis of Information Resources Quality Control in the Network Environment[J]. Information Studies: Theory & Application, 2012,35(8):26-30.)
( Song Lirong, Zhang Qun, Qi Na . Problems in Information Quality on Medical and Health Websites in China[J]. China Journal of Medical Library and Information Science, 2014,23(9):1-6.)
[12]
Shahar S, Shirley N, Noah S A . Quality and Accuracy Assessment of Nutrition Information on the Web for Cancer Prevention[J]. Medical Informatics, 2013,38(1):15-26.
doi: 10.3109/17538157.2012.710684
pmid: 22957981
[13]
Bizzi I, Ghezzi P, Paudyal P . Health Information Quality of Websites on Periodontology[J]. Journal of Clinical Periodontology, 2017,44(3):308-314.
doi: 10.1111/jcpe.12668
pmid: 28005268
( Zhao Yusui, Xu Yan, Wu Qingqing , et al. The Development of an Evaluation Index System on Health Information on the Internet Using Delphi Method[J]. Preventive Medicine, 2018,30(2):121-124.)
( Qian Minghui, Xu Zhixuan, Lian Yi . Information Quality Evaluation and Brand Inspiration of Online Health Consultation Platform[J]. Information and Documentation Services, 2018,39(3):57-63.)
( Zhong Le, Liu Wei, Yin Fei . Information Quality Evaluation of Chinese Websites on Attention Deficit Hyperactivity Disorder[J]. Chinese Mental Health Journal, 2010,24(10):780-784.)
[17]
Corcelles R, Daigle C, Talamas H R . Assessment of the Quality of Internet Information on Sleeve Gastrectomy[J]. Surgery for Obesity and Related Diseases, 2015,11(3):539-544.
doi: 10.1016/j.soard.2014.08.014
pmid: 25604832
[18]
Yagci I A, Das S . Measuring Design-Level Information Quality in Online Reviews[J]. Electronic Commerce Research and Applications, 2018,30:102-110.
doi: 10.1016/j.elerap.2018.05.010
[19]
di Sciascio C, Strohmaier D, Errecalde M , et al. WikiLyzer: Interactive Information Quality Assessment in Wikipedia [C]// Proceedings of the 22nd International Conference on Intelligent User Interfaces. ACM, 2017: 377-388.
[20]
Utkin L V . A New Ranking Procedure by Incomplete Pairwise Comparisons Using Preference Subsets[J]. Intelligent Data Analysis, 2009,13(2):229-241.
doi: 10.3233/IDA-2009-0365
[21]
Hullermeier E, Furnkranz J . Ranking by Pairwise Comparison a Note on Risk Minimization [C]// Proceedings of the 2004 IEEE International Conference on Fuzzy Systems. 2004: 97-102.
( Li Hongliu, Wang Xingyuan . The Effect of Online User Reviews on Customer Value Creation: From the Perspective of Price Decision[J]. Price: Theory&Practice, 2018(1):150-152.)
[23]
Schubert J, Hörling P . Preference-based Monte Carlo Weight Assignment for Multiple-criteria Decision Making in Defense Planning [C]// Proceedings of the 17th International Conference on Information Fusion. IEEE, 2014.
( Deng Shengli, Zhao Haiping . Research on the Standard Framework of the Quality and the Content Evaluation of Online Health Information from Users’ Perspective[J]. Library and Information Service, 2017,61(21):30-39.)
[25]
Liu K Y, Haukoos J S, Sasson C . Availability and Quality of Cardiopulmonary Resuscitation Information for Spanish- speaking Population on the Internet[J]. Resuscitation, 2014,85(1):131-137.
doi: 10.1016/j.resuscitation.2013.08.274
( Ruan Guangce . Topic Extraction Research of Net Reviews Based on Latent Dirichlet Allocation[J]. Journal of Intelligence, 2014,33(3):161-164.)
[27]
Lu Y, Wu Y, Liu J . Understanding Health Care Social Media Use from Different Stakeholder Perspectives: A Content Analysis of an Online Health Community[J]. Journal of Medical Internet Research, 2017,19(4):e109.
doi: 10.2196/jmir.7087
pmid: 28389418
( Li Xiangdong, Ding Cong, Gao Fan . The Research of Bibliographic Information Classification Method Based on the Composite Weighted LDA Model[J]. Journal of the China Society for Scientific and Technical Information, 2017,36(4):26-34.)
[29]
Oğuz F, Elif Şengün A . Mystery of the Unknown: Revisiting Tacit Knowledge in the Organizational Literature[J]. Journal of Knowledge Management, 2011,15(3):445-461.
doi: 10.1108/13673271111137420
( Deng Shengli, Zhao Haiping . Quality Evaluation of Foreign Network Health Information: A Review of Indicators, Tools and Results[J]. Information and Documentation Services, 2017,38(1):69-76.)