Research on User Information Requirement in Chinese Network Health Community: Taking Tumor-forum Data of Qiuyi as an Example
Quan Lu1,2,Anqi Zhu1,Jiyue Zhang1,Jing Chen3()
1School of Information Management, Wuhan University, Wuhan 430072, China 2Big Data Research Institute, Wuhan University, Wuhan 430072, China 3School of Information Management, Central China Normal University, Wuhan 430079, China
[Objective] This paper constructs an information demand mining framework of Chinese online health community users adapted to the big data environment, and analyzes the user information needs by taking the data of tumor-forum as an example. [Methods] The Latent Semantic Indexing (LSI) model and MapReduce distributed text clustering technology were used in this framework to mine the user information needs. We use all the Q&A data (24,305 in total) from tumor-forum of Chinese online health community (qiuyi.cn) as the experimental data source. [Results] The proposed framework mines the five information needs and their proportions of the tumor users: treatment (43.3%), pathology and etiology (34.5%), examination (12.1%), postoperative (7.0%), prevention (3.1%), and top 20 keywords of these needs. The analysis shows the growth of each needs, and the significant difference between domestic users and foreign users. Gender differences are also significant, the male need treatment information most, while female need pathological and etiological information most. Age difference is large too, and the information needs of young people are the largest (83.79%), etc. [Limitations] There may be better threshold selection, and the medical thesaurus is not prefect. The analysis of information needs is not multidimensional. [Conclusions] The proposed framework is feasible. The paper found the trend of the demand distribution changes with year and the distribution of users information needs vary with age or gender.
陆泉,朱安琪,张霁月,陈静. 中文网络健康社区中的用户信息需求挖掘研究*——以求医网肿瘤板块数据为例[J]. 数据分析与知识发现, 2019, 3(4): 22-32.
Quan Lu,Anqi Zhu,Jiyue Zhang,Jing Chen. Research on User Information Requirement in Chinese Network Health Community: Taking Tumor-forum Data of Qiuyi as an Example. Data Analysis and Knowledge Discovery, 2019, 3(4): 22-32.
(CNNIC. The 39th Statistical Report on the Development of Internet in China[EB/OL]. [2017-01-22]. http://www.cnnic.cn/hlwfzyj/ hlwxzbg/hlwtjbg/201601/P020160122444930951954.pdf
(National Health and Family Planning Commission. “13th Five-Year Plan”: National Population Health Information Development Plan[R/OL]. [2018-03-21]. http://ghs.ndrc.gov.cn/ghwb/gjjgh/ 201707/t20170720_855014.html
(Zhao Haiping, Deng Shengli.Literature Review of Users' Health Information Behavior in Social Q&A Platform: Research Topic and Method[J]. Journal of Information Resources Management, 2016(4): 19-27.)
[5]
Oh S, Zhang Y, Park M S.Cancer Information Seeking in Social Question and Answer Services: Identifying Health-Related Topics in Cancer Questions on Yahoo! Answers[J]. Information Research, 2016, 21(3). http://www.informationr. net/ir/21-3/paper718.html#.XLlSRvl6enE
[6]
Tsuya A, Sugawara Y, Tanaka A, et al.Do Cancer Patients Tweet? Examining the Twitter Use of Cancer Patients in Japan[J]. Journal of Medical Internet Research, 2014, 16(e5): e137.
[7]
Shaw R J, Johnson C M.Health Information Seeking and Social Media Use on the Internet Among People with Diabetes[J]. Online Journal of Public Health Informatics, 2011, 3(1). DOI:10.5210/ojphi.v3i1.3561.
(Wei Yongting, Chen Ying, Xu Yahong.Investigation and Analysis of the Health Information Needs Among Patients with Cancer During Chemotherapy in Hospital[J]. Nursing Practice and Research, 2013, 10(11): 152-153.)
(Huang Xuewei, Zhang Ying, Wang Xiuli, et al.Information Needs of Cancer Patients: Development and Evaluation of Information Preference Questionnaire for Cancer Patients[J]. Chinese Mental Health Journal, 2003, 17(11): 750-753.)
[10]
Valero-Aguilera B, Bermudez-Tamayo C, Francisco Garcia-Gutierrez J, et al. Information Needs and Internet Use in Urological and Breast Cancer Patients[J]. Supportive Care in Cancer, 2014, 22(2): 545-552.
[11]
Friedemann-Sanchez G, Griffin J M, Partin M R.Gender Differences in Colorectal Cancer Screening Barriers and Information Needs[J]. Health Expectations, 2007, 10(2): 148-160.
(Zhang Xinyao, Cao Jindan.The Analysis of Influence Factors of Health Information Network Users' Requirement[J]. Medicine and Society, 2010, 23(9): 25-27.)
(Wu Yanyan, Jiang Yafang.Investigation of Information Needs of Chemotherapy Inpatients[J]. Chinese Journal of Modern Nursing, 2010, 16(4): 384-387.)
[15]
Oh H J, Lauckner C, Boehmer J, et al.Facebooking for Health: An Examination into the Solicitation and Effects of Health-Related Social Support on Social Networking Sites[J]. Computers in Human Behavior, 2013, 29(5): 2072-2080.
[16]
Ramo D E, Liu H, Prochaska J J.A Mixed-Methods Study of Young Adults' Receptivity to Using Facebook for Smoking Cessation: If You Build It, Will They Come?[J]. American Journal of Health Promotion, 2015, 29(4): e126-e135.
[17]
Bernad V M, Maderuelo F J Á, Moreno G P. Information Needs of the Health and Diseases in Users of Healthcare Services in Primary Care at Salamanca, Spain[J]. Atencion Primaria, 2016, 48(1): 15-24.
[18]
Bowler L, Oh J S, He D, et al.Eating Disorder Questions in Yahoo! Answers: Information, Conversation, or Reflection?[C]// Proceedings of the American Society for Information Science and Technology. 2012.
(Jin Biyi, Xu Xin.Health Information Needs of Diabetics in Social Q&A Community[J]. Chinese Journal of Medical Library and Information Science, 2014, 23(12): 37-42.)
[20]
Stonbraker S, Larson E.Health-information Needs of HIV-positive Adults in Latin America and the Caribbean: An Integrative Review of the Literature[J]. Aids Care, 2016, 28(10): 1223-1229.
[21]
吕英杰. 网络健康社区中的文本挖掘方法研究[D]. 上海: 上海交通大学, 2013.
[21]
(Lv Yingjie.Research on Text Mining in Online Health Community[D]. Shanghai: Shanghai Jiao Tong University, 2013.)
(Li Chongyang, Zhai Shanshan, Zhen Lu.Measurement of Information Demand Characteristics in Online Health Community: An Empirical Analysis Based on Time and Theme Perspective[J]. Digital Library Forum, 2016(9): 34-42.)
(Wu Jiang, Hou Shaoxin, Jin Mengmeng, et al.LDA Feature Selection Based Text Classification and User Clustering in Chinese Online Health Community[J]. Journal of the China Society for Scientific and Technical Information, 2017, 36(11): 1183-1191.)
(Zheng Yingxin.Application of Clustering Analysis Based on Elbow Rule in Data Mining in the Optimization Design of Primary and Secondary School Students' Travel Routes[J]. Electronics World, 2017(9): 146.)
[28]
Kanungo T, Mount D M, Netanyahu N S, et al.An Efficient K-means Clustering Algorithm: Analysis and Implementation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(7): 881-892.
[29]
Cho J, Noh H, Ha M H, et al.What Kind of Cancer Information Do Internet Users Need?[J]. Supportive Care in Cancer, 2011, 19(9): 1465-1469.
[30]
Chen W.Cancer Statistics: Updated Cancer Burden in China[J]. Chinese Journal of Cancer Research, 2015, 27(1): 1.