[Objective] This paper proposes a new method to rank the quality of answers from a popular Q&A community in China. [Methods] First, based on the information acceptance model, we established initial quality indicators for the answer’s perceived values. Then, we discretized these indicators with the K-Medoids clustering algorithm. Third, we reduced and weighted the indictors with the help of rough set theory. Finally, we generated the formal rankings with the weighted grey correlation analysis. [Results] We evaluated the proposed method with 2 297 answers for six different types of questions from the Q&A website of “Zhihu”. We found that the answers ranked higher generally included textual message with images. These answers were also more informative than others and involved active members of the Q&A community. [Limitations] The size of our dataset needs to be expanded, and the evaluation method of the proposed model could be optimized. [Conclusions] The proposed method is an effective way to rank the quality of answers from the Q&A community.
易明,张婷婷. 大众性问答社区答案质量排序方法研究*[J]. 数据分析与知识发现, 2019, 3(6): 12-20.
Ming Yi,Tingting Zhang. Ranking Answer Quality of Popular Q&A Community. Data Analysis and Knowledge Discovery, 2019, 3(6): 12-20.
Hosseini M, Moore J, Almaliki M, et al.Wisdom of the Crowd Within Enterprises: Practices and Challenges[J]. Computer Networks, 2015, 90: 121-132.
[2]
Fichman P.A Comparative Assessment of Answer Quality on Four Question Answering Sites[J]. Journal of Information Science, 2011, 37(5): 476-486.
[3]
Zhu Z, Bernhard D, Gurevych I.A Multi-Dimensional Model for Assessing the Quality of Answers in Social Q&A Sites[C]// Proceedings of the 2009 International Conference on Information Quality.2009: 264-265.
[4]
Shah C, Pomerantz J.Evaluating and Predicting Answer Quality in Community QA[C]// Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval.2010: 411-418.
[5]
Yan Z, Zhou J.Optimal Answerer Ranking for New Questions in Community Question Answering[J].Information Processing and Management, 2015, 51(1): 163-178.
[6]
Yang L, Qiu M, Gottipati S, et al.CQArank: Jointly Model Topics and Expertise in Community Question Answering[C]// Proceedings of the 22nd ACM International Conference on Information & Knowledge Management. 2013: 99-108.
(Liu Yu, Yuan Jian.Candidate Answer Sorting Method of Q&A Community Questions Based on RTEM Model[J]. Electronic Science and Technology, 2016, 29(5): 130-134.)
(Zhang Cheng, Qu Mingcheng, Ni Ning, et al.Automatic Answer Selection Based on Probabilistic Latent Semantic Analysis Model[J]. Computer Engineering, 2011, 37(14): 70-72.)
[9]
Guo L, Hu X.Identifying Authoritative and Reliable Contents in Community Question Answering with Domain Knowledge[C]//Proceedings of the 17th Pacific-Asia Conference on Knowledge Discovery and Data Mining. 2013: 133-142.
(Lai Shean, Cai Zhongmin.Question Answering Quality Evaluation for Community Question Answering Based on Similarity[J]. Computer Applications and Software, 2013, 30(2): 266-269.)
(Wang Wei, Ji Yuqiang, Wang Hongwei, et al.Evaluating Chinese Answers’ Quality in the Community QA System: A Case Study of Zhihu[J].Library and Information Service, 2017, 61(22): 36-44.)
[12]
Ginsca A L, Popescu A.User Profiling for Answer Quality Assessment in Q&A Communities[C]//Proceedings of the 2013 Workshop on Data-Driven User Behavioral Modelling and Mining from Social Media.2013: 25-28.
(Kong Weize, Liu Yiqun, Zhang Min, et al.Answer Quality Analysis on Community Question Answering[J]. Journal of Chinese Information Processing, 2011, 25(1): 3-8.)
(Jiang Wen, Xu Xin, Wu Gaofeng.Online Q&A Community Automatically Information Quality Evaluation with Sentiment Feature[J]. Library and Information Service, 2015, 59(4): 100-105.)
[15]
John B M, Chua A Y K, Goh D H L. What Makes a High-Quality User-Generated Answer?[J]. IEEE Internet Computing, 2011, 15(1): 66-71.
(Li Chen, Chao Wenhan, Chen Xiaoming, et al.Quality Evaluation and Prediction for Question and Answer in Chinese Community Question Answering[J]. Computer Science, 2011, 38(6): 230-236.)
[17]
Sussman S W, Siegal W S.Informational Influence in Organizations: An Integrated Approach to Knowledge Adoption[J]. Information Systems Research, 2003, 14(1): 47-65.
(Wang Hongwei, Meng Yuan.Helpful Features Identification of Online Reviews Quality Based on GBDT Feature Contribution[J].Journal of Chinese Information Processing, 2017, 31(3): 109-117.)
[19]
Radev D R, Jing H, Styś M, et al.Centroid-Based Summarization of Multiple Documents[J]. Information Processing & Management, 2004, 40(6): 919-938.
[20]
Joyce E, Kraut R.Predicting Continued Participation in Newsgroups[J]. Journal of Computer-Mediated Communication, 2006, 11(3): 723-747.
(Zhou Zhiyuan, Shen Guchao.Application of Rough Set Theory in Determining the Weight of Intelligence Analysis Index[J]. Information Studies: Theory&Application, 2012, 35(9): 61-65.)
(Zhang Zhengchao, Guan Xin, He You, et al.Rough Sets Data Processing Method and Its Research[J]. Computer Technology and Development, 2010, 20(4): 12-16, 20.)
(Zhang Xueping, Gong Kangli, Zhao Guangcai.Parallel K-Medoids Algorithm Based on MapReduce[J]. Journal of Computer Applications, 2013, 33(4): 1023-1025, 1035.)
[24]
Fahad A, Alshatri N, Tari Z, et al.A Survey of Clustering Algorithms for Big Data: Taxonomy and Empirical Analysis[J]. IEEE Transactions on Emerging Topics in Computing, 2014, 2(3): 267-279.
[25]
PawlakZ. Rough Set[J]. International Journal of Computer and Information Sciences, 1982, 11(5): 341-356.
(Sun Jingjing.Research on Attribute Reduction and Rule Reduction Algorithm of Decision Table Based on Rough Set Theory[D]. Zhengzhou: Information Engineering University, 2005.)
[27]
邓聚龙. 灰色系统基本方法[M]. 武汉: 华中理工大学出版社, 1987.
[27]
(Deng Julong.Basic Methods of Grey System[M]. Wuhan: Huazhong University of Science & Technology Press, 1987.)
(Yu Liang, Fang Zhigeng, Wu Lifeng, et al.Maximum Entropy Configuration Model of Objective Index Weight Based on Grey Category Characteristics Difference[J]. Systems Engineering- Theory&Practice, 2014, 3(8): 2065-2070.)
[29]
黄涛. 基于灰色关联度分析的模糊群决策方法研究[D].广州: 华南理工大学, 2016.
[29]
(Huang Tao.Research of Fuzzy Multi-Attribute Decision Making Method Based on Grey Correlation Analysis[D]. Guangzhou: South China University of Technology, 2016.)