[Objective] This paper classifies Baidu encyclopedia entries based on users’ information behaviors, aiming to identify entries with high potential values. [Methods] We chose the usage and recognition levels as indicators, and proposed a new entry classification model base on Boston matrix and BP neural network. [Results] We classified the Baidu encyclopedia entries automatically with usage indicators and created development strategies for each category. Our new model correctly identified each entry’s category information. [Limitations] More research is needed to study the newly generated entries and features difficult to quantify. [Conclusions] This research proposed an effective method to automatically classify online encyclopedia entries.
Stvilia B, Twidale M B, Smith L C, et al.Assessing Information Quality of a Community-based Encyclopedia[C]// Proceedings of International Conference on Information Quality, 2005: 442-454.
Warncke-Wang M, Cosley D, Riedl J. Tell Me More: An Actionable Quality Model for Wikipedia[C]// Proceedings of the 9th International Symposium on Open Collaboration. ACM, 2013: Article No.8.
Blumenstock J E.Size Matters: Word Count as a Measure of Quality on Wikipedia[C]// Proceedings of the 17th International Conference on World Wide Web, 2008: 1095-1096.
Dalip D H, Gonçalves M A, Cristo M, et al.Automatic Quality Assessment of Content Created Collaboratively by Web Communities: A Case Study of Wikipedia[C]// Proceedings of the 9th ACM/IEEE-CS Joint Conference on Digital Libraries. ACM, 2009:295-304.
Wöhner T, Peters R. Assessing the Quality of Wikipedia Articles with Lifecycle Based Metrics[C]// Proceedings of the 5th International Symposium on Wikis and Open Collaboration, Orlando, Florida, USA.2009: Article No.16.
Wang S, Iwaihara M.Quality Evaluation of Wikipedia Articles Through Edit History and Editor Groups[C]// Proceedings of Asia-Pacific Web Conference on Web Technologies and Applications. Springer-Verlag, 2011:188-199.
Xu Y, Luo T.Measuring Article Quality in Wikipedia: Lexical Clue Model[C]// Proceedings of the 2011 3rd Symposium on Web Society. IEEE, 2011:141-146.
Suzuki Y, Yoshikawa M. Mutual Evaluation of Editors and Texts for Assessing Quality of Wikipedia Articles[C]// Proceedings of the 8th Annual International Symposium on Wikis and Open Collaboration. ACM, 2012: Article No.18.
Ferretti E, Fusilier D H, Cabrera R G, et al.On the Use of PU Learning for Quality Flaw Prediction in Wikipedia[C]// Proceedings of the CLEF 2012 Evaluation Labs and Workshop, 2012.
Dalip D H, Goncalves M A, Cardoso T, et al.A Multi-view Approach for the Quality Assessment of Wiki Articles[J]. Journal of Information & Data Management, 2012, 3(1):73-82.
Flekova L, Ferschke O, Gurevych I.What Makes a Good Biography? Multidimensional Quality Analysis Based on Wikipedia Article Feedback Data[C]// Proceedings of the International Conference on World Wide Web. ACM, 2014:855-866.
Dang Q V, Ignat C L. Quality Assessment of Wikipedia Articles: A Deep Learning Approach by Quang Vinh Dang and Claudia-Lavinia Ignat with Martin Vesely as Coordinator[J]. ACM SIGWEB Newsletter, 2016, 5: Article No.5.
Khairova N, Lewoniewski W, Wecel K.Estimating the Quality of Articles in Russian Wikipedia Using the Logical-Linguistic Model of Fact Extraction[C]// Proceedings of the International Conference on Business Information Systems(BIS 2017). 2017:28-40.
Shen A L, Qi J Z, Baldwin T.A Hybrid Model for Quality Assessment of Wikipedia Articles[C] // Proceedings of Australasian Language Technology Association Workshop, 2017:43-52.
(Qiu Jiangnan, Weng Nan, Xu Shengguo.Research on Evaluation Model for the Information Quality of Wikipedia Articles Based on C4.5[J]. Journal of the China Society for Scientific and Technical Information, 2012, 31(12): 1259-1264.)