1(Linguistics Research Center, Dalian University of Foreign Languages, Dalian 116044, China) 2(Research Center for Multilingual Big Data in Cyberspace, Dalian University of Foreign Languages, Dalian 116044, China)
[Objective] This paper tries to extract topics from book reviews with the help of natural language semantics. [Methods] We proposed a method to retrieve the explicit and implicit topic keywords with the global semantic information from common sense knowledge base. [Results] The sentence coverage rate with the knowledge base method and the lexical diversity of the proposed method were 30.8% and 0.36% higher than those of the Double-Propagation algorithm. Then, based on the extracted topic words, we created a cluster map to identify the topic keywords identified by the nodes cluster centrality. [Limitations] There is no domain knowledge base in the field of book reviews. [Conclusions] The proposed method based on Knowledge Base improves the sentence coverage and lexical diversity of topics extracted from book reviews.
(Liu Jun.On the Implicit Topic of Literature[J]. Documentation, Information and Knowledge, 1996(2): 24-27.)
[2]
Hu M, Liu B.Mining and Summarizing Customer Reviews[C]// Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2004: 168-177.
[3]
Hu M, Liu B.Mining Opinion Features in Customer Reviews[C]// Proceedings of the 19th National Conference on Artificial Intelligence. 2004: 755-760.
[4]
Qiu G, Liu B, Bu J, et al.Opinion Word Expansion and Target Extraction Through Double Propagation[J]. Computational Linguistics, 2011, 37(1): 9-27.
[5]
Poria S, Cambria E, Ku L W, et al.A Rule-based Approach to Aspect Extraction from Product Reviews[C]// Proceedings of the 2nd Workshop on Natural Language Processing for Social Media. 2014: 28-37.
[6]
Su Q, Xu X, Guo H, et al.Hidden Sentiment Association in Chinese Web Opinion Mining[C]// Proceedings of the 17th International Conference on World Wide Web. ACM, 2008: 959-968.
[7]
Jin W, Ho H H.A Novel Lexicalized HMM-based Learning Framework for Web Opinion Mining[C]// Proceedings of the 26th Annual International Conference on Machine Learning. 2009: 465-472.
[8]
Poria S, Cambria E, Gelbukh A.Aspect Extraction for Opinion Mining with a Deep Convolutional Neural Network[J]. Knowledge-Based Systems, 2016, 108: 42-49.
[9]
Cruz I, Gelbukh A, Sidorov G.Implicit Aspect Indicator Extraction for Aspect Based Opinion Mining[J]. International Journal of Computational Linguistics and Applications, 2014, 5(2): 135-152.
[10]
Zhang Y, Zhu W.Extracting Implicit Features in Online Customer Reviews for Opinion Mining[C]// Proceedings of the 22nd International Conference on World Wide Web. 2013: 103-104.
(Feng Shufang, Wang Suge.Automobile Reviews Ontology Knowledge Base Construction Oriented Towards Opinion Mining[J].Computer Applications and Software, 2011, 28(5): 45-47, 105.)
(Wang Suge, Li Dayu, Li Yang.Sentiment Mining of Commodity Reputation Data Based on Joint Model[J]. Journal of Tsinghua University: Science and Technology, 2017, 57(9): 926-931.)
[13]
Zhang P, Gu H, Gartrell M, et al.Group-based Latent Dirichlet Allocation (Group-LDA): Effective Audience Detection for Books in Online Social Media[J]. Knowledge-Based Systems, 2016, 105: 134-146.
[14]
Sohail S S, Siddiqui J, Ali R.Book Recommendation System Using Opinion Mining[C]// Proceedings of the 2013 International Conference on Advances in Computing, Communications and Informatics. 2013: 1609-1614.
[15]
Sohail S S, Siddiqui J, Ali R.Feature Extraction and Analysis of Online Reviews for the Recommendation of Books Using Opinion Mining Technique[J]. Perspectives in Science, 2016, 8: 754-756.
[16]
陈晓美. 网络评论观点知识发现研究[D]. 长春: 吉林大学, 2014.
[16]
(Chen Xiaomei.Study of Knowledge Discovery of Opinions from Web Reviews[D]. Changchun: Jilin University, 2014.)
[17]
Cambria E, Chandra P, Sharma A, et al.Do not Feel the Trolls[C]// Proceedings of the CEUR Workshop. 2010.
[18]
Rajagopal D, Cambria E, Olsher D, et al.A Graph-based Approach to Commonsense Concept Extraction and Semantic Similarity Detection[C]// Proceedings of the 22nd International Conference on World Wide Web. ACM, 2013: 565-570.
(Li Feng.Clustering Analysis Based on Core Keyword—— Concurrently Discuss the Deficiency of Co-word Analysis[J]. Information Science, 2017, 35(8): 68-71, 78.)
(Fu Zhu, Wang Yuefen.A Discussion on Some Questions of Term Collection in Co-Word Analysis[J]. Journal of the China Society for Scientific and Technical Information, 2016, 35(7): 704-713.)
(Hu Changping, Chen Guo.Characteristics of Keywords in Scientific Papers and Their Impact on Co-Word Analysis[J]. Journal of the China Society for Scientific and Technical Information, 2014,33(1): 23-32.)
[22]
Wang Z Y, Li G, Li C Y, et a1. Research on the Semantic-Based Co-Word Analysis[J]. Scientometrics, 2012, 90(3): 855-875.
[23]
Waltman L, Van Eck N J. A Smart Local Moving Algorithm for Large-Scale Modularity-Based Community Detection[J]. The European Physical Journal B, 2013, 86(11): 471.
[24]
Waltman L, Van Eck N J, Noyons E C M. A Unified Approach to Mapping and Clustering of Bibliometric Networks[J]. Journal of Informetrics, 2010, 4(4): 629-635.
[25]
Manning C D.Part-of-Speech Tagging from 97% to 100%: Is It Time for Some Linguistics?[C]//Proceedings of the 2011 International Conference on Intelligent Text Processing and Computational Linguistics. Berlin, Heidelberg: Springer, 2011: 171-189.
[26]
Chris D P.Another Stemmer[J].ACM SIGIR Forum, 1990, 24(3): 56-61.
[27]
SenticNet. Concept Parser[OL]. [2018-01-28]. .
[28]
Van Eck N J, Waltman L. Software Survey: VOSviewer, a Computer Program for Bibliometric Mapping[J]. Scientometrics, 2010, 84(2): 523-538.
(Yang Ying, Cui Lei.Subject Strategic Information Research Based on Visualization of Co-Word Network[J]. Journal of the China Society for Scientific and Technical Information, 2011, 30(3): 325-330.)