[Objective] This paper presents a novel algorithm based on the NLP technique and complex network theory, aiming to extract product features more effectively. [Methods] First, we constructed a weighted bipartite graph with the product features and sentiment words, which described their relationship more clearly and intuitively from network perspective. Then, we proposed the NodeRank algorithm to rank the importance of product features, which improved the precision of feature extraction. [Results] We examined the proposed algorithm with data from jd.com, a popular online shopping site in China. The precision, recall and F-score of the NodeRank algorithm were better than the HAC, TF-IDF and TextRank methods. [Limitations] The computational complexity of our new algorithm needs to be optimized. [Conclusions] The NodeRank algorithm could effectively extract the product features, which supports marketing and other business activities.
King R A, Racherla P, Bush V D.What We Know and don’t Know about Online Word-of-Mouth: A Review and Synthesis of the Literature[J]. Journal of Interactive Marketing, 2014, 28(3): 167-183.
doi: 10.1016/j.intmar.2014.02.001
[2]
Phang C W, Zhang C, Sutanto J.The Influence of User Interaction and Participation in Social Media on the Consumption Intention of Niche Products[J]. Information & Management, 2013, 50(8): 661-672.
doi: 10.1016/j.im.2013.07.001
[3]
Gandomi A, Haider M.Beyond the Hype: Big Data Concepts, Methods, and Analytics[J]. International Journal of Information Management, 2015, 35(2): 137-144.
doi: 10.1016/j.ijinfomgt.2014.10.007
[4]
Hu M, Liu B.Mining and Summarizing Customer Reviews[C]//Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Seattle, Washington, USA. 2004: 168-177.
[5]
Popescu A M, Etzioni O.Extracting Product Features and Opinions from Reviews [A]//Natural Language Processing and Text Mining[M]. Springer, 2007: 9-28.
(Li Shi, Ye Qiang, Li Yijun, et al.Mining Features of Products from Chinese Customer Online Reviews[J]. Journal of Management Sciences in China, 2009, 12(2): 142-152.)
doi: 10.3321/j.issn:1007-9807.2009.02.015
(Liu Hongyu, Zhao Yanyan, Qin Bing, et al.Comment Target Extraction and Sentiment Classification[J]. Journal of Chinese Information Processing, 2010, 24(1): 84-88.)
doi: 10.3969/j.issn.1003-0077.2010.01.015
[8]
Qiu G, Liu B, Bu J, et al.Opinion Word Expansion and Target Extraction Through Double Propagation[J]. Computational Linguistics, 2011, 37(1): 9-27.
doi: 10.1162/coli_a_00034
[9]
Poria S, Cambria E, Ku L W, et al.A Rule-Based Approach to Aspect Extraction from Product Reviews[C] //Proceedings of the 2nd Workshop on Natural Language Processing for Social Media (SocialNLP). 2014: 28-37.
[10]
Xu H, Shu L, Zhang J, et al.Mining Compatible/Incompatible Entities from Question and Answering via Yes/No Answer Classification Using Distant Label Expansion [OL]. arXiv Preprint, arXiv:1612.04499.
[11]
Xu H, Xie S, Shu L, et al.CER: Complementary Entity Recognition via Knowledge Expansion on Large Unlabeled Product Reviews [OL]. arXiv Preprint, arXiv: 1612 .01039.
doi: 10.1109/BigData.2016.7840672
[12]
Borrajo L, Vieira A S, Iglesias E L.TCBR-HMM: An HMM-based Text Classifier with a CBR System[J]. Applied Soft Computing, 2015,26: 463-473.
doi: 10.1016/j.asoc.2014.10.019
[13]
Owoputi O, O’Connor B, Dyer C, et al. Improved Part-of- Speech Tagging for Online Conversational Text with Word Clusters[C]//Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2013: 7-13.
[14]
Mesnil G, Dauphin Y, Yao K, et al.Using Recurrent Neural Networks for Slot Filling in Spoken Language Understanding[J]. IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), 2015, 23(3): 530-539.
doi: 10.1109/TASLP.2014.2383614
[15]
Jakob N, Gurevych I.Extracting Opinion Targets in a Single- and Cross-Domain Setting with Conditional Random Fields[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2010:1035-1045.
[16]
Shu L, Liu B, Xu H, et al.Supervised Opinion Aspect Extraction by Exploiting Past Extraction Results [OL]. arXiv Preprint. arXiv:1612.07940.
[17]
Choi Y, Cardie C.Hierarchical Sequential Learning for Extracting Opinions and Their Attributes[C]// Proceedings of the ACL 2010 Conference Short Papers. Association for Computational Linguistics, 2010:269-274.
[18]
Wang W, Wang H, Song Y.Ranking Product Aspects Through Sentiment Analysis of Online Reviews[J]. Journal of Experimental & Theoretical Artificial Intelligence, 2017, 29(2): 227-246.
doi: 10.1080/0952813X.2015.1132270
[19]
Zhang Z, Guo C, Goes P.Product Comparison Networks for Competitive Analysis of Online Word-of-Mouth[J]. ACM Transactions on Management Information Systems, 2013, 3(4): 1-22.
doi: 10.1145/2407740.2407744
[20]
Jo Y, Oh A H.Aspect and Sentiment Unification Model for Online Review Analysis[C]//Proceedings of the ACM International Conference on Web Search and Data Mining. ACM, 2011:815-824.
[21]
Moghaddam S, Ester M.ILDA: Interdependent LDA Model for Learning Latent Aspects and Their Ratings from Online Product Reviews[C]//Proceedings of International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 2011:665-674.
[22]
Huang S, Liu X, Peng X, et al.Fine-grained Product Features Extraction and Categorization in Reviews Opinion Mining[C]//Proceedings of IEEE 12th International Conference on Data Mining Workshops. IEEE, 2012:680-686.
[23]
Yan Z, Xing M, Zhang D, et al.EXPRS: An Extended PageRank Method for Product Feature Extraction from Online Consumer Reviews[J]. Information & Management, 2015, 52(7): 850-858.
doi: 10.1016/j.im.2015.02.002
[24]
Zhang L, Liu B, Lim S H, et al.Extracting and Ranking Product Features in Opinion Documents[C]// Proceedings of International Conference on Computational Linguistics: Posters. Association for Computational Linguistics, 2010: 1462-1470.
[25]
Mihalcea R, Tarau P.TextRank: Bringing Order into Texts[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2004: 404-411.
[26]
Zha Z J, Yu J, Tang J, et al.Product Aspect Ranking and Its Applications[J]. IEEE Transactions on Knowledge & Data Engineering, 2014, 26(5): 1211-1224.
doi: 10.1109/TKDE.2013.136
[27]
Brin S, Page L.The Anatomy of a Large-scale Hypertextual Web Search Engine[J]. Computer Networks and ISDN Systems, 1998, 30(1-7): 107-117. .