Please wait a minute...
Data Analysis and Knowledge Discovery  2018, Vol. 2 Issue (4): 90-98    DOI: 10.11925/infotech.2096-3467.2017.1252
Orginal Article Current Issue | Archive | Adv Search |
Extracting Product Features with NodeRank Algorithm
Lixin Zhou,Jie Lin()
School of Economics and Management, Tongji University, Shanghai 200092, China
Download: PDF(1351 KB)   HTML ( 2
Export: BibTeX | EndNote (RIS)      
Abstract  

[Objective] This paper presents a novel algorithm based on the NLP technique and complex network theory, aiming to extract product features more effectively. [Methods] First, we constructed a weighted bipartite graph with the product features and sentiment words, which described their relationship more clearly and intuitively from network perspective. Then, we proposed the NodeRank algorithm to rank the importance of product features, which improved the precision of feature extraction. [Results] We examined the proposed algorithm with data from jd.com, a popular online shopping site in China. The precision, recall and F-score of the NodeRank algorithm were better than the HAC, TF-IDF and TextRank methods. [Limitations] The computational complexity of our new algorithm needs to be optimized. [Conclusions] The NodeRank algorithm could effectively extract the product features, which supports marketing and other business activities.

Key wordsFeature Extraction      Bipartite Graph      NodeRank Algorithm      Importance Ranking     
Received: 11 December 2017      Published: 11 May 2018

Cite this article:

Lixin Zhou,Jie Lin. Extracting Product Features with NodeRank Algorithm. Data Analysis and Knowledge Discovery, 2018, 2(4): 90-98.

URL:

http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.2096-3467.2017.1252     OR     http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2018/V2/I4/90

[1] King R A, Racherla P, Bush V D.What We Know and don’t Know about Online Word-of-Mouth: A Review and Synthesis of the Literature[J]. Journal of Interactive Marketing, 2014, 28(3): 167-183.
[2] Phang C W, Zhang C, Sutanto J.The Influence of User Interaction and Participation in Social Media on the Consumption Intention of Niche Products[J]. Information & Management, 2013, 50(8): 661-672.
[3] Gandomi A, Haider M.Beyond the Hype: Big Data Concepts, Methods, and Analytics[J]. International Journal of Information Management, 2015, 35(2): 137-144.
[4] Hu M, Liu B.Mining and Summarizing Customer Reviews[C]//Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Seattle, Washington, USA. 2004: 168-177.
[5] Popescu A M, Etzioni O.Extracting Product Features and Opinions from Reviews [A]//Natural Language Processing and Text Mining[M]. Springer, 2007: 9-28.
[6] 李实, 叶强, 李一军. 中文网络客户评论的产品特征挖掘方法研究[J]. 管理科学学报, 2009, 12(2): 142-152.
[6] (Li Shi, Ye Qiang, Li Yijun, et al.Mining Features of Products from Chinese Customer Online Reviews[J]. Journal of Management Sciences in China, 2009, 12(2): 142-152.)
[7] 刘鸿宇, 赵妍妍, 秦兵, 等. 评价对象抽取及其倾向性分析[J]. 中文信息学报, 2010, 24(1): 84-88.
[7] (Liu Hongyu, Zhao Yanyan, Qin Bing, et al.Comment Target Extraction and Sentiment Classification[J]. Journal of Chinese Information Processing, 2010, 24(1): 84-88.)
[8] Qiu G, Liu B, Bu J, et al.Opinion Word Expansion and Target Extraction Through Double Propagation[J]. Computational Linguistics, 2011, 37(1): 9-27.
[9] Poria S, Cambria E, Ku L W, et al.A Rule-Based Approach to Aspect Extraction from Product Reviews[C] //Proceedings of the 2nd Workshop on Natural Language Processing for Social Media (SocialNLP). 2014: 28-37.
[10] Xu H, Shu L, Zhang J, et al.Mining Compatible/Incompatible Entities from Question and Answering via Yes/No Answer Classification Using Distant Label Expansion [OL]. arXiv Preprint, arXiv:1612.04499.
[11] Xu H, Xie S, Shu L, et al.CER: Complementary Entity Recognition via Knowledge Expansion on Large Unlabeled Product Reviews [OL]. arXiv Preprint, arXiv: 1612 .01039.
[12] Borrajo L, Vieira A S, Iglesias E L.TCBR-HMM: An HMM-based Text Classifier with a CBR System[J]. Applied Soft Computing, 2015,26: 463-473.
[13] Owoputi O, O’Connor B, Dyer C, et al. Improved Part-of- Speech Tagging for Online Conversational Text with Word Clusters[C]//Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2013: 7-13.
[14] Mesnil G, Dauphin Y, Yao K, et al.Using Recurrent Neural Networks for Slot Filling in Spoken Language Understanding[J]. IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), 2015, 23(3): 530-539.
[15] Jakob N, Gurevych I.Extracting Opinion Targets in a Single- and Cross-Domain Setting with Conditional Random Fields[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing. 2010:1035-1045.
[16] Shu L, Liu B, Xu H, et al.Supervised Opinion Aspect Extraction by Exploiting Past Extraction Results [OL]. arXiv Preprint. arXiv:1612.07940.
[17] Choi Y, Cardie C.Hierarchical Sequential Learning for Extracting Opinions and Their Attributes[C]// Proceedings of the ACL 2010 Conference Short Papers. Association for Computational Linguistics, 2010:269-274.
[18] Wang W, Wang H, Song Y.Ranking Product Aspects Through Sentiment Analysis of Online Reviews[J]. Journal of Experimental & Theoretical Artificial Intelligence, 2017, 29(2): 227-246.
[19] Zhang Z, Guo C, Goes P.Product Comparison Networks for Competitive Analysis of Online Word-of-Mouth[J]. ACM Transactions on Management Information Systems, 2013, 3(4): 1-22.
[20] Jo Y, Oh A H.Aspect and Sentiment Unification Model for Online Review Analysis[C]//Proceedings of the ACM International Conference on Web Search and Data Mining. ACM, 2011:815-824.
[21] Moghaddam S, Ester M.ILDA: Interdependent LDA Model for Learning Latent Aspects and Their Ratings from Online Product Reviews[C]//Proceedings of International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 2011:665-674.
[22] Huang S, Liu X, Peng X, et al.Fine-grained Product Features Extraction and Categorization in Reviews Opinion Mining[C]//Proceedings of IEEE 12th International Conference on Data Mining Workshops. IEEE, 2012:680-686.
[23] Yan Z, Xing M, Zhang D, et al.EXPRS: An Extended PageRank Method for Product Feature Extraction from Online Consumer Reviews[J]. Information & Management, 2015, 52(7): 850-858.
[24] Zhang L, Liu B, Lim S H, et al.Extracting and Ranking Product Features in Opinion Documents[C]// Proceedings of International Conference on Computational Linguistics: Posters. Association for Computational Linguistics, 2010: 1462-1470.
[25] Mihalcea R, Tarau P.TextRank: Bringing Order into Texts[C]//Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2004: 404-411.
[26] Zha Z J, Yu J, Tang J, et al.Product Aspect Ranking and Its Applications[J]. IEEE Transactions on Knowledge & Data Engineering, 2014, 26(5): 1211-1224.
[27] Brin S, Page L.The Anatomy of a Large-scale Hypertextual Web Search Engine[J]. Computer Networks and ISDN Systems, 1998, 30(1-7): 107-117. .
[1] Xiaofeng Li,Jing Ma,Chi Li,Hengmin Zhu. Identifying Commodity Names Based on XGBoost Model[J]. 数据分析与知识发现, 2019, 3(7): 34-41.
[2] Guijun Yang,Xue Xu,Fuqiang Zhao. Predicting User Ratings with XGBoost Algorithm[J]. 数据分析与知识发现, 2019, 3(1): 118-126.
[3] Xiaoxi Huang,Hanyu Li,Rongbo Wang,Xiaohua Wang,Zhiqun Chen. Recognizing Metaphor with Convolution Neural Network and SVM[J]. 数据分析与知识发现, 2018, 2(10): 77-83.
[4] Weiqing Li,Weijun Wang. Building Product Feature Dictionary with Large-scale Review Data[J]. 数据分析与知识发现, 2018, 2(1): 41-50.
[5] Changbing Li,Chongpeng Pang,Meiping Li. Extracting Product Features with Weight-based Apriori Algorithm[J]. 数据分析与知识发现, 2017, 1(9): 83-89.
[6] Du Siqi, Li Honglian, Lv Xueqiang. Research of Chinese Chunk Parsing in Application of the Product Feature Extraction[J]. 现代图书情报技术, 2015, 31(9): 26-30.
[7] Lu Yonghe, Liang Minghui. Improvement of Text Feature Extraction with Genetic Algorithm[J]. 现代图书情报技术, 2014, 30(4): 48-57.
[8] Tang Xiaobo, Xiao Lu. Research of Text Feature Extraction on Dependency Parsing Network[J]. 现代图书情报技术, 2014, 30(11): 31-37.
[9] You Guirong, Wu Wei, Qian Yuntao. Feature Extraction Method for Detecting Spam in Electronic Commerce[J]. 现代图书情报技术, 2014, 30(10): 93-100.
[10] Xu Jian, Wen Haosheng. Study on Talents Description Web Page Automatic Recognition System[J]. 现代图书情报技术, 2011, 27(6): 20-26.
[11] Yang Zhizhuo,Han Xie. An Algorithm of Text Information Filtering Based on Feature Extraction[J]. 现代图书情报技术, 2008, 24(4): 29-34.
[12] Ren Hui,Zhou Xiaoguang,Shen Jin . Excavation of Web Knowledge Association Based on Bipartite Graph[J]. 现代图书情报技术, 2007, 2(4): 39-42.
[13] Jin Yi,Huang Min. Study on the Melody Based Retrieval of Music——Input Recognition of Melodic Features[J]. 现代图书情报技术, 2004, 20(1): 41-45.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn