[Objective] This paper designs multiple word representation methods, aiming to obtain the latent semantic features and extract product properties from reviews.[Methods] First, we used word properties, dependency relationship and embedding techniques to construct three types of word representations, which included basic, structural and category semantic information. Then, we applied conditional random field model to extract product properties with these semantic information.[Results] The accuracy of the proposed method was 3.97% higher than that of the DepREm-CRF.Its F1 value was up to 7.65% better than the popular ones.[Limitations] More research is needed to investigate the relationship between online sentiments and properties.[Conclusions] The proposed method is able to effectively extract properties from product reviews, and lays good foundation for fine-grained sentiment analysis research.
Luo H, Li T, Liu B, et al. Improving Aspect Term Extraction with Bidirectional Dependency Tree Representation[J]. IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP), 2019,27(7):1201-1212.
[2]
Yin Y, Wei F, Dong L , et al. Unsupervised Word and Dependency Path Embeddings for Aspect Term Extraction [C]// Proceedings of the 25th International Joint Conference on Artificial Intelligence. 2016: 2979-2985.
[3]
Hu M, Liu B . Mining and Summarizing Customer Reviews [C]// Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2004: 168-177.
[4]
Liu C L, Hsaio W H, Lee C H, et al. Movie Rating and Review Summarization in Mobile Environment[J]. IEEE Transactions on Systems Man and Cybernetics Part C:Applications and Reviews, 2012,42(3):397-407.
[5]
Ghadery E, Movahedi S, Faili H, et al. An Unsupervised Approach for Aspect Category Detection Using Soft Cosine Similarity Measure[OL]. arXivPreprint, arXiv:1812.03361.
[6]
Zhang J, Chen D, Lu M. Combining Sentiment Analysis with a Fuzzy Kano Model for Product Aspect Preference Recommendation[J]. IEEE Access, 2018,6:59163-59172.
( Guo Bo, Li Shouguang, Wang Hao, et al. Examining Product Reviews with Sentiment Analysis and Opinion Mining[J]. Data Analysis and Knowledge Discovery, 2017,1(12):1-9.)
( Li Weiqing, Wang Weijun. Building Product Feature Dictionary with Large-Scale Review Data[J]. Data Analysis and Knowledge Discovery, 2018,2(1):41-50.)
( Zhang Zhen, Zeng Jin. Extracting Keywords from User Comments: Case Study of Meituan[J]. Data Analysis and Knowledge Discovery, 2019,3(3):36-44.)
[10]
Poria S, Cambria E, Ku L W , et al. A Rule-Based Approach to Aspect Extraction from Product Reviews [C]// Proceedings of the 2nd Workshopon Natural Language Processing for Social Media. 2014: 28-37.
( Peng Yun, Wan Changxuan, Jiang Tengjiao, et al. Extracting Product Aspects and User Opinions Based on Semantic Constrained LDA Model[J]. Journal of Software, 2017,28(3):676-693.)
[12]
Mukherjee A, Liu B . Aspect Extraction ThroughSemi-Supervised Modeling [C]// Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics. 2012,1:339-348.
[13]
Li Y, Qin Z, Xu W, et al. A Holistic Model of Mining Product Aspects and Associated Sentiments from Online Reviews[J]. Multimedia Tools and Applications, 2015,74(23):10177-10194.
[14]
Liu Q, Gao Z, Liu B , et al. Automated Rule Selection for Aspect Extraction in Opinion Mining [C]// Proceedings of the 24th International Joint Conference on Artificial Intelligence. 2015: 1291-1297.
( Zhou Qingqing, Zhang Chengzhi. Fine-grained Aspect Extraction from Online Customer Reviews[J]. Journal of the China Society for Scientific and Technical Information, 2017,36(5):484-493.)
[16]
Peng H, Ma Y, Li Y, et al. Learning Multi-Grained Aspect Target Sequence for Chinese Sentiment Analysis[J]. Knowledge-Based Systems, 2018,148:167-176.
( Zhao Yang, Li Qiqi, Chen Yuhan, et al. Examining Consumer Reviews of Overseas Shopping APP with Sentiment Analysis[J]. Data Analysis and Knowledge Discovery, 2018,2(11):19-27.)
[18]
Xu H, Liu B, Shu L , et al. Double Embeddings and CNN-Based Sequence Labeling for Aspect Extraction [C]// Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. 2018: 592-598.
[19]
Lafferty J D, McCallum A, Pereira F C N . Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data [C]// Proceedings of the 18th International Conference on Machine Learning. Morgan Kaufmann Publishers Inc., 2001: 282-289.
[20]
Xiang Y, He H, Zheng J. Aspect Term Extraction Based on MFE-CRF[J]. Information, 2018,9(8):198-213.
[21]
Le Q, Mikolov T . Distributed Representations of Sentences and Documents [C]// Proceedings of the 31st International Conference on Machine Learning. 2014: 1188-1196.
[22]
Dhingra B, Zhou Z, Fitzpatrick D , et al. Tweet2Vec: Character-Based Distributed Representations for Social Media [C]// Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. 2016: 269-274.
[23]
Moody C E. Mixing DirichletTopic Models and Word Embeddings to Make LDA2Vec[OL]. arXivPreprint,arXiv:1605.02019.
( Zeng Qingtian, Dai Mingdi, Li Chao, et al. Discovering Important Locations with User Representation and Trace Data[J]. Data Analysis and Knowledge Discovery, 2019,3(6):75-82.)
[25]
MacAvaney S, Zeldes A . A Deeper Look into Dependency-Based Word Embeddings [C]// Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop. 2018: 40-45.
[26]
Ye Z, Zhao H. Syntactic Word Embedding Based on Dependency Syntax and PolysemousAnalysis[J]. Frontiers of Information Technology & Electronic Engineering, 2018,19(4):524-535.
[27]
Levy O, Goldberg Y . Dependency-Based Word Embeddings [C]// Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics. 2014: 302-308.
[28]
Zhao Y, Qin B, Liu T. Encoding Syntactic Representations with a Neural Network for Sentiment Collocation Extraction[J]. Science China-Information Sciences, 2017, 60(11): Article No. 110101.
[29]
Li C, Li J, Song Y , et al. Training and Evaluating Improved Dependency-Based Word Embeddings [C]// Proceedings of the 32nd AAAI Conference on Artificial Intelligence. 2018: 5836-5843.
[30]
Blei D M, Ng A Y, Jordan M I. Latent DirichletAllocation[J]. Journal of Machine Learning Research, 2003,3:993-1022.