Data Analysis and Knowledge Discovery  2017, Vol. 1 Issue (12): 1-9    DOI: 10.11925/infotech.2096-3467.2017.0618
Examining Product Reviews with Sentiment Analysis and Opinion Mining
Guo Bo1(), Li Shouguang1, Wang Hao1, Zhang Xiaojun1, Gong Wei1, Yu Zhaojun1, Sun Yu2
1Meizu Telecom Equipment Co., Ltd., Beijing 100872, China
2Computer Science Department, California State Polytechnic University, Pomona 91768, USA
[Objective] This study conducts a comprehensive analysis of huge amount of reviews generated by E-commerce website users, aiming to assess the marketing strategies. [Methods] We used syntactic parsing, bag of words model and machine learning techniques to examine real-world datasets from JD and TMall. The proposed method could analyze sentiment and extract opinion from the reviews automatically. [Results] The accuracy of the sentiment analysis was 90%. We constructed an automatic vocabulary building mechanism without dictionary dependency. The F-measure of the new system was 71%. [Limitations] The recall of the opinion extraction needs to be improved. [Conclusions] The proposed system could effectively monitor the word-of-mouth issues facing products sold online. It could be transferred to many online business.

Key wordsUser Review      Sentimental Analysis      Opinion Mining      Machine Learning      Tag Extraction     
Received: 29 June 2017      Published: 29 December 2017
ZTFLH:  TP181  

Cite this article:

Guo Bo,Li Shouguang,Wang Hao,Zhang Xiaojun,Gong Wei,Yu Zhaojun,Sun Yu. Examining Product Reviews with Sentiment Analysis and Opinion Mining. Data Analysis and Knowledge Discovery, 2017, 1(12): 1-9.

步骤 依存句法关系 含义 示例

nsubj(VA, NN) 句子主语 手机(外形)很(漂亮)
amod(NN, VA) 修饰关系 很(差)的(手机)
amod(NN, JJ) 修饰关系 这个手机有很(漂亮)的(外形)

dep(VA, VA) 依赖关系

conj(NN, NN) 并列关系 手机的(拍照)和(摄像)不错
compound:nn(NN, NN) 名词组合 (手机外形)不错
nmod:assmod(NN, NN) 名词短语 (手机)的(外形)很漂亮

nsubj(VA, NN) 句子主语 手机(外形)很(漂亮)
amod(NN, VA) 修饰关系 很(差)的(手机)
amod(NN, JJ) 修饰关系 这个手机有很(漂亮)的(外形)
模型 算法 准确率 召回率 F1值 AUC
基础模型 NB 0.889 0.892 0.890 0.950
否定词模型 NB 0.892 0.899 0.895 0.953
句法模型 NB 0.914 0.908 0.911 0.961
基础模型 SGD 0.908 0.894 0.901 0.958
否定词模型 SGD 0.911 0.904 0.907 0.961
句法模型 SGD 0.917 0.919 0.918 0.967
基础模型 SVM 0.902 0.902 0.902 0.959
否定词模型 SVM 0.912 0.900 0.906 0.960
句法模型 SVM 0.916 0.920 0.918 0.966
基础模型 RF 0.871 0.870 0.871 0.942
否定词模型 RF 0.875 0.874 0.874 0.945
句法模型 RF 0.880 0.880 0.880 0.948
5万 10万 15万 20万
NB 0.23 0.45 0.59 0.98
SGD 0.22 0.39 0.57 0.75
SVM 4 12 17 26
RF 190 400 640 890
