Please wait a minute...
New Technology of Library and Information Service  2011, Vol. 27 Issue (10): 24-28    DOI: 10.11925/infotech.1003-3513.2011.10.05
article Current Issue | Archive | Adv Search |
Research on Automatic Extraction of Web Sentiment Words
Zhang Qingliang, Xu Jian
School of Information Management, Sun Yat-Sen University, Guangzhou 510006, China
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  To improve the efficiency of extracting sentiment words and building sentiment lexicon, the authors propose a method to extract a set of basic sentiment words, and then to calculate both the PMI-IR value between candidate word and the positive basic sentiment word set and the PMI-IR value between candidate words and the negative basic sentiment word set, to judge the orientation of a candidate word.Taking account of frequency, orientation, intensity and definiteness of words, computers are able to finish most of the work. It improves the efficiency and reduces cost of building sentiment lexicon. Experiment is processed on the dataset constituted with 71 061 reviews from 360buy and 1 736 reviews from Joyo. With the dataset, the method achieves a recall rate of 76.36%, a precision of 76.94%,and the precision of sentiment orientation is 62.70%.
Key wordsSentiment analysis      Sentiment orientation      PMI-IR      Sentiment lexicon     
Received: 28 July 2011      Published: 03 December 2011
: 

G250

 

Cite this article:

Zhang Qingliang, Xu Jian. Research on Automatic Extraction of Web Sentiment Words. New Technology of Library and Information Service, 2011, 27(10): 24-28.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2011.10.05     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2011/V27/I10/24

[1] Turney P D, Littman M L. Measuring Praise and Critism: Inference of Semantic Orientation from Association[J]. ACM Tanslations on Information Systems,2003,21(4):315-346.

[2] 王素格. 基于Web的评论文本情感分类问题研究 .上海:上海大学, 2008.

[3] Tian P, Liu Y, Liu M. Research of Product Ranking Technology Based on Opinion Mining .In:Proceedings of the 2nd International Conference on Intelligent Computation Technology and Automation.2009:239-243.

[4] 董振东,董强. 知网 . .http://www.keenage.com/.

[5] Kamps J, Marx M, Mokken R J. Words with Attitude . In:Proceedings of the 1st International Conference on Global WordNet.2002:332-341.

[6] 朱嫣岚,闵锦,周雅倩,等. 基于HowNet的词汇语义倾向计算[J]. 中文信息学报, 2006,20(1):14-20.

[7] 路斌,万小军,杨建武.基于同义词词林的词汇褒贬计算 . 见: 中国计算技术与语言问题研究——第七届中文信息处理国际会议论文集 . 中国,武汉:中国中文信息学会, 2007.

[8] Dave K, Lawrence S, Pennock D M. Mining the Peanut Gallery: Opinion Extraction and Semantic Classification of Product Reviews . In:Proceedings of the 12th International Conference on World Wide Web.2003: 519-528.

[9] 彭学仕,孙春华.面向倾向性分析的基于词聚类的基准词选择方法[J]. 计算机应用研究, 2011,28(1):114-116.

[10] 王素格,李德玉,魏英杰,等. 基于同义词的词汇情感倾向判别方法[J]. 中文信息学报, 2009,23(5): 68-74.

[11] Turney P D. Mining the Web for Synonyms: PMI-IR Versus LSA on TOEFL .In:Proceedings of the 12th European Conference on Machine Learning, Freiburg, Germany.2001:491-502.

[12] 亚马逊公司.卓越网.http://www.amazon.cn/.

[13] 北京当当网信息技术有限公司.当当网.http://www.dangdang.com/.

[14] 北京京东世纪商贸有限公司.京东商城.http://www.360buy.com/.

[15] 阿里巴巴集团.淘宝网.http://www.taobao.com/.

[16] Abulaish M, Jahiruddin, Doja M N, et al. Feature and Opinion Mining for Customer Review Summarization . In:Proceeding of the 3rd International Conference on Pattern Recognition and Machine Intelligence. 2009:219-224.

[17] Hu M, Liu B. Mining Opinion Features in Customer Reviews . In:Proceedings of the 19th National Conference on Artificial Intelligence. 2004:755-760.
[1] Xu Yuemei, Wang Zihou, Wu Zixin. Predicting Stock Trends with CNN-BiLSTM Based Multi-Feature Integration Model[J]. 数据分析与知识发现, 2021, 5(7): 126-138.
[2] Zhong Jiawa,Liu Wei,Wang Sili,Yang Heng. Review of Methods and Applications of Text Sentiment Analysis[J]. 数据分析与知识发现, 2021, 5(6): 1-13.
[3] Liu Tong,Liu Chen,Ni Weijian. A Semi-Supervised Sentiment Analysis Method for Chinese Based on Multi-Level Data Augmentation[J]. 数据分析与知识发现, 2021, 5(5): 51-58.
[4] Wang Yuzhu,Xie Jun,Chen Bo,Xu Xinying. Multi-modal Sentiment Analysis Based on Cross-modal Context-aware Attention[J]. 数据分析与知识发现, 2021, 5(4): 49-59.
[5] Li Feifei,Wu Fan,Wang Zhongqing. Sentiment Analysis with Reviewer Types and Generative Adversarial Network[J]. 数据分析与知识发现, 2021, 5(4): 72-79.
[6] Chang Chengyang,Wang Xiaodong,Zhang Shenglei. Polarity Analysis of Dynamic Political Sentiments from Tweets with Deep Learning Method[J]. 数据分析与知识发现, 2021, 5(3): 121-131.
[7] Zhang Mengyao, Zhu Guangli, Zhang Shunxiang, Zhang Biao. Grouping Microblog Users of Trending Topics Based on Sentiment Analysis[J]. 数据分析与知识发现, 2021, 5(2): 43-49.
[8] Han Pu, Zhang Wei, Zhang Zhanpeng, Wang Yuxin, Fang Haoyu. Sentiment Analysis of Weibo Posts on Public Health Emergency with Feature Fusion and Multi-Channel[J]. 数据分析与知识发现, 2021, 5(11): 68-79.
[9] Lv Huakui,Liu Zhenghao,Qian Yuxing,Hong Xudong. Relationship Between Financial News and Stock Market Fluctuations[J]. 数据分析与知识发现, 2021, 5(1): 99-111.
[10] Xu Hongxia,Yu Qianqian,Qian Li. Studying Content Interaction Data with Topic Model and Sentiment Analysis[J]. 数据分析与知识发现, 2020, 4(7): 110-117.
[11] Jiang Lin,Zhang Qilin. Research on Academic Evaluation Based on Fine-Grain Citation Sentimental Quantification[J]. 数据分析与知识发现, 2020, 4(6): 129-138.
[12] Shi Lei,Wang Yi,Cheng Ying,Wei Ruibin. Review of Attention Mechanism in Natural Language Processing[J]. 数据分析与知识发现, 2020, 4(5): 1-14.
[13] Li Tiejun,Yan Duanwu,Yang Xiongfei. Recommending Microblogs Based on Emotion-Weighted Association Rules[J]. 数据分析与知识发现, 2020, 4(4): 27-33.
[14] Shen Zhuo,Li Yan. Mining User Reviews with PreLM-FT Fine-Grain Sentiment Analysis[J]. 数据分析与知识发现, 2020, 4(4): 63-71.
[15] Xue Fuliang,Liu Lifang. Fine-Grained Sentiment Analysis with CRF and ATAE-LSTM[J]. 数据分析与知识发现, 2020, 4(2/3): 207-213.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn