Please wait a minute...
New Technology of Library and Information Service  2015, Vol. 31 Issue (1): 38-44    DOI: 10.11925/infotech.1003-3513.2015.01.06
Current Issue | Archive | Adv Search |
Semi-supervised Micro-blog Sentiment Classification Method Combining Active Learning and Co-training
Bi Qiumin1, Li Ming2, Zeng Zhiyong3
1. Faculty of Art and Communication, Kunming University of Science and Technology, Kunming 650093, China;
2. School of Information, Yunnan University of Finance and Economics, Kunming 650221, China;
3. Center of Information Management, Yunnan University of Finance and Economics, Kunming 650221, China
Download: PDF(527 KB)   HTML  
Export: BibTeX | EndNote (RIS)      
Abstract  

[Objective] Aimed at less labeled data and more unlabeled samples in micro-blog sentiment classification, a novel method is proposed. [Methods] Active learning is introduced into co-training, the method selects the most valuable ones from low confidence samples, then labels and adds them into training dataset, trains classifiers again. [Results] Experimental results show that classifiers have better performance in this way, and the accuracy is improved obviously. Especially when labeled data reaches 40%, the accuracy increases by about 5%. [Limitations] In the collaborative process, random feature subspace generation can not build two strong classifiers, so hypothesis are not fulfilled. [Conclusions] This method solves the defects of co-training after introducing active learning; the performance and accuracy of classifiers are enhanced.

Key wordsCo-training      Active learning      Sentiment classification     
Received: 20 June 2014      Published: 12 February 2015
:  TP391  

Cite this article:

Bi Qiumin, Li Ming, Zeng Zhiyong. Semi-supervised Micro-blog Sentiment Classification Method Combining Active Learning and Co-training. New Technology of Library and Information Service, 2015, 31(1): 38-44.

URL:

http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2015.01.06     OR     http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2015/V31/I1/38

null
[1] Han Huang,Hongyu Wang,Xiaoguang Wang. Automatic Recognizing Legal Terminologies with Active Learning and Conditional Random Field Model[J]. 数据分析与知识发现, 2019, 3(6): 66-74.
[2] Guangshang Gao. Reviewing Basic Methods of Entity Resolution[J]. 数据分析与知识发现, 2019, 3(5): 27-40.
[3] Qingqing Zhang,Xingshi He,Huimin Wang,Shengjun Meng. Text Sentiment Classification Based on Deep Belief Network[J]. 数据分析与知识发现, 2019, 3(4): 71-79.
[4] Hui Li,Yaqing Chai. Fine-Grained Sentiment Analysis Based on Convolutional Neural Network[J]. 数据分析与知识发现, 2019, 3(1): 95-103.
[5] Shuyi Wang,Huatao Liao,Chake Wu. Mining News on Competitors with Sentiment Classification[J]. 数据分析与知识发现, 2018, 2(3): 70-78.
[6] Qingqing Zhang,Xilin Liu. Classifying Sentiments Based on BPSO Random Subspace[J]. 数据分析与知识发现, 2017, 1(5): 71-81.
[7] Wang Xiaoyun,Yuan Yuan,Shi Lingling. Predicting Opening Weekend Box Office Prediction Based on Microblog[J]. 现代图书情报技术, 2016, 32(4): 31-39.
[8] He Huixin,Liu Lijuan. A Scientific Research Object Labeling System Based on Active earning[J]. 现代图书情报技术, 2016, 32(3): 67-73.
[9] Guo Shunli,Zhang Xiangxian. Building Sentiment Analysis Dictionary for Chinese Book Reviews[J]. 现代图书情报技术, 2016, 32(2): 67-74.
[10] Shao Jian, Zhang Chengzhi, Li Lei. Survey on Hashtag Mining and Its Application[J]. 现代图书情报技术, 2015, 31(10): 40-49.
[11] Xu Xin, Yu Fei, Zhang Li. A Method and Its Application of Text Semantic Orientation[J]. 现代图书情报技术, 2011, 27(10): 54-62.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn