Please wait a minute...
Advanced Search
现代图书情报技术  2013, Vol. Issue (12): 70-73     https://doi.org/10.11925/infotech.1003-3513.2013.12.11
  情报分析与研究 本期目录 | 过刊浏览 | 高级检索 |
中文网络评论中产品特征提取方法研究
王永, 张勤, 杨晓洁
重庆邮电大学经济管理学院 重庆 400065
Research on the Method of Extracting Features from Chinese Product Reviews on the Internet
Wang Yong, Zhang Qin, Yang Xiaojie
School of Economics and Management, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
全文: PDF (531 KB)   HTML  
输出: BibTeX | EndNote (RIS)      
摘要 针对中文网络客户评论中产品特征提取问题,提出采用FP增长算法获取候选产品特征集,再根据独立支持度、频繁项名词非特征规则及PMI阈值过滤技术对候选产品特征进行筛选,得到最终产品特征集,从而实现对中文网络客户评论中产品特征信息的自动挖掘。采用数据堂提供的手机评论语料,对该方法进行数据实验,实验结果可以验证该方法的有效性。
服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
王永
张勤
杨晓洁
关键词 产品特征特征提取关联规则评论挖掘    
Abstract:Aim for better solving the problem of extracting features from Chinese product reviews on the Internet, an approach using FP-growth algorithm is proposed to obtain the set of candidate product features. Then, the candidate product features are filtered according to the rules of p-support, non-features frequent nouns and PMI threshold filtering technology. Finally, the final product features set are obtained. Thus, the automatic mining of product features information from Chinese customer reviews on the Internet is achieved. The proposed method is tested with the cell phone reviews from Datatang and the results show that the presented method is valid and effective.
Key wordsProduct features    Features extracting    Association rules    Review mining
收稿日期: 2013-08-12      出版日期: 2014-01-08
:  TP393  
基金资助:本文系国家社会科学基金项目“差错管理气氛对企业创新行为的影响机理及对策研究”(项目编号:12CGL049)和重庆市自然科学基金项目“基于在线社交网络的舆情演化及社会化协同过滤推荐算法研究”(项目编号:CSTC2011jjA40045)的研究成果之一。
通讯作者: 王永     E-mail: wangyong_cqupt@163.com
引用本文:   
王永, 张勤, 杨晓洁. 中文网络评论中产品特征提取方法研究[J]. 现代图书情报技术, 2013, (12): 70-73.
Wang Yong, Zhang Qin, Yang Xiaojie. Research on the Method of Extracting Features from Chinese Product Reviews on the Internet. New Technology of Library and Information Service, 2013, (12): 70-73.
链接本文:  
https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.1003-3513.2013.12.11      或      https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2013/V/I12/70
[1] 翟东升, 徐颖, 黄鲁成, 等.基于产品评论挖掘的竞争产品优势分析[J]. 情报杂志, 2013, 32(2): 45-51.(Zhai Dongsheng, Xu Ying, Huang Lucheng, et al. The Advantage Analysis of Competitive Product Based on Product Reviews Mining[J]. Journal of Intelligence, 2013, 32(2): 45-51.)
[2] Zhuang L, Jing F, Zhu X Y.Movie Review Mining and Summarization[C].In: Proceedings of the 15th ACM International Conference on Information and Knowledge Management (CIKM'06), Arlington, Virginia, USA. New York: ACM, 2006:43-50.
[3] Kobayashi N, Inui K, Matsumoto Y, et al. Collecting Evaluative Expressions for Opinion Extraction[C]. In: Proceedings of the 1st International Joint Conference on Natural Language Processing (IJCNLP'04). Berlin, Heidelberg: Springer-Verlag, 2004:596-605.
[4] 娄德成, 姚天昉.汉语句子语义极性分析和观点抽取方法的研究[J]. 计算机应用, 2006, 26(11):2622-2625.(Lou Decheng, Yao Tianfang. Semantic Polarity Analysis and Opinion Mining on Chinese Review Sentences[J].Journal of Computer Applications, 2006, 26(11):2622-2625.)
[5] Shi B, Chang K. Mining Chinese Reviews[C].In: Proceedings of the 6th IEEE International Conference on Data Mining. Washington D C: IEEE Computer Society, 2006:585-589.
[6] Yi J, Nasukawa T, Bunescu R, et al. Sentiment Analyzer: Extracting Sentiments About a Given Topic Using Natural Language Processing Techniques[C].In: Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM'03). Washington D C: IEEE Computer Society, 2003: 427.
[7] 余传明.从用户评论中挖掘产品属性——基于 SOM 的实现[J]. 现代图书情报技术, 2009(5):61-66.(Yu Chuanming. Mining Product Aspects from User Reviews—— An SOM-based Approach[J]. New Technology of Library and Information Service, 2009(5):61-66.)
[8] Hu M, Liu B. Mining and Summarizing Customer Reviews[C]. In: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'04).New York: ACM, 2004:168-l77.
[9] 李实, 叶强, 李一军, 等.中文网络客户评论的产品特征挖掘方法研究[J]. 管理科学学报, 2009(2):142-152.(Li Shi, Ye Qiang, Li Yijun, et al. Mining Features of Products from Chinese Customer Online Reviews[J]. Journal of Management Sciences in China, 2009(2):142-152.)
[10] 李实, 叶强, 李一军, 等.挖掘中文网络客户评论的产品特征及情感倾向[J]. 计算机应用研究, 2010, 27(8):3016-3019. (Li Shi, Ye Qiang, Li Yijun, et al. Mining Product Features and Sentiment Orientation from Chinese Customer Reviews[J]. Application Research of Computers, 2010, 27(8):3016-3019.)
[11] Han J, Pei J, Yin Y, et al. Mining Frequent Patterns without Candidate Generation: A Frequent-Pattern Tree Approach[C]. In: Proceedings of the 2000 ACM SIGMOD, Dallas, USA. 2000:1-12.
[12] Church K W, Hanks P. Word Association Norms, Mutual Information and Lexicography[C].In: Proceedings of the 27th Annual Meeting on Association for Computational Linguistics, New Brunswick, NJ, Canada. Stroudsburg: Association for Computational Linguistics, 1989:76-83.
[1] 郑新曼, 董瑜. 基于科技政策文本的程度词典构建研究*[J]. 数据分析与知识发现, 2021, 5(10): 81-93.
[2] 李铁军,颜端武,杨雄飞. 基于情感加权关联规则的微博推荐研究*[J]. 数据分析与知识发现, 2020, 4(4): 27-33.
[3] 沈卓,李艳. 基于PreLM-FT细粒度情感分析的餐饮业用户评论挖掘[J]. 数据分析与知识发现, 2020, 4(4): 63-71.
[4] 魏伟,郭崇慧,邢小宇. 基于语义关联规则的试题知识点标注及试题推荐*[J]. 数据分析与知识发现, 2020, 4(2/3): 182-191.
[5] 蔡婧璇,吴江,王诚坤. 基于深度学习的众测报告有用性预测研究*[J]. 数据分析与知识发现, 2020, 4(11): 102-111.
[6] 李博诚,张云秋,杨铠西. 面向微博商品评论的情感标签抽取研究 *[J]. 数据分析与知识发现, 2019, 3(9): 115-123.
[7] 黄名选,卢守东,徐辉. 基于加权关联模式挖掘与规则后件扩展的跨语言信息检索 *[J]. 数据分析与知识发现, 2019, 3(9): 77-87.
[8] 李纲,周华阳,毛进,陈思菁. 基于机器学习的社交媒体用户分类研究 *[J]. 数据分析与知识发现, 2019, 3(8): 1-9.
[9] 文秀贤,徐健. 基于用户评论的商品特征提取及特征价格研究 *[J]. 数据分析与知识发现, 2019, 3(7): 42-51.
[10] 张勇,李树青,程永上. 基于频次有效长度的加权关联规则挖掘算法研究 *[J]. 数据分析与知识发现, 2019, 3(7): 85-93.
[11] 聂卉. 结合词向量和词图算法的用户兴趣建模研究 *[J]. 数据分析与知识发现, 2019, 3(12): 30-40.
[12] 严娇,马静,房康. 基于融合共现距离的句法网络下文本语义相似度计算 *[J]. 数据分析与知识发现, 2019, 3(12): 93-100.
[13] 钟庆虹,乔晓东,张运良,翁梦娟. 基于LDA2Vec和残差网络的跨媒体融合方法研究 *[J]. 数据分析与知识发现, 2019, 3(10): 78-88.
[14] 杨贵军,徐雪,赵富强. 基于XGBoost算法的用户评分预测模型及应用*[J]. 数据分析与知识发现, 2019, 3(1): 118-126.
[15] 何跃, 丰月, 赵书朋, 马玉凤. 基于知乎问答社区的内容推荐研究——以物流话题为例[J]. 数据分析与知识发现, 2018, 2(9): 42-49.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 2015 《数据分析与知识发现》编辑部
地址:北京市海淀区中关村北四环西路33号 邮编:100190
电话/传真:(010)82626611-6626,82624938
E-mail:jishu@mail.las.ac.cn