三种文档语义倾向性识别方法的分析与比较*

doi:10.11925/infotech.1003-3513.2007.04.11

现代图书情报技术

2007, Vol. 2

Issue (4): 43-47 https://doi.org/10.11925/infotech.1003-3513.2007.04.11

知识组织与知识管理

本期目录 | 过刊浏览 | 高级检索

三种文档语义倾向性识别方法的分析与比较*

马海兵¹ 刘永丹¹王兰成¹ 李荣陆²

¹(南京政治学院上海分院上海 200433)
²(Autodesk公司上海 200001)

An Analysis and Comparison of Three Methods for Document Semantic Orientation Recognition

Ma Haibing¹ Liu Yongdan¹ Wang Lancheng¹ Li Ronglu²

¹(Shanghai Branch, Nanjing Political Institute, Shanghai 200433, China)
²(Autodesk Incorporation, Shanghai 200001, China)

摘要
参考文献
相关文章
Metrics

全文: PDF (400 KB)
输出: BibTeX | EndNote (RIS)

摘要

研究并实现三种文档倾向性识别的方法：基于情感词加权的方法、基于语义模式分析的方法和基于文本分类的方法。第一种方法利用特征词汇的情感语义倾向性。第二种方法对自然语言的句法结构进行简化，以获取合适粒度的倾向性语义模式。第三种方法则直接利用传统的基于文本分类的方法。通过在网络舆情分析系统中的具体实现，探讨这三种方法各自的不足和优势。

	服务

	把本文推荐给朋友
	加入引用管理器
	E-mail Alert
	RSS
	作者相关文章
	李荣陆
	王兰成
	马海兵
	刘永丹

关键词 ：自然语言处理, 文本分类, 倾向性识别

Abstract：

This paper investigates and implements three methods for document semantic orientation recognition: one is based on the weighted emotional words; another is based on the analysis of the semantic pattern; and the third is based on text classification. The first method is try to use semantic orientation of feature words. The second method is to simplify the structure of the natural language syntax, in order to acquire the appropriate size patterns with semantics tendency. The third method is based on the direct use of traditional text classification method. Through experiments and the application in an analysis system of net-mediated public opinions, shortages and advantages of the three methods are proposed.

Key words： Natural language process Text classification Semantic orientation recognition

收稿日期: 2007-03-06 出版日期: 2007-04-25

TP391

基金资助:

*本文系中国博士后基金资助项目“军队网络舆情分析系统”(项目编号：20060390952)和江苏省博士后科研资助项目（项目编号：0601029）的研究成果之一。

通讯作者: 马海兵 E-mail: martin0721@163.com

作者简介: 马海兵,刘永丹,王兰成,李荣陆

引用本文:

马海兵,刘永丹,王兰成,李荣陆 . 三种文档语义倾向性识别方法的分析与比较*[J]. 现代图书情报技术, 2007, 2(4): 43-47.
Ma Haibing,Liu Yongdan,Wang Lancheng,Li Ronglu . An Analysis and Comparison of Three Methods for Document Semantic Orientation Recognition. New Technology of Library and Information Service, 2007, 2(4): 43-47.

链接本文:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.1003-3513.2007.04.11 或 https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2007/V2/I4/43

1Jaap Kamps, Maarten Marx, Robert J. Mokken, Maarten de Rijke. Using WordNet to Measure Semantic Orientations of Adjectives. In: Proceedings of the fourth international conference on Language Resources and Evaluation, 2004，IV:1115-1118
2Turney Peter. Thumbs Up or Thumbs Down ? Semantic Orientation Applied to Unsupervised Classification of Reviews. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics. 2002: 417 -424
3Turney, Peter, Littman Michael. Measuring praise and criticism: Inference of semantic orientation from association. ACM Transactions on Information Systems, 2003, 21(4): 315 - 346
4朱嫣岚,闵锦,周雅倩等. 基于HowNet 的词汇语义倾向计算. 中文信息学报,2006, 20(1): 14-20
5Yi J,Nasukawa T,Bunescu R,et al. Sentiment Analyzer: Extracting Sentiments about a Given Topic Using Natural Language Processing Techniques, In : Proceedings of the Third IEEE International Conference on Data Mining, 2003
6Dave K,Lawrence S,Pennock D. Mining the Peanut Gallery:Opinion Extraction and Semantic Classification of Product Reviews. In: Proceedings of the 22th International World Wide Web Conference, Budapest, Hungary, 2003
7Beineke P,Hastie T，Vaithyanathan S. The Sentimental Factor:Improving Review Classification Via Human-Provided Information. In: Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, 2004
8刘永丹，曾海泉，李荣陆等基于语义分析的倾向性文本过滤.通信学报, 2004, 25(7):78-85
9Bo Pang , Lillian Lee , Shivakumar Vaithyanathan. Thumbs up? Sentiment Classification Using machine Learning Techniques. In: Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing. 2002：79-86
10Bo Pang, Lillian Lee. Seeing Stars: Exploiting Class Relationships for Sentiment Categorization with Respect to Rating Scales. In: Proceedings of the Association for Computational Linguistics (ACL 2005), 2005: 115-124

[1]	陈杰,马静,李晓峰. 融合预训练模型文本特征的短文本分类方法*[J]. 数据分析与知识发现, 2021, 5(9): 21-30.
[2]	周泽聿,王昊,赵梓博,李跃艳,张小琴. 融合关联信息的GCN文本分类模型构建及其应用研究*[J]. 数据分析与知识发现, 2021, 5(9): 31-41.
[3]	王一钒,李博,史话,苗威,姜斌. 古汉语实体关系联合抽取的标注方法*[J]. 数据分析与知识发现, 2021, 5(9): 63-74.
[4]	余本功,朱晓洁,张子薇. 基于多层次特征提取的胶囊网络文本分类研究^*[J]. 数据分析与知识发现, 2021, 5(6): 93-102.
[5]	王艳, 王胡燕, 余本功. 基于多特征融合的中文文本分类研究^*[J]. 数据分析与知识发现, 2021, 5(10): 1-14.
[6]	唐晓波,高和璇. 基于关键词词向量特征扩展的健康问句分类研究 ^*[J]. 数据分析与知识发现, 2020, 4(7): 66-75.
[7]	王思迪,胡广伟,杨巳煜,施云. 基于文本分类的政府网站信箱自动转递方法研究*[J]. 数据分析与知识发现, 2020, 4(6): 51-59.
[8]	徐月梅,刘韫文,蔡连侨. 基于深度融合特征的政务微博转发规模预测模型^*[J]. 数据分析与知识发现, 2020, 4(2/3): 18-28.
[9]	徐彤彤,孙华志,马春梅,姜丽芬,刘逸琛. 基于双向长效注意力特征表达的少样本文本分类模型研究^*[J]. 数据分析与知识发现, 2020, 4(10): 113-123.
[10]	余本功,曹雨蒙,陈杨楠,杨颖. 基于nLD-SVM-RF的短文本分类研究*[J]. 数据分析与知识发现, 2020, 4(1): 111-120.
[11]	聂维民,陈永洲,马静. 融合多粒度信息的文本向量表示模型 ^*[J]. 数据分析与知识发现, 2019, 3(9): 45-52.
[12]	邵云飞,刘东苏. 基于类别特征扩展的短文本分类方法研究 ^*[J]. 数据分析与知识发现, 2019, 3(9): 60-67.
[13]	秦贺然,刘浏,李斌,王东波. 融入实体特征的典籍自动分类研究 ^*[J]. 数据分析与知识发现, 2019, 3(9): 68-76.
[14]	黄名选,卢守东,徐辉. 基于加权关联模式挖掘与规则后件扩展的跨语言信息检索 ^*[J]. 数据分析与知识发现, 2019, 3(9): 77-87.
[15]	陈果,许天祥. 基于主动学习的科技论文句子功能识别研究 ^*[J]. 数据分析与知识发现, 2019, 3(8): 53-61.

Viewed

Full text

Abstract

Cited

Shared

Discussed