|
|
Research on Automatic Classification Based on Term Context Relations |
Guo Shaoyou |
(Department of Information Management, Zhengzhou University, Zhengzhou 450001, China) |
|
|
Abstract In this paper, a term context vector is used to represent the relation between a term and its context terms. Based on term context vectors, class feature vectors of a classifier, and the document vector of the document to be classified are generated, and then the document is classified. The experiment shows that adding term context relations into class feature vector and document vector can improve the classification effect.
|
Received: 03 January 2008
Published: 25 May 2008
|
|
Corresponding Authors:
Guo Shaoyou
E-mail: gsy6@ha.edu.cn
|
About author:: Guo Shaoyou |
[1] Wang Y. Incorporating Semantic and Syntactic Information into Document Representation for Document Clustering[D]. Mississippi:Mississippi State University,2005.
[2] Billhardt H, Borrajo D, Maojo V. Using Term Co-occurrence Data for Document Indexing and Retrieval[C]. In:Proceedings of the BCSIRSG 22nd Annual Colloquium on Information Retrieval Research, 2000:105-117.
[3] 何中市,刘里. 基于上下文关系的文本分类特征描述方法[J]. 计算机科学,2007,34(5):183-186.
[4] 孙晓霞,郑玉明,廖湖声. 一种基于特征词句子环境的文本分类器[J]. 计算机应用研究,2007(2):116-119.
[5] 曾雪强,王明文,陈素芬.一种基于潜在语义结构的文本分类模型[J].华南理工大学学报:自然科学版, 2004,32(z1):99-102.
[6] 郭少友. 以文档为中心的上下文检索[D]. 北京:中国科学院研究生院,2007.
[7] Besancon R, Rajman M, Chappelier J C. Textual Similarities Based on a Distributional Approach[C]. The Tenth International Workshop on Database and Expert Systems Applications.Florence,Italy,1999:180-184.
[8] Cai L J, Hofmann T. Text Categorization by Boosting Automatically Extracted Concepts [EB/OL].[2007-11-22].http://www.iro.umontreal.ca/~kegl/ift3390/2006_1/Lectures/l08_TextCategorizationCaiHofmann.pdf.
[9] 李荣陆. 文本分类系统SVMCLS 2.0[EB/OL]. [2007-11-22]. http://www.nlp.org.cn/docs/docredirect.php?doc-id=1023. |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|