In this paper, a term context vector is used to represent the relation between a term and its context terms. Based on term context vectors, class feature vectors of a classifier, and the document vector of the document to be classified are generated, and then the document is classified. The experiment shows that adding term context relations into class feature vector and document vector can improve the classification effect.
郭少友. 基于词语上下文关系的文本自动分类方法研究[J]. 现代图书情报技术, 2008, 24(5): 44-49.
Guo Shaoyou. Research on Automatic Classification Based on Term Context Relations. New Technology of Library and Information Service, 2008, 24(5): 44-49.
[1] Wang Y. Incorporating Semantic and Syntactic Information into Document Representation for Document Clustering[D]. Mississippi:Mississippi State University,2005.
[2] Billhardt H, Borrajo D, Maojo V. Using Term Co-occurrence Data for Document Indexing and Retrieval[C]. In:Proceedings of the BCSIRSG 22nd Annual Colloquium on Information Retrieval Research, 2000:105-117.
[3] 何中市,刘里. 基于上下文关系的文本分类特征描述方法[J]. 计算机科学,2007,34(5):183-186.
[4] 孙晓霞,郑玉明,廖湖声. 一种基于特征词句子环境的文本分类器[J]. 计算机应用研究,2007(2):116-119.
[5] 曾雪强,王明文,陈素芬.一种基于潜在语义结构的文本分类模型[J].华南理工大学学报:自然科学版, 2004,32(z1):99-102.
[6] 郭少友. 以文档为中心的上下文检索[D]. 北京:中国科学院研究生院,2007.
[7] Besancon R, Rajman M, Chappelier J C. Textual Similarities Based on a Distributional Approach[C]. The Tenth International Workshop on Database and Expert Systems Applications.Florence,Italy,1999:180-184.
[8] Cai L J, Hofmann T. Text Categorization by Boosting Automatically Extracted Concepts [EB/OL].[2007-11-22].http://www.iro.umontreal.ca/~kegl/ift3390/2006_1/Lectures/l08_TextCategorizationCaiHofmann.pdf.
[9] 李荣陆. 文本分类系统SVMCLS 2.0[EB/OL]. [2007-11-22]. http://www.nlp.org.cn/docs/docredirect.php?doc-id=1023.