|
|
An Improved Hierarchical Document Classification Method |
Tan Jinbo |
(Department of Educational Technology, Shandong Normal University, Jinan 250014,China) |
|
|
Abstract On large amount of document category quantity, hierarchical text classification is an effective approach. However, classification methods using the top-down approach suffer from blocking. To address the problem, this paper proposes an improved hierarchical classification method, namely restricted voting method. Our experiments using Rocchio classifiers on the elementary education subjects resource have shown that it can reduce blocking and improve the classification performance.
|
Received: 17 November 2006
Published: 25 February 2007
|
|
Corresponding Authors:
Tan Jinbo
E-mail: yttjb@163.com
|
About author:: Tan Jinbo |
1袁时金,李荣陆,周水庚,胡运法. 层次化中文文档分类. 通信学报,2004(11):55-63
2肖雪,何中市. 基于向量空间模型的中文文本层次分类方法研究. 计算机应用, 2006(5):1125-1126,1133
3朱华宇,孙正兴,张福炎. 一个基于向量空间模型的中文文本自动分类系统. 计算机工程, 2001(2):15-17,63
4高波,赵政. 文本层次分类系统的研究. 计算机工程与应用,2006(11):176-178
5Sun A,Lim E P,Ng W K,Srivastava J. Blocking reduction strategies in hierarchical text classification. IEEE Trans. on Knowledge and Data Eng,2004,16(10): 1305-1308
6Sun A,Lim E P. Hierarchical text classification and evaluation. In Proc. of 1st IEEE ICDM,2001 (11):521-528
7Dumais S T,Chen H. Hierarchical classification of Web content. In Proc. of 23rd ACM SIGIR,2000(7):256-263
8Greiner R,Grove A,Schuurmans D. On learning hierarchical classifications. http://citeseer.nj.nec.com/article/greiner97learning.html (Accessed Mar.5,2005)
9Larkey L S,Croft W B. Combining classifiers in text categorization. In Proc. of 19th ACM SIGIR,1996(8):289-297
10Li Y H,Jian A K. Classification of text documents. The Computer Journal,1998,41(8):537-546
11Sebastiani F. Machine learning in automated text categorization. ACM Computing Surveys,2002,34(1):1-47
12谭金波.基于Web的基础教育资源自动分类技术研究:[学位论文].南京:南京师范大学教育技术学院,2006. |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|