|
|
Study on Machine Learning Based Automatic Text Categorization Model |
Chen Lifu Zhou Ning Li Dan |
(Information Management School, Wuhan University, Wuhan 430072, China) |
|
|
Abstract This article develops a theoretical model of machine learning based automatic text categorization, which is widely used in text categorization tasks. First, definition and architecture model of text categorization are given. Then, we choose SVM classifier as a typical example for detail analysis. Finally, a performance result is reported by the author through a Chinese text categorization experiment.
|
Received: 20 June 2005
Published: 25 October 2005
|
|
Corresponding Authors:
Chen Lifu
E-mail: chinatoby@sina.com
|
About author:: Chen Lifu,Zhou Ning,Li Dan |
1Fabrizio Sebastiani: Machine Learning in Automated Text Categorization, ACM Computing Surveys, Vol.34, No.1, 2002
2Kjersti Aas and Line Eikvil: Text Categorisation: A Survey, Technical Report #941, Norwegian Computing Center, 1999
3Yiming Yang and Jan O. Pedersen: A Comparative Study on Feature Selection in Text Categorization, Proceedings of the Fourteenth International Conference on Machine Learning (ICML'97), 1997
4Yiming Yang and Xin Liu: A re-examination of text categorization methods, Proceedings of ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'99), 1999
5Thorsten Joachims: A Statistical Learning Model of Text Classification for Support Vector Machines, SIGIR '01, 2001
6Yiming Yang: An evaluation of statistical approaches to text categorization, Journal of Information Retrieval, Vol 1, 1999
7Yan-Shi Dong, Ke-Song Han: A Comparison of Several Ensemble Methods for Text Categorization, Proceedings of the 2004 IEEE International Conference on Service Computing (SCC'04), 2004 |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|