|
|
Research and Inplementation of Chinese Web-text Clustering |
Yang Xueming |
(Network Center, Ningbo University, Ningbo 315211, China) |
|
|
Abstract The text automatic clustering has been put forward and studied in application. This paper proposes a text clustering framework by coalescent the HAC and K-Means clustering algorithm, and evaluates this framework in an experiment.
|
Received: 14 September 2006
Published: 25 December 2006
|
|
Corresponding Authors:
Yang Xueming
E-mail: yangxueming@nbu.edu.cn
|
About author:: Yang Xueming |
1吴斌等.一种基于群体智能的Web文当聚类算法.计算机研究与发展,2002,39(11):1429-1434
2Han J, Kamber M. Data Mining: Concepts and Techniques . Morgan Kaufmann Publishers,2001,14-22
3方开泰.实用多元统计分析.华东师范大学出版社,1986 ,43-55
4Yang Y, Pedersen J P. Feature selection in statistical learning of text categorization. In the 14th Int.Conf.on Machine Learning,1997.412-420
5代六玲 等.中文文本分类中特征抽取方法的比较研究.中文信息学报,2004,18(1):26-32
6陈宁等.基于模糊概念图的文档聚类及其在Web中的应用.软件学报,2002,13(8):1598-1605
7Schtze H, Silverstein C. Projections for Efficient Document Clustering, in ACM/SIGIR (1997), 74-81
8姜宁,史忠植.文档聚类中的贝叶斯后验模型选择方法.计算机研究与发展,2002,39(5):580-587
9Fazli C, Esen A. Ozkarahan. Concepts and Effectiveness of the Cover-Coefficient-Based Clustering Methodology for Text Database. ACM Transcations on Database Systems,1990,15(4):64-78
10Modha D, Spangler S. Feature weighting in kmeans clustering. Machine Learning, 2003,52(3):217-237 |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|