%A Lu Guoli,Wang Xiaohua,Wang Rongbo %T Text Clustering Research on the Max Term Contribution Dimension Reduction and Simulated Annealing Algorithm %0 Journal Article %D 2008 %J Data Analysis and Knowledge Discovery %R 10.11925/infotech.1003-3513.2008.12.08 %P 43-47 %V 24 %N 12 %U {https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/abstract/article_667.shtml} %8 2008-12-25 %X

This paper presents a new algorithm for text character extraction and dimension reduction based on the Max Term Contribution. Its main idea is computing the contribution of each term in the high dimension document-base and extracting the maximum contribution terms to construct a low dimension document-base from the high dimension document-base using the search algorithm. Then a modified K-means clustering method based on the Simulated Annealing (SA) is presented to cluster the low dimension document datum which is obtained by MTC. Finally, some experiments show that the new method can improve the cluster precision.