This paper adopts cluster analysis method to discuss and analyze the features of Chinese characters,in order to discover the internal rules. Based on the clustering feature of Chinese characters,it refines the matching result of string matching,and advances a 2-level similarity model. The experiment result shows that this model can reflect the similarity better.
王静婷. 基于汉字聚类特征的中文字符串相似度计算研究[J]. 现代图书情报技术, 2011, 27(2): 48-53.
Wang Jingting. Research Towards Chinese String Similarity Based on the Clustering Feature of Chinese Characters. New Technology of Library and Information Service, 2011, 27(2): 48-53.
[12] Cohen W W, Ravikumar P, Fienberg S E.A Comparison of String Distance Metrics for Name-Matching Tasks . In: Proceedings of IJCAI-03 Workshop on Information Integration on the Web (IIWeb-03).2003:73-78.