A Chinese Reverse-order Directional Maximum Mathching Segmentation System Design Based Converse Dictionary
Zhang Liyi1,2 Li Yazi1
1 (School of Information Management, Wuhan University, Wuhan 430072,China) 2 (Center for Studies of Information Resources, Wuhan University, Wuhan 430072,China)
This paper introduces normal segmentation algorithms, and based on the improving Chinese converse dictionary and optimizing reverse-order directional maximum matching algorithm, designs a Chinese segmentation system. In the experiment, the speed and accuracy are improved obviously.
张李义,李亚子 . 基于反序词典的中文逆向最大匹配分词系统设计*[J]. 现代图书情报技术, 2006, 1(8): 42-45.
Zhang Liyi,Li Yazi . A Chinese Reverse-order Directional Maximum Mathching Segmentation System Design Based Converse Dictionary. New Technology of Library and Information Service, 2006, 1(8): 42-45.
1刘源. 信息处理用现代汉语分词规范及自动分词方法. 北京: 清华大学出版社, 1994
2Pak-kwong Wong, Chorkin Chan. Chinese Word Segmentation based on Maximum Matching and Word Binding Force.In:International Conference On Computational Linguistics Proceedings of the 16th conference on Computational linguistics.Copenhagen,Denmark,1996: 200-203
3ERIK HATCHER, OTIS GOSPODNETIC. Lucene In Action. America: Manning Publications Co.2005
4刘宏涛. 中文自动分词系统的设计模型. 计算机与数字工程,2005, 33 (4):138-140
5赵艳红,费洪晓. 一个基于改进的反序分词词典的中文分词算法. 深圳职业技术学院学报, 2004, 4 : 28-31
6HongLan Jin, Kam-Fai Wong. A Chinese Dictionary Construction Algorithm for Information Retrieval.In:ACM Transactions on Asian Language Information Processing. ACM Press,2002: 281-296
7刘开瑛. 中文文本自动分词和标注, 北京: 商务印书馆, 2000