Please wait a minute...
New Technology of Library and Information Service  2006, Vol. 1 Issue (8): 42-45    DOI: 10.11925/infotech.1003-3513.2006.08.09
Current Issue | Archive | Adv Search |
A Chinese Reverse-order Directional Maximum Mathching Segmentation System Design Based Converse Dictionary
Zhang Liyi1,2    Li Yazi1
1 (School of Information Management, Wuhan University, Wuhan 430072,China)
2 (Center for Studies of Information Resources, Wuhan University, Wuhan 430072,China)
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

 This paper introduces normal segmentation algorithms, and based on the improving Chinese converse dictionary and optimizing reverse-order directional maximum matching algorithm, designs a Chinese segmentation system. In the experiment, the speed and accuracy are improved obviously.

Key wordsReverse-order dictionary      Maximum matching      Reverse maximum matching      Auto segmentation     
Received: 25 May 2006      Published: 25 August 2006
: 

G254

 
Corresponding Authors: Zhang Liyi     E-mail: 8982632@163.com
About author:: Zhang Liyi,Li Yazi

Cite this article:

Zhang Liyi,Li Yazi . A Chinese Reverse-order Directional Maximum Mathching Segmentation System Design Based Converse Dictionary. New Technology of Library and Information Service, 2006, 1(8): 42-45.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2006.08.09     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2006/V1/I8/42

1刘源. 信息处理用现代汉语分词规范及自动分词方法. 北京: 清华大学出版社, 1994
2Pak-kwong Wong, Chorkin Chan. Chinese Word Segmentation based on Maximum Matching and Word Binding Force.In:International Conference On Computational Linguistics Proceedings of the 16th conference on Computational linguistics.Copenhagen,Denmark,1996: 200-203
3ERIK HATCHER, OTIS GOSPODNETIC. Lucene In Action. America: Manning Publications Co.2005
4刘宏涛. 中文自动分词系统的设计模型. 计算机与数字工程,2005, 33 (4):138-140
5赵艳红,费洪晓. 一个基于改进的反序分词词典的中文分词算法. 深圳职业技术学院学报, 2004, 4 : 28-31
6HongLan Jin, Kam-Fai Wong. A Chinese Dictionary Construction Algorithm for Information Retrieval.In:ACM Transactions on Asian Language Information Processing. ACM Press,2002: 281-296
7刘开瑛. 中文文本自动分词和标注, 北京: 商务印书馆, 2000

[1] Gu Jun, Wang Hao. Study on Term Extraction on the Basis of Chinese Domain Texts[J]. 现代图书情报技术, 2011, 27(4): 29-34.
[2] Mai Fanjin,Wang Ting. Sense Disambiguation of Chinese Segmentation Based on Bi-direction Matching Method and HMM[J]. 现代图书情报技术, 2008, 24(8): 37-41.
[3] Hua Bolin,Zhao Liang. Nested Vector Segmentation Technique in Knowledge Extraction[J]. 现代图书情报技术, 2007, 2(7): 50-53.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn