Please wait a minute...
New Technology of Library and Information Service  2007, Vol. 2 Issue (4): 52-55    DOI: 10.11925/infotech.1003-3513.2007.04.13
Current Issue | Archive | Adv Search |
A Design of Algorithm for Chinese Phrase Segmentation
Zhang Haiying
(Networks Center of Xiangfan University, Xiangfan 441053, China)
Download: PDF (389 KB)  
Export: BibTeX | EndNote (RIS)      
Abstract  

This paper analyses the shortcoming of segmentation algorithm, designs a new algorithm for Chinese phrase segmentation. By building two levels index for Chinese thesaurus, we attain a highly efficient Chinese phrase segmentation thesaurus which supports hashing operation by means of the first Chinese character in a string and full binary search. Based on this thesaurus, we design a new algorithm for Chinese phrase segmentation.

Key wordsSegmentation algorithm      Chinese segmentation     
Received: 30 January 2007      Published: 25 April 2007
ZTFLH: 

G252.7 

 
     
  TP391

 
Corresponding Authors: Zhang Haiying     E-mail: xfu_www@126.com
About author:: Zhang Haiying

Cite this article:

Zhang Haiying . A Design of Algorithm for Chinese Phrase Segmentation. New Technology of Library and Information Service, 2007, 2(4): 52-55.

URL:

http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2007.04.13     OR     http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2007/V2/I4/52

1张国煊,王小华, 周必水.快速书面自动分词系统及其算法设计.计算机研究与发展,1993,30(1):163-167
2陈桂林,王永成,韩客松,王刚.一种改进的快速分词算法.计算机研究与发展,2000,37(4):418-424
3揭春雨,刘源,梁南元.论汉语自动分词方法.中文信息学报,1989,3(1):101-108
4吴胜远.一种汉语分词方法.计算机研究与发展,1996,33(4):306-311
5孙巍.一种面向中文信息检索的汉语自动分词方法.现代图书情报技术,2006(7):33-36
6吴绍根.汉语自动分词模式自动机构造研究.现代图书情报技术,2006(5):47-49,61
7傅立云.基于词典的汉语自动分词算法的改进.情报杂志,2006,25(1):40-41
8文庭孝,邱均平,侯经川.汉语自动分词研究展望.现代图书情报技术,2004(7):6-10

[1] Hua Bolin. Stop-word Processing Technique in Knowledge Extraction[J]. 现代图书情报技术, 2007, 2(8): 48-51.
[2] Wen Tingxiao,Qiu Junping,Hou Jingchuan. View of Chinese Automatic Segmentation Research Wen Tingxiao  Qiu Junping  Hou Jingchuan[J]. 现代图书情报技术, 2004, 20(7): 6-10.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn