Please wait a minute...
New Technology of Library and Information Service  2006, Vol. 1 Issue (5): 47-49    DOI: 10.11925/infotech.1003-3513.2006.05.12
Current Issue | Archive | Adv Search |
Study of Scheme Automaton for Chinese Word Automatic Segmentation
Wu Shaogen
(Department of Computer Engineering, Guangdong Industry Technical College, Guangzhou 510300,China)
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

Based on finite state automaton, a new finite state automaton, named Scheme Automaton is proposed in this paper. On the basis of the model, a new Chinese word automatic segmertation model is designed, and also gives the key data structure and construction algorithm. Then analyzes the complexity of the algorithm.

Key wordsChinese information process      Chinese word segmentation      Scheme automaton      Binary search     
Received: 07 February 2006      Published: 25 May 2006
: 

TP391

 
Corresponding Authors: Wu Shaogen     E-mail: bill3000@126.com
About author:: Wu Shaogen

Cite this article:

Wu Shaogen . Study of Scheme Automaton for Chinese Word Automatic Segmentation. New Technology of Library and Information Service, 2006, 1(5): 47-49.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2006.05.12     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2006/V1/I5/47

1揭春雨,刘源,梁南元.论汉语自动分词方法.中文信息学报,1989,3(1):1-8
2尹锋.基于神经网络的汉语自动分词系统的设计与分析.情报学报,1998,17(1):41-49
3吴建胜,战学刚,迟呈英.一种基于自动机的分词方法.计算机工程与应用,2005(8):81-85
(下转第61页)4张立昂,王捍贫,黄雄.计算理论导引.北京:机械工业出版社,200021-22
5吴胜远.一种汉语分词方法.计算机研究与发展,1996,33(4):307-311
6陈桂林,王永成,韩客松等.一种高效的中文电子词表数据结构.计算机研究与发展,2000,37(1):109-115
7刘源,梁南元.汉语处理的基础工程——现代汉语词频统计.中文信息学报,1986(1):17-25

[1] Feng Guoming,Zhang Xiaodong,Liu Suhui. DBLC Model for Word Segmentation Based on Autonomous Learning[J]. 数据分析与知识发现, 2018, 2(5): 40-47.
[2] Ni Weijian,Sun Haohao,Liu Tong,Zeng Qingtian. An Unsupervised Approach to Optimize Chinese Word Segmentation on Domain Literature[J]. 数据分析与知识发现, 2018, 2(2): 96-104.
[3] Zhang Yue,Wang Dongbo,Zhu Danhao. Segmenting Chinese Words from Food Safety Emergencies[J]. 数据分析与知识发现, 2017, 1(2): 64-72.
[4] Yufeng Duan,Sisi Huang. Information Extraction from Chinese Plant Species Diversity Description Text[J]. 现代图书情报技术, 2016, 32(1): 87-96.
[5] Yu Xincong, Li Honglian, Lv Xueqiang. Research on the Application of Hyponymy in the Enrollment Robot[J]. 现代图书情报技术, 2015, 31(12): 65-71.
[6] Zhang Jie, Zhang Haichao, Zhai Dongsheng. Research of the Word Segmentation for Chinese Patent Claims[J]. 现代图书情报技术, 2014, 30(9): 91-98.
[7] Deng Shasha, Zhang Pengzhu, Li Xinmiao. A Method for Network Opinion Modeling Based on Governmental Public Decision Domain[J]. 现代图书情报技术, 2012, (9): 69-74.
[8] Li Wenjiang, Chen Shiqin. Application of AIMLBot Intelligent Robot in Real-time Virtual Reference Service[J]. 现代图书情报技术, 2012, 28(7): 127-132.
[9] Jiang Hua, Su Xiaoguang. Chinese High-frequency Words Extraction Algorithm Without Thesaurus[J]. 现代图书情报技术, 2012, 28(6): 50-53.
[10] Shi Chongde, Wang Huilin. Research on Chinese Word Segmentation Optimization in Statistical Machine Translation[J]. 现代图书情报技术, 2012, 28(4): 29-34.
[11] Gu Jun, Wang Hao. Study on Term Extraction on the Basis of Chinese Domain Texts[J]. 现代图书情报技术, 2011, 27(4): 29-34.
[12] Xie Hui,Qin Jie,Hu Shuangshuang. The Study on the Duplicated Web Pages Detection Algorithm Based on the Keyword from User’s Submission[J]. 现代图书情报技术, 2008, 24(7): 43-46.
[13] Zhang Jinzhu,Zhang Dong,Wang Huilin. The Research of Character-Position-Based Chinese Word Segmentation[J]. 现代图书情报技术, 2008, 24(5): 39-43.
[14] Yao Xingshan. The Improvement in a Chinese Word Segmentation Based on Hash Algorism[J]. 现代图书情报技术, 2008, 24(3): 78-81.
[15] Zhang Chengzhi,Su Xinning . Recognition Mutually Exclusive Words for Information Retrieval[J]. 现代图书情报技术, 2007, 2(2): 44-48.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn