Design and Implementation of Chinese Words Dictionary Segmentation Module Based on Lucene
Xiang Hui1 Guo Yiping2 Wang Liang1
1(Department of Control Science and Engineering,Huazhong University of Science and Technology, Wuhan 430074,China) 2(Huazhong University of Science and Technology Library,Wuhan 430074,China)
This paper introduces the construction of language analyzer in Lucene, designs and implements Chinese words segmentation module which uses forwards maximum match algorithm (FMM). This module can disposes Chinese information well and efficiently in the search engine based on Lucene.
向晖,郭一平,王亮 . 基于Lucene的中文字典分词模块的设计与实现[J]. 现代图书情报技术, 2006, 1(8): 46-50.
Xiang Hui,Guo Yiping,Wang Liang . Design and Implementation of Chinese Words Dictionary Segmentation Module Based on Lucene. New Technology of Library and Information Service, 2006, 1(8): 46-50.