本文针对汉语构词的特殊性, 提出了一种单汉字标引的改进算法。该算法在标引上采用了对非检索词词首字的剔除标引, 在检索上, 采取检索词首字查找, 检索词直接匹配的算法。还提出了对检索结果的存储以及构筑后控词典, 以逐步完善单汉字检索系统。以上算法, 在SCIRS (Single Chinese Character Indexing and Retrieval System) 得到初步实现。
In view of specific characteristics of Chinese morphology,the paper proposes an improved algorithm of single Chinese character Indexing.In the indexing Module,it rejects the Character that can not be the first character of a keyword;in the retrieval module,it matches the keyword directly.The algorithm stores theresults of retrieval and constructs the psot-controlled vocabulary.The algorithm was implemented in the SCIRS (Single Chinese Character Indexing and Retrieval System).
收稿日期: 1996-12-23
出版日期: 1997-04-25
通讯作者:
王淼
作者简介: 王淼
引用本文:
王淼. 单汉字标引技术的改进研究[J]. 现代图书情报技术, 1997, 13(2): 48-53.
Wang Miao. Research on Improvement of Single Chinese Character Indexing Technique. New Technology of Library and Information Service, 1997, 13(2): 48-53.