Based on a classed large-scale corpus labeled with key words, we can get words’ subject degree by statistic method according to domain erperts knowledge. The paper constructs a key words auto-indexing system, and introduces detailedly its overall process and functional module.
刘华. 关键词自动标引系统实现[J]. 现代图书情报技术, 2006, 1(2): 88-90.
Liu Hua. Construction of a Key Words Auto-Indexing System. New Technology of Library and Information Service, 2006, 1(2): 88-90.
1Yuen-Hsien Tseng, Fast Keyword Extraction of Chinese Documents in a Web Environment, to appear in Information Retrieval Workshop for Asia Languages - 1997
2王明燕.基于Web页面的关键词与关键概念提取技术,北京:北京工业大学硕士论文,2003
3叶志清等.文献信息计算机全文全自动标引方法.情报学报.2003,22(2):169-172
4杨文峰,李星.基于PAT-TREE统计语言模型与关键词自动提取.计算机工程与应用.2001(15):17-20
5王兰成.基于XMARC信息描述的知识标引与概念检索研究,上海:东华大学博士学位论文,2003
6陈克利.基于大规模真实文本的平衡语料分析与文本分类方法.Advances in Computation of Oriental Languages.北京:清华大学出版社,2003
7丁璇,侯汉清,章成志.中文网页标引源主题表达能力的调查统计.大学图书馆学报.2002(6):70-72