|
|
Chinese Text Keywords Extraction Based on Fuzzy Processing |
Zhang Hongying |
(Adult Education College,Anhui University of Finance and Economics, Bengbu 233000,China) |
|
|
Abstract This article studies algorithms of keywords extraction and analyzes factors that may influence the extraction. Based on the quantification of these factors, this paper proposes the complete framework of a model that includes word segmentation and part-of-speech tagging, text pre-treatment, weighted linear algorithm, generation and filtering of word combination, and combination of candidate keywords.
|
Received: 18 December 2008
Published: 25 May 2009
|
|
Corresponding Authors:
Zhang Hongying
E-mail: zhytsj@sina.com
|
About author:: Zhang Hongying |
[1] Luhn H P.A Statistical Approach to Mechanized Encoding and Searching of Literary Information[J].IBM Journal of Research and Development,1957,1(4):309-317.
[2] 张敏, 耿焕同, 王煦法. 一种利用BC 方法的关键词自动提取算法研究[J]. 小型微型计算机系统, 2007(6):189-192.
[3] 刘华. 基于文本分类中特征提取的领域词语聚类 [J]. 语言文字应用,2007(1):139 - 144.
[4] 方清华. 信息检索加权理论与技术:基于VSM模型的分析[J]. 情报杂志, 2008(6):73-76.
[5] 王灿辉,张敏,马少平,等. 基于相邻词的中文关键词自动抽取[J].广西师范大学学报(自然科学版), 2007(2):161-164.
[6] 索红光, 刘玉树, 曹淑英. 一种基于词汇链的关键词抽取方法[J]. 中文信息学报, 2006(6):25-30.
[7] Li S J,Wang H F,Yu S W,et al.Research on Maximum Entropy Model for Keyword Indexing[J].Chinese Journal of Computers,2004,27(9):1192-1197. |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|