Please wait a minute...
New Technology of Library and Information Service  2009, Vol. 25 Issue (5): 39-43    DOI: 10.11925/infotech.1003-3513.2009.05.08
Current Issue | Archive | Adv Search |
Chinese Text Keywords Extraction Based on Fuzzy Processing
Zhang Hongying
(Adult Education College,Anhui University of Finance and Economics, Bengbu 233000,China)
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

This article studies algorithms of keywords extraction and analyzes factors that may influence the extraction. Based on the quantification of these factors, this paper proposes the complete framework of a model that includes word segmentation and part-of-speech tagging, text pre-treatment, weighted linear algorithm, generation and filtering of word combination, and combination of candidate keywords.

Key wordsText      Keyword      Extraction      Fuzzy processing     
Received: 18 December 2008      Published: 25 May 2009
: 

TP393

 
Corresponding Authors: Zhang Hongying     E-mail: zhytsj@sina.com
About author:: Zhang Hongying

Cite this article:

Zhang Hongying. Chinese Text Keywords Extraction Based on Fuzzy Processing. New Technology of Library and Information Service, 2009, 25(5): 39-43.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2009.05.08     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2009/V25/I5/39

[1] Luhn H P.A Statistical Approach to Mechanized Encoding and Searching of Literary Information[J].IBM Journal of Research and Development,1957,1(4):309-317.
[2] 张敏, 耿焕同, 王煦法. 一种利用BC 方法的关键词自动提取算法研究[J]. 小型微型计算机系统, 2007(6):189-192.
[3] 刘华. 基于文本分类中特征提取的领域词语聚类 [J]. 语言文字应用,2007(1):139 - 144.
[4] 方清华. 信息检索加权理论与技术:基于VSM模型的分析[J]. 情报杂志, 2008(6):73-76.
[5] 王灿辉,张敏,马少平,等. 基于相邻词的中文关键词自动抽取[J].广西师范大学学报(自然科学版), 2007(2):161-164.
[6] 索红光, 刘玉树, 曹淑英. 一种基于词汇链的关键词抽取方法[J]. 中文信息学报, 2006(6):25-30.
[7] Li S J,Wang H F,Yu S W,et al.Research on Maximum Entropy Model for Keyword Indexing[J].Chinese Journal of Computers,2004,27(9):1192-1197.

[1] Chen Jie,Ma Jing,Li Xiaofeng. Short-Text Classification Method with Text Features from Pre-trained Models[J]. 数据分析与知识发现, 2021, 5(9): 21-30.
[2] Zhou Zeyu,Wang Hao,Zhao Zibo,Li Yueyan,Zhang Xiaoqin. Construction and Application of GCN Model for Text Classification with Associated Information[J]. 数据分析与知识发现, 2021, 5(9): 31-41.
[3] Wang Yifan,Li Bo,Shi Hua,Miao Wei,Jiang Bin. Annotation Method for Extracting Entity Relationship from Ancient Chinese Works[J]. 数据分析与知识发现, 2021, 5(9): 63-74.
[4] Ma Jiangwei, Lv Xueqiang, You Xindong, Xiao Gang, Han Junmei. Extracting Relationship Among Military Domains with BERT and Relation Position Features[J]. 数据分析与知识发现, 2021, 5(8): 1-12.
[5] Han Hui, Liu Xiuwen. Automatic Scoring for Subjective Questions in Maritime Competency Assessment[J]. 数据分析与知识发现, 2021, 5(8): 113-121.
[6] Chai Qingfeng, Shi Linyan, Mei Shan, Xiong Haitao, He Huixin. Extracting Knowledge Elements of Sci-Tech Literature Based on Artificial and Machine Features[J]. 数据分析与知识发现, 2021, 5(8): 132-144.
[7] Tan Ying, Tang Yifei. Extracting Citation Contents with Coreference Resolution[J]. 数据分析与知识发现, 2021, 5(8): 25-33.
[8] Zhang Jiandong, Chen Shiji, Xu Xiaoting, Zuo Wenge. Extracting PDF Tables Based on Word Vectors[J]. 数据分析与知识发现, 2021, 5(8): 34-44.
[9] Jiang Yaren, Le Xiaoqiu. Continual Learning for One-to-many Entity Relationship Generation with Small Samples[J]. 数据分析与知识发现, 2021, 5(8): 45-53.
[10] Yu Xuehan, He Lin, Xu Jian. Extracting Events from Ancient Books Based on RoBERTa-CRF[J]. 数据分析与知识发现, 2021, 5(7): 26-35.
[11] Zhang Le, Leng Jidong, Lv Xueqiang, Cui Zhuo, Wang Lei, You Xindong. RLCPAR: A Rewriting Model for Chinese Patent Abstracts Based on Reinforcement Learning[J]. 数据分析与知识发现, 2021, 5(7): 59-69.
[12] Zhao Danning,Mu Dongmei,Bai Sen. Automatically Extracting Structural Elements of Sci-Tech Literature Abstracts Based on Deep Learning[J]. 数据分析与知识发现, 2021, 5(7): 70-80.
[13] Chen Xingyue, Ni Liping, Ni Zhiwei. Extracting Financial Events with ELECTRA and Part-of-Speech[J]. 数据分析与知识发现, 2021, 5(7): 36-47.
[14] Xie Hao,Mao Jin,Li Gang. Sentiment Classification of Image-Text Information with Multi-Layer Semantic Fusion[J]. 数据分析与知识发现, 2021, 5(6): 103-114.
[15] Huang Mingxuan,Jiang Caoqing,Lu Shoudong. Expanding Queries Based on Word Embedding and Expansion Terms[J]. 数据分析与知识发现, 2021, 5(6): 115-125.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn