Please wait a minute...
New Technology of Library and Information Service  2010, Vol. 26 Issue (10): 43-48    DOI: 10.11925/infotech.1003-3513.2010.10.07
article Current Issue | Archive | Adv Search |
Research on Extraction of Hot Keywords
Cheng Xiao, Lu Bei, Chen Zhiqun
Institute of Computer Application Technology, Hangzhou Dianzi University, Hangzhou 310018, China
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

According to extraction of hot keywords in the multi-phase candidate keywords, the paper tries mass data process,determines the meaningless words based on the timing of statistical law, and proposes Union Variance (UV) concept. The HK (Hot Keywords) formula is constructed based on multi-feature fusion to achieve the extraction of hot keywords. Experimental results show that this method is efficient in the process of hot subject extraction.

Key wordsOnline      public      opinion      Chinese      word      segmentation      Keywords      Weighting      calculation     
Received: 16 August 2010      Published: 04 January 2011
: 

G353.1

 

Cite this article:

Cheng Xiao, Lu Bei, Chen Zhiqun. Research on Extraction of Hot Keywords. New Technology of Library and Information Service, 2010, 26(10): 43-48.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2010.10.07     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2010/V26/I10/43


[1] CNNIC发布《第26次中国互联网络发展状况统计报告》 . . http://research.cnnic.cn/html/1279173730d2350.html.

[2] 陆蓓,程肖,谌志群.互联网舆情挖掘研究述略
[J]. 情报资料工作 ,2010(2):41-45.

[3] 邱立坤,陶然,龙志炜,等.面向互联网的话题发现技术研究 . 见: 全国网络与信息安全技术研讨会论文集(下册) . 青岛:中国通信学会,2007:373-379.

[4] 李恒训,张华平,秦鹏,等.基于主题词的网络热点话题发现 . 见: 第五届全国信息检索学术会议论文集 .上海:中国中文信息学会,2009:134-143.

[5] Zhang H P, Liu Q, Yu H K, et al.Chinese Name Entity Recognition Using Role Model
[J]. International Journal of Computational Linguistics and Chinese Language Processing, 2003,8(2):29-60.

[6] 化柏林.知识抽取中的停用词处理技术
[J]. 现代图书情报技术 ,2007(8):48-51.

[7] 曾依灵,许洪波,白硕.网络文本主题词的提取与组织研究
[J]. 中文信息学报 ,2008,22(3):64-70,80.

[8] 刘星星,何婷婷,龚海军,等.网络热点事件发现系统的设计
[J]. 中文信息学报 ,2008,22(6):80-85.

[9] 陆蓓,程肖,谌志群.基于改进蚁群聚类的热点主题发现算法研究
[J]. 现代图书情报技术 ,2010(4):66-71.

[10] 丁伟莉,赵华,郑德权,等.中文Bolg热门话题检测与排序技术研究 . 见: 中国中文信息学会二十五周年学术会议论文集 . 北京:中国中文信息学会,2006:282-289.

[1] Wang Hanxue,Cui Wenjuan,Zhou Yuanchun,Du Yi. Identifying Pathogens of Foodborne Diseases with Machine Learning[J]. 数据分析与知识发现, 2021, 5(9): 54-62.
[2] Fan Tao,Wang Hao,Wu Peng. Sentiment Analysis of Online Users' Negative Emotions Based on Graph Convolutional Network and Dependency Parsing[J]. 数据分析与知识发现, 2021, 5(9): 97-106.
[3] Han Hui, Liu Xiuwen. Automatic Scoring for Subjective Questions in Maritime Competency Assessment[J]. 数据分析与知识发现, 2021, 5(8): 113-121.
[4] Zhang Jiandong, Chen Shiji, Xu Xiaoting, Zuo Wenge. Extracting PDF Tables Based on Word Vectors[J]. 数据分析与知识发现, 2021, 5(8): 34-44.
[5] Wang Qinjie, Qin Chunxiu, Ma Xubu, Liu Huailiang, Xu Cunzhen. Recommending Scientific Literature Based on Author Preference and Heterogeneous Information Network[J]. 数据分析与知识发现, 2021, 5(8): 54-64.
[6] Yu Xuehan, He Lin, Xu Jian. Extracting Events from Ancient Books Based on RoBERTa-CRF[J]. 数据分析与知识发现, 2021, 5(7): 26-35.
[7] Huang Mingxuan,Jiang Caoqing,Lu Shoudong. Expanding Queries Based on Word Embedding and Expansion Terms[J]. 数据分析与知识发现, 2021, 5(6): 115-125.
[8] Yin Pengbo,Pan Weimin,Zhang Haijun,Chen Degang. Identifying Clickbait with BERT-BiGA Model[J]. 数据分析与知识发现, 2021, 5(6): 126-134.
[9] Wang Xiwei,Jia Ruonan,Wei Yanan,Zhang Liu. Clustering User Groups of Public Opinion Events from Multi-dimensional Social Network[J]. 数据分析与知识发现, 2021, 5(6): 25-35.
[10] Cao Rui,Liao Bin,Li Min,Sun Ruina. Predicting Prices and Analyzing Features of Online Short-Term Rentals Based on XGBoost[J]. 数据分析与知识发现, 2021, 5(6): 51-65.
[11] Ma Yingxue,Zhao Jichang. Patterns and Evolution of Public Opinion on Weibo During Natural Disasters: Case Study of Typhoons and Rainstorms[J]. 数据分析与知识发现, 2021, 5(6): 66-79.
[12] Wang Nan,Li Hairong,Tan Shuru. Predicting of Public Opinion Reversal with Improved SMOTE Algorithm and Ensemble Learning[J]. 数据分析与知识发现, 2021, 5(4): 37-48.
[13] Yan Qiang,Zhang Xiaoyan,Zhou Simin. Extracting Keywords Based on Sememe Similarity[J]. 数据分析与知识发现, 2021, 5(4): 80-89.
[14] Lin Kerou,Wang Hao,Gong Lijuan,Zhang Baolong. Disambiguation of Chinese Author Names with Multiple Features[J]. 数据分析与知识发现, 2021, 5(4): 90-102.
[15] Zhang Qi,Jiang Chuan,Ji Youshu,Feng Minxuan,Li Bin,Xu Chao,Liu Liu. Unified Model for Word Segmentation and POS Tagging of Multi-Domain Pre-Qin Literature[J]. 数据分析与知识发现, 2021, 5(3): 2-11.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn