|
|
Design and Implementation of Medical Academic Information Automatic Gathering System |
Wu Haidong, He Xiaoyang, Zhang Jingli |
Third Military Medical University Library, Chongqing 400038, China |
|
|
Abstract [Objective] Aiming at Chinese news of medical research literature published on top journals, design an automatic gathering system which can gather news from different medical news websites, extract content and keywords, realize the subject classification and journal navigation. [Context] Provide information source of foreign academic research for active push and subject services. [Methods] Using HttpClient & HtmlParser to build Web-page collector, realize the news list page and content acquisition. Using IK Analyzer 2012 and MeSH to realize medical keywords extraction and subject classification. [Results] The system achieves automatic gathering, keyword extraction and subject classification of specified website news. [Conclusions] Librarians can use this system to provide effective medical academic information push service for medicine researchers.
|
Received: 27 March 2014
Published: 18 December 2014
|
|
[1] 王涛. 基于HTML标记的主题爬行器的设计与实现[D]. 成
都: 电子科技大学, 2009. (Wang Tao. Design and Implementation of Topic Focused Crawler Based on HTML Tags [D]. Chengdu: University of Electronic Science and Technology of China, 2009.)
[2] 贺苏伟. 教育新闻采集系统的设计与实现[D]. 广州: 华南理工大学, 2012.(He Suwei. The Design and Implementation of Education New Collection System [D]. Guangzhou: South China University of Technology, 2012.)
[3] 韩朝阳. 基于Web的动态语料库构建——以中国政治新闻语料库建库为例[J]. 中国教育技术装备, 2013(23): 66-68. (Han Zhaoyang. Construction of Dynamic Corpus Based on Web: An Example of China Political News Corpus [J]. China Educational Technology Equipment, 2013(23): 66-68.)
[4] 张春元, 康耀红, 伍小芹. Web新闻自动采集发布系统的设计与实现[J]. 计算机技术与发展, 2009(9): 250-253. (Zhang Chunyuan, Kang Yaohong, Wu Xiaoqin. Design and Implementation of Web News Automatically Gathering and Publishing System [J]. Computer Technology and Development, 2009(9): 250-253.)
[5] 陈建国. 基于Web结构的网站新闻采集系统的设计与实现[J]. 井冈山大学学报: 自然科学版, 2012, 33(2): 54-57. (Chen Jianguo. Design and Implementation of News Gathering System Based on Web Structure [J]. Journal of Jinggangshan University: Natural Science, 2012, 33(2): 54-57.)
[6] 钱爱兵, 江岚. 基于标题的中文新闻网页自动分类[J]. 现代图书情报技术, 2008(10): 59-68. (Qian Aibing, Jiang Lan. Automatic Classification Based on News Titles for Chinese News Web Pages [J]. New Technology of Library and Information Service, 2008(10): 59-68.) |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|