Please wait a minute...
New Technology of Library and Information Service  2008, Vol. 24 Issue (6): 61-66    DOI: 10.11925/infotech.1003-3513.2008.06.12
Current Issue | Archive | Adv Search |
Research and Implementation of Related Articles Database Based on Vector Space Model
Yu Xitian  Wan Lili  Hu Tiejun  Li Danya
(Institute of Medical Information, Chinese Academy of Medical Sciences, Beijing 100020, China)
Download: PDF(541 KB)   HTML  
Export: BibTeX | EndNote (RIS)      
Abstract  

In this paper, a Vector Space Model (VSM) based on terms extraction with lexicon is introduced, and the related articles database and retrieval system of Chinese biomedical engineering literatures is designed and implemented. In addition, a comparison study on VSM based on suffix tree for the database is conducted.

Key wordsRelated articles database      Term      Vector space model      Biomedical engineering literatures     
Received: 26 February 2008      Published: 25 June 2008
: 

G354 

 
  TP391

 
Corresponding Authors: Yu Xitian     E-mail: yuxitian1234@163.com
About author:: Yu Xitian,Wan Lili,Hu Tiejun,Li Danya

Cite this article:

Yu Xitian,Wan Lili,Hu Tiejun,Li Danya. Research and Implementation of Related Articles Database Based on Vector Space Model. New Technology of Library and Information Service, 2008, 24(6): 61-66.

URL:

http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2008.06.12     OR     http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2008/V24/I6/61

[1] 王家钺.信息检索中“相关性概念”的研究[J].现代外语,2001,24(2):181-191.
[2] 孙建军,成颖.基于信息检索交互模型的相关性研究[J].中国图书馆学报,2005,31(1):41-45,70.
[3] Cooper W S. A Definition of Relevance for Information Retrieval[J]. Information Storage and Retrieval, 1971,7(1):19-37.
[4] Mizzaro S. Relevance:The Whole History[J]. Journal of the American Society of Information Science, 1997,48(9):810-832.
[5] 赖茂生,赵丹群,韩圣龙,等.计算机情报检索[M].北京:北京大学出版社,1993.
[6] 赖茂生.科技文献检索[M].第2版.北京:北京大学出版社,1994.
[7] 李军莲.PubMed检索系统的文献相关性判定研究及应用设想[D].北京:中国协和医科大学,2001.
[8] 徐莉,胡铁军.建立中国生物医学文献相关性数据库的探讨[D].北京:中国协和医科大学,2002.
[9] 王闰强,胡铁军.中国生物医学文献相关性数据库建设及应用研究[D].北京:中国协和医科大学,2003.
[10] 包金龙.基于向量空间模型的信息检索系统的设计[J].情报杂志,2005,24(7):44-45,49.
[11] 邱宇红,郭继军.向量空间模型在医学文献相关性研究中的应用[J].现代图书情报技术,2007(7):63-67.
[12] 万莉莉,胡铁军.中国生物医学工程文献相关性数据库建设研究[D].北京:中国协和医科大学,2007.
[13] 刘斌,陈桦.向量空间模型信息检索技术讨论[J].情报杂志,2006,25(7):91-93.
[14] 任慧玲,胡铁军,李丹亚,等.中文期刊文献数字对象唯一标识符的研究[J].情报学报,2004,23(4):437-443.
[15] 刘春艳,胡铁军.PubMed生物医学工程文献数据挖掘[D].北京:中国协和医科大学,2005.
[16] Wilbur W J, Yang Y. An Analysis of Statistical Term Strength and Its Use in the Indexing and Retrieval of Molecular Biology Texts[J]. Computers in Biology and Medicine, 1996,26(3): 209-222.
[17] Buckley C,  Lewit A F. Optimization of Inverted Vector Searches[C].In: Proceedings of The 8th Annual International ACMSIGIR Conference on Research and Development in Information Retrieval, Montreal, 1985:97-110.
[18] Lucarella D. A Document Retrieval System Based on Nearest Neighbor Searching[J]. Journal of Information Science, 1988,14(1):25-33.
[19] Salton G, Fox E, Wu H. Extended Boolean Information Retrieval[J]. Communication of the ACM, 1983,26:1022-1036.
[20] 郭莉,张吉,谭建龙.基于后缀树模型的文本实时分类系统的研究和实现[J].中文信息学报,2005,19(5):16-23.

[1] Peiyao Zhang,Dongsu Liu. Topic Evolutionary Analysis of Short Text Based on Word Vector and BTM[J]. 数据分析与知识发现, 2019, 3(3): 95-101.
[2] Longjia Jia,Bangzuo Zhang. Classifying Topics of Internet Public Opinion from College Students: Case Study of Sina Weibo[J]. 数据分析与知识发现, 2018, 2(7): 55-62.
[3] Weijian Ni,Haohao Sun,Tong Liu,Qingtian Zeng. An Unsupervised Approach to Optimize Chinese Word Segmentation on Domain Literature[J]. 数据分析与知识发现, 2018, 2(2): 96-104.
[4] Xiaobei Liang,Zhen Xu,Jingjing Li. Impacts of Landlords on Tenants of Short-term Rentals[J]. 数据分析与知识发现, 2018, 2(11): 46-53.
[5] Rujiang Bai,Fuhai Leng,Junhua Liao. An Improved Cosine Text Similarity Computing Method Based on Semantic Chunk Feature[J]. 数据分析与知识发现, 2017, 1(6): 56-64.
[6] Wang Miping,Wang Hao,Deng Sanhong,Wu Zhixiang. Extracting Chinese Metallurgy Patent Terms with Conditional Random Fields[J]. 现代图书情报技术, 2016, 32(6): 28-36.
[7] Jiang Lin,Wang Dongbo. Automatic Extraction of Domain Terms Using Continuous Bag-of-Words Model[J]. 现代图书情报技术, 2016, 32(2): 9-15.
[8] Liu Tong,Ni Weijian,Liu Mei. Identifying Terminology from Search Engine Query Logs[J]. 现代图书情报技术, 2016, 32(2): 25-33.
[9] Jiancheng Zheng, Xiaolin Zhang, Yan Zhao, Zhenxin Wu, Gaolei Yin, Man Xiao, Xiujuan Chen. Study of Sustainable Support Mechanisms for Long Term Preservation of Digital Publications[J]. 数据分析与知识发现, 2016, 32(12): 1-8.
[10] Guang Zhu, Mining Feng. Content Authentication for Video Resources of Libraries, Museums and Archives with Semi-fragile Watermarking[J]. 数据分析与知识发现, 2016, 32(12): 76-84.
[11] Wang Peixia,Yu Hai,Chen Li,Wang Yongji. Using Intelligent System to Extract Search Terms for Sci-Tech Novelty Retrieval[J]. 现代图书情报技术, 2016, 32(11): 82-93.
[12] Zhu Guang,Feng Mining. A New Video Watermarking Technology for Copyright Protection[J]. 现代图书情报技术, 2016, 32(10): 105-111.
[13] Mengxia Zhang,Liping Ku. Policy Research of Data Curation[J]. 现代图书情报技术, 2016, 32(1): 3-10.
[14] Hui Zhu,Jianlin Yang,Hao Wang. Study on Construction of Domain Terminology Taxonomic Relation[J]. 现代图书情报技术, 2016, 32(1): 73-80.
[15] Fu Honghu, Zhang Zhixiong, Liu Jianhua, Qian Li, Wang Ying. Construction of STKOS Term Publishing and Sharing Service Platform[J]. 现代图书情报技术, 2015, 31(9): 76-81.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn