Please wait a minute...
New Technology of Library and Information Service  2008, Vol. 24 Issue (6): 61-66    DOI: 10.11925/infotech.1003-3513.2008.06.12
Current Issue | Archive | Adv Search |
Research and Implementation of Related Articles Database Based on Vector Space Model
Yu Xitian  Wan Lili  Hu Tiejun  Li Danya
(Institute of Medical Information, Chinese Academy of Medical Sciences, Beijing 100020, China)
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

In this paper, a Vector Space Model (VSM) based on terms extraction with lexicon is introduced, and the related articles database and retrieval system of Chinese biomedical engineering literatures is designed and implemented. In addition, a comparison study on VSM based on suffix tree for the database is conducted.

Key wordsRelated articles database      Term      Vector space model      Biomedical engineering literatures     
Received: 26 February 2008      Published: 25 June 2008
: 

G354 

 
  TP391

 
Corresponding Authors: Yu Xitian     E-mail: yuxitian1234@163.com
About author:: Yu Xitian,Wan Lili,Hu Tiejun,Li Danya

Cite this article:

Yu Xitian,Wan Lili,Hu Tiejun,Li Danya. Research and Implementation of Related Articles Database Based on Vector Space Model. New Technology of Library and Information Service, 2008, 24(6): 61-66.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2008.06.12     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2008/V24/I6/61

[1] 王家钺.信息检索中“相关性概念”的研究[J].现代外语,2001,24(2):181-191.
[2] 孙建军,成颖.基于信息检索交互模型的相关性研究[J].中国图书馆学报,2005,31(1):41-45,70.
[3] Cooper W S. A Definition of Relevance for Information Retrieval[J]. Information Storage and Retrieval, 1971,7(1):19-37.
[4] Mizzaro S. Relevance:The Whole History[J]. Journal of the American Society of Information Science, 1997,48(9):810-832.
[5] 赖茂生,赵丹群,韩圣龙,等.计算机情报检索[M].北京:北京大学出版社,1993.
[6] 赖茂生.科技文献检索[M].第2版.北京:北京大学出版社,1994.
[7] 李军莲.PubMed检索系统的文献相关性判定研究及应用设想[D].北京:中国协和医科大学,2001.
[8] 徐莉,胡铁军.建立中国生物医学文献相关性数据库的探讨[D].北京:中国协和医科大学,2002.
[9] 王闰强,胡铁军.中国生物医学文献相关性数据库建设及应用研究[D].北京:中国协和医科大学,2003.
[10] 包金龙.基于向量空间模型的信息检索系统的设计[J].情报杂志,2005,24(7):44-45,49.
[11] 邱宇红,郭继军.向量空间模型在医学文献相关性研究中的应用[J].现代图书情报技术,2007(7):63-67.
[12] 万莉莉,胡铁军.中国生物医学工程文献相关性数据库建设研究[D].北京:中国协和医科大学,2007.
[13] 刘斌,陈桦.向量空间模型信息检索技术讨论[J].情报杂志,2006,25(7):91-93.
[14] 任慧玲,胡铁军,李丹亚,等.中文期刊文献数字对象唯一标识符的研究[J].情报学报,2004,23(4):437-443.
[15] 刘春艳,胡铁军.PubMed生物医学工程文献数据挖掘[D].北京:中国协和医科大学,2005.
[16] Wilbur W J, Yang Y. An Analysis of Statistical Term Strength and Its Use in the Indexing and Retrieval of Molecular Biology Texts[J]. Computers in Biology and Medicine, 1996,26(3): 209-222.
[17] Buckley C,  Lewit A F. Optimization of Inverted Vector Searches[C].In: Proceedings of The 8th Annual International ACMSIGIR Conference on Research and Development in Information Retrieval, Montreal, 1985:97-110.
[18] Lucarella D. A Document Retrieval System Based on Nearest Neighbor Searching[J]. Journal of Information Science, 1988,14(1):25-33.
[19] Salton G, Fox E, Wu H. Extended Boolean Information Retrieval[J]. Communication of the ACM, 1983,26:1022-1036.
[20] 郭莉,张吉,谭建龙.基于后缀树模型的文本实时分类系统的研究和实现[J].中文信息学报,2005,19(5):16-23.

[1] Cao Rui,Liao Bin,Li Min,Sun Ruina. Predicting Prices and Analyzing Features of Online Short-Term Rentals Based on XGBoost[J]. 数据分析与知识发现, 2021, 5(6): 51-65.
[2] Cheng Bin,Shi Shuicai,Du Yuncheng,Xiao Shibin. Keyword Extraction for Journals Based on Part-of-Speech and BiLSTM-CRF Combined Model[J]. 数据分析与知识发现, 2021, 5(3): 101-108.
[3] Li Keyu,Wang Hao,Gong Lijuan,Tang Huihui. Measurement and Distribution of Index Quality in Research Topics from Academic Databases[J]. 数据分析与知识发现, 2020, 4(6): 91-108.
[4] Xiong Xin,Wang Hao,Zhang Haichao,Zhang Baolong. Impacts of Chinese Term Granularity on Measuring Term Discriminative Capacity[J]. 数据分析与知识发现, 2020, 4(2/3): 143-152.
[5] Liu Liu,Qin Tianyun,Wang Dongbo. Automatic Extraction of Traditional Music Terms of Intangible Cultural Heritage[J]. 数据分析与知识发现, 2020, 4(12): 68-75.
[6] Li Jiaquan,Li Baoan,You Xindong,Lü Xueqiang. Computing Similarity of Patent Terms Based on Knowledge Graph[J]. 数据分析与知识发现, 2020, 4(10): 104-112.
[7] Liu Jingru,Song Yang,Jia Rui,Zhang Yipeng,Luo Yong,Ma Jingdong. A BiLSTM-CRF Model for Protected Health Information in Chinese[J]. 数据分析与知识发现, 2020, 4(10): 124-133.
[8] Peiyao Zhang,Dongsu Liu. Topic Evolutionary Analysis of Short Text Based on Word Vector and BTM[J]. 数据分析与知识发现, 2019, 3(3): 95-101.
[9] Zhiyong Tao,Xiaobing Li,Ying Liu,Xiaofang Liu. Classifying Short Texts with Improved-Attention Based Bidirectional Long Memory Network[J]. 数据分析与知识发现, 2019, 3(12): 21-29.
[10] Jia Longjia,Zhang Bangzuo. Classifying Topics of Internet Public Opinion from College Students: Case Study of Sina Weibo[J]. 数据分析与知识发现, 2018, 2(7): 55-62.
[11] Ni Weijian,Sun Haohao,Liu Tong,Zeng Qingtian. An Unsupervised Approach to Optimize Chinese Word Segmentation on Domain Literature[J]. 数据分析与知识发现, 2018, 2(2): 96-104.
[12] Liang Xiaobei,Xu Zhen,Li Jingjing. Impacts of Landlords on Tenants of Short-term Rentals[J]. 数据分析与知识发现, 2018, 2(11): 46-53.
[13] Bai Rujiang,Leng Fuhai,Liao Junhua. An Improved Cosine Text Similarity Computing Method Based on Semantic Chunk Feature[J]. 数据分析与知识发现, 2017, 1(6): 56-64.
[14] Wang Miping,Wang Hao,Deng Sanhong,Wu Zhixiang. Extracting Chinese Metallurgy Patent Terms with Conditional Random Fields[J]. 现代图书情报技术, 2016, 32(6): 28-36.
[15] Jiang Lin,Wang Dongbo. Automatic Extraction of Domain Terms Using Continuous Bag-of-Words Model[J]. 现代图书情报技术, 2016, 32(2): 9-15.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn