Please wait a minute...
New Technology of Library and Information Service  2010, Vol. 26 Issue (4): 41-45    DOI: 10.11925/infotech.1003-3513.2010.04.07
article Current Issue | Archive | Adv Search |
A Table Retrieval Algorithm Based on the Vector Space Model
Wang Kai,Wang Chaofei
(China Defense Science and Technology Information Center, Beijing 100142,China)
Download: PDF(461 KB)   HTML  
Export: BibTeX | EndNote (RIS)      
Abstract  

 According to the problem that most information institutions can only provide searching service for literature instead of tables, this paper proposes a table retrieval algorithm which is based on Vector Space Model(VSM).Discussions are implemented from the aspects of table character extraction, term value setting, and search result ranking, which provide theoretical basis of the table retrieval services in the future.

Key wordsVSM       Table retrieval       Character region       Character term     
Received: 20 January 2010      Published: 25 April 2010
: 

TP319

 
Corresponding Authors: Wang Kai     E-mail: wangkaiabc@163.com

Cite this article:

Wang Kai,Wang Chaofei. A Table Retrieval Algorithm Based on the Vector Space Model. New Technology of Library and Information Service, 2010, 26(4): 41-45.

URL:

http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2010.04.07     OR     http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2010/V26/I4/41

[1] Baeza-Yates R,  Ribeiro-Neto B. Modern Information Retrieval[M]. ACM Press, 1999:9-11.
[2] Liu Y, Bai K, Mitra P,et al. Tablerank:A Ranking Algorithm for Table Search and Retrieval[C]. In:Proceedings of the 22nd National Conference on Artificial intelligence, Vancouver, British Columbia, Canada.2007:317-322.
[3] Liu Y,Mitra P,Giles C L. Automatic Extraction of Table Metadata from Digital Documents[C]. In:Proceedings of ACM/IEEE Joint Conference on Digital Libraries.2006:339–340.
[4] Liu Y, Bai K, Mitra P, et al.Automatic Table Metadata Extraction and Searching in Digital Libraries[C].In:Proceedings of ACM/IEEE Joint Conference on Digital Libraries.2007:91-100.
[5] Salton G, Buckley C. Term-weighting Approaches in Automatic Text Retrieval [J].Information Processing and Management,1988,24(5):513-523.
[6] 樊甫华,张万军. 一种利用向量空间模型快速检索文本情报的方法[J]. 计算机工程与科学,2004, 26(11):59-61.
[7] 王惠,沈玉利.基于内容的图书馆图片检索系统[J].情报科学,2005,23(10):1552-1558.

[1] Nie Jing, Li Qiang, Pang Li, Ying Huijie. Study of Web Page Extraction Algorithm in Mobile Meta Search Engine[J]. 现代图书情报技术, 2010, 26(10): 54-58.
[2] Fan Yi,Mu Dongmei. Comparative Study on Protégé and KAON[J]. 现代图书情报技术, 2007, 2(8): 18-21.
[3] Sun Caijie. The Development and Application of RSS in Library[J]. 现代图书情报技术, 2005, 21(6): 83-85.
[4] Chang Chun. How to Use KAON to Create Ontology and Estimation about It[J]. 现代图书情报技术, 2004, 20(8): 14-17.
[5] Xu Jiangang. Status Monitoring and Automatic Recovery of Horizon WebPAC[J]. 现代图书情报技术, 2004, 20(3): 32-34.
[6] Zhao Yingli. Design and Realization of Evaluation System of Library and Information System of the CAS[J]. 现代图书情报技术, 2001, 17(1): 71-72.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn