According to the problem that most information institutions can only provide searching service for literature instead of tables, this paper proposes a table retrieval algorithm which is based on Vector Space Model(VSM).Discussions are implemented from the aspects of table character extraction, term value setting, and search result ranking, which provide theoretical basis of the table retrieval services in the future.
王凯 王朝飞. 一种基于向量空间模型的表格检索算法[J]. 现代图书情报技术, 2010, 26(4): 41-45.
Wang Kai,Wang Chaofei. A Table Retrieval Algorithm Based on the Vector Space Model. New Technology of Library and Information Service, 2010, 26(4): 41-45.
[1] Baeza-Yates R, Ribeiro-Neto B. Modern Information Retrieval[M]. ACM Press, 1999:9-11.
[2] Liu Y, Bai K, Mitra P,et al. Tablerank:A Ranking Algorithm for Table Search and Retrieval[C]. In:Proceedings of the 22nd National Conference on Artificial intelligence, Vancouver, British Columbia, Canada.2007:317-322.
[3] Liu Y,Mitra P,Giles C L. Automatic Extraction of Table Metadata from Digital Documents[C]. In:Proceedings of ACM/IEEE Joint Conference on Digital Libraries.2006:339–340.
[4] Liu Y, Bai K, Mitra P, et al.Automatic Table Metadata Extraction and Searching in Digital Libraries[C].In:Proceedings of ACM/IEEE Joint Conference on Digital Libraries.2007:91-100.
[5] Salton G, Buckley C. Term-weighting Approaches in Automatic Text Retrieval [J].Information Processing and Management,1988,24(5):513-523.
[6] 樊甫华,张万军. 一种利用向量空间模型快速检索文本情报的方法[J]. 计算机工程与科学,2004, 26(11):59-61.
[7] 王惠,沈玉利.基于内容的图书馆图片检索系统[J].情报科学,2005,23(10):1552-1558.