Please wait a minute...
New Technology of Library and Information Service  2010, Vol. 26 Issue (5): 50-57    DOI: 10.11925/infotech.1003-3513.2010.05.09
article Current Issue | Archive | Adv Search |
Design and Implementation of Chinese Thesis Retrieval System Based on XML
 Liu Dan
(Department of Information Management, Peking University, Beijing 100871, China)
Download: PDF(865 KB)   HTML  
Export: BibTeX | EndNote (RIS)      
Abstract  

The paper tries to apply the XML text retrieval methods to long text enviroment,and uses Chinese thesis as a dataset. It designs and implements XML tagging, indexing, keyword retrieval and structural retrieval on Chinese thesis, and finally constructs an XML-based Chinese thesis retrieval system.

Key wordsXML retrieval        Thesis retrieval        Chinese XML indexing        Chinese XML retrieval     
Received: 25 March 2010      Published: 25 May 2010
: 

G354.45

 
Corresponding Authors: Liu Dan     E-mail: liudan1987@gmail.com

Cite this article:

Liu Dan. Design and Implementation of Chinese Thesis Retrieval System Based on XML. New Technology of Library and Information Service, 2010, 26(5): 50-57.

URL:

http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2010.05.09     OR     http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2010/V26/I5/50

[1] INEX. INEX 2007[EB/OL].[2009-04-13]. http://inex.is.informatik.uni-duisburg.de/2007/.
[2] SourceForge. TopX Introduction[EB/OL].[2008-09-06]. http://topx.sourceforge.net/.
[3] Theobald M, Schenkel R, Weikum G. TopX and XXL at INEX 2005[C]. In: Proceedings of the 4th Workshop of the Initiative for the Evaluation of XML Retrieval.2005.
[4] Fuhr N, Gvert N, Groβjohann K. HyRex: Hyper-media Retrieval Engine for XML[C]. In: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 2002:449-449.
[5] Fuhr N, Groβjohann K. XIRQL: A Query Language for Information Retrieval in XML Documents[C]. In: Proceedings of the 24th Annual International ACM SIGIR Conference. 2001: 172-180.
[6] Guo L, Shao F, Botev C,et al. XRANK: Ranked Keyword Search over XML Documents[C]. In: Proceedings of the 2003 ACM SIGMOD International Conference on Management of Data.2003: 16-27.
[7] Lu W, Liu D, Fang F,et al. WHU-XML: An XML Based Digital Library System[C].In: Proceedings of the 2008 IEEE International Symposium on IT in Medicine and Education.2008: 380-384.
[8] 刘丹,孔少华,陆伟. XML检索研究综述[J]. 现代图书情报技术,2010(4):24-34.
[9] Kazai G, Doucet A. Book Search 2007: INEX 2007 Book Search Track Overview[C]. In: Proceedings of the 6th Initiative on the Evaluation of XML Retrieval Workshop. Berlin: Springer, 2008: 148-161.
[10] 中国国家图书馆. 博士论文[EB/OL].[2009-04-13]. http://202.96.31.42:9080/doctor/index.htm.
[11] 中国知网. 中国知识资源总库—CNKI系列数据库[EB/OL].[2009-04-13]. http://cnki1.lib.whu.edu.cn/kns50/index.aspx.
[12] Amer-Yahia S, Lalmas M. XML Search: Languages, INEX and Scoring[J]. SIGMOD Record, 2006,35(4): 16-23.
[13] 陆伟. 元素级XML检索模型构建的关键问题与解决方案研究[J].中国图书馆学报,2007,33(6):58-61.
[14] 陆伟. 基于传统文本检索系统的XML索引实现研究[J]. 情报学报,2006,25(6):679-685.
[15] Robertson S, Zaragoza H, Taylor M. Simple BM25 Extension to Multiple Weighted Fields[C]. In: Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 2004: 42-49.
[16] Lu W, Robertson S, Macfarlane A. Field-Weighted XML Retrieval Based on BM25[C]. In: Proceedings of the 4th Initiative on the Evaluation of XML Retrieval Workshop. Berlin: Springer, 2006: 126-137.
[17] 刘丹, 陆伟, 张宓. XML结构化检索研究及实现[J]. 现代图书情报技术,2009(3):52-56.

[1] Liu Dana,Lu Wei,Zhang Mi. Research and Implementation of Structural XML Retrieval[J]. 现代图书情报技术, 2009, 3(3): 52-56.
[2] Huang Shuiqing,Zhu Shumei. Design and Application of Full Text Retrieval Tool on Unified Open Access Resource Platform[J]. 现代图书情报技术, 2008, 24(7): 7-12.
[3] Xing Min,Sun Shupeng. The Development of the Full-text Database on Network with Minisis System ——The Construction of the Full-text Database on Network in the Library of Petroleum University[J]. 现代图书情报技术, 2001, 17(1): 78-79.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn