Please wait a minute...
New Technology of Library and Information Service  2009, Vol. 3 Issue (1): 22-27    DOI: 10.11925/infotech.1003-3513.2009.01.05
Current Issue | Archive | Adv Search |
Analysis of Retrieval System Architecture in Web Archive
Wu ZhenxinXiang Jing1,2
1(National Science Library, Chinese Academy of Science,Beijing 100190,China)
2(Graduate University of Chinese Academy of Sciences,Beijing 100049,China)
Download: PDF (1083 KB)  
Export: BibTeX | EndNote (RIS)      
Abstract  

Based on existing Web Archive projects,this paper finishes a preliminary analysis of retrieval system architecturethat are applied by these projects and how they cope with the challenge which is to search infomation in massive data collection. From the view of system architecture, it discusses archive retrieval system performance and efficiency and wish to provide some references to the relevantinstitutes and researchers.

Key wordsWeb Archive      Retrieval system      System architecture     
Received: 28 September 2008      Published: 25 January 2009
ZTFLH: 

G202

 
Corresponding Authors: Wu Zhenxin     E-mail: wuzx@mail.las.ac.cn
About author:: Wu Zhenxin,Xiang Jing

Cite this article:

Wu Zhenxin,Xiang Jing. Analysis of Retrieval System Architecture in Web Archive. New Technology of Library and Information Service, 2009, 3(1): 22-27.

URL:

http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2009.01.05     OR     http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2009/V3/I1/22

[1] Pandora[EB/OL].[2008-07-27].http://pandora.nla.gov.au/.
[2] Government of Canada Web Archive[OB/OL].[2008-08-10].http://www.collectionscanada.gc.ca/whats-new/013-315-e.html.
[3] e-Depot[EB/OL].[2008-04-07].http://www.kb.nl/dnp/e-depot/e-depot-en.html.
[4] Tumba[EB/OL].[2008-07-27].http://www.tumba.pt.
[5] LOCKSS[EB/OL].[2008-06-02].http://www.lockss.org/.
[6] Web Infomall[EB/OL].[2008-09-24].http://www.infomall.cn/.
[7] NWA Toolset[EB/OL].[2008-05-06].http://nwatoolset.sourceforge.net/.
[8] Wayback [EB/OL].[2007-11-03]. http://archive-access.sourceforge.net/projects/wayback/.
[9] WERA[EB/OL].[2007-11-03]. http://archive-access.sourceforge.net/projects/wera/.
[10] Xing[EB/OL].[2007-11-03].http://www.nla.gov.au/xinq/.
[11]赵江华.“天网”高性能分布式检索系统的设计与实现[D].北京:北京大学,2002.
[12] SIDRA:A Flexible Web Search System[R/OL].[2008-07-17]. http://www.di.fc.ul.pt/tech-reports/04-17.pdf.
[13] Gomes D.Web Modelling for Web Warehouse Design[J/OL].[2008-08-17]. http://xldb.fc.ul.pt/daniel/docs/papers/webModellingWebWarehouse.pdf.
[14] Search Engines and Web Dynamics[J/OL]. [2008-07-27].http://www.idi.ntnu.no/~algkon/generelt/se-dynamicweb1.pdf.
[15] Sverre Bang. The Nordic Web Archive[OB/OL].[2008-08-05]. http://www.deflink.dk/upload/doc_filer/doc_alle/1023_SBA.ppt.
[16] Gillian Cantello, Stegenga J. Government Web Content in Canada a National Library Web Archive Perspective [J/OL].[2008-08-10]. Government Web Content in Canada a National Library Web Archive Perspective.
[17] Li X M,Zhu J J.Some Characteristics of Web Data and their Reflection on our Society: an Empirical Approach[R/OL].[2008-09-07]. http://www.law.gmu.edu/nctl/stpp/us_japan_pubs/internet_IICIID.pdf.

[1] Xiong Xin,Wang Hao,Zhang Haichao,Zhang Baolong. Impacts of Chinese Term Granularity on Measuring Term Discriminative Capacity[J]. 数据分析与知识发现, 2020, 4(2/3): 143-152.
[2] Hu Jiying,Wu Zhenxin,Xie Jing,Zhang Zhixiong. A Full-text Indexing System for WARC Files[J]. 现代图书情报技术, 2016, 32(5): 91-98.
[3] Wu Zhenxin, Zhang Zhixiong, Xie Jing, Hu Jiying. Developing Web Archive System of International Institutions Based on IIPC Open Source Software[J]. 现代图书情报技术, 2015, 31(4): 1-9.
[4] Yang Rui, Tang Yijie, Liu Yi, Li Wei. Comprehensive Evaluation of the Ontology Building System in the Web Environment[J]. 现代图书情报技术, 2012, 28(1): 13-18.
[5] Cheng Ying. Relevance Criteria Oriented Academic Information Retrieval System Success Model Construction[J]. 现代图书情报技术, 2011, 27(9): 46-53.
[6] Xian Guojian, Zhao Ruixue. Research and Implementation of Chinese Agricultural Journals’ Abstracts Retrieval System Based on Solr[J]. 现代图书情报技术, 2011, 27(6): 51-58.
[7] Cheng Ying. Empirical Analysis on Relevance Criteria Oriented Academic Information Retrieval System Success Model[J]. 现代图书情报技术, 2011, 27(10): 45-53.
[8] Liu Lan,Wu Zhenxin,Xiang Jing,Sun Zhiru. Review of Open Source Software in Web Archive[J]. 现代图书情报技术, 2009, 25(5): 11-17.
[9] Sun Zhiru,Wu Zhenxin,Qu Yupeng. Analysis of Index Strategies in Web Archive[J]. 现代图书情报技术, 2009, 25(4): 14-18.
[10] Yao Fei,Chen Wu,Zhao Yang. Architecture Design and Implementation of English Website of Tsinghua University Library[J]. 现代图书情报技术, 2009, 3(3): 91-95.
[11] Hu Xiaoqing,Zhang Jianyong. Usability Evaluation Indicators and Empirical Study of Database Retrieval System[J]. 现代图书情报技术, 2009, 3(2): 46-50.
[12] Wu Dan. Design and Implementation of an English-Chinese Interactive Cross-Language Information Retrieval System[J]. 现代图书情报技术, 2009, 3(2): 89-95.
[13] Shen Jinzhi,Kou Wenbo,Tian Chengeng. Web Archive Content Extracted on Feature Orienting and Boarder Forecasting[J]. 现代图书情报技术, 2009, 25(12): 52-56.
[14] Liu Lan,Wu Zhenxin,Zhang Zhixiong,Xu Lin. Study on Harvest Strategy in Web Archive[J]. 现代图书情报技术, 2009, 3(1): 10-15.
[15] Lin Ying,Wu Zhenxin,Zhang Zhixiong. An Analysis of Web Information Archiving Strategies[J]. 现代图书情报技术, 2009, 3(1): 16-21.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn