Please wait a minute...
New Technology of Library and Information Service  2009, Vol. 25 Issue (4): 14-18    DOI: 10.11925/infotech.1003-3513.2009.04.03
Current Issue | Archive | Adv Search |
Analysis of Index Strategies in Web Archive
Sun Zhiru1,Wu ZhenxinQu Yupeng3
1(National Science Library, Chinese Academy of Sciences, Beijing 100190,China)
2(Graduate University of the Chinese Academy Sciences, Beijing 100049,China)
3(National Library of China, Beijing 100081,China)
Download: PDF(556 KB)   HTML  
Export: BibTeX | EndNote (RIS)      
Abstract  

This article summarizes several typical index strategies through analyzing Web Archive projects with Wayback as access tool, also gives preliminary analysis for the scope of application, merits and faults of each strategy. Thus hopes to give companies of this area some reference.

Key wordsWeb Archive      Wayback      Index Strategy     
Received: 02 April 2009      Published: 25 April 2009
: 

G202

 
Corresponding Authors: Wu Zhenxin     E-mail: wuzx@mail.las.ac.cn
About author:: Sun Zhiru,Wu Zhenxin,Qu Yupeng

Cite this article:

Sun Zhiru,Wu Zhenxin,Qu Yupeng. Analysis of Index Strategies in Web Archive. New Technology of Library and Information Service, 2009, 25(4): 14-18.

URL:

http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2009.04.03     OR     http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2009/V25/I4/14

[1] About the Wayback Machine[EB/OL]. [2009-02-26]. http://www.Archive.org/web/web.php.
[2] Tofel B. ‘Wayback’for Accessing Web Archives[C]. In: Proceedings of the 7th International Web Archiving Workshop. Vancouver, Canada, 2007.
[3] Gomes D, Nogueira A, Miranda J, et al. Introducing the Portuguese Web Archive initiative[C]. In: Proceedings of the 8th International Web Archiving Workshop.  Aaarhus, Denmark, 2008.
[4] NutchWAX[EB/OL]. [2008-05-19]. http://Archive-access.sourceforge.net/projects/nutchwax/.
[5] Minor D,Zhu B,Moore R, Cowart C. Archiving, Indexing and Accessing Web Materials: Solutions for Large Amounts of Data[C]. In: Proceedings of the 7th International Web Archiving Workshop. Vancouver, Canada, 2007.
[6] Data Center for Library of Congress Digital Holdings: A Pilot Project[R]. Library of Congress, CACI,San Diego Supercomputer Center and UCSD Libraries, 2007.
[7] 应宏. 网格技术及其应用[J]. 计算机工程与设计, 2004, 25(10):1685-1688,1691.

[1] Hu Jiying,Wu Zhenxin,Xie Jing,Zhang Zhixiong. A Full-text Indexing System for WARC Files[J]. 现代图书情报技术, 2016, 32(5): 91-98.
[2] Wu Zhenxin, Zhang Zhixiong, Xie Jing, Hu Jiying. Developing Web Archive System of International Institutions Based on IIPC Open Source Software[J]. 现代图书情报技术, 2015, 31(4): 1-9.
[3] Liu Lan,Wu Zhenxin,Xiang Jing,Sun Zhiru. Review of Open Source Software in Web Archive[J]. 现代图书情报技术, 2009, 25(5): 11-17.
[4] Shen Jinzhi,Kou Wenbo,Tian Chengeng. Web Archive Content Extracted on Feature Orienting and Boarder Forecasting[J]. 现代图书情报技术, 2009, 25(12): 52-56.
[5] Wu Zhenxin,Xiang Jing. Analysis of Retrieval System Architecture in Web Archive[J]. 现代图书情报技术, 2009, 3(1): 22-27.
[6] Liu Lan,Wu Zhenxin,Zhang Zhixiong,Xu Lin. Study on Harvest Strategy in Web Archive[J]. 现代图书情报技术, 2009, 3(1): 10-15.
[7] Lin Ying,Wu Zhenxin,Zhang Zhixiong. An Analysis of Web Information Archiving Strategies[J]. 现代图书情报技术, 2009, 3(1): 16-21.
[8] Wu Zhenxin,Zhang Zhixiong,Sun Zhiru. An Analysis of the Application of Web Archive Resources Based on Data Mining[J]. 现代图书情报技术, 2009, 3(1): 28-33.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn