Please wait a minute...
New Technology of Library and Information Service  2009, Vol. 3 Issue (1): 16-21    DOI: 10.11925/infotech.1003-3513.2009.01.04
article Current Issue | Archive | Adv Search |
An Analysis of Web Information Archiving Strategies
Lin YingWu ZhenxinZhang Zhixiong2
1 ( BNU Research Center for Digital Library Technology, Beijing 100875,China)
2 (National Science Library,Chinese Academy of Sciences, Beijing 100190,China)
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

Archiving strategy as the important research area of Web Archive, has been concerned by many projects, this paper selects several typical strategies: compressed archiving with external index, archiving with multi-master, format migration archiving, and characteristic described archiving , and analyzes their preservation context, characteristics ,as well as the application of the strategy to achieve, and provides valuable reference for Web Archive research in China.

Key wordsWeb Archive      Digital preservation      Archiving strategy     
Received: 19 September 2008      Published: 25 January 2009
: 

G250.76

 
Corresponding Authors: Wu Zhenxin     E-mail: wuzx@mail.las.ac.cn
About author:: Lin Ying,Wu Zhenxin,Zhang Zhixiong

Cite this article:

Lin Ying,Wu Zhenxin,Zhang Zhixiong. An Analysis of Web Information Archiving Strategies. New Technology of Library and Information Service, 2009, 3(1): 16-21.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2009.01.04     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2009/V3/I1/16

[1] Netpreserve.org (International Internet Preservation Consortium)[EB/OL]. [2008-07-09]. http://www.netpreserve.org.
[2] PANDORA(Australia’s Web Archive) [EB/OL]. [2008-07-09].  http://pandora.nla.gov.au/about.htmlhttp://www.nla.gov.au/nla/staffpaper/2004/koerbin2.html.
[3]  Large Scale Data Repository: Petabox[EB/OL]. [2008-07-09]. http://www.archive.org/web/petabox.php.
[4] SDSC Chronopolis[EB/OL]. [2008-07-09]. http://chronopolis.sdsc.edu/.
[5] Internet Archive[EB/OL]. [2008-07-09]. http://www.archive.org/about/about.php.
[6] Large Scale Data Repository: Petabox[EB/OL]. [2008-07-09]. http://www.archive.org/web/petabox.php.
[7] Burner M, Kahle B. WWW Archive File Format Specification[EB/OL]. [2008-07-09]. http://pages.alexa.com/company/arcformat.html.
[8] Library of Congress. Data Center for Library of Congress Digital Holdings: A Pilot Project[R/OL]. [2008-07-09].  http://chronopolis.sdsc.edu/assets/docs/SDSC_LC_data-storage_report_2.pdf.
[9] PANDORA(Australia’s Web Archive) [EB/OL]. [2008-07-09].  http://pandora.nla.gov.au/about.html.
[10] Koerbin P. The PANDORA Digital Archiving System (PANDAS): Managing Web Archiving in Australia: A Case Study[EB/OL]. [2008-07-09]. http://www.nla.gov.au/nla/staffpaper/2004/koerbin2.html.
[11] Nordic Web Archive Introduction[R/OL]. [2008-07-09].  http://www.lib.helsinki.fi/tietolinja/0100/nwa.pdf.
[12] Hallgrímsson P,Bang S. Nordic Web Archive. 3rd ECDL Workshop on Web Archives[R/OL]. [2008-07-09]. http://bibnum.bnf.fr/ecdl/2003/proceedings.php?f=hallgrimsson.
[13] Electronic Records Archives (ERA) [EB/OL]. [2008-07-09]. http://www.archives.gov/era/.
[14] National Archive and Records Administration.Electronic Records Archives Program Management Office. Electronic Records Archives: Introduction to Preservation and Access Levels Concepts[R/OL]. [2008-07-09]. http://www.archives.gov/era/pdf/preservation-and-access-levels.pdf.
[15] Lake D. SAA Presentation[R/OL]. [2008-07-09]. http://www.archives.gov/era/pdf/2006-saa-lake.pdf.

[1] Hu Jiying,Wu Zhenxin,Xie Jing,Zhang Zhixiong. A Full-text Indexing System for WARC Files[J]. 现代图书情报技术, 2016, 32(5): 91-98.
[2] Wu Zhenxin, Zhang Zhixiong, Xie Jing, Hu Jiying. Developing Web Archive System of International Institutions Based on IIPC Open Source Software[J]. 现代图书情报技术, 2015, 31(4): 1-9.
[3] Wu Zhenxin, Wang Yuju, Fu Honghu, Li Chunwang, Liu Jianhua. Constructing a Trusted Ingest Workflow of Digital Preservation System[J]. 现代图书情报技术, 2015, 31(3): 1-7.
[4] Wu Zhenxin. Research on Fixity of Digital Object in Digital Preservation[J]. 现代图书情报技术, 2014, 30(11): 1-9.
[5] Zhang Zhixiong,Wu Zhenxin,Liu Jianhua,Guo Hongmei. Analysis of the Difference Between Digital Curation and Digital Preservation[J]. 现代图书情报技术, 2014, 30(1): 4-13.
[6] Ma Ningning, Li Chao, Qu Yunpeng. Design and Implementation of an Automatic Obsolescence Management System for Digital Preservation[J]. 现代图书情报技术, 2013, (4): 69-76.
[7] Gao JianXiu Wu Zhenxin Sun Shuo. Research on the Application of Cloud Storage in Digital Preservation[J]. 现代图书情报技术, 2010, 26(6): 1-6.
[8] Liu Lan,Wu Zhenxin,Xiang Jing,Sun Zhiru. Review of Open Source Software in Web Archive[J]. 现代图书情报技术, 2009, 25(5): 11-17.
[9] Sun Zhiru,Wu Zhenxin,Qu Yupeng. Analysis of Index Strategies in Web Archive[J]. 现代图书情报技术, 2009, 25(4): 14-18.
[10] Shen Jinzhi,Kou Wenbo,Tian Chengeng. Web Archive Content Extracted on Feature Orienting and Boarder Forecasting[J]. 现代图书情报技术, 2009, 25(12): 52-56.
[11] Wu Zhenxin,Yao Fei,Gao Jianxiu,Sun Minjie. A Comprehensive Review of 2009 International Conference on Preservation of Digital Objects——Moving into the Mainstream, Enabling Our Digital Future[J]. 现代图书情报技术, 2009, (10): 1-6.
[12] Wu Zhenxin,Xiang Jing. Analysis of Retrieval System Architecture in Web Archive[J]. 现代图书情报技术, 2009, 3(1): 22-27.
[13] Liu Lan,Wu Zhenxin,Zhang Zhixiong,Xu Lin. Study on Harvest Strategy in Web Archive[J]. 现代图书情报技术, 2009, 3(1): 10-15.
[14] Wu Zhenxin,Zhang Zhixiong,Sun Zhiru. An Analysis of the Application of Web Archive Resources Based on Data Mining[J]. 现代图书情报技术, 2009, 3(1): 28-33.
[15] Shen Yulan,Zhang Aixia. Standard System Frame of Long-term Digital Preservation Systems[J]. 现代图书情报技术, 2008, 24(4): 1-6.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn