Please wait a minute...
Advanced Search
现代图书情报技术  2009, Vol. 3 Issue (1): 10-15     https://doi.org/10.11925/infotech.1003-3513.2009.01.03
  专题 本期目录 | 过刊浏览 | 高级检索 |
Web Archive的采集策略研究*
刘兰1,2  吴振新张智雄徐麒3
(中国科学院国家科学图书馆  北京 100190)
2 (中国科学院研究生院  北京 100049)
3 (西南交通大学图书馆  成都 610031)
Study on Harvest Strategy in Web Archive
Liu Lan1,Wu ZhenxinZhang ZhixiongXu Lin3
1(National Science Library,Chinese Academy of Sciences, Beijing 100190,China)
2(Graduate University of Chinese Academy of Sciences, Beijing 100049,China)
3(Library of Southwest Jiaotong University, Chengdu 610031,China)
全文: PDF (424 KB)  
输出: BibTeX | EndNote (RIS)      
摘要 

通过总结目前国际上Web Archive中常用的三种采集策略:完整性采集、选择性采集和混合型采集,对比分析各种采集策略的特点、关键问题和代表性的项目,最后分析选择采集策略需要考虑的关键因素,并提出一般性的建议。

服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
刘兰
吴振新
张智雄
徐麒
关键词 Web Archive采集策略完整性采集选择性采集混合型采集    
Abstract

This paper summarizes three commonly used harvest strategies in Web Archive:the integrity harvest, selective harvest and hybrid harvest. Then comparatively analyzes characteristics of various harvest strategies, key issues and representation projects. Finally, some key factors need to consider in choosing the harvest strategy are analyzed and general recommendations are made.

Key wordsWeb Archive    Harvest strategy    Integrity harvest    Selective harvest    Hybrid harvest
收稿日期: 2008-09-24      出版日期: 2009-01-25
: 

G250.76

 
基金资助:

*本文系国家社会科学基金项目“网络信息资源保存的理论与方法研究”(项目编号:06BTQ025)的研究成果之一。

通讯作者: 刘兰     E-mail: liulan@mail.las.ac.cn
作者简介: 刘兰,吴振新,张智雄,徐麒
引用本文:   
刘兰,吴振新,张智雄,徐麒. Web Archive的采集策略研究*[J]. 现代图书情报技术, 2009, 3(1): 10-15.
Liu Lan,Wu Zhenxin,Zhang Zhixiong,Xu Lin. Study on Harvest Strategy in Web Archive. New Technology of Library and Information Service, 2009, 3(1): 10-15.
链接本文:  
https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.1003-3513.2009.01.03      或      https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2009/V3/I1/10

[1] Kelly B. Approaches to the Preservation of Web Sites[EB/OL].[2008-06-11].http://www.ukoln.ac.uk/.
[2] Online Australian Publications: Selection Guidelines for Archiving and Preservation by the National Library of Australia[EB/OL].[2008-06-11].http://pandora.nla.gov.au/selectionguidelines.html.
[3] Michael Day. Collecting and Preserving the World Wide Web: A Feasibility Study Undertaken for the JISC and Wellcome Trust [J/OL].[2008-06-11].http://www.jisc.ac.uk/uploaded_documents/archiving_feasibility.pdf.
[4] The Internet Archive Web Archive[EB/OL].[2008-06-11].http://wa.archive.org/aroundtheworld/index.new.html.
[5] WebArchivArchive of the Czech Web[EB/OL].[2008-06-11].http://en.webarchiv.cz/thematic_collections.
[6] 数据来源:MINERVA[EB/OL].[2008-06-11].http://www.loc.gov/MINERVA/presentations.html.
[7] The Australian Web Domain Harvests: A Preliminary Quantitative Analysis of the Archive Data[J/OL]. [2008-05-16].http://pandora.nla.gov.au/documents/auscrawls.pdf.

[1] 孙志茹,吴振新,曲云鹏. 基于Wayback的索引策略研究[J]. 现代图书情报技术, 2009, 25(4): 14-18.
[2] 吴振新,向菁. Web Archive检索系统架构分析*[J]. 现代图书情报技术, 2009, 3(1): 22-27.
[3] 林颖,吴振新,张智雄. Web Archive存档策略分析*[J]. 现代图书情报技术, 2009, 3(1): 16-21.
[4] 吴振新,张智雄,孙志茹. 基于数据挖掘的Web Archive资源应用分析*[J]. 现代图书情报技术, 2009, 3(1): 28-33.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 2015 《数据分析与知识发现》编辑部
地址:北京市海淀区中关村北四环西路33号 邮编:100190
电话/传真:(010)82626611-6626,82624938
E-mail:jishu@mail.las.ac.cn