An Analysis of the Application of Web Archive Resources Based on Data Mining
Wu Zhenxin1 Zhang Zhixiong1 Sun Zhiru1,2
1(National Science Library, Chinese Academy of SciencesBeijing 100190,China) 2(Graduate University of Chinese Academy of Sciences,Beijing 100049,China)
This article introduced current applications of web archive resources, and then from the perspective of data mining, analyzes and sums up the in-depth applications of web archive resources.
吴振新,张智雄,孙志茹. 基于数据挖掘的Web Archive资源应用分析*[J]. 现代图书情报技术, 2009, 3(1): 28-33.
Wu Zhenxin,Zhang Zhixiong,Sun Zhiru. An Analysis of the Application of Web Archive Resources Based on Data Mining. New Technology of Library and Information Service, 2009, 3(1): 28-33.
[1] Internet Archive[EB/OL]. [2007-12-16].http://www.archive.org/index.php.
[2] 中国Web信息博物馆[EB/OL].[2008-11-2].http://www.infomall.cn/.
[3] Wayback Machine[EB/OL].[2008-11-02].http://www.archive.org/index.php.
[4] Report on the 8th International Workshop on Web Archiving[R/OL].[2008-11-02].http://www.dlib.org/dlib/november08/rauber/11rauber.html.
[5] WEAR[EB/OL].[2008-11-02].http://archive-access.sourceforge.net/projects/wera/.
[6] XTF[EB/OL]. [2008-11-02].http://www.cdlib.org/inside/projects/xtf/.
[7] Xinq[EB/OL].[2008-11-02].http://www.nla.gov.au/xinq/.
[8] Warrick[EB/OL]. [2008-05-28]. http://warrick.cs.odu.edu/.
[9] Lazy Preservation: Reconstructing Websites by Crawling the Crawlers[EB/OL]. [2008-11-02].http://www.cs.odu.edu/~fmccown/pubs/lazyp-widm06.pdf.
[10] WebContinuity[EB/OL]. [2008-11-08].http://www.nationalarchives.gov.uk/webcontinuity/.
[11] 阎宏飞.可扩展Web信息搜集系统的设计、实现与应用初探[D].北京:北京大学,2002.
[12] Rauber A,Aschenbrenner A, Witvoet O. Austrian Online Archive Processing: Analyzing Archives of the World Wide Web[J].Research and Advanced Technology for Digital Libraries: 6th European Conference, ECDL, 2002:16-31.
[13] Rauber A, Aschenbrenner A, Witvoet O,et al. Uncovering Information Hidden in Web Archives[J].D-Lib Magazine, 2002,8(12):1082-9873.
[14] William Y A , Aya S, Dmitriev P,et al. Building a Research Library for the History of the Web[J].Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries, 2006:95-102.
[15] William Y A , Aya S, Dmitriev P, et al. A Research Library Based on the Historical Collections of the Internet Archive[J].D-Lib Magazine, 2006,12(2):1082-9873.
[16] Kitsuregawa M, Tamura T, Toyoda M,et al.Socio-Sense: A System for Analysing the Societal Behavior from Long Term Web Archive[M]. Progress in WWW Research and Development.Heidelberg :Springer Berlin ,2008.
[17] 让社会科学插上信息技术的翅膀[EB/OL]. [2008-11-02].http://cess.grids.cn/ourpdfs/Let%20social%20science%20ride% 20on%20IT%20bullet%20train.pdf.