Please wait a minute...
New Technology of Library and Information Service  2009, Vol. 3 Issue (1): 28-33    DOI: 10.11925/infotech.1003-3513.2009.01.06
article Current Issue | Archive | Adv Search |
An Analysis of the Application of Web Archive Resources Based on Data Mining
Wu ZhenxinZhang ZhixiongSun Zhiru1,2
1(National Science Library, Chinese Academy of SciencesBeijing 100190,China)
2(Graduate University of Chinese Academy of Sciences,Beijing 100049,China)
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

 This article introduced current applications of web archive resources, and then from the perspective of data mining, analyzes and sums up the in-depth applications of web archive resources.

Key wordsWeb archive      Application analysis      Data mining     
Received: 22 December 2008      Published: 25 January 2009
: 

 

 
  G350

 
Corresponding Authors: Wu Zhenxin     E-mail: wuzx@mail.las.ac.cn
About author:: Wu Zhenxin,Zhang Zhixiong,Sun Zhiru

Cite this article:

Wu Zhenxin,Zhang Zhixiong,Sun Zhiru. An Analysis of the Application of Web Archive Resources Based on Data Mining. New Technology of Library and Information Service, 2009, 3(1): 28-33.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2009.01.06     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2009/V3/I1/28

[1] Internet Archive[EB/OL]. [2007-12-16].http://www.archive.org/index.php.
[2] 中国Web信息博物馆[EB/OL].[2008-11-2].http://www.infomall.cn/.
[3] Wayback Machine[EB/OL].[2008-11-02].http://www.archive.org/index.php.
[4] Report on the 8th International Workshop on Web Archiving[R/OL].[2008-11-02].http://www.dlib.org/dlib/november08/rauber/11rauber.html.
[5] WEAR[EB/OL].[2008-11-02].http://archive-access.sourceforge.net/projects/wera/.
[6] XTF[EB/OL]. [2008-11-02].http://www.cdlib.org/inside/projects/xtf/.
[7] Xinq[EB/OL].[2008-11-02].http://www.nla.gov.au/xinq/.
[8] Warrick[EB/OL]. [2008-05-28]. http://warrick.cs.odu.edu/.
[9] Lazy Preservation: Reconstructing Websites by Crawling the Crawlers[EB/OL]. [2008-11-02].http://www.cs.odu.edu/~fmccown/pubs/lazyp-widm06.pdf.
[10] WebContinuity[EB/OL]. [2008-11-08].http://www.nationalarchives.gov.uk/webcontinuity/.
[11] 阎宏飞.可扩展Web信息搜集系统的设计、实现与应用初探[D].北京:北京大学,2002.
[12] Rauber A,Aschenbrenner A, Witvoet O. Austrian Online Archive Processing: Analyzing Archives of the World Wide Web[J].Research and Advanced Technology for Digital Libraries: 6th European Conference, ECDL, 2002:16-31.
[13] Rauber A, Aschenbrenner A, Witvoet O,et al. Uncovering Information Hidden in Web Archives[J].D-Lib Magazine, 2002,8(12):1082-9873.
[14] William Y A , Aya S, Dmitriev P,et al. Building a Research Library for the History of the Web[J].Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries, 2006:95-102.
[15] William Y A , Aya S, Dmitriev P, et al. A Research Library Based on the Historical Collections of the Internet Archive[J].D-Lib Magazine, 2006,12(2):1082-9873.
[16] Kitsuregawa M, Tamura T, Toyoda M,et al.Socio-Sense: A System for Analysing the Societal Behavior from Long Term Web Archive[M]. Progress in WWW Research and Development.Heidelberg :Springer Berlin ,2008.
[17] 让社会科学插上信息技术的翅膀[EB/OL]. [2008-11-02].http://cess.grids.cn/ourpdfs/Let%20social%20science%20ride% 20on%20IT%20bullet%20train.pdf.

[1] Xie Wang, Wang Lizhen, Chen Hongmei, Zeng Lanqing. Identifying Relationship Between Pollution Sources and Cancer Cases with Spatial Ordered Pair Patterns[J]. 数据分析与知识发现, 2021, 5(2): 14-31.
[2] Yong Zhang,Shuqing Li,Yongshang Cheng. Mining Algorithm for Weighted Association Rules Based on Frequency Effective Length[J]. 数据分析与知识发现, 2019, 3(7): 85-93.
[3] Quan Lu,Anqi Zhu,Jiyue Zhang,Jing Chen. Research on User Information Requirement in Chinese Network Health Community: Taking Tumor-forum Data of Qiuyi as an Example[J]. 数据分析与知识发现, 2019, 3(4): 22-32.
[4] Dongmei Mu,Hui Fa,Ping Wang,Jing Sun. Research on Disease Risk Factors on Structural Equation Model[J]. 数据分析与知识发现, 2019, 3(4): 80-89.
[5] Li Yongnan. Using Bayes Theory to Classify Counter Terrorism Intelligence[J]. 数据分析与知识发现, 2018, 2(10): 9-14.
[6] Mu Dongmei,Wang Ping,Zhao Danning. Reducing Data Dimension of Electronic Medical Records: An Empirical Study[J]. 数据分析与知识发现, 2018, 2(1): 88-98.
[7] Hu Zhongyi,Wang Chaoqun,Wu Jiang. Identifying Phishing Websites with Multiple Online Data Sources[J]. 数据分析与知识发现, 2017, 1(6): 47-55.
[8] Jiang Siwei,Xie Zhenping,Chen Meijie,Cai Ming. Self-Explainable Reduction Method for Mixed Feature Data Modeling[J]. 数据分析与知识发现, 2017, 1(12): 92-100.
[9] Mu Dongmei,Ren Ke. Discovering Knowledge from Electronic Medical Records with Three Data Mining Algorithms[J]. 现代图书情报技术, 2016, 32(6): 102-109.
[10] Hu Jiying,Wu Zhenxin,Xie Jing,Zhang Zhixiong. A Full-text Indexing System for WARC Files[J]. 现代图书情报技术, 2016, 32(5): 91-98.
[11] Li Feng,Li Shu’ning,Yu Jing. A Department Oriented Library Usage Data System for Graduates[J]. 现代图书情报技术, 2016, 32(5): 99-103.
[12] Zhao Jingxian. Detect of Internet Fake Public Opinion Based on Decision Tree[J]. 现代图书情报技术, 2015, 31(6): 78-84.
[13] He Jianmin, Wang Zhe. The Pedigree Method to Mine Influential Clusters of Topic Information in Social Network[J]. 现代图书情报技术, 2015, 31(5): 65-72.
[14] Huang Wenbin, Xu Shanchuan, Ma Long, Wang Jun. Analysis of Mobile User Behaviors with Telecommunication Data[J]. 现代图书情报技术, 2015, 31(5): 80-87.
[15] Wu Zhenxin, Zhang Zhixiong, Xie Jing, Hu Jiying. Developing Web Archive System of International Institutions Based on IIPC Open Source Software[J]. 现代图书情报技术, 2015, 31(4): 1-9.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn