[1] Bermark D , Lagoze C, Sbiltyakov A. Focused Crawls, Tunneling, and Digital Libraries[C].In: Proceedings of the 6th European Conferrence on Research Advanced Technology for Digital Libraries, Lecture Notes In Computer Science,2002,2458:91-106.
[2] Luo N,Zuo W L,Yuan F Y. Gray Tunneling Based on Block Relevance for Focused Crawling[EB/OL].[2007-12-30].http://www.atlantis-press.com/php/download_paper?id=1288.
[3] 封化民,刘飚,刘艳敏,等.含有位置坐标树的Web页面分析和内容提取框架[J].清华大学学报,2005,45(S):1767-1771.
[4] Lin S H, Ho J M. Discovering Informative Content Blocks from Web Documents[C]. In: Proceedings of the ACM SIGKDD Int.2002. New York: ACM Press, 2002:588-593.
[5] Kovacevic M, Diligenti M, Gori M, et al.Recognition of Common Area in a Web Page Using Visual Information: A Possible Application in a Page Classification[C]. In: Proceeding of the 10th international Conference on Artifical Intelligence:Methodology, Systems, Application. Varna:Springer,2002:203-212.
[6] 荆涛,左万利. 基于可视布局信息的网页噪音去除算法[J]. 华南理工大学学报(自然科学版),2004, 32(增刊):84-87.
[7] 王知津,贾福新,郑红军.现代信息检索[M]. 北京:机械工业出版社,2006.
[8] Srinivasan P, Menczer F, Pant G. A General Evaluation Framework for Topical Crawlers[J]. Information Retrieval, 2005,8(3):417-447.
[9] 教育信息化技术标准委员会.CELTS-31:教育资源建设技术规范[EB/OL].[2005-12-20].http:// www.edu.cn/html/keyanfz/doc20020210/13.doc. |