Please wait a minute...
New Technology of Library and Information Service  2007, Vol. 2 Issue (11): 45-48    DOI: 10.11925/infotech.1003-3513.2007.11.09
Current Issue | Archive | Adv Search |
Design and Realization of Weblog Gathering System Based on RSS
Liu Li1,2  Xiao Shibin1,2  Wang Tao1,2  Shi Shuicai1,2
1(Chinese Information Processing Research Center,Beijing Information Science and Technology University,Beijing 100101,China)
2(Beijing TRS Information Technology Ltd,Beijing 100101,China)
Download: PDF (674 KB)  
Export: BibTeX | EndNote (RIS)      

This paper focuses on how to crawl Weblogs effectively in some sections of Web,and brings forward an arithmetic of the Weblog gathering based on RSS.The authors design two crawlers,one of which is responsible for gathering RSS by performing a breadth-first traversal of the Web,and the other tracks updated Weblogs automatically by performing a vertical search of every RSS.Also A model system is implemented.

Key wordsRSS      Weblog      Information gathering     
Received: 14 September 2007      Published: 25 November 2007


Corresponding Authors: Liu Li     E-mail:
About author:: Liu Li,Xiao Shibin,Wang Tao,Shi Shuicai

Cite this article:

Liu Li,Xiao Shibin,Wang Tao,Shi Shuicai. Design and Realization of Weblog Gathering System Based on RSS. New Technology of Library and Information Service, 2007, 2(11): 45-48.

URL:     OR

[1] 张道银,蔡瑞英.RSS技术及其应用研究[J].微计算机信息,2006,22(21):281-283
[2] Najork M,Heydon A.High-Performance Web Crawling[M].Handbook of Massive Data Sets,Kluwer Academic Publishers Inc,2001:25-45
[3] Heydon A,Najork M.Mercator:A Scalable,Entensible Web Crawler[J].World Wide Web,1999(2):219-229
[4] 李盛韬,赵章界,余智华,等.基于主题的Web信息采集系统的设计与实现[J].计算机工程,2003,29(17):102-104
[5] 李晓明,凤旺森.两种对URL的散列效果很好的函数[J].软件学报,2004,15(2):179-184
[6] 崔国华,周荣华,粟栗,等.关于MD5强度分析的研究[J].计算机工程与科学,2007,29(1):45-48
[7] 郭红艳,杨波,金蓓弘,等.高效DOM实现的技术研究[J].计算机科学,2006,33(6):274-277

[1] Wu Haidong, He Xiaoyang, Zhang Jingli. Design and Implementation of Medical Academic Information Automatic Gathering System[J]. 现代图书情报技术, 2014, 30(11): 73-78.
[2] Bi Qiang, Bao Yulai. Design and Implementation of Domain Ontology and RSS Based Integrated Portal for Open Access Resource[J]. 现代图书情报技术, 2012, 28(3): 78-82.
[3] Xue Juan. Design and Implementation of Key Subjects Information Push System Based on RSS Technology[J]. 现代图书情报技术, 2010, 26(4): 83-86.
[4] Li Wenjiang,Chen Shiqin. The Design of RSS 2.0 Generating and Parsing Class Library Based on LINQ[J]. 现代图书情报技术, 2009, 25(7-8): 131-135.
[5] Zhou Yan,Ma Jianguo. Design of News Broadcasting System Based on Broadcast-store Grid[J]. 现代图书情报技术, 2007, 2(9): 76-79.
[6] Jiang Enbo . The Service and Technology Based on Information Syndication[J]. 现代图书情报技术, 2007, 2(4): 32-34.
[7] Qian Aibing . Design and Implementation of Focused Web News Aggregator Based on RSS[J]. 现代图书情报技术, 2007, 2(4): 56-61.
[8] Zhang Bei,Zhang Chengyu,Jiang Airong . Application of Ajax and RSS in Personalized Portal Site of the Library[J]. 现代图书情报技术, 2007, 2(3): 65-68.
[9] Xu Dezhi,Wang Qingtao,Wang Bin . Ontology-Based Web Information Gathering[J]. 现代图书情报技术, 2007, 2(2): 53-55.
[10] Gong Weitao,Ma Ziwei. The Digital Library Portal Integration Technology and Implementation[J]. 现代图书情报技术, 2007, 2(11): 23-27.
[11] Chen Linghui . The Idea and Implementation of RSS-Based Individual Information Service of Information Portal[J]. 现代图书情报技术, 2007, 2(1): 33-36.
[12] Liu Feng,Shi Shuicai,Xiao Shibin,Wang Hongwei . A Design of Distributed News & Weblog Search Engine Based on RSS[J]. 现代图书情报技术, 2007, 2(1): 29-32.
[13] Tian Yang . Implementation and Application of an XQuery Processor Based on Free Component of AltovaXML[J]. 现代图书情报技术, 2006, 1(4): 70-73.
[14] Zhang Huier,Zhang Zhixiong,Lin Ying,Li Sa. Implementation of RSS-based Science and Technology  Information Syndication System[J]. 现代图书情报技术, 2005, 21(7): 60-63.
[15] Wang Jiantao. Application of RSS in Information Service of Library[J]. 现代图书情报技术, 2005, 21(7): 86-88.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938