|
|
A Design of Distributed News & Weblog Search Engine Based on RSS |
Liu Feng Shi Shuicai Xiao Shibin Wang Hongwei |
(Chinese Information Processing Research Center, Beijing Information Science & Technology University, Beijing 100101,China) |
|
|
Abstract For the problem of traditional search engine can’t get completed and updated copies of the whole Web in time, especially news and Weblog site with high update frequency, this paper designes a distributed news & Weblog search engine based on RSS syndicated data. Using the pastry protocol, distributed data could be stored and transferred smoothly. This paper also compresses index file with Bloom filter. So the news and Weblog site with high update frequency could be searched in time and the cost of storage could be reduced. The system has a bright future.
|
Received: 11 October 2006
Published: 25 January 2007
|
|
Corresponding Authors:
Liu Feng
E-mail: liu.feng@trs.com.cn
|
About author:: Liu Feng,Shi Shuicai,Xiao Shibin,Wang Hongwei |
1Balakrishnan H, Kaashoek M, Karger D, Morris R, Stoica I. Looking Up Data in P2P Systems.Comm. of the ACM, February 2003
2伍玉伟. RSS:网络信息“聚合”利器.图书情报论坛,2006(1) :72-73
3于忠涛,刘兴伟.Pastry 网络模型的路由机制及改进.西华大学学报自然科学版,2006,25(1) :27-30
4Ripeanu M.Peer-to-peer Architecture Case Study:Gnutella.In Proceedings of International Conference on P2P Computing, 2001
5Bloom Filter.http://www.nist.gov/dads/HTML/bloomFilter.html(Accessed Aug.18,2006)
6池静,方启泉. Bloom filter 的研究和应用.河北建筑科技学院学报,2003,20(4) :59-61 |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|