|
|
Targeted Websites Distributed and Precise Harvest System for Network Monitoring Technology |
Xie Jing, Qu Yunpeng, Liu Jianhua |
National Science Library, Chinese Academy of Sciences, Beijing 100190,China |
|
|
Abstract By analyzing the existing open-source framework collection system, an accurate acquistition system is designed and developed based on Crawler4j. So the system can meet the real-time monitoring of collection of resources and accuracy requirements. And the paper introduces the design and implementation of the system.
|
Received: 05 May 2011
Published: 09 October 2011
|
|
[1] Nutch.http://wiki.apache.org/nutch.[2] Heritrix.http://crawler.archive.org/.[3] Open Source Web Crawler for Java.http://code.google.com/p/crawler4j/.[4] Trail:RMI.http://download.oracle.com/javase/tutorial/rmi/index.html.[5] Cobra: Java HTML Renderer & Parser.http://lobobrowser.org/cobra.jsp.[6] Regular Expression.http://en.wikipedia.org/wiki/Regular_expression. |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|