New Technology of Library and Information Service  2010, Vol. 26 Issue (3): 52-57    DOI: 10.11925/infotech.1003-3513.2010.03.09
Overview of Research on Data Collection from Ajax Sites
Xia Tian
(School of Information Resource Management, Renmin University of China, Beijing 100872, China)
This paper introduces the recent advances achieved from five aspects, which include Ajax link elements judgment, page state identification, page state controllable transformation, content extraction and duplicated states detection. The overall processing flow and the relevant supporting technologies are summarized, and the new research trends are discussed. This study will be helpful to promote the further research on Ajax data collection issues.

Key wordsData collection      Ajax crawler      HTML renderer      Web2.0     
Received: 06 March 2010      Published: 25 March 2010


