%A Xia Tian %T Overview of Research on Data Collection from Ajax Sites %0 Journal Article %D 2010 %J Data Analysis and Knowledge Discovery %R 10.11925/infotech.1003-3513.2010.03.09 %P 52-57 %V 26 %N 3 %U {https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/abstract/article_3186.shtml} %8 2010-03-25 %X

This paper introduces the recent advances achieved from five aspects, which include Ajax link elements judgment, page state identification, page state controllable transformation, content extraction and duplicated states detection. The overall processing flow and the relevant supporting technologies are summarized, and the new research trends are discussed. This study will be helpful to promote the further research on Ajax data collection issues.