%A Chen Yanmei,Zhang Bin %T The Research and Realization of Technology Converting HTML to XML %0 Journal Article %D 2003 %J Data Analysis and Knowledge Discovery %R 10.11925/infotech.1003-3513.2003.05.21 %P 66-67 %V 19 %N 5 %U {https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/abstract/article_1836.shtml} %8 2003-10-25 %X

Nowadays, the whole world can possibly communicate with all different people by using web. Internet usually uses HTML, it cannot handle the various requirement of Internet and also express the data itself.To do so, information from web sources needs to be accessible in a structured way. XML and its various extensions are a step in this direction. Unfortunately, the web is not yet a well organized repository of nicely structured XML documents but rather a conglomerate of volatile HTML pages, for which structure has to be extracted. This thesis shows the design and imp lementation of a conversion system of HTML to XML.