|
|
Research on Ontology-based Automatic Annotation for Deep Web |
Zhang Yulian1 Li Shuai 1 Zhou Xinglin 2 |
1(College of Information Science and Engineering, Yanshan University,Qinhuangdao 066004, China)
2(Department of Computer Science,Shanghai Technical Institute of Electronics & Information,Shanghai 201411,China) |
|
|
Abstract This paper puts forward a kind of labelling method according to the Web page sense of vision information, which is also based on the inquiry connection pattern labelling method.Then it uses ontology phrases to replace the original label information, and the replacement of label information can ensure the labelling information uniformity.This method is a good way to make up the defects of original method,and effectively improves the precision and recall.
|
Received: 20 July 2009
Published: 25 September 2009
|
|
Corresponding Authors:
Li Shuai
E-mail: blue_ice_sea@163.com
|
About author:: Zhang Yulian ,Li Shuai ,Zhou Xinglin |
[1] Lu Y Y, He H, Zhao H K, et al. Annotating Structured Data of the Deep Web[C].In: Proceedings of the IEEE 23rd International Conference on Data Engineering. Istanbul:IEEE Computer,2007: 376-385.
[2] Wang J Y,Lochovsky F H. Data Extraction and Label Assignment for Web Databases[C]. In: Proceedings of the 12th International Conference on World Wide Web,Budapest, Hungary.New York, NY, USA: ACM Press, 2003:187-196.
[3] Gaelle Hignette, Patrice Buche, Juliette Dibie-Barthélemy,et al. An Ontology-driven Annotation of Date Tables[M]. Heidelberg: Springer Berlin, 2007: 29-40.
[4] 袁柳,李战怀,陈世亮.基于本体的Deep Web数据标注[J].软件学报,2008,19(2):237-245.
[5] Goldberg D E. Genetic Algorithms in Search,Optimization,and Machine Learning[M]. Addison-Wesley, 1989.
[6] Lin D K. An Information-theoretic Definition of Similarity[C]. In: Proceedings of the 15th International Conference on Machine Learning. Madison: ACM Press,1998:296-304.
[7] Van Rijsbergen C J.Information Retrieval[M].2nd Edition. Department of Computer Science,University of Glasgow,1979.
[8] He H, Meng W, Yu C, et al. WISE-Integrator:A System for Extracting and Integrating Complex Web Search Interface of the Deep Web[C]. In:Proceedings of the 31st VLDB Conference. Trondheim,Norway:VLDB Press, 2006: 1314-1317.
[9] 催晓军,彭智勇,曾勇.基于多标注源的Deep Web查询结果自动标注[J].计算机应用,2009,29(1):196-200. |
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|