New Technology of Library and Information Service  2011, Vol. 27 Issue (6): 27-31    DOI: 10.11925/infotech.1003-3513.2011.06.05
Automatic Identify Title of Web Text Resource Based on Rules
Liu Jianhua1, Zhang Zhixiong1, Xie Jing1, Zou Yimin1,2
1. National Science Library, Chinese Academy of Sciences, Beijing 100190, China;
2. Craduate University of Chinese Acadeny of Sciences, Beijing 100049, China
Abstract  As the important role of titles of Web resource for information retrieval,text cluster and so on,this paper proposes a method to identify the titles automatically and quickly based on the style information(such as font) and location information of text which are used by many other researchers. Besides, it considers the relevance between the title candidates and text content. Lastly, this paper implements the title identification component and does some experiments to show the effectiveness of this method.
Key wordsWeb text resources      Title identification      Title source      Title feature     
Received: 05 May 2011      Published: 15 August 2011



Liu Jianhua, Zhang Zhixiong, Xie Jing, Zou Yimin. Automatic Identify Title of Web Text Resource Based on Rules. New Technology of Library and Information Service, 2011, 27(6): 27-31.

