Please wait a minute...
New Technology of Library and Information Service  2009, Vol. 3 Issue (3): 38-45    DOI: 10.11925/infotech.1003-3513.2009.03.07
Current Issue | Archive | Adv Search |
Semantic Relation Extraction from Socially-generated Tags:A Methodology for Metadata Generation
Miao Chen  Xiaozhong Liu  Jian Qin
(Syracuse University, USA)
Download: PDF(538 KB)   HTML  
Export: BibTeX | EndNote (RIS)      
Abstract  

The growing predominance of social semantics in the form of tagging presents the metadata community with both opportunities and challenges as for leveraging this new form of information content representation and for retrieval. One key challenge is the absence of contextual information associated with these tags. This paper presents an experiment working with Flickr tags as an example of utilizing social semantics sources for enriching subject metadata. The procedure included four steps:1) Collecting a sample of Flickr tags, 2) Calculating cooccurrences between tags through mutual information, 3) Tracing contextual information of tag pairs via Google search results,4) Applying natural language processing and machine learning techniques to extract semantic relations between tags. The experiment helped us to build a context sentence collection from the Google search results, which was then processed by natural language processing and machine learning algorithms. This new approach achieved a reasonably good rate of accuracy in assigning semantic relations to tag pairs. This paper also explores the implications of this approach for using social semantics to enrich subject metadata.

Key wordsRelation extraction      Tags      Search engine      Social semantics      Metadata     
Received: 09 February 2009      Published: 25 March 2009
: 

G250

 
Corresponding Authors: Miao Chen     E-mail: mchen14@syr.edu
About author:: Miao Chen,Xiaozhong Liu,Jian Qin