Research of Title Party News Identification Technology Based on Topic Sentence Similarity
Wang Zhichao1, Weng Nan2, Wang Yu3
1. Institute of Information Science & Technology, Shanghai Jiaotong University, Shanghai 200240, China; 2. School of Management & Engineering, Nanjing University, Nanjing 210093, China; 3. School of Management, Dalian University of Technology, Dalian 116024, China
Abstract:Concerning the issues of the more and more title party news in the Web,this paper presents a new algorithm of title party news identification. Firstly, it analyzes the composition of the news page, then puts forward an approach of news title extraction and information extraction based on the features of news page. Secondly, considering the problem of extracting coherent topic sentences from news pages, starting with the relationship matrix of sentences, it puts forward an algorithm of topic sentence extraction. Then, according to the extracted news title and the candidate set of topic sentences, it can compute the similarity value, which is the main basis for judging the title party. Finally, the experiment results show that this method is effective and feasible.
王志超, 翁楠, 王宇. 基于主题句相似度的标题党新闻鉴别技术研究[J]. 现代图书情报技术, 2011, (11): 48-53.
Wang Zhichao, Weng Nan, Wang Yu. Research of Title Party News Identification Technology Based on Topic Sentence Similarity. New Technology of Library and Information Service, 2011, (11): 48-53.