Please wait a minute...
New Technology of Library and Information Service  2007, Vol. 2 Issue (6): 38-41    DOI: 10.11925/infotech.1003-3513.2007.06.09
Current Issue | Archive | Adv Search |
Information Extraction Based on Calculation of Sentence Similarity
Lian ZhanjunLv XueqiangZhang Yujie2  Shi Shuicai1
1 (Chinese Information Processing Research Center,Beijing Information
Science and Technology University,Beijing 100101,China)
2 (College of Information Science and Engineering,Dalian  Polytechnic University, Dalian 116011,China)
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  

This paper gives a new method of information extraction based on calculation of sentence similarity. The topics of the sentences in testing words are labeled by adopting the method of calculation of sentence similarity. The veracity is increased by referencing the distributing of probability of the sentences in the documents. Using the resources of personal information on Internet, the paper achieves a statistic result.

Key wordsInformation extraction      Distributing of probability      Topic      Calculation of sentence similarity     
Received: 10 May 2007      Published: 25 June 2007
: 

TP391

 
Corresponding Authors: Lian Zhanjun     E-mail: dikk12345678@gmail.com
About author:: Lian Zhanjun,Lv Xueqiang,Zhang Yujie,Shi Shuicai

Cite this article:

Lian Zhanjun,Lv Xueqiang,Zhang Yujie,Shi Shuicai. Information Extraction Based on Calculation of Sentence Similarity. New Technology of Library and Information Service, 2007, 2(6): 38-41.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2007.06.09     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2007/V2/I6/38

1Zhang Y M,Zhou J F.A Trainable Method for Extracting Chinese Entity Names and Their Relations.In:Proceedings of the Second Chinese Language Processing Workshop,Hong Kong,2000
2Barzilay R, Lee L. Catching the Drift: Probabilistic Content Models. with Application to Generation and Summarization,HLT-NAACL 2004:113-120
3李向阳,苗壮,肖江.无结构文本信息抽取综述.军事通信技术,2004,25(2):32-35
4车万翔,刘挺,秦兵,李生等.基于改进编辑距离的中文相似句子检索.高技术通讯,2004(7):15-20
5李彬,刘挺,秦兵,李生.基于语义依存的汉语句子相似度计算.计算机应用研究,2003(12):15-17
6菅小艳,郑家恒. 基于HMM的农作物信息抽取.自然语言理解与大规模内容计算,2005(10):25-28
7高霄云,杨建林.基于规则的中文时间词和数词的自动识别算法.现代图书情报技术,2007(3): 46-50
8Sigz.垂直搜索引擎技术. http://www.fullsearcher.com/n20051112144420735.asp (Accessed  Sept.10,2006)

[1] Tan Ying, Tang Yifei. Extracting Citation Contents with Coreference Resolution[J]. 数据分析与知识发现, 2021, 5(8): 25-33.
[2] Ma Yingxue,Zhao Jichang. Patterns and Evolution of Public Opinion on Weibo During Natural Disasters: Case Study of Typhoons and Rainstorms[J]. 数据分析与知识发现, 2021, 5(6): 66-79.
[3] Wu Xu,Chen Chunxu. Detecting Topics of Group Chats with Multiple Strategies[J]. 数据分析与知识发现, 2021, 5(5): 1-9.
[4] Song Ruoxuan,Qian Li,Du Yu. Identifying Academic Creative Concept Topics Based on Future Work of Scientific Papers[J]. 数据分析与知识发现, 2021, 5(5): 10-20.
[5] Yi Huifang,Liu Xiwen. Analyzing Patent Technology Topics with IPC Context-Enhanced Context-LDA Model[J]. 数据分析与知识发现, 2021, 5(4): 25-36.
[6] Li Yueyan,Wang Hao,Deng Sanhong,Wang Wei. Research Trends of Information Retrieval——Case Study of SIGIR Conference Papers[J]. 数据分析与知识发现, 2021, 5(4): 13-24.
[7] Wang Hongbin,Wang Jianxiong,Zhang Yafei,Yang Heng. Topic Recognition of News Reports with Imbalanced Contents[J]. 数据分析与知识发现, 2021, 5(3): 109-120.
[8] Shen Si,Li Qinyu,Ye Yuan,Sun Hao,Ye Wenhao. Topic Mining and Evolution Analysis of Medical Sci-Tech Reports with TWE Model[J]. 数据分析与知识发现, 2021, 5(3): 35-44.
[9] Zhang Xin,Wen Yi,Xu Haiyun. A Prediction Model with Network Representation Learning and Topic Model for Author Collaboration[J]. 数据分析与知识发现, 2021, 5(3): 88-100.
[10] Zhao Tianzi, Duan Liang, Yue Kun, Qiao Shaojie, Ma Zijuan. Generating News Clues with Biterm Topic Model[J]. 数据分析与知识发现, 2021, 5(2): 1-13.
[11] Zhang Jinzhu, Yu Wenqian. Topic Recognition and Key-Phrase Extraction with Phrase Representation Learning[J]. 数据分析与知识发现, 2021, 5(2): 50-60.
[12] Wang Wei, Gao Ning, Xu Yuting, Wang Hongwei. Topic Evolution of Online Reviews for Crowdfunding Campaigns[J]. 数据分析与知识发现, 2021, 5(10): 103-123.
[13] Chen Hao, Zhang Mengyi, Cheng Xiufeng. Identifying Cross-Region Patent Collaboration Opportunities Using LDA and Decision Trees——Case Study of Universities from Guangdong and Wuhan[J]. 数据分析与知识发现, 2021, 5(10): 37-50.
[14] Liu Qian, Li Chenliang. A Survey of Topic Evolution on Social Media[J]. 数据分析与知识发现, 2020, 4(8): 1-14.
[15] Sheng Jiaqi, Xu Xin. Expanding Scholar Labels with Research Similarity and Co-authorship Network[J]. 数据分析与知识发现, 2020, 4(8): 75-85.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn