Please wait a minute...
Advanced Search
现代图书情报技术  2004, Vol. 20 Issue (6): 1-5     https://doi.org/10.11925/infotech.1003-3513.2004.06.01
  数字图书馆 本期目录 | 过刊浏览 | 高级检索 |
信息抽取技术及其在数字图书馆中的应用前景分析
张智雄
(中国科学院文献情报中心 北京   100080)
Information Extraction and Its Functions in the Digital Library
Zhang Zhixiong
(Library of Chinese Academy of Science, Beijing 100080, China)
全文:
输出: BibTeX | EndNote (RIS)      
摘要 

信息抽取的目标是自动从文本信息中抽取出预先想要得到的信息(知识) , 它提供了一条从浩瀚的信息堆积中抽取出与用户相关的信息的一条思路。文章分析了信息抽取的主要概念、主要研究活动、信息抽取的类型和信息抽取系统的一般结构, 并提出在数字图书馆的建设中, 信息抽取技术能够在数字内容的自动标引、元数据获取、数据挖掘、情报研究分析、大型知识库数值库建设、参考咨询等方面发挥重要的作用。

服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
关键词 信息抽取MUC数字图书馆NL P    
Abstract

Information Extraction (IE) is a term which has come to be applied to the activity of automatically extracting pre-specified sorts of information from natural language texts. This paper analyses the basic concept of information extraction, the main research activities on information extraction, the type of information extraction and the system of information extraction. The author believes information extraction will play a very important role in coping with the huge collection of digital information. It can provide helps in automat ic annotation of digital materials, automatic acquisition of metadata, improving data mining in information analysis, developing knowledge base from free text, and generating answers in digital reference system.

Key wordsInformation Extraction(IE)    Message understanding conference    Digital library    Natural language processing
收稿日期: 2004-03-08      出版日期: 2004-06-25
ZTFLH: 

G250.76

 
通讯作者: 张智雄     E-mail: zhangzhx@mail.las.ac.cn
作者简介: 张智雄
引用本文:   
张智雄. 信息抽取技术及其在数字图书馆中的应用前景分析[J]. 现代图书情报技术, 2004, 20(6): 1-5.
Zhang Zhixiong. Information Extraction and Its Functions in the Digital Library. New Technology of Library and Information Service, 2004, 20(6): 1-5.
链接本文:  
https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.1003-3513.2004.06.01      或      https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2004/V20/I6/1

1  Andrew Joscelyne and Rose Lockwood,EUROMAP Final Report: BenchmarkingHLTprogressinEurope-FullReport. http://www.hltcentral.org/usrdocs/Euromap-report/EUROMAP-Final-Report-Full-May-2003.pdf(AccessedFeb.8,2004)
2  Gate Information Extraction,http://gate.ac.uk/ie/(Accessed Feb.8,2004)
3  NLP group of University of Sheffield,Information Extraction.
http://nlp.shef.ac.uk/research/areas/ie.html(Accessed Feb.8,2004)
4  Douglas E.Appeltand DavidJ.Israel,Introduction to Informa
tion Extraction.Technology,http://www.ai.sri.com/~appelt/ie-tutorial/(Accessed Feb.8,2004)
5  Donna Harman,Whatis Information Extraction? http://www.itl.nist.gov/iaui/894.02/relatedprojects/muc/info/whatsie.html(Accessed Feb.8,2004)
6  Hamish Cunningham,Information Extraction-a User Guide(SecondEdition),http://www.dcs.shef.ac.uk/~hamish/IE/userguide/main.html(Accessed Feb.8,2004)
7  RALI,Bilingual Information Extraction,http://www.iro.umontreal.ca/~kosseim/Extraction/ProjetEI.en.html(Accessed Feb.8,2004)
8  Jakub Piskorski&FeiyuXu,Overview of MUC and Introduction to Text Mining,http://www.dfki.de/~feiyu/HS-TM-IE/textM.ppt(Accessed Feb.8,2004)
9  Nancy A.Chinchor,OVERVIEWOFMUC-7/MET , http://www.itl.nist.gov/iaui/894.02/relatedprojects/muc/proceedings/muc7proceedings/overview.html(Accessed Feb.
8,2004)
10  E.Marsh&D.Perzanowski,MUC-7EVALUATION OF IE TECHNOLOGY:Overviewofresults,http://www.itl.nist.gov/iaui/894.02/relatedprojects/muc/proceedings/muc7proceedings/marshslides.pdf(Accessed Feb.8,2004)
11  ACE-Automatic Content Extraction,http://nist.gov/speech/tests/ace/index.htm(Accessed Feb.8,2004)
12  Diana Maynard,Kalina Bontcheva,Hamish Cunningham,To wards a semantic extraction of named entities,In Proceedings Recent Advances in Natural,Borovets,Bulgaria,2003.http://gate.ac.uk/sale/ranlp03/ranlp03.pdf(Accessed Feb.8,2004)
13  H.Cunningham etc.,GATE:A framework and graphical development environment for robust NLP tools and applications,Proceedings of the 40th Anniversary Meeting of the Associationfor Computational Linguistics(2002).http://gate.ac.uk/sale/acl02/acl-main.pdf(Accessed Feb.8,2004)
14  Valentin Tablan,GATE and Information Extraction,http://gate.ac.uk/sale/talks/gothenburg/index.html(Accessed Feb.8,2004)
15  ANNIE.http://www.gate.ac.uk/sale/tao/index.html#annie(Accessed Feb.8,2004)
16  Kalina Bontchevaetc.,Using Human Language Technology for Automatic Annotation and Indexing of Digital Library Content.Proceedings of the 6th European Conference on Research and Advanced Technology for Digital Libraries(ECDL’2002),Rome, September2002.http://gate.ac.uk/sale/ecdl02/ecdl.pdf(Accessed Feb.8,2004)
17  Reinoso-Castillo,J.(2002).Ontolgy-Driven Information Extraction and Integration from Autonomous ,Heterogeneous,Distributed Data Sources--AFederatedQuery-Centric Approach.Masters Thesis.Artificial Intelligence Research Laboratory.Department of Computer Science.Iowa State University
(Accessed Feb.8,2004)
18  Vasant Honavaretc.,Ontology-Driven Information Extraction and Knowledge Acquisition from Heterogeneous,Distributed,Autonomous Biological Data Sources.http://www.cs.iastate.edu/~honavar/Papers/ijcaiworkshoppaper.pdf(AccessedFeb.8,2004)
19  Roazhon,Information Extraction:from unstructured texts to knowledge bases,http://tim.irisa.fr/veille/text-mining/thales.ppt(Accessed Feb.8,2004)
20  Rohini Srihariand WeiLi,Information Extraction Supported Question Answering.http://trec.nist.gov/pubs/trec8/papers/cymfony.pdf(Accessed Feb.8,2004)

[1] 谭荧, 唐亦非. 基于指代消解的引文内容抽取研究*[J]. 数据分析与知识发现, 2021, 5(8): 25-33.
[2] 陶玥,余丽,张润杰. 科技文献中短语级主题抽取的主动学习方法研究*[J]. 数据分析与知识发现, 2020, 4(10): 134-143.
[3] 刘志强,都云程,施水才. 基于改进的隐马尔科夫模型的网页新闻关键信息抽取*[J]. 数据分析与知识发现, 2019, 3(3): 120-128.
[4] 章成志,李铮. 基于学术论文全文的创新研究评价句抽取研究 *[J]. 数据分析与知识发现, 2019, 3(10): 12-18.
[5] 牟冬梅, 金姗, 琚沅红. 基于文献数据的疾病与基因关联关系研究*[J]. 数据分析与知识发现, 2018, 2(8): 98-106.
[6] 齐云飞, 赵宇翔, 朱庆华. 关联数据在数字图书馆移动视觉搜索系统中的应用研究*[J]. 数据分析与知识发现, 2017, 1(1): 81-90.
[7] 洪亮,钱晨,樊星. 移动数字图书馆资源的情境感知个性化推荐方法研究*[J]. 现代图书情报技术, 2016, 32(7-8): 110-119.
[8] 刘健,毕强,马卓. 数字图书馆微服务评价指标体系构建及实证研究*[J]. 现代图书情报技术, 2016, 32(5): 22-29.
[9] 段宇锋,黄思思. 中文植物物种多样性描述文本的信息抽取研究*[J]. 现代图书情报技术, 2016, 32(1): 87-96.
[10] 刘伟, 王星, 宋培彦. 同义词抽取结果的噪音清洗方法研究[J]. 现代图书情报技术, 2015, 31(6): 64-70.
[11] 王传清, 毕强. 数字图书馆自动化语义标注工具系统模型研究[J]. 现代图书情报技术, 2014, 30(6): 17-24.
[12] 尉萌. 利用演化模式做文献推荐[J]. 现代图书情报技术, 2014, 30(4): 20-26.
[13] 李湘东, 霍亚勇, 黄莉. 图书网页的自动识别及书目信息抽取研究[J]. 现代图书情报技术, 2014, 30(4): 71-77.
[14] 刘雅静, 王衍喜, 郝丹, 周津慧. 机构知识库支撑科研服务方法研究[J]. 现代图书情报技术, 2014, 30(3): 1-7.
[15] 胡昌平, 陈果. 共词分析中的词语贡献度特征选择研究[J]. 现代图书情报技术, 2013, 29(7/8): 89-93.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 2015 《数据分析与知识发现》编辑部
地址:北京市海淀区中关村北四环西路33号 邮编:100190
电话/传真:(010)82626611-6626,82624938
E-mail:jishu@mail.las.ac.cn