Please wait a minute...
Advanced Search
现代图书情报技术  2014, Vol. 30 Issue (10): 42-48     https://doi.org/10.11925/infotech.1003-3513.2014.10.07
  知识组织与知识管理 本期目录 | 过刊浏览 | 高级检索 |
TimeML应用于汉语文本时间关系标注的可行性分析
李路标, 张均胜, 张寅生, 王惠临
中国科学技术信息研究所 北京 100038
On the Feasibility of Applying TimeML to the Annotation of Temporal Relations in Chinese Text
Li Lubiao, Zhang Junsheng, Zhang Yinsheng, Wang Huilin
Institute of Scientific & Technical Information of China, Beijing 100038, China
全文: PDF (439 KB)   HTML  
输出: BibTeX | EndNote (RIS)      
摘要 

[目的] 对TimeML应用于汉语文本时间关系标注的可行性进行研究并验证。[方法] 基于TimeML标准及其主要标签, 针对汉语时间表达的特点, 探讨主要标签在汉语中的适用性。[结果] 虽然汉语语言和英语语言在语法结构和句法结构上存在差异, 但是TimeML标准在汉语上的应用是可行的。[局限] 英汉语言结构的不同导致在TimeML五个主要标签中涉及语法结构的属性在英汉对译文本之间不能完全平行实现。[结论] TimeML作为英文时间关系标记语言, 可以有效地应用于汉语文本时间关系的标注。本研究为汉语文本中事件时序推理及深层次的汉语文本时间关系解析研究奠定了基础。

服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
李路标
张均胜
王惠临
张寅生
关键词 TimeML汉语事件时间表达式时间关系解析    
Abstract

[Objective] This paper carries on the research and experiment on the feasibility of applying TimeML to the annotation of temporal relations in Chinese text. [Methods] According to the characteristics of Chinese temporal expressions, this paper discusses the applicability of the main labels of TimeML in Chinese text based on TimeML and its main labels. [Results] Although there are some differences between Chinese and English in the grammatical structure and syntactic structure, the application of TimeML to the Chinese language is feasible. [Limitations] The main labels of TimeML can't be completely parallel implemented to the English-Chinese text on the grammar structure because of the differences of language structure between Chinese and English. [Conclusions] TimeML, a markup language of temporal relations in English text, can be effectively applied to the annotation of temporal relations in Chinese text. The study lays the foundation for the temporal ordering inference of events and further TRR research in Chinese text.

Key wordsTimeML    Chinese    Event    Temporal expression    TRR
收稿日期: 2014-04-18      出版日期: 2014-11-28
:  G355  
基金资助:

本文系国家国际科技合作专项基金项目"面向科技文献的日汉双向实用型机器翻译合作研究"(项目编号:2014DFA11350)和中国科学技术信息研究所学科建设项目"自然语言处理"(项目编号:XK2014-6)的研究成果之一。

通讯作者: 李路标 E-mail: lilubiao2008@163.com     E-mail: lilubiao2008@163.com
作者简介: 作者贡献声明: 李路标: 设计研究方案, 起草论文及最终版修订; 张均胜: 对实验方法提出改进; 张寅生: 研究命题提出, 并确定研究对象范围; 王惠临: 研究方案可行性调查分析。
引用本文:   
李路标, 张均胜, 张寅生, 王惠临. TimeML应用于汉语文本时间关系标注的可行性分析[J]. 现代图书情报技术, 2014, 30(10): 42-48.
Li Lubiao, Zhang Junsheng, Zhang Yinsheng, Wang Huilin. On the Feasibility of Applying TimeML to the Annotation of Temporal Relations in Chinese Text. New Technology of Library and Information Service, 2014, 30(10): 42-48.
链接本文:  
https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.1003-3513.2014.10.07      或      https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2014/V30/I10/42

[1] 陈勇. 语言学研究中的标记理论[J]. 外语研究, 2002(6): 28-32. (Chen Yong. Markedness Theory in the Study of Linguistics [J]. Foreign Languages Research, 2002(6): 28-32.)
[2] Pustejovsky J, Castano J M, Ingria R, et al. TimeML: Robust Specification of Event and Temporal Expressions in Text[C]. In: Proceedings of the 5th International Workshop on Computational Semantics. 2003.
[3] Yoshikawa K, Riedel S, Asahara M, et al. Jointly Identifying Temporal Relations with Markov Logic [C]. In: Proceedings of the 47th Annual Meeting of the Association for Computational Linguistics and the 4th International Joint Conference on Natural Language Processing of the AFNLP. 2009: 405-413.
[4] 王凤玲. 基于条件随机域模型的英语时间表达式识别研究[J]. 电子技术, 2012(5): 8-10. (Wang Fengling. English Temporal Expression Recognition Based on Conditional Random Fields [J]. Electronic Technology, 2012(5): 8-10.)
[5] Caselli T, dell'Orletta F, Prodanof I. TETI: A TimeML Compliant TimEx Tagger for Italian[C]. In: Proceedings of the 2009 International Multiconference on Computer Science and Information Technology. 2009: 185-192.
[6] Bittar A, Amsili P, Denis P, et al. French TimeBank: An ISO-TimeML Annotated Reference Corpus [C]. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics. 2011: 130-134.
[7] Pustejovsky J, Hanks P, Sauri R, et al. The Timebank Corpus [C]. In: Proceedings of Corpus Linguistics. 2003: 647-656.
[8] Saurii R, Littman J, Knippen B, et al. TimeML[EB/OL]. [2014-03-22]. http://www.timeml.org.
[9] Mani I, Schiffman B, Zhang J. Inferring Temporal Ordering of Events in News [C]. In: Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics. 2003: 55-57.
[10] Chambers N, Wang S, Jurafsky D. Classifying Temporal Relations Between Events [C]. In: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics. 2007: 173-176.
[11] Bethard S. ClearTK-TimeML: A Minimalist Approach to TempEval 2013 [C]. In: Proceedings of the 2nd Joint Conference on Lexical and Computational Semantics. 2013: 10-14.
[12] Carpuat M. NRC: A Machine Translation Approach to Cross-Lingual Word Sense Disambiguation (SemEval-2013 Task 10)[C]. In: Proceedings of the 2nd Joint Conference on Lexical and Computational Semantics. 2013: 188-192.
[13] 李铁根. "了"、"着"、"过"与汉语时制的表达[J]. 语言研究, 2002(3): 1-13. (Li Tiegen. The Expression of "Le", "Zhe", "Guo" and Chinese Tense [J]. Sudies in Language and Linguistics, 2002(3): 1-13.)
[14] 李英哲, 郑良伟, Lorry Foster, 等. 实用汉语参考语法[M]. 熊文华译. 北京: 北京语言学院出版社, 1990. (Li Yingzhe, Zheng Liangwei, Lorry Foster, et al. Practical Chinese Reference Grammar [M]. Translated by Xiong Wenhua. Beijing: Beijing Language and Culture University Press, 1990.)
[15] Whorf B L. Science and Linguistics [J]. Technological Review, 1940: 42(6): 229-231.
[16] Linguistic Data Consortium. ACE (Automatic Concept Extraction) Chinese Event Guidelines V5.5.1[EB/OL]. (2005-07-01). [2014-03-22]. https://www.ldc.upenn.edu/sites/www. ldc.upenn.edu/files/chinese-events-guidelines-v5.5.1.pdf.
[17] Egges A, Nijholt A, Nugues P. CarSim: Automatic 3d Scene Generation of a Car Accident Description[R]. Technical Report, University of Twente, 2001.
[18] Boguraev B, Castaño J, Gaizauskas R, et al. TimeML 1.2.1: A Formal Specification Language for Events and Temporal Expressions [EB/OL]. [2014-03-22]. http://timeml.org/site/ publications/timeMLdocs/timeml_1.2.1.html.
[19] 蔡维天. 谈汉语模态词的分布与诠释之对应关系[J]. 中国语文, 2010(3): 208-221. (Tsai W T. On the Syntax-Semantics Correspondences of Chinese Modals [J]. Studies of the Chinese Language, 2010(3): 208-221.)

[1] 陈星月, 倪丽萍, 倪志伟. 基于ELECTRA模型与词性特征的金融事件抽取方法研究*[J]. 数据分析与知识发现, 2021, 5(7): 36-47.
[2] 喻雪寒, 何琳, 徐健. 基于RoBERTa-CRF的古文历史事件抽取方法研究*[J]. 数据分析与知识发现, 2021, 5(7): 26-35.
[3] 胡昊天,吉晋锋,王东波,邓三鸿. 基于深度学习的食品安全事件实体一体化呈现平台构建*[J]. 数据分析与知识发现, 2021, 5(3): 12-24.
[4] 王倩,王东波,李斌,许超. 面向海量典籍文本的深度学习自动断句与标点平台构建研究*[J]. 数据分析与知识发现, 2021, 5(3): 25-34.
[5] 赵天资, 段亮, 岳昆, 乔少杰, 马子娟. 基于Biterm主题模型的新闻线索生成方法 *[J]. 数据分析与知识发现, 2021, 5(2): 1-13.
[6] 程铁军, 王曼, 黄宝凤, 冯兰萍. 基于CEEMDAN-BP模型的突发事件网络舆情预测研究*[J]. 数据分析与知识发现, 2021, 5(11): 59-67.
[7] 纪有书, 王东波, 黄水清. 基于词对齐的古汉语同义词自动抽取研究*——以前四史典籍为例[J]. 数据分析与知识发现, 2021, 5(11): 135-144.
[8] 韩普, 张伟, 张展鹏, 王宇欣, 方浩宇. 基于特征融合和多通道的突发公共卫生事件微博情感分析*[J]. 数据分析与知识发现, 2021, 5(11): 68-79.
[9] 尹浩然,曹金璇,曹鲁喆,王国栋. 扩充语义维度的BiGRU-AM突发事件要素识别研究*[J]. 数据分析与知识发现, 2020, 4(9): 91-99.
[10] 聂磊,傅娟,易成岐,杨道玲. 基于移动终端位置数据的企业线下复工水平测算方法研究 *[J]. 数据分析与知识发现, 2020, 4(7): 38-49.
[11] 余传明,原赛,朱星宇,林虹君,张普亮,安璐. 基于深度学习的热点事件主题表示研究*[J]. 数据分析与知识发现, 2020, 4(4): 1-14.
[12] 邓建高,张璇,傅柱,韦庆明. 基于系统动力学的突发事件网络舆情传播研究:以“江苏响水爆炸事故”为例*[J]. 数据分析与知识发现, 2020, 4(2/3): 110-121.
[13] 梁艳平,安璐,刘静. 同类突发公共卫生事件微博话题共振研究*[J]. 数据分析与知识发现, 2020, 4(2/3): 122-133.
[14] 刘玉文,王凯. 面向地域的网络话题识别方法*[J]. 数据分析与知识发现, 2020, 4(2/3): 173-181.
[15] 徐建民,张丽青,王苗. 基于贝叶斯网络的静态话题追踪模型*[J]. 数据分析与知识发现, 2020, 4(2/3): 200-206.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 2015 《数据分析与知识发现》编辑部
地址:北京市海淀区中关村北四环西路33号 邮编:100190
电话/传真:(010)82626611-6626,82624938
E-mail:jishu@mail.las.ac.cn