Please wait a minute...
Advanced Search
现代图书情报技术  2010, Vol. 26 Issue (4): 12-17     https://doi.org/10.11925/infotech.1003-3513.2010.04.03
  数字图书馆 本期目录 | 过刊浏览 | 高级检索 |
基于清华汉语树库的有标记联合结构统计分析
王东波,谢靖
(南京大学信息管理系)
Analyzing the Linguistic Features of Coordination with Overt
Wang Dong-Bo,Xie Jing
(Department of Information Management, Nanjing University, Nanjing 210093,China)
全文: PDF (415 KB)   HTML  
输出: BibTeX | EndNote (RIS)      
摘要 

详细统计和分析有标记联合结构的内部语言学和外部语言学特征。内部特征方面主要考察该结构的词性序列分布、短语序列分布;外部特征方面主要考察该结构的句法功能分布和左右边界特征词。这些考察一方面为从量化的角度研究该结构提供相对精确的数据,另一方面为计算机自动识别该结构提供语言学知识。

服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
王东波
谢靖
关键词 有标记联合结构 内部语言学特征 外部语言学特征 清华汉语树库    
Abstract

The article counts and analyzes the internal and external linguistic features of Coordination with Overt Conjunctions (COC) in detail. It mainly investigates the internal linguistic features including the distribution of Part-Of-Speech(POS) and phrases sequences, as well as the external linguistic features including the distribution of syntactic function and the features of border lexicons. For one thing, the statistical data offers the linguistic knowledge for identifying COC, for another thing, the accurate data is used to investigate the COC.

Key wordsCOC     Internal linguistic features     External linguistic features     Tsinghua Chinese treebank
收稿日期: 2010-03-08      出版日期: 2010-04-25
基金资助:

*本文系教育部人文社会科学研究基金资助项目“基于大规模语料库和WordNet词库的英汉学习型词典设计特征知识获取”(项目编号:09YJAZH042)和国家社会科学基金项目“汉语词语搭配获取与语义特征分析的相互关系研究”(项目编号:07BYY050)的研究成果之一。

通讯作者: 王东波     E-mail: wangdongbo0102@gmail.com
引用本文:   
王东波 谢靖. 基于清华汉语树库的有标记联合结构统计分析[J]. 现代图书情报技术, 2010, 26(4): 12-17.
Wang Dong-Bo,Xie Jing. Analyzing the Linguistic Features of Coordination with Overt. New Technology of Library and Information Service, 2010, 26(4): 12-17.
链接本文:  
https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.1003-3513.2010.04.03      或      https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2010/V26/I4/12

[1] 周强.汉语语料库的短语自动划分和标注研究[D].北京:北京大学,2002.
[2] 吴云芳.面向中文信息处理的现代汉语并列结构研究[D].北京:北京大学,2003.
[3] Church K. Astochastic Parts Program and Noun Phrase Parser for Unrestricted Text[C]. In:Proceedings of the 2nd Conference on Applied Natural Language Processing. Austin:Association for Computational Linguistics,1988:136-143.
[4] Agarwal R, Boggess L. A Simple but Useful Approach to Conjunct Identification[C]. In:Proceedings of the 30th Annual Meeting of Asscosiation for Computational Linguistics,Newark, Delaware. Morristown, NJ, USA: Association for Computational Linguistics,1992:15-20.
[5] Akitoshi Okumura,Kazunori Muraki. Symmetric Pattern Matching Analysis for English Coordinate Structures[C].In:Proceedings of the 4th Conference on Applied Natural Language Processing,Stuttgart, Germany. Morristown, NJ, USA:Association for Computational Linguistics,  1994:41-46.
[6] Sadao Kurohashi,Makoto Nagao.A Syntactic Analysis Method of Long Japanese Sentences Based on the Detection of Conjunctive Structures[J]. Computational Linguistics,1994,20(4):507-534.
[7] 邓云华.汉语联合短语的类型和共性[D].长沙:湖南师范大学,2004.
[8] 马清华.并列结构的自组织研究[D].武汉:华中师范大学,2004.
[9] 苗艳军,李军辉,周国栋.统计和规则相结合的并列结构自动识别[J].计算机应用研究,2009,26(9):3404-3406.
[10] 周强.汉语句法树库标注体系[J].中文信息学报,2004,18(3):l-8.
[11] 陈小荷.从自动句法分析角度看汉语词类问题[J].语言教学与研究,1999(3):63-72.
[12] 徐艳华.现代汉语实词语法功能考察及词类体系重构[D].南京:南京师范大学,2006.
[13] 卢俊之,陈小荷,王东波,等. 基于语法功能匹配的汉语句法分析算法[J].计算机工程与应用,2008,44(16):151-153,159.
[14] 吴云芳.并列成分中心词语义相似性考察[J].当代语言学,2005,7(4):305-315.

No related articles found!
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 2015 《数据分析与知识发现》编辑部
地址:北京市海淀区中关村北四环西路33号 邮编:100190
电话/传真:(010)82626611-6626,82624938
E-mail:jishu@mail.las.ac.cn