Please wait a minute...
New Technology of Library and Information Service  2014, Vol. 30 Issue (3): 57-64    DOI: 10.11925/infotech.1003-3513.2014.03.09
Current Issue | Archive | Adv Search |
ManGO:Grammar Engineering for Deep Linguistic Processing
Yang Chunlei1, Dan Flickinger2
1 College of English Language and Literature, Shanghai International Studies University, Shanghai 201600, China;
2 The Center for the Study of Language and Information (CSLI), Stanford University, Stanford 94305, USA
Download: PDF(1618 KB)   HTML  
Export: BibTeX | EndNote (RIS)      
Abstract  

[Objective] This article contributes to the development of ManGO (Mandarin Grammar Online) for deep linguistic processing. [Context] On the platform of LKB (the Linguistic Knowledge Builder) and based on Grammar Matrix, ManGO is developed in the environment of DELPH-IN (Deep Linguistic Processing with HPSG Initiative). The frameworks of its syntactic and semantic analysis are HPSG (Head-driven Phrase Structure Grammar) and MRS (Minimal Recursion Semantics) respectively. ManGO lays a solid foundation for further resource grammar development and commercial application. [Methods] First, linguistic knowledge is formalized according to systematic Ontological studies. Then, the computational implementation of ManGO goes through grammar customization, creation of a Chinese MRS test suite, lexicon building, definition of grammar rules and MRS representation. [Results] ManGO covers nearly all the major Chinese word types and grammar phenomona, and fully covers the Chinese MRS test suite. [Conclusions] ManGO is one of the earliest medium-size computational grammars of Chinese. It serves as the bridge and effective carrier of the interdisciplinary studies across formal grammar theory and computational linguistics.

Key wordsMandarin Grammar Online (ManGO)      Grammar engineering      Head-driven Phrase Structure Grammar (HPSG)      Natural Language Processing (NLP)     
Received: 22 November 2013      Published: 15 April 2014
:  H087  

Cite this article:

Yang Chunlei, Dan Flickinger. ManGO:Grammar Engineering for Deep Linguistic Processing. New Technology of Library and Information Service, 2014, 30(3): 57-64.

URL:

http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2014.03.09     OR     http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2014/V30/I3/57

[1] Oepen S, Flickinger D, Tsujii J, et al. Collaborative Language Engineering: A Case Study in Efficient Grammar-based Processing [M]. Stanford: CSLI Publications, 2002.

[2] Bender E M. Grammar Engineering for Linguistic Hypothesis Testing [C]. In: Proceedings of the Texas Linguistics Society X Conference: Computational Linguistics for Less-Studied Languages. Stanford: CSLI Publications Online, 2008: 16-36.

[3] Bender E M, Drellishak S, Fokkens A, et al. Grammar Customization [J]. Research on Language & Computation, 2010, 8 (1):23-72.

[4] 陆俭明. 汉语言文字应用面面观 [J]. 语言文字应用, 2000(2): 4-8. (Lu Jianming. Aspects of Language Use in China [J]. Applied Linguistics, 2000(2): 4-8.)

[5] Pollard C J, Sag I A. Head-driven Phrase Structure Grammar [M]. Chicago: The University of Chicago Press, 1994.

[6] Sag I A, Wasow T, Bender E M. Syntactic Theory: A Formal Introduction [M]. Stanford: CSLI Publications, 2003.

[7] Boas H C, Sag I A. Sign-Based Construction Grammar [M]. Stanford: CSLI Publications, 2012.

[8] Zhang Y, Wang R, Chen Y. Joint Grammar and TreeBank Development for Mandarin Chinese with HPSG [C]. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC'2012), Istanbul, Turkey. 2012:1868-1873.

[9] 范子衿, 王惠临, 张均胜. 中心语驱动短语结构语法研究综述 [J].现代图书情报技术, 2013(4): 40-47. (Fan Zijin, Wang Huilin, Zhang Junsheng. Review of Head-driven Phrase Structure Grammar [J]. New Technology of Library and Information Service, 2013(4): 40-47.)

[10] Hutchins J. Latest Developments in Machine Translation Technology [C]. In: Proceedings of MT Summit IV, Kobe, Japan.1993:11-34.

[11] Kay M. Collected Papers of Martin Kay: A Half Century of Computational Linguistics [M]. Stanford: CSLI Publications, 2010.

[12] 冯志伟. 自然语言处理的学科定位 [J]. 解放军外国语学院学报, 2005, 28(3): 1-8. (Feng Zhiwei. Academic Position of Natural Language Processing [J]. Journal of PLA University of Foreign Languages, 2005,28(3):1-8.)

[13] 方立, 吴平. 中心语驱动短语结构语法评介 [J]. 语言教学与研究, 2003(5): 31-43. (Fang Li, Wu Ping. A Review of Head-driven Phrase Structure Grammar [J]. Language Teaching and Linguistic Studies, 2003(5): 31-43.)

[14] 陆俭明. 句法语义接口问题 [J]. 外国语, 2006(3): 30-35. (Lu Jianming. On Interface between Syntax and Semantics [J]. Journal of Foreign Languages, 2006(3):30-35.)

[15] Backofen R, Becker T, Calder J, et al. The EAGLES Formalisms Working Group-Final Report [R]. Saarbrücken: German Research Center for Artificial Intelligence (DFKI), 1996.

[16] Bender E M, Flickinger D, Oepen S. The Grammar Matrix: An Open-Source Starter-Kit for the Rapid Development of Cross-Linguistically Consistent Broad-Coverage Precision Grammars [C]. In: Proceedings of the Workshop on Grammar Engineering and Evaluation at the 19th International Conference on Computational Linguistics, Taipei, Taiwan, China.2002: 8-14.

[17] Copestake A, Flickinger D, Pollard C, et al. Minimal Recursion Semantics: An Introduction [J]. Research on Language and Computation, 2005, 3(2-3):281-332.

[18] 曾少勤, 王惠临, 张寅生.汉语文本的最小递归语义表示研究——以名词性量化短语为例 [J].现代图书情报技术, 2012 (10): 35-41. (Zeng Shaoqin, Wang Huilin, Zhang Yinsheng. Mandarin Text Representation Based on Minimal Recursion Semantics——Illustrated by Quantitative Noun Phrases [J]. New Technology of Library and Information Service, 2012(10): 35-41.)

[19] Flickinger D, Yang J C. ManGO: Mandarin Grammar Online [C]. In: Proceedings of DELPH-IN Summit 2011, Seattle, Suquamish, USA.2011.

[20] 杨春雷. 兼语式的深层语言处理: 从语言学设计到计算实现 [J]. 外国语,2013,36(3): 50-59. (Yang Chunlei. Deep Linguistic Processing of Pivotal Construction: From Linguistic Design to Implementation [J]. Journal of Foreign Languages, 2013,36(3): 50-59.)

[21] Fokkens A, Avgustinova T, Zhang Y. CLIMB Grammars: Three Projects Using Metagrammar Engineering [C]. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC'12), Instanbul, Turkey. 2012:1672-1679.

[1] Yang Chunlei. Building Online System for Chinese Lexicon and Grammar[J]. 现代图书情报技术, 2016, 32(7-8): 129-136.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn