ManGO:Grammar Engineering for Deep Linguistic Processing
Yang Chunlei1, Dan Flickinger2
1 College of English Language and Literature, Shanghai International Studies University, Shanghai 201600, China;
2 The Center for the Study of Language and Information (CSLI), Stanford University, Stanford 94305, USA
[Objective] This article contributes to the development of ManGO (Mandarin Grammar Online) for deep linguistic processing. [Context] On the platform of LKB (the Linguistic Knowledge Builder) and based on Grammar Matrix, ManGO is developed in the environment of DELPH-IN (Deep Linguistic Processing with HPSG Initiative). The frameworks of its syntactic and semantic analysis are HPSG (Head-driven Phrase Structure Grammar) and MRS (Minimal Recursion Semantics) respectively. ManGO lays a solid foundation for further resource grammar development and commercial application. [Methods] First, linguistic knowledge is formalized according to systematic Ontological studies. Then, the computational implementation of ManGO goes through grammar customization, creation of a Chinese MRS test suite, lexicon building, definition of grammar rules and MRS representation. [Results] ManGO covers nearly all the major Chinese word types and grammar phenomona, and fully covers the Chinese MRS test suite. [Conclusions] ManGO is one of the earliest medium-size computational grammars of Chinese. It serves as the bridge and effective carrier of the interdisciplinary studies across formal grammar theory and computational linguistics.
杨春雷, Dan Flickinger. 汉构:面向深层语言处理的语法工程[J]. 现代图书情报技术, 2014, 30(3): 57-64.
Yang Chunlei, Dan Flickinger. ManGO:Grammar Engineering for Deep Linguistic Processing. New Technology of Library and Information Service, 2014, 30(3): 57-64.
[1] Oepen S, Flickinger D, Tsujii J, et al. Collaborative Language Engineering: A Case Study in Efficient Grammar-based Processing [M]. Stanford: CSLI Publications, 2002.
[2] Bender E M. Grammar Engineering for Linguistic Hypothesis Testing [C]. In: Proceedings of the Texas Linguistics Society X Conference: Computational Linguistics for Less-Studied Languages. Stanford: CSLI Publications Online, 2008: 16-36.
[3] Bender E M, Drellishak S, Fokkens A, et al. Grammar Customization [J]. Research on Language & Computation, 2010, 8 (1):23-72.
[4] 陆俭明. 汉语言文字应用面面观 [J]. 语言文字应用, 2000(2): 4-8. (Lu Jianming. Aspects of Language Use in China [J]. Applied Linguistics, 2000(2): 4-8.)
[5] Pollard C J, Sag I A. Head-driven Phrase Structure Grammar [M]. Chicago: The University of Chicago Press, 1994.
[6] Sag I A, Wasow T, Bender E M. Syntactic Theory: A Formal Introduction [M]. Stanford: CSLI Publications, 2003.
[7] Boas H C, Sag I A. Sign-Based Construction Grammar [M]. Stanford: CSLI Publications, 2012.
[8] Zhang Y, Wang R, Chen Y. Joint Grammar and TreeBank Development for Mandarin Chinese with HPSG [C]. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC'2012), Istanbul, Turkey. 2012:1868-1873.
[9] 范子衿, 王惠临, 张均胜. 中心语驱动短语结构语法研究综述 [J].现代图书情报技术, 2013(4): 40-47. (Fan Zijin, Wang Huilin, Zhang Junsheng. Review of Head-driven Phrase Structure Grammar [J]. New Technology of Library and Information Service, 2013(4): 40-47.)
[10] Hutchins J. Latest Developments in Machine Translation Technology [C]. In: Proceedings of MT Summit IV, Kobe, Japan.1993:11-34.
[11] Kay M. Collected Papers of Martin Kay: A Half Century of Computational Linguistics [M]. Stanford: CSLI Publications, 2010.
[12] 冯志伟. 自然语言处理的学科定位 [J]. 解放军外国语学院学报, 2005, 28(3): 1-8. (Feng Zhiwei. Academic Position of Natural Language Processing [J]. Journal of PLA University of Foreign Languages, 2005,28(3):1-8.)
[13] 方立, 吴平. 中心语驱动短语结构语法评介 [J]. 语言教学与研究, 2003(5): 31-43. (Fang Li, Wu Ping. A Review of Head-driven Phrase Structure Grammar [J]. Language Teaching and Linguistic Studies, 2003(5): 31-43.)
[14] 陆俭明. 句法语义接口问题 [J]. 外国语, 2006(3): 30-35. (Lu Jianming. On Interface between Syntax and Semantics [J]. Journal of Foreign Languages, 2006(3):30-35.)
[15] Backofen R, Becker T, Calder J, et al. The EAGLES Formalisms Working Group-Final Report [R]. Saarbrücken: German Research Center for Artificial Intelligence (DFKI), 1996.
[16] Bender E M, Flickinger D, Oepen S. The Grammar Matrix: An Open-Source Starter-Kit for the Rapid Development of Cross-Linguistically Consistent Broad-Coverage Precision Grammars [C]. In: Proceedings of the Workshop on Grammar Engineering and Evaluation at the 19th International Conference on Computational Linguistics, Taipei, Taiwan, China.2002: 8-14.
[17] Copestake A, Flickinger D, Pollard C, et al. Minimal Recursion Semantics: An Introduction [J]. Research on Language and Computation, 2005, 3(2-3):281-332.
[18] 曾少勤, 王惠临, 张寅生.汉语文本的最小递归语义表示研究——以名词性量化短语为例 [J].现代图书情报技术, 2012 (10): 35-41. (Zeng Shaoqin, Wang Huilin, Zhang Yinsheng. Mandarin Text Representation Based on Minimal Recursion Semantics——Illustrated by Quantitative Noun Phrases [J]. New Technology of Library and Information Service, 2012(10): 35-41.)
[19] Flickinger D, Yang J C. ManGO: Mandarin Grammar Online [C]. In: Proceedings of DELPH-IN Summit 2011, Seattle, Suquamish, USA.2011.
[20] 杨春雷. 兼语式的深层语言处理: 从语言学设计到计算实现 [J]. 外国语,2013,36(3): 50-59. (Yang Chunlei. Deep Linguistic Processing of Pivotal Construction: From Linguistic Design to Implementation [J]. Journal of Foreign Languages, 2013,36(3): 50-59.)
[21] Fokkens A, Avgustinova T, Zhang Y. CLIMB Grammars: Three Projects Using Metagrammar Engineering [C]. In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC'12), Instanbul, Turkey. 2012:1672-1679.