[Objective] This paper makes a comparative analysis to the development of natural language processing at home and abroad from multi-angle. [Methods] The literatures are from CNKI (5 582), Web of Science (10 348) and major international conferences on natural language processing (5 573). Use word frequency statistics and co-occurrence analysis as main research methods and use knowledge maps to show statistical results. [Results] The result shows that the study of natural language processing performance at home and abroad has a great similarity. Their research focuses on the domains of information extraction, artificial intelligence, information retrieval, machine translation, machine learning and so on. [Limitations] There are some limitations in this paper, such as the choice of subject term, the error resulting from the subjectivity to data cleaning. [Conclusions] According to the results, several recommendations are made on the development of natural language processing.
邱均平, 方国平. 基于知识图谱的中外自然语言处理研究的对比分析[J]. 现代图书情报技术, 2014, 30(12): 51-61.
Qiu Junping, Fang Guoping. The Comparative Analysis of Natural Language Processing Research at Home and Abroad Based on Knowledge Mapping. New Technology of Library and Information Service, 2014, 30(12): 51-61.
[1] Allen J. 自然语言理解[M]. 第二版. 刘群, 张华平, 骆卫华, 等译. 北京: 电子工业出版社, 2005. (Allen J. Natural Language Understanding [M]. The 2nd Edition. Translated by Liu Qun, Zhang Huaping, Luo Weihua, et al. Beijing: Publishing House of Electronics Industry, 2005.)
[2] 杨国文. 自然语言理解[J]. 外语教学与研究, 1987(3) : 28-31, 81. (Yang Guowen. On Understanding Natural Language [J]. Foreign Language Teaching and Research, 1987(3): 28-31, 81.)
[3] 冯志伟. 自然语言处理的学科定位[J]. 解放军外国语学院学报, 2005, 28(3): 1-8. (Feng Zhiwei. Academic Position of Natural Language Processing [J]. Journal of PLA University of Foreign Languages, 2005, 28(3): 1-8.)
[4] 冯志伟. 自然语言处理的历史与现状[J]. 中国外语, 2008, 5(1): 14-22. (Feng Zhiwei. The Past and Present of Natural Language Processing [J]. Foreign Languages in China, 2008, 5(1): 14-22.)
[5] 曹佩. 论自然语言处理[J]. 信息与电脑, 2010(5): 187. (Cao Pei. On the Natural Language Processing [J]. China Computer and Communication, 2010(5): 187.)
[6] 殷杰, 董佳蓉. 论自然语言处理的发展趋势[J]. 自然辩证法研究, 2008, 24(3): 31-37. (Yin Jie, Dong Jiarong. The Development Trend of the Natural Language Processing [J]. Studies in Dialectics of Nature, 2008, 24(3): 31-37.)
[7] 祝清松. 我国自然语言处理研究的文献计量分析[J]. 情报杂志, 2009, 28(S2): 32-34. (Zhu Qingsong. Bibliometric Analysis of Natural Language Processing in China [J]. Journal of Information, 2009, 28(S2): 32-34.)
[8] 李阳, 许培扬. 我国自然语言处理研究文献计量分析[J]. 中华医学图书情报杂志, 2012, 21(2): 65-70. (Li Yang, Xu Peiyang. Research on Natural Language Processing in China: A Bibliometric Analysis [J]. Chinese Journal of Medical Library and Information Science, 2012, 21(2): 65-70.)
[9] 田瑛. 运用语义分析解决自然语言处理中的英语歧义问题[J]. 语文学刊(外语教育与教学), 2009(5): 14-15. (Tian Ying. The Use of Natural Language Processing Semantic Analysis to Resolve Ambiguity in English [J]. Journal of Language and Literature Studies, 2009(5):14-15.)
[10] 吴巧玲. 中文分词算法在自然语言处理技术中的研究及应用[J]. 信息与电脑, 2011(12): 39-40. (Wu Qiaoling. Chinese Word Segmentation Algorithm and Its Application in Natural Language Processing Techniques [J]. China Computer & Communication, 2011(12): 39-40.)
[11] 许坤, 冯岩松, 赵东岩, 等.面向知识库的中文自然语言问句的语义理解[J]. 北京大学学报: 自然科学版, 2014, 50(1): 85-92. (Xu Kun, Feng Yansong, Zhao Dongyan, et al. Automatic Understanding of Natural Language Questions for Querying Chinese Knowledge Bases [J]. Acta Scientiarum Naturalium Universitatis Pekinensis, 2014, 50(1): 85-92.)
[12] 才让加. 面向自然语言处理的大规模汉藏(藏汉)双语语料库构建技术研究[J]. 中文信息学报, 2011, 25(6): 157-161. (Tse Ring'rgyal. Research on Large-scale Sino-Tibetan Bilingual Corpus Construction for Natural Language Processing [J]. Journal of Chinese Information Processing, 2011, 25(6): 157-161.)
[13] 孟维娟. 自然语言处理中的歧义[J]. 上海电机学院学报, 2006, 9(S1): 16-19. (Meng Weijuan. Simple Analysis of Ambiguity in Natural Language Processing [J]. Journal of Shanghai Dianji University, 2006, 9(S1): 16-19.)
[14] 邱均平. 信息计量学[M]. 武汉: 武汉大学出版社, 2007. (Qiu Junping. Informetrics [M]. Wuhan: Wuda Publishing House, 2007.)
[15] 化柏林. 用VBA实现文献计量分析研究中的数据预处理技术[J]. 现代图书情报技术, 2007(3): 69-72. (Hua Bolin. Implementation of Preprocess Technology in Bibliometric and Analytic Research via VBA [J]. New Technology of Library and Information Service, 2007(3): 69-72.)
[16] 中国科学报: 《让机器会"说"多种语言》[EB/OL]. [2014-01-03]. http://www.ia.cas.cn/xwzx/mtsm/201401/t20140103_ 4009934.html. (Chinese Science News: Let the Machine "Say" in Many Languages [EB/OL]. [2014-01-03]. http://www.ia.cas.cn/ xwzx/mtsm/201401/t20140103_4009934.html.)
[17] 寇继虹, 楼雯. 基于知识图谱的E-learning研究的可视化分析[J]. 电化教育研究, 2010(9): 20-25. (Kou Jihong, Lou Wen. Visual Analysis of E-learning Based on Knowledge Map [J]. E-education Research, 2010(9): 20-25.)
[18] 杨皓东, 江凌, 李国俊. 国内自然语言处理研究热点分析——基于共词分析[J].图书情报工作, 2011, 55(10): 112-117. (Yang Haodong, Jiang Ling, Li Guojun. The Hotspot of Natural Language Processing in China: Based on Co-word Analysis [J]. Library and Information Service, 2011, 55(10): 112-117.)
[19] Wong K, Li W, Xu R, et al. Book Review: Introduction to Chinese Natural Language Processing [J]. Computational Linguistics, 2010, 36(4): 777-780.
[20] 李生. 自然语言处理的研究与发展[J]. 燕山大学学报, 2013, 37(5): 377-384. (Li Sheng. Research and Development of Natural Language Processing [J]. Journal of Yanshan University, 2013, 37(5): 377-384.)
[21] 王献昌, 史晓东, 陈火旺. 机器翻译与自然语言处理的现状与趋势[J]. 计算机科学, 1992, 19(3): 1-3. (Wang Xianchang, Shi Xiaodong, Chen Huowang. The Current Situation and Trend of Machine Learning and Natural Language Processing [J]. Computer Science, 1992, 19(3): 1-3.)
[22] Liu J, Wang Q, Lin C, et al. Question Difficulty Estimation in Community Question Answering Services [C]. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA. Association for Computational Linguistics, 2013: 85-90.
[23] 孙镇, 王惠临. 命名实体识别研究进展综述[J]. 现代图书情报技术, 2010(6): 42-47. (Sun Zhen, Wang Huilin. Overview on the Advance of the Research on Named Entity Recognition [J]. New Technology of Library and Information Service, 2010(6): 42-47.
[24] 崔新华. 自然语言处理在信息检索中的应用研究[J]. 贵阳学院学报: 自然科学版, 2012, 7(3): 37-40. (Cui Xinhua. Natural Language Processing Applications in Formation Retrieval Research [J]. Journal of Guiyang College: Natural Science, 2012, 7(3): 37-40.)
[25] 左远清, 周洞汝, 王波. 自然语言处理在搜索引擎信息检索中的应用[J].现代计算机, 2002(7): 28-29, 44. (Zuo Yuanqing, Zhou Dongru, Wang Bo. Application of Natural Language Processing in Information Retrieve by Search Engineer [J]. Modem Computer, 2002(7): 28-29, 44.)
[26] 于志敏, 张文德. 基于自然语言处理的信息检索[J]. 山东科技大学学报: 自然科学版, 2006, 25(1): 122-124. (Yu Zhimin, Zhang Wende. Information Retrieval Based on Natural Language Processing [J]. Journal of Shandong University of Science and Technology: Natural Science, 2006, 25(1): 122-124.)
[27] 蔡霞, 张森. 自然语言理解在Web数据挖掘中的应用[J]. 计算机工程与设计, 2003, 24(11): 1-3. (Cai Xia, Zhang Sen. Practice of Web Mining Based on Nature Language Understanding [J]. Computer Engineering and Design, 2003, 24(11): 1-3.)
[28] Lewis D D, Jones K S. Natural Language Processing for Information Retrieval [J]. Communications of the ACM, 1996, 39(1): 92-101.
[29] Voorhees E M. Natural Language Processing and Information Retrieval [A].// Information Extraction [M]. Springer Berlin Heidelberg, 1999: 1-17.
[30] Doszkocs T E. Natural Language Processing in Information Retrieval [J]. Journal of the American Society for Information Science, 1986, 37(4): 191-196.
[31] 黄敏. 自然语言处理与信息检索[J]. 图书情报工作, 2001, 45(4): 41-44, 65. (Huang Min. Natural Language Processing and Information Retrieval [J]. Library and Information Service, 2001, 45(4): 41-44, 65.)
[32] 蔡艳婧, 程显毅, 潘燕. 面向自然语言处理的人工智能框架[J]. 微电子学与计算机, 2011, 28(10): 173-176, 180. (Cai Yanjing, Cheng Xianyi, Pan Yan. A Framework of Artificial Intelligence Oriented Natural Language Processing [J]. Microelectronics & Computer, 2011, 28(10): 173-176, 180.)
[33] Obermeier K K.Natural Language Processing Technologies in Artificial-Intelligence -The Science and Industry Perspective [M]. Ellis Horwood, 1989.
[34] Costantino M, Morgan R G, Collingham R J, et al. Natural Language Processing and Information Extraction: Qualitative Analysis of Financial News Articles [C]. In: Proceedings of the IEEE/IAFE 1997 Computational Intelligence for Financial Engineering. IEEE, 1997: 116-122.
[35] Coulet A, Cohen K B, Altman R B. The State of the Art in Text Mining and Natural Language Processing for Pharmacogenomics [J]. Journal of Biomedical Informatics, 2012, 45(5): 825-826.
[36] Zhou G, Liu F, Liu Y, et al. Statistical Machine Translation Improves Question Retrieval in Community Question Answering via Matrix Factorization [C]. In: Proceedings of Annual Meeting of the Association of Computational Linguistics. 2013.