Please wait a minute...
New Technology of Library and Information Service  2014, Vol. 30 Issue (12): 51-61    DOI: 10.11925/infotech.1003-3513.2014.12.07
Current Issue | Archive | Adv Search |
The Comparative Analysis of Natural Language Processing Research at Home and Abroad Based on Knowledge Mapping
Qiu Junping, Fang Guoping
Research Center for Chinese Science Evaluation, Wuhan University, Wuhan 430072, China
Download: PDF(848 KB)   HTML  
Export: BibTeX | EndNote (RIS)      
Abstract  

[Objective] This paper makes a comparative analysis to the development of natural language processing at home and abroad from multi-angle. [Methods] The literatures are from CNKI (5 582), Web of Science (10 348) and major international conferences on natural language processing (5 573). Use word frequency statistics and co-occurrence analysis as main research methods and use knowledge maps to show statistical results. [Results] The result shows that the study of natural language processing performance at home and abroad has a great similarity. Their research focuses on the domains of information extraction, artificial intelligence, information retrieval, machine translation, machine learning and so on. [Limitations] There are some limitations in this paper, such as the choice of subject term, the error resulting from the subjectivity to data cleaning. [Conclusions] According to the results, several recommendations are made on the development of natural language processing.

Key wordsNatural language processing      Knowledge mapping      Information retrieval      Machine learning     
Received: 17 March 2014      Published: 20 January 2015
:  G250  

Cite this article:

Qiu Junping, Fang Guoping. The Comparative Analysis of Natural Language Processing Research at Home and Abroad Based on Knowledge Mapping. New Technology of Library and Information Service, 2014, 30(12): 51-61.

URL:

http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2014.12.07     OR     http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2014/V30/I12/51

[1] Allen J. 自然语言理解[M]. 第二版. 刘群, 张华平, 骆卫华, 等译. 北京: 电子工业出版社, 2005. (Allen J. Natural Language Understanding [M]. The 2nd Edition. Translated by Liu Qun, Zhang Huaping, Luo Weihua, et al. Beijing: Publishing House of Electronics Industry, 2005.)
[2] 杨国文. 自然语言理解[J]. 外语教学与研究, 1987(3) : 28-31, 81. (Yang Guowen. On Understanding Natural Language [J]. Foreign Language Teaching and Research, 1987(3): 28-31, 81.)
[3] 冯志伟. 自然语言处理的学科定位[J]. 解放军外国语学院学报, 2005, 28(3): 1-8. (Feng Zhiwei. Academic Position of Natural Language Processing [J]. Journal of PLA University of Foreign Languages, 2005, 28(3): 1-8.)
[4] 冯志伟. 自然语言处理的历史与现状[J]. 中国外语, 2008, 5(1): 14-22. (Feng Zhiwei. The Past and Present of Natural Language Processing [J]. Foreign Languages in China, 2008, 5(1): 14-22.)
[5] 曹佩. 论自然语言处理[J]. 信息与电脑, 2010(5): 187. (Cao Pei. On the Natural Language Processing [J]. China Computer and Communication, 2010(5): 187.)
[6] 殷杰, 董佳蓉. 论自然语言处理的发展趋势[J]. 自然辩证法研究, 2008, 24(3): 31-37. (Yin Jie, Dong Jiarong. The Development Trend of the Natural Language Processing [J]. Studies in Dialectics of Nature, 2008, 24(3): 31-37.)
[7] 祝清松. 我国自然语言处理研究的文献计量分析[J]. 情报杂志, 2009, 28(S2): 32-34. (Zhu Qingsong. Bibliometric Analysis of Natural Language Processing in China [J]. Journal of Information, 2009, 28(S2): 32-34.)
[8] 李阳, 许培扬. 我国自然语言处理研究文献计量分析[J]. 中华医学图书情报杂志, 2012, 21(2): 65-70. (Li Yang, Xu Peiyang. Research on Natural Language Processing in China: A Bibliometric Analysis [J]. Chinese Journal of Medical Library and Information Science, 2012, 21(2): 65-70.)
[9] 田瑛. 运用语义分析解决自然语言处理中的英语歧义问题[J]. 语文学刊(外语教育与教学), 2009(5): 14-15. (Tian Ying. The Use of Natural Language Processing Semantic Analysis to Resolve Ambiguity in English [J]. Journal of Language and Literature Studies, 2009(5):14-15.)
[10] 吴巧玲. 中文分词算法在自然语言处理技术中的研究及应用[J]. 信息与电脑, 2011(12): 39-40. (Wu Qiaoling. Chinese Word Segmentation Algorithm and Its Application in Natural Language Processing Techniques [J]. China Computer & Communication, 2011(12): 39-40.)
[11] 许坤, 冯岩松, 赵东岩, 等.面向知识库的中文自然语言问句的语义理解[J]. 北京大学学报: 自然科学版, 2014, 50(1): 85-92. (Xu Kun, Feng Yansong, Zhao Dongyan, et al. Automatic Understanding of Natural Language Questions for Querying Chinese Knowledge Bases [J]. Acta Scientiarum Naturalium Universitatis Pekinensis, 2014, 50(1): 85-92.)
[12] 才让加. 面向自然语言处理的大规模汉藏(藏汉)双语语料库构建技术研究[J]. 中文信息学报, 2011, 25(6): 157-161. (Tse Ring'rgyal. Research on Large-scale Sino-Tibetan Bilingual Corpus Construction for Natural Language Processing [J]. Journal of Chinese Information Processing, 2011, 25(6): 157-161.)
[13] 孟维娟. 自然语言处理中的歧义[J]. 上海电机学院学报, 2006, 9(S1): 16-19. (Meng Weijuan. Simple Analysis of Ambiguity in Natural Language Processing [J]. Journal of Shanghai Dianji University, 2006, 9(S1): 16-19.)
[14] 邱均平. 信息计量学[M]. 武汉: 武汉大学出版社, 2007. (Qiu Junping. Informetrics [M]. Wuhan: Wuda Publishing House, 2007.)
[15] 化柏林. 用VBA实现文献计量分析研究中的数据预处理技术[J]. 现代图书情报技术, 2007(3): 69-72. (Hua Bolin. Implementation of Preprocess Technology in Bibliometric and Analytic Research via VBA [J]. New Technology of Library and Information Service, 2007(3): 69-72.)
[16] 中国科学报: 《让机器会"说"多种语言》[EB/OL]. [2014-01-03]. http://www.ia.cas.cn/xwzx/mtsm/201401/t20140103_ 4009934.html. (Chinese Science News: Let the Machine "Say" in Many Languages [EB/OL]. [2014-01-03]. http://www.ia.cas.cn/ xwzx/mtsm/201401/t20140103_4009934.html.)
[17] 寇继虹, 楼雯. 基于知识图谱的E-learning研究的可视化分析[J]. 电化教育研究, 2010(9): 20-25. (Kou Jihong, Lou Wen. Visual Analysis of E-learning Based on Knowledge Map [J]. E-education Research, 2010(9): 20-25.)
[18] 杨皓东, 江凌, 李国俊. 国内自然语言处理研究热点分析——基于共词分析[J].图书情报工作, 2011, 55(10): 112-117. (Yang Haodong, Jiang Ling, Li Guojun. The Hotspot of Natural Language Processing in China: Based on Co-word Analysis [J]. Library and Information Service, 2011, 55(10): 112-117.)
[19] Wong K, Li W, Xu R, et al. Book Review: Introduction to Chinese Natural Language Processing [J]. Computational Linguistics, 2010, 36(4): 777-780.
[20] 李生. 自然语言处理的研究与发展[J]. 燕山大学学报, 2013, 37(5): 377-384. (Li Sheng. Research and Development of Natural Language Processing [J]. Journal of Yanshan University, 2013, 37(5): 377-384.)
[21] 王献昌, 史晓东, 陈火旺. 机器翻译与自然语言处理的现状与趋势[J]. 计算机科学, 1992, 19(3): 1-3. (Wang Xianchang, Shi Xiaodong, Chen Huowang. The Current Situation and Trend of Machine Learning and Natural Language Processing [J]. Computer Science, 1992, 19(3): 1-3.)
[22] Liu J, Wang Q, Lin C, et al. Question Difficulty Estimation in Community Question Answering Services [C]. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA. Association for Computational Linguistics, 2013: 85-90.
[23] 孙镇, 王惠临. 命名实体识别研究进展综述[J]. 现代图书情报技术, 2010(6): 42-47. (Sun Zhen, Wang Huilin. Overview on the Advance of the Research on Named Entity Recognition [J]. New Technology of Library and Information Service, 2010(6): 42-47.
[24] 崔新华. 自然语言处理在信息检索中的应用研究[J]. 贵阳学院学报: 自然科学版, 2012, 7(3): 37-40. (Cui Xinhua. Natural Language Processing Applications in Formation Retrieval Research [J]. Journal of Guiyang College: Natural Science, 2012, 7(3): 37-40.)
[25] 左远清, 周洞汝, 王波. 自然语言处理在搜索引擎信息检索中的应用[J].现代计算机, 2002(7): 28-29, 44. (Zuo Yuanqing, Zhou Dongru, Wang Bo. Application of Natural Language Processing in Information Retrieve by Search Engineer [J]. Modem Computer, 2002(7): 28-29, 44.)
[26] 于志敏, 张文德. 基于自然语言处理的信息检索[J]. 山东科技大学学报: 自然科学版, 2006, 25(1): 122-124. (Yu Zhimin, Zhang Wende. Information Retrieval Based on Natural Language Processing [J]. Journal of Shandong University of Science and Technology: Natural Science, 2006, 25(1): 122-124.)
[27] 蔡霞, 张森. 自然语言理解在Web数据挖掘中的应用[J]. 计算机工程与设计, 2003, 24(11): 1-3. (Cai Xia, Zhang Sen. Practice of Web Mining Based on Nature Language Understanding [J]. Computer Engineering and Design, 2003, 24(11): 1-3.)
[28] Lewis D D, Jones K S. Natural Language Processing for Information Retrieval [J]. Communications of the ACM, 1996, 39(1): 92-101.
[29] Voorhees E M. Natural Language Processing and Information Retrieval [A].// Information Extraction [M]. Springer Berlin Heidelberg, 1999: 1-17.
[30] Doszkocs T E. Natural Language Processing in Information Retrieval [J]. Journal of the American Society for Information Science, 1986, 37(4): 191-196.
[31] 黄敏. 自然语言处理与信息检索[J]. 图书情报工作, 2001, 45(4): 41-44, 65. (Huang Min. Natural Language Processing and Information Retrieval [J]. Library and Information Service, 2001, 45(4): 41-44, 65.)
[32] 蔡艳婧, 程显毅, 潘燕. 面向自然语言处理的人工智能框架[J]. 微电子学与计算机, 2011, 28(10): 173-176, 180. (Cai Yanjing, Cheng Xianyi, Pan Yan. A Framework of Artificial Intelligence Oriented Natural Language Processing [J]. Microelectronics & Computer, 2011, 28(10): 173-176, 180.)
[33] Obermeier K K.Natural Language Processing Technologies in Artificial-Intelligence -The Science and Industry Perspective [M]. Ellis Horwood, 1989.
[34] Costantino M, Morgan R G, Collingham R J, et al. Natural Language Processing and Information Extraction: Qualitative Analysis of Financial News Articles [C]. In: Proceedings of the IEEE/IAFE 1997 Computational Intelligence for Financial Engineering. IEEE, 1997: 116-122.
[35] Coulet A, Cohen K B, Altman R B. The State of the Art in Text Mining and Natural Language Processing for Pharmacogenomics [J]. Journal of Biomedical Informatics, 2012, 45(5): 825-826.
[36] Zhou G, Liu F, Liu Y, et al. Statistical Machine Translation Improves Question Retrieval in Community Question Answering via Matrix Factorization [C]. In: Proceedings of Annual Meeting of the Association of Computational Linguistics. 2013.

[1] Jiahui Hu,An Fang,Wanqing Zhao,Chenliu Yang,Huiling Ren. Annotating Chinese E-Medical Record for Knowledge Discovery[J]. 数据分析与知识发现, 2019, 3(7): 123-132.
[2] Jinzhu Zhang,Yiming Hu. Extracting Titles from Scientific References in Patents with Fusion of Representation Learning and Machine Learning[J]. 数据分析与知识发现, 2019, 3(5): 68-76.
[3] Zhiqiang Liu,Yuncheng Du,Shuicai Shi. Extraction of Key Information in Web News Based on Improved Hidden Markov Model[J]. 数据分析与知识发现, 2019, 3(3): 120-128.
[4] Hongxia Xu,Chunwang Li. Review of Knowledge Extraction of Scientific Literature[J]. 数据分析与知识发现, 2019, 3(3): 14-24.
[5] Zixuan Zhang,Hao Wang,Liping Zhu,Sanhong eng. Identifying Risks of HS Codes by China Customs[J]. 数据分析与知识发现, 2019, 3(1): 72-84.
[6] Lina Liu,Jiayin Qi,Zhenping Zhang,Dan Zeng. Analyzing Impacts of Brand Reputation on Online Sales Based on Massive Commodity Reviews and Brand[J]. 数据分析与知识发现, 2018, 2(9): 10-21.
[7] Haixia Sun,Lei Wang,Yingjie Wu,Weina Hua,Junlian Li. Matching Strategies for Institution Names in Literature Database[J]. 数据分析与知识发现, 2018, 2(8): 88-97.
[8] Longjia Jia,Bangzuo Zhang. Classifying Topics of Internet Public Opinion from College Students: Case Study of Sina Weibo[J]. 数据分析与知识发现, 2018, 2(7): 55-62.
[9] Wei Lu,Mengqi Luo,Heng Ding,Xin Li. Image Annotation Tags by Deep Learning and Real Users: A Comparative Study[J]. 数据分析与知识发现, 2018, 2(5): 1-10.
[10] Li Wang,Lixue Zou,Xiwen Liu. Visualizing Document Correlation Based on LDA Model[J]. 数据分析与知识发现, 2018, 2(3): 98-106.
[11] Xinyue Fan,Lei Cui. Predicting Antineoplastic Drug Targets Based on Network Properties[J]. 数据分析与知识发现, 2018, 2(12): 98-108.
[12] Yang Zhao,Xini Yuan,Yawen Chen,Liqiang Wu. Predicting Conversion Rate of APP Advertising with Machine Learning[J]. 数据分析与知识发现, 2018, 2(11): 2-9.
[13] Xin Wang,Wen’gang Feng. Review of Techniques Detecting Online Extremism and Radicalization[J]. 数据分析与知识发现, 2018, 2(10): 2-8.
[14] Chaofan Yang,Zhonghua Deng,Xin Peng,Bin Liu. Review of Information Retrieval Research: Case Study of Conference Papers[J]. 数据分析与知识发现, 2017, 1(7): 35-43.
[15] Zhongyi Hu,Chaoqun Wang,Jiang Wu. Identifying Phishing Websites with Multiple Online Data Sources[J]. 数据分析与知识发现, 2017, 1(6): 47-55.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn