Please wait a minute...
Advanced Search
现代图书情报技术  2014, Vol. 30 Issue (12): 51-61     https://doi.org/10.11925/infotech.1003-3513.2014.12.07
  知识组织与知识管理 本期目录 | 过刊浏览 | 高级检索 |
基于知识图谱的中外自然语言处理研究的对比分析
邱均平, 方国平
武汉大学中国科学评价研究中心 武汉 430072
The Comparative Analysis of Natural Language Processing Research at Home and Abroad Based on Knowledge Mapping
Qiu Junping, Fang Guoping
Research Center for Chinese Science Evaluation, Wuhan University, Wuhan 430072, China
全文: PDF (848 KB)   HTML  
输出: BibTeX | EndNote (RIS)      
摘要 

[目的]从多角度对中外自然语言处理的发展进行对比分析.[方法]对5 582篇来自CNKI、10 348篇来自Web of Science、5 573篇来自与自然语言处理相关的重大国际会议文献, 采用词频统计法、共现分析法相结合的方法, 利用知识图谱呈现统计结果.[结果]统计结果表明, 中外对自然语言处理的研究表现出极大的相似性, 研究内容都集中在信息抽取、人工智能、信息检索、机器翻译、机器学习等领域.[局限]检索主题词的选取、数据清洗时的主观性给研究带来误差.[结论]对国内自然语言处理的发展提出建议.

服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
方国平
邱均平
关键词 自然语言处理知识图谱信息检索机器学习    
Abstract

[Objective] This paper makes a comparative analysis to the development of natural language processing at home and abroad from multi-angle. [Methods] The literatures are from CNKI (5 582), Web of Science (10 348) and major international conferences on natural language processing (5 573). Use word frequency statistics and co-occurrence analysis as main research methods and use knowledge maps to show statistical results. [Results] The result shows that the study of natural language processing performance at home and abroad has a great similarity. Their research focuses on the domains of information extraction, artificial intelligence, information retrieval, machine translation, machine learning and so on. [Limitations] There are some limitations in this paper, such as the choice of subject term, the error resulting from the subjectivity to data cleaning. [Conclusions] According to the results, several recommendations are made on the development of natural language processing.

Key wordsNatural language processing    Knowledge mapping    Information retrieval    Machine learning
收稿日期: 2014-03-17      出版日期: 2015-01-20
:  G250  
基金资助:

本文系国家社会科学基金项目"基于语义的馆藏资源深度聚合与可视化展示研究"(项目编号:11&ZD152)的研究成果之一.

通讯作者: 方国平 E-mail: 1259297235@qq.com     E-mail: 1259297235@qq.com
作者简介: 作者贡献声明: 邱均平: 提出研究命题, 设计研究思路及研究方法, 论文起草, 最终版本修订; 方国平: 论文修改, 文献调研, 原始数据获取、清洗、分析.
引用本文:   
邱均平, 方国平. 基于知识图谱的中外自然语言处理研究的对比分析[J]. 现代图书情报技术, 2014, 30(12): 51-61.
Qiu Junping, Fang Guoping. The Comparative Analysis of Natural Language Processing Research at Home and Abroad Based on Knowledge Mapping. New Technology of Library and Information Service, 2014, 30(12): 51-61.
链接本文:  
https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.1003-3513.2014.12.07      或      https://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/Y2014/V30/I12/51

[1] Allen J. 自然语言理解[M]. 第二版. 刘群, 张华平, 骆卫华, 等译. 北京: 电子工业出版社, 2005. (Allen J. Natural Language Understanding [M]. The 2nd Edition. Translated by Liu Qun, Zhang Huaping, Luo Weihua, et al. Beijing: Publishing House of Electronics Industry, 2005.)
[2] 杨国文. 自然语言理解[J]. 外语教学与研究, 1987(3) : 28-31, 81. (Yang Guowen. On Understanding Natural Language [J]. Foreign Language Teaching and Research, 1987(3): 28-31, 81.)
[3] 冯志伟. 自然语言处理的学科定位[J]. 解放军外国语学院学报, 2005, 28(3): 1-8. (Feng Zhiwei. Academic Position of Natural Language Processing [J]. Journal of PLA University of Foreign Languages, 2005, 28(3): 1-8.)
[4] 冯志伟. 自然语言处理的历史与现状[J]. 中国外语, 2008, 5(1): 14-22. (Feng Zhiwei. The Past and Present of Natural Language Processing [J]. Foreign Languages in China, 2008, 5(1): 14-22.)
[5] 曹佩. 论自然语言处理[J]. 信息与电脑, 2010(5): 187. (Cao Pei. On the Natural Language Processing [J]. China Computer and Communication, 2010(5): 187.)
[6] 殷杰, 董佳蓉. 论自然语言处理的发展趋势[J]. 自然辩证法研究, 2008, 24(3): 31-37. (Yin Jie, Dong Jiarong. The Development Trend of the Natural Language Processing [J]. Studies in Dialectics of Nature, 2008, 24(3): 31-37.)
[7] 祝清松. 我国自然语言处理研究的文献计量分析[J]. 情报杂志, 2009, 28(S2): 32-34. (Zhu Qingsong. Bibliometric Analysis of Natural Language Processing in China [J]. Journal of Information, 2009, 28(S2): 32-34.)
[8] 李阳, 许培扬. 我国自然语言处理研究文献计量分析[J]. 中华医学图书情报杂志, 2012, 21(2): 65-70. (Li Yang, Xu Peiyang. Research on Natural Language Processing in China: A Bibliometric Analysis [J]. Chinese Journal of Medical Library and Information Science, 2012, 21(2): 65-70.)
[9] 田瑛. 运用语义分析解决自然语言处理中的英语歧义问题[J]. 语文学刊(外语教育与教学), 2009(5): 14-15. (Tian Ying. The Use of Natural Language Processing Semantic Analysis to Resolve Ambiguity in English [J]. Journal of Language and Literature Studies, 2009(5):14-15.)
[10] 吴巧玲. 中文分词算法在自然语言处理技术中的研究及应用[J]. 信息与电脑, 2011(12): 39-40. (Wu Qiaoling. Chinese Word Segmentation Algorithm and Its Application in Natural Language Processing Techniques [J]. China Computer & Communication, 2011(12): 39-40.)
[11] 许坤, 冯岩松, 赵东岩, 等.面向知识库的中文自然语言问句的语义理解[J]. 北京大学学报: 自然科学版, 2014, 50(1): 85-92. (Xu Kun, Feng Yansong, Zhao Dongyan, et al. Automatic Understanding of Natural Language Questions for Querying Chinese Knowledge Bases [J]. Acta Scientiarum Naturalium Universitatis Pekinensis, 2014, 50(1): 85-92.)
[12] 才让加. 面向自然语言处理的大规模汉藏(藏汉)双语语料库构建技术研究[J]. 中文信息学报, 2011, 25(6): 157-161. (Tse Ring'rgyal. Research on Large-scale Sino-Tibetan Bilingual Corpus Construction for Natural Language Processing [J]. Journal of Chinese Information Processing, 2011, 25(6): 157-161.)
[13] 孟维娟. 自然语言处理中的歧义[J]. 上海电机学院学报, 2006, 9(S1): 16-19. (Meng Weijuan. Simple Analysis of Ambiguity in Natural Language Processing [J]. Journal of Shanghai Dianji University, 2006, 9(S1): 16-19.)
[14] 邱均平. 信息计量学[M]. 武汉: 武汉大学出版社, 2007. (Qiu Junping. Informetrics [M]. Wuhan: Wuda Publishing House, 2007.)
[15] 化柏林. 用VBA实现文献计量分析研究中的数据预处理技术[J]. 现代图书情报技术, 2007(3): 69-72. (Hua Bolin. Implementation of Preprocess Technology in Bibliometric and Analytic Research via VBA [J]. New Technology of Library and Information Service, 2007(3): 69-72.)
[16] 中国科学报: 《让机器会"说"多种语言》[EB/OL]. [2014-01-03]. http://www.ia.cas.cn/xwzx/mtsm/201401/t20140103_ 4009934.html. (Chinese Science News: Let the Machine "Say" in Many Languages [EB/OL]. [2014-01-03]. http://www.ia.cas.cn/ xwzx/mtsm/201401/t20140103_4009934.html.)
[17] 寇继虹, 楼雯. 基于知识图谱的E-learning研究的可视化分析[J]. 电化教育研究, 2010(9): 20-25. (Kou Jihong, Lou Wen. Visual Analysis of E-learning Based on Knowledge Map [J]. E-education Research, 2010(9): 20-25.)
[18] 杨皓东, 江凌, 李国俊. 国内自然语言处理研究热点分析——基于共词分析[J].图书情报工作, 2011, 55(10): 112-117. (Yang Haodong, Jiang Ling, Li Guojun. The Hotspot of Natural Language Processing in China: Based on Co-word Analysis [J]. Library and Information Service, 2011, 55(10): 112-117.)
[19] Wong K, Li W, Xu R, et al. Book Review: Introduction to Chinese Natural Language Processing [J]. Computational Linguistics, 2010, 36(4): 777-780.
[20] 李生. 自然语言处理的研究与发展[J]. 燕山大学学报, 2013, 37(5): 377-384. (Li Sheng. Research and Development of Natural Language Processing [J]. Journal of Yanshan University, 2013, 37(5): 377-384.)
[21] 王献昌, 史晓东, 陈火旺. 机器翻译与自然语言处理的现状与趋势[J]. 计算机科学, 1992, 19(3): 1-3. (Wang Xianchang, Shi Xiaodong, Chen Huowang. The Current Situation and Trend of Machine Learning and Natural Language Processing [J]. Computer Science, 1992, 19(3): 1-3.)
[22] Liu J, Wang Q, Lin C, et al. Question Difficulty Estimation in Community Question Answering Services [C]. In: Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, Washington, USA. Association for Computational Linguistics, 2013: 85-90.
[23] 孙镇, 王惠临. 命名实体识别研究进展综述[J]. 现代图书情报技术, 2010(6): 42-47. (Sun Zhen, Wang Huilin. Overview on the Advance of the Research on Named Entity Recognition [J]. New Technology of Library and Information Service, 2010(6): 42-47.
[24] 崔新华. 自然语言处理在信息检索中的应用研究[J]. 贵阳学院学报: 自然科学版, 2012, 7(3): 37-40. (Cui Xinhua. Natural Language Processing Applications in Formation Retrieval Research [J]. Journal of Guiyang College: Natural Science, 2012, 7(3): 37-40.)
[25] 左远清, 周洞汝, 王波. 自然语言处理在搜索引擎信息检索中的应用[J].现代计算机, 2002(7): 28-29, 44. (Zuo Yuanqing, Zhou Dongru, Wang Bo. Application of Natural Language Processing in Information Retrieve by Search Engineer [J]. Modem Computer, 2002(7): 28-29, 44.)
[26] 于志敏, 张文德. 基于自然语言处理的信息检索[J]. 山东科技大学学报: 自然科学版, 2006, 25(1): 122-124. (Yu Zhimin, Zhang Wende. Information Retrieval Based on Natural Language Processing [J]. Journal of Shandong University of Science and Technology: Natural Science, 2006, 25(1): 122-124.)
[27] 蔡霞, 张森. 自然语言理解在Web数据挖掘中的应用[J]. 计算机工程与设计, 2003, 24(11): 1-3. (Cai Xia, Zhang Sen. Practice of Web Mining Based on Nature Language Understanding [J]. Computer Engineering and Design, 2003, 24(11): 1-3.)
[28] Lewis D D, Jones K S. Natural Language Processing for Information Retrieval [J]. Communications of the ACM, 1996, 39(1): 92-101.
[29] Voorhees E M. Natural Language Processing and Information Retrieval [A].// Information Extraction [M]. Springer Berlin Heidelberg, 1999: 1-17.
[30] Doszkocs T E. Natural Language Processing in Information Retrieval [J]. Journal of the American Society for Information Science, 1986, 37(4): 191-196.
[31] 黄敏. 自然语言处理与信息检索[J]. 图书情报工作, 2001, 45(4): 41-44, 65. (Huang Min. Natural Language Processing and Information Retrieval [J]. Library and Information Service, 2001, 45(4): 41-44, 65.)
[32] 蔡艳婧, 程显毅, 潘燕. 面向自然语言处理的人工智能框架[J]. 微电子学与计算机, 2011, 28(10): 173-176, 180. (Cai Yanjing, Cheng Xianyi, Pan Yan. A Framework of Artificial Intelligence Oriented Natural Language Processing [J]. Microelectronics & Computer, 2011, 28(10): 173-176, 180.)
[33] Obermeier K K.Natural Language Processing Technologies in Artificial-Intelligence -The Science and Industry Perspective [M]. Ellis Horwood, 1989.
[34] Costantino M, Morgan R G, Collingham R J, et al. Natural Language Processing and Information Extraction: Qualitative Analysis of Financial News Articles [C]. In: Proceedings of the IEEE/IAFE 1997 Computational Intelligence for Financial Engineering. IEEE, 1997: 116-122.
[35] Coulet A, Cohen K B, Altman R B. The State of the Art in Text Mining and Natural Language Processing for Pharmacogenomics [J]. Journal of Biomedical Informatics, 2012, 45(5): 825-826.
[36] Zhou G, Liu F, Liu Y, et al. Statistical Machine Translation Improves Question Retrieval in Community Question Answering via Matrix Factorization [C]. In: Proceedings of Annual Meeting of the Association of Computational Linguistics. 2013.

[1] 王寒雪,崔文娟,周园春,杜一. 基于机器学习的食源性疾病致病菌识别方法*[J]. 数据分析与知识发现, 2021, 5(9): 54-62.
[2] 陈东华,赵红梅,尚小溥,张润彤. 数据驱动的大型医院手术室运营预测与优化方法研究*[J]. 数据分析与知识发现, 2021, 5(9): 115-128.
[3] 车宏鑫,王桐,王伟. 前列腺癌预测模型对比研究*[J]. 数据分析与知识发现, 2021, 5(9): 107-114.
[4] 周阳,李学俊,王冬磊,陈方,彭莉娟. 炸药配方设计知识图谱的构建与可视分析方法研究*[J]. 数据分析与知识发现, 2021, 5(9): 42-53.
[5] 王一钒,李博,史话,苗威,姜斌. 古汉语实体关系联合抽取的标注方法*[J]. 数据分析与知识发现, 2021, 5(9): 63-74.
[6] 苏强, 侯校理, 邹妮. 基于机器学习组合优化方法的术后感染预测模型研究*[J]. 数据分析与知识发现, 2021, 5(8): 65-75.
[7] 沈科杰, 黄焕婷, 化柏林. 基于公开履历数据的人物知识图谱构建*[J]. 数据分析与知识发现, 2021, 5(7): 81-90.
[8] 黄名选,蒋曹清,卢守东. 基于词嵌入与扩展词交集的查询扩展*[J]. 数据分析与知识发现, 2021, 5(6): 115-125.
[9] 曹睿,廖彬,李敏,孙瑞娜. 基于XGBoost的在线短租市场价格预测及特征分析模型*[J]. 数据分析与知识发现, 2021, 5(6): 51-65.
[10] 钟佳娃,刘巍,王思丽,杨恒. 文本情感分析方法及应用综述*[J]. 数据分析与知识发现, 2021, 5(6): 1-13.
[11] 阮小芸,廖健斌,李祥,杨阳,李岱峰. 基于人才知识图谱推理的强化学习可解释推荐研究*[J]. 数据分析与知识发现, 2021, 5(6): 36-50.
[12] 孟镇,王昊,虞为,邓三鸿,张宝隆. 基于特征融合的声乐分类研究*[J]. 数据分析与知识发现, 2021, 5(5): 59-70.
[13] 李贺,刘嘉宇,李世钰,吴迪,金帅岐. 基于疾病知识图谱的自动问答系统优化研究*[J]. 数据分析与知识发现, 2021, 5(5): 115-126.
[14] 向卓元,刘志聪,吴玉. 基于用户行为自适应推荐模型研究 *[J]. 数据分析与知识发现, 2021, 5(4): 103-114.
[15] 李跃艳,王昊,邓三鸿,王伟. 近十年信息检索领域的研究热点与演化趋势研究——基于SIGIR会议论文的分析[J]. 数据分析与知识发现, 2021, 5(4): 13-24.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 2015 《数据分析与知识发现》编辑部
地址:北京市海淀区中关村北四环西路33号 邮编:100190
电话/传真:(010)82626611-6626,82624938
E-mail:jishu@mail.las.ac.cn