Please wait a minute...
Advanced Search
现代图书情报技术  2013, Vol. Issue (5): 40-45    DOI: 10.11925/infotech.1003-3513.2013.05.05
  知识组织与知识管理 本期目录 | 过刊浏览 | 高级检索 |
跨语言术语同义关系推荐方法及其实证
宋培彦1, 李静静1, 赵星2
1. 中国科学技术信息研究所 北京 100038;
2. 北京师范大学图书馆 北京 100875
Recommended Method for Cross-language Term Synonymous Relationship and Its Empirical Research
Song Peiyan1, Li Jingjing1, Zhao Xing2
1. Institute of Scientific & Technical Information of China, Beijing 100038, China;
2. Beijing Normal University Library, Beijing 100875, China
全文: PDF(613 KB)   HTML  
输出: BibTeX | EndNote (RIS)      
摘要 同义关系是术语词间关系的基本类型。以英汉对齐词典为知识库、以等值翻译词对为知识表示形式,提出对中文术语和英文翻译进行双向推导的方法,建立树形拓扑结构并进行遍历操作,通过加权优选模型对同义词推荐结果进行优化。实验证明该方法对术语同义关系识别的准确率较高,在知识组织工具构建、百科知识服务、信息检索等领域有一定应用价值。
服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
宋培彦
李静静
赵星
关键词 术语同义词知识组织英汉对齐词典术语计算    
Abstract:Synonym of terminology is a basic type of relationships between terms. Based on the English-Chinese alignment dictionary, this paper proposes recommended method of term synonyms, deriving of Chinese and English translation in pairs, and optimizes synonyms results by weighted optimization measure. The experiments indicate that both the accuracy and efficiency of synonym recognition are improved by the method, which is beneficial to the construction of knowledge organization systems, encyclopedias and information retrieval.
Key wordsTerminology    Synonym    Knowledge organization    English-Chinese alignment dictionary    Terminology computing
收稿日期: 2013-04-15     
:  G254  
  TP391  
基金资助:本文系国家社会科学基金青年项目“基于知识组织的术语服务研究”(项目编号:11CTQ018)和国家“十二五”科技支撑计划子项目“《汉语主题词表》(工程技术版)与英文超级科技词表的映射研究”(项目编号:2011BAH10B07)的研究成果之一。
通讯作者: 宋培彦     E-mail: songpy@istic.ac.cn
引用本文:   
宋培彦, 李静静, 赵星. 跨语言术语同义关系推荐方法及其实证[J]. 现代图书情报技术, 2013, (5): 40-45.
Song Peiyan, Li Jingjing, Zhao Xing. Recommended Method for Cross-language Term Synonymous Relationship and Its Empirical Research. New Technology of Library and Information Service, DOI:10.11925/infotech.1003-3513.2013.05.05.
链接本文:  
http://manu44.magtech.com.cn/Jwk_infotech_wk3/CN/10.11925/infotech.1003-3513.2013.05.05
[1] Wei X, Peng F C, Tseng H. Search with Synonyms: Problems and Solutions[C]. In: Proceedings of the International Conference on Computational Linguistics (COLING). 2010:1318-1326.
[2] Gnoli C. Ten Long-term Research Questions in Knowledge Organization[J]. Knowledge Organization, 2008, 35(2-3): 137-149.
[3] Bourigault D, Jacquemin C, L’Homme M C. Recent Advances in Computational Terminology[M]. Amsterdam:John Benjamins Publishing, 2001.
[4] Falk I, Gardent C, Jacquey E, et al. Grouping Synonyms by Definitions[C]. In: Proceedings of Recent Advances in Natural Language Processing (RANLP), Borovets, Bulgaria.2009:76-81.
[5] 刘群,李素健.基于《知网》的词汇语义相似度计算[OL].[2013-05-16].http://www.keenage.com/html/c_index.html (Liu Qun,Li Sujian. lexical semantic similarity computation based on Hownet[OL].[2013-05-16].http://www.keenage.com/html/c_index.html)
[6] 张瑞霞,朱贵良,杨国增. 基于知识图的汉语词汇语义相似度计算[J]. 中文信息学报 , 2009,23(3):116-120.(Zhang Ruixia, Zhu Guiliang, Yang Guozeng. A New Measure of Semantic Similarity Between Chinese Words Based on Knowledge Graphs[J]. Journal of Chinese Information Processing, 2009,23(3):116-120.)
[7] Blondel V D, Senellart P P. Automatic Extraction of Synonyms in a Dictionary[C].In: Proceedings of the SIAM Text Mining Workshop.2002.
[8] Jannink J. Thesaurus Entry Extraction from an On-line Dictionary[OL]. [2013-04-12].http://infolab.stanford.edu/SKC/publications/webster.pdf.
[9] 聂卉,龙朝辉. 结合语义相似度与相关度的概念扩展[J]. 情报学报 ,2007,26(5):728-732.(Nie Hui, Long Zhaohui. Conceptual Retrieval Based on Semantic Similarity Integrated with Semantic Relativity[J]. Journal of the China Society for Scientific and Technical Information,2007,26(5):728-732.)
[10] 滕广青,毕强. 基于概念格的跨本体映射中概念相似度计算方法[J]. 情报学报 , 2012,31(4):390-397.(Teng Guangqing, Bi Qiang. An Approach to Concept Similarity Computation in Cross-Ontology Mapping Based on Concept Lattice[J]. Journal of the China Society for Scientific and Technical Information, 2012,31(4):390-397.)
[11] 赖院根,吴雯娜. 基于叙词表的概念语义相似度计算[J]. 图书情报工作 ,2009,53(8):21-24,33.(Lai Yuangen,Wu Wenna. Research on Concept Semantic Similarity Computation Based on Thesaurus[J]. Library and Information Service, 2009,53(8):21-24,33.)
[12] 王兰成,李超. 结合两种相似度计算的主题信息检索方法研究[J]. 现代图书情报技术 ,2009(11):53-58.(Wang Lancheng, Li Chao. Research on Information Retrieval Methods Based on Two Similarity Calculation[J]. New Technology of Library and Information Service, 2009(11):53-58.)
[13] Van der Plas L, Tiedemann J. Finding Synonyms Using Automatic Word Alignment and Measures of Distributional Similarity[C]. In: Proceedings of the COLING/ACL on Main Conference Poster Sessions. 2006:866-873.
[14] Van der Plas L, Tiedemann J, Manguin J L. Automatic Acquisition of Synonyms for French Using Parallel Corpora[C]. In: Proceedings of the 4th International Workshop on Distributed Agent-Based Retrieval Tools. 2010.
[15] Meusel R, Niepert M, Eckert K, et al. Thesaurus Extension Using Web Search Engines[C].In: Proceedings of the Role of Digital Libraries in a Time of Global Change, and 12th International Conference on Asia-Pacific Digital Libraries. 2010:198-207.
[16] Muller P, Langlais P. Comparing Distributional and Mirror Translation Similarities for Extracting Synonyms[C]. In: Proceedings of the 24th Canadian Conference on Advances in Artificial Intelligence. Berlin, Heidelberg: Springer-Verlag, 2011:323-334.
[17] 陆勇,侯汉清. 基于模式匹配的汉语同义词自动识别[J]. 情报学报 ,2006,25(6):720-724.(Lu Yong, Hou Hanqing. Automatic Recognition of Chinese Synonyms Based on Pattern Matching Algorithm[J]. Journal of the China Society for Scientific and Technical Information, 2006,25(6):720-724.)
[18] 陆勇,章成志,侯汉清. 基于百科资源的多策略中文同义词自动抽取研究[J]. 中国图书馆学报 ,2010,36(1):56-62.(Lu Yong,Zhang Chengzhi,Hou Hanqing. Research on the Multi-Strategies of Chinese Synonyms Automatic Extraction from Encyclopaedia[J]. Journal of Library Science in China,2010,36(1):56-62. )
[19] 章成志.基于多层特征的字符串相似度计算模型[J]. 情报学报, 2005,24(6):696-701.(Zhang Chengzhi. A Model for Chinese String Similarity Based on Multi-Level Features[J]. Journal of the China Society for Scientific and Technical Information, 2005,24(6):696-701.)
[20] 徐硕,朱礼军,乔晓东,等.基于双序列比对的中文术语语义相似度计算的新方法[J]. 情报学报 ,2010,29(4):701-708.(Xu Shuo, Zhu Lijun, Qiao Xiaodong, et al. A Novel Approach to Chinese Terms Semantic Similarity Calculation Based on Pairwise Sequence Alignment[J]. Journal of the China Society for Scientific and Technical Information, 2010,29(4):701-708.)
[21] 常春, 赖院根. 基于文献标题词汇共现获取词间关系研究[J]. 图书情报工作 ,2009,53(8):17-20.(Chang Chun, Lai Yuangen. Research on Concept Relations Based on Keywords Co-ocurrence in Document Titles[J]. Library and Information Service, 2009,53(8):17-20. )
[22] 刘华梅,侯汉清.基于情报检索的汉语同义词识别初探[J]. 情报理论与实践 , 2005,28(4):373-375,382.(Liu Huamei,Hou Hanqing. Information Retrieval Based on the Recognition of Chinese Synonyms[J]. Information Studies:Theory & Application, 2005,28(4):373-375,382.)
[23] 吴云芳,石静,金澎. 基于图的同义词集自动获取方法[J]. 计算机研究与发展 , 2011,48(4):610-616.(Wu Yunfang, Shi Jing, Jin Peng. Graph-based Automatic Acquisition of Semantic Classes[J]. Journal of Computer Research and Development, 2011,48(4):610-616.)
[24] GB13190-1991, 汉语叙词表编制规则[S]. 北京:中国标准出版社,1991.(GB13190-1991, Chinese Thesaurus Compiling Rules[S]. Beijing: China Standard Press,1991.)
[25] 宋培彦.术语知识表示模型研究[J]. 情报理论与实践 ,2012,35(8):111-114.(Song Peiyan. Model of Terminology Knowledge Representation[J]. Information Studies: Theory & Application, 2012,35(8):111-114.)
[1] 谢靖,王敬东,吴振新,张智雄,王颖,叶志飞. 科技文献检索系统语义丰富化框架的设计与实践*[J]. 数据分析与知识发现, 2017, 1(4): 84-93.
[2] 陈果,肖璐. 网络社区中的知识元链接体系构建研究*[J]. 数据分析与知识发现, 2017, 1(11): 75-83.
[3] 丁恒,陆伟. 标准文献知识服务系统设计与实现*[J]. 现代图书情报技术, 2016, 32(7-8): 120-128.
[4] 王密平,王昊,邓三鸿,吴志祥. 基于CRFs的冶金领域中文专利术语抽取研究*[J]. 现代图书情报技术, 2016, 32(6): 28-36.
[5] 姜霖,王东波. 采用连续词袋模型(CBOW)的领域术语自动抽取研究*[J]. 现代图书情报技术, 2016, 32(2): 9-15.
[6] 刘彤,倪维健,柳梅. 面向搜索引擎查询日志的领域术语自动识别方法*[J]. 现代图书情报技术, 2016, 32(2): 25-33.
[7] 毕强, 刘健, 鲍玉来. 基于语义相似度的文本聚类研究*[J]. 数据分析与知识发现, 2016, 32(12): 9-16.
[8] 朱惠,杨建林,王昊. 中文领域专业术语层次关系构建研究*[J]. 现代图书情报技术, 2016, 32(1): 73-80.
[9] 付鸿鹄, 张智雄, 刘建华, 钱力, 王颖. 构建STKOS术语发布与共享服务平台[J]. 现代图书情报技术, 2015, 31(9): 76-81.
[10] 刘伟, 王星, 宋培彦. 同义词抽取结果的噪音清洗方法研究[J]. 现代图书情报技术, 2015, 31(6): 64-70.
[11] 刘丹军, 付鸿鹄, 文奕, 胡正银, 杨宁, 向彬, 钱力, 刘春江. 科技知识组织体系版本管理系统设计与实践应用[J]. 现代图书情报技术, 2015, 31(4): 79-86.
[12] 范雪雪, 王志荣, 徐晤, 梁银, 马小虎. 基于医学本体的术语相似度算法研究[J]. 现代图书情报技术, 2015, 31(12): 57-64.
[13] 何宇, 吕学强, 徐丽萍. 新能源汽车领域中文术语抽取方法[J]. 现代图书情报技术, 2015, 31(10): 88-94.
[14] 王颖, 张智雄, 李传席, 刘毅, 汤怡洁, 周子健, 钱力, 付鸿鹄. 科技知识组织体系开放引擎系统的设计与实现[J]. 现代图书情报技术, 2015, 31(10): 95-101.
[15] 侯婷, 吕学强, 李卓. 专利术语抽取的层次过滤方法[J]. 现代图书情报技术, 2015, 31(1): 24-30.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
版权所有 © 2015 《数据分析与知识发现》编辑部
地址:北京市海淀区中关村北四环西路33号 邮编:100190
电话/传真:(010)82626611-6626,82624938
E-mail:jishu@mail.las.ac.cn