Please wait a minute...
New Technology of Library and Information Service  2012, Vol. 28 Issue (5): 48-54    DOI: 10.11925/infotech.1003-3513.2012.05.07
Current Issue | Archive | Adv Search |
Research on Example-based Machine Translation for Oracle Bone Inscriptions
Yuan Dong1, Xiong Jing2, Liu Yongge2
1. Department of Computer Science and Technology, Ocean University of China, Qingdao 266100, China;
2. School of Computer and Information Engineering, Anyang Normal University, Anyang 455002, China
Download: PDF(850 KB)   HTML  
Export: BibTeX | EndNote (RIS)      
Abstract  This paper introduces an Example-Based Machine Translation (EBMT) method for Oracle Bone Inscriptions(OBI). The key technologies in the example library building process, similarity algorithm of example sentences and example retrieval algorithm are proposed. And a machine translation system is developed to verify the proposed method. Experimental results show that the target sentences obtained by the machine translation can meet the demands of OBI researchers.
Key wordsOracle bone inscriptions      EBMT      Bilingual parallel corpus      Sentence similarity     
Received: 13 March 2012      Published: 24 July 2012
: 

TP391.2

 

Cite this article:

Yuan Dong, Xiong Jing, Liu Yongge. Research on Example-based Machine Translation for Oracle Bone Inscriptions. New Technology of Library and Information Service, 2012, 28(5): 48-54.

URL:

http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2012.05.07     OR     http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2012/V28/I5/48

[1] 江铭虎.自然语言处理[M].北京:高等教育出版社,2006.(Jiang Minghu. Natural Language Processing[M]. Beijing: Higher Education Press, 2006.)

[2] 顾绍通. 甲骨文数字化处理研究述评[J]. 西华大学学报: 自然科学版 ,2010, 29(5): 38-42.(Gu Shaotong.Review on Digitization Processing of Jiaguwen[J]. Journal of Xihua University:Natural Science Edition, 2010, 29(5): 38-42.)

[3] 陈光田.古文字与古代汉语学习的关系研究[J]. 新乡学院学报:社会科学版 ,2010,24(4):125-127.(Chen Guangtian. Discussion on the Function of Ancient Words in Learning Ancient Chinese[J]. Journal of Xinxiang Teachers College:Social Sciences Edition, 2010,24(4): 125-127.)

[4] 王宇信, 杨升南, 聂玉海. 甲骨文精粹释译[M].昆明: 云南人民出版社, 2004. (Wang Yuxin, Yang Shengnan, Nie Yuhai.Oracle Bone Inscriptions Pithiness Explanation[M]. Kunming: Yunnan People's Publishing House, 2004.)

[5] 侯宏旭, 刘群, 那顺乌日图. 基于实例的汉蒙机器翻译[J]. 中文信息学报 , 2007, 21(4): 65-72. (Hou Hongxu, Liu Qun, Nasun Urt.Example Based Chinese-Mongolian Machine Translation[J]. Journal of Chinese Information Processing, 2007, 21(4): 65-72.)

[6] 刘群. 汉英机器翻译若干关键技术研究[M]. 北京: 清华大学出版社, 2008. (Liu Qun. Research on Some Key Techniques in Chinese-English Machine Translation [M]. Beijing: Tsinghua University Press, 2008.)

[7] 姜迎春,雪艳. 词语对齐与机器翻译问题研究——以汉蒙机器翻译为例[J]. 民族翻译 , 2010(1): 91-95.(Jiang Yingchun,Xue Yan. Word Alignment and Machine Translation Research-Taking Chinese-Mongolian as Example[J]. Minority Translators Journal, 2010(1): 91-95.)

[8] Cai H Y, Jiang M H, Deng B X, et al. Method Combining Rule-based and Corpus-based Approaches for Oracle-bone Inscription Information Processing[C]. In:Proceedings of the 2006 International Conference on Intelligent Computing(ICIC'06 ). Berlin, Heidelberg:Springer-Verlag,2006:736-741.

[9] 马小虎,杨亦鸣,黄文帆,等. 甲骨文轮廓字形生成技术研究与通用甲骨文字库的建设[J]. 语言文字应用 ,2004(3): 105-111. (Ma Xiaohu, Yang Yiming, Huang Wenfan, et al. Research on the Technology of the Automatic Generation “Jiaguwen” Outline Font and Building of Universal Jiaguwen Font[J]. Applied Linguistics, 2004(3): 105-111.)

[10] 顾绍通,马小虎,杨亦鸣. 基于字形拓扑结构的甲骨文输入编码研究[J]. 中文信息学报 ,2008,22(4): 123-128.(Gu Shaotong, Ma Xiaohu, Yang Yiming. Topological Frame Based Input Method Coding of Jiaguwen[J]. Journal of Chinese Information Processing, 2008,22(4): 123-128.)

[11] Wang A M, Wang J P, Ge Y Q, et al. Research on Intelligent Oracle Bone Fragments Rejoining Technology [J]. Applied Mechanics and Materials,2011,50-51: 594-598.

[12] Gao F, Liu Y G, Xiong J. Ontology-based Semantic Annotation for Oracle Bone Inscriptions[C]. In:Proceedings of the 2nd International Conference on Artificial Intelligence, Management Science and Electronic Commerce (AIMSEC).2011:5087-5090.

[13] 王爽, 熊德兰, 王晓霞. 基于实例的古文机器翻译设计与实现[J]. 许昌学院学报 ,2009,28(5): 88-91.(Wang Shuang, Xiong Delan, Wang Xiaoxia. The Research and Implemention of Example-based Machine Translation of Ancient Chinese[J]. Journal of Xuchang University, 2009,28(5): 88-91.)

[14] Fang M, Jiang X, Zhao Q, Jiang Y. Automatic Choosing of English Rhymes in Translation of Chinese Ancient Poems[C]. In:Proceedings of the 2nd International Symposium on Knowledge Acquisition and Modeling(KAM'09). 2009: 434-437.

[15] 郭锐, 宋继华, 廖敏. 基于自动句对齐的相似古文句子检索[J]. 中文信息学报 , 2008, 22(2): 87-91.(Guo Rui, Song Jihua,Liao Min. Ancient Sentence Search Based on Sentence Auto-Alignment in Parallel Corpus of Ancient and Modern Chinese[J]. Journal of Chinese Information Processing, 2008, 22(2): 87-91.)

[16] 宋继华, 胡佳佳, 孟蓬生, 等. 古今汉语平行语料库的语料构建[J]. 现代教育技术 ,2008,18(1): 92-99. (Song Jihua, Hu Jiajia, Meng Pengsheng, et al. The Construction of Corpora in a Classic-Cotemporary Chinese Parallel Corpus[J]. Modern Educational Technology, 2008,18(1): 92-99.)

[17] 刘一曼. 甲骨文字的特点及主要内容[J]. 档案管理 , 2000(1): 40-41. (Liu Yiman. The Characteristics and Main Content of Oracle Bone Inscriptions[J]. Archives Management, 2000(1): 40-41.)

[18] 刘志基. 简论甲骨文字频的两端集中现象[J]. 语言研究 , 2010,30(4): 114-122. (Liu Zhiji. On the Two Concentration Features of the Character Frequency of Oracle Bone Inscriptions[J]. Studies in Language and Linguistics, 2010,30(4): 114-122.)

[19] 郑继娥. 甲骨文祭祀卜辞语言硏究[M]. 成都: 巴蜀书社, 2007. (Zheng Ji'e. Oracle Bone Sacrifice Inscriptions Language Research[M]. Chengdu: Bashu Publishing House, 2007.)

[20] Xiong J, Gao F, Liu Y. Word Segmentation Method for Oracle Bone Inscriptions Based on Dictionary and Syntactic Rules[C]. In:Proceedings of the 3rd International Conference on Computer Design and Applications (ICCDA). IEEE, 2011: 592-595.

[21] 吕学强, 任飞亮, 黄志丹, 等. 句子相似模型和最相似句子查找算法[J]. 东北大学学报: 自然科学版 , 2003, 24(6): 531-534. (Lv Xueqiang, Ren Feiliang, Huang Zhidan,et al. Sentence Similarity Model and the Most Similar Sentence Search Algorithm[J]. Journal of Northeastern University:Natural Science, 2003, 24(6): 531-534.)

[22] 贾兆红, 陈华平. 基于改进遗传算法的权重发现技术[J]. 计算机工程 ,2007,33(5): 156-157. (Jia Zhaohong, Chen Huaping.Weights Finding Based on Improved Genetic Algorithms[J]. Computer Engineering, 2007,33(5): 156-157.)

[23] 张刚, 杨海成, 经小川, 等. 基于遗传算法的技术成熟困难度计算方法[J]. 北京理工大学学报 , 2011, 31(4): 472-476. (Zhang Gang, Yang Haicheng, Jing Xiaochuan, et al. Computation of Advancement Degree of Difficulty Based on Genetic Algorithm[J]. Transactions of Beijing Institute of Technology, 2011, 31(4): 472-476.)

[24] Papineni K, Roukos S, Ward T, et al. BLEU: A Method for Automatic Evaluation of Machine Translation[C]. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics(ACL). 2002: 311-318.

[25] Doddington G. Automatic Evaluation of Machine Translation Quality Using N-gram Co-occurrence Statistics[C]. In: Proceedings of the 2nd International Conference on Human Language Technology Research. San Francisco: Morgan Kaufmann Publishers Inc., 2002: 138-145.

[26] 黄瑾, 刘洋, 刘群. 机器翻译评测介绍[EB/OL]. [2012-02-05]. http://lib.ict.ac.cn/libraryict/ITL/data/2007/5/机器翻译评测介绍.pdf. (Huang Jin, Liu Yang, Liu Qun. Introduction to Machine Translation Evaluation[EB/OL]. [2012-02-05]. http://lib.ict.ac.cn/libraryict/ITL/data/2007/5/%BB%FA%C6%F7%B7%AD%D2%EB%C6%C0%B2%E2%BD%E9%C9%DC.pdf.)
[1] Gao Feng, Xiong Jing, Liu Yongge. Research on the Extenics of Oracle Bone Inscriptions Interpretation Based on HowNet[J]. 现代图书情报技术, 2015, 31(7-8): 58-64.
[2] Xiong Jing, Gao Feng, Wu Qinxia. Research on Semantic Mining for Large-scale Oracle Bone Inscriptions Foundation Data[J]. 现代图书情报技术, 2015, 31(2): 7-14.
[3] Wang Zhichao, Weng Nan, Wang Yu. Research of Title Party News Identification Technology Based on Topic Sentence Similarity[J]. 现代图书情报技术, 2011, (11): 48-53.
[4] Wu Lin,Wei Xing,Huo Cuiting. Research and Implement of Automatic Patent Bilingual Corpus Extraction from Web——Taking esp@cenet as an Example[J]. 现代图书情报技术, 2009, (9): 57-63.
[5] He Wei,Wang Yu. Extracting Topic Sentences form Web Text Based on Sentence Relationship Map[J]. 现代图书情报技术, 2009, 3(3): 57-61.
[6] Wang Sen,Wang Yu. Algorithm of the Text Copy Detection Based on Text Structure Tree[J]. 现代图书情报技术, 2009, (10): 50-55.
[7] Lian Zhanjun,Lv Xueqiang,Zhang Yujie,Shi Shuicai. Information Extraction Based on Calculation of Sentence Similarity[J]. 现代图书情报技术, 2007, 2(6): 38-41.
[8] Hua Bolin. Article Novelty Evaluation System Based on Sentence Matching[J]. 现代图书情报技术, 2007, 2(11): 40-44.
[9] Qin Xinguo. Research on the Copy Detection Based on the Similarity of Sentences[J]. 现代图书情报技术, 2007, 2(11): 63-66.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn