Abstract:This paper presents a method for identifying candidate reference control gene based on text mining from PubMed database. It integrates several approaches such as pattern matching, subject recognition and information extraction to find candidate gene and its experiment environment for biology domain specialists. Experiment results show that the method not only has good performance on mining of candidate reference control gene and its environments, but also saves much time and reduces cost.
何琳, 何娟, 沈耕宇, 杨波, 黄水清. 一种通过文本挖掘发现实时定量聚合酶链式反应实验内参基因的方法研究[J]. 现代图书情报技术, 2012, 28(7): 109-114.
He Lin, He Juan, Shen Gengyu, Yang Bo, Huang Shuiqing. An Approach to Discovery of Reference Control Gene for qRT-PCR Experiment Based on Texting Mining. New Technology of Library and Information Service, 2012, 28(7): 109-114.
[1] Czechowski T, Stitt M, Altmann T, et al. Genome-Wide Identification and Testing of Superior Reference Genes for Transcript Normalization in Arabidopsis[J]. Plant Physiology, 2005, 139(1):5-17.[2] Libault M, Thibivilliers S, Bilgin D D, et al. Identification of Four Soybean Reference Genes for Gene Expression Normalization [J]. The Plant Genome, 2008, 1(1):44-54.[3] 胡瑞波,范成明,傅永福. 植物实时荧光定量PCR内参的选择[J]. 中国农业科技导报 , 2009, 11(6):30-36. (Hu Ruibo, Fan Chengming, Fu Yongfu. Reference Gene Selection in Plant Real-time Quantitative Reverse Transcription PCR(qRT-PCR)[J]. Journal of Agricultural Science and Technology, 2009, 11(6):30-36.)[4] Faccioli P, Ciceri G P, Provero P, et al. A Combined Strategy of “in Silico” Transcriptome Analysis and Web Search Engine Optimization Allows an Agile Identification of Reference Genes Suitable for Normalization in Gene Expression Studies[J]. Plant Molecular Biology, 2007, 63(5):679-688.[5] Coker J S, Davis E. Selection of Candidate Housekeeping Controls in Tomato Plants Using EST Data[J]. BioTechniques, 2003, 35(4):740-748.[6] 丁效,宋凡,秦兵,等. 音乐领域典型事件抽取方法研究[J]. 中文信息学报 , 2011, 25(2):15-20. (Ding Xiao, Song Fan, Qin Bing, et al. Research on Typical Event Extraction Method in the Field of Music[J]. Journal of Chinese Information Processing, 2011, 25(2):15-20.)[7] 许旭阳,李弼程,张先飞,等. 基于事件实例驱动的新闻文本事件抽取[J]. 计算机科学 , 2011,38(8):232-235. (Xu Xuyang, Li Bicheng, Zhang Xianfei, et al. News Text Event Extraction Driven by Event Sample[J]. Computer Science, 2011, 38(8):232-235.)[8] 郑家恒, 菅小艳. 农作物信息抽取系统的设计与实现[J]. 计算机工程 , 2006, 32(7):197-198. (Zheng Jiaheng, Jian Xiaoyan. Design and Realization of the System of Farm Crop Information Extraction[J]. Computer Engineering, 2006, 32(7):197-198.)[9] 高文利. 基于本体的军备情报抽取系统的设计与实现[J]. 现代图书情报技术 , 2010(1):83-87. (Gao Wenli. The System of Arms Information Extraction Based on Ontology[J]. New Technology of Library and Information Service, 2010(1):83-87.)[10] The Stanford Parser: A Statistical Parser[EB/OL].[2011-12-18].http://nlp.stanford.edu/software/lex-parser.shtml.[11] Ashburner M, Ball C A, Blake J A, et al. Gene Ontology: Tool for the Unification of Biology. The Gene Ontology Consortium [J]. Nature Genetics, 2000, 25(1):25-29.[12] Morris J, Hirst G. Lexical Cohesion Computed by Thesaural Relations as an Indicator of the Structure of Text [J]. Computational Linguistics, 1991, 17 (1):21-48.[13] Jain M, Nijhawan A, Tyagi A K, et al. Validation of Housekeeping Genes as Internal Control for Studying Gene Expression in Rice by Quantitative Real-time PCR[J]. Biochemical and Biophysical Research Communications, 2006,345(2):646-651.