Please wait a minute...
New Technology of Library and Information Service  2013, Vol. Issue (6): 30-35    DOI: 10.11925/infotech.1003-3513.2013.06.05
Current Issue | Archive | Adv Search |
Comparative Analysis of Centrality Indices in Extracting Concepts from Semantic Predication Network——Based on Disease Treatment Research
Zhang Han, Liu Shuangmei
Department of Medical Informatics, China Medical University, Shenyang 110001, China
Download: PDF(578 KB)   HTML  
Export: BibTeX | EndNote (RIS)      
Abstract  The aim of the study is to compare the validity of four node centrality indices in extracting crucial nodes from semantic predication network. Depending on Unified Medical Language System (UMLS) and SemRep, this paper first constructs a semantic predication network for biomedical literature, in which nodes represent UMLS concepts and edges semantic relations between nodes. Relying on the semantic type of the concepts and the semantic relations, schemas related to disease treatment are defined and used to extract disease treatment related predications. Then four centrality indices including degree centrality, betweenness centrality, closeness centrality and eigenvector centrality are used to extract crucial concepts related to four aspects of disease treatment (therapeutic drugs, therapeutic procedures, body location of the disease and disease comorbidities). The extracted concepts are compared to a reference standard produced by domain experts. The results show that centrality combined with semantic schema can effectively extract crucial nodes of the users interest. Among four centrality indices, degree centrality performs best (F-score is 0.72) and eigenvector centrality performs secondly best (F-score is 0.66).
Key wordsInformation extraction      Semantic predication network      Semantic schema      Node centrality     
Received: 28 April 2013      Published: 24 July 2013
:  TP391.1  

Cite this article:

Zhang Han, Liu Shuangmei. Comparative Analysis of Centrality Indices in Extracting Concepts from Semantic Predication Network——Based on Disease Treatment Research. New Technology of Library and Information Service, 2013, (6): 30-35.

URL:

http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2013.06.05     OR     http://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2013/V/I6/30

[1] 赵辉,刘怀亮,范云杰. 复杂网络理论在中文文本特征选择中的应用研究[J]. 现代图书情报技术,2012(9):23-28.(Zhao Hui, Liu Huailiang, Fan Yunjie. Study on the Application of Complex Network Theory in Chinese Text Feature Selection[J].New Technology of Library and Information Service,2012(9):23-28.)
[2] Erkan G, Radev D R. LexRank: Graph-based Lexical Centrality as Salience in Text Summarization[J]. Journal of Artificial Intelligence Research,2004,22(1):457-479.
[3] Zhang X, Cheng G,Qu Y Z. Ontology Summarization Based on RDF Sentence Graph[C].In: Proceedings of the 16th International Conference on World Wide Web. 2007:707-716.
[4] Unified Medical Language System (UMLS)[EB/OL].[2013-03-11]. http://www.nlm.nih.gov/research/umls/.
[5] Aronson A R, Lang F M. An Overview of MetaMap: Historical Perspective and Recent Advances [J]. Journal of the American Medical Informatics Association, 2010,17(3):229-236.
[6] Kilicoglu H, Fiszman M, Rodriguez A, et al. Semantic MEDLINE: A Web Application to Manage the Results of PubMed Searches[C].In: Proceedings of the 3rd International Symposium on Semantic Mining in Biomedicine. 2008:69-76.
[7] Fiszman M, Demner-Fushman D, Kilicoglu H, et al. Automatic Summarization of MEDLINE Citations for Evidence-based Medical Treatment: A Topic-oriented Evaluation[J]. Journal of Biomedical Informatics,2009,42(5):801-813.
[8] Workman E T, Hurdle J F. Dynamic Summarization of Bibliographic-based Data[J]. BMC Medical Informatics & Decision Making, 2011,11(6). doi:10.1186/1472-6947-11-6.
[9] 商玥,王鸿飞,杨志豪. 利用语义关系抽取生成生物医学文摘的算法[J]. 计算机科学与探索, 2011,5(11):1027-1035.(Shang Yue, Wang Hongfei, Yang Zhihao. Automatic Summarization Algorithm for Biomedical Literature Based on Semantic Relation Extraction[J]. Journal of Frontiers of Computer Science & Technology, 2011,5(11):1027-1035.)
[10] Zhang H, Fiszman M, Shin D, et al. Degree Centrality for Semantic Abstraction Summarization of Theraputic Studies[J]. Journal of Biomedical Informatics,2011,44(5):830-838.
[11] de Nooy W, Mrvar A, Batagelj V.Appendix 1: Getting Started with Pajek[A].//Exploratory Social Network Analysis with Pajek[M].New York:Cambridge University Press,2010.
[12] Freeman L C. Centrality in Social Networks: Conceptual Clarification[J]. Social Networks, 1979,1(3):215-239.
[13] 高小强,赵星,陶乃航. 网络中心度用于期刊引文评价的有效性研究[J]. 大学图书馆学报,2009,27(5):61-64.(Gao Xiaoqiang, Zhao Xing, Tao Naihang. Validity of Journals Citation Evaluation with Centrality Indexes of Networks[J].Journal of Academic Libraries, 2009,27(5):61-64.)
[14] McCray A T, Burgun A, Bodenreider O. Aggregating UMLS Semantic Types for Reducing Conceptual Complexity[J].Studies in Health Technology and Informatics,2001,84(1):216-220.
[1] Zhiqiang Liu,Yuncheng Du,Shuicai Shi. Extraction of Key Information in Web News Based on Improved Hidden Markov Model[J]. 数据分析与知识发现, 2019, 3(3): 120-128.
[2] Dongmei Mu,Shan Jin,Yuanhong Ju. Finding Association Between Diseases and Genes from Literature Abstracts[J]. 数据分析与知识发现, 2018, 2(8): 98-106.
[3] Xiaowei Chen,Yutian Shi. Identifying Key Nodes in Social Network with Improved PageRank Algorithm[J]. 数据分析与知识发现, 2017, 1(8): 68-75.
[4] Yufeng Duan,Sisi Huang. Information Extraction from Chinese Plant Species Diversity Description Text[J]. 现代图书情报技术, 2016, 32(1): 87-96.
[5] Liu Wei, Wang Xing, Song Peiyan. A Noise Cleaning Method for Synonym Extraction Results[J]. 现代图书情报技术, 2015, 31(6): 64-70.
[6] Jiang Chuntao. Automatic Annotation of Bibliographical References in Chinese Patent Documents[J]. 现代图书情报技术, 2015, 31(10): 81-87.
[7] Li Xiangdong, Huo Yayong, Huang Li. Study of Book Pages Automatic Identification and Bibliographic Information Extraction[J]. 现代图书情报技术, 2014, 30(4): 71-77.
[8] Liu Yajing, Wang Yanxi, Hao Dan, Zhou Jinhui. Study on the Methods of Institutional Repository Supporting Research Services[J]. 现代图书情报技术, 2014, 30(3): 1-7.
[9] Huang Xun, You Hongliang, Yu Yang. A Review of Relation Extraction[J]. 现代图书情报技术, 2013, 29(11): 30-39.
[10] He Lin, He Juan, Shen Gengyu, Yang Bo, Huang Shuiqing. An Approach to Discovery of Reference Control Gene for qRT-PCR Experiment Based on Texting Mining[J]. 现代图书情报技术, 2012, 28(7): 109-114.
[11] Gao Qiang, You Hongliang. Study on Named Entity Recognition Based on Cascaded Model for Field of Defense[J]. 现代图书情报技术, 2012, (11): 47-52.
[12] Wang Xiuyan, Cui Lei. Overview of Semantic Relations Extraction Between Biomedical Entities by Key Verbs[J]. 现代图书情报技术, 2011, 27(9): 21-27.
[13] Zhou Hong, Zhang Bei, Jiang Airong, Zhang Chengyu. Design and Implementation of Library Bibliography Information Self SMS Push Service[J]. 现代图书情报技术, 2011, 27(7/8): 127-131.
[14] Wang Zhichao, Weng Nan, Wang Yu. Research of Title Party News Identification Technology Based on Topic Sentence Similarity[J]. 现代图书情报技术, 2011, (11): 48-53.
[15] Lu Wanhui, Ma Jianxia. Research on Complex Time Information Extraction Based on CRF Model[J]. 现代图书情报技术, 2011, 27(10): 29-33.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn