Please wait a minute...
New Technology of Library and Information Service  2013, Vol. Issue (6): 30-35    DOI: 10.11925/infotech.1003-3513.2013.06.05
Current Issue | Archive | Adv Search |
Comparative Analysis of Centrality Indices in Extracting Concepts from Semantic Predication Network——Based on Disease Treatment Research
Zhang Han, Liu Shuangmei
Department of Medical Informatics, China Medical University, Shenyang 110001, China
Download:
Export: BibTeX | EndNote (RIS)      
Abstract  The aim of the study is to compare the validity of four node centrality indices in extracting crucial nodes from semantic predication network. Depending on Unified Medical Language System (UMLS) and SemRep, this paper first constructs a semantic predication network for biomedical literature, in which nodes represent UMLS concepts and edges semantic relations between nodes. Relying on the semantic type of the concepts and the semantic relations, schemas related to disease treatment are defined and used to extract disease treatment related predications. Then four centrality indices including degree centrality, betweenness centrality, closeness centrality and eigenvector centrality are used to extract crucial concepts related to four aspects of disease treatment (therapeutic drugs, therapeutic procedures, body location of the disease and disease comorbidities). The extracted concepts are compared to a reference standard produced by domain experts. The results show that centrality combined with semantic schema can effectively extract crucial nodes of the users interest. Among four centrality indices, degree centrality performs best (F-score is 0.72) and eigenvector centrality performs secondly best (F-score is 0.66).
Key wordsInformation extraction      Semantic predication network      Semantic schema      Node centrality     
Received: 28 April 2013      Published: 24 July 2013
:  TP391.1  

Cite this article:

Zhang Han, Liu Shuangmei. Comparative Analysis of Centrality Indices in Extracting Concepts from Semantic Predication Network——Based on Disease Treatment Research. New Technology of Library and Information Service, 2013, (6): 30-35.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.1003-3513.2013.06.05     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2013/V/I6/30

[1] 赵辉,刘怀亮,范云杰. 复杂网络理论在中文文本特征选择中的应用研究[J]. 现代图书情报技术,2012(9):23-28.(Zhao Hui, Liu Huailiang, Fan Yunjie. Study on the Application of Complex Network Theory in Chinese Text Feature Selection[J].New Technology of Library and Information Service,2012(9):23-28.)
[2] Erkan G, Radev D R. LexRank: Graph-based Lexical Centrality as Salience in Text Summarization[J]. Journal of Artificial Intelligence Research,2004,22(1):457-479.
[3] Zhang X, Cheng G,Qu Y Z. Ontology Summarization Based on RDF Sentence Graph[C].In: Proceedings of the 16th International Conference on World Wide Web. 2007:707-716.
[4] Unified Medical Language System (UMLS)[EB/OL].[2013-03-11]. http://www.nlm.nih.gov/research/umls/.
[5] Aronson A R, Lang F M. An Overview of MetaMap: Historical Perspective and Recent Advances [J]. Journal of the American Medical Informatics Association, 2010,17(3):229-236.
[6] Kilicoglu H, Fiszman M, Rodriguez A, et al. Semantic MEDLINE: A Web Application to Manage the Results of PubMed Searches[C].In: Proceedings of the 3rd International Symposium on Semantic Mining in Biomedicine. 2008:69-76.
[7] Fiszman M, Demner-Fushman D, Kilicoglu H, et al. Automatic Summarization of MEDLINE Citations for Evidence-based Medical Treatment: A Topic-oriented Evaluation[J]. Journal of Biomedical Informatics,2009,42(5):801-813.
[8] Workman E T, Hurdle J F. Dynamic Summarization of Bibliographic-based Data[J]. BMC Medical Informatics & Decision Making, 2011,11(6). doi:10.1186/1472-6947-11-6.
[9] 商玥,王鸿飞,杨志豪. 利用语义关系抽取生成生物医学文摘的算法[J]. 计算机科学与探索, 2011,5(11):1027-1035.(Shang Yue, Wang Hongfei, Yang Zhihao. Automatic Summarization Algorithm for Biomedical Literature Based on Semantic Relation Extraction[J]. Journal of Frontiers of Computer Science & Technology, 2011,5(11):1027-1035.)
[10] Zhang H, Fiszman M, Shin D, et al. Degree Centrality for Semantic Abstraction Summarization of Theraputic Studies[J]. Journal of Biomedical Informatics,2011,44(5):830-838.
[11] de Nooy W, Mrvar A, Batagelj V.Appendix 1: Getting Started with Pajek[A].//Exploratory Social Network Analysis with Pajek[M].New York:Cambridge University Press,2010.
[12] Freeman L C. Centrality in Social Networks: Conceptual Clarification[J]. Social Networks, 1979,1(3):215-239.
[13] 高小强,赵星,陶乃航. 网络中心度用于期刊引文评价的有效性研究[J]. 大学图书馆学报,2009,27(5):61-64.(Gao Xiaoqiang, Zhao Xing, Tao Naihang. Validity of Journals Citation Evaluation with Centrality Indexes of Networks[J].Journal of Academic Libraries, 2009,27(5):61-64.)
[14] McCray A T, Burgun A, Bodenreider O. Aggregating UMLS Semantic Types for Reducing Conceptual Complexity[J].Studies in Health Technology and Informatics,2001,84(1):216-220.
[1] Tan Ying, Tang Yifei. Extracting Citation Contents with Coreference Resolution[J]. 数据分析与知识发现, 2021, 5(8): 25-33.
[2] Wang Yi,Shen Zhe,Yao Yifan,Cheng Ying. Domain-Specific Event Graph Construction Methods:A Review[J]. 数据分析与知识发现, 2020, 4(10): 1-13.
[3] Tao Yue,Yu Li,Zhang Runjie. Active Learning Strategies for Extracting Phrase-Level Topics from Scientific Literature[J]. 数据分析与知识发现, 2020, 4(10): 134-143.
[4] Zhiqiang Liu,Yuncheng Du,Shuicai Shi. Extraction of Key Information in Web News Based on Improved Hidden Markov Model[J]. 数据分析与知识发现, 2019, 3(3): 120-128.
[5] Chengzhi Zhang,Zheng Li. Extracting Sentences of Research Originality from Full Text Academic Articles[J]. 数据分析与知识发现, 2019, 3(10): 12-18.
[6] Mu Dongmei,Jin Shan,Ju Yuanhong. Finding Association Between Diseases and Genes from Literature Abstracts[J]. 数据分析与知识发现, 2018, 2(8): 98-106.
[7] Chen Xiaowei,Shi Yutian. Identifying Key Nodes in Social Network with Improved PageRank Algorithm[J]. 数据分析与知识发现, 2017, 1(8): 68-75.
[8] Yufeng Duan,Sisi Huang. Information Extraction from Chinese Plant Species Diversity Description Text[J]. 现代图书情报技术, 2016, 32(1): 87-96.
[9] Liu Wei, Wang Xing, Song Peiyan. A Noise Cleaning Method for Synonym Extraction Results[J]. 现代图书情报技术, 2015, 31(6): 64-70.
[10] Jiang Chuntao. Automatic Annotation of Bibliographical References in Chinese Patent Documents[J]. 现代图书情报技术, 2015, 31(10): 81-87.
[11] Li Xiangdong, Huo Yayong, Huang Li. Study of Book Pages Automatic Identification and Bibliographic Information Extraction[J]. 现代图书情报技术, 2014, 30(4): 71-77.
[12] Liu Yajing, Wang Yanxi, Hao Dan, Zhou Jinhui. Study on the Methods of Institutional Repository Supporting Research Services[J]. 现代图书情报技术, 2014, 30(3): 1-7.
[13] Huang Xun, You Hongliang, Yu Yang. A Review of Relation Extraction[J]. 现代图书情报技术, 2013, 29(11): 30-39.
[14] He Lin, He Juan, Shen Gengyu, Yang Bo, Huang Shuiqing. An Approach to Discovery of Reference Control Gene for qRT-PCR Experiment Based on Texting Mining[J]. 现代图书情报技术, 2012, 28(7): 109-114.
[15] Gao Qiang, You Hongliang. Study on Named Entity Recognition Based on Cascaded Model for Field of Defense[J]. 现代图书情报技术, 2012, (11): 47-52.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn