|
|
Constructing Knowledge Graph with Public Resumes |
Shen Kejie,Huang Huanting,Hua Bolin() |
Department of Information Management, Peking University, Beijing 100871, China |
|
|
Abstract [Objective] This paper constructs knowledge graph based on the public resume data with natural language processing technology, which provides new tool for traditional data analysis. [Context] The proposed method could automatically extract profesional backgrounds and job information from resumes, and then obtain the relationship of working experience and colleagues in the organizations. The visualized knowledge graph could provide decision support for talent selection, personnel appointment and removal tasks of enterprises and institutions. [Methods] First, we used crawler to obtain the resume data and used the BERT-BiLSTM-CRF model to recognize entities. Then, we established the relationship between entities by defining rules and integrating the external domain knowledge. Finally, we used neo4j graph database to store and visualize data. [Results] The accuracy of the BERT-BiLSTM-CRF model with the entity recognition task was 84.85%. The constructed knowledge graph, which included resumes of 561 people, 8,174 entities in 3 categories, and 20,162 relationships in 5 categories, could support multi-angle queries and data mining. [Conclusions] This proposed model explores the internal relationships among resumes and provides a novel way to analyze resumes. However, there are few precise entity alignment processing and the establishment of relationships among institution entities.
|
Received: 11 February 2021
Published: 11 August 2021
|
|
Fund:National Social Science Fund of China(17BTQ066) |
Corresponding Authors:
Hua Bolin,ORCID:0000-0001-9248-6455
E-mail: huabolin@pku.edu.cn
|
[1] |
田瑞强, 姚长青, 潘云涛, 等. 基于履历数据的海外华人高层次科技人才流动研究: 社会网络分析视角[J]. 图书情报工作, 2014, 58(19):92-99.
|
[1] |
(Tian Ruiqiang, Yao Changqing, Pan Yuntao, et al. Using the Curriculum Vitae for Career Mobility Research of Chinese Overseas Highly-Talent: From the Perspective of Social Network Analysis[J]. Library and Information Service, 2014, 58(19):92-99.)
|
[2] |
马秀玲, 饶帅. 少数民族地区基层公务员晋升的影响因素研究——基于县处级正职领导干部的履历分析[J]. 西北民族大学学报(哲学社会科学版), 2016(4):53-63.
|
[2] |
(Ma Xiuling, Rao Shuai. On Influence Factor of Promotion of Basic Unit Public Servants in Ethnic Area——Case Study of CVs of County-level Principals[J]. Journal of Northwest Minzu University (Philosophy and Social Sciences), 2016(4):53-63.)
|
[3] |
Hamman J A. Career Experience and Performing Effectively as Governor[J]. American Review of Public Administration, 2004, 34(2):151-163.
doi: 10.1177/0275074004263758
|
[4] |
Sun J J, Cole M, Huang Z Y, et al. Chinese Leadership: Provincial Perspectives on Promotion and Performance[J]. Environment and Planning C: Politics and Space, 2018, 37(4):750-772.
doi: 10.1177/2399654418791580
|
[5] |
任宁. 大规模真实文本中的人物职衔信息提取研究[D]. 北京: 北京语言大学, 2008.
|
[5] |
(Ren Ning. Personal Position and Title Information Extraction in Large-Scale Real Texts[D]. Beijing: Beijing Language and Culture University, 2008.)
|
[6] |
谷楠楠, 冯筠, 孙霞, 等. 中文简历自动解析及推荐算法[J]. 计算机工程与应用, 2017, 53(18):141-148, 270.
|
[6] |
(Gu Nannan,(Feng Yun,(Sun Xia, et al. Chinese Resume Information Automatic Extraction and Recommendation Algorithm[J]. Computer Engineering and Applications, 2017, 53(18):141-148, 270.)
|
[7] |
Dong F, Wang J N. Personal Information Extraction of the Teaching Staff Based on CRFs[C]// Proceedings of 2015 International Conference on Network & Information Systems for Computers. 2015: 615-617.
|
[8] |
祖石诚, 王修来, 曹阳, 等. 基于新型文本块分割法的简历解析[J]. 计算机科学, 2020, 47(S1):95-101.
|
[8] |
(Zu Shicheng, Wang Xiulai, Cao Yang, et al. Resume Parsing Based on Novel Text Block Segmentation Methodology[J]. Computer Science, 2020, 47(S1):95-101.)
|
[9] |
Gaur B, Saluja G S, Sivakumar H B, et al. Semi-supervised Deep Learning Based Named Entity Recognition Model to Parse Education Section of Resumes[J]. Neural Computing and Applications, 2021, 33:5705-5718.
doi: 10.1007/s00521-020-05351-2
|
[10] |
曹烃. 体育科研论文合著状况分析——基于知识图谱的CSSCI文献计量分析[J]. 北京体育大学学报, 2012, 35(9):49-54.
|
[10] |
(Cao Ting. Analysis on the Co-author Status of the Sports Scientific Research Thesis——A Study Based on the Knowledge Map of CSSCI Literature Metrological Analysis[J]. Journal of Beijing Sport University, 2012, 35(9):49-54.)
|
[11] |
杨海慈, 王军. 宋代学术师承知识图谱的构建与可视化[J]. 数据分析与知识发现, 2019, 3(6):109-116.
|
[11] |
(Yang Haici, Wang Jun. Visualizing Knowledge Graph of Academic Inheritance in Song Dynasty[J]. Data Analysis and Knowledge Discovery, 2019, 3(6):109-116.)
|
[12] |
王晓萍, 郭梦洁, 岳婧雯. 基于关系图谱的人岗关系研究[J]. 大数据, 2020, 6(6):129-139.
|
[12] |
(Wang Xiaoping, Guo Mengjie, Yue Jingwen. Research on Person-Position Relationship Based on Relation Graph[J]. Big Data Research, 2020, 6(6):129-139.)
|
[13] |
He Y, Yun H Y, Lin L. The Character Relationship Mining Based on Knowledge Graph and Deep Learning[C]// Proceedings of the 5th International Conference on Big Data Computing and Communications (BIGCOM). 2019: 22-27.
|
[14] |
Huang Z H, Xu W, Yu K. Bidirectional LSTM-CRF Models for Sequence Tagging[OL]. arXiv Preprint, arXiv:1508.01991.
|
[15] |
Devlin J, Chang M W, Lee K, et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding [OL]. arXiv Preprint, arXiv:1810.04805.
|
[16] |
王子牛, 姜猛, 高建瓴, 等. 基于BERT的中文命名实体识别方法[J]. 计算机科学, 2019, 46(S2):138-142.
|
[16] |
(Wang Ziniu, Jiang Meng, Gao Jianling, et al. Chinese Named Entity Recognition Method Based on BERT[J]. Computer Science, 2019, 46(S2):138-142.)
|
[17] |
中国政要资料库[EB/OL]. [2021-01-30]. http://cpc.people.com.cn/GB/64162/394696/index.html.
|
[17] |
(Database of Chinese Politicians[EB/OL]. [2021-01-30]. http://cpc.people.com.cn/GB/64162/394696/index.html.)
|
[18] |
地方党政领导人物库[EB/OL]. [2021-01-30]. http://district.ce.cn/zt/rwk/index.shtml.
|
[18] |
(Database of Local Party and Government Leaders[EB/OL]. [2021-01-30]. http://district.ce.cn/zt/rwk/index.shtml.)
|
[19] |
Jiao Z Y, Sun S Q, Ke S. Chinese Lexical Analysis with Deep Bi-GRU-CRF Network[OL]. arXiv Preprint, arXiv:1807.01882.
|
[20] |
Li S, Zhao Z, Hu R F, et al. Analogical Reasoning on Chinese Morphological and Semantic Relations[C]// Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics. 2018: 138-143.
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|