Please wait a minute...
Data Analysis and Knowledge Discovery  2023, Vol. 7 Issue (6): 50-60    DOI: 10.11925/infotech.2096-3467.2022.0542
Current Issue | Archive | Adv Search |
Identifying Academic Expertise of Researchers Based on Iceberg Model
Song Peiyan,Long Chenxiang(),Li Yiran,Ni Xuening
Management School, Tianjin Normal University, Tianjin 300382, China
Download: PDF (1139 KB)   HTML ( 13
Export: BibTeX | EndNote (RIS)      
Abstract  

[Objective] This paper aims to automatically identify the academic expertise of researchers, which improves the research project evaluation and talent assessment. [Methods] Firstly, we adopted the Iceberg Model to describe the academic expertise of researchers. The visible part of the “iceberg” reveals the researchers’ areas of expertise and specialization, which identify their core competencies and main research directions. The lower part of the “iceberg” indicates the “comparative advantages” of researchers’ expertise. Then, we used labels to represent researchers’ expertise and utilized machine learning techniques such as LDA and BERT to extract, cluster, and generate matrices of academic labels. Finally, we proposed the self-focus and the peer-relative indexes to identify the researchers’ main areas and relative position in the scientific community. [Results] Using a sample of 20 researchers, we generated 8,985 sets of label words and their weights and described researchers’ expertise at a fine-grained level. And then, the “Self-Focus Index” and the “Peer-relative Index” were calculated based on the domain-researcher matrix (40×20). We found the proposed method can accurately reflect researchers’ expertise in specific research areas and relative positions within the scientific community. [Limitations] Future work should consider incorporating the temporal factor to capture the temporal evolution characteristics of researchers’ academic expertise. [Conclusions] The advantages of the proposed method are twofold. Firstly, the iceberg model effectively explains what researchers do and how well they do it. The model provides a theoretical basis for label extraction, index design, and enhancing interpretability. Secondly, in addition to quantifiable comparative expertise index calculations, the method achieves fine-grained, precise, and dynamic talent expertise profiling.

Key wordsPortrait of Talent      Expertise Identification      Iceberg Model      Self-Focus Index      Peer-Relative Index     
Received: 28 May 2022      Published: 09 August 2023
ZTFLH:  G353  
  TP391  
Fund:National Social Science Fund of China(21BTQ061)
Corresponding Authors: Long Chenxiang,ORCID:0000-0002-1549-6542,E-mail:l1769069529@126.com。   

Cite this article:

Song Peiyan, Long Chenxiang, Li Yiran, Ni Xuening. Identifying Academic Expertise of Researchers Based on Iceberg Model. Data Analysis and Knowledge Discovery, 2023, 7(6): 50-60.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.2096-3467.2022.0542     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2023/V7/I6/50

Iceberg Model of Personnel Competency
The Process of Identifying the Academic Expertise of Researchers
标签词 绝对权值 标签词 绝对权值 标签词 绝对权值
社会化搜索 3.128 533 信息交流 1.263 936 数据集 1.012 890
知识图谱 2.301 209 健康信息 1.232 692 信息偶遇 1.011 500
关联数据 2.146 272 元分析 1.228 293 行动研究 1.011 320
扎根理论 1.885 615 用户体验 1.221 989 图博档 1.006 950
活动理论 1.863 091 众包抄录平台 1.173 928 语义网 0.997 070
移动互联 1.773 432 信息搜索行为 1.158 969 社会技术系统理论 0.984 090
虚拟社区 1.484 011 用户生成内容 1.111 415 冲突性健康信息 0.979 500
文献计量学 1.479 637 个体认知 1.092 402 移动视觉搜索 0.977 300
古籍目录 1.440 918 IIIF 1.076 175 目录学 0.920 940
数字人文 1.397 298 用户信息行为 1.064 919 研究热点 0.405 770
智能分析工具 1.371 669 在线问答社区 1.043 680 SSCI 0.381 180
开放数据 1.368 774 文化遗产 1.027 962 文献学 0.301 700
Some Experts’ Labels & Weights
学者

领域
领域1 领域2 领域3 领域4 领域5 领域6 领域7 领域8 领域9 领域10 领域11 领域12 领域13 领域14 领域15
学者1 6.778 1.281 26.276 0.764 0.646 0.000 0.000 0.508 0.000 0.055 0.291 9.069 0.866 3.543 0.688
学者2 0.560 4.953 0.731 0.065 0.791 0.172 0.999 11.865 0.404 2.032 1.400 0.098 0.139 2.033 1.421
学者3 0.119 0.382 0.169 1.234 0.145 2.679 0.488 0.332 0.032 0.650 0.000 0.390 0.413 0.401 4.229
学者4 0.125 0.101 0.020 0.000 0.123 0.586 0.000 0.000 0.090 0.000 0.000 0.168 0.000 0.418 0.151
学者5 0.000 0.213 0.000 18.877 0.036 0.000 2.856 0.056 0.270 0.264 0.129 0.252 0.019 8.740 0.000
学者6 0.481 2.065 0.769 0.255 0.896 0.213 0.133 0.218 0.385 0.980 0.057 0.043 0.238 0.594 1.121
学者7 0.000 0.215 0.042 3.846 0.264 1.786 0.091 0.047 0.474 0.312 0.284 0.063 1.220 0.106 0.118
学者8 0.421 4.373 1.644 0.000 0.853 0.077 0.569 10.469 0.325 1.765 0.398 0.426 0.055 0.000 1.270
学者9 0.738 0.748 0.736 0.000 0.397 0.178 0.526 0.606 0.000 0.134 9.363 0.425 0.000 0.462 0.353
学者10 0.033 0.192 0.000 0.045 0.000 2.101 0.042 0.239 0.375 0.000 0.111 0.000 0.154 0.292 0.219
学者11 3.127 0.071 12.478 0.000 0.107 0.000 0.056 0.353 0.000 0.093 0.455 4.575 0.108 1.390 0.489
学者12 0.424 0.270 1.332 0.450 0.809 0.314 0.116 0.266 0.000 0.658 0.182 0.402 0.115 0.981 0.734
学者13 0.482 0.000 1.194 0.000 0.088 0.269 0.098 5.255 0.439 0.286 0.523 0.391 0.098 0.455 0.748
学者14 0.108 0.294 0.318 0.212 0.119 2.966 3.933 0.342 3.300 1.743 0.566 0.150 1.057 0.000 0.106
学者15 0.108 0.294 0.318 0.212 0.119 2.966 3.925 0.342 3.254 1.721 0.566 0.150 1.057 0.000 0.106
Domain-Expert Matrix (Partial)
来源 学者研究专长描述
学校人员库 研究领域:用户信息行为;社会化媒体;健康信息学
中国知网-学者库 科研领域:图书情报与数字图书馆;新闻与传媒;计算机软件及计算机应用
研究方向:信息政策与战略分析;网络信息资源管理
万方数据-学者知识脉络 研究兴趣:社会化媒体3;标签3;对中国的启示4;群体协作3;演化规律3;系统3;政策研究4;China4;改进4;实验3;用户体验3;研究成果3;Based4;基础6;日本5; Paper3;内容3;Information 8;借鉴3;研究现状3;特征4;国内5;互联网5;信息用户3;信息通信 4;Communication4;研究对象3;文章3;技术接受3;实证研究3;社会化标注3(注:标签词后的数字为发文指数)
AMiner 研究兴趣:信息交流;数字图书馆;社交网络分析;德尔福法;中国
本文方法 研究专长:社会化搜索×3.128 533;知识图谱×2.301 209;关联数据×2.146 272;扎根理论×1.885 615;活动理论×1.863 091;移动互联×1.773 432;虚拟社区×1.484 011;文献计量学×1.479 637;古籍目录×1.440 918;数字人文×
1.397 298;智能分析工具×1.371 669;开放数据×1.368 774;信息交流×1.263 936;健康信息×1.232 692;元分析×1.228 293;用户体验×1.221 989;众包抄录平台×1.173 928;信息搜索行为×1.158 969;用户生成内容×1.111 415;个体认知×1.092 402;IIIF×1.076 175;用户信息行为×1.064 919;在线问答社区×1.043 680;文化遗产×1.027 962;数据集×1.012 890;信息偶遇×1.011 500;行动研究×1.011 320;图博档×1.006 950;语义网×0.997 070;社会技术系统理论×0.984 090;冲突性健康信息×0.979 500;移动视觉搜索×0.977 300;目录学×0.920 940;研究热点×0.405 770;SSCI×0.381 180;文献学×0.301 700…(共639词)
Comparison of Specialty Descriptions
Self-Focus Index
Peer-Relative Index
[1] 熊回香, 叶佳鑫, 丁玲, 等. 基于改进的h指数的学者评价研究[J]. 情报学报, 2019, 38(10): 1022-1029.
[1] (Xiong Huixiang, Ye Jiaxin, Ding Ling, et al. Scholar Evaluation Research Based on an Improved h-Index[J]. Journal of the China Society for Scientific and Technical Information, 2019, 38(10): 1022-1029.)
[2] 王林, 潘陈益, 朱文静. 基于h指数、g指数和p指数的微博影响力评价对比研究[J]. 现代情报, 2018, 38(6): 11-18, 61.
doi: 10.3969/j.issn.1008-0821.2018.06.002
[2] (Wang Lin, Pan Chenyi, Zhu Wenjing. Comparative Research on the Evaluation of Microblogs’ Impact Based on h-Index, g-Index and p-Index[J]. Journal of Modern Information, 2018, 38(6): 11-18, 61.)
doi: 10.3969/j.issn.1008-0821.2018.06.002
[3] 隋桂玲. p指数和h指数学术影响力评价对比的理论和实证研究[J]. 情报杂志, 2020, 39(4): 153-160.
[3] (Sui Guiling. Theoretical and Empirical Research on the Comparison of Academic Influence Evaluation of p-Index and h-Index[J]. Journal of Intelligence, 2020, 39(4): 153-160.)
[4] 宋培彦, 程志强. 肿瘤领域专家学术影响力评价方法及其实证研究[J]. 情报工程, 2018, 4(3): 48-57.
[4] (Song Peiyan, Cheng Zhiqiang. Method of Experts Academic Influence Evaluation in the Field of Oncology: An Empirical Study[J]. Technology Intelligence Engineering, 2018, 4(3): 48-57.)
[5] 袁国华, 寇晶晶, 张建勇, 等. 基于开放同行评议的学者影响力评价研究——以F1000为例[J]. 图书情报工作, 2018, 62(13): 37-44.
doi: 10.13266/j.issn.0252-3116.2018.13.006
[5] (Yuan Guohua, Kou Jingjing, Zhang Jianyong, et al. The Research of Scholar Influence Evaluation Based on Open Peer Review: Take the F1000 as an Example[J]. Library and Information Service, 2018, 62(13): 37-44.)
doi: 10.13266/j.issn.0252-3116.2018.13.006
[6] 王炎, 魏瑞斌. 基于多数据源的专家学术网络构建研究[J]. 情报杂志, 2016, 35(12): 121-126, 138.
[6] (Wang Yan, Wei Ruibin. To Build the Expert Academic Network Based on the Multi-data Source[J]. Journal of Intelligence, 2016, 35(12): 121-126, 138.)
[7] McClelland D C. Testing for Competence Rather Than for Intelligence.[J]. American Psychologist, 1973, 28(1): 1-14.
doi: 10.1037/h0034092 pmid: 4684069
[8] 徐曾旭林, 谢靖, 于倩倩. 人才多元评价模型设计方法研究[J]. 数据分析与知识发现, 2021, 5(8): 122-131.
[8] (Xu Zengxulin, Xie Jing, Yu Qianqian. Research on Design Method of Multi-Evaluation Model for Talents[J]. Data Analysis and Knowledge Discovery, 2021, 5(8): 122-131.)
[9] 宋雪雁, 李溪萌, 邓君. 数字时代档案文献编纂人员胜任力模型研究[J]. 图书情报工作, 2020, 64(3): 32-41.
doi: 10.13266/j.issn.0252-3116.2020.03.004
[9] (Song Xueyan, Li Ximeng, Deng Jun. Research on Competency Model of Archival Document Compilers in the Digital Age[J]. Library and Information Service, 2020, 64(3): 32-41.)
doi: 10.13266/j.issn.0252-3116.2020.03.004
[10] 邹凯, 徐萍萍, 郭一航, 等. 大数据背景下高校信息管理类人才胜任力素质模型构建[J]. 情报理论与实践, 2021, 44(12): 55-64, 18.
[10] (Zou Kai, Xu Pingping, Guo Yihang, et al. Construction of Competency Model of Information Management Talents in Universities under the Background of Big Data[J]. Information Studies: Theory & Application, 2021, 44(12): 55-64, 18.)
[11] 宋新平, 李慧, 熊强, 等. 大数据下企业竞争情报人员胜任力模型研究[J]. 现代情报, 2020, 40(5): 88-95.
doi: 10.3969/j.issn.1008-0821.2020.05.011
[11] (Song Xinping, Li Hui, Xiong Qiang, et al. Research on Competency Model of Enterprise Competitive Intelligence Personnel under Big Data[J]. Journal of Modern Information, 2020, 40(5): 88-95.)
doi: 10.3969/j.issn.1008-0821.2020.05.011
[12] Hu W, Ding K, Gu L, et al. Research on the Competency Model of Chancellors in Charge of Scientific Research in Chinese Research-Oriented Universities[J]. Journal of Scientometric Research, 2014, 3(3): 104-110.
doi: 10.4103/2320-0057.153552
[13] Klendauer R, Berkovich M, Gelvin R, et al. Towards a Competency Model for Requirements Analysts[J]. Information Systems Journal, 2012, 22(6): 475-503.
doi: 10.1111/isj.2012.22.issue-6
[14] 庆海涛, 陈媛媛, 关琳, 等. 智库专家胜任力模型构建[J]. 图书馆论坛, 2016, 36(5): 34-39.
[14] (Qing Haitao, Chen Yuanyuan, Guan Lin, et al. Competency Model of Think-Tank Experts[J]. Library Tribune, 2016, 36(5): 34-39.)
[15] 万健, 罗园晶, 茆意宏. 图书馆员知识咨询胜任力模型构建[J]. 图书情报工作, 2016, 60(20): 27-35.
doi: 10.13266/j.issn.0252-3116.2016.20.004
[15] (Wan Jian, Luo Yuanjing, Mao Yihong. The Construction of the Competency Model of Librarians’ Knowledge Consultation[J]. Library and Information Service, 2016, 60(20): 27-35.)
doi: 10.13266/j.issn.0252-3116.2016.20.004
[16] 中华人民共和国国家质量监督检验检疫总局, 中国国家标准化管理委员会. 科技人才元数据元素集: GB/T 35397—2017[S]. 北京: 中国标准出版社, 2017.
[16] (General Administration of Quality Supervision, Inspection and Quarantine of the People’s Republic of China, Standardization Administration of the People’s Republic of China. Research and Development Talent Metadata Element Set: GB/T 35397—2017[S]. Beijing: Standards Press of China, 2017.)
[17] 贾君枝, 崔西燕. 人物本体词表之间的互操作及分类体系构建[J]. 情报学报, 2019, 38(7): 731-741.
[17] (Jia Junzhi, Cui Xiyan. Interoperability Between Ontological Word Lists of Persons and Construction of Classification Systems[J]. Journal of the China Society for Scientific and Technical Information, 2019, 38(7): 731-741.)
[18] 陆伟, 刘杰, 秦喜艳. 基于专长词表的图情领域专家检索与评价[J]. 中国图书馆学报, 2010, 36(2): 70-76.
[18] (Lu Wei, Liu Jie, Qin Xiyan. Expert Search and Evaluation Based on Expertise Vocabulary in the Field of Library and Information Science[J]. Journal of Library Science in China, 2010, 36(2): 70-76.)
[19] 胡月红, 刘萍. 基于本体概念的专长表示研究[J]. 图书情报工作, 2012, 56(4): 17-21, 40.
[19] (Hu Yuehong, Liu Ping. An Ontology Based Approach for Expertise Representation[J]. Library and Information Service, 2012, 56(4): 17-21, 40.)
[20] 陈翀, 李楠, 梁冰, 等. 基于成果特征的学者学术专长识别方法[J]. 图书情报工作, 2019, 63(20): 96-103.
doi: 10.13266/j.issn.0252-3116.2019.20.011
[20] (Chen Chong, Li Nan, Liang Bing, et al. Identifying Expertise Tags of Scholars by Multiple Features of Academic Publications[J]. Library and Information Service, 2019, 63(20): 96-103.)
doi: 10.13266/j.issn.0252-3116.2019.20.011
[21] 刘萍, 周梦欢. 基于共词网络的专家专长挖掘[J]. 情报科学, 2012, 30(12): 1815-1819.
[21] (Liu Ping, Zhou Menghuan. Expertise Identification Based on Co-word Network[J]. Information Science, 2012, 30(12): 1815-1819.)
[22] 刘晓豫, 朱东华, 汪雪锋, 等. 多专长专家识别方法研究——以大数据领域为例[J]. 图书情报工作, 2018, 62(3): 55-63.
doi: 10.13266/j.issn.0252-3116.2018.03.007
[22] (Liu Xiaoyu, Zhu Donghua, Wang Xuefeng, et al. Multi-expertise Researcher Identification: A Case Study of the Big Data[J]. Library and Information Service, 2018, 62(3): 55-63.)
doi: 10.13266/j.issn.0252-3116.2018.03.007
[23] 张晓娟, 陆伟, 程齐凯. PLSA在图情领域专家专长识别中的应用[J]. 现代图书情报技术, 2012(2): 76-81.
[23] (Zhang Xiaojuan, Lu Wei, Cheng Qikai. Application of PLSA on Expertise Identifying in the Field of Library and Information Science[J]. New Technology of Library and Information Service, 2012(2): 76-81.)
[24] 赵辉, 化柏林, 何鸿魏. 科技情报用户画像标签生成与推荐[J]. 情报学报, 2020, 39(11): 1214-1222.
[24] (Zhao Hui, Hua Bolin, He Hongwei. User Profile Tag Generation and Information Recommendations for Science and Tencnology Intelligence[J]. Journal of the China Society for Scientific and Technical Information, 2020, 39(11): 1214-1222.)
[25] 聂卉. 结合词向量和词图算法的用户兴趣建模研究[J]. 数据分析与知识发现, 2019, 3(12): 30-40.
[25] (Nie Hui. Modeling Users with Word Vector and Term-Graph Algorithm[J]. Data Analysis and Knowledge Discovery, 2019, 3(12): 30-40.)
[26] 夏立新, 曾杰妍, 毕崇武, 等. 基于LDA主题模型的用户兴趣层级演化研究[J]. 数据分析与知识发现, 2019, 3(7): 1-13.
[26] (Xia Lixin, Zeng Jieyan, Bi Chongwu, et al. Identifying Hierarchy Evolution of User Interests with LDA Topic Model[J]. Data Analysis and Knowledge Discovery, 2019, 3(7): 1-13.)
[27] 范晓玉, 窦永香, 赵捧未, 等. 融合多源数据的科研人员画像构建方法研究[J]. 图书情报工作, 2018, 62(15): 31-40.
doi: 10.13266/j.issn.0252-3116.2018.15.004
[27] (Fan Xiaoyu, Dou Yongxiang, Zhao Pengwei, et al. Study for the Construction Method of Scientist Profile with Multi-Source Data Fusion[J]. Library and Information Service, 2018, 62(15): 31-40.)
doi: 10.13266/j.issn.0252-3116.2018.15.004
[28] Jeong Y S, Lee S H, Gweon G. Discovery of Research Interests of Authors over Time Using a Topic Model[C]// Proceedings of the International Conference on Big Data and Smart Computing. 2016: 24-31.
[29] Kang S, Cheng N C. Internet-Based Researcher Interest Mining[C]// Proceedings of the 6th International Conference on Dependable Systems and Their Applications. 2020: 1-12.
[30] Daud A. Using Time Topic Modeling for Semantics-Based Dynamic Research Interest Finding[J]. Knowledge-Based Systems, 2012, 26: 154-163.
doi: 10.1016/j.knosys.2011.07.015
[31] Dehghan M, Biabani M, Abin A A. Temporal Expert Profiling: With an Application to T-shaped Expert Finding[J]. Information Processing & Management, 2019, 56(3): 1067-1079.
doi: 10.1016/j.ipm.2019.02.017
[32] de Campos L M, Fernández -Luna J M, Huete J F, et al. LDA-Based Term Profiles for Expert Finding in a Political Setting[J]. Journal of Intelligent Information Systems, 2021, 56(3): 529-559.
doi: 10.1007/s10844-021-00636-x
[33] Jr Spencer L M, Spencer S M. Competence at Work: Models for Superior Performance[M]. New York: Wiley, 1993.
[34] 闫淑敏, 杨小丽. 基于扎根理论的高校科研人员创新动力研究[J]. 科技管理研究, 2019, 39(1): 39-45.
[34] (Yan Shumin, Yang Xiaoli. Research on Innovation Motivation of Scientific Research Personnel in Colleges and Universities Based on Grounded Theory[J]. Science and Technology Management Research, 2019, 39(1): 39-45.)
[1] Xie Zhen, Ma Jianxia, Hu Wenjing. Mapping and Analyzing Personal Academic Trajectory from Multiple Dimensions[J]. 数据分析与知识发现, 2023, 7(2): 129-140.
[2] Zhang Zhengang, Yu Chuanming. Knowledge Graph Completion Model Based on Entity and Relation Fusion[J]. 数据分析与知识发现, 2023, 7(2): 15-25.
[3] Yuan Yue, Pang Na, Li Guangjian. Automatically Extracting Technical Indicators from U.S. Commerce Control List[J]. 数据分析与知识发现, 2023, 7(1): 35-48.
[4] Nie Weimin, Ou Shiyan. A Modified Hybrid Method to Identify Cited Spans[J]. 数据分析与知识发现, 2023, 7(1): 113-127.
[5] Cao Zhe, Guo Huilan, Wu Jiang, Hu Zhongyi. The Ideal and Reality of Metaverse: User Perception of VR Products Based on Review Mining[J]. 数据分析与知识发现, 2023, 7(1): 49-62.
[6] Hu Zhongyi,Zhang Shuoguo,Wu Jiang. Identifying Phishing Websites Based on URL Multi-Granularity Feature Fusion[J]. 数据分析与知识发现, 2022, 6(11): 103-110.
[7] Chen Wen, Chen Wei. Predicting Popularity of Emerging Topics with Multivariable LSTM and Bibliometric Indicators[J]. 数据分析与知识发现, 2022, 6(10): 35-45.
[8] Zhang Wanshu, Yao Haitao, Wang Xuefeng. Analyzing Characteristics of ESI Discipline Distribution in China, U.S. and U.K. with Sub-Disciplines and Text Contents[J]. 数据分析与知识发现, 2022, 6(9): 14-26.
[9] Zhang Han, An Xinyu, Liu Chunhe. Building Multi-Source Semantic Knowledge Graph for Drug Repositioning[J]. 数据分析与知识发现, 2022, 6(7): 87-98.
[10] Nie Hui, Wu Xiaoyan, Lin Yun. Clustering and Characterizing Depression Patients Based on Online Medical Records[J]. 数据分析与知识发现, 2022, 6(2/3): 222-232.
[11] Wang Nan, Li Hairong, Tan Shuru. Predicting Public Opinion Reversal Based on Evolution Analysis of Events and Improved KE-SMOTE Algorithm[J]. 数据分析与知识发现, 2022, 6(2/3): 396-408.
[12] Zhou Zhichao. Review of Automatic Citation Classification Based on Machine Learning[J]. 数据分析与知识发现, 2021, 5(12): 14-24.
[13] Wu Shengnan, Pu Hongjun, Tian Ruonan, Liang Wenqi, Yu Qi. Network Structure’s Impacts on Link Prediction Algorithm from Meta-Analysis Perspective[J]. 数据分析与知识发现, 2021, 5(11): 102-113.
[14] Ji Youshu, Wang Dongbo, Huang Shuiqing. Automatically Extracting Ancient Chinese Synonyms with Word Alignment——Case Study of Pre-Four-History Corpus[J]. 数据分析与知识发现, 2021, 5(11): 135-144.
[15] Wang Nan,Li Hairong,Tan Shuru. Predicting of Public Opinion Reversal with Improved SMOTE Algorithm and Ensemble Learning[J]. 数据分析与知识发现, 2021, 5(4): 37-48.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn