Please wait a minute...
Data Analysis and Knowledge Discovery  2020, Vol. 4 Issue (2/3): 231-238    DOI: 10.11925/infotech.2096-3467.2019.0600
Current Issue | Archive | Adv Search |
Developing Modularity Scientometrics System with Distributed Technology
Shi Hongbo1,2(),Guo Hongmei1,Yue Ting1,2,Qian Li1,2,Huang Dingyu1,Chang Zhijun1
1National Science Library, Chinese Academy of Sciences, Beijing 100190, China
2Department of Library, Information and Archives Management, School of Economics and Management, University of Chinese Academy of Sciences, Beijing 100190, China
Download: PDF (3684 KB)   HTML ( 6
Export: BibTeX | EndNote (RIS)      
Abstract  

[Objective] This paper designs and develops a modularity scientometrics system, aiming to meet the needs and real time processing tasks facing researchers. [Context] The relational database system cannot manage the vast amount of literature resources, while the distributed technology provides highly efficient computating ability for the scientometrics data.[Methods] We designed a genenal indicator model and a standard task workflow. Then,we built the proposed system based on ES, Redis and modularity indicator designs.[Results] Our platform provides standard workflow for users to conduct scientometrics tasks and receive resluts in almost real time.[Conclusions] The distributed technology and modularity design could help us build a highly efficient and universal scientometrics as well as decision making systems.

Key wordsDistributed Technology      Modularity Analysis      Scientometrics     
Received: 03 June 2019      Published: 26 April 2020
ZTFLH:  TP391  
Corresponding Authors: Hongbo Shi     E-mail: shihb@mail.las.ac.cn

Cite this article:

Shi Hongbo,Guo Hongmei,Yue Ting,Qian Li,Huang Dingyu,Chang Zhijun. Developing Modularity Scientometrics System with Distributed Technology. Data Analysis and Knowledge Discovery, 2020, 4(2/3): 231-238.

URL:

https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/10.11925/infotech.2096-3467.2019.0600     OR     https://manu44.magtech.com.cn/Jwk_infotech_wk3/EN/Y2020/V4/I2/3/231

System Logical Architecture
Task Indicators Management Scheme
Elastic Expansion for Indicator Calculation
Different Queues for Calculation
Comparison of Calculation Process Between ES and Relation Database
Demonstration of Collection Indicator Calculation
Demonstration of the Storage and Utilization of the JSON Results
Selection of Data Source and Statistical Caliber
Selection Process of the Target and Indicators
Final Confirmation of the Task
Indicator Result and Graphical Display
指标个数 计算时间 单指标平均时间
15 64s 4.3s
Indicator Calculation Efficiency
[1] Kalachikhin P A . The Principles of the Design of the State Scientometric System[J]. Automatic Documentation and Mathematical Linguistics, 2016,50(4):161-172.
[2] Jin B, Zhang J, Chen D , et al. Development of the Chinese Scientometric Indicators (CSI)[J]. Scientometrics, 2002,54(1):145-154.
[3] Grivel L, Polanco X, Kaplan A . A Computer System for Big Scientometrics at the Age of the World Wide Web[J]. Scientometrics, 1997,40(3):493-506.
[4] 崔雷, 胡海荣, 李纪宾 . 文献计量学共引分析系统设计与开发[J]. 情报学报, 2000,19(4):308-312.
[4] ( Cui Lei, Hu Hairong, Li Jibin . Development of Co-citation Cluster Analysis System[J]. Journal of the China Society for Scientific and Technical Information, 2000,19(4):308-312.)
[5] 程学旗, 靳小龙, 王元卓 , 等. 大数据系统和分析技术综述[J]. 软件学报, 2014,25(9):1889-1908.
[5] ( Cheng Xueqi, Jin Xiaolong, Wang Yuanzhuo , et al. Survey on Big Data System and Analytic Technology[J]. Journal of Software, 2014,25(9):1889-1908.)
[6] 王元卓, 靳小龙, 程学旗 . 大数据分析系统创新平台与生态建设[J]. 大数据, 2018,4(1):90-99.
[6] ( Wang Yuanzhuo, Jin Xiaolong, Cheng Xueqi . Innovation Platform and Ecology Construction of Big Data Analysis System[J]. Big Data Research, 2018,4(1):90-99.)
[7] Hive[EB/OL]. [ 2019- 06- 01]. http://hive.apache.org/.
[8] HBase[EB/OL]. [ 2019- 06- 01]. http://hbase.apache.org/.
[9] InCites[EB/OL]. [2019-06-01].https://incites.clarivate.com/.
[10] ElasticSearch[EB/OL]. [2019-06-01].https://www.elastic.co/cn/products/elasticsearch.
[11] Redis[EB/OL]. [2019-06-01].https://redis.io/.
[12] Aggregations[EB/OL]. [2019-06-01].https://www.elastic.co/guide/en/elasticsearch/reference/current/search-aggregations.html.
[13] Filters Aggregation [EB/OL]. [2019-06-01].https://www.elastic.co/guide/en/elasticsearch/reference/current/search-aggregations-bucket-filters-aggregation.html.
[14] Terms Aggregation [EB/OL]. [2019-06-01].https://www.elastic.co/guide/en/elasticsearch/reference/current/search-aggregations-bucket-terms-aggregation.html.
[15] Scripting[EB/OL]. [2019-06-01].https://www.elastic.co/guide/en/elasticsearch/reference/current/modules-scripting.html.
[1] Chen Yunwei. Trend Mapping of Scientometrics Development: 1978-2008[J]. 现代图书情报技术, 2010, 26(1): 71-76.
[2] Cui Lei, Liu Wei,Yan Lei,Zhang Han,Hou Yuefang,Huang Yingna,Zhang Hao. Development of a Text Mining System Based on the Co-occurrence of Bibliographic Items in Literature Databases[J]. 现代图书情报技术, 2008, 24(8): 70-75.
  Copyright © 2016 Data Analysis and Knowledge Discovery   Tel/Fax:(010)82626611-6626,82624938   E-mail:jishu@mail.las.ac.cn