|
|
Graph Databases for Complex Network Analysis |
Liu Chunjiang1,2(),Li Shuying1,Hu Hanlin3,Fang Shu1,2 |
1Chengdu Library and Information Center, Chinese Academy of Sciences, Chengdu 610041, China 2Department of Library, Information and Archives Management, School of Economics and Management, University of Chinese Academy of Sciences, Beijing 100190, China 3School of Public Administration, Sichuan School of Economics and Management, University, Chengdu 610065, China |
|
|
Abstract [Objective] This paper systematically reviews the progress and trends of graph database research and applications for complex network analysis. [Coverage] We searched the Web of Science, Scopus, and CNKI database for Chinese and English literature. A total of 15 graph databases and open-source packages, 21 practical cases, and 14 research papers were retrieved. [Methods] First, we compared the mainstream graph database products from China and abroad. Then, we explored the latest solutions for complex network analysis, including algorithms (such as centrality, path finding, link prediction, and community detection), graph visualization, performance and related applications. [Results] The graph database has become an important analysis tool and research method for complex network analysis and big data mining. They also work closely with graph computing engines for complex network analysis. [Limitations] This paper only examined a few representative cases. [Conclusions] The graph database could effectively query, represent and analyze complex network data for their patterns or structures. Their presentation of multi-dimensional data is crucial for mining implicit relationships.
|
Received: 14 October 2021
Published: 31 December 2021
|
|
Fund:National Social Science Fund of China(19BTQ088);2020 Western Light Talent Project of the Chinese Academy of Sciences(E1C0000201) |
Corresponding Authors:
Liu Chunjiang,ORCID:0000-0001-8934-339X
E-mail: liucj@clas.ac.cn
|
[1] |
Alhussien I, Cambria E, Zhang N S. Semantically Enhanced Models for Commonsense Knowledge Acquisition[C]// Proceedings of the 2018 IEEE International Conference on Data Mining Workshops. IEEE, 2018: 1014-1021.
|
[2] |
Paulheim H. Knowledge Graph Refinement: A Survey of Approaches and Evaluation Methods[J]. Semantic Web, 2016, 8: 489-508.
doi: 10.3233/SW-160218
|
[3] |
沈志宏, 赵子豪, 王海波. 以图为中心的新型大数据技术栈研究[J]. 数据分析与知识发现, 2020, 4(7): 50-65.
|
[3] |
( Shen Zhihong, Zhao Zihao, Wang Haibo. Big Data Technology Stack Shifting: From SQL Centric to Graph Centric[J]. Data Analysis and Knowledge Discovery, 2020, 4(7): 50-65.)
|
[4] |
DBMS Popularity Broken Down by Database Model[EB/OL]. [2021-09-02]. https://db-engines.com/en/ranking_categories.
|
[5] |
Šestak M, Heričko M, Družovec T W, et al. Applying K-Vertex Cardinality Constraints on a Neo4j Graph Database[J]. Future Generation Computer Systems, 2021, 115: 459-474.
doi: 10.1016/j.future.2020.09.036
|
[6] |
Gutfraind A, Genkin M. A Graph Database Framework for Covert Network Analysis: An Application to the Islamic State Network in Europe[J]. Social Networks, 2017, 51: 178-188.
doi: 10.1016/j.socnet.2016.10.004
|
[7] |
Breiger R L, Schoon E, Melamed D, et al. Comparative Configurational Analysis as a Two-Mode Network Problem: A Study of Terrorist Group Engagement in the Drug Trade[J]. Social Networks, 2014, 36: 23-39.
doi: 10.1016/j.socnet.2013.04.002
|
[8] |
Chu Z, Yu J, Hamdulla A. A Novel Deep Learning Method for Query Task Execution Time Prediction in Graph Database[J]. Future Generation Computer Systems, 2020, 112: 534-548.
doi: 10.1016/j.future.2020.06.006
|
[9] |
Gutfraind A, Genkin M. A Graph Database Framework for Covert Network Analysis: An Application to the Islamic State Network in Europe[J]. Social Networks, 2017, 51: 178-188.
doi: 10.1016/j.socnet.2016.10.004
|
[10] |
图数据库GDB-帮助中心-阿里云[EB/OL]. [2021-10-10]. https://help.aliyun.com/product/102714.html.
|
[10] |
(Graph Database GDB-Help Center-Alibaba Cloud[EB/OL]. [2021-10-10]. https://help.aliyun.com/product/102714.html. )
|
[11] |
HugeGraph[EB/OL]. [2021-10-10]. https://hugegraph.github.io/hugegraph-doc/.
|
[12] |
图数据库BGraph-百度AI开放平台[EB/OL]. [2021-10-10]. https://ai.baidu.com/tech/kg/bgraph.
|
[12] |
(Graph Database BGraph-Baidu AI Open Platform[EB/OL]. [2021-10-10]. https://ai.baidu.com/tech/kg/bgraph. )
|
[13] |
图数据库Graph Database | 腾讯云[EB/OL]. [2021-10-10]. https://intl.cloud.tencent.com/zh/products/konisgraph?lang=zh.
|
[13] |
(Graph Database | Tencent Cloud[EB/OL]. [2021-10-10]. https://intl.cloud.tencent.com/zh/products/konisgraph?lang=zh. )
|
[14] |
蚂蚁金服金融科技-分布式图数据库[EB/OL]. [2021-10-10]. https://tech.antfin.com/products/GEABASE.
|
[14] |
(Ant Financial Technology-Distributed Graph Database[EB/OL]. [2021-10-10]. https://tech.antfin.com/products/GEABASE. )
|
[15] |
Nebula Graph开源分布式图数据库[EB/OL]. [2021-10-10]. https://www.nebula-graph.com.cn/.
|
[15] |
(Nebula Graph Open Source Distributed Graph Database[EB/OL]. [2021-10-10]. https://www.nebula-graph.com.cn/. )
|
[16] |
费马TuGraph图数据库[EB/OL]. [2021-10-10]. https://fma-ai.cn/product/.
|
[16] |
(Feima TuGraph Graph Database[EB/OL]. [2021-10-10]. https://fma-ai.cn/product/. )
|
[17] |
Native Graph Database|Neo4j Graph Database Platform[EB/OL]. [2021-10-10]. https://neo4j.com/product/neo4j-graph-database/.
|
[18] |
Graph Database|Graph Analytics Platform | TigerGraph[EB/OL]. [2021-10-10]. https://www.tigergraph.com/.
|
[19] |
Getting Started with Graph Databases-Amazon Neptune[EB/OL]. [2021-10-10]. https://docs.aws.amazon.com/neptune/latest/userguide/graph-get-started.html.
|
[20] |
JanusGraph[EB/OL]. [2021-10-10]. https://janusgraph.org.
|
[21] |
ArangoDB, the Multi-Model Database for Graph and Beyond[EB/OL]. [2021-10-10]. https://www.arangodb.com/.
|
[22] |
AllegroGraph[EB/OL]. [2021-10-10]. https://allegrograph.com/.
|
[23] |
Home | OrientDB Community Edition[EB/OL]. [2021-10-10]. https://orientdb.org/.
|
[24] |
覃悦. 基于中心性的算法在复杂网络分析中的应用及对比研究——以文本网络为例[D]. 天津: 天津财经大学, 2020.
|
[24] |
( Qin Yue. Application and Contrast Research of Centrality-Based Algorithms in Complex Network Analysis—Take Text Network as an Example[D]. Tianjin: Tianjin University of Finance & Economics, 2020.)
|
[25] |
Needham M, Hodler A E. Graph Algorithms: Practical Examples in Apache Spark and Neo4j[M]. O’Reilly Media, 2019.
|
[26] |
Lü L Y, Zhou T. Link Prediction in Complex Networks: A Survey[J]. Physica A: Statistical Mechanics and Its Applications, 2011, 390(6): 1150-1170.
doi: 10.1016/j.physa.2010.11.027
|
[27] |
赵卫绩, 张凤斌, 刘井莲. 复杂网络社区发现研究进展[J]. 计算机科学, 2020, 47(2): 10-20.
|
[27] |
( Zhao Weiji, Zhang Fengbin, Liu Jinglian. Review on Community Detection in Complex Networks[J]. Computer Science, 2020, 47(2): 10-20.)
|
[28] |
Shneiderman B, Aris A. Network Visualization by Semantic Substrates[J]. IEEE Transactions on Visualization and Computer Graphics, 2006, 12(5): 733-740.
pmid: 17080794
|
[29] |
周晓楠, 黄磊, 王飞跃, 等. 图数据库在识别重大疾病保险团伙式欺诈中的应用研究[J]. 保险研究, 2020(9): 92-104.
|
[29] |
( Zhou Xiaonan, Huang Lei, Wang Feiyue, et al. On the Application of Graph Database on Identifying Critical Illness Insurance Group Fraud[J]. Insurance Studies, 2020(9): 92-104.)
|
[30] |
王猛. 面向科技与能力网络的关联分析系统[D]. 大连: 大连理工大学, 2020.
|
[30] |
( Wang Meng. Association Analysis System for Technology and Ability Network[D]. Dalian: Dalian University of Technology, 2020.)
|
[31] |
Kolomeets M, Chechulin A, Kotenko I. Social Networks Analysis by Graph Algorithms on the Example of the VKontakte Social Network[J]. Journal of Wireless Mobile Networks, Ubiquitous Computing, and Dependable Applications, 2019, 10: 55-75.
|
[32] |
Giordani I, Archetti F, Candelieri A, et al. Graph Data Base: An Enabling Technology for Drug Prescription Patterns Analysis[J]. Statistica Applicata-Italian Journal of Applied Statistics, 2020 (2): 181-192.
|
[33] |
Henderson R. Using Graph Databases to Detect Financial Fraud[J]. Computer Fraud & Security, 2020, 2020(7): 6-10.
|
[34] |
王一苇, 胡林麟, 罗程, 等. 集成图数据库算法和模式匹配的城市高压电网转供电方案生成[J]. 电测与仪表, 2022, 59(4): 169-177.
|
[34] |
( Wang Yiwei, Hu Linlin, Luo Cheng, et al. Generation of Urban High-Voltage Power Grid Transfer Power Supply Scheme Based on Integrated Graph Database Algorithm and Pattern Matching[J]. Electrical Measurement & Instrumentation, 2022, 59(4): 169-177.)
|
[35] |
范国栋. 模糊决策与图数据库在服务组合中的研究与应用[D]. 淄博: 山东理工大学, 2020.
|
[35] |
( Fan Guodong. Research and Application of Fuzzy Decision and Graph Database in Service Composition[D]. Zibo: Shandong University of Technology, 2020.)
|
[36] |
Permana S D H, Bintoro K B Y, Arifitama B, et al. Comparative Analysis of Pathfinding Algorithms A*, Dijkstra, and BFS on Maze Runner Game[J]. International Journal of Information Systems & Technology, 2018, 1(2): 1-8.
|
[37] |
Lysenko A, Roznovăţ I A, Saqi M, et al. Representing and Querying Disease Networks Using Graph Databases[J]. BioData Mining, 2016, 9: 23.
doi: 10.1186/s13040-016-0102-8
pmid: 27462371
|
[38] |
郭坤铭. 基于异构网络的关系推理与预测方法研究[D]. 太原: 太原理工大学, 2017.
|
[38] |
( Guo Kunming. Research on Relation Inference and Prediction in Heterogeneous Network[D]. Taiyuan: Taiyuan University of Technology, 2017.)
|
[39] |
胡正银, 刘蕾蕾, 代冰, 等. 基于领域知识图谱的生命医学学科知识发现探析[J]. 数据分析与知识发现, 2020, 4(11): 1-14.
|
[39] |
( Hu Zhengyin, Liu Leilei, Dai Bing, et al. Discovering Subject Knowledge in Life and Medical Sciences with Knowledge Graph[J]. Data Analysis and Knowledge Discovery, 2020, 4(11): 1-14.)
|
[40] |
Lim M, Abdullah A, Jhanjhi N. Performance Optimization of Criminal Network Hidden Link Prediction Model with Deep Reinforcement Learning[J]. Journal of King Saud University-Computer and Information Sciences, 2021, 33(10): 1202-1210.
doi: 10.1016/j.jksuci.2019.07.010
|
[41] |
Franzoni V, Lepri M, Milani A. Topological and Semantic Graph-Based Author Disambiguation on DBLP Data in Neo4j[OL]. arXiv Preprint, arXiv: 1901.08977.
|
[42] |
施朝浩. 基于图特征的欺诈检测方法研究与应用[D]. 杭州: 浙江大学, 2019.
|
[42] |
( Shi Chaohao. Research and Application of Fraud Detection Method Based on Graph Features[D]. Hangzhou: Zhejiang University, 2019.)
|
[43] |
杜伟静, 李翀, 王宇宸, 等. Web of Science科研社区挖掘算法研究[J]. 小型微型计算机系统, 2020, 41(12): 2465-2469.
|
[43] |
( Du Weijing, Li Chong, Wang Yuchen, et al. Research on Web of Science Academic Community Mining Algorithm[J]. Journal of Chinese Computer Systems, 2020, 41(12): 2465-2469.)
|
[44] |
Drakopoulos G, Gourgaris P, Kanavos A. Graph Communities in Neo4j[J]. Evolving Systems, 2020, 11(3): 397-407.
doi: 10.1007/s12530-018-9244-x
|
[45] |
Rashmi R, Champawat S, Teja G V, et al. Analysis of Road Networks Using the Louvain Community Detection Algorithm[A]//Soft Computing for Problem Solving[M]. Springer, 2020: 749-757.
|
[46] |
任磊, 杜一, 马帅, 等. 大数据可视分析综述[J]. 软件学报, 2014, 25(9): 1909-1936.
|
[46] |
( Ren Lei, Du Yi, Ma Shuai, et al. Visual Analytics Towards Big Data[J]. Journal of Software, 2014, 25(9): 1909-1936.)
|
[47] |
马雨萌, 王昉, 黄金霞, 等. 基于文献知识抽取的专题知识库构建研究——以中药活血化瘀专题知识库为例[J]. 情报学报, 2019, 38(5): 482-491.
|
[47] |
( Ma Yumeng, Wang Fang, Huang Jinxia, et al. Research on Construction of a Subject Knowledge Base Based on Literature Knowledge Extraction: Using the Knowledge Base of Activating Blood Circulation and Removing Stasis as the Object[J]. Journal of the China Society for Scientific and Technical Information, 2019, 38(5): 482-491.)
|
[48] |
马欢欢. 基于电子病历的癫痫医学知识图谱构建的研究[D]. 曲阜: 曲阜师范大学, 2020.
|
[48] |
( Ma Huanhuan. Research on the Construction of Epilepsy Medical Knowledge Graph Based on Electronic Medical Records[D]. Qufu: Qufu Normal University, 2020.)
|
[49] |
Messina A, Fiannaca A la Paglia L, et al. BioGraph: A Web Application and a Graph Database for Querying and Analyzing Bioinformatics Resources[J]. BMC Systems Biology, 2018, 12(S5): 75-89.
doi: 10.1186/s12918-018-0597-3
|
[50] |
Allen D, Hodler A, Hunger M, et al. Understanding Trolls with Efficient Analytics of Large Graphs in Neo4j[C]// Proceedings of the 2019 Datenbanksysteme für Business, Technologie und Web, 2019: 377-396.
|
|
Viewed |
|
|
|
Full text
|
|
|
|
|
Abstract
|
|
|
|
|
Cited |
|
|
|
|
|
Shared |
|
|
|
|
|
Discussed |
|
|
|
|