Importing Relationships into a Running Graph Database Using Parallel Processing
May 05, 2020 Β· Entered Twilight Β· π arXiv.org
"Last commit was 5.0 years ago (β₯5 year threshold)"
Evidence collected by the PWNC Scanner
Repo contents: BinSet.java, GroupIterator.java, IterateRelationship.java, LICENSE, README.md, erdosRenyi.jar, insert_nodes_rels.cql, iterateRelationship.md, iterateRelationship_latest.jar
Authors
Joshua Porter, Aleks Ontman
arXiv ID
2005.04093
Category
cs.DC: Distributed Computing
Cross-listed
cs.PF
Citations
1
Venue
arXiv.org
Repository
https://github.com/Lnofeisone/graph-iterateRelationship
β 3
Last Checked
2 months ago
Abstract
Importing relationships into a running graph database using multiple threads running concurrently is a difficult task, as multiple threads cannot write information to the same node at the same time. Here we present an algorithm in which relationships are sorted into bins, then imported such that no two threads ever access the same node concurrently. When this algorithm was implemented as a procedure to run on the Neo4j graph database, it reduced the time to import relationships by up to 69% when 32 threads were used.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β Distributed Computing
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
R.I.P.
π»
Ghosted
Hyperledger Fabric: A Distributed Operating System for Permissioned Blockchains
R.I.P.
π»
Ghosted
Reproducing GW150914: the first observation of gravitational waves from a binary black hole merger
R.I.P.
π»
Ghosted
MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems
R.I.P.
π»
Ghosted