drop() slow performance
I have to drop thousands of vertex ids from the graph, and .drop().iterate() takes like 1 minute for each vertex.
Seems like it is gonna take ages for my task to complete.
Is there any other quicker way to achieve this? like a bulk drop operation?
2 Replies
You can use Spark to drop them in parallel. Example: https://github.com/Citegraph/citegraph/blob/main/backend/src/main/java/io/citegraph/data/spark/loader/EdgePropertyPurger.java
GitHub
citegraph/backend/src/main/java/io/citegraph/data/spark/loader/Edge...
CiteGraph: A citation graph web visualizer. Contribute to Citegraph/citegraph development by creating an account on GitHub.
This one drops a property from all edges. You could easily modify it and drop vertices.