JanusGraph

J

JanusGraph

JanusGraph - Distributed, open source, massively scalable graph database.

Join

Check status of StandardScannerExecutor job

How can I check the status of the below job?
No description

Problem with mixed index query : All property & hasId

Hello, i have a problem during a query using mixed index in order to call ElasticSearch "all" property. Everything works fine when i only use "has(label, key, value)" alone: index is called, but when i used "has(label, key, value).hasId(within(ids))" GraphCentricQuery is correctly built but it doesn't retrieve my vertex. Any idea why this doesn't work ? (I use or step in order to use JanusGraphStepStrategy to fold has container) thanks you for your help...
No description

Impact of ID Pool Initialisation on Query Performance

Greetings everyone, We're currently operating a JanusGraph setup with cluster.max-partitions set at 1024 and ids.num-partition at 10. Our primary goal is to ensure high availability for the cluster instances. However, we've noticed that the initialization of the ID pool is causing disruptions, during server restarts. The root cause seems to be the initialisation of ID pool threads for each partition until every partition has a ID pool. Upon server restart, the ID pool is initialised based on write operations. Unfortunately, this process has been negatively impacting the performance of query execution. To mitigate this challenge, we're exploring the possibility of implementing an eager initialisation approach for the ID pool threads. Is there a way to achieve it?...

JanusGraph fails on initial startup (cql-es)

Hi all, I have a problem that's been popping up once in a while regarding only the initial startup of Cassandra/ES/JanusGraph. Sometimes it fails with "table edgestore doesn't exist", or "table janusgraph_ids doesn't exist". However, if I restart the JG pod, it completes. To me, it sounds like some sort of race condition....

Exhaust all paths between two or more vertices

Is there some way to go on a path finding mission with repeat().until() type construct that avoids the early stop observed while the until condition is first met? That is, without skipping .until() for .times().

Standard way of restructuring of existing data

Hi folks we have usecase where we would like to restructure data present in a graph, for example we have existing graph with structure like this: <Node: User A> --<Edge--Pays>--<Node: Payment 1>--<Edge--Receives>--<Node: User B>, and using this existing data, we would like to add an direct edge from User A to User B for faster queries. Is there a standard way to achieve this?

Find all paths between two vertices

Not sure this makes any sense, but here goes. Could someone please provide a solution to how one would avoid dead-end paths to be traversed, i.e., how can one avoid multistep traversals following .bothE().otherV() to return to self? Here's my non-working query `g.V().has('lbl', 'DexpiElement').has('tagname', 'tagA') ...

Is there a way to access the management from gremlin remote console?

mostly i need to be able to create vertex and edge labels dynamically from gremlin, is it posible?

What configuration i can use for the root.schema.default?

i know it's supposed to use "None" so it will require a schema, but in my use case i need a schema maker, i cant find a single example with the schema maker turned on.

JanusGraph Instance startup failure due to id block allocation

Hi Everyone, I'm trying to startup janusgraph instance after setting up the properties for authentication db but the server is terminating with an error Could not acquire new ID block from storage. Please see the full stack trace in the image attached, not sure why this is happening I'm trying to spawn up a single instance (There are few other instances which are already running)
No description

Storing large temporal data in janusgraph

Hi folks, we have a requirement to store around 100M events per day for last 1 year, mainly temporal data. What are the best practices we can follow to avoid supernodes, minimum impact on query performance.

Data Storage wit TTL

Hi Everyone, We have a requirement in which we have to store around ~80 million records daily in our graph storage and we should have a TTL of 90 (~7 Billion) days for this data but the issue we are having is that we can have TTL on static vertex only and we don't want to do that as that restrict us from further updates on that vertex (correct me if I'm wrong). Please suggest some way in which we can store this data so that we can have TTL also. We are using bigtable backend, so will it be possible for us if we directly can have gc policy(90 days) on Bigtable column families?...

ERROR org.apache.tinkerpop.gremlin.server.util.ServerGremlinExecutor - Could not invoke constructor

Could not invoke constructor on class org.janusgraph.graphdb.management.JanusGraphManager (defined by the 'graphManager' setting) with one argument of class Settings.

Multiple Graphs with Embedded JanusGraph

Hello!! I am using embedded Janusgraph (embedded JanusGraph as a library inside my Java Application). I am trying to figure out if I can write two different named graphs to the same database. I am trying to use https://docs.janusgraph.org/operations/configured-graph-factory/ to create multiple graphs. I set the name of the graph in graph.graphname in my JanusGraph configuration. But I get the following error while opening the graph...

Drop an Graph From the Database

Hello!! I want to delete the graph stored in my database so that I can recreate the new graph. I am running the following command to delete the graph...

Concurrency challenges and the solution

Are there any known concurrency challenges with Janusgraph gremlin queries and what's the solution. I am planning to use either cassndra or ScyllaDB as storage backend.

Caused by: java.lang.SecurityException: class "org.apache.commons.logging.impl.NoOpLog"'

Gettting below exception when i try deploing janusgraph in tomcat Caused by: java.lang.SecurityException: class "org.apache.commons.logging.impl.NoOpLog"'s signer information does not match signer information of other classes in the same package...

Changing the max number of parameters for parameterized queries

Hi, we are using parameterized queries for graph ingestion and we reached the limit of parameters in a query (default is 16) hence our ingestion is failing. We want to update those settings following this guide https://tinkerpop.apache.org/docs/3.2.5/upgrade/#_default_maximum_parameters. Doing this in the gremlin-server yml is straightforward but we would like to do it via environment variables. Any suggestions? I tried some combinations and this is the closer I got: docker run --rm -it -e gremlinserver.processors[+].className[org.apache.tinkerpop.gremlin.server.op.session.SessionOpProcessor].config.maxParameters=256 docker.io/janusgraph/janusgraph:latest janusgraph show-config which generated something like: ...

Need help with injecting data into janusgraph databse using python

I am new to janusGraph and need help in injecting the following data into janus graph using python mainly but any other language can also work. I also am not sure wether to use cassandra, hbase,berkeley DB, etc. as I am unable to understand how to handle the following data : https://snap.stanford.edu/data/amazon0601.html https://snap.stanford.edu/data/amazon-meta.html...

Janusgraph with bigtable with olap+oltp

HI folks, we have recently setup janusgraph with bigtable. This is just used for oltp usecases, but given it's nature we can't run full graph queries like pagerank etc. We would like to know if we can setup OLAP janusgraph as well, which basically has the same schema as the OLTP db but runs against longer time period i.e. for OLAP usecases. Can someone guide us in achieving this? I was thinking to run both of systems together depending upon the query window. Was thinking if we can leverage ingesiton pipeline of OLTP for creating graph in OLAP....