llvmpipe is being used instead of GPU
1s delay between execution done and Finished message
Serverless is Broken
EU-RO-1 region severless H100 gpu not available ....
Workers wrongfully reported as "idle"
"Throttled" and re-"Initializing" workers everywhere today
how to run flux+lora on 24 GB Gpu through code
Queue waiting 5+ minutes with dozens of idle workers
Serverless H200?
using compression encoding for serverless requests
Throttled ECR Download?
Need some help to troubleshoot a configuration of a Serverless
Do Webhook Request Responses have a retry mechanism?
Incorrect billing
ed0rivbjvv0x0u
and pzfz3xhwa86raj
Request getting stuck
Serverles endpoint status and runsync not returning data anymore in request body (request not found)
I want to increase/decrease workers by code, can you help?
1 active worker
in the actual time when we expect the traffic throughout the day, and at night when no one is using the application we make active workers 0
to avoid any charges. And then the next day, we make active workers 1
manually from runpod dashboard.
We are willing to do that automatically. I know there is a GraphQL
but I am not able to find relevant code to do that.
Can anyone please help?...Support for https://huggingface.co/deepseek-ai/DeepSeek-V3?
Serverless Idle Timeout is not working