R
RunPod10mo ago
Andrey

Number of requests per second

Hello! Can you clarify about the number of requests per second? Here (https://docs.runpod.io/serverless/references/operations) you write about "Rate limit: 2000 per second" But here(https://docs.runpod.io/serverless/endpoints/invoke-jobs) Rate Limits "/runsync: 2000 requests every 10 seconds."
Endpoint operations | RunPod Documentation
Comprehensive guide on interacting with models using RunPod's API Endpoints without managing the pods yourself.
Invoke a Job | RunPod Documentation
Run Endpoints in RunPod.
5 Replies
ashleyk
ashleyk10mo ago
Oh wow, nice find. @flash-singh can you confirm?
flash-singh
flash-singh10mo ago
its per 10 seconds 2k rewuests
Andrey
AndreyOP10mo ago
And can we extend this quantity? We need about 1250 requests per second
ashleyk
ashleyk10mo ago
Does @PatrickR need to fix the doc thats incorrect?
flash-singh
flash-singh10mo ago
what kind of requests are these? llm? text to image? you can pm me details, we can talk further
Want results from more Discord servers?
Add your server