pazanchick
RRunPod
•Created by pazanchick on 4/22/2024 in #⚡|serverless
No active workers after deploying New Release
34 replies
RRunPod
•Created by pazanchick on 4/17/2024 in #⚡|serverless
'Connection reset by peer' after job finishes.
Previous logs indicate that the handler works correctly. Happened multiple times now and returns failure response. Any input on this issue?
@Papa Madiator
5 replies
RRunPod
•Created by pazanchick on 2/17/2024 in #⚡|serverless
llama.cpp serverless endpoint
https://github.com/ggerganov/llama.cpp
llama.cpp is afak the only setup that supports llava-1.6 quantized, that's why i use it. On some workers the docker image works, on others "illegal instruction" error and crash. https://github.com/ggerganov/llama.cpp/issues/537 I wonder if someone already tried it out and if there's a better fix to this issue other than building and stuffing multiple binaries with the correct instruction sets into one image that will work anywhere. (i already tried building with LLAMA_NATIVE=0) appreciate any insights, thanks!
llama.cpp is afak the only setup that supports llava-1.6 quantized, that's why i use it. On some workers the docker image works, on others "illegal instruction" error and crash. https://github.com/ggerganov/llama.cpp/issues/537 I wonder if someone already tried it out and if there's a better fix to this issue other than building and stuffing multiple binaries with the correct instruction sets into one image that will work anywhere. (i already tried building with LLAMA_NATIVE=0) appreciate any insights, thanks!
8 replies
RRunPod
•Created by pazanchick on 2/7/2024 in #⚡|serverless
GraphQL: How to get the runtime of a serverless pod through the api stateless?
3 replies
GraphQL: Query specific Endpoints and getting running worker amount
My goal is to adjust the amount of Active Workers for serverless endpoints dynamically.
1. Is there a way to query specific endpoints, instead of all?
https://docs.runpod.io/graphql/manage-endpoints
2. Is checking the
PodTelemetry.state
for all endpoint pods
the most reliable way to for counting how many pods are running?
https://graphql-spec.runpod.io/#definition-Endpoint
I found that a health check with https://api.runpod.ai/v2/<id>/health
seems to be more delayed.
Appreciate any insights and thanks for your time!4 replies