DannyB
RRunPod
•Created by DannyB on 12/9/2024 in #⚡|serverless
Structure of "job" JSON
I understand that at the very least there is
job["id"]
and job["input"]
and we utilize it.
It will help me a great deal if I could send additional information like job["source"]
or other metadata to the handler function.
It seems like no matter how I structure the JSON, only id
and input
end up in the job JSON to the handler.
Is this indeed the case?
I prefer not to force additional metadata inside the input
field ... I want a separation between the model's inputs and the handler controllers.10 replies
RRunPod
•Created by DannyB on 11/18/2024 in #⚡|serverless
Unable to fetch docker images
During worker initialization I am seeing errors such as:
error pulling image: Error response from daemon: Get "https://registry-1.docker.io/v2/": context deadline exceeded
2024-11-18T18:10:47Z error pulling image: Error response from daemon: Get "https://registry-1.docker.io/v2/": net/http: request canceled while waiting for connection (Client.Timeout exceeded while awaiting headers)
image pull: {repo}/{tag}: pending
failed to pull image: open /var/lib/docker/165536.165536/tmp/GetImageBlob3971008098: no such file or directory
10 replies
RRunPod
•Created by DannyB on 11/11/2024 in #⚡|serverless
🚨 All 30 H100 workers are throttled
What's going on ?
4 replies
RRunPod
•Created by DannyB on 4/26/2024 in #⚡|serverless
A way to know if worker is persistent ("active") or not
It would be a great help if there was some way from inside the code (e.g. environment variable) to know if the model is running on a persistent worker or not.
Example use-case - If the worker is persistent, I can compile the model, it takes ~20 minutes but it is worth it so my users can get an almost 50% latency reduction. However, you can understand it is impossible to do for an ephemeral worker as it will take too long to initialize.
Is there any way to do this?
3 replies