Nikita
RRunPod
•Created by sidou on 11/20/2024 in #⚡|serverless
"Failed to return job results. | Connection timeout to host https://api.runpod.ai/v2/91gr..."
I've noticed that some of my A6000/A40 workers have been falling into throttling mode very often lately
10 replies
RRunPod
•Created by sidou on 11/20/2024 in #⚡|serverless
"Failed to return job results. | Connection timeout to host https://api.runpod.ai/v2/91gr..."
anyway if your images doesnt exceed the limit of 10 mbs of a payload and it works on some machines, i would suggest you to try redeploying it :\
10 replies
RRunPod
•Created by sidou on 11/20/2024 in #⚡|serverless
"Failed to return job results. | Connection timeout to host https://api.runpod.ai/v2/91gr..."
or it might be related to volume attached to it
10 replies
RRunPod
•Created by sidou on 11/20/2024 in #⚡|serverless
"Failed to return job results. | Connection timeout to host https://api.runpod.ai/v2/91gr..."
and i assume the code is fine as you have workers that are okay
when did you deploy on your a100 pcie for the last time? I guess it might be related to the recent price drop for bigger GPUs followed by availability decrease
10 replies
RRunPod
•Created by sidou on 11/20/2024 in #⚡|serverless
"Failed to return job results. | Connection timeout to host https://api.runpod.ai/v2/91gr..."
info about the SDK version is usually in your dockerfile or requirements.txt of your repository
10 replies
RRunPod
•Created by sidou on 11/20/2024 in #⚡|serverless
"Failed to return job results. | Connection timeout to host https://api.runpod.ai/v2/91gr..."
What is your SDK version?
10 replies