nathaniel Comments - Answer Overflow

nathaniel

•Created by vlad.pustovarov on 4/7/2025 in #⛅｜pods

500 Response when creating pod using API

looking it up, it seems some people's perspective is that since the error happened at the business logic layer rather than the http layer, I should return 200 but with some error struct in the response body. To that I say, nah. As a consumer I like being able to assume that 200 / 2xx actually means everything went fine and my error handling code doesn't need to parse any further unless response is not 2xx

16 replies

RRunPod

•Created by vlad.pustovarov on 4/7/2025 in #⛅｜pods

500 Response when creating pod using API

i looked for a 400 response code that covered this kind of situation, namely that nothing went wrong on the server but the request did not succeed. 409 sounds the closest but not exactly it?

16 replies

RRunPod

•Created by vlad.pustovarov on 4/7/2025 in #⛅｜pods

500 Response when creating pod using API

if you try in future maybe we do have that combination of settings and the input does work, so problem is on our end

16 replies

RRunPod

•Created by vlad.pustovarov on 4/7/2025 in #⛅｜pods

500 Response when creating pod using API

debatable, you as the user didn't do anything wrong by submitting this set of parameters, we just didn't have the infrastructure on the backend to support it

16 replies

RRunPod

•Created by vlad.pustovarov on 4/7/2025 in #⛅｜pods

500 Response when creating pod using API

another option for us on the backend is to run the sql query with EXPLAIN and look at how many rows were left after each WHERE clause, and report back the ones that led to a sufficiently high % reduction of the search space. also pretty sophisticated but not as expensive since we're running the query anyway

16 replies

RRunPod

•Created by vlad.pustovarov on 4/7/2025 in #⛅｜pods

500 Response when creating pod using API

In future, we might spend some extra time to get potential machines with slightly altered versions of your input, and suggest to you "hey, your input works if you get rid of AllowedCudaVersions so maybe try that". For now that is a bit sophisticated

16 replies

RRunPod

•Created by vlad.pustovarov on 4/7/2025 in #⛅｜pods

500 Response when creating pod using API

unfortunately this is one of the places where we can't give you super great error messages on how to fix it. This means for the set of parameters you specified, no machines match all the constraints. There are too many possible causes of this for us to narrow one down to suggest. Try removing all params you don't actually care about - they are all optional, we have defaults for them on the backend. Once you get to a configuration that works, add things in one by one to see if that was a reason machines were unavailable

16 replies

RRunPod

•Created by Shehriyar on 3/31/2025 in #⛅｜pods

GraphQL secretCreate Mutation

@zacksparrow how do scopes for secret mutations work?

26 replies

RRunPod

•Created by Jamb on 3/25/2025 in #⛅｜pods

Runpod occasionally fails to pull from ECR

will update you when fix is found

11 replies

RRunPod

•Created by Jamb on 3/25/2025 in #⛅｜pods

Runpod occasionally fails to pull from ECR

debugging this now

11 replies

RRunPod

•Created by ajgeiss0702 on 3/24/2025 in #⛅｜pods

Requests using RUNPOD_API_KEY fail with 403 unauthorized.

that makes sense. will get back to you on this in next few days

18 replies

RRunPod

•Created by ajgeiss0702 on 3/24/2025 in #⛅｜pods

Requests using RUNPOD_API_KEY fail with 403 unauthorized.

if it usually works with things like runpodctl stop which I think I’ve seen people do, then it’s an issue with how rest api parses the scopes

18 replies

RRunPod

•Created by ajgeiss0702 on 3/24/2025 in #⛅｜pods

Requests using RUNPOD_API_KEY fail with 403 unauthorized.

yes, I believe this is a scope issue with the api key that gets generated for the pod. let me check how it’s supposed to work in code

18 replies

RRunPod

•Created by Cemal on 3/18/2025 in #⚡｜serverless

Terminate worker

ok, the update I was talking about is now released, set includeWorkers=true for get and list endpoints to show current workers

13 replies

RRunPod

•Created by Cemal on 3/18/2025 in #⚡｜serverless

Terminate worker

https://rest.runpod.io/v1/docs#tag/pods/DELETE/pods/{podId}

13 replies

RRunPod

•Created by Cemal on 3/18/2025 in #⚡｜serverless

Terminate worker

it sounds like you are polling through separate means for the error which will tell you the pod id, so yeah you can just use the rest api's delete pod endpoint

13 replies

RRunPod

•Created by Cemal on 3/18/2025 in #⚡｜serverless

Terminate worker

will keep you updated on that first part, the second part can already be done if you know the worker's pod id by hitting DELETE https://rest.runpod.io/v1/pods/thePodId

13 replies

RRunPod

•Created by Cemal on 3/18/2025 in #⚡｜serverless

Terminate worker

that's not the same as terminating a pod/worker by id. I am shortly adding an update to rest api which will let you pull the list of workers for your endpoint, and once you have the ids of those you can use the delete/terminate pod endpoint just as you can for normal pods

13 replies

RRunPod

•Created by ozzie on 3/18/2025 in #⚡｜serverless

Delete Serverless Endpoint via the API?

the python sdk was created long before rest api existed, which is why it hits graphql all over the place

11 replies

RRunPod

•Created by ozzie on 3/18/2025 in #⚡｜serverless

Delete Serverless Endpoint via the API?

as for why the code for gql was not working before, idk :shrug: but since rest api is just doing the same thing on the backend it was probably an error in calling it. those opaque error messages from gql are unfortunate and someday we should figure out how to expose more info to users in them without revealing a bunch of internals. As devs, they are annoying to us too, but at least we can go look at the logs to see what actually went wrong

11 replies

Gaming

Programming