nathaniel
nathaniel
RRunPod
Created by vlad.pustovarov on 4/7/2025 in #⛅|pods
500 Response when creating pod using API
looking it up, it seems some people's perspective is that since the error happened at the business logic layer rather than the http layer, I should return 200 but with some error struct in the response body. To that I say, nah. As a consumer I like being able to assume that 200 / 2xx actually means everything went fine and my error handling code doesn't need to parse any further unless response is not 2xx
16 replies
RRunPod
Created by vlad.pustovarov on 4/7/2025 in #⛅|pods
500 Response when creating pod using API
i looked for a 400 response code that covered this kind of situation, namely that nothing went wrong on the server but the request did not succeed. 409 sounds the closest but not exactly it?
16 replies
RRunPod
Created by vlad.pustovarov on 4/7/2025 in #⛅|pods
500 Response when creating pod using API
if you try in future maybe we do have that combination of settings and the input does work, so problem is on our end
16 replies
RRunPod
Created by vlad.pustovarov on 4/7/2025 in #⛅|pods
500 Response when creating pod using API
debatable, you as the user didn't do anything wrong by submitting this set of parameters, we just didn't have the infrastructure on the backend to support it
16 replies
RRunPod
Created by vlad.pustovarov on 4/7/2025 in #⛅|pods
500 Response when creating pod using API
another option for us on the backend is to run the sql query with EXPLAIN and look at how many rows were left after each WHERE clause, and report back the ones that led to a sufficiently high % reduction of the search space. also pretty sophisticated but not as expensive since we're running the query anyway
16 replies
RRunPod
Created by vlad.pustovarov on 4/7/2025 in #⛅|pods
500 Response when creating pod using API
In future, we might spend some extra time to get potential machines with slightly altered versions of your input, and suggest to you "hey, your input works if you get rid of AllowedCudaVersions so maybe try that". For now that is a bit sophisticated
16 replies
RRunPod
Created by vlad.pustovarov on 4/7/2025 in #⛅|pods
500 Response when creating pod using API
unfortunately this is one of the places where we can't give you super great error messages on how to fix it. This means for the set of parameters you specified, no machines match all the constraints. There are too many possible causes of this for us to narrow one down to suggest. Try removing all params you don't actually care about - they are all optional, we have defaults for them on the backend. Once you get to a configuration that works, add things in one by one to see if that was a reason machines were unavailable
16 replies
RRunPod
Created by Shehriyar on 3/31/2025 in #⛅|pods
GraphQL secretCreate Mutation
@zacksparrow how do scopes for secret mutations work?
26 replies
RRunPod
Created by Jamb on 3/25/2025 in #⛅|pods
Runpod occasionally fails to pull from ECR
will update you when fix is found
11 replies
RRunPod
Created by Jamb on 3/25/2025 in #⛅|pods
Runpod occasionally fails to pull from ECR
debugging this now
11 replies
RRunPod
Created by ajgeiss0702 on 3/24/2025 in #⛅|pods
Requests using RUNPOD_API_KEY fail with 403 unauthorized.
that makes sense. will get back to you on this in next few days
18 replies
RRunPod
Created by ajgeiss0702 on 3/24/2025 in #⛅|pods
Requests using RUNPOD_API_KEY fail with 403 unauthorized.
if it usually works with things like runpodctl stop which I think I’ve seen people do, then it’s an issue with how rest api parses the scopes
18 replies
RRunPod
Created by ajgeiss0702 on 3/24/2025 in #⛅|pods
Requests using RUNPOD_API_KEY fail with 403 unauthorized.
yes, I believe this is a scope issue with the api key that gets generated for the pod. let me check how it’s supposed to work in code
18 replies
RRunPod
Created by Cemal on 3/18/2025 in #⚡|serverless
Terminate worker
ok, the update I was talking about is now released, set includeWorkers=true for get and list endpoints to show current workers
13 replies
RRunPod
Created by Cemal on 3/18/2025 in #⚡|serverless
Terminate worker
13 replies
RRunPod
Created by Cemal on 3/18/2025 in #⚡|serverless
Terminate worker
it sounds like you are polling through separate means for the error which will tell you the pod id, so yeah you can just use the rest api's delete pod endpoint
13 replies
RRunPod
Created by Cemal on 3/18/2025 in #⚡|serverless
Terminate worker
will keep you updated on that first part, the second part can already be done if you know the worker's pod id by hitting DELETE https://rest.runpod.io/v1/pods/thePodId
13 replies
RRunPod
Created by Cemal on 3/18/2025 in #⚡|serverless
Terminate worker
that's not the same as terminating a pod/worker by id. I am shortly adding an update to rest api which will let you pull the list of workers for your endpoint, and once you have the ids of those you can use the delete/terminate pod endpoint just as you can for normal pods
13 replies
RRunPod
Created by ozzie on 3/18/2025 in #⚡|serverless
Delete Serverless Endpoint via the API?
the python sdk was created long before rest api existed, which is why it hits graphql all over the place
11 replies
RRunPod
Created by ozzie on 3/18/2025 in #⚡|serverless
Delete Serverless Endpoint via the API?
as for why the code for gql was not working before, idk :shrug: but since rest api is just doing the same thing on the backend it was probably an error in calling it. those opaque error messages from gql are unfortunate and someday we should figure out how to expose more info to users in them without revealing a bunch of internals. As devs, they are annoying to us too, but at least we can go look at the logs to see what actually went wrong
11 replies