jojje
Runpod API documentation
Great. Your AI helpbot revealed the GraphQL API is documented here: https://graphql-spec.runpod.io/ (for anyone else looking)
So that means the question can be narrowed to just the REST API.
4 replies
any way to control the restart policy of pods?
Haven't found one. So in order to avoid crash-loops, I just wrap all my containers in an init script that just execs into a "wait" process and launches all its actual work in sub-processes. That way I can see any errors in the logs, debug and fix stuff that is broken without the frigging container vanishing in a puff a smoke a second after an error happens.
The always-restart policy is only really useful for stable production workloads. Not for R&D or experimental setups, which is all I'm using runpod for.
7 replies