Container keeps restarting
Hello, after I start GPU pod, the container keeps restarting:
2024-02-16T20:18:35Z start container
in infinite loop. when I SSH I get:
Error response from daemon: Container 7a57de0dfdcb653575e97a592e6c72fa2807bd3238065065b700b177dfd65b94 is not running
What could be the problem?
7 Replies
Can u share ur template?
is this a custom template or a runpod one?
It might be i sometimes see ppl forget a :tag, if its custom
or if it is a runpod one, terminate and make a new one
oops, I saw suspicious commas in the http ports
restarting
Commas are normal, they aren't causing the issue
not working, I saw runpod template axolotl
trying this
OK great, works.
@justin @ashleyk thanks for the responses!
Yeah from what i could tell ur docker image is wrong:
Forgetting the -runpod part
https://hub.docker.com/layers/winglian/axolotl-runpod/main-py3.10-cu118-2.1.2/images/sha256-69f8a6fed7e8ec75508af8b195cf1e29d4b9fd39f49341c12e65139ccc2738c1?context=explore
for all these containers if u click into it u can see the name
also look at logs, might have more info
also if it keeps saying start container, then your container is not running a process that stays alive, in container command you can type somethng like
bash -c 'sleep infinity'