Stuck pod instance
I have a problem with community pod (id: xbyhioflerw8pz), it is not accessible for a really long time and stuck at launch. It infinitely tries to deploy without updating the status, only shows "Waiting for logs". Live chat support is silent. I appreciate any help.
28 Replies
Can u share ur template
I have an assumption but id love if u can screenshot the template and setup
@justin it uses runpod/pytorch:3.10-2.0.0-117
ur using a default template?
Or ur using one u typed in urself?
Have u terminated and started it again?
else is by default
I could be wrong but did u type that docker image name? or u just did it through the edit button
The name of image was available sometime ago. I used the pod, then saw network outage problem. Since that I cannot launch the machine. Terminate is not the option as I need access to the data inside
@ashleyk could you take a look?
He does not work at RunPod
By "He" you refer to the mentioned image tag? How can I make the pod alive again?
cause you tagged Ashleyk
Apologize, got confused
Could you help resolve the issue?
I do not have permissions to check hardware level stuff
To whom I can address with the problem?
forwarded it to team
@Papa Madiator any news? I am still paying for the storage of the pod the whole time it is not available
Use the Web Chat functionality, on the bottom right of runpod, to try to get in contact with customer support for a refund btw if needed / Also @Papa Madiator
Ill leave papa madiator to respond if he has any ideas for anything else.
@justin [Not Staff] unfortunately, there has been no answer from the web chat for a month. It's the reason I created the post
btw where data was saved on volume network or container storage?
@Papa Madiator it is community pod, hence data is stored in container storage in /workspace dir
@Papa Madiator any updates?
pod id?
xbyhioflerw8pz
any news?
Hello
It is same for me, the gpu was working yesterday. I am waiting for logs from hrs today
What is the pod id?
ID: 4smb1047x6p4qs$
@Papa Madiator to whom can I contact to understand whether working on issue is in progress, time estimates, etc?
Use web chat, Discord is for community help, web chat has proper ticket tracking etc.
asked team again
Server is experiencing hardware issues, and we've requested the data center team to resolve them. It may be back online tomorrow; otherwise, it's likely to be Monday. Once the server is back online, you can retrieve the data. After retrieving the data, please create a new pod, after which we will proceed to shut down the server.
To prevent this type of issues, please consider creating a network volume.