What the fuck is going on again with US - 1 x H100 80GB SXM5
"We have detected a critical error on this machine which may affect some pods. We are looking into the root cause and apologize for any inconvenience. We would recommend backing up your data and creating a new pod in the meantime."
I have been using runpod and every fucking day is something wrong!?
ID: x1vidmyoiu3a06
ID: 0qt99lcnw9026q
ID: tg8zumezw3rt9e
WHY!?!?!?!?!?
9 Replies
HOW can i running my business HERE?
three POD's like this ...
@flash-singh @Zeen I have to agree that this is not acceptable for such expensive GPUs.
we are looking into that, its the same HGX server running into network issues
Everything is working fine, thanks
This is a NIGHTMARE! Everything stopped working today. I have to shut down my H100 servers. I can't run my business like this.
@ashleyk @flash-singh @Zeen
Same Pods:
ID: x1vidmyoiu3a06
ID: 0qt99lcnw9026q
ID: tg8zumezw3rt9e
I have another serwers with the message:
"This server has recently suffered a network outage and may have spotty network connectivity. We aim to restore connectivity soon, but you may have connection issues until it is resolved. You will not be charged during any network downtime."
ID: g0htfaz7oe0lht
ID: brr2em0266otas
ID: 2xpbv4mg8ka0v7
@ashleyk
Sorry, I can't help, I am just a community member not RunPod staff, you will unfortunately have to wait for RunPod staff to look into it for you but they are in the US so will probably only come online in a few hours time.
ok thanks
issue with the server again:
The following pods were impacted.
2xpbv4mg8ka0v7
brr2em0266otas
g0htfaz7oe0lht
Can somebody look at it please?
@flash-singh @Zeen
"This server has recently suffered a network outage and may have spotty network connectivity. We aim to restore connectivity soon, but you may have connection issues until it is resolved. You will not be charged during any network downtime."