Daniel T.
We have detected a critical error on this machine which may affect some pods.
runpodctl send
times out/fails for large files when transferring data. This is a relatively common problem for approaches that do not have enough retries or networking instability. rsync
and wget
work due to robust retries. Given the inability to obtain a pod in the given geographical region, I'm giving up on the approach of transferring data via two connected pods and will pay the cloud provider fees.27 replies
We have detected a critical error on this machine which may affect some pods.
Thank you for the update and for crediting the account. Right now it seems like two GPUs are in the error state. Do you have any clue regarding the timing for the tech to fix the issue? Would you recommend spinning up a new instance, or waiting for the issue to be fixed by the tech? For reference, it takes ~24 hours to transfer data and egress costs are substantial.
27 replies