GENGHIS
Networking on my pod has been shit for last 3 days. please fix. US region. RTX 6000 Ada
network storage doesnt seem to be useful unless you have gpu availability within same region. generally i stay on a machine if it's available and if it's not then usually there's no same region machines to transfer data to anyway.
9 replies
Better solution for 0 GPU stranded volumes
I was able to reduce the resource requirements for data upload by modifying my aws cli config. however, my main point still stands which is that better data escape valves would be appreciated and I would be willing to pay for more cpu to get my data off faster.
21 replies
Kernel version discrepancy between Pods.
Also having issue with outdated kernel. My process is hanging
accelerator = Accelerator() Detected kernel version 5.4.0, which is below the recommended minimum of 5.5.0; this can cause the process to hang. It is recommended to upgrade the kernel to the minimum version or higher.
13 replies