bghira
GPU errored, machine dead
Search
0 matches
2024-09-04T11:12:09Z stop container
2024-09-04T11:12:44Z remove container
2024-09-04T11:12:51Z create container runpod/pytorch:2.4.0-py3.11-cuda12.4.1-devel-ubuntu22.04
2024-09-04T11:12:52Z 2.4.0-py3.11-cuda12.4.1-devel-ubuntu22.04 Pulling from runpod/pytorch
2024-09-04T11:12:52Z Digest: sha256:a931abe272a5156aab1b4fd52a6d3c599a5bf283b6e6d11d1765336e22b1037c
2024-09-04T11:12:52Z Status: Image is up to date for runpod/pytorch:2.4.0-py3.11-cuda12.4.1-devel-ubuntu22.04
2024-09-04T11:12:52Z error creating container: nvidia-smi: exit status 255\n
---------stdout------
Unable to determine the device handle for GPU0000:04:00.0: Unknown Error
---------stderr------
11 replies