Vectris Posts - Answer Overflow

Vectris

•Created by Vectris on 11/27/2024 in #⛅｜pods-clusters

Unable to boot mi300x

Getting the following error: error starting container: Error response from daemon: error gathering device information while adding custom device "/dev/dri/renderD136": no such file or directory Pod ID: kptjoa8hkns744

15 replies

RRunPod

•Created by Vectris on 11/26/2024 in #⛅｜pods-clusters

Mi300x HIP error: no ROCm-capable device is detected

I'm using the Mi300x and getting a RuntimeError: HIP error: no ROCm-capable device is detected using RunPod Pytorch 2.4.0 ROCm 6.1 template, how can I resolve this?

3 replies

RRunPod

•Created by Vectris on 11/13/2024 in #⛅｜pods-clusters

How to migrate serverless endpoint to a pod?

I have a strange use case in which I have a functional serverless endpoint that must run on AMD hardware (for none technical reasons) Everything is setup and working currently running on NVIDIA hardware. AMD hardware is not yet available for serverless, can I recreate the serverless behaviour using a pod? I understand the two services are very different, I just need an API that generates an output using AMD hardware.

5 replies

Gaming

Programming