Vectris
RRunPod
•Created by Vectris on 11/27/2024 in #⛅|pods-clusters
Unable to boot mi300x
Getting the following error:
error starting container: Error response from daemon: error gathering device information while adding custom device "/dev/dri/renderD136": no such file or directory
Pod ID: kptjoa8hkns744
15 replies
RRunPod
•Created by Vectris on 11/26/2024 in #⛅|pods-clusters
Mi300x HIP error: no ROCm-capable device is detected
I'm using the Mi300x and getting a
RuntimeError: HIP error: no ROCm-capable device is detected
using RunPod Pytorch 2.4.0 ROCm 6.1 template, how can I resolve this?3 replies
RRunPod
•Created by Vectris on 11/13/2024 in #⛅|pods-clusters
How to migrate serverless endpoint to a pod?
I have a strange use case in which I have a functional serverless endpoint that must run on AMD hardware (for none technical reasons)
Everything is setup and working currently running on NVIDIA hardware.
AMD hardware is not yet available for serverless, can I recreate the serverless behaviour using a pod?
I understand the two services are very different, I just need an API that generates an output using AMD hardware.
5 replies