rewbs
RRunPod
•Created by rewbs on 3/22/2024 in #⚡|serverless
Docker image using headless OpenGL (EGL, surfaceless plaform) OK locally, fails to CPU in Runpod
Hi all, I'm wondering if anyone can educate me on what would be causing this difference in behaviour when running a container locally versus in Runpod, and whether there is a solution.
In summary I'm trying to run a headless OpenGL program in a docker container, by using ELG with the surfaceless platform (https://registry.khronos.org/EGL/extensions/MESA/EGL_MESA_platform_surfaceless.txt). I was able to get the program working as intended in a container outside of Runpod. But once deployed to Runpod, it falls back to CPU processing.
As a minimal testcase, it's sufficient to simply run
eglinfo
, a utility which tells you what EGL devices are available. Outside of runpod multiple are available, but in Runpod none are. The testcase and example outputs are available here: https://github.com/rewbs/egldockertest .
Any ideas very much appreciated!
(As an aside, I should note I'm by no means an OpenGL expert so I might be getting confused, or at very least getting the terminology wrong.)26 replies
RRunPod
•Created by rewbs on 2/14/2024 in #⚡|serverless
Is there a way to access worker ID & job ID from a handler? Would be good for logging + debugging
If Runpod could drop worker ID and job ID into environment variables made available to handlers that would be ace. Would save me a lot of time manually mapping things when trying to debug issues.
6 replies
RRunPod
•Created by rewbs on 12/21/2023 in #⚡|serverless
Jobs suddenly queuing up: only 1 worker active, 9 jobs queued
5 replies