otakuhero
RRunPod
•Created by otakuhero on 7/12/2024 in #⚡|serverless
Some worker can't find file "libEGL_nvidia.so.0"
Thanks for the reply, I've sent the email, the problem occurs on both pod and serverless, and randomly, it seems to only occur on containers with a low version or a certain version of the driver.
5 replies
Pod unable to read environment variables set in templates caused a loss
I initially submitted a ticket on the web, but haven't received a response yet. Previously, I also submitted a ticket online and didn't receive an email, so I'm trying to provide feedback here😳
10 replies
Container fails to start randomly
@Papa Madiator hi, I encountered the same problem again, two pods failed to start, pod id:wqaz2xufma32pt & eqyabu82t6l3y9, I'm not sure whether it's caused by my custom images or infrastructure such as physical machines....
24 replies
Container fails to start randomly
Hi, I've encountered a new issue. When I create an RTX 4090 pod, it fails to launch. 😔 The system log is as follows. When I delete the pod and rebuild it, everything returns to normal. What could be the reason for this?
I use secure cloud and custom image
2024-02-27T08:13:42Z start container
2024-02-27T08:13:43Z error starting container: Error response from daemon: failed to create task for container: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status 1, stdout: , stderr: Auto-detected mode as 'legacy'
Inconsistency detected by ld.so: ../sysdeps/x86_64/dl-machine.h: 534: elf_machine_rela_relative: Assertion
ELFW(R_TYPE) (reloc->r_info) == R_X86_64_RELATIVE' failed!
nvidia-container-cli: detection error: driver rpc error: failed to process request: unknown
2024-02-27T08:13:58Z start container`24 replies