ethan
ethan
RRunPod
Created by ethan on 6/4/2024 in #⛅|pods
Strange unix and/or user perms issue with command in dockerfile/replacement command
I have a bash script in my pod which, as part of its last command, executes mpirun with some target process. When running this command using bash <script> as the dockerfile's entrypoint, or using runpod's replacement command, the following issue occurs:
2024-06-04T00:27:41.763661289Z Per request, Open MPI attempted to set a system resource
2024-06-04T00:27:41.763672184Z limit to a given value:
2024-06-04T00:27:41.763682241Z
2024-06-04T00:27:41.763690901Z Resource: openfiles
2024-06-04T00:27:41.763700120Z Limit: max
2024-06-04T00:27:41.763708781Z
2024-06-04T00:27:41.763718000Z The system has refused to allow this operation. This is likely
2024-06-04T00:27:41.763726660Z due to a permission limitation, or specifying an unsupported
2024-06-04T00:27:41.763735320Z value. Please check the system or remove the request and try
2024-06-04T00:27:41.763744539Z again.
2024-06-04T00:27:41.763753200Z --------------------------------------------------------------------------
2024-06-04T00:27:41.763766609Z
2024-06-04T00:27:41.763784489Z Failed initializing opal: -43
2024-06-04T00:27:41.763661289Z Per request, Open MPI attempted to set a system resource
2024-06-04T00:27:41.763672184Z limit to a given value:
2024-06-04T00:27:41.763682241Z
2024-06-04T00:27:41.763690901Z Resource: openfiles
2024-06-04T00:27:41.763700120Z Limit: max
2024-06-04T00:27:41.763708781Z
2024-06-04T00:27:41.763718000Z The system has refused to allow this operation. This is likely
2024-06-04T00:27:41.763726660Z due to a permission limitation, or specifying an unsupported
2024-06-04T00:27:41.763735320Z value. Please check the system or remove the request and try
2024-06-04T00:27:41.763744539Z again.
2024-06-04T00:27:41.763753200Z --------------------------------------------------------------------------
2024-06-04T00:27:41.763766609Z
2024-06-04T00:27:41.763784489Z Failed initializing opal: -43
However, if I simply enter the container and run the exact same script (same command, bash <script>), it runs without issue. In both cases, I am acting as the root user
22 replies
RRunPod
Created by ethan on 5/28/2024 in #⛅|pods
Container Files Missing in Workspace On Pod Launch
When launching pods (a40) on both community and server cloud, using a custom image that populates /workspace as a volume, the expected files and directories don't show up. This worked as of last Friday, and the image has not changes on its github container repo. There is more than enough space on both the network and disk volumes to contain these files
19 replies
RRunPod
Created by ethan on 3/21/2024 in #⛅|pods
Shell asks for a password when I try to ssh to a secure cloud pod (with correct public key set)
I have a correctly formatted public key set, I have ssh enabled. Still asks for a password when I ssh in
2 replies