DreamGen
DreamGen
RRunPod
Created by DreamGen on 5/24/2024 in #⛅|pods
Network issue ETA?
They are back up, took >1hour. Receieved a generic There seems to have been a possible issue with the server that one or more of your pods is hosted on. (before the servers were actually back up). Not a great experience.
5 replies
RRunPod
Created by DreamGen on 5/24/2024 in #⛅|pods
Network issue ETA?
No description
5 replies
RRunPod
Created by DreamGen on 5/19/2024 in #⛅|pods
Feature Request: `runpodctl send` TO specific machine & folder (ala SCP)
Thanks, appreciate it! Sorry for posting in the wrong channel 😄
17 replies
RRunPod
Created by DreamGen on 5/19/2024 in #⛅|pods
Feature Request: `runpodctl send` TO specific machine & folder (ala SCP)
You made a mistake reading the post. This was a feature request. I mentioned that this functionality can replicated by using runpodctl send foo + ssh machine 'runpodctl receive ...'
17 replies
RRunPod
Created by DreamGen on 5/19/2024 in #⛅|pods
Feature Request: `runpodctl send` TO specific machine & folder (ala SCP)
And, you are wrong! 😄 If you have public IP, then the scheme above works perfectly fine -- the problem is that servers without public IP also don't have proper SSH setup (it won't let you execute commands remotely)
17 replies
RRunPod
Created by DreamGen on 5/19/2024 in #⛅|pods
Feature Request: `runpodctl send` TO specific machine & folder (ala SCP)
many server types aren't available with public IP
17 replies
RRunPod
Created by DreamGen on 5/19/2024 in #⛅|pods
Feature Request: `runpodctl send` TO specific machine & folder (ala SCP)
SCP does not work on servers without public IP
17 replies
RRunPod
Created by DreamGen on 5/19/2024 in #⛅|pods
Feature Request: `runpodctl send` TO specific machine & folder (ala SCP)
Actually ssh machine 'cd /workspace && runpoctl receive ...' will not work on machines without public IP 😕
17 replies
RRunPod
Created by DreamGen on 4/17/2024 in #⛅|pods
A6000 price change based on # GPUS?
No description
3 replies
RRunPod
Created by Alexm on 4/1/2024 in #⛅|pods
l40s "no ressources available"
Same
91 replies
RRunPod
Created by DreamGen on 3/16/2024 in #⛅|pods
UserWarning: CUDA initialization: Unexpected error from cudaGetDeviceCount(). Did you run some cuda
I switched to 12.3 machine and that worked in this case. In other cases it was the oppsite 😄
5 replies
RRunPod
Created by DreamGen on 2/25/2024 in #⛅|pods
Broken CUDA / PyTorch on H100
Thanks for sharing! I don't think I can do much about the installed drivers on the machine, and there were no machines with otehr drivers.
26 replies
RRunPod
Created by DreamGen on 2/25/2024 in #⛅|pods
Broken CUDA / PyTorch on H100
I already removed the pod, it's >$15/hour and I did not want to just waste money, sorry -- will try back some other time
26 replies
RRunPod
Created by DreamGen on 2/25/2024 in #⛅|pods
Broken CUDA / PyTorch on H100
tried re-creating, tried reinstalling several times, did not work, gave up
26 replies
RRunPod
Created by DreamGen on 2/25/2024 in #⛅|pods
Broken CUDA / PyTorch on H100
It was SXM
26 replies
RRunPod
Created by DreamGen on 2/25/2024 in #⛅|pods
Broken CUDA / PyTorch on H100
nvcc --version
root@2583eec93fb6:/workspace/axolotl# nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2022 NVIDIA Corporation
Built on Wed_Sep_21_10:33:58_PDT_2022
Cuda compilation tools, release 11.8, V11.8.89
Build cuda_11.8.r11.8/compiler.31833905_0
root@2583eec93fb6:/workspace/axolotl# nvcc --version
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2022 NVIDIA Corporation
Built on Wed_Sep_21_10:33:58_PDT_2022
Cuda compilation tools, release 11.8, V11.8.89
Build cuda_11.8.r11.8/compiler.31833905_0
26 replies
RRunPod
Created by DreamGen on 2/25/2024 in #⛅|pods
Broken CUDA / PyTorch on H100
26 replies
RRunPod
Created by DreamGen on 2/25/2024 in #⛅|pods
Broken CUDA / PyTorch on H100
nvidia-smi
26 replies