David Mack
David Mack
RRunPod
Created by David Mack on 6/12/2024 in #⛅|pods
n00b multi gpu question
anyway, i think i'm good now, thank you 🙂
17 replies
RRunPod
Created by David Mack on 6/12/2024 in #⛅|pods
n00b multi gpu question
!pip install "unsloth[cu121-ampere-torch220] @ git+https://github.com/unslothai/unsloth.git" !pip install --no-deps xformers trl peft accelerate bitsandbytes datasets
17 replies
RRunPod
Created by David Mack on 6/12/2024 in #⛅|pods
n00b multi gpu question
initially installing hugging face and unsloth
17 replies
RRunPod
Created by David Mack on 6/12/2024 in #⛅|pods
n00b multi gpu question
training LLMs via hugging face DPO trainer
17 replies
RRunPod
Created by David Mack on 6/12/2024 in #⛅|pods
n00b multi gpu question
My current money is on one of the pip installs (hugging face, unsloth) re-installed pytorch and broke the pod's setup
17 replies
RRunPod
Created by David Mack on 6/12/2024 in #⛅|pods
n00b multi gpu question
I'll update this thread if i see flakiness
17 replies
RRunPod
Created by David Mack on 6/12/2024 in #⛅|pods
n00b multi gpu question
Alright so, I restarted the pod (with the env var you suggested) and CUDA reported zero gpus Then I removed the env var, restarted, and CUDA now reports four GPUS. no change from previous code/config Either: - somehow the pip install commands messed up CUDA, and restarting fixed that - runpod is flakey on if the gpus get attached or not
17 replies
RRunPod
Created by David Mack on 6/12/2024 in #⛅|pods
n00b multi gpu question
Thanks!!!!
17 replies