digigoblin
RRunPod
•Created by BadNoise on 7/5/2024 in #⚡|serverless
Pipeline is not using gpu on serverless
Yep, the GPU utilization telemetry always confuses people because its not real-time
70 replies
RRunPod
•Created by BadNoise on 7/5/2024 in #⚡|serverless
Pipeline is not using gpu on serverless
check nvidia-smi
70 replies
RRunPod
•Created by BadNoise on 7/5/2024 in #⚡|serverless
Pipeline is not using gpu on serverless
That telemetry is not real time and not reliable
70 replies
RRunPod
•Created by BadNoise on 7/5/2024 in #⚡|serverless
Pipeline is not using gpu on serverless
Its using GPU if the GPU memory is showing as used
70 replies
RRunPod
•Created by BadNoise on 7/5/2024 in #⚡|serverless
Pipeline is not using gpu on serverless
Assming your base image is CUDA 12.1
70 replies
RRunPod
•Created by smoke on 7/6/2024 in #⚡|serverless
CUDA out of memory (80GB GPU)
Must be something wrong with your implementation. Nobody can help you unless you share the code.
8 replies
RRunPod
•Created by smoke on 7/6/2024 in #⚡|serverless
CUDA out of memory (80GB GPU)
Thats ridiculous for training a LoRA, you can do full Dreambooth training with 24GB
8 replies
RRunPod
•Created by blabbercrab on 7/5/2024 in #⚡|serverless
Serverless is timing out before full load
Why do you need to load 30 Lora anyway?
21 replies
RRunPod
•Created by blabbercrab on 7/5/2024 in #⚡|serverless
Serverless is timing out before full load
You can't really fix this, RunPod expects the handler to kick in within a certain time period.
21 replies
RRunPod
•Created by DDV-PYRMD on 7/3/2024 in #⚡|serverless
serverless
It doesn't have xformers etc, use the link I sent, it works well.
9 replies
RRunPod
•Created by BadNoise on 7/5/2024 in #⚡|serverless
Pipeline is not using gpu on serverless
its already doing that
70 replies
RRunPod
•Created by BadNoise on 7/5/2024 in #⚡|serverless
Pipeline is not using gpu on serverless
Yep, will be useful for us to help you test it
70 replies
RRunPod
•Created by ssssteven on 7/3/2024 in #⚡|serverless
network connections are very slow, Failed to return job results.
Problem seems to be a connection timeout to api.runpod.ai, I've also seen this in my endpoint logs from time to time. The connection shouldn't time out.
6 replies
RRunPod
•Created by BadNoise on 7/5/2024 in #⚡|serverless
Pipeline is not using gpu on serverless
How are you running the model?
70 replies