R
RunPod9mo ago
Bryan

Why are secure cloud pods so slow?

I'm pretty sure I just wasted a few hours of time trying to find a decent pod that isn't being bottlenecked by it's other hardware. I only managed to find 1 pod a few days ago that was giving me 3 it/s while training a model and it was a community pod.
8 Replies
ashleyk
ashleyk9mo ago
What kind of model are you training? Are you using Kohya_ss or something else? What kind of GPU are you using and which region of secure cloud are you using?
Bryan
BryanOP9mo ago
been trying to train a LORA for SDXL, been trying 3090's, 4090's, tried an a100 havent been picking a region, any suggestions?
ashleyk
ashleyk9mo ago
Do you know which region you're getting when you're auto assigned one?
Bryan
BryanOP9mo ago
says CZ most of the time
ashleyk
ashleyk9mo ago
Okay, I'll run some tests, not sure whether its the slow disk causing it to be slow.
Bryan
BryanOP9mo ago
someone suggested in another thread it could be old CPU's but they get away with it because they only show vCPU count and nothing else
ashleyk
ashleyk9mo ago
Yeah CPU can have some impact but once everything is loaded and the training starts, it should be mostly using GPU not CPU. If I check the CPU usage while training is in progress, its very low, while GPU utilization is basically maxed out.
Bryan
BryanOP9mo ago
one thing I noticed about the fast pod I received once was the beginning wheel for Kohya loaded FAST
Want results from more Discord servers?
Add your server