Very Slow Mapping
Hello! I am trying to run
dataset.map()
and it takes only a few minutes when I run it on Colab. However, when I run it on any machine on RunPod, it reports that it has several hours to finish. I reported this to the Support, but no solution yet. I wonder if anyone faced a similar issue, and how to solve it. The code below is for pre-processing an audio dataset for Whisper fine-tuning. Thanks!
5 Replies
The same code and dataset size?
And where do you store the dataset in
What library do you use in that code?
Transformers
yeah
Same everthing.
RunPod "network volume"
Ooh hmm network volume is abit slower than usual
But I haven't done fine tuning for that yet
Maybe select nvme when creating pods and filter 9 vcpus