Cajoek
Cajoek
RRunPod
Created by Cajoek on 4/11/2024 in #⛅|pods
ERROR: Unexpected bus error encountered in worker. This might be caused by insufficient shared memor
Can't train anything with a batch size larger than 16 😦 I just get Killed now
11 replies
RRunPod
Created by Cajoek on 4/11/2024 in #⛅|pods
ERROR: Unexpected bus error encountered in worker. This might be caused by insufficient shared memor
@Papa Madiator Is shm the same for all pod types?
11 replies
RRunPod
Created by Cajoek on 4/11/2024 in #⛅|pods
ERROR: Unexpected bus error encountered in worker. This might be caused by insufficient shared memor
Sorry, I'm training an image transformer model with pytorch. RunPod Pytorch 2.2.0 image
11 replies
RRunPod
Created by Cajoek on 4/11/2024 in #⛅|pods
ERROR: Unexpected bus error encountered in worker. This might be caused by insufficient shared memor
But now I'm only utilizing 20% of GPU memory
11 replies
RRunPod
Created by Cajoek on 4/11/2024 in #⛅|pods
ERROR: Unexpected bus error encountered in worker. This might be caused by insufficient shared memor
Also changing the number dataloader seems to have an effect
11 replies
RRunPod
Created by Cajoek on 4/11/2024 in #⛅|pods
ERROR: Unexpected bus error encountered in worker. This might be caused by insufficient shared memor
Okay, do you know how to avoid the problem?
11 replies