Igor Gulamov
Igor Gulamov
RRunPod
Created by Igor Gulamov on 2/11/2025 in #⛅|pods
deepseek-r is loading for >1h into vram.
No description
16 replies
RRunPod
Created by Igor Gulamov on 2/11/2025 in #⛅|pods
deepseek-r is loading for >1h into vram.
function, that reads .tensor and load it to gpu, takes extremally long time. mmap is mapping file as memory to load data directly from ssd to vram with no ram consumption
16 replies
RRunPod
Created by Igor Gulamov on 2/11/2025 in #⛅|pods
deepseek-r is loading for >1h into vram.
I read on the github that it is often related on nmap over network drives, but not sure
16 replies
RRunPod
Created by Igor Gulamov on 2/11/2025 in #⛅|pods
deepseek-r is loading for >1h into vram.
no oom, this is 8xMI300X
16 replies
RRunPod
Created by Igor Gulamov on 2/11/2025 in #⛅|pods
deepseek-r is loading for >1h into vram.
loading safetensors checkpoint shards for 1 hour
16 replies
RRunPod
Created by Igor Gulamov on 2/11/2025 in #⛅|pods
deepseek-r is loading for >1h into vram.
I use my own docker container with sglang inside. For rocm you have only pytorch, no vllm or sglang. I use loaded to the disk model. There is no space to load the 2nd one: 670gb total. and it is downloading for 4 hours, not for 1
16 replies