Baking model into Dockerimage
Hello, im trying to bake or rather, downlaod the model via vllm directly while building so the image contains the model. I havent found any kind of simple "vllm download" command sadly. The onlny was is either by running vllm and afterwards adding the file to the image which would be too big to host on my registry or let runpod serverless build the image for me with its doing it while building
4 Replies
Why via vllm? You can do this step in the Dockerfile
wdym? I am not sure how i can prewarm vllm viwthout using vllm to download it as its doing more than just downloding it from huggingface. Im not sure if it needs more but im also unsure how exaxctly to do otherwise
If you want to bake a model into the container image you have to do it while building it. You can't do it afterwards without rebuilding it again, which can't be done on RunPod.
To download or copy a model and bake it into the image you can do:
or
If you want to bake a model into the container image you have to do it while building it. You can't do it afterwards without rebuilding it again, which can't be done on RunPod.Exactly thats what i mean, my first thought would simply be using vvlm as its building the neccessary fodler structure but i think you can just skip this by using --model and the local path with that you sent right? will test it out thanks ^^