RunPod•2mo ago

Can you now run gemma 3 in the vllm container?

In the serverless, its seems im getting an error, any help on this

18 Replies

Jason•2mo ago

can you send the error

YobinOP•2mo ago

I deleted it but it seems bc gemma3 is a new model so transformer is relatively outdated afaik?

Jason•2mo ago

hmm did you use vllm? or only transformer? yeah maybe its a good thing to check first for compatibility

YobinOP•2mo ago

I used the preset vllm, llama 3.2b worked but the new gemma 3 didnt

Dj•2mo ago

vLLM needs to publish an update first unfortunately You can use vLLM directly from the main branch, but that's not super easy if you're using our vLLM template iirc

Jason•2mo ago

I think we can update and build our own vllm template from that vllm-worker repo in github easily Just update the requirement.txt or wherever installs the vllm

Bj9000•2mo ago

Looks like vllm v0.8.0 added gemma3 support will the serverless vllm be updated soon?

Jason•2mo ago

usually its delayed, so probably a few days/ weeks late

Aizen•4w ago

Hi, i have the same issue , have you resolved it? , if then please help me out with it too

YobinOP•4w ago

I used ollana Ollama

Aizen•4w ago

Okay

Jason•4w ago

yes i think vllm is updated already

Javier•4w ago

I deployed an end point to try to call gemma3:4b but nothing is happening when I call it, anybody managed?

Jason•4w ago

Yes it works

Jason•4w ago

just now i've tried for you!

Output-from-gemma3.t...

Jason•4w ago

you need access from hf + hf token to access it

Jason•4w ago

use vllm to configure + check the allow remote code options in the config ( in runpod menu when configuring vllm )

Jason•4w ago

second image is the next page

Gaming

Programming

Can you now run gemma 3 in the vllm container?

Did you find this page helpful?