RunpodR
Runpod9mo ago
Ale

Serverless vLLM changing engine arguments

Hi, I got vLLM Serverless worker up and running, but want to change one engine argument (which is not overridable through environment variables), specifically
--limit-mm-per-prompt
, how could I do that with your custom image
runpod/worker-v1-vllm:v2.3.0stable-cuda12.1.0
that endpoints use? Thanks
Was this page helpful?