Ale
Ale
RRunPod
Created by Ale on 4/15/2025 in #⚡|serverless
Serverless vLLM changing engine arguments
Hi, I got vLLM Serverless worker up and running, but want to change one engine argument (which is not overridable through environment variables), specifically --limit-mm-per-prompt , how could I do that with your custom image runpod/worker-v1-vllm:v2.3.0stable-cuda12.1.0 that endpoints use? Thanks
10 replies