Why is 125M from facebook loading into VLLM quickdeploy even though another model is specified?

Specified a qwen variant - get facebook opt125m deployed instead.

1 Reply

sslan•4mo ago

MODEL_PATH is not being passed properly. Doesn't matter which version of VLLM I use or which model from HF is used

We're a community of enthusiasts, engineers, and enterprises, all sharing insights on AI, Machine Learning and GPUs!

15KMembers