Why is 125M from facebook loading into VLLM quickdeploy even though another model is specified?

Specified a qwen variant - get facebook opt125m deployed instead.
1 Reply
sslan
sslan4w ago
MODEL_PATH is not being passed properly. Doesn't matter which version of VLLM I use or which model from HF is used
Want results from more Discord servers?
Add your server