Why is 125M from facebook loading into VLLM quickdeploy even though another model is specified?
Specified a qwen variant - get facebook opt125m deployed instead.
1 Reply
MODEL_PATH is not being passed properly. Doesn't matter which version of VLLM I use or which model from HF is used