bradfox2
bradfox2
RRunPod
Created by bradfox2 on 11/6/2024 in #⚡|serverless
Why is 125M from facebook loading into VLLM quickdeploy even though another model is specified?
Specified a qwen variant - get facebook opt125m deployed instead.
2 replies