bradfox2 Posts - Answer Overflow

Topics

bradfox2

•Created by bradfox2 on 11/6/2024 in #⚡｜serverless

Why is 125M from facebook loading into VLLM quickdeploy even though another model is specified?

Specified a qwen variant - get facebook opt125m deployed instead.

2 replies