xxxyyy
RRunPod
•Created by xxxyyy on 11/11/2024 in #⚡|serverless
Chat completion (template) not working with VLLM 0.6.3 + Serverless
There isn't any reported error on the Qwen Github regarding the chat template (it uses the SAME template as a model that was released months ago), so i suspect this is a runpod specific error?
4 replies
RRunPod
•Created by xxxyyy on 11/11/2024 in #⚡|serverless
Chat completion (template) not working with VLLM 0.6.3 + Serverless
Here's a partial error from server-end:
4 replies
RRunPod
•Created by xxxyyy on 11/11/2024 in #⚡|serverless
Chat completion (template) not working with VLLM 0.6.3 + Serverless
This request runs fine without error:
But this request give me error:
4 replies