R
RunPod4mo ago
Jason

Serverless vllm running but still downloading?

Title says it all and this shouldn't happen
No description
No description
No description
5 Replies
Jason
JasonOP4mo ago
still
No description
Jason
JasonOP4mo ago
anyways, how to replicate: create a vllm endpoint, then create a network storage, then attach ONLY after its created and all ready Then send requests when 1 worker is ready in the latest workers, to openAI endpoint @yhlong00000
Poddy
Poddy4mo ago
@nerdylive
Escalated To Zendesk
The thread has been escalated to Zendesk!
Jason
JasonOP4mo ago
i'll try to
River
River4mo ago
Hi there. This is not optimized behavior, but it is expected behavior., Since when you switch to using a network volume, all the running containers need to be removed, and need to be changed to be from the same data center the network volume is in I'll speak to the team about optimizing this behavior, but this is an edge case do we likely will take some time to do this. Nonetheless, the downloading new data occurs because we need to change all the Workers to be only from the data center connected to the network volume. Yup that should work, Keep it in the same data centre as uh the network volume would you just be able to always use the network volume hre?

Did you find this page helpful?