morrow
morrow
RRunPod
Created by morrow on 2/14/2025 in #⚡|serverless
my serverless worker is downloading models to `/runpod-volume/.cache/huggingface` by itself
Hello, I don't use any network volume so I don't understand why /runpod-volume exist at all, but also I have a HF_HOME env var that point somewhere else and it seem huggingface is targeting /runpod-volume without explanation. Did I miss something ? Is that related to the new caching feature I was told about a few weeks ago ?
2 replies
RRunPod
Created by morrow on 1/2/2025 in #⚡|serverless
Best way to cache models with serverless ?
Hello, I'm using serverless endpoint to do image generation with flux dev. The model is 22gb which is quite long to download, especially since some workers seem to be faster than some others. I've been using a network volume as a cache which greatly improve start up time. However, doing this lock me in a particular region which I believe make some GPUs like the A100 very rarely available. Is there a way to have a global huggingface cache with serverless endpoint ? (like with pods) Thanks
6 replies
RRunPod
Created by morrow on 12/8/2024 in #⚡|serverless
Serverless github endpoint stuck at uploading phase
No description
11 replies