morrow Posts - Answer Overflow

morrow

•Created by morrow on 2/14/2025 in #⚡｜serverless

my serverless worker is downloading models to `/runpod-volume/.cache/huggingface` by itself

Hello, I don't use any network volume so I don't understand why /runpod-volume exist at all, but also I have a HF_HOME env var that point somewhere else and it seem huggingface is targeting /runpod-volume without explanation. Did I miss something ? Is that related to the new caching feature I was told about a few weeks ago ?

2 replies

RRunPod

•Created by morrow on 1/2/2025 in #⚡｜serverless

Best way to cache models with serverless ?

Hello, I'm using serverless endpoint to do image generation with flux dev. The model is 22gb which is quite long to download, especially since some workers seem to be faster than some others. I've been using a network volume as a cache which greatly improve start up time. However, doing this lock me in a particular region which I believe make some GPUs like the A100 very rarely available. Is there a way to have a global huggingface cache with serverless endpoint ? (like with pods) Thanks

6 replies

RRunPod

•Created by morrow on 12/8/2024 in #⚡｜serverless

Serverless github endpoint stuck at uploading phase

11 replies

Gaming

Programming