JohnDoe
RRunPod
•Created by JohnDoe on 10/3/2024 in #⚡|serverless
Flux.1 Schnell Serverless Speeds
I currently have a diffusers only pipeline so that would work. However, how do I load it on start up? Do I load it before it hits the inference function? As it seems to .to('cuda') part is where I'm coming unstuck
4 replies
RRunPod
•Created by JohnDoe on 5/22/2024 in #⚡|serverless
Mixed Delay Times
How does flashboot work?
11 replies
RRunPod
•Created by JohnDoe on 5/22/2024 in #⚡|serverless
Mixed Delay Times
I don't think workers are an issue
11 replies
RRunPod
•Created by JohnDoe on 5/22/2024 in #⚡|serverless
Mixed Delay Times
I'm guessing I can't control the cold start time?
11 replies