pg2571
RRunPod
•Created by pg2571 on 11/16/2024 in #⚡|serverless
Not able to connect to the local test API server
I am running the container on an EC2 instance.
I keep getting errors like:
I've hit it as localhost, 0.0.0.0, 127.0.0.1. I've tried from inside the container too. It still throws this error....
4 replies
RRunPod
•Created by pg2571 on 11/16/2024 in #⚡|serverless
Network volume vs baking in model into docker
I want to run a serverless worker that can get called anywhere from once per hour to 300-400/hour.
I want to optimize for cold starts when the occasional request comes in.
it runs SDXL, a checkpoint, a few controlnets, etc. About 15-20GB in total.
Am I better off using a network volume or putting the images directly into the docker image?
For the initial pod setup on the first request, does it download the entire docker image each time? And thats gonna be slower than reading the network volume?
I saw this earlier: https://www.answeroverflow.com/m/1194344244563943604
3 replies