Runpod queue not processing

Hey, using Kandinsky 2.1 deployed serverless application. Then hit the run endpoint and it was queued. checked status by id still in_queue status. anyone can resolve this issue ?
77 Replies
ashleyk
ashleykā€¢8mo ago
Works for me, did you deploy it yourself or using the RunPod managed one?
rafael21@
rafael21@ā€¢8mo ago
He is using 3.0, not 2.1... deployed by himself
ashleyk
ashleykā€¢8mo ago
You guys work together?
rafael21@
rafael21@ā€¢8mo ago
yes
ashleyk
ashleykā€¢8mo ago
Are all your workers throttled?
rafael21@
rafael21@ā€¢8mo ago
look
No description
ashleyk
ashleykā€¢8mo ago
Why do you only have 1 worker? You should leave it at the default of 3 The 1 worker is throttled so thats why your requests are stuck in the queue.
ashleyk
ashleykā€¢8mo ago
It can only accept requests from the queue when it shows as Ready like this.
No description
rafael21@
rafael21@ā€¢8mo ago
Changed here to 3
rafael21@
rafael21@ā€¢8mo ago
No description
rafael21@
rafael21@ā€¢8mo ago
maybe the problem is the worker code...
rafael21@
rafael21@ā€¢8mo ago
No description
ashleyk
ashleykā€¢8mo ago
Are you using network storage? Its not a problem with the worker code, all your workers are throttled again. Try change to a different GPU tier instead of 24GB, try 24GB PRO or 48GB, but you need to scale your workers down to zero and back up again for it to take effect
rafael21@
rafael21@ā€¢8mo ago
Look bro
No description
ashleyk
ashleykā€¢8mo ago
Is that your own image or someone else's image? Looks the same name as yours so I assume its yours? If you made the repository private on Dockerhub, you need to add the registry credentials to RunPod as well as to your endpoint template.
ashleyk
ashleykā€¢8mo ago
No description
ashleyk
ashleykā€¢8mo ago
Then select the credentials here in your serverless template.
No description
ashleyk
ashleykā€¢8mo ago
Then scale max workers down to zero and back again for the changes to take effect.
rafael21@
rafael21@ā€¢8mo ago
it is my image and repo is currently public This AMD64 says: Image may have poor performance, or fail, if run via emulation
ashleyk
ashleykā€¢8mo ago
Didn't you build it for Linux? How did you build it?
rafael21@
rafael21@ā€¢8mo ago
my computer is macOS
ashleyk
ashleykā€¢8mo ago
Also don't use latest tag, that is bad practice for serverless. Its fine for Pods, but not for Serverless. For Severless you should use version tags and releases.
rafael21@
rafael21@ā€¢8mo ago
sure, thanks
ashleyk
ashleykā€¢8mo ago
Different techniques to build docker / OCI containers for k8s deployments on Apple M1 and later architectures or any architecture different than your target cluster's architecture
ashleyk
ashleykā€¢8mo ago
Basically use buildx and --platform linux/amd64.
rafael21@
rafael21@ā€¢8mo ago
i should put the tag like: version1 ?
ashleyk
ashleykā€¢8mo ago
Semantic Versioning
Semantic Versioning 2.0.0
Semantic Versioning spec and website
rafael21@
rafael21@ā€¢8mo ago
bro, do you know if can do git clone before that? I used these commands when I created the image: git clone https://github.com/rafaelvmfranco/repo cd repo docker build . -t rafaelfranco21/kandinsky:latest docker push rafaelfranco21/kandinsky:latest
ashleyk
ashleykā€¢8mo ago
Oh yeah, your docker command is wrong for Mac
docker buildx build . --platform linux/amd64 -t rafaelfranco21/kandinsky:1.0.0
docker buildx build . --platform linux/amd64 -t rafaelfranco21/kandinsky:1.0.0
Something like that should work. The git command is not really relevant, just the docker build command.
rafael21@
rafael21@ā€¢8mo ago
I got this error: ERROR: failed to solve: process "/bin/bash -c python /cache_models.py" did not complete successfully: exit code: 1
rafael21@
rafael21@ā€¢8mo ago
No description
rafael21@
rafael21@ā€¢8mo ago
No description
rafael21@
rafael21@ā€¢8mo ago
No description
ashleyk
ashleykā€¢8mo ago
Your disk is out of space
rafael21@
rafael21@ā€¢8mo ago
my computer or docker?
ashleyk
ashleykā€¢8mo ago
No description
ashleyk
ashleykā€¢8mo ago
Your computer probably
rafael21@
rafael21@ā€¢8mo ago
šŸ˜…
ashleyk
ashleykā€¢8mo ago
df -h
df -h
Should show, or you can also check in Finder
rafael21@
rafael21@ā€¢8mo ago
No description
ashleyk
ashleykā€¢8mo ago
Bottom one is full actually I don't understand because it says size is 0Bi But google docker prune commands, you can probably prune to free up space
rafael21@
rafael21@ā€¢8mo ago
I will delete some apps
ashleyk
ashleykā€¢8mo ago
Looks like the / disk is fine though Maybe you buildx has a size limit or something, I don't know
rafael21@
rafael21@ā€¢8mo ago
crazy thing! Im not even a developer hahah
ashleyk
ashleykā€¢8mo ago
Me neither, at least not anymore
rafael21@
rafael21@ā€¢8mo ago
what are you now? I have zero code knowledge... im just trying like crazy with your help
ashleyk
ashleykā€¢8mo ago
By the way you can also try using https://depot.dev to build your image, they used to have a free plan but apparently its been removed and you have to pay now, but may be worth paying to save yourself a lot of headaches and time, as they say "time is money".
Depot
Depot
The fastest way to build Docker images.
ashleyk
ashleykā€¢8mo ago
They probably had to make it paid because of the large amount of RunPod people who started using it šŸ˜†
rafael21@
rafael21@ā€¢8mo ago
I dont think I would know how to use it but thanks! Bro, I created the image! Already created the new runpod endpoint using the new image now, I will see if it gets the ready status
rafael21@
rafael21@ā€¢8mo ago
still initializing
No description
rafael21@
rafael21@ā€¢8mo ago
too good to be true šŸ˜…
ashleyk
ashleykā€¢8mo ago
Click on one to view the logs
rafael21@
rafael21@ā€¢8mo ago
No description
rafael21@
rafael21@ā€¢8mo ago
there is an error
ashleyk
ashleykā€¢8mo ago
What is the image set to in your template? Does it have a tag?
rafael21@
rafael21@ā€¢8mo ago
this rafaelfranco21/kandinsky3.0:1:0:0 maybe it is because the docker repo is private but I already created the credentials like you said
rafael21@
rafael21@ā€¢8mo ago
No description
rafael21@
rafael21@ā€¢8mo ago
mistake is here
ashleyk
ashleykā€¢8mo ago
Yeah should probably be dots not colons
rafael21@
rafael21@ā€¢8mo ago
yes now I got another error 2024-03-01T18:16:35Z error pulling image: Error response from daemon: Head "https://registry-1.docker.io/v2/rafaelfranco21/kandinsky3.0/manifests/1.0.0": unauthorized: incorrect username or password
ashleyk
ashleykā€¢8mo ago
Looks like your registry credentials are incorrect, double check them or make your image public
rafael21@
rafael21@ā€¢8mo ago
when I send docker login in the terminal its says: Login Succeeded šŸ˜…
ashleyk
ashleykā€¢8mo ago
Yeah but thats a username and password probably, you should use an access token for pulling your images and make it read-only: https://hub.docker.com/settings/security
ashleyk
ashleykā€¢8mo ago
Like this
No description
rafael21@
rafael21@ā€¢8mo ago
There is one of this here so I need to craete one only for runpod I generated it what do I do with it bro?
ashleyk
ashleykā€¢8mo ago
Edit your docker auth credentials on RunPod and put it in there
ashleyk
ashleykā€¢8mo ago
No description
rafael21@
rafael21@ā€¢8mo ago
username is the docker username
No description
rafael21@
rafael21@ā€¢8mo ago
and pssword is the token that I just generated
ashleyk
ashleykā€¢8mo ago
Yes
rafael21@
rafael21@ā€¢8mo ago
Ok, thanks I will now scale workers to zero to reset it now it says: 2024-03-01T18:41:52Z 14ef900132c4 Downloading [=============================================> ] 18.45GB/20.07GB several lines of this so it seems that it is working
ashleyk
ashleykā€¢8mo ago
Yeah its pulling the image
rafael21@
rafael21@ā€¢8mo ago
these extra workers are ready
No description
rafael21@
rafael21@ā€¢8mo ago
the latest workers are not
ashleyk
ashleykā€¢8mo ago
What do they say when you click on them?
rafael21@
rafael21@ā€¢8mo ago
Now, all say worker is ready but status is initializing now, ready šŸ˜ šŸ˜
ashleyk
ashleykā€¢8mo ago
Yeah it takes a while for all the workers to become ready the first time and when you do a new release.
Want results from more Discord servers?
Add your server