Cannot set TCP-Port 3000 for Dreambooth
Hi all, i want to use Dreambooth, and i tryed to set the Port via Fuser and in the Configuration of the pod, but its notpossible to open the port. Fuser throws permission denied (fuser -k 3000/tcp). Any suggestions for me, please?? 😦
A40 availability
There are a couple of >1 month old posts about this but it seems to be an issue again, A40s have become pretty much entirely unavailable other than at weird times (~7am GMT) and it's been like this for about a week now, what's going on? Availability seems unusually poor, I've never known it like this, I've got quite a lot of credit that I can't use.
SGLANG load LLM Model
I am trying to load LLM model using Pods by using sglang template.
Here is my config:
When I start the pod, it did not loading the model instead the container log keep showing cuda things(license,version). May I know what is the reason?...
Unable to start pod using GraphQL
I am trying to create a pod using the GraphQL endpoint but I am getting 400 status response, here are the request and response for the same. Please let me know how to get this working.
```
Sending GraphQL query:
mutation {...
Differentiating between the pod state, "starting" vs "stopping"
When I start a pod and fetch it's details through the grapQL api, the "runtime" is None but when I stop it, the "runtime" is None as well. Is there a way to differentiate between these two states ?
Building and deploying dockerfile from Pod
Has anyone figured out how to properly build and push a dockerfile in a runpod pod? https://docs.runpod.io/tutorials/pods/build-docker-images
I'm trying to do this for my custom serverless worker bc my personal pc has some docker issues that i havent figured out over multiple days of debugging (i think wsl got corrupted somehow but that's a different issue). But every time i try to run
bazel run //:push_custom_image
, it shows the following error:
```
WARNING: Target pattern parsing failed....Still waiting for logs but I can Console in?
My container is still in Waiting For Logs state, but I can access it through the web console, run services, and access them through http.
Even after doing this, it is still showing waiting for logs.
The docker entrypoint script does not seem to have completed as the services it should run are not started.
What logs does the container logs on runpod ui actually look in? Is it what the container prints to the screen or a log file?...
Assistance with Deploying AI App on RunPod
I recently purchased RunPod to deploy my AI app, but I could use some guidance on implementing my end-to-end project.
I have a project folder, "X," that includes my custom models (in both ONNX and PyTorch formats) and Flask APIs. It’s working well locally, but I'm a bit confused about transitioning to RunPod. Specifically, I’m unsure about how to best leverage Pods, serverless options, and templates to set it up on your platform.
I've explored the documentation but still have questions on structuring and deploying it effectively. Could you or someone from your team provide guidance or resources to help me set up and run my project on RunPod?...
cant ssh to runpod
-- RUNPOD.IO --
Enjoy your Pod #1r3czjoca6n3zh ^_^
Error response from daemon: Container b7b51b7b9a1b7e03f346032b3339de15d9465317e632972e2b5be6b3584d8759 is not running...
How fast are network volumes?
Hey, this kind of belongs into here, and also into serverless. For a client we're currently architecting some stuff, and the question we're having is, just exactly how fast are network volumes.
My limited benchmarks make it feel like they add about a minute for an SDXL Model to the execution time, because the Model needs to be loaded to RAM. This seems to be much faster with local storage.
What are your experiences, any pieces of advice? Any gotchas?
Thank you so much 🙂
...
Please help me.
I have to deploy backend that built using flask on VPS with GPU.
The backend performs object detection using YOLO.
How to do it on RunPod?
And what is this error?...
very slow network storage
I deploy pod with network storage on US-KS-2 and its extremally slow (storage disk)
...
pyton -m venv venv
pyton -m venv venv
Pod not starting up properly anymore
When I deploy a pod with the "RunPod Stable Diffusion" template on demand, its not starting up properly, even if I wait for an hour. I can not launch jupyterlab or the sd webui. Did something change with the platform?
This used to be a very easy and straightforward process withouth issues....
Putty for SSH? Any clues?
I'm trying to connect with putty. I think I converted the ed25519 to .ppk correctly. I know how to use it to authenticate.
I can connect and get prompted to "login as:", so I just use "root" because I can't find a username in documentation. Which username do we use? Am I doing this right?...
Unable to Connect AWS to RunPod
I am encountering an issue while attempting to connect AWS to RunPod. Despite multiple attempts, the connection fails, and we have been unable to establish a successful link between the two services. Any guidance or troubleshooting steps to fix this issue would be greatly appreciated. Thank you in advance for your support!
External IP Ranges (for an AWS VPC Security Group
I've been using an RDS database to collate the results of the work my pods are doing, and that's been fine so far with one or two running, but we're about to scale to a lot more.
This kind of means that I can no longer log into a pod and ping something to get it's IP address to add to my RDS VPC Inbound Whitelist.
I was looking at maybe AWS PrivateLink or mTLS, but neither seem to be supported.
...
how can i access network volume from jupyterlab notebook ?
I am currently running a test on some LLM models, and currently trying to setup a network volume so that I am download and use some of the larger models while also working on some other embbeding models as well (not able to download both llm and embedding model into the defualt volume at the same time)
Would like to ask how can I move the model to the network volume so that I wont have running out of volume error. Thanks!...
I need to reinstall the pip requirements for comfyui everytime I start a pod.
I need to reinstall the pip requirements for comfyui everytime I start a pod. I understand there is a difference between disk and pod volume. I assume I have to update the correct one. So which one do I need to update to permanentely update comfyui and how do I do that? I already tried solving this with the ask-ai bot without success. Thank you.
Solution:
that template is brokwn switch to https://peer.madiator.cloud/w/mFxwXpgQyLT3yBkGv1ZWkx