Mandragora.ai
Mandragora.ai
RRunPod
Created by Mandragora.ai on 8/24/2024 in #⛅|pods
Production pod suddenly unreachable, how long can I expect this to last for? (Please provide ETA)
Thanks BTW, the little bit of back and forth here really helped keep me sane. I would have become increasingly frustrated if I was just faced with silence.
28 replies
RRunPod
Created by Mandragora.ai on 8/24/2024 in #⛅|pods
Production pod suddenly unreachable, how long can I expect this to last for? (Please provide ETA)
Nah, it occured to me to try this before i fully deployed skypilot. I will get back to skypilot in the coming days, but its no longer an emergency, and I have more pressing matters to attend to.
28 replies
RRunPod
Created by Mandragora.ai on 8/24/2024 in #⛅|pods
Production pod suddenly unreachable, how long can I expect this to last for? (Please provide ETA)
The first instance is still buggered.
28 replies
RRunPod
Created by Mandragora.ai on 8/24/2024 in #⛅|pods
Production pod suddenly unreachable, how long can I expect this to last for? (Please provide ETA)
I worked around it by creating a second instance of the pod and updating my app to point to that. Same cloud, same region, same spec
28 replies
RRunPod
Created by Mandragora.ai on 8/24/2024 in #⛅|pods
Production pod suddenly unreachable, how long can I expect this to last for? (Please provide ETA)
Bummer, thats a dealbreaker for us if it pans out that way.
28 replies
RRunPod
Created by Mandragora.ai on 8/24/2024 in #⛅|pods
Production pod suddenly unreachable, how long can I expect this to last for? (Please provide ETA)
No description
28 replies
RRunPod
Created by Mandragora.ai on 8/24/2024 in #⛅|pods
Production pod suddenly unreachable, how long can I expect this to last for? (Please provide ETA)
yeah, will probably use DO, i've already got a couple apps running on App Platform. Cant use Runpod to host the gateway to get around runpod issues, not reliable enough 😉
28 replies
RRunPod
Created by Mandragora.ai on 8/24/2024 in #⛅|pods
Production pod suddenly unreachable, how long can I expect this to last for? (Please provide ETA)
will do, so far its been very intuitive to set up. still need to figure out where to host the gateway, but running locally its already working, was trivial; about ~15 minutes to connect my app to it.
28 replies
RRunPod
Created by Mandragora.ai on 8/24/2024 in #⛅|pods
Production pod suddenly unreachable, how long can I expect this to last for? (Please provide ETA)
I'm in the process of setting up skypilot now to hopefully have some better stability
28 replies
RRunPod
Created by Mandragora.ai on 8/24/2024 in #⛅|pods
Production pod suddenly unreachable, how long can I expect this to last for? (Please provide ETA)
When you say "which cloud", i assume that means secure vs community? I'm on secure cloud. Canada region
28 replies
RRunPod
Created by Mandragora.ai on 8/24/2024 in #⛅|pods
Production pod suddenly unreachable, how long can I expect this to last for? (Please provide ETA)
I hope so
28 replies
RRunPod
Created by Mandragora.ai on 8/24/2024 in #⛅|pods
Production pod suddenly unreachable, how long can I expect this to last for? (Please provide ETA)
Thanks. I have created a ticket. I have emailed support. This is the third hours long outage in three months. The first lasted a day and a half. This can't keep happening with mission critical infrastructure.
28 replies
RRunPod
Created by Mandragora.ai on 5/10/2024 in #⚡|serverless
Serverless broke for me overnight, I can't get inference to run at all.
thanks for the heads up, i'll keep an eye out for anyone else having similar problems too
101 replies
RRunPod
Created by Mandragora.ai on 5/10/2024 in #⚡|serverless
Serverless broke for me overnight, I can't get inference to run at all.
No description
101 replies
RRunPod
Created by Mandragora.ai on 5/10/2024 in #⚡|serverless
Serverless broke for me overnight, I can't get inference to run at all.
thanks for the heads up
101 replies
RRunPod
Created by Mandragora.ai on 5/10/2024 in #⚡|serverless
Serverless broke for me overnight, I can't get inference to run at all.
no idea, i'll see if i can replicate it. i haven't seen that issue myself, we have a letsencrypt ssl cert.
101 replies
RRunPod
Created by Mandragora.ai on 5/10/2024 in #⚡|serverless
Serverless broke for me overnight, I can't get inference to run at all.
@Alpay Ariyak thanks for your hard work with this 🎉
101 replies
RRunPod
Created by Mandragora.ai on 5/10/2024 in #⚡|serverless
Serverless broke for me overnight, I can't get inference to run at all.
my app is back up and running! Only 26 hours of downtime and 186 new signups hit with "Sorry we're down"
101 replies
RRunPod
Created by Mandragora.ai on 5/10/2024 in #⚡|serverless
Serverless broke for me overnight, I can't get inference to run at all.
YES! I finally got some inference output!
101 replies
RRunPod
Created by Mandragora.ai on 5/10/2024 in #⚡|serverless
Serverless broke for me overnight, I can't get inference to run at all.
is there anything i might need to do at my end to get it running again? I just activated a worker on the endpoint, and it did actually load the model into memory. Which is way further than I got at any point yesterday. but it still not running inference; the requests are still stuck at IN_QUEUE. I'm about to start playing with my environment variables again in case they're in an invalid state
101 replies