RunPod•8mo ago

Exposing http ports on serverless

There's no way to expose http ports on serverless is there? When I'm creating a new template and flip the template type from Pod to Serverless that option goes away.

66 Replies

MrAssistedOP•8mo ago

I see stuff in the codebase about RUNPOD_REALTIME_PORT but I'm not sure what the use case is for that and I haven't found documentation for it. Also I'd like to expose two ports to my serverless instances

digigoblin•8mo ago

Why would you want to expose HTTP ports on serverless? Its not designed for that. Use pods instead.

MrAssistedOP•8mo ago

I'm creating a version of this https://www.instagram.com/p/C8CzGuOubKp/ that anybody can use without setup. Sending frames over a websocket connection at 24fps. I have it working with pods, but I'd rather not manage standing up and tearing down the instances for each user

Instagram

nerdylive•8mo ago

wow that seems cool

MrAssistedOP•8mo ago

That's what I think! But I can't find anyone else interested in real time ai or ai vtubing

nerdylive•8mo ago

Yeah use pods for now... 24fps, isn't that so high, you're gonna use bunch of high end gpus hahah

MrAssistedOP•8mo ago

it all runs on one 3090

nerdylive•8mo ago

Wow 24 image / s?

MrAssistedOP•8mo ago

Yup. sdxl turbo https://github.com/GenDJ/GenDJ/blob/main/diffusion_processor.py

GitHub

GenDJ/diffusion_processor.py at main · GenDJ/GenDJ

Contribute to GenDJ/GenDJ development by creating an account on GitHub.

nerdylive•8mo ago

Oooo Buut you can initiate websockets to external servers from serverless.. if im not wrong

MrAssistedOP•8mo ago

is there any off the shelf or open source solution for managing a bunch of pods? Ideally I'd like to have some sitting idle for users so they can instantly start using it without waiting a few minutes for the server to stand up and for the start script to run setting up the project etc

nerdylive•8mo ago

well there are some scripts that can help you get started, others are like maybe pulumi, skypilot

MrAssistedOP•8mo ago

how? That would be amazing. I'm also serving a web server hosting the frontend from the pod but I can probably figure out how to decouple that from the websocket and processing

nerdylive•8mo ago

kind of insfrastructure as code

MrAssistedOP•8mo ago

ah so it's in that land

nerdylive•8mo ago

use the graphql api if you want to start script i think you can make custom templates

MrAssistedOP•8mo ago

the pod version of this is already a custom template. Works amazingly well

nerdylive•8mo ago

Woop, no direct connections to the frontend imo, make a backend for this

MrAssistedOP•8mo ago

well it's handy for the pod version I think the non-pod version will have to be a pretty different architecture with a whole webapp

nerdylive•8mo ago

i think for "instant use or faster loading" use some active workers serverless or pods (longer loading if you wanna match the serverless thing ) yes unless you got the same architecture in pods and serverless lol

MrAssistedOP•8mo ago

yeah that's why I was hoping to have some solution out there for managing the fleet of pods. Obviously for cost reasons I wanna have a few active waiting for people as possible, so it'd have to be dynamic and pretty complex logic of when to stand new ones up/tear old ones down

nerdylive•8mo ago

make your own hahah, use the graphql api

MrAssistedOP•8mo ago

side note currently I'm stuffing the actual models inside of the container which causes my docker images to be like 20 gigs, takes forever to upload and is a terrible developer experience. How are people doing it? why graphql api instead of python sdk? I was gonna use this https://github.com/runpod/runpod-python

nerdylive•8mo ago

well there is an alternative, you can offload the model into network storage

MrAssistedOP•8mo ago

can I auto mount a network drive on all new pods I stand up?

nerdylive•8mo ago

yes, its automatically mounted in /workspace if you choose it in ui or start from graphql, both can be done but you're limited to a region that has the network volume

MrAssistedOP•8mo ago

ahh dealbreaker

nerdylive•8mo ago

dw tho its still a huge pool of gpus, just need to pick the right one that has the gpu model you wanna use but also its slower than container disk

MrAssistedOP•8mo ago

well I've noticed some regions aren't good for maintaining the websocket connection. It keeps dropping. and also regions run out of gpus quite frequently in my experience. I can't couple myself to one region, especially not when the current images I have are working fine just huge and annoying tradeoff not worth it wish there were more gpus in usa regions

nerdylive•8mo ago

which regions?

MrAssistedOP•8mo ago

most lol RO is working fine

nerdylive•8mo ago

ooof and us?

MrAssistedOP•8mo ago

never had the gpus I want available in US ever lol

nerdylive•8mo ago

yeah well, gpus are limited from the stock in the DCs

MrAssistedOP•8mo ago

DCs?

nerdylive•8mo ago

datacenters

MrAssistedOP•8mo ago

do you work at runpod? you're helpful btw tks for answering my random questions I've felt like I'm wandering through a dark forest lol

nerdylive•8mo ago

hahaha well not officially working feel free to explore the docs, or find some codes on google, github

MrAssistedOP•8mo ago

so I guess my next step is to build the whole dang pod management thing

nerdylive•8mo ago

also if you're building for serverless you can test out on pods too sure goodluck with that!

MrAssistedOP•8mo ago

but geez the serverless stuff all works great I just would love to use that with the ports exposed spin up an job when a user wants to do the live thing, spin it down when theyre done

nerdylive•8mo ago

why not initiate it from internal to external that seems more fit to pods workload

MrAssistedOP•8mo ago

not sure what you mean

nerdylive•8mo ago

like i said before initiate the ws from the serverless worker, not expose a port then connect from external

MrAssistedOP•8mo ago

I think you need the port exposed to make a websocket connection no? hold on spinning up a pod without the ws port open to confirm

nerdylive•8mo ago

Yes, but instead expose the port in your server and connect it from runpod runpod's firewall for outbound is all exposed if im not wrong thats not possible if you're connecting from your server to runpod, but works if the other way and you got open ports on your own serv

MrAssistedOP•8mo ago

oh snap well nah it needs to go both ways well wait no that might work im gonna quick spin something up locally and ngrok into it and see if that works

nerdylive•8mo ago

Yeah that might work, try it out on pods if you want

MrAssistedOP•8mo ago

ur a genius the only annoying thing is now I need to maintain double the websocket connections

nerdylive•8mo ago

double?

MrAssistedOP•8mo ago

browser -><- my server -><- runpod instead of just sending the frames directly between the runpod instance and the browser in fact the server -><- runpod connection maybe I want to find a better way to do it than websockets. Some messaging queue or something

nerdylive•8mo ago

sure but that would add some more delay ig if not sockets the second one, Hmm that would work on pods yeah but not really secure if you have more ports exposed or security flaws

MrAssistedOP•8mo ago

well that's how it works currently wanna try it out lol

nerdylive•8mo ago

hmm sure

MrAssistedOP•8mo ago

dming the link 1 sec

Encyrption•8mo ago

With GenDJ would it be possible to have it process video from a file rather than live? Take in mp4 return mp4?

MrAssistedOP•8mo ago

it would totally be possible, just haven't built it yet currently it just turns the frames of your webcam into jpgs and sends them to the server, you'd do the same thing with the video as it plays

Encyrption•8mo ago

I'll take a look at the source, might be able to adapt.

MrAssistedOP•8mo ago

PRs welcome lol

Encyrption•8mo ago

Also, for <- my server -><- runpod -> you are not limited to websocket. If it were me I would have servless connect to my_server via a openvpn. Really easy to setup and automate.

MrAssistedOP•8mo ago

interesting. I tried cloudflare tunnel but couldnt figure it out. hadn't thought of vpn would that scale? Like many people could be using it at the same time? and how would I send the frames?

Encyrption•8mo ago

Assuming you don't care about hyper focusing on encryption there is very little overhead. You could send frames via SCP or you could mount a disk from the server... If you want to keep it off disk then your could receive it via a webhook.. really anyway you can on network. With that said I am in the process of building out a front end that acts as web socker server and client... sometimes it is the best way to go.

MrAssistedOP•8mo ago

tbh might want to flip back to the original i2i-realtime and use zmq https://github.com/kylemcdonald/i2i-realtime/blob/main/worker_app.py

Encyrption•8mo ago

Yes, combine that with openvpn connection and your in business. RunPod has teased about adding backend networks that serverless can connect to and along with your external servers. Until that happens OpenVPN is is pretty close to that.

MrAssistedOP•8mo ago

quick question: do we pay for pod startup time?

nerdylive•8mo ago

pods, i guess yes you pay for them

Gaming

Programming

Exposing http ports on serverless

Did you find this page helpful?