RunPod•12mo ago

Can we run aphrodite-engine on Serverless?

aphrodite-engine is a fork from vLLM and also supports exl2 format, which gives it a huge advantage. Are there any plans to support aphrodite-engine in future on RunPod's serverless offering? I believe currently aphrodite-engine is only supported as a single server on RunPod. Thanks

8 Replies

Madiator2011•12mo ago

I mean nothing stops your from building own worker 🙂

houmieOP•12mo ago

How? lol

Madiator2011•12mo ago

all workers are open source so you can have look at code and build own. Some coding and then packaging docker image

houmieOP•12mo ago

Seriously, I would be interested building my own queue and host aphrodite-engine on a single pod instead.

houmieOP•12mo ago

You mean this worker: https://github.com/runpod-workers/worker-vllm

GitHub

GitHub - runpod-workers/worker-vllm: The RunPod worker template for...

The RunPod worker template for serving our large language model endpoints. Powered by vLLM. - runpod-workers/worker-vllm

houmieOP•12mo ago

Or any other you could recommend?

Madiator2011•12mo ago

I do not know what aphrodite-engine is so cant tell but yes if it's fork ov vllm it will be good start point

Jason•12mo ago

Yes find your own source study them and that will help you understand, the triggers of runpod serverless is quite easy They all are documented on runpod docs

Gaming

Programming

Can we run aphrodite-engine on Serverless?

Did you find this page helpful?