Cyber | Senpai
Cyber | Senpai
Explore posts from servers
RRunPod
Created by Cyber | Senpai on 8/22/2024 in #⚡|serverless
Implement RAG with vllm API
Is it possible to implement RAG with the given API of vllm and our deployed model on the serverless endpoint.
7 replies