riverfog7
RRunPod
•Created by Kalpak on 2/22/2025 in #⚡|serverless
Help with deploying WhisperX ($35 bounty)
It says
huggingface_access_token
on the Readme10 replies
RRunPod
•Created by Lattus on 1/22/2025 in #⚡|serverless
Serverless deepseek-ai/DeepSeek-R1 setup?
Is the model you are trying to run a GGUF quant? You'll need a custom script for GGUF quants or if there is multiple models in a single repo
41 replies
RRunPod
•Created by Bj9000 on 1/27/2025 in #⚡|serverless
Serveless quants
change tensor-parallel-size to gpu count
10 replies
RRunPod
•Created by Bj9000 on 1/27/2025 in #⚡|serverless
Serveless quants
install_requirements.sh
10 replies
RRunPod
•Created by Bj9000 on 1/27/2025 in #⚡|serverless
Serveless quants
I do have a running script
10 replies
RRunPod
•Created by Stewette on 2/8/2025 in #⚡|serverless
The default steps on the website for serverless create broken containers that I am charged for.
8xMI300X should work with 1.5 terabytes of vram
6 replies
RRunPod
•Created by Justin on 2/17/2025 in #⚡|serverless
Baking model into Dockerimage
vllm --model /path/to/model
does not work.
You have to do vllm /path/to/model
6 replies
RRunPod
•Created by 자베르 on 2/14/2025 in #⚡|serverless
Github Serverless building takes too much
Github actions is good. Its free for some extent and the image gets host a docker repo (ghcr.io), and never gets queued for 1hour
9 replies