Volko
RRunPod
•Created by Volko on 4/17/2024 in #⚡|serverless
Why is my endpoint running ? I don't have any questions and the time idle is set to 1 sec

3 replies
RRunPod
•Created by Volko on 4/17/2024 in #⛅|pods-clusters
is AWQ faster than GGUF ?
In which order is the fastest inference speed between AWQ, GGUF, GPTQ, QAT, EXL2 ?
10 replies
RRunPod
•Created by Volko on 4/11/2024 in #⛅|pods-clusters
Do 2 GPUs will fine tune 2 times faster than 1 GPU on axolotl ?
Do 2 GPUs will fine tune 2 times faster than 1 GPU on axolotl ?
23 replies
RRunPod
•Created by Volko on 4/10/2024 in #⛅|pods-clusters
Download Mixtral from HuggingFace

15 replies