annasuhstuff
annasuhstuff
RRunPod
Created by annasuhstuff on 8/7/2024 in #⚡|serverless
"IN QUEUE" and nothing happeneds
No description
7 replies
RRunPod
Created by annasuhstuff on 8/5/2024 in #⚡|serverless
HF_TOKEN question
No description
26 replies
RRunPod
Created by annasuhstuff on 8/1/2024 in #⚡|serverless
Head size 160 is not supported by PagedAttention
Hello, I hope everyone is doing great! I am stuck with this error: ValueError: Head size 160 is not supported by PagedAttention. Supported head sizes are: [64, 80, 96, 112, 128, 256] Does it mean I have to RETRAIN MY MODEL? Full logs are in attachment
3 replies
RRunPod
Created by annasuhstuff on 6/27/2024 in #⚡|serverless
Quantization method
No description
9 replies
RRunPod
Created by annasuhstuff on 6/26/2024 in #⚡|serverless
LoRA adapter on Runpod.io (using vLLM Worker)
No description
21 replies
RRunPod
Created by annasuhstuff on 6/25/2024 in #⚡|serverless
No config error /
No description
5 replies