How does the soft check on workers limit work?
Stuck in the initialization
cannot stream openai compatible response out
[URGENT] Failed to return results
Is there an equivalent of flash boot for CPU-only serverless?
Why the available GPUs are only 1?
Faster-Whisper worker template is not fully up-to-date
1.0.2
, whereas the Runpod template is still on 0.10.0
.
There are a few changes that have been introduced in Faster-Whisper (now using CUDA 12) since, that we would like to benefit from, especially the language_detection_threshold
setting, since it seems like most of our transcriptions done by people with British accent are being transcribed into Welsh (with a language detection confidence of around 0.51
to 0.55
) - which could be circumvented by increasing the threshold....Slow IO speeds on serverless
How to download models for Stable Diffusion XL on serverless?
2) I created a Stable Diffusion XL endpoint on serverless, but couldn't attach the network storage.
3) After the deployment succeeded, I clicked on edit endpoint and attached that network storage to it. So far so good I believe. But how do I exactly download various SDXL models into my network storage, so that I could use them via Postman?...
0% GPU utilization and 100% CPU utilization on Faster Whisper quick deploy endpoint
![No description](https://answer-overflow-discord-attachments.s3.us-east-1.amazonaws.com/1253370999089664073/Screenshot_2024-06-20_at_8.02.53_AM.png)
Loading models from network volume cache is taking too long.
Are webhooks fired from Digital Ocean?
AWS#AWSManagedRulesBotControlRuleSet#SignalKnownBotDataCenter
. The IP address in these requests seems to be a Digital Ocean Data Center. I have disabled the WAF for my ALB for my RunPod webhooks temporarily, but hoping that someone can confirm whether these are legitimate requests or not, because I was under the impression that RunPod uses AWS and not Digital Ocean.best architecture opinion
Cancelling job resets flashboot
RUNPOD_API_KEY and MAX_CONTEXT_LEN_TO_CAPTURE
Do I need to allocate extra container space for Flashboot?
Thanks...
When servless is used, does the machine reboot if it is executed consecutively? Currently seeing iss
Slow I/O