R
RunPod•11mo ago
joshmohrer

Insanely Fast Whisper

I am trying to get this to work on Runpod to try and eek out some more speed over your faster-whisper which I currently use and love. It seems like this effort was started? https://github.com/runpod-workers/worker-insanely-fast-whisper/tree/main @Justin Merrell @Marut
or perhaps its done already and ready to go? I tried rolling my own with the help of an upwork guy https://hub.docker.com/r/joshmohrer/insanely-faster-whisper but its not really working, and somewhat incomplete. I'll be putting growing and hopefully meaningful (to runpod) volume through this, so any help would be greatly apprecaited 🙂
GitHub
GitHub - runpod-workers/worker-insanely-fast-whisper
Contribute to runpod-workers/worker-insanely-fast-whisper development by creating an account on GitHub.
10 Replies
wiki
wiki•11mo ago
hey @joshmohrer , Sure. How can I help? Are you facing any issues with this worker ? Is that docker image is built using this worker?
joshmohrer
joshmohrerOP•11mo ago
Hey @Marut thanks for the reply. My docker was a clean attempt starting from the insanely-fast-whisper github but its a bit borked. Is the runpod-workers version linked above something that can be deployed to runpod as-is?
wiki
wiki•11mo ago
Yeah, You can deploy this worker.
joshmohrer
joshmohrerOP•11mo ago
"dt":"2024-02-05 18:21:39.791830" "endpointid":"3rrxzpnnvgyfyg" "level":"info" "message":" "error_message": "Multiple languages detected when trying to predict the most likely target language for transcription. It is currently not supported to transcribe to different languages in a single batch. Please make sure to either force a single language by passing language='...' or make sure all input audio is of the same language."," "workerId":"5ohdo2lip7bl7d" } your faster-whisper handles this no problem and unfortuntaely its something I need. Perhaps limitation of insanely-faster-whiper
flash-singh
flash-singh•11mo ago
thats from your insanely-faster-whiper?
joshmohrer
joshmohrerOP•11mo ago
Yeah
wiki
wiki•11mo ago
I can check once.
joshmohrer
joshmohrerOP•11mo ago
i got a bit spoiled by your faster-whisper template which has been really perfect. im partially trying to get diarized output, and partially looking for slightly more speed. my transcription usecase is time sensitive for my app's experience.
wiki
wiki•11mo ago
I understand, Let me take a look on it. @joshmohrer made minor changes to be in sync with upstream. You can try with multi language by specifying, now. Let me know if you face any issues.
joshmohrer
joshmohrerOP•11mo ago
thanks!!
Want results from more Discord servers?
Add your server