Good improvements for many languages, numbers here<p><a href="https://github.com/openai/whisper/blob/main/language-breakdown.svg">https://github.com/openai/whisper/blob/main/language-breakdo...</a>
Does anyone know of a nice UI wrapper for something like whisper.cpp?<p>I need to write a lot of long texts for work and some good dictation software would be great. I know there's Dragon, but somehow I have not been able to find something that fits my need and is free.
This seems like the best free voice recognition in general.<p>Is there a model that is the best at wake word detection? The last that I looked, it seemed like this was fairly lacking.
Still doesn't look like it can do real-time unfortunately.<p>Edit: I understand that you can use small samples and approximate something like streaming, but the limitation here is you wind up without context for the samples, increasing WER. It would be nice if there was some streaming option.
This is great, but I hope in the future there would be a speech-to-text model with a focus on low-resource languages, probably by balancing the dataset similar to No Language Left Behind (NLLB) released by Meta, it's a translation model that works really well even with low-resource languages, it would be really cool something similar for speech transcription.
did they break the api?<p>from openai import OpenAI<p>Traceback (most recent call last):
File "<stdin>", line 1, in <module>
ImportError: cannot import name 'OpenAI' from 'openai'<p>If so where is the current documentation?