For people who want simple, out of the box stuff (not necessarily in Python) for just getting phonemes I can also recommend [0]. Not amazing recognition quality, but dead simple setup, and it is possible to integrate a language model as well (I never needed one for my task). The author showed it as well in [1], but kind of skimmed right by - but to me if you want to know speech recognition in detail, pocketsphinx-python is one of the best ways. Customizing the language model is a <i>huge</i> boost in domain specific recognition.<p>Large company APIs will usually be better at generic speaker, generic language recognition - but if you can do speaker adaptation and customize the language model, there are some insane gains possible since you prune out a lot of uncertainty and complexity.<p>If you are more interested in recognition and alignment to a script, "gentle" is great [2][3]. The guts also have raw Kaldi recognition, which is pretty good for a generic speech recognizer but you would need to do some coding to pull out that part on its own.<p>For a decent performing deep model, check into Mozilla's version of Baidu's DeepSpeech [4].<p>If doing full-on development, my colleague has been using a bridge between PyTorch (for training) and Kaldi (to use their decoders) to good success [5].<p>[0] how I use pocketsphinx to get phonemes, <a href="https://github.com/kastnerkyle/ez-phones" rel="nofollow">https://github.com/kastnerkyle/ez-phones</a><p>[1] <a href="https://github.com/cmusphinx/pocketsphinx-python" rel="nofollow">https://github.com/cmusphinx/pocketsphinx-python</a><p>[2] <a href="https://github.com/lowerquality/gentle" rel="nofollow">https://github.com/lowerquality/gentle</a><p>[3] how I use gentle for foreced alignment, <a href="https://github.com/kastnerkyle/raw_voice_cleanup/tree/master/alignment" rel="nofollow">https://github.com/kastnerkyle/raw_voice_cleanup/tree/master...</a><p>[4] <a href="https://github.com/mozilla/DeepSpeech" rel="nofollow">https://github.com/mozilla/DeepSpeech</a><p>[5] <a href="https://github.com/mravanelli/pytorch-kaldi" rel="nofollow">https://github.com/mravanelli/pytorch-kaldi</a>