Annoyingly tedious Speech To Text inference Steps

Did you ever wonder what it takes to produce the Banafo Offline Speech to Text Transcripts ?

This is how we do it behind the scenes:


While the results for English were pretty good, we were unable to keep up training the monkeys for other languages so here is how we do it nowadays (please note that we omit some proprietary parts that we consider our trade secrets 😉

Way too many parts are needed and as the experience tells us, when you try it with enough voices, accents, compression formats something eventually will go wrong in every single one of them.

Read more