This looks pretty cool, especially since it is offline and free! The only caveat...

fxtentacle · on Aug 10, 2022

> I've had the idea for a while to build a voice assistent which can switch modes or datasets while you are speaking. If you say "computer, play ...." for example it would load a recognizer that is specialized on song names.

I would say by now the generic recognizers are so good that this is becoming less and less useful. For example, this tool handles non-existing German words quite well.

That said, the tool has a "--data_folder_path" parameter where you can specify a different acoustic and language model.

BTW, I also want to build an offline voice assistant :)

That's how I got started on this journey. You might be interested in my next project, where I try to do offline real-time English recognition with a WebRTC API to make it easy for developers to connect my AI module with their own task logic. Here's the waiting list: https://madmimi.com/signups/f0da3b13840d40ce9e061cafea6280d5...