Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This looks pretty cool, especially since it is offline and free! The only caveat is that it is probably not feasible to train yourself, right?

I should play with this if I have some time. I've had the idea for a while to build a voice assistent which can switch modes or datasets while you are speaking. If you say "computer, play ...." for example it would load a recognizer that is specialized on song names. The idea is that you can mix English song names in a German prompt, and it will not be confused. Every voice assistent I know gets confused, presumably because they convert speech to plain text and only then act on the text.



> I've had the idea for a while to build a voice assistent which can switch modes or datasets while you are speaking. If you say "computer, play ...." for example it would load a recognizer that is specialized on song names.

I would say by now the generic recognizers are so good that this is becoming less and less useful. For example, this tool handles non-existing German words quite well.

That said, the tool has a "--data_folder_path" parameter where you can specify a different acoustic and language model.

BTW, I also want to build an offline voice assistant :)

That's how I got started on this journey. You might be interested in my next project, where I try to do offline real-time English recognition with a WebRTC API to make it easy for developers to connect my AI module with their own task logic. Here's the waiting list: https://madmimi.com/signups/f0da3b13840d40ce9e061cafea6280d5...




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: