Are you talking about this website? It seems rather sparse on details. Can you elaborate a bit on what the tool is, who it is for (only power users?), what you use the tool for, how well it works, limitations, etc.?
what it is: command-focsed voice recognition for computer control and especially programming by voice. freeware written in rust by a voice-only dev funded by patreon for use by other voice-only devs.
who is it for: anyone who wants to control their computer by voice or do dictation, but especially ones who want to input a lot of symbols and control commands.
what I use it for: general computer control. search when doing something else. sometimes switching applications. dictation when my hands hurt. coding when my hands hurt.
how well it works: better than google voice keyboard, which is quite good. the core is closed source freeware, the configs are in python and a simple custom language and are generally open source. the config api is really nicely done. you can make custom voice commands easily. I have it set up so saying “computer, <command>” does the command from sleep mode, and “computer, wake” wakes, “computer, sleep” sleeps. eg, “computer, google search lesswrong voice text ai”.
I am not using it now; my hands are still faster when they work. however, it’s head and shoulders better than dragon, which was for a long time the best command voice recognition. (edit: this part is prerelease only apparently, but) since it integrated whisper for dictation voice recognition I think its place as best option is uncontested.
I downloaded it and selected the W2L Conformer engine. On https://talon.wiki/speech_engines/ it does not say anything about using Whisper. It seems much worse than what ChatGPT does.
Did you load another engine to get Whisper to work?
Are you talking about this website? It seems rather sparse on details. Can you elaborate a bit on what the tool is, who it is for (only power users?), what you use the tool for, how well it works, limitations, etc.?
https://talon.wiki/
what it is: command-focsed voice recognition for computer control and especially programming by voice. freeware written in rust by a voice-only dev funded by patreon for use by other voice-only devs.
who is it for: anyone who wants to control their computer by voice or do dictation, but especially ones who want to input a lot of symbols and control commands.
what I use it for: general computer control. search when doing something else. sometimes switching applications. dictation when my hands hurt. coding when my hands hurt.
how well it works: better than google voice keyboard, which is quite good. the core is closed source freeware, the configs are in python and a simple custom language and are generally open source. the config api is really nicely done. you can make custom voice commands easily. I have it set up so saying “computer, <command>” does the command from sleep mode, and “computer, wake” wakes, “computer, sleep” sleeps. eg, “computer, google search lesswrong voice text ai”.
I am not using it now; my hands are still faster when they work. however, it’s head and shoulders better than dragon, which was for a long time the best command voice recognition. (edit: this part is prerelease only apparently, but) since it integrated whisper for dictation voice recognition I think its place as best option is uncontested.
I downloaded it and selected the W2L Conformer engine. On https://talon.wiki/speech_engines/ it does not say anything about using Whisper. It seems much worse than what ChatGPT does.
Did you load another engine to get Whisper to work?
oh hmm, I might have a pre-release version. Sorry to mislead. It’ll be out eventually, and exists, but it’s likely because I have the patreon version.
The changelog indeed mentions Whisper as a “0.4.0 beta-only feature”.