I don’t use the openai capabilities. I use talon voice recognition. I can control my computer well enough to code by voice, and {edit: in a prerelease patreon version,] the dictation mode is now based on whisper. It’s pretty amazing.
Are you talking about this website? It seems rather sparse on details. Can you elaborate a bit on what the tool is, who it is for (only power users?), what you use the tool for, how well it works, limitations, etc.?
what it is: command-focsed voice recognition for computer control and especially programming by voice. freeware written in rust by a voice-only dev funded by patreon for use by other voice-only devs.
who is it for: anyone who wants to control their computer by voice or do dictation, but especially ones who want to input a lot of symbols and control commands.
what I use it for: general computer control. search when doing something else. sometimes switching applications. dictation when my hands hurt. coding when my hands hurt.
how well it works: better than google voice keyboard, which is quite good. the core is closed source freeware, the configs are in python and a simple custom language and are generally open source. the config api is really nicely done. you can make custom voice commands easily. I have it set up so saying “computer, <command>” does the command from sleep mode, and “computer, wake” wakes, “computer, sleep” sleeps. eg, “computer, google search lesswrong voice text ai”.
I am not using it now; my hands are still faster when they work. however, it’s head and shoulders better than dragon, which was for a long time the best command voice recognition. (edit: this part is prerelease only apparently, but) since it integrated whisper for dictation voice recognition I think its place as best option is uncontested.
I downloaded it and selected the W2L Conformer engine. On https://talon.wiki/speech_engines/ it does not say anything about using Whisper. It seems much worse than what ChatGPT does.
Did you load another engine to get Whisper to work?
I don’t use the openai capabilities. I use talon voice recognition. I can control my computer well enough to code by voice, and {edit: in a prerelease patreon version,] the dictation mode is now based on whisper. It’s pretty amazing.
Are you talking about this website? It seems rather sparse on details. Can you elaborate a bit on what the tool is, who it is for (only power users?), what you use the tool for, how well it works, limitations, etc.?
https://talon.wiki/
what it is: command-focsed voice recognition for computer control and especially programming by voice. freeware written in rust by a voice-only dev funded by patreon for use by other voice-only devs.
who is it for: anyone who wants to control their computer by voice or do dictation, but especially ones who want to input a lot of symbols and control commands.
what I use it for: general computer control. search when doing something else. sometimes switching applications. dictation when my hands hurt. coding when my hands hurt.
how well it works: better than google voice keyboard, which is quite good. the core is closed source freeware, the configs are in python and a simple custom language and are generally open source. the config api is really nicely done. you can make custom voice commands easily. I have it set up so saying “computer, <command>” does the command from sleep mode, and “computer, wake” wakes, “computer, sleep” sleeps. eg, “computer, google search lesswrong voice text ai”.
I am not using it now; my hands are still faster when they work. however, it’s head and shoulders better than dragon, which was for a long time the best command voice recognition. (edit: this part is prerelease only apparently, but) since it integrated whisper for dictation voice recognition I think its place as best option is uncontested.
I downloaded it and selected the W2L Conformer engine. On https://talon.wiki/speech_engines/ it does not say anything about using Whisper. It seems much worse than what ChatGPT does.
Did you load another engine to get Whisper to work?
oh hmm, I might have a pre-release version. Sorry to mislead. It’ll be out eventually, and exists, but it’s likely because I have the patreon version.
The changelog indeed mentions Whisper as a “0.4.0 beta-only feature”.