Handy – free open source speech-to-text app

(github.com)

52 points | by tin7in 3 hours ago

13 comments

blutoot 12 minutes ago
I have dystonia which often stiffens my arms in a way that makes it impossible for me to type on a keyboard. TTS apps like SuperWhisper have proven to be very helpful for me in such situations. I am hoping to get a similar experience out of "Handy" (very apt maming from my perspective).
I do, however, wonder if there is a way all these TTS tools can get to the next level. The generated text should not be just a verbatim copy of what I just said, but depending on the context, it should elaborate. For example, if my cursor is actively inside an editor/IDE with some code, my coding-related verbal prompts should actually generate the right/desired code in that IDE.
Perhaps this is a bit of combining TTS with computer-use.
llarsson 30 minutes ago
A question because I'm not using speech-to-text, but find it intriguing (especially since it's now possible to do locally and for free).
How have your computing habits changed as a result of having this? When do you typically use this instead of typing on the keyboard?
[-]
- tin7in 27 minutes ago
  I use it all the time with coding agents, especially if I'm running multiple terminals. It's way faster to talk than type. The only problem is that it looks awkward if there are others around.
  [-]
  - johnisgood 21 minutes ago
    Interesting. I can think and type faster, but not talk. I am not much of a talker.
- noneofyour 18 minutes ago
  Part of my job is to give feedback to people using Word Comments. Using STT, it's been a breeze. The time saving really is great. Thing is, I only do this when working at home with no one around. So really only when WFH.
PhilippGille 36 minutes ago
Has anyone compared this with https://github.com/HeroTools/open-whispr already? From the description they seem very similar.
Handy first release was June 2025, OpenWhispr a month later. Handy has ~11k GitHub stars, OpenWhispr has ~730.
frankdilo 1 hour ago
This looks great! What’s missing for me to switch from something like Wispr Flow is the ability to provide a dictionary for commonly mistaken words (name of your company, people, code libraries).
[-]
- sipjca 7 minutes ago
  There’s a PR for this which will be pulled in soon enough, I can kick off a build of the PR if you want to download a pre release version
- tin7in 1 hour ago
  It has something called "Custom Words" which might be what you are describing. Haven't tested this feature yet properly.
- jauntywundrkind 1 hour ago
  I dig that some models have an ability to say how sure they are of words. Manually entering a bunch of special words is ok, but I want to be able to review the output and see what words the model was less sure of, so I can go find out what I might need to add.
dumbmrblah 35 minutes ago
I just set this up today. I had Whispering app set up on my Windows computer, but it really wasn't working well on my Ubuntu computer that I just set up. I found Handy randomly. It was the last app I needed to go Linux full-time. Thank you!
mrroryflint 16 minutes ago
On a M4 Macbook Air, there was enough lag to make it unusable for me. I hit the shortcut and start speaking but there was always a 1-2sec delay before it would actually start transcribing even if the icon was displayed.
[-]
- jborichevskiy 13 minutes ago
  Curious if you were using AirPods or other Bluetooth headphones for this?
  If so, there should be "keep microphone on" or similar setting in the config that may help with this, alternatively, I set my microphone to my MacBook mic so that my headphones aren't involved at all and there is much less latency on activation
- sipjca 12 minutes ago
  What microphone are you using?
Jack5500 1 hour ago
The Parakeet V3 model is really great!
vladstudio 55 minutes ago
Use it daily. Looks and works great.
chainmail2029 53 minutes ago
There's a slightly awkward naming overlap with an existing product.
[-]
- unwind 39 minutes ago
  Which one? I did a quick search but that didn't turn up anything so perhaps it's a partial word overlap or something.
  I did find the projects "user-facing" home page [1] which was nice. I found it rather hard to find a link from that to the code on GitHub, which was surprising.
  [1]: https://handy.computer/
  [-]
  - DomB 31 minutes ago
    It's the German word for smart phone / mobile phone
  - zavec 14 minutes ago
    There's also a sex toy
  - sReinwald 14 minutes ago
    [dead]
- ensocode 37 minutes ago
  This is a slightly German-centric comment.
- xfeeefeee 20 minutes ago
  [dead]
Dnguyen 26 minutes ago
Would be nice if the output can be piped directly into Claude Code.
jborichevskiy 1 hour ago
Big Handy fan!
blutoot 27 minutes ago
Crashes on Tahoe 26.3 Betq 1 :(
[-]
- sipjca 13 minutes ago
  Please send me a crash log!
dotancohen 1 hour ago
Looks interesting. Why does it need a GUI at all?
[-]
- sipjca 6 minutes ago
  It doesn’t! Just makes it more accessible to more people I feel. There’s a cli version for Mac which I wrote first handy-cli
- tin7in 1 hour ago
  As an alternative to Wisprflow, Superwhisper and so on. It works really well compared to the commercial competitors but with a local model.
- unwind 42 minutes ago
  Ah, that was a typo: you meant "GPU" (Graphics Processing Unit, not "GUI" which of course is Graphical User Interface) since that is listed in the system requirements. Explained implicitly by an existing comment, thanks!
- Barbing 1 hour ago
  I hear a CLI request? Tons of CLI speech-to-text tools by the way, really glad to see this. Excellent competitors (Superwhisper, MacWhisper, etc.) are closed/paid.
- satvikpendem 56 minutes ago
  Because local AI models run well on a GPU, better than on a CPU
- kristianp 1 hour ago
  So more people can use it?