Voicepad
November 3, 2025 ยท View on GitHub
This repo is (initially) a few loosely organised thoughts for a macropad that I've been thinking about for some time and which (in lieu of having built it!) is replaced with keyboard macros and slightly inconveniently placed shortcut buttons.
Other approaches I have used and which are "good, but could be better" - a cheap USB HID device as a transcription start/stop button (mapping: F13). This can also be a surrogate PTT button. I also have a side button on my vertical mouse set up for the same purpose. And a couple of keys assigned on my keyboard. I also tried a foot mouse and ... wasn't a fan.
While such hardware was of very peripheral interest before (get it?!) with the advent of Whisper, STT has dislodged text as my default mode of entering text into a computer. And as I do that quite a bit, I find myself looking for something more
Why a voice pad?
If you use STT to do a substantial amount of typing then you know why voice related peripherals exist.
But the question of what functionalities are probably still somewhat workflow specific: I need stop and start buttons but if you're a professional transcriptionist ... you probably are well-served by existing hardware.
So this voice pad reflects my needs. Which are...
Bindings / Shortcuts Plans
Start And Stop
Press a button to start STT and press a button to end the record stream / input.
This can be one button, two or three.
- 1 button is start/stop
- 2 buttons are start and stop
- 3 buttons is start, pause stop - but most STT apps dont support pause.
Current preference: stick with 1.
Voice Note
Besides STT
Voice Recording
Design assumption/idea: separate buttons for real time and async recording
Design Notes
Context constriction is key: if the recording keys bind to F13 and that in turn binds to software level functions we will end up with overlapping text inputs