Linux Voice Tech

March 29, 2026 · View on GitHub

An index of voice technology tools accessible to Linux users

Star counts and last commit dates are shown via shields.io badges and update dynamically.

For background, notes on how the repo is organized, and inclusion criteria, see notes.md. For a getting started guide, see starting-points.md.

Keywords

Automatic speech recognition (ASR)
Speech-to-text (STT)
Text-to-speech (TTS)
Linux voice typing
Linux dictation
Linux TTS
Voice control
Transcription

Section	What's in it
Wayland-Compatible STT	Tools with explicit Wayland virtual input support
Voice Typing — GUIs	Desktop apps for dictation and transcription
Voice Typing — CLIs	Command-line dictation and transcription tools
Voice Notes & AI-Enhanced	Note-taking with AI post-processing
Real-Time Streaming STT	Low-latency, live transcription libraries
Self-Hosted / Web UI	Docker/web-based transcription services
Cloud STT / API-Based	Tools using OpenAI, Deepgram, or other cloud APIs
Voice Assistants	Voice-controlled assistant applications
Voice Commands & Automation	Voice-to-action, voice-to-MCP, computer control
Toolkits & Frameworks	Developer libraries for building voice apps
Whisper Variants & Optimizations	Faster/smaller/better Whisper implementations
Complementary Tools	VAD, diarization, noise suppression
Text-to-Speech (TTS)	TTS tools and frameworks
MCP Servers	Model Context Protocol voice servers
Awesome Lists	Other curated voice tech lists
Community Resources	GitHub topics, subreddits

STT Tools with Wayland Support

Projects with explicit Wayland support. Particularly valuable for users on modern Linux desktops (GNOME, KDE Plasma on Wayland, Hyprland, Sway, niri, etc.) where X11 virtual input methods don't work.

Repository	Stars	Last Updated	Description
dictation-tools			Dictation tools with Wayland support
freespeak			Voice dictation with Wayland support
hyprvoice			Voice dictation for Hyprland
hyprwhspr			Whisper-based voice input for Hyprland
local-dictation-assistant			Local dictation assistant with Wayland support
niri-transcribe			Transcription tool for niri compositor
swictation			Voice dictation for Sway/Wayland
TalkType (ronb1964)			Privacy-first voice dictation for Linux Wayland. Press key to talk, release to type. Whisper AI, 100% offline
vocalinux			Offline voice dictation for Linux. Whisper.cpp, Whisper & VOSK engines, GPU-accelerated, X11 + Wayland
voice-typing-linux			Voice typing for Linux with Wayland support
wayland-voice-dictation			Voice dictation designed for Wayland
whisper-wayland			Whisper integration for Wayland
whispy			STT tool with Wayland support

Voice Typing — GUIs

Desktop applications for dictation and transcription with graphical interfaces.

Repository	Stars	Last Updated	Description
AI-Typer-V2			Voice dictation with multimodal AI cleanup — speak naturally, get polished text
aTrain			Audio transcription training tool
audiov			Speech-to-text, voice-typing, dictation software for Linux distributions
Buzz			Offline audio transcription and translation. Supports Whisper, Whisper.cpp, Faster-Whisper. Available via Flatpak/Snap. Vulkan GPU support
dsnote			Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation
LinuxWhisper			Whisper for Linux
maVoice-Linux			Voice control for Linux
mint-whisper			Whisper for Linux Mint
murmure			Fully local, private, cross-platform STT with LLM post-processing
OpenFlow			Local speech-to-text app for Linux
OpenWispr			Open source Whisper-based voice assistant
Parakeet-Type-Ubuntu			On-device voice typing for Linux using Parakeet and NeMo ASR models via sherpa-onnx. No cloud, no GPU required
sotto			Local speech-to-text transcription app for Linux using Whisper models
soundvibes			Speech-to-text for Linux that just works
TalkType (zyk42)			Cross-platform Electron voice writing assistant. ASR + LLM for understanding, polishing, and Q&A
TranscriptionSuite			Fully local, private STT app with speaker diarization, Audio Notebook mode, LM Studio integration, longform and live transcription
VoiceType			Fork of Deepgram's Linux starter. CLI to GUI + hotkey support, API key editing, cost tracking
WhisperNow			Real-time Whisper transcription
whisper-to-input-desktop			Desktop app using OpenAI's Whisper to transcribe audio and input it as text
whisper-ui			Whisper UI interface
whisperer			Whisper-based transcription tool
whisply			A simple GUI for OpenAI Whisper
wisper			Voice dictation app for Linux. Type directly at cursor with AI-powered transcription
wispr-lite			Lightweight Whisper-based transcription tool

Voice Typing — CLIs

Command-line dictation and transcription tools.

Repository	Stars	Last Updated	Description
BlahST			Offline, real-time, streaming speech-to-text transcription using OpenAI Whisper
blurt			Whisper.cpp-based STT tool
dicti			Dictation tool
froshine			Voice recognition tool
linux-stt-input			Linux STT input method
linux-voice-to-text-ai			Linux voice to text AI
Linux-Dictation-Project			Linux dictation project
sonori			Voice recognition tool
speak-to-ai			Speak to AI assistant
speech-assistant			Faster Whisper-based speech assistant
speedofsound			Voice typing for the Linux desktop
stt-linux (afif)			STT for Linux
STT-Assistant-linux			STT assistant for Linux
super-stt			Enhanced STT tool
talktype			Push-to-talk voice typing for the terminal. Local Whisper, cross-platform
TermlAi			Terminal AI assistant with voice
transcribeAnywhere			Universal transcription tool
VocalFLow			Voice flow dictation tool
voice-type			Linux-first system-wide dictation tool. Unmatched accuracy and speed, totally free
voicekeyboard			Voice keyboard implementation
voxd			Voice input daemon
whisp-away			Whisper-based dictation tool
whisper-dictation (ananjiani)			Whisper-based dictation tool
whisper-hotkey-linux			Whisper hotkey for Linux
whisper-toggle			Toggle-based Whisper control
whisper-transcribe			Whisper transcription tool
whisperd			A daemon for OpenAI Whisper
WhisperVoice			Whisper voice processing tool
WhisperVoiceInput			Whisper voice input tool
whispertrigger			Trigger OpenAI Whisper with a hotkey
whispertux			A simple CLI wrapper for OpenAI's Whisper speech-to-text model
wvcr			Wave voice control recorder

Voice Notes & AI-Enhanced Transcription

Tools focused on capturing voice notes with AI post-processing (LLM cleanup, formatting, summarization).

Repository	Stars	Last Updated	Description
handsfree			Hands-free computing
notesGPT			Voice notes with GPT processing
obsidian-scribe			Obsidian voice note transcription
ScribeWizard			Transcription wizard tool
Thought-Pad			Thought capture with STT
whisper-notes			Whisper-powered note processing
whisper-notes-pro			Professional whisper notes application
Whisper-Notepad-For-Linux			Whisper notepad with post-processing
Whisper-Notepad-Simple			Simplified Whisper notepad using OpenAI API

Real-Time Streaming STT

Libraries and tools for low-latency, live transcription.

Repository	Stars	Last Updated	Description
RealtimeSTT			Low-latency STT library with VAD, wake word activation. Uses WebRTCVAD + SileroVAD + Faster-Whisper
whisper_real_time			Real-time transcription with OpenAI Whisper
whisper_streaming			Real-time streaming Whisper with self-adaptive latency using local agreement policy
WhisperLive			Real-time Whisper transcription from Collabora. OpenVINO support, browser extensions, iOS client
WhisperLiveKit			2025 SOTA streaming STT with speaker diarization. Simul-Whisper for ultra-low latency

Self-Hosted / Web UI

Docker-deployed tools and web interfaces for self-hosted STT.

Repository	Stars	Last Updated	Description
meeting-minutes			Self-hostable meeting transcription and minutes generation
Scriberr			Voice transcription tool
Whisper-WebUI			A Gradio-based browser interface for Whisper. Easy subtitle generation
whisper-fastapi			Whisper FastAPI service

Cloud STT / API-Based Tools

Projects that use cloud STT APIs for transcription.

OpenAI Whisper API

Repository	Stars	Last Updated	Description
speech2keys			Speech to keystrokes using OpenAI Whisper API

Deepgram API

Repository	Stars	Last Updated	Description
Deepgram-Voice-Keyboard-Ubuntu			STT project using Deepgram API for Ubuntu
fortuna			Deepgram Fortuna project
voice-keyboard-linux			Deepgram voice keyboard for Linux

Hugging Face ASR Models

Resource	Description
ASR Models (Trending)	Trending automatic speech recognition models on Hugging Face
Whisper on Hugging Face	OpenAI Whisper on Hugging Face

Voice Assistants

Privacy-Focused

Open source voice assistants emphasizing local processing and privacy.

Repository	Stars	Last Updated	Description
Neon AI			Privacy-first voice assistant. Offline-capable, customizable. Maintains Mycroft community forums
OpenVoiceOS			Community-driven voice assistant framework. Local processing, privacy-focused. Continuation of Mycroft
Project Alice			Modular smart assistant, fully offline. Built around Snips, guarantees privacy
SEPIA Framework			Self-hosted, privacy-compliant voice assistant ecosystem

General

Repository	Stars	Last Updated	Description
jarvis_linux			Jarvis for Linux
linux-voice-control			Linux voice control system
LinuxVoiceAssistant			Linux voice assistant
Local-Voice			Local voice assistant
Personal-Voice-Assistent			Personal voice assistant
tempest			Voice assistant framework
vosk-cli-dictation			Vosk CLI dictation

Voice Commands & Automation

Tools that translate voice into actions — computer control, voice-to-commands, voice-to-JSON, etc.

Repository	Stars	Last Updated	Description
Handy			Voice-controlled computer interface - handy.computer
home-assistant-assist-desktop			Home Assistant desktop client
JustSayIt.jl			Offline, low-latency translation of speech to computer commands or text. Julia-based
numen	N/A	N/A	Voice-controlled interface (hosted on SourceHut)
voice2json			Voice to JSON converter

Voice Operating Systems

Repository	Stars	Last Updated	Description
ovos-buildroot			OpenVoiceOS - A minimalistic Linux OS bringing the open source voice assistant to IoT and embedded devices

Subtitle Generation

Repository	Stars	Last Updated	Description
auto-subs			Automatic subtitle generation
whisper-subs			Whisper subtitle generation

Service-Specific Voice Tools

Repository	Stars	Last Updated	Description
deepin-voice-note			Deepin voice note application
overlayed			Voice overlay for Discord on Linux
whatsapp_voice_transcription			WhatsApp voice message transcription

Voice Biometrics

Repository	Stars	Last Updated	Description
voiceprint			Voice biometric authentication for Linux

Developer Tools

Repository	Stars	Last Updated	Description
mt_stt			C wrapper for speech-to-text
whisper.cpp-cli			Whisper.cpp CLI wrapper
whisper (Nutlope)			Whisper implementation

Proof of Concepts

Repository	Stars	Last Updated	Description
stt-linux (samcole8)			STT Linux proof of concept
whisperai			Whisper AI proof of concept

Complementary Tools

Tools that aren't STT themselves, but help make the most of voice workflows.

Noise Suppression & Audio Processing

Repository	Stars	Last Updated	Description
easyeffects			Audio effects for PipeWire applications - noise reduction, equalization, and more
NoiseTorch			Real-time microphone noise suppression on Linux

Voice Activity Detection (VAD) & Diarisation

Repository	Stars	Last Updated	Description
pyannote-audio			Neural building blocks for speaker diarization: speech activity detection, speaker embedding, clustering
Silero VAD			Enterprise-grade Voice Activity Detector. MIT license, <1ms per chunk on CPU
WebRTC VAD			Python interface to WebRTC Voice Activity Detector
wyoming-openwakeword			Custom wake word detection for Home Assistant

Toolkits & Frameworks

ASR/STT toolkits and frameworks for building voice applications. Developer libraries rather than end-user applications.

Repository	Stars	Last Updated	Description
Coqui STT			Deep learning STT toolkit (continuation of Mozilla DeepSpeech). Custom model training
fairseq			Meta's sequence modeling toolkit. Includes Wav2Vec 2.0 for self-supervised ASR
FunASR			End-to-end speech recognition toolkit from Alibaba. Industrial-grade models
NVIDIA NeMo			Enterprise ASR toolkit with Conformer/Parakeet models. GPU-accelerated training and inference
sherpa-onnx			STT, TTS, speaker diarization, VAD using next-gen Kaldi with ONNX Runtime. Offline, 12 programming languages
sherpa-onnx-go			Go package for sherpa-onnx speech recognition without network access
SpeechBrain			PyTorch-based speech toolkit for ASR, speaker recognition, speech enhancement
Vosk			Offline speech recognition API. Lightweight, 20+ languages, works on Raspberry Pi

Whisper Variants & Optimizations

Optimized implementations and variants of OpenAI's Whisper model.

Repository	Stars	Last Updated	Description
distil-whisper			HuggingFace's distilled Whisper. 6x faster, 49% smaller, within 1% WER
faster-whisper			CTranslate2 reimplementation. 4x faster, less memory, 8-bit quantization support
insanely-fast-whisper			CLI for fastest Whisper inference. Batching, flash attention, distil-whisper support
whisper.cpp			C/C++ port of Whisper. CPU inference, minimal dependencies, runs on edge devices
whisper-plus			Advanced Whisper pipelines with diarization, translation, and video transcription support
wyoming-faster-whisper			Wyoming protocol server for faster-whisper. Home Assistant integration
wyoming-whisper-api-client			Wyoming protocol client for Whisper APIs. Centralizes STT for Home Assistant

Text-to-Speech (TTS)

Repository	Stars	Last Updated	Description
claude-tts			TTS plugin for Claude Code — multi-provider support (ElevenLabs, OpenAI, Google, Amazon Polly, Azure, local system TTS)

MCP Servers

MCP (Model Context Protocol) servers that provide STT capabilities.

Repository	Stars	Last Updated	Description
stt-mcp-server-linux			Local speech-to-text MCP server for Tmux on Linux (for use with Claude Code and other MCP clients)

Awesome Lists

Repository	Stars	Last Updated	Description
awesome-voice-typing			Curated list of open-source STT and voice typing tools for Linux, macOS, Windows, Android, and iOS
Voice-Apps-Index			Index for STT and dictation apps and WIPs

Ideas & Specifications

Projects at the concept or specification stage.

Repository	Stars	Last Updated	Description
VoiceBox			Idea for a speech tech solution — specced out by Claude

Archived Projects

Notable projects that are no longer actively maintained.

Repository	Stars	Last Updated	Description
AI-Transcription-Notepad			Voice note taking utility using cloud audio multimodal models for single-pass transcription and text cleanup (archived)

Community Resources

GitHub Topics

Topic	Description
asr	Automatic speech recognition
dictation	Dictation tools and applications
speech-to-text	General speech-to-text projects
transcription	Audio/video transcription tools
voice	General voice technology projects
voice-assistant	Voice assistant applications
voice-commands	Voice command implementations
voice-control	Voice control tools
voice-dictation	Voice dictation specific projects
voice-recognition	Voice recognition systems

Subreddits

Subreddit	Focus
r/accessibility	Accessibility tools including voice control
r/LocalLLaMA	Local LLMs (frequently covers voice topics)
r/opensource	Open source projects including voice tools
r/speechrecognition	Speech recognition systems and discussion
r/TextToSpeech	TTS technology (complementary to STT)
r/VoiceTech	Voice technology and applications