Linux Voice Tech

March 29, 2026 · View on GitHub

alt text

An index of voice technology tools accessible to Linux users

Last Updated Repository Type Resources License

Star counts and last commit dates are shown via shields.io badges and update dynamically.

For background, notes on how the repo is organized, and inclusion criteria, see notes.md. For a getting started guide, see starting-points.md.

Keywords

  • Automatic speech recognition (ASR)
  • Speech-to-text (STT)
  • Text-to-speech (TTS)
  • Linux voice typing
  • Linux dictation
  • Linux TTS
  • Voice control
  • Transcription

Quick Navigation

SectionWhat's in it
Wayland-Compatible STTTools with explicit Wayland virtual input support
Voice Typing — GUIsDesktop apps for dictation and transcription
Voice Typing — CLIsCommand-line dictation and transcription tools
Voice Notes & AI-EnhancedNote-taking with AI post-processing
Real-Time Streaming STTLow-latency, live transcription libraries
Self-Hosted / Web UIDocker/web-based transcription services
Cloud STT / API-BasedTools using OpenAI, Deepgram, or other cloud APIs
Voice AssistantsVoice-controlled assistant applications
Voice Commands & AutomationVoice-to-action, voice-to-MCP, computer control
Toolkits & FrameworksDeveloper libraries for building voice apps
Whisper Variants & OptimizationsFaster/smaller/better Whisper implementations
Complementary ToolsVAD, diarization, noise suppression
Text-to-Speech (TTS)TTS tools and frameworks
MCP ServersModel Context Protocol voice servers
Awesome ListsOther curated voice tech lists
Community ResourcesGitHub topics, subreddits

STT Tools with Wayland Support

Projects with explicit Wayland support. Particularly valuable for users on modern Linux desktops (GNOME, KDE Plasma on Wayland, Hyprland, Sway, niri, etc.) where X11 virtual input methods don't work.

RepositoryStarsLast UpdatedDescription
dictation-toolsStarsLast CommitDictation tools with Wayland support
freespeakStarsLast CommitVoice dictation with Wayland support
hyprvoiceStarsLast CommitVoice dictation for Hyprland
hyprwhsprStarsLast CommitWhisper-based voice input for Hyprland
local-dictation-assistantStarsLast CommitLocal dictation assistant with Wayland support
niri-transcribeStarsLast CommitTranscription tool for niri compositor
swictationStarsLast CommitVoice dictation for Sway/Wayland
TalkType (ronb1964)StarsLast CommitPrivacy-first voice dictation for Linux Wayland. Press key to talk, release to type. Whisper AI, 100% offline
vocalinuxStarsLast CommitOffline voice dictation for Linux. Whisper.cpp, Whisper & VOSK engines, GPU-accelerated, X11 + Wayland
voice-typing-linuxStarsLast CommitVoice typing for Linux with Wayland support
wayland-voice-dictationStarsLast CommitVoice dictation designed for Wayland
whisper-waylandStarsLast CommitWhisper integration for Wayland
whispyStarsLast CommitSTT tool with Wayland support

Voice Typing — GUIs

Desktop applications for dictation and transcription with graphical interfaces.

RepositoryStarsLast UpdatedDescription
AI-Typer-V2StarsLast CommitVoice dictation with multimodal AI cleanup — speak naturally, get polished text
aTrainStarsLast CommitAudio transcription training tool
audiovStarsLast CommitSpeech-to-text, voice-typing, dictation software for Linux distributions
BuzzStarsLast CommitOffline audio transcription and translation. Supports Whisper, Whisper.cpp, Faster-Whisper. Available via Flatpak/Snap. Vulkan GPU support
dsnoteStarsLast CommitSpeech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation
LinuxWhisperStarsLast CommitWhisper for Linux
maVoice-LinuxStarsLast CommitVoice control for Linux
mint-whisperStarsLast CommitWhisper for Linux Mint
murmureStarsLast CommitFully local, private, cross-platform STT with LLM post-processing
OpenFlowStarsLast CommitLocal speech-to-text app for Linux
OpenWisprStarsLast CommitOpen source Whisper-based voice assistant
Parakeet-Type-UbuntuStarsLast CommitOn-device voice typing for Linux using Parakeet and NeMo ASR models via sherpa-onnx. No cloud, no GPU required
sottoStarsLast CommitLocal speech-to-text transcription app for Linux using Whisper models
soundvibesStarsLast CommitSpeech-to-text for Linux that just works
TalkType (zyk42)StarsLast CommitCross-platform Electron voice writing assistant. ASR + LLM for understanding, polishing, and Q&A
TranscriptionSuiteStarsLast CommitFully local, private STT app with speaker diarization, Audio Notebook mode, LM Studio integration, longform and live transcription
VoiceTypeStarsLast CommitFork of Deepgram's Linux starter. CLI to GUI + hotkey support, API key editing, cost tracking
WhisperNowStarsLast CommitReal-time Whisper transcription
whisper-to-input-desktopStarsLast CommitDesktop app using OpenAI's Whisper to transcribe audio and input it as text
whisper-uiStarsLast CommitWhisper UI interface
whispererStarsLast CommitWhisper-based transcription tool
whisplyStarsLast CommitA simple GUI for OpenAI Whisper
wisperStarsLast CommitVoice dictation app for Linux. Type directly at cursor with AI-powered transcription
wispr-liteStarsLast CommitLightweight Whisper-based transcription tool

Voice Typing — CLIs

Command-line dictation and transcription tools.

RepositoryStarsLast UpdatedDescription
BlahSTStarsLast CommitOffline, real-time, streaming speech-to-text transcription using OpenAI Whisper
blurtStarsLast CommitWhisper.cpp-based STT tool
dictiStarsLast CommitDictation tool
froshineStarsLast CommitVoice recognition tool
linux-stt-inputStarsLast CommitLinux STT input method
linux-voice-to-text-aiStarsLast CommitLinux voice to text AI
Linux-Dictation-ProjectStarsLast CommitLinux dictation project
sonoriStarsLast CommitVoice recognition tool
speak-to-aiStarsLast CommitSpeak to AI assistant
speech-assistantStarsLast CommitFaster Whisper-based speech assistant
speedofsoundStarsLast CommitVoice typing for the Linux desktop
stt-linux (afif)StarsLast CommitSTT for Linux
STT-Assistant-linuxStarsLast CommitSTT assistant for Linux
super-sttStarsLast CommitEnhanced STT tool
talktypeStarsLast CommitPush-to-talk voice typing for the terminal. Local Whisper, cross-platform
TermlAiStarsLast CommitTerminal AI assistant with voice
transcribeAnywhereStarsLast CommitUniversal transcription tool
VocalFLowStarsLast CommitVoice flow dictation tool
voice-typeStarsLast CommitLinux-first system-wide dictation tool. Unmatched accuracy and speed, totally free
voicekeyboardStarsLast CommitVoice keyboard implementation
voxdStarsLast CommitVoice input daemon
whisp-awayStarsLast CommitWhisper-based dictation tool
whisper-dictation (ananjiani)StarsLast CommitWhisper-based dictation tool
whisper-hotkey-linuxStarsLast CommitWhisper hotkey for Linux
whisper-toggleStarsLast CommitToggle-based Whisper control
whisper-transcribeStarsLast CommitWhisper transcription tool
whisperdStarsLast CommitA daemon for OpenAI Whisper
WhisperVoiceStarsLast CommitWhisper voice processing tool
WhisperVoiceInputStarsLast CommitWhisper voice input tool
whispertriggerStarsLast CommitTrigger OpenAI Whisper with a hotkey
whispertuxStarsLast CommitA simple CLI wrapper for OpenAI's Whisper speech-to-text model
wvcrStarsLast CommitWave voice control recorder

Voice Notes & AI-Enhanced Transcription

Tools focused on capturing voice notes with AI post-processing (LLM cleanup, formatting, summarization).

RepositoryStarsLast UpdatedDescription
handsfreeStarsLast CommitHands-free computing
notesGPTStarsLast CommitVoice notes with GPT processing
obsidian-scribeStarsLast CommitObsidian voice note transcription
ScribeWizardStarsLast CommitTranscription wizard tool
Thought-PadStarsLast CommitThought capture with STT
whisper-notesStarsLast CommitWhisper-powered note processing
whisper-notes-proStarsLast CommitProfessional whisper notes application
Whisper-Notepad-For-LinuxStarsLast CommitWhisper notepad with post-processing
Whisper-Notepad-SimpleStarsLast CommitSimplified Whisper notepad using OpenAI API

Real-Time Streaming STT

Libraries and tools for low-latency, live transcription.

RepositoryStarsLast UpdatedDescription
RealtimeSTTStarsLast CommitLow-latency STT library with VAD, wake word activation. Uses WebRTCVAD + SileroVAD + Faster-Whisper
whisper_real_timeStarsLast CommitReal-time transcription with OpenAI Whisper
whisper_streamingStarsLast CommitReal-time streaming Whisper with self-adaptive latency using local agreement policy
WhisperLiveStarsLast CommitReal-time Whisper transcription from Collabora. OpenVINO support, browser extensions, iOS client
WhisperLiveKitStarsLast Commit2025 SOTA streaming STT with speaker diarization. Simul-Whisper for ultra-low latency

Self-Hosted / Web UI

Docker-deployed tools and web interfaces for self-hosted STT.

RepositoryStarsLast UpdatedDescription
meeting-minutesStarsLast CommitSelf-hostable meeting transcription and minutes generation
ScriberrStarsLast CommitVoice transcription tool
Whisper-WebUIStarsLast CommitA Gradio-based browser interface for Whisper. Easy subtitle generation
whisper-fastapiStarsLast CommitWhisper FastAPI service

Cloud STT / API-Based Tools

Projects that use cloud STT APIs for transcription.

OpenAI Whisper API

RepositoryStarsLast UpdatedDescription
speech2keysStarsLast CommitSpeech to keystrokes using OpenAI Whisper API

Deepgram API

RepositoryStarsLast UpdatedDescription
Deepgram-Voice-Keyboard-UbuntuStarsLast CommitSTT project using Deepgram API for Ubuntu
fortunaStarsLast CommitDeepgram Fortuna project
voice-keyboard-linuxStarsLast CommitDeepgram voice keyboard for Linux

Hugging Face ASR Models

ResourceDescription
ASR Models (Trending)Trending automatic speech recognition models on Hugging Face
Whisper on Hugging FaceOpenAI Whisper on Hugging Face

Voice Assistants

Privacy-Focused

Open source voice assistants emphasizing local processing and privacy.

RepositoryStarsLast UpdatedDescription
Neon AIStarsLast CommitPrivacy-first voice assistant. Offline-capable, customizable. Maintains Mycroft community forums
OpenVoiceOSStarsLast CommitCommunity-driven voice assistant framework. Local processing, privacy-focused. Continuation of Mycroft
Project AliceStarsLast CommitModular smart assistant, fully offline. Built around Snips, guarantees privacy
SEPIA FrameworkStarsLast CommitSelf-hosted, privacy-compliant voice assistant ecosystem

General

RepositoryStarsLast UpdatedDescription
jarvis_linuxStarsLast CommitJarvis for Linux
linux-voice-controlStarsLast CommitLinux voice control system
LinuxVoiceAssistantStarsLast CommitLinux voice assistant
Local-VoiceStarsLast CommitLocal voice assistant
Personal-Voice-AssistentStarsLast CommitPersonal voice assistant
tempestStarsLast CommitVoice assistant framework
vosk-cli-dictationStarsLast CommitVosk CLI dictation

Voice Commands & Automation

Tools that translate voice into actions — computer control, voice-to-commands, voice-to-JSON, etc.

RepositoryStarsLast UpdatedDescription
HandyStarsLast CommitVoice-controlled computer interface - handy.computer
home-assistant-assist-desktopStarsLast CommitHome Assistant desktop client
JustSayIt.jlStarsLast CommitOffline, low-latency translation of speech to computer commands or text. Julia-based
numenN/AN/AVoice-controlled interface (hosted on SourceHut)
voice2jsonStarsLast CommitVoice to JSON converter

Voice Operating Systems

RepositoryStarsLast UpdatedDescription
ovos-buildrootStarsLast CommitOpenVoiceOS - A minimalistic Linux OS bringing the open source voice assistant to IoT and embedded devices

Subtitle Generation

RepositoryStarsLast UpdatedDescription
auto-subsStarsLast CommitAutomatic subtitle generation
whisper-subsStarsLast CommitWhisper subtitle generation

Service-Specific Voice Tools

RepositoryStarsLast UpdatedDescription
deepin-voice-noteStarsLast CommitDeepin voice note application
overlayedStarsLast CommitVoice overlay for Discord on Linux
whatsapp_voice_transcriptionStarsLast CommitWhatsApp voice message transcription

Voice Biometrics

RepositoryStarsLast UpdatedDescription
voiceprintStarsLast CommitVoice biometric authentication for Linux

Developer Tools

RepositoryStarsLast UpdatedDescription
mt_sttStarsLast CommitC wrapper for speech-to-text
whisper.cpp-cliStarsLast CommitWhisper.cpp CLI wrapper
whisper (Nutlope)StarsLast CommitWhisper implementation

Proof of Concepts

RepositoryStarsLast UpdatedDescription
stt-linux (samcole8)StarsLast CommitSTT Linux proof of concept
whisperaiStarsLast CommitWhisper AI proof of concept

Complementary Tools

Tools that aren't STT themselves, but help make the most of voice workflows.

Noise Suppression & Audio Processing

RepositoryStarsLast UpdatedDescription
easyeffectsStarsLast CommitAudio effects for PipeWire applications - noise reduction, equalization, and more
NoiseTorchStarsLast CommitReal-time microphone noise suppression on Linux

Voice Activity Detection (VAD) & Diarisation

RepositoryStarsLast UpdatedDescription
pyannote-audioStarsLast CommitNeural building blocks for speaker diarization: speech activity detection, speaker embedding, clustering
Silero VADStarsLast CommitEnterprise-grade Voice Activity Detector. MIT license, <1ms per chunk on CPU
WebRTC VADStarsLast CommitPython interface to WebRTC Voice Activity Detector
wyoming-openwakewordStarsLast CommitCustom wake word detection for Home Assistant

Toolkits & Frameworks

ASR/STT toolkits and frameworks for building voice applications. Developer libraries rather than end-user applications.

RepositoryStarsLast UpdatedDescription
Coqui STTStarsLast CommitDeep learning STT toolkit (continuation of Mozilla DeepSpeech). Custom model training
fairseqStarsLast CommitMeta's sequence modeling toolkit. Includes Wav2Vec 2.0 for self-supervised ASR
FunASRStarsLast CommitEnd-to-end speech recognition toolkit from Alibaba. Industrial-grade models
NVIDIA NeMoStarsLast CommitEnterprise ASR toolkit with Conformer/Parakeet models. GPU-accelerated training and inference
sherpa-onnxStarsLast CommitSTT, TTS, speaker diarization, VAD using next-gen Kaldi with ONNX Runtime. Offline, 12 programming languages
sherpa-onnx-goStarsLast CommitGo package for sherpa-onnx speech recognition without network access
SpeechBrainStarsLast CommitPyTorch-based speech toolkit for ASR, speaker recognition, speech enhancement
VoskStarsLast CommitOffline speech recognition API. Lightweight, 20+ languages, works on Raspberry Pi

Whisper Variants & Optimizations

Optimized implementations and variants of OpenAI's Whisper model.

RepositoryStarsLast UpdatedDescription
distil-whisperStarsLast CommitHuggingFace's distilled Whisper. 6x faster, 49% smaller, within 1% WER
faster-whisperStarsLast CommitCTranslate2 reimplementation. 4x faster, less memory, 8-bit quantization support
insanely-fast-whisperStarsLast CommitCLI for fastest Whisper inference. Batching, flash attention, distil-whisper support
whisper.cppStarsLast CommitC/C++ port of Whisper. CPU inference, minimal dependencies, runs on edge devices
whisper-plusStarsLast CommitAdvanced Whisper pipelines with diarization, translation, and video transcription support
wyoming-faster-whisperStarsLast CommitWyoming protocol server for faster-whisper. Home Assistant integration
wyoming-whisper-api-clientStarsLast CommitWyoming protocol client for Whisper APIs. Centralizes STT for Home Assistant

Text-to-Speech (TTS)

RepositoryStarsLast UpdatedDescription
claude-ttsStarsLast CommitTTS plugin for Claude Code — multi-provider support (ElevenLabs, OpenAI, Google, Amazon Polly, Azure, local system TTS)

MCP Servers

MCP (Model Context Protocol) servers that provide STT capabilities.

RepositoryStarsLast UpdatedDescription
stt-mcp-server-linuxStarsLast CommitLocal speech-to-text MCP server for Tmux on Linux (for use with Claude Code and other MCP clients)

Awesome Lists

RepositoryStarsLast UpdatedDescription
awesome-voice-typingStarsLast CommitCurated list of open-source STT and voice typing tools for Linux, macOS, Windows, Android, and iOS
Voice-Apps-IndexStarsLast CommitIndex for STT and dictation apps and WIPs

Ideas & Specifications

Projects at the concept or specification stage.

RepositoryStarsLast UpdatedDescription
VoiceBoxStarsLast CommitIdea for a speech tech solution — specced out by Claude

Archived Projects

Notable projects that are no longer actively maintained.

RepositoryStarsLast UpdatedDescription
AI-Transcription-NotepadStarsLast CommitVoice note taking utility using cloud audio multimodal models for single-pass transcription and text cleanup (archived)

Community Resources

GitHub Topics

TopicDescription
asrAutomatic speech recognition
dictationDictation tools and applications
speech-to-textGeneral speech-to-text projects
transcriptionAudio/video transcription tools
voiceGeneral voice technology projects
voice-assistantVoice assistant applications
voice-commandsVoice command implementations
voice-controlVoice control tools
voice-dictationVoice dictation specific projects
voice-recognitionVoice recognition systems

Subreddits

SubredditFocus
r/accessibilityAccessibility tools including voice control
r/LocalLLaMALocal LLMs (frequently covers voice topics)
r/opensourceOpen source projects including voice tools
r/speechrecognitionSpeech recognition systems and discussion
r/TextToSpeechTTS technology (complementary to STT)
r/VoiceTechVoice technology and applications