README.md
May 5, 2026 · View on GitHub
EchoNote
Private, on-device meeting transcription & AI summaries.
No cloud. No bots. No subscriptions.
Quick Start
- Download the installer for your OS → Latest Release
- Install & open the app — a setup wizard walks you through permissions and model downloads.
- Hit Record — live transcription starts instantly. When you're done, click Summarize.
That's it. Everything runs on your machine.
Download
| Platform | File | Install |
|---|---|---|
| macOS (Apple Silicon) | EchoNote_x.x.x_macOS-AppleSilicon.dmg | Open .dmg → drag to Applications |
| macOS (Intel) | EchoNote_x.x.x_macOS-Intel.dmg | Open .dmg → drag to Applications |
| Windows | EchoNote_x.x.x_Windows-x64.exe | Run the installer |
| Linux | EchoNote_x.x.x_Linux-x64.AppImage | chmod +x && ./EchoNote_*.AppImage |
| Linux (Debian/Ubuntu) | EchoNote_x.x.x_Linux-x64.deb | sudo dpkg -i EchoNote_*.deb |
⚠️ Important — Read Before First Launch
The app is not code-signed yet, so your OS will show a security warning the first time you open it. This is expected for open-source apps — the full source code is available in this repository.
🍎 macOSmacOS will say the app is "damaged" or "can't be opened". Run this once in Terminal:
Then open the app normally. |
🪟 WindowsWindows SmartScreen will show an "unrecognized app" warning. Click "More info" → "Run anyway". |
🐧 LinuxMake the AppImage executable first:
|
Why EchoNote?
Most meeting tools send your audio to the cloud. EchoNote doesn't.
| EchoNote | Cloud-based tools | |
|---|---|---|
| Data stays on your machine | ✅ | ❌ |
| Works offline | ✅ | ❌ |
| No bot joins your call | ✅ | ❌ |
| No subscription / free forever | ✅ | ❌ |
| Open source & auditable | ✅ | ❌ |
Features
Core
- Live transcription — Real-time speech-to-text powered by Whisper, supporting 90+ languages.
- Speaker identification — Automatically detects and labels different speakers using neural embeddings.
- AI summaries — One-click meeting summaries generated by a local LLM. Tokens stream in real-time.
- Conversational chat — Ask follow-up questions about any meeting ("What did Maria say about the deadline?").
- Meeting notes — Take timestamped notes alongside the live transcript; include them in AI context.
- Full-text search — Instantly search across all past meetings.
Customization
- Custom summary templates — Create prompt templates for any format: 1:1s, sprint reviews, sales calls.
- Model selection — Download and switch between multiple ASR and LLM models at runtime — no restart needed.
- Hardware-aware recommendations — EchoNote detects your RAM and suggests the optimal models automatically.
- Theme switcher — Light, dark, and system-follow modes.
Integrations
- MCP server — Expose your meeting transcripts, notes, and summaries to AI coding assistants (VS Code, Cursor, Windsurf, Claude Desktop) via Model Context Protocol. One-click install from Settings.
Reliability
- Works with Teams, Zoom, etc. — Captures audio at the device's native sample rate, so it coexists with apps that use exclusive audio formats.
- Sleep prevention — Automatically prevents OS sleep during recording.
- Auto-updates — The app checks for new versions on launch.
- Setup wizard — Guided onboarding: permissions, hardware check, model downloads, and test recording.
How It Works
🎤 Microphone → Silero VAD → Whisper → Speaker ID → Live Transcript
↓
Qwen 3 LLM → Summary / Chat
All processing happens locally using open-source models:
| Model | Purpose | Size |
|---|---|---|
| Whisper | Speech-to-text | 148 MB – 1.6 GB |
| Qwen 3 | Summaries & chat | 2.6 – 9.2 GB |
| Silero VAD | Voice activity detection | ~1.2 MB |
| 3D-Speaker | Speaker embeddings (ERes2Net / CAM++) | ~26 MB |
| pyannote | Speaker segmentation | ~17 MB |
Models are downloaded on-demand from the built-in model manager (Settings → Models).
Which models should I choose?
ASR (Speech-to-Text)
| Model | Size | RAM | Best for |
|---|---|---|---|
| Whisper Base | ~148 MB | 1 GB | Quick testing, low-end machines |
| Whisper Small | ~488 MB | 2 GB | Decent quality on older hardware |
| Whisper Fast (distil-large-v3) | ~756 MB | 3 GB | Distilled — faster inference, good quality |
| Whisper Turbo Lite (q5_0) | ~574 MB | 2 GB | Quantized — lighter, nearly same quality |
| Whisper Turbo ⭐ | ~1.6 GB | 4 GB | Recommended — best speed/quality, 90+ languages |
LLM (Summaries & Chat)
| Model | Size | RAM | Best for |
|---|---|---|---|
| Qwen 3 Lite (4B) | ~2.6 GB | 6–8 GB | Machines with <8 GB RAM |
| Qwen 3 Medium (8B) | ~5.2 GB | 8–12 GB | Laptops with 8–16 GB RAM |
| Qwen 3 Large (14B) ⭐ | ~9.2 GB | 14–18 GB | Recommended — best quality for 16 GB+ |
Quick Pick
| Your RAM | ASR | LLM | Disk needed |
|---|---|---|---|
| 8 GB | Whisper Turbo Lite | Qwen 3 Lite | ~3.2 GB |
| 16 GB | Whisper Turbo | Qwen 3 Large | ~10.8 GB |
| 32 GB+ | Whisper Turbo | Qwen 3 Large | ~10.8 GB |
System Requirements
| Minimum | Recommended | |
|---|---|---|
| macOS | 12.3+ (Monterey) | Apple Silicon, 16 GB RAM |
| Windows | 10/11 (64-bit) | 16 GB RAM |
| Linux | Ubuntu 22.04+ / AppImage-compatible | 16 GB RAM |
| Disk | ~3 GB (app + base models) | ~12 GB (with full LLM) |
Privacy & Security
- Zero network access during meetings — all AI runs locally.
- No telemetry. No usage data, analytics, or crash reports.
- Local storage only. Meetings live in a SQLite database on your machine.
- Open source. Audit every line of code in this repository.
Roadmap
- Live transcription with Whisper (90+ languages)
- Speaker identification (diarization)
- Meeting persistence and full-text search
- Local LLM summaries with streaming
- Conversational chat with meeting context
- Custom summary templates
- Runtime model selection
- Timestamped meeting notes
- Hardware-aware model recommendations
- Setup wizard & onboarding
- Light / dark / system themes
- MCP server for AI coding assistants
- System audio capture (transcribe the other side of the call)
- Encrypted local storage (SQLCipher)
Built With
| Layer | Technology |
|---|---|
| Desktop shell | Tauri 2 (Rust + native webview) |
| Backend | Rust 1.88+ |
| Frontend | React 18, TypeScript, Tailwind CSS, i18n (en / es) |
| Speech-to-text | whisper.cpp |
| Summaries & chat | llama.cpp (Qwen 3) |
| Voice activity | Silero VAD via ONNX |
| Speaker ID | 3D-Speaker + pyannote via ONNX |
| Storage | SQLite with FTS5 full-text search |
Contributing
Contributions are welcome — see CONTRIBUTING.md for guidelines.
PolyForm Noncommercial 1.0.0 © 2026 Luis MC