README.md

May 5, 2026 · View on GitHub

EchoNote logo

EchoNote

Private, on-device meeting transcription & AI summaries.
No cloud. No bots. No subscriptions.

Quick Start

Download the installer for your OS → Latest Release
Install & open the app — a setup wizard walks you through permissions and model downloads.
Hit Record — live transcription starts instantly. When you're done, click Summarize.

That's it. Everything runs on your machine.

Download

Platform	File	Install
macOS (Apple Silicon)	`EchoNote_x.x.x_macOS-AppleSilicon.dmg`	Open `.dmg` → drag to Applications
macOS (Intel)	`EchoNote_x.x.x_macOS-Intel.dmg`	Open `.dmg` → drag to Applications
Windows	`EchoNote_x.x.x_Windows-x64.exe`	Run the installer
Linux	`EchoNote_x.x.x_Linux-x64.AppImage`	`chmod +x && ./EchoNote_*.AppImage`
Linux (Debian/Ubuntu)	`EchoNote_x.x.x_Linux-x64.deb`	`sudo dpkg -i EchoNote_*.deb`

⚠️ Important — Read Before First Launch

The app is not code-signed yet, so your OS will show a security warning the first time you open it. This is expected for open-source apps — the full source code is available in this repository.

🍎 macOS

macOS will say the app is "damaged" or "can't be opened". Run this once in Terminal:

xattr -cr /Applications/EchoNote.app

Then open the app normally.

🪟 Windows

Windows SmartScreen will show an "unrecognized app" warning.

Click "More info" → "Run anyway".

🐧 Linux

Make the AppImage executable first:

chmod +x EchoNote_*.AppImage
./EchoNote_*.AppImage

Why EchoNote?

Most meeting tools send your audio to the cloud. EchoNote doesn't.

	EchoNote	Cloud-based tools
Data stays on your machine	✅	❌
Works offline	✅	❌
No bot joins your call	✅	❌
No subscription / free forever	✅	❌
Open source & auditable	✅	❌

Features

Core

Live transcription — Real-time speech-to-text powered by Whisper, supporting 90+ languages.
Speaker identification — Automatically detects and labels different speakers using neural embeddings.
AI summaries — One-click meeting summaries generated by a local LLM. Tokens stream in real-time.
Conversational chat — Ask follow-up questions about any meeting ("What did Maria say about the deadline?").
Meeting notes — Take timestamped notes alongside the live transcript; include them in AI context.
Full-text search — Instantly search across all past meetings.

Customization

Custom summary templates — Create prompt templates for any format: 1:1s, sprint reviews, sales calls.
Model selection — Download and switch between multiple ASR and LLM models at runtime — no restart needed.
Hardware-aware recommendations — EchoNote detects your RAM and suggests the optimal models automatically.
Theme switcher — Light, dark, and system-follow modes.

Integrations

MCP server — Expose your meeting transcripts, notes, and summaries to AI coding assistants (VS Code, Cursor, Windsurf, Claude Desktop) via Model Context Protocol. One-click install from Settings.

Reliability

Works with Teams, Zoom, etc. — Captures audio at the device's native sample rate, so it coexists with apps that use exclusive audio formats.
Sleep prevention — Automatically prevents OS sleep during recording.
Auto-updates — The app checks for new versions on launch.
Setup wizard — Guided onboarding: permissions, hardware check, model downloads, and test recording.

How It Works

🎤 Microphone → Silero VAD → Whisper → Speaker ID → Live Transcript
                                                          ↓
                                              Qwen 3 LLM → Summary / Chat

All processing happens locally using open-source models:

Model	Purpose	Size
Whisper	Speech-to-text	148 MB – 1.6 GB
Qwen 3	Summaries & chat	2.6 – 9.2 GB
Silero VAD	Voice activity detection	~1.2 MB
3D-Speaker	Speaker embeddings (ERes2Net / CAM++)	~26 MB
pyannote	Speaker segmentation	~17 MB

Models are downloaded on-demand from the built-in model manager (Settings → Models).

Which models should I choose?

ASR (Speech-to-Text)

Model	Size	RAM	Best for
Whisper Base	~148 MB	1 GB	Quick testing, low-end machines
Whisper Small	~488 MB	2 GB	Decent quality on older hardware
Whisper Fast (distil-large-v3)	~756 MB	3 GB	Distilled — faster inference, good quality
Whisper Turbo Lite (q5_0)	~574 MB	2 GB	Quantized — lighter, nearly same quality
Whisper Turbo ⭐	~1.6 GB	4 GB	Recommended — best speed/quality, 90+ languages

LLM (Summaries & Chat)

Model	Size	RAM	Best for
Qwen 3 Lite (4B)	~2.6 GB	6–8 GB	Machines with <8 GB RAM
Qwen 3 Medium (8B)	~5.2 GB	8–12 GB	Laptops with 8–16 GB RAM
Qwen 3 Large (14B) ⭐	~9.2 GB	14–18 GB	Recommended — best quality for 16 GB+

Quick Pick

Your RAM	ASR	LLM	Disk needed
8 GB	Whisper Turbo Lite	Qwen 3 Lite	~3.2 GB
16 GB	Whisper Turbo	Qwen 3 Large	~10.8 GB
32 GB+	Whisper Turbo	Qwen 3 Large	~10.8 GB

System Requirements

	Minimum	Recommended
macOS	12.3+ (Monterey)	Apple Silicon, 16 GB RAM
Windows	10/11 (64-bit)	16 GB RAM
Linux	Ubuntu 22.04+ / AppImage-compatible	16 GB RAM
Disk	~3 GB (app + base models)	~12 GB (with full LLM)

Privacy & Security

Zero network access during meetings — all AI runs locally.
No telemetry. No usage data, analytics, or crash reports.
Local storage only. Meetings live in a SQLite database on your machine.
Open source. Audit every line of code in this repository.

Roadmap

Built With

Layer	Technology
Desktop shell	Tauri 2 (Rust + native webview)
Backend	Rust 1.88+
Frontend	React 18, TypeScript, Tailwind CSS, i18n (en / es)
Speech-to-text	whisper.cpp
Summaries & chat	llama.cpp (Qwen 3)
Voice activity	Silero VAD via ONNX
Speaker ID	3D-Speaker + pyannote via ONNX
Storage	SQLite with FTS5 full-text search

Contributing

Contributions are welcome — see CONTRIBUTING.md for guidelines.