DVR Transcriber
January 26, 2026 ยท View on GitHub
Status: Work in Progress
A CLI utility and workspace template for importing, organizing, and transcribing audio files from Digital Voice Recorders (DVRs).
The Problem
Digital voice recorders produce flat directories of MP3 files with minimal metadata. Managing, organizing, and transcribing these recordings is tedious and time-consuming.
The Solution
A streamlined workflow that:
- Imports recordings from a mounted DVR
- Organizes files into date-based folders (DDMM format)
- Classifies recordings using AI (via short audio samples)
- Transcribes selected recordings with speaker identification
- Exports transcripts and summaries to cloud storage
Planned Features
Import & Organization
- CLI for syncing from DVR mount point
- Move (default) or copy modes
- Auto-cleanup of accidental recordings (<10 seconds)
- Date-based folder organization
AI-Powered Classification
- Extract 30-second samples from recordings
- Send to Gemini for title and summary generation
- Avoid processing full multi-hour recordings just for metadata
Transcription
- Selective full transcription via Gemini API
- Voice sample-based speaker diarization
- Accurate speaker labels without manual intervention
Evidence Mode (Optional)
For recordings that may serve as legal evidence:
- No file modifications or renaming
- SHA256 checksum calculation
- Cloud backup (Google Drive, S3, WORM storage)
Use Cases
- Meeting notes and discussions
- Personal voice memos
- Research interviews
- Property viewing notes
- Any scenario where voice capture beats typing
Requirements (Planned)
- Linux
- Python 3.10+
- Gemini API key
- Mounted DVR (USB mass storage)
Installation
Coming soon
Usage
Coming soon
License
TBD