DVR Transcriber

January 26, 2026 · View on GitHub

Status: Work in Progress

A CLI utility and workspace template for importing, organizing, and transcribing audio files from Digital Voice Recorders (DVRs).

The Problem

Digital voice recorders produce flat directories of MP3 files with minimal metadata. Managing, organizing, and transcribing these recordings is tedious and time-consuming.

The Solution

A streamlined workflow that:

Imports recordings from a mounted DVR
Organizes files into date-based folders (DDMM format)
Classifies recordings using AI (via short audio samples)
Transcribes selected recordings with speaker identification
Exports transcripts and summaries to cloud storage

Planned Features

Import & Organization

CLI for syncing from DVR mount point
Move (default) or copy modes
Auto-cleanup of accidental recordings (<10 seconds)
Date-based folder organization

AI-Powered Classification

Extract 30-second samples from recordings
Send to Gemini for title and summary generation
Avoid processing full multi-hour recordings just for metadata

Transcription

Selective full transcription via Gemini API
Voice sample-based speaker diarization
Accurate speaker labels without manual intervention

Evidence Mode (Optional)

For recordings that may serve as legal evidence:

No file modifications or renaming
SHA256 checksum calculation
Cloud backup (Google Drive, S3, WORM storage)

Use Cases

Meeting notes and discussions
Personal voice memos
Research interviews
Property viewing notes
Any scenario where voice capture beats typing

Requirements (Planned)

Linux
Python 3.10+
Gemini API key
Mounted DVR (USB mass storage)

Installation

Coming soon

Usage

Coming soon

License

TBD