๐๏ธ dictationmd
May 5, 2026 ยท View on GitHub
A standard for defining a speech-to-text correction prompt โ so your repo's jargon survives the mic.
๐ The Spec
DICTATION.md is an optional, repository-level prompt file that corrects transcription text to match repository-specific jargon.
- A speech-enabled editor SHOULD automatically detect
DICTATION.mdand use it as a post-processing step after STT. - A speech-enabled editor MUST ask the user for permission before using
DICTATION.mdfor the first time. - Users should be able to choose the LLM model used for correction.
DICTATION.mdshould be tuned for small/local models.
๐ Frontmatter
DICTATION.md supports optional YAML frontmatter fields:
| Field | Type | Description |
|---|---|---|
description | string | A short description of the dictation prompt, shown to the user when asking for permission. |
model | string | The preferred LLM model identifier to use for correction (e.g. gpt-4o-mini, phi3). The editor MAY use this as a default, but users SHOULD be able to override it. |
Example:
---
description: Corrects speech-to-text for the dictationmd project vocabulary.
model: gpt-4o-mini
---
๐ค Why
Speech-to-text is designed for general-purpose language and commonly mistranscribes repo-specific keywords and acronyms. By providing an LLM-powered mapping and correction layer, transcribed prompts have a much better success rate. It's all about making your code โ and your tools โ actually understand what you said.
๐ How
You can reuse this dictation prompt generator agentic workflow in your own repo: