๐ŸŽ™๏ธ dictationmd

May 5, 2026 ยท View on GitHub

A standard for defining a speech-to-text correction prompt โ€” so your repo's jargon survives the mic.

๐Ÿ“„ The Spec

DICTATION.md is an optional, repository-level prompt file that corrects transcription text to match repository-specific jargon.

  • A speech-enabled editor SHOULD automatically detect DICTATION.md and use it as a post-processing step after STT.
  • A speech-enabled editor MUST ask the user for permission before using DICTATION.md for the first time.
  • Users should be able to choose the LLM model used for correction. DICTATION.md should be tuned for small/local models.

๐Ÿ“‹ Frontmatter

DICTATION.md supports optional YAML frontmatter fields:

FieldTypeDescription
descriptionstringA short description of the dictation prompt, shown to the user when asking for permission.
modelstringThe preferred LLM model identifier to use for correction (e.g. gpt-4o-mini, phi3). The editor MAY use this as a default, but users SHOULD be able to override it.

Example:

---
description: Corrects speech-to-text for the dictationmd project vocabulary.
model: gpt-4o-mini
---

๐Ÿค” Why

Speech-to-text is designed for general-purpose language and commonly mistranscribes repo-specific keywords and acronyms. By providing an LLM-powered mapping and correction layer, transcribed prompts have a much better success rate. It's all about making your code โ€” and your tools โ€” actually understand what you said.

๐Ÿš€ How

You can reuse this dictation prompt generator agentic workflow in your own repo:

๐ŸŒ Examples