Code Edit Format Benchmark

February 27, 2026 ยท View on GitHub

Benchmark guide for comparing edit format performance and render behavior.


Purpose

Use this benchmark to compare edit-format throughput and output quality for Codex-focused editing workloads.


Quick Start

npm run bench:edit-formats

Smoke run:

npm run bench:edit-formats:smoke

Render dashboard output:

npm run bench:edit-formats:render

Output Files

OutputLocation
Benchmark report JSON.tmp/edit-format-benchmark-*.json
Render preview artifacts.tmp/edit-format-benchmark-render-*.txt

(Temporary benchmark artifacts are not source files.)


Common Presets

PresetGoal
codex-coreBaseline Codex-oriented evaluation
smokeFast sanity check for CI/local validation

Interpretation Checklist

  1. Compare latency per format.
  2. Measure token/size overhead.
  3. Review success/error rates.
  4. Validate output consistency.
  5. Confirm no regressions in editing fidelity.

Cleanup

Bash:

rm -rf .tmp

PowerShell:

Remove-Item ".tmp" -Recurse -Force -ErrorAction SilentlyContinue