README.md

April 2, 2026 · View on GitHub

⚠️ Deprecated: Big Iron was an experimental prototype and is no longer maintained. This functionality has been rebuilt and shipped as supermodel factory in the supermodel CLI. Please use that instead.


        _________
       /  _____  \
      /___________\        ____  ___ ____   ___ ____  ___  _  _
      |  (o)  (o) |       | __ )| |/ ___|  |_ _|  _ \/ _ \| \| |
      |     ^     |       |  _ \| | |  _    | || |_) |   /| .` |
      |   [---]   |       | |_) | | |_| |   | ||  _ <| |\ \| |\  |
      |___________|       |____/|_|\____|  |___|_| \_\_| \_|_| \_|
      /|         |\
     (_)         (_)      AI-Native SDLC — Ride the graph. Ship clean iron.

An AI-native software development system that integrates Hermes Agent with the Supermodel Code Graph API to make structural code intelligence a first-class citizen at every phase of the SDLC.

The graph replaces file loading for context. The agent stays architecturally aligned across sessions. Phase gates enforce quality without human gatekeeping. The system improves itself with every completed task.


Quick Start

# 1. Set required env vars
export SUPERMODEL_API_KEY=your_key_here
export GITHUB_TOKEN=your_token_here   # optional — for PR automation

# 2. Run setup (installs Supermodel MCP, Hermes config, and all skills)
./scripts/setup.sh

# 3. Launch Hermes
hermes

# 4. In Hermes, reload MCP connections
/reload-mcp

# 5. Point the factory at a goal
factory run ./my-project "Add rate limiting to the order API"

The factory CLI

The top-level entry point. Three modes:

# Run the full 8-phase SDLC cycle autonomously on a goal
factory run <codebase> "<goal>"

# Read-only health check — structural metrics and risk report
factory health <codebase>

# Continuous improvement pass — health + refactor + dead code sweep
factory improve <codebase>

Examples:

factory run ./demo "Add discount system to orders"
factory health ./demo
factory improve ./demo

Hermes is launched with all relevant skills pre-loaded and runs autonomously through the phases, using the graph at each step.


Demo

A complete layered Python app is included in demo/ as a factory target. Run the full 8-phase cycle on it:

./scripts/demo_run.sh ./demo "Add rate limiting to the order API"

Or just run the demo tests standalone:

cd demo && python3 -m pytest tests/ -v

The demo app has:

  • Domain layerUser, Order entities and domain services
  • Infrastructure layer — in-memory repositories
  • Application layerUserService, OrderService use cases
  • Orchestration layerSDLCRunner, the phase gate coordinator
  • 36 tests organized in dependency order

How It Works

See docs/ARCHITECTURE.md for the full design. The short version:

Supermodel ships as an MCP server that Hermes consumes natively. It maintains a persistent multi-layered graph of the codebase — call graph, dependency graph, domain graph, AST. The agent queries this graph instead of loading files.

Hermes runs a skill at each SDLC phase. Each skill knows how to use the graph API for that specific phase: blast radius before planning, arch validation before coding, topological test ordering, graph-diff quality gates.

The result: the agent always knows what's connected to what, never hallucinates signatures, and can't accidentally violate the architecture without it being caught at a gate.


The 8 Phases

PhaseSkillWhat it does
1planningGraph-grounded scoping: blast radius, domain ownership, implementation checklist
2arch_checkPre-code arch review: circular deps, domain layering, coupling thresholds
3codegenGraph-aware code generation: exact signatures, resolved imports, no duplicate impls
4quality_gatesPost-write graph diff: validate new edges, dead code check, coupling delta
5test_orderDependency-ordered test run: topological sort, blast-radius scoping
6code_reviewGraph-enriched review: caller annotations, domain impact, risk levels
7refactorGraph-guided refactoring: leaf-first sequencing, debt scoring, pattern capture
8health_cronNightly autonomous health check: drift detection, auto-scheduled remediation

Phases run in order. No phase advances without passing its graph-based gate.


CI Integration

graph_gate.sh wraps the Supermodel REST API for use in git hooks, GitHub Actions, or any CI pipeline:

# Architecture gate (pre-code)
./scripts/graph_gate.sh arch ./my-project

# Quality gate (post-write)
./scripts/graph_gate.sh quality ./my-project

# Test coverage gate
./scripts/graph_gate.sh coverage ./my-project

# Health gate (nightly)
./scripts/graph_gate.sh health ./my-project

# Blast radius for a specific function
./scripts/graph_gate.sh impact ./my-project app/service.py:create_order

Exit code 0 = PASS, 1 = FAIL. Drop into any CI step.


File Layout

big-iron/
├── factory                 ← main entry point (run/health/improve)
├── README.md
├── CLAUDE.md               ← Claude Code configuration
├── AGENTS.md               ← Agent operating rules
├── docs/
│   └── ARCHITECTURE.md     ← full system design
├── skills/                 ← SDLC phase skill files (source of truth)
│   ├── guardrails.md       ← architectural constraints (all phases)
│   ├── planning.md
│   ├── arch_check.md
│   ├── codegen.md
│   ├── quality_gates.md
│   ├── test_order.md
│   ├── code_review.md
│   ├── refactor.md
│   └── health_cron.md
├── config/
│   └── hermes-config.yaml  ← Hermes + Supermodel MCP config
├── scripts/
│   ├── setup.sh            ← first-time setup
│   ├── install_skills.sh   ← sync skills to ~/.hermes/skills/
│   ├── graph_gate.sh       ← shell phase gate (CI-ready)
│   ├── supermodel.sh       ← Supermodel API client library
│   └── demo_run.sh         ← interactive 8-phase demo runner
└── demo/                   ← target codebase for demos
    ├── pyproject.toml
    ├── app/
    │   ├── domain/         ← Layer 2: User, Order entities
    │   ├── infrastructure/ ← Layer 3: in-memory repositories
    │   ├── application/    ← Layer 1: UserService, OrderService
    │   └── orchestration/  ← Layer 0: SDLCRunner
    └── tests/

Prerequisites


References