Agents

May 19, 2026 · View on GitHub

Nine agents own the chain from a user request to an approved revision. The chain is strictly ordered. Each agent has a single purpose, a documented input, a documented output, and a list of forbidden behaviors. Prompts for each agent live under prompts/.

Agent chain

Template Orchestrator Agent
        ↓
Version + Skill Resolver Agent
        ↓
Skill Validator Agent
        ↓
Visual Analyzer Agent
        ↓
Architecture Mapper Agent
        ↓
Asset Resolver Agent
        ↓
Template Coder Agent
        ↓
Test + Render Agent
        ↓
Visual Review Agent
        ↓
Revision Manager Agent
        ↓
User Approval / Rollback
        ↓
Template Publisher Agent (on APPROVE only)
        ↓
templates/<template-id>/

The original simple plan ended at the visual review. The improved chain adds explicit version detection, skill resolution, skill validation, an Asset Resolver between architecture planning and code generation so icons and fonts are downloaded before the Template Coder runs, and a separate revision manager so that no agent except the Revision Manager can change project metadata. A Template Publisher runs after APPROVAL and turns the frozen revision into a publish-quality, end-user-friendly bundle under templates/.

Relational geometry across the chain

A cross-cutting rule encoded in every agent's shared rules block and in prompts/master-prompt.md: layout dimensions must be DERIVED from a small set of base constants — page size, margins, column gaps, weights — not hand-tuned per region.

Visual Analyzer describes layout in RATIOS (e.g. "sidebar ≈ 31% of usable width") and proportions, never in raw pixel widths.
Architecture Mapper captures those ratios as named weight constants in the plan.
Template Coder emits derived widths (SIDEBAR_WIDTH = USABLE_WIDTH * SIDEBAR_WEIGHT, GRID_COLUMN_WIDTH = MAIN_WIDTH / 2.0, ...). The same SIDEBAR_WEIGHT constant feeds every row.weights(...) AND the derived SIDEBAR_WIDTH so changing one base constant recomputes every dependent dimension in a single place.

Hardcoded pixel values are reserved for genuinely independent dimensions (icon size, line marker height, fixed paddings). When a number can be derived, it MUST be derived. See prompts/template-coder-agent.md under "Relational geometry over pixel constants" for the canonical pattern.

Anchors and alignment

The relational principle extends to element-to-element positioning. GraphCompose's anchor + alignment primitives encode the relationship directly so the engine resolves the actual coordinates at render time:

Element-to-element relationship	Engine primitive
Paragraph horizontal alignment	`TextAlign.LEFT / .CENTER / .RIGHT`
Inline image vs surrounding text baseline	`InlineImageAlignment.BASELINE / .CENTER / .TEXT_TOP`
Layer in a stack (badge, watermark, overlay)	`LayerAlign.TOP_LEFT / TOP_CENTER / ... / BOTTOM_RIGHT`
Cell text in a table	`DocumentTableTextAnchor.{TOP,CENTER,BOTTOM}_{LEFT,CENTER,RIGHT}`
Column proportions in a row	`RowBuilder.weights(...)`
Positioned overlay with offset from anchor	`LayerStackBuilder.position(node, offsetX, offsetY, LayerAlign.X)`
Custom canvas anchors	`HAnchor.{LEFT,CENTER,RIGHT}` + `VAnchor.{TOP,MIDDLE,BOTTOM}`

Visual Analyzer describes the relationship in prose ("icon centered on the label baseline"), Architecture Mapper picks the named anchor that expresses it, Template Coder reaches for that anchor in code. Hand-rolled pixel offsets only appear when the anchor set genuinely doesn't cover the case (rare custom decoration).

Template Orchestrator Agent

Purpose: main coordinator. Decides whether the user request is a new generation, a revision, an approval, an undo, a revert to approved, or a selective rollback, and routes the work to the correct specialized agent.

Inputs: user request, project metadata, current approved revision, current draft revision, reference image, available skill packs, GraphCompose version.

Outputs: orchestration-decision.md.

Responsibilities:

detect whether the request is new generation, revision, approval, rollback, or selective rollback
select the correct base revision
decide whether full visual analysis is needed
decide whether to patch an existing template or regenerate
ensure every change creates a new revision
route work to the correct specialized agent
produce the final user-facing status
prevent direct overwrite of approved revisions

Forbidden: editing project metadata directly (the Revision Manager owns that), writing code (the Template Coder owns that), and approving revisions without a complete visual review.

Prompt: prompts/orchestrator-agent.md.

Version + Skill Resolver Agent

Purpose: prevent the AI from mixing APIs from different GraphCompose versions.

Inputs: pom.xml, build.gradle, project config, skill-manifest.json, user request.

Outputs: version-resolution.md.

Responsibilities:

detect the target GraphCompose version
read skill-manifest.json
select the compatible skill pack
detect unsupported versions
prevent the use of stale skills
report uncertainty before code generation

Required rules (verbatim):

The agent must identify the target GraphCompose version before writing code.

The agent must only use skills marked as compatible with that GraphCompose version.

If the version is unknown, the agent must use conservative verified primitives only.

Prompt: prompts/version-skill-resolver-agent.md.

Skill Validator Agent

Purpose: verify that the loaded skill pack still matches the real GraphCompose API. Skills are versioned contracts, not static documentation, and they must be tested against real library behavior.

Inputs: selected skill pack, GraphCompose version, verified examples, fixture projects, build output, render output.

Outputs: skill-validation-report.md, and skill-fix-report.md when drift is detected.

Responsibilities:

verify that skills match the selected GraphCompose version
check that documented examples compile
check that documented examples render
detect stale or wrong API instructions
create skill-fix reports when drift is found

Core rule (verbatim):

If GraphCompose behavior differs from the skill documentation, GraphCompose is the source of truth.
The skill must be fixed.

Forbidden: silently working around incorrect skills, or marking a skill as active when its examples fail to compile or render.

Prompt: prompts/skill-validator-agent.md.

See skill-validation.md for the planned validation discipline.

Visual Analyzer Agent

Purpose: analyze the visual reference. Does not write code.

Inputs: reference.png, optional reference.pdf, optional user notes.

Outputs: visual-analysis.md.

Responsibilities:

identify page format and orientation
identify the layout grid
identify major regions
describe visual hierarchy
identify repeated components
identify typography
identify icons and decorative symbols; when replacements are needed, use Iconify as the search/source reference
identify colors
identify spacing
identify tables
identify cards, panels, badges, and decorations
identify uncertain or ambiguous parts

Forbidden: emitting Java code, naming specific GraphCompose builders, or assuming a particular library version. The analyzer describes what is visible, not how to implement it.

Prompt: prompts/visual-analyzer-agent.md.

Architecture Mapper Agent

Purpose: convert visual analysis into a GraphCompose architecture plan. Does not write final Java code.

Inputs: visual-analysis.md, selected skills, GraphCompose version, reference image.

Outputs: architecture-plan.md.

Responsibilities:

map visual regions to GraphCompose DSL primitives
decide reusable private render methods
identify theme tokens
identify icon and font assets, including Iconify icon set/name and Google Fonts family/weights when relevant
identify data model assumptions
define a testing plan
identify visual risks
define fallback strategies

Forbidden: writing the final template Java, inventing GraphCompose APIs not present in the loaded skill pack, and recommending CanvasLayer for anything other than last-resort decoration. The mapper must not introduce icon or font substitutions without documenting their source and fallback.

Prompt: prompts/architecture-mapper-agent.md.

Asset Resolver Agent

Purpose: acquire the icons and fonts the Architecture Mapper declared in asset-request.json so the Template Coder has every external asset on disk before it writes any Java. Runs between the Architecture Mapper and the Template Coder.

Inputs: asset-request.json, architecture-plan.md (read-only), selected skill pack, revision folder.

Outputs: assets-manifest.json, assets/icons/<token>.png, assets/fonts/<family>-<weight>.ttf (when downloads land).

Responsibilities:

read and validate the request schema
resolve every icon via the Iconify HTTP API, preferring the request's preferredSets priority list (default mdi → tabler → lucide → material-symbols → ph) before falling back to broad search
when an icon entry is marked "visual": true and Playwright is available, use the visual fallback to suggest an icon set
download icons as PNGs into <revision>/assets/icons/
resolve every font role against standard14, graphcompose-bundled (DefaultFonts.googleFamilies()), or google-fonts (marks the role manual_drop_required for non-bundled families)
write assets-manifest.json as the single source of truth the Template Coder reads

Forbidden: inventing icon sets or font families, silently substituting one icon for another, marking a non-bundled Google font as ok, touching generated Java, sharing assets across revisions implicitly, and skipping the manifest when the request is empty.

Prompt: prompts/asset-resolver-agent.md. Tool surface: tools/asset-resolver/README.md.

Template Coder Agent

Purpose: write the Java template and tests, using only the loaded skill pack and GraphCompose APIs valid for the selected version.

Inputs: architecture-plan.md, assets-manifest.json, selected skill pack, GraphCompose version, base revision when applicable.

Outputs: generated-template.java, generated-test.java, patch.diff, changed-components.md.

Responsibilities:

generate maintainable Java code
use the GraphCompose semantic DSL
avoid raw PDFBox usage
avoid coordinate soup
keep code componentized
use selected skills only
use GraphCompose APIs valid for the selected version
use only sourced icon/font assets recorded in the architecture plan
create tests
track changed components

Rules (verbatim):

Do not write one huge method.

Do not import PDFBox directly.

Do not invent GraphCompose API.

Use CanvasLayer only as last resort.

Every visible component should map to a named method or named layout block.

Icons should come from Iconify when a replacement is needed. Custom fonts should default to Google Fonts when licensing permits, and font loading must use verified GraphCompose APIs.

Prompt: prompts/template-coder-agent.md.

Test + Render Agent

Purpose: verify that generated code compiles, runs, and produces the expected artifacts.

Inputs: generated-template.java, generated-test.java, reference image, project config.

Outputs: output.pdf, output.png, output-page-N.png, output-debug.pdf, output-debug.png, output-debug-page-N.png, layout-snapshot.json, test-result.md, build.log, render.log. The debug PDF + previews come from a second render pass through GraphCompose.document(...).guideLines(true).create(); layout geometry and the layout snapshot are unchanged between the two passes — only the convenience PDF gets the guide-line overlay.

Responsibilities:

compile generated code
run the template test
render the PDF
generate the preview PNG
generate the layout snapshot
report failure clearly
preserve failed revision artifacts

Minimum checks:

template compiles
PDF file is generated
PDF file is not empty
preview image is generated
layout snapshot is generated
render does not throw unexpected exceptions

Forbidden: deleting artifacts on failure, mutating prior revisions, or marking a revision successful when any minimum check fails.

Prompt: prompts/test-render-agent.md.

Visual Review Agent

Purpose: compare the rendered output against the reference and classify mismatches.

Inputs: reference.png, output.png, previous-output.png when available, layout-snapshot.json, visual-analysis.md, architecture-plan.md.

Outputs: visual-review.md.

Responsibilities:

compare reference image with output preview
compare current output with previous output when available
identify visual mismatches
classify mismatches
recommend next revision actions
decide whether the revision is acceptable

Forbidden: approving the revision (the Revision Manager owns approval), editing the template, or suppressing visible mismatches as "close enough" without a documented classification. See visual-accuracy-contract.md for the canonical mismatch labels and visual-review-loop.md for the review document format.

Prompt: prompts/visual-review-agent.md.

Revision Manager Agent

Purpose: the key safety layer. Owns project metadata and the revision folder structure.

Inputs: revision artifacts, user decision, project metadata.

Outputs: revision.json, status.md, and updates to template-project.json.

Responsibilities:

create revision
track parent revision
track approved revision
track draft revision
save artifacts
approve revision
reject revision
undo last change
revert to approved
selective rollback
preserve failed revisions
prevent destructive overwrite

Core rule (verbatim):

Never overwrite the approved revision directly.
Every change creates a new revision.

Prompt: prompts/revision-manager-agent.md.

See revision-model.md and rollback.md for the data model and the rollback flows the Revision Manager implements. The shared master prompt that bundles all agents lives at prompts/master-prompt.md.

Template Publisher Agent

Purpose: promote an APPROVED revision into a publish-quality, end-user-friendly bundle under templates/<template-id>/. Runs only after the Revision Manager Agent flips a revision to APPROVED. The bundle is the single artifact a downstream consumer copies into their project.

Inputs: template-project.json, the approved revision's folder (examples/<project>/revisions/<approved>/), and the runner-side spec sources (examples/<project>/render-runner/src/main/java/...).

Outputs: templates/<template-id>/ containing the polished template class (with full Javadoc and renamed class), the spec record and provider copied verbatim, an example data file, the asset request + icons folder, the preview PDF + per-page PNGs, a README.md, and a template.json manifest pointing at the source revision and commit.

Responsibilities:

read the approved revision and template-project.json#displayName
rename the revision's Generated<X>Template class to the PascalCase form of the display name (e.g. "Mint Editorial CV" → MintEditorialCvTemplate)
add comprehensive Javadoc that an end user can read without context: class header + per-method docs + "Customization point" callouts on theme tokens, sizes, fonts, and grid widths
copy the spec record and provider as-is from the runner; their Javadoc is owned by the Template Coder
copy the data file as <doc-kind>-data.example.json
copy the asset request + assets/icons/*.png so the bundle is renderable without an Iconify round-trip
copy output.pdf and per-page preview PNGs into preview/
write README.md (purpose, copy-into-project instructions, swap data instructions) and template.json (id, displayName, sourceProject, sourceRevision, sourceCommit, schemaVersion, dependencies)

Forbidden: inventing template content, mutating the source revision folder, publishing a non-APPROVED revision, and shipping a partial bundle.

Prompt: prompts/template-publisher-agent.md. Helper: scripts/publish-template.mjs is the deterministic copy step; the Javadoc polish is the agent's editorial work on top.