GAAI Skills

May 8, 2026 · View on GitHub

Skills are pure execution units. They perform a single, well-defined operation and produce explicit outputs. They never reason about intent or strategy.

Agents decide. Skills execute.


Core Skills — Week 1

These are the 7 skills you will encounter in a normal workflow. You do not invoke skills directly — agents invoke them on your behalf. This list tells you what is happening under the hood.

SkillWhen it runs
generate-epicsDiscovery: translate product intent into outcome-driven Epics
generate-storiesDiscovery: translate an Epic into User Stories with acceptance criteria
evaluate-storyDelivery: assess complexity and determine delivery tier
implementDelivery: write code that satisfies acceptance criteria
qa-reviewDelivery: validate implementation against acceptance criteria — the quality gate
memory-retrieveAny agent: load the minimum relevant memory before context-building
decision-extractionAfter QA PASS: formalize durable decisions into long-term memory

The full catalog (48 skills) is below. Skills outside this list are loaded on demand for advanced workflows.


Structure

Each skill lives in its own directory with a SKILL.md file:

skills/
├── discovery/   ← produce artefacts (PRD, Epics, Stories, validation)
├── delivery/    ← orchestrate and execute (planning, implementation, QA)
├── cross/       ← memory, context, governance, analysis — usable by any agent
└── domains/     ← domain-specific skill packs (mirrors memory/domains/)
    └── content-production/

The source of truth for available skills is the directory itself. The skills-index.yaml file in this directory is a derived cache (core skills only) — generated by build-skills-index, never edited manually. Project skills have their own index at .gaai/project/skills/skills-index.yaml.


Discovery Protocol

Skills are loaded in three phases — load the minimum needed, not everything:

Phase 1 — Filter by frontmatter (read SKILL.md frontmatter only)

Read skills-index.yaml for a fast aggregated view, or scan SKILL.md frontmatter directly if the index is absent or stale.

Frontmatter fields available for filtering:

FieldValuesUse for
metadata.categorydiscovery | delivery | crossNarrow by agent track
metadata.trackdiscovery | delivery | cross-cuttingSecondary filter
metadata.tagslist of keywordsCross-cutting capability search
descriptionone sentencePrimary relevance signal — what it does + when to activate
metadata.idSKILL-{CAT}-{NNN}Exact lookup by ID

Directive:

  1. Use metadata.category to scope to the right directory first
  2. Read description to confirm relevance — it is the single most important field
  3. Use metadata.tags for cross-cutting capabilities (e.g. memory, governance, qa)
  4. Do not load all skills — load only SKILL.md files whose description matches your task

Phase 2 — Load full SKILL.md

Load the full SKILL.md body only for skills confirmed relevant in Phase 1. Budget: <5,000 tokens per skill, <500 lines.

Phase 3 — Load references/ (execution only)

Load files in references/ or assets/ only during execution, never at discovery time.


Invocation Rules

  1. Skills are never invoked implicitly — an agent always selects and invokes explicitly
  2. Skills execute in isolated context windows — no shared state between skills
  3. Skills never chain other skills — only agents orchestrate
  4. Skills never access memory autonomously — context is always provided by the agent
  5. Skills never make product or architectural decisions — they execute only

Skill Authoring Guidance

When to add a skill vs. when to add a rule

You want to...Use...
Add a new execution capability (something an agent will do)create-skill
Add a new constraint (something an agent must not do, or a standard it must follow)rules-normalize

Skills are procedural: they perform a defined operation and produce outputs. They live in .gaai/core/skills/.

Rules are declarative: they state constraints, policies, and governance boundaries. They live in .gaai/core/contexts/rules/.

Decision test

Ask: "Does this describe how to do something, or whether something is allowed?"

  • "How to generate a Story from an Epic" → skill (generate-stories)
  • "Stories must have acceptance criteria before entering the backlog" → rule (orchestration.rules.md)
  • "How to compact memory when it exceeds a size threshold" → skill (memory-compact)
  • "How to archive a superseded DEC out of the active index" → skill (memory-archive-superseded)
  • "How to validate memory index discoverability invariants" → skill (memory-index-lint)
  • "Memory must never be auto-loaded by a skill" → rule (orchestration.rules.md)

If your answer is "how to do something" → skill. If your answer is "a constraint the system must enforce" → rule.

How to create each

  • New skill: invoke create-skill (.gaai/core/skills/cross/create-skill/SKILL.md) → create-skill Step 6 invokes build-skills-index to regenerate skills-index.yaml
  • New rule: invoke rules-normalize (.gaai/core/skills/cross/rules-normalize/SKILL.md)

Maintaning the Index

skills-index.yaml is generated — never edited manually.

To regenerate after adding or modifying a skill: → invoke build-skills-index (.gaai/core/skills/cross/build-skills-index/SKILL.md)


Final Rule

If a skill appears to "think", it is wrongly designed.


Delivery Skills

SkillIDDescription
browser-journey-testBrowser-based journey testing for UI validation
ci-watch-and-fixSKILL-DELIVERY-CI-WATCH-001Watch GitHub Actions CI after PR creation, detect failures, extract logs, apply minimal fixes, and re-push — up to 3 cycles before escalating
compose-teamRead specialists registry and select the sub-agent team for a Story
coordinate-handoffsSKILL-DEL-009Validate sub-agent handoff artefacts, sequence phase transitions, manage retry and escalation logic
delivery-high-level-planSKILL-DELIVERY-HIGH-LEVEL-PLAN-001Transform validated Stories into a clear, minimal, governed execution plan
evaluate-storyAssess complexity, identify domains, determine delivery tier
frontend-designDomain-specific skill for frontend UI implementation
implementImplement a Story from an execution plan
prepare-execution-planFile-level decomposition of a high-level plan for Tier 2/3 Stories
qa-reviewSKILL-QA-REVIEW-001Validate implemented code against acceptance criteria — the hard quality gate
remediate-failuresSKILL-REMEDIATE-FAILURES-001Correct failures detected during QA review

Cross Skills (selected)

SkillIDDescription
eval-runSKILL-CRS-025Evaluate any output file against a structured evals.yaml assertions file and produce a score report with per-assertion pass/fail results. Activate when the Discovery Agent runs the Skill Optimize protocol to measure output quality or detect regressions.
skill-optimizeSKILL-CRS-026Run a structured evaluate-analyze-improve cycle on any GAAI skill to measure quality, detect regressions, and propose targeted improvements. Activate when a skill needs baseline evaluation, after SKILL.md modifications, or when friction-retrospective flags a skill.
pattern-transferSKILL-CRS-027Discover structurally similar patterns across domains, assess transfer viability via structural invariant checking, and propose domain adaptations with risk gates. Activate when Discovery identifies a problem that may have been solved in another domain.
memory-delta-triageSKILL-MEMORY-DELTA-TRIAGE-001Apply three deterministic heuristics to a single memory-delta and produce a structured triage verdict. Activate when Discovery processes raw memory-deltas from contexts/artefacts/memory-deltas/.

discovery/ — skills that produce artefacts → delivery/ — skills that orchestrate and execute → cross/ — skills for memory, context, governance → domains/ — domain-specific skill packs → Back to GAAI.md