roam-code

May 22, 2026 · View on GitHub

roam-code

The local codebase intelligence layer that lets AI coding agents earn the right to change code — and prove they did.

PyPI version GitHub stars CI Python 3.10+ License: Apache 2.0

Credential-free · 100% local by default (opt-in metrics-push is the only outbound surface) · tamper-evident ChangeEvidence packets · Apache 2.0 · runs entirely on your machine

241 commands · 227 MCP tools (57 in the default core preset) · 28 languages

roam terminal demo


Why Roam is different

Cursor, Cody, Aider, and Windsurf are human-first IDE surfaces that log a session. Roam is an agent-first CLI surface that gates the change and emits proof. Four properties no competitor combines today:

  • Credential-free. No account, no API key, no cloud login. pip install and run.
  • 100% local by default. Source code never leaves the machine; air-gapped repos work like cloud repos. The single outbound surface (roam metrics-push) is opt-in, summary-only, and prints its exact payload under --dry-run.
  • Tamper-evident ChangeEvidence packets. Each AI-assisted change compiles into one portable packet — HMAC-chained run ledger + signed Code Graph Attestation + signed PR bundle — answering eight questions: who acted, what authority existed, what context was read, what changed, what could break, what policy applied, what verified it, who accepted risk. PR Replay answers 7 of 8 today; the remaining approvals question surfaces as producer_not_available, never silently dropped. Cursor logs the run; Roam proves the change.
  • MCP runtime security at the wrapper boundary. Every MCP response is scrubbed for secrets on egress, gated against the active mode (read_only / safe_edit / migration / autonomous_pr) with a closed-enum policy_decision, and each decision receipt is HMAC-linked into the signed run ledger. Inside-server controls; the gateway layer (Interlock / Lasso / Portkey) composes on top — see dev/MCP-SECURITY-POSTURE.md.

Underneath sits a SQLite-backed graph of symbols, calls, imports, layers, git history, runtime traces, smells, clones, security flows, and algorithmic patterns across 28 languages — the same local facts queried before, during, and after a change.

Dependency-aware, not string-based. Roam knows Flask has 47 dependents and 31 affected tests; grep knows it appears 847 times. One command replaces 5-10 tool calls — <0.5s per query, plain-ASCII output, --json and --sarif envelopes for agents and CI.

Without RoamWith Roam
Tool calls81
Wall time~11s<0.5s
Tokens consumed~15,000~3,000

Illustrative — a typical agent workflow on a 200-file Python project (Flask). Reproducible smoke transcript in docs/fresh-install-smoke.md; full indexing-rate harness in benchmarks/. Exact numbers vary with repo size, agent prompt, and model.


Install + first four commands

Ten minutes from pip install to a verdict on whether your next edit is safe.

pip install "roam-code[mcp]"          # 1. install with MCP server for Claude Code / Cursor / Continue
cd /path/to/your/repo
roam init                             # 2. index the repo into .roam/index.db (one-time, ~30s on most repos)
roam health                           # 3. composite 0-100 score: complexity, cycles, dark-matter coupling, dead code
roam preflight <symbol>               # 4. blast radius + tests + complexity + architecture rules before you edit

Python 3.10+. pipx install roam-code and uv tool install roam-code work too. Drop [mcp] for CLI-only. See docs/fresh-install-smoke.md for a verbatim transcript of these four commands against a clean venv.

Step 4 is the payoff — roam preflight on a hot symbol returns a verdict before you touch it:

$ roam preflight open_db
VERDICT: Significant risk — CRITICAL, 1847 symbols in blast radius

Pre-flight check for `open_db (src/roam/db/connection.py:799)`:

  Blast radius:     1847 symbols in 382 files                [CRITICAL]
  Affected tests:   617 direct, 962 transitive               [OK]
  Complexity:       cc=30, nest=4                            [CRITICAL]
  Coupling:         2 files often change together            [MEDIUM]
  Conventions:      no violations                            [OK]

  Overall risk: CRITICAL
  Risk driver:  complexity (cc=30, CRITICAL)

An agent sees the blast radius before it edits — not after the tests fail.

Alternate install methods + Docker
pipx install roam-code                                   # isolated environment (recommended)
uv tool install roam-code                                # uv-managed tool
pip install git+https://github.com/Cranot/roam-code.git  # from source

# Docker (alpine-based)
docker build -t roam-code .
docker run --rm -v "$PWD:/workspace" roam-code index
docker run --rm -v "$PWD:/workspace" roam-code health

Works on Linux, macOS, and Windows. Windows: if roam is not found after installing with uv, run uv tool update-shell and restart your terminal.


What's New

v13.4 (released 2026-05-21) — Perf wave + Pattern-1 stabilisation + assurance hardening. Major detector speed-ups (clones 43.8s → 13.1s, intent 66s → 12s, doc-staleness 93s → 19s, sbom 30s → 9s — all byte-identical output), 17 commands now emit isError/status on error envelopes + 11 commands route their argless --json path through a proper envelope (Pattern-1C drift-guards added), a persisted per-snapshot spectral gap powering a real roam forecast failure budget, MCP prompt-injection marker scan on tool-call egress, release supply-chain hardening (PEP 740 attestations, tag-bound artifacts), and large false-positive cuts in feature-envy / shotgun-surgery / god-components. Full diff in CHANGELOG.md.

Earlier releases (v13.3 / v13.2 / v13.1 / v13.0)

v13.3 (released 2026-05-19) — MCP runtime security + UX polish

  • Egress secret-redaction at the MCP wrapper boundary, 4-mode policy_decision enforcement with shadow-mode (ROAM_MODE_DRY_RUN), HMAC-linked McpDecisionReceipt + receipt_integrity verdict on roam runs verify.
  • 3 new persisting detectors (boundary, test-hermeticity, compatibility), roam doctor advisory-vs-blocking split, and --json warnings-channel discipline.

v13.2 (released 2026-05-16) — Evidence freshness + resolution disclosure

  • Canonical unresolved-path envelopes across impact / preflight / trace / test-map / context / safe-delete / split / why — one explicit "not found" shape in JSON mode.
  • Evidence freshness stamped at the producer. Runs record hashes for .roam-rules.yml, .roam/constitution.yml, .roam/control-map.yml.
  • PR Replay evidence coverage improved. Replay path answers 7 of the 8 evidence questions completely; remaining approvals question marked producer_not_available instead of silently omitted.

v13.1 (released 2026-05-15) — Pattern-2 propagation + shared YAML helper + 3 flagship silent-fallback seals

  • 3 flagship silent-fallback seals. cmd_taint, cmd_health, cmd_doctor now emit state="empty_corpus" + partial_success=True on unanalyzed repos instead of false Healthy 100/100 / No taint findings / all checks passed verdicts.
  • Shared YAML config-loader helper (load_yaml_with_warnings). 5 of 7 surveyed loaders migrated; ~125 LOC removed.
  • 5 new live smell detectors. type-switch, speculative-generality, empty-catch, cross-layer-clone, parallel-hierarchyroam smells now ships 24 deterministic detectors.
  • 30+ behavioral Pattern-2 fixes + empty-corpus smoke sweep across 25+ detectors.

v13.0 (released 2026-05-13) — Agent-OS substrate + Laravel idioms + Vue SFC

  • Agent-OS control plane. Repo-local substrates under .roam/: constitution, HMAC-chained run ledger, multi-agent leases, portable agent memory, 4 cumulative modes (read_onlysafe_editmigrationautonomous_pr).
  • World-model classifiers (R28). roam side-effects, roam idempotency, roam causal-graph, roam tx-boundaries.
  • Laravel dynamic-dispatch idioms. 7 of 8 implicit-edge idioms (Route closures, Eloquent scopes, Policy resolution, Observer registration, Job/Queue/Artisan dispatch).
  • Vue SFC import graph. .vue template/script/style blocks parsed; component registrations resolved across the SFC boundary.
  • ~20 new CLI commands (brief, next, mode, constitution, laws, memory, lease, runs, replay, agent-score, agents-md, …) and schema bump (USER_VERSION 12 → 13).

Full release notes in CHANGELOG.md.

Best for

  • Agent-assisted coding — structured answers that cut tokens vs raw file exploration
  • Large codebases (100+ files) — graph queries beat linear search at scale
  • Architecture governance — health scores, CI quality gates, budget enforcement, fitness functions
  • Safe refactoring — blast radius, affected tests, pre-change safety checks, graph-level editing
  • Multi-agent orchestration — partition codebases for parallel agents with conflict-aware planning
  • Security analysis — vulnerability reachability, auth gaps, CVE path tracing, taint analysis
  • Algorithm optimization — detect O(n²) loops, N+1 queries, and 32 other anti-patterns with suggested fixes

When NOT to use Roam

  • Real-time type checking — use an LSP (pyright, gopls, tsserver). Roam is static and offline.
  • Small scripts (<10 files) — read the files directly.
  • Pure text search — ripgrep is faster for raw string matching.

Core commands

Lead with the 5 verbs. The 5 core commands cover ~80% of agent workflows: understand, context, retrieve, preflight, critique. The remaining ~236 commands are detail surface for specialised workflows (taint, fleet, cga, oracle, eval, …) — they're called by agents on demand, not memorised. This is intentional design; under the hood the canonical surface is 241 commands (234 canonical + 7 aliases) organised into 7 categories (aliases for muscle memory: mathalgo, churnweather, digest / snapshot / trendtrends, onboardunderstand, refsuses), but you don't need to know that to start.

VerbWhat it does
roam understandFull codebase briefing: stack, architecture, key abstractions, health, conventions, entry points
roam context <symbol>AI-optimized context: definition + callers + callees + files-to-read with line ranges
roam retrieve <task>Graph-aware context for free-form tasks ("trace login flow", "where is the n+1?") — FTS5 + structural rerank within a token budget
roam preflight <symbol>Pre-change safety gate: blast radius + tests + complexity + coupling + fitness
roam critiqueVerify a patch against the graph: clones-not-edited + blast radius + intent vs semantic-diff. Pipe git diff in; exit 5 on high severity

The full surface spans 7 categories — Getting Started, Daily Workflow, Codebase Health, Architecture, Exploration, Reports & CI, and Multi-Repo Workspace. Run roam --help for the 5-verb core, roam --help-all for every command name, and roam surface --json for the machine-readable inventory. Every command accepts roam --json <cmd> for structured output and roam --sarif <cmd> for CI integration (SARIF 2.1.0, honoured by 37 commands).

Full command reference — canonical command list (all 234)

The complete, always-current list with flags and examples lives in the Command Reference.

A few representative commands beyond the core five:

  • Health & architecture: roam health (0-100 score), roam weather (churn × complexity hotspots), roam smells (24 deterministic detectors), roam algo (34-task anti-pattern catalog), roam clusters / roam layers / roam cycles.
  • Change safety: roam impact <symbol> (blast radius), roam diff (uncommitted-change blast radius), roam pr-risk (0-100 PR risk), roam diagnose <symbol> (root-cause ranking).
  • Backend quality: roam n1 (N+1 queries), roam auth-gaps, roam missing-index, roam over-fetch, roam taint (graph-reach taint, 10 rule packs).
  • Index-aware search: roam search <pattern>, roam grep <pattern> (grep + reachability + PageRank), roam uses <name> (graph-precise references, no string-literal false positives).
  • Multi-agent: roam orchestrate --agents N (conflict-aware partitioning), roam fleet plan, roam lease (parallel-agent coordination).

Walkthrough

10-command walkthrough investigating Flask (click to expand)

How you'd use Roam to understand a project you've never seen before, using Flask as an example.

$ roam understand
Tech stack: Python (flask, jinja2, werkzeug)
Architecture: Monolithic — 3 layers, 5 clusters
Key abstractions: Flask, Blueprint, Request, Response
Health: 78/100 — 1 god component (Flask)
Entry points: src/flask/__init__.py, src/flask/cli.py

$ roam file src/flask/app.py            # file skeleton: definitions + signatures + health
$ roam deps src/flask/app.py            # what imports this file
$ roam weather                          # hotspots ranked by churn × complexity
$ roam health                           # composite 0-100 + god components / cycles / layer violations
$ roam context Flask                    # AI-ready context: files to read with line ranges
$ roam preflight Flask                  # pre-change gate: blast radius + tests + complexity + fitness
$ roam split src/flask/app.py           # internal symbol groups + extraction suggestions
$ roam why Flask url_for Blueprint      # role classification (Hub/Bridge/Core) + reach + risk
$ roam health --gate                    # CI quality gate (exit 5 on failure)

Ten commands. Complete picture: structure, dependencies, hotspots, health, context, safety checks, decomposition, and CI gates.

Integration with AI coding tools

Roam is designed to be called by coding agents. Instead of repeatedly grepping and reading files, the agent runs one roam command and gets a verdict-first envelope. roam preflight (above) replaces grep+read+test-impact+complexity+fitness in one ~3KB call; roam health rolls the whole codebase into one score:

$ roam health
VERDICT: Fair codebase (75/100) — 47 critical, 9 warnings, focus: god_components

Health Score: 75/100  |  Tangle: 0.0% (7/33395 symbols in cycles)
Propagation Cost: 0.1%  |  Algebraic Connectivity: 0.0074

Health: 67 issues — 47 CRITICAL, 9 WARNING, 19 INFO
  Breakdown: cycles [1 CRITICAL, 1 WARNING], god [31 CRITICAL, 8 WARNING, 11 INFO], bottlenecks [15 CRITICAL]

Top CRITICAL issues (run `roam --detail health` for the full breakdown):
  cycle (5 symbols): _COMMANDS, complete, _reconstruct_command
  god component: path (prop, degree=2408)

The verdict line works alone — an agent that reads nothing else still knows where to look. Pipe --json for the structured envelope your agent consumes.

Fastest setup: point your agent at Roam by writing instructions into its config file.

roam describe --write               # auto-detects CLAUDE.md, AGENTS.md, .cursor/rules, etc.
roam describe --agent-prompt        # compact ~500-token prompt — copy-paste into an existing config
roam minimap --update               # inject/refresh an annotated codebase minimap (won't touch other content)

This teaches the agent which command fits each situation: roam preflight before changes, roam context for files to read, roam diagnose for debugging.

Where to put agent instructions for each tool
ToolConfig file
Claude CodeCLAUDE.md in your project root
OpenAI Codex CLIAGENTS.md in your project root
Gemini CLIGEMINI.md in your project root
Cursor.cursor/rules/roam.mdc (add alwaysApply: true frontmatter)
Windsurf.windsurf/rules/roam.md (add trigger: always_on frontmatter)
GitHub Copilot.github/copilot-instructions.md
AiderCONVENTIONS.md
Continue.devconfig.yaml rules
Cline.clinerules/ directory

MCP Server

Roam includes a Model Context Protocol server for direct integration with MCP-aware tools.

pip install "roam-code[mcp]"
roam mcp

Default preset: core (58 tools: 57 core + roam_expand_toolset meta-tool).

227 tools span seven presets (core, review, refactor, debug, architecture, compliance, full); core stays narrow to keep the prompt tight. Most tools are read-only index queries; side-effect tools are explicitly annotated. Set ROAM_MCP_PRESET=full roam mcp for the complete toolset.

Cold-start envelope. Any wrapper that can't complete normally — missing index, stale index, partial failure — returns one canonical structured envelope (status, error_code, summary.verdict, hint, next_command) instead of hanging or emitting empty output. Agents always get an actionable signal, never a silent failure.

MCP runtime security. Three controls run at the wrapper boundary inside the server, protecting every client even with no gateway present: egress secret-redaction, mode-gated policy_decision enforcement (opt-in shadow-mode via ROAM_MODE_DRY_RUN), and HMAC-linked decision receipts bound into the signed run ledger. Gateway integrators: see dev/MCP-SECURITY-POSTURE.md.

See Using Roam via MCP for the first-run flow and canonical agent sequence.

MCP tool list (all 227)
ToolDescription
roam_adrsDiscover Architecture Decision Records (ADRs) and link them to code modules. Scans well-known ADR directories (docs/adr/ / architecture/decisions/ / ...) for markdown files matching ADR naming patterns, parses each ADR's title / status / date / file refs, then cross-references mentioned files against the symbol index. Different from roam_doc_staleness (inline docstring drift) -- this is the prose-decision-document discoverer.
roam_adversarialFrame architectural issues in changed files as challenges the developer must defend: CRITICAL (new cyclic dependencies), HIGH (layer violations, high-confidence anti-patterns), WARNING (cross-cluster coupling, high fan-out), INFO (orphaned symbols). Composes cycles + clusters + layers + catalog + dead + complexity. Different from roam_diff (blast-radius facts) -- this is the architecture-review framing for code-review agents.
roam_adversarial_reviewAdversarial architecture review: challenges about cycles, anti-patterns, coupling.
roam_affectedMonorepo impact analysis: find all affected packages/modules from changes.
roam_affected_testsList the tests you actually need to run after editing a symbol or file. Use when user asks 'which tests do I run?', 'what tests cover X?', or after Edit/Write. Walks reverse-dependencies with hop distance — closer hops run first. For a full pre-commit check (blast radius + fitness + tests), use roam_prepare_change.
roam_agent_contextExtract a single agent's partition from the full agent plan: write scope, read-only dependencies, interface contracts, coordination instructions, and key symbols. Different from roam_agent_plan (full multi-agent view) and roam_orchestrate (operational dispatch with merge order) -- this is the focused per-worker packet for one agent.
roam_agent_exportGenerate AI agent context file (CLAUDE.md/AGENTS.md/.cursorrules) from index.
roam_agent_planDecompose partitions into dependency-ordered multi-agent tasks: per-task write scope, read-only dependencies, interface contracts, phase schedule, and merge sequencing. Supports plain / json / claude-teams output formats. Different from roam_partition (raw analytical manifest) and roam_orchestrate (operational dispatch) -- this is the dependency-ordered phase schedule.
roam_agent_scoreAggregate runs from the local ledger and score each agent on a 0..100 composite (run completion, gate adherence, preflight compliance, blast accuracy, replay survival). Empty state (no runs / no matching runs) returns a clean envelope with state: "no_data" -- never empty stdout, never a crash. Different from roam_runs_verify (HMAC tamper-detection) -- this is the per-agent quality score across runs.
roam_ai_ratioEstimate AI-generated code percentage from git commit heuristics.
roam_ai_readinessAI readiness score (0-100): how effectively AI agents can work on this codebase.
roam_alertsActive health alerts: thresholds breached on tangle, complexity, churn, or coverage.
roam_algoDetect suboptimal algorithms with better alternatives and complexity analysis.
roam_annotate_symbolAdd persistent annotation to a symbol/file for future agent sessions.
roam_apiList the public API surface — exported public symbols with signatures and docs.
roam_api_changesDetect breaking and non-breaking API changes vs a git ref.
roam_api_driftMismatches between backend models and frontend interfaces.
roam_architecture_driftCompute per-week growth rates for symbols / edges / cycles across a sliding window of persisted .roam/snapshots/ and classify overall direction as improving / degrading / stable. Different from roam_graph_diff (point-in-time delta between two commits) and roam_trends (metric-level time series) -- this is the snapshot-based architectural-trajectory report.
roam_article_12_checkRun a 6-item EU AI Act Article 12 readiness checklist over the indexed repo: audit-trail directory, audit-trail records, retention policy doc, technical docs, attestation surface, high-risk classification heuristic. Emits a structured envelope mapping each item to its Article (12, 18, 19) or Annex (III). Different from roam_audit_trail_conformance_check (per-record chain integrity) -- this is the repo-level governance-readiness assessment. Per the agentic-assurance guardrails: 'maps to' / 'supports evidence for', never 'certifies' / 'makes compliant'.
roam_askRoute a natural-language codebase question to the right roam recipe. Use when user asks 'is it safe to delete X?', 'where does login validate?', 'what just broke?', 'who owns module Y?', 'trace the n+1 in checkout'. Maps intent to one of 25 composed recipes (onboard / trace-task / find-bug / verify-patch / explore-impact / security-audit / who-owns / what-changed / explore-tests / dependency-update / ...). Try this BEFORE falling back to Grep+Read — the dispatcher covers most workflows in one tool call. Even low-confidence results contain signal.
roam_attestProof-carrying PR attestation: evidence bundle + merge verdict.
roam_auditRun a one-shot codebase architecture audit: bundles health, debt, dead-code, risk, test-pyramid, coverage, and API-surface signals into a single envelope. Designed as the structured artifact a written audit report attaches. Different from roam_health (single 0-100 score) and roam_report (preset-driven Markdown report) -- this is the verdict-first audit packet for governance and onboarding.
roam_audit_trail_conformance_checkScore the audit trail against an EU AI Act Article 12 checklist.
roam_audit_trail_exportExport the audit trail as markdown / json / csv for procurement review.
roam_audit_trail_verifyVerify SHA-256 chain integrity of a roam audit trail.
roam_auth_gapsEndpoints missing authentication or authorization checks.
roam_batch_getGet details for up to 50 symbols in one call. Replaces 50 sequential roam_symbol calls.
roam_batch_searchSearch up to 10 patterns in one call. Replaces 10 sequential roam_search_symbol calls.
roam_bisect_blameFind snapshots that caused architectural degradation, ranked by impact.
roam_boundarySurface public-by-accident exports + changed-range layer violations. Two closed-enum kinds: public_by_accident (warning, _-prefixed name in all) and wrong_direction_import (high, lower-layer module imports from higher-layer caller).
roam_breaking_changesDetect breaking API changes between git refs: removed exports, changed signatures.
roam_briefCompose a one-page agent briefing covering five sections: next (what roam next would recommend), highlights (stack / top danger zones / top mined laws from roam agents-md), pr_bundle (current PR-bundle status on the active branch), mode (active agent mode and its allow-list size), and runs (the N most-recent runs from the ledger). Designed as the FIRST command an agent runs when joining a roam-indexed repo. Different from roam_next (single-command router) -- this is the verdict-first session kickoff packet.
roam_budget_checkCheck changes against architectural budgets (cycles, health floor, complexity).
roam_bus_factorScore knowledge-concentration risk per directory: Shannon entropy over unique authors, primary-author share, last activity, and a staleness factor. Flags CRITICAL / HIGH / MEDIUM / LOW per module. Different from roam_owner (per-file blame) and roam_congestion (too-many-authors merge-conflict risk) -- this measures knowledge-loss risk.
roam_capsule_exportSanitized structural graph export without code bodies (privacy-safe).
roam_catalogReturn the full machine-readable list of every roam MCP tool currently registered, including title, description, and capability flags (core / read_only / destructive). Use this once at session start to discover what's available without enumerating tools.
roam_causal_graphBuild per-symbol causal graphs: edges from inputs (parameters / globals / env reads) to sinks (side-effecting calls / return / raise / mutation). Six causal kinds: param_to_effect, param_to_return, global_to_effect, global_to_mutation, env_to_effect, param_to_raise. Heuristic line-level text scan -- false negatives expected. Different from roam_taint (cross-symbol taint propagation) -- this is intra-symbol dataflow only.
roam_cga_emitEmit a Code Graph Attestation — in-toto v1 statement with predicate type https://roam-code.com/spec/CodeGraph/v1 (or https://roam-code.com/spec/CodeGraph-AIBOM/v1 with --aibom). Merkle root over symbol fingerprints + edge-bundle digest. Optional cosign keyless or offline signing.
roam_cga_verifyVerify a Code Graph Attestation — re-derives the Merkle root + edge-bundle digest from the live DB and compares to the bundled predicate, AND verifies the cosign signature on the sibling .bundle. Fails closed (exit 5) when no bundle is present unless no_cosign=True is passed to acknowledge predicate-only verification.
roam_changelogList commits since last tag, optionally formatted as a markdown CHANGELOG draft.
roam_check_rulesRun 10 built-in structural rules: cycles, fan-out, complexity, tests, god classes, layer violations.
roam_cleanRemove orphaned index entries (files deleted from disk) without full rebuild.
roam_clonesDetect near-duplicate code via AST structural hashing (Type-2 clones).
roam_closureMinimal set of changes needed for rename/delete/modify (exact files + lines).
roam_clustersShow Louvain code clusters and directory mismatches. Returns per-cluster size, cohesion, conductance, modularity Q, mega-cluster sub-group breakdowns, and inter-cluster coupling. Different from roam_layers (dependency-layer violations) -- this groups by community detection, not by topological depth.
roam_codeownersCODEOWNERS coverage, ownership distribution, unowned files, drift detection.
roam_compareDiff two roam indices structurally: reports symbols added/removed/moved, per-file complexity deltas above a threshold, language counts, and a one-line health verdict (improved / regressed / sideways). Different from roam_graph_diff (commit-range graph delta from one index) -- this is the cross-index structural delta for release-vs-release comparisons.
roam_compatibilityDetect outbound surface regressions vs a baseline snapshot. Closed-enum verdicts: no regressions / surface additions / surface drift / breaking changes. Compares commands, flags, envelope summary fields, MCP tools, and preset counts. Capture the baseline via CLI: roam compatibility --write-baseline PATH.
roam_completePrefix completion for symbols / file paths / commands. Faster than search; returns just names.
roam_complexity_reportFunctions ranked by cognitive complexity above threshold.
roam_congestionDetect developer congestion: files with too many concurrent authors within a sliding time window. Combines author count, churn intensity, and complexity into a congestion score that predicts merge conflicts and coordination failures. Different from roam_bus_factor (knowledge-loss risk) and roam_owner (per-file blame breakdown) -- this measures too-many-cooks contention.
roam_contextGet the minimum files + line ranges needed to understand or modify a symbol. Use when user says 'show me X', 'I need to change Y', 'how does Z work?'. Returns targeted reads ranked by PageRank — cheaper than Read'ing whole files. For pre-change safety (blast radius + tests + effects), use roam_prepare_change instead.
roam_conventionsAuto-detect codebase naming, file, import, and export conventions with outliers.
roam_couplingShow temporal coupling: file pairs that change together. Reads git history to find files with high co-change frequency. Different from roam_fan (structural connectivity) and roam_dark_matter (hidden co-change) -- this measures file-level temporal coupling.
roam_coverage_gapsFind unprotected entry points: top-level exported functions / methods that have no call-graph path to a required gate symbol (auth / permission / validation). Supports exact gate names, regex patterns, framework presets (python / javascript / go / java-maven / rust), and a .roam-gates.yml sidecar config. Different from roam_auth_gaps (PHP/Laravel source analysis) and roam_test_gaps (untested symbols in changed files) -- this walks the call graph to verify every entry reaches a required gate.
roam_critiqueVerify a unified diff against the codebase graph BEFORE merge. Use when user asks 'review my patch', 'is this PR safe?', or after generating any non-trivial diff. Catches clones-not-edited (missed duplicates the agent should have updated together) + blast-radius hop count. Pass git diff output as diff_text. Grounded against the indexed graph, not vibes.
roam_cutFind fragile domain boundaries via minimum-cut analysis. Computes the thinnest edge cuts between architectural clusters and the highest-impact 'leak edges' whose removal would best improve domain isolation. Different from roam_split (decomposes a single file) -- this finds boundaries between clusters.
roam_cut_analysisMinimum cut analysis: fragile domain boundaries, highest-impact leak edges.
roam_dark_matterFile pairs that co-change without structural links (hidden coupling).
roam_dashboardUnified single-screen codebase status: health, hotspots, bus factor, dead code, AI rot.
roam_dead_codeUnreferenced exported symbols (dead code candidates).
roam_debtPrioritized tech debt with SQALE remediation cost estimates.
roam_delete_checkGate the diff (working / staged / PR / HEAD) on surviving references to deleted symbols and files. Per-deletion verdict: SAFE (no surviving references), LIKELY-SAFE (survivors only in tests / docs / unreachable code), or BREAK-RISK (survivors in reachable code). Different from roam_critique (PR-wide diff review) -- this targets the deletion surface specifically with CI-gate semantics (overall BREAK-RISK trips the gate).
roam_depsFile-level imports and importers (what depends on this file).
roam_describeAuto-generate a project description for AI coding agents: multi-section Markdown report covering overview, directories, entry points, key abstractions, architecture, and testing. Different from roam_understand (compact codebase overview) -- this is the comprehensive prose description for CLAUDE.md / AGENTS.md / .cursor/rules. The wrapper emits to stdout; on-disk writes are deferred to the CLI (roam describe --write) so the MCP surface stays read-only.
roam_dev_profileDeveloper behavioral profiling: commit time patterns, change scatter (Gini), burst detection.
roam_diagnoseRoot cause analysis: upstream/downstream suspects ranked by composite risk.
roam_diagnose_issueFind root-cause suspects for a failing symbol. Use when user reports 'X is broken', 'test Y fails', 'why does Z return null?', 'this exception traces to ...'. Ranks upstream / downstream callers by composite risk and lists side effects + transactional boundaries. Replaces manual Grep+Read traversal of the call graph. Pass the suspect symbol.
roam_diffShow the blast radius of your edits BEFORE you commit. Run after Edit/Write tools to see affected symbols, files, tests, plus coupling and fitness warnings. Use when user asks 'what did my change break?', 'safe to commit?'. Replaces ad-hoc git diff --stat inspection with graph-aware impact data. For PR-level risk verdict, use roam_pr_risk.
roam_disambiguateList every symbol matching a name with file/line/kind/signature/PageRank — pick the right overload.
roam_doc_intentLink documentation to code: find drift, dead refs, undocumented symbols.
roam_doc_stalenessDetect stale docstrings: docs whose body has drifted since the comment was written. Uses git blame to compare docstring timestamps against code body timestamps. Different from roam_docs_coverage (missing docs ranked by PageRank) and roam_stale_refs (dangling doc links) -- this audits what existing docs SAY.
roam_docs_coverageDoc coverage + stale-doc drift with PageRank-ranked missing docs.
roam_doctorSetup diagnostics: Python version, tree-sitter, git, index existence, freshness, SQLite.
roam_dogfoodOne-shot full-stack run: audit + pr-analyze + audit-trail + conformance.
roam_dogfood_aggregateTriage view over the dogfood eval corpus: totals, per-command findings count, by-status / by-severity / by-type breakdowns. Reads internal/dogfood/evals/ (or an override path). Useful for agents auditing roam-code itself; mostly a no-op on consumer repos that have no dogfood corpus.
roam_driftOwnership drift detection: declared CODEOWNERS vs actual time-decayed contributors.
roam_duplicatesDetect semantically duplicate functions via structural similarity.
roam_effectsSide effects of functions: DB writes, network, filesystem (direct + transitive).
roam_endpointsList all REST/GraphQL/gRPC endpoints with handlers, methods, and locations.
roam_entry_pointsCatalog every entry point into the codebase: HTTP routes, CLI commands, scheduled jobs, event handlers, message consumers, main functions, and exports. Reports per-entry reachability coverage -- what fraction of symbols each entry transitively reaches through the call graph.
roam_eval_retrieveRun the retrieval eval harness over a labeled task set. Reports recall@K, mean reciprocal rank, and per-task diagnostics. Supports a weight sweep and CodeRAG-Bench / BEIR emit formats for public leaderboard submission.
roam_evidence_diffDiff two ChangeEvidence packets: shows hash drift, schema drift, added/removed refs, missing evidence, and changed verdicts. Useful for reviewing PR re-runs, comparing replay windows, or auditing whether a fresh evidence packet has improved or regressed against a stored baseline. Different from roam_compare (two-index structural delta) -- this is the two-packet evidence delta.
roam_evidence_doctorDiagnose a ChangeEvidence packet's health: schema validity, closed-enum conformance, content_hash integrity, completeness banner tier (STRONG / PARTIAL / INSUFFICIENT), declared redactions, and actionable next steps for partial / missing evidence questions. Read-only.
roam_evidence_oscalEmit an OSCAL v1.2 document. Default kind='control-mapping' compiles the roam control map (maps roam evidence to EU AI Act, ISO/IEC 42001, NIST AI RMF, NIST AI 600-1, NIST SP 800-218A, SOC 2, internal AI-change policy). kind='assessment-results' compiles a per-run AR document from a ChangeEvidence packet (requires evidence_path); AR mandates an Assessment Plan reference — pass import_ap_ref for an external AP or omit it to inline a synthesized stub AP. Supports evidence for the listed frameworks — does not certify compliance. Two roam-specific concepts (authority_refs, redactions) surface as OSCAL prop extensions under the urn:roam:oscal:v1 namespace.
roam_expand_toolsetList available tool presets or show contents of a preset. Presets: core (57), review (70), refactor (70), debug (69), architecture (71), compliance (13), full (227).
roam_exploreCodebase exploration bundle: understand overview + optional symbol deep-dive in one call.
roam_fanShow fan-in / fan-out: the most-connected symbols or files. Flags hub / spreader / HIGH-RISK structural hotspots based on cross-file import / call edges. Different from coupling (co-change frequency) -- this measures structural connectivity.
roam_fetch_handleFetch all or part of a large payload by handle — supports byte slice, section pick, jq projection.
roam_file_infoFile skeleton: all symbols with signatures, kinds, line ranges.
roam_findings_countShow per-detector finding counts. Useful for spotting which detectors have migrated to the central registry vs which are still only emitting to their detector-specific tables.
roam_findings_listList rows from the central findings registry, optionally filtered by detector or subject. Cross-detector view -- every migrated detector (clones, dead, complexity, smells, n1, missing-index, ...) emits here behind one schema.
roam_findings_showShow full detail for a single finding by its stable finding_id_str. Returns the detector version, subject, confidence tier, claim, evidence JSON, and any suppressions.
roam_fingerprintTopology fingerprint for cross-repo comparison or structural drift tracking.
roam_fitnessRun architectural fitness functions from .roam/fitness.yaml: dependency constraints, layer enforcement, metric thresholds, naming conventions, and trend regression guards. Different from roam_preflight (compound 6-signal pre-edit gate) -- this is the dedicated fitness surface with per-rule output, baseline / delta mode, and trend regression guards.
roam_flag_deadDetect potentially stale feature-flag code: flags referenced only once, flags always checked with the same boolean default, and flags clustered in a single file. Recognises LaunchDarkly, Unleash, Split, generic feature_flag(...) calls, and FEATURE_* env-var patterns. Different from roam_dead_code (graph-unreachable symbols) -- this targets code that is alive in the graph but gated behind flags that may never fire.
roam_fleet_planPlan a multi-agent fleet for a goal — graph-aware partition (Louvain + co-change) emits .roam-fleet.json for Composio / Copilot CLI / raw.
roam_fn_couplingShow function-level temporal coupling: symbol pairs that change together across commits. Different from roam_coupling (file-level pairs) -- this drills into co-changing symbols inside and across files, with optional structural-edge filtering.
roam_for_bug_fixCompound: diagnose + affected_tests + diff + context for a symbol you're about to debug.
roam_for_new_featureCompound: understand + search + context + complexity for an area you're about to add code to.
roam_for_refactorCompound: preflight + impact + complexity_report + clones for a symbol you're about to refactor.
roam_for_security_reviewCompound: taint + vuln + critique + adversarial for a security review pass.
roam_forecastPredict when metrics will exceed thresholds (Theil-Sen regression).
roam_generate_planStructured execution plan for code modification: read order, invariants, tests.
roam_get_annotationsRead annotations for symbols, files, or project. Filter by tag/date.
roam_get_invariantsImplicit contracts for symbols: signature stability, usage spread, breaking risk.
roam_graph_diffShow the structural graph delta between two snapshots. Surfaces new / removed symbols, edge churn, degree shifts, new cycles, layer migrations, and likely renames. Reads persisted snapshots from .roam/snapshots/ -- capture one with --save-snapshot.
roam_graph_statsReport graph-level invariants: density, connected components, average in/out degree, top in-degree symbols, and approximate diameter. One overview number for 'how dense, connected, and cyclic is this codebase'.
roam_grepRun index-aware grep across the codebase. Returns matches with their enclosing symbol, reachability badge, PageRank, clone-class, and bridge annotations. Supports multi-pattern, source-only / test-only filters, reachable-from / unreachable filters, co-occurrence across patterns, and rank-by importance.
roam_guardCheck breaking-change risk for a symbol before editing: 0..100 risk score with component breakdown (blast radius, complexity, centrality, test gap, layer analysis) plus caller / callee lists and covering tests -- all within a ~2K-token budget. Different from roam_preflight (file / staged / coupling / convention / fitness composite) -- this is the per-symbol quantified risk score for sub-agent dispatch.
roam_healthCodebase health score (0-100) with issue breakdown, cycles, bottlenecks.
roam_history_grepRun git pickaxe (-S / -G) through commit history. Returns commits that introduced or removed the literal string, with author, date, short SHA, and summary per commit.
roam_hotspotsShow runtime hotspots: symbols ranked by static analysis vs real production traces (requires roam ingest-trace to have populated runtime_stats). Each row is tagged UPGRADE (runtime-critical but statically safe), CONFIRMED (both agree), or DOWNGRADE (statically risky but low traffic). Different from roam_why_slow (top-N by latency alone) -- this classifies static vs runtime mismatch.
roam_hoverOne-line architectural summary for a symbol — kind, location, blast-radius bucket, top caller, top callee.
roam_idempotencyClassify symbols by retry safety: idempotent (pure, read-only I/O, write-with-check patterns like mkdir(exist_ok=True) / INSERT OR IGNORE / UPSERT / if not exists: create), non_idempotent (naive writes, mutations, appends), or unknown (process spawn / unreadable body). Composes on top of roam_side_effects. Different from roam_tx_boundaries (transaction correctness) -- this answers is it safe to retry?.
roam_impactBlast radius for 'is it safe to change?' — symbols + files affected, in 5 lines. Compact decision-support output. Round 4 / S: the right default tool for safety-checks; preflight is heavier.
roam_ingest_traceIngest runtime traces (OTel/Jaeger/Zipkin), match spans to symbols.
roam_initInitialize roam and build the first index. Task-mode for non-blocking setup.
roam_intentLink documentation to code: find which docs mention which symbols, and detect doc-to-code drift (references to non-existent symbols). Different from roam_docs_coverage (PageRank-ranked missing-docstring hotlist) and roam_doc_staleness (stale docstring content) -- this is the prose-doc-to-symbol linker plus drift detector.
roam_invariantsDiscover implicit contracts for a symbol or the public API surface: signature shape, parameter count and ordering, usage spread across files, dependency set. Different from roam_check_rules (explicit governance rules) -- this is the AUTO-discovered implicit-contract surface so agents know what must stay stable when modifying a symbol.
roam_layersShow topological dependency layers and violations. Returns each layer's symbol count, directory breakdown, and any back-edges that violate the topological order. Different from roam_clusters (community detection) -- this measures dependency depth.
roam_llm_smellsRun LLM-API integration linter over indexed files: detects unpinned model versions, missing max_tokens, prompt injection via user-input concatenation, unvalidated json.loads on LLM output, and missing temperature. Different from roam_vibe_check (AI-generated code shape) and roam_smells (structural anti-patterns) -- this is the production gate for human-authored LLM-using code.
roam_mapShow project skeleton: directory tree, entry points, top symbols by PageRank, language counts. Different from roam_describe (prose description) and roam_minimap (sentinel-block one-pager for CLAUDE.md) -- this is the structured skeleton with directories, entry points, and ranked symbols for agent onboarding.
roam_metricsShow unified per-file or per-symbol metrics: cognitive complexity, fan-in / fan-out, SNA centrality vector (PageRank / betweenness / closeness / eigenvector / clustering coefficient), composite debt score, churn, test coverage, and comprehension difficulty in a single view.
roam_metrics_pushPush metrics-only summary to Roam Cloud Lite. Default is dry-run.
roam_migration_planGenerate an ordered migration plan with risk + blast-radius per step from a target-architecture YAML spec or inline --move SYMBOL=path/to/new/file directives. Each step is annotated with caller count and a derived risk score so agents can decide where to stop or insert tests. Stops at the first step exceeding max_risk. Different from roam_simulate (counterfactual single-move analysis) -- this is the ordered multi-step plan with a risk gate.
roam_migration_safetyNon-idempotent database migrations (unsafe for re-run).
roam_minimapGenerate a compact ~20-line codebase minimap for CLAUDE.md injection: tech stack, annotated directory tree, key symbols by PageRank, high-fan-in symbols to avoid, hotspots, detected conventions. Different from roam_describe (long-form prose) and roam_map (structured skeleton) -- this is the sentinel-block one-pager. The wrapper emits to stdout; on-disk updates are deferred to the CLI (roam minimap --update / --init-notes) so the MCP surface stays read-only.
roam_missing_indexQueries on non-indexed columns (slow query risk).
roam_moduleShow directory contents: exported symbols, signatures, external imports / importers, internal cohesion percentage, and API surface ratio. Different from roam_describe (project-wide) -- this analyses a single directory.
roam_mutateAgentic editing: move/rename/add-call/extract symbols with auto-import rewrite.
roam_n1Detect N+1 I/O patterns in ORM code (Laravel/Django/Rails/SQLAlchemy/JPA).
roam_nextSuggest the next roam command based on cheap repo-state signals: index presence, staleness, working-tree dirtiness, recent envelope, and recent memory. Emits one imperative recommendation in <200ms. Different from roam_brief (multi-section session kickoff) and roam_workflow (curated multi-step recipes) -- this is the single-command router.
roam_onboardGenerate a new-developer onboarding guide for the codebase.
roam_oracle_batchRun multiple oracle queries in one call. Items: [{name, oracle, max_hops?}, ...] where oracle is one of symbol-exists, route-exists, is-test-only, is-reachable-from-entry, is-clone-of.
roam_oracle_is_clone_ofAnswer the boolean oracle question: does this symbol have persisted clone siblings in the clone_pairs table? Returns a yes/no verdict envelope with the matched clone class size. Different from roam_clones (full clone-pair enumeration) -- this is the cheap boolean lookup for one symbol's clone status.
roam_oracle_is_reachable_from_entryAnswer the boolean oracle question: is the symbol reachable from any entry point via the call graph (BFS up to max_hops depth)? Useful for sniffing orphans and production-vs-tooling code. Different from roam_dead_code (broad dead-symbol detection) and roam_entry_points (entry-point enumeration) -- this is the cheap boolean lookup for one symbol's reachability.
roam_oracle_is_test_onlyAnswer the boolean oracle question: are ALL callers of this symbol in test files? Useful for sniffing test fixtures and dead-but-test-only helpers. Different from roam_dead_code (broad dead-symbol detection) -- this is the cheap boolean lookup for one symbol's test-only status.
roam_oracle_route_existsAnswer the boolean oracle question: does a route handler match this URL path? Returns a yes/no verdict envelope with the matched handler's file + kind when found. Different from roam_endpoints (full endpoint enumeration) -- this is the cheap boolean lookup for one route precondition check.
roam_oracle_symbol_existsAnswer the boolean oracle question: does a symbol with this name exist in the index? Returns a yes/no verdict envelope with the matched symbol's file + kind when found. Different from roam_search_symbol (top-N ranked hits) -- this is the cheap boolean lookup for agent precondition checks.
roam_oracle_test_onlyAlias of roam_oracle_is_test_only — preserves the shorter name agents sometimes guess.
roam_orchestratePartition codebase for parallel multi-agent work with exclusive write zones.
roam_orphan_importsList imports that don't resolve to any indexed module or installed package -- catches typo'd local imports, missing packages, and dangling relative imports. Covers Python (default), JavaScript / TypeScript, and Go. Different from roam_dead_code (unused symbols) -- this targets import-statement orphans.
roam_orphan_routesBackend routes with no frontend consumer (dead endpoints).
roam_over_fetchModels serializing too many fields (data over-exposure risk).
roam_ownerShow code ownership computed from git blame: per-author line counts, percentages, last-active dates, and a fragmentation index. Works on a file or a directory prefix. Different from roam_codeowners (which reads the CODEOWNERS file) -- this measures actual ownership.
roam_partitionMulti-agent work partitioning: split codebase into independent work zones.
roam_path_coverageCritical call paths with zero test protection, ranked by risk.
roam_patternsDetect positive architectural patterns: Singleton, Factory, Observer, Repository, Middleware, Strategy, and Decorator. Different from roam_smells (negative anti-patterns) -- this discovers intentional design patterns.
roam_planGenerate a structured execution plan for modifying code: read-order (call-graph BFS), invariants (mined contracts), blast-radius preview, and per-task heuristics. Five task types: refactor / debug / extend / review / understand. Different from roam_plan_refactor (refactoring-specific simulation) and roam_preflight (blast-radius gate) -- this is the general-purpose work plan for any task type.
roam_plan_refactorBuild an ordered refactor plan for one symbol using risk/test/simulation context.
roam_postmortemReplay current detectors against past commits: walks a git commit range, runs roam critique against each commit's diff, and reports which findings would have surfaced pre-merge. Useful for retrospective replay -- 'would today's detector set have caught the incidents already in history?' Different from roam_pr_replay (one PR replay) -- this is the range-replay over historical commits.
roam_pr_analyzeAgent-aware PR risk verdict — INTENTIONAL / SAFE / REVIEW / BLOCK.
roam_pr_comment_renderRender a markdown PR comment from a pr-analyze JSON envelope.
roam_pr_diffStructural graph delta of code changes: metric deltas, layer violations.
roam_pr_prepOne-shot pre-PR fitness check: bundles diff blast radius + critique + pr-risk into a single envelope with a ready_to_open verdict. Different from roam_pr_risk (composite risk score alone) and roam_critique (clones-not-edited + blast-radius alone) -- this is the three-section pre-PR rollup with the go/no-go verdict.
roam_pr_riskRisk score (0-100) for pending changes with per-file breakdown.
roam_preflightPre-change safety check: blast radius, tests, complexity, fitness. Call BEFORE modifying code.
roam_prepare_changeBundle everything needed before editing a symbol: blast radius, affected tests, files to read, side effects, fitness gates. Call BEFORE Edit/Write on non-trivial code. Use when user asks 'is it safe to change X?', 'what do I need to know to refactor Y?', 'show me what depends on Z'. Replaces manual preflight + context + effects sequence.
roam_py_modernPython modernisation signal: walrus, match, PEP 604/585, f-strings vs legacy.
roam_py_typesPython type-annotation health: % public fns fully typed, Any usage, legacy typing.
roam_pytest_fixturespytest fixture chain: top fixtures by dependent count, or per-symbol dependency walk.
roam_recommendSurface symbols related to a given symbol via three signal sources combined: call-graph neighbours (1-hop in + out), git co-change (other symbols whose files changed in the same commits), and persisted clone siblings (when roam clones --persist was run). Each candidate gets a score that's the normalised sum of the three contributions. Different from roam_impact (transitive blast radius) and roam_neighbours (graph-only 1-hop neighbours) -- this fuses co-change + clones into the ranking.
roam_refs_textAudit literal strings across the project and emit a per-string verdict: SAFE-TO-REMOVE / REVIEW / LOAD-BEARING. Groups every reference by surface (code, test, docs, config, generated, vendored) and annotates reachability for code hits.
roam_reindexIncremental or force reindex. Task-mode + elicited confirmation for force runs.
roam_relateHow symbols connect: shared deps, call chains, conflicts, cohesion score.
roam_repo_mapCompact project skeleton with key symbols per file, by PageRank.
roam_reportRun a compound report preset (built-ins: first-contact, security, pre-pr, refactor, guardian) that orchestrates multiple analysis commands into one rendered report. Different from roam_audit (single fixed bundle) -- this is the preset-driven multi-command roll-up with optional Markdown output and strict exit-code gating.
roam_resetDelete index DB and rebuild from scratch. Requires force=True. Recovery for corrupted indexes.
roam_retrieveGraph-aware context for free-form tasks: FTS5 + structural rerank (PageRank + clones) + token budget.
roam_review_changeChange review bundle: pr-risk + breaking changes + structural diff in one call.
roam_riskRank symbols by domain-weighted risk: combines static risk (fan-in + fan-out + betweenness) with domain criticality weights so financial / auth / data-integrity symbols rank higher than UI symbols. Different from roam_fan (raw fan-in/out degree) and roam_hotspots (runtime hotspot classification) -- this is the semantic-domain-weighted risk heatmap.
roam_rules_checkEvaluate custom governance rules from .roam/rules/ YAML files.
roam_rules_validateLint a .roam/rules.yml for shippability before customers see it.
roam_runtime_hotspotsRuntime hotspots where static and runtime rankings disagree (UPGRADE/DOWNGRADE).
roam_safe_deleteFuse dead-code, blast-radius, and test-coverage signals into a single deletion verdict: SAFE / REVIEW / UNSAFE. Reports direct callers (non-test), transitive dependents, affected files, and a public-API bump that flips SAFE -> REVIEW for exported symbols whose name matches a common public-API prefix. Different from roam_dead_code (all unreferenced symbols) and roam_impact (transitive blast radius) -- this is the single go/no-go gate.
roam_safe_zonesClassify the refactor containment zone around a symbol or file: ISOLATED (no external connections), CONTAINED (<=5 boundary symbols), or EXPOSED (>5). Reports strictly-internal vs boundary symbols and external caller / callee counts per boundary. Different from roam_impact (unbounded reverse blast radius) and roam_closure (exact locations needing modification) -- this maps the bounded zone where it is safe to refactor freely.
roam_sbomEmit a Software Bill of Materials (CycloneDX 1.7 by default, or SPDX 2.3) enriched with call-graph reachability — distinguishes phantom dependencies from those actually exercised. Pair with --aibom for the AIBOM extension required by EU AI Act Art. 50.
roam_search_semanticFind symbols by natural language query (hybrid BM25 + vector + framework packs).
roam_search_symbolLook up a function / class / method by partial name. Use when user mentions a symbol ('the login handler', 'AuthService.refresh', 'handleSave') and you need the file path, line, kind, and qualified name. Replaces Bash: grep -n 'def name' src/ + Read. Returns PageRank-ranked results — most-important match first. Do NOT use for finding references — that's roam_uses. For 3+ patterns at once use roam_batch_search.
roam_secretsScan for hardcoded secrets, API keys, tokens, passwords (24 patterns).
roam_semantic_diffStructural change summary: what symbols were added/removed/modified.
roam_session_metricsLocal-only telemetry: per-tool invocation counts grouped by outcome (success / rate_limited / error). Helps answer "which tools are agents actually using?" and "are 90 of the 227 tools dead weight?". Never phones home — counters live in the MCP server process and reset on restart.
roam_side_effectsClassify symbols by side-effect bucket: none (pure), io_read (disk / network / DB read), io_write (disk / network / DB write), mutation (global / module state mutation), process (subprocess / thread / async), or unknown. Coarse five-bucket taxonomy designed for agent decisions. Different from roam_effects (finer 11-kind taxonomy + transitive propagation) -- this is the agent's go/no-go classifier for can I retry this safely?.
roam_simulatePredict metric deltas from move/extract/merge/delete operations.
roam_simulate_departureSimulate knowledge loss if a developer leaves the team.
roam_sketchRender a compact structural skeleton of a directory: every file's exported symbols with kind, signature, line range, and first-line docstring. Different from roam_understand (broader project overview) and roam_file_info (one-file skeleton) -- this is the directory-level API surface in a single view, with optional full=True to include private symbols.
roam_smellsRun 24 deterministic code-smell detectors over the indexed codebase: brain methods, god classes, deep nesting, shotgun surgery, feature envy, long parameter lists, large classes, dead params, low cohesion, message chains, data clumps, type switches, cross-layer clones, parallel hierarchies, and more. Different from roam_vibe_check (AI-rot pattern regex) and roam_patterns (positive design patterns) -- this surfaces negative structural anti-patterns from DB queries.
roam_spectralSpectral bisection: Fiedler vector partition tree and modularity gap.
roam_splitAnalyse a file's internal call / reference graph and propose natural decomposition groups via Louvain community detection. Reports per-group isolation %, internal vs cross-group edges, and ranked extraction candidates (groups with >=3 symbols and >=50% isolation). Different from roam_clusters (repo-wide module partitioning) -- this analyses ONE file's internal seams.
roam_stale_refsFind dangling file references — markdown links / HTML href-src / backtick paths whose target is missing. v12.48 adds anchor validation, confidence-tagged hints, --diff branch filter, --fix preview/apply, and --sort-by ranking. Set enrich_with_llm=True for LLM-sampled hints on findings the deterministic providers couldn't resolve.
roam_statsAggregate high-level statistics: language / role / kind counts plus a recent-commit activity counter over a configurable window. Different from roam_metrics (per-symbol static-metric report) and roam_graph_stats (graph-wide topology stats) -- this is the language-and-role inventory snapshot.
roam_suggest_refactoringRank proactive refactoring candidates using complexity/coupling/churn/smells.
roam_suggest_reviewersSuggest optimal code reviewers for changed files.
roam_supply_chainDependency risk dashboard: pin coverage, risk scoring, supply-chain health.
roam_symbolSymbol definition, callers, callees, PageRank, fan-in/out metrics.
roam_syntax_checkTree-sitter syntax validation. Finds ERROR/MISSING AST nodes. No index needed.
roam_taintGraph-reach taint analysis. Returns OpenVEX-shaped findings (spec-legal status + justification — never code_not_reachable). 10 starter rule packs: sqli, xss, ssrf, path-traversal, command-injection, deserialization, open-redirect, urllib, socketio, fileupload. Pair with --ci to gate on findings (exit 5).
roam_taint_classifyRun roam taint then ask the agent's own LLM (via MCP sampling) to classify each reachable finding as IDOR/AUTHZ/SQLI/XSS/CMD_INJECTION/etc. with confidence + reasoning. Counter to Semgrep Multimodal — same LLM-reasoning narrative without a hosted API key.
roam_test_gapsFind changed symbols missing test coverage, ranked by severity.
roam_test_hermeticityDetect non-hermetic test patterns that cause CI flakiness. Six closed-enum kinds: network, time, random, filesystem, env, subprocess. AST-driven (not regex) with module-level suppression for monkeypatch / freezegun / responses / random.seed.
roam_test_impactTests transitively reachable from changed symbols — sharper scope than affected_tests.
roam_test_mapMap a symbol or file to its current test coverage: direct test edges (test file calls the symbol), file-level importers (test file imports the symbol's module), and convention-based matches (Salesforce <Name>Test / <Name>_Test classes). Different from roam_test_gaps (untested symbols in changed files) and roam_affected_tests (forward trace from changes to affected tests) -- this is the lookup for what currently exercises a given symbol.
roam_test_pyramidCount indexed test files by kind (unit / integration / e2e / smoke / unknown) using path and name conventions, and flag inverted pyramids (when e2e + integration > unit). Different from roam_test_gaps (missing coverage) -- this measures the shape of the existing test suite for slow-CI risk.
roam_test_scaffoldGenerate a test-file skeleton for a source file or symbol (functions, classes, methods) with the right imports and per-symbol stub blocks. Supports pytest / unittest (Python), jest / mocha / vitest (JS/TS), Go testing, JUnit4 / JUnit5 (Java), and RSpec / Minitest (Ruby). Dry-run by default; pair with roam_test_map first to confirm no existing coverage. Skips symbols that already have tests in the target file.
roam_timelineChronological commits that touched the file owning a symbol — author, date, lines added/removed.
roam_tourCodebase onboarding guide: reading order, entry points, architecture roles.
roam_traceShortest dependency path between two symbols with hop details.
roam_trendsHistorical metric tracking: record and query health metric trends over time.
roam_tx_boundariesClassify functions by transactional safety: transactional (begin matched by commit/rollback, all mutations inside scope), partial_transactional (mutations both inside AND outside scope), unsafe_mutation (mutations OUTSIDE any transaction wrapper -- latent bug), unmatched_begin (begin without commit/rollback -- leak), unmatched_commit, non_transactional, or unknown. Composes on top of roam_side_effects. Different from roam_idempotency (retry safety) -- this gates transaction correctness.
roam_understandBrief Claude on an unfamiliar codebase in one call. Use when user asks 'what is this repo?', 'where do I start?', 'give me the lay of the land'. Returns stack, architecture layers, entry points, hotspots, conventions (~2-4K tokens). Do NOT explore with Glob/Grep first — start here.
roam_usesList every caller, importer, and subclass of a symbol — structured by edge type. Use when user asks 'where is X used?', 'who calls Y?', 'what breaks if I rename Z?'. Graph-precise: no comment / string-literal false positives that multi-shape Grep produces. Do NOT use Bash:grep for references — this is the right tool. For 3+ symbols call roam_batch_get (one round-trip) instead.
roam_validate_planPre-apply validator for a multi-step change plan. Returns blockers, warnings, advice per operation.
roam_verifyCheck changed files for naming, import, error-handling, and duplicate issues.
roam_verify_importsHallucination firewall: validate import statements resolve to indexed symbols.
roam_vibe_checkAI rot score (0-100): 8-pattern taxonomy of AI code anti-patterns.
roam_visualizeGenerate Mermaid/DOT architecture diagram with smart filtering.
roam_vuln_mapIngest vulnerability scanner reports (npm/pip/trivy/osv), match to symbols.
roam_vuln_reachVulnerability reachability through call graph: paths, hops, blast radius.
roam_weatherChurn x complexity hotspot ranking: highest-leverage refactoring targets.
roam_whyExplain why a symbol matters: role classification (Hub/Bridge/Leaf), transitive reach, critical-path membership, cluster cohesion, and a one-line verdict. Accepts multiple symbol names for batch triage. Different from roam_fan (raw connectivity ranking) and roam_preflight (blast-radius gate before edit) -- this is the per-symbol role explainer for triage and onboarding.
roam_why_failTriage a failing test/symbol: recently-changed symbols transitively reachable from it.
roam_why_slowRank runtime hotspots by cost = log10(call_count + 1) * p99_latency_ms. Reads runtime_stats populated by roam ingest-trace. Optionally restricts to symbols in changed files vs a base ref. Different from roam_hotspots (static-vs-runtime classification) -- this is the pure latency-weighted ranking.
roam_workflowInspect a workflow recipe DAG, list available recipes, or suggest what to run next given a prior command. Useful as an agent navigation aid: 'I just ran roam impact -- what should I run next?' Different from the heavyweight analytical recipes -- this is the metadata-only recipe browser.
roam_ws_contextCross-repo augmented context for a symbol spanning multiple repos.
roam_ws_understandMulti-repo workspace overview: per-repo stats, cross-repo connections.
roam_x_langShow cross-language symbol bridges: Protobuf .proto -> generated Go/Java/Python stubs, Salesforce Apex -> Aura/LWC/Visualforce, REST API frontend -> backend route, template variable -> source, and env-var read -> .env definition. Call this tool to list every registered bridge type.

Core preset tools: roam_affected_tests, roam_alerts, roam_ask, roam_audit_trail_conformance_check, roam_audit_trail_export, roam_audit_trail_verify, roam_batch_get, roam_batch_search, roam_catalog, roam_complete, roam_complexity_report, roam_context, roam_critique, roam_dead_code, roam_deps, roam_diagnose, roam_diagnose_issue, roam_diff, roam_disambiguate, roam_dogfood, roam_explore, roam_fetch_handle, roam_file_info, roam_fleet_plan, roam_for_bug_fix, roam_for_new_feature, roam_for_refactor, roam_for_security_review, roam_health, roam_impact, roam_metrics_push, roam_oracle_is_clone_of, roam_oracle_is_reachable_from_entry, roam_oracle_is_test_only, roam_oracle_route_exists, roam_oracle_symbol_exists, roam_pr_analyze, roam_pr_comment_render, roam_pr_risk, roam_preflight, roam_prepare_change, roam_py_modern, roam_py_types, roam_retrieve, roam_review_change, roam_rules_validate, roam_search_symbol, roam_session_metrics, roam_syntax_check, roam_taint_classify, roam_test_impact, roam_timeline, roam_trace, roam_understand, roam_uses, roam_validate_plan, roam_why_fail.

MCP client setup (Claude Code / Claude Desktop / Cursor / VS Code)

Claude Code: claude mcp add roam-code -- roam mcp, or add to .mcp.json:

{ "mcpServers": { "roam-code": { "command": "roam", "args": ["mcp"] } } }

Claude Desktop — add to claude_desktop_config.json (include "cwd": "/path/to/your/project").

Cursor — add the same mcpServers block to .cursor/mcp.json.

VS Code + Copilot — add to .vscode/mcp.json under a servers key with "type": "stdio".

Go deeper

Pick the path that matches your role:

  • 5-min demo (CTO/CISO/dev-tools-lead): The Canonical Demo — install → health → preflight → critique → signed ChangeEvidence packet, five commands, no laptop egress.
  • Developer tutorial (15 min): Getting Started — install, index, query, ship.
  • Agent integration: roam mcp-setup claude-code (or cursor, continue) — then Using Roam via MCP for the cold-start envelope and canonical agent loop.
  • Full surface: Command Reference — every command, flag, and JSON envelope.
  • Architecture: How it fits together — graph, findings registry, run ledger, evidence compiler.

CI/CD integration

All you need is Python 3.10+ and pip install roam-code.

# .github/workflows/roam.yml
name: Roam Analysis
on: [pull_request]

jobs:
  roam:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
        with:
          fetch-depth: 0
      - uses: Cranot/roam-code@main
        with:
          commands: health
          gate: "score>=70"
          sarif: true
          comment: true

roam init auto-generates this workflow. The Action accepts commands, gate (quality-gate expression, exit 5 on failure), sarif (upload to GitHub Code Scanning), comment (sticky PR comment), cache, and changed-only (incremental mode).

SARIF output. 37 commands honour the global --sarif flag (health, complexity, dead, smells, clones, vulns, taint, secrets, n1, …). Minimal upload:

- run: roam --sarif health > roam-health.sarif
- uses: github/codeql-action/upload-sarif@v3
  with:
    sarif_file: roam-health.sarif

For GitLab / Jenkins / Azure / Bitbucket templates, severity gates, and upload guardrails, see docs/ci-integration.md.

The CLI is Apache 2.0, fully local, zero-API-key, and never expires. Three optional paid layers build on the same engine:

  • Roam Review — hosted PR bot for AI-generated changes, built on roam pr-analyze. CodeRabbit/Greptile review PR semantics; Roam Review reads the graph (who calls the changed symbol, which layer it sits in) and emits a portable ChangeEvidence packet. The CLI engine is a working CI gate today: git diff main..HEAD | roam pr-analyze --gate (exit 5 on BLOCK).
  • Roam Cloud — opt-in metrics history with no source upload. roam metrics-push sends a summary-only payload (numerical metrics, paths or SHA-256 hashes, identifier names) — never source-code bodies. Inspect the exact payload with --dry-run.
  • PR Replay — one-shot paid audit of your last 30/90 merged PRs: a written structural-review report plus a founder walk-through. Free DIY sample via roam pr-replay --tier sample.

Early access — email hello@roam-code.com. Full pricing at https://roam-code.com/pricing.

Language Support

Tier 1 — Full extraction (dedicated parsers)

LanguageExtensionsSymbolsReferencesInheritance
Python.py .pyiclasses, functions, methods, decorators, variablesimports, calls, inheritanceextends, __all__ exports
JavaScript.js .jsx .mjs .cjsclasses, functions, arrow functions, CJS exportsimports, require(), callsextends
TypeScript.ts .tsx .mts .ctsinterfaces, type aliases, enums + all JSimports, calls, type refsextends, implements
Java.javaclasses, interfaces, enums, constructors, fieldsimports, callsextends, implements
Go.gostructs, interfaces, functions, methods, fieldsimports, callsembedded structs
Rust.rsstructs, traits, impls, enums, functionsuse, callsimpl Trait for Struct
C / C++.c .h .cpp .hpp .ccstructs, classes, functions, namespaces, templatesincludes, callsextends
C#.csclasses, interfaces, structs, enums, records, methods, properties, delegates, eventsusing directives, calls, new, attributesextends, implements
PHP.phpclasses, interfaces, traits, enums, methods, propertiesnamespace use, calls, static calls, newextends, implements, use (traits)
Ruby.rbclasses, modules, methods, singleton methods, constantsrequire, require_relative, include/extend, callsclass inheritance
Kotlin.kt .ktsclasses, interfaces, enums, objects, functions, methods, propertiesimports, calls, type refsextends, implements
Scala.scala .scclasses, traits, objects, case classes, functions, val/var, type aliasesimports, calls, newextends, with (trait mixins)
Swift.swiftclasses, structs, enums, protocols, functions, methods, propertiesimports, calls, type refsextends, conforms
Dart.dartclasses, mixins, extensions, enums, type aliases, functions, methods, constructorsimports, calls, type refsextends, implements, with
Visual FoxPro.prgfunctions, procedures, classes, methods, properties, constantsDO, SET PROCEDURE/CLASSLIB, CREATEOBJECT, obj.method()DEFINE CLASS ... AS
SQL (DDL).sqltables, columns, views, functions, triggers, schemas, types, sequencesforeign keys, view table deps, trigger refs
YAML (CI/CD).yml .yamlGitLab CI jobs/anchors, GitHub Actions workflows/jobs, generic top-level keysextends:, needs:, !reference, uses:
HCL / Terraform.tf .tfvars .hclresource, data, variable, output, module, provider, localsvar.*, module.*, data.*, local.*
Vue / Svelte.vue .sveltevia <script> block extraction (TS/JS)imports, calls, type refsextends, implements
Salesforce ecosystem (Tier 1)
LanguageExtensionsSymbolsReferences
Apex.cls .triggerclasses, triggers, SOQL, annotationsimports, calls, System.Label, generic type refs
Aura.cmp .app .evt .intf .designcomponents, attributes, methods, eventscontroller refs, component refs
LWC (JavaScript).js (in LWC dirs)anonymous class from filename@salesforce/apex/, @salesforce/schema/, @salesforce/label/
Visualforce.page .componentpages, componentscontroller/extensions, merge fields, includes
SF Metadata XML*-meta.xmlobjects, fields, rules, layoutsApex class refs, formula field refs, Flow actionCalls

Cross-language edges mean roam impact AccountService shows blast radius across Apex, LWC, Aura, Visualforce, and Flows.

Tier 2 languages (and .jsonc / .mdx) get basic symbol extraction via a generic tree-sitter walker.

Performance

MetricValue
Index 200 files~3-5s
Index 3,000 files~2 min
Incremental (no changes)<1s
Any query command<0.5s

After the first full index, roam index only re-processes changed files (mtime + SHA-256 hash). Detailed indexing benchmarks across Express / Axios / Vue / Laravel / Svelte live in benchmarks/.

How It Works

Codebase
    |
[1] Discovery ──── git ls-files (respects .gitignore + .roamignore)
[2] Parse ──────── tree-sitter AST per file (28 languages)
[3] Extract ────── symbols + references (calls, imports, inheritance)
[4] Resolve ────── match references to definitions → edges
[5] Metrics ────── adaptive PageRank, betweenness, cognitive complexity, Halstead
[6] Algorithms ── 34-task anti-pattern catalog (O(n^2) loops, N+1, recursion, async)
[7] Git ────────── churn, co-change matrix, authorship, Renyi entropy
[8] Clusters ───── Louvain community detection
[9] Health ─────── per-file scores (7-factor) + composite score (0-100)
[10] Store ─────── .roam/index.db (SQLite, WAL mode)

Exclude paths with a .roamignore file (full gitignore syntax) or roam config --exclude "*.proto". For the graph algorithms (Personalized PageRank for blast radius, Tarjan SCC, Louvain, Fiedler bisection, Mann-Kendall trend detection, …) and the weighted-geometric-mean health score, see the Architecture guide.

How Roam Compares

roam-code is the only tool that combines graph algorithms (PageRank, Tarjan SCC, Louvain clustering), git archaeology, architecture simulation, and multi-agent partitioning in a single local CLI with zero API keys.

Capabilityroam-codeAI IDEs (Cursor, Windsurf)AI Agents (Claude Code, Codex)SAST (SonarQube, CodeQL)
Persistent local indexSQLiteCloud embeddingsNonePer-scan
Call graph analysisYesNoNoYes (CodeQL)
PageRank / centralityYesNoNoNo
Cycle detection (Tarjan)YesNoNoDeprecated (SonarQube)
Community detection (Louvain)YesNoNoNo
Git churn / co-changeYesNoNoNo
Architecture simulationYesNoNoNo
Multi-agent partitioningYesNoNoNo
MCP tools for agents227 (57 in default core preset)Client onlyClient only34 (SonarQube)
Languages2870+50+12-42
100% local, zero API keysYesNoNoPartial
Open sourceApache 2.0NoPartialPartial
Interprocedural taint depthshallow (OpenVEX-shaped)n/an/adeep (CodeQL)
Built-in rule packs10 taint packs, 10 governance rulesn/an/a2,000+ (Semgrep community)
Cross-repo at GitHub scaleworkspace overlay (sibling repos)n/an/anative (Sourcegraph)

Key Differentiators

  • vs AI IDEs (Cursor, Windsurf, Augment): roam-code provides deterministic structural analysis. AI IDEs use probabilistic embeddings that can't guarantee reproducible results.
  • vs AI Agents (Claude Code, Codex CLI, Gemini CLI): these agents read files one at a time. roam-code pre-computes relationships so agents get instant answers about architecture, blast radius, and dependencies.
  • vs SAST Tools (SonarQube, CodeQL, Semgrep): SAST tools find bugs and vulnerabilities. roam-code understands architecture — how code is structured, where it's coupled, and what breaks when you change it. Complementary, not competitive.
  • vs Code Search (Sourcegraph/Amp, Greptile): text search finds where code is. roam-code understands why code matters — which functions are central, which modules are tangled, which files are high-risk.
For teams — cost comparison
ToolAnnual cost (20-dev team)InfrastructureSetup time
SonarQube Server (paid tier)$15,000-$45,000Self-hosted serverDays
CodeScene$20,000-$60,000SaaS or on-premHours
Code Climate$12,000-$36,000SaaSHours
Roam (free CLI)$0 (Apache 2.0)None (local)5 minutes

The comparison is against the paid tiers a 20-dev team usually buys, not free Community editions. Roam complements either tier — pipe its SARIF output into the same Code Scanning surface. Rollout: pilot on one repo, add roam health --gate to CI as non-blocking, then tighten thresholds and track trajectory with roam trends.

FAQ

Does Roam send any data externally? No by default — zero telemetry, zero analytics, zero update checks. The single outbound surface is roam metrics-push: opt-in, summary metrics only, prints its exact payload locally under --dry-run. Source-code bodies never leave the machine.

Can Roam run in air-gapped environments? Yes. Once installed, no internet access is required.

Does Roam modify my source code? Read-only by default. Creates .roam/ with an index database. roam mutate (move/rename/extract) defaults to --dry-run; pass --apply explicitly to write changes.

How does Roam handle monorepos and multi-repo projects? Monorepos: indexes from the root; batched SQL handles 100k+ symbols. Multi-repo: roam ws init <repo1> <repo2> builds a workspace overlay DB for cross-repo API edges, then roam ws resolve / ws context / ws trace work across repos.

Is Roam compatible with SonarQube / CodeScene? Yes — they coexist in the same CI pipeline. SARIF output uploads to GitHub Code Scanning.

Does Roam satisfy SOC 2 / ISO 42001 / EU AI Act on its own? No. Roam maps to controls and produces supporting evidence — the signed ChangeEvidence packet, HMAC-chained run ledger, and audit-trail records answer the eight evidence questions a reviewer asks after an AI-assisted change. Roam does not certify; your auditor still owns that step.

What's the difference between the free CLI and Roam Review / Cloud / PR Replay? The CLI is Apache 2.0, fully local, and never expires. Roam Review is a hosted PR bot, Roam Cloud is opt-in metrics history with no source upload, PR Replay is a one-shot paid audit. All three are layers on top of the same engine.

Limitations

  • Static analysis primarily — can't trace dynamic dispatch, reflection, or eval'd code. Runtime trace ingestion (roam ingest-trace) adds production data but requires external trace export.
  • Import resolution is heuristic — complex re-exports or conditional imports may not resolve.
  • Limited cross-language edges — Salesforce, Protobuf, REST API, and multi-repo edges are supported, but not arbitrary FFI.
  • Tier 2 languages get basic symbol extraction only via the generic tree-sitter walker.
  • Large monorepos (100k+ files) may have slow initial indexing.

Troubleshooting

ProblemSolution
roam: command not foundEnsure install location is on PATH. For uv: uv tool update-shell
Another indexing process is runningDelete .roam/index.lock and retry
database is lockedroam index --force to rebuild
Unicode errors on Windowschcp 65001 for UTF-8
Symbol resolves to wrong fileUse file:symbol syntax: roam symbol myfile:MyFunction
Health score seems wrongroam --json health for factor breakdown
Index stale after git pullroam index (incremental). After major refactors: roam index --force

Update / Uninstall

# Update
pipx upgrade roam-code        # or: uv tool upgrade roam-code / pip install --upgrade roam-code

# Uninstall
pipx uninstall roam-code      # or: uv tool uninstall roam-code / pip uninstall roam-code

Delete .roam/ from your project root to clean up local data.

Contributing

git clone https://github.com/Cranot/roam-code.git
cd roam-code
pip install -e ".[dev]"   # includes pytest, ruff
pytest tests/              # all test cases must pass

Good first contributions: add a Tier 1 language (see go_lang.py or php_lang.py as templates), improve reference resolution, add benchmark repos, extend SARIF converters, add MCP tools. Please open an issue first to discuss larger changes.

License

Apache 2.0