iOS Simulator Skill for Claude Code

June 18, 2026 · View on GitHub

Ask DeepWiki

iOS Simulator Skill for Claude Code

Production-ready skill for building, testing, and automating iOS apps. 27 scripts optimized for both human developers and AI agents.

(If you'd prefer an MCP, XC-MCP)

Xcode Build + Simulator Automation

This skill covers both sides of iOS development:

  • Xcode builds via xcodebuild — compile, test, and parse results with progressive error disclosure
  • Simulator interaction via xcrun simctl and idb — semantic UI navigation, accessibility testing, device lifecycle

If you only need Xcode build tooling without the simulator scripts, see the plugin version: xclaude-plugin

Installation

In Claude Code:

/plugin marketplace add conorluddy/ios-simulator-skill
/plugin install ios-simulator-skill@conorluddy

Via Git Clone

# Personal installation
git clone https://github.com/conorluddy/ios-simulator-skill.git ~/.claude/skills/ios-simulator-skill

# Project installation
git clone https://github.com/conorluddy/ios-simulator-skill.git .claude/skills/ios-simulator-skill

Restart Claude Code. The skill loads automatically.

Prerequisites

  • macOS 12+
  • Xcode Command Line Tools (xcode-select --install)
  • Python 3
  • IDB (optional, for interactive features: brew tap facebook/fb && brew install idb-companion)
  • Pillow (optional, for visual diffs: pip3 install pillow)

Features

Xcode Build with Progressive Disclosure

The build_and_test.py script wraps xcodebuild with token-efficient output. A build returns a single summary line with an xcresult ID:

Build: SUCCESS (0 errors, 3 warnings) [xcresult-20251018-143052]

Then drill into details on demand:

python scripts/build_and_test.py --get-errors xcresult-20251018-143052
python scripts/build_and_test.py --get-warnings xcresult-20251018-143052
python scripts/build_and_test.py --get-log xcresult-20251018-143052

This keeps agent conversations focused — no walls of build output unless you ask for them.

Simulator Navigation via Accessibility

Instead of fragile pixel-coordinate tapping, all navigation uses iOS accessibility APIs to find elements by meaning:

# Fragile — breaks if UI changes
idb ui tap 320 400

# Robust — finds by meaning
python scripts/navigator.py --find-text "Login" --tap

The accessibility tree gives structured data (element types, labels, frames, tap targets) at ~10 tokens default output vs 1,600-6,300 tokens for a screenshot. See AI-Accessible Apps for more on why accessibility-first navigation matters for AI agents.

Screenshot Token Optimization

When screenshots are needed (visual verification, bug reports, diffs), the skill automatically resizes and compresses them to minimize token cost. Default output across all 27 scripts is 3-5 lines — 96% reduction vs raw tool output.

TaskRaw ToolsThis SkillSavings
Screen analysis200+ lines5 lines97.5%
Find & tap button100+ lines1 line99%
Login flow400+ lines15 lines96%

All 27 Scripts

Every script supports --help and --json. See SKILL.md for the complete reference.

Build & Development

ScriptWhat it doesKey flags
build_and_test.pyBuild Xcode projects, run tests, parse xcresult bundles--project, --scheme, --test, --get-errors, --get-warnings
log_monitor.pyReal-time log monitoring with severity filtering--app, --severity, --follow, --duration

Device State

ScriptWhat it doesKey flags
appearance.pySwitch dark mode, dynamic type, locale, region--theme, --text-size, --locale, --region, --reset
location.pySimulate GPS coordinates and run built-in scenarios--lat, --lng, --city, --gpx, --list-scenarios, --clear
ScriptWhat it doesKey flags
screen_mapper.pyAnalyze current screen, list interactive elements--verbose, --hints
navigator.pyFind and interact with elements semantically--find-text, --find-type, --find-id, --tap, --enter-text
gesture.pySwipes, scrolls, pinches, long press, pull to refresh--swipe, --scroll, --pinch, --long-press, --refresh
keyboard.pyText input and hardware button control--type, --key, --button, --clear, --dismiss
app_launcher.pyLaunch, terminate, install, deep link apps--launch, --terminate, --install, --open-url, --list

Testing & Analysis

ScriptWhat it doesKey flags
accessibility_audit.pyWCAG compliance checking on current screen--verbose, --output
visual_diff.pyCompare two screenshots for visual changes--threshold, --output, --details
test_recorder.pyAutomated test documentation with screenshots--test-name, --output
app_state_capture.pyDebugging snapshots (screenshot, hierarchy, logs)--app-bundle-id, --output, --log-lines
sim_health_check.shVerify environment (Xcode, simctl, IDB, Python)
model_inspector.pyInspect Core Data / SwiftData models from project files--project-path, --raw, --show-versions
container.pyInspect app sandbox: list, cat, UserDefaults, Core Data, export--ls, --cat, --userdefaults, --core-data-path, --export
hang_watcher.py (HangBuster)Record + summarise os_log hang events with progressive disclosure (session mode + raw NDJSON + legacy stream); auto-restart on stream death, automatic disk-cap cleanup--start [--raw-capture --max-size-mb N --no-gzip], --stop, --get-details, --list-sessions, --diff, --budget-tokens, --auto-sample (legacy: --watch, --since)
localization_audit.pyAudit .xcstrings catalogs for missing keys, unused keys, placeholder mismatches--catalog, --source, --strict

Permissions & Environment

ScriptWhat it doesKey flags
clipboard.pyCopy text to simulator clipboard for paste testing--copy, --test-name
status_bar.pyOverride status bar (time, battery, network)--preset, --time, --battery-level, --clear
push_notification.pySend simulated push notifications--bundle-id, --title, --body, --payload
privacy_manager.pyGrant, revoke, reset app permissions (13 services)--bundle-id, --grant, --revoke, --reset

Device Lifecycle

ScriptWhat it doesKey flags
simctl_boot.pyBoot simulators with readiness verification--name, --wait-ready, --timeout, --all, --type
simctl_shutdown.pyGracefully shutdown simulators--name, --verify, --all, --type
simctl_create.pyCreate simulators by device type and OS version--device, --runtime, --list-devices
simctl_delete.pyDelete simulators with safety confirmation--name, --yes, --all, --old
simctl_erase.pyFactory reset without deletion--name, --verify, --all, --booted

Configuration

Every operational limit — timeouts, output caps, polling intervals, cache size, post-action delays — is tunable via an IOS_SIM_* environment variable. Defaults are tuned for local development on Apple Silicon. Raise them on slow CI runners, large monorepos, or accessibility audits over complex screens. Lower them when you need faster failure or tighter token budgets.

There's a universal tradeoff to keep in mind:

  • Higher caps / longer timeouts → fewer false failures, more complete diagnostics, more tokens consumed by AI agents and slower failures when something is genuinely broken.
  • Lower caps / shorter timeouts → faster feedback, tighter token usage, risk of silently dropped errors or premature timeouts on legitimately slow operations.

Boot & lifecycle timeouts

How long to wait on xcrun simctl operations.

VariableDefaultTradeoff
IOS_SIM_BOOT_TIMEOUT300 (s)Wait for simulator readiness after boot. Lower → faster failure on broken sims. Higher → survives cold-start on slow CI runners (GitHub-hosted macOS can need 4–6 min).
IOS_SIM_BOOT_SUBPROCESS_TIMEOUT60 (s)Timeout for the simctl boot call itself (before readiness polling starts). Rarely needs changing; bump only if you see Boot command timed out on resource-starved CI.
IOS_SIM_ERASE_TIMEOUT90 (s)Wait for factory-reset verification. Larger simulators (lots of installed apps + data) can need more than the old 30s.
IOS_SIM_POLL_INTERVAL0.5 (s)How often to re-check boot/erase state. Lower → more responsive (more CPU). Higher → quieter on slow CI but adds latency to “ready” detection.
IOS_SIM_STATE_SUBPROCESS_TIMEOUT15 (s)Per-subprocess timeout in app_state_capture.py. Bump for apps with very large accessibility trees.

Build & test output caps

build_and_test.py returns counts by default and full details via xcresult ID; these caps govern what's surfaced in human/JSON output before progressive disclosure kicks in.

VariableDefaultTradeoff
IOS_SIM_BUILD_SUMMARY_CAP15Errors / failed tests in the default text summary. Lower → terser default output. Higher → less need to chase xcresult IDs for context.
IOS_SIM_BUILD_VERBOSE_CAP100Errors / warnings in --verbose mode. Mostly relevant for monorepos or first builds with many fixable warnings.
IOS_SIM_BUILD_JSON_CAP50Max errors / failed tests in --json output. Raise for CI dashboards that need exhaustive lists.
IOS_SIM_BUILD_LOG_PREVIEW4000 (chars)Chars of build log included in default output. Higher → more context for failures, more tokens.
IOS_SIM_BUILD_TIMEOUT1800 (s)Hard cap on a single xcodebuild build invocation. Default of 30 min covers most clean builds of large apps; raise for very large monorepos, lower to fail fast in CI when builds are expected to take seconds. Without this, a hung xcodebuild would block forever.
IOS_SIM_TEST_TIMEOUT2700 (s)Hard cap on xcodebuild test. Tests can take significantly longer than builds (45 min default) because of simulator boot + animation delays.
IOS_SIM_INTROSPECT_TIMEOUT60 (s)Timeout for xcodebuild -list and xcrun simctl list introspection calls. These should normally complete in <1s; 60s catches Xcode-toolchain hangs without disrupting cold-start.

Log monitor output

log_monitor.py aggregates os_log output; these caps shape both the text summary and the structured JSON.

VariableDefaultTradeoff
IOS_SIM_LOG_TEXT_SUMMARY15Errors / warnings shown in the text summary. The default surfaces enough for most debugging without flooding terminal output.
IOS_SIM_LOG_LINE_MAX300 (chars)Per-line truncation. Crash messages with full Swift symbol mangling can exceed 200 chars; raise if you see “…” cutting off the actionable bit.
IOS_SIM_LOG_TAIL200 (lines)Recent log lines shown in verbose mode and JSON sample_logs. Also used by xcode log excerpt. Lower for tighter context, higher for richer post-mortems.
IOS_SIM_LOG_JSON_CAP100Max errors / warnings in JSON output. Raise if you're piping into a dashboard that needs the full picture.
IOS_SIM_HANG_PREDICATE(default)Override the os_log predicate used by hang_watcher.py. The default catches RunningBoard watchdog kills, explicit "Hang detected" messages, and main-thread hang annotations. Hang events originate from system daemons (RunningBoard, SpringBoard, watchdog) — not the target app's process — so the predicate intentionally stays simulator-global. --bundle-id is applied post-parse against the event payload, never ANDed into the predicate.
IOS_SIM_HANG_MIN_MS250HangBuster threshold: events below this duration never reach disk.
IOS_SIM_HANG_SESSION_TTL_HOURS24HangBuster session prune age. Pruning runs on every --start.
IOS_SIM_HANG_DEFAULT_TOP_N3Default top-N clusters in --stop L1 output.
IOS_SIM_HANG_BUDGET_TOKENS(unset)Default token budget for --stop (picks L0/L1/L2 to fit).
IOS_SIM_HANG_MAX_RESTARTS3HangBuster worker: bounded log stream respawn attempts on EOF/subprocess death before the session is marked crashed. Set to 0 to disable auto-restart.
IOS_SIM_HANG_TOTAL_CAP_MB100HangBuster aggregate disk cap. When total session-state exceeds this on --start, oldest sessions are dropped first. Set to 0 to disable.

UI navigation & screen mapping

These shape what AI agents see when exploring a screen. They're the biggest direct lever on token consumption per navigation step.

VariableDefaultTradeoff
IOS_SIM_MAX_ELEMENTS25Tappable elements listed by navigator.py. Default is enough for most screens; raise to 100+ for dense Settings-style screens or audit workflows. High token impact at high values.
IOS_SIM_SCREEN_BUTTONS_PREVIEW15Button names in screen_mapper.py summary.
IOS_SIM_SCREEN_SECTION_ITEMS10Items per section in screen_mapper.py summary.
IOS_SIM_APPS_PREVIEW30Installed apps listed by app_launcher.py before truncation.
IOS_SIM_TAP_SETTLE_MS500 (ms)Delay after a tap before reading new state. Lower → faster navigation on snappy apps. Higher → safer on apps with heavy animations or async loading; raises end-to-end test runtime linearly.
IOS_SIM_RELAUNCH_DELAY_MS1000 (ms)Delay between terminate and re-launch in app_launcher.py --restart. Raise if you see relaunches catching the previous process still tearing down.

Accessibility audit

VariableDefaultTradeoff
IOS_SIM_A11Y_TOP_ISSUES10Top issues surfaced per audit. Default of 3 in older versions was almost always too aggressive for real-world apps. Raise for first-pass audits, lower for regression checks.
IOS_SIM_A11Y_LABEL_MAX80 (chars)Max chars of AXLabel retained in audit output. Custom localized labels often exceed 30 chars; 80 catches almost all.

Progressive disclosure cache

ProgressiveCache stores large outputs (build results, log dumps) keyed by short IDs so the default output stays minimal.

VariableDefaultTradeoff
IOS_SIM_CACHE_TTL_HOURS1How long cache entries remain valid. Lower → fresher data on next retrieve, more re-runs. Higher → faster reruns in long CI pipelines, risk of stale results if simulator state changed.
IOS_SIM_CACHE_MAX_ENTRIES500Hard cap; oldest entries (by mtime) are evicted on every save(). Prevents unbounded growth of ~/.ios-simulator-skill/cache/ in long-running environments. Raise only if you frequently need to retrieve entries older than ~500 saves.

Examples

# Slow GitHub Actions macOS runner — give boot up to 10 minutes
IOS_SIM_BOOT_TIMEOUT=600 python scripts/simctl_boot.py --wait-ready

# Monorepo with hundreds of warnings — see them all in verbose mode
IOS_SIM_BUILD_VERBOSE_CAP=500 python scripts/build_and_test.py --verbose

# Complex Settings-style screen — return more tappable elements
IOS_SIM_MAX_ELEMENTS=100 python scripts/navigator.py --list-tappable

# Snappy app — cut tap-settle delay in half for faster E2E runs
IOS_SIM_TAP_SETTLE_MS=250 python scripts/navigator.py --find-text "Login" --tap

# Long CI pipeline — keep cache entries valid for the whole job
IOS_SIM_CACHE_TTL_HOURS=8 python scripts/build_and_test.py --project MyApp.xcodeproj

You can also export the vars once for the whole session:

export IOS_SIM_BOOT_TIMEOUT=600
export IOS_SIM_LOG_TAIL=500
export IOS_SIM_MAX_ELEMENTS=50
# … all subsequent scripts honor them

Parse errors fall back to the documented default with a warning on stderr — no scripts will crash on a malformed env var.

Evaluation

Tested using Claude Code evals:

ConditionPass Rate
With skill100% (3/3)
Without skill46% (~1.4/3)
claude evals run evals/evals.json --skill ios-simulator-skill

License

MIT