๐Ÿš€ Parallel multi-platform testing

April 15, 2026 ยท View on GitHub

flutter-skill CLI demo โ€” nav, tap, type, screenshot across websites

flutter-skill

Give any AI agent eyes and hands inside any running app.
10 platforms. Zero test code. One MCP server.

Stars pub.dev npm CI License

Demo โ€ข Quick Start โ€ข AI Platforms โ€ข Platforms โ€ข vs Others โ€ข Docs

๐Ÿš€ Zero config. Zero test code. Just talk to your AI.

If this saves you time, please consider starring the repo โญ โ€” it helps others find it!


30-Second Demo

https://github.com/user-attachments/assets/d4617c73-043f-424c-9a9a-1a61d4c2d3c6

One prompt. 28 AI-driven actions. Zero test code. The AI explores a TikTok clone, navigates tabs, scrolls feeds, tests search, fills forms โ€” all autonomously.


Why This Exists

Writing E2E tests is painful. Maintaining them is worse. flutter-skill takes a different approach:

  • ๐Ÿ”Œ Connects any AI agent (Claude, Cursor, Windsurf, Copilot, OpenClaw) directly to your running app via MCP
  • ๐Ÿ‘€ The agent sees your screen โ€” taps buttons, types text, scrolls, navigates โ€” like a human tester who never sleeps
  • โœ… Zero test code โ€” no Page Objects, no XPath, no brittle selectors. Just plain English
  • โšก Zero config โ€” 2 lines of code, works on all 10 platforms
You: "Test the checkout flow with an empty cart, then add 3 items and complete purchase"

Your AI agent handles the rest โ€” screenshots, taps, text entry, assertions, navigation.
No Page Objects. No XPath. No brittle selectors. Just plain English.

Quick Start

1. Install (30 seconds)

npm install -g flutter-skill

2. Add to your AI (copy-paste into MCP config)

{
  "mcpServers": {
    "flutter-skill": {
      "command": "flutter-skill",
      "args": ["server"]
    }
  }
}

Works with Claude Desktop, Cursor, Windsurf, Copilot, Cline, OpenClaw โ€” any MCP-compatible agent.

3. Add to your app (2 lines for Flutter)

import 'package:flutter_skill/flutter_skill.dart';

void main() {
  if (kDebugMode) FlutterSkillBinding.ensureInitialized();
  runApp(MyApp());
}

4. Test โ€” just talk to your AI:

"Launch my app, explore every screen, and report any bugs"

That's it. Zero configuration. Zero test code. Works in under 60 seconds.

๐Ÿ“ฆ More install methods (Homebrew, Scoop, Docker, IDE, Agent Skill)
MethodCommand
npmnpm install -g flutter-skill
Homebrewbrew install ai-dashboad/flutter-skill/flutter-skill
Scoopscoop install flutter-skill
Dockerdocker pull ghcr.io/ai-dashboad/flutter-skill
pub.devdart pub global activate flutter_skill
VSCodeExtensions โ†’ "Flutter Skill"
JetBrainsPlugins โ†’ "Flutter Skill"
Agent Skillnpx skills add ai-dashboad/flutter-skill
Zero-configflutter-skill init (auto-detects & patches your app)

Use with AI Platforms

MCP Server Mode (IDE Integration)

Works with any MCP-compatible AI tool. One config line:

{
  "mcpServers": {
    "flutter-skill": {
      "command": "flutter-skill",
      "args": ["server"]
    }
  }
}
PlatformConfig FileStatus
Cursor.cursor/mcp.jsonโœ…
Claude Desktopclaude_desktop_config.jsonโœ…
Windsurf~/.codeium/windsurf/mcp_config.jsonโœ…
VSCode Copilot.vscode/mcp.jsonโœ…
ClineVSCode Settings โ†’ Cline โ†’ MCPโœ…
OpenClawSkill or MCP configโœ…
Continue.dev.continue/config.jsonโœ…

HTTP Serve Mode (CLI & Automation)

For standalone browser automation, CI/CD pipelines, or remote access:

# Start server
flutter-skill serve https://your-app.com

# Use CLI client commands
flutter-skill nav https://google.com
flutter-skill snap                    # Accessibility tree (99% fewer tokens)
flutter-skill screenshot /tmp/ss.jpg
flutter-skill tap "Login"
flutter-skill type "hello@example.com"
flutter-skill eval "document.title"
flutter-skill tools                   # List all available tools
CommandDescription
nav <url>Navigate to URL
snapAccessibility tree snapshot
screenshot [path]Take screenshot
tap <text|ref|x y>Tap element
type <text>Type via keyboard
key <key> [mod]Press key
eval <js>Execute JavaScript
titleGet page title
textGet visible text
hover <text>Hover element
upload <sel> <file>Upload file
toolsList tools
call <tool> [json]Call any tool

Supports --port=N, --host=H flags and FS_PORT/FS_HOST env vars.

Two Modes Compared

server (MCP stdio)serve (HTTP)
Use caseIDE / AI agent integrationCLI / automation / CI/CD
ProtocolMCP (JSON-RPC over stdio)HTTP REST
Tools253 (dynamic per page)246 (generic)
BrowserAuto-launches ChromeConnects to existing Chrome
Best forCursor, Claude, VSCodeOpenClaw, scripts, pipelines

Full CLI client reference: docs/CLI_CLIENT.md


10 Platforms, One Tool

Most testing tools work on 1-2 platforms. flutter-skill works on 10.

PlatformSDKTest Score
Flutter (iOS/Android/Web)flutter_skillโœ… 188/195
React Nativesdks/react-nativeโœ… 75/75
Electronsdks/electronโœ… 75/75
Tauri (Rust)sdks/tauriโœ… 75/75
Android (Kotlin)sdks/androidโœ… 74/75
KMP Desktopsdks/kmpโœ… 75/75
.NET MAUIsdks/dotnet-mauiโœ… 75/75
iOS (Swift/UIKit)sdks/iosโœ… 19/19
Web (any website)sdks/webโœ…
Web CDP (zero-config)No SDK neededโœ… 141/156

Total: 656/664 tests passing (98.8%) โ€” each platform tested against a complex social media app with 50+ elements.


โšก Performance

Real benchmarks from automated test runs against a complex social media app:

OperationWeb (CDP)ElectronAndroid
connect93 ms55 ms103 ms
tap1 ms1 ms2 ms
enter_text1 ms1 ms2 ms
inspect3 ms12 ms10 ms
snapshot2 ms8 ms29 ms
screenshot31 ms80 ms88 ms
eval1 msโ€”โ€”

Token efficiency: snapshot() returns a structured element tree instead of an image โ€” 87โ€“99% fewer tokens than sending screenshots to your AI agent.

How fast is that? A tap takes 1โ€“2 ms end-to-end. Browser automation tools like Playwright and Selenium typically take 50โ€“100 ms for the same operation. That's 50โ€“100ร— faster, because flutter-skill talks directly to the app runtime instead of going through WebDriver or CDP indirection.

Heavy DOM Sites (Real-World)

Tested 15 MCP tools against production websites โ€” 75/75 passed, zero timeouts:

SiteToolsTotal Timesnapshotscreenshotcount_elements
YouTube15/15 โœ…6.9s43 ms30 ms4 ms
Amazon15/15 โœ…14.2s1 ms5 ms2 ms
Reddit15/15 โœ…17.9s6 ms32 ms51 ms
Hacker News15/15 โœ…4.8s53 ms188 ms1 ms
Wikipedia15/15 โœ…7.8s15 ms336 ms1 ms

Total time includes page load. Tool execution is consistently sub-100ms even on heavy DOM sites.


Why Not Playwright / Appium / Detox?

flutter-skillPlaywright MCPAppiumDetox
MCP tools253~33โŒโŒ
Platforms101 (web)MobileReact Native
Setup time30 secMinutesHoursHours
Test code neededโŒ Noneโœ… Yesโœ… Yesโœ… Yes
AI-native (MCP)โœ…โœ…โŒโŒ
Self-healing testsโœ…โŒโŒโŒ
Monkey/fuzz testingโœ…โŒโŒโŒ
Visual regressionโœ…โŒโŒโŒ
Network mock/replayโœ…โŒโŒโŒ
API + UI testingโœ…โŒโŒโŒ
Multi-device syncโœ…โŒPartialโŒ
Accessibility auditโœ…โŒโŒโŒ
i18n testingโœ…โŒโŒโŒ
Performance monitoringโœ…โŒโŒโŒ
Natural languageโœ…โŒโŒโŒ
Flutter supportโœ… NativePartialPartialโŒ
Desktop appsโœ…โœ…โŒโŒ

| AI page understanding | โœ… AX Tree | โŒ Screenshots | โŒ | โŒ | | Boundary/security test | โœ… 13 payloads | โŒ | โŒ | โŒ | | Batch actions | โœ… 5+/call | 1/call | 1/call | 1/call |

flutter-skill is the only AI-native E2E testing tool that works across mobile, web, and desktop โ€” with 7ร— more tools than the nearest competitor.


CLI Commands

# ๐Ÿค– AI autonomous exploration โ€” finds bugs automatically
flutter-skill explore https://my-app.com --depth=3

# ๐Ÿ’ Monkey/fuzz testing โ€” random actions, crash detection
flutter-skill monkey https://my-app.com --actions=100 --seed=42

# ๐Ÿš€ Parallel multi-platform testing
flutter-skill test --url https://my-app.com --platforms web,electron,android

# ๐ŸŒ Zero-config WebMCP server โ€” any website becomes testable
flutter-skill serve https://my-app.com

๐Ÿง  AI-Native: 95% Fewer Tokens

Most AI testing tools send screenshots to the LLM โ€” each one costs ~4,000 tokens.

flutter-skill uses Chrome's Accessibility Tree to give your AI a compact semantic summary of any page:

// page_summary โ†’ ~200 tokens (vs ~4,000 for a screenshot)
{
  "title": "Shopping Cart",
  "nav": ["Home", "Products", "Cart", "Account"],
  "forms": [{"input:Coupon Code": "text"}],
  "buttons": ["Apply", "Checkout", "Continue Shopping"],
  "features": {"search": true, "pagination": true},
  "links": 47, "inputs": 3
}

Then batch multiple actions in one call:

// explore_actions โ†’ 5 actions per call (vs 5 separate tool calls)
{"actions": [
  {"type": "fill", "target": "input:Coupon Code", "value": "SAVE20"},
  {"type": "tap", "target": "button:Apply"},
  {"type": "tap", "target": "button:Checkout"},
  {"type": "fill", "target": "input:Email", "value": "test@example.com"},
  {"type": "tap", "target": "button:Continue"}
]}

Result: Your AI agent tests faster, costs less, and understands pages better than screenshot-based tools.

flutter-skillScreenshot-based tools
Tokens per page~200~4,000
Actions per call5+1
Understands semanticsโœ… roles, names, stateโŒ pixels only
Works with Shadow DOMโœ…โŒ

What It Can Do

๐Ÿ‘€ See

  • screenshot โ€” capture the screen
  • inspect_interactive โ€” all tappable/typeable elements with semantic refs
  • find_element / wait_for_element
  • get_elements โ€” full element tree

๐Ÿ‘† Interact

  • tap / long_press / swipe / drag
  • enter_text / set_text / clear_text
  • scroll โ€” all directions
  • go_back / press_key

๐Ÿ” Inspect (v0.8.0)

  • Semantic refs: button:Login, input:Email
  • Stable across UI changes
  • tap(ref: "button:Submit")
  • 7 roles: button, input, toggle, slider, select, link, item

๐Ÿš€ Control

  • launch_app โ€” launch with flavors
  • hot_reload / hot_restart
  • get_logs / get_errors
  • scan_and_connect โ€” auto-find apps
253 tools โ€” full reference

AI Explore: page_summary, explore_actions, boundary_test, explore_report

Launch & Connect: launch_app, scan_and_connect, connect_cdp, hot_reload, hot_restart, list_sessions, switch_session, close_session, disconnect, stop_app

Screen: screenshot, screenshot_region, screenshot_element, native_screenshot, inspect, inspect_interactive, snapshot, get_widget_tree, find_by_type, get_text_content, get_visible_text

Interaction: tap, double_tap, long_press, enter_text, set_text, clear_text, swipe, scroll_to, drag, go_back, press_key, type_text, hover, fill, select_option, set_checkbox, focus, blur, native_tap, native_input_text, native_swipe

Smart Testing: smart_tap, smart_enter_text, smart_assert (self-healing with fuzzy match)

Assertions: assert_text, assert_visible, assert_not_visible, assert_element_count, assert_batch, wait_for_element, wait_for_gone, wait_for_idle, wait_for_stable, wait_for_url, wait_for_text, wait_for_element_count

Visual Regression: visual_baseline_save, visual_baseline_compare, visual_baseline_update, visual_regression_report, visual_verify, visual_diff, compare_screenshot

Network Mock: mock_api, mock_clear, record_network, replay_network, intercept_requests, clear_interceptions, block_urls, http_request

API Testing: api_request, api_assert

Coverage & Reliability: coverage_start, coverage_stop, coverage_report, coverage_gaps, retry_on_fail, stability_check

Data-Driven: test_with_data, generate_test_data

Multi-Device: multi_connect, multi_action, multi_compare, multi_disconnect, parallel_snapshot, parallel_tap

Accessibility: accessibility_audit, a11y_full_audit, a11y_tab_order, a11y_color_contrast, a11y_screen_reader

i18n: set_locale, verify_translations, i18n_snapshot

Performance: perf_start, perf_stop, perf_report, get_performance, get_frame_stats, get_memory_stats

Session: save_session, restore_session, session_diff

Recording & Export: record_start, record_stop, record_export (Playwright, Cypress, XCUITest, Espresso, Detox, Maestro, +5 more), video_start, video_stop

Auth: auth_inject_session, auth_biometric, auth_otp, auth_deeplink

CDP Browser: navigate, reload, go_forward, get_title, get_page_source, eval, get_tabs, new_tab, switch_tab, close_tab, get_cookies, set_cookie, clear_cookies, get_local_storage, set_local_storage, clear_local_storage, generate_pdf, set_viewport, emulate_device, throttle_network, go_offline, set_geolocation, set_timezone, set_color_scheme

Debug: get_logs, get_errors, get_console_messages, get_network_requests, diagnose, diagnose_project, reset_app


Platform Setup

Flutter (iOS / Android / Web)
dependencies:
  flutter_skill: ^0.9.36
import 'package:flutter_skill/flutter_skill.dart';

void main() {
  if (kDebugMode) FlutterSkillBinding.ensureInitialized();
  runApp(MyApp());
}
React Native
npm install flutter-skill-react-native
import FlutterSkill from 'flutter-skill-react-native';
FlutterSkill.start();
Electron
npm install flutter-skill-electron
const { FlutterSkillBridge } = require('flutter-skill-electron');
FlutterSkillBridge.start(mainWindow);
iOS (Swift)
// Swift Package Manager: FlutterSkillSDK
import FlutterSkill
FlutterSkillBridge.shared.start()

Text("Hello").flutterSkillId("greeting")
Android (Kotlin)
implementation("com.flutterskill:flutter-skill:0.8.0")

FlutterSkillBridge.start(this)
Tauri (Rust)
[dependencies]
flutter-skill-tauri = "0.8.0"
KMP Desktop

Add Gradle dependency โ€” see sdks/kmp for details.

.NET MAUI

Add NuGet package โ€” see sdks/dotnet-maui for details.


Example Prompts

Just tell your AI what to test:

PromptWhat happens
"Test login with wrong password"Screenshots โ†’ enters creds โ†’ taps login โ†’ verifies error
"Explore every screen and report bugs"Systematically navigates all screens, tests all elements
"Fill registration with edge cases"Tests emoji ๐ŸŒ, long strings, empty fields, special chars
"Compare checkout flow on iOS and Android"Runs same test on both platforms, compares screenshots
"Take screenshots of all 5 tabs"Taps each tab, captures state

Contributing

See CONTRIBUTING.md for guidelines.

git clone https://github.com/ai-dashboad/flutter-skill
cd flutter-skill
dart pub get
dart run bin/flutter_skill.dart server  # Start MCP server

๐Ÿ“ฆ pub.dev๐Ÿงฉ VSCode
๐Ÿ“ฆ npm๐Ÿงฉ JetBrains
๐Ÿบ Homebrew๐Ÿ“– Docs
๐Ÿค– Agent Skill๐Ÿ“‹ Changelog

โญ If flutter-skill saves you time, star it so others can find it too!

MIT License ยฉ 2025