SkillFortify

April 24, 2026 · View on GitHub

Supply chain security scanner for AI agent skills -- supports 22 frameworks.

SkillFortifyBench Now Available

540-skill, 3-format benchmark for evaluating AI agent skill supply chain security scanners. 270 malicious (13 attack types) + 270 benign (5 categories) across Claude, MCP, and OpenClaw formats. Deterministic generation from seed=42. 100% precision, 91.5% recall with Wilson 95% CI containing the paper point estimate. View Benchmark | Standalone Repo


PyPI version Tests License: Elastic-2.0 Python 3.11+

Website | PyPI | Paper (arXiv) | Wiki | @varunPbhardwaj


One Command. Every Framework.

pip install skillfortify
skillfortify scan                # Auto-discovers all AI tools on your system
skillfortify scan ./my-project   # Scan a specific project
skillfortify dashboard           # Generate HTML security report

SkillFortify formally analyzes agent skill safety using sound static analysis. If SkillFortify reports no violations, the capability bounds in the formal model are assured. Unlike heuristic scanners where absence of findings does not mean absence of risk, SkillFortify provides mathematically grounded security guarantees.


Supported Frameworks (22)

#FrameworkDetection
1Claude Code Skills.claude/ directory
2MCP Serversmcp.json, mcp_config.json, deep server scan
3OpenClaw Skills.claw/ directory
4LangChain Toolslangchain imports, BaseTool, @tool
5CrewAI Toolscrew.yaml, crewai imports
6AutoGen Toolsautogen imports, register_for_llm
7OpenAI Agents SDKopenai-agents configurations
8Google ADKgoogle-adk configurations
9DifyDify workflow and plugin definitions
10ComposioComposio tool integrations
11Semantic KernelMicrosoft Semantic Kernel plugins
12LlamaIndexLlamaIndex tool abstractions
13n8nn8n workflow node definitions
14FlowiseFlowise chatflow configurations
15MastraMastra agent tool definitions
16PydanticAIPydanticAI tool decorators
17AgnoAgno agent configurations
18CAMEL-AICAMEL-AI tool integrations
19MetaGPTMetaGPT action and tool definitions
20HaystackHaystack component definitions
21Anthropic Agent SDKAnthropic agent tool configurations
22Custom SkillsUser-defined skill manifests (YAML/JSON)

All frameworks are parsed into a unified representation for consistent analysis, trust scoring, and SBOM generation.


Quick Start

Install

pip install skillfortify                 # Core scanner
pip install skillfortify[registry]       # + marketplace scanning
pip install skillfortify[all]            # Everything

System-Wide Scan

Run skillfortify scan with no arguments to automatically discover every AI agent tool installed on your system -- Claude Code, Cursor, VS Code extensions, Windsurf, and more:

skillfortify scan
Discovering AI tools on this system...
  Found: Claude Code skills       (12 skills in ~/.claude/skills/)
  Found: MCP servers              (8 servers in ~/.cursor/mcp.json)
  Found: VS Code MCP configs      (3 servers in ~/.vscode/mcp.json)
  Found: Windsurf MCP configs     (2 servers)

Scanning 25 skills across 4 locations...

+----------------------+--------+-----------+----------+--------------+
|       Skill          | Source |  Status   | Findings | Max Severity |
+----------------------+--------+-----------+----------+--------------+
| deploy-automation    | Claude |   SAFE    |        0 | -            |
| data-export          | Claude |  UNSAFE   |        2 | HIGH         |
| postgres-server      | MCP    |   SAFE    |        0 | -            |
| file-manager         | MCP    |  WARNING  |        1 | MEDIUM       |
+----------------------+--------+-----------+----------+--------------+
25 skills scanned | 22 safe | 2 unsafe | 1 warning | 5 total findings

Project Scan

skillfortify scan ./my-agent-project
skillfortify scan ./my-agent-project --format json
skillfortify scan ./my-agent-project --severity-threshold high

HTML Dashboard

Generate a standalone HTML security report with interactive filtering, a capabilities matrix, and severity breakdown:

skillfortify dashboard
skillfortify dashboard --output security-report.html

Open the generated file in any browser -- no server or dependencies required.


Features

  • Formal threat model (DY-Skill) -- mathematically grounded attack taxonomy for the agent skill supply chain
  • Sound static analysis -- formal capability verification, not heuristic pattern matching
  • Capability-based access control -- POLA compliance checks for every skill
  • Agent Dependency Graph -- constraint-based resolution with conflict detection
  • Lockfile generation -- deterministic skill-lock.json for reproducible agent configurations
  • Trust score algebra -- multi-signal trust with propagation through dependency chains
  • ASBOM generation -- CycloneDX 1.6 Agent Skill Bill of Materials for compliance reporting
  • Registry scanning -- scan MCP registries, PyPI, and npm for known vulnerabilities
  • HTML dashboard -- standalone interactive security report
  • System auto-discovery -- finds every AI tool on your machine automatically
  • 22 framework support -- broadest coverage of any agent security scanner

CLI Commands

CommandDescription
skillfortify scan [path]Discover and analyze skills. No path = system-wide scan
skillfortify verify <skill>Deep formal verification of a single skill file
skillfortify lock <path>Generate deterministic skill-lock.json lockfile
skillfortify trust <skill>Compute multi-signal trust score with graduated levels
skillfortify sbom <path>Generate CycloneDX 1.6 ASBOM for compliance
skillfortify frameworksList all 22 supported frameworks and detection methods
skillfortify dashboardGenerate standalone HTML security report
skillfortify registry-scan <source>Scan MCP, PyPI, or npm registries for threats

Exit Codes

CodeMeaning
0All checks passed
1Security findings detected
2No skills found or parse error

Benchmark Results

Evaluated on SkillFortifyBench -- 540 agent skills (clean and malicious samples from documented real-world incidents):

MetricValue
Precision100% (zero false positives)
Recall94.12%
F1 Score96.95%
Average scan time2.55 ms per skill

Trust Levels

Graduated trust levels inspired by the SLSA framework:

LevelThresholdMeaning
FORMALLY_VERIFIED>= 0.75Highest assurance. Formal analysis passed, strong provenance
COMMUNITY_VERIFIED>= 0.50Community reviewed, usage history, behavioral checks passed
SIGNED>= 0.25Basic provenance. Author signed, limited verification
UNSIGNED< 0.25No verification. Treat with extreme caution

CI/CD Integration

GitHub Actions

name: Skill Security Scan
on: [push, pull_request]

jobs:
  skillfortify-scan:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-python@v5
        with:
          python-version: "3.11"
      - run: pip install skillfortify
      - run: skillfortify scan . --format json
      - run: skillfortify lock . --output /tmp/fresh-lock.json

Requirements

  • Python 3.11 or later
  • No external services required -- runs entirely offline
  • Works on Linux, macOS, and Windows

Academic Paper

"Formal Analysis and Supply Chain Security for Agentic AI Skills"

Backed by peer-reviewed research with five formal theorems and full proofs, formalizing the agent skill supply chain threat model, capability verification, trust algebra, and dependency resolution.

Read the paper on arXiv | Zenodo | DOI: 10.5281/zenodo.18787663


Contributing

Contributions welcome. See CONTRIBUTING.md for setup instructions, coding standards, and submission guidelines.


Author

Varun Pratap Bhardwaj -- Solution Architect with 15+ years in enterprise technology. Dual qualifications in technology and law (LL.B.), with a focus on formal methods for AI safety.


License

Elastic License 2.0. See LICENSE.

Copyright (c) 2026 Varun Pratap Bhardwaj / Qualixar.


Citation

@article{bhardwaj2026skillfortify,
  author    = {Bhardwaj, Varun Pratap},
  title     = {Formal Analysis and Supply Chain Security for Agentic AI Skills},
  journal   = {arXiv preprint arXiv:2603.00195},
  year      = {2026},
  doi       = {10.5281/zenodo.18787663},
  url       = {https://arxiv.org/abs/2603.00195}
}

⭐ Support This Project

If this project solves a real problem for you, please star the repo — it helps other developers discover Qualixar and signals that the AI agent reliability community is growing. Every star matters.

Star History Chart


Part of the Qualixar AI Agent Reliability Platform

Qualixar is building the open-source infrastructure for AI agent reliability engineering. Seven products, seven peer-reviewed papers, one coherent platform. Each tool solves one reliability pillar:

ProductPurposeInstallPaper
SuperLocalMemoryPersistent memory + learning for AI agentsnpx superlocalmemoryarXiv:2604.04514
Qualixar OSUniversal agent runtime (13 execution topologies)npx qualixar-osarXiv:2604.06392
SLM MeshP2P coordination across AI agent sessionsnpm i slm-mesh
SLM MCP HubFederate 430+ MCP tools through one gatewaypip install slm-mcp-hub
AgentAssayToken-efficient AI agent testingpip install agentassayarXiv:2603.02601
AgentAssertBehavioral contracts + drift detectionpip install agentassert-abcarXiv:2602.22302
SkillFortifyFormal verification for AI agent skillspip install skillfortifyarXiv:2603.00195

Zero cloud dependency. Local-first. EU AI Act compliant.

Start here → qualixar.com · All papers on Qualixar HuggingFace