Skill Scanner Architecture

February 26, 2026 · View on GitHub

Tip

What you'll find here

This section covers the internals of Skill Scanner: how scans execute, what each analyzer detects, how findings map to threat frameworks, and how to extend the system. If you're looking to use the scanner, start with the User Guide instead.

Skill Scanner is a modular security scanner for agent skill packages. It combines deterministic scanning with optional LLM-assisted analysis through a central orchestrator and pluggable analyzers.

Explore by Topic

Scanning Pipeline -- How scans execute end-to-end: two-phase analysis, post-processing, and output.
Analyzers -- The 10 detection engines — capability matrix, deep dives, and selection guide.
Threat Model -- Cisco AI Security Framework taxonomy, binary handling, and risk classification.
Extending -- Write custom YAML signatures, YARA rules, and Python checks. Add new analyzers.

High-Level Layout

graph TB
    subgraph Entry[Entry Points]
        CLI["CLI<br>skill-scanner"]
        API["FastAPI<br>skill-scanner-api"]
        HOOK[Pre-commit Hook]
    end

    subgraph Core[Core Engine]
        FACTORY["Analyzer Factory<br>build_core_analyzers / build_analyzers"]
        SCANNER["SkillScanner<br>core/scanner.py"]
        LOADER["SkillLoader<br>core/loader.py"]
        EXTRACT["ContentExtractor<br>archive/document extraction"]
        POLICY["ScanPolicy<br>core/scan_policy.py"]
    end

    subgraph Analyzers[Analyzers]
        STATIC[Static Analyzer]
        BYTECODE[Bytecode Analyzer]
        PIPELINE[Pipeline Analyzer]
        BEHAVIORAL[Behavioral Analyzer]
        LLM[LLM Analyzer]
        META[Meta Analyzer]
        VT[VirusTotal Analyzer]
        AID[AI Defense Analyzer]
        TRIGGER[Trigger Analyzer]
        CROSS[Cross-Skill Scanner]
    end

    subgraph Data[Data]
        PACKS["data/packs/core<br>signatures + yara + python checks"]
        PROMPTS["data/prompts<br>LLM schemas/prompts"]
        TAXONOMY["Threat mapping<br>AITech taxonomy"]
        PRESETS["data/*_policy.yaml<br>strict/balanced/permissive"]
    end

    subgraph Output[Output]
        SUMMARY[Summary]
        JSON[JSON]
        MARKDOWN[Markdown]
        TABLE[Table]
        SARIF[SARIF]
        HTML[HTML]
    end

    CLI --> FACTORY
    API --> FACTORY
    HOOK --> FACTORY
    FACTORY --> SCANNER
    POLICY --> FACTORY
    SCANNER --> LOADER
    SCANNER --> EXTRACT
    SCANNER --> Analyzers
    STATIC --> PACKS
    LLM --> PROMPTS
    Analyzers --> TAXONOMY
    SCANNER --> Output
    PRESETS --> POLICY

Core Components

Data Models

Source: skill_scanner/core/models.py

Primary data structures:

SkillManifest, SkillFile, Skill
Finding, ScanResult, Report
enums: Severity, ThreatCategory

ScanResult and Report expose computed safety/severity summaries and JSON-serializable output helpers.

Loader

Source: skill_scanner/core/loader.py

SkillLoader is responsible for:

Validating skill directory and SKILL.md
Parsing frontmatter (name, description, optional metadata)
Discovering files recursively (excluding only .git internals)
Basic file-type classification (python/bash/markdown/binary/other)
Extracting referenced file hints from instruction content

Analyzer Construction

Source: skill_scanner/core/analyzer_factory.py

This is the single source of truth for analyzer assembly:

build_core_analyzers(policy, ...)
- static, bytecode, pipeline (gated by policy.analyzers.*)
build_analyzers(policy, ..., use_behavioral/use_llm/...)
- adds optional analyzers based on flags/params

CLI, API, pre-commit hook, eval runners, and fallback scanner paths all rely on this factory for parity.

Scanner Orchestrator

Source: skill_scanner/core/scanner.py

SkillScanner runs a two-phase scan pipeline for each skill:

Load + preprocess -- load skill, extract archives/embedded content via ContentExtractor
Phase 1 (non-LLM) -- all non-LLM analyzers (static, bytecode, pipeline, behavioral, VirusTotal, AI Defense, trigger)
Phase 2 (LLM/meta) -- LLM and meta analyzers receive enrichment context built from Phase 1 findings
Post-processing -- disabled rules enforcement, severity overrides, analyzability scoring, finding normalization/dedup, co-occurrence metadata, policy fingerprint attachment
Cleanup -- temporary extraction artifacts are removed; ScanResult is returned

For directory scans, scan_directory(...) iterates skill packages and optionally adds:

description overlap findings (TRIGGER_OVERLAP_*)
cross-skill findings via CrossSkillScanner

See Scanning Pipeline for the full stage-by-stage breakdown.

Analyzability Scoring

Source: skill_scanner/core/analyzability.py

compute_analyzability() evaluates how much of a skill's content the scanner could actually inspect. This implements a fail-closed posture: content the scanner cannot analyze is flagged rather than trusted.

Per-file UNANALYZABLE_BINARY findings for opaque non-inert binaries
Aggregate LOW_ANALYZABILITY findings when the overall score drops below policy thresholds
Score and details are included in ScanResult metadata

Supporting Modules

Module	Purpose
`command_safety.py`	Command safety tier classification for pipeline analysis
`file_magic.py`	File magic number detection and extension/content mismatch checking
`rule_registry.py`	Central registry for rule metadata and validation
`strict_structure.py`	Skill package structure validation

Analyzer Inventory

Core (policy-driven)

static_analyzer: YAML signatures + YARA + inventory checks
bytecode_analyzer: Python bytecode/source consistency checks
pipeline_analyzer: shell pipeline taint analysis and command-risk checks

Optional (flag/API-driven)

behavioral_analyzer: static dataflow + cross-file/script correlation
llm_analyzer: semantic threat analysis with structured schema output
meta_analyzer: second-pass LLM validation/filtering (requires prior findings)
virustotal_analyzer: binary hash lookup (+ optional upload)
aidefense_analyzer: Cisco AI Defense cloud inspection
trigger_analyzer: overly broad trigger/description checks

Policy System

ScanPolicy (skill_scanner/core/scan_policy.py) centralizes:

file limits and thresholds
rule scoping and docs-path behavior
command safety tiers
hidden file allowlists
severity overrides and disabled rules
output dedupe and metadata behavior
core analyzer toggles

Built-in presets:

strict
balanced (default)
permissive

Entry Points

CLI

Source: skill_scanner/cli/cli.py

Commands:

scan
scan-all
list-analyzers
validate-rules
generate-policy
configure-policy
interactive

Output formats: summary, json, markdown, table, sarif, html.

API

Source: skill_scanner/api/router.py

Current endpoints:

GET /
GET /health
POST /scan
POST /scan-upload
POST /scan-batch
GET /scan-batch/{scan_id}
GET /analyzers

Add analyzer class inheriting BaseAnalyzer
Register construction path in analyzer_factory.py
Add policy knobs (if needed) in scan_policy.py
Add/adjust tests under tests/
Document CLI/API toggles in docs

For rule-based detection updates, prefer extending skill_scanner/data/packs/core/ (signatures/YARA/python checks) before adding analyzer-level bespoke logic.