Coverage Analysis

April 26, 2026 · View on GitHub

Honest assessment of what these skills cover vs. what real practitioners need.

Headline number

~85–90% coverage of what an experienced practitioner would reach for during the OSINT/recon phase of an authorized external red-team engagement.

~35–45% coverage of what a full external red-team operator does in their job (because most red-team work is exploitation + post-exploit, intentionally out of scope).

By practitioner archetype

ArchetypeCoverage of their needsWhy
Pure OSINT analyst~90%Skills are built for this.
External attack-surface analyst (CyCognito-style)~85–90%Direct overlap with the methodology.
Bug bounty hunter~75–80%Strong on recon; thin on exploit techniques.
Threat intel investigator~70%RU/CN pivots, attribution discipline, malware basics — but no infrastructure-tracking-over-time.
External red teamer (recon phase)~85–90%The OSINT phase is well-covered.
External red teamer (full engagement)~35–45%Recon is ~30–40% of a full engagement; rest (exploitation, post-exploit, lateral, reporting) is mostly out of scope.
Internal red teamer (assumed-breach)~10%Almost entirely out of scope.
Adversary emulation / TTP-driven~25%Threat-actor section exists; specific TTP playbooks per APT don't.
Physical pentester~25%Sat imagery + LinkedIn intel cover scouting; physical execution doesn't.
Social engineer~50%Pretext development covered; payload crafting + voice tradecraft not.
Purple teamer~30%No SOC-coordination guidance.

By engagement phase

PhaseCoverage
Pre-engagement (RoE, scoping, NDAs, SOW, pricing)~10%
External OSINT / passive recon~85–90%
External active recon (light probing)~75–85%
Phishing payload crafting + delivery0% (out of scope)
Initial access (exploit execution)~5% (we identify, don't exploit)
Foothold / persistence0% (out of scope)
Privilege escalation (local + AD)0% (out of scope)
Lateral movement0% (out of scope)
C2 infrastructure0% (out of scope)
AV/EDR evasion0% (out of scope)
Domain dominance0% (out of scope)
Data exfiltration tradecraft0% (out of scope)
Cleanup / artifact removal0% (out of scope)
Reporting (technical + exec)~75%
Disclosure / vendor coordination~60%
Re-test / continuous monitoring~30%
Purple-team / SOC-coordination0%
Lessons-learned / engagement retrospective~20%

What's deliberately out of scope (and why)

  • Active exploitation, post-exploitation, malware — operational tradecraft, different domain, safety posture concerns.
  • C2 frameworks, AV/EDR evasion — operational tradecraft, large body of separate knowledge.
  • AD attacks, BloodHound, Kerberos — internal recon, not external.
  • Specific client-portal report formats — too company-specific to template usefully.
  • Pricing, NDA, SOW templates — business operations, not technical.
  • Real PII / breach corpus content — privacy + opsec.

Smoke-test results (32 prompts)

The repo ships 32 self-test prompts (tests/smoke-test-prompts.md) covering the major capability areas.

RunPASSPARTIALFAILGrade
v2.0 (initial)1922C
v2.1 (current)3110A

The single PARTIAL is Test 5 (cloud-bucket combinatorial generation) — acceptable; the inputs + technique are documented, runtime synthesis is appropriate.

Caveats

The smoke-test number (96.9% PASS) is Claude grading itself on tests Claude designed. It's a useful signal for tracking gaps but not an objective measure of real-world coverage. A real practitioner would find more gaps. Treat it as "the skills now answer the obvious questions"; non-obvious questions may need a follow-on iteration.

What experienced practitioners would say is still missing (within OSINT scope)

If a senior offensive consultant reviewed v2.1 and stayed within OSINT scope, here's what they'd flag as still missing:

  1. Specific tool-chaining recipes — "use spiderfoot → export CSV → maltego transforms → asset graph" workflows. We name tools; we don't compose them step-by-step.
  2. Recon-ng / SpiderFoot / Maltego module-by-module configuration — these are full ecosystems; we treat them as pointers.
  3. Custom Burp Suite / OWASP ZAP setup for engagements — the "configure your active proxy for an engagement" guide.
  4. OPSEC infrastructure as code — Terraform/Ansible to spin up clean engagement infrastructure (proxy stacks, redirectors).
  5. Sector-specific deep dives — §47 is a starting point, not a deep dive (real healthcare RT specialists know HL7 trafficking like a second language).
  6. Adversary-emulation playbooks per APT — "to simulate APT29's external recon, use these specific tools/techniques."
  7. Continuous-monitoring orchestration — daily diff scripts, alert pipelines, false-positive tuning.
  8. Multi-tenant engagement workflow — how an MSSP runs 30 concurrent ASM engagements without crossing wires.
  9. Client-specific report styling — every Big-4 consultancy has their own template.
  10. Tool failure recovery — when Shodan rate-limits during a critical phase, what's plan B/C/D?

These would push coverage to ~95% of OSINT-phase work. Each would add 200–500 lines and approach the limits of what a single skill can usefully encode.

Roadmap

PhaseStatusDescription
v1.0✅ DoneOriginal framework
v2.0✅ DoneExternal-red-team posture rewrite
v2.1✅ CurrentComprehensive expansion (this version)
v2.2🔜Continuous-monitoring playbook + multi-tenant workflow + Burp extension recipes
v3.0🔜Plugin manifest for one-click Claude Code install + optional MCP server companion

Bottom line

For "external OSINT for authorized red-team operations": ~85–90% coverage of what an experienced practitioner reaches for. For "everything a full red-team operator does in their job": ~35–45% — the gap is mostly intentional (out of scope).

The skills are production-ready for OSINT-phase work. They are not a replacement for a senior red teamer on a full engagement.