Awesome GenAI Security

June 27, 2026 · View on GitHub

A curated list of links, references, books, videos, tutorials (Free or Paid), Exploit, CTFs, Hacking Practices, etc., which are related to GenAI, LLM, RAG, MCP, Agents, and Agentic AI security.

GenAI Security Papers & Standards
AI Security Books
AI Security Videos
Online Tutorials / Blogs / Presentations
Online Courses (Paid/Free)
AI Security Certifications
Tools of Trade
Security Practices and CTFs
AI Red Teaming
GenAI Security Attacks, Breaches & Incidents
Regulatory Frameworks & Governance
Newsletters & Communities
Contributors

GenAI Security Papers & Standards

Important papers, standards, and checklists from organizations like OWASP, NIST, and others.

OWASP Top 10 for LLM Applications 2025
OWASP LLM AI Security and Governance Checklist
OWASP Agentic AI Top 10
NIST AI RMF Playbook
NIST AI Risk Management Framework (AI RMF)
NIST Adversarial Machine Learning
Microsoft Failure Models in Machine Learning
Microsoft Threat Modeling AI/ML
OWASP GenAI Security Project
MITRE ATLAS (Adversarial Threat Landscape for AI Systems)
Google Secure AI Framework (SAIF)
Anthropic Responsible Scaling Policy
ENISA Multilayer Framework for Good Cybersecurity Practices for AI
Databricks AI Security Framework (DASF) 2.0 - Practical controls mapped to AI system components and risks.
OWASP Securing Agentic Applications Guide 1.0 - Reference architecture and controls for building secure agentic apps.
CSA MAESTRO - Agentic AI Threat Modeling Framework - Seven-layer threat modeling for agentic AI systems.
Microsoft: Zero Trust for AI (Tools & Guidance) - Applying Zero Trust principles to AI agents and workloads.

AI Security Books

AI Security Videos

Online Tutorials / Blogs / Presentations

Articles and guides covering LLM, RAG, and general GenAI security.

RAG Security

MCP & Agent Security

LLM Attacks

Online Courses (Paid/Free)

Stanford CS-324: Large Language Models
Princeton COS 597G: Understanding Large Language Models
Coursera: GenAI with LLM
Coursera: Generative AI Engineering with LLMs Specialization
Coursera: Generative AI for Cybersecurity Professionals (IBM)
Coursera: AI for Cybersecurity (JHU)
AttackIQ: The foundation of AI Security
Microsoft AI Red Teaming 101 (Microsoft Learn) - Free training on GenAI vulnerabilities, single/multi-turn attacks, spotlighting defenses, and PyRIT automation.
SANS SEC545: GenAI and LLM Application Security - RAG/vector-DB security, prompt injection, MLOps hardening, and agentic AI security (maps to GIAC GAIPS).

AI Security Certifications

Certified AI Security Professional (CAISP) by Practical DevSecOps – Securing AI systems, models, and pipelines against adversarial threats, LLM vulnerabilities, AI supply chain risks, data poisoning, and AI-specific security frameworks. Hands-on, practitioner-level skills.
GIAC AI Platform Security (GAIPS) - Hands-on (CyberLive) certification for auditing and securing GenAI applications, LLM pipelines, and agentic AI systems.
GIAC AI Security Automation Engineer (GASAE) - Validates using AI/automation across offensive, defensive, and cloud security operations.
ISC2 AI Security Certificate - On-demand course bundle covering AI fundamentals, AI risk management, secure-by-design, and global AI regulation.

Tools of Trade

Tools for defending, scanning, and auditing GenAI systems.

Defensive / Scanning

LLM Guard - Information extraction and security for LLMs.
Model Scan - Scanning models for serialization attacks.
Rebuff - Prompt injection detection.
NB Defense - Notebook security.
Protect AI's OSS Portfolio
LLM Guard Playground
Cisco MCP Scanner - Scans MCP servers/tools for poisoning and prompt injection (YARA + LLM-as-judge).
Snyk Agent Scan - Inventories and scans AI agents, MCP servers, and skills for 15+ risks (successor to Invariant mcp-scan).
Fickling (Trail of Bits) - Decompiler, static analyzer, and safety scanner for malicious pickle/PyTorch model files.
ModelAudit - Static scanner detecting malicious code/backdoors across 40+ ML model file formats.
AIsbom - CLI that scans model files for malware and generates CycloneDX/SPDX AI SBOMs.

Offensive / Red Teaming

AI/ML Exploits
Garak - LLM Vulnerability Scanner
PyRIT - Python Risk Identification Toolkit for GenAI (Microsoft)
Counterfit - AI Security Testing (Microsoft)
ART - Adversarial Robustness Toolbox (IBM)
promptmap - Prompt Injection Testing
DeepTeam - LLM & AI-Agent Red Teaming Framework - 50+ vulnerability types and 20+ attack methods mapped to OWASP/NIST/MITRE.
Promptfoo - LLM Testing & Red Teaming - Generates adversarial inputs to find prompt injection, jailbreaks, and data leakage, with CI/CD integration.

Guardrails & Firewalls

Guardrails AI - Input/output validation for LLMs.
NeMo Guardrails (NVIDIA) - Programmable guardrails for LLM applications.
Vigil - LLM Prompt Injection Detection
Lakera Guard - Real-time AI security for prompt injection and data leakage.
Trylon Gateway - Self-hosted open-source AI firewall/proxy applying custom guardrails (prompt-injection defense, PII redaction).
Bifrost AI Gateway - High-performance open-source AI gateway unifying 20+ LLM providers with governance and policy enforcement.

Security Practices and CTFs

Practice your skills with these vulnerable applications and challenges.

Gandalf - Lakera AI - LLM security challenge.
Prompt Airlines - AI security challenges, CTF style.
Certified AI/ML Pentester (C-AI/MLPen) Exam - The SecOps Group
Damn Vulnerable MCP Server - Deliberately vulnerable MCP implementation.
Vulnerable MCP Servers Lab - Collection of vulnerable servers.
FinBot Agentic AI CTF - Agentic Security CTF.
OWASP WrongSecrets - Includes an LLM/AI secrets-leakage challenge.
Huntr.com - World’s first bug bounty platform for AI/ML.
HackAPrompt - Prompt hacking competition.
Crucible by Dreadnode - AI/ML security challenges and CTFs.
AI Goat - Vulnerable LLM CTF built on AWS.
Microsoft AI Red Teaming Playground Labs - Hands-on red-teaming challenges (prompt injection, indirect injection, guardrail bypass) with Docker/Kubernetes deployment.
PortSwigger Web Security Academy: Web LLM Attacks - Free official hands-on labs on exploiting LLM APIs, excessive agency, and prompt injection.

AI Red Teaming

Resources and methodologies for red teaming AI/GenAI systems.

Microsoft AI Red Team
Anthropic Red Teaming Research
MITRE ATLAS - Tools, Data & Case Studies (GitHub)
AI Red Teaming Guide - Humane Intelligence
Google DeepMind: Evaluating Frontier Models for Dangerous Capabilities
OWASP GenAI Red Teaming Guide - Methodology for red teaming GenAI/LLM applications.
CSA Agentic AI Red Teaming Guide - Red teaming approach tailored to autonomous agents.

GenAI Security Attacks, Breaches & Incidents

Notable real-world incidents involving GenAI and LLM security.

Anthropic Disrupts First AI-Orchestrated Cyber Espionage Campaign (Nov 2025) - A Chinese state-sponsored group jailbroke Claude Code to autonomously execute ~80-90% of an espionage campaign against ~30 global targets; the first documented largely AI-run cyberattack.
CamoLeak: GitHub Copilot Chat Private Source-Code Exfiltration (Oct 2025) - CVSS 9.6 prompt injection hidden in PRs leaks private code and secrets via GitHub's own Camo image proxy.
Check Point Researchers Expose Critical Claude Code Flaws - CVE-2025-59536 and CVE-2026-21852: Enabling Remote Command Execution and API Key Theft.
Anthropic: "Vibe-Hacking" Extortion Using Claude Code (Aug 2025) - A criminal used Claude Code to automate intrusions and craft targeted extortion against 17+ organizations, alongside AI-built ransomware-as-a-service.
PromptLock: First Known AI-Powered Ransomware (Aug 2025, PoC) - ESET found ransomware that uses a local LLM via the Ollama API to generate malicious scripts on the fly (later assessed as an academic proof-of-concept).
Replit AI Agent Deletes a Production Database (Jul 2025) - Replit's AI coding agent wiped a live production database during a code freeze, then fabricated data and falsely claimed rollback was impossible.
Amazon Q VS Code Extension Compromised with Data-Wiping Prompt (Jul 2025) - A malicious prompt to wipe files and delete AWS resources was slipped into the official Amazon Q extension (964k+ installs).
CVE-2025-6514: Critical RCE in mcp-remote (Jul 2025) - A malicious MCP server could achieve full remote code execution (CVSS 9.6) on clients running mcp-remote; the first real-world RCE against an MCP client.
EchoLeak (CVE-2025-32711): Zero-Click Data Theft in Microsoft 365 Copilot (Jun 2025) - A single crafted email could silently exfiltrate organizational data from M365 Copilot with no user interaction; the first known zero-click exploit against an AI agent.
Policy Puppetry: Universal Jailbreak Bypassing All Major LLMs (Apr 2025) - HiddenLayer disclosed a single transferable prompt that bypasses safety guardrails across OpenAI, Google, Anthropic, Meta, DeepSeek, and others.
Singapore Finance Director Scammed via Deepfake Executive Video Call (Mar 2025) - A finance director transferred ~US$499,000 after a Zoom call in which company executives were AI-generated deepfakes.
nullifAI: Malicious ML Models on Hugging Face Evade Picklescan (Feb 2025) - ReversingLabs found malicious Pickle-based models using broken/7z-wrapped pickles to bypass scanners and deliver a reverse shell (PoC).
Anthropic: Chinese AI Firms Created 24,000 Fraudulent Accounts For 'Distillation Attacks'
DeepSeek Exposed Database Leaking Chat History and API Keys (Jan 2025) - Wiz found a publicly accessible, unauthenticated DeepSeek ClickHouse database exposing 1M+ log lines including plaintext chats and secrets.
LiteLLM on PyPI Was Compromised, What the Attack Changed and What Defenders Should Do Now
The Day Chevrolet's AI Chatbot Tried to Sell a $70,000 SUV for $1
Air Canada Chatbot Provides Wrong Info (2024) - Airline held liable for chatbot hallucinating refund policy.
Samsung bans use of generative AI tools like ChatGPT after April internal data leak (2023)
AI-powered Bing Chat spills its secrets via prompt injection attack (2023)
ChatGPT Data Leak Bug (2023) - Bug exposed chat history titles and payment info of other users.
GitHub Copilot Leaking Secrets (2023) - AI code assistant reproducing secrets from training data.
Microsoft Tay Bot Manipulation (2016) - Twitter chatbot manipulated into generating offensive content.

Regulatory Frameworks & Governance

EU AI Act - EU regulation on artificial intelligence.
NIST AI Risk Management Framework (AI RMF) - NIST's voluntary framework for managing AI risks.
ISO/IEC 42001:2023 - AI Management System Standard
U.S. AI Executive Order (2025): Removing Barriers to American Leadership in AI - The Trump administration's Jan 2025 EO; it revoked Biden's EO 14110 (2023) on Safe, Secure & Trustworthy AI.
India AI Governance Guidelines (IndiaAI / MeitY) - India's national AI strategy and governance guidance.
Singapore Model AI Governance Framework

Newsletters & Communities

OWASP GenAI Slack Channel - Join #project-top10-for-llm channel.
AI Village (DEF CON) - Community focused on AI security research.
MLSecOps Community - Community for ML security operations.
The AI Security Newsletter by Ken Huang
Protect AI Blog (now part of Palo Alto Networks)

Contributors

Sanjeev Jaiswal