Awesome AI Security

A curated list of AI security resources, tools, research papers, and more.

Focused on LLM security, prompt injection, jailbreaks, AI agents, and RAG systems.

📚 Research Papers

🛠️ Tools

Offensive Tools

garak - LLM vulnerability scanner with extensive probe library
PyRIT - Microsoft's Python Risk Identification Toolkit for generative AI
Promptmap - Automatic prompt injection testing
LLM-Attacks - GCG adversarial attack implementation
TextAttack - NLP adversarial attack framework
Adversarial Robustness Toolbox - IBM's ML security library
llm-security-payloads - Curated LLM attack payload collection

Defensive Tools

NeMo Guardrails - NVIDIA's programmable guardrails for LLMs
Guardrails AI - Input/output validation for LLMs
LLM Guard - Security toolkit for LLM interactions
Rebuff - Prompt injection detection
Vigil - LLM prompt injection scanner
Lakera Guard - Commercial prompt injection protection

Scanners & Platforms

AgentAudit - Automated AI security testing platform
Protect AI - ML/AI security platform
HiddenLayer - AI security monitoring
CalypsoAI - LLM security scanning

📖 Articles & Blogs

OWASP Top 10 for LLM Applications - Essential LLM security reference
Prompt Injection: What's the worst that can happen? - Simon Willison's prompt injection overview
The AI Attack Surface Map v1.0 - Daniel Miessler's attack surface taxonomy
Securing LLM Systems Against Prompt Injection - NVIDIA's defense guide
Anthropic's Responsible Disclosure Policy - AI safety disclosure practices
Google's Secure AI Framework (SAIF) - Enterprise AI security framework
Red Teaming Language Models with Language Models - DeepMind's automated red teaming
Lessons Learned on LLM Safety - OpenAI GPT-4 system card
Embrace The Red: LLM Security - Johann Rehberger's AI security blog
Hacking Auto-GPT and LangChain - Agent exploitation walkthrough
Jailbreaking GPT-4's Code Interpreter - Code interpreter bypass
LLM Security: Prompt Injection & Data Exfiltration - Cobalt's security analysis
The Dual LLM Pattern for Building AI Assistants - Architectural defense pattern

🎓 Courses & Training

Free

NVIDIA: Securing LLM Applications - Free LLM security course
Lakera Prompt Injection Course - Free prompt injection fundamentals
HackAPrompt Competition - Learn by competing
Damn Vulnerable LLM Agent - Hands-on vulnerable agent

Paid

SANS SEC595: Applied Data Science and AI/ML for Cybersecurity - Comprehensive AI security
Offensive AI (OffSec) - Offensive security with AI focus
AI Red Team Professional - AI red teaming certification

🏆 CTF & Challenges

Gandalf by Lakera - Progressive prompt injection challenge
GPT Prompt Attack - Prompt injection CTF
Prompt Airlines - Interactive jailbreak game
HackAPrompt - Large-scale prompt injection competition
TensorTrust - PvP prompt injection game
Crucible by Dreadnode - AI security CTF platform
AI Village CTF - DEF CON AI security challenges
Prompt Injection Playground - Practice environment

📺 Videos & Talks

Conference Talks

DEF CON 31 - Compromising LLMs: The Advent of AI Malware - AI malware and exploitation
Black Hat 2023 - Hacking AI: Security Implications of ML Models - ML model security
DEF CON 31 AI Village - Indirect Prompt Injection - Kai Greshake on indirect injection
BSides SF 2024 - LLM Security Deep Dive - Latest LLM security talks

YouTube Channels & Videos

John Hammond - ChatGPT Jailbreaks - Popular jailbreak demos
LiveOverflow - Hacking AI - Technical AI exploitation
David Bombal - AI Security - AI security interviews
NVIDIA AI Enterprise - LLM Security - Enterprise LLM security

🔬 Vulnerability Databases

AI Incident Database - Real-world AI failure database
AVID (AI Vulnerability Database) - ML vulnerability taxonomy
MITRE ATLAS - Adversarial Threat Landscape for AI Systems
NIST AI Risk Management Framework - AI risk standards
CVE - AI Related - Traditional CVEs for AI systems

💼 Companies & Services

AI Security Focused

XSource_Sec - AI red teaming and AgentAudit platform
Lakera - Prompt injection protection
Protect AI - ML security platform
HiddenLayer - AI threat detection
CalypsoAI - LLM security scanning
Robust Intelligence - AI validation platform
Adversa AI - AI red teaming
Preamble - AI guardrails

Big Tech AI Safety Teams

Anthropic Safety - Constitutional AI research
OpenAI Red Teaming - GPT safety and red teaming
Google DeepMind Safety - AI safety research
Microsoft Responsible AI - Azure AI security

🐦 People to Follow

Name	Handle	Focus
Simon Willison	@simonw	Prompt injection research
Johann Rehberger	@waborel	AI red teaming
Kai Greshake	@kai_greshake	Indirect prompt injection
Daniel Miessler	@danielmiessler	AI security frameworks
Sander Schulhoff	@SSchulhworthy	HackAPrompt organizer
Rich Harang	@richharang	NVIDIA AI security
Pliny the Prompter	@elder_plinius	Jailbreak research
Jailbreak Chat	@jailbreakchat	Jailbreak aggregation

Contributing

Contributions welcome! Please read the Contributing Guide first.

Add new resources via Pull Request
Ensure links are working and relevant
Follow the existing format

License

Maintained by XSource_Sec

If you find this useful, please ⭐ star the repository!

🚀 Try AgentAudit - Automated AI Security Testing

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Awesome AI Security

Contents

📚 Research Papers

Prompt Injection

Jailbreaking LLMs

RAG Security

Agent Security

🛠️ Tools

Offensive Tools

Defensive Tools

Scanners & Platforms

📖 Articles & Blogs

🎓 Courses & Training

Free

Paid

🏆 CTF & Challenges

📺 Videos & Talks

Conference Talks

YouTube Channels & Videos

🔬 Vulnerability Databases

💼 Companies & Services

AI Security Focused

Big Tech AI Safety Teams

🐦 People to Follow

Contributing

License

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

License

XSource-Sec/awesome-ai-security

Folders and files

Latest commit

History

Repository files navigation

Awesome AI Security

Contents

📚 Research Papers

Prompt Injection

Jailbreaking LLMs

RAG Security

Agent Security

🛠️ Tools

Offensive Tools

Defensive Tools

Scanners & Platforms

📖 Articles & Blogs

🎓 Courses & Training

Free

Paid

🏆 CTF & Challenges

📺 Videos & Talks

Conference Talks

YouTube Channels & Videos

🔬 Vulnerability Databases

💼 Companies & Services

AI Security Focused

Big Tech AI Safety Teams

🐦 People to Follow

Contributing

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Packages