Model Arena

Open-source adversarial robustness benchmarking for every AI model.

Submit your model. Survive FGSM, PGD, prompt injection, jailbreaks, and 15+ more industry-standard attacks. Climb the public leaderboard. Contribute new attacks via GitHub PR.

Live Site · Leaderboard · Attacks · Contributing

What is Model Arena?

Model Arena is a community-driven benchmarking platform where:

Developers submit AI models (image classifiers, LLMs, object detectors, tabular/audio models) to be evaluated against adversarial attacks
Researchers contribute novel attacks via GitHub PRs — once merged, they run against every model on the platform
Everyone can see the public leaderboard and full evaluation results

The platform includes an AutoResearch agent that continuously scans ArXiv, uses Claude to synthesize novel attacks from recent papers, and auto-submits promising ones for maintainer review.

Attack library

Category	Weight	Attacks
Gradient-Based	30%	FGSM, PGD, AutoAttack (APGD-CE+DLR+FAB+Square), C&W L2, DeepFool
Black-Box	25%	Square Attack, HopSkipJump, Adversarial Patch
Prompt Injection	25%	Direct injection, GCG, Cipher Attacks (8 variants)
Jailbreak	20%	Jailbreak Templates, PAIR, TAP, AutoDAN, Many-Shot
Multimodal	—	Typographic Attack
Tabular	—	Tabular PGD
Audio	—	Universal Audio Perturbation

Community attacks are added via Pull Request (see Contributing).

Stack

Layer	Technology
Backend	FastAPI · SQLAlchemy (async) · PostgreSQL · Celery · Redis
Frontend	Next.js 14 · TailwindCSS · TanStack Query · Recharts · WebSockets
AI	Anthropic Claude (PAIR attacker, TAP evaluator, AutoResearch agent)
Infra	Docker Compose (Postgres + Redis)

Getting started (local development)

Prerequisites

Python 3.11+
Node.js 18+
PostgreSQL 14+
Redis 7+

Backend

cd backend

# Create virtual environment
python -m venv .venv && source .venv/bin/activate

# Install dependencies
pip install -r requirements.txt
pip install "pydantic[email]"

# Configure environment
cp .env.example .env
# Edit .env — add ANTHROPIC_API_KEY for LLM attacks + AutoResearch

# Create database
psql -c "CREATE USER arena WITH PASSWORD 'arena';"
psql -c "CREATE DATABASE model_arena OWNER arena;"

# Run migrations
alembic upgrade head

# Seed built-in attacks
python scripts/seed.py

# Start API server
uvicorn app.main:app --reload --port 8000

# Start Celery worker (new terminal)
celery -A app.workers.celery_app worker --loglevel=info

Frontend

cd frontend
npm install --legacy-peer-deps
npm run dev

Open http://localhost:3000.

Environment variables

Variable	Required	Description
`DATABASE_URL`	Yes	PostgreSQL async URL
`REDIS_URL`	Yes	Redis URL
`SECRET_KEY`	Yes	JWT signing key
`ANTHROPIC_API_KEY`	Optional	Enables PAIR, TAP, AutoResearch

Contributing

Model Arena is open source and welcomes contributions of all kinds.

Contributing a new attack (most common)

See CONTRIBUTING.md for the full guide. The short version:

Fork this repo
Copy community_attacks/_template.py → community_attacks/your_attack.py
Implement the BaseAttack interface
Open a Pull Request — CI validates automatically

Other contributions

Bug reports → GitHub Issues
Feature requests → GitHub Discussions
Frontend / backend improvements → PR against main

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.github		.github
backend		backend
community_attacks		community_attacks
frontend		frontend
nginx		nginx
.env.example		.env.example
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
docker-compose.prod.yml		docker-compose.prod.yml
docker-compose.yml		docker-compose.yml
setup-prod.sh		setup-prod.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Model Arena

What is Model Arena?

Attack library

Stack

Getting started (local development)

Prerequisites

Backend

Frontend

Environment variables

Contributing

Contributing a new attack (most common)

Other contributions

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Model Arena

What is Model Arena?

Attack library

Stack

Getting started (local development)

Prerequisites

Backend

Frontend

Environment variables

Contributing

Contributing a new attack (most common)

Other contributions

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages