#

adversarial-prompts

Here are 6 public repositories matching this topic...

nuclide-research / VisorCorpus

Adversarial prompt corpus toolkit for LLM/RAG safety: 10 attack categories, 8 Forge mutators, 6 result statuses, CI-ready JSON output.

Updated Jun 8, 2026
Go

nuclide-research / VisorAgent

Go injection benchmark: delivers adversarial prompts through real tool-use paths and scores HIT/MISS per detection signal. Controlled targets only.

Updated Jun 8, 2026
Go

koswadi / prompt-injection-detection-dataset

A structured NLP dataset for detecting prompt injection attacks, jailbreak attempts, and malicious instruction manipulation in Large Language Models (LLMs). Includes annotated threat categories, risk classifications, and validation-ready samples for AI safety training, security evaluation, and adversarial robustness research.

Updated May 21, 2026

jrajath94 / adversarial-prompt-suite

Systematic red-teaming framework for adversarial prompt evaluation — jailbreak detection, injection classification, attack surface coverage metrics

python machine-learning ai-safety red-teaming prompt-injection llm-safety adversarial-prompts

Updated May 22, 2026
Python

mintmas / triple-arbiter

x402 settlement facilitator + EAS-compatible threat-intel attestation issuer on Base mainnet

Updated Apr 22, 2026

nuclide-research / VisorPlus

Go CLI for AI/LLM infrastructure assessment: hunt, fingerprint, enumerate, passive-recon, and adversarial-corpus generation in one binary.

Updated Jun 8, 2026
Go

Improve this page

Add a description, image, and links to the adversarial-prompts topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the adversarial-prompts topic, visit your repo's landing page and select "manage topics."