Open benchmark dataset for agentic AI governance tools. 794 labelled examples of malicious and benign agent skills, MCP manifests, traces, and rule files.
benchmark mcp evaluation dataset ai-governance prompt-injection llm-security agentic-ai eu-ai-act agent-skills clawhavoc toxicskills
-
Updated
May 27, 2026 - Python