Benchmarking schema-valid false tool observations and defense baselines for tool-using LLM agents.
benchmark mcp ai-safety tool-use prompt-injection llm-agents agent-security rag-security agentdojo toolsandbox tool-output-spoofing
-
Updated
Jun 8, 2026 - Python