Show and Tell: OpenAgentAudit — audit evidence layer for AI agents (spec + profiles now open) #4
telleroutlook
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Show and Tell: OpenAgentAudit — audit evidence layer for AI agents (spec + profiles now open)
Hey everyone,
I've been working on a recurring problem in production AI agent deployments: the gap between what LLM observability gives you and what an actual audit requires.
The short version: agent logs are not audit evidence. They're built to answer engineering questions (latency, cost, prompt version). Audit requires different things — tamper-evident records, signed policy decisions captured before tool execution, rubric-referenced severity, reproducible findings with stable IDs.
I just open-sourced the specification and tooling that fills this gap:
→ WasmAgent/open-agent-audit
What's in the repo
spec/versions/v0.1/SPEC.md) — hash-chained, versioned, adapter-agnosticschemas/v0.1/Why the "audit ≠ observability" framing
Four concrete gaps I keep running into:
+10ppclaim collapse to-1ppafter fixing protocol consistency and running McNemar. The eval dashboard showed a pretty bar chart.Status
The spec is in release-candidate shape. The TypeScript implementation packages are deliberate skeletons — code lands after the canonical model has held shape for a release-candidate window.
The production deployment is at trustavo.com — running on Cloudflare Workers.
Questions for this community
I'm most interested in hearing from people who have been on the receiving end of an AI deployment audit:
Open an issue, comment here, or email me. Star the repo if you want to follow along — it's also useful signal for whether to keep building.
Beta Was this translation helpful? Give feedback.
All reactions