Toolguard test cases are flaky across test runs. See https://github.com/AgentToolkit/agent-lifecycle-toolkit/actions/runs/19456997883/job/55672760165, https://github.com/AgentToolkit/agent-lifecycle-toolkit/actions/runs/19456997883/job/55672760181 and https://github.com/AgentToolkit/agent-lifecycle-toolkit/actions/runs/19456997883/job/55672760169 for more details.