Skip to content

Add multi-model eval support and results for Sonnet 4.5, GPT-5.2, and…

21b605a
Select commit
Loading
Failed to load commit list.
Open

Improve evaluation harness: concurrent execution, robust error handling, and CLI model configuration #22

Add multi-model eval support and results for Sonnet 4.5, GPT-5.2, and…
21b605a
Select commit
Loading
Failed to load commit list.
Microsoft GitHub Policy Service / license/cla Started 2026-02-08 20:57:38 ago

Contributor License Agreement is not agreed yet.

This check verifies that the author has agreed to a CLA with Microsoft.