Skip to content

Pull requests: vercel-labs/agent-eval

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Skip missing validation scripts
#92 opened Mar 17, 2026 by gaojude Loading…
[wip] add bub agent support
#91 opened Mar 7, 2026 by CorrectRoadH Draft
Add timings for phases
#88 opened Feb 25, 2026 by jeffsee55 Loading…
Add ability to choose which eval --smoke runs
#84 opened Feb 20, 2026 by jeffsee55 Loading…
ProTip! Exclude everything labeled bug with -label:bug.