Skip to content

Skip missing validation scripts#92

Open
gaojude wants to merge 1 commit intomainfrom
jude/skip-missing-scripts-codex-provider
Open

Skip missing validation scripts#92
gaojude wants to merge 1 commit intomainfrom
jude/skip-missing-scripts-codex-provider

Conversation

@gaojude
Copy link
Copy Markdown
Collaborator

@gaojude gaojude commented Mar 17, 2026

When the experiment config specifies scripts: ["build"], runValidation in shared.ts unconditionally runs npm run build in the sandbox. If the fixture's package.json doesn't define that script — or doesn't exist at all, as in evals where the agent scaffolds the project from scratch — this fails and counts as a validation failure even though the missing script isn't the agent's fault.

getAvailableScripts() now reads package.json from the sandbox and returns the set of defined script names. If the file is missing or invalid, it returns an empty set. runValidation checks this before running each script and skips any that aren't defined.

Skip configured validation scripts when not defined in the sandbox's package.json.
@gaojude gaojude changed the title Skip missing validation scripts and use built-in Codex OpenAI provider Skip missing validation scripts Mar 19, 2026
@gaojude gaojude force-pushed the jude/skip-missing-scripts-codex-provider branch from 159cf1a to 4ed561a Compare March 19, 2026 14:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant