Skip to content

distill-v2: sample from staged (private) task set instead of public completed set#532

Open
catoneone wants to merge 1 commit into
mainfrom
distill-v2/private-release-set
Open

distill-v2: sample from staged (private) task set instead of public completed set#532
catoneone wants to merge 1 commit into
mainfrom
distill-v2/private-release-set

Conversation

@catoneone

Copy link
Copy Markdown
Collaborator

Why

Same rationale as the SWE change (#529). DISTILL-V2 sampled from the most recently completed (public) tasks, which gives miners a window to overfit to the freshly exposed set within a day and win on that basis rather than on genuine capability.

Switch the DISTILL-V2 dataset range source from the public tasks.completed_up_to to the staged tasks.staged_up_to (private) release set. Tasks are promoted to the public set after a one-day delay, preserving transparency on the usual cadence while removing the overfit window.

Change

  • affine/database/system_config.json: DISTILL-V2 dataset_range_source.field tasks.completed_up_totasks.staged_up_to

…ompleted set

Same rationale as SWE: sampling from the most recently completed (public)
tasks lets miners overfit to the exposed set within a day. Switch the
DISTILL-V2 dataset range source to the staged (private) release set; tasks are
promoted to the public set after a one-day delay.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant