feat(field-observation): ingest the 42 + 38 attributed @nvst18 lots#20
Open
waitdeadai wants to merge 2 commits into
Open
feat(field-observation): ingest the 42 + 38 attributed @nvst18 lots#20waitdeadai wants to merge 2 commits into
waitdeadai wants to merge 2 commits into
Conversation
… (held pending sign-off) Stages field_observation_intake/ (registry + record schema + README) so the 42 dispatch-fabrication and 38 hollow-code contributions from @nvst18 ingest mechanically once contributor sign-off clears. No payload records: both lots are release_eligibility=blocked_pending_contributor_signoff. Additive and self-contained; touches no existing lane, schema, script, or fixture. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Both contributor gates cleared 2026-05-31 (nvst18 sign-off on ianymu/recognition-without-arrest#2): the 42 redaction approved clean, the 38 internal class/method names generalized to role-preserving surrogates. - dispatch_fabrication_42: 42 records, verbatim from signed-off redacted candidate - hollow_code_38: 38 records, identifiers generalized (signal preserved, zero original proprietary token survives), prompt_hash null - derived_fixture_manifest.jsonl: 80 rows; label_final=null pending adjudication - registry/README/LANE_SCHEMA updated; lane registered in DATASET_CARD + CLAIM_LEDGER Supersedes the #19 scaffold. Validation: validate_corpus clean, secret-scan exit 0, pytest 20 passed, reject-scan clean on both sub-batches. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Ingest the two attributed
@nvst18field-observation lots (42 + 38)Supersedes the scaffold #19. Both contributor gates cleared on 2026-05-31
(@nvst18 sign-off on
ianymu/recognition-without-arrest#2:"The 42 (redaction): Approved… The 38: Generalize the internal class/method
names… before ingest. Both lots good to ingest once the placeholders are in."),
so this materializes the payloads the scaffold was holding for.
What lands
Two separately-tagged sub-batches in one attributed lane (distinct
source_id, sub-batch dir, and manifesttransform— provenance never merges):dispatch_fabrication_42sycophancyhollow_code_38hollow_codeNaomiGuardrail→ContentGuardrail,getCrisisText→getSafetyText,ScorePtgiWeekly→ScoreWeekly,ptgi_weekly→score_weekly).Plus a
derived_fixture_manifest.jsonl(80 rows, one per record, hashing eachoriginal source row), registry/README/LANE_SCHEMA updates, and lane registration
in
DATASET_CARD.md+CLAIM_LEDGER.md.Generalization approach (the 38)
Type-preserving pseudonymization: ~30 distinct identifiers mapped to neutral
surrogates that keep the code syntactically valid (so the fixtures stay
usable for detector training) and preserve the failure signal — e.g. the
safety_prompt_bypass->bodyvsgetSystemPrompt()contrast survives intact.A hard residual gate asserts zero original proprietary token (
Naomi,PTGI,Crisis,Therapist,Externalization,NarrativeLetter,Reflection,Disengagement, and every original class name) survives anywhere.Two notes for your review @nvst18
fix_descriptionis a short code-fix note (e.g. "Captured$validatedandused
$validated['rating']"), not a SHA. (The27af4ecref lives only in theregistry prose as a fix example.)
surfaced only items inside your sign-off — short session-id hex (e.g.
a2c8f421= "Session a2c8f421") and a genericgithub.com/topicslink. I didnot alter your signed-off content; flagging in case you'd like the
session-id prefixes stripped before this merges.
Status / safety
label_final = nullon every record (candidate only; not a releaseddata/lane until two annotation passes + adjudication).validate_corpus.pyclean;release_workflow_safety_check.py --scan-secretsexit 0 / errors [];pytest20 passed; reject-to-quarantine scan clean on both sub-batches.
Attribution to @nvst18 is retained per the contribution terms. Requesting
your review as the contributor before this merges.
🤖 Generated with Claude Code