Skip to content

TAS-182: Article demo and benchmark equivalence gate #305

@AtlantisPleb

Description

@AtlantisPleb

Summary

  • combine Hungarian and Sudoku closure into one gate

Roadmap Position

  • Phase: Phase 6. Close demos and benchmark claims
  • Sequence: article-gap closure after TAS-156A
  • Owned-route boundary: any canonical article route referenced here means the psionic-transformer-based route established by TAS-156A / TAS-160, not a reintroduced mixed implementation spread across model or runtime crates.
  • Claim posture: necessary for article-equivalent closure, but not sufficient by itself to widen public capability claims

Description

  • publish one checker that turns green only when:
    • Hungarian demo parity is green
    • Arto parity is green
    • benchmark-wide Sudoku parity is green

Why This Matters

  • the final article audit needs one unified demo/benchmark surface

Primary Surfaces

  • psionic-transformer (canonical owned article-route base for this issue)
  • psionic-eval
  • psionic-provider

Validation

  • checker tests
  • negative tests on any missing demo or benchmark row

Claim Discipline

  • Keep machine-legible boundaries explicit; partial progress must not be presented as full article equivalence.
  • Preserve typed refusal, fallback, blocked, and suppressed posture where closure is still incomplete.
  • Keep public language bounded to the declared article envelope rather than implying generic arbitrary-program closure from a partial green result.

Done When

  • one article demo/benchmark gate exists and is green

Metadata

Metadata

Assignees

No one assigned

    Labels

    tassadarPsionic Tassadar roadmap work

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions