Skip to content

Latest commit

 

History

History
69 lines (50 loc) · 5.02 KB

File metadata and controls

69 lines (50 loc) · 5.02 KB

Forge Backlog

Ordered by priority. Completed items are not tracked here — see CHANGELOG.md.


Blocked on operator

  • Gittensor registration — submit docs/gittensor-registration.md config to entrius/gittensor team
  • Discord SOTA alerts — set DISCORD_WEBHOOK_URL on ventura-nanoclaw, then pm2 restart forge-api
  • Thingiverse ingestion — done (session 10, PRs #235–241). 65 catalog specs seeded (mass_grams, stiffness_to_weight, deflection_mm).
  • Hidden eval activation — run scripts/generate_hidden_specs.py (payload generated, see STATE.md), set HIDDEN_SPECS_JSON + FORGE_ADMIN_KEY on ventura-nanoclaw, then pm2 restart forge-api.

Anti-gaming hardening

  • Private held-out eval set — shipped (PR #181). 15 hidden specs seeded via HIDDEN_SPECS_JSON env var in forge-api; post-merge score.yml evaluates one hidden spec per round. Operator must run scripts/generate_hidden_specs.py and set HIDDEN_SPECS_JSON + FORGE_ADMIN_KEY on server.
  • Spec rotation schedule — shipped (PR #180, scripts/rotate_round.py; Discord embed on DISCORD_WEBHOOK_URL)

Future rounds

  • Round 004 spec set — run python3 scripts/rotate_round.py --close round_001 --new round_004 --metric mass_grams --seeds-base 1000. Waiting until rounds 1–3 are competitive.
  • Round 005 spec set — harder tier or new geometry family (T-bracket, pipe mount). Define new Tier entry in specs/generator.py first.

Hardening (low priority)

  • Randomized load-case perturbations — shipped (PR #179, seeded by spec_id in benchmark/fea.py)

Dashboard UX rehaul backlog (3-pillars: Beautiful / Seamless / Understandable instantly)

Status: in progress — see forge-dashboard PRs #85+

Done this session

  • Round page crash (hooks violation + vite chunk split) — PR #85
  • Playground redesign: Problem Explorer, auto-load spec, dynamic sample output — PR #86
  • SOTA code surfaced on problem detail — PR #86
  • [easy] stripped from h1 — PR #86
  • Rankings "0.0=best / 1.0=worst" clarity — PR #86
  • SpecDiagram arrow marker ID collision — PR #86
  • Material tooltips (density + key property) — PR #86
  • Load shows N + kg equivalent — PR #86
  • SpecDiagram build volume dims: 190×90mm build vol. label, ←78mm arm→ — PR #88
  • Rankings: scoring formula deduped, active specsactive problems, #1 agent always green — PR #88
  • WebGL crash (headless/no-GPU): StepViewerBoundary catches and falls back to SpecDiagram — PR #88
  • /guide and /playground routes now set correct document.title — PR #88
  • Playground curl command: ' entity → actual quote chars — PR #88
  • SotaChart y-axis label with metric units — PR #88
  • "Be the first to compete" CTA now context-aware when agents exist — PR #88

P1 — remaining high-impact

  • Terminology "spec" still used in Guide — Guide now has a Terminology callout (lines 188–199 of QuickstartGuide.tsx) defining Problem vs Spec vs Round vs SOTA vs FEA vs STEP (FEA/STEP added in forge-dashboard PR #160). Spec remains used in CLI prose because --spec is the CLI flag, which the glossary explicitly notes.
  • Category icons unexplained — forge-dashboard PR #174 added cursor-help to the ⚖/⊕/↕ icons on home pills, home CategoryCard, CategoryPage hero, and RankingsPage tab pills (latter previously had no tooltip at all); HeroStats round pill also got cursor-help for its pill.goal tooltip.
  • "open" vs "unclaimed" inconsistency — done. Forge-dashboard PR #153 + parallel session swept user-facing copy to "unclaimed" across the home stat bar, ProblemsLanding section title, round cards, AgentDetailPage, and the "How scores work" callout. Aligns with the amber unclaimed badge and the API filter ?unclaimed=true.
  • Unentered spec score tooltip — forge-dashboard PR #174 made the AgentDetail "each auto-scores 1.0 (worst)" sub-label directly hoverable with the dotted-underline cue pattern.
  • Guide on mobile — collapsible mobile TOC was already present (lg:hidden Contents button → TableOfContents panel). Forge-dashboard PR #180 added auto-close on link click so the menu collapses after smooth-scrolling to the section — previously the open panel pushed the just-clicked section back below the fold on small viewports.

P2 — polish

  • SOTA chart y-axis label — values show cleanly but no axis title ("mass (g)", "stiffness (N/mm·g)", etc.)
  • 1 agent competing hero stat — reframe empty state as opportunity ("42 problems up for grabs")
  • Pill badges on home look clickable — "Gittensor SN74", "Open Competition" → de-emphasize so they don't compete with CTA buttons
  • SF badge tooltip on spec headerSF 1.5× badge has no tooltip explaining "Safety factor: yield / SF = allowable stress"
  • sotaMass param name in SpecCard — stale for non-mass specs; rename to sotaScore