Skip to content

refactor: remove dead-end GPU experiments and clean up exec plans#115

Merged
ChrisLundquist merged 1 commit intomasterfrom
claude/happy-hodgkin
Mar 10, 2026
Merged

refactor: remove dead-end GPU experiments and clean up exec plans#115
ChrisLundquist merged 1 commit intomasterfrom
claude/happy-hodgkin

Conversation

@ChrisLundquist
Copy link
Copy Markdown
Owner

Summary

  • Delete lz77_hash GPU kernel and all Rust references — fundamental SIMT atomic ordering problem produces 6.25% match ratio vs 99.61% CPU
  • Delete parlz module, pipeline variant (ID 11 reserved), WebGPU integration, and CLI flag — 37.6% ratio gap vs serial greedy, confirmed NO-GO
  • Disable gpu_fused_span() routing entropy to GPU for Lzr/LzSeqR — GPU rANS is 0.54–0.77x CPU after 29+ optimization iterations
  • Update exec plan statuses: PLAN-p0a-gpu-rans → CLOSED; interleaved-rans, unified-scheduler-north-star, huffman-sync-decode, agent-harness → PARKED
  • Add benchmark caveat to CLAUDE.md documenting ~18x gap between bench.sh and Criterion

29 files changed, +53 / -1,596 lines. All removed code was confirmed dead-end with documented performance data.

Test plan

  • cargo fmt --check passes clean
  • cargo clippy --all-targets passes clean (zero warnings)
  • cargo test --no-default-features — 571 passed, 0 failed
  • Pre-commit hook (fmt + clippy + test) passes

🤖 Generated with Claude Code

Remove confirmed dead-end code that wastes future agent time:

- Delete lz77_hash.wgsl GPU hash-table kernel and all Rust references
  (fundamental SIMT atomic ordering problem, 6.25% match ratio vs 99.61% CPU)
- Delete parlz module, pipeline variant, WebGPU integration, and CLI flag
  (37.6% ratio gap vs serial greedy, NO-GO for fully-parallel GPU LZ)
- Disable gpu_fused_span() routing entropy to GPU for Lzr/LzSeqR
  (GPU rANS is 0.54-0.77x CPU after 29+ optimization iterations)

Update exec plan statuses:
- PLAN-p0a-gpu-rans-vertical-slice: CLOSED (perf gate failed)
- PLAN-interleaved-rans: PARKED (Phase D cancelled, GPU rANS dead end)
- PLAN-unified-scheduler-north-star: PARKED (Phases 2+5 blocked)
- TODO-huffman-sync-decode: PARKED (zero implementation progress)
- agent-harness-implementation: PARKED (Phase 1 done, rest deferred)

Add benchmark caveat to CLAUDE.md documenting ~18x gap between
bench.sh (per-process, includes wgpu init) and Criterion (pre-allocated).

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@ChrisLundquist ChrisLundquist merged commit 327c122 into master Mar 10, 2026
4 checks passed
@ChrisLundquist ChrisLundquist deleted the claude/happy-hodgkin branch March 10, 2026 09:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant