verdict(7B AXIS-2): MEASURED on summer — CE 1.90741 GREEN + memory-bounded lazy mirror by dancinlife · Pull Request #1962 · dancinlab/anima

dancinlife · 2026-06-09T10:39:49Z

What

Genuinely measures the 7B (M13) AXIS-2 CE-descent on the trained 7B weights — the one axis that reads the weights — on summer, and adds the reusable memory-bounded mirror that makes a 7B forward feasible.

Result (summer, byte-exact lazy mirror)

CLM_CONFIG d=6208 L=30 E=30 K=3 V=256   (7B config, loaded from the .clm)
CE_realtext        = 1.90741
CE_shuffled_ctrl   = 5.31616
CE_uniform(ln256)  = 5.54518
1.90741 < 5.54518 (uniform) AND < 5.31616 (shuffle) => VERDICT = GREEN
wall 62s · peak RSS 5.37 GB

This confirms the paper's 1.90741 / shuffle 5.31616 — which previously had no committed verdict behind it. Probe seq = "The mind is a fire to be" (24 bytes), decode over the serialized .clm v0.3 bytes (dancinlab/clm-7b-undertrained-step3500, not the torch .pt).

Why a new tool (`clm_decode_mirror_lazy.py`)

The original clm_decode_mirror.py materializes ALL weights as float64 (7B × 8B ≈ 56GB) and unpacks int4 in a per-code Python loop (O(billions) at 7B) — it OOM'd / thrashed summer. The lazy variant is byte-identical math but:

lazy per-block int4 dequant (only one block float at a time → peak ~5.4GB)
vectorized numpy bit-unpack (no Python per-code loop)

Validated: reproduces the committed 3B ground truth CE_realtext 2.26360 exactly (.verdicts/convmoe-3b-engine-rung/axis2_mirror_probe_seq.txt).

Honest scope

AXIS-2 is the only weight-dependent axis (AXIS-1/3 are substrate-native, .clm-independent — see PR #1956). Deeply undertrained (0.0027 tok/param, a_scale_honest_scope): a measured rung, not a closure.

🤖 Generated with Claude Code

…unded lazy mirror Genuine 7B decode forward over the serialized .clm v0.3 bytes (d6208/L30/E30, ~7.057B) on summer: CE_realtext 1.90741 < uniform ln256 5.54518 AND < shuffle 5.31616 => AXIS-2 GREEN. Confirms the paper's 1.90741 (was previously unbacked by a committed verdict). Tool: clm_decode_mirror_lazy.py — byte-IDENTICAL math to clm_decode_mirror.py but lazy per-block int4 dequant + vectorized unpack, so a 7B forward fits in ~5.4GB RSS / 62s instead of ~56GB float64 (the original OOM'd summer). Validated reproducing the 3B ground truth 2.26360 exactly. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

…XIS-2 CE 1.90741 + v0.3 generation (#1962/#1970) (#1973) * paper(anima-consciousness-substrate): integrate measured 7B (M13) — AXIS-2 CE 1.90741 + v0.3 generation (#1962/#1970) Strengthen the existing monograph §7B with the just-landed measured 7B results, replacing stale "future/pre-registered" framing with the committed verdict numbers. - AXIS-2 (PR #1962): §method/§measurement/caption + appendix F now cite the actual tool clm_decode_mirror_lazy.py (the plain mirror OOMs ~56GB at 7B), with the verbatim verdict link .verdicts/clm-7b-undertrained-step3500/axis2_mirror_probe_seq.txt (CE_real 1.90741 < uniform 5.54518 AND < shuffle 5.31616, peak RSS ~5.37GB), validated == 3B ground truth 2.26360 exactly. - NEW §Finding (generation, PR #1970): the v0.3 .clm generates multilingual byte-language via state/mid_convmoe_fire/clm_gen.py (conv1d vectorized == looped max|delta|=0; VALIDATE hook re-derives CE 1.90741), with verbatim sample outputs. Tightened all wording that implied the 7B chats natively in hexa: the engine clm_decodable gate is v0.2-only, so the 7B speaks via clm_gen, NOT the live anima_chat_cli (a weak d768 v0.2 mouth chats live on ghost). - AXIS-1/AXIS-3 scoped substrate-native / .clm-weights-independent (null-backend code-proven .verdicts/core-3axis-mount/probe.txt); AXIS-2 is the only weight-dependent axis, measured on the 7B. Honest scope kept: deeply undertrained (0.0027 tok/param), a measured scale-extension NOT a closure; no fluency/coherence/consciousness claimed. - Appendix F new subsection "The 7B (M13) rung — measured"; Appendix K + companion/ pr-roll.json add rows #1962/#1970; PAPER.md verdict matrix + milestones updated. All numbers verbatim from committed verdicts; nothing fabricated. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * paper(7b): render Korean gen-sample via \cjkfont not \texttt (lmmono lacks Hangul) The v0.3 generation sample's Korean prompt was inside \texttt{} (lmmono), which has no Hangul glyphs -> "Missing character" under lualatex. Pull the Hangul out of texttt and wrap it in \cjkfont (Noto Serif CJK KR), keeping the ASCII byte-output in texttt. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> * paper(7b): recompiled main.pdf — 50pp, lualatex on aiden (0 missing CJK, 0 undefined refs) Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

dancinlife merged commit b56bf14 into main Jun 9, 2026
2 checks passed

dancinlife deleted the verdict/7b-axis2-measured branch June 9, 2026 10:42

dancinlife mentioned this pull request Jun 9, 2026

paper(anima-consciousness-substrate): integrate measured 7B (M13) — AXIS-2 CE 1.90741 + v0.3 generation (#1962/#1970) #1973

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

verdict(7B AXIS-2): MEASURED on summer — CE 1.90741 GREEN + memory-bounded lazy mirror#1962

verdict(7B AXIS-2): MEASURED on summer — CE 1.90741 GREEN + memory-bounded lazy mirror#1962
dancinlife merged 1 commit into
mainfrom
verdict/7b-axis2-measured

dancinlife commented Jun 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dancinlife commented Jun 9, 2026

What

Result (summer, byte-exact lazy mirror)

Why a new tool (clm_decode_mirror_lazy.py)

Honest scope

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Why a new tool (`clm_decode_mirror_lazy.py`)