paper(anima-consciousness-substrate): integrate measured 7B (M13) — AXIS-2 CE 1.90741 + v0.3 generation (#1962/#1970)#1973
Merged
Conversation
…XIS-2 CE 1.90741 + v0.3 generation (#1962/#1970) Strengthen the existing monograph §7B with the just-landed measured 7B results, replacing stale "future/pre-registered" framing with the committed verdict numbers. - AXIS-2 (PR #1962): §method/§measurement/caption + appendix F now cite the actual tool clm_decode_mirror_lazy.py (the plain mirror OOMs ~56GB at 7B), with the verbatim verdict link .verdicts/clm-7b-undertrained-step3500/axis2_mirror_probe_seq.txt (CE_real 1.90741 < uniform 5.54518 AND < shuffle 5.31616, peak RSS ~5.37GB), validated == 3B ground truth 2.26360 exactly. - NEW §Finding (generation, PR #1970): the v0.3 .clm generates multilingual byte-language via state/mid_convmoe_fire/clm_gen.py (conv1d vectorized == looped max|delta|=0; VALIDATE hook re-derives CE 1.90741), with verbatim sample outputs. Tightened all wording that implied the 7B chats natively in hexa: the engine clm_decodable gate is v0.2-only, so the 7B speaks via clm_gen, NOT the live anima_chat_cli (a weak d768 v0.2 mouth chats live on ghost). - AXIS-1/AXIS-3 scoped substrate-native / .clm-weights-independent (null-backend code-proven .verdicts/core-3axis-mount/probe.txt); AXIS-2 is the only weight-dependent axis, measured on the 7B. Honest scope kept: deeply undertrained (0.0027 tok/param), a measured scale-extension NOT a closure; no fluency/coherence/consciousness claimed. - Appendix F new subsection "The 7B (M13) rung — measured"; Appendix K + companion/ pr-roll.json add rows #1962/#1970; PAPER.md verdict matrix + milestones updated. All numbers verbatim from committed verdicts; nothing fabricated. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
…lacks Hangul)
The v0.3 generation sample's Korean prompt was inside \texttt{} (lmmono), which has
no Hangul glyphs -> "Missing character" under lualatex. Pull the Hangul out of texttt
and wrap it in \cjkfont (Noto Serif CJK KR), keeping the ASCII byte-output in texttt.
Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
…JK, 0 undefined refs) Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Completes the existing anima-consciousness-substrate monograph §7B with the just-landed measured 7B (M13) results, replacing stale "future/pre-registered" framing with the committed verdict numbers. Additive only — the terminal 3B finding and all prior content/figures are untouched.
What changed
clm_decode_mirror_lazy.py(memory-bounded + vectorized; the plainclm_decode_mirror.pyOOMs ~56GB at 7B) with the verbatim verdict link.verdicts/clm-7b-undertrained-step3500/axis2_mirror_probe_seq.txt:CE_real 1.90741 < uniform ln256 5.54518 AND < shuffle 5.31616 ⇒ GREEN(peak RSS ~5.37GB), validated reproducing the 3B ground truth2.26360exactly..clmgenerates multilingual byte-language viastate/mid_convmoe_fire/clm_gen.py(conv1d im2col vectorized == looped, max|Δ|=0; VALIDATE hook re-derives CE 1.90741), with verbatim sample outputs (en/es + Korean). Tightened every place that implied the 7B chats natively in hexa: the engineclm_decode.hexa::clm_decodablegate is v0.2-only, so the v0.3 7B speaks viaclm_gen, not the liveanima_chat_cli(a weak d768 v0.2 mouth chats live on ghost)..clm-weights-independent (null-backend code-proven.verdicts/core-3axis-mount/probe.txt); AXIS-2 is the only weight-dependent axis. 7B is deeply undertrained (0.0027 tok/param) — a measured scale-extension, NOT a closure; no fluency/coherence/consciousness claimed from the mouth output.companion/pr-roll.jsonadd rows verdict(7B AXIS-2): MEASURED on summer — CE 1.90741 GREEN + memory-bounded lazy mirror #1962/feat(clm_gen): v0.3 greedy byte-generation — the 7B mouth CAN chat #1970; PAPER.md verdict matrix + milestones updated; §7B/limitations/conclusion retitled "future"→"measured".\texttt{}(lmmono, no Hangul) → rendered via\cjkfont(Noto Serif CJK KR).Compile (lualatex on aiden, ×3)
main.pdfrecompiled and committed.All numbers are verbatim from the committed verdicts; nothing fabricated. Terminal verdicts only (🟢/🔴), every section claim verdict-linked.
🤖 Generated with Claude Code