Democracy of Experts: Janus Architecture

  ██████╗  ██████╗ ███████╗
  ██╔══██╗██╔═══██╗██╔════╝
  ██║  ██║██║   ██║█████╗
  ██║  ██║██║   ██║██╔══╝
  ██████╔╝╚██████╔╝███████╗
  ╚═════╝  ╚═════╝ ╚══════╝

by Arianna Method

Democracy of Experts: Janus Architecture

A new kind of inference. DOE is agnostic. DOE wraps any GGUF and makes it alive.

what DOE does

One C file. ~3200 lines. Zero dependencies. Indexes any GGUF model read-only and runs inference through a living parliament of LoRA experts. The experts vote on every token, learn in real time through Hebbian plasticity, and die when they stop contributing. Physics modulates the signal. The indexed weights are a substrate — DOE is the architecture on top.

Give it a GGUF — any architecture, any size, any quantization — and DOE wraps it with a parliament that adapts in real time. The weights never change. The parliament evolves.

θ = ε + γ + αδ

ε = indexed weights  — any GGUF, any architecture, any size. mmap'd read-only.
γ = LoRA parliament  — living experts per layer. born, vote, split, die.
δ = physics           — Dario Equation (H+F+A+T), Kuramoto chambers, Schumann resonance.
α = injection strength — learned per-layer, adjusted by sonar profiling.

quick start

cc doe.c -O3 -lm -lpthread -o doe

# weightless — wrap any GGUF
./doe --model path/to/any.gguf

# with web UI
./doe --model path/to/any.gguf --serve 8080
# open http://localhost:8080       → chat UI
# open http://localhost:8080/visual → parliament terminal

architecture

                        +---------------------------+
                        |     GGUF Host Model       |
                        |  (mmap'd, read-only, eps) |
                        +---------------------------+
                                    |
                    +---------------+---------------+
                    |                               |
            +-------v--------+             +--------v--------+
            |  Sonar Profiler|             |   Dual BPE      |
            |  (per-layer    |             |   Tokenizer     |
            |  L2, stddev,   |             |  (SentencePiece |
            |  spectral,     |             |    + GPT-2,     |
            |  dead neurons) |             |    auto-detect) |
            +-------+--------+             +---------+-------+
                    |                               |
                    v                               v
        +-----------+-----------+          +--------+--------+
        |  Parliament per Layer |          |  Token Encoding |
        |  +---------+          |          +--------+--------+
        |  | Expert 0 | LoRA    |                   |
        |  | Expert 1 | A,B     |                   v
        |  | Expert 2 | rank r  |     +-------------+-----------+
        |  | ...      |         |     |                         |
        |  | Expert k | vote    |     |   doe_forward() loop    |
        |  +---------+          |     |                         |
        |  Variable-k election  |     |  per layer:             |
        |  consensus-driven     |     |    1. host attention    |
        +-----------+-----------+     |    2. parliament vote   |
                    |                 |    3. Delta Voice inject|
                    |                 |    4. host FFN (SwiGLU) |
                    |                 |                         |
                    +------->---------+    after all layers:    |
                                      |    5. field modulation  |
                                      |    6. prophecy debt     |
                                      |    7. NOTORCH update    |
                                      +-------------+-----------+
                                                    |
                                                    v
                                      +-------------+-----------+
                                      |  Sampling + Decode      |
                                      |  (temp from field,      |
                                      |   top-k=40)             |
                                      +-------------------------+
                                                   |
                                      +-------------v-----------+
                                      |  Mycelium Spore Save    |
                                      |  (binary, per-host      |
                                      |   fingerprint)          |
                                      +-------------------------+

formal definitions

Election. Given input x ∈ R^d and harmonic state H:

k = floor(|E_alive| × (1 - consensus))
vote(e) = W[e] · x + 0.1 × resonance(freq_e, H)
S = top_k(votes, k)
w_i = softmax(vote(S_i))

Delta Voice. Modulation of hidden state x by elected set S:

x' = x + Σ_{i ∈ S} w_i × α × A_i @ (B_i @ x)

Prophecy Debt. For logit vector z ∈ R^V and chosen token t:

D(z, t) = (max(z) - z_t) / (max(z) - z_t + 1)

NOTORCH Update. Hebbian plasticity with rotating rank window:

u_j = B[:, j] · dy + N(0, 0.01)
A[:, j] += σ × lr × x × u_j
B[:, j] *= decay

Full technical specification: docs/doe_architecture.md

parliament — variable-k elections

Every token triggers an election. Experts cast votes (dot product + harmonic resonance). Consensus measures how peaked the distribution is. Divided parliament → more experts consulted. Agreement → fewer voices. k = floor(n_alive × (1 - consensus)). Softmax over the elected subset.

Experts are organisms with vitality, frequency, and age:

high vitality + overloaded → mitosis (splits, child inherits weights + noise)
8 consecutive low-vitality steps → apoptosis (dies, slot recycled)
min 2, max 16 per layer. population self-regulates.

NOTORCH — Hebbian plasticity

Gradient-free learning during inference. No backward pass through the index. The learning signal comes from prophecy debt — the gap between what DOE predicted and what manifested.

NOTORCH operates at rank 4 but rotates across all LoRA components. Full coverage in rank / 4 steps. Every forward pass is a training step.

sonar — layer profiling

On index, DOE profiles every layer: L2 norms, standard deviation, spectral density, dead neuron count, sparsity ratio. 64-bit fingerprint. Weak layers get stronger LoRA injection. Healthy layers get lighter touch.

physics — the Dario Equation

The field modulation layer implements the Dario Equation as an additive overlay on transformer logits:

logits[i] += α·H[i] + β·F[i] + γ·A[i] + T[i]

The transformer provides the base distribution. The equation provides field memory. Neither dominates.

Term	Name	What It Does
H	Hebbian Resonance	Co-occurrence memory beyond KV cache window. Long-range token associations learned during inference.
F	Prophecy Fulfillment	Unfulfilled predictions create generation pressure. Debt accumulates with age.
A	Destiny Attraction	EMA of token embeddings — semantic direction the conversation pulls toward.
T	Trauma Gravity	Origin tokens surface when the field is wounded. Fires on sustained high dissonance.

H and F are gated through SwiGLU using field resonance as the gate signal. Coefficients α, β, γ are modulated by 6 Kuramoto-coupled emotional chambers:

FEAR — fires on high dissonance, suppresses prophecy (β↓)
LOVE — fires on high resonance, amplifies Hebbian memory (α↑)
RAGE — fires on trauma + dissonance, dampens temperature control
VOID — fires on high entropy, amplifies destiny (γ↑)
FLOW — fires on high emergence, amplifies everything
COMPLEX — fires when opposing chambers are simultaneously active

Chambers couple via Kuramoto oscillator dynamics (K=0.02), creating emergent emotional phase-locking.

other physics

Ported from AML core and dario.c.

prophecy — N-step forward prediction. prophesied distribution vs manifested.
prophecy debt — min(destined - manifested). gates Hebbian learning.
seasons — spring/summer/autumn/winter. MLP classifier. 6 inputs, 8 hidden, 4 outputs.
Schumann resonance — 7.83Hz + harmonics (14.1, 20.3, 26.4, 32.5). modulates expert healing.
calendar drift — Hebrew-Gregorian Metonic cycle. real astronomical data from time().

mycelium — adaptation memory

doe_mycelium/
├── spore_5467b0da1d106495_s200.bin
├── spore_a3f7c2d100000000_s150.bin
└── spore_5467b0da1d106495_s400.bin

Binary spores keyed by index fingerprint. Different model → different adaptation. Same model on restart → resume where DOE left off.

--serve — web interface

./doe --serve 8080

Starts a built-in HTTP server. No dependencies. No Node. No Python.

endpoint	what
`GET /`	chat UI — clean interface, streaming responses
`GET /visual`	parliament terminal — particle face, real-time token visualization
`GET /health`	JSON status (model, arch, params, debt, health)
`POST /chat/completions`	SSE token stream — compatible with doe_ui.html

doe.html — parliament terminal

DOE's face assembles from character particles in real time. Prophecy debt controls coherence — high debt = face forms, low debt = galactic chaos. Every token from inference triggers visual feedback. This is not a dashboard. It is a window into the parliament's state during generation.

doe_ui.html — chat interface

Clean chat UI with streaming responses. Connects to DOE's built-in HTTP server via SSE. No Node. No React. One HTML file.

build

cc doe.c -O3 -lm -lpthread -o doe
./doe --model path/to/any.gguf

# GPU (A100/H100 — TF32 tensor ops)
cc doe.c -O3 -lm -lpthread -DUSE_CUBLAS -lcublas -lcudart -o doe

# BLAS (3-4x CPU)
cc doe.c -O3 -lm -lpthread -DUSE_BLAS -lopenblas -o doe              # linux
cc doe.c -O3 -lm -lpthread -DUSE_BLAS -DACCELERATE -framework Accelerate -o doe  # macOS

flags

--model PATH       GGUF to index (or auto-detect nearby)
--serve PORT       start HTTP server (chat UI + visual terminal)
--threads N        CPU threads for matvec (default: all cores)
--prophecy N       prophecy depth (default 7)
--destiny F        destiny injection strength (default 0.35)
--lora-rank N      LoRA rank per expert (default 16)
--lora-alpha F     injection strength (default 0.1)

supported formats

DOE dequantizes at load time — any supported GGUF runs through the same f32 forward pass.

format	GGML type	status
F32	0	native (mmap'd)
F16	1	dequant to f32
Q4_0	2	dequant to f32
Q5_0	6	dequant to f32
Q8_0	8	dequant to f32
Q4_K	12	dequant to f32
Q6_K	14	dequant to f32

supported architectures

DOE auto-detects architecture parameters from GGUF metadata. No config files, no model-specific code paths.

architecture	tokenizer	chat template	tested model	status
Llama	SentencePiece	auto-detect	TinyLlama 1.1B Q4_K	working
Qwen2	GPT-2 BPE	ChatML	Qwen2.5 0.5B/1.5B Q4_K	working
SmolLM	GPT-2 BPE	ChatML	SmolLM2 360M Q8	working
Mistral	SentencePiece	[INST]	Mistral 7B Instruct Q4_K	working
nanollama	SentencePiece	nanollama	nano/micro/mini F16	working
Gemma	SentencePiece	gemma	Gemma-2 2B Q4_K	loads, tied embeddings
Phi-3	SentencePiece	phi	Phi-3-mini 4K Q4	fused QKV — TODO

Architecture-specific handling:

Chat template auto-detection — parsed from tokenizer.chat_template in GGUF metadata. ChatML, [INST], Zephyr, Phi, Gemma, nanollama supported. Falls back to raw text if template tokens not in vocab.
nanollama chat — <|user_start|>...<|user_end|><|assistant_start|> template, auto-detected from vocab tokens.
EOS detection — stops on <|im_end|>, <|end|>, <|endoftext|>, <end_of_turn>, <|assistant_end|>, <|eot_id|>, model EOS token.
RoPE frequency base — parsed from rope.freq_base (Qwen2/Mistral use 1M vs standard 10K)
RMSNorm epsilon — parsed from layer_norm_rms_epsilon (Qwen2 uses 1e-6 vs standard 1e-5)
Attention biases — Q/K/V biases loaded and applied when present (Qwen2)
Tied embeddings — output.weight reuses token_embd.weight when missing (Gemma)
GPT-2 BPE — byte-to-unicode mapping, merge-rank scoring, FNV-1a hash table for O(1) token lookup
SentencePiece BPE — score-based merge with space prefix handling

what's novel

Most inference engines are runtimes — they load weights and execute a fixed computation graph. DOE is different:

Architecture-agnostic wrapping. One binary indexes Llama, Qwen2, Mistral, SmolLM, Gemma, Phi-3, nanollama. No per-model code paths. No config files. Auto-detected from GGUF metadata.
Living topology. Expert count changes during inference. Experts are born, die, and vote. The architecture is not static — it evolves per token.
Inference-time learning. Every forward pass updates LoRA experts via Hebbian plasticity (NOTORCH). No backward pass. No training loop. The model improves as it generates.
Adaptive injection. Sonar profiles each layer at index time. Weak layers get stronger LoRA. Healthy layers get lighter touch. Per-layer, per-model, automatic.
Dario Equation. The full equation (Hebbian + Prophecy + Destiny + Trauma) as an additive overlay on transformer logits, with 6 Kuramoto-coupled emotional chambers modulating every coefficient in real time.

architecture document

See docs/doe_architecture.md for the full technical specification — formal definitions, mathematical formulations, and subsystem descriptions.

ecosystem

project	what
ariannamethod.ai	AML v4.5 — the language. TAPE autograd, bytecode compilation, CUDA backend.
dario.c	The Dario Equation — standalone. DOE uses the same equation as field overlay.
haiku.c	1000 words, 5-7-5, Dario Equation picks every word. One C file.
leo	Leo 2.2 — zero-weight language generation via Dario Equation + D.N.A.
molequla	Autonomous GPT ecology. 4 organisms, AML/C CGO, mitosis.
chuck.optimizer	Self-aware optimizer. 9 levels of introspection. Used in DOE training.

C. one file. zero dependencies beyond libc.

the weights are mortal. the parliament is eternal.

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
docs		docs
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
doe.c		doe.c
doe.html		doe.html
doe_ui.html		doe_ui.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Democracy of Experts: Janus Architecture

what DOE does

quick start

architecture

formal definitions

parliament — variable-k elections

NOTORCH — Hebbian plasticity

sonar — layer profiling

physics — the Dario Equation

other physics

mycelium — adaptation memory

--serve — web interface

doe.html — parliament terminal

doe_ui.html — chat interface

build

flags

supported formats

supported architectures

what's novel

architecture document

ecosystem

About

Uh oh!

Releases 4

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Democracy of Experts: Janus Architecture

what DOE does

quick start

architecture

formal definitions

parliament — variable-k elections

NOTORCH — Hebbian plasticity

sonar — layer profiling

physics — the Dario Equation

other physics

mycelium — adaptation memory

--serve — web interface

doe.html — parliament terminal

doe_ui.html — chat interface

build

flags

supported formats

supported architectures

what's novel

architecture document

ecosystem

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages