-
Notifications
You must be signed in to change notification settings - Fork 1.5k
Pull requests: openai/parameter-golf
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
LongContext 4096 + Full SOTA Stack & QAT Int4 → 16 Layers
#347
opened Mar 21, 2026 by
FlashyFlash3011
•
Draft
2 of 4 tasks
Non-record: DART - Differential Attention Recurrent Transformer (Student submission, Kerala)
#345
opened Mar 21, 2026 by
anandks2006
Loading…
Non-record: Autoresearch Heads4 + Step-based LR + Sliding Window (1xH100)
#344
opened Mar 21, 2026 by
aryanbhosale
Loading…
Non-record: MLX-Optimized 12L 416d with SmearGate + BigramHash (val_bpb=1.9011, Mac)
#342
opened Mar 21, 2026 by
adhyaay-karnwal
Loading…
Add Hybrid Depth-Recurrent Transformer submission
#341
opened Mar 21, 2026 by
tobiascanavesi
Loading…
V2 Prototype: SwiGLU + Dropout + MuonWD + MidLayerLoop
#340
opened Mar 21, 2026 by
starfly-web
Loading…
Record: 11L XSA+EMA+TTT, sliding val_bpb=1.1254 (3-seed mean 1.1256)
#338
opened Mar 21, 2026 by
alertcat
Loading…
Non-record: 11L PartialRoPE + LNScale + EMA + SWA + TTT (1xH100 80min, val_bpb=1.2108)
#334
opened Mar 21, 2026 by
nathon-lee
Loading…
4 tasks done
11L XSA4 + SmearGate + BigramHash + SWA + RoPE50K (mean val_bpb=1.1565, 3 seeds)
#333
opened Mar 21, 2026 by
mahsumaktas
Loading…
4 tasks done
Record: 12L Gradient-Guided Quant + Partial RoPE + LN Scale + EMA + XSA4 (val_bpb: 1.1320)
#332
opened Mar 21, 2026 by
saml212
Loading…
10L MLP3x + BigramHash(2048) + SWA + Stride-32: 1.1487 BPB
#331
opened Mar 21, 2026 by
Rhodrium
Loading…
3 tasks
Non-record: 11L Int6 + Online Logit Bias (val_bpb=1.1609)
#330
opened Mar 21, 2026 by
bopmite
Loading…
Non-record: MLX prototyping harness with validated technique stack (val_bpb=1.9588, Mac)
#328
opened Mar 21, 2026 by
kingjulio8238
Loading…
4 of 5 tasks
Submission TrigramHash + PartialRoPE + HeadTemp + stride32 (val_bpb: 1.1450)and
#327
opened Mar 21, 2026 by
Ananddna
Loading…
[Non-Record] QAT + NTK-4096 Eval + Cosine Warmdown + Aggressive SWA
#326
opened Mar 21, 2026 by
crony-io
Loading…
Add Looped Transformer Design non-record submission (non tuned)
#325
opened Mar 21, 2026 by
Aum08Desai
Loading…
Minimal recurrent motif (sb1 rs2 g0.18) – non-record submission
#323
opened Mar 21, 2026 by
megnat05-tmm
Loading…
11L SmearGate + BigramHash(10240) + Causal TTT + Mixed Int5/Int6 + SWA
#322
opened Mar 21, 2026 by
romainsantoli-web
•
Draft
Add record: Optimizer Tuning + Sliding Window Eval (val_bpb=1.1864)
#321
opened Mar 21, 2026 by
andreanjos
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.