[ATOM-SGL][Attn refrac] Separate model-specific MLA from SGL full attention backend by ZhiweiYan-96 · Pull Request #28 · zejunchen-zejun/ATOM

ZhiweiYan-96 · 2026-05-12T08:36:37Z

Motivation

SGLang plugin have three components

Attention: GDN, Full attention
Model Forward patch, like deepseeek mla forward, qwen3.5 forward
Sglang runtime management: managing forward batch related info

…ention backend

ZhiweiYan-96 changed the title ~~Zhiwei/attn model decouple~~ [ATOM-SGL][Attn refrac] Separate model-specific MLA from SGL full attention backend May 12, 2026

ZhiweiYan-96 added 3 commits May 19, 2026 11:01

add work log

29408d1

[ATOM-SGL][Attn refrac] Separate model-specific MLA from SGL full att…

e1d06c7

…ention backend

remove work log

3598254

ZhiweiYan-96 force-pushed the zhiwei/attn_model_decouple branch from 67d67b9 to 3598254 Compare May 19, 2026 11:43

ZhiweiYan-96 mentioned this pull request May 21, 2026

[ATOM SGLang] SGL plugin Attention Refractory ROCm/ATOM#863

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ATOM-SGL][Attn refrac] Separate model-specific MLA from SGL full attention backend#28

[ATOM-SGL][Attn refrac] Separate model-specific MLA from SGL full attention backend#28
ZhiweiYan-96 wants to merge 3 commits into
mainfrom
zhiwei/attn_model_decouple

ZhiweiYan-96 commented May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ZhiweiYan-96 commented May 12, 2026

Motivation

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant