[ATOM-SGL][Attn refrac] Route DeepSeek MLA through an SGLang wrapper by ZhiweiYan-96 · Pull Request #29 · zejunchen-zejun/ATOM

ZhiweiYan-96 · 2026-05-13T07:22:05Z

Move the SGLang DeepSeek MLA runtime entry from legacy forward glue into SGLangDeepseekMLAAttention while keeping RadixAttention and the full-attention backend as the host/backend layers. Shrink deepseek_mla_forward.py into a helper module and clarify absorbed vs non-absorbed path naming.

ZhiweiYan-96 force-pushed the zhiwei/attn_model_decouple branch from 67d67b9 to 3598254 Compare May 19, 2026 11:43

ZhiweiYan-96 force-pushed the zhiwei/attn_refrac_share_model branch from 2a3dd89 to a1f88ae Compare May 19, 2026 11:43

ZhiweiYan-96 mentioned this pull request May 21, 2026

[ATOM SGLang] SGL plugin Attention Refractory ROCm/ATOM#863

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ATOM-SGL][Attn refrac] Route DeepSeek MLA through an SGLang wrapper#29

[ATOM-SGL][Attn refrac] Route DeepSeek MLA through an SGLang wrapper#29
ZhiweiYan-96 wants to merge 1 commit into
zhiwei/attn_model_decouplefrom
zhiwei/attn_refrac_share_model

ZhiweiYan-96 commented May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ZhiweiYan-96 commented May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant