Skip to content

[ATOM-SGL][Attn refrac] Route DeepSeek MLA through an SGLang wrapper#29

Open
ZhiweiYan-96 wants to merge 1 commit into
zhiwei/attn_model_decouplefrom
zhiwei/attn_refrac_share_model
Open

[ATOM-SGL][Attn refrac] Route DeepSeek MLA through an SGLang wrapper#29
ZhiweiYan-96 wants to merge 1 commit into
zhiwei/attn_model_decouplefrom
zhiwei/attn_refrac_share_model

Conversation

@ZhiweiYan-96
Copy link
Copy Markdown
Collaborator

Move the SGLang DeepSeek MLA runtime entry from legacy forward glue into SGLangDeepseekMLAAttention while keeping RadixAttention and the full-attention backend as the host/backend layers. Shrink deepseek_mla_forward.py into a helper module and clarify absorbed vs non-absorbed path naming.

Move the SGLang DeepSeek MLA runtime entry from legacy forward glue into
SGLangDeepseekMLAAttention while keeping RadixAttention and the full-attention
backend as the host/backend layers. Shrink deepseek_mla_forward.py into a
helper module and clarify absorbed vs non-absorbed path naming.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant