-
Notifications
You must be signed in to change notification settings - Fork 56
Pull requests: ROCm/ATOM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[vLLM-ATOM] Fix GLM-4.7 MTP in vLLM plugin
#805
opened May 15, 2026 by
kliuae
Contributor
Loading…
1 task
Add inferencex-sync skill for ATOM benchmark comparison and InferenceX PR creation
#802
opened May 15, 2026 by
seungrokj
Contributor
Loading…
[Plugin][MLA] Tolerate rotary_emb=None for NoPE-only MLA models (Kimi-Linear)
#792
opened May 14, 2026 by
ChuanLi1101
Collaborator
Loading…
1 of 4 tasks
[fix](gpt-oss): fix quark quantized model in moe bias
#787
opened May 14, 2026 by
PerryZhang01
Contributor
Loading…
Add DSR1-MXFP4 recipe for MI355X (Team Jons contest submission, 2840/3000)
#786
opened May 14, 2026 by
j0ons
Loading…
ci(benchmark): upgrade Kimi K2.5 to K2.6
#781
opened May 14, 2026 by
carlushuang
Contributor
Loading…
1 of 2 tasks
[codex] DeepSeek FP4 MTP decode safeguards and MLA hooks
#779
opened May 13, 2026 by
josusanmartin
•
Draft
feat(server): add Anthropic Messages API endpoint (/v1/messages)
#778
opened May 13, 2026 by
carlushuang
Contributor
Loading…
4 of 5 tasks
Add mooncake dockerfile build
#771
opened May 13, 2026 by
ZhangLirong-amd
Contributor
Loading…
1 task
[MoE] adapt to triton_kernels matmul_ogs -> matmul rename
#763
opened May 12, 2026 by
Liang-jianhao97
Loading…
1 task done
[feat][ATOM-vLLM][Attention Refactor] Reconstruct the Attention Arch
#750
opened May 11, 2026 by
zejunchen-zejun
Collaborator
•
Draft
Add Mistral-3-8B + Qwen3-8B-FP8 + native triton attention backend for gfx1201 (RDNA4 / RX 9070 XT)
#749
opened May 11, 2026 by
carlushuang
Contributor
Loading…
[feat][breaking] Enable prefix caching by default
#741
opened May 11, 2026 by
functionstackx
Contributor
Loading…
3 of 6 tasks
perf: optimize GDN decode with SGLang fused recurrent kernel
#727
opened May 9, 2026 by
zovonoir
Contributor
Loading…
1 of 2 tasks
docs: deploy compressor page with docs workflow
#715
opened May 7, 2026 by
gyohuangxin
Member
Loading…
perf: fused Triton kernels for Qwen3.5 RMSNorm and MRoPE
#708
opened May 7, 2026 by
zovonoir
Contributor
Loading…
1 of 2 tasks
[ci] add Qwen3.5 Dense/MoE models accuracy validation and benchmark tests for atom-plugined sglang
#700
opened May 6, 2026 by
wanzhenchn
Contributor
Loading…
[vLLM-ATOM benchmark] add GLM-4.7 and Minimax-2.5 to vLLM-ATOM benchmark
#695
opened May 6, 2026 by
gbyu-amd
Contributor
Loading…
1 task
Enable Cohere Command-R (CohereForCausalLM / Cohere2ForCausalLM) on ATOM
#675
opened Apr 30, 2026 by
jatseng-ai
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.