[FLYDSL] [TRITON] Attention backward mxfp8 gfx950 by lburzawa · Pull Request #3094 · ROCm/aiter

lburzawa · 2026-05-08T22:58:36Z

Motivation

Support mxfp8 attention backward in FlyDSL on gfx950.

Technical Details

Main attn bwd kernel in FlyDSL
Bwd preprocess kernel in Triton
Quant kernels in Triton

Test Plan

Correctness tests for each kernel.

Test Result

Tests pass.

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

github-actions · 2026-05-08T22:58:54Z

🏷️ CI Guide

Runs automatically on every PR:

✅ Pre-checks (submodule verification, code formatting)
✅ Aiter op tests (gfx942 + gfx950)
✅ Triton tests on MI35X (only when aiter/ops/triton/** or related paths are changed)

Extended tests (opt-in via labels):

Label	Tests
`ci:triton-300x`	Run an additional Triton test job on MI300X in PRs; main branch always runs both MI35X and MI300X
`ci:sglang`	SGLang integration tests
`ci:atom`	ATOM benchmark (DeepSeek-R1 + GPT-OSS)
`ci:vllm`	vLLM benchmark
`ci:all`	All of the above

Add labels via the sidebar or gh pr edit 3094 --add-label <label>

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

github-actions · 2026-05-16T19:54:21Z

+    kv_load_bytes = 16
+
+    bytes_per_tile_qo_scale = (int(tile_m) * int(tile_head)) // 32
+    bytes_per_thread_qo_scale = max(1, bytes_per_tile_qo_scale // total_threads)


⚠️ [ruff] <F841> _{reported by reviewdog 🐶}
Local variable bytes_per_thread_qo_scale is assigned to but never used

Suggested change

bytes_per_thread_qo_scale = max(1, bytes_per_tile_qo_scale // total_threads)

max(1, bytes_per_tile_qo_scale // total_threads)

github-actions · 2026-05-16T19:54:22Z

+    )
+    non_torch_memory_before = cuda_memory_before - torch_memory_before
+
+    data = func(*args, **kwargs)


⚠️ [ruff] <F841> _{reported by reviewdog 🐶}
Local variable data is assigned to but never used

Suggested change

data = func(*args, **kwargs)

func(*args, **kwargs)

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

lburzawa added 2 commits May 8, 2026 17:07

add mxfp8 quants and bwd preprocess

c894f91

add attn bwd main kernel in flydsl

10caa55

lburzawa requested a review from a team May 8, 2026 22:58

lburzawa requested a review from vgokhale May 8, 2026 23:03

Update aiter/ops/flydsl/kernels/attn_bwd_mxfp8_gfx950.py

d6fd3a2

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

lburzawa requested a review from coderfeli May 8, 2026 23:07

add gqa support

0fc8327

vgokhale previously approved these changes May 13, 2026

View reviewed changes

support uneven sequences

b88b194

lburzawa dismissed vgokhale’s stale review via b88b194 May 14, 2026 00:27

lburzawa added 3 commits May 14, 2026 00:32

reformat

125345e

Merge branch 'main' into attn_bwd_mxfp8_gfx950

5fe0ad6

supports 2d quant

74adeec

github-actions Bot reviewed May 16, 2026

View reviewed changes

Update aiter/ops/flydsl/kernels/attn_bwd_mxfp8_gfx950.py

f4c9101

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FLYDSL] [TRITON] Attention backward mxfp8 gfx950#3094

[FLYDSL] [TRITON] Attention backward mxfp8 gfx950#3094
lburzawa wants to merge 9 commits into
mainfrom
attn_bwd_mxfp8_gfx950

lburzawa commented May 8, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented May 8, 2026

Uh oh!

github-actions Bot May 16, 2026

Uh oh!

github-actions Bot May 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	bytes_per_thread_qo_scale = max(1, bytes_per_tile_qo_scale // total_threads)
	max(1, bytes_per_tile_qo_scale // total_threads)

Conversation

lburzawa commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

github-actions Bot commented May 8, 2026

🏷️ CI Guide

Uh oh!

github-actions Bot May 16, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot May 16, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

lburzawa commented May 8, 2026 •

edited

Loading