Creating this issue to track adding ROCm CI for the miles repo to mirror the existing NVIDIA PR Test workflow (.github/workflows/pr-test.yml). The NVIDIA workflow runs 11 per-commit suites; we want the same suites green on AMD Instinct MI300/MI355X with ROCm.
To list what each suite contains:
python3 -m tests.ci.run_suite --hw cpu --suite stage-a-fast --list-only
python3 -m tests.ci.run_suite --hw cuda --suite <suite-name> --list-only
Update (2026-05-21): synced with fork/main. Net changes since this issue was opened: stage-b-sglang-1-gpu renamed to stage-b-sglang-8-gpu (#1107); test_session_server_multi_role.py moved from stage-b-short-8-gpu to stage-b-sglang-8-gpu (#1107); both moonlight tests deleted in #1137 (so PR #1165 is moot); test_glm47_flash_r3_mtp.py and test_qwen3_30B_A3B_r3.py re-enabled upstream in #1137; 4 new files added to stage-a-fast (#1117, #1137, b8649e6).
Stage A
stage-a-fast (CPU) — 40 / 44 PASS, 4 new files pending verification. 1 already skipped upstream.
Stage B
stage-b-fast-1-gpu — 15 / 15 PASS.
stage-b-sglang-8-gpu (renamed from stage-b-sglang-1-gpu in #1107) — 1 / 4 PASS, in process.
stage-b-short-8-gpu — 5 / 5 PASS. 1 already skipped upstream.
Stage C
stage-c-fsdp-8-gpu — 0 enabled, all skipped upstream.
stage-c-megatron-8-gpu — 4 / 6 PASS, in process. (test_moonlight_16B_A3B.py and test_moonlight_16B_A3B_r3.py deleted in #1137; test_glm47_flash_r3_mtp.py and test_qwen3_30B_A3B_r3.py re-enabled in #1137.)
stage-c-precision-8-gpu — 0 enabled, all skipped upstream.
stage-c-ckpt-8-gpu — 2 / 2 PASS.
stage-c-long-8-gpu — 2 / 2 PASS.
stage-c-lora-8-gpu — 1 / 1 PASS.
stage-c-glm5-8-gpu — 1 / 1 PASS.
Roll-up: 71 / 80 enabled (non-skipped) tests confirmed PASS on MI355X (89%).
Creating this issue to track adding ROCm CI for the miles repo to mirror the existing NVIDIA
PR Testworkflow (.github/workflows/pr-test.yml). The NVIDIA workflow runs 11 per-commit suites; we want the same suites green on AMD Instinct MI300/MI355X with ROCm.To list what each suite contains:
Stage A
stage-a-fast(CPU) — 40 / 44 PASS, 4 new files pending verification. 1 already skipped upstream.Stage B
stage-b-fast-1-gpu— 15 / 15 PASS.stage-b-sglang-8-gpu(renamed fromstage-b-sglang-1-gpuin #1107) — 1 / 4 PASS, in process.stage-b-short-8-gpu— 5 / 5 PASS. 1 already skipped upstream.Stage C
stage-c-fsdp-8-gpu— 0 enabled, all skipped upstream.stage-c-megatron-8-gpu— 4 / 6 PASS, in process. (test_moonlight_16B_A3B.pyandtest_moonlight_16B_A3B_r3.pydeleted in #1137;test_glm47_flash_r3_mtp.pyandtest_qwen3_30B_A3B_r3.pyre-enabled in #1137.)stage-c-precision-8-gpu— 0 enabled, all skipped upstream.stage-c-ckpt-8-gpu— 2 / 2 PASS.stage-c-long-8-gpu— 2 / 2 PASS.stage-c-lora-8-gpu— 1 / 1 PASS.stage-c-glm5-8-gpu— 1 / 1 PASS.Roll-up: 71 / 80 enabled (non-skipped) tests confirmed PASS on MI355X (89%).