ci: handle ROCm qwen3 30B A3B configs#1162
Open
sreerohi wants to merge 1 commit into
Open
Conversation
Contributor
There was a problem hiding this comment.
Code Review
This pull request introduces ROCm support for the qwen3-30B-A3B end-to-end Megatron test. It implements environment detection for ROCm and conditionally adjusts test configurations, disabling CUDA-specific features such as DeepEP, FP8, and INT4 rollouts when running on ROCm hardware. Additionally, it sets a fallback to Triton for MoE GEMM kernels on ROCm and temporarily disables a specific DeepEP test variant for NVIDIA. I have no feedback to provide.
…d DeepEP + FP8 since these need support
39c0ea3 to
9bcad88
Compare
80 tasks
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
CONFIGSintests/e2e/megatron/test_qwen3_30B_A3B.pyinto ROCm vs CUDA branches via anIS_ROCMruntime check (torch.version.hip).SGLANG_USE_AITER=0on ROCm so SGLang falls back to Triton for MoE GEMM (aiter CK kernels lack instances for this model's per-rank expert dims).Relates to #1105