Skip to content

Cherry pick commits to fix mxfp4 perf#897

Open
zhanglx13 wants to merge 1 commit into
pytorch/rocm7.1_internal_testingfrom
fix_mxfp4
Open

Cherry pick commits to fix mxfp4 perf#897
zhanglx13 wants to merge 1 commit into
pytorch/rocm7.1_internal_testingfrom
fix_mxfp4

Conversation

@zhanglx13
Copy link
Copy Markdown

cherry picked commit commit msg TFlops Note
5366920 Current baseline 800 81 spills
e8f5420 [GEMM] Add combine dot_scaled and addF 3500 eliminate all spills

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants