Fix FP8 dequantization on MPS by falling back to CPU by snwchd71 · Pull Request #34 · Comfy-Org/comfy-kitchen

snwchd71 · 2026-04-04T19:48:03Z

Summary

MPS (Apple Silicon) does not support FP8 dtype conversion, causing dequantize_per_tensor_fp8() to crash with:

TypeError: Trying to convert Float8_e4m3fn to the MPS backend but it does not have support for that dtype.

This adds an explicit device + dtype guard to dequantize on CPU and transfer only the final result back to MPS. Includes a one-time logger.warning for visibility.

No hot-path overhead: the check is a string compare + frozenset membership, only evaluated when the tensor is on MPS with an FP8 dtype
Bit-identical results: every FP8 value is exactly representable in bfloat16/float16, verified with torch.equal
Auto-disables: the dtype guard means this becomes a no-op if MPS ever gains FP8 support

Prior work

Fix FP8 tensor support on MPS backend for Apple Silicon Macs #23 by @AoiYamada identified the same fix. This PR adds tests, a one-time warning, and the dtype guard.
fix: CPU fallback for FP8 quantization on MPS (Apple Silicon) ComfyUI#12378 by @tashiscool fixes the complementary quantization path in ComfyUI core (stochastic_rounding, quant_ops.py). Both fixes are needed for end-to-end FP8 on MPS, confirmed by @vSnake87 on M3 Max.

Test plan

test_dequantize_fp8_cpu_fallback_correctness — verifies fallback math is bit-identical to standard path (runs in CI, parametrized float16/bfloat16)
test_dequantize_fp8_on_mps_device — end-to-end on actual MPS hardware (skipped in CI, parametrized float16/bfloat16)
Full test suite: 103 passed, 203 skipped (CUDA/Triton), 0 failures
ruff check clean

MPS (Apple Silicon) does not support FP8 dtype conversion, causing dequantize_per_tensor_fp8() to crash. Add an explicit device check with dtype guard to dequantize on CPU and transfer only the final result back to MPS. Includes a one-time logger.warning for visibility. Signed-off-by: spn <snwchd71@users.noreply.github.com>

snwchd71 force-pushed the fix/mps-fp8-dequantize branch from 636c256 to 2d427c4 Compare April 4, 2026 20:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix FP8 dequantization on MPS by falling back to CPU#34

Fix FP8 dequantization on MPS by falling back to CPU#34
snwchd71 wants to merge 1 commit intoComfy-Org:mainfrom
snwchd71:fix/mps-fp8-dequantize

snwchd71 commented Apr 4, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

snwchd71 commented Apr 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Prior work

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

snwchd71 commented Apr 4, 2026 •

edited

Loading