[fp8] Select SGLang FP8 block quant kernel to match inference by yueming-yuan · Pull Request #1182 · radixark/miles

yueming-yuan · 2026-05-22T21:22:57Z

No description provided.

gemini-code-assist

Code Review

This pull request integrates the per_block_cast_to_fp8 function from the sglang library into the FP8 quantization workflow. It introduces a new internal helper, _blockwise_cast_to_fp8, which conditionally utilizes the sglang implementation when the weight block size is (128, 128) and falls back to the existing Triton-based kernel otherwise. Additionally, the sglang utility is safely imported with a fallback to None to ensure compatibility. I have no feedback to provide as there were no review comments to evaluate.

yueming-yuan requested review from Zhichenzzz, fzyzcjy, maocheng23 and yushengsu-thu as code owners May 22, 2026 21:22

yueming-yuan force-pushed the fp8-quant-kernel-selection branch from bcd5394 to 4aee1fc Compare May 22, 2026 21:25

Select SGLang FP8 block quant kernel

b6ccb11

yueming-yuan force-pushed the fp8-quant-kernel-selection branch from 4aee1fc to b6ccb11 Compare May 22, 2026 21:26

yueming-yuan changed the title ~~Select SGLang FP8 block quant kernel~~ [fp8] Select SGLang FP8 block quant kernel to match inference May 22, 2026

gemini-code-assist Bot reviewed May 22, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fp8] Select SGLang FP8 block quant kernel to match inference#1182

[fp8] Select SGLang FP8 block quant kernel to match inference#1182
yueming-yuan wants to merge 1 commit into
radixark:mainfrom
yueming-yuan:fp8-quant-kernel-selection

yueming-yuan commented May 22, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

yueming-yuan commented May 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

yueming-yuan commented May 22, 2026 •

edited

Loading