[Diffusion] Intel XPU: runtime integration for distributed and stream management by xiangyuT · Pull Request #2 · analytics-zoo/sglang-diffusion

xiangyuT · 2026-04-16T07:08:04Z

Motivation

This PR builds on the XPU platform foundation merged in sgl-project#17920, adding the runtime-level changes needed to actually run diffusion inference on Intel XPU (Arc Pro B-series, etc.) with tensor parallelism support.

sgl-project#17920 added the platform detection (XpuPlatform), attention backend (xpu_backend.py), platform plugin registration, and basic sgl_kernel integration. This PR addresses the remaining gaps discovered during end-to-end testing on Intel Arc Pro B60 GPUs.

Test Results

Tested on Intel Arc Pro B60 with Z-Image-Turbo (BF16, 9-step turbo schedule, prompt="A golden retriever in the snow"):

TP=1:

TP=2:

Notes

All changes are gated behind current_platform.is_xpu() checks — no impact on CUDA/ROCm/NPU paths.
The XCCL workarounds (AVG, all-to-all, batch P2P) are known issues being tracked upstream in PyTorch; these can be removed once fixed.

upstream main already has xpu_platform_plugin function and "xpu" entry in builtin_platform_plugins dict since PR sgl-project#17920.

The all_to_all_4D method calls self._maybe_wait() on the output of ft_c.all_to_all_single, but the method was never defined, causing AttributeError at runtime during multi-GPU inference. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

xiangyuT force-pushed the xpu_dev branch 2 times, most recently from c4dd99c to 370cf47 Compare May 14, 2026 00:57

xiangyuT and others added 3 commits May 14, 2026 01:04

XPU diffusion support: squashed xpu_0122 changes

f0f21bb

fix: remove duplicate xpu_platform_plugin (already in upstream)

40216f2

upstream main already has xpu_platform_plugin function and "xpu" entry in builtin_platform_plugins dict since PR sgl-project#17920.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Diffusion] Intel XPU: runtime integration for distributed and stream management#2

[Diffusion] Intel XPU: runtime integration for distributed and stream management#2
xiangyuT wants to merge 3 commits into
mainfrom
xpu_dev

xiangyuT commented Apr 16, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

xiangyuT commented Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Test Results

Notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

xiangyuT commented Apr 16, 2026 •

edited

Loading