Skip to content

update vllm patch to v0.14.0-b8.3 and pin transformers==5.8.0#430

Open
gc-fu wants to merge 1 commit into
mainfrom
update-vllm-patch-b8.3-0525
Open

update vllm patch to v0.14.0-b8.3 and pin transformers==5.8.0#430
gc-fu wants to merge 1 commit into
mainfrom
update-vllm-patch-b8.3-0525

Conversation

@gc-fu
Copy link
Copy Markdown
Contributor

@gc-fu gc-fu commented May 25, 2026

Summary

  • Regenerate vllm_for_multi_arc.patch from intel-sandbox/llm-scaler-vllm-xpu branch v0.14.0-b8.3 (commit d03ae58)
  • Pin transformers==5.8.0 in Dockerfile instead of installing from git main, to avoid version incompatibility issues

Test plan

  • Build Docker image with updated patch and verify it completes successfully
  • Run vLLM serving with the new image and verify model loading works
  • Confirm transformers version is 5.8.0 inside container

🤖 Generated with Claude Code

- Regenerate vllm_for_multi_arc.patch from intel-sandbox/llm-scaler-vllm-xpu
  branch v0.14.0-b8.3 (commit d03ae58)
- Pin transformers to 5.8.0 instead of installing from git main to avoid
  version incompatibility issues

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant