vLLM Recipes

This repo intends to host community maintained common recipes to run vLLM answering the question: How do I run model X on hardware Y for task Z?

Guides

Arcee AI

Trinity-Large-Thinking

DeepSeek

Ernie

GLM

Google

Llama

Microsoft

Phi-4

MiniMax

MiniMax-M2.7, MiniMax-M2.5, MiniMax-M2.1, MiniMax-M2

Xiaomi MiMo

MiMo-V2-Flash

Mistral AI

Moonshotai

NVIDIA

OpenAI

gpt-oss

PaddlePaddle

PaddleOCR-VL

Qwen

Seed

Seed-OSS-36B

StepFun

Step-3.5-Flash

Tencent-Hunyuan

HunyuanOCR

Contributing

New recipes live as structured YAML at models/<hf_org>/<hf_repo>.yaml and render on recipes.vllm.ai. See CONTRIBUTING.md for the full schema, VRAM formula, and validation steps.

Quick loop:

pnpm install
pnpm dev                              # http://localhost:3000
node scripts/build-recipes-api.mjs    # validates every YAML + rebuilds the JSON API

Legacy MkDocs guides

The top-level Markdown directories (DeepSeek/, Qwen/, etc.) are the historical MkDocs site, kept as a reference during the YAML migration. To preview them:

uv venv
source .venv/bin/activate
uv pip install -r requirements.txt
uv run mkdocs serve --dev-addr 127.0.0.1:8001

License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vLLM Recipes

Guides

Arcee AI

DeepSeek

Ernie

GLM

Google

inclusionAI

InternVL

InternLM

Jina AI

Llama

Microsoft

MiniMax

Xiaomi MiMo

Mistral AI

Moonshotai

NVIDIA

OpenAI

PaddlePaddle

Qwen

Seed

StepFun

Tencent-Hunyuan

Contributing

Legacy MkDocs guides

License

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

vLLM Recipes

Guides

Arcee AI

DeepSeek

Ernie

GLM

Google

inclusionAI

InternVL

InternLM

Jina AI

Llama

Microsoft

MiniMax

Xiaomi MiMo

Mistral AI

Moonshotai

NVIDIA

OpenAI

PaddlePaddle

Qwen

Seed

StepFun

Tencent-Hunyuan

Contributing

Legacy MkDocs guides

License