xPyD-hub

Lightweight, rapidly deployable Prefill-Decode (PD) proxy for LLM serving — minimal setup, minimal maintenance.

Projects

xPyD-proxy Rapidly deployable PD proxy server — scheduling, health monitoring, and dynamic instance management for prefill/decode instances.
xPyD-bench Benchmarking tool — measure throughput, latency, and TTFT against xPyD proxy.
xPyD-plan PD ratio planner — recommend optimal Prefill:Decode node allocation from benchmark data or dataset analysis.

About

xPyD-proxy implements a two-phase serving pattern:

Prefill — KV cache preparation on dedicated prefill nodes
Decode — autoregressive token generation on decode nodes

The proxy handles round-robin / load-balanced scheduling, health checks, and hot-reload of instance configs. Designed for local dev and lightweight deployment.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

xPyD-hub

xPyD-hub

Projects

About

Pinned Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!