Ground PoC of ComputeFollowsPower (IDEA-001): dispatch latency-tolerant jobs to the cheapest-power node within their latency budget and verdict whether the power arbitrage beats the movement cost.
-
Updated
Jun 7, 2026 - Python
Ground PoC of ComputeFollowsPower (IDEA-001): dispatch latency-tolerant jobs to the cheapest-power node within their latency budget and verdict whether the power arbitrage beats the movement cost.
Production-grade AI latency budgeting and reactive scaling framework for LLM inference systems. Covers p50/p95/p99 modeling, SLO design, Kubernetes (K8s) HPA patterns, and distributed AI infrastructure. By Vipin Kumar
Add a description, image, and links to the latency-budget topic page so that developers can more easily learn about it.
To associate your repository with the latency-budget topic, visit your repo's landing page and select "manage topics."