You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Sample code for two-tier LLM inference (cascading) on Amazon Bedrock, using a single Bedrock Converse API surface. NVIDIA Nemotron Nano handles routine support-ticket classification on every request; Anthropic Claude Sonnet handles cases the routing logic flags as harder. Includes a bake-off harness, sample data, and tests. Next.js + TypeScript.