Skip to content
#

fault-tolerant-systems

Here are 4 public repositories matching this topic...

Language: All
Filter by language

InfoAgent is a production-ready Python AI agent for real-time weather and news. Built for resilience, it features structured logging, retry logic, and a modular design—essential for scalable, observable, and fault-tolerant AI systems.

  • Updated Mar 21, 2026
  • Python

Real-time budget monitoring system built on a PostgreSQL state machine. Tracks project burn rates, detects HEALTHY → WARNING → CRITICAL transitions, and fires webhook alerts only on state changes eliminating duplicate alert noise. Idempotent writes, DLQ on API failure.

  • Updated Apr 25, 2026

Build a production-style AI system that ingests logs and metrics, detects anomalies, and uses an LLM to summarize incidents and suggest likely root causes, with observability and reliability in mind.

  • Updated Apr 22, 2026
  • Java

Improve this page

Add a description, image, and links to the fault-tolerant-systems topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the fault-tolerant-systems topic, visit your repo's landing page and select "manage topics."

Learn more