A tool for recovering and replaying missed events in an event-driven system, ensuring accurate recalculations without relying on traditional databases.
-
Updated
Feb 16, 2025 - Python
A tool for recovering and replaying missed events in an event-driven system, ensuring accurate recalculations without relying on traditional databases.
Automated chaos testing tool for evaluating cloud infrastructure reliability, network fault tolerance, and application security.
As a ServiceNow Admin and Jr Developer at Netflix, I built a semi-automated incident response system to help the DevOps engineer team quickly remediate failing AWS EC2 instances, protecting streaming quality for millions of viewers.
This repo demonstrates chaos engineering methodologies in modern distributed system using chaos monkey tool kit and spring boot dependency
Add a description, image, and links to the system-resilience topic page so that developers can more easily learn about it.
To associate your repository with the system-resilience topic, visit your repo's landing page and select "manage topics."