Skip to content

aws-samples/sample-aws-mcp-rca

AWS Root Cause Analysis (RCA) with MCP Integration

Overview

This project provides an intelligent AWS Root Cause Analysis (RCA) system that leverages Amazon Bedrock's AI capabilities to automatically analyze AWS incidents and infrastructure issues. The system integrates with Kiro IDE through the Model Context Protocol (MCP), enabling developers to perform sophisticated incident analysis directly from their development environment.

The solution combines real-time AWS service monitoring, intelligent log analysis, and AI-powered root cause identification to help teams quickly diagnose and resolve infrastructure problems. By integrating with AWS services like Amazon CloudWatch Logs, AWS CloudTrail, and AWS Health, the system provides comprehensive visibility into your AWS environment while offering actionable insights for incident resolution.

Key features include:

  • Automated Incident Analysis: AI-powered analysis of AWS incidents using Amazon Bedrock
  • Multi-Service Integration: Seamless integration with Amazon CloudWatch Logs, AWS CloudTrail, and AWS Health
  • Real-time Monitoring: Continuous monitoring of AWS resources and services
  • MCP Integration: Direct integration with Kiro IDE for streamlined developer workflows
  • Secure API Gateway: Production-ready Amazon API Gateway with AWS Identity and Access Management (IAM) authentication
  • Scalable Architecture: Serverless AWS Lambda-based architecture for cost-effective scaling

Architecture

Architecture Diagram

The system follows a serverless, event-driven architecture designed for scalability and reliability:

Core Components

  1. MCP Client (Kiro IDE Integration)

    • Provides seamless integration with Kiro IDE
    • Handles authentication and request routing
    • Offers intuitive tools for incident analysis
  2. Amazon API Gateway

    • Secure entry point with AWS Identity and Access Management (IAM) authentication
    • Request validation and rate limiting
    • Routes requests to appropriate AWS Lambda functions
  3. AWS Lambda Functions

    • Serverless compute for incident analysis
    • Integrates with multiple AWS services
    • Powered by Amazon Bedrock for AI analysis
  4. AWS Service Integration

    • Amazon CloudWatch Logs: Real-time log collection and analysis
    • AWS CloudTrail: API call tracking and audit trails
    • AWS Health: Service health and incident notifications
    • Amazon Bedrock: AI-powered analysis and recommendations
  5. Data Flow

    • Incident data collection from multiple AWS sources
    • Real-time processing and correlation
    • AI analysis using Amazon Bedrock
    • Structured response delivery to Kiro IDE

The architecture helps achieve high availability, security, and cost-effectiveness while providing rapid incident analysis capabilities.

Solution Deployment

For complete deployment instructions, including prerequisites, configuration, and step-by-step setup, please refer to:

Deployment Guide

The deployment guide covers:

  • Environment setup and prerequisites
  • AWS infrastructure deployment using CDK
  • MCP client configuration for Kiro IDE
  • Testing and validation procedures
  • Troubleshooting common issues

Conclusion

This AWS Root Cause Analysis system represents a significant advancement in infrastructure monitoring and incident response capabilities. By combining the power of Amazon Bedrock's AI with comprehensive AWS service integration, the solution enables teams to:

  • Reduce Mean Time to Resolution (MTTR) through automated analysis and intelligent recommendations
  • Improve Operational Efficiency by streamlining the incident investigation process
  • Enhance Developer Experience with seamless Kiro IDE integration
  • Maintain Security Standards through proper AWS Identity and Access Management (IAM) authentication and secure API design
  • Scale Cost-Effectively using serverless architecture that adapts to demand

The MCP integration delivers this powerful analysis capability readily available within the developer's existing workflow, making incident response faster and more effective. As AWS environments continue to grow in complexity, this solution provides the intelligent automation needed to maintain operational excellence.

For support, additional documentation, or to contribute to the project, please refer to the individual component documentation in their respective directories.

About

AWS Root Cause Analysis (RCA) with MCP Integration

Topics

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published