Skip to content
View swapnildahiphale's full-sized avatar

Block or report swapnildahiphale

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
swapnildahiphale/README.md

Swapnil Dahiphale

SRE & AI Engineer building resilient systems at scale. Creator of OpenSRE — an AI-powered SRE platform with episodic memory and knowledge graph for automated incident investigation.

About

I'm an SRE & AI Engineer with 10+ years of experience in infrastructure, platform engineering, and production reliability. I build systems that combine site reliability engineering with AI to automate what SRE teams do manually — from incident investigation to root cause analysis.

Currently building OpenSRE, an open-source AI SRE platform that learns from every production incident using episodic memory and Neo4j knowledge graphs.

Projects

  • OpenSRE — AI SRE platform with episodic memory and knowledge graph. Investigates production incidents, correlates alerts, analyzes logs, and finds root causes automatically. 46 production skills, multi-provider LLM support, Slack/Teams integration. Website | Live Demo

  • Portfolio — Personal portfolio and blog

Expertise

  • Site Reliability Engineering — production incident response, observability, SLO/SLI design, on-call operations
  • AI Engineering — LLM agents, LangGraph orchestration, episodic memory systems, RAG pipelines
  • Platform Engineering — Kubernetes, Terraform, ArgoCD, CI/CD pipelines
  • Cloud Infrastructure — AWS, GCP, infrastructure as code, containerization
  • Observability — Prometheus, Grafana, Elasticsearch, Datadog, PagerDuty

Connect

Pinned Loading

  1. OpenSRE OpenSRE Public

    Open-source AI SRE agent that investigates production incidents using episodic memory and Neo4j knowledge graph. 46 production skills. Self-hosted.

    Python 24 4

  2. first-principles-thinking-skill first-principles-thinking-skill Public

    First principles thinking for coding agents. Challenge assumptions, find ground truths, build solutions from bedrock. Works with Claude Code, Cursor, Codex, OpenCode, Gemini CLI.

    1

  3. claude-code-jarvis-hooks claude-code-jarvis-hooks Public

    Claude Code hooks that talk

    Python 5

  4. portfolio-website portfolio-website Public

    Source code for my portfolio website: swapnil.one

    TypeScript

  5. liquibase-argocd-demo liquibase-argocd-demo Public

    Argo CD to automate Database changes workflow with Liquibase

  6. atlas-argocd-demo atlas-argocd-demo Public

    Forked from rotemtam/atlas-argocd-demo

    A demo repo for deploying Atlas Operator + ArgoCD apps