Skip to content
View Siva010's full-sized avatar
🤌
Deep_Work
🤌
Deep_Work

Block or report Siva010

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Siva010/README.md

Typing SVG

LinkedIn GitHub Gmail


🧠 About Me

class SaiSivaSrinivas:
    def __init__(self):
        self.name   = "Sai Siva Srinivas Munduru"
        self.role   = ["AI Engineer", "Full Stack Developer", "Java Backend Developer"]
        self.focus  = [
            "Production-grade LLM applications & RAG pipelines",
            "Hybrid search • Vector DBs • RAGAS-driven evaluation",
            "Real-time SaaS platforms with WebSocket & Redis",
            "Distributed Java microservices with Kafka & Spring Boot"
        ]

    def current_stack(self):
        return {
            "ai_ml"   : ["LangChain", "LlamaIndex", "LangGraph", "CrewAI", "AutoGen", "RAGAS"],
            "backend" : ["FastAPI", "Spring Boot", "Node.js", "Express.js"],
            "frontend": ["Next.js", "React", "TypeScript", "Tailwind CSS", "ShadCN UI"],
            "infra"   : ["Docker", "AWS", "Redis", "Kafka", "Pinecone", "PostgreSQL"],
        }

    def fun_fact(self):
        return "My RAG pipeline hit faithfulness ≥ 0.91 before I hit my morning coffee ☕"

🚀 Featured Projects

⚡ Production RAG Platform

Python FastAPI Pinecone Redis RAGAS Docker

Hybrid search RAG backend (vector + BM25) with ~38% improvement in context recall Redis caching cut latency from ~1.2 s → 180 ms (85% reduction) RAGAS scores: faithfulness ≥ 0.91 · answer relevancy ≥ 0.88 Deployed on AWS EC2 with Docker Compose · JWT-ready · streaming responses


🤝 Multi-User Collaboration App

Next.js TypeScript FastAPI WebSockets Redis

Real-time SaaS supporting 50+ concurrent users · live updates in < 50 ms PostgreSQL schema optimisation → query times reduced by ~45% RBAC (Owner/Editor/Viewer) · CI/CD via GitHub Actions · < 5 min build-to-deploy 🌐 Live Demo


⚙️ Microservices Rate Limiting Platform

Java Spring Boot Kafka Redis Docker

Distributed rate limiter handling 10,000+ req/s with < 2 ms overhead Token bucket + sliding-window · circuit-breaker cut downstream failures by ~60% 85%+ JUnit 5 + Mockito test coverage · Kafka violation streaming · Docker Compose


🛠️ Tech Stack

🤖 AI / ML

LangChain LlamaIndex LangGraph CrewAI PyTorch Hugging Face

🌐 Frontend

Next.js React TypeScript Tailwind

⚙️ Backend

FastAPI Spring Boot Node.js Java Python

🗄️ Databases & Infra

PostgreSQL MongoDB Redis Pinecone Kafka Docker AWS


🏅 Contribution Graph


🏆 Certifications

🏅 Certification 🏢 Issuer 📅 Year
IBM Generative AI Engineering Specialization IBM 2026
Oracle Cloud Infrastructure — Generative AI Professional Oracle 2025
Google Cloud Digital Leader Google Cloud 2026

💬 Ask me about

LLM Applications · RAG Pipelines · Hybrid Search · RAGAS Evaluation · FastAPI · Spring Boot · Real-time Systems · Distributed Architecture

📫 Reach me

LinkedIn GitHub Email


Profile Views

"First, solve the problem. Then, write the code." — John Johnson

Pinned Loading

  1. Intelligent-RAG-ChatBot-for-creators Intelligent-RAG-ChatBot-for-creators Public

    Precisely designed production-grade RAG (Retrieval-Augmented Generation) application built to compare social media video transcripts and performance metrics, isolate viral hook psychology, and supp…

    Python 1

  2. collab-app collab-app Public

    TypeScript

  3. rate-limiter rate-limiter Public

    Python