Skip to content
View PranavShashidhara's full-sized avatar

Block or report PranavShashidhara

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
PranavShashidhara/README.md

👋 Hi, I'm Pranav

🎓 MS in Data Science · 🤖 Building agentic LLM systems · ⚙️ GPU kernels to production pipelines

I build systems that go from data → model → deployment → real-world use. Currently finishing my MS at UMD (GPA 3.97) while shipping agentic AI systems at MTech Ventures — AutoGen reasoning loops, MCP tool servers, RAG pipelines, and LLM infrastructure for real users.

🧠 What I Work On

  • Agentic LLM Systems — AutoGen, MCP servers, tool orchestration, reasoning loops
  • LLM Fine-tuning & Inference — QLoRA, PEFT, vLLM, multi-GPU deployment
  • RAG Pipelines — grounded, production-ready retrieval systems
  • CUDA & GPU Systems — kernel optimization, memory hierarchy, edge GPU benchmarking
  • Generative Models — diffusion models, segmentation-guided synthesis
  • MLOps & Infrastructure — Docker, Kubernetes, ONNX, CI/CD for ML

🚀 What You'll Find Here

  • Agentic AI systems with real users in the loop
  • LLM fine-tuning and distributed inference pipelines
  • CUDA kernel benchmarks and GPU systems experiments
  • RAG architectures and multimodal LLM apps
  • Dockerized, cloud-deployed, production-ready ML systems

🛠 Tech I Use

  • ML / AI · Python · PyTorch · HuggingFace · LangChain · AutoGen · vLLM · PEFT
  • Systems · CUDA · C++ · Docker · Kubernetes · AWS · Terraform · GitHub Actions
  • Data · SQL · PySpark · Pandas · Pinecone · MLflow

📊 GitHub Activity

📫 Connect

LinkedIn

Pinned Loading

  1. Movie-Recommendation-system Movie-Recommendation-system Public

    This project focuses on developing a recommendation system utilizing various learning techniques, including collaborative filtering, matrix factorization, and restricted Boltzmann machines (RBMs).

    Jupyter Notebook 1

  2. MediAssist_AI MediAssist_AI Public

    Offline-capable, multilingual voice-based medical assistant using Claude 3.5, BioGPT, Whisper, and RAG. Built for reliability in low-connectivity settings.

    Python

  3. Seg_diffusion Seg_diffusion Public

    Segmentation-guided diffusion models for controllable brain MRI synthesis using BraTS 2021. Supports counterfactual generation via Mask-Ablated Training (MAT) to modify or remove tumors, evaluated …

    Jupyter Notebook 1 1

  4. llm-orchestration-stack llm-orchestration-stack Public

    End-to-end platform for fine-tuning and deploying Llama-3.1 8B-Instruct on SQL generation using QLoRA, 4-bit quantization, and multi-GPU inference. Focuses on memory-efficient, high-throughput LLM …

    Jupyter Notebook

  5. multilingual_toxicity_classification multilingual_toxicity_classification Public

    A multilingual NLP system using XLM-RoBERTa-Large for multi-task classification that detects toxic content and harmful intent across 15+ languages with binary toxicity and multi-label intent heads.…

    Jupyter Notebook 1

  6. cuda-memory-hierarchy-benchmarks cuda-memory-hierarchy-benchmarks Public

    Roofline-driven CUDA microbenchmarking framework for studying memory-bound vs compute-bound transitions on edge GPUs (Jetson Orin Nano, SM87), comparing hand-written GEMM kernels, Tensor Core imple…

    Cuda