Skip to content
View dhruvthakur2000's full-sized avatar
๐Ÿ 
building from the mountains
๐Ÿ 
building from the mountains

Block or report dhruvthakur2000

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please donโ€™t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
dhruvthakur2000/README.md

Typing SVG


๐ŸŒŒ About Me

I publicly engineer real-world AI systems from scratch โ€” documenting decisions, failures, tradeoffs, and mental models along the way.


๐Ÿš€ Flagship Project โ€” SaleTech

Real-time AI voice sales agent. 40+ concurrent calls. Sub-600ms end-to-end latency. Zero vendor lock-in.

Component Technology Latency
Voice Activity Detection Silero VAD v5 + WebRTC (dual-fusion) 6ms
Speech Recognition Faster-Whisper Large-v3 (streaming) 150ms
Language Model Qwen-2.5-7B-Instruct + KV-cache reuse 320ms
Text-to-Speech Piper TTS (streaming, sentence-by-sentence) 100ms
End-to-End P95 ~580ms

What makes it different:

  • ๐Ÿ” KV-cache reuse across turns โ€” 40% LLM latency reduction by serializing attention tensors to Redis
  • ๐ŸŽค Adaptive end-of-turn detection โ€” learns speaking pace per session; 420msโ€“720ms dynamic silence threshold
  • โšก Barge-in handling โ€” multi-signal fusion detects interruptions in <200ms, stops TTS mid-sentence
  • ๐Ÿงฑ Redis-backed session persistence โ€” full state, KV-cache, metrics per session; horizontal scaling ready
  • โ˜ธ๏ธ Kubernetes-ready โ€” StatefulSet, HPA auto-scaling, PSTN via Asterisk/FreeSWITCH

Stack: Python FastAPI AsyncIO WebSockets Silero Faster-Whisper PyTorch Redis Docker Kubernetes

โ†’ github.com/dhruvthakur2000/SaleTech


๐Ÿ”ญ Projects

Project What it does Stack
SaleTech Production real-time AI voice sales agent โ€” 40+ concurrent calls, <600ms latency, full open-source pipeline Python ยท FastAPI ยท Silero ยท Faster-Whisper ยท Qwen ยท Piper ยท Redis ยท K8s
linux_driver_eval CLI framework to benchmark how well LLMs write Linux kernel device driver code. Two pipelines: generation + evaluation. Weighted scoring across correctness, security, quality, performance Python ยท GCC ยท Together API ยท Static Analysis
virtual-voicebot Streamlit voice assistant with persona-aware responses โ€” the project that started my obsession with real-time audio pipelines Python ยท Streamlit ยท Groq ยท Whisper ยท LLaMA ยท TTS
๐Ÿ”œ HomeAssist (planned) Smarter Alexa โ€” always-on edge voice assistant using SaleTech's VAD + ASR + buffer layers. Wake-word detection, local LLM, zero cloud dependency SaleTech core ยท Edge inference
๐Ÿ”œ SaleTech Analytics (planned) Call intelligence layer โ€” real-time sentiment, objection detection, sales stage classification per turn SaleTech core ยท NLP ยท Classification

๐Ÿง  Skills & Tech Stack

โš™๏ธ Languages & Tools

Python C Git Linux Bash VSCode

๐Ÿ“Š ML & Data Science

NumPy Pandas Scikit-learn Seaborn Matplotlib PyTorch

๐Ÿค– GenAI & LLM Stack

LangChain LangGraph Hugging Face ChromaDB QLoRA RAG AI Agents Multi-Agent Prompt Engineering Function Calling Fine-tuning

๐ŸŽ™๏ธ Voice AI (Specialist Stack)

Silero VAD Faster Whisper Piper TTS WebRTC VAD Streaming ASR

๐ŸŒ Backend & APIs

FastAPI REST API WebSockets Redis AsyncIO

โ˜๏ธ Infrastructure & DevOps

Docker Kubernetes GitHub Actions MLOps


๐Ÿ“ Pins & Badges

An image of @dhruvthakur2000's Holopin badges, which is a link to view their full Holopin profile


โœ๏ธ Writing

I write about real engineering decisions โ€” not tutorials copied from docs.

Post Platform
๐Ÿ“ก VAD: Voice Activity Detection โ€” how it actually works Hashnode
๐Ÿ” Understanding the Attention Mechanism: The Heart of the Transformer Revolution Medium
๐Ÿชต Structured Logging in Python: A Practical Guide for Production Systems Medium

๐Ÿ“Š GitHub Stats

Dhruv's GitHub Stats

Top Languages

GitHub Streak


๐Ÿค Connect With Me

LinkedIn X / Twitter Medium Hashnode Email




Open to ML Engineer / AI Engineer / Backend Python roles โ€” remote or on-site.

Popular repositories Loading

  1. DDoS-Attack-Detection-and-Mitigation DDoS-Attack-Detection-and-Mitigation Public

    Forked from chiragbiradar/DDoS-Attack-Detection-and-Mitigation

    SDN networks (Software Defined Networking ) are exposed to new security threats and attacks, especially Distributed Denial of Service (DDoS) attacks. For this aim, we have proposed a model able to โ€ฆ

    Python 1

  2. SaleTech SaleTech Public

    **A multi-user, low-latency, AI-powered voice sales agent** capable of handling **20+ concurrent real-time conversations** using a **central LLM (Quen) on an L40 GPU**, deployable by **real organizโ€ฆ

    Python 1

  3. content-engine content-engine Public

    โ€œIโ€™m building a production-grade multi-agent AI system that converts engineering work into structured contentโ€”with evals and guardrails.โ€

    Python 1

  4. dhruvthakur2000 dhruvthakur2000 Public

    Config files for my GitHub profile.

  5. salary_proj salary_proj Public

    repository for my first project i.e salary prediction of a data scientist

  6. RTA RTA Public

    Jupyter Notebook