Skip to content
View Heman10x-NGU's full-sized avatar

Block or report Heman10x-NGU

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Heman10x-NGU/README.md

Hey, I'm Hemant

Backend / distributed systems engineer at BNY Mellon, working on high-throughput event pipelines, financial data systems, and reliable workflow-driven services.

I build around three themes:

  • LLM reliability and agent guardrails - evals, calibration, logprobs, structured outputs, MCP-style workflows
  • Distributed systems and backend infrastructure - Kafka, gRPC, Redis, queues, caches, concurrency, observability
  • Developer tools - small, practical tools that help engineers validate, debug, or ship faster

Currently exploring: LLM evaluation, hallucination risk scoring, agent preflight checks, MCP integrations, and production-grade distributed systems.


Featured Projects

Hallucination Sentinel - LLM Output Risk Scoring

Open-source Python toolkit that scores LLM output risk using calibrated token entropy. It implements the Calibrated Entropy Score (CES) method from arXiv:2605.28264 and is designed for developers building RAG, agent, and batch QA systems.

  • CLI + Python API for scoring entropy sequences and provider outputs
  • Calibration workflow, thresholding, evaluation harness, and provider logprob smoke tests
  • Agent/RAG guardrail examples with clear limitations: this is a risk signal, not a truth oracle

ThreadGraph - Goroutine Leak & Deadlock Detector for Go

Static + dynamic analysis tool for finding goroutine leaks, deadlocks, and lock bugs in Go programs using execution traces.

  • Detects goroutine leaks and deadlock patterns without application instrumentation
  • Goroutine provenance tree, Tarjan SCC deadlock detection, go/ssa static analysis, CI baseline flags
  • Built for debugging real concurrency failures, not just toy examples

NexusCache - Distributed Cache in Go

Distributed caching system with etcd service discovery, gRPC communication, consistent hashing, singleflight deduplication, and Prometheus monitoring.

  • Thread-safe LRU cache with concurrent access
  • Consistent hashing and hot data replication across nodes
  • Benchmarked locally at 23K+ ops/sec with sub-1ms P50 latency

TitanQueue - Distributed Task Queue in Go

Task queue built with Redis and Go, designed around reliability and at-least-once delivery semantics.

  • Concurrent worker pool with lease-based ownership and automatic recovery
  • Priority queues, retries with exponential backoff, graceful shutdown
  • Web dashboard for queue and worker monitoring

PhronAi - Voice-Powered AI Diagramming

Voice-controlled AI diagramming app that turns spoken architecture descriptions into system diagrams.

  • Django + React + Groq + Deepgram
  • Structured output validation with Pydantic / Instructor
  • Built around real-time AI workflow UX, not just chat

ListFix - Marketplace Listing Optimizer

Open-source tool for improving Facebook Marketplace listings from raw item descriptions.

  • Generates clearer titles, descriptions, and pricing suggestions
  • Practical AI utility for sellers and resellers
  • Small product-shaped project focused on immediate usefulness

Tech Stack

Languages: Go · Java · Python · C++ · TypeScript

Systems & infra: Kafka · gRPC · Redis · etcd · Temporal · Postgres · Elasticsearch · Docker

AI & LLM: OpenAI · Claude · Groq · Deepgram · Instructor · Pydantic · MCP · evals · logprobs

Patterns: Consistent hashing · Singleflight · At-least-once delivery · Lease-based ownership · Exponential backoff · Goroutines · Worker pools · Structured outputs


Building LLM reliability tools, AI agents, or real-time backend systems? Let's talk.


Connect

LinkedIn Email


GitHub Stats

Pinned Loading

  1. NexusCache NexusCache Public

    High-performance distributed cache in Go with consistent hashing, singleflight, etcd discovery, and gRPC. 23K ops/sec, 713µs P50 latency (3-node Docker cluster).

    Go

  2. PhronAi PhronAi Public

    AI SPEECH TO WHITEBOARD 🎤 Voice-powered AI diagramming platform. Speak naturally, get professional system diagrams instantly. Built with Django, React, Groq LLaMA 3.3, and Deepgram.

    TypeScript

  3. TitanQueue TitanQueue Public

    A production-ready distributed task queue built in Go with Redis, designed for reliability through at-least-once delivery semantics.

    Go

  4. hallucination-sentinel hallucination-sentinel Public

    Single-pass CES entropy risk scoring for LLM outputs with calibration, provider logprobs, CLI, and agent/RAG guardrails.

    Python