Skip to content
View MadsDoodle's full-sized avatar

Block or report MadsDoodle

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
MadsDoodle/README.md

Madhav S. Baidya

AI Researcher | LLM Systems | Multimodal Reasoning | Knowledge Graphs

Madhav Banner


About

Final-year undergraduate at IIT (BHU), Varanasi working at the intersection of LLMs, multimodal reasoning, and knowledge-grounded systems. My work focuses on building epistemically reliable AI systems, combining structured representations (knowledge graphs) with retrieval and generation pipelines.

I am particularly interested in:

  • Decision-aware retrieval and generation (Decision RAG)
  • Multi-agent LLM systems for reasoning and control
  • Multimodal information extraction and grounding
  • Robustness, evaluation, and failure analysis of LLM systems

Research Interests

  • Large Language Models (LLMs) and Agentic Systems
  • Multimodal Learning (Text, Vision, Audio)
  • Knowledge Graphs and Neuro-Symbolic AI
  • Retrieval-Augmented Generation (RAG) and Decision RAG
  • AI Robustness, Hallucination Mitigation, and Evaluation

Research & Publications

  • PassiveQA: A Three-Action Framework for Epistemically Calibrated Question Answering via Supervised Finetuning
    arXiv

    • Introduced a planner-driven multi-agent system with ASK / ANSWER / ABSTAIN routing
    • Designed a decision RAG mechanism over a knowledge graph with query-guided edge weighting
    • Incorporated explicit variable injection (?var) to model missing information and enable structured multi-hop reasoning
    • Fine-tuned a Mistral-7B planner (LoRA) on a graph-grounded dataset for improved abstention and reduced hallucination
  • Detecting the Machine: A Comprehensive Benchmark of AI-Generated Text Detectors Across Architectures, Domains, and Adversarial Conditions
    arXiv

    • Built a large-scale evaluation framework across transformers, CNNs, stylometric models, and LLM-based detectors
    • Evaluated under domain shift, cross-LLM generalization, and adversarial humanization
    • Identified systemic limitations such as detector–generator coupling and robustness failure

Selected Work

  • Multimodal Knowledge Graph System
    RDF-based graph construction (RDFlib + SPARQL) from unstructured documents with hybrid retrieval (symbolic + vector) for multi-hop QA

  • Agentic Research Assistant (LangGraph)
    Multi-agent pipeline for automated paper retrieval, parsing, and synthesis across scientific sources

  • SPAWN: Spoken Environment World Modeling
    Benchmark framework for evaluating spatial reasoning in multimodal LLMs from spoken/textual inputs, including tasks such as map reconstruction, relational inference, and navigation planning under noisy and ambiguous conditions

  • SQLPilot (NL → SQL Compiler)
    Schema-aware query generation with validation and execution across MySQL + SQLite backends

  • Multimodal PDF RAG System
    CLIP-based image embeddings + LLM summarization for joint text-image retrieval


Technical Stack

AI / ML PyTorch • TensorFlow • Representation Learning • Multimodal Learning

LLMs & Agents LangChain • LangGraph • OpenAI • Mistral • Tool-Augmented Agents

Retrieval & Knowledge Systems Qdrant • Knowledge Graphs (RDF, SPARQL, RDFlib) • Sentence Transformers

Systems FastAPI • Docker • Async Systems • Microservices • DVC


Activity


Links


Note

Focused on building systems that know when they know, and when they do not.

Pinned Loading

  1. Tabular-RAG-from-scratch Tabular-RAG-from-scratch Public

    RAG implemented from scratch without using LangChain and LangGraph - designed specifically for processing and querying PDF documents with advanced support for visual content like tables, charts, an…

    Python 10

  2. Gen-Sites-No-code-website-builder-using-LangGraph Gen-Sites-No-code-website-builder-using-LangGraph Public

    A no-code website builder using LangGraph nodes and LangChain tools with a beautiful frontend implementation.. (still in development)

    TypeScript 8 1

  3. Detecting-the-Machine-A-Comprehensive-Benchmark-of-AI-Generated-Text-Detectors-Across-Architectures Detecting-the-Machine-A-Comprehensive-Benchmark-of-AI-Generated-Text-Detectors-Across-Architectures Public

    This project aims to address this gap by conducting a systematic, controlled study of human versus LLM-generated text detectability using paired question–answer datasets. Rather than proposing a no…

    Jupyter Notebook 5

  4. PassiveQA PassiveQA Public

    PassiveQA is a novel question-answering framework that teaches language models when to answer, ask for clarification, or refuse to answer - instead of always attempting to generate responses.

    Jupyter Notebook 5

  5. Knowledge-Graph-using-RDF-LLM Knowledge-Graph-using-RDF-LLM Public

    The Knowledge Graph System is a comprehensive AI-powered pipeline for extracting entities and relationships from unstructured documents (PDF, TXT, DOCX) to build structured knowledge graphs. It lev…

    TypeScript 1

  6. GreenCRM GreenCRM Public

    GreenCRM is a next-generation, AI-driven CRM platform designed to automate the heavy lifting of sales and lead management. By placing Artificial Intelligence at the core of the workflow, GreenCRM t…

    TypeScript 1