🤖 RAG Chatbot with Groq

A powerful Retrieval-Augmented Generation (RAG) chatbot that enables intelligent conversations with your PDF documents. Built with FastAPI, ChromaDB, and Groq's lightning-fast LLM inference, this application extracts context from PDFs and provides accurate, source-backed answers to your questions.

✨ Features

📄 PDF Document Upload - Upload and process PDF files
🔍 Intelligent Text Extraction - Extracts and chunks text from PDFs using PyMuPDF
🧠 Vector Embeddings - Stores document chunks as embeddings in ChromaDB
⚡ Lightning-Fast Responses - Powered by Groq's high-performance LLM (Llama 3.1)
🎯 Context-Aware Answers - Retrieves relevant document chunks before generating responses
🔗 Source Citations - Returns the source chunks used to generate each answer
🚀 RESTful API - Easy-to-use FastAPI endpoints for integration

🏗️ Architecture

┌─────────────┐      ┌──────────────┐      ┌─────────────┐
│   PDF File  │─────▶│ Text Extract │─────▶│  Chunking   │
└─────────────┘      └──────────────┘      └─────────────┘
                                                   │
                                                   ▼
┌─────────────┐      ┌──────────────┐      ┌─────────────┐
│   Answer    │◀─────│   Groq LLM   │◀─────│  ChromaDB   │
└─────────────┘      └──────────────┘      └─────────────┘
                            ▲                      ▲
                            │                      │
                     ┌──────┴──────────────────────┘
                     │     Context Retrieval
                     │
              ┌──────┴──────┐
              │   Question  │
              └─────────────┘

📋 Prerequisites

Python 3.8 or higher
A Groq API key (Get one here)

🚀 Quick Start

1. Clone the Repository

git clone https://github.com/Rafay-AABS/rag-chatbot-groq.git
cd rag-chatbot-groq

2. Create a Virtual Environment (Recommended)

# Windows
python -m venv venv
venv\Scripts\activate

# Linux/Mac
python3 -m venv venv
source venv/bin/activate

3. Install Dependencies

pip install -r requirements.txt

4. Set Up Environment Variables

Create a .env file in the root directory:

GROQ_API_KEY=your_groq_api_key_here

5. Run the Application

uvicorn main:app --reload

The API will be available at http://localhost:8000

📚 API Documentation

Once the server is running, visit:

Interactive API Docs: http://localhost:8000/docs
Alternative Docs: http://localhost:8000/redoc

Endpoints

1. Upload PDF Document

POST /upload

Upload a PDF file to be processed and stored.

Request:

curl -X POST "http://localhost:8000/upload" \
  -H "Content-Type: multipart/form-data" \
  -F "file=@your_document.pdf"

Response:

{
  "document_id": "uuid-string",
  "pages_extracted": 10,
  "chunks_created": 45,
  "chunks_stored": 45
}

2. Ask Questions

POST /chat

Ask questions about an uploaded document.

Request:

curl -X POST "http://localhost:8000/chat" \
  -H "Content-Type: application/json" \
  -d '{
    "question": "What is the main topic of the document?",
    "document_id": "uuid-from-upload"
  }'

Response:

{
  "answer": "The main topic of the document is...",
  "sources": [
    "chunk of text 1...",
    "chunk of text 2...",
    "chunk of text 3..."
  ]
}

🛠️ Technology Stack

Component	Technology
Web Framework	FastAPI
LLM Provider	Groq (Llama 3.1)
Vector Database	ChromaDB
PDF Processing	PyMuPDF (fitz)
Embeddings	Sentence Transformers
Language	Python 3.8+

📁 Project Structure

rag-chatbot-groq/
├── main.py                 # FastAPI application entry point
├── requirements.txt        # Python dependencies
├── .env                    # Environment variables (create this)
├── README.md              # Project documentation
├── data/
│   ├── uploads/           # Uploaded PDF files storage
│   └── chroma_db/         # ChromaDB vector database
└── utils/
    ├── pdf_utils.py       # PDF text extraction and chunking
    ├── embedding_utils.py # Vector embedding and storage
    └── rag_pipeline.py    # RAG query and response generation

⚙️ Configuration

Chunking Parameters

Modify in utils/pdf_utils.py:

def chunk_text(text: str, max_length=1000, overlap=100):
    # max_length: Maximum characters per chunk
    # overlap: Overlapping characters between chunks

Retrieval Settings

Modify in utils/rag_pipeline.py:

def ask_question(question: str, document_id: str, top_k=3):
    # top_k: Number of relevant chunks to retrieve

LLM Model

Change the Groq model in utils/rag_pipeline.py:

response = client.chat.completions.create(
    model="llama-3.1-8b-instant",  # Change model here
    messages=[{"role": "user", "content": prompt}]
)

Available models: llama-3.1-8b-instant, llama-3.1-70b-versatile, mixtral-8x7b-32768, etc.

🔧 Advanced Usage

Using with Python Requests

import requests

# Upload PDF
with open("document.pdf", "rb") as f:
    response = requests.post(
        "http://localhost:8000/upload",
        files={"file": f}
    )
    doc_id = response.json()["document_id"]

# Ask question
response = requests.post(
    "http://localhost:8000/chat",
    json={
        "question": "What are the key findings?",
        "document_id": doc_id
    }
)
print(response.json()["answer"])

🐛 Troubleshooting

No context found for document

Ensure the document_id from upload matches the one used in chat
Check that the PDF contains extractable text (not just images)

Groq API errors

Verify your API key in .env
Check your API rate limits on Groq console
Ensure you have internet connectivity

ChromaDB errors

Delete the data/chroma_db folder and restart
Ensure sufficient disk space

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Groq for providing ultra-fast LLM inference
ChromaDB for the vector database
FastAPI for the excellent web framework
PyMuPDF for PDF processing

📧 Contact

Rafay AABS - GitHub

Project Link: https://github.com/Rafay-AABS/rag-chatbot-groq

⭐ If you find this project helpful, please give it a star!

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
utils		utils
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

🤖 RAG Chatbot with Groq

✨ Features

🏗️ Architecture

📋 Prerequisites

🚀 Quick Start

1. Clone the Repository

2. Create a Virtual Environment (Recommended)

3. Install Dependencies

4. Set Up Environment Variables

5. Run the Application

📚 API Documentation

Endpoints

1. Upload PDF Document

2. Ask Questions

🛠️ Technology Stack

📁 Project Structure

⚙️ Configuration

Chunking Parameters

Retrieval Settings

LLM Model

🔧 Advanced Usage

Using with Python Requests

🐛 Troubleshooting

No context found for document

Groq API errors

ChromaDB errors

🤝 Contributing

📝 License

🙏 Acknowledgments

📧 Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages