Hybrid RAG System

This project is a Retrieval-Augmented Generation (RAG) system implemented using Python, LangChain, and the DeepSeek R1 model. It combines traditional retrieval techniques (BM25) with modern dense embeddings (FAISS) to build a highly efficient document retrieval and question-answering system.

Features

Current Features

Hybrid Retrieval: Combines BM25 and FAISS for robust and accurate document retrieval.
Multi-PDF Support: Users can upload multiple PDF files for processing, which are stored in a dedicated data/ directory.
Streamlit UI: A user-friendly interface for uploading files, asking questions, and viewing results.
Tracing and Analytics: Integrated tracing with LangSmith to analyze performance and monitor usage.
Custom LLM Integration: Uses the DeepSeek R1 model (via Ollama) for question answering.
Dynamic Context Handling: Automatically handles and prepares context for queries.

Planned Features

Enhanced Memory Features

Context-Aware Memory: Implement dynamic context retention to remember the flow of conversations.
- Framework: LangChain's updated ConversationBufferMemory or EntityMemory.
- Use Case: Retaining context across multiple user queries for more coherent interactions.
User-Specific Memory: Allow memory reset or persistence for different users.
- Framework: Redis or PostgreSQL for memory persistence across sessions.

File Management

Support for Multiple File Formats:
- Additional Formats: Microsoft Word, CSV, and image files (via Tesseract or Amazon Textract for OCR).
- Library: python-docx for Word, pandas for CSV, and pytesseract or AWS Textract for images.
File Listing UI:
- Feature: A sidebar UI for managing uploaded files (view/delete).
- Library: Streamlit components (st.sidebar and st.selectbox).

Advanced Tracing and Analytics

Usage Analytics:
- Framework: LangSmith or OpenTelemetry.
- Metrics: Number of queries, response times, and user feedback.
Error Logging:
- Framework: Sentry or Python's built-in logging library.
- Storage: Centralized logs for troubleshooting.

API Integration

REST API:
- Framework: FastAPI for building a RESTful API.
- Use Case: Exposing functionalities for external applications.

Security Enhancements

Access Control:
- Framework: FastAPI Users for authentication and role-based access.
- Feature: Secure endpoints for API access.
Data Encryption:
- Library: cryptography for encrypting files and query results.

Enhanced Retrieval

Advanced Retrieval Techniques:
- Framework: DPR (Dense Passage Retrieval) using Hugging Face models.
- Improvement: Replace FAISS with Weaviate or Milvus for better vector storage and search.
Semantic Clustering:
- Library: Scikit-learn for clustering similar documents.

Performance Optimization

Parallel Processing:
- Framework: concurrent.futures or multiprocessing.
- Use Case: Faster processing of large files.
Caching System:
- Library: Redis or Memcached for caching embeddings and document chunks.

Feedback System

User Feedback Loop:
- Framework: Streamlit widgets for rating responses.
- Use Case: Improve system accuracy with user feedback.
Interactive Debugging:
- Feature: Flag incorrect answers directly from the UI.

Testing and CI/CD

Automated Testing:
- Framework: Pytest for unit and integration tests.
- CI Tool: GitHub Actions for continuous integration.
Continuous Deployment:
- Tool: Docker and AWS CodePipeline for seamless updates.

Community Engagement

Knowledge Base:
- Platform: GitBook or ReadTheDocs for user and developer documentation.
- Content: Guides, FAQs, and tutorials.
Open Source Contribution:
- Platform: GitHub for hosting and collaboration.
- Feature: Contributor guidelines and issues for community involvement.

Installation

Prerequisites

Python 3.9+
Pip
Ollama installed (installation guide).

Steps

Clone the repository:

git clone https://github.com/your-repo/hybrid-rag.git
cd hybrid-rag

Create and activate a virtual environment:

python -m venv rag_env
source rag_env/bin/activate  # On Windows: rag_env\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```

Install Ollama and pull the DeepSeek R1 model:

# Install Ollama
brew install ollama  # macOS

# Pull the DeepSeek R1 model
ollama pull deepseek-r1:1.5b

Run the Streamlit app:
```
streamlit run app.py
```

Usage

Upload one or more PDF files via the Streamlit UI.
Ask questions based on the uploaded documents.
View responses and source documents.

Contributing

We welcome contributions! Please check the Contributing Guidelines.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
app.py		app.py
callback_handler.py		callback_handler.py
model.py		model.py
requirements.txt		requirements.txt
retriever.py		retriever.py
test_retriever.py		test_retriever.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hybrid RAG System

Features

Current Features

Planned Features

Enhanced Memory Features

File Management

Advanced Tracing and Analytics

API Integration

Security Enhancements

Enhanced Retrieval

Performance Optimization

Feedback System

Testing and CI/CD

Community Engagement

Installation

Prerequisites

Steps

Usage

Contributing

License

References

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Hybrid RAG System

Features

Current Features

Planned Features

Enhanced Memory Features

File Management

Advanced Tracing and Analytics

API Integration

Security Enhancements

Enhanced Retrieval

Performance Optimization

Feedback System

Testing and CI/CD

Community Engagement

Installation

Prerequisites

Steps

Usage

Contributing

License

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages