ai-rag-cli (accuracy-first, self-hosted RAG — CLI-first)

Local, offline-friendly Retrieval-Augmented Generation over PDFs/Excel/CSV with:

Hybrid retrieval (BM25 + dense embeddings) + cross-encoder reranker
Page-level citations and structured JSON/table answers
Confidence gating + "unknown" path on low evidence
Evaluation harness for exact-match/citation checks
CLI-first; optional REST API to support a future desktop app

Stack: OpenSearch (BM25 + k-NN vectors), sentence-transformers (BGE embeddings + reranker), FastAPI (optional API), Ollama (local LLM).

Quick start

Start infra (OpenSearch + Ollama) with Docker (see below).
rag create-index
Put files into input_docs/ then rag ingest input_docs/
Ask a question: rag query "Show Q1 revenue by product; include citations." --format table
Evaluate: add questions to eval/questions.yaml then rag eval eval/questions.yaml

Why CLI first?

Focus on accuracy and determinism. Same core powers a desktop UI later (Electron/Tauri calling the local REST API).

License

MIT

Running locally

Infra

docker compose up -d

Windows PowerShell:

Invoke-WebRequest -Uri "http://localhost:11434/api/pull" -Method POST -Body '{"name":"llama3.2:1b"}' -ContentType "application/json"

Linux/macOS/bash:

curl http://localhost:11434/api/pull -d '{"name":"llama3.2:1b"}'

Install & init

python -m pip install -U pip
pip install -e .
rag init
rag create-index

Ingest & query

# put docs into input_docs/ first
rag ingest input_docs/
rag query "Show Q1 revenue by product; include citations." --format table

Evaluate

# edit eval/questions.yaml to your dataset
rag eval eval/questions.yaml

REST API (for future desktop UI)

rag serve  # then POST to http://127.0.0.1:8000/query

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
app_api		app_api
docker		docker
eval		eval
rag_cli		rag_cli
rag_core		rag_core
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
QUICK_START.md		QUICK_START.md
README.md		README.md
SETUP.md		SETUP.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
test.Dockerfile		test.Dockerfile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ai-rag-cli (accuracy-first, self-hosted RAG — CLI-first)

Quick start

Why CLI first?

License

Running locally

Infra

Install & init

Ingest & query

Evaluate

REST API (for future desktop UI)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ai-rag-cli (accuracy-first, self-hosted RAG — CLI-first)

Quick start

Why CLI first?

License

Running locally

Infra

Install & init

Ingest & query

Evaluate

REST API (for future desktop UI)

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages