ReceiptAgent

AI-Powered Receipt Processing & Expense Management

What it does

I built this because manually logging receipts after business trips is genuinely painful. You upload a photo of any receipt, the AI reads it, pulls out the merchant name, total amount, date and every line item, scores it for fraud risk, and stores it all in a structured expense report. No typing required.

Live Demo

1. API is live and healthy

2. Create an expense report

3. Upload a receipt image

4. AI extracts everything from the image

The AI read the receipt image and returned:

Merchant: COFFEE HOUSE
Date: 2026-01-29
Total: $20.52 (Tax: $1.52)
Items: Latte $4.50, Muffin x2 $6.00, Sandwich $8.50
Fraud Score: 50/100 — Medium risk
Confidence: 95%

All saved to PostgreSQL automatically.

Architecture

                         +-------------+
  Upload receipt image   |   Django    |   POST /api/receipts/
  ---------------------->|   REST API  |----------------------.
                         +------+------+                      |
                                |                             v
                                |                     +-------+--------+
                                |  Celery task        |  Redis Broker  |
                                +-------------------->|                |
                                                      +-------+--------+
                                                              |
                                                              v
                                                   +----------+----------+
                                                   |  LangGraph Pipeline |
                                                   |                     |
                                                   |  load_image         |
                                                   |    -> extract_data  |  Llama 4 Scout (vision)
                                                   |    -> validate      |
                                                   |    -> fraud_check   |  Llama 3.3 70B
                                                   |    -> finalize      |
                                                   +----------+----------+
                                                              |
                                                              v
                                                      +-------+------+
                                                      |  PostgreSQL  |
                                                      +--------------+

How it works

Upload a receipt -> the system runs it through an AI pipeline:

Load Image -- reads and encodes the receipt photo
Extract Data -- Llama 4 vision model reads the image and returns structured JSON with merchant, items, totals
Validate -- checks amounts add up, date is valid, required fields exist
Fraud Check -- Llama 3 scores the receipt for suspicious patterns (round numbers, unusual merchants, missing info)
Save -- everything stored in PostgreSQL, report total updated

Routing is conditional: if validation has too many errors, the receipt goes to manual review. If the fraud score hits 70+, it gets flagged automatically. All processing runs in the background via Celery so the API responds instantly.

Run it locally

You need Docker. That's it.

git clone https://github.com/khansalman12/receipt-agent.git
cd receipt-agent

# Add your free Groq API key (get one at console.groq.com)
cp .env.example .env

# Start everything -- migrations run automatically
docker-compose up

Hit http://localhost:8000/api/health/ -- if you get {"status": "ok"} you're good.

Testing

The test suite covers models, API endpoints, serializer validation, pipeline routing, Celery tasks, edge cases, and the CLI. Tests never call real LLMs -- all AI calls are mocked.

# Run full suite
pytest api/tests/ -v

# Run with coverage
pytest api/tests/ --cov=api --cov-report=term-missing

# Run a specific file
pytest api/tests/test_tasks.py -v

Test files:

File	What it covers
`test_models.py`	Field defaults, UUID PKs, relationships, cascade delete
`test_views.py`	CRUD endpoints, status transitions, filtering, 404s
`test_serializers.py`	Image validation (size, type), read-only enforcement
`test_pipeline.py`	Graph routing logic, validation rules, boundary scores
`test_tasks.py`	Celery task lifecycle, mocked LLM, report flagging
`test_edge_cases.py`	Double approvals, empty DBs, invalid uploads, GIF rejection
`test_management.py`	CLI command output, filters, empty DB handling
`test_utils.py`	Multi-format date parsing, garbage input handling

CI runs automatically on every push via GitHub Actions (lint + test with coverage gate).

CLI

# View receipt processing stats
python manage.py receiptstats

# Filter by status
python manage.py receiptstats --status FLAGGED

# Limit to last 7 days
python manage.py receiptstats --days 7

API Endpoints

Expense Reports

Method	Endpoint	What it does
GET	`/api/reports/`	List all reports
POST	`/api/reports/`	Create a new report
GET	`/api/reports/{id}/`	Get one report with all receipts
POST	`/api/reports/{id}/approve/`	Approve it
POST	`/api/reports/{id}/reject/`	Reject it
POST	`/api/reports/{id}/flag/`	Flag for manual review
GET	`/api/reports/pending/`	Get all pending reports

Receipts

Method	Endpoint	What it does
GET	`/api/receipts/`	List all receipts
POST	`/api/receipts/`	Upload a receipt image
GET	`/api/receipts/{id}/`	Get receipt + AI extracted data
DELETE	`/api/receipts/{id}/`	Delete it

Tech Stack

Layer	What I used
API	Django 5, Django REST Framework
AI Pipeline	LangGraph, LangChain
LLM	Groq -- Llama 4 (vision) + Llama 3 (fraud)
Background Jobs	Celery + Redis
Database	PostgreSQL
Container	Docker, Docker Compose
Static Files	WhiteNoise
CI	GitHub Actions (lint + test + coverage)

Project Structure

receipt-agent/
├── api/
│   ├── models.py          -- ExpenseReport and Receipt models
│   ├── views.py           -- REST endpoints
│   ├── serializers.py     -- request/response shapes
│   ├── tasks.py           -- Celery async tasks
│   ├── ai/
│   │   ├── graph.py       -- LangGraph workflow
│   │   ├── nodes.py       -- extract, validate, fraud nodes
│   │   └── state.py       -- shared pipeline state
│   ├── management/
│   │   └── commands/
│   │       └── receiptstats.py  -- CLI reporting tool
│   └── tests/             -- 8 test files, 80+ test cases
├── config/                -- Django settings, Celery config
├── .github/workflows/     -- CI pipeline
├── docker-compose.yml     -- local dev stack
├── Dockerfile             -- production container
└── requirements.txt

Environment Variables

Variable	Description
`GROQ_API_KEY`	Free at console.groq.com
`SECRET_KEY`	Any random string in production
`DATABASE_URL`	PostgreSQL connection string
`REDIS_URL`	Redis connection string
`DEBUG`	Set to `0` in production

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.github/workflows		.github/workflows
api		api
config		config
docs		docs
postman/globals		postman/globals
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
build.sh		build.sh
docker-compose.yml		docker-compose.yml
manage.py		manage.py
pytest.ini		pytest.ini
requirements.txt		requirements.txt
start.sh		start.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ReceiptAgent

What it does

Live Demo

1. API is live and healthy

2. Create an expense report

3. Upload a receipt image

4. AI extracts everything from the image

Architecture

How it works

Run it locally

Testing

CLI

API Endpoints

Expense Reports

Receipts

Tech Stack

Project Structure

Environment Variables

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ReceiptAgent

What it does

Live Demo

1. API is live and healthy

2. Create an expense report

3. Upload a receipt image

4. AI extracts everything from the image

Architecture

How it works

Run it locally

Testing

CLI

API Endpoints

Expense Reports

Receipts

Tech Stack

Project Structure

Environment Variables

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages