AI-Odyssey: Blockchain Forensics & Token Analysis System

An advanced blockchain forensics platform for detecting suspicious token behaviors, mixer usage, wash trading, and Ponzi schemes on the Ethereum network in real-time.

🎯 Features

🔍 Token Analysis: Real-time ERC-20 token forensics with BitQuery integration
📊 Graph Visualization: Interactive force-directed network graphs with 50+ influential nodes
⚠️ Pattern Detection: Simultaneous detection of:
- Mixer/privacy pool usage (Tornado Cash patterns)
- Wash trading rings and circular transactions
- Ponzi scheme hierarchies
🎯 Risk Scoring: Comprehensive 40/40/10/10 weighted heuristic model
- 40% Fan-in analysis (incoming transactions)
- 40% Fan-out analysis (outgoing transactions)
- 10% Uniform denomination detection (Tornado Cash)
- 10% Temporal randomness analysis
📈 Real-time Status: Track analysis progress (0-100%) with live updates
💾 Export Results: Download analysis as CSV or JSON
🚀 Production Ready: <30 second analysis time, 99%+ uptime target

🚀 Quick Start

Prerequisites

Python 3.8+
Node.js 18+
Docker & Docker Compose (optional, for containerized deployment)
Git

Local Development (5 minutes)

1. Clone Repository

git clone https://github.com/ouemnaa/ai-odyssey.git
cd ai-odyssey

2. Backend Setup

cd backend

# Create virtual environment
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

# Copy environment file
cp .env.example .env

# Edit .env with your BitQuery API key
# BITQUERY_API_KEY=your_api_key_here

# Run server
uvicorn app.main:app --reload --host 0.0.0.0 --port 8000

Backend will be available at http://localhost:8000

3. Frontend Setup

cd frontend

# Install dependencies
npm install

# Start development server
npm run dev

Frontend will be available at http://localhost:5173

4. Using Docker Compose (Alternative)

# From project root
docker-compose up

# Backend: http://localhost:8000
# Frontend: http://localhost:3000

🏗️ Architecture

High-Level Overview

┌─────────────────────────────────────────┐
│      Frontend (React + TypeScript)      │
│     • Interactive Graph Visualization   │
│     • Risk Dashboard & Metrics          │
│     • Token Input & Search              │
└────────────────┬────────────────────────┘
                 │ HTTP/JSON
                 ▼
┌─────────────────────────────────────────┐
│     Backend API (FastAPI + Python)      │
│  • POST /api/v1/analyze                 │
│  • GET /api/v1/analysis/{id}/status     │
│  • GET /api/v1/analysis/{id}            │
│  • GET /api/v1/analysis/{id}/export     │
└────────────────┬────────────────────────┘
                 │
     ┌─────────── ───────────┐
     ▼                       ▼
  ┌──────┐                ┌──────────┐
  │      │                │ Neo4j    │
  │Cache │                │Database│
  └──────┘                └──────────┘
     │
     ▼
┌─────────────────────────────────────────┐
│        Agent Layer (Python)             │
│  • First Flow: Mixer Detection Agent    │
│  • Second Flow: General Forensics Agent │
│  • Louvain Community Detection          │
│  • Risk Metrics Calculation             │
└────────────────┬────────────────────────┘
                 │
                 ▼
          ┌─────────────────┐
          │  BitQuery API   │
          │ (Ethereum Data) │
          └─────────────────┘

Component Details

Frontend (`frontend/`)

Framework: React 18 + TypeScript + Vite
UI Library: Radix UI + TailwindCSS
Visualization: Custom graph renderer with Framer Motion animations
State Management: React Hooks + Context API
API Client: Axios with polling for async operations

Key Components:

SearchSection.tsx - Token input and analysis submission
GraphVisualization.tsx - Interactive graph with zoom/pan
RiskDashboard.tsx - Risk metrics and statistics
NodeDetailsModal.tsx - Detailed wallet information

Backend (`backend/`)

Framework: FastAPI + Uvicorn (Python 3.8+)
Database: Neo4j for persistence and graph manipulation
Validation: Pydantic models
Async: AsyncIO for non-blocking operations

Key Modules:

api/routes/analysis.py - Main analysis endpoints
services/analysis_service.py - Orchestrates agent execution
utils/graph_converter.py - Converts agent output to frontend format
schemas/ - Data models (Pydantic)

Agents (`agent/`)

First Flow: Mixer Detection (`first-flow/mixer_mcp_tool.py`)

# Specialized mixer detection using behavioral heuristics
- detect_direct_mixer_addresses()     # Known mixer detection
- calculate_fan_in_score()             # Incoming tx analysis (40% weight)
- calculate_fan_out_score()            # Outgoing tx analysis (40% weight)
- calculate_uniform_denominations()   # Tornado denomination pattern (10%)
- calculate_temporal_randomness()     # Timing analysis (10%)

Tornado Cash Denominations Detected:

0.1 ETH
1 ETH
10 ETH
100 ETH

Second Flow: General Forensics (`second-flow/work.py`)

# Comprehensive token forensics and pattern detection
- fetch_real_transactions()            # BitQuery integration
- fetch_real_internal_transactions()  # Smart contract calls
- build_graph_from_real_data()        # NetworkX graph construction
- detect_all_clusters_real()          # Louvain community detection
- calculate_advanced_risk_metrics()   # Gini coefficient, PageRank, etc.

Pattern Detection:

Mixer clusters (fan-in/fan-out spikes)
Wash trading rings (circular transactions)
Ponzi hierarchies (centralized fund flows)

📁 Project Structure

ai-odyssey/
├── README.md                           # This file
├── PROJECT_REPORT.md                   # Comprehensive technical documentation
├── MVP_DEPLOYMENT.md                   # 4-week deployment guide
├── DEPLOYMENT_SUMMARY.md               # Quick reference
├── docker-compose.yml                  # Local development stack
│
├── backend/                            # FastAPI application
│   ├── app/
│   │   ├── main.py                    # Application entry point
│   │   ├── config.py                  # Configuration management
│   │   ├── api/
│   │   │   └── routes/
│   │   │       ├── analysis.py        # Analysis endpoints
│   │   │       └── health.py          # Health check
│   │   ├── models/
│   │   │   └── analysis.py            # SQLAlchemy models
│   │   ├── schemas/
│   │   │   ├── analysis.py            # Request/response schemas
│   │   │   ├── graph.py               # Graph data models
│   │   │   └── status.py              # Status schemas
│   │   ├── services/
│   │   │   ├── analysis_service.py    # Core analysis orchestration
│   │   │   └── export_service.py      # Export to CSV/JSON
│   │   └── utils/
│   │       └── graph_converter.py     # Agent output transformation
│   ├── requirements.txt                # Python dependencies
│   ├── Dockerfile                      # Container image
│   └── README.md                       # Backend documentation
│
├── frontend/                           # React application
│   ├── client/
│   │   ├── index.html
│   │   ├── src/
│   │   │   ├── App.tsx                # Root component
│   │   │   ├── main.tsx               # Entry point
│   │   │   ├── pages/
│   │   │   │   └── Home.tsx           # Main analysis page
│   │   │   ├── components/
│   │   │   │   ├── SearchSection.tsx
│   │   │   │   ├── GraphVisualization.tsx
│   │   │   │   ├── RiskDashboard.tsx
│   │   │   │   ├── NodeDetailsModal.tsx
│   │   │   │   └── AnalysisResults.tsx
│   │   │   ├── services/
│   │   │   │   └── analysisService.ts # API client
│   │   │   ├── contexts/
│   │   │   │   └── ThemeContext.tsx   # Dark/light mode
│   │   │   └── ui/                    # Radix UI components
│   │   └── public/
│   ├── package.json
│   ├── vite.config.ts
│   └── tsconfig.json
│
├── agent/                              # ML agents for forensics
│   ├── first-flow/
│   │   ├── mixer_mcp_tool.py          # Mixer detection agent (1830 lines)
│   │   └── queries.py                 # BitQuery GraphQL queries
│   └── second-flow/
│       ├── work.py                    # General forensics agent (1544 lines)
│       ├── work.md                    # Agent documentation
│       └── forensic_token_*.{csv,json} # Sample outputs
│
└── .github/
    └── workflows/
        └── deploy.yml                 # CI/CD pipeline (GitHub Actions)

💻 Development

Requirements

# Backend
python-3.8+
fastapi==0.104.1
uvicorn==0.24.0
pydantic==2.5.0
networkx==3.2
community-python==1.0.0
requests==2.31.0
redis==5.0.1
psycopg2-binary==2.9.9
sqlalchemy==2.0.23

# Frontend
node-18.x
npm-10.x
react==18.2.0
typescript==5.3.3
vite==5.0.8
tailwindcss==3.4.1

Running Tests

Backend

# Run all tests
pytest backend/

# Run specific test
pytest backend/test_graph_converter.py

# With coverage
pytest backend/ --cov=app

Frontend

# Type checking
npm run check

# Build check
npm run build

# Format code
npm run format

Code Style

Backend: PEP 8 (Python)

# Format
black backend/

# Lint
flake8 backend/

Frontend: Prettier + TypeScript

# Format
npm run format

# Type check
npm run check

🚀 Deployment

Local Development

# Using docker-compose (recommended)
docker-compose up

# Manually
# Terminal 1: Backend
cd backend && uvicorn app.main:app --reload

# Terminal 2: Frontend
cd frontend && npm run dev

# Terminal 3: Agents (optional, for manual testing)
cd agent/second-flow && python work.py

Production Deployment (MVP)

See MVP_DEPLOYMENT.md for complete 4-week deployment guide:

Week 1: Infrastructure (PostgreSQL, Redis, container registry)
Week 2: Backend (ECS Fargate, 3x FastAPI, 2x workers)
Week 3: Frontend (S3 + CloudFront CDN)
Week 4: Monitoring & go-live

Quick Cost Summary:

Monthly: ~$265
Setup: 4 weeks
Concurrent Users: 100-1,000
Uptime: 99%+

Environment Variables

Backend (.env):

# API Configuration
DEBUG=false
LOG_LEVEL=info
PORT=8000

# Database
Neo4j

# API Keys
BITQUERY_API_KEY=your_bitquery_key_here

# Analysis Settings
MAX_ANALYSIS_TIME=30s
MAX_TRANSACTIONS_PER_ANALYSIS=10000

Frontend (.env.local):

VITE_API_URL=http://localhost:8000
VITE_API_TIMEOUT=300000

📡 API Documentation

Base URL

Development: http://localhost:8000
Production: https://api.yourdomain.com

Authentication

Currently MVP (no auth required). Enterprise version will support JWT/OAuth2.

Main Endpoints

1. Submit Analysis

POST /api/v1/analyze
Content-Type: application/json

{
  "tokenAddress": "0x6982508145454ce325ddbe47a25d4ec3d2311933",
  "daysBack": 7,
  "sampleSize": 5000
}

Response (HTTP 202 Accepted):

{
  "analysisId": "550e8400-e29b-41d4-a716-446655440000",
  "status": "processing",
  "timestamp": "2025-12-06T10:30:00Z"
}

2. Check Status

GET /api/v1/analysis/{analysisId}/status

Response:

{
  "analysisId": "550e8400-e29b-41d4-a716-446655440000",
  "status": "detecting_patterns",
  "progress": 75,
  "currentStep": "Detecting wash trading patterns...",
  "startedAt": "2025-12-06T10:30:05Z"
}

Status Values: queued, fetching_data, building_graph, detecting_patterns, completed, failed

3. Get Results

GET /api/v1/analysis/{analysisId}

Response:

{
  "nodes": [
    {
      "id": "0x1234...abcd",
      "type": "wallet",
      "riskLevel": "high",
      "holdings": 1500000,
      "txCount": 425,
      "degree": 50
    }
  ],
  "links": [
    {
      "source": "0x1234...abcd",
      "target": "0x5678...efgh",
      "value": 50000,
      "txCount": 12
    }
  ],
  "riskScore": 78.5,
  "metrics": {
    "giniCoefficient": 0.82,
    "avgClusteringCoefficient": 0.34,
    "networkDensity": 0.12
  },
  "topInfluentialWallets": [...],
  "detectedCommunities": [...],
  "redFlags": [...]
}

4. Export Results

GET /api/v1/analysis/{analysisId}/export?format=csv
GET /api/v1/analysis/{analysisId}/export?format=json

5. Health Check

GET /health

Response:

{
  "status": "online",
  "timestamp": "2025-12-06T10:30:00Z"
}

Full API Documentation

Interactive API docs available at:

Swagger UI: http://localhost:8000/docs
ReDoc: http://localhost:8000/redoc

🧠 How It Works

Analysis Pipeline

1. User submits token address
              ↓
2. Backend validates input
              ↓
3. First Agent (Mixer Detection)
   ├─ Fetch 24h transactions
   ├─ Detect known mixer addresses
   ├─ Calculate heuristic scores
   └─ Return mixer confidence
              ↓
4. Second Agent (General Forensics)
   ├─ Fetch 7-day transaction history
   ├─ Fetch internal transactions
   ├─ Fetch token holders
   ├─ Build NetworkX directed graph
   ├─ Community detection (Louvain)
   ├─ Pattern detection (wash trading, Ponzi)
   └─ Calculate risk metrics
              ↓
5. Graph Converter transforms output
              ↓
6.      Results stored 
              ↓
7. Frontend polls and displays visualization

Forensic Heuristics

Risk Score Calculation (0-100)

risk_score = (
    0.40 * fan_in_score +
    0.40 * fan_out_score +
    0.10 * uniform_denominations_score +
    0.10 * temporal_randomness_score
) * 100

if tornado_denominations_detected:
    risk_score = min(100, risk_score + 20)

Risk Categories

Low (0-30): Normal trading patterns
Medium (30-60): Some suspicious indicators
High (60-80): Multiple red flags
Critical (80-100): Strong illicit activity indicators

Graph Metrics

Fan-In: Number of unique senders to address
Fan-Out: Number of unique receivers from address
Gini Coefficient: Wealth concentration measure
PageRank: Node influence in network
Clustering Coefficient: Local network density
Modularity: Community structure quality (>0.6 is strong)

📊 Data Sources

BitQuery GraphQL API: Real-time Ethereum transaction data
ERC-20 Token Transfers: Via standard transfer events
Internal Transactions: Smart contract interactions
Token Holders: Distribution analysis
Known Mixer List: Hardcoded Tornado Cash addresses

🔒 Security

Current State (MVP)

No authentication required (public API)
Rate limiting not implemented
Data stored in memory/local cache

Production Recommendations

Add JWT/OAuth2 authentication
Implement rate limiting (1000 req/min per IP)
Use HTTPS only
Encrypt sensitive data in transit
Rotate API keys regularly
Add CORS restrictions
Implement request signing

📈 Performance

Typical Analysis Times

Transaction Count	Analysis Time	Pattern Accuracy
1,000	5-8s	90%
5,000	12-18s	92%
10,000	20-28s	95%

Scalability Targets

MVP: 100-1,000 concurrent users
Scale 1: 5,000+ concurrent users (add read replicas, more workers)
Scale 2: 50,000+ concurrent users (multi-region deployment, K8s)

🐛 Troubleshooting

Backend Won't Start

# Check port 8000 is free
lsof -i :8000  # macOS/Linux
netstat -ano | findstr :8000  # Windows

# Clear Python cache
find . -type d -name __pycache__ -exec rm -r {} +
find . -name "*.pyc" -delete

# Reinstall dependencies
pip install --upgrade -r requirements.txt

Frontend Build Issues

# Clear node modules and cache
rm -rf node_modules pnpm-lock.yaml
npm install

# Clear Vite cache
rm -rf dist .vite
npm run dev

Redis Connection Failed

# Check Redis is running
redis-cli ping
# Should return: PONG

# Or with Docker
docker-compose up redis

BitQuery API Rate Limit

Error: "Rate limit exceeded"

Solution:
- Add exponential backoff (implemented in code)
- Wait 60 seconds before retry
- Consider upgrading BitQuery plan

📚 Documentation

PROJECT_REPORT.md: Comprehensive technical documentation
MVP_DEPLOYMENT.md: Week-by-week deployment guide
backend/README.md: Backend-specific documentation
API Docs: /docs (Swagger) or /redoc (ReDoc) endpoints

🤝 Contributing

Development Workflow

Create feature branch

git checkout -b feature/your-feature

Make changes and test

# Backend testing
cd backend && pytest

# Frontend testing
cd frontend && npm run check

Commit with clear messages

git commit -m "feat: add mixer detection improvement"

Push and create pull request

git push origin feature/your-feature

Commit Message Format

feat: add new feature
fix: fix bug
docs: update documentation
test: add tests
refactor: refactor code
perf: improve performance
ci: update CI/CD

📄 License

MIT License - see LICENSE file for details

👥 Authors

Project: AI-Odyssey Blockchain Forensics Team
Repository: https://github.com/ouemnaa/ai-odyssey

🙏 Acknowledgments

BitQuery for Ethereum data API
NetworkX for graph analysis
FastAPI for async framework
React community for UI components

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
agent		agent
backend		backend
frontend		frontend
README.md		README.md
docker-compose.yml		docker-compose.yml
run_first_flow_agent.bat		run_first_flow_agent.bat
test_mixer_api.py		test_mixer_api.py
test_mixer_backend.py		test_mixer_backend.py

Folders and files

Latest commit

History

Repository files navigation

AI-Odyssey: Blockchain Forensics & Token Analysis System

🎯 Features

📋 Table of Contents

🚀 Quick Start

Prerequisites

Local Development (5 minutes)

1. Clone Repository

2. Backend Setup

3. Frontend Setup

4. Using Docker Compose (Alternative)

🏗️ Architecture

High-Level Overview

Component Details

Frontend (frontend/)

Backend (backend/)

Agents (agent/)

First Flow: Mixer Detection (first-flow/mixer_mcp_tool.py)

Second Flow: General Forensics (second-flow/work.py)

📁 Project Structure

💻 Development

Requirements

Running Tests

Backend

Frontend

Code Style

🚀 Deployment

Local Development

Production Deployment (MVP)

Environment Variables

📡 API Documentation

Base URL

Authentication

Main Endpoints

1. Submit Analysis

2. Check Status

3. Get Results

4. Export Results

5. Health Check

Full API Documentation

🧠 How It Works

Analysis Pipeline

Forensic Heuristics

Risk Score Calculation (0-100)

Risk Categories

Graph Metrics

📊 Data Sources

🔒 Security

Current State (MVP)

Production Recommendations

📈 Performance

Typical Analysis Times

Scalability Targets

🐛 Troubleshooting

Backend Won't Start

Frontend Build Issues

Redis Connection Failed

BitQuery API Rate Limit

📚 Documentation

🤝 Contributing

Development Workflow

Commit Message Format

📄 License

👥 Authors

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Frontend (`frontend/`)

Backend (`backend/`)

Agents (`agent/`)

First Flow: Mixer Detection (`first-flow/mixer_mcp_tool.py`)

Second Flow: General Forensics (`second-flow/work.py`)

Packages