Sovereign Engine

Self-contained local AI inference platform. A Rust reverse proxy manages llama.cpp backend containers, provides OIDC authentication, fair-use scheduling, and a React dashboard — all in a single Docker image.

Version: 1.8.0

Features

OpenAI-compatible API — drop-in replacement at /v1/* for any client that speaks OpenAI
Multi-model support — load and manage multiple models with GPU memory-aware scheduling
Multimodal / vision — auto-detects and loads companion mmproj projectors for vision-capable GGUFs (e.g. Gemma 4)
Backend flexibility — llama.cpp with NVIDIA CUDA, AMD ROCm, or CPU-only
OIDC authentication — connect any identity provider; bootstrap mode for initial setup
API tokens — per-user, SHA-256 hashed, manageable via dashboard or API
Fair-use scheduler — per-user request queuing to prevent resource monopolisation
GPU reservation system — exclusive-use time windows with admin approval workflow
React dashboard — model management, user admin, usage metrics, HuggingFace model search
Open WebUI integration — trusted-header SSO, no separate auth configuration needed
TLS — manual certs or automatic provisioning via Let's Encrypt (ACME TLS-ALPN-01)
Network isolation — dual Docker network architecture keeps backends unreachable from the host

Architecture

                     ┌─────────────────────────────────────────────┐
                     │              sovereign-public               │
                     │            (host-facing bridge)             │
                     └────────────────────┬────────────────────────┘
                                          │
                Host ─────────────────► :3000 / :443
                                          │
                             ┌────────────┴────────────┐
                             │     Proxy (axum)        │
                             │                         │
                             │  api.* →                │
                             │    /v1/*    → OpenAI API│
                             │    /api/*   → Admin/User│
                             │    /portal/* → React SPA│
                             │  chat.* →               │
                             │    /*       → Open WebUI│
                             │                         │
                             │  [SQLite DB]            │
                             │  [React SPA static]     │
                             └────────────┬────────────┘
                                          │
                     ┌────────────────────┴────────────────────────┐
                     │            sovereign-internal               │
                     │        (isolated, backends only)            │
                     └───┬──────────────┬──────────────┬───────────┘
                         │              │              │
                  [backend :8080] [backend :8080] [backend :8080]

The proxy sits on both networks. Backend containers sit only on sovereign-internal and are never directly reachable from the host.

Quick Start (Docker)

Prerequisites

Docker + Docker Compose
GPU: NVIDIA (with nvidia-container-toolkit), AMD ROCm, or CPU-only

Docker Compose

Create a docker-compose.yml:

services:
  proxy:
    image: dragonhold2024/sovereign-engine:1.0.0
    ports:
      - "3000:3000"
    networks:
      - sovereign-public
      - sovereign-internal
    volumes:
      - /var/run/docker.sock:/var/run/docker.sock
      - ./data:/config
      - ./models:/models
    environment:
      - LISTEN_ADDR=0.0.0.0:3000
      - DATABASE_URL=sqlite:///config/sovereign.db
      - BOOTSTRAP_USER=admin
      - BOOTSTRAP_PASSWORD=changeme
      - BREAK_GLASS=true
      - MODEL_PATH=/models
      - BACKEND_NETWORK=sovereign-internal
    restart: unless-stopped

networks:
  sovereign-public:
    name: sovereign-public
    driver: bridge
  sovereign-internal:
    name: sovereign-internal
    driver: bridge
    internal: true

Then:

docker compose up -d

Open http://localhost:3000 for Open WebUI, or http://localhost:3000/portal for the admin dashboard. On the login page, use the "Break-glass / Admin emergency login" form (visible because BREAK_GLASS=true is set) to log in with admin / changeme — this issues a session cookie.

Production note: Set BREAK_GLASS=false after configuring an OIDC identity provider. Bootstrap credentials are intended for initial setup only.

Verify Installation

curl -s http://localhost:3000/api/user/health | jq .
# Expected: {"status":"ok"}

Next Steps

Download a model via the dashboard (HuggingFace search built in) or place GGUF files in ./models/
Load a model from the Models page
Configure an Identity Provider via the admin panel or /api/admin/idp
Create API tokens for programmatic access
Point any OpenAI-compatible client at http://localhost:3000/v1

Environment Variables

Variable	Default	Description
`LISTEN_ADDR`	`0.0.0.0:443`	Bind address for the proxy
`DATABASE_URL`	`sqlite:///config/sovereign.db`	SQLite database URL
`TLS_CERT_PATH`	(none)	Path to TLS certificate PEM file
`TLS_KEY_PATH`	(none)	Path to TLS private key PEM file
`ACME_CONTACT`	(none)	Contact email for ACME; enables Let's Encrypt provisioning for both hostnames
`ACME_STAGING`	`false`	Use Let's Encrypt staging environment
`BOOTSTRAP_USER`	(none)	Bootstrap admin username. Set with `BREAK_GLASS=true` to enable the break-glass login form.
`BOOTSTRAP_PASSWORD`	(none)	Bootstrap admin password. Set with `BREAK_GLASS=true` to enable the break-glass login form.
`BREAK_GLASS`	`false`	When `true` (with `BOOTSTRAP_USER` + `BOOTSTRAP_PASSWORD` set), enables a one-shot break-glass login form on the portal login page (`POST /auth/bootstrap-login`). Not HTTP Basic auth — credentials are submitted once via the form and a session cookie is issued. Disable after OIDC is configured.
`DOCKER_HOST`	`unix:///var/run/docker.sock`	Docker socket path
`MODEL_PATH`	`/models`	Model storage path (inside the container)
`MODEL_HOST_PATH`	(same as MODEL_PATH)	Host-side path for model bind mounts into child containers
`UI_PATH`	`/app/ui`	Path to static UI files
`API_HOSTNAME`	`localhost`	API subdomain hostname (e.g. `api.example.com`)
`CHAT_HOSTNAME`	`localhost`	Chat subdomain hostname (e.g. `chat.example.com`)
`COOKIE_DOMAIN`	(none)	Cookie domain for cross-subdomain sessions (e.g. `.example.com`)
`BACKEND_NETWORK`	`sovereign-internal`	Docker network for backend container isolation
`WEBUI_BACKEND_URL`	`http://open-webui:8080`	Open WebUI backend URL (internal)
`WEBUI_API_KEY`	(none)	Pre-shared key for Open WebUI → proxy `/v1` calls
`DB_ENCRYPTION_KEY`	(none)	High-entropy random key for AES-256-GCM encryption of IdP client secrets at rest (e.g. `openssl rand -hex 32`; not a passphrase)
`SECURE_COOKIES`	`true`	Set `Secure` flag on session cookies (set `false` for HTTP dev)
`QUEUE_TIMEOUT_SECS`	`30`	Max seconds to hold a queued request before returning 429
`RUST_LOG`	`sovereign_engine=info,tower_http=info`	Log level (tracing EnvFilter)

Volumes

Mount Point	Purpose
`/config`	SQLite database, ACME cert cache
`/models`	Model files (GGUF, etc.) — shared with backend containers
`/var/run/docker.sock`	Docker socket (required for backend container management)

TLS Configuration

Option A: Automatic (Let's Encrypt)

Set API_HOSTNAME, CHAT_HOSTNAME, COOKIE_DOMAIN, ACME_CONTACT, and LISTEN_ADDR=0.0.0.0:443. Port 443 must be directly reachable from the internet. Certs are cached in /config/acme/ and cover both hostnames.

Option B: Manual certs

Set TLS_CERT_PATH and TLS_KEY_PATH to PEM files mounted into the container.

Option C: No TLS (development)

Set LISTEN_ADDR=0.0.0.0:3000 and omit TLS variables. Not recommended for production.

Development

Prerequisites

Rust 1.84+
Node.js 22+
Docker

Build from Source

# Clone
git clone https://github.com/rauxon/SovereignEngine.git
cd SovereignEngine

# Rust proxy
cd proxy && cargo build --release

# React UI
cd ui && npm install && npm run build

# Full Docker image
docker build -t sovereign-engine .

Local Development

Copy .env.example to .env and set LISTEN_ADDR=0.0.0.0:31000
Run the proxy: cd proxy && cargo run
Run the UI dev server: cd ui && npm run dev

The Vite dev server proxies /api, /auth, and /v1 to http://localhost:31000.

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
docs		docs
live		live
proxy		proxy
scripts		scripts
tests		tests
ui		ui
.env.example		.env.example
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
SECURITY_ASSESSMENT.md		SECURITY_ASSESSMENT.md
docker-compose.runtime.yml		docker-compose.runtime.yml
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sovereign Engine

Features

Architecture

Quick Start (Docker)

Prerequisites

Docker Compose

Verify Installation

Next Steps

Environment Variables

Volumes

TLS Configuration

Development

Prerequisites

Build from Source

Local Development

Documentation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Sovereign Engine

Features

Architecture

Quick Start (Docker)

Prerequisites

Docker Compose

Verify Installation

Next Steps

Environment Variables

Volumes

TLS Configuration

Development

Prerequisites

Build from Source

Local Development

Documentation

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages