AGENT-33 is a local-first AI agent orchestration platform for teams that want real workflows, explicit governance, and a usable control plane instead of a pile of disconnected scripts.
AGENT-33 combines an API runtime, workflow engine, memory stack, review/release controls, and a first-party frontend so you can run guarded automation from one system.
- Local-first runtime: FastAPI backend, Docker Compose bootstrap, Ollama-friendly model routing
- Contained Agent OS: optional Linux operator workspace with first-party tools, state, and stack connectivity
- Guardrailed automation: scopes, approvals, autonomy budgets, and review/release workflows
- Agent + workflow orchestration: invoke agents directly or compose repeatable workflows
- Operational visibility: health, dashboard surfaces, traces, evaluations, and rollout telemetry
- Extensible platform: packs, tools, memory, webhook intake, and improvement loops
engine/: FastAPI runtime, orchestration services, API routes, tests, Docker Compose stackfrontend/: AGENT-33 control plane UI served athttp://localhost:3000core/: orchestration specs, policy packs, protocol references, workflow materialsdocs/: canonical operator, setup, onboarding, and release-readiness documentation
- Docker Desktop or Docker Engine with Compose
- Python 3.11+
curl- Ollama reachable from the stack (
http://host.docker.internal:11434by default), or use the bundled/local override paths documented in the setup guides
cd engine
cp .env.example .env
docker compose up -d
curl http://localhost:8000/healthIf you reuse an Ollama container from another Compose project:
docker compose -f docker-compose.yml -f docker-compose.shared-ollama.yml up -d- Frontend:
http://localhost:3000 - API docs:
http://localhost:8000/docs
Default local credentials from .env.example:
- username:
admin - password:
admin
docker compose exec -T api python -c "import os,time,jwt; now=int(time.time()); payload={'sub':'local-admin','scopes':['admin','agents:read','agents:write','agents:invoke','workflows:read','workflows:write','workflows:execute','tools:execute'],'iat':now,'exp':now+3600}; print(jwt.encode(payload, os.getenv('JWT_SECRET','change-me-in-production'), algorithm=os.getenv('JWT_ALGORITHM','HS256')))"Set the token in your shell:
export TOKEN="<paste-token-here>"PowerShell:
$env:TOKEN = "<paste-token-here>"List agents:
curl http://localhost:8000/v1/agents/ \
-H "Authorization: Bearer $TOKEN"Invoke the orchestrator:
curl -X POST http://localhost:8000/v1/agents/orchestrator/invoke \
-H "Authorization: Bearer $TOKEN" \
-H "Content-Type: application/json" \
-d '{
"inputs": {
"task": "Create a short rollout plan for adding cache metrics"
},
"model": "llama3.2",
"temperature": 0.2
}'- Start the stack and confirm
/health - Sign in to
http://localhost:3000 - Mint a local JWT or use the UI token flow
- List agents with
GET /v1/agents/ - Invoke an agent or execute a minimal workflow
- Explore the dashboard, traces, reviews, evaluations, and autonomy surfaces from the UI
For a fuller beginner path, use:
Bootstrap auth is for local development only. Do not expose AGENT-33 publicly with default credentials or default secrets.
Before any shared, VPS, or production deployment:
- set
AUTH_BOOTSTRAP_ENABLED=false - rotate
API_SECRET_KEY - rotate
JWT_SECRET - rotate
ENCRYPTION_KEY - review SECURITY.md
- work through the Release Checklist
- Getting Started
- Operator Onboarding
- Setup Guide
- Walkthroughs
- Use Cases
- Agent OS Runtime
- API Surface
- Release Checklist
- Documentation Index
- Functionality and Workflows
- Production Deployment Runbook
- Operator Verification Runbook
- Horizontal Scaling Architecture
- Incident Response Playbooks
- Operators who need a guarded local or self-hosted AI control plane
- Platform teams building approval-aware automation and workflow execution
- Engineering teams running review, release, evaluation, and autonomy gates in one runtime
- Researchers and builders experimenting with packs, memory, training, and improvement loops
The POST-4 roadmap is complete through POST-4.5, including the P-PACK v3 A/B harness and behavior rollout. The next roadmap wave is public launch preparation and broader ecosystem work under POST-CLUSTER.
Latest merged implementation PR:
#406—POST-4.5: apply P-PACK v3 behavior rollout
MIT. See LICENSE.