Skip to content

maltzsama/dagster-authkit

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

38 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿ›ก๏ธ Dagster AuthKit

Community authentication wrapper for self-hosted Dagster OSS.

Authentication, RBAC, and Audit logs for Dagster without touching internal code.


๐ŸŽฏ What is this?

Dagster OSS has no auth. If you run it in a VPC or locally, anyone with the URL has full admin access.

AuthKit solves this by wrapping the dagster-webserver command to add:

  • โœ… Login Interface: Simple username/password flow.
  • โœ… RBAC (4 Levels): Granular control over who can do what.
  • โœ… Audit Logs: JSON logs for monitoring who is doing what.
  • โœ… Multi-Backend: Works with SQLite, Postgres, MySQL (via Peewee ORM) and Redis.

No code changes required. You don't touch your repository.py or dagster.yaml.


โœจ What's New in v0.3.0

๐Ÿ” Proxy Authentication Mode

Delegate authentication to enterprise identity providers via reverse proxy:

  • Authelia integration with complete examples
  • Caddy reverse proxy with built-in forward_auth directive
  • Traefik forward auth support
  • Header-based user extraction (Remote-User, Remote-Groups)
  • Smart group parser that handles JSON, LDAP DNs, CSV, and mixed formats

๐Ÿš€ Kubernetes Deployment

Full example stack for Minikube including:

  • OpenLDAP with pre-seeded users and RBAC groups
  • Authelia configured with LDAP backend
  • Caddy as reverse proxy with TLS termination
  • Dagster-AuthKit in proxy mode
  • Step-by-step Makefile with minikube tunnel support

๐Ÿ—๏ธ Core Improvements

  • GraphQL parsing: Replaced fragile regex with official AST parser (graphql-core)
  • Redis hardening: Atomic operations, proper session revocation, URL validation
  • Code organization: All UI templates centralized in utils/templates.py
  • Observability: RBAC decision tracking via metrics endpoint

๐Ÿ“‚ Ready-to-Run Examples

We provide ready-to-use stacks for different scenarios in the examples/ directory:

examples
โ”œโ”€โ”€ authelia              # NEW! Authelia + Caddy + LDAP SSO (Docker)
โ”‚   โ”œโ”€โ”€ Makefile
โ”‚   โ”œโ”€โ”€ docker-compose.yml
โ”‚   โ”œโ”€โ”€ Caddyfile
โ”‚   โ””โ”€โ”€ authelia/
โ”œโ”€โ”€ kubernetes            # NEW! Minikube deployment
โ”‚   โ”œโ”€โ”€ Makefile
โ”‚   โ””โ”€โ”€ k8s/
โ”œโ”€โ”€ ldap                  # Active Directory integration (**Experimental**)
โ”‚   โ”œโ”€โ”€ Makefile
โ”‚   โ”œโ”€โ”€ docker-compose.yml
โ”‚   โ””โ”€โ”€ ldap-bootstrap.ldif
โ”œโ”€โ”€ postgresql_redis      # Recommended production setup
โ”‚   โ”œโ”€โ”€ Makefile
โ”‚   โ””โ”€โ”€ docker-compose.yml
โ””โ”€โ”€ quickstart-sqlite     # Simple local testing
    โ”œโ”€โ”€ Makefile
    โ””โ”€โ”€ docker-compose.yml

How to run

Pick a scenario, go into the folder, and check the Makefile.

1. Authelia SSO (Docker) Complete SSO with Authelia, Caddy, and OpenLDAP:

cd examples/authelia
make up
# Access: https://auth.company.com (admin/password123)
# Then:   https://dagster.company.com

2. Kubernetes (Minikube) Same stack running on Kubernetes:

cd examples/kubernetes
make build  # Build the Docker image inside Minikube
make up     # Deploy everything
# In another terminal: make connect (runs minikube tunnel)
# Add to /etc/hosts: $(minikube ip) auth.company.com dagster.company.com

3. Standard Setup (Postgres + Redis)

cd examples/postgresql_redis
make up

4. Local Quickstart (SQLite)

cd examples/quickstart-sqlite
make up

5. LDAP/AD Testing โš ๏ธ EXPERIMENTAL

cd examples/ldap
make up

๐Ÿš€ Manual Installation (Python)

If you aren't using Docker, you can install via pip.

# For local testing (SQLite)
pip install dagster-authkit[sqlite]

# For server usage (Postgres + Redis recommended)
pip install dagster-authkit[postgresql,redis]

# For LDAP/Active Directory integration (**Experimental**)
pip install dagster-authkit[ldap]

Usage:

# Initialize the database and create the first admin
dagster-authkit init-db --with-admin

# Run Dagster (replaces the standard 'dagster-webserver' command)
dagster-authkit -f your_pipeline.py -h 0.0.0.0 -p 3000

# For proxy mode (Authelia/OAuth2 Proxy)
export DAGSTER_AUTH_BACKEND=proxy
export DAGSTER_AUTH_PROXY_LOGIN_URL=https://auth.yourcompany.com
dagster-authkit -f your_pipeline.py -h 0.0.0.0 -p 3000

๐Ÿ” Roles (RBAC)

We provide 4 levels of access. Permissions are enforced via GraphQL query analysis.

Role Description
Admin Full access. Can manage users, settings, and all pipelines.
Editor Can modify assets and codebase (if allowed) and manage runs.
Launcher Can launch runs and re-execute jobs, but cannot modify code/assets.
Viewer Read-only. Can view runs and assets. GraphQL mutations are blocked.

How it works: AuthKit analyzes GraphQL queries using the official GraphQL parser to accurately identify mutations and block unauthorized actions.


๐Ÿ“ฆ Backends

Backend Implementation Status Use Case
SQLite Peewee ORM Stable Local / Simple. Single instance only.
PostgreSQL Peewee + psycopg2 Stable Production. Recommended for Docker/K8s.
MySQL/MariaDB Peewee + mysql-connector Stable Production.
Redis Native redis Stable Session Storage + Distributed Rate Limiting.
LDAP ldap3 library Experimental Active Directory / OpenLDAP. Community maintained.
Proxy Header-based Stable Authelia, OAuth2 Proxy, Traefik, Caddy.
OpenID Connect Header-based Experimental AuthKit supports OIDC providers (Google, GitHub, Okta, Keycloak) via Authelia

๐Ÿ› ๏ธ CLI Management

Manage users directly from the shell. Useful for CI/CD or admin tasks.

# Create a new launcher
dagster-authkit add-user bob --role launcher

# Reset password
dagster-authkit change-password bob

# List everyone
dagster-authkit list-users

# View RBAC permissions matrix
dagster-authkit list-permissions

๐Ÿ”ฎ Roadmap

Current (v0.3.0)

  • โœ… Username/password auth (bcrypt)
  • โœ… 4-level RBAC (ADMIN/EDITOR/LAUNCHER/VIEWER)
  • โœ… SQLite, PostgreSQL, MySQL, Redis support
  • โœ… GraphQL mutation blocking with official AST parser
  • โœ… LDAP backend (experimental)
  • โœ… Proxy authentication (Authelia, Caddy, Traefik)
  • โœ… Kubernetes example with full SSO stack
  • โœ… Redis session revocation and rate limiting
  • โœ… Centralized UI templates

Next

  • ๐Ÿ”„ Improved GraphQL query analysis
  • ๐Ÿ”„ Helm chart for Kubernetes deployments
  • ๐Ÿ”„ OpenID Connect support (via proxy mode)

What we will NOT do:

  • โŒ Inject React code into Dagster UI (too brittle)
  • โŒ Complex enterprise features (that's what Dagster+ is for)

๐Ÿค Contributing

Found a bug? Want to add a feature? Open a PR. If it works and keeps things simple, we'll merge it.

Especially needed:

  • People with Active Directory experience to validate the LDAP backend
  • Testing on different Dagster versions
  • Helm chart contributions

๐Ÿ“„ License

Apache 2.0 - see LICENSE


๐Ÿ™ Credits

Built by Demetrius Albuquerque because self-hosting Dagster shouldn't mean no auth.

Inspired by the community's need for a middle ground between "no auth" and "pay for Dagster+".

About

Community Auth System for self-hosted Dagster OSS - simple RBAC, Audit-log, and Session Management

Topics

Resources

License

Stars

Watchers

Forks

Packages