AutoML-Platform

This project is a AutoML Platform that allows a user to have quick model and a deployed ready for inference model

Requirements

Before running the project, ensure you have the following installed:

uv - Python package manager (https://github.com/astral-sh/uv)
Node.js & npm - For the Next.js dashboard
Redis - Message queue for job management (run: docker run -d -p 6379:6379 redis)
Minikube - Local Kubernetes cluster
kubectl - Kubernetes command-line tool

Project Setup

Install Python dependencies:

uv sync

Running the Platform

The platform can be started using the launcher script:

./launcher.sh

This will start all services:

MLflow Server - Model tracking and registry (http://localhost:5001)
API Server - FastAPI backend (http://localhost:8000)
Job Worker - Background worker for training orchestration
Dashboard - Next.js frontend (http://localhost:3000)

To stop all services:

./launcher.sh --stop

Environment Variables

Optional configuration:

IP_ADDR - Host IP address for K3s pods to connect to MLflow (auto-detected if not set)
DASHBOARD_PORT - Dashboard port (default: 3000)
NEXT_PUBLIC_API_BASE_URL - API base URL (default: http://localhost:8000)
REDIS_HOST - Redis host (default: localhost)
REDIS_PORT - Redis port (default: 6379)

Note: The IP_ADDR is automatically detected (macOS: en0 interface, Linux: default route). You can override it by setting it manually if needed:

export IP_ADDR=192.168.1.100
./launcher.sh

First MVP

With the first MVP we have the following workflow:

flowchart TD
    A[Upload Dataset via API] --> B[Job Manager]
    B --> C1[K3s Training Pod 1]
    B --> C2[K3s Training Pod 2]
    B --> C3[K3s Training Pod N]
    C1 & C2 & C3 --> D[MLflow Tracking]
    D --> E[Select Best Model]
    E --> F[Deploy Model with KServe or BentoML]
    F --> G[Inference API Endpoint]

Final MVP

The ultimate goal would be to be able to have the following working architecture

flowchart TD
    A[Upload Dataset and Prompt] --> B[Data Profiler: schema, types, stats]
    B --> C[Intelligent Agent: LLM + Rules to plan pipelines]
    C --> D[Training Orchestrator: launch K3s Jobs]

    subgraph K3s Cluster
        D --> P1[Training Pod 1: Model A]
        D --> P2[Training Pod 2: Model B]
        D --> P3[Training Pod 3: Model C]
    end

    P1 & P2 & P3 --> E[MLflow Tracking: metrics, params, artifacts]
    E --> F[Model Selector: evaluate best model]
    F --> G[Model Deployer: KServe or BentoML]
    G --> H[Monitoring Dashboard: Prometheus, Grafana, Evidently]
    H --> C
    G --> I[Inference API Endpoint]

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.github/workflows		.github/workflows
src		src
tests		tests
utils		utils
.dockerignore		.dockerignore
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
launcher.sh		launcher.sh
main.py		main.py
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
run_tests.sh		run_tests.sh
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AutoML-Platform

Requirements

Project Setup

Running the Platform

Environment Variables

First MVP

Final MVP

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

CedricDamais/AutoML-Platform

Folders and files

Latest commit

History

Repository files navigation

AutoML-Platform

Requirements

Project Setup

Running the Platform

Environment Variables

First MVP

Final MVP

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages