🚀 Distributed Image Classifier on Kubernetes

Project 20 for the Distributed Network Programming course.

A high-performance, autoscaling microservice deployed on Kubernetes that accepts image uploads, performs classification using a pre-trained Deep Learning model, and exposes Prometheus metrics for monitoring.

🏗 Architecture & Design Choices

API Framework (FastAPI): Chosen for its high performance, native async support, and automatic OpenAPI documentation. Heavy ML inference is offloaded to a separate threadpool to prevent event loop blocking.
Machine Learning (PyTorch & ResNet18): ResNet18 offers the optimal balance between inference speed and accuracy, making it ideal for a responsive microservice.
Package Management (uv): Used for extremely fast dependency resolution and strict lockfile generation.
Automation (go-task): Used instead of complex Bash scripts. Features built-in smart K8s context detection (automatically loads local images into minikube or kind without needing a local registry).
Autoscaling (HPA): Horizontal Pod Autoscaler monitors CPU utilization and dynamically scales the classification pods under load.

🛠 Step 0: Prerequisites (Crucial)

To ensure a smooth deployment, you must have the following installed on your host machine:

Python 3.10+ (Required to parse deployment scripts and run Locust).
A local Kubernetes Cluster (Docker Desktop, Minikube, or Kind).

⚠️ MINIKUBE USERS - CRITICAL STEP: The Autoscaler (HPA) requires metrics to function. You must enable the metrics server before deploying:
minikube addons enable metrics-server

3. Install go-task (The Task Runner) This project uses task to automate everything. You must install it first:

# Windows (via built-in winget)
winget install Task.Task

# macOS (Homebrew)
brew install go-task/tap/go-task

# Linux (Installs globally to /usr/local/bin)
sudo sh -c "$(curl --location https://taskfile.dev/install.sh)" -- -d -b /usr/local/bin

(Note: The task CLI will automatically verify if you have uv, docker, kubectl, and helm installed when you run it).

🚀 Step 1: One-Click Deployment

Deploy the entire architecture (App Build, K8s Deployment, Helm Monitoring Stack) with a single command:

task start

What this does automatically:

Verifies system requirements.
Calculates source code hash to generate a unique Docker image tag.
Builds the image and intelligently loads it into your specific K8s environment.
Installs the kube-prometheus-stack via Helm into the monitoring namespace.
Applies Kubernetes manifests, waits for the rollout, and prints the Grafana admin password.

🌐 Accessing the API

Once deployed, the FastAPI Swagger UI is available at: 👉 http://localhost/docs

⚠️ MINIKUBE USERS: K8s LoadBalancers do not expose to localhost automatically in Minikube. You must run the following in a separate terminal and keep it open:
minikube tunnel

📊 Step 2: Load Testing & Autoscaling

To validate throughput, latency, and HPA behavior, use the built-in test suite. This will set up the necessary port-forwards to Grafana and start the Locust load testing tool.

task test

(You can override default load parameters: task test USERS=50 RATE=5)

🔍 How to Monitor the Test:

Start the Load (Locust): Open http://localhost:8089. The test will automatically start hitting the /predict endpoint with random image data. Observe RPS and Latency here.
Watch the Autoscaler (HPA): Open a new terminal and watch Kubernetes spawn new pods as the CPU load increases:
```
kubectl get hpa -w
kubectl get pods -w
```
View Prometheus Metrics (Grafana): Open http://localhost:3000. Login with username admin (use the password printed at the end of task start).
- Navigate to: Dashboards -> Kubernetes / Compute Resources / Namespace (Workloads).

💻 Local Development (Without K8s)

You can run and test the application locally without full cluster deployment:

task run       # Starts the FastAPI server locally on port 4123
task lint      # Runs Ruff to format and lint code

🧹 Step 3: Cleanup

Manage your cluster state easily with these commands to free up resources:

# Remove Application only (keeps monitoring stack and Grafana data)
task down

# NUCLEAR Clean: Delete everything (App, Monitoring Stack, Namespace)
task teardown

# Clean local caches (Python __pycache__, Ruff, UV, Docker dangling images)
task clean

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
app		app
k8s		k8s
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
Taskfile.yml		Taskfile.yml
image.png		image.png
locustfile.py		locustfile.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 Distributed Image Classifier on Kubernetes

🏗 Architecture & Design Choices

🛠 Step 0: Prerequisites (Crucial)

🚀 Step 1: One-Click Deployment

🌐 Accessing the API

📊 Step 2: Load Testing & Autoscaling

🔍 How to Monitor the Test:

💻 Local Development (Without K8s)

🧹 Step 3: Cleanup

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🚀 Distributed Image Classifier on Kubernetes

🏗 Architecture & Design Choices

🛠 Step 0: Prerequisites (Crucial)

🚀 Step 1: One-Click Deployment

🌐 Accessing the API

📊 Step 2: Load Testing & Autoscaling

🔍 How to Monitor the Test:

💻 Local Development (Without K8s)

🧹 Step 3: Cleanup

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages