An MLOps workflow for training, inference, experiment tracking, model registry, and deployment.
-
Updated
Nov 24, 2025 - Python
An MLOps workflow for training, inference, experiment tracking, model registry, and deployment.
A comprehensive .NET MAUI plugin for ML inference with ONNX Runtime, CoreML, and platform-native acceleration support
gRPC server for Machine Learning (ML) Model Inference in Rust.
EcoChain-ML is a hybrid energy-aware ML framework integrating a lightweight PoS blockchain layer and renewable-aware scheduling. Built to simulate green computing strategies on a single PC, it evaluates energy, latency, and sustainability trade-offs.
[TPDS 2025] EdgeAIBus: AI-driven Joint Container Management and Model Selection Framework for Heterogeneous Edge Computing
Dockerized Django application for handwritten math expression recognition using a CNN model, with end-to-end ML pipeline and cloud-ready deployment.
ML service for cats that actually learn stuff. PPO brains, personality drift, mood system.
PoC demonstrating distributed workload orchestration using Ray as the primary compute framework with Prefect for workflow orchestration, supporting cloud-native deployments (Kubernetes)
High-performance C++20 neural network framework powered by Intel oneAPI MKL 2025.2. Optimized for CPU-based deep learning inference and training.
A lightweight, framework-agnostic middleware that dynamically batches inference requests in real time to maximize GPU/TPU utilization.
Client-side React + Vite web app to record and process voice audio and send features to an API for automated stress prediction (speech-based stress detection).
Submission of Project
Enterprise Data Warehouse & ML Platform - High-performance platform processing 24B records with <60s latency and 100K records/sec throughput, featuring 32 fact tables, 128 dimensions, and automated ML pipelines achieving 91.2% accuracy. Real-time ML inference serving 300K+ predictions/hour with ensemble models.
Microservice to digitalize a chess scoresheet
Production-ready ML model serving with FastAPI, TensorFlow, Docker, Kubernetes, and Prometheus. Features CI/CD, health checks, and scalable inference.
QuantTradingOS is a collection of AI-powered trading agents, frameworks, and analytics tools for research, execution, and portfolio management.
scripts for benchmarking vLLM using Llama 8b and NVIDIA 4090 GPU
Production-style Responsible AI income prediction system with deterministic inference, p50/p95 latency tracking, rate limiting, and fairness audit.
Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
🐱 Create a living cat AI that exhibits emotions, reactions, and realistic behavior for an engaging and interactive experience.
Add a description, image, and links to the ml-inference topic page so that developers can more easily learn about it.
To associate your repository with the ml-inference topic, visit your repo's landing page and select "manage topics."