Rust-native single-binary MLX inference + conversion backend for Apple Silicon
rust metal inference affine quantization mlx kv-cache apple-silicon llm safetensors tiered-cache paroquant turboquant rotorquant gemma4 isoquant qwen3-6 ternary-bonsai mxfp planarquant
-
Updated
Jun 17, 2026 - Rust