Reinforcement learning for text generation on MLX (Apple Silicon)
-
Updated
Feb 15, 2026 - Python
Reinforcement learning for text generation on MLX (Apple Silicon)
Rust-native inference runtime for Qwen3 / Qwen3.5 — OpenAI-compatible serving + integrated agent, train, and self-evolution workflows. CUDA + Metal, no PyTorch on the hot path.
OPSG-based test refinement for Java: Stable RL approach to generate maintainable, high-quality unit tests with 98.6% compilation success.
Add a description, image, and links to the gspo topic page so that developers can more easily learn about it.
To associate your repository with the gspo topic, visit your repo's landing page and select "manage topics."