BlockBlast reimplementation + RL agents (DQN, PPO, PPO+Action Masking, DQN+Action Masking, Random)
-
Updated
Apr 25, 2025 - Python
BlockBlast reimplementation + RL agents (DQN, PPO, PPO+Action Masking, DQN+Action Masking, Random)
This repository contains code for the paper "Multi-agent reinforcement learning with action masking for uav-enabled mobile communications" which has been accepted for publishing in IEEE Transactions on Machine Learning in Communications and Networking.
Source code of “Agile Reinforcement Learning for Real-Time Task Scheduling in Edge Computing” (CAIAC 2025)
My MSc project on applying, tuning and modifying the PPO and A2C algorithms to Pettingzoo MARL library two player poker game
Code for hierarchical signal coordination using hybrid model-based and RL approach.
Bot Pokémon Gen 1 (Random Battle Showdown) entraîné en RL avec MaskablePPO, imitation + curriculum + self-play ; suivi MLflow.
Unofficial Stable-Baselines3 extension package with MaskableRecurrentPPO.
숫자 퍼즐 ‘사과 게임’을 pygame으로 구현하고, Gymnasium 환경에서 PPO·DQN·Monte Carlo 에이전트를 학습·실행하며 사람 vs PPO 대결까지 지원합니다.
Safe reinforcement learning with PPO-based action masking and penalty methods in MiniGrid Lava environments.
Add a description, image, and links to the action-masking topic page so that developers can more easily learn about it.
To associate your repository with the action-masking topic, visit your repo's landing page and select "manage topics."