This branch extends Evoman's main branch, allowing the agent to update its policy (and receive partial rewards) after each time step, as opposed to only at the end of the game. Therefore, it suports RL algorithms. Use the demo data_gatherer_PPO.py to experiment with Evoman using the algorithm PPO.
dowelt/evoman
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|