Skip to content
View garrett4wade's full-sized avatar
  • Tsinghua University
  • Beijing, China

Block or report garrett4wade

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. inclusionAI/AReaL inclusionAI/AReaL Public

    The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.

    Python 5k 463

  2. openpsi-project/ReaLHF openpsi-project/ReaLHF Public archive

    Super-Efficient RLHF Training of LLMs with Parameter Reallocation

    Python 335 22

  3. revisiting_marl revisiting_marl Public

    Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)

    Python 23

  4. cugae cugae Public

    CUDA implementation of Generalized Advantage Estimation (GAE)

    Python 4

  5. scaling_marl scaling_marl Public

    Python