HiFAR: Multi-Stage Curriculum Learning for High-Dynamics Humanoid Fall Recovery

This is the official implementation of the IROS paper "HiFAR: Multi-Stage Curriculum Learning for High-Dynamics Humanoid Fall Recovery".

This project builds upon the Booster Gym project, a reinforcement learning (RL) framework designed for humanoid robot locomotion developed by Booster Robotics.

Installation

Follow these steps to set up your environment:

Install Miniconda

Miniconda is a lightweight tool for managing packages and environments.

mkdir -p ~/miniconda3
wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O ~/miniconda3/miniconda.sh
bash ~/miniconda3/miniconda.sh -b -u -p ~/miniconda3
rm ~/miniconda3/miniconda.sh
vim ~/.bashrc  # Add the following line
source ~/miniconda3/bin/activate

Create a Python 3.8 environment:

conda create --name <env_name> python=3.8

Install PyTorch

Activate the environment and install PyTorch with CUDA support:

conda activate <env_name>
conda install numpy=1.21.6 pytorch=2.0 pytorch-cuda=11.8 -c pytorch -c nvidia

Install Isaac Gym

Download Isaac Gym from NVIDIA’s website.

Extract and install:

tar -xzvf IsaacGym_Preview_4_Package.tar.gz
cd isaacgym/python
pip install -e .

Configure the environment for shared libraries:

cd $CONDA_PREFIX
mkdir -p ./etc/conda/activate.d
vim ./etc/conda/activate.d/env_vars.sh  # Add the following lines
export OLD_LD_LIBRARY_PATH=${LD_LIBRARY_PATH}
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:$CONDA_PREFIX/lib
mkdir -p ./etc/conda/deactivate.d
vim ./etc/conda/deactivate.d/env_vars.sh  # Add the following lines
export LD_LIBRARY_PATH=${OLD_LD_LIBRARY_PATH}
unset OLD_LD_LIBRARY_PATH

Install Additional Requirements

Install the required Python dependencies:
```
pip install -r requirements.txt
```

Usage

Configurations

Configurations are loaded from envs/<task>.yaml. You can override config values using command-line arguments.

Command-Line Arguments:

--checkpoint: Path to the model checkpoint.
--num_envs: Number of environments to create.
--headless: Run without a viewer window.
--sim_device: Device for physics simulation (e.g., cuda:0, cpu).
--rl_device: Device for the RL algorithm (e.g., cuda:0, cpu).
--seed: Random seed.
--max_iterations: Maximum training iterations.

Stage 1 Training

To train a basic fall recovery policy, run:

python train.py --task=T1FallRecovery

This trains a policy for the T1FallRecovery task using the default configuration. Example configurations are available in envs/example_cfgs/T1FallRecovery_cfg1.yaml.

Stage 2 Training

After completing the basic policy training, proceed to train a more complex policy. Replace envs/T1FallRecovery.yaml with envs/example_cfgs/T1FallRecovery_cfg2.yaml, and set the checkpoint argument to the path of the first-stage trained model.

python train.py --task=T1FallRecovery

To enable network expansion, use the convert_fall_recovery.py script. After expanding the model, train the policy with additional control DoFs by specifying the checkpoint argument with the converted model path:

python train.py --task=T1FallRecoveryRandom

Example configurations are available in envs/example_cfgs/T1FallRecoveryRandom_cfg.yaml. You can add more initial states by modifying the init_state list in the configuration file to enhance the robustness of the policy.

Testing

To test a trained policy in Isaac Gym, run:

python play.py --task=TASK_NAME --checkpoint=logs/<date-time>/nn/<checkpoint_name>.pth

Videos are saved in videos/<date-time>.mp4 by default. Disable video recording in the config file if needed.

For simulation-to-simulation testing in Mujoco, use:

python play_mujoco.py --task=T1FallRecovery

or

python play_mujoco_extended.py --task=T1FallRecoveryRandom

Citation

If you find this project useful, please cite our paper:

@article{hifar2025,
      title={HiFAR: Multi-Stage Curriculum Learning for High-Dynamics Humanoid Fall Recovery},
      author={Chen, Penghui and Wang, Yushi and Luo, Changsheng and Cai, Wenhan and Zhao, Mingguo},
      journal={arXiv preprint arXiv:2502.20061},
      year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
envs		envs
resources/T1		resources/T1
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
convert_fall_recovery_model.py		convert_fall_recovery_model.py
play.py		play.py
play_mujoco.py		play_mujoco.py
play_mujoco_extended.py		play_mujoco_extended.py
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HiFAR: Multi-Stage Curriculum Learning for High-Dynamics Humanoid Fall Recovery

Installation

Usage

Configurations

Stage 1 Training

Stage 2 Training

Testing

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

HiFAR: Multi-Stage Curriculum Learning for High-Dynamics Humanoid Fall Recovery

Installation

Usage

Configurations

Stage 1 Training

Stage 2 Training

Testing

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages