Need help with training priviledged teach policy

Hi Tairan,

Thanks for sharing the code.

I tried to reproduce the results of the privileged teacher policy with the following scripts:

```
python legged_gym/scripts/train_hydra.py \
  --config-name=config_teleop \
  task=h1:teleop run_name=OmniH2O_TEACHER \
  env.num_observations=913 \
  env.num_privileged_obs=990 \
  motion.teleop_obs_version=v-teleop-extend-max-full \
  motion=motion_full \
  motion.extend_head=True \
  num_envs=4096 \
  asset.zero_out_far=False \
  asset.termination_scales.max_ref_motion_distance=1.5 \
  sim_device=cuda:0 \
  motion.motion_file=resources/motions/h1/amass_phc_filtered.pkl \
  rewards=rewards_teleop_omnih2o_teacher \
  rewards.penalty_curriculum=True \
  rewards.penalty_scale=0.5
```

After training about 200K steps, I still observed very bad results when playing the teach policy, nearly 0 success rate:

```
Loaded 1 motions with a total length of 9.300s and 280 frames.                                              | 0/1 [00:00<?, ?it/s]
Terminated: 0 | max frames: 467 | steps 12 | Start: 62 | Succ rate: 0.016 | Mpjpe: 252.459
```

The video results:

https://github.com/user-attachments/assets/17322605-77e9-4dcf-9941-eb423b539685

Here is my training curve:

<img width="3469" height="1260" alt="Image" src="https://github.com/user-attachments/assets/cbed65dc-0b9b-4db2-a985-29eab173e012" />

 What might I be doing wrong?

Thank you.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need help with training priviledged teach policy #73

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Need help with training priviledged teach policy #73

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions