This repository is a framework that uses a VLA model as a backbone to improve success rates with RL.
# 1) Conda env
conda create -n vla_lab python=3.11
conda activate vla_lab
# 2) Clone the repo
cd $HOME
git clone https://github.com/chohh7391/vla_lab.git
cd vla_lab
# 3) Isaac Sim SDK (v5.1.0)
pip install --upgrade pip
pip install "isaacsim[all,extscache]==5.1.0" --extra-index-url https://pypi.nvidia.com
# 4) Pytorch (CUDA 12.8)
pip install -U torch==2.7.0 torchvision==0.22.0 --index-url https://download.pytorch.org/whl/cu128
# 5) Isaac Lab (v2.3.0)
git clone -b v2.3.0 git@github.com:isaac-sim/IsaacLab.git _isaaclab
sudo apt install -y cmake build-essential
./_isaaclab/isaaclab.sh --install
# 6) Smoke test (headless sim)
python _isaaclab/scripts/tutorials/00_sim/create_empty.py --headless
# 7) Dev install
python -m pip install -e source/vla_lab
# 8) Extra dependencies
python -m pip install zmq scikit-learn pyarrow fastparquet av-
Verify that the extension is correctly installed by:
-
Listing the available tasks:
Note: It the task name changes, it may be necessary to update the search pattern
"VlaLab-"(in thescripts/reinforcement_learning/list_envs.pyfile) so that it can be listed.# use 'FULL_PATH_TO_isaaclab.sh|bat -p' instead of 'python' if Isaac Lab is not installed in Python venv or conda python scripts/reinforcement_learning/list_envs.py -
Running a task:
# use 'FULL_PATH_TO_isaaclab.sh|bat -p' instead of 'python' if Isaac Lab is not installed in Python venv or conda python scripts/reinforcement_learning/<RL_LIBRARY>/train.py --task=<TASK_NAME>
-
Running a task with dummy agents:
These include dummy agents that output zero or random agents. They are useful to ensure that the environments are configured correctly.
-
Zero-action agent
# use 'FULL_PATH_TO_isaaclab.sh|bat -p' instead of 'python' if Isaac Lab is not installed in Python venv or conda python scripts/reinforcement_learning/zero_agent.py --task=<TASK_NAME>
-
Random-action agent
# use 'FULL_PATH_TO_isaaclab.sh|bat -p' instead of 'python' if Isaac Lab is not installed in Python venv or conda python scripts/reinforcement_learning/random_agent.py --task=<TASK_NAME>
-
-
To setup the IDE, please follow these instructions:
- Run VSCode Tasks, by pressing
Ctrl+Shift+P, selectingTasks: Run Taskand running thesetup_python_envin the drop down menu. When running this task, you will be prompted to add the absolute path to your Isaac Sim installation.
If everything executes correctly, it should create a file .python.env in the .vscode directory.
The file contains the python paths to all the extensions provided by Isaac Sim and Omniverse.
This helps in indexing all the python modules for intelligent suggestions while writing code.
We provide an example UI extension that will load upon enabling your extension defined in source/vla_lab/vla_lab/ui_extension_example.py.
To enable your extension, follow these steps:
-
Add the search path of this project/repository to the extension manager:
- Navigate to the extension manager using
Window->Extensions. - Click on the Hamburger Icon, then go to
Settings. - In the
Extension Search Paths, enter the absolute path to thesourcedirectory of this project/repository. - If not already present, in the
Extension Search Paths, enter the path that leads to Isaac Lab's extension directory directory (IsaacLab/source) - Click on the Hamburger Icon, then click
Refresh.
- Navigate to the extension manager using
-
Search and enable your extension:
- Find your extension under the
Third Partycategory. - Toggle it to enable your extension.
- Find your extension under the
We have a pre-commit template to automatically format your code. To install pre-commit:
pip install pre-commitThen you can run pre-commit with:
pre-commit run --all-filesIn some VsCode versions, the indexing of part of the extensions is missing.
In this case, add the path to your extension in .vscode/settings.json under the key "python.analysis.extraPaths".
{
"python.analysis.extraPaths": [
"<path-to-ext-repo>/source/vla_lab"
]
}If you encounter a crash in pylance, it is probable that too many files are indexed and you run out of memory.
A possible solution is to exclude some of omniverse packages that are not used in your project.
To do so, modify .vscode/settings.json and comment out packages under the key "python.analysis.extraPaths"
Some examples of packages that can likely be excluded are:
"<path-to-isaac-sim>/extscache/omni.anim.*" // Animation packages
"<path-to-isaac-sim>/extscache/omni.kit.*" // Kit UI tools
"<path-to-isaac-sim>/extscache/omni.graph.*" // Graph UI tools
"<path-to-isaac-sim>/extscache/omni.services.*" // Services tools
...-
You can download pretrained base line policy here
-
The model trained in base_line must exist to start.
-
Edit dataset_path in {}_gr00t_env_cfg.py
- PegInsert Demo Save
python scripts/reinforcement_learning/rl_games/play.py --task=VlaLab-VLA-Gr00t-Forge-PegInsert-Demo-Save-Direct-v0 --headless --enable_cameras --checkpoint=<BASE_LINE_CHECKPOINT_PATH>- GearMesh Demo Save
python scripts/reinforcement_learning/rl_games/play.py --task=VlaLab-VLA-Gr00t-Forge-GearMesh-Demo-Save-Direct-v0 --headless --enable_cameras --checkpoint=<BASE_LINE_CHECKPOINT_PATH>- NutThread Demo Save
python scripts/reinforcement_learning/rl_games/play.py --task=VlaLab-VLA-Gr00t-Forge-NutThread-Demo-Save-Direct-v0 --headless --enable_cameras --checkpoint=<BASE_LINE_CHECKPOINT_PATH>- Automate Demo Save
python source/vla_lab/vla_lab/tasks/direct/base_line/automate/run_w_id.py --assembly_id=<Assembly_ID> --train- You can use hugging face
- add
--huggingfacein terminal
- add
- Install Isaac-GR00T
cd ~/
git clone https://github.com/NVIDIA/Isaac-GR00T.git- install dependencies
cd Isaac-GR00T
conda create -n gr00t python=3.10
conda activate gr00t
pip install --upgrade setuptools
pip install -e .[base]
pip install --no-build-isolation flash-attn==2.7.1.post4 -
Replace the files at
Isaac-GR00T/gr00t/data/embodiment_tags.pyandIsaac-GR00T/gr00t/experiment/data_config.pywith the files from the gr00t folder within the repository. -
change meta folder in dataset like
~/vla_lab/gr00t/meta -
take care three things
- 1st: In
episodes.jsonl, edit "tasks" except "valid". And edit "length" to your total_frames - 2nd: In
info.json, edit "total_episodes" and "total_frames" - 3rd: Change stats.json file using
vla_lab/gr00t/utils/get_stats.py - 4th: In
tasks.jsonl, edit "task" at "task_index": 0
- 1st: In
-
Train gr00t model
python scripts/gr00t_finetune.py \
--dataset-path /home/hyunho_RCI/datasets/gr00t-rl/<TASK_NAME>/ \
--num-gpus 1 \
--batch-size 32 \
--output-dir /home/hyunho_RCI/Isaac-GR00T/checkpoints/<TASK_NAME> \
--max-steps 20000 \
--embodiment-tag franka \
--data-config franka_triple_cam \
--video-backend torchvision_av \
--push_to_hub \
--hub_model_id bhe1004/<TASK_NAME>- Eval gr00t model
python scripts/eval_policy.py \
--plot \
--embodiment_tag franka \
--model_path bhe1004/<TASK_NAME> \
--data_config franka_triple_cam \
--embodiment_tag franka \
--dataset_path /home/hyunho_RCI/datasets/gr00t-rl/<TASK_NAME>/ \
--video_backend decord \
--modality_keys eef_position_delta eef_rotation_delta gripper_close \
--save_plot_path /home/hyunho_RCI/Isaac-GR00T/plots/<TASK_NAME>/eef_pose.png- run gr00t server
cd ~/Isaac-GR00T
python scripts/inference_service.py --server --model_path <GR00T_MODEL_PATH> --embodiment-tag franka --data-config franka_triple_cam --denoising-steps 4 --port {PORT_ID}-
change port id to {PORT_ID} in source/vla_lab/vla_lab/envs/direct_rl_gr00t_env.py
-
Train vla-rl policy
python scripts/reinforcement_learning/rl_games/train.py --task={TASK_NAME} --headless --enable_cameras --wandb-entity={YOUR_ENTITY} --wandb-project-name={WANDB_PROJECT_NAME} --wandb-name={RUN_NAME} --huggingface --repo_id={REPOSITORY_ID} --track- If your task is 'Automate', run this terminal input
python source/isaaclab_tasks/isaaclab_tasks/direct/automate/run_w_id.py --assembly_id=ASSEMBLY_ID \
--train --headless(e.g)
python scripts/reinforcement_learning/rl_games/train.py --task=VlaLab-VLA-Gr00t-Forge-PegInsert-Direct-v1 --headless --enable_cameras --wandb-entity=chohh7391-kyung-hee-university --wandb-project-name=VLA_RL-VLA_RL-gr00t --wandb-name=forge-peg_insert --huggingface --repo_id=bhe1004/VLA_RL-VLA_RL-gr00t-forge-peg_insert- transfer isaaclab demos to gr00t dataset
python datasets/gr00t-rl/utils/hdf5_to_gr00t_demo.py- record demos
python scripts/tools/record_demos.py --task VlaLab-BaseLine-Stack-IK-Rel-Gr00t-v0 --device cpu --teleop_device keyboard --dataset_file ./datasets/dataset.hdf5 --num_demos 10 --enable_cameras --low_pass_filter- replay demos
python scripts/tools/replay_demos.py --task VlaLab-BaseLine-Stack-IK-Rel-Gr00t-v0 --device cpu --dataset_file ./datasets/dataset.hdf5 --enable_cameras- augment dataset
# annotate
python scripts/imitation_learning/isaaclab_mimic/annotate_demos.py --device cpu --enable_cameras --task VlaLab-BaseLine-Stack-IK-Rel-Gr00t-Mimic-v0 --auto --input_file ./datasets/dataset.hdf5 --output_file ./datasets/annotated_dataset.hdf5 --headless
# augment
python scripts/imitation_learning/isaaclab_mimic/generate_dataset.py \
--device cpu --enable_cameras --headless --num_envs 10 --generation_num_trials 1000 \
--input_file ./datasets/annotated_dataset.hdf5 --output_file ./datasets/generated_dataset.hdf5