BAGEL World Model

This repository contains the BAGEL-World-Model code used by VLA-MBPO.

Installation

This repository is intended to use an independent uv environment from the VLARLKit root environment. You need to ensure VLARLKit has already installed.

cd VLARLKit/third_party/
git submodule update --init "BAGEL"
uv sync
uv pip install flash_attn==2.5.8 --no-build-isolation

If you encounter any compilation errors when installing flash_attn, you can install a precompiled wheel instead of building from source.

Go to the official release page:
https://github.com/Dao-AILab/flash-attention/releases
Download the wheel that matches your environment, typically:
- CUDA 12.4
- PyTorch 2.5
- Python 3.10 (cp310)
After downloading the wheel, install it with:
```
uv pip install <wheel_file>
```

Required Artifacts

The model checkpoints and datasets are not bundled in this repository yet. Before running the full training/inference workflow, prepare the following local paths:

Artifact	Description	Download Link
Bagel-WM-ckpt	Fine-tuned world model checkpoints	Coming soon
Datasets for finetuning BAGEL	LIBERO and LeRobot datasets for finetuning BAGEL as WMs	Coming soon
Datasets for branch rollouts	Datasets for performing branch rollouts to train MBRL	Coming soon

World-Model Training

World-model training scripts are under scripts/:

bash scripts/train_libero.sh

Training VLAs with World Models (VLA-MBPO)

First, set each loading path in config file VLARLKit/examples/configs/libero_goal_vla_mbpo.yaml,

branch_dataset_root: <your download data dir>

model:
  model_path: "<your download path>/RLinf-Pi05-LIBERO-SFT"
  data:
    assets_dir: "<your download path>/RLinf-Pi05-LIBERO-SFT"

world_model:
  load_model_path: <your download world model ckpt path>

Now, you can lanuch the script to run!

# assume you are at VLARLKit root path
bash examples/run_vla_mbpo.sh

Generation Demos

Citation

If you find this code useful, please cite:

@article{zhang2026vlambpo,
  title={Towards Practical World Model-based Reinforcement Learning for Vision-Language-Action Models},
  author={Zhang, Zhilong and Ren, Haoxiang and Sun, Yihao and Sheng, Yifei and Wang, Haonan and Lin, Haoxin and Wu, Zhichao and Bacon, Pierre-Luc and Yu, Yang},
  journal={arXiv preprint arXiv:2603.20607},
  year={2026}
}

Acknowledgements

This codebase builds on Bagel, UniPlan stack. We thank the authors and maintainers of these projects.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
assets		assets
bagel		bagel
scripts		scripts
train		train
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BAGEL World Model

Installation

Required Artifacts

World-Model Training

Training VLAs with World Models (VLA-MBPO)

Generation Demos

Citation

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

BAGEL World Model

Installation

Required Artifacts

World-Model Training

Training VLAs with World Models (VLA-MBPO)

Generation Demos

Citation

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages