uDIAMOND Agent Training Notebook

This notebook sets up and runs the training loop for both DIAMOND or uDIAMOND model as part of our project. The goal is to evaluate how noise conditioning affects policy learning, visual fidelity, and computational efficiency in imagined rollouts.

What This Notebook Does

Clones the uDIAMOND repository:
- Repository: https://github.com/BillKG-exe/uDIAMOND
Installs required dependencies, including:
- torch, gym, gymnasium[atari], wandb, hydra-core, torcheval
Log in to Weights & Biases (wandb) for experiment tracking. The training information such as losses of the diffrent models involved as well as console logs are stored on wandb.
Runs DIAMOND training:
- Environment: BreakoutNoFrameskip-v4
- Noise conditioning parameter: noise_conditioning=true to setup DIAMOND architecture with noise conditioning or false to activate uDIAMOND architecture which removes noise conditioning.

How to Run

To run the uDIAMOND version (without noise conditioning), run the command as shown below:

!python3 src/main.py \
  "env.train.id=BreakoutNoFrameskip-v4" \
  "agent.denoiser.noise_conditioning=false"

Computation

Training both DIAMOND and uDIAMOND models requires substantial compute resources. Due to the use of diffusion models and the iterative denoising process involved in generating frame predictions, training is computationally intensive, therefore heavily relies on GPUs.

Running this notebook on a CPU-only environment is not recommended and will result in extremely slow training. We used Kaggle to run our notebook using their GPU T4 x 2. The training resulted in about 1 hour for 140. Training it on the orginal number of epochs as DIAMOND(1000 epochs) will take couple days.

Note

uDIAMOND is a variation of the DIAMOND model from Diffusion for World Modeling:
Visual Details Matter in Atari. The uDiamond model implements DIAMOND without noise conditioning.

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
config		config
results/data		results/data
scripts		scripts
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
diamond.ipynb		diamond.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

uDIAMOND Agent Training Notebook

What This Notebook Does

How to Run

Computation

Note

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

uDIAMOND Agent Training Notebook

What This Notebook Does

How to Run

Computation

Note

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages