Add Optimization Cookbook #5117

samanklesaria · 2025-11-28T20:02:47Z

What does this PR do?

This PR adds a guide that shows some common techniques for working with Flax models during optimization. These include:

Calculation of Exponential Moving Averages
Optimizing only a low rank addition to certain weights (LORA)
Using different learning rates for different parameters to implement the maximal update parameterization
Using second order optimizers like LBFGS.
Specifying sharding for optimization state that differs from that of parameter state
Gradient accumulation

This is a work in progress: the guide will be much further fleshed out over time.

This document emphasizes a style as close to pure jax as possible: to that end, it shows how the flax version of each technique only requires minor deviation from the often more intuitive pure-jax version.

review-notebook-app · 2025-11-28T20:02:51Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

samanklesaria force-pushed the opt_cookbook branch 3 times, most recently from c495dc1 to b929529 Compare December 1, 2025 23:53

samanklesaria force-pushed the opt_cookbook branch 4 times, most recently from d3f39f9 to 34d7c20 Compare December 9, 2025 22:26

Add Optimization Cookbook

444c6b6

samanklesaria force-pushed the opt_cookbook branch from 34d7c20 to 444c6b6 Compare December 9, 2025 22:27

samanklesaria added 2 commits December 9, 2025 16:33

Update notebook

008a16f

Fix typo

fa523a9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Optimization Cookbook #5117

Add Optimization Cookbook #5117

Uh oh!

samanklesaria commented Nov 28, 2025 •

edited

Loading

Uh oh!

review-notebook-app bot commented Nov 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add Optimization Cookbook #5117

Are you sure you want to change the base?

Add Optimization Cookbook #5117

Uh oh!

Conversation

samanklesaria commented Nov 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

review-notebook-app bot commented Nov 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

samanklesaria commented Nov 28, 2025 •

edited

Loading