Skip to content

Use sharded_state_dict_default in MLP.sharded_state_dict#4693

Open
gdengk wants to merge 2 commits into
NVIDIA:mainfrom
gdengk:gdeng/main-pr-4325-activation-mlp-state-dict
Open

Use sharded_state_dict_default in MLP.sharded_state_dict#4693
gdengk wants to merge 2 commits into
NVIDIA:mainfrom
gdengk:gdeng/main-pr-4325-activation-mlp-state-dict

Commits

Commits on May 8, 2026

Commits on May 12, 2026