fix(trainer): add mixed_precision_dtype parameter to DefaultTrainer by chuyaowang · Pull Request #718 · constantinpape/torch-em

chuyaowang · 2026-06-11T14:08:37Z

The PRs mentioned in the issue

Allows callers to select the autocast dtype for mixed precision training instead of the hardcoded torch.float16. Defaults to torch.float16 to preserve existing behaviour.

Add mixed_precision_dtype: torch.dtype = torch.float16 parameter to DefaultTrainer.__init__ with corresponding docstring entry
Store self.mixed_precision_dtype as an instance attribute so the Serializer can round-trip it through checkpoints
Create GradScaler only when dtype is float16; bfloat16 has fp32 range and does not require gradient scaling
Pass dtype=self.mixed_precision_dtype to the autocast context in _train_epoch_mixed and _validate_mixed
Select _backprop (no scaler) vs _backprop_mixed (with scaler) based on whether self.scaler is None, driven by the chosen dtype

Allows callers to select the autocast dtype for mixed precision training instead of the hardcoded torch.float16. Defaults to torch.float16 to preserve existing behaviour. - Add `mixed_precision_dtype: torch.dtype = torch.float16` parameter to `DefaultTrainer.__init__` with corresponding docstring entry - Store `self.mixed_precision_dtype` as an instance attribute so the Serializer can round-trip it through checkpoints - Create GradScaler only when dtype is float16; bfloat16 has fp32 range and does not require gradient scaling - Pass `dtype=self.mixed_precision_dtype` to the autocast context in `_train_epoch_mixed` and `_validate_mixed` - Select `_backprop` (no scaler) vs `_backprop_mixed` (with scaler) based on whether `self.scaler` is None, driven by the chosen dtype

This was referenced Jun 11, 2026

fix(training): auto-select bfloat16 on GPUs without Tensor Cores computational-cell-analytics/micro-sam#1234

Open

Finetuning microsam AIS (continued) computational-cell-analytics/micro-sam#1214

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(trainer): add mixed_precision_dtype parameter to DefaultTrainer#718

fix(trainer): add mixed_precision_dtype parameter to DefaultTrainer#718
chuyaowang wants to merge 1 commit into
constantinpape:mainfrom
chuyaowang:fix/mixed-precision-dtype

chuyaowang commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

chuyaowang commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant