Separating the function to loop over data from the one to create batches for training by anagainaru · Pull Request #92 · AI-ModCon/BaseSIM_APEIRON

anagainaru · 2026-03-10T16:45:25Z

Summary

Current behavior is to have a function get_cur_data_loaders that returns the dataloaders that is used for both looping for inference and for creating batches to train when doing continual learning. The current PR separates them into two separate functions.

Motivation

Having two functions allows us to control the granularity of the drift detectors during streaming (e.g. looking at element by element) without impacting the training (which should use the same batch size as what was used for the original training)

API / CLI Changes

!! This PR changes out current API !!

diff --git a/.claude/skills/new-harness/SKILL.md b/.claude/skills/new-harness/SKILL.md
index 0940025..1a4b9c9 100644
@@ -64,11 +64,14 @@ class <Name>Harness(BaseModelHarness):
         # Rebuild data loaders with new transforms
         # Track augmentation history for replay

-    def get_cur_data_loaders(self) -> Tuple[DataLoader, DataLoader]:
+    def get_stream_dataloader(self) -> DataLoader:
+        # Return data_loader for current data
+
+    def get_train_dataloaders(self) -> Tuple[DataLoader, DataLoader]:
         # Return (train_loader, val_loader) for current data
         # Call _dispose_current_loaders() first if loaders exist

-    def get_hist_data_loaders(self) -> Tuple[Optional[DataLoader], Optional[DataLoader]]:
+    def get_hist_dataloaders(self) -> Tuple[Optional[DataLoader], Optional[DataLoader]]:
         # Return historical data loaders for CL replay
         # Return (None, None) when task_counter == 1

Introduced a new function def get_stream_dataloader(self) that return the data loader over the stream of data using a different batch size that the training (the model harnesses could use the data side of the configuration, by default this is set to 1, each data in the stream should be analyzed in isolation).

The self.get_cur_data_loaders() is renamed to def get_train_dataloaders(self) and returns as before the training and validation used by the continual learning. It uses the batch_size specified in the training side of the configuration.

The get_hist_data_loaders is exactly the same but it's renamed to get_hist_dataloaders to be consistent with the others.

Usage

The MNIST example has been updated to show how to use the new function.

Imagenet and cifar have only minimal changes to update to the new API but the functionality did not change (the training and streaming loaders are exactly the same).

ScSteffen · 2026-03-11T13:39:09Z

I think this is a neccessary change - but very intrusive in the sense that all examples needs to be adapted.

Do you have an example that runs currently, so I can validate the changes locally?

ScSteffen · 2026-03-11T13:40:41Z

I think with this change we can further simplify:

==> the loop loader only needs a val_loader, since we are not training on this data

anagainaru · 2026-03-11T14:45:23Z

I think this is a neccessary change - but very intrusive in the sense that all examples needs to be adapted.

Do you have an example that runs currently, so I can validate the changes locally?

You do not need to change the examples, if this second function is not implemented we use the current behavior with looping over data using the training batch size.

anagainaru · 2026-03-11T14:46:19Z

I think with this change we can further simplify:

==> the loop loader only needs a val_loader, since we are not training on this data

Agree, I will make the change and update mnist so you can run using different batches if you specify in toml a data batch size.

ScSteffen

Functionality checked manually. Changes look good. Approved

anagainaru requested a review from ScSteffen March 10, 2026 16:45

anagainaru force-pushed the loop-batch branch from b33f77a to b999f7e Compare April 7, 2026 21:41

Separating the loop functions over dataloaders for training or infering

a420c4b

anagainaru force-pushed the loop-batch branch from b999f7e to a420c4b Compare May 4, 2026 16:38

anagainaru added 4 commits May 5, 2026 21:32

API change

beac92b

Changes to pass the CI

75eba76

Changes to the mnist example to use the new stream function

91ec43b

Code cleaning

1e924e4

ScSteffen approved these changes May 8, 2026

View reviewed changes

anagainaru merged commit 6c48614 into main May 9, 2026
3 checks passed

anagainaru deleted the loop-batch branch May 9, 2026 01:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Separating the function to loop over data from the one to create batches for training#92

Separating the function to loop over data from the one to create batches for training#92
anagainaru merged 5 commits intomainfrom
loop-batch

anagainaru commented Mar 10, 2026 •

edited

Loading

Uh oh!

ScSteffen commented Mar 11, 2026

Uh oh!

ScSteffen commented Mar 11, 2026

Uh oh!

anagainaru commented Mar 11, 2026

Uh oh!

anagainaru commented Mar 11, 2026

Uh oh!

ScSteffen left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

anagainaru commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Motivation

API / CLI Changes

Usage

Uh oh!

ScSteffen commented Mar 11, 2026

Uh oh!

ScSteffen commented Mar 11, 2026

Uh oh!

anagainaru commented Mar 11, 2026

Uh oh!

anagainaru commented Mar 11, 2026

Uh oh!

ScSteffen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

anagainaru commented Mar 10, 2026 •

edited

Loading