fix: prevent duplicate data when using multiple DataLoader workers by AmSach · Pull Request #169 · NTMC-Community/MatchZoo-py

AmSach · 2026-05-11T00:46:51Z

Fixed the bug described in issue #150.

What was wrong

When using num_workers > 0 in DataLoader, each worker process iterates over ALL batches instead of a distinct subset. This causes the model to train on duplicate data multiple times per epoch (e.g., with 30 workers, each sample is processed 30 times per epoch instead of once).

How I fixed it

Dataset.init: Added num_workers parameter and _worker_id tracking
DataLoader.init: Creates a worker_init_fn that sets the worker_id on the Dataset for each subprocess
Dataset.iter: Now partitions batches across workers so each worker processes only 1/num_workers of the batches

Testing

Syntax checks pass on both modified files
Import tests pass

Closes #150

When using num_workers > 0 in DataLoader, each worker iterates over ALL batches instead of a distinct subset, causing the model to train on duplicate data multiple times per epoch. Fix: - Dataset now accepts num_workers parameter to partition batches - DataLoader passes num_workers to Dataset and creates worker_init_fn that sets worker_id on the Dataset for each subprocess - Dataset.__iter__ now partitions batches across workers so each worker processes a distinct subset This ensures each worker handles 1/num_workers of the batches, eliminating duplicate training data.

AmSach requested review from Chriskuei and caiyinqiong as code owners May 11, 2026 00:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: prevent duplicate data when using multiple DataLoader workers#169

fix: prevent duplicate data when using multiple DataLoader workers#169
AmSach wants to merge 1 commit into
NTMC-Community:masterfrom
AmSach:fix/dataloader-duplicate-data-with-num-workers

AmSach commented May 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

AmSach commented May 11, 2026

What was wrong

How I fixed it

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant