Synthetic benchmark by ArcaneEmergence · Pull Request #62 · SchubertLab/DextraDemixer

ArcaneEmergence · 2026-06-19T10:06:08Z

Major changes in simulation and the model:

Model changes

size factor
clonotype median aggregation
memory efficient credible interval FDR control
Save and load model using pickle (issues with version compatibility of jax)
extreme outlier removal
kmeans outlier handling now simply takes the three highest points, if not already clustered

Simulator

Component variance now sampled in relationship to mean
Assure sampled fraction of binder, and outlier proportion is close to user specified parameter

Snakemake

Improved snakemake pipeline

Figure notebook

Added notebooks to reproduce figures
Added minimal set of csv files to recreate Figure 2

… due to computation and memory problems.

…ve and show figure optional, 3. legend frameon off and add titles, 4. changed indentation to spaces

…timized for HPC to avoid spawning many small jobs, instead uses multiprocessing to run in parallel. 2. Additional logging of figs and metrics 3. Changed folder structure depending on scenario and configuration in a yaml

…eriment/synth_bench

…rsion

…hip to mean

…d outlier mask vector to final mdata

…atio

- size factor calculation - alpha offset - incorporating negative control in a more flexible way - exponential lr decay - flag to log performance during training - various clonotype info incorporations - small improvements of plotting

…ab/DextraDemixer into experiment/synth_bench

… training data if zvalue > 100, but keep it for prediction

….scan, minor cleanup

…cpu_model`

…les for a minimal and full setup.

Copilot

Pull request overview

This PR introduces a revamped “synthetic benchmark” workflow (Snakemake + Slurm wrappers) and updates core simulation/model utilities to support new benchmarking requirements (e.g., variance sampling tied to mean, outlier handling, clonotype aggregation, and model pickling).

Changes:

Replace legacy synthetic benchmark snakefiles/slurm scripts with a scenario-driven snakefile_benchmark.smk + slurm_benchmark.sh and per-scenario YAML configs.
Add new simulator behavior (variance-from-mean model; tighter control of binder/outlier proportions) and new benchmark runner scripts (simulate_data.py, updated run_dextrademixer.py, new run_beam.py).
Extend core library utilities and the DextraDemixer model (metrics aggregation helpers, new preprocessing knobs, saving/loading via pickle, and updates to posterior/FDR logic).

Reviewed changes

Copilot reviewed 35 out of 41 changed files in this pull request and generated 16 comments.

Show a summary per file

File	Description
experiments/synthetic_benchmark/snakefile_run_timing	Removed legacy timing workflow snakefile
experiments/synthetic_benchmark/snakefile_run_simulation	Removed legacy simulation workflow snakefile
experiments/synthetic_benchmark/snakefile_benchmark.smk	New scenario-driven benchmark workflow (simulate → run tools → aggregate)
experiments/synthetic_benchmark/slurm_run_timing.sh	Removed legacy Slurm wrapper
experiments/synthetic_benchmark/slurm_run_simulation.sh	Removed legacy Slurm wrapper
experiments/synthetic_benchmark/slurm_benchmark.sh	New unified Slurm submission wrapper for benchmark scenarios
experiments/synthetic_benchmark/simulate_data.py	New CLI entrypoint for simulation generation
experiments/synthetic_benchmark/run_dextramixerkmeans.py	Removed legacy runner
experiments/synthetic_benchmark/run_dextramixer.py	Removed legacy runner
experiments/synthetic_benchmark/run_dextrademixer.py	Updated runner to new model/config + richer metric logging
experiments/synthetic_benchmark/run_beamt.py	Removed legacy runner
experiments/synthetic_benchmark/run_beam.py	New BEAM runner producing benchmark CSV outputs
experiments/synthetic_benchmark/environment.yaml	Removed old per-experiment conda environment
experiments/synthetic_benchmark/create_data_mean_variance_fold_increase.py	Removed old simulation script
experiments/synthetic_benchmark/benchmarks/synth_benchmark/config.yaml	New benchmark scenario configuration
experiments/synthetic_benchmark/benchmarks/scaling/config.yaml	New scaling scenario configuration
experiments/synthetic_benchmark/benchmarks/dropout/config.yaml	New dropout scenario configuration
experiments/synthetic_benchmark/aggregate_results.py	Simplified aggregation via shared `aggregate_csv` utility
experiments/hyperparameter_tuning/snakefile_run_optuna_multi_at_once	Removed legacy optuna workflow
experiments/hyperparameter_tuning/snakefile_run_optuna	Removed legacy optuna workflow
experiments/hyperparameter_tuning/slurm_run_optuna_multi_at_once.sh	Removed legacy optuna Slurm wrapper
experiments/hyperparameter_tuning/optuna_dextrademixer.py	Removed legacy optuna driver
experiments/hyperparameter_tuning/environment_optuna.yaml	Removed legacy optuna environment
experiments/hyperparameter_tuning/create_data_mean_variance_fold_increase.py	Removed legacy simulation script
experiments/hyperparameter_tuning/aggregate_results.py	Removed legacy aggregation script
experiments/.slurm/config.yaml	Updated Slurm profile submission template/resources
experiments/.slurm_one_node/status.py	New status checker script for one-node profile
experiments/.slurm_one_node/config.yaml	New one-node Slurm profile
environment_minimal.yaml	New pinned “minimal” environment for benchmarks/tests
environment_full.yaml	New pinned “full” environment export
env_minimal.def	New Apptainer definition for minimal env
env_full.def	New Apptainer definition for full env
dextrademixer/utils/utils.py	Added shared aggregation + metrics + Slurm helpers
dextrademixer/utils/simulation.py	Updated simulator: variance sampling + binder/outlier proportion control
dextrademixer/model/Dextrademixer.py	Model enhancements: size factors, outlier filtering, save/load, posterior logic changes
.gitignore	Updated ignore patterns for experiment outputs/assets

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+    output:
+        protected(
+            "benchmarks/{scenario}/csv/Dextra{model_config}-"  # wildcard cannot be empty, therefore have to use a small hack here
+            "{N},0.4,{po},{p},False,{mean_inc},None,{i}.csv",
+        )
+    params:
+        neg_ctrl_key=lambda wc: "neg_control" if wc.model_config == "Demixer+neg." else "None"
+    resources:


+    resources:
+        c=1,
+        mem="8000M",
+        node="",
+        qos="cpu_preemptible" if PREEMPTIBLE else "cpu_normal",


        clone = None if c is None else jnp.array(c, dtype=INT_DTYPE)
-        self.data = {"x": jnp.array(x, dtype=INT_DTYPE),
-                     "s": None if s is None else jnp.array(s, dtype=FLOAT_DTYPE),
-                     "x_neg": None if neg_cont is None else jnp.array(neg_cont, dtype=FLOAT_DTYPE),
-                     "clone": clone,
-                     # If clone is not contiuous, then there will be problems with indexing
-                     "clone_continuous": None if clone is None else jnp.searchsorted(jnp.unique(clone), clone),
-                     "sigma": None if sigma is None else jnp.array(sigma, dtype=FLOAT_DTYPE),
+        zscore = jnp.abs((x - jnp.mean(x)) / jnp.std(x))
+        outlier_threshold = 100 # TODO Hardcoded
+        # With outliers
+        self.data_full = {"x": jnp.array(x, dtype=INT_DTYPE),
+                          "s": None if s is None else jnp.array(s, dtype=FLOAT_DTYPE),
+                          "x_neg": None if neg_cont is None else jnp.array(neg_cont, dtype=FLOAT_DTYPE),
+                          "clone": clone,
+                          # If clone is not contiuous, then there will be problems with indexing
+                          "clone_continuous": None if clone is None else jnp.searchsorted(jnp.unique(clone), clone),
+                          "sigma": None if sigma is None else jnp.array(sigma, dtype=FLOAT_DTYPE),
+                          }
+        # Without outliers
+        self.data = {"x": jnp.array(x[jnp.where(zscore < outlier_threshold)], dtype=INT_DTYPE),
+                     "s": jnp.array(s[jnp.where(zscore < outlier_threshold)], dtype=FLOAT_DTYPE) if s is not None else None,
+                     "x_neg": jnp.array(neg_cont[jnp.where(zscore < outlier_threshold)], dtype=FLOAT_DTYPE) if neg_cont is not None else None,
+                     "clone": jnp.array(clone[jnp.where(zscore < outlier_threshold)], dtype=INT_DTYPE) if clone is not None else None,
+                     "clone_continuous": None if clone is None else jnp.searchsorted(jnp.unique(clone), clone[jnp.where(zscore < outlier_threshold)]),
+                     "sigma": None if sigma is None else jnp.array(sigma, dtype=FLOAT_DTYPE)[jnp.where(zscore < outlier_threshold)],
                     }


        super().preprocess_model_data(x=x, s=s, neg_cont=neg_cont, c=c, sigma=sigma, mode=mode,
                                      alpha_model=alpha_model, **kwargs)


+    def fit_svi(self, guide='normal', svi_config: Dict[str, Union[int, float]] = None,
                nof_inits: int = 100, use_minimal_loss: bool = True, rng_key: int = 998777,
-                return_loss: bool = False) \
-            -> az.InferenceData:
+                y_true: Array = None) \
+                -> az.InferenceData:


+    """
+    Sample a realistic variance given a mean using the fitted power-law model:
+        log(var) = a + b*log(mean) + Normal(0, resid_std^2)
+
+    Args:
+        mean : float or np.ndarray
+            Mean(s) at which to sample the variance. Must be > 0; broadcasting allowed.
+        a : float, default 2.0221541172111164
+            Proportionality constant (exp(intercept) from log–log OLS).
+        b : float, default 1.6969075027280063
+            Scaling exponent (slope from log–log OLS).
+        resid_std : float, default 0.31049623532404225
+            Residual standard deviation on the *log-variance* scale (σ from OLS residuals).
+        rng : int | np.random.RandomState, default 42
+            Source of randomness. If int, used as the seed. If None, uses SciPy/Numpy default RNG.
+    Returns:
+        float or np.ndarray
+            A sample of variance values with the same broadcasted shape as `mean`.
+    """


+        if use_size_factor:
+            pmhc_list = use_size_factor if isinstance(use_size_factor, list) else mdata[gex_key].var_names.tolist()
+            x_plus = jnp.array(gex[:, pmhc_list].X.toarray(),
+                               dtype=FLOAT_DTYPE)  # only used for size factor calculation
+            s = self.calculate_size_factors(x_plus)
+            del x_plus
+        else:
+            s = jnp.ones(x.shape[0], dtype=FLOAT_DTYPE)
+
        self._check_parameters(x, x_neg, c, sigma)
-        self.model.preprocess_model_data(x=x, neg_cont=x_neg, c=c, sigma=sigma, mode=self.mode,
-                                         alpha_model=self.alpha_model, **kwargs)
+        self.model.preprocess_model_data(x=x, s=s, neg_cont=x_neg, c=c, sigma=sigma, mode=self.mode,
+                                         alpha_model=self.alpha_model, outlier_threshold=outlier_threshold, **kwargs)


ArcaneEmergence added 30 commits October 14, 2025 13:10

cleanup unused files

5217200

feature: Use AutoNormal guide instead of AutoMultivariateNormal guide…

642b983

… due to computation and memory problems.

refactor: renaming variables of posterior bfdr thresholding

a3d72f9

figures: small improvements on figures: 1. shorter titles, 2. make sa…

a3878f0

…ve and show figure optional, 3. legend frameon off and add titles, 4. changed indentation to spaces

refactor: cleanup unused files

2dee937

update cluster params

5f25647

feature: return mean_over_cell posterior parameters

b227af7

add estimate_sim_params.ipynb

d1bb1ad

refactor: Remove duplicate line

1c53378

feature: if y_true is None, create dummy zero values

ea58ad4

feature: save and load models

e69d361

feature: save and load models

bcc96dd

Merge remote-tracking branch 'origin/experiment/synth_bench' into exp…

31f0a4c

…eriment/synth_bench

feature: parallelize BEAMT

e610cac

refactor: rename mixer to model

8f579ec

feature: sample variance based on mean, instead of sampling overdispe…

3d20156

…rsion

fix jax, jaxlib and numpyro versions

aa7ae48

scenario config: Update to version with variance sampled in relations…

ff646cd

…hip to mean

feature: add flags to sbatch script and snakemake file

36a39a8

feature: resample so that sampled N_binder / N_total ~ p_binding_ratio

7b0472d

bugfix: Use noise mean for binder outliers instead of binder mean. Ad…

43a951b

…d outlier mask vector to final mdata

feature: ensure real outlier ratio roughly matches the specified ratio

211ce03

feature: resample binding assignment to get close to target binding r…

30a4181

…atio

feature: Add MCC

eb04360

feature: Multiple enhancements

2a66d7c

- size factor calculation - alpha offset - incorporating negative control in a more flexible way - exponential lr decay - flag to log performance during training - various clonotype info incorporations - small improvements of plotting

update .gitignore

0f6c8a7

Merge branch 'experiment/synth_bench' of https://github.com/SchubertL…

1e1878b

…ab/DextraDemixer into experiment/synth_bench

feature: remove kmeans outlier threshold, instead remove outlier from…

915d50b

… training data if zvalue > 100, but keep it for prediction

feature: float_or_none for argparse

b1bb15a

ArcaneEmergence added 11 commits March 16, 2026 17:49

experiment: Gemuend 2025 CMV data

a2950de

feature: save BEAMT results to csv

61854be

Update Figure design

9fa73ef

feature: clonotype median aggregation model

155e8d5

feature: memory-efficient _predict_posterior_class_dist through lax…

3eae588

….scan, minor cleanup

feature: add to utils.py mean_ci_t_interval, aggregate_csv, `get_…

3596302

…cpu_model`

cleanup: remove hyperparameter tuning

a7b6fca

cleanup: synthetic_benchmark unused files

f42668b

feature: new way of running synthetic benchmark by using apptainer

30b1f27

feature: update slurm config

269ab9f

Figure: Add notebooks for figure plotting and add environment.yaml fi…

331c37e

…les for a minimal and full setup.

ArcaneEmergence requested a review from Copilot June 19, 2026 10:06

Copilot started reviewing on behalf of ArcaneEmergence June 19, 2026 10:06 View session

Copilot AI reviewed Jun 19, 2026

View reviewed changes

ArcaneEmergence marked this pull request as ready for review June 19, 2026 11:05

ArcaneEmergence merged commit 355267a into main Jun 19, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Synthetic benchmark#62

Synthetic benchmark#62
ArcaneEmergence merged 41 commits into
mainfrom
experiment/synth_bench

ArcaneEmergence commented Jun 19, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		super().preprocess_model_data(x=x, s=s, neg_cont=neg_cont, c=c, sigma=sigma, mode=mode,
		alpha_model=alpha_model, **kwargs)

Uh oh!

Conversation

ArcaneEmergence commented Jun 19, 2026

Model changes

Simulator

Snakemake

Figure notebook

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants