FANS: Flow-based Analysis of Noise Shift

Overview

FANS (Flow-based Analysis of Noise Shift) is a framework for detecting and analyzing distributional shifts in causal systems using normalizing flows. Built on the foundation of Causal Normalizing Flows, FANS extends the methodology to identify whether observed distribution changes are due to functional shifts (changes in causal mechanisms) or noise shifts (changes in noise distributions).

Key Features

Shift Detection: Automatically identifies which variables have undergone distributional shifts between environments
Shift Type Classification: Distinguishes between function shifts and noise shifts

Installation

Prerequisites

Create a new conda environment with Python 3.9.12:

conda create --name fans python=3.9.12 --no-default-packages

Activate the conda environment:

conda activate fans

Install Dependencies

Install PyTorch and related packages:

pip install torch==1.13.1+cu117 torchvision==0.14.1+cu117 --extra-index-url https://download.pytorch.org/whl/cu117

Install additional requirements:

pip install -r requirements.txt

Quick Start

Train a FANS Model

Train a FANS model on a synthetic dataset with 10 nodes and Erdős-Rényi (ER) graph structure:

CUDA_VISIBLE_DEVICES=0 python main.py \
    --config_file causal_nf/configs/data_small/nodes_10/ER/causal_nf_nodes_10_ER_adj_1.yaml \
    --wandb_mode offline \
    --project causal_nf

What this does:

Trains a causal normalizing flow on environment 1 data
Evaluates shift detection performance on environment 2 data
Saves results to results/ directory
Generates visualizations of detected shifts

FANS Method

The FANS (Flow-based Analysis of Noise Shift) method leverages trained causal normalizing flows to detect and classify distributional shifts between two environments.

Core Methodology

Training: Learn a causal normalizing flow on environment 1 data that maps observations X to noise variables Z following the causal graph structure
Shift Detection: Transform environment 2 data through the learned flow and test for independence violations in the noise space
Statistical Testing: Use distance correlation and independence tests to identify shifted variables
Visualization: Generate comparative plots showing distributional differences

Data Structure

Synthetic Data

Synthetic datasets are organized by node count and graph type:

data/data_small/
├── nodes_10/
│   ├── ER/          # Erdős-Rényi random graphs
│   │   ├── adj_1.npy           # Adjacency matrix
│   │   ├── data_env1_1.npy     # Environment 1 data
│   │   ├── data_env2_1.npy     # Environment 2 data
│   │   └── metadata_1.json     # Shift information
│   └── SF/          # Scale-free graphs
├── nodes_20/
├── nodes_30/
├── nodes_40/
└── nodes_50/

Real Datasets

Morpho-MNIST: Located in data/morpho_mnist/
Sachs: Located in data/sachs/

Running Experiments

Training FANS Model

Basic Usage

python main.py \
    --config_file <CONFIG_PATH> \
    --wandb_mode <MODE> \
    --project <PROJECT_NAME>

Example: Train on 30-node scale-free graph

CUDA_VISIBLE_DEVICES=1 python main.py \
    --config_file causal_nf/configs/data_small/nodes_30/SF/causal_nf_nodes_30_SF_adj_5.yaml \
    --wandb_mode online \
    --project fans_experiments

Running Baseline Methods

Run baseline shift detection methods for comparison:

python experiments/experiment_script.py --model <MODEL_NAME> [OPTIONS]

Available Models:

splitkci: Kernel Conditional Independence Test
prediter: PreDITEr method
iscan: Independence-based shift detection
linearccp: Linear CCP
gpr: Gaussian Process Regression

Options:

Option	Description	Default
`--nodes`	Node counts to process (space-separated)	10 20 30 40 50
`--gpu`	GPU device ID (-1 for CPU)	-1
`--output_dir`	Results directory	auto-generated
`--config_type`	Graph type filter (ER, SF, all)	all
`--dataset_indices`	Dataset range (e.g., "1-30")	all

Examples

Run SplitKCI on all node sizes, only first 5 datasets:

python experiments/experiment_script.py \
    --model splitkci \
    --dataset_indices "1-5" \
    --gpu 0

Run ISCAN on CPU for SF graphs:

python experiments/experiment_script.py \
    --model iscan \
    --config_type SF \
    --gpu -1

Results and Analysis

Analysis

Generate unified results CSV:

python experiments/analysis/analysis.py

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
causal_nf		causal_nf
data		data
experiments		experiments
torchlikelihoods		torchlikelihoods
zuko		zuko
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
auto_queue_experiments.sh		auto_queue_experiments.sh
fans.py		fans.py
fans_ablation.py		fans_ablation.py
main.py		main.py
main_ablation.py		main_ablation.py
poster.ipynb		poster.ipynb
requirements.txt		requirements.txt
run_experiments.sh		run_experiments.sh
run_sweep.sh		run_sweep.sh
visualize.py		visualize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

FANS: Flow-based Analysis of Noise Shift

Overview

Key Features

Installation

Prerequisites

Install Dependencies

Quick Start

Train a FANS Model

FANS Method

Core Methodology

Data Structure

Synthetic Data

Real Datasets

Running Experiments

Training FANS Model

Basic Usage

Example: Train on 30-node scale-free graph

Running Baseline Methods

Examples

Results and Analysis

Analysis

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

FANS: Flow-based Analysis of Noise Shift

Overview

Key Features

Installation

Prerequisites

Install Dependencies

Quick Start

Train a FANS Model

FANS Method

Core Methodology

Data Structure

Synthetic Data

Real Datasets

Running Experiments

Training FANS Model

Basic Usage

Example: Train on 30-node scale-free graph

Running Baseline Methods

Examples

Results and Analysis

Analysis

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages