SpatialTranscriptFormer Framework

Warning

Work in Progress: This project is under active development. Core architectures, CLI flags, and data formats are subject to major changes.

Tip

Framework Release: SpatialTranscriptFormer has been restructured from a research codebase into a robust framework. You can now use the Python API to train on your own spatial transcriptomics data with custom backbones and architectures.

SpatialTranscriptFormer is a modular deep learning framework designed to bridge histology and biological pathways. It leverages transformer architectures to model the interplay between morphological features and gene expression signatures, providing interpretable mapping of the tissue microenvironment.

Python API: Quick Start

The framework is designed to be integrated programmatically into your scanpy/AnnData workflows:

from spatial_transcript_former import SpatialTranscriptFormer, Predictor, FeatureExtractor
from spatial_transcript_former.predict import inject_predictions

# 1. Load model and create feature extractor
model = SpatialTranscriptFormer.from_pretrained("./checkpoints/stf_small/")
extractor = FeatureExtractor(backbone="phikon", device="cuda")
predictor = Predictor(model, device="cuda")

# 2. Extract features from image patches
#    image_patches: (N, 3, 224, 224) float tensor in [0, 1]
#    coords:        (N, 2) tensor of spatial coordinates (from your WSI tiling)
features = extractor.extract_batch(image_patches, batch_size=64)  # → (N, 768)

# 3. Predict gene expression from extracted features
predictions = predictor.predict_wsi(features, coords)  # → (1, G)

# 4. Integrate with Scanpy
inject_predictions(adata, coords, predictions[0], gene_names=model.gene_names)

For more details, see the Python API Reference.

Key Technical Pillars

Modular Architecture: Decoupled backbones, interaction modules, and output heads.
Quad-Flow Interaction: Configurable attention between Pathways and Histology patches (p2p, p2h, h2p, h2h).
Pathway Bottleneck: Interpretable gene expression prediction via 50 MSigDB Hallmark tokens.
Spatial Pattern Coherence: Optimized using a composite MSE + PCC (Pearson Correlation) loss.
Foundation Model Ready: Native support for CTransPath, Phikon, Hibou, and GigaPath.

License

This project is protected by a Proprietary Source Code License. See the LICENSE file for full details.

✅ Permitted: Evaluation for employment, Academic Research, and Non-Profit use.
🤝 For-Profit Use: Permitted only under a negotiated agreement with the author.
- Note on Foundation Models: This architecture is backbone-agnostic. Any negotiated commercial agreement covers only the SpatialTranscriptFormer source code and IP. It does not grant commercial rights to use restricted third-party weights (e.g., CTransPath, Phikon). To use this system commercially, you must select a foundation model with a compatible open or commercial license (e.g., Hibou, Virchow, or H-Optimus-0).
- Note on HEST-1k Dataset: The benchmark data used in this project is sourced from the HEST-1k dataset (Mahmood Lab), which is licensed under CC BY-NC-SA 4.0. This data is strictly for non-commercial research and cannot be used for commercial training or clinical deployment without explicit permission from the original authors.
- Note on MSigDB: This project uses data from the Molecular Signatures Database (MSigDB) (versions v6.0–v7.5.1, and v2022.1+). The contents are protected by copyright © 2004–2025 Broad Institute, Inc., MIT, and Regents of the University of California, and are distributed under the CC BY 4.0 license. While this allows for commercial use, users must provide appropriate attribution. Note that individual gene sets within MSigDB may be subject to additional terms from third-party sources (e.g., KEGG).
❌ Prohibited: Redistribution and unauthorized commercial exploitation.

Intellectual Property

The core architectural innovations, including the SpatialTranscriptFormer interaction logic and spatial masking strategies, are the unique Intellectual Property of the author. For a detailed breakdown, see the IP Statement.

Installation

This project requires Conda.

Clone the repository.
Run the automated setup script:
- On Windows: .\setup.ps1
- On Linux/HPC: bash setup.sh

Exemplar Recipe: HEST-1k Benchmark

The SpatialTranscriptFormer repository includes a complete, out-of-the-box CLI pipeline as an exemplar for reproducing our benchmarks on the HEST-1k dataset.

1. Dataset Access & Preprocessing

# Download a specific subset
stf-download --organ Breast --disease Cancer --tech Visium --local_dir hest_data

2. Training with Presets

# Recommended: Run the Interaction model (Small)
python scripts/run_preset.py --preset stf_small

3. Inference & Visualization

stf-predict --data-dir A:\hest_data --sample-id MEND29 --model-path checkpoints/best_model.pth --model-type interaction

Visualization plots and spatial expression maps will be saved to the ./results directory. For the full guide, see the HEST Recipe Docs.

Documentation

Framework APIs & Usage

Python API Reference: Full documentation for Trainer, Predictor, and SpatialDataset.
Bring Your Own Data Guide: Templates and examples for training on your own non-HEST spatial transcriptomics data.
HEST Recipe Docs: Detailed documentation for the included HEST-1k dataset recipe.
Training Guide: Complete list of configuration flags and preset configurations for HEST models.

Theory & Interpretability

Models & Architecture: Deep dive into the quad-flow interaction logic and network scaling.
Pathway Mapping: Clinical interpretability, pathway bottleneck design, and MSigDB integration.
Gene Analysis: Modeling strategies for mapping morphology to high-dimensional gene spaces.
Data Structure: Detailed breakdown of the HEST data structure on disk, metadata conventions, and preprocessing invariants.
Single-cell Best Practices: Gap analysis and roadmap for alignment with industry standard recommendations.

Development

Running Tests

# Run all tests (Pytest wrapper)
.\test.ps1

Future Directions & Clinical Collaborations

A major future direction for SpatialTranscriptFormer is to integrate this architecture into an end-to-end pipeline for patient risk assessment and prognosis tracking. By leveraging the model's predicted expression and pathway activations, we aim to build a downstream risk prediction module that allows users to directly evaluate how spatially-resolved expression relates to patient survival.

Note

Call for Collaborators: Rigorous risk assessment models require vast datasets of clinical metadata and survival outcomes, which we currently lack access to. We are open to investigating any disease of interest! If you have access to large clinical cohorts and are interested in exploring how spatial pathway activation correlates with patient prognosis, we would love to partner with you.

Contributing

We welcome contributions! Please see CONTRIBUTING.md for details on our coding standards and the process for submitting pull requests. Note that this project is under a proprietary license; contributions involve an assignment of rights for non-academic use.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
.github		.github
docs		docs
inspection		inspection
scripts		scripts
src/spatial_transcript_former		src/spatial_transcript_former
tests		tests
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
global_genes.json		global_genes.json
pyproject.toml		pyproject.toml
setup.ps1		setup.ps1
setup.sh		setup.sh
test.ps1		test.ps1
test.sh		test.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SpatialTranscriptFormer Framework

Python API: Quick Start

Key Technical Pillars

License

Intellectual Property

The core architectural innovations, including the SpatialTranscriptFormer interaction logic and spatial masking strategies, are the unique Intellectual Property of the author. For a detailed breakdown, see the IP Statement.

Installation

Exemplar Recipe: HEST-1k Benchmark

1. Dataset Access & Preprocessing

2. Training with Presets

3. Inference & Visualization

Documentation

Framework APIs & Usage

Theory & Interpretability

Development

Running Tests

Future Directions & Clinical Collaborations

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SpatialTranscriptFormer Framework

Python API: Quick Start

Key Technical Pillars

License

Intellectual Property

The core architectural innovations, including the SpatialTranscriptFormer interaction logic and spatial masking strategies, are the unique Intellectual Property of the author. For a detailed breakdown, see the IP Statement.

Installation

Exemplar Recipe: HEST-1k Benchmark

1. Dataset Access & Preprocessing

2. Training with Presets

3. Inference & Visualization

Documentation

Framework APIs & Usage

Theory & Interpretability

Development

Running Tests

Future Directions & Clinical Collaborations

Contributing

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages