Unified Supervision for Vision-Language modeling in 3D computed tomography

Official Code Release for ICCV 2025 3DVLM Workshop Paper

Title: Unified Supervision for Vision-Language modeling in 3D computed tomography
Conference: ICCV 2025, Vision-Language Modeling in 3D Medical Imaging (VLM3D) Workshop

Overview

Uniferum is a volumetric vision-language model designed for radiology. Uniferum integrates classification labels and segmentation masks into a single unified training framework.

Harmonizes classification and segmentation across multiple CT datasets.
Improves State-of-the-Art Results on the CT-RATE benchmark by +7% compared to CLIP-based models.
Robust out-of-distribution performance
zero-shot capabilities on RAD-CHEST and INSPECT datasets.

Citation

If you find this code useful for your research, please consider citing our work:

@inproceedings{iccv2025uniferum,
  title={Unified Supervision for Vision-Language modeling in 3D computed tomography},
  author={Hao-Chih Lee, Zelong Liu, Hamza Ahmed, Spencer Kim, Sean Huver,
Vishwesh Nath, Zahi A. Fayad, Timothy Deyer, Xueyan Mei},
  booktitle={ICCV VLM3D Workshop},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
assets		assets
bin		bin
configs		configs
data_utils		data_utils
img_utils		img_utils
models		models
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Unified Supervision for Vision-Language modeling in 3D computed tomography

Overview

Citation

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Unified Supervision for Vision-Language modeling in 3D computed tomography

Overview

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages