Skip to content

ieellee/uLLSAM

Repository files navigation

uLLSAM

Official Implement of uLLSAM, our paper is available at https://arxiv.org/abs/2505.10769.

News 🚀🚀🚀

2025/04/21: First version of uLLSAM has been released, we also make model weights available in README.md. Currently we support InternLM-1.8B.

2025/12/15: We now support Qwen3-2B as LLM.

1. Installation guaidance

# clone the repository to your disk
cd ./uLLSAM
conda env create -f environment.yml
conda activate ullsam

If your encounter some unexpected errors, you can also refer to InvernVL and SAM to install your own environment.

2. Download checkpoints

Please follow README.md in checkpoints folder.

3. Launch app server

python app.py

You can visit the application at localhost:9996 in your browser, chrome is recommended。 demo

4. Train and Finetune uLLSAM

If you want to reproduce uLLSAM, just use the ./data/train_seg_all.jsonl to train the model, you need to prepare 9 datasets.

You can refer to torch_em to prepare and download datasets.

bash ./scripts/train_all_joint_v2.sh

If you want to finetune your custom data, follow the data structure in ./data/train_seg_all.jsonl

Specifically, each line in jsonl is structured as {"image_path": "...", "conversation": [{"role": "user", "content": "Describe the image in detail\n"}, {"role": "assistant", "content": ""}]}

5. Acknowledgement

Our uLLSAM is heavily inspired by many outstanding prior works, including

Thank the authors of above projects for open-sourcing their assets!

About

Official Implement of the paper "Unifying Segment Anything in Microscopy with Multimodal Large Language Model"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors