Skip to content

johnkimdy/Multicam-MASt3R-SLAM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Cognitive-Unburdening Surveillance: Real-Time 3D Reconstruction for Distributed Spatial Awareness

John DongYoon Kim* · Rocky Kim* · Jinwoo Park · Jihoon Park · Beomgeun Seo

(* Equal Contribution)

teaser3x


Getting Started

Installation

conda create -n multicam-mast3r-slam python=3.11
conda activate multicam-mast3r-slam

Check the system's CUDA version with nvcc

nvcc --version

Install pytorch with matching CUDA version following:

# CUDA 11.8
conda install pytorch==2.5.1 torchvision==0.20.1 torchaudio==2.5.1  pytorch-cuda=11.8 -c pytorch -c nvidia
# CUDA 12.1
conda install pytorch==2.5.1 torchvision==0.20.1 torchaudio==2.5.1 pytorch-cuda=12.1 -c pytorch -c nvidia
# CUDA 12.4
conda install pytorch==2.5.1 torchvision==0.20.1 torchaudio==2.5.1 pytorch-cuda=12.4 -c pytorch -c nvidia

Clone the repo and install the dependencies.

git clone https://github.com/rmurai0610/MASt3R-SLAM.git --recursive
cd MASt3R-SLAM/

# if you've clone the repo without --recursive run
# git submodule update --init --recursive

pip install -e thirdparty/mast3r
pip install -e thirdparty/in3d
pip install --no-build-isolation -e .
 

Setup the checkpoints for MASt3R and retrieval. The license for the checkpoints and more information on the datasets used is written here.

mkdir -p checkpoints/
wget https://download.europe.naverlabs.com/ComputerVision/MASt3R/MASt3R_ViTLarge_BaseDecoder_512_catmlpdpt_metric.pth -P checkpoints/
wget https://download.europe.naverlabs.com/ComputerVision/MASt3R/MASt3R_ViTLarge_BaseDecoder_512_catmlpdpt_metric_retrieval_trainingfree.pth -P checkpoints/
wget https://download.europe.naverlabs.com/ComputerVision/MASt3R/MASt3R_ViTLarge_BaseDecoder_512_catmlpdpt_metric_retrieval_codebook.pkl -P checkpoints/

Examples

Live Demo

Connect a realsense camera to the PC and run

python main.py --dataset realsense --config config/base.yaml

Running on a multicam (iPhone, Android, Drones, etc..)

You would need to edit config/multicam.yaml. For any additional camera input, add the following under dataset:

- id: "iphone"
  type: "iphone"
  path: "your/ip/path"
  camera_id: n

For iPhone camera input, download Droid Cam from the App Store then input the IP address displayed from that app. For Android camera input, download IP Webcam then input the IP address displayed from that app. For a DJI drone camera input,

python main.py --config config/multicam.yaml

If the calibration parameters are known, you can specify them in intrinsics.yaml

python main.py --config config/multicam.yaml --calib config/intrinsics.yaml

Downloading a Munji Dataset

bash ./scripts/download_munji.sh

Running on Munji Multi-cam benchmark dataset

chmod +x scripts/download_munji.sh && ./scripts/download_munji.sh
python main.py --config config/multicam_munji_tables.yaml

Reproducibility

There might be minor differences between the released version and the results in the paper after developing this multi-processing version. We run all our experiments on an RTX 3090, and the performance may differ when running with a different GPU.

Acknowledgement

We sincerely thank the developers and contributors of the many open-source projects that our code is built upon.

Citation

If you found this code/work to be useful in your own research, please considering citing the following:

@inproceedings{kim2025cognitive,
    title={Cognitive-Unburdening Surveillance: Real-Time {3D} Reconstruction for Distributed Spatial Awareness},
    author={Kim, Dong Yoon and Kim, Rocky and Park, Jinwoo and Park, Jihoon and Seo, Beomgeun},
    booktitle={The 38th Annual ACM Symposium on User Interface Software and Technology Adjunct Proceedings},
    series={UIST Adjunct '25},
    year={2025},
    month={September},
    location={Busan, Republic of Korea},
    publisher={ACM},
    address={New York, NY, USA},
    doi={10.1145/3746058.3758351},
    isbn={979-8-4007-2036-9/2025/09}
}

About

[ACM UIST 2025] Cognitive-Unburdening Surveillance: Real-Time 3D Reconstruction for Distributed Spatial Awareness

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors