Cognitive-Unburdening Surveillance: Real-Time 3D Reconstruction for Distributed Spatial Awareness

John DongYoon Kim* · Rocky Kim* · Jinwoo Park · Jihoon Park · Beomgeun Seo

(* Equal Contribution)

Paper | Video | Project Page

Getting Started

Installation

conda create -n multicam-mast3r-slam python=3.11
conda activate multicam-mast3r-slam

Check the system's CUDA version with nvcc

nvcc --version

Install pytorch with matching CUDA version following:

# CUDA 11.8
conda install pytorch==2.5.1 torchvision==0.20.1 torchaudio==2.5.1  pytorch-cuda=11.8 -c pytorch -c nvidia
# CUDA 12.1
conda install pytorch==2.5.1 torchvision==0.20.1 torchaudio==2.5.1 pytorch-cuda=12.1 -c pytorch -c nvidia
# CUDA 12.4
conda install pytorch==2.5.1 torchvision==0.20.1 torchaudio==2.5.1 pytorch-cuda=12.4 -c pytorch -c nvidia

Clone the repo and install the dependencies.

git clone https://github.com/rmurai0610/MASt3R-SLAM.git --recursive
cd MASt3R-SLAM/

# if you've clone the repo without --recursive run
# git submodule update --init --recursive

pip install -e thirdparty/mast3r
pip install -e thirdparty/in3d
pip install --no-build-isolation -e .

Setup the checkpoints for MASt3R and retrieval. The license for the checkpoints and more information on the datasets used is written here.

mkdir -p checkpoints/
wget https://download.europe.naverlabs.com/ComputerVision/MASt3R/MASt3R_ViTLarge_BaseDecoder_512_catmlpdpt_metric.pth -P checkpoints/
wget https://download.europe.naverlabs.com/ComputerVision/MASt3R/MASt3R_ViTLarge_BaseDecoder_512_catmlpdpt_metric_retrieval_trainingfree.pth -P checkpoints/
wget https://download.europe.naverlabs.com/ComputerVision/MASt3R/MASt3R_ViTLarge_BaseDecoder_512_catmlpdpt_metric_retrieval_codebook.pkl -P checkpoints/

Examples

Live Demo

Connect a realsense camera to the PC and run

python main.py --dataset realsense --config config/base.yaml

Running on a multicam (iPhone, Android, Drones, etc..)

You would need to edit config/multicam.yaml. For any additional camera input, add the following under dataset:

- id: "iphone"
  type: "iphone"
  path: "your/ip/path"
  camera_id: n

For iPhone camera input, download Droid Cam from the App Store then input the IP address displayed from that app. For Android camera input, download IP Webcam then input the IP address displayed from that app. For a DJI drone camera input,

python main.py --config config/multicam.yaml

If the calibration parameters are known, you can specify them in intrinsics.yaml

python main.py --config config/multicam.yaml --calib config/intrinsics.yaml

Downloading a Munji Dataset

bash ./scripts/download_munji.sh

Running on Munji Multi-cam benchmark dataset

chmod +x scripts/download_munji.sh && ./scripts/download_munji.sh
python main.py --config config/multicam_munji_tables.yaml

Reproducibility

There might be minor differences between the released version and the results in the paper after developing this multi-processing version. We run all our experiments on an RTX 3090, and the performance may differ when running with a different GPU.

Acknowledgement

We sincerely thank the developers and contributors of the many open-source projects that our code is built upon.

Citation

If you found this code/work to be useful in your own research, please considering citing the following:

@inproceedings{kim2025cognitive,
    title={Cognitive-Unburdening Surveillance: Real-Time {3D} Reconstruction for Distributed Spatial Awareness},
    author={Kim, Dong Yoon and Kim, Rocky and Park, Jinwoo and Park, Jihoon and Seo, Beomgeun},
    booktitle={The 38th Annual ACM Symposium on User Interface Software and Technology Adjunct Proceedings},
    series={UIST Adjunct '25},
    year={2025},
    month={September},
    location={Busan, Republic of Korea},
    publisher={ACM},
    address={New York, NY, USA},
    doi={10.1145/3746058.3758351},
    isbn={979-8-4007-2036-9/2025/09}
}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
config		config
groundtruths		groundtruths
mast3r_slam		mast3r_slam
media		media
resources/programs		resources/programs
scripts		scripts
thirdparty		thirdparty
-home-research-Desktop-.txt		-home-research-Desktop-.txt
.gitignore		.gitignore
.gitmodules		.gitmodules
Dependencies.md		Dependencies.md
Dockerfile		Dockerfile
LICENSE.md		LICENSE.md
Multicam-MASt3R-SLAM.code-workspace		Multicam-MASt3R-SLAM.code-workspace
README.md		README.md
checkAvailableindexWebcam.py		checkAvailableindexWebcam.py
environment.txt		environment.txt
main.py		main.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
run_mast3r_slam.sh		run_mast3r_slam.sh
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cognitive-Unburdening Surveillance: Real-Time 3D Reconstruction for Distributed Spatial Awareness

Paper | Video | Project Page

Getting Started

Installation

Examples

Live Demo

Running on a multicam (iPhone, Android, Drones, etc..)

Downloading a Munji Dataset

Running on Munji Multi-cam benchmark dataset

Reproducibility

Acknowledgement

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Cognitive-Unburdening Surveillance: Real-Time 3D Reconstruction for Distributed Spatial Awareness

Paper | Video | Project Page

Getting Started

Installation

Examples

Live Demo

Running on a multicam (iPhone, Android, Drones, etc..)

Downloading a Munji Dataset

Running on Munji Multi-cam benchmark dataset

Reproducibility

Acknowledgement

Citation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages