Classifies tactile images (from YCB-Sight) into 4 material categories using UniTouch encoder:
- plastic
- paper-box
- wood
- metal
docker pull timttu/unitouch:v3The checkpoint is already wrapped in the workspace.
To get into the container and explore files:
docker run --gpus all -it \
-v <your-path-where-repo-is>:/pers_vol/YCB-material \
timttu/unitouch:v3 /bin/bashOnce inside, you can check files, verify paths, or run commands manually.
First, navigate to the repository directory and run the preprocessing script:
cd /pers_vol/YCB-material/unitouch_ycbsight
bash preprocessing.shThis will download and extract the YCB-Sight datasets into the data/ folder.
python3 inference_classifier.py \
--config configs/inference.yaml \
--checkpoint workspace/best_model.pth \
--input /path/to/your/tactile/images/ \
--output results.json \
--batch-size 64pip install torch torchvision tqdm pyyaml tensorboard pillow easydictYour tactile images should be organized like this:
data/
├── plastic/
│ ├── image_001.jpg
│ └── ...
├── paper-box/
├── wood/
└── metal/
python train_classifier.py --config configs/simple_classifier.yaml --gpu 0@inproceedings{yang2024binding,
title={Binding touch to everything: Learning unified multimodal tactile representations},
author={Yang, Fengyu and Feng, Chao and Chen, Ziyang and others},
booktitle={CVPR},
year={2024}
}