Video Similarity Comparison Tool (ViT-model)

This project uses Vision Transformer (ViT) models to compare videos and find similarities between them. It extracts embeddings from video frames and computes cosine similarity to determine how similar two videos are.

Features

Extract frame embeddings from videos using Vision Transformer models
Compare videos based on visual similarity
Process multiple videos in batch
Generate similarity reports in JSON format

Requirements

Python 3.6+
PyTorch
OpenCV
NumPy
timm (PyTorch Image Models)

Installation

Clone this repository:

git clone https://github.com/yourusername/ViT-model.git
cd ViT-model

Install the required dependencies:

pip install torch torchvision opencv-python numpy timm

Usage

Place videos to be monitored in the monitored folder
Place videos to be compared against in the watched folder
Run the main script:

python test/main.py

Results will be saved to similarity_results.json

How It Works

The tool samples frames from each video at a specified rate (default: 1 frame per second)
Each frame is preprocessed and normalized
A Vision Transformer model extracts embeddings from the frames
Frame embeddings are aggregated to create a video-level embedding
Cosine similarity is computed between video embeddings
Results are sorted by similarity and saved to a JSON file

Project Structure

test/main.py: Main script for video comparison
monitored/: Directory for videos to be monitored
watched/: Directory for videos to be compared against
similarity_results.json: Output file with similarity results

License

MIT License

Acknowledgements

This project uses the timm library for Vision Transformer models.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
test		test
.gitignore		.gitignore
README.md		README.md
create_venv.sh		create_venv.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Video Similarity Comparison Tool (ViT-model)

Features

Requirements

Installation

Usage

How It Works

Project Structure

License

Acknowledgements

About

Uh oh!

Releases

Packages

Languages

orianexxx/ViT-model

Folders and files

Latest commit

History

Repository files navigation

Video Similarity Comparison Tool (ViT-model)

Features

Requirements

Installation

Usage

How It Works

Project Structure

License

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages