MNIST Vision Transformer in JAX

The project is a Vision Transformer (ViT) for classifying handwritten digits from the MNIST dataset created purely with JAX.

Reasoning

The project was built in JAX to expose all the math that goes on behind the scenes while training a model like a ViT. In the project, all math is made using only jax functions, attention being manually implemented.

Features

Lightweight ViT architecture
Built purely with JAX
Many usable augmentations
Extendable to other image-based datasets

Requirements

Python 3.9 to 3.12
A proper jaxlib installation if using cuda

Installation

# Clone the repo
git clone https://github.com/Advaith-Hello
cd TransformerTest1

# Setup venv and install dependencies

python -3.12 -m venv .venv
python -m pip install --upgrade pip
python -m pip install -r requirements.txt

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
experiments		experiments
scripts		scripts
src		src
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MNIST Vision Transformer in JAX

Reasoning

Features

Requirements

Installation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MNIST Vision Transformer in JAX

Reasoning

Features

Requirements

Installation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages