vit

An implementation of ViT in multiple languages and frameworks.

Note

PyTorch is the only implementation I have finished. Other implementations are a work-in-progress.

Try it Yourself

Models

Model	Pre-training data	Top-1 IN-1k	Link
ViT-S/32	IN-1k	67.7%*	gdrive
ViT-S/32	IN-21k	DOING	TODO

* yes this model gets out performed by a ResNet-50 on IN-1k. However, this uses IN-1k as it's pre-training dataset, hence the low Top-1 for a ViT. ViTs require a ~~large~~ huge pre-training dataset to obtain good performance (due to no inductive bias on images, unlike CNNs).

Notes

I followed the original ViT's hyper-params. This includes:

LR:
- Linear warmup of 30 epochs, i.e. increase LR from eps -> lr linearly
- Cosine decay to 0 for the remaining epochs. Note, the hparams for cosine decay are for half a period of cosine, i.e. it doesn't go back up to original LR.
H-params for the model architecture

Train/Eval Lessons

Training a ViT from scratch has it's challenges due to compute requirements and hparam tweaking.

When training from scratch:
1. Warming-up the LR is very important. Training a ViT from scratch is not like training a CNN.
2. Use a huge dataset when training from scratch (pre-training), e.g. IN-21k
Evaluate with multiple crops ("Inception-style") and average the output logits. I observed a ~+10-15% boost in top-1 performance when doing this.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
deps		deps
docs/dev		docs/dev
vit		vit
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

vit

Try it Yourself

Models

Notes

Train/Eval Lessons

About

Uh oh!

Releases

Packages

Languages

miguelmartin75/vit

Folders and files

Latest commit

History

Repository files navigation

vit

Try it Yourself

Models

Notes

Train/Eval Lessons

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages