Skip to content

simple Vision Transformer implementation with PyTorch

Notifications You must be signed in to change notification settings

ItaiPemp/Simple_ViT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Vision Transformer Implementation

image

Overview

This repository contains a simple implementation of the Vision Transformer (ViT) model as described in the original paper titled "An Image is Worth 16x16 Words" by Alexey Dosovitskiy et al. The Vision Transformer is a novel approach to image classification that treats images as sequences of patches and utilizes transformer architectures for learning representations.

Citation

About

simple Vision Transformer implementation with PyTorch

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages