Skip to content

A minimalist implementation of the ViT (Vision Transformer) model, using tinygrad

License

Notifications You must be signed in to change notification settings

EthanBnntt/tinygrad-vit

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A Simple Vision Transformer (ViT) Implementation in Tinygrad

This repository contains a minimalist implementation of the Vision Transformer (ViT) model using tinygrad.

What is a ViT?

The Vision Transformer (ViT) is a model introduced by Google Research that applies transformer architecture to image classification tasks. Unlike traditional convolutional neural networks (CNNs), ViT divides an image into patches and processes them as sequences, similar to words in natural language processing.

Installation

To get started, clone this repository and install the required dependencies:

git clone https://github.com/EthanBnntt/tinygrad-vit.git
cd tinygrad-vit
pip install -r requirements.txt

Usage

python train.py

About

A minimalist implementation of the ViT (Vision Transformer) model, using tinygrad

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages