Vision Transformer (ViT) for CIFAR-10 Classification

This repository contains code to train and evaluate a Vision Transformer (ViT) model using PyTorch on the CIFAR-10 dataset. CIFAR-10 is a popular benchmark dataset for image classification tasks.

Overview

This project demonstrates how to use a Vision Transformer (ViT) model, originally designed for natural language processing tasks, for image classification. The model is trained on the CIFAR-10 dataset, which consists of 60,000 32x32 color images in 10 classes, with 6,000 images per class.

Requirements

Python 3
PyTorch
torchvision
CUDA-enabled GPU (optional but recommended for faster training)

Installation

Clone the repository:

git clone https://github.com/your_username/your_repository.git
cd your_repository

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
__pycache__		__pycache__
data/cifar-10		data/cifar-10
README.md		README.md
Visual_transformers.ipynb		Visual_transformers.ipynb
fineTune_Visual_transformers.ipynb		fineTune_Visual_transformers.ipynb
model.py		model.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vision Transformer (ViT) for CIFAR-10 Classification

Overview

Requirements

Installation

About

Releases

Packages

Languages

Movazed/Vision-Transformer-ViT-for-CIFAR-10-Classification

Folders and files

Latest commit

History

Repository files navigation

Vision Transformer (ViT) for CIFAR-10 Classification

Overview

Requirements

Installation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages