CIFAR-10 Image Classification Tutorial

This repository demonstrates deep learning techniques for image classification using the CIFAR-10 benchmark dataset. Through a series of progressively more sophisticated models, it shows how different neural network architectures and training techniques improve classification performance.

The tools developed for this project are available via PyPI for use in your own projects.

PyPi package: image-classification-tools
Documentation: gperdrizet.github.io/CIFAR10
Models: Pre-trained models are available on HuggingFace

About the dataset

CIFAR-10 consists of 60,000 32×32 color images across 10 classes:

Training set: 50,000 images (40,000 for training, 10,000 for validation)
Test set: 10,000 images
Classes: airplane, automobile, bird, cat, deer, dog, frog, horse, ship, truck

The models

01-DNN.ipynb - Baseline fully-connected deep neural network on grayscale images
02-CNN.ipynb - Convolutional neural network to exploit spatial relationships
03-RGB-CNN.ipynb - CNN modified to process full RGB color information
04-architecture_optimization.ipynb - Optuna optimization of CNN architecture (conv blocks, filters, dropout)
05-training-optimization.ipynb - Optuna optimization of training hyperparameters (optimizer, learning rate, weight decay)
06-augmented-CNN.ipynb - Optimized architecture + training with data augmentation
07-resnet50.ipynb - Transfer learning with pre-trained ResNet50
08-results.ipynb - Model comparison and performance analysis

Results

Getting started

Prerequisites

Docker
Visual Studio Code
Dev Containers extension
Optional: NVIDIA GPU with drivers ≥545 (for GPU configuration)

Setup

Clone this repository:

git clone https://github.com/gperdrizet/CIFAR10.git

Open the repository directory in VS Code
When prompted, choose your configuration:
- DeepLearning GPU: For Linux/WSL2 with NVIDIA GPU (recommended for training)
- DeepLearning CPU: For Mac or machines without GPU (suitable for code development and testing)
Click "Reopen in Container" or use the command palette (Ctrl+Shift+P / Cmd+Shift+P) and select "Dev Containers: Reopen in Container"
Wait for the container to build and initialize (first time only). The devcontainer will:
- Set up the Python environment
- Install all dependencies (GPU or CPU version of PyTorch)
- Download the CIFAR-10 dataset

Note: The CPU configuration works on any machine but training will be significantly slower (10-50x). It's perfect for following along with the material without a GPU. See .devcontainer/README.md for more details.

Running the notebooks

Once the container environment loads, you can start running the notebooks!

Training control: Each notebook has parameters to control whether to train or load existing models:

retrain_model = True (default): Train a new model
retrain_model = False: Load an existing model (from local disk or Hugging Face Hub)
model_source = 'local' or 'huggingface': Choose where to load from

Note: Run notebooks in order - some depend on output generated by earlier notebooks.

Optional: Upload models to Hugging Face Hub

Each notebook includes optional Hugging Face Hub integration to share your trained models:

Copy .env.template to .env and add your repository details
Create a Hugging Face account at huggingface.co
Run huggingface-cli login in the terminal and enter your access token
Set upload_to_hf = True in the upload cells

Note: Students can download pre-trained models from the default repository (gperdrizet/CIFAR10) without any setup by setting retrain_model=False and model_source='huggingface'.

See HUGGINGFACE.md for detailed setup instructions.

Environment

This project uses Dev Containers with two configurations to support both GPU and CPU-only machines:

GPU Configuration (recommended for training):

Based on deeplearning-GPU template
Pre-installed PyTorch with CUDA 12.6 support
Works with any NVIDIA GPU from GTX 1050 to RTX 5090 (Pascal architecture and newer)
Requires: Linux/WSL2, NVIDIA drivers ≥545, Docker with GPU support

CPU Configuration (for Mac and non-GPU machines):

Based on Microsoft's Python 3.10 devcontainer
PyTorch CPU-only (auto-installed)
Works on any machine (Mac, Windows, Linux)
Good for code development, testing, and learning without GPU

Why use this setup?

Zero configuration: CUDA, cuDNN, and ML frameworks are pre-installed
Cross-platform: Identical environment across different machines
Reproducible: Everyone gets the exact same dependencies
Flexible: Choose GPU or CPU based on your hardware

The code automatically detects your hardware, so both configurations work seamlessly. Training on CPU will be slower but still functional.

Resource	Link
Dev Container Guide	.devcontainer/README.md
GPU Template Repository	gperdrizet/deeplearning-GPU
Docker Image	gperdrizet/deeplearning-gpu

Dependencies

Core requirements:

python ≥3.10, <3.13
torch ≥2.0
torchvision ≥0.15
numpy ≥1.24, <2.0
matplotlib ≥3.7
optuna ≥3.0
scikit-learn ≥1.3

Citation

If you find this project helpful for your learning or work, feel free to reference it:

@software{cifar10_tutorial,
  author = {Perdrizet, George},
  title = {CIFAR-10 Image Classification Tutorial},
  year = {2024},
  url = {https://github.com/gperdrizet/CIFAR10}
}

License

This project is licensed under the GPLv3 License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 169 Commits
.devcontainer		.devcontainer
.github/workflows		.github/workflows
data/pytorch		data/pytorch
docs		docs
notebooks		notebooks
src/image_classification_tools		src/image_classification_tools
.env.template		.env.template
.gitignore		.gitignore
HUGGINGFACE.md		HUGGINGFACE.md
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CIFAR-10 Image Classification Tutorial

About the dataset

The models

Results

Getting started

Prerequisites

Setup

Running the notebooks

Optional: Upload models to Hugging Face Hub

Environment

Dependencies

Citation

License

About

Uh oh!

Releases 14

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CIFAR-10 Image Classification Tutorial

About the dataset

The models

Results

Getting started

Prerequisites

Setup

Running the notebooks

Optional: Upload models to Hugging Face Hub

Environment

Dependencies

Citation

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 14

Contributors

Uh oh!

Languages