Fine-Tuning Guide

Author: Daniel Puente Viejo

A hands-on guide for fine-tuning the models TinyLlama1.1B & Llama3.1-8B-Instruct with low-rank adapters (LoRA). The repository bundles Colab and macOS & Windows/Linux local workflows with curated datasets, and fully reproducible experiments so you can adapt compact open models to your own domain with minimal compute.

🎯 What You Get

Workflows covering google_colab/fine-tuning.ipynb, mac/fine-tuning.ipynb, and windows-linux/fine-tuning.ipynb
Ready-to-use basketball dataset in data plus raw corpus in data/data.txt
Three training setups: Unsloth on Colab, Hugging Face's SFTTrainer on macOS (Apple Silicon), and Windows/Linux (CUDA)
Logging-ready setup (Accelerate, bitsandbytes, safetensors) for efficient experimentation on consumer GPUs or Colab T4s

🧱 Project Structure

├── data/                      # Supervised fine-tuning corpora (JSON + raw text)
├── google_colab/
│   ├── fine-tuning.ipynb      # Colab workflow (setup + training + evaluation)
│   └── imgs/                  # Notebook figures and configuration screenshots
├── mac/
│   ├── fine-tuning.ipynb      # Local M-series workflow with Accelerate + LoRA
│   └── requirements.txt       # Exact Python dependencies for local runs
├── windows-linux/
│   ├── fine-tuning.ipynb      # Local Windows/Linux workflow with PEFT + CUDA
│   └── requirements.txt       # Exact Python dependencies for local runs
└── README.md                  # You are here

🚀 Quick Start

Option A — Google Colab (GPU in the cloud)

Open google_colab/fine-tuning.ipynb and connect to a GPU runtime (T4 or better)
Run the setup cells to install Transformers, TRL, PEFT, bitsandbytes and Unsloth.
Load datasets directly from data

Option B — Local macOS (Apple Silicon or CPU)

Create a Python 3.10+ environment (uv, conda, or venv) and activate it
Install dependencies from mac/requirements.txt

python -m venv .venv
source .venv/bin/activate
pip install -r mac/requirements.txt

Launch mac/fine-tuning.ipynb and run the cells sequentially

Option C — Local Windows/Linux (CUDA GPU or CPU)

Create and activate a Python 3.10+ virtual environment (Command Prompt shown below)

python -m venv .venv
.venv\Scripts\activate
pip install -r windows-linux/requirements.txt

Ensure CUDA drivers are installed and a compatible GPU is available
Open windows-linux/fine-tuning.ipynb in VS Code or Jupyter

📓 Notebooks at a Glance

google_colab/fine-tuning.ipynb: Designed for quick iteration on Colab with Unsloth, showcasing the full fine-tuning loop and evaluation on the basketball dataset
mac/fine-tuning.ipynb: Optimized for Apple Silicon with 4-bit loading, Accelerate configuration, and local inference tests. You can use CPU only if you don't have an M-series chip, but performance will be slower.
windows-linux/fine-tuning.ipynb: CUDA-ready notebook using PEFT + Accelerate for GPUs on Windows, Linux, or WSL. CPU fallback is available but not recommended for large models.

🧾 Data Expectations

You can train the model with 2 different types of data:

Raw context text for language-model warm-up resides in data/data.txt. Provide one or more paragraphs separated by blank lines.
Supervised pairs live in data/atomic_train.json. Each entry follows the schema below:

{
	"question": "What type of sport is basketball?",
	"answer": "Basketball is a team sport played on a rectangular court."
}

🧠 Recommended Learning Path

Review the Colab notebook to understand the end-to-end flow
Inspect the dataset examples in data.
Enjoy the notebooks!

📃 License

This project is released under the MIT License. See LICENSE for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fine-Tuning Guide

🎯 What You Get

🧱 Project Structure

🚀 Quick Start

Option A — Google Colab (GPU in the cloud)

Option B — Local macOS (Apple Silicon or CPU)

Option C — Local Windows/Linux (CUDA GPU or CPU)

📓 Notebooks at a Glance

🧾 Data Expectations

🧠 Recommended Learning Path

📃 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
data		data
google_colab		google_colab
imgs		imgs
mac		mac
windows-linux		windows-linux
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Fine-Tuning Guide

🎯 What You Get

🧱 Project Structure

🚀 Quick Start

Option A — Google Colab (GPU in the cloud)

Option B — Local macOS (Apple Silicon or CPU)

Option C — Local Windows/Linux (CUDA GPU or CPU)

📓 Notebooks at a Glance

🧾 Data Expectations

🧠 Recommended Learning Path

📃 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages