Milk Quality Classification with Semi-Supervised Learning

📌 Project Overview

This project, developed by Nowa Analytics, focuses on building semi-supervised classification models to assess milk quality. We were contracted by a dairy industry to help guarantee the quality of milk used in their products.

Using machine learning, we classify milk samples into three categories:

Low Quality
Medium Quality
High Quality

Since the dataset contains both labeled and unlabeled samples, we applied semi-supervised learning techniques such as:

✅ Self-Training with labeled + unlabeled data
✅ Label Propagation (transductive learning)
✅ Supervised baselines for performance comparison

📊 Dataset

The dataset was provided in a CSV file named qualidade_leite.csv, containing 1,059 entries with the following features:

Column	Description
pH	pH level of the milk (continuous)
Temperature	Temperature of the sample (°C)
Taste	Taste quality score
Odor	Odor quality score
Fat	Fat content
Turbidity	Milk turbidity
Color	Visual color quality
Quality	Target variable (Low, Medium, High) – partially labeled (424 samples)

📌 Note: Only a portion of the dataset has labels, which makes it ideal for semi-supervised approaches.

⚙️ Project Objectives

Train classification models using labeled data.
Understand and apply semi-supervised learning concepts.
Generate pseudo-labels for unlabeled samples.
Apply Self-Training strategy with labeled + unlabeled data.
Explore transductive learning using Label Propagation.
Compare the results against fully supervised models.

🛠️ Technologies Used

Python 3.11+
Pandas & NumPy – Data processing
Scikit-learn – Classification, Self-Training, Label Propagation
Matplotlib & Seaborn – Data visualization
Jupyter Notebook – Experiment tracking

📈 Expected Results

Improved classification performance using semi-supervised approaches.
Demonstration of how unlabeled data can boost model accuracy.
Comparison between Self-Training, Label Propagation, and supervised baselines.

👨‍💻 Authors

Project developed by Nowa Analytics 🚀 Data Science Consulting | Machine Learning Solutions

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
model		model
Final_Project.ipynb		Final_Project.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Milk Quality Classification with Semi-Supervised Learning

📌 Project Overview

📊 Dataset

⚙️ Project Objectives

🛠️ Technologies Used

📈 Expected Results

👨‍💻 Authors

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Milk Quality Classification with Semi-Supervised Learning

📌 Project Overview

📊 Dataset

⚙️ Project Objectives

🛠️ Technologies Used

📈 Expected Results

👨‍💻 Authors

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages