📩 SMS Spam Classifier

A simple spam detection project using PyTorch, Scikit-learn, and the Hugging Face sms_spam dataset. The model is trained to classify SMS messages as spam or not spam using a logistic regression-based neural network (CircleModelV0).

📁 Project Structure

Sms-Spam/
├── main.ipynb          # Training notebook
├── inference.py        # Inference script
├── full_model.pth      # Trained PyTorch model (full)
├── vectorizer.pkl      # CountVectorizer object (for text preprocessing)
└── README.md

🧠 Model Architecture (`CircleModelV0`)

A simple feed-forward neural network:

Input Layer: 8713 features (from CountVectorizer)
Hidden Layer: 64 neurons + ReLU
Output Layer: 1 neuron (sigmoid for binary classification)

🚀 Getting Started

1. Clone the repository

git clone https://github.com/milliyin/sms-spam-model-train.git
cd Sms-Spam

2. Install dependencies

pip install torch scikit-learn datasets huggingface_hub

3. Run Inference

python inference.py

🧪 Example Output

Input: you have won a free prize
Output: Spam

🗃️ Dataset

The model uses the public dataset ucirvine/sms_spam from Hugging Face. It contains 5,574 SMS messages labeled as spam or ham.

🧠 My Learning Journey

This project is part of my learning process on how to train machine learning models using my own code. Here’s what I’ve learned and implemented:

✅ Installed and configured libraries like torch, scikit-learn, and datasets.
✅ Downloaded and loaded the dataset using the Hugging Face Datasets library.
✅ Preprocessed the raw SMS data using CountVectorizer to convert text into numerical feature vectors.
✅ Converted those vectors to PyTorch tensors.
✅ Split the data into training and testing sets using train_test_split.
✅ Assigned input features (X: SMS prompts) and labels (y: 0 for ham, 1 for spam).
✅ Created a custom model class CircleModelV0 using PyTorch's nn.Module.
✅ Wrote the training loop including forward pass, loss computation, backpropagation, and optimization.
✅ Evaluated the model using accuracy and tested it on new inputs.
✅ Saved and reloaded the model and vectorizer for inference.

🔐 Notes on PyTorch 2.6+

If you're using PyTorch 2.6+, the torch.load() call needs to allow loading full models with custom classes:

model = torch.load("full_model.pth", weights_only=False)

Make sure the class CircleModelV0 is defined or imported in your script before calling torch.load.

📬 Contact

Feel free to open issues or suggest improvements!
Made with ❤️ for NLP enthusiasts.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📩 SMS Spam Classifier

📁 Project Structure

🧠 Model Architecture (`CircleModelV0`)

🚀 Getting Started

1. Clone the repository

2. Install dependencies

3. Run Inference

🧪 Example Output

🗃️ Dataset

🧠 My Learning Journey

🔐 Notes on PyTorch 2.6+

📬 Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
full_model.pth		full_model.pth
inference.py		inference.py
main.ipynb		main.ipynb
vectorizer.pkl		vectorizer.pkl

Folders and files

Latest commit

History

Repository files navigation

📩 SMS Spam Classifier

📁 Project Structure

🧠 Model Architecture (CircleModelV0)

🚀 Getting Started

1. Clone the repository

2. Install dependencies

3. Run Inference

🧪 Example Output

🗃️ Dataset

🧠 My Learning Journey

🔐 Notes on PyTorch 2.6+

📬 Contact

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

🧠 Model Architecture (`CircleModelV0`)

Packages