Skip to content

Utkarsh2108/sms-spam-detection

Repository files navigation

📩 SMS Spam Detection

A machine learning project to classify SMS messages as spam or ham (not spam) using natural language processing (NLP) techniques.


👥 Collaborators


📌 Project Overview

This project aims to build an SMS spam classifier using traditional NLP techniques and machine learning algorithms. The model learns to distinguish spam messages from legitimate ones using a labeled dataset of SMS messages.


📂 Dataset


⚙️ Tech Stack & Tools

  • Language: Python
  • Libraries: pandas, scikit-learn, nltk, matplotlib, seaborn
  • Modeling: Naive Bayes, Logistic Regression, SVM, etc.
  • Notebook: Jupyter Notebook (sms_spam_detection.ipynb)

🧠 Approach

🔹 Data Cleaning & Preprocessing

  • Lowercasing
  • Punctuation removal
  • Stopword filtering
  • Stemming

🔹 Exploratory Data Analysis (EDA)

  • Spam vs Ham distribution
  • Common word frequencies

🔹 Text Vectorization

  • Using TF-IDF and CountVectorizer

🔹 Model Building

  • Tested models: Naive Bayes, Logistic Regression, SVM

🔹 Model Evaluation

  • Accuracy, Precision, Recall, F1-score
  • Confusion Matrix

🤝 Let's Collaborate!

We're always open to feedback, suggestions, and collaboration on similar NLP or machine learning projects.

Connect with us on LinkedIn:

Feel free to reach out — let’s build something cool together! 🚀

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages