📩 SMS Spam Detection

A machine learning project to classify SMS messages as spam or ham (not spam) using natural language processing (NLP) techniques.

👥 Collaborators

📌 Project Overview

This project aims to build an SMS spam classifier using traditional NLP techniques and machine learning algorithms. The model learns to distinguish spam messages from legitimate ones using a labeled dataset of SMS messages.

📂 Dataset

Source: UCI SMS Spam Collection Dataset
Format: CSV file with two columns:
- label: spam or ham
- message: text content of the SMS

⚙️ Tech Stack & Tools

Language: Python
Libraries: pandas, scikit-learn, nltk, matplotlib, seaborn
Modeling: Naive Bayes, Logistic Regression, SVM, etc.
Notebook: Jupyter Notebook (sms_spam_detection.ipynb)

🧠 Approach

🔹 Data Cleaning & Preprocessing

Lowercasing
Punctuation removal
Stopword filtering
Stemming

🔹 Exploratory Data Analysis (EDA)

Spam vs Ham distribution
Common word frequencies

🔹 Text Vectorization

Using TF-IDF and CountVectorizer

🔹 Model Building

Tested models: Naive Bayes, Logistic Regression, SVM

🔹 Model Evaluation

Accuracy, Precision, Recall, F1-score
Confusion Matrix

🤝 Let's Collaborate!

We're always open to feedback, suggestions, and collaboration on similar NLP or machine learning projects.

Connect with us on LinkedIn:

Feel free to reach out — let’s build something cool together! 🚀

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
,gitignore		,gitignore
Procfile		Procfile
README.md		README.md
app.py		app.py
model.pkl		model.pkl
nltk.text		nltk.text
requirements.text		requirements.text
setup.sh		setup.sh
sms_spam_detection.ipynb		sms_spam_detection.ipynb
spam.csv		spam.csv
vectorizer.pkl		vectorizer.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📩 SMS Spam Detection

👥 Collaborators

📌 Project Overview

📂 Dataset

⚙️ Tech Stack & Tools

🧠 Approach

🔹 Data Cleaning & Preprocessing

🔹 Exploratory Data Analysis (EDA)

🔹 Text Vectorization

🔹 Model Building

🔹 Model Evaluation

🤝 Let's Collaborate!

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

📩 SMS Spam Detection

👥 Collaborators

📌 Project Overview

📂 Dataset

⚙️ Tech Stack & Tools

🧠 Approach

🔹 Data Cleaning & Preprocessing

🔹 Exploratory Data Analysis (EDA)

🔹 Text Vectorization

🔹 Model Building

🔹 Model Evaluation

🤝 Let's Collaborate!

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages