Email-Spam-Filtering

This project is a spam classification system that uses Multinomial Naive Bayes algorithm. The project involves pre-processing an adaptive dataset of emails using Natural Language Processing techniques such as tokenization, stop-word removal, and stemming. The pre-processed data is then used to train a logistic regression model. The model is based on bag-of-words approach, where each email is represented as a vector of word frequencies. The project uses Python programming language and its libraries including pandas, scikit-learn, and matplotlib for data processing, modeling, and visualization respectively. The evaluation of model accuracy is based on precision, recall, and F1-score metrics. The project includes data visualization of the most frequently used words in spam and ham emails.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
spam.csv		spam.csv
spam_filtering.py		spam_filtering.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Email-Spam-Filtering

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Email-Spam-Filtering

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages