spam-detection

This project focuses on building a robust spam detection model using machine learning techniques. The model is designed to classify emails or messages as "spam" or "not spam" based on their content.This project aims to achieve high accuracy in distinguishing spam from legitimate messages.

Data Preprocessing Removed unnecessary characters, punctuation, and stop words. Converted all text to lowercase. Tokenized the text into words. Applied lemmatization to reduce words to their root form.
Feature Engineering Bag-of-Words (BoW): Converted text data into numerical vectors representing word frequency. TF-IDF (Term Frequency-Inverse Document Frequency): Highlighted important words in the dataset. N-grams: Captured word sequences to add more context to the features.
Model Training Implemented and trained various machine learning models: Naive Bayes Random Forest
Evaluation The models were evaluated using the following metrics: Accuracy Precision Recall F1-score
Results The best-performing model achieved: Accuracy: 98% Precision: 97% Recall: 96% F1-Score: 96.5%

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.gitignore		.gitignore
README.md		README.md
spam detection (2).ipynb		spam detection (2).ipynb
spam.csv		spam.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

spam-detection

About

Releases

Packages

Languages

Krishnasoni200320/spam-detection-

Folders and files

Latest commit

History

Repository files navigation

spam-detection

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages