AAS SPAM Detector

The objective is to develop a SPAM classifier capable of reaching at least 70% accuracy. You can, and should, use all that was presented in the theoretical notebooks.

The dataset is different from the toy one used in the class, instead the work will be done on the Enron SPAM dataset. The Enron-Spam dataset is a fantastic ressource collected by V. Metsis, I. Androutsopoulos and G. Paliouras and described in their publication "Spam Filtering with Naive Bayes - Which Naive Bayes?". The dataset contains a total of 17.171 spam and 16.545 non-spam ("ham") e-mail messages (33.716 e-mails total). The original dataset and documentation can be found here.

Setup

Follow the instructions bellow:

python3 -m venv venv
sourve venv/bin/activate
pip install -r requirements.txt

Authors

Mário Antunes - mariolpantunes

License

This project is licensed under the MIT License - see the LICENSE file for details

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

AAS SPAM Detector

Setup

Authors

License

Files

README.md

Latest commit

History

README.md

File metadata and controls

AAS SPAM Detector

Setup

Authors

License