Skip to content

Latest commit

 

History

History
11 lines (7 loc) · 670 Bytes

README.md

File metadata and controls

11 lines (7 loc) · 670 Bytes

phishing-website-detection

Final master's degree project.

I used this dataset https://archive.ics.uci.edu/ml/datasets/phishing+websites and got good results using xgboost and a simple perceptron.

alt text

Undersampling techniques were used such as One-Sided Selection and Condensed Nearest Neighbor and they massively improved the training time of SVM models and the accuracy/precision/F1 measures of SVM with RBF Kernel.

Creators of the dataset used: Mohammad, Rami, Thabtah, Fadi Abdeljaber and McCluskey, T.L. (2015) Phishing Websites Dataset. [Dataset] (Unpublished)