BERT-based models for phishing detection

Songailaitė, Milita; Kankevičiūtė, Eglė; Zhyhun, Bohdan; Mandravickaitė, Justina

Use this url to cite publication: https://hdl.handle.net/20.500.12259/259788

BERT-based models for phishing detection

Type of publication

Straipsnis konferencijos medžiagoje Scopus duomenų bazėje / Article in conference proceedings in Scopus database (P1a2)

Author(s)

Author	Affiliation
Songailaitė, Milita	Gamtos ir tech.mokslų tyr.institut / Research Institute of Natural and Technological Sciences	Centre for Applied Research and Development (CARD)
Kankevičiūtė, Eglė	Gamtos ir tech.mokslų tyr.institut / Research Institute of Natural and Technological Sciences	Centre for Applied Research and Development (CARD)
Zhyhun, Bohdan	Taikomosios informatikos katedra / Department of Applied Informatics	Centre for Applied Research and Development (CARD)
Mandravickaitė, Justina	Gamtos ir tech.mokslų tyr.institut / GTMTI Gamtos ir tech.mokslų tyr.institut	Centre for Applied Research and Development (CARD)

Title

BERT-based models for phishing detection

[en]

Is part of

CEUR Workshop proceedings : IVUS 2023 : Proceedings of the 28th international conference on Information Society and University Studies , Kaunas, Lithuania, May 12, 2023.

Date Issued

Date	Volume	Start Page	End Page
2023	3575	34	44

Publisher

Aachen : CEUR-WS

Is Referenced by

Scopus

URI

URI
https://ceur-ws.org/Vol-3575/Paper4.pdf
https://hdl.handle.net/20.500.12259/259788

Field of Science

OECD Classification

Keywords (en)

Abstract (en)

In this paper we report the application of BERT-based models for phishing detection in emails. We fine-tuned 3 BERT-based models (DistilBERT, TinyBERT and RoBERTa) for the task.All the fine-tuned models attained scores above 0.985 for each metric (accuracy, precision,recall and F1-score). Nevertheless, the RoBERTa model demonstrated the highest classification scores across all metrics, indicating that it can classify the selected phishing data with the utmost accuracy. The models from each BERT architecture have then been assessed more deeply via using them in pseudo-real-life situation. For this purpose, we created an entirely new dataset from the actual phishing emails and used text augmentation techniques to increase their quantity. DistilBERT and RoBERTa models produced very similar outcomes, i.e., most of the emails were classified correctly. However, as DistilBERT uses fewer resources and performs better than the RoBERTa model, it has been regarded as the best model for detecting phishing emails in our case. The TinyBERT variant had the worst results as its size was insufficient for learning to categorize emails and detect phishing.