Towards Robust SEA Detection: Leveraging Model Diversity and Randomization Against Adversarial Attacks

Insaf Kraidia*, Samir Brahim Belhaouari

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Deep Learning-based Social Engineering Attack (SEA) detectors have shown promising results in identifying malicious content. However, these models remain vulnerable to adversarial attacks, where carefully crafted inputs can significantly impact their performance. This paper presents a novel defense mechanism that leverages model diversity and randomization to counter black-box adversarial attacks. Unlike traditional approaches that rely on a single model with fixed parameters, our method automatically utilizes multiple base model variants during inference, each trained on different data distributions with distinct settings. We use a random selection method from these variants for the final predictions, adding an element of unpredictability that increases the difficulty for attackers in creating successful adversarial examples. Our experimental results across various transformer-based architectures demonstrate that our approach maintains robust performance while effectively defending against 12 different types of black-box attacks targeting words, sentences, and characters. The proposed method achieves an average performance improvement of 2-3% over existing defense mechanisms on clean data and a significant 17% improvement in resistance to adversarial settings. It outperforms existing approaches by 13-44% in accuracy under adversarial attacks, all without substantially increasing computational complexity or compromising clean data performance. This work enhances the reliability and trustworthiness of SEA detection systems while addressing practical challenges in deploying robust defense mechanisms.

Original languageEnglish
Title of host publicationProceeding - 12th International Conference on Information Technology
Subtitle of host publicationInnovation Technologies, ICIT 2025
EditorsKhalid Mohammad Jaber
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages84-90
Number of pages7
ISBN (Electronic)9798331508944
DOIs
Publication statusPublished - May 2025
Event12th International Scientific Conference on Information Technology, ICIT 2025 - Amman, Jordan
Duration: 27 May 202530 May 2025

Publication series

NameProceeding - 12th International Conference on Information Technology: Innovation Technologies, ICIT 2025

Conference

Conference12th International Scientific Conference on Information Technology, ICIT 2025
Country/TerritoryJordan
CityAmman
Period27/05/2530/05/25

Keywords

  • Adversarial Attack
  • Black-box attacks
  • Deep Learning
  • Malicious Content
  • Random Selection
  • Social Engineering Attacks

Fingerprint

Dive into the research topics of 'Towards Robust SEA Detection: Leveraging Model Diversity and Randomization Against Adversarial Attacks'. Together they form a unique fingerprint.

Cite this