Image-Based Phishing URL Classification Using Convolutional Neural Networks
gholamreza ahmadi
1
(
)
Hamed Monkaresi
2
(
دانشگاه رازی
)
Keywords: Phishing Detection, URL Classification, Convolutional Neural Networks (CNNs), Deep Learning, Image-Based Classification, Cybersecurity,
Abstract :
Phishing attacks continue to pose a significant threat to online security, with attackers increasingly leveraging deceptive URLs to steal sensitive information. Traditional phishing detection methods often rely on URL analysis or manual feature extraction, which can be time-consuming and less effective against evolving attack techniques. To address these limitations, more adaptive and intelligent detection mechanisms are increasingly required to keep pace with modern attack strategies. In this paper, we propose an image-based approach for phishing URL classification using Convolutional Neural Networks (CNNs). By transforming URLs into visual representations based on their features, we leverage the power of deep learning to automatically extract discriminative features for classification. We conduct a comprehensive comparison of various deep learning models, including different CNN architectures (both basic and pre-trained/fine-tuned), to evaluate their performance in terms of accuracy, computational efficiency, and training time. Our experiments demonstrate that image-based classification using CNNs achieves competitive accuracy while offering potential robustness against adversarial variations in phishing URLs. Additionally, we analyze the trade-offs between model complexity and inference time, providing insights into the practical deployment of such systems. The results highlight the potential of image-based deep learning models as an effective tool for phishing detection, paving the way for further research in this domain.
1. Vahid Shahrivari, Mohammad Mahdi Darabi, Mohammad Izadi. Phishing detection using machine learning techniques. arXiv preprint arXiv:200911116. 2020;
2. Aljofey A, Jiang Q, Rasool A, Chen H, Liu W, Qu Q, et al. An effective detection approach for phishing websites using URL and HTML features. Sci Rep. 2022 May 25;12(1):8842.
3. Akinyelu AA. Machine Learning and Nature Inspired Based Phishing Detection: A Literature Survey. International Journal on Artificial Intelligence Tools. 2019 Aug 29;28(05):1930002.
4. Goenka R, Chawla M, Tiwari N. A comprehensive survey of phishing: mediums, intended targets, attack and defence techniques and a novel taxonomy. Int J Inf Secur. 2024 Apr 19;23(2):819–48.
5. Sarker IH. Deep Cybersecurity: A Comprehensive Overview from Neural Network and Deep Learning Perspective. SN Comput Sci. 2021 May 20;2(3):154.
6. Khan A, Sohail A, Zahoora U, Qureshi AS. A survey of the recent architectures of deep convolutional neural networks. Artif Intell Rev. 2020 Dec 21;53(8):5455–516.
7. Prakash P, Kumar M, Kompella RR, Gupta M. PhishNet: Predictive Blacklisting to Detect Phishing Attacks. In: 2010 Proceedings IEEE INFOCOM. IEEE; 2010. p. 1–5.
8. Zhang N, Tan Y an, Yang C, Li Y. Deep learning feature exploration for Android malware detection. Appl Soft Comput. 2021 Apr;102:107069.
9. Khanan A, Abdelgadir Mohamed Y, Mohamed AHHM, Bashir M. From Bytes to Insights: A Systematic Literature Review on Unraveling IDS Datasets for Enhanced Cybersecurity Understanding. IEEE Access. 2024;12:59289–317.
10. Arivukarasi M, Antonidoss A. Performance Analysis of Malicious URL Detection by using RNN and LSTM. In: 2020 Fourth International Conference on Computing Methodologies and Communication (ICCMC). IEEE; 2020. p. 454–8.
11. Shrivastava GK, Pateriya RK, Kaushik P. An efficient focused crawler using LSTM-CNN based deep learning. International Journal of System Assurance Engineering and Management. 2023 Feb 19;14(1):391–407.
12. Abdali S, Gurav R, Menon S, Fonseca D, Entezari N, Shah N, et al. Identifying Misinformation from Website Screenshots. Proceedings of the International AAAI Conference on Web and Social Media. 2021 May 22;15:2–13. Costa H V., Ribeiro AGR, Souza VMA. Fusion of Image Representations for Time Series Classification with Deep Learning. In 2024. p. 235–50.
14. Jang S, Li S, Sung Y. FastText-Based Local Feature Visualization Algorithm for Merged Image-Based Malware Classification Framework for Cyber Security and Cyber Defense. Mathematics. 2020 Mar 24;8(3):460.
15. Adebowale MA, Lwin KT, Sánchez E, Hossain MA. Intelligent web-phishing detection and protection scheme using integrated features of Images, frames and text. Expert Syst Appl. 2019 Jan;115:300–13.
16. Nguyen G, Dlugolinsky S, Bobák M, Tran V, López García Á, Heredia I, et al. Machine Learning and Deep Learning frameworks and libraries for large-scale data mining: a survey. Artif Intell Rev. 2019 Jun 19;52(1):77–124.
17. Gupta P, Bagchi A. Introduction to Pandas. In 2024. p. 161–96.
18. Zhang Z. Improved Adam Optimizer for Deep Neural Networks. In: 2018 IEEE/ACM 26th International Symposium on Quality of Service (IWQoS). IEEE; 2018. p. 1–2.
19. Muhammad MB, Yeasin M. Eigen-CAM: Class Activation Map using Principal Components. In: 2020 International Joint Conference on Neural Networks (IJCNN). IEEE; 2020. p. 1–7.