A Review of Methods for Phishing Website Detection from Legal websites

Document Type : Original Article

Authors

1 Engineering Department, Imam Khomeini International University, Qazvin, Iran

2 Engineering department, imam khomeini international university, qazvin, iran

Abstract

The increasing popularity of the Internet has led to the dramatic growth of e-commerce. However, such activities have significant security challenges, mainly due to cyber fraud and identity theft. Therefore, checking the legitimacy of visited web pages is a very important task to secure the identity of customers and prevent phishing attacks. The use of machine learning methods, and deep learning is widely recognized as a promising solution. Research is full of studies that use machine learning and deep learning methods to detect website phishing. However, their findings depend on the data set and are far from generalizable. The two main reasons for the lack of generalization are impractical replication and lack of appropriate benchmark data sets for fair evaluation of systems. Furthermore, phishing methods are constantly evolving and the proposed models do not keep up with the rapid changes. In this article, we review the methods of identifying phishing sites from legal sites and finally reach the final conclusion.

Keywords

Main Subjects


 [1] Hannousse, A., & Yahiouche, S. (2021). Towards benchmark datasets for machine learning based website phishing detection: An experimental study. Engineering Applications of Artificial Intelligence, 104, 104347.
[2] Anti-Phishing Working Group. (2006). Phishing Activity Trends Report-May, 2006. http://www. anti-phishing. org/reports/apwg_report_May2006. pdf.
[3] Wandera (2020). Mobile Threat Landscape 2020: Understanding the  key trends in mobile enterprise security in 2020. Technical Report. https://www.wandera.com/mobile-threat-landscape/.
[4] Safi, A., & Singh, S. (2023). A Systematic Literature Review on Phishing Website Detection Techniques. Journal of King Saud University-Computer and Information Sciences.
[5] Yang, L., Zhang, J., Wang, X., Li, Z., Li, Z., & He, Y. (2021). An improved ELM-based and data preprocessing integrated approach for phishing detection considering comprehensive features. Expert Systems with Applications, 165, 113863.
[6] Jain, A. K., & Gupta, B. B. (2018). PHISH-SAFE: URL features-based phishing detection system using machine learning. In Cyber Security: Proceedings of CSI 2015 (pp. 467-474). Springer Singapore.
[7] Jain, A. K., & Gupta, B. B. (2018). Two-level authentication approach to protect from phishing attacks in real time. Journal of Ambient Intelligence and Humanized Computing, 9, 1783-1796.
[8] Sindhu, S., Patil, S. P., Sreevalsan, A., Rahman, F., & AN, M. S. (2020, October). Phishing detection using random forest, SVM and neural network with backpropagation. In 2020 International Conference on Smart Technologies in Computing, Electrical and Electronics (ICSTCEE) (pp. 391-394). IEEE.
[9] Zhu, E., Ju, Y., Chen, Z., Liu, F., Fang, X. (2020). DTOF-ANN: an artificial neural network phishing detection model based on decision tree and optimal features. Appl. Soft Comput. J. 95,. https://doi.org/10.1016/j.asoc.2020.106505 106505.
[10] Alkawaz, M. H., Steven, S. J., Hajamydeen, A. I., & Ramli, R. (2021, April). A comprehensive survey on identification and analysis of phishing website based on machine learning methods. In 2021 IEEE 11th IEEE Symposium on Computer Applications & Industrial Electronics (ISCAIE) (pp. 82-87). IEEE.
[11] Basit, A., Zafar, M., Javed, A. R., & Jalil, Z. (2020, November). A novel ensemble machine learning method to detect phishing attack. In 2020 IEEE 23rd International Multitopic Conference (INMIC) (pp. 1-5). IEEE.