Open Access Open Access  Restricted Access Subscription Access

Email Spam Classifications Based on Support Vector Machine and Recurrent Neural Network

Satyendra Shukla, Pawanesh Singh

Abstract


In recent times, e-mail has become one of the fastest and the utmost economical process of communication. Spam emails have dramatically increased over the past few years as a result of the growth in email subscribers. In this growing world, most of the transactions, business, study materials are taking place through emails. But due to the social networks and advertising, some of the emails contain undesirable information known as spam. Reliable anti-spam filters have emerged as a result of the surge in the volume of unsolicited bulk email messages, or spam. Various email spam messages are marketable in nature but might similarly encompass covered links that seem to be for conversant websites but actually lead to phishing web sites or sites that are holding malware. A spammer naturally sends an email to number of email addresses. Spamming is on the rise, thanks to the ease with which it is possible to send a flood of emails at no cost. Spam mails disturb the one calmness. Even though numerous algorithms have been created for the classification of spam emails, none of them consistently produce reliable results. In this study, we are classifying the ham and the spam emails by doing comparative study of the RNN and SVM algorithm. This classification shows 97 and 79% accuracy and 0.97 and 0.79% Kappa coefficient. As a result, this study reveals that the email spam classification of Kaggle datasets has a high level of accuracy.


Keywords


Email classification, machine learning, Kaggle dataset, RNN, Kappa coefficient, email spams, support vector machines

Full Text:

PDF

References


Alghoul Ahmed, et al. Email Classification Using Artificial Neural Network. Int J Acad Dev. 2018; 2(11): 8–14.

Dada Emmanuel Gbenga, et al. Machine learning for email spam filtering: review, approaches and open research problems. Heliyon. 2019; 5(6): e01802.

Awad WA, ELseuofi SM. Machine learning methods for spam e-mail classification. Int J Comput Sci Inf Technol (IJCSIT). 2011; 3(1): 173–184.

Torabi ZS, Nadimi-Shahraki MH, Nabiollahi A. Efficient support vector machines for spam detection: a survey. Int J Comput Sci Inf Secur (IJCSIS). 2015; 13(1): 11–28.

Kumaresan T, Palanisamy C. E-mail spam classification using S-cuckoo search and support vector machine. Int J Bio-Inspired Comput. 2017; 9(3): 142–156.

Awad M, Foqaha M. Email spam classification using hybrisd approach of RBF neural network and particle swarm optimization. Int J Netw Secur Appl. 2016; 8(4): 17–28.

Guzella TS, Caminhas WM. A review of machine learning approaches to Spam filtering. Expert Syst Appl. 2009; 36(7): 10206–10222.

Yuchun Tang, Sven Krasser, Yuanchen He, Weilai Yang, Dmitri Alperovitch. Support Vector Machines and Random Forests Modeling for Spam Senders Behavior Analysis. IEEE GLOBECOM 2008; 2008 IEEE Global Telecommunications Conference. 2008; 1–5.

Marsono Muhammad N, Watheq El-Kharashi M, Fayez Gebali. Targeting spam control on middleboxes: Spam detection based on layer-3 e-mail content classification. Elsevier Comput Netw. 2009; 53(6): 835–848.

Karthika R, Visalakshi P. A hybrid ACO based feature selection method for email spam classification. WSEAS Trans Comput. 2015; 14: 171–177.


Refbacks

  • There are currently no refbacks.


Copyright (c) 2022 Journal of Network Security

  • eISSN: 2395–6739
  • ISSN: 2321–8517