Skip to main content

Detection of SMS Spam Using Machine-Learning Algorithms

  • Conference paper
  • First Online:
Embedded Systems and Artificial Intelligence

Abstract

Short message service (SMS) is considered as one of the most popular means of communication, it allows to the mobile phone users to exchange a short text message with a low cost. Its growing popularity and its dependence on mobile phone has increased the number of attacks, caused by sending an unsolicited message like SMS spam. In this paper, we address a comparative study, between multilayer perceptron (MLP), support vector machine (SVM), random forest and k-nearest neighbors (KNN). For extracting the feature vectors the bag-of-words (BOW) and the TF-IDF methods are applied. These feature vectors are used as input for training and testing the different machine-learning classifiers mentioned above. The results of different machine-learning classifiers, based on their accuracy, precision, recall, F-measure, and ROC (receiver operating characteristic) curve have shown that the MLP outperforms SVM, random forest and KNN in SMS spam detection. Although the MLP has achieved the highest accuracy by using the BOW, than by using the TF-IDF method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Gupta, M., Bakliwal, A., Agarwal, S., Mehndiratta, P.: A comparative study of spam SMS detection using machine learning classifiers. In: 2018 Eleventh International Conference on Contemporary Computing (IC3), pp. 287–293

    Google Scholar 

  2. SMS Spam Dataset ‘Collection V.1’. Available online at http://www.dt.fee.unicamp.br/~tiago/smsspamcollection/

  3. Spam SMS Dataset 2011–12. Available online on request at http://precog.iiitd.edu.in/requester.php?dataset=smsspam

  4. Choudhary, N., Jain, A.K.: Towards filtering of SMS spam messages using machine learning based technique. In: Advanced Informatics for Computing Research: First International Conference, ICAICR 2017, Jalandhar, pp. 18–30, 17–18 Mar 2017 (Revised selected papers)

    Google Scholar 

  5. Uysal, A.K. Gunal, S. Ergin, S. Gunal, E.S.: The impact of feature extraction and selection on SMS spam filtering. 2013 Elektronika Ir Elektrotechnika, 67–72

    Google Scholar 

  6. Popovac, M., Karanovic, M., Sladojevic, S., Arsenovic, M., Anderla, A.: Convolutional neural network based SMS spam detection. In: 26th Telecommunications Forum TELFOR 2018, pp. 807–810

    Google Scholar 

  7. Goh, K.L., Lim, K.H., Singh, A.K.: Multilayer perceptrons neural network based Weh spam detection application. In:2013 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP), Beijing, China, pp. 636–640

    Google Scholar 

  8. Kingma, D.P., Ba, J.L.: Adam: a method for stochastic optimization. In: ICLR 2015 Conference, pp 34–48

    Google Scholar 

  9. Kim, J., Kim, B., Savarese, S.: Comparing image classification methods: K-nearest-neighbor and support-vector-machines. In: American-Math’12/CEA’12 Proceedings of the 6th WSEAS International Conference on Computer Engineering and Applications, and Proceedings of the 2012 American conference on Applied Mathematics, pp. 133–138

    Google Scholar 

  10. Hsu, C.-W., et al.: A practical guide to support vector classification. Available at https://www.csie.ntu.edu.tw/~cjlin (2016)

  11. Sedhai, S., Sun, A.: Semi-supervised spam detection in twitter stream. IEEE Trans. Comput. Soc. Syst., 169–175 (2018)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fatima Zohra El Hlouli .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

El Hlouli, F.Z., Riffi, J., Mahraz, M.A., El Yahyaouy, A., Tairi, H. (2020). Detection of SMS Spam Using Machine-Learning Algorithms. In: Bhateja, V., Satapathy, S., Satori, H. (eds) Embedded Systems and Artificial Intelligence. Advances in Intelligent Systems and Computing, vol 1076. Springer, Singapore. https://doi.org/10.1007/978-981-15-0947-6_41

Download citation

Publish with us

Policies and ethics