Skip to main content

Fake News Detection Using Ensemble Learning and Machine Learning Algorithms

  • Chapter
  • First Online:
Combating Fake News with Computational Intelligence Techniques

Part of the book series: Studies in Computational Intelligence ((SCI,volume 1001))

Abstract

Digital news becomes widely accessible to a large community of users with the advancement of several channels of communication and the progression of technology and thus, contributes to the increase of spreading of fake news. The current study experiments and investigates machine learning models that classify news as either fake or real. Five classifiers were implemented using Random Forest, Support Vector Machine, Gradient Boosting, Logistic Regression, and Naïve Bayes algorithms. Models were trained using merged open-source datasets extracted from online sources covering different domains. Text lemmatization, vectorization, and tokenization were applied to extract useful information from news text and to improve the generalization capabilities and the performance of fake news classification models. The impact of the voting strategy on the performance of ensemble learning models were explored. The performance of the five classifiers was evaluated using the accuracy, the F1-Score, the recall, and the precision. The attained results are promising. The ensemble classifier trained using random forest algorithm and gradient boosting algorithm outperform the other classifiers and thus it might be used effectively against fake news spreading.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Elyassami, S., & Kaddour, A. (2021). Implementation of an incremental deep learning model for survival prediction of cardiovascular patients. IAES International Journal of Artificial Intelligence. 10(1), 101–109. ISSN 2252–8938

    Google Scholar 

  2. Elyassami, S., Hamid, Y., & Habuza, T.: Road crashes analysis and prediction using gradient boosted and random forest trees. In 2020 6th IEEE Congress on Information Science and Technology (CiSt), Agadir—Essaouira, Morocco (pp. 520–525). https://doi.org/10.1109/CiSt49399.2021.9357298

  3. Conroy, N. K., Rubin, V. L., & Chen, Y. (2015). Automatic deception detection: Methods for finding fake news. Proceedings of the Association for Information Science and Technology, 52(1), 1–4.

    Article  Google Scholar 

  4. Pradhan, & Ajay, M. (2020). Fake news detection methods: Machine learning approach. International Journal for Research in Applied Science and Engineering Technology, 8(7), 971–975. https://doi.org/10.22214/ijraset.2020.29630

  5. Maurice, V. (2018). Incorrect, fake, and false. journalists’ perceived online source credibility and verification behavior. Observatorio (OBS*) 12.1 (2018): n. pag. Web.

    Google Scholar 

  6. Kuldeep, N. (2018). New social media and the impact of fake news on society. In ICSSM Proceedings, July (pp. 77–96).

    Google Scholar 

  7. Álvaro Ibrain, R., & Lloret Iglesias, L. (2019). Fake news detection using deep learning.

    Google Scholar 

  8. Federico, M et al. (2019). Fake news detection on social media using geometric deep learning.

    Google Scholar 

  9. Lyu, S., & Lo, D.C.-T. (2020). Fake news detection by decision tree. SoutheastCon, 2020, 1–2. https://doi.org/10.1109/SoutheastCon44009.2020.9249688

    Article  Google Scholar 

  10. Natali, R et al. (2020). A hybrid deep model for fake news detection. CSI, 4(4). Accessed 27 Sept 2020.

    Google Scholar 

  11. Alao, A. (2020). How artificial intelligence tools are deployed in the fight against fake news. The Nation 4(4)

    Google Scholar 

  12. Nikhil, S. (2020). Fake news detection using machine learning. International Journal Of Trend In Scientific Research And Development (IJTSRD),4(4)

    Google Scholar 

  13. Kaggle. (2021). Fake news dataset 1. https://www.kaggle.com/clmentbisaillon/fake-and-real-news-dataset. Last Accessed 01 July 2021

  14. Kaggle. (2021). Fake news dataset 2. https://www.kaggle.com/c/fake-news/data. Last Accessed 01 July 2021

  15. Kaggle. (2021). Fake news dataset 3. https://www.kaggle.com/jruvika/fake-news-detection. Last Accessed 01 July 2021

  16. Smelyakov, K., Karachevtsev, D., Kulemza, D., Samoilenko, Y., Patlan, O., & Chupryna, A. (2020). Effectiveness of preprocessing algorithms for natural language processing applications, In 2020 IEEE International Conference on Problems of Infocommunications. Science and Technology (PIC S&T) (pp. 187–191). https://doi.org/10.1109/PICST51311.2020.9467919

  17. Shah, F. P., & Patel, V. (2016) A review on feature selection and feature extraction for text classification. In 2016 International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET) (pp. 2264–2268). https://doi.org/10.1109/WiSPNET.2016.7566545

  18. Shrivastava, P., & Shukla, M. (2015). Comparative analysis of bagging, stacking and random subspace algorithms. In 2015 International Conference on Green Computing and Internet of Things (ICGCIoT) (pp. 511‒516). https://doi.org/10.1109/ICGCIoT.2015.7380518

  19. Kecman, V. (2005). Support vector machines-an introduction in “Support vector machines: Theory and applications.” Springer.

    Google Scholar 

  20. Hofmann, T., Schölkopf, B., & Smola, A. J. (2008). Kernel methods in machine learning. The Annals of Statistics, 36(3), 1171–1220.

    Article  MathSciNet  Google Scholar 

  21. Saba Abdul-baqi, S. et al. (2018) A new model for iris classification based on naïve bayes grid parameters optimization. International Journal of Sciences: Basic and Applied Research (IJSBAR) 40.2, 150–155.

    Google Scholar 

  22. Singh, G., Kumar, B., Gaur, L., & Tyagi, A. Comparison between multinomial and bernoulli naïve bayes for text classification. In 2019 International Conference on Automation, Computational and Technology Management (ICACTM) (pp. 593–596). https://doi.org/10.1109/ICACTM.2019.8776800.

  23. Zhenhai, C., & Wei, L. (2012) Logistic regression model and its application. Journal of Yanbian University(Natural Science Edition), 38(01), 28–32.

    Google Scholar 

  24. Baldi, P. (1995). Gradient descent learning algorithm overview: A general dynamical systems perspective. IEEE Transactions on Neural Networks, 6(1), 182–195. https://doi.org/10.1109/72.363438

    Article  Google Scholar 

  25. Friedman, J. H. (2001) Greedy function approximation: A gradient boosting machine. Annual Statistics, 29 (5), 1189–1232.

    Google Scholar 

  26. Galdi, P., & Tagliaferri, R. (2018) Data mining: Accuracy and error measures for classification and prediction. Encyclopedia of Bioinformatics and Computational Biology 431–436

    Google Scholar 

  27. Powers, D. (2020). Evaluation: from precision, recall and Fmeasure to ROC, informedness, markedness and correlation. arXiv preprint arXiv:2010.16061

  28. Lever, J., Krzywinski, M., & Altman, N. (2016). Classification evaluation. Nature Methods, 13, 603–604. https://doi.org/10.1038/nmeth.3945

    Article  Google Scholar 

  29. Hofmann, M., & Klinkenberg, R. (2013) RapidMiner: Data mining use cases and business analytics applications.

    Google Scholar 

  30. Agarwal, A., Mittal, M., Pathak, A., et al. (2020). Fake news detection using a blend of neural networks: An application of deep learning. SN Computer Science, 1, 143.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sanaa Elyassami .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Elyassami, S., Alseiari, S., ALZaabi, M., Hashem, A., Aljahoori, N. (2022). Fake News Detection Using Ensemble Learning and Machine Learning Algorithms. In: Lahby, M., Pathan, AS.K., Maleh, Y., Yafooz, W.M.S. (eds) Combating Fake News with Computational Intelligence Techniques. Studies in Computational Intelligence, vol 1001. Springer, Cham. https://doi.org/10.1007/978-3-030-90087-8_7

Download citation

Publish with us

Policies and ethics