Skip to main content

Fake News Identification on Social Media Using Machine Learning Techniques

  • Conference paper
  • First Online:
Proceedings of International Conference on Information Technology and Applications

Abstract

The devastating effect of spreading fake news related to politics, health, and customer reviews cannot be neglected over social media on the decision-making approach of an individual. The problem of fake news needs the attention of social media administrators, law enforcement agencies, and academic researchers. To handle this issue, researchers suggested various artificial intelligence techniques. However, most of the studies used only a specific type of news that leads to dataset biases. This study used three different standard datasets collected from Kaggle and GitHub. Preprocessed the datasets to remove unwanted text. Then these preprocessed datasets are applied on three classifiers: passive aggressive, machine learning, and naïve Bayes of 30–70, 40–60, 50–50, 60–40, and 70–30, respectively. To evaluate the performance accuracy, precision and recall are used. Results clearly show that this study outperforms the state-of-the-art techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Aggarwal A, Rajadesingan A, Kumaraguru P (2012) PhishAri: automatic realtime phishing detection on twitter. eCrime Researchers Summit (eCrime)

    Google Scholar 

  2. Ajao O, Bhowmik D, Zargari S (2018) Fake news identification on twitter with hybrid cnn and rnn models. Proceedings of the 9th International Conference on Social Media and Society

    Google Scholar 

  3. Al-garadi MA, Varathan KD, Ravana SD (2016) Cybercrime detection in online communications: the experimental case of cyberbullying detection in the Twitter network. Comput Hum Behav 63:433–443

    Article  Google Scholar 

  4. Bessi A, Ferrara E (2016) Social bots distort the 2016 US Presidential election online discussion

    Google Scholar 

  5. Bessi A, Ferrara E (2016) Social bots distort the 2016 US Presidential election online discussion. First Monday 21(11–7)

    Google Scholar 

  6. Boididou C, Papadopoulos S, Zampoglou M, Apostolidis L, Papadopoulou O, Kompatsiaris Y (2018) Detection and visualization of misleading content on Twitter. Int J Multimedia Inf Retrieval 7(1):71–86. https://doi.org/10.1007/s13735-017-0143-x

    Article  Google Scholar 

  7. Buczak AL, Baugher B, Guven E, Ramac-Thomas LC, Elbert Y, Babin SM, Lewis SH (2015) Fuzzy association rule mining and classification for the prediction of malaria in South Korea. BMC Med Inform Decis Mak 15(1):47

    Article  Google Scholar 

  8. Ding X, Liu B, Yu PS (2008) A holistic lexicon-based approach to opinion mining. Proceedings of the 2008 international conference on web search and data mining

    Google Scholar 

  9. Domonoske C (2016) Students have ‘dismaying’inability to tell fake news from real, study finds. National Public Radio 23

    Google Scholar 

  10. Edkins B (2016) Americans Believe ey Can Detect Fake News. Studies Show ey Can’t.(December 2016). In

    Google Scholar 

  11. Eysenbach G (2008) Credibility of health information and digital media: new perspectives and implications for youth. MacArthur foundation digital media and learning initiative

    Google Scholar 

  12. Eysenbach G (2008) Credibility of health information and digital media: new perspectives and implications for youth. Digital Media Youth Credibility 123–154

    Google Scholar 

  13. Fernández-Luque L, Bau T (2015) Health and social media: perfect storm of information. Healthcare Inf Res 21(2):67–73

    Article  Google Scholar 

  14. Ferrara E (2017) Disinformation and social bot operations in the run up to the 2017 French presidential election

    Google Scholar 

  15. Girgis S, Amer E, Gadallah M (2018) Deep learning algorithms for detecting fake news in online text. 2018 13th international conference on computer engineering and systems (ICCES)

    Google Scholar 

  16. Heydari A, Ali Tavakoli M, Salim N, Heydari Z (2015) Detection of review spam: a survey. Expert Syst Appl 42(7):3634–3642

    Article  Google Scholar 

  17. Howard PN, Kollanyi B (2016) Bots,# StrongerIn, and# Brexit: computational propaganda during the UK-EU referendum. Available at SSRN 2798311

    Google Scholar 

  18. Howard, P. N., & Kollanyi, B. (2016). Bots,# strongerin, and# brexit: Computational propaganda during the uk-eu referendum. Browser Download This Paper.

    Google Scholar 

  19. Ishtiaq U, Kareem SA, Abdullah ERMF, Mujtaba G, Jahangir R, Ghafoor HY (2019) Diabetic retinopathy detection through artificial intelligent techniques: a review and open issues. Multimedia Tools Appl 1–44

    Google Scholar 

  20. Ivanitskaya L, Boyle IO, Casey AM (2006) Health information literacy and competencies of information age students: results from the interactive online Research Readiness Self-Assessment (RRSA). J Med Internet Res 8(2):e6

    Google Scholar 

  21. Kannan S, Gurusamy V (2014) Preprocessing techniques for text mining. Conference Paper. India

    Google Scholar 

  22. Lauw H, Shafer JC, Agrawal R, Ntoulas A (2010) Homophily in the digital world: A LiveJournal case study. IEEE Internet Comput 14(2):15–23

    Article  Google Scholar 

  23. Liu, B. (2007). Web data mining: exploring hyperlinks, contents, and usage data. Springer Science & Business Media.

    Google Scholar 

  24. McCallum A, Nigam K (1998) A comparison of event models for naive bayes text classification. AAAI-98 workshop on learning for text categorization

    Google Scholar 

  25. Ramasubramanian C, Ramya R (2013) Effective pre-processing activities in text mining using improved porter’s stemming algorithm. Int J Adv Res Comput Commun Eng 2(12):4536–4538

    Google Scholar 

  26. Ruchansky N, Seo S, Liu Y (2017) Csi: a hybrid deep model for fake news detection. Proceedings of the 2017 ACM on conference on information and knowledge management

    Google Scholar 

  27. Scott J (2017) Social network analysis. Sage

    Book  Google Scholar 

  28. Shu K, Sliva A, Wang S, Tang J, Liu H (2017) Fake news detection on social media: a data mining perspective. ACM SIGKDD Explorations Newsl 19(1):22–36

    Article  Google Scholar 

  29. Tucker J, Guess A, Barberá P, Vaccari C, Siegel A, Sanovich S, Stukal D, Nyhan B (2018) Social media, political polarization, and political disinformation: a review of the scientific literature

    Google Scholar 

  30. Vijayarani S, Ilamathi MJ, Nithya M (2015) Preprocessing techniques for text mining-an overview. Int J Comput Sci Commun Netw 5(1):7–16

    Google Scholar 

  31. Vijayarani S, Janani R (2016) Text mining: open source tokenization tools-an analysis. Adv Comput Intell: Int J (ACII) 3(1):37–47

    Google Scholar 

  32. Viviani M, Pasi G (2017) Credibility in social media: opinions, news, and health information—a survey. Wiley Interdisciplinary Rev: Data Mining and Knowl Discovery 7(5)

    Google Scholar 

  33. Vosoughi S, Roy D, Aral S (2018) The spread of true and false news online. Science 359(6380):1146–1151

    Article  Google Scholar 

  34. Yang C, Harkreader R, Zhang J, Shin S, Gu G (2012) Analyzing spammers’ social networks for fun and profit: a case study of cyber criminal ecosystem on twitter. Proceedings of the 21st international conference on World Wide Web

    Google Scholar 

  35. Yardi S, Romero D, Schoenebeck G (2009) Detecting spam in a twitter network. First Monday 15(1)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ghafoor, H.Y., Jaffar, A., Jahangir, R., Iqbal, M.W., Abbas, M.Z. (2022). Fake News Identification on Social Media Using Machine Learning Techniques. In: Ullah, A., Anwar, S., Rocha, Á., Gill, S. (eds) Proceedings of International Conference on Information Technology and Applications. Lecture Notes in Networks and Systems, vol 350. Springer, Singapore. https://doi.org/10.1007/978-981-16-7618-5_8

Download citation

Publish with us

Policies and ethics