Abstract
In this digital era, the dissemination of news is increasingly done online among the exploding population of Internet users. Due to the low cost involved and vanishing journalistic integrity, the spreading of unverified news, also called “fake news,” has become commonplace, often with long-lasting consequences to society. In these times of “digital deceit,” identifying fake news has become very important. This paper proposes a two-step approach using linguistic techniques in a deep learning framework to tackle the problem of fake news. Fake news may comprise fabricated news stories as well as “click baits.” Our approach involves identifying both these fake categories through a cascaded approach employing stance detection and entailment classification by building a complex model involving multiple deep neural networks. We use Term Frequency (TF), Cosine Similarity, and Word2Vec as features at different stages in the model. The datasets from Fake News Challenge 1 (FNC 1) and the Sentences Involving Compositional Logic (SICK) are used for stance detection and entailment classification, respectively. The focus of this work is, given an article, not to establish its credibility by identifying whether the article is counterfeit or not but to identify and eliminate the spread of fake news articles with respect to any particular spurious source.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Allcott H, Gentzkow M (2017) Social media and fake news in the 2016 election. J Econ Perspect 31(2):211–236
Conroy NK, Rubin VL, Chen Y (2015) Automatic deception detection: methods for finding fake news. Proc Assoc Inf Sci Technol 52(1):1–4
Chen Y, Conroy NJ, Rubin VL (2015) Misleading online content: recognizing click bait as “false news”. In: Proceedings of the 2015 ACM on workshop on multimodal deception detection, pp 15–19. (Nov 2015)
Granik M, Mesyura V (2017). Fake news detection using naive Bayes classifier. In: 2017 IEEE first Ukraine conference on electrical and computer engineering (UKRCON), pp 900–903. IEEE. (May 2017)
Bajaj S (2017) The pope has a new baby! Fake news detection using deep learning
Kaggle, Getting real about fake news. https://www.kaggle.com/mrisdal/fake-news/. Accessed 2020
Corney D, Albakour D, Martinez-Alvarez M, Moussa S (2016) What do a million news articles look like? In NewsIR@ ECIR, pp 42–47. (March 2016)
Feng VW, Hirst G (2013) Detecting deceptive opinions with profile compatibility. In: Proceedings of the sixth international joint conference on natural language processing, pp 338–346. (Oct 2013)
Baird S, Sibley D, Pan Y(2017) Talos targets disinformation with fake news challenge victory. https://blog.talosintelligence.com/2017/06/talos-fake-news-challenge.html
Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems, pp 3111–3119
Hanselowski A, Avinesh PVS, Schiller B, Caspelherr F (2017) Team Athene on the fake news challenge. https://medium.com/@andre134679/team-athene-on-the-fake-news-challenge-28a5cf5e017b
Riedel B, Augenstein I, Spithourakis GP, Riedel S (2017) A simple but tough-to-beat baseline for the Fake News Challenge stance detection task. arXiv:1707.03264
Le Q, Mikolov T (2014) Distributed representations of sentences and documents. In: International conference on machine learning, pp 1188–1196. (Jan 2014)
Graves A (2012) Supervised sequence labelling. In: Supervised sequence labelling with recurrent neural networks. Springer, Berlin, Heidelberg, pp 5–13
Yih WT, Toutanova K, Platt JC, Meek C (2011) Learning discriminative projections for text similarity measures. In: Proceedings of the fifteenth conference on computational natural language learning, pp 247–256. (June 2011)
Mueller J, Thyagarajan A (2016) Siamese recurrent architectures for learning sentence similarity. In: Thirtieth AAAI conference on artificial intelligence. (Mar 2016)
Marelli M, Menini S, Baroni M, Bentivogli L, Bernardi R, Zamparelli R (2014) A SICK cure for the evaluation of compositional distributional semantic models. In: LREC, pp 216–223. (May 2014)
Tai KS, Socher R, Manning CD (2015) Improved semantic representations from tree-structured long short-term memory networks. arXiv:1503.00075
Kingma DP, Ba J (2014) Adam: a method for stochastic optimization. arXiv:1412.6980
Miller GA (1995) WordNet: a lexical database for English. Commun ACM 38(11):39–41
Author information
Authors and Affiliations
Contributions
All authors discussed the contents of the manuscript and contributed to its preparation. While E. A. W and K. M designed the model and implemented the system, H. M and S. N helped with the manuscript. S. A and N. K. A provided critical feedback and helped shape the research, analysis, and final version of the manuscript.
Corresponding author
Editor information
Editors and Affiliations
Ethics declarations
Conflict of Interest
The authors declare no conflict of interest.
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Williams, E.A., Karthik, M., Shahina, A., Murali, H., Safiyyah, N., Nayeemulla Khan, A. (2023). Snooping for Fake News: A Cascaded Approach Using Stance Detection and Entailment Classification. In: Venkataraman, N., Wang, L., Fernando, X., Zobaa, A.F. (eds) Big Data and Cloud Computing. ICBCC 2022. Lecture Notes in Electrical Engineering, vol 1021. Springer, Singapore. https://doi.org/10.1007/978-981-99-1051-9_11
Download citation
DOI: https://doi.org/10.1007/978-981-99-1051-9_11
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-1050-2
Online ISBN: 978-981-99-1051-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)