Abstract
Fake news may have different meaning to different individuals. For the purpose of this paper, we will go by the definition of fake news as those reports that are bogus: The story itself is created, with no relation to realities, sources or statements. In this research on fake news detection through machine learning algorithms, we are implementing two feature selection approaches toward the problem: Bag of words model and TF-IDF vectorization model and are using four classifiers namely, logistic regression classifier, naive Bayes classifier, random forest classifier and passive aggressive classifier for classification purpose. This research is being conducted on two separate datasets, among which for bag of words model along with logistic regression classifier yields average F1 Score of 92.16% and for TF-IDF vectorization, logistic regression classifier yields average F1 Score of 93.47%. Also, passive aggressive classifier works well with high volume of data along with TF-IDF as can be seen by highest increase in F1 Score.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Hunt Allcott, H., Gentzkow, M.: Social media and fake news in the 2016 election. J. Econ. Perspect. 31(2) (2017)
Reis, J.C., Correia, A., Murai, F., Veloso, A., Benevenuto, F., Cambria, E.: Supervised learning for fake news detection. IEEE Intell. Syst. 34(2), 76–81 (2019)
Akaichi, J.: Social networks’ facebook’statutes updates mining for sentiment classification. In: 2013 International Conference on Social Computing (pp. 886–891). IEEE (2013, Sept)
Bozkır, A.S., Mazman, S.G., Sezer, E.A.: Identification of user patterns in social networks by data mining techniques: facebook case. In: International Symposium on Information Management in a Changing World, pp. 145–153. Springer, Berlin, Heidelberg (2010, Sept)
Savyan, P.V., Bhanu, S.M.S.: Behaviour profiling of reactions in facebook posts for anomaly detection. In: 2017 Ninth International Conference on Advanced Computing (ICoAC), pp. 220–226. IEEE (2017, Dec)
Ahmed, H., Traore, I., Saad, S.: Detection of online fake news using N-gram analysis and machine learning techniques. In: International Conference on Intelligent, Secure, and Dependable Systems in Distributed and Cloud Environments, pp. 127–138. Springer, Cham (2017, Oct)
Kaliyar, R.K., Goswami, A. Narang, P.: Multiclass fake news detection using ensemble machine learning. In: 2019 IEEE 9th International Conference on Advanced Computing (IACC), pp. 103–107. Tiruchirappalli, India (2019)
Ma, C., Huang, J.-B., Yang, X., Yang, M.-H.: Hierarchical convolutional features for visual tracking
Diaz-Garcia, J., Fernandez-Basso, C., Ruiz, M.D., Martin-Bautista, M.J.: Mining text patterns over fake and real tweets (2020). https://doi.org/10.1007/978-3-030-50143-3_51
Ibrishimova, M.D., Li, K.F.: A machine learning approach to fake news detection using knowledge verification and natural language processing (2020). https://doi.org/10.1007/978-3-030-29035-1_22
Zhang, J., Cui, L., Fu, Y., Gouza, F.B.: Fake news detection with deep diffusive network model (2018)
Cook, R.D., Weisberg, S.: Criticism and influence analysis in regression. Sociological Methodology, vol.13, pp. 313–361 (1982)
Suppe, F.: Classification. In: Barnouw, E. (ed.) International encyclopedia of communications. vol. 1, pp. 292–296. Oxford University Press, Oxford, UK (1989)
Tryon, R.C.: Cluster analysis: correlation profile and orthometric (factor) analysis for the isolation of unities in mind and personality. Edwards Brothers (1939)
Goldberg, Y.: A primer on neural network models for natural language processing. J. Artific. Intell. Res. 57, 345–420 (2016)
Library of Congress: The subject headings manual. Library of Congress, Policy and Standards Division, Washington, DC (2008)
McTear, M., Callejas, Z., Griol, D.: Spoken language understanding. In: The Conversational Interface. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-32967-3_8
Rajaraman, A., Ullman, J.D.: Data mining (PDF). Mining of Massive Datasets. pp. 1–1 (2011)
Tolles, J., Meurer, W.J.: Logistic regression relating patient characteristics to outcomes. JAMA 316(5), 533–534 (2016)
Ho, T.K.: In: Proceedings of the 3rd International Conference on Document Analysis and Recognition, pp. 278–282. Montreal, QC, 14–16 Aug 1995
Rennie, J., Shih, L., Teevan, J., Karger, D.: Tackling the poor assumptions of naïve bayes classifiers (2003)
Online Algorithms and Stochastic Approximations. Online Learning and Neural Networks. Cambridge University Press. ISBN 978-0-521-65263-6
Suri, B., Taneja, S., Aggarwal, S., Sharma, V.R.: Fake news detection tool (FNDT): shield against sentimental deception. J. Inf. Optim. Sci. (2020)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Shrivastava, S., Singh, R., Jain, C., Kaushal, S. (2022). A Research on Fake News Detection Using Machine Learning Algorithm. In: Somani, A.K., Mundra, A., Doss, R., Bhattacharya, S. (eds) Smart Systems: Innovations in Computing. Smart Innovation, Systems and Technologies, vol 235. Springer, Singapore. https://doi.org/10.1007/978-981-16-2877-1_25
Download citation
DOI: https://doi.org/10.1007/978-981-16-2877-1_25
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-2876-4
Online ISBN: 978-981-16-2877-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)