Abstract
The Internet has shaped how people gather knowledge, learn from their surroundings, form their individual opinions, and deal with socially relevant topics. In a time of polarization, when the news that we see is twisted according to one’s view, extremely one-sided views aim to conquer the internet. In such a case, it is of utmost importance to devise an algorithm that can outperform and overcome such biases. We propose to build a convolutional neural network by utilizing sentence embeddings from language representation models like BERT, RoBERTa, DistilBERT, and XLNet, which would be able to classify whether an article displays a hyper-partisan narrative or not. We analyze the writing style of the author rather than depending on fact verification to prove an article’s underlying bias. Our model gives an accuracy up to 88% with BERTweet-base. Such a model can actively prevent the spread of political propaganda through news outlets and can lead to the public consuming unbiased and accurate information.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Alabdulkarim A, Alhindi T (2019) Spider-Jerusalem at SemEval-2019 task 4: hyperpartisan news detection. In: Proceedings of the 13th International workshop on semantic evaluation. Association for Computational Linguistics, Minneapolis, Minnesota, USA, pp 985–989. https://doi.org/10.18653/v1/S19-2170. https://www.aclweb.org/anthology/S19-2170
Barbieri F, Camacho-Collados J, Espinosa-Anke L, Neves L (2020) TweetEval:Unified benchmark and comparative evaluation for tweet classification. In: Proceedings of findings of EMNLP
Jiang Y, Petrak J, Song X, Bontcheva K, Maynard D (2019) Team bertha von suttner at SemEval-2019 task 4: hyperpartisan news detection using ELMo sentence representation convolutional network. In: Proceedings of the 13th International workshop on semantic evaluation. Association for Computational Linguistics, Minneapolis, Minnesota, USA, pp 840–844. https://doi.org/10.18653/v1/S19-2146. https://www.aclweb.org/anthology/S19-2146
Kiesel J, Mestre M, Shukla R, Vincent E, Adineh P, Corney D, Stein B, Potthast M (2019) SemEval-2019 task 4: Hyperpartisan news detection. In: Proceedings of the 13th International workshop on semantic evaluation. Association for Computational Linguistics, Minneapolis, Minnesota, USA, pp 829–839. https://doi.org/10.18653/v1/S19-2145. https://www.aclweb.org/anthology/S19-2145
Nguyen DQ, Vu T, Nguyen AT (2020) BERTweet: a pre-trained language model for English Tweets. In: Proceedings of the 2020 Conference on empirical methods in natural language processing: system demonstrations, pp 9–14 (2020)
Potthast M, Kiesel J, Reinartz K, Bevendorff J, Stein B (2018) A stylometric inquiry into hyperpartisan and fake news. In: Proceedings of the 56th Annual meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Melbourne, Australia, pp 231–240. https://doi.org/10.18653/v1/P18-1022. https://www.aclweb.org/anthology/P18-1022
Reimers N, Gurevych I (2019) Sentence-bert: sentence embeddings using siamese bert-networks. In: Proceedings of the 2019 Conference on empirical methods in natural language processing. Association for Computational Linguistics. https://arxiv.org/abs/1908.10084
Socher R, Perelygin A, Wu J, Chuang J, Manning CD, Ng A, Potts C (2013) Recursive deep models for semantic compositionality over a sentiment treebank. In: Proceedings of the 2013 Conference on empirical methods in natural language processing. Association for Computational Linguistics, Seattle, Washington, USA, pp 1631–1642. https://www.aclweb.org/anthology/D13-1170
Wolf T, Debut L, Sanh V, Chaumond J, Delangue C, Moi A, Cistac P, Rault T, Louf R, Funtowicz M, Brew J (2019) Huggingface’s transformers: state-of-the-art natural language processing. CoRR abs/1910.03771 (2019). http://arxiv.org/abs/1910.03771
Acknowledgements
We are grateful to the Department of Computer Science and Engineering Delhi Technological University for providing us the labs and resources for performing our study. We are thankful to all other faculty members of our department for their guidance, and our parents for their encouragement.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Kumar, A., Tyagi, U., Grover, T., Ghosh, A. (2022). Fighting Media Hyper-partisanship with Modern Language Representation Models. In: Gupta, D., Polkowski, Z., Khanna, A., Bhattacharyya, S., Castillo, O. (eds) Proceedings of Data Analytics and Management . Lecture Notes on Data Engineering and Communications Technologies, vol 90. Springer, Singapore. https://doi.org/10.1007/978-981-16-6289-8_6
Download citation
DOI: https://doi.org/10.1007/978-981-16-6289-8_6
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-6288-1
Online ISBN: 978-981-16-6289-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)