Skip to main content

Fighting Media Hyper-partisanship with Modern Language Representation Models

  • Conference paper
  • First Online:
Proceedings of Data Analytics and Management

Part of the book series: Lecture Notes on Data Engineering and Communications Technologies ((LNDECT,volume 90))

  • 1108 Accesses

Abstract

The Internet has shaped how people gather knowledge, learn from their surroundings, form their individual opinions, and deal with socially relevant topics. In a time of polarization, when the news that we see is twisted according to one’s view, extremely one-sided views aim to conquer the internet. In such a case, it is of utmost importance to devise an algorithm that can outperform and overcome such biases. We propose to build a convolutional neural network by utilizing sentence embeddings from language representation models like BERT, RoBERTa, DistilBERT, and XLNet, which would be able to classify whether an article displays a hyper-partisan narrative or not. We analyze the writing style of the author rather than depending on fact verification to prove an article’s underlying bias. Our model gives an accuracy up to 88% with BERTweet-base. Such a model can actively prevent the spread of political propaganda through news outlets and can lead to the public consuming unbiased and accurate information.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Alabdulkarim A, Alhindi T (2019) Spider-Jerusalem at SemEval-2019 task 4: hyperpartisan news detection. In: Proceedings of the 13th International workshop on semantic evaluation. Association for Computational Linguistics, Minneapolis, Minnesota, USA, pp 985–989. https://doi.org/10.18653/v1/S19-2170. https://www.aclweb.org/anthology/S19-2170

  2. Barbieri F, Camacho-Collados J, Espinosa-Anke L, Neves L (2020) TweetEval:Unified benchmark and comparative evaluation for tweet classification. In: Proceedings of findings of EMNLP

    Google Scholar 

  3. Jiang Y, Petrak J, Song X, Bontcheva K, Maynard D (2019) Team bertha von suttner at SemEval-2019 task 4: hyperpartisan news detection using ELMo sentence representation convolutional network. In: Proceedings of the 13th International workshop on semantic evaluation. Association for Computational Linguistics, Minneapolis, Minnesota, USA, pp 840–844. https://doi.org/10.18653/v1/S19-2146. https://www.aclweb.org/anthology/S19-2146

  4. Kiesel J, Mestre M, Shukla R, Vincent E, Adineh P, Corney D, Stein B, Potthast M (2019) SemEval-2019 task 4: Hyperpartisan news detection. In: Proceedings of the 13th International workshop on semantic evaluation. Association for Computational Linguistics, Minneapolis, Minnesota, USA, pp 829–839. https://doi.org/10.18653/v1/S19-2145. https://www.aclweb.org/anthology/S19-2145

  5. Nguyen DQ, Vu T, Nguyen AT (2020) BERTweet: a pre-trained language model for English Tweets. In: Proceedings of the 2020 Conference on empirical methods in natural language processing: system demonstrations, pp 9–14 (2020)

    Google Scholar 

  6. Potthast M, Kiesel J, Reinartz K, Bevendorff J, Stein B (2018) A stylometric inquiry into hyperpartisan and fake news. In: Proceedings of the 56th Annual meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Melbourne, Australia, pp 231–240. https://doi.org/10.18653/v1/P18-1022. https://www.aclweb.org/anthology/P18-1022

  7. Reimers N, Gurevych I (2019) Sentence-bert: sentence embeddings using siamese bert-networks. In: Proceedings of the 2019 Conference on empirical methods in natural language processing. Association for Computational Linguistics. https://arxiv.org/abs/1908.10084

  8. Socher R, Perelygin A, Wu J, Chuang J, Manning CD, Ng A, Potts C (2013) Recursive deep models for semantic compositionality over a sentiment treebank. In: Proceedings of the 2013 Conference on empirical methods in natural language processing. Association for Computational Linguistics, Seattle, Washington, USA, pp 1631–1642. https://www.aclweb.org/anthology/D13-1170

  9. Wolf T, Debut L, Sanh V, Chaumond J, Delangue C, Moi A, Cistac P, Rault T, Louf R, Funtowicz M, Brew J (2019) Huggingface’s transformers: state-of-the-art natural language processing. CoRR abs/1910.03771 (2019). http://arxiv.org/abs/1910.03771

Download references

Acknowledgements

We are grateful to the Department of Computer Science and Engineering Delhi Technological University for providing us the labs and resources for performing our study. We are thankful to all other faculty members of our department for their guidance, and our parents for their encouragement.

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Kumar, A., Tyagi, U., Grover, T., Ghosh, A. (2022). Fighting Media Hyper-partisanship with Modern Language Representation Models. In: Gupta, D., Polkowski, Z., Khanna, A., Bhattacharyya, S., Castillo, O. (eds) Proceedings of Data Analytics and Management . Lecture Notes on Data Engineering and Communications Technologies, vol 90. Springer, Singapore. https://doi.org/10.1007/978-981-16-6289-8_6

Download citation

Publish with us

Policies and ethics