Using KNN and SVM Based One-Class Classifier for Detecting Online Radicalization on Twitter

Agarwal, Swati; Sureka, Ashish

doi:10.1007/978-3-319-14977-6_47

Swati Agarwal¹⁸ &
Ashish Sureka¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8956))

Included in the following conference series:

International Conference on Distributed Computing and Internet Technology

2553 Accesses
55 Citations
1 Altmetric

Abstract

Twitter is the largest and most popular micro-blogging website on Internet. Due to low publication barrier, anonymity and wide penetration, Twitter has become an easy target or platform for extremists to disseminate their ideologies and opinions by posting hate and extremism promoting tweets. Millions of tweets are posted on Twitter everyday and it is practically impossible for Twitter moderators or an intelligence and security analyst to manually identify such tweets, users and communities. However, automatic classification of tweets into pre-defined categories is a non-trivial problem problem due to short text of the tweet (the maximum length of a tweet can be 140 characters) and noisy content (incorrect grammar, spelling mistakes, presence of standard and non-standard abbreviations and slang). We frame the problem of hate and extremism promoting tweet detection as a one-class or unary-class categorization problem by learning a statistical model from a training set containing only the objects of one class . We propose several linguistic features such as presence of war, religious, negative emotions and offensive terms to discriminate hate and extremism promoting tweets from other tweets. We employ a single-class SVM and KNN algorithm for one-class classification task. We conduct a case-study on Jihad, perform a characterization study of the tweets and measure the precision and recall of the machine-learning based classifier. Experimental results on large and real-world dataset demonstrate that the proposed approach is effective with F-score of 0.60 and 0.83 for the KNN and SVM classifier respectively.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Supervised Classifiers to Identify Hate Speech on English and Spanish Tweets

Classification of multi-lingual tweets, into multi-class model using Naïve Bayes and semi-supervised learning

Article 29 August 2020

Hate Speech and Offensive Language Detection in Twitter Data Using Machine Learning Classifiers

Keywords

References

Agrawal, S., Sureka, A.: Learning to classify hate and extremism promoting tweets. JISIC (2014)
Google Scholar
Berger, J., Strathearn, B.: Who matters online: Measuring influence, evaluating content and countering violent extremism in online social networks. The international centre for the study of radicalization and political violence (2013)
Google Scholar
Chang, C.C., Lin, C.J.: LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology 2, 27:1–27:27 (2011)
Google Scholar
Kwok, I., Wang, Y.: Locate the hate: Detecting tweets against blacks. In: Twenty-Seventh AAAI Conference on Artificial Intelligence (2013)
Google Scholar
Li, R., Wang, S., Chang, K.C.C.: Towards social data platform: automatic topic-focused monitor for twitter stream. Proceedings of the VLDB Endowment 6(14), 1966–1977 (2013)
Article Google Scholar
Li, R., Wang, S., Deng, H., Wang, R., Chang, K.C.C.: Towards social user profiling: unified and discriminative influence model for inferring home locations. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1023–1031. ACM (2012)
Google Scholar
Liebrecht, C., Kunneman, F., van den Bosch, A.: The perfect solution for detecting sarcasm in tweets# not. Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (2013)
Google Scholar
Martinez-Romo, J., Araujo, L.: Detecting malicious tweets in trending topics using a statistical analysis of language. Expert Systems with Applications 40(8), 2992–3000 (2013)
Article Google Scholar
O’Callaghan, D., Greene, D., Conway, M., Carthy, J., Cunningham, P.: Uncovering the wider structure of extreme right communities spanning popular online networks. In: Web Science Conference, pp. 276–285 (2013)
Google Scholar
Reyes, A., Rosso, P., Buscaldi, D.: From humor recognition to irony detection: The figurative language of social media. Data & Knowledge Engineering 74, 1–12 (2012)
Article Google Scholar
Wadhwa, P., Bhatia, M.P.S.: Tracking on-line radicalization using investigative data mining. In: NCC, pp. 1–5 (2013)
Google Scholar
Xiang, G., Fan, B., Wang, L., Hong, J., Rose, C.: Detecting offensive tweets via topical feature discovery over a large scale twitter corpus. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, pp. 1980–1984. ACM (2012)
Google Scholar
Yang, M.C., Lee, J.T., Lee, S.W., Rim, H.C.: Finding interesting posts in twitter based on retweet graph analysis. In: SIGIR, pp. 1073–1074 (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

Indraprastha Institute of Information Technology, Delhi (IIIT-D), New Delhi, India
Swati Agarwal & Ashish Sureka

Authors

Swati Agarwal
View author publications
You can also search for this author in PubMed Google Scholar
Ashish Sureka
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Technology and Computer Science, Tata Institute of Fundamental Research, Homi Bhabha Road, Colaba, 400005, Mumbai, India
Raja Natarajan
Department of Computer Science and Engineering, Indian Institute of Technology Guwahati, 781039, Guwahati, India
Gautam Barua
Department of Computer Science,, Berhampur University, 760007, Berhampur, Odisha, India
Manas Ranjan Patra

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Agarwal, S., Sureka, A. (2015). Using KNN and SVM Based One-Class Classifier for Detecting Online Radicalization on Twitter. In: Natarajan, R., Barua, G., Patra, M.R. (eds) Distributed Computing and Internet Technology. ICDCIT 2015. Lecture Notes in Computer Science, vol 8956. Springer, Cham. https://doi.org/10.1007/978-3-319-14977-6_47

Download citation

DOI: https://doi.org/10.1007/978-3-319-14977-6_47
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-14976-9
Online ISBN: 978-3-319-14977-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Using KNN and SVM Based One-Class Classifier for Detecting Online Radicalization on Twitter

Abstract

Chapter PDF

Similar content being viewed by others

Supervised Classifiers to Identify Hate Speech on English and Spanish Tweets

Classification of multi-lingual tweets, into multi-class model using Naïve Bayes and semi-supervised learning

Hate Speech and Offensive Language Detection in Twitter Data Using Machine Learning Classifiers

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Using KNN and SVM Based One-Class Classifier for Detecting Online Radicalization on Twitter

Abstract

Chapter PDF

Similar content being viewed by others

Supervised Classifiers to Identify Hate Speech on English and Spanish Tweets

Classification of multi-lingual tweets, into multi-class model using Naïve Bayes and semi-supervised learning

Hate Speech and Offensive Language Detection in Twitter Data Using Machine Learning Classifiers

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation