Abstract
A copypasta is a piece of text that is copied and pasted in online forums and social networking sites (SNSs) repeatedly, usually for a humorous or mocking purpose. In recent years, copypasta is also used to spread rumors and false information, which damages not only the reputation of individuals or organizations but also misleads many netizens. This paper presents a tool for Hong Kong netizens to detect text messages that are copypasta or their variants (by transforming an existing copypasta with new subjects and events). We exploit the Encyclopedia of Virtual Communities in Hong Kong (EVCHK), which contains a database of 315 commonly occurred copypasta in Hong Kong, and a CNN model to determine whether a text message is a copypasta or its variant with an accuracy rate of around 98%. We also showed a prototype of a Google Chrome browser extension that provides a user-friendly interface for netizens to identify copypasta and their variants on a selected text message directly (e.g., in an online forum or SNS). This tool can show the source of the corresponding copypasta and highlight their differences (if it is a variant). From a survey, users agreed that our tool can effectively help them to identify copypasta and hence help stop the spreading of this kind of online rumor.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Leung, L.: Generational differences in content generation in social media: the roles of the gratifications sought and of narcissism. Comput. Hum. Behav. 29(3), 997–1006 (2013)
Sahoo, S.R., et al.: Security issues and challenges in online social networks (OSNs) based on user perspective. In: Computer and Cyber Security, pp. 591–606 (2018)
Sahoo, S.R., et al.: Spammer detection approaches in online social network (OSNs): a survey. In: Sustainable Management of Manufacturing Systems in Industry 4.0, pp. 159–180. Springer, Cham (2022)
Chen, J., Wu, Z., Yang, Z., Xie, H., Wang, F.L., Liu, W.: Multimodal fusion network with contrary latent topic memory for rumor detection. IEEE Multimed. 29(1), 104–113 (2022)
Riddick, S., Shivener, R.: Affective spamming on Twitch: rhetorics of an emote-only audience in a presidential inauguration livestream. Comput. Compos. 64, 102711 (2022)
Lam, C.: How digital platforms facilitate parody: online humour in the construction of Hong Kong identity. Comedy Stud. 13(1), 101–114 (2022)
Zannettou, S., Sirivianos, M., Blackburn, J., Kourtellis, N.: The web of false information: rumors, fake news, hoaxes, clickbait, and various other shenanigans. J. Data Inf. Qual. 11(3), 1–37 (2019)
Tembhurne, J.V., Almin, M.M., Diwan, T.: Mc-DNN: fake news detection using multi-channel deep neural networks. Int. J. Semant. Web Inf. Syst. (IJSWIS) 18(1), 1–20 (2022)
Facebook Page of the Hong Kong Chief Secretary for Administration’s Office. https://www.facebook.com/CSOGOV/posts/420752431929437. Accessed 2022/07/01
Srinivasan, S., Dhinesh Babu, L.D.: A parallel neural network approach for faster rumor identification in online social networks. Int. J. Semant. Web Inf. Syst. (IJSWIS) 15(4), 69–89 (2019)
Avery, D.: Twitter updates security policy to combat spam tweets and ‘copypasta’. https://www.cnet.com/news/social-media/twitter-updates-security-policy-to-combat-spam-tweets-and-copypasta/. Accessed 2022/07/01
Chiang, T.A., Che, Z.H., Huang, Y.L., Tsai, C.Y.: Using an ontology-based neural network and DEA to discover deficiencies of hotel services. Int. J. Semant. Web Inf. Syst. (IJSWIS) 18(1), 1–19 (2022)
Fung, Y.C., Lee, L.K.: A chatbot for promoting cybersecurity awareness. In: Agrawal, D.P., Nedjah, N., Gupta, B.B., Martinez Perez, G. (eds.) Cyber Security, Privacy and Networking. LNNS, vol. 370, pp. 379–387. Springer, Singapore (2022)
Sahoo, S.R., et al.: Hybrid approach for detection of malicious profiles in twitter. Comput. Electr. Eng. 76, 65–81 (2019). ISSN 0045-7906. https://doi.org/10.1016/j.compeleceng.2019.03.003
Meel, P., Vishwakarma, D.K.: Fake news, rumor, information pollution in social media and web: a contemporary survey of state-of-the-arts, challenges and opportunities. Expert Syst. Appl. 153, 112986 (2020)
Rani, N., Das, P., Bhardwaj, A.K.: Rumor, misinformation among web: a contemporary review of rumor detection techniques during different web waves. Concurr. Comput. Pract. Exp. 34(1), e6479 (2022)
Fung, Y.C., Lee, L.K., Chui, K.T., Cheung, G.H.K., Tang, C.H., Wong, S.M.: Sentiment analysis and summarization of Facebook posts on news media. In: Data Mining Approaches for Big Data and Sentiment Analysis in Social Media, pp. 142–154. IGI Global (2022)
Lee, L.K., Chui, K.T., Wang, J., Fung, Y.C., Tan, Z.: An improved cross-domain sentiment analysis based on a semi-supervised convolutional neural network. In: Data Mining Approaches for Big Data and Sentiment Analysis in Social Media, pp. 155–170. IGI Global (2022)
Liu, Y., Liu, H., Wong, L.P., Lee, L.K., Zhang, H., Hao, T.: A hybrid neural network RBERT-C based on pre-trained RoBERTa and CNN for user intent classification. In: International Conference on Neural Computing for Advanced Applications, pp. 306–319. Springer, Singapore (2020)
Liu, H., Liu, Y., Wong, L.P., Lee, L.K., Hao, T.: A hybrid neural network BERT-cap based on pre-trained language model and capsule network for user intent classification. Complexity 2020, 8858852 (2020)
Appati, J.K., Nartey, P.K., Yaokumah, W., Abdulai, J.D.: A systematic review of fingerprint recognition system development. Int. J. Softw. Sci. Comput. Intell. (IJSSCI) 14(1), 1–17 (2022)
Lee, L.K., Fung, Y.C., Pun, Y.W., Wong, K.K., Yu, M.T.Y., Wu, N.I.: Using a multiplatform chatbot as an online tutor in a university course. In: 2020 International Symposium on Educational Technology (ISET), pp. 53–56. IEEE (2020)
Gunti, P., et al.: Data mining approaches for sentiment analysis in online social networks (OSNs). In: Data Mining Approaches for Big Data and Sentiment Analysis in Social Media, pp. 116–141. IGI Global (2022)
Pei, Z., Sun, Z., Xu, Y.: Slang detection and identification. In: Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL), pp. 881–889 (2019)
Gupta, A., Kumaraguru, P., Castillo, C., Meier, P.: TweetCred: real-time credibility assessment of content on Twitter. In: International Conference on Social Informatics 2014, pp. 228–243. Springer, Cham (2014)
Lee, J.L., Chen, L., Lam, C., Lau, C.M., Tsui, T.H.: PyCantonese: Cantonese linguistics and NLP in python. In: Proceedings of the 13th Language Resources and Evaluation Conference, pp. 6607–6611. European Language Resources Association (2022)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Fung, YC. et al. (2023). Detecting Rumors Transformed from Hong Kong Copypasta. In: Nedjah, N., Martínez Pérez, G., Gupta, B.B. (eds) International Conference on Cyber Security, Privacy and Networking (ICSPN 2022). ICSPN 2021. Lecture Notes in Networks and Systems, vol 599. Springer, Cham. https://doi.org/10.1007/978-3-031-22018-0_2
Download citation
DOI: https://doi.org/10.1007/978-3-031-22018-0_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-22017-3
Online ISBN: 978-3-031-22018-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)