Investigating the Impact of Utilizing the ChatGPT for Arabic Sentiment Analysis

Al-Gaphari, Ghaleb; AL-Hagree, Salah; Al-Helali, Baligh

doi:10.1007/978-3-031-59711-4_9

Ghaleb Al-Gaphari⁵,
Salah AL-Hagree^5,6 &
Baligh Al-Helali⁶

Part of the book series: Lecture Notes on Data Engineering and Communications Technologies ((LNDECT,volume 210))

Included in the following conference series:

International Conference of Reliable Information and Communication Technology

65 Accesses

Abstract

In the field of artificial intelligence (A), there has been a remarkable breakthrough with the emergence of large language models (LLMs) that are fine-tuned to follow human instructions. One such model is OpenAI's ChatGPT (Chat Generative Pre-trained Transformer), which has proven to be a highly capable tool for various tasks including question answering, code debugging, and dialogue generation. However, while these models are touted for their multilingual proficiency, their ability to accurately analyze sentiment, particularly in the Arabic language, has not been extensively investigated. Recognizing this limitation, we aim to address this gap by conducting a comprehensive evaluation of ChatGPT’ sentiment analysis capabilities specifically for Arabic text. We investigate the impact of utilizing the ChatGPT variants for Arabic sentiment analysis (ASA) and propose a new active labeling methods for ChatGPT. We evaluate the performance of four machine learning (ML) techniques, including Naive Bayes (NB), K-Nearest Neighbors (K-NN), Support Vector Machine (SVM), and Random Forest (FR), using accuracy, recall, precision, and F-score measure. We also compare six methods of labeling the data for ASA, manual labeling by humans, labeling using ChatGPT by Assistant-Poe, labeling using ChatGPT by Bing-Edge, labeling using ChatGPT by Assistant-Poe with humans, labeling using ChatGPT by Bing-Edge with humans, and labeling using ChatGPT by Assistant-Poe with Bing-Edge. Our experimental results show that the NB technique performed the best, achieving an accuracy of 91.22%, recall of 89.62%, precision of 88.90%, and F-score of 89.26% by using multiple Bing-Edge models for ASA. Moreover, utilizing our proposed active labeling method with ChatGPT achieved higher accuracy compared to other labeling methods. Our study suggests that the NB technique with multiple Bing-Edge models and our proposed active labeling method are effective approaches for ASA using ChatGPT. Our study contributes to the advancement of sentiment analysis in Arabic text and offers valuable insights into effective approaches for this task.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Hardcover Book: USD 249.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Kadaoui, K., et al.: TARJAMAT: Evaluation of Bard and ChatGPT on Machine Translation of Ten Arabic Varieties. arXiv preprint arXiv:2308.03051 (2023)‏
Ray, P.P.: ChatGPT: A comprehensive review on background, applications, key challenges, bias, ethics, limitations and future scope. Internet Things Cyber-Phys. Syst. (2023)
Google Scholar
Al-Shalabi, A.A., Al-Gaphari, G., Salah, A.H., Alqasemi, F.: Investigating the impact of utilizing the K-nearest neighbor and levenshtein distance algorithms for Arabic sentiment analysis on mobile applications. JAST 1(2) (2023)
Google Scholar
Al-Hagree, S., Al-Gaphari, G.: Arabic sentiment analysis based machine learning for measuring user satisfaction with banking services’ mobile applications: comparative study. In: 2022 2nd International Conference on Emerging Smart Technologies and Applications (eSmarTA), pp. 1–4. IEEE (2022)‏
Google Scholar
Al-Hagree, S., Al-Gaphari, G.: Arabic sentiment analysis on mobile applications using levenshtein distance algorithm and naive bayes. In: 2022 2nd International Conference on Emerging Smart Technologies and Applications (eSmarTA), pp. 1–6. IEEE ,(2022)
Google Scholar
Praveen, S.V., Vajrobol, V.: Understanding the perceptions of healthcare researchers regarding ChatGPT: a study based on bidirectional encoder representation from transformers (BERT) sentiment analysis and topic modeling. Ann. Biomed. Eng. 1–3 (2023)
Google Scholar
Wang, Z., Xie, Q., Ding, Z., Feng, Y., Xia, R.: Is ChatGPT a good sentiment analyzer? A preliminary study. arXiv preprint arXiv:2304.04339 (2023)
Susnjak, T.: Applying bert and chatgpt for sentiment analysis of lyme disease in scientific literature. arXiv preprint arXiv:2302.06474 (2023)‏
Zhu, Y., Zhang, P., Haq, E.U., Hui, P., Tyson, G.: Can chatgpt reproduce human-generated labels? a study of social computing tasks. arXiv preprint arXiv:2304.10145 (2023)
Ubani, S., Polat, S.O., Nielsen, R.: Zero shot data Aug: generating and augmenting training data with ChatGPT. arXiv preprint arXiv:2304.14334 (2023)
Zhang, B., Yang, H., Liu, X.Y.: Instruct-FinGPT: Financial Sentiment Analysis by Instruction Tuning of General-Purpose Large Language Models. arXiv preprint arXiv:2306.12659 (2023)
Erfina, A., Nurul, M.R.: Implementation of Naive Bayes classification algorithm for Twitter user sentiment analysis on ChatGPT using Python programming language. Data Metadata 2, 45 (2023)
Article Google Scholar
Koonchanok, R., Pan, Y., Jang, H.: Tracking public attitudes toward ChatGPT on Twitter using sentiment analysis and topic modeling. arXiv preprint arXiv:2306.12951 (2023)‏
Karanouh, M.: Mapping ChatGPT in mainstream media: early quantitative insights through sentiment analysis and word frequency analysis. arXiv preprint arXiv:2305.18340 (2023)
Liao, W., et al.: Differentiate chatgpt-generated and human-written medical texts. arXiv preprint arXiv:2304.11567 (2023)
Roumeliotis, K.I., Tselikas, N.D.: ChatGPT and open-AI models: a preliminary review. Future Internet 15(6), 192 (2023)
Article Google Scholar
Al-Helali, B.: A new imputation method based on genetic programming and weighted KNN for symbolic regression with incomplete data. Soft. Comput. 25(8), 5993–6012 (2021)
Article Google Scholar
Hadwan, M., Al-Hagery, M.A., Al-Sanabani, M., Al-Hagree, S.: Soft Bigram distance for names matching. PeerJ Comput. Sci. 7, e465 (2021)
Article Google Scholar
Kwon, S.Y., Bhatia, G., Nagoud, E.M.B., Abdul-Mageed, M.: ChatGPT for arabic grammatical error correction. arXiv preprint arXiv:2308.04492 (2023)‏
Mujahid, M., Kanwal, K., Rustam, F., Aljadani, W., Ashraf, I.: Arabic ChatGPT tweets classification using RoBERTa and BERT ensemble model. ACM Trans. Asian Low-Resource Lang. Inform. Process. (2023)‏
Google Scholar
Tawkat Islam Khondaker, M., Waheed, A., Moatez Billah Nagoudi, E., Abdul-Mageed, M.: GPTAraEval: a comprehensive evaluation of ChatGPT on arabic NLP. arXiv e-prints, arXiv-2305 (2023)
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Department, Faculty of Computer and Information Technology, Sana’a University, Sana’a, Yemen
Ghaleb Al-Gaphari & Salah AL-Hagree
Department of Computer Sciences and Information Technology, Ibb University, Ibb, Yemen
Salah AL-Hagree & Baligh Al-Helali

Authors

Ghaleb Al-Gaphari
View author publications
You can also search for this author in PubMed Google Scholar
Salah AL-Hagree
View author publications
You can also search for this author in PubMed Google Scholar
Baligh Al-Helali
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ghaleb Al-Gaphari .

Editor information

Editors and Affiliations

School of Computing and Digital Technology, Birmingham City University, Birmingham, UK
Faisal Saeed
Department of Business Analytics, Sunway Business School, Sunway University, Selangor, Malaysia
Fathey Mohammed
Department of Computer Sciences and Electrical Engineering, Marshall University, Huntington, WV, USA
Yousef Fazea

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Al-Gaphari, G., AL-Hagree, S., Al-Helali, B. (2024). Investigating the Impact of Utilizing the ChatGPT for Arabic Sentiment Analysis. In: Saeed, F., Mohammed, F., Fazea, Y. (eds) Advances in Intelligent Computing Techniques and Applications. IRICT 2023. Lecture Notes on Data Engineering and Communications Technologies, vol 210. Springer, Cham. https://doi.org/10.1007/978-3-031-59711-4_9

Download citation

DOI: https://doi.org/10.1007/978-3-031-59711-4_9
Published: 30 June 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-59710-7
Online ISBN: 978-3-031-59711-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics