Abstract
The emergence of the Web 2.0 technology generated a huge amount of raw data by enabling Internet users to post their opinions and reviews on the web. This data plays an important role in decision making for many peoples and organizations. An example of valuable insights that can be extracted from user’s posts is their opinions and sentiments regarding topics, events, services, products, etc. The English language has been the subject of extensive research on sentiment analysis. The proposed solutions are largely dominated by the use of two main analysis approaches based on machine learning techniques and the lexical approach. This work focuses on the second one to analyze the sentiments expressed in Moroccan tweets written in Arabic language : Standard Arabic (SA) and Moroccan Dialect (MD), and proposes a new method for extracting characteristics and representing data. The main idea of this method is to represent the text as a weight vector of feelings. Due to the lack of resources (databases and lexicon dictionaries) for the Arabic language, especially for the Moroccan one, this work starts with the construction of a corpus of 18.000 valid tweets based on 36 114 collected tweets that are manually tagged and classified as MD or SA. Then describes the steps of the construction of the Moroccan Senti-lexicon, a dictionary of 30.000 words labeled as positive, negative or neutral. The results of this study prove to be superior to those obtained by other comparable state of the art approaches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Bakshi, R.K., Kaur, N., Kaur, R., Kaur, G.: Opinion mining and sentiment analysis. In: 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom), pp. 452–455 (2016)
Cambria, E., Poria, S., Gelbukh, A., Thelwall, M.: Sentiment analysis is a big suitcase. IEEE Intell. Syst. 32(06), 74–80 (2017). https://doi.org/10.1109/MIS.2017.4531228
Garouani, M., Chrita, H., Kharroubi, J.: Sentiment analysis of Moroccan tweets using text mining. In: Motahhir, S., Bossoufi, B. (eds.) ICDTA 2021. LNNS, vol. 211, pp. 597–608. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-73882-2_54
Albraheem, L., Al-Khalifa, H.S.: Exploring the problems of sentiment analysis in informal Arabic. In: ACM International Conference Proceeding Series, pp. 415–418 (2012) . https://doi.org/10.1145/2428736.2428813
Abdulla, N.A., Ahmed, N.A., Shehab, M.A., Al-Ayyoub, M.: Arabic sentiment analysis: Lexicon-based and corpus-based. In: 2013 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT) (2013). https://doi.org/10.1109/AEECT.2013.6716448
Alsolamy, A.A., Siddiqui, M.A., Khan, I.H.: A corpus based approach to build Arabic sentiment lexicon. Int. J. Inf. Eng. Electron. Bus. 11, 16–23 (2019). https://doi.org/10.5815/IJIEEB.2019.06.03
Mohammad, S., Salameh, M., Kiritchenko, S.: Sentiment lexicons for Arabic social media. In: Proceedings of LREC’16, pp. 33–37 (2016)
Eskander, R., Rambow, O.: SLSA: a sentiment lexicon for standard Arabic. In: Conference Proceedings - EMNLP 2015 Conference on Empirical Methods in Natural Language Processing, pp. 2545–2550 (2015). https://doi.org/10.18653/V1/D15-1304
Abdulla, N.A., Al-Ayyoub, M., Al-Kabi, M.N.: An extended analytical study of Arabic sentiments. Int. J. Big Data Intell. 1, 103–113 (2014). https://doi.org/10.1504/IJBDI.2014.063845
Al-Ayyoub, M., Essa, S.B., Alsmadi, I.: Lexicon-based sentiment analysis of Arabic tweets. Int. J. Soc. Netw. Min. 2, 101–114 (2015). https://doi.org/10.1504/IJBDI.2014.063845
Itani, M.M., Zantout, R.N., Hamandi, L., Elkabani, I.: Classifying sentiment in Arabic social networks: Naïve search versus Naïve Bayes. In: 2012 2nd International Conference on Advances in Computational Tools for Engineering Applications ACTEA 2012, pp. 192–197 (2012). https://doi.org/10.1109/ICTEA.2012.6462864
UNGEGN - United Nations Group of Experts on Geographical Names. https://unstats.un.org/unsd/geoinfo/ungegn/wg1.html. Accessed 02 July 2021
Abdulla, N.A., Ahmed, N.A., Shehab, M.A., Al-Ayyoub, M., Al-Kabi, M.N., Al-rifai, S.: Towards improving the lexicon-based approach for Arabic sentiment analysis. IJITWE 9(3), 55–71 (2014). https://doi.org/10.4018/ijitwe.2014070104
El Abdouli, A., Hassouni, L., Anoun, H.: A new practical approach to automatically generate the trending topics in Morroccan society using the social network twitter. Revue Méditerranéenne des Télécommunications (2017)
Almatarneh, S., Gamallo, P.: Automatic construction of domain-specific sentiment lexicons for polarity classification. In: De la Prieta, F., et al. (eds.) PAAMS 2017. AISC, vol. 619, pp. 175–182. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-61578-3_17
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Garouani, M., Kharroubi, J. (2022). Towards a New Lexicon-Based Features Vector for Sentiment Analysis: Application to Moroccan Arabic Tweets. In: Maleh, Y., Alazab, M., Gherabi, N., Tawalbeh, L., Abd El-Latif, A.A. (eds) Advances in Information, Communication and Cybersecurity. ICI2C 2021. Lecture Notes in Networks and Systems, vol 357. Springer, Cham. https://doi.org/10.1007/978-3-030-91738-8_7
Download citation
DOI: https://doi.org/10.1007/978-3-030-91738-8_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-91737-1
Online ISBN: 978-3-030-91738-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)