Skip to main content

Towards a New Lexicon-Based Features Vector for Sentiment Analysis: Application to Moroccan Arabic Tweets

  • Conference paper
  • First Online:
Advances in Information, Communication and Cybersecurity (ICI2C 2021)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 357))

Abstract

The emergence of the Web 2.0 technology generated a huge amount of raw data by enabling Internet users to post their opinions and reviews on the web. This data plays an important role in decision making for many peoples and organizations. An example of valuable insights that can be extracted from user’s posts is their opinions and sentiments regarding topics, events, services, products, etc. The English language has been the subject of extensive research on sentiment analysis. The proposed solutions are largely dominated by the use of two main analysis approaches based on machine learning techniques and the lexical approach. This work focuses on the second one to analyze the sentiments expressed in Moroccan tweets written in Arabic language : Standard Arabic (SA) and Moroccan Dialect (MD), and proposes a new method for extracting characteristics and representing data. The main idea of this method is to represent the text as a weight vector of feelings. Due to the lack of resources (databases and lexicon dictionaries) for the Arabic language, especially for the Moroccan one, this work starts with the construction of a corpus of 18.000 valid tweets based on 36 114 collected tweets that are manually tagged and classified as MD or SA. Then describes the steps of the construction of the Moroccan Senti-lexicon, a dictionary of 30.000 words labeled as positive, negative or neutral. The results of this study prove to be superior to those obtained by other comparable state of the art approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    https://developer.twitter.com.

  2. 2.

    https://socialbakers.com/statistics/twitter.

  3. 3.

    https://www.trendsmap.com/local/morocco.

References

  1. Bakshi, R.K., Kaur, N., Kaur, R., Kaur, G.: Opinion mining and sentiment analysis. In: 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom), pp. 452–455 (2016)

    Google Scholar 

  2. Cambria, E., Poria, S., Gelbukh, A., Thelwall, M.: Sentiment analysis is a big suitcase. IEEE Intell. Syst. 32(06), 74–80 (2017). https://doi.org/10.1109/MIS.2017.4531228

    Article  Google Scholar 

  3. Garouani, M., Chrita, H., Kharroubi, J.: Sentiment analysis of Moroccan tweets using text mining. In: Motahhir, S., Bossoufi, B. (eds.) ICDTA 2021. LNNS, vol. 211, pp. 597–608. Springer, Cham (2021). https://doi.org/10.1007/978-3-030-73882-2_54

    Chapter  Google Scholar 

  4. Albraheem, L., Al-Khalifa, H.S.: Exploring the problems of sentiment analysis in informal Arabic. In: ACM International Conference Proceeding Series, pp. 415–418 (2012) . https://doi.org/10.1145/2428736.2428813

  5. Abdulla, N.A., Ahmed, N.A., Shehab, M.A., Al-Ayyoub, M.: Arabic sentiment analysis: Lexicon-based and corpus-based. In: 2013 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT) (2013). https://doi.org/10.1109/AEECT.2013.6716448

  6. Alsolamy, A.A., Siddiqui, M.A., Khan, I.H.: A corpus based approach to build Arabic sentiment lexicon. Int. J. Inf. Eng. Electron. Bus. 11, 16–23 (2019). https://doi.org/10.5815/IJIEEB.2019.06.03

    Article  Google Scholar 

  7. Mohammad, S., Salameh, M., Kiritchenko, S.: Sentiment lexicons for Arabic social media. In: Proceedings of LREC’16, pp. 33–37 (2016)

    Google Scholar 

  8. Eskander, R., Rambow, O.: SLSA: a sentiment lexicon for standard Arabic. In: Conference Proceedings - EMNLP 2015 Conference on Empirical Methods in Natural Language Processing, pp. 2545–2550 (2015). https://doi.org/10.18653/V1/D15-1304

  9. Abdulla, N.A., Al-Ayyoub, M., Al-Kabi, M.N.: An extended analytical study of Arabic sentiments. Int. J. Big Data Intell. 1, 103–113 (2014). https://doi.org/10.1504/IJBDI.2014.063845

    Article  Google Scholar 

  10. Al-Ayyoub, M., Essa, S.B., Alsmadi, I.: Lexicon-based sentiment analysis of Arabic tweets. Int. J. Soc. Netw. Min. 2, 101–114 (2015). https://doi.org/10.1504/IJBDI.2014.063845

    Article  Google Scholar 

  11. Itani, M.M., Zantout, R.N., Hamandi, L., Elkabani, I.: Classifying sentiment in Arabic social networks: Naïve search versus Naïve Bayes. In: 2012 2nd International Conference on Advances in Computational Tools for Engineering Applications ACTEA 2012, pp. 192–197 (2012). https://doi.org/10.1109/ICTEA.2012.6462864

  12. UNGEGN - United Nations Group of Experts on Geographical Names. https://unstats.un.org/unsd/geoinfo/ungegn/wg1.html. Accessed 02 July 2021

  13. Abdulla, N.A., Ahmed, N.A., Shehab, M.A., Al-Ayyoub, M., Al-Kabi, M.N., Al-rifai, S.: Towards improving the lexicon-based approach for Arabic sentiment analysis. IJITWE 9(3), 55–71 (2014). https://doi.org/10.4018/ijitwe.2014070104

    Article  Google Scholar 

  14. El Abdouli, A., Hassouni, L., Anoun, H.: A new practical approach to automatically generate the trending topics in Morroccan society using the social network twitter. Revue Méditerranéenne des Télécommunications (2017)

    Google Scholar 

  15. Almatarneh, S., Gamallo, P.: Automatic construction of domain-specific sentiment lexicons for polarity classification. In: De la Prieta, F., et al. (eds.) PAAMS 2017. AISC, vol. 619, pp. 175–182. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-61578-3_17

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Moncef Garouani .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Garouani, M., Kharroubi, J. (2022). Towards a New Lexicon-Based Features Vector for Sentiment Analysis: Application to Moroccan Arabic Tweets. In: Maleh, Y., Alazab, M., Gherabi, N., Tawalbeh, L., Abd El-Latif, A.A. (eds) Advances in Information, Communication and Cybersecurity. ICI2C 2021. Lecture Notes in Networks and Systems, vol 357. Springer, Cham. https://doi.org/10.1007/978-3-030-91738-8_7

Download citation

Publish with us

Policies and ethics