Towards the Construction of an Emotion Analysis Model in University Students Using Images Taken in Classrooms

Atehortúa Zapata, Jader Daniel; Cano Duque, Santiago; Forero Hincapié, Santiago; Hernández-Leal, Emilcy

doi:10.1007/978-3-031-47372-2_25

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1924))

Included in the following conference series:

Colombian Conference on Computing

178 Accesses

Abstract

Data mining is used in various fields, image processing is one of them, a particular application is the identification and classification of emotions expressed by students in the classroom. However, this creates challenges, such as the subjective interpretation of facial expressions and the need for extensive data sets to train and validate the models, for the former it is required to go to other allied research fields, and for the latter, a possibility is glimpsed in the transfer of learning. This work seeks to review and compare different classifiers for the construction of a model that allows the analysis of the emotions of university students from images extracted from recordings of face-to-face classes stored in an educational support platform. For this, the KDD (Knowledge Discovery in Databases) methodology was followed, and experiments were proposed with different configurations of hyperparameters and generation of models from classifiers such as Nearby Neighbors-KNN, Convolutional Neural Networks-CNN, and Random Forest. The performance of each one is contrasted based on precision, recall, F1, Accuracy, and ROC curve. Additionally, an approximation to a learning transfer process was carried out using an open-use data set (taken from the Kaggle repository) for the classification of emotions for the training of the models and validating with the data extracted from the source of the case study. The results support the utility and potential of applying these techniques in scenarios where image-based emotion analysis is required, with CNN being the classifier with the best accuracy and obtaining significant value from knowledge transfer that motivates further deepening of the approach for the treatment of this problem.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Machine Learning Based Emotion Recognition in a Digital Learning Environment

Online learners’ engagement detection via facial emotion recognition in online learning context using hybrid classification model

Article 21 February 2024

Deep Learning Based Audio-Visual Emotion Recognition in a Smart Learning Environment

Keywords

1 Introduction

The emotional well-being of students holds significant relevance in the educational context. Recognizing and understanding the emotions experienced by students is not only crucial for their personal development but can also have a direct impact on their academic performance and overall success in the educational environment [2]. However, the task of detecting and assessing students’ emotions can be complex and subjective for educators and education professionals. In response to this challenge, image processing supported by artificial intelligence approaches offers a promising tool for objectively and accurately analyzing students’ emotions [5]. This has already been tested in virtual education processes, where such analysis can even occur in real-time, enabling early interventions in the educational methodologies and resources employed.

Image processing enables the extraction of key visual features from an image, such as facial expressions, gestures, and body postures, which are closely related to human emotions. By combining computer vision and machine learning techniques, it is possible to develop algorithms capable of identifying and classifying the emotions expressed by students in various educational situations and contexts [1].

This work focuses on exploring the possibility of classifying and analyzing emotions in university students based on images extracted from recordings of in-person classes stored on an educational support platform. It is worth noting that this work represents a preliminary approach to building an emotion analysis model, making it an experimental effort involving classification techniques. At this stage, it does not yet propose a fully implementable tool in the classroom. Additionally, the advantages of transfer learning [13] are examined as a means of domain adaptation, using a validation dataset separate from the model’s training dataset. The goal is to fine-tune the pre-trained model to the target data by transferring prior knowledge and adapting to the new data. However, it is important to consider certain factors, such as image quality and the interpretation of emotions, which can be influenced by factors like lighting or the presence of facial obstructions. External factors, such as students’ mood or personal situations, may also affect facial emotional expression [8].

Throughout the article, various studies and practical applications that have used image processing to analyze students’ emotions in educational settings will be presented in Sect. 2. Section 3 will cover the methodology, including data selection and extraction, as well as the experiment setup. The results and their discussion will be presented in Sect. 4, while the conclusions and future work will conclude the document in Sect. 5.

2 Literature Review

There are several studies that have proven the benefits of image processing for the detection of emotions in educational environments. The state of mind can influence the learning process [7], so having knowledge of the emotions that a student is experiencing in a class can help the teacher to propose methodological strategies and didactic mediations.

In [4], a significant experimental benchmark for research on students’ emotion recognition and graphical visualization of facial expressions in a virtual learning environment is successfully proposed. This paper presents an exploration of speech and images by comparing the performance of several deep learning neural network algorithms and an improved long term bidirectional memory convolution neural network algorithm is proposed, which achieves satisfactory performance for the addressed case study.

On the other hand, in [10] a framework that combines a facial expression recognition (FER) algorithm with online course platforms is proposed. Students’ faces pictures are taken through the cameras of the devices they use to attend classes and the expressions are analyzed and classified into 8 types of emotions (anger, contempt, disgust, fear, happiness, neutrality, sadness, and surprise). The authors used a course with 27 students conducted on the Tencent Meeting platform and the results obtained show that the model based on Convolutional Neural Networks (CNN), demonstrates robustness in various environments. This suggests that facial expression recognition could be an effective tool for understanding students’ emotions during online classes.

As for [9], it reports the analysis of behavior and the search for patterns in the oral presentations of a group of students by applying sequential pattern mining techniques. The analysis allowed segmenting into three different groups according to their body postures, sequential pattern mining provided a complementary perspective for data evaluation and helped to observe the most frequent postural sequences of the students.

Another interesting work is presented in [6], which proposes the integration of two models, one for emotion recognition and the other for attention analysis, to facilitate monitoring during a student’s interaction in virtual environments. This integration was carried out on a web platform and the results indicate that the platform could be used by teachers as knowledge mediators, since they could understand the behavior of students in virtual environments, whether synchronous or asynchronous, and take actions to improve the learning experience of students.

In relation to transfer learning, the study [3] proposes a two-phase approach to develop an emotion recognition model and face the challenge of data scarcity in this field. Experimental results evidence a significant improvement in performance when applying the transfer learning strategy through implementation in a Convolutional Neural Network (CNN). This resulted in a remarkable increase in recognition efficiency from 86.38% to 95,89%.

Other works, such as those presented in [12] and [11] also recognize the usefulness and benefits of knowledge transfer in the representation of facial expressions and emotion recognition, being in a process of consolidation of this approach, as well as the libraries, algorithm implementations and tools that support it.

A review of previous studies highlights the importance of understanding students’ emotions in educational environments. Existing approaches have demonstrated efficacy in emotion recognition through image processing and facial expression analysis. However, they present limitations in terms of generalization, personalization, and data availability. The approach proposed in this manuscript seeks to overcome these limitations by improving performance and adaptability in a technology-assisted face-to-face educational environment.

3 Methodology

A data mining process was carried out following the traditional steps of a KDD (Knowledge Discovery in Databases) methodology, in which experiments with different hyperparameter configurations and model generation from classifiers such as KNN, CCN and Random Forest were proposed, contrasting the performance of each one based on accuracy, recall and F1 metrics.

By using these classifiers, we seek to perform a comparative analysis to determine whether the Convolutional Neural Network (CNN), being specifically designed for image and video processing, could perform greater efficiency in comparison to the other evaluated models.

Additionally, an approach to a learning transfer process was made by using an open-use dataset for emotion classification to train the models and validating with the data extracted from the case study source.

An applied research is developed with a case study corresponding to the analysis of images obtained in a subject of the Systems Engineering program of the University of Medellin in the semester 2022-2, which includes videos captured in the classroom, from which a relevant set of images were extracted for our research. These videos were obtained from the cameras installed in the classrooms, thus guaranteeing the quality of the data. Regarding data privacy, it is important to note that these classes are uploaded to the u-virtual platform, and due to a previously established agreement at the time of enrollment at the university, they are accessible to the university community. Two strategies were proposed for data collection, one for model training data, which were obtained from a free external repository, and one for test data, which were collected from the group described above as a case study.

Figure 1 shows the general scheme of the steps carried out in the process, where the flow between the data obtained for the training of the models, the collection of the test data, the training process that generates results and the pre-trained model that is subsequently used to perform the transfer with the test data extracted from the real case mentioned above is identified.

4 Results

The results at the end of the application of the techniques show that the pre-trained models from a generic dataset of emotions identified in facial expressions demonstrate a medium level of effectiveness in identifying the predominant emotion in students from images of moments extracted from recordings of in-person classes. Together, these results support the utility and potential of applying these techniques in scenarios where image-based emotional analysis is required. Below are the performance metrics obtained for the three implemented models (see Table 1).

Table 1. Evaluation metrics.

Full size table

On the other hand, the ROC curve is shown in Fig. 2, which illustrates the relationship between the true positive rate and the false positive rate. These curves allow an effective evaluation and comparison of the performance of the models in classification problems. Likewise, the value of the area under the ROC curve will be highlighted, a metric that quantifies the quality of the models in terms of their classification capacity.

In the implementation of the three classification algorithms, it was observed that the CNN model demonstrated superior performance compared to the Random Forest and KNN algorithms. Likewise, after applying Transfer Learning, an improvement in the efficiency of the CNN model was evident. On the other hand, the KNN algorithm also experienced an increase in its efficiency but remained below the CNN. These results suggest that the use of Transfer Learning can enhance the performance of classification models for the processing of emotional images.

These results support the usefulness of the applied techniques and their potential for various applications. For example, in educational settings, the developed models can be used to identify learning situations that generate positive or negative emotions in students, allowing timely intervention to improve the learning experience.

When performing a comparative analysis of the metrics presented in the table (see Table 1) of models, it is clearly highlighted that the CNN model exhibits the greatest efficiency, even when applying transfer learning, reaffirming its position as the most suitable model for the case study. These results coincide with the evidence found in previous studies [8] and [11], where it was also shown that a Convolutional Neural Network (CNN) obtained superior results in efficiency. However, it is important to highlight that, despite these achievements, we must continue to focus on further improving the efficiency of our case study, considering that the results reported in the literature review exceed 85%.

5 Conclusions

In the processes of classifying educational data, aspects such as the observation of features, that is, exploratory analysis, and the verification of transformation needs are fundamental, as well as the verification of data balance. For this work, the interpretation of emotions present in an image can be subjective and vary among different observers. In image analysis, the consistency and reliability of results can be affected, in addition to the aforementioned factors, by other aspects such as the selection of the partitioning method for the training stage, the definition and execution of experiments, and the strategy for evaluating the results.

The use of transfer learning in image analysis yields favorable results, proving to be an effective way to enhance model performance, especially in situations where there is a limited dataset available or rapid adaptation to new tasks or applications is required. For the experiments outlined in this work, where a comparison of various classification techniques was carried out, it is found that transfer learning is a viable option for training models in problems where obtaining a significant dataset is challenging.

As a future work, the goal is to expand the test dataset taken from the virtual learning environment that supports in-person teaching processes. Additionally, more detailed tracking of a specific subject, along with the visual records, is planned, and these records may be accompanied by sociodemographic characterization data of the student group and the didactic activities carried out at the time of video and image capture. Furthermore, the intention is to incorporate other relevant factors, such as precise emotion interpretation, to achieve more efficient results in the analysis.

References

Barrionuevo, C., Ierache, J.S., Sattolo, I.I.: Reconocimiento de emociones a través de expresiones faciales con el empleo de aprendizaje supervisado aplicando regresión logística, pp. 491–500 (2020). http://sedici.unlp.edu.ar/handle/10915/114089
Cañero-Pérez, M., Mónaco-Gerónimo, E., Montoya-Castilla, I.: La inteligencia emocional y la empatía como factores predictores del bienestar subjetivo en estudiantes universitarios. EJIHPE: Eur. J. Invest. Health Psychol. Educ. 9(1), 19–29 (2019). ISSN-e: 2254-9625. ISSN: 2174-8144. https://doi.org/10.30552/ejihpe.v9i1.313
Hung, J.C., Lin, K.C., Lai, N.X.: Recognizing learning emotion based on convolutional neural networks and transfer learning. Appl. Soft Comput. 84, 105724 (2019). https://doi.org/10.1016/J.ASOC.2019.105724
Article Google Scholar
Lu, X.: Deep learning based emotion recognition and visualization of figural representation. Front. Psychol. 12, 818833 (2022). https://doi.org/10.3389/FPSYG.2021.818833
Article Google Scholar
Masias, E.J.F., Segovia, J.H.L., Casique, A.G., Díaz, M.E.D.: Análisis de sentimientos con inteligencia artificial para mejorar el proceso enseñanza-aprendizaje en el aula virtual. Publicaciones 53, 185–216 (2023). https://doi.org/10.30827/PUBLICACIONES.V53I2.26825. https://revistaseug.ugr.es/index.php/publicaciones/article/view/26825
Piedrahíta-Carvajal, A., Rodríguez-Marín, P.A., Terraza-Arciniegas, D.F., Amaya-Gómez, M., Duque-Muñoz, L., Martínez-Vargas, J.D.: Aplicación web para el análisis de emociones y atención de estudiantes. TecnoLógicas 24, 62–76 (2021). https://doi.org/10.22430/22565337.1821
Puertas-Molero, P., Zurita-Ortega, F., Chacón-Cuberos, R., Castro-Sánchez, M., Ramírez-Granizo, I., González-Valero, G.: La inteligencia emocional en el ámbito educativo: un meta-análisis. Anales de Psicología 36, 84–91 (2020). https://doi.org/10.6018/ANALESPS.36.1.345901
Article Google Scholar
Saxena, A., Khanna, A., Gupta, D.: Emotion recognition and detection methods: a comprehensive survey. J. Artif. Intell. Syst. 2, 53–79 (2020). https://doi.org/10.33969/AIS.2020.21005. https://iecscience.org/jpapers/46. https://iecscience.org/jpapers/46abstract
Vieira, F., et al.: A learning analytics framework to analyze corporal postures in students presentations. Sensors 21, 1525 (2021). https://doi.org/10.3390/S21041525. https://www.mdpi.com/1424-8220/21/4/1525/htm. https://www.mdpi.com/1424-8220/21/4/1525
Wang, W., Xu, K., Niu, H., Miao, X.: Emotion recognition of students based on facial expressions in online education based on the perspective of computer simulation. Complexity 2020 (2020). https://doi.org/10.1155/2020/4065207
Wu, D., Han, X., Yang, Z., Wang, R.: Exploiting transfer learning for emotion recognition under cloud-edge-client collaborations. J. Sel. Areas Commun. 39, 479–490 (2021). https://doi.org/10.1109/JSAC.2020.3020677
Article Google Scholar
Xue, F., Wang, Q., Guo, G.: TransFER: learning relation-aware facial expression representations with transformers, pp. 3601–3610 (2021)
Google Scholar
Zhuang, F., et al.: A comprehensive survey on transfer learning. Proc. IEEE 109, 43–76 (2021). https://doi.org/10.1109/JPROC.2020.3004555
Article Google Scholar

Download references

Author information

Authors and Affiliations

Universidad de Medellín, Cra 87 # 30-65, Medellín, Colombia
Jader Daniel Atehortúa Zapata, Santiago Cano Duque, Santiago Forero Hincapié & Emilcy Hernández-Leal

Authors

Jader Daniel Atehortúa Zapata
View author publications
You can also search for this author in PubMed Google Scholar
Santiago Cano Duque
View author publications
You can also search for this author in PubMed Google Scholar
Santiago Forero Hincapié
View author publications
You can also search for this author in PubMed Google Scholar
Emilcy Hernández-Leal
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Emilcy Hernández-Leal .

Editor information

Editors and Affiliations

Universidad EAFIT, Medellín, Colombia
Marta Tabares
Universidad EAFIT, Medellín, Colombia
Paola Vallejo
Universidad EAFIT, Medellín, Colombia
Biviana Suarez
Pedagogical and Technological University, Tunja, Colombia
Marco Suarez
Universidad EAFIT, Medellín, Colombia
Oscar Ruiz
Universidad EAFIT, Medellín, Colombia
Jose Aguilar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Atehortúa Zapata, J.D., Cano Duque, S., Forero Hincapié, S., Hernández-Leal, E. (2024). Towards the Construction of an Emotion Analysis Model in University Students Using Images Taken in Classrooms. In: Tabares, M., Vallejo, P., Suarez, B., Suarez, M., Ruiz, O., Aguilar, J. (eds) Advances in Computing. CCC 2023. Communications in Computer and Information Science, vol 1924. Springer, Cham. https://doi.org/10.1007/978-3-031-47372-2_25

Download citation

DOI: https://doi.org/10.1007/978-3-031-47372-2_25
Published: 14 November 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-47371-5
Online ISBN: 978-3-031-47372-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Towards the Construction of an Emotion Analysis Model in University Students Using Images Taken in Classrooms

Abstract

Similar content being viewed by others

Machine Learning Based Emotion Recognition in a Digital Learning Environment

Online learners’ engagement detection via facial emotion recognition in online learning context using hybrid classification model

Deep Learning Based Audio-Visual Emotion Recognition in a Smart Learning Environment

Keywords

1 Introduction

2 Literature Review

3 Methodology

4 Results

5 Conclusions

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Towards the Construction of an Emotion Analysis Model in University Students Using Images Taken in Classrooms

Abstract

Similar content being viewed by others

Machine Learning Based Emotion Recognition in a Digital Learning Environment

Online learners’ engagement detection via facial emotion recognition in online learning context using hybrid classification model

Deep Learning Based Audio-Visual Emotion Recognition in a Smart Learning Environment

Keywords

1 Introduction

2 Literature Review

3 Methodology

4 Results

5 Conclusions

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation