Analyzing Students Reviews of Teacher Performance Using Support Vector Machines by a Proposed Model

Gutiérrez, G.; Ponce, J.; Ochoa, A.; Álvarez, M.

doi:10.1007/978-3-319-76261-6_9

G. Gutiérrez¹¹,
J. Ponce¹²,
A. Ochoa¹³ &
…
M. Álvarez¹¹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 820))

Included in the following conference series:

International Symposium on Intelligent Computing Systems

867 Accesses
10 Citations

Abstract

Sentiment Analysis arises from areas such as natural language processing and data mining, it has become a key area for society because it is possible to identify emotions expressed in texts. Teacher evaluation is considered an important process in higher education institutions to measure teacher performance and implement constructive strategies to benefit students in their education. This paper describes the design and development of a Model with the purpose of analyzing the students reviews of teacher performance. The collection of comments was done in two ways, the first is that comments were collected from a teacher evaluation conducted in 2016 and the second was made on Twitter through the participation of students of the Universidad Politécnica de Aguascalientes. In this work we applied Support Vector Machines with three kernels: linear, radial and polynomial, to predict a classification of comments in positive, negative or neutral and we calculated evaluation measures.

Access provided by CONRICYT-eBooks. Download conference paper PDF

A Sentiment Analysis Model to Analyze Students Reviews of Teacher Performance Using Support Vector Machines

YouTube Sentiment Analysis: Performance Model Evaluation

A Sentiment Analysis Based Approach for Exploring Student Feedback

Keywords

1 Introduction

In the teaching process is crucial to evaluate the teaching performance [1]. This evaluation is one of the most complex processes in any university, since various factors and criteria like: planning of classes, schedules, delivery of evaluation evidence, attendance to courses for the improvement of teaching and teaching styles, among others, should be met to be concentrated in order to provide a final assessment for the professor. Teacher evaluation can be performed by an observation guide or a rubric. However, when teacher performance is evaluated by students, varied opinions are collected from the same established criteria. Therefore Education is one of the areas that in recent years has shown interest in analyzing the comments of students in order that teachers improve their teaching techniques promoting appropriate learning in students. This is possible by Sentiment Analysis [2] an application of natural language processing, text mining and computational linguistics, to identify information from the text.

In his research, Binali [3] ensures that students represents their emotions in comments, so it is a way to learn about various aspects of the student. Wen [4] applies Sentiment Analysis on feedback from students about their teachers, enrolled in online courses in order to know their opinion and determine whether there is a connection between emotions and dropout rates. Students feedback on quality and standards of learning is considered as a strategy to improve the teaching process [4] and can be collected through a variety of social networks, blogs and surveys.

In this paper, we presented a model called SocialMining to support the Teacher Performance Assessment applying Support Vector Machines (SVM). We selected SVM as a classifier due to its high performance in classification applications [5, 6]. Further experiments with other machine learning algorithms will follow.

This paper is organized as follows. Section 2 presents related work. Section 3 shows the SocialMining model architecture. Section 4 describes data and methods used and experimental design. Section 5 includes the results. Finally, the conclusions of the work are presented in Sect. 6.

2 Related Work

The Table 1 shows an overview of some related work. All these works have obtained good results in their different combinations of methods and algorithms. This table is not exhaustive.

Table 1. List of features to analyze

Full size table

From Table 1 we can see that most of previous research has focused on particular aspects like: know the student emotional state, analyze the terms and phrases from opinions of students, detect the feelings of students on some topics and know the user opinions of the E-learning systems. In this work we proposed a model to evaluate teacher performance considering spanish reviews from students and applying machine learning algorithms to classify them as positive, negative and neutral. The results of this work may help to improve the classification process of comments and suggest courses to teachers.

3 SocialMining Model Architecture

The SocialMining model is composed of three phases: a comments extraction process (feedback from students about their teachers) and cleaning, a feature selection process, and classification of comments into positive, negative and neutral, applying SVM. The last phase includes an evaluation process of SVM results in kernels.

Phase 1: Comments Extraction and Cleaning Process.

In this phase we extracted feedback from students about their teachers to generate a corpus of comments. Then we do a labeling process to classify the comments into positive and negative considering a numeric range. The numeric range varied from: −2 to −0.2 is used for negative comments, −2 value express very negative comments. Values between +0.2 to +2 apply to positive comments, +2 is used as a positive comments. Likewise, those comments labeled with the number 1 are considered as neutral (Fig. 1).

In this cleaning process, the stop words and nouns that appear in most of the comments are deleted (e.g. teacher, university, class, subject, school and others). In addition, punctuation marks are removed and capitalized words are converted to lowercase. The output in this phase is the corpus of comments.

Phase 2: Feature Selection.

Once finished the cleaning process, we performed a feature selection process, removing repetitive terms and applying some functions to select the required terms or features, this process is like a filtering. So the input in this phase is the corpus of comments and the output are the features.

A feature in Sentiment Analysis is a term or phrase that helps to express a positive or negative opinion. There are several methods used in feature selection, where some of them are based on the syntactic word position, based in information gain, using a importance variable calculated by genetic algorithms [15] and trees like the variable importance measures for random forest [16]. In this phase is necessary to know the importance of each feature, by their weight. So the Term Frequency - Inverse Document Frequency (TF-IDF) is applied (Fig. 2).

Phase 3: Comments Classification Process.

In this phase, the corpus of comments and features (matrixCF) is partitioned into two independent datasets. The first dataset is dedicated to training process (train) and is used in classification to find patterns or relationships among data; the second dataset is considered for the testing process (test) in order to adjust the model performance. In this work two thirds of the matrixCF are used for training dataset and one-third for test dataset. Then the cross-validation method of k iterations is applied to control the tuning and training of SVM. In this method matrixCF is divided into K subsets. One of the subsets is used as test data and the remaining (K−1) as training data. The cross-validation process is repeated for K iterations, with each of the possible subsets of test data, resulting in a confusion matrix with average values. Once the K iterations have been completed, cross validation accuracy is obtained. In this research, K is equal to 10.

The tuning process in SVM allows adjusting the parameters of each kernel (linear, radial basis and polynomial). Then is performed a training process, through which is identified whether the value of the parameters vary or remain constant.

Finally, the implementation of SVM is performed presenting as a result the confusion matrix and accuracy values as well as the metric Receiver operating characteristic (ROC) curve.

4 Materials and Methods

4.1 Data

The dataset used in this work comprises 1040 comments in Spanish of three groups of systems engineering students at Universidad Politécnica de Aguascalientes. They evaluated 21 teachers in the first scholar grade (2016). For this study we considered only those comments free from noise or spam (characterized in this study as texts with strange characters, empty spaces, no opinion or comments unrelated to teacher evaluation). In this work we identified a set of 99 features. An extract of the features are listed in Table 2.

Table 2. List of features

Full size table

4.2 Performance Measures

We used typical performance measures in machine learning such as:

Accuracy, primary measure to evaluate the performance of a predictive model.
Balanced accuracy, a better estimate of a classifier performance when a unequal distribution.
Sensitivity, which measures the proportion of true positives.
Specificity, measures the proportion of true negatives.
ROC curve, measures the performance of a classifier through graphical representation [17, 18].

4.3 Classifiers

SVM is an algorithm introduced by Vapnik [19] for the classification of both linear and nonlinear data, it has been known for its quality in text classification [20]. There are kernels that can be used in SVM, such as: linear, polynomial, radial basis function (RBF) and sigmoid. Each of these kernels has particular parameters and they must be tuned in order to achieve the best performance. In this work we selected the first three kernels to classify comments; this is mainly because of their good performance in text classification [5, 6]. Table 3 shows the parameters of each kernel used in this study.

Table 3. Kernel parameters.

Full size table

C is the parameter for the soft margin cost function, it determines a tradeoff between a wide margin and classifier error. A very small value of C cause a larger margin separating hyperplane and the model get fit tighter to data, however a large value of C reduce the margin and this may cause more error on the training set.
Sigma determines the width for Gaussian distribution in Radial basis kernel.
Degree control the flexibility of the resulting classifier in Polynomial Kernel.

4.4 Experimental Design

We created a dataset containing 1040 comments and 99 features associated with teacher performance assessment. We used train-test evaluation, two-thirds (2/3) for training, and (1/3) one-third for testing, then there were performed 30 runs applying SVM with polynomial, radial basis function (RBF) and linear kernel. For each run performance measures are computed. In each run we set a different seed to ensure different splits of training and testing sets, all kernels use the same seed at the same run.

Each kernel requires tuning different parameters (see Table 3). A simple and effective method of tuning parameters of SVM has been proposed by Hsu [21], the grid search. The C values used for the kernels, range from 0.1 to 2, the value of sigma (σ) varied from 0.01 to 2, the degree value parameter range from 2 to 10, and values between 0 and 1 are assigned for coef parameter. We performed 30 train-test runs using different seeds and calculated the accuracy and balanced accuracy for each run.

5 Discussion and Results

In this section, we present the results with three kernels in SVM. The first step is to determine the parameters of each kernel of SVM, so we first load the data and create a partition of corpus of comments, then divided it into training and testing datasets, then use a train control in R [22] to set the training method. We use the Hsu [21] methodology to specify the search space in each kernel parameter. ROC is the performance criterion used to select the optimal kernels parameters of SVM.

Setting the seed to 1 in the process of optimization parameters, we generated paired samples according to Hothorn [23] and compare models using a resampling technique. Table 4 shows the summary of resampling results using R [22], the performance metrics are: ROC, sensibility and specificity. In the Fig. 3 we can see the plot of summary resampling results, in this case, the polynomial kernel apparently has a better performance than linear and RBF (radial).

Table 4. Summary resampling results of parameters optimization

Full size table

Once obtained optimized parameters for each kernel, the execution of each SVM model is performed.

The Table 5 shows the average results of each kernel of SVM across 30 runs. Also the standard deviation of each metric is presented.

Table 5. Average results across 30 runs in three kernels of SVM

Full size table

The linear kernel obtained a balanced accuracy above 0.80, this is an indicator that the classifier is feasible to use in comments classification. Values obtained in Sensitivity were much higher than those obtained in specificity in all kernels, which indicates that the classifier can detect the negative comments of the teachers. The kernel polynomial (SVM Poly) had the lowest performance in all metrics except in sensitivity. The three kernels resulted more sensitive than specific.

6 Conclusions

In computer science is attractive the use of this type of machine learnings models to automate processes, save time and contribute to decision-making. The SocialMining model supports the analysis of the behavior from unstructured data provided by students. The sentiment analysis is based on the analysis of texts and the SocialMining can provide a feasible solution to the problem of analysis of teacher evaluation comments. Further experiments will be conducted in this ongoing research project.

It is important to point out that is necessary reduce the number of features through a depth analysis to identify the most relevant features of teacher performance assessment, in order to improve the results of comments classification process. Also we considered important having a corpus of balanced comments (positive and negative comments in equal quantity) for testing and training process.

In addition to conduct a deeper analysis for relevant features selection, we considered necessary to implement other machine learning algorithms in order to measure the performance of each algorithm in the classification of comments and select the optimal with high accuracy results.

Based on the adequate results that have been obtained by the SocialMining model applying Naïve Bayes and a corpus of subjectivity [24], we considered that with the implementation of other algorithms of machine learning well-known for their good performance in classification process.

About how SocialMining model support the improvement of teaching in the first instance it allows a quicker analysis of student comments, identifying which teachers have mostly negative comments which allows interventions with the teacher in order to support it through teacher improvement courses. Each school period, courses are offered to teachers, however the comments of students are not considered among the criteria to recommend a certain course to the teacher. For this reason it is believed that the Model presented in this work will support the improvement of teaching.

References

Careaga, A.: La evaluación como herramienta de transformación de la práctica docente. Educere 5(15), 345–352 (2001)
Google Scholar
Bing, L.: Sentiment Analysis and Opinion Mining (Synthesis Lectures on Human Language Technologies). Morgan & Claypool Publishers, San Rafael (2012)
Google Scholar
Binali, H.H., Wu, C., Potdar, V.: A new significant area: emotion detection in E-learning using opinion mining techniques. In: 3rd IEEE International Conference on Digital Ecosystems and Technologies (DEST 2009). IEEE (2009)
Google Scholar
Wen, M., Yang, D., Penstein Rosé, C.: Sentiment analysis in MOOC discussion forums: what does it tell us? In: Proceedings of Educational Data Mining, pp. 1–8 (2014). http://goo.gl/fViyBH
Altrabsheh, N., Cocea, M., Fallahkhair, S.: Learning sentiment from students’ feedback for real-time interventions in classrooms. In: Bouchachia, A. (ed.) Adaptive and Intelligent Systems. LNCS, vol. 8779, pp. 40–49. Springer, Cham (2014). http://dx.doi.org/10.1007/978-3-319-11298-5_5
Manning, C.D., Raghavan, P., Schütze, H.: Support vector machines and machine learning on documents. In: Introduction to Information Retrieval, pp. 319–348 (1998)
Google Scholar
Sarkar, A., et al.: Text Classification using Support Vector Machine (2015)
Google Scholar
Ortigosa, A., Martín, J., Carro, R.: Sentiment analysis in Facebook and its application to e-learning. Comput. Hum. Behav. 31, 527–541 (2014). http://dx.doi.org/10.1016/j.chb.2013.05.024
Pong-Inwong, C.R., Rungworawut, W.S.: Teaching senti-lexicon for automated sentiment polarity definition in teaching evaluation. In: 10th International Conference on Semantics, Knowledge and Grids (SKG), Beijing, pp. 84–91. IEEE (2014)
Google Scholar
Kaewyong, P., Sukprasert, A., Salim, N., Phang, A.: The possibility of students’ comments automatic interpret using lexicon based sentiment analysis to teacher evaluation. In: The 3rd International Conference on Artificial Intelligence and Computer Science 2015, Penang, Malaysia (2015)
Google Scholar
Francesco, C., de Santo, M., Greco, L.: SAFE: a sentiment analysis framework for e-learning. Int. J. Emerg. Technol. Learn. 9(6), 37 (2014)
Google Scholar
Pang, B., Lee, L.: Thumbs up? Sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing, vol. 10, pp. 79–86 (2002). https://doi.org/10.3115/1118693.1118704
Bharathisindhu, P., Brunda, S.: Identifying e-learner’s opinion using automated sentiment analysis in e-learning. Int. J. Res. Eng. Technol. 3(1) (2014). http://dx.doi.org/10.15623/ijret.2014.0319086
Kirubakaran, E.: M-learning sentiment analysis with data mining techniques. Int. J. Comput. Sci. Telecommun. (2012)
Google Scholar
Witten, I.H., Frank, E., Hall, M.A.: Data Mining: Practical Machine Learning Tools and Techniques, 3rd edn. Morgan Kaufmann (2011)
Google Scholar
Punch III, W.F., et al.: Further research on feature selection and classification using genetic algorithms. In: ICGA, pp. 557–564 (1993)
Google Scholar
Strobl, C., et al.: Conditional variable importance for random forests. BMC Bioinform. 9(1), 307 (2008)
Google Scholar
Han, J., Kamber, M., Pei, J.: Data Mining: Concepts and Techniques. Morgan Kaufmann, San Francisco (2012)
Book MATH Google Scholar
Vapnick, V.: Statistical Learning Theory. Wiley, New York (1998)
Google Scholar
Esuli, A., Sebastiani, F., SENTIWORDNET: a publicly available lexical resource for opinion mining. In: Proceedings of the 5th Conference on Language Resources and Evaluation (LREC 2006), Genoa, Italy, pp. 417–422 (2006)
Google Scholar
Hsu, C.-W., Chang, C.-C., Lin, C.-J.: A practical guide to support vector classification. Technical report, Department of Computer Science, National Taiwan University (2003)
Google Scholar
R-Core-Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria (2013). http://goo.gl/e40yiU
Hothorn, T., Leisch, F., Hornik, K., Zeileis, A.: The design and analysis of benchmark experiments. J. Comput. Graph. Stat. 14(3), 675–699 (2005)
Article MathSciNet Google Scholar
Gutiérrez, G., Padilla, A., Canul-Reich, J., De-Luna, A., Ponce, J.: Proposal of a sentiment analysis model in tweets for improvement of the teaching - learning process in the classroom using a corpus of subjectivity. Int. J. Comb. Optim. Probl. Inform. 7(2), 22–34 (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

Universidad Politécnica de Aguascalientes, Aguascalientes, Mexico
G. Gutiérrez & M. Álvarez
Universidad Autónoma de Aguascalientes, Aguascalientes, Mexico
J. Ponce
Universidad Autónoma de Ciudad Juárez, Ciudad Juárez, Mexico
A. Ochoa

Authors

G. Gutiérrez
View author publications
You can also search for this author in PubMed Google Scholar
J. Ponce
View author publications
You can also search for this author in PubMed Google Scholar
A. Ochoa
View author publications
You can also search for this author in PubMed Google Scholar
M. Álvarez
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to G. Gutiérrez .

Editor information

Editors and Affiliations

Autonomous University of Yucatán, Merida, Mexico
Carlos Brito-Loeza
Autonomous University of Yucatán, Merida, Mexico
Arturo Espinosa-Romero

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gutiérrez, G., Ponce, J., Ochoa, A., Álvarez, M. (2018). Analyzing Students Reviews of Teacher Performance Using Support Vector Machines by a Proposed Model. In: Brito-Loeza, C., Espinosa-Romero, A. (eds) Intelligent Computing Systems. ISICS 2018. Communications in Computer and Information Science, vol 820. Springer, Cham. https://doi.org/10.1007/978-3-319-76261-6_9

Download citation

DOI: https://doi.org/10.1007/978-3-319-76261-6_9
Published: 17 February 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-76260-9
Online ISBN: 978-3-319-76261-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Analyzing Students Reviews of Teacher Performance Using Support Vector Machines by a Proposed Model

Abstract

Similar content being viewed by others

A Sentiment Analysis Model to Analyze Students Reviews of Teacher Performance Using Support Vector Machines

YouTube Sentiment Analysis: Performance Model Evaluation

A Sentiment Analysis Based Approach for Exploring Student Feedback

Keywords

1 Introduction

2 Related Work

3 SocialMining Model Architecture