Relevance Index for Inferred Knowledge in Higher Education Domain Using Data Mining

Gupta, Preeti; Mehrotra, Deepti; Sharma, Tarun Kumar

doi:10.1007/978-981-10-5699-4_27

Preeti Gupta¹⁹,
Deepti Mehrotra²⁰ &
Tarun Kumar Sharma¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 584))

Abstract

Optimizing the real-life scenarios facilitate knowledge building. Developing a knowledge model for optimizing certain output criteria enhances the benefits by many folds. Even a non-profit sector like education needs to define knowledge models that optimize their functioning and eventually help in knowledge building. Quantifying the factors determining the academic well-being of the students in any educational organization is of prime importance. The paper exemplifies the implementation of Data Mining Technique to deduce knowledge through classification rules and further assign relevance index to inferred knowledge.

Access provided by CONRICYT-eBooks. Download conference paper PDF

EDM Framework for Knowledge Discovery in Educational Domain

Existing Machine Learning Techniques for Knowledge Tracing: A Review Using the PRISMA Guidelines

Mining Educational Data to Improve Teachers’ Performance

Keywords

1 Introduction

It is often said that we are drowning in data but starving for knowledge [1]. Extraction of information from data facilitates knowledge building. Information which can be termed as a subset of data stimulates action in an entity, whereas knowledge defines the action of an entity in a particular setting [2]. A number of researchers have classified knowledge on different basis, sometimes defining the manner of codification and occurrence [3], or on the basis of know-what, know-how, know-why and know-when aspect of knowledge [4]. Some have even mapped knowledge in diverse domains [5].

There are varieties of ways for representing knowledge [6]. Using production rules written in form of IF-THEN rules is one of the most popular approach used for knowledge representation [7]. The IF-THEN rules adopt a modular approach, each defining principally independent and a relatively minor piece of knowledge. A rule-based system will include universal rules and actualities about the knowledge domain covered.

Knowledge building in education domain can be achieved by adopting procedures that optimize their functioning.

The research work is undertaken with an objective of deducing relevance index for inferred knowledge. The case of education sector is taken in particular while inferring knowledge related to student’s academic performance in a technical subject at level of higher education. It is important to deduce relevance index to inferred knowledge as it is a clear depiction of the existing system and further helps in decision making.

The paper is organized as follows. Section 2 elaborates the methodology adopted for rule induction and further rule evaluation in the higher education set-up. Finally, the conclusions are drawn and presented in Sect. 3.

2 Adopted Methodology in Higher Education Scenario

Educational organizations strive to achieve the higher academic output for the students. Many researchers have strived hard to predict the factors affecting the academic results of students [8,9,10,11,12]. Identification of such critical parameters, which could improve the academic attainment of students, supports an effective academic planning.

In case the individuals of a population can be separated into different classes, generation of a classification rule is a system in which the individuals of the population are each allocated to one or the other class.

In the study, knowledge is represented through classification rules [13], which exist in the form of IF-THEN rules. The work starts by identifying the variables and collecting the data in the context of these variables. The values of the attributes are then encoded on an 8-level scale. Rule induction is initiated through JRip, which implements a propositional rule learner, repeated incremental pruning to produce error reduction (RIPPER). The rules are then evaluated on the basis of the metrics Net Benefit which takes into account both classification and misclassification witnessed by the knowledge rule.

2.1 Variable Identification and Data Collection

This dataset has 5000 records and five independent attributes, all of which are categorical. The independent attribute names in the dataset are as follows: ContinuousEvaluationMarks, SGPA_II, Practical_orient, Attendance, Base_Sub_Marks.

The independent attributes affect the dependent attribute of End_Term_Marks and are reflected in Table 1.

Table 1 Attributes of the study

Full size table

The attributes were encoded on the 8-level scale, depicted in Table 2.

Table 2 Encoding of the attributes

Full size table

2.2 Rule Induction

In the year 1995, Cohen proposed JRip which implemented a propositional rule learner, repeated incremental pruning to produce error reduction (RIPPER) [14].

Error reduction can be witnessed in JRip since the process of incremental pruning examination of the classes is done in the increasing order of their size. The initial ruleset is generated on the basis of incremental reduced error. Initially, JRip (RIPPER) treats all the instances from the training dataset related to a particular judgment as a class and deduces a ruleset that covers all the members of that class. The procedure is repeated for all the classes.

Initialization

Initialize RS = {}, and from each class from the less frequent one to the most frequent one.

Repeat

{

1.
Building phase: Repeat the phases given below, grow phase and prune phase until there are no positive instances or error rate increases more than 50%.
1. 1.1
  Grow phase: Follow the greedy approach of adding conditions to the rule until the accuracy of the rule reaches 100%.
2. 1.2
  Prune phase: Incremental pruning approach should be followed for each rule. The pruning metrics can be measured in terms of 2p/(p + n) − 1 , where p — number of positive instances covered in the ruleset and n — number of negative instances covered in the ruleset.

2
Optimization Phase: On generation of the initial ruleset {R _i }, two variants of each rule are to be generated and pruned from randomized data using procedures Grow and Prune. The generation of the first variant is done from an empty rule, and the next variant is created by adopting a greedy approach of adding conditions to the original rule. The metrics of Description Length (DL) are computed for each variant. The final representation of the ruleset is done by the rule having the minimal DL. After the examination of all the rules in R _i , Building phase is again used for generating more rules if there are still residual positives.

3
Those rules that increase the DL of the complete ruleset are then deleted from the ruleset, and the final ruleset is added to RS.

}

In the study, JRip was implemented using Weka 3.8.0 and the following ruleset of 87 rules was generated. A snapshot of the rules and the output achieved is shown in Fig. 1.

2.3 Rule Analysis and Interpretation

For each of the 87 rules acquired by implementing JRip on the dataset, the value of classification (true positive, TP) and misclassification (false positive, FP) was recorded [15].

True positive (TP)— the number of examples satisfying A and C
False positive (FP)—the number of examples satisfying A, but not C

where A—antecedent of the rule, C—consequent of the rule

The rules were further evaluated on the basis of Net Benefit [16] considering a range of thresholds and calculating the NB across these thresholds. The result was then plotted against Rule Number and Net Benefit. For each threshold P _t, the Net Benefit was calculated as per Eq. 1:

$$ {\text{Net Benefit}}\;\left( {\text{NB}} \right) = \frac{\text{TP}}{\text{N}} - \frac{\text{FP}}{\text{N}} \left( {\frac{{P_{t} }}{{1 - P_{t} }}} \right) $$

(1)

On evaluating the rules for Net Benefit for different values of P _t, the following observations were met and are depicted through Fig. 2.

On cross-tabulating the rule count for P _t = 0.1–0.6, the NB values for all the 87 rules can be witnessed in Table 3.

Table 3 Analysing NB for the rules

Full size table

The consolidated plot depicting the NB values for all 87 rules across thresholds (P _t = 0.1, 0.5, 0.6), shown in Fig. 2, depict that the Net Benefit of the rule having maximum Net Benefit across all the threshold values of P _t (P _t = 0.1–0.6) decreases as we increase the threshold value (P _t) from 0.1 to 0.6. In fact at P _t = 0.6, some of the rules exhibit the negative NB.

P _t = 0.5 signifies that FP and TP are weighted equally. Hence, maintaining a P _t = 0.1 signifies assigning more weightage to the classification, i.e. true positive (TP), rather than to misclassification, i.e. false positive (FP).

The study selects P _t = 0.1. Maximum NB and distinct peaks are achieved on selecting a P _t = 0.1. It is also observed that NB value decreases as we move from P _t = 0.1 to P _t = 0.6. Moreover, the NB value also shows a negative growth in case of P _t = 0.6. P _t = 0.6 signifies the assignment of more weightage to misclassification rather than to classification.

However, for P _t = 0.1, the rule that acquires the highest benefit is:

Base_Sub_Marks = 010 and Attendance = 001 and ContinuousEvaluationMarks = 101 => End_Term_Marks = 011

On decoding the rule, it can be stated as:

Base_Sub_Marks is between 41 and 50 and Attendance between 75.1 and 77% and ContinuousEvaluationMarks between 20 and 23 => End_Term_Marks between 51 and 60.

The relevance index assigned to the knowledge rule is on the basis of its Net Benefit (NB), keeping into account the classification and misclassification done by the rule. The Net Benefit (NB) for the above said rule at a threshold value P _t of 0.1 is 0.002689.

The reason for using Net Benefit (NB) to assign relevance index to inferred knowledge is:

1.
The prediction model incorporates consequences and hence can be used to infer a decision on the usage of the given model.
2.
It can be directly applied to the validation set and does not need any additional information.
3.
Even if the model outcome is in binary or continuous form, the method for evaluation is applicable.

3 Conclusion

Rule induction can deduce the relationship existing between the various attributes. The influence of the independent variables on the dependent variable can be observed. Rules with a higher relevance index are much more apt to the system and can be used for appropriate syllabus planning, designing structured lesson plans, structuring criteria for the evaluation of the student’s performance and adoption of suitable teaching pedagogy for the improvement in the overall academic performance of the students. The knowledge derived in the form of rules bears relevance in the context of the domain and hence can be added to the knowledge set that can supplement the process of decision making in a knowledge base environment.

References

Han, J., Kamber, M.: Data mining: concepts and techniques. Morgan Kaufmann Publishers, Canada (2000)
MATH Google Scholar
Boisot, M.H.: Knowledge Assets Securing Competitive Advantage in the Information Economy. OUP Oxford New Edition (2006)
Google Scholar
Polanyi, M.: The Tacit Dimension. Routledge and Kegan Paul, London (1966)
Google Scholar
Nickols, F.W.: The knowledge in knowledge management. In: Cortada, J.W., Woods, J.A. (eds.) The Knowledge Management Yearbook 2000–2001, pp. 12–21. Butterworth-Heinemann, Boston, MA (2000)
Google Scholar
Gupta, P., Mehrotra, D., Singh, R.: Achieving excellence through knowledge mapping in higher education institution. Int. J. Comput. Appl. 5–10 (2012)
Google Scholar
Jong, T.D., Ferguson Hesler, M.G.M.: Types and quality of knowledge. Educ. Psychol. 31,105–113(1996)
Google Scholar
Rich, E., Knight, K., Nair, S.B.: Artificial Intelligence. TMH, New Delhi (2010)
Google Scholar
Kabakchieva, D.: Student performance prediction by using data mining classification algorithms. Cybern. Inf. Technol. 13, 61–72 (2013)
Google Scholar
Kumar, S.A., Vijayalakshmi, M.N.: Efficiency of decision trees in predicting student’s academic performance. Int. J. Comput. Sci. Inf. Technol. 23, 335–343 (2011)
Google Scholar
Sembiring, S., Zarlis, M., Hartama, D., Ramliana, S., Wani, E.: Prediction of student academic performance by an application of data mining techniques. In: International Conference on Management & Artificial Intelligence, vol. 6, pp. 110–114 (2011)
Google Scholar
Ramanathan, L., Dhanda, S., Kumar, S.D.: Predicting students’ performance using modified ID3 algorithm. Int. J. Eng. Technol. (IJET) 5(3), 2491–2497 (2013)
Google Scholar
Gupta, P., Mehrotra, D., Sharma. T.K.: Genetic based weighted aggregation model for optimization of student’s performance in higher education. In: Advances in Intelligent Systems and Computing, pp. 877–887. Springer, Singapore (2015)
Google Scholar
Gupta, P., Mehrotra, D.: Effective curriculum development through rule induction in knowledge centric higher education organization. In: Confluence 2013: The Next Generation Information Technology Summit (4th International Conference), Noida. IET Digital Library, vol. 2013, issue 647 CP, pp. 475–480 (2013)
Google Scholar
Cohen, W.W.: Fast effective rule induction. In: Twelfth International Conference on Machine Learning, pp. 115–123 (1995)
Google Scholar
Freitas, A.A.: A survey of evolutionary algorithms for data mining and knowledge discovery. In: Advances in Evolutionary Computing, pp. 819–845, Springer, Heidelberg (2003)
Google Scholar
Vickers, A.J., Elkin, E.B.: Decision curve analysis: a novel method for evaluating prediction models. Med. Decis. Making 26, 565–574 (2006)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Amity University Rajasthan, Jaipur, India
Preeti Gupta & Tarun Kumar Sharma
ASET, Amity University Uttar Pradesh, Noida, India
Deepti Mehrotra

Authors

Preeti Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Deepti Mehrotra
View author publications
You can also search for this author in PubMed Google Scholar
Tarun Kumar Sharma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Preeti Gupta .

Editor information

Editors and Affiliations

Department of Applied Science and Engineering, IIT Roorkee, Saharanpur, India
Millie Pant
Department of Physics, Amity School of Applied Sciences, Amity University Rajasthan, Jaipur, Rajasthan, India
Kanad Ray
Department of Computer Science and Engineering, Amity School of Engineering and Technology, Amity University Rajasthan, Jaipur, Rajasthan, India
Tarun K. Sharma
Department of Electronics and Communication Engineering, SEEC, Manipal University Jaipur, Jaipur, Rajasthan, India
Sanyog Rawat
Surface Characterization Group, NIMS, Nano Characterization Unit, Advanced Key Technologies Division, Tsukuba, Ibaraki, Japan
Anirban Bandyopadhyay

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gupta, P., Mehrotra, D., Sharma, T.K. (2018). Relevance Index for Inferred Knowledge in Higher Education Domain Using Data Mining. In: Pant, M., Ray, K., Sharma, T., Rawat, S., Bandyopadhyay, A. (eds) Soft Computing: Theories and Applications. Advances in Intelligent Systems and Computing, vol 584. Springer, Singapore. https://doi.org/10.1007/978-981-10-5699-4_27

Download citation

DOI: https://doi.org/10.1007/978-981-10-5699-4_27
Published: 25 November 2017
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-5698-7
Online ISBN: 978-981-10-5699-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Relevance Index for Inferred Knowledge in Higher Education Domain Using Data Mining

Abstract

Similar content being viewed by others

EDM Framework for Knowledge Discovery in Educational Domain

Existing Machine Learning Techniques for Knowledge Tracing: A Review Using the PRISMA Guidelines

Mining Educational Data to Improve Teachers’ Performance

Keywords

1 Introduction

2 Adopted Methodology in Higher Education Scenario