Machine learning and non-machine learning methods in mathematical recognition systems: Two decades’ systematic literature review

Sakshi; Kukreja, Vinay

doi:10.1007/s11042-023-16356-z

Machine learning and non-machine learning methods in mathematical recognition systems: Two decades’ systematic literature review

Published: 23 August 2023

Volume 83, pages 27831–27900, (2024)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Multimedia Tools and Applications Aims and scope Submit manuscript

Machine learning and non-machine learning methods in mathematical recognition systems: Two decades’ systematic literature review

Download PDF

391 Accesses
2 Citations
Explore all metrics

Abstract

Tools based on machine learning (ML) have seen widespread application in the prediction and categorization of mathematical symbols and phrases. The purpose of this work is to conduct a comprehensive analysis of the machine learning and non-machine learning strategies that are currently in use for the recognition of mathematical expressions. (MEs). The authors collected and analyzed research studies on the recognition of MEs (and closely related models and issues as well), which are published from January 2000 to December 2022 in the SLR. The review has nominated 98 primary studies out of the extracted 202 studies after heedful filtering using inclusion/exclusion criteria and quality assessment. The pertinent data is derived from IEEE explore, Science Direct, Wiley, Scopus, ACM Digital Library, etc. For assiduously reviewing and synthesizing the data, the authors used grounded theory and other qualitative and quantitative techniques. The analysis reveals that the support vector machine as an ML model with CROHME as the dataset and expression recognition rate as an accuracy metric is frequently used in the chosen studies. Recognition is typically fragmented down into three stages—segmenting symbols, recognizing symbols, and analyzing structures—in non-ML studies. In conclusion, this work aims to synthesize the results of existing research to provide a summary of the state-of-the-art in recognizing handwritten MEs.

Machine learning models for mathematical symbol recognition: A stem to stern literature analysis

Article 31 March 2022

Handwritten Mathematical Symbols Classification Using WEKA

ICDAR 2023 CROHME: Competition on Recognition of Handwritten Mathematical Expressions

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

An idea is the origin point of innovation and research. The key initiative towards realizing these ideas is to build and enhance cost-effective and efficacious methods for writing down the documented knowledge into the corresponding electronic format that can be processed by computers and distributed with the help of the Internet. With the numerous expansion in internet users in recent years, the escalating drift of disseminating and exchanging information is consistently done via digital mediums [38]. The categorization and recognition of mathematical expressions (MEs) have become a fascinating and stimulating study area of pattern recognition with endless real-world ramifications due to the fact that MEs represent an essential part of the engineering and scientific literature. [1]. In addition, recognizing handwritten mathematical expressions (also known as HMEs) is a difficult classification task that requires real-time identification of all the symbols that include input as well as the intricate 2D relationships that exist between subexpressions and symbols [119]. These subexpressions can be nested containing Greek and Latin letters, special symbols, and characters. This task of HME recognition also becomes more onerous as it has complicated tasks like structure analysis, symbol recognition, context of MEs unequivocally. This method of verification is complicated enough that it requires additional time to be spent on computational work. Therefore, recognizing them is unquestionably a difficult and laborious task, particularly when attempting to recognize them from a handwritten source of information.

1.1 Review gaps and importance

a.
Although researchers carried out plenty of work in the arena of recognizing MEs and symbols, there is a need to systematically collect, compile, and consolidate the recent works in this field. Other reviews [98] and surveys [221] finely presenting the works of HME. Yet none of the literature works systematically reports the studies and judiciously covers all significant empirical instances of literature available on MEs
b.
Unlike the traditional review methods that attempted to present the past works by producing a summarized result form of several studies of the concerned area, the objective of this SLR is to provide a complete possible list of all studies related to the subject area from different research aspects.
c.
To the best of our knowledge, no systematic review focuses on the extensive identification and classification of techniques used for MEs recognition. This is the first-ever SLR that aims to be as fair as possible by being auditable [79] and providing transparency to the researchers in this mine to dig and extract for what is left unexplored and unmined.
d.
The entire research methodology used for drafting this study has been presented in detail. Every step involved in the review process has been kept transparent to the readers. For instance, data collection, data selection, answer extraction design; each task involved in research has been vividly depicted.
e.
The uniqueness of the study is the research methodology involved. Apart from strictly adhering to the SLR guidelines, the authors have endeavored to experiment creatively with inter-disciplinary concepts during data synthesis and answer design.
f.
The intended effect of the study is to establish a synthesis of research questions through the use of ground theory, which is a qualitative research methodology. This method has been deployed for the inaugural time in conjunction with systematic literature analysis.
g.
In addition, the sub-processes that make up the recognition process have been dissected in great detail.

A better approach for summarizing the studies and research performed over a period of time for directing the researchers and future aspirants interested in the topic is a real need of the hour. So, this review is planned, conducted, and reported to broadly specify what techniques outperform the rest and where there is a genuine requirement for implementing a new method for recognizing MEs.

1.2 Research objectives

This paper conscientiously reviews all the studies between the period between January 2000 and June 2021. To perform the review study, the authors have collated several techniques that belong to multiple computers science domains, like computer vision, digital image processing, and artificial intelligence. This SLR endeavours and attempts to summarise, analyse and methodically assess the empirical evidence regarding

a)
Identification of methods and techniques used in recognition of HMEs.
b)
Extract out the kind of HMEs used for recognition study.
c)
Listing out the most frequently used dataset in the research and for empirical analysis.
d)
Analyzing the accuracy measures and evaluating the accuracy values of the several used methods.
e)
Focusing on the pure or hybrid techniques implemented in the subprocess analyzing the performance and capability of the applied techniques for recognizing HMEs.

g)
Comparing performance accuracy of contrasting ML techniques to ensure which method outperforms the other techniques belonging to the same header.
h)
Analyzing the actualization and capability of the applied techniques for recognizing HMEs, i.e., ML versus non-ML or conventional statistical methods.
i)
Summarizing the set of journals publishing the research on this stimulating research area.

1.3 Motivation

The several motivating factors for carrying research in this domain are listed in Table 1.

Table 1 Motivation factors

Machine learning and non-machine learning methods in mathematical recognition systems: Two decades’ systematic literature review

Abstract

Similar content being viewed by others

Machine learning models for mathematical symbol recognition: A stem to stern literature analysis

Handwritten Mathematical Symbols Classification Using WEKA

ICDAR 2023 CROHME: Competition on Recognition of Handwritten Mathematical Expressions

Explore related subjects

1 Introduction

1.1 Review gaps and importance

1.2 Research objectives

1.3 Motivation

1.4 Focus of the study

1.5 Research questions

1.6 Paper organization

2 Background

2.1 Defining handwritten mathematical expressions

2.2 Types of handwritten mathematical expressions

2.3 Characteristics of mathematical expressions

2.4 Challenges caused by the inherent properties of ME

2.4.1 Other challenges and CROHME

3 Review methodology

3.1 Review protocol and Criteria

3.2 Research questions

3.2.1 Answer extraction design

3.3 Search design and strategy

3.3.1 Search terms

3.3.2 Data retrieval and literature sources

3.4 Screening of papers and the process of filtration of studies

3.5 Classification of HMER related studies

3.5.1 Inclusion and exclusion criteria: (Primary Selection Phase)

3.5.2 Quality assessment criterion: (Secondary Selection Phase)

3.6 Data collection and extraction

3.7 Data analysis and synthesis

3.8 Synthesis with ground theory

Open coding

Axial coding

Selective coding

3.9 Threat to the validity

3.9.1 Limitations of search string

3.9.2 Selection bias

4 Statistical analysis of the selected HMER related studies

4.1 Extracted metadata fields from selected studies

4.2 Publication type overview

4.3 Temporal view of research over the years

4.4 Geographical distribution of research studies over the years

4.5 Distribution of publications by authors

4.6 Frequency of keywords

5 Results and discussion

Pros and Cons of using ML/ Non-ML approaches for HMER

6 Summary of findings

7 Limitations of this study

8 Conclusions and future scope

Data Availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Publisher’s note

Appendices

Appendix 1

Appendix 2

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation