Machine Learning: Towards an Unified Classification Criteria

Burbano, Clara; Reveló, David; Mejía, Julio; Soto, Daniel

doi:10.1007/978-3-030-70416-2_6

Clara Burbano¹⁵,
David Reveló¹⁶,
Julio Mejía¹⁶ &
…
Daniel Soto¹⁷

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1346))

797 Accesses

Abstract

In a broad sense, Machine Learning (ML) is the performance optimization in a certain task through computational means, following a certain criterion and using referential data and/or past results from previous iterations. ML is a subset of Artificial Intelligence (AI) and has attracted a substantial amount of research during the last decades. This blooming subject led to the statement of different definitions for classifications, criteria, algorithms and so on. This paper summarizes these different definitions and proposes a homologation between them, providing an unified vision for each definition.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Defining Machine Learning

Machine Learning Techniques: A Survey

Fundamental Concepts of Machine Learning

Keywords

1 Introduction

1.1 On the Growing Popularity of Artificial Intelligence and Machine Learning

During the last years, Artificial Intelligence (AI) related projects, as well as Machine Learning (ML) related projects grew in numbers, both at scientific and at industry level. While any developing research field sparks scientific interest on its own because of much knowledge is yet to be explored – with a quick Internet search, you can even find websites proposing research ideas [1,2,3,4], AI and ML proved to also be very attractive to different markets because of how much value companies get (and are forecasted to keep getting) thanks to AI and ML related products, proven by the growing marked size for these kind of technologies, predicted to be as big as US$160 Bn by 2026 [5], as well as by how the stock markets for these fields have been positively behaving (and predicted to keep doing so) during recent years [6].

Solutions like customer behavior analysis, chatbots, business forecasting tools, behavior-based cybersecurity systems, to name a few, turned out to be highly profitable business [7, 8], which in turn made investment on these areas to grow as now is possible to even find crowdfunding websites for these kind of technologies [9].

This strong trend led to the involvement of many of the world’s largest companies, to active development AI and ML technologies. Interestingly enough, many of the companies leading in AI and ML markets, are also active players in the Cloud Computing industry, often developing hybrid solutions using AI and ML technologies on cloud services, promoting even more investing in these fields [10].

Current market analysis forecast that AI and ML could lead to an weighted average of 1.7% across 16 different industries as well as to increase the economic output of those industries up to US$4 trillion by 2035. What’s more, when analyzed at country-level, it is forecasted that AI and ML could double the economic growth rates, among 12 countries sampled [11, 12]. It is also noteworthy that AI and ML are already creating new jobs, with industries requiring workers such as AI engineer, machine learning scientist, AI developer, among others [13].

The AI and ML have a well justified popularity both in the scientific and industrial communities. That being said, it is important to clarify and understand the difference between both, which is explained in the next Sect. 6.1.2. Later in Sect. 6.2 this paper dives into the different classification criteria used for ML algorithms, and then the Unified Vision Proposal is provided and explained in Sect. 6.3, followed up by conclusions and future work suggestions.

1.2 Defining Artificial Intelligence and Machine Learning

While the previous section mentioned AI and ML together, these are two different – but closely related – terms.

Although it is possible to – extremely – simplify this by stating that ML is a subset inside the AI field, brief but proper definitions are provided and referenced as follows:

Artificial Intelligence (AI). AI represents a set of complex edge technologies capable of interacting with its environment by means of simulating human intelligence [14] and is considered the core of the so-called “Fourth Industrial Revolution” [15].
Machine Learning (ML). Is the performance optimization in a certain task through computational means, following a certain criterion and using referential data and/or past results from previous iterations [16]. ML comes from the need to tackle problems beyond the reach of traditional, hardcoded IA solutions, being technically a specialized subset of AI, focused on real world knowledge applied to machines capable of making “subjective” decisions [17].

In a broad sense, the basic machine learning process involves building a ML based by “training” the machine using referential data [18].

2 Classifications and Selection Criteria in Machine Learning

2.1 Machine Learning Algorithm Classifications

Many authors concur in classifying ML algorithms based on a cognitive criterion, meaning that each ML algorithm belongs to a certain group depending on how it “learns”. This approach identifies three main categories: Supervised learning, unsupervised learning and reinforced learning [18, 19], although some authors reduce these categories to the first two [17, 20].

Supervised Learning. As the name suggests, ML algorithms are “guided”. This guidance takes the form of referential data, usually called “target” data, so the algorithm knows that it must identify that kind of data. The algorithm then is trained used that target data, so when it is ready, it can identify whenever it is shown the target data or something else. This kind of algorithm is usually seen in tasks which require identifying what kind of input data (an image, for example) is being presented to the algorithm.
Unsupervised Learning. Unlike the previous ML type, those algorithms belonging to the unsupervised learning classification do not have the help of target data, so they rely on identifying patterns and structures on the input data they have to work with. In Example, an unsupervised ML algorithm will be given a set of pictures of pencils, apples and cars, so after iterating over that info, it will eventually be able to separate the pencils from the apples and the cars.
Reinforced Learning. These kinds of ML algorithms work similarly to its counterpart in psychology. The result of a task will be awarded or penalized depending on whether the answer is right or wrong, so the algorithm will learn from its previous experiences to answer right. Instead of using a target data set, it works with goals it aims to achieve. Some video games provide a very nice example of this kind of learning, when a character needs to go from point A to point B, while having many possible paths, by only one is the optimal one [21].

Other classification methods are based on the type of problem to be solved, or the type of data needed to be handled, or even in the type of statistical procedure required to achieve a solution. The strategies are closer to the decision criteria used to decide when to use a given algorithm, so we’ll cover those in the next section.

2.2 Criteria for Choosing the Right ML Algorithm

While ML algorithms can be very flexible, some are more suited to certain scenarios than others.

Current literature states that the following variables are to be considered when deciding which algorithm is to be used:

Data size: As some algorithms can have higher execution times, for very large datasets this can discard some options.
Data quality: Algorithms relying heavily on the accuracy of the data presented to them (like in the case of supervised learning types), when the available data isn’t reliable enough, it should be preferred to use algorithms which don’t have this heavy dependency.
Available time: Closely related to the data size variable, when confronted to a short deadline, some algorithms can be an actual obstacle to the research.
Data type: Discrete and continuous data are to be approached differently, thus the algorithm must be chosen with this variable in mind as well.

3 Classification Filtering and Unified Vision Proposal

As previously stated, current literature provides a myriad of classification names to ML algorithms, often being redundant by giving a similar name to something already classified as a different name.

We have gathered all the definitions we found in the aforementioned literature, then filtered repeated results, and sorted them as a unified vision. Given the large number of concepts and relationships involved, the full scheme is presented in four parts, as shown in Figs. 6.1, 6.2, 6.3 and 6.4. Then we sorted those relationships in a cleaner, shorter version, which is shown in Fig. 6.5.

4 Conclusions and Future Work

This research went through different views when it comes to how to classify ML algorithms, as well as views on which factors include to decide which algorithm is the best option for a given scenario. Then we found some common ground among various views, from which we graphically described how each view integrates into a greater scheme of things. It was possible to filter “repeated” views so as a result, we produced a refined version of this graphic perspective, in the form of a conceptual map, which we propose as a tool to contribute to a better understanding of ML in a more structured way.

Nevertheless, by the time out research finished, new literature added even more views [22] up to 14 different ML algorithm types [23], so this proposal still has room for improvement. Future work should review those new classification proposals, in order to find a way to integrate them into this new, greater classification scheme.

Of course, as a relatively young and unexplored field, ML might lead to new algorithms and classifications currently unexplored, which may or may not integrate seamlessly into this scheme, thus our proposal might (or might not) require a deep reforming.

References

Geeks for Geeks Website, https://www.geeksforgeeks.org/8-best-topics-for-research-and-thesis-in-artificial-intelligence/. Last accessed 2020/05/07
1 Red Drop Blog, https://1reddrop.com/2020/05/14/15-artificial-intelligence-research-paper-topics-for-writing/. Last accessed 2020/05/07
Tech Sparks Blog, https://www.techsparks.co.in/artificial-intelligence-as-an-m-tech-thesis-topic-for-cse/. Last accessed 2020/05/07
Data Flair Blog, https://data-flair.training/blogs/machine-learning-project-ideas/. Last accessed 2020/05/07
Market Watch Press Release, https://www.marketwatch.com/press-release/artificial-intelligence-ai-market-value-growing-at-a-us-160-bn-by-2026-2020-04-13. Last accessed 2020/05/07
The Motley Fool Website, https://www.fool.com/investing/stock-market/market-sectors/information-technology/ai-stocks/. Last accessed 2020/05/07
Towards Data Science Website, https://towardsdatascience.com/value-investing-with-machine-learning-e41867156108?gi=1840c5a0962c. Last accessed 2020/05/07
T.A. Borges, R.F. Neves, Ensemble of machine learning algorithms for cryptocurrency investment with different data resampling methods. Appl. Soft Comput. 90, 106187 (2020)
Article Google Scholar
C.C. Chen, C.H. Chen, T.Y. Liu, Investment performance of machine learning: Analysis of S&P 500 index. Int. J. Econ. Financ. Issues 10(1), 59–66 (2020)
Article Google Scholar
Datamation Website, https://www.datamation.com/artificial-intelligence/top-artificial-intelligence-companies.html. Last accessed 2020/05/07
Accenture Website: How AI boosts industry profits and innovation, https://www.accenture.com/_acnmedia/Accenture/next-gen-5/insight-ai-industry-growth/pdf/Accenture-AI-Industry-Growth-Full-Report.pdf?la=en. Last accessed 2020/05/07
Accenture Website: Industry spotlights: How AI boosts industry profits and innovation, https://www.accenture.com/_acnmedia/Accenture/next-gen-5/insight-ai-industry-growth/pdf/Accenture-AI-Industry-Growth-Industry-Report.pdf?la=en. Last accessed 2020/05/07
Datamation Website, https://www.datamation.com/artificial-intelligence/artificial-intelligence-jobs.html. Last accessed 2020/05/07
E. Glikson, A.W. Woolley, Human trust in artificial intelligence: Review of empirical research. Acad. Manag. Ann. 14(2), 627–660 (2020)
Article Google Scholar
K. Schwab, The Fourth Industrial Revolution (Crown Business, New York, 2017)
Google Scholar
E. Alpaydin, Introduction to Machine Learning (MIT Press, 2020)
MATH Google Scholar
I. Goodfellow, Y. Bengio, A. Courville, Deep Learning (MIT Press, 2016)
MATH Google Scholar
Edureka! Blog, https://www.edureka.co/blog/introduction-to-machine-learning/. Last accessed 2020/05/07
Medium Article: Different types of machine learning and their types, https://medium.com/deep-math-machine-learning-ai/different-types-of-machine-learning-and-their-types-34760b9128a2. Last accessed 2020/05/07
Hunter Heidenreich Blog: Machine learning for the average person: What are the types of machine learning?, http://hunterheidenreich.com/blog/breaking_down_ml_for_the_average_person/. Last accessed 2020/05/07
M. Kubat, B. Ivan, M. Ryszard, A review of machine learning methods, in Machine Learning and Data Mining: Methods and Applications, (Wiley, Chichester, 1998), pp. 3–69
Google Scholar
A. Dey, Machine learning algorithms: A review. Int. J. Comput. Sci. Inf. Technol. 7(3), 1174–1179 (2016)
Google Scholar
Machine Learning Mastery Blog, https://machinelearningmastery.com/types-of-learning-in-machine-learning

Download references

Author information

Authors and Affiliations

Institución Universitaria Antonio José Camacho, Cali, Colombia
Clara Burbano
Grupo de Investigación en Sistemas Inteligentes (GISI), Corporación Universitaria Comfacauca, Popayán, Colombia
David Reveló & Julio Mejía
Centro de Investigación de la Universidad Mayor CICS, Universidad Mayor, Santiago, Chile
Daniel Soto

Authors

Clara Burbano
View author publications
You can also search for this author in PubMed Google Scholar
David Reveló
View author publications
You can also search for this author in PubMed Google Scholar
Julio Mejía
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Soto
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Clara Burbano .

Editor information

Editors and Affiliations

Department of Electrical and Computer Engineering, University of Nevada, Las Vegas, NV, USA
Shahram Latifi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Burbano, C., Reveló, D., Mejía, J., Soto, D. (2021). Machine Learning: Towards an Unified Classification Criteria. In: Latifi, S. (eds) ITNG 2021 18th International Conference on Information Technology-New Generations. Advances in Intelligent Systems and Computing, vol 1346. Springer, Cham. https://doi.org/10.1007/978-3-030-70416-2_6

Download citation

DOI: https://doi.org/10.1007/978-3-030-70416-2_6
Published: 05 June 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-70415-5
Online ISBN: 978-3-030-70416-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Machine Learning: Towards an Unified Classification Criteria

Abstract