From Zero to Hero: A Process Mining Tutorial

Janes, Andrea; Maggi, Fabrizio Maria; Marrella, Andrea; Montali, Marco

doi:10.1007/978-3-319-69926-4_55

Andrea Janes¹⁹,
Fabrizio Maria Maggi²⁰,
Andrea Marrella²¹ &
…
Marco Montali¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 10611))

Included in the following conference series:

International Conference on Product-Focused Software Process Improvement

3771 Accesses

Abstract

Process mining is an emerging area that synergically combines model-based and data-oriented analysis techniques to obtain useful insights on how business processes are executed within an organization. This tutorial aims at providing an introduction to the key analysis techniques in process mining that allow decision makers to discover process models from data, compare expected and actual behaviors, and enrich models with key information about the actual process executions. In addition, the tutorial will present concrete tools and will provide practical skills for applying process mining in a variety of application domains, including the one of software development.

Access provided by CONRICYT-eBooks. Download conference paper PDF

Process mining: software comparison, trends, and challenges

Article 30 December 2022

Academic View: Development of the Process Mining Discipline

Process Mining: A 360 Degree Overview

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

Process mining [2] is a recent research discipline that sits between computational intelligence and data mining on the one hand, and process modeling and analysis on the other hand. Through process mining, decision makers can discover process models from data, compare expected and actual behaviors, and enrich models with information retrieved from data. This, in turn, provides the basis for understanding, maintaining, and enhancing processes based on reality.

Since process mining has many applications, this tutorial aims at encouraging participants to apply it in new fields, in which it has not been applied so far. In particular, first, we will introduce the process mining framework, the main process mining techniques and tools, and the different phases of event data analysis through process mining. Second, we will carry out an hands-on session using concrete process mining tools, considering business use cases, as well as the particular scenario of software processes. Finally, we will discuss common pitfalls and critical issues and will give suggestions on how to mitigate them.

2 Process Mining Framework

The reference framework for process mining is depicted in Fig. 1. On the one hand, process mining considers conceptual models describing processes, organizational structures, and the corresponding relevant data. On the other hand, it focuses on the real execution of processes, as reflected by the footprint of reality logged and stored by the software systems in use within an enterprise. For process mining to be applicable, such information has to be structured in the form of explicit event logs. In fact, all process mining techniques assume that it is possible to record the sequencing of relevant events occurred within an enterprise, such that each event refers to an activity (i.e., a well-defined step in some process) and is related to a particular case [1]. Events may have additional information stored in event logs. In fact, whenever possible, process mining techniques use extra information such as the exact timestamp at which the event has been recorded, the resource (i.e., person or device) that generated the event, the event type in the context of the activity transactional lifecycle (e.g., whether the activity has been started, canceled, or completed), or data elements recorded with the event (e.g., the size of an order).

The three main types of process mining techniques are marked by the three, thick red arrows in the bottom part of Fig. 1. Discovery starts from an event log and automatically produces a process model that explains the different behaviors observed in the log, without assuming any prior knowledge on the process. The vast majority of process discovery algorithms focus on the discovery of the process control-flow, towards generating a model that indicates the allowed sequences of activities according to the log. Conformance checking compares an existing process model and an event log for the same process, with the aim of understanding the presence and nature of deviations. Conformance checking techniques take as inputs an event log and a process model, and return indications related to the adherence of the behaviors contained in the log to the prescriptions contained in the model. Enhancement improves an existing process model using information recorded in an event log for that process. The inputs of enhancement techniques are a process model and an event log, and the output is a new process model that incorporates and reflects new information extracted from the data. The first important class of enhancement techniques is that of extension, where the input process model is not altered in its structure, but is extended with additional perspectives, using information present in the log. A second important class of enhancement techniques is that of repair, where deviations detected by checking the conformance of the input event log to the input process model are resolved by suitably modifying the process model.

The presented tutorial will demonstrate that process mining can also be used in the context of software development [9], trying to focus on aspects that occur with a certain regularity and with a finite number of possible activities so that repeating cases within an actual “process” can be discovered. One example will be the user behavior analysis [9], i.e., the study on how the users interact with software applications. In particular, we will study an example that can be found in [3], in which the authors applied process mining to study and to improve the user interface of an ERP software.

2.1 Process Mining Tools

A plethora of process mining techniques and technologies have been developed and successfully employed in several application domains^{Footnote 1}. The process mining solutions that will be used in the tutorial are ProM (Process Mining framework)^{Footnote 2} and Disco^{Footnote 3}.

ProM (Process Mining framework) is an Open Source framework for process mining algorithms [10] based on JAVA. It provides a plug-in based integration platform [4] that users and developers can exploit to run and deploy process techniques. This pluggable architecture currently hosts a very large amount of plug-ins covering all the different aspects of process mining, from data import to discovery, conformance checking, and enhancement along different perspectives [2].
Disco is a commercial, stand-alone and lightweight process mining tool. It supports various file formats as input, in particular providing native support for importing CSV files, which can be annotated with case and event information prior to the import. Disco has usability, fidelity, and performance as design priorities, and makes process mining easy and fast [7].

2.2 The XES Standard

In recent years, the XES (eXtensible Event Stream) format emerged as the main reference format for the storage, interchange, and analysis of event logs. XES, which is based on XML, appeared for the first time in 2009 [6] as the successor of the MXML format [5]. It quickly became the de-facto standard adopted by the IEEE Task Force on Process Mining^{Footnote 4}, eventually becoming an official IEEE standard in 2016 [8]. This standard will be introduced in the tutorial.

3 Agenda

The tutorial is structured as follows:

Introduction to the process mining framework
Designing process models and collecting event logs
- Process modeling: basics of Petri nets and BPMN
- The XES standard and the OpenXES reference implementation
Mining event logs, discovering processes
- ProM, the open-source jack of all trades
- Choosing a mining algorithm: Alpha, Heuristics miner, Fuzzy miner, or Multi-phase miner?
- Disco, the user friendly tool
Interpreting the mined models: discovery, conformance checking, and enhancement
- Deciding from which perspective to look at the data: choosing a case ID
- On the representational bias of process mining
Hands-on session
- Process mining success stories for business processes and software processes
- Walk-through of two process mining examples examining a business process and a software process, discussing strategies to collect, filter, analyze, and interpret data
- Discussion on the differences between mining business and software processes.

Notes

References

van der Aalst, W., et al.: Process mining manifesto. In: Proceedings of BPM International Workshops, vol. 99, pp. 169–194 (2012)
Google Scholar
van der Aalst, W.M.P.: Process Mining - Data Science in Action, 2nd edn. Springer, Heidelberg (2016). doi:10.1007/978-3-662-49851-4
Book Google Scholar
Astromskis, S., Janes, A., Mairegger, M.: A process mining approach to measure how users interact with software: an industrial case study. In: ICSSP 2015 (2015)
Google Scholar
van Dongen, B.F., de Medeiros, A.K.A., Verbeek, H.M.W., Weijters, A., van der Aalst, W.M.P.: The ProM framework: a new era in process mining tool support, pp. 444–454 (2005)
Google Scholar
van Dongen, B.F., van der Aalst, W.M.P.: A meta model for process mining data. In: Proceedings of EMOI - INTEROP, vol. 160. CEUR-WS.org (2005)
Google Scholar
Gunther, C.W.: XES standard definition version 1.0. Technical report, Fluxicon Process Laboratories. http://www.xes-standard.org
Gunther, C.W., Rozinat, A.: Disco: Discover your processes. In: Proceedings of the Demo Track of BPM, vol. 940, pp. 40–44 (2012)
Google Scholar
IEEE Computational Intelligence Society: IEEE standard for eXtensible Event Stream (XES) for achieving interoperability in event logs and event streams. IEEE Std. 1849–2016, p. i-50 (2016)
Google Scholar
Rubin, V.A., Mitsyuk, A.A., Lomazova, I.A., van der Aalst, W.M.P.: Process mining can be applied to software tool. In: Proceedings of the ESEM 2014. ACM (2014)
Google Scholar
Verbeek, H.M.W., Buijs, J.C.A.M., van Dongen, B.F., van der Aalst, W.M.P.: XES, XESame, and ProM 6. In: Soffer, P., Proper, E. (eds.) CAiSE Forum 2010. LNBIP, vol. 72, pp. 60–75. Springer, Heidelberg (2011). doi:10.1007/978-3-642-17722-4_5
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Free University of Bozen-Bolzano, Bolzano, Italy
Andrea Janes & Marco Montali
University of Tartu, Tartu, Estonia
Fabrizio Maria Maggi
Sapienza University, Rome, Italy
Andrea Marrella

Authors

Andrea Janes
View author publications
You can also search for this author in PubMed Google Scholar
Fabrizio Maria Maggi
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Marrella
View author publications
You can also search for this author in PubMed Google Scholar
Marco Montali
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fabrizio Maria Maggi .

Editor information

Editors and Affiliations

University of Innsbruck, Innsbruck, Austria
Michael Felderer
Technical University Munich, Garching, Germany
Daniel Méndez Fernández
Brunel University London, Uxbridge, United Kingdom
Burak Turhan
Pontifical Catholic University of Rio de, Rio de Janeiro, Brazil
Marcos Kalinowski
University College London, London, United Kingdom
Federica Sarro
Vienna University of Technology, Vienna, Austria
Dietmar Winkler

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Janes, A., Maggi, F.M., Marrella, A., Montali, M. (2017). From Zero to Hero: A Process Mining Tutorial. In: Felderer, M., Méndez Fernández, D., Turhan, B., Kalinowski, M., Sarro, F., Winkler, D. (eds) Product-Focused Software Process Improvement. PROFES 2017. Lecture Notes in Computer Science(), vol 10611. Springer, Cham. https://doi.org/10.1007/978-3-319-69926-4_55

Download citation

DOI: https://doi.org/10.1007/978-3-319-69926-4_55
Published: 28 October 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-69925-7
Online ISBN: 978-3-319-69926-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

From Zero to Hero: A Process Mining Tutorial

Abstract

Similar content being viewed by others

Process mining: software comparison, trends, and challenges

Academic View: Development of the Process Mining Discipline

Process Mining: A 360 Degree Overview

Keywords

1 Introduction

2 Process Mining Framework

2.1 Process Mining Tools

2.2 The XES Standard

3 Agenda

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

From Zero to Hero: A Process Mining Tutorial

Abstract

Similar content being viewed by others

Process mining: software comparison, trends, and challenges

Academic View: Development of the Process Mining Discipline

Process Mining: A 360 Degree Overview

Keywords

1 Introduction

2 Process Mining Framework

2.1 Process Mining Tools

2.2 The XES Standard

3 Agenda

Notes

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation