Keywords

1 Introduction and Motivation

What is the particular challenge offered by natural data, which could suggest the need of topology, and in particular of persistence? Simply said, it’s quality instead of quantity. This is especially evident with images.

If one has to analyze, classify, retrieve images of mechanical pieces, vehicles, rigid objects, then geometry fulfills all needs. On the images themselves, matrix theory provides the transformations for superimposing a picture to a template. More often, pictures are represented by feature vectors, whose components are geometric measures (shape descriptors). Then recognition, defect detection, retrieval etc. can be performed on the feature vectors.

The scene changes if the depicted objects are of natural origin: the rigidity of geometry becomes an obstacle. Recognizing the resemblance between a sitting and a standing man is difficult. The challenge is even harder when it comes to biomedical data and when the context is essential for the understanding of data [34, 51].

It’s here that topology comes into play: the standing and sitting men are homeomorphic, i.e. there is a topological transformation which superimposes one to the other, whereas no matrix will ever be able to do that. It is generally difficult to discover whether two objects are homeomorphic; then algebraic topology turns helpful: It associates invariants — e.g. Betti numbers — to topological spaces, such that objects which are homeomorphic have identical invariants (the converse does not hold, unfortunately).

(Algebraic) topology seems then to be the right environment for formalizing qualitative aspects in a computable way, as is nicely expressed in [35, Sect. 5.1]. There is a problem: if geometry is too rigid, topology is too free. This is the reason why persistent topology can offer new topological descriptors (e.g. Persistent Betti Numbers, Persistence Diagrams) which preserve some selected geometric features through filtering functions. Classical references on persistence are [8, 12, 22, 50].

Persistent topology has been experimented in the image context, particularly in the biomedical domain, but also in fields where data are not pictures, e.g. in geology, music and linguistics, as will be shown in this survey.

2 Glossary and Basic Notions

It is out of the scope of this survey to give a working introduction to homology and persistence; we limit ourselves to an intuitive description of the concepts, and recommend to profit of the technical references, without which a real understanding of the results is impossible. An essential (and avoidable) technical description of a particular homology is reported in Sect. 2.1.

Homology. There is a well-structured way (technically a set of functors) to associate homology vector spaces (more generally modules) \(H_k(X)\) to a simplicial complex or to a topological space X, and linear transformations to maps [33, Chap. 2] and [23, Chap. 4].

Betti numbers. The k-th Betti number \(\beta _k(X)\) is the dimension of the k-th homology vector space \(H_k(X)\), i.e. the number of independent generators (homology classes of k-cycles) of this space. Intuitively, \(\beta _0(X)\) counts the number of path-connected components (i.e. the separate pieces) of which X is composed; \(\beta _1(X)\) counts the holes of the type of a circle (like the one of a doughnut); \(\beta _2(X)\) counts the 2-dimensional voids (like the ones of gruyere or of an air chamber).

Homeomorphism. Given topological spaces X and Y, a homeomorphism from X to Y is a continuous map with continuous inverse. If one exists, the two spaces are said to be homeomorphic. This is the typical equivalence relation between topological spaces. Homology vector spaces and Betti numbers are invariant under homeomorphisms.

Remark 1

As hinted in the Introduction, geometry is too rigid, but topology is too free. In particular, homeomorphic spaces can be very different from an intuitive viewpoint: the joke by which “for a topologist a mug and a doughnut are the same” is actually true; the two objects are homeomorphic! Persistent topology then tries to overcome this difficulty by studying not just topological spaces but pairs, once called size pairs, (Xf) where f is generally a continuous function, called measuring or filtering function, from X to \(\mathbb {R}\) (to \(\mathbb {R}^n\) in multidimensional persistence) which conveys the idea of shape, the viewpoint of the observer. Shape similarity is actually very much dependent on the context. The Betti numbers of the sublevel sets then make it possible to distinguish the two objects although they are homeomorphic: see Fig. 1.

Fig. 1.
figure 1

Sublevel sets of mug and doughnut.

Sublevel Sets. Given a pair (Xf), with \(f:X \rightarrow \mathbb {R}\) continuous, given \(u \in \mathbb {R}\), the sublevel set under u is the set \(X_u = \{x\in X \,| \, f(x) \le u\}\).

Persistent Betti Numbers. For all \(u, v \in \mathbb {R},\ \ \ u<v\), the inclusion map \(\iota ^{u, v}: X_u \rightarrow X_v\) is continuous and induces, at each degree k, a linear transformation \(\iota ^{u, v}_*: H_k(X_u) \rightarrow H_k(X_v)\). The k-Persistent Betti Number (k-PBN) function assigns to the pair (uv) the number dim Im \(\iota ^{u, v}_*\), i.e. the number of classes of k-cycles of \(H_k(X_u)\) which “survive” in \(H_k(X_v)\). See Fig. 2 (left) for the 1-PBN functions of mug and doughnut. Note that a pitcher, and more generally any open container with a handle, will have very similar PBNs to the ones of the mug; this is precisely what we want for a functional search and not for a strictly geometrical one.

Fig. 2.
figure 2

From left to right: 1-PBN functions of mug and of doughnut, 1-PDs of mug and of doughnut.

Persistence Diagrams. The k-PBN functions are wholly determined by the position of some discontinuity points and lines, called cornerpoints and cornerlines (or cornerpoints at infinity) The coordinates (uv) of a cornerpoint represent the levels of “birth” and “death” respectively of a generator; the abscissa of a cornerline is the level of birth of a generator which never dies. The persistence of a cornerpoint is the difference \(v-u\) of its coordinates. Cornerpoints and cornerlines form the k-Persistence Diagram (k-PD). Figure 2 (right) depicts the 1-PDs of mug and of doughnut. For the sake of simplicity, we are here neglecting the fact that cornerpoints and cornerlines may have multiplicities.

Remark 2

Sometimes it is important to distinguish even objects for which there exists a rigid movement superimposing one to the other — so also geometrically equivalent — as in the case of some letters: context may be essential! See Fig. 3, where ordinate plays the role of filtering function.

Fig. 3.
figure 3

Above: the objects “M” and “W”. Below, from left to right: 0-PBN functions of M and of W, 0-PDs of M and of W.

Matching distance. Given the k-PDs \(\mathcal{D}_{X,f}, \mathcal{D}_{Y,g}\) of two pairs (Xf), (Yg), match the cornerpoints of \(\mathcal{D}_{X,f}\) either with cornerpoints of \(\mathcal{D}_{Y,g}\) or with their own projections on the diagonal \(u=v\); the weight of this matching is the sup of the \(L_\infty \)-distances of matching points. The matching distance (or bottleneck distance) of \(\mathcal{D}_{X,f}\) and \(\mathcal{D}_{Y,g}\) is the inf of such weights among all possible such matchings.

Natural pseudodistance. Given two pairs (Xf), (Yg), with XY homeomorphic, the weight of a given homeomorphism \(\varphi :X \rightarrow Y\) is sup\(_{x\in X}|g(\varphi (x))-f(x)|\). The natural pseudodistance of (Xf) and (Yg) is the inf of these weights among all possible homeomorphisms. If we are given the k-PDs of the two pairs, their matching distance is a lower bound for the natural pseudodistance of the two pairs, and it is the best possible obtainable from the two k-PDs. Much is known on this dissimilarity measure [19,20,21].

2.1 A Brief Technical Description of Homology

There are several homologies. The classical and most descriptive one, at least for compact spaces, is singular homology with coefficients in \(\mathbb {Z}\); we refer to [33, Chap. 2] for a thorough exposition of it. Anyway, the homology used in most applications is the simplicial one, of which (with coefficients in \(\mathbb {Z}_2\)) we now give a very short introduction following [23, Chap. 4].

Simplices. A p-simplex \(\sigma \) is the convex hull, in a Euclidean space, of a set of \(p+1\) points, called vertices of the simplex, not contained in a Euclidean \((p-1)\)-dimensional subspace; the simplex is said to be generated by its vertices. A face of a simplex \(\sigma \) is the simplex generated by a nonempty set of vertices of \(\sigma \).

Simplicial complexes. A finite collection K of simplices of a given Euclidean space is a simplicial complex if (1) for any \(\sigma \in K\), all faces of \(\sigma \) belong to K, (2) the intersection of two simplices of K is either empty or a common face. The space of the complex K is the topological subspace of Euclidean space |K| formed by the union of all simplices of K.

Simplicial homology with \(\mathbb {Z}_2\) coefficients. Given a (finite) simplicial complex K, call p-chain any formal linear combination of p-simplices with coefficients in \(\mathbb {Z}_2\) (i.e. either 1 or 0, with \(1+1=0\)). p-chains form a \(\mathbb {Z}_2\)-vector space \(C_p\). Note that each p-chain actually identifies a set of p-simplices of K and that the sum of two p-chains is just the symmetric difference (Xor) of the corresponding sets. We now introduce a linear transformation \(\partial _p: C_p \rightarrow C_{p-1}\) (called boundary operator) for any \(p\in \mathbb {Z}\). We just need to define it on generators, i.e. on p-simplices, and then extend by linearity. Writing \(\sigma = [u_0, u_1, \ldots , u_p]\), we denote by \([u_0, \ldots , \hat{u}_j, \ldots , u_p]\) its face generated by all of its vertices except \(u_j\) (\(j=0, \ldots , p\)). Then we define

$$\partial _p(\sigma ) = \sum _{j=0}^n[u_0, \ldots , \hat{u}_j, \ldots , u_p]$$
Fig. 4.
figure 4

Cycles.

It is possible to prove that \(\partial _{p}\partial _{p+1} = 0\), so that \(B_p=\) Im\(\partial _{p+1}\) is contained in \(Z_p=\) Ker\(\partial _p\). Elements of \(B_p\) are called p-boundaries; elements of \(Z_p\) are called p-cycles. The p-homology vector space is defined as the quotient \(H_p(K) = Z_p/B_p\). Homology classes are represented by cycles which are not boundaries. Two cycles are homologous is their difference is a boundary. In Fig. 4, representing the simplicial complex K formed by the shaded triangles and their faces, the blue chain b is a 1-cycle which is also a boundary; the red chain c and the green one \(c'\) are 1-cycles which are not boundaries; c and \(c'\) are homologous.

3 State-of-the-Art

The application of persistence to shape analysis and classification has a long story, since it started in the 90’s when it still had the name of Size Theory [50]. In the last few years it has taken various, very interesting forms. The constant aspect is always the presence of qualitative features which are difficult to capture and formalize within other frames of mind.

3.1 Leukocytes

Leukocytes, or white blood cells, belong to five different classes: lymphocyte; monocyte; neutrophile, eosinophile, basophile granulocytes. Eosinophile and neutrophile granulocytes are generally difficult to be distinguished, so they were considered in a single classification class in an early research by the Bologna team [26].

Fig. 5.
figure 5

A radius along which the three filtering functions are computed.

Fig. 6.
figure 6

Persistent Betti Number functions relative to the sum of grey tones (different colors represent different values).

As a space, the boundary of the starlike hull of the cell is assumed. The images are converted to grey tones.

Three filtering functions are put to work, all computed along radii from the center of mass of the cell (Fig. 5):

  • Sum of grey tones

  • Maximum variation

  • Sum of variations pixel to pixel.

Classification (with very good hit ratios for that time) is performed by measuring distance from the average PBN function of each class.

3.2 Handwritten Letters and Monograms

Again in Bologna we faced recognition of handwritten letters with time information; our goal was to recognize both the alphabet letter and the writer [25].

The space on which the filtering functions are defined is the time interval of the writing. The filtering functions are computed in the 3D “plane-time”:

  • Distance of points from the letter axis

  • Speed

  • Curvature

  • Torsion

  • Distance from center of mass (in plane projection).

Fig. 7.
figure 7

A monogram with its outline (above) and the directions along which the filtering functions are computed (below).

Classification comes from fuzzy characteristic functions, obtained from normalized inverse of distance. Cooperation of the characteristic functions coming from the single filtering functions is given by their rough arithmetic average.

A later experiment, which was even repeated live at a conference, concerned the recognition of monograms for personal identification, without time information [24].

Two topological spaces are used. The first is the outline of the monogram and the filtering function is the distance from the center of mass (see upper Fig. 7).

The second space is a horizontal segment placed at the base of the monogram image. Filtering functions:

  • Number of black pixels along segments (3 directions) (see lower Fig. 7)

  • Number of pixel-pixel black-white jumps (3 directions).

Classification is performed by a weighted average of fuzzy characteristic functions.

3.3 Sign Alphabet

Automatic recognition of the symbols expressed by the hands in the sign language is a task which was of interest for different teams. The first one was the group led by Alessandro Verri in Genova [49]. The signs were performed with a white glove on a black background; translation into common letters was done in real time in a live demo at a conference.

The domain space is a horizontal segment; the filtering functions assign to each point of the segment the maximum distance of a contour point within a strip of fixed width, with 24 different strip orientations.

Fig. 8.
figure 8

Four filtering functions and the corresponding 0-Persistent Betti Number functions.

The choice of S. Wang in Sherbrooke, instead, is to use a part of the contour, determined by principal component analysis, as a domain and distance from center of mass as filtering function [32].

The team of D.Kelly in Maynooth uses the whole contour as domain, and distances from four lines as filtering functions [36] (see Fig. 8).

3.4 Human Gait

Personal identification and surveillance are the aim of a research by the Cuban team of L. Lamar-León, together with the Sevilla group of computational topology [37].

Fig. 9.
figure 9

Four filtering functions on silhouette stacks for gait identification.

Considering a stack of silhouettes as a 3D object, and using four different filtering functions, makes 0- and 1-degree persistent homology a tool for identifying people through their gait (Fig. 9).

3.5 Tropical Cyclones

S. Banerjee in Kolkata makes use of persistence on sequences of satellite images of cloud systems (Fig. 10), in order to evaluate risk and intensity of forming hurricanes [2].

Fig. 10.
figure 10

Time evolution of cyclones.

Time interval is the domain of two filtering functions which are common characteristic measures of cyclones:

  • Central Feature portion

  • Outer Banding Feature

3.6 Galaxies

Again S. Banerjee [3] applies similar methods to another type of spirals: galaxies.

Various filtering functions are used. One is defined as a function of distance from galaxy center, and is the ratio between major and minor axis of the corresponding isophote. Another one is a “pitch” parameter defined by Ringermacher and Mead [45]. A third filtering function is a compound based on color.

The classification results agree with the literature.

3.7 Bones

In [48] a powerful construction (the Persistent Homology Transform) is introduced. It consists in gathering the “height” filtering functions according to all possible directions. The paper shows that the transform is injective for objects homeomorphic to spheres. By using the transform it is possible to define an effective distance between surfaces. An application is shown by classifying heel bones of different species; the comparison with the ground truth produced by using placement of landmarks on the surfaces is very good.

3.8 Melanocytic Lesions

A very important part of natural shape analysis is the detection of malignant cells and lesions, since there generally are no templates for them. As far as we know, the first attempt through persistence (called size theory at that time) is the ADAM EU Project, by the Bologna team together with CINECA and with I. Stanganelli, a dermatologist of the Romagna Oncology Institute [17, 27, 47]. The analysis is mainly based on asymmetry of boundary, masses and color distribution: the lesion is split into two halves by 45 equally spaced lines, and the difference between the two halves is measured by the matching distance of the corresponding Persistence Diagrams.

Fig. 11.
figure 11

One of the 45 splittings of a melanocytic lesion, and the whole A-curve corresponding to the filtering function luminance.

The three functions (A-curves) relating these distances to the splitting line angles give parameters which are then fed into a Support Vector Machine classifier.

The same team is presently involved with a biomedical firm in the realization of a machine for smart retrieval of dermatological images [28].

3.9 Tumor Mouth Cells

A morphological classification of normal and tumor cells of the epithelial tissue of the mouth is proposed in [40, 41]: the filtering function is distance from the center of mass; the discrimination is statistically based on the distribution of cornerpoints (see Fig. 12).

Fig. 12.
figure 12

Distribution of cornerpoints in the diagrams of normal and tumor mouth cells.

3.10 Hepatic Lesions

The advantages of a multidimensional range for the filtering functions are shown in [1], where several classification experiments are performed on the images of hepatic cells (see Fig. 13). The domain space is the part of image occupied by the lesion; the two components of the filtering function are the greyscale of each pixel and the distance from the lesion boundary.

Fig. 13.
figure 13

Various types of hepatic lesions.

3.11 Genetic Pathways

So far we have seen applications of persistence to images of natural origin. But the modularity of the method opens the possibility to deal with data of very different nature. A first example is given by [43], where persistence is used on the Vietoris-Rips complex in a space where points are complex phenotypes related together by the Jaccard distance. This made it possible to find systematic associations among metabolic syndrome variates that show distinctive genetic association profiles.

3.12 Oil and Gas Reservoirs

Researchers in Ufa and Novosibirsk need to get a reliable geological and hydrodynamical model of gas and oil reservoirs out of noisy data; the model has to be robust under small perturbations. The authors have found an answer in persistent 0-, 1- and 2-cycles. The domain space is the 3D reservoir bed, and the filtering function is permeability, obtained as a decreasing function of radioactivity [4] (Russian; translated and completed in this same volume).

3.13 Brain Connections

A complex research on brain connections and their modification under the assumption of a psychoactive substance (psilocybine) is performed in [42] and extended in [39]. The construction starts with a complete graph whose vertices are cortical or subcortical regions; these, and their functional connectivity (expressed as weights on the edges) come from an elaborate processing of functional MRI data. Then the simplicial complex is built, whose simplices are the cliques (complete subgraphs) of the graph.

The filtering function on each simplex is minus the highest weight of its building edges. A difference between treated and control subjects already appears in the comparison of the 1-Persistence Diagrams (see Fig. 14). Then more information is obtained from secondary graphs (called homological scaffolds), whose vertices are the homology generators weighted by their persistence.

Fig. 14.
figure 14

Probability densities for \(H_1\) generators: placebo (left) and psilocybin (right) treated.

There are other applications of persistence to brain research: evaluation of cortical thickness in autism [16]; study of unexpected connections between subcortex, frontal cortex and parietal cortex in the form of 1- and 2-dimensional persistent cycles [31, 46].

3.14 Music

Among other mathematical applications to music, M.G. Bergomi in Lisbon collaborates with various researchers in exploring musical genres by persistence [6]. As a space they adopt a modified version of Euler’s Tonnetz [9]. The filtering function is the total duration of each note in a given track. Classification can be performed at different detail levels: experimentation is reported on tonal and atonal classical music of several authors (an example is in Fig. 15), on pop music and on different interpretation of the same jazz piece.

Fig. 15.
figure 15

0- and 1-persistence diagrams for three classical pieces.

A blend of persistence and deep learning is the central idea of a research by the team of I.-H. Yang in Taiwan [38]. They input audio signals to a Convolutional Neural Network (CNN); after a first convolution layer, a middle layer processes the output of the first in two different complementary ways: one is a classical CNN; the other computes the persistence landscape (an information piece derivable from the persistence diagram [10]) of the same output. Whereas the persistence layer by itself does not perform any better than the normal CNN, their combination gives very good results in terms of music tagging.

3.15 Languages

An interdisciplinary team at Caltech investigates the metric spaces built by 79 Indo-European and 49 Niger-Congo languages [44]. These appear as points in a Euclidean space of syntactic parameters; on them a Vietoris-Rips complex [23, Sect. III.2] is built and Euclidean distance is assumed as filtering function. The Indo-European family reveals one 1-dimensional and two 0-dimensonal persistent cycles, the Niger-Congo respectively none and one. The interpretation of these differences and of the link with phylogenetic and historical facts is still under way.

4 Open Problems

There is a number of open problems in persistence, whose solution will affect applications to natural data analysis, and to which only partial answers have been given so far:

  • Optimal choice of the foliations along which to perform the 1D reduction of multidimensional persistence [13]

  • Study of the discontinuities in multidimensional persistence [11, 15]

  • Understanding the monodromy around multiple cornerpoints [14]

  • Restricting the group of homeomorphisms of interest by considering the invariance required by the observer [29]

  • Modulation of the impact of different filtering functions for search engines with relevance feedback [30]

  • Use of advanced tools of algebraic topology [5]

  • Use of persistence in the wider context of concrete categories, not necessarily passing through homology of complexes or of topological spaces [7].

5 Future Outlook

There are at least two ways in which persistence will interact with machine learning, and this is likely to enormously boost the qualitative processing of natural data [18]:

  • Feeding a neural network with Persistence Diagrams instead of raw data will convey the needs and viewpoints of the user

  • Deep learning might yield a quantum leap in persistence, by automatically finding the best filtering functions for a given problem.