Assignment Flow for Order-Constrained OCT Segmentation

Sitenko, Dmitrij; Boll, Bastian; Schnörr, Christoph

doi:10.1007/978-3-030-71278-5_5

Dmitrij Sitenko¹¹,
Bastian Boll¹¹ &
Christoph Schnörr¹¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 12544))

Included in the following conference series:

DAGM German Conference on Pattern Recognition

1150 Accesses
2 Citations

Abstract

At the present time Optical Coherence Tomography (OCT) is among the most commonly used non-invasive imaging methods for the acquisition of large volumetric scans of human retinal tissues and vasculature. Due to tissue-dependent speckle noise, the elaboration of automated segmentation models has become an important task in the field of medical image processing.

We propose a novel, purely data driven geometric approach to order-constrained 3D OCT retinal cell layer segmentation which takes as input data in any metric space. This makes it unbiased and therefore amenable for the detection of local anatomical changes of retinal tissue structure. To demonstrate robustness of the proposed approach we compare four different choices of features on a data set of manually annotated 3D OCT volumes of healthy human retina. The quality of computed segmentations is compared to the state of the art in terms of mean absolute error and Dice similarity coefficient.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Assignment Flow for Order-Constrained OCT Segmentation

Article Open access 03 September 2021

Simultaneous Segmentation of Retinal OCT Images Using Level Set

3D Retinal Vessel Tree Segmentation and Reconstruction with OCT Images

1 Introduction

Overview. Optical Coherence Tomography (OCT) is a non-invasive imaging technique which measures the intensity response of back scattered light from millimeter penetration depth. We focus specifically on the application of OCT in ophthalmology for aquisition of high-resolution volume scans of the human retina. This provides information about retinal tissue structure in vivo to understand human eye functionalities. OCT devices record multiple two-dimensional B-scans in rapid succession and combine them into a single volume in a subsequent alignment step. Taking an OCT scan only takes minutes and can help detect symptoms of pathological conditions such as glaucoma, diabetes, multiple sclerosis or age-related macular degeneration.

The relative ease of data aquisition also enables to use multiple OCT volume scans of a single patient over time to track the progression of a pathology or quantify the success of therapeutic treatment. To better leverage the availability of raw OCT data in both clinical settings and empirical studies, much work has focused on automatic extraction of relevant information, in particular automatic cell layer segmentation, detection of fluid and reconstruction of vascular structures. The difficulty of these tasks lies in challenging signal-to-noise ratio which is influenced by multiple factors including mechanical eye movement during registration and the presence of speckle.

Related Work. Effective segmentation of OCT volumes is a very active area of research. Several methods for segmenting human retina were proposed in [11, 12, 16] and [2] which rely on graphical models. To increase robustness, the retina segmentation approaches proposed in [23] and [9] employ shape priors using soft constraints. In [19] Rathke first introduced a parallelizable segmentation method based on probabilistic graphical models with global low-rank shape prior. Variational approaches given in [8, 26] and [18] model retina layers by zero level sets with properly chosen functionals including soft constraints. Much recent work has focused on the use of deep learning to address the task of cell layer segmentation in a purely data driven way. Methods presented in [17, 21] rely on the U-net architecture [20] which yields good predictive performance in settings with limited availability of training data. To enforce global order of cell layers along a spatial axis as well as additional regularization, local predictions have been tied together through graph-based methods [10] or through a second machine learning component [14]. However, if global context is already used in feature extraction, the risk of overfitting remains and unseen pathologies may result in unpredictable behavior.

Approach. Our segmentation approach is a smooth image labeling algorithm based on geometric numerical integration on an elementary statistical manifold. It can work with input data from any metric space, making it agnostic to the choice of feature extraction and suitable as plug-in replacement in diverse pipelines. In addition to respecting the natural order of cell layers, our segmentation process has a high amount of built-in parallelism such that modern graphics acceleration hardware can easily be leveraged. We evaluate the effectiveness of our novel approach for a selection of input features ranging from traditional covariance descriptors to convolutional neural networks.

Contribution. We propose a geometric assignment approach that extends the approach introduced by [4] to retinal layer segmentation with the following novel characteristics:

(i)
By leveraging a continuous characterization of layer ordering, our method is able to simultaneously perform local regularization and to incorporate the global topological ordering constraint in a single smooth labeling process. The segmentation is computed from a distance matrix containing pairwise distances between data for each voxel and prototypical data for each layer in some feature space. This highlights the ability to extract features from raw OCT data in a variety of different ways and to use the proposed segmentation as a plug-in replacement for other graph-based methods.
(ii)
Computationally fast and high-quality cell layer segmentations of OCT volumes are obtained by using only local features for each voxel. This is in contrast to competing deep learning approaches which commonly use information from an entire B-scan as input. In addition, the exclusive use of local features combats bias introduced through limited data availability in training and enables the incorporation of three-dimensional information without compromising runtime scalability.
(iii)
The highly parallelizable segmentation approach with global voxel interaction enables robust cell layer segmentation of entire OCT volumes without using any prior knowledge other than local regularity and order of cell layers. In particular, no global shape prior is used as opposed to segmentation methods relying on graphical models like, e.g., [19]. Figure 1 shows a typical result obtained with our novel approach after segmenting healthy retina tissues with labels specified in (Fig. 2).

Organization. Our paper is organized as follows. The assignment flow approach is briefly summarized in Sect. 2 and extended in Sect. 4 in order to take into account the order of layers as a global constraint. In Sect. 3, we consider the Riemannian manifold $\mathcal {P}_{d}$ of positive definite matrices as a suitable feature space for local OCT data descriptors. The resulting features are subsequently compared to local features extracted by a convolutional network in Sect. 5. The evaluation of performance measures for OCT segmentation of our novel approach are reported in Sect. 5 and compared to the state-of-the-art method given in [16].

2 Assignment Flow

We summarize the assignment flow approach introduced by [4] and refer to the recent survey [22] for more background and a review of recent related work.

Assignment Manifold. Let $(\mathcal {F},d_{\mathcal {F}})$ be a metric space and $\mathcal {F}_{n} = \{f_{i} \in \mathcal {F} :i \in \mathcal {I}\},\,|\mathcal {I}|=n$ given data. Assume that a predefined set of prototypes $\mathcal {F}_{*} = \{f^{*}_{j} \in \mathcal {F} :j \in \mathcal {J}\},\, |\mathcal {J}|=c$ is given. Data labeling denotes assignments $j \rightarrow i,\; f_{j}^{*} \rightarrow f_{i}$ to be determined in a spatially regularized fashion. The assignments at each pixel $i\in \mathcal {I}$ are encoded by assignment vectors $W_{i} = (W_{i1},\dotsc ,W_{ic})^{\top } \in \mathcal {S}$ in the relative interior $\mathcal {S}={{\,\mathrm{rint}\,}}\varDelta _{c}$ of the probability simplex, that becomes a Riemannian manifold $(\mathcal {S},g)$ endowed with the Fisher-Rao metric g from information geometry. The assignment manifold $(\mathcal {W},g)$, $\mathcal {W}=\mathcal {S}\times \cdots \times \mathcal {S}$ ($n=|\mathcal {I}|$ factors) is the product manifold whose points encode the label assignments at all pixels.

Assignment Flow. Based on the given data and prototypes, the distance vector field $D_{\mathcal {F};i} = \big (d_{\mathcal {F}}(f_{i},f_{1}^{*}),\dotsc ,d_{\mathcal {F}}(f_{i},f_{c}^{*})\big )^{\top },\; i \in \mathcal {I}$ is well defined. This data representation is lifted to the assignment manifold by the likelihood map and the likelihood vectors, respectively,

$$\begin{aligned} L_{i} :\mathcal {S} \rightarrow \mathcal {S},\quad L_{i}(W_{i}) = \frac{W_{i} e^{-\frac{1}{\rho } D_{\mathcal {F};i}}}{\langle W_{i},e^{-\frac{1}{\rho } D_{\mathcal {F};i}} \rangle },\quad i \in \mathcal {I}. \end{aligned}$$

(2.1)

This map is based on the affine e-connection of information geometry and the scaling parameter $\rho > 0$ is used for normalizing the a-prior unknown scale of the components of $D_{\mathcal {F};i}$ that depends on the specific application at hand. The likelihood vectors are spatially regularized by the similarity map and the similarity vectors, respectively,

$$\begin{aligned} S_{i} :\mathcal {W} \rightarrow \mathcal {S},\quad S_{i}(W) = {{\,\mathrm{Exp}\,}}_{W_{i}}\Big (\sum _{k \in \mathcal {N}_{i}} w_{ik} {{\,\mathrm{Exp}\,}}_{W_{i}}^{-1}\big (L_{k}(W_{k})\big )\Big ),\quad i \in \mathcal {I}, \end{aligned}$$

(2.2)

where ${{\,\mathrm{Exp}\,}}_{p}(v) = \frac{p e^{v/p}}{\langle p,e^{v/p}\rangle }$ is the exponential map corresponding to the e-connection and positive weights $\omega _{ik}$, $k\in \mathcal {N}_{i}$, that sum up to 1 on every patch around pixel i indexed by $\mathcal {N}_{i}$, determine the regularization properties.

The assignment flow is induced on the assignment manifold $\mathcal {W}$ by the locally coupled system of nonlinear ODEs

$$\begin{aligned} \dot{W}_{i} = R_{W_{i}} S_{i}(W),\qquad W_{i}(0)=\mathbbm {1}_{\mathcal {S}},\quad i \in \mathcal {I}, \end{aligned}$$

(2.3)

where the map $R_{p}={{\,\mathrm{Diag}\,}}(p)-p p^{\top }$, $p\in \mathcal {S}$ turns the right-hand side into a tangent vector field and $\mathbbm {1}_{\mathcal {W}} \in \mathcal {W}$ denotes the barycenter of the assignment manifold $\mathcal {W}$. The solution $W(t)\in \mathcal {W}$ is numerically computed by geometric integration [27] and determines a labeling W(T) for sufficiently large T after a trivial rounding operation. Convergence and stability of the assignment flow have been studied by [28].

3 OCT Data Representation by Covariance Descriptors

In this section, we briefly sketch the basic geometric notation for representation of OCT data in terms of covariance descriptors $f_{i}\in \mathcal {F}_{n}$ [25] and identify the metric data space $(\mathcal {F},d_{\mathcal {F}})$ underlying (2.1).

The Manifold $\mathcal {P}_{d}$. The Riemannian manifold $(\mathcal {P}_{d},g)$ of positive definite matrices of dimension $\frac{(d+1)(d)}{2}$ and the Riemannian metric are given by

The Riemannian distance is given by

$$\begin{aligned} d_{\mathcal {P}_{d}}(S,T) = \Big (\sum _{i \in [d]}\big (\log \lambda _{i}(S,T)\big )^{2}\Big )^{1/2}, \end{aligned}$$

(3.2)

whereas the globally defined exponential map reads

$$\begin{aligned} \exp _{S}(U) = S^{\frac{1}{2}}{{\,\mathrm{expm}\,}}(S^{-\frac{1}{2}}US^{-\frac{1}{2}})S^{\frac{1}{2}}, \end{aligned}$$

(3.3)

with ${{\,\mathrm{expm}\,}}(\cdot )$ denoting the matrix exponential. Given a smooth objective function $J :\mathcal {P}_{d} \rightarrow \mathbb {R}$, the Riemannian gradient is given by

$$\begin{aligned} {{\,\mathrm{grad}\,}}J(S) = S\big (\partial J(S)\big )S \in T_{S}\mathcal {P}_{d}, \end{aligned}$$

(3.4)

where the symmetric matrix $\partial J(S)$ is the Euclidean gradient of J at S.

Region Covariance Descriptors. To apply the introduced geometric framework, we model each OCT volume by a mapping $I:\mathcal {D} \rightarrow \mathbb {R}_+$ where $\mathcal {D} \subset \mathbb {R}^3$ is an underlying spatial domain.

To each voxel $v \in \mathcal {D}$, we associate the local feature vector $f :\mathcal {D} \rightarrow \mathbb {R}^{10}$,

$$\begin{aligned} f(v) := ( I(v), \nabla _x I(v), \nabla _y I(v), \nabla _z I(v), \sqrt{2}\nabla _{xy} I(v), \dots , \nabla _{zz} I(v) )^\top \end{aligned}$$

(3.5)

assembled from the raw intensity value I(v) as well as first- and second-order responses of derivatives filters capturing information from larger scales following [13]. By introducing a suitable geometric graph spanning $\mathcal {D}$, we can associate a neighborhood $\mathcal {N}(i)$ of fixed size with each voxel $i\in \mathcal {I}$ as in (2.2). For each neighborhood, we define the regularized region covariance descriptor $S_i$ as

$$\begin{aligned} S_i := \sum _{j \in \mathcal {N}(i)}\theta _{ij} (f_j-\overline{f_i})(f_j-\overline{f_i})^T + \epsilon I, \quad \overline{f_i} = \sum _{k \in \mathcal {N}(i)}\theta _{ik}f_k, \end{aligned}$$

(3.6)

as a weighted empirical covariance matrix with respect to feature vectors $f_{j}$. The small value $1 \gg \epsilon > 0$ acts as a regularization parameter enforcing positive definiteness of $S_i$. In the following, we use the shorthand notation $[n] = \{1,\dotsc ,n\}$ for natural numbers n.

Computing Prototypical Covariance Descriptors. Given a set of covariance descriptors

$$\begin{aligned} \mathcal {S}_{N} = \{(S_1,\omega _{1}),\dots , (S_{N},\omega _{N})\} \subset \mathcal {P}_d \end{aligned}$$

(3.7)

together with positive weights $\omega _{i}$, we next focus on the solution of the problem

$$\begin{aligned} \overline{S} = \arg \min _{S \in \mathcal {P}_{d}} J(S;\mathcal {S}_{N}),\qquad J(S;\mathcal {S}_{N}) = \sum _{i \in [N]} \omega _{i}d_{\mathcal {P}_{d}}^{2}(S,S_{i}), \end{aligned}$$

(3.8)

with the distance $d_{\mathcal {P}_{d}}$ given by (3.2). From (3.3), we deduce

$$\begin{aligned} U = \exp _{S}^{-1} \circ \exp _{S}(U) = S^{\frac{1}{2}} {{\,\mathrm{logm}\,}}\big (S^{-\frac{1}{2}}\exp _{S}(U)S^{-\frac{1}{2}}\big ) S^{\frac{1}{2}} \end{aligned}$$

(3.9)

with the matrix logarithm ${{\,\mathrm{logm}\,}}={{\,\mathrm{expm}\,}}^{-1}$ [15, Section 11]. The efficient mean retrieval of (3.8) regarding the evaluation of (3.2) requires a nontrivial matrix decomposition that has to be applied multiple times to every voxel (vertex) of a 3D gridgraph. This results in an overall quite expensive approach in particular for a large data set. Therefore we reduce the computational costs by relying on an approximation of the Riemannian mean by employing surrogate metrics and distances introduced below.

Log-Euclidean Distance and Means. A computationally cheap approach was proposed by [3] (among several other ones). Based on the operations

$$\begin{aligned} S_{1} \odot S_{2}&= {{\,\mathrm{expm}\,}}\big ({{\,\mathrm{logm}\,}}(S_{1})+{{\,\mathrm{logm}\,}}(S_{2})\big ),\end{aligned}$$

(3.10a)

$$\begin{aligned} \lambda \cdot S&= {{\,\mathrm{expm}\,}}\big (\lambda {{\,\mathrm{logm}\,}}(S)\big ), \end{aligned}$$

(3.10b)

the set $(\mathcal {P}_{s},\odot ,\cdot )$ becomes isomorphic to the vector space where $\odot $ plays the role of addition. Consequently, the mean of the data $\mathcal {S}_{N}$ given by (3.7) is defined analogous to the arithmetic mean by

$$\begin{aligned} \overline{S} = {{\,\mathrm{expm}\,}}\Big (\sum _{i\in [N]}\omega _{i}{{\,\mathrm{logm}\,}}(S_{i})\Big ). \end{aligned}$$

(3.11)

While computing the mean is considerably cheaper than integrating the flow induced by (3.4) with respect to objective (3.8), the geometry (curved structure) of the manifold $\mathcal {P}_{d}$ is ignored. Therefore, in the next section, we additionally consider another approximation of the Riemannian mean that better respects the underlying geometry but can still be evaluated efficiently.

S-Divergence and Means. For an approximation of the objective function (3.8), we replace the Riemannian $d_{g}^{2}(p,q)$ distance by the Stein divergence proposed by Sra [24]

$$\begin{aligned} D_{s}(S_{1},S_{2}) = \log \det \Big (\frac{S_{1}+S_{2}}{2}\Big ) - \frac{1}{2}\log \det (S_{1} S_{2}),\qquad S, S_{1},S_{2} \in \mathcal {P}_{d}, \end{aligned}$$

(3.12)

and avoid involved generalized eigenvalue problem for evaluation of (3.2) by replacing (3.8) with

$$\begin{aligned} \overline{S} = \arg \min _{S\in \mathcal {P}_{d}} J_{s}(S;\mathcal {S}_{N}),\qquad J_{s}(S;\mathcal {S}_{N}) = \sum _{i\in [N]}\omega _{i} D_{s}(S,S_{i}). \end{aligned}$$

(3.13)

We refer to, e.g., [5, 6] for a more complete exposition of divergence functions. The Riemannian gradient flow for this specific problem reads

$$\begin{aligned} \dot{S}&= -{{\,\mathrm{grad}\,}}J_{s}(S;\mathcal {S}_{N}) \overset{(3.4)}{=} -S \partial J(S;\mathcal {S}_{N}) S \end{aligned}$$

(3.14a)

$$\begin{aligned}&= -\frac{1}{2}\big (S R(S;\mathcal {S}_{N}) S - S\big ),\qquad R(S;\mathcal {S}_{N}) = \sum _{i\in [N]}\omega _{i}\Big (\frac{S+S_{i}}{2}\Big )^{-1}. \end{aligned}$$

(3.14b)

Discretizing the flow using the geometric explicit Euler scheme with step size h yields,

$$\begin{aligned} S_{(t+1)}&= \exp _{S_{(t)}}\big ({-}h {{\,\mathrm{grad}\,}}J_{s}(S_{(t)};\mathcal {S}_{N})\big ) \end{aligned}$$

(3.15a)

$$\begin{aligned}&\overset{(3.3)}{=} S_{(t)}^{\frac{1}{2}} {{\,\mathrm{expm}\,}}\Big (\frac{h}{2}\big (I-S_{(t)}^{\frac{1}{2}}R(S_{(t)};\mathcal {S}_{N})S_{(t)}^{\frac{1}{2}}\big )\Big ) S_{(t)}^{\frac{1}{2}}. \end{aligned}$$

(3.15b)

Using as initial point $S_{(0)}$ the log-Euclidean mean (3.11) defines the following algorithm that we use for mean retrieval throughout the present paper.

4 Ordered Layer Segmentation

In this section, we work out an extension of the assignment flow (Sect. 2) which is able to respect the order of cell layers as a global constraint while remaining in the same smooth geometric setting. In particular, existing schemes for numerical integration still apply to the novel variant.

4.1 Ordering Constraint

With regard to segmenting OCT data volumes, the order of cell layers is crucial prior knowledge. Figure 2 illustrates for a schematic OCT volume acquisition of 11 retina layers and 3 separating membranes (ILM, ELM, BM) and typical scan notations used throughout the paper. To incorporate this knowledge into the geometric setting of Sect. 2, we require a smooth notion of ordering which allows to compare two probability distributions. In the following, we assume prototypes $f^{*}_{j} \in \mathcal {F}$, $j \in [n]$ in some feature space $\mathcal {F}$ to be indexed such that ascending label indices reflect the physiological order of cell layers.

Definition 1 (Ordered Assignment Vectors)

A pair of voxel assignments $(w_i, w_j)\in \mathcal {S}^2$, $i < j$ within a single A-scan is called ordered, if $w_j - w_i \in K = \{ By:y\in \mathbb {R}^c_+ \}$ which is equivalent to $Q(w_j - w_i) \in \mathbb {R}_+$ with the matrices

$$\begin{aligned} B_{ij} := {\left\{ \begin{array}{ll} -1 \quad &{}\text {if } i = j\\ 1 \quad &{}\text {if } i-j = 1\\ 0 \quad &{}\text {else} \end{array}\right. }, \quad Q_{i,j} = {\left\{ \begin{array}{ll}1 &{} \text {if } i\ge j\\ 0 &{} \text {else}\end{array}\right. } \end{aligned}$$

(4.1)

4.2 Ordered Assignment Flow

Likelihoods as defined in (2.1) emerge by lifting $-\frac{1}{\rho }D_{\mathcal {F}}$ regarded as Euclidean gradient of $-\frac{1}{\rho }\langle D_{\mathcal {F}}, W \rangle $ to the assignment manifold. It is our goal to encode order preservation into a generalized likelihood matrix $L_\text {ord}(W)$. To this end, consider the assignment matrix $W\in \mathcal {S}^N$ for a single A-scan consisting of N voxels. We define the related matrix $Y(W)\in \mathbb {R}^{N(N-1)\times c}$ with rows indexed by pairs $(i,j)\in [N]^2$, $i\ne j$ in fixed but arbitrary order. Let the rows of Y be given by

$$\begin{aligned} Y_{(i,j)}(W) = {\left\{ \begin{array}{ll}Q(w_j-w_i) &{} \text {if }i > j\\ Q(w_i-w_j) &{} \text {if }i < j\end{array}\right. }\ . \end{aligned}$$

(4.2)

By construction, an A-scan assignment W is ordered exactly if all entries of the corresponding Y(W) are nonnegative. This enables to express the ordering constraint on a single A-scan in terms of the energy objective

$$\begin{aligned} E_\text {ord}(W) \;= \sum _{(i,j)\in [N]^2,\;i\ne j} \phi (Y_{(i,j)}(W))\ . \end{aligned}$$

(4.3)

where $\phi :\mathbb {R}^c \rightarrow \mathbb {R}$ denotes a smooth approximation of $\delta _{\mathbb {R}^c_+}$. In our numerical experiments, we choose

$$\begin{aligned} \phi (y) = \left\langle \gamma \exp \left( -\frac{1}{\gamma } y \right) , \mathbbm {1}\right\rangle \end{aligned}$$

(4.4)

with a constant $\gamma > 0$. Suppose a full OCT volume assignment matrix $W\in \mathcal {W}$ is given and denote the set of submatrices for each A-scan by C(W). Then order preserving assignments consistent with given distance data $D_{\mathcal {F}}$ in the feature space $\mathcal {F}$ are found by minimizing the energy objective

$$\begin{aligned} E(W) = \langle D_{\mathcal {F}}, W \rangle + \sum _{W_A\in C(W)} E_\text {ord}(W_A)\ . \end{aligned}$$

(4.5)

We consequently define the generalized likelihood map

(4.6)

and specify a corresponding assignment flow variant.

Definition 2 (Ordered Assignment Flow)

The dynamical system

$$\begin{aligned} \dot{W} = R_WS(L_\text {ord}(W)),\qquad W(0) = \mathbbm {1}_\mathcal {W} \end{aligned}$$

(4.7)

evolving on $\mathcal {W}$ is called the ordered assignment flow.

By applying known numerical schemes [27] for approximately integrating the flow (4.7), we find a class of discrete-time image labeling algorithms which respect the physiological cell layer ordering in OCT data. In Sect. 5, we benchmark the simplest instance of this class, emerging from the choice of geometric Euler integration.

5 Experimental Results and Discussion

OCT-Data. In the following sections, we describe experiments performed on a set of volumes with annotated OCT B-scans extracted by a spectral domain OCT device (Heidelberg Engineering, Germany). Further, we always assume an OCT volume in question to consist of $N_B$ B-scans, each comprising $N_A$ A-scans with N voxels.

While raw OCT volume data has become relatively plentiful in clinical settings, large volume datasets with high-quality gold-standard segmentation are not widely available at the time of writing. By extracting features which represent a given OCT scan locally as opposed to incorporating global context at every stage, it is our hypothesis that superior generalization can be achieved in the face of limited data availability. This is most expected for pathological cases in which global shape of cell layers may deviate drastically from seen examples in the training data. Our approach consequently differs from common deep learning methods which explicitly aim to incorporate global context into the feature extraction process. Utilization of shape prior limits the methods ability to generalize to unseen data if large deviation from the expected global shape seen in training is present.

Prototypes on$\mathcal {P}^d$. For applying the framework introduced in Sect. 2, we interpret covariance features (3.6) as data points $f_{i}\in \mathcal {F}_{n}$ evolving on the natural metric space (3.1a) and model each retina tissue indexed by $l \in \{1, \dots ,C\}$ with a random variable $S_l$ taking values $\{S_l^k \}_{k = 1}^{N_l}$. To generalize the retina layer detection to multiple OCT data sets instead of just using a single prototype (3.13), we partition the samples $\{S_l^k \}_{k = 1}^{N_l}$ into $K_l$ disjoint sets $\{S^l_1,\dots , S^l_{K_l}\}$ with representatives $\{ \tilde{S}^1_l, \dots , \tilde{S}^{K_l}_l \}$. These are serving as prototypes $f_{j}^{*},\,j\in \mathcal {J}$ which are determined offline for each $l\in \{ 1,\dots , 14 \}$ as the minimal expected loss measured by the Stein divergence (3.12) according to K-means like functional

$$\begin{aligned} E_{p_l}(S^{l}) = \sum ^{K_l}_{j = 1} p(j) \sum _{ S_i \in S_j }\frac{p(i|j)}{p(j)} D_S(S_l^i,\tilde{S}^{j}_l), \quad p(i,j) = \frac{1}{N_l},\;p(j) = \frac{N_j}{N_l}, \end{aligned}$$

(5.1)

with marginals $p_l(j) = \sum _{i = 1}^{N_j} p_l(j|S_l^i)$ and using Algorithm 1 for mean retrieval. The experimental results discussed next illustrate the relative influence of the covariance descriptors and regularization property of the ordered assignment flow, respectively. Throughout, we fixed the grid connectivity $\mathcal {N}_{i}$ for each voxel $i\in \mathcal {I}$ to $3 \times 5 \times 5$. Figure 3, second row, illustrates a typical result of nearest neighbor assignment and the volume segmentation without ordering constraints. As the second raw shows, the high texture similarity between the choroid and GCL layer yields wrong predictions resulting in violation of biological retina ordering through the whole volume which cannot be resolved with the based assignment flow approach given in Sect. 2. In third row of Fig. 3, we plot the ordered volume segmentation by stepwise increasing the parameter $\gamma $ defined in (4.4), which controls the ordering regularization by means of the novel generalized likelihood matrix (4.6). The direct comparison with the ground truth remarkably shows how the ordered labelings evolve on the assignment manifold while simultaneously giving accurate data-driven detection of RNFL, OPL, INL and the ONL layer. For the remaining critical inner layers, the local prototypes extracted by (5.1) fail to segment the retina properly, due to the presence of vertical shadow regions originating from the scanning process of the OCT-data.

CNN Features. In addition to the covariance features in Sect. 3, we compare a second approach to local feature extraction based on a convolutional neural network architecture. For each node $i\in [n]$, we trained the network to directly predict the correct class in [c] using raw intensity values in $\mathcal {N}_i$ as input. As output, we find a score for each layer which can directly be transformed into a distance vector suitable as input to the ordered assignment flow (4.7) via (4.6). The specific network used in our experiments has a ResNet architecture comprising four residually connected blocks of 3D convolutions and ReLU activation. Model size was hand-tuned for different sizes of input neighborhoods, adjusting the number of convolutions per block as well as corresponding channel dimensions. In particular, labeling accuracy is increased for detection of RPE and PR2 layers, as illustrated in the last raw of Fig. 3.

Evaluation. To assess the segmentation performance of our proposed approach, we compared to the state of the art graph-based retina segmentation method of 10 intra-retinal layers developed by the Retinal Image Analysis Laboratory at the Iowa Institute for Biomedical Imaging [1, 11, 16], also referred to as the IOWA Reference Algorithm. We quantify the region agreement with manual segmentation regarded as gold standard. Specifically, we calculate the DICE similarity coefficient [7] and the mean absolute error for segmented cell layer within the pixel size of 3.87 $\upmu $m compared to human grader on an OCT volume consisting of 61 B-scans reported in Table 1. To allow a direct comparison to the proposed segmentation method, the evaluation was performed on layers summarized in Table 1. We point out that in general our method is not limited to any number of segmented layers if ground truth is available and further performance evaluations though additional comparison with the method proposed in [19] will be included in the complete report of the proposed approach which is beyond scope of this paper. The OCT volumes were imported into OCTExplorer 3.8.0 and segmented using the predefined Macular-OCT IOWA software.

Both methods detect the RNFL layer with high accuracy whereas for the underlying retina tissues the automated segmentation with ordered assignment flow indicates the smallest mean absolute error and the highest Dice similarity index underpinning the superior performance of order preserving labeling in view of accuracy.

Table 1. Mean absolute error measures are given in pixels (1 pixel = 3.87 $\upmu $m). Left: Dice and mean absolute error of the proposed approach. Right: Resulting metrics achieved by IOWA reference algorithm. All numbers demonstrate the superior performance of our novel order-preserving labeling approach.

Full size table

6 Conclusion

In this paper we presented a novel, fully automated and purely data driven approach for retina segmentation in OCT-volumes. Compared to methods [9, 16] and [19] that have proven to be particularly effective on tissue classification with a priory known retina shape orientation, our ansatz merely relies on local features and yields ordered labelings which are directly enforced through the underlying geometry of statistical manifold. Consequently, by building on the feasible concept of spatially regularized assignment [22], the ordered flow (Definition 2) possesses the potential to be extended towards the detection of pathological retina changes and vascular vessel structure, which is the objective of our current research.

References

Abràmoff, M.D., Garvin, M.K., Sonka, M.: Retinal imaging and image analysis. IEEE Rev. Biomed. Eng. 3, 169–208 (2010)
Article Google Scholar
Antony, B., et al.: Automated 3-D segmentation of intraretinal layers from optic nerve head optical coherence tomography images. In: Progress in Biomedical Optics and Imaging - Proceedings of SPIE, vol. 7626, pp. 249–260 (2010)
Google Scholar
Arsigny, V., Fillard, P., Pennec, X., Ayache, N.: Geometric means in a novel vector space structure on symmetric positive definite matrices. SIAM J. Matrix Anal. Appl. 29(1), 328–347 (2007)
Article MathSciNet Google Scholar
Åström, F., Petra, S., Schmitzer, B., Schnörr, C.: Image labeling by assignment. J. Math. Imaging Vis. 58(2), 211–238 (2017)
Article MathSciNet Google Scholar
Bauschke, H.H., Borwein, J.M.: Legendre functions and the method of random Bregman projections. J. Convex Anal. 4(1), 27–67 (1997)
MathSciNet MATH Google Scholar
Censor, Y.A., Zenios, S.A.: Parallel Optimization: Theory, Algorithms, and Applications. Oxford University Press, New York (1997)
MATH Google Scholar
Dice, L.R.: Measures of the amount of ecologic association between species. Ecology 26(3), 297–302 (1945)
Article Google Scholar
Duan, J., Tench, C., Gottlob, I., Proudlock, F., Bai, L.: New variational image decomposition model for simultaneously denoising and segmenting optical coherence tomography images. Phys. Med. Biol. 60, 8901–8922 (2015)
Article Google Scholar
Dufour, P.A., et al.: Graph-based multi-surface segmentation of OCT data using trained hard and soft constraints. IEEE Trans. Med. Imaging 32(3), 531–543 (2013)
Article Google Scholar
Fang, L., Cunefare, D., Wang, C., Guymer, R., Li, S., Farsiu, S.: Automatic segmentation of nine retinal layer boundaries in OCT images of non-exudative AMD patients using deep learning and graph search. Biomed. Opt. Expr. 8(5), 2732–2744 (2017)
Article Google Scholar
Garvin, M.K., Abramoff, M.D., Wu, X., Russell, S.R., Burns, T.L., Sonka, M.: Automated 3-D intraretinal layer segmentation of macular spectral-domain optical coherence tomography images. IEEE Trans. Med. Imaging 9, 1436–1447 (2009)
Article Google Scholar
Nicholson, B., Nielsen, P., Saebo, J., Sahay, S.: Exploring tensions of global public good platforms for development: the case of DHIS2. In: Nielsen, P., Kimaro, H.C. (eds.) ICT4D 2019. IAICT, vol. 551, pp. 207–217. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-18400-1_17
Chapter Google Scholar
Hashimoto, M., Sklansky, J.: Multiple-order derivatives for detecting local image characteristics. Comput. Vis. Graph. Image Process. 39(1), 28–55 (1987)
Article Google Scholar
He, Y., et al.: Deep learning based topology guaranteed surface and MME segmentation of multiple sclerosis subjects from retinal OCT. Biomed. Opt. Expr. 10(10), 5042–5058 (2019)
Article Google Scholar
Higham, N.: Functions of Matrices: Theory and Computation. SIAM (2008)
Google Scholar
Kang, L., Xiaodong, W., Chen, D.Z., Sonka, M.: Optimal surface segmentation in volumetric images-a graph-theoretic approach. IEEE Trans. Pattern Anal. Mach. Intell. 28(1), 119–134 (2006)
Article Google Scholar
Liu, X., et al.: Semi-supervised automatic segmentation of layer and fluid region in retinal optical coherence tomography images using adversarial learning. IEEE Access 7, 3046–3061 (2019)
Article Google Scholar
Novosel, J., Vermeer, K.A., de Jong, J.H., Wang, Z., van Vliet, L.J.: Joint segmentation of retinal layers and focal lesions in 3-D OCT data of topologically disrupted retinas. IEEE Trans. Med. Imaging 36(6), 1276–1286 (2017)
Article Google Scholar
Rathke, F., Schmidt, S., Schnörr, C.: Probabilistic intra-retinal layer segmentation in 3-D OCT images using global shape regularization. Med. Image Anal. 18(5), 781–794 (2014)
Article Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Chapter Google Scholar
Roy, A., et al.: ReLayNet: retinal layer and fluid segmentation of macular optical coherence tomography using fully convolutional networks. Biomed. Opt. Expr. 8(8), 3627–3642 (2017)
Article Google Scholar
Schnörr, C.: Assignment flows. In: Grohs, P., Holler, M., Weinmann, A. (eds.) Handbook of Variational Methods for Nonlinear Geometric Data, pp. 235–260. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-31351-7_8
Chapter Google Scholar
Song, Q., Bai, J., Garvin, M.K., Sonka, M., Buatti, J.M., Wu, X.: Optimal multiple surface segmentation with shape and context priors. IEEE Trans. Med. Imaging 32(2), 376–386 (2013)
Article Google Scholar
Sra, S.: Positive definite matrices and the S-divergence. Proc. Am. Math. Soc. 144(7), 2787–2797 (2016)
Article MathSciNet Google Scholar
Tuzel, O., Porikli, F., Meer, P.: Region covariance: a fast descriptor for detection and classification. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3952, pp. 589–600. Springer, Heidelberg (2006). https://doi.org/10.1007/11744047_45
Chapter Google Scholar
Yazdanpanah, A., Hamarneh, G., Smith, B.R., Sarunic, M.V.: Segmentation of intra-retinal layers from optical coherence tomography images using an active contour approach. IEEE Trans. Med. Imaging 30(2), 484–496 (2011)
Article Google Scholar
Zeilmann, A., Savarino, F., Petra, S., Schnörr, C.: Geometric numerical integration of the assignment flow. Inverse Probl. 36(3), 034004 (33pp) (2020)
Google Scholar
Zern, A., Zeilmann, A., Schnörr, C.: Assignment flows for data labeling on graphs: convergence and stability. CoRR abs/2002.11571 (2020)
Google Scholar

Download references

Acknowledgement

We thank Dr. Stefan Schmidt and Julian Weichsel for sharing with us their expertise on OCT sensors, data acquisition and processing. In addition, we thank Prof. Fred Hamprecht and Alberto Bailoni for their guidance in training deep networks for feature extraction from 3D data.

Author information

Authors and Affiliations

Image and Pattern Analysis Group (IPA) and Heidelberg Collaboratory for Image Processing (HCI), Heidelberg University, Heidelberg, Germany
Dmitrij Sitenko, Bastian Boll & Christoph Schnörr

Authors

Dmitrij Sitenko
View author publications
You can also search for this author in PubMed Google Scholar
Bastian Boll
View author publications
You can also search for this author in PubMed Google Scholar
Christoph Schnörr
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dmitrij Sitenko .

Editor information

Editors and Affiliations

University of Tübingen, Tübingen, Germany
Zeynep Akata
University of Tübingen, Tübingen, Germany
Andreas Geiger
Czech Technical University in Prague, Prague, Czech Republic
Torsten Sattler

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sitenko, D., Boll, B., Schnörr, C. (2021). Assignment Flow for Order-Constrained OCT Segmentation. In: Akata, Z., Geiger, A., Sattler, T. (eds) Pattern Recognition. DAGM GCPR 2020. Lecture Notes in Computer Science(), vol 12544. Springer, Cham. https://doi.org/10.1007/978-3-030-71278-5_5

Download citation

DOI: https://doi.org/10.1007/978-3-030-71278-5_5
Published: 17 March 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-71277-8
Online ISBN: 978-3-030-71278-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Assignment Flow for Order-Constrained OCT Segmentation

Abstract

Similar content being viewed by others

Assignment Flow for Order-Constrained OCT Segmentation

Simultaneous Segmentation of Retinal OCT Images Using Level Set

3D Retinal Vessel Tree Segmentation and Reconstruction with OCT Images

1 Introduction

2 Assignment Flow

3 OCT Data Representation by Covariance Descriptors