Introduction

A current challenge in prostate cancer is the development of new strategies to improve the diagnosis of significant cancers while excluding insignificant ones. Presently, a volume threshold is used, and cancers more than 0.5 cc are considered as significant. Systematic transrectal ultrasound biopsies (TRUS-B) remain the gold standard for prostate cancer diagnosis; however, these biopsies are invasive and miss a significant percentage of cancers. In contrast, prostate magnetic resonance imaging (MRI) can be used noninvasively for reliable characterization of prostate tissues/lesions, by taking advantage of multiparametric protocols [1].

A typical prostate multiparametric MR study includes a morphological T2-weighted (T2-W) series and a functional series, including diffusion-weighted imaging (DWI) sequences and perfusion T1-weighted sequences, acquired before, during and after contrast agent administration (also called dynamic contrast enhanced [DCE] MRI). In some cases, these studies also include spectroscopic analysis. For multiparametric analysis, several DWI sequences with different diffusion gradients are acquired and used to compute the apparent diffusion coefficient (ADC) map. The DCE images are used to analyse the tissue enhancement, either visually or using time/intensity curves and semi-quantitative parameters, such as the area under the enhancement curve, wash in and wash out rates and time to peak. Computation of pharmacokinetic parameters can also be used to characterize tissue microcirculation, for example, using Tofts’ model, where \({K}_\mathrm{trans}\) is the transfer constant between the blood plasma and the extra-vascular extra-cellular spaces (EES), \({K}_\mathrm{ep}\) is the exchange constant rate between the EES and the blood plasma, and \({V}_\mathrm{e}\) is the extra-vascular extra-cellular volume [2, 3] (Fig. 1).

Fig. 1
figure 1

Examples of multiparametric magnetic resonance images: a T2-weighted sequence, b diffusion-weighted images (DWI), c T1-weighted dynamic contrast enhance (DCE) images, d the apparent diffusion coefficient map computed from the diffusion-weighted sequences and e, f pharmacokinetics parameters computed from the DCE sequences. The transfer constant between the blood plasma and the extra-vascular extra-cellular spaces \(({K}_\mathrm{trans})\) and the exchange constant rate between the extra-vascular extra-cellular spaces and the blood plasma \(({K}_\mathrm{ep})\) were determined in e and f, respectively

In this context, and because MRI involves an enormous number of data to interpret, many authors have investigated automatic classification algorithms to design MRI-based computer-aided diagnosis (CAD) tools and software for cancer characterization. The current standard paradigm for CAD systems is as a second reader. After the radiologist has evaluated multiple imaging sets, CAD is used to indicate the likelihood that a given suspicious region is malignant. Most of the employed methods are based on supervised classification techniques, in which a set of preinterpreted patient data must be used as a learning step (Table 1). These approaches have presented two main limitations. First, the performance of the algorithm depends on the quality and volume of the training data. Second, the learning step requires retrospective data, which might be either obsolete or centre specific.

Table 1 State of the art of classification and clustering methods applied for prostate magnetic resonance image analysis

In this study, we began with the assumption that a multiparametric approach would be optimal for all of the images. We then proposed an unsupervised method for prostate tissue classification from multiparametric MR images. This method was not designed for the detection of cancer or suspicious lesions but rather was intended to be used as an analysis tool, allowing us to merge information from all available sources to address conflict and redundancy and to provide a unique homogeneous pattern (tissues) map.

We investigated the use of evidence theory. This approach, also known as belief functions theory, is becoming widely used in multisource data analysis. It provides an advanced modelling of fusion, conflicts among sources and outliers [4]. Several applications of this reasoning in medical image analysis and CAD can be found (e.g. in brain MRI segmentation and tumour detection) [5, 6]. Our proposed approach is an adaptation of the evidential C-means classifier (ECM) introduced by Masson and Denoeux [7]. We successfully applied this method to multiparametric prostate MR data in a previous work, in which the aim was to segment and separate the prostate into two compartments: peripheral and transition zones [8].

Methods

Figure 2 depicts the pipeline of the method.

Fig. 2
figure 2

The pipeline of the method. The multiparametric data are first spatially registered. After feature extraction, evidential theory-based clustering is used to reduce the data into homogeneous classes

Data preparation: spatial registration and feature extraction

The multiparametric MR sequences could not be processed directly for two reasons. First, most of the images were acquired from different fields of view and thus had different spatial resolutions. Second, there were patient movements between acquisitions. Therefore, the first step was spatial registration and normalization. Because T2W images are regarded as the cornerstone for prostate morphology evaluation, these images were considered the reference space, and the other sequences were strictly registered to this space using standard affine registration algorithms based on maximization of mutual information.

Only a simplest affine registration was considered due to the fact that all the images are acquired during the same MR exam, and thus, we can consider that morphological changes due to rectum and bladder pressures are negligible and therefore no need to the application of a more sophisticated method to compensate the deformations.

Following this spatial matching, resampling and interpolation were used to match the other sequences with the T2W images in terms of resolution.

After this preprocessing, each prostate voxel was described by a feature vector containing the image features as grey levels from the T2W images, the ADC and the pharmacokinetic parameters \(({K}_\mathrm{trans}\), \({K}_\mathrm{ep}\) and \({V}_\mathrm{e})\).

A previous study [23] demonstrated that the local parameters, computed using fractal geometry, allowed for better detection of heterogeneities in prostate tissues from T2W MR images than from the native grey level values. Fractal geometry is a powerful tool for texture analysis that can be used to process medical imaging data efficiently. The fractal geometry can be measured using the fractal dimensions (FDs). We computed the local 3D FDs for each \(7\times 7\times 3\) region of interest (ROI) using the variance method [24]. Thus, instead of considering the T2W image grey levels, a local fractal dimension was estimated for each voxel.

By combing all of the MR data, a feature set including the local fractal dimension, the ADC value and the pharmacokinetic parameters \({K}_\mathrm{trans}\), \({V}_\mathrm{e}\) was associated to each voxel as follows:

$$\begin{aligned} \hbox {Voxel} ({V}_{i} )=\left\{ {\hbox {FD},\hbox {ADC}, {K}_{\mathrm{trans}},{V}_\mathrm{e}} \right\} \end{aligned}$$
(1)

After processing, the data were normalized and standardized for the range of different features. For each feature, the mean and the standard deviation were computed. For each feature value, the mean was then subtracted, and the result was divided by the standard deviation.

Source modelling and clustering

Initialization

Our method was designed to assist radiologists in analysing large numbers of images by fusing the sources into a single map. The voxels were described by their features (Eq. 1) and were grouped into homogeneous patterns. Class initialization is a key step in any clustering process for addressing this issue. Our proposed approach was inspired by radiologists’ behaviour when cognitively analysing MR images. Generally, the most sensitive images, typically pharmacokinetic and ADC images, are analysed first to detect and highlight the suspicious regions. To refine the analysis, several cross-analyses are performed using all of the image sequences.

In this regard, the \({K}_\mathrm{trans}\) map could be used to define the initial classes map. First, the image histogram was constructed, and a mode recognition algorithm [25] was applied to detect the class number. Then, the K-means algorithm was used for clustering. The results were applied to the remaining image sources to obtain the configuration of the first classes.

Evidential modelling

Evidential reasoning associates a data source or sensor, S, with a set of propositions, also known as the “frame of discernment”. In a classification context, the frame of discernment, denoted as \(\varOmega \), is the set of classes. If \(\omega _1 ,\ldots ,\omega _k \) denote these classes, then \(\varOmega =\left\{ {{\omega }_1 ,\ldots ,{\omega }_{k}} \right\} \).

Let \({P}=\left\{ {{P}_1 ,\ldots , {P}_{N}} \right\} \) be the set of patterns/objects to be assigned to one of the \(\varOmega \) classes. Evidential reasoning allows for the extraction of the partial knowledge of this assignment, called the “basic belief assignment” (bba). A bba is a function that takes values in the range [0, 1] and defines the \(2^{\varOmega }\) subsets of \( \varOmega (\left\{ {\emptyset ,{\omega }_1 ,\ldots ,{\omega }_{k} ,{\upomega }_1 \mathop \cup \nolimits {\omega }_2,\ldots ,} \right\} )\). For each pattern \(P_{i} \in P\), a bba (denoted as \(m_{i}\) ) allows us to measure the assignment to each subset \(A\) of \(\varOmega \) such that the following relationship is true:

$$\begin{aligned} \mathop \sum \limits _{{A}\subseteq \varOmega } {m}_{i} ({A})=1 \end{aligned}$$
(2)

The higher the value of \(m_{i} \left( A \right) \) is, the stronger the belief for assigning \(P_{i} \) to A. For instance, \(m_{i} \left( {\left\{ {\omega _{j}} \right\} } \right) =1\) implies that \(\left( {\forall A\subseteq \varOmega ,A\ne \left\{ {\omega _{j}} \right\} } \right) ,m_{i} \left( A \right) =0\). This means that \(P_{i} \) is assigned to \(\left\{ {\omega _{j}} \right\} \). However, if \(m_{i} \left( {\left\{ {\omega _{j}} \right\} } \right) =0.5\), \(m_{i} \left( {\left\{ {\omega _l } \right\} } \right) =0.2\) and \(m_{i} \left( {\left\{ {\omega _{j} ,\omega _l } \right\} } \right) =0.3\), there is a stronger belief for assigning \(P_{i} \) to \(\left\{ {\omega _{j}} \right\} \) than to \(\left\{ {\omega _l } \right\} \). It also highlights that there is a 20 % belief for assigning \(P_{i} \) to the union of the 2 classes, which can be interpreted as a doubt regarding pattern membership. In contrast to the fuzzy set model, evidential reasoning can extend the partial membership concept by assigning beliefs not only to classes but also to unions of classes. This feature is particularly useful in cases in which the classes overlap.

Using this model, Denoeux and Masson [26] introduced a new type of data partition called the “credal partition”. This partition can be seen as an extension of the fuzzy partition, with bba functions replacing the fuzzy membership functions. The authors later proposed an evidential version of the C-means classifier (the ECM) that used a credal partition, which was inspired by the fuzzy C-means (FCM) algorithm [7]. This algorithm classifies \(N\) patterns into \(k\) classes of \(\varOmega \), based on the class centres and the minimization of a cost function. As is the case for fuzzy partitions in FCM, the credal partition (in which each line is a \(\text {bba} m_{i} \) associated with a pattern \(P_{i}\)) is optimized iteratively. Further details on this generic classification method can be found in [7].

Spatial relaxation

The ECM model, as previously described, extracts and optimizes partial knowledge for pattern assignments. The ECM model can be used to classify voxels directly as independent data objects. However, the voxel neighbourhoods, defined by the connexity system, provide valuable information. In fact, the image segmentation supposes that the image regions share common features. Connexity and neighbourhood systems model this assumption using region-oriented segmentation methods, such as growing regions or hidden Markov field models.

In a homogeneous region or class, a bba not only provides information on a pattern but also on its connected neighbours. Corrupted information, extracted from outliers/noise patterns, can be relaxed using the information from neighbours. Thus, introducing neighbourhood information in ECM modelling can provide the following advantages:

  • Modelling of contextual region information by extracting information from the patterns/voxels;

  • Reduced corrupted information related to outliers and noise; and

  • Assimilation of the ECM classifiers into region-based segmentation processes.

The \({bbam}_{i}\) of pattern \({P}_{i}\) (associated with voxel \({V}_{i})\) was relaxed by combining it with bba functions from spatially connected neighbours. Spatial connection was defined by a 3D connexity system (26 in this study). A combination was performed using a conjunctive bba combination operator [27].

Let \(m_{i} \) be the bba of the pattern \(P_{i} \) and \(\left\{ {m_{i1} ,\cdots ,m_{i26}} \right\} \) be the bba functions from \(P_{i} \)’s 26 connected neighbours \(\left\{ {P_{i1} ,\cdots ,P_{i26}} \right\} \). We denote as \(m_{i}^{\prime }\) , the result of combining \(m_{i} \) and \(\left\{ {m_{i1} ,\cdots ,m_{i26}} \right\} \)

$$\begin{aligned} {\begin{array}{l} {\forall {A}\subseteq \varOmega } \\ {m_{i}^{\prime } \left( {A} \right) =\mathop {\mathop {\sum }\limits _{{A}_1 \cap \cdots \cap {A}_\mathrm{f} \!=\!A}}\limits _{{A}_1 \cdots {A}_\mathrm{f} \subseteq \varOmega } m_{i} \left( {{A}_1 } \right) \cdots m_{i26} \left( {{A}_\mathrm{f}} \right) } \\ \end{array}} \end{aligned}$$
(3)

However, the contribution of each neighbour to this combination should be weighted by its distance from the considered voxel. This combination is particularly relevant for prostate MRI, in which the voxels are significantly anisotropic. Based on this reasoning, the neighbouring bba functions were weighted as follows.

Let \(m_{ij}^{{j}}\) be the result of weighting \(m_{ij}\)

$$\begin{aligned} \begin{array}{l} \forall {A}\subseteq \varOmega \\ \left\{ {{\begin{array}{l} {m_{ij}^{{\alpha _{j}}} \left( A \right) =\alpha _{j} .m_{ij} (A)} \\ {m_{ij}^{{\alpha _{j}}} \left( \varOmega \right) =1-\alpha _{j} +\alpha _{j} .m_{ij} (\varOmega )} \\ \end{array}}} \right. \\ \end{array} \end{aligned}$$
(4)

where

$$\begin{aligned} {\alpha }_\mathrm{j} =\frac{{\gamma }}{\hbox {d}_{\mathrm{ij}}^2 } \end{aligned}$$
(5)

\( 0\le \gamma \le 1\) is a parameter, and \(d_{ij}\) is a normalized Euclidean distance between voxel \(v_{i} \) and its spatial neighbour \(v_{ij}\) . Using Eq. (3) and replacing the neighbouring bba functions with the weighted ones (4), we define \(m_{i}^{\prime }\) as follows.

$$\begin{aligned} {\begin{array}{l} {\forall {A}\subseteq \varOmega } \\ {m_{i}^{\prime } \left( {A} \right) \!=\!\mathop {\mathop {\sum }\limits _{{A}_1 \cap \cdots \cap {A}_\mathrm{f} \!=\!A}}\limits _{{A}_1 \cdots {A}_\mathrm{f} \subseteq \varOmega } \!\!m_{i} \left( {{A}_1 } \right) \cdot m_{i1}^{\alpha _{1}} \left( {{A}_2 } \right) \cdots m_{i26}^{{\alpha _{26}}} \left( {{A}_\mathrm{f}} \right) } \\ \end{array}} \end{aligned}$$
(6)

This combination is used as a relaxation step that allows for correction of the evidential assignment of a voxel based on the information from its neighbours.

Decision making

At the level of bba function extraction and optimization, we measure belief for the membership of each voxel to one of the classes \(\omega \in \). However, we also measure belief for the empty set and subsets \(A\), which could be interpreted as “doubt” regarding the membership of the voxel. A decision still must be made to classify the voxels into one of the classes. This decision is reached by transforming the \({\text {bba}}m_{i} \) into a probability measurement, as calculated by

$$\begin{aligned} {\begin{array}{l} {\forall {\omega }_{i} \in \varOmega } \\ {\hbox {Prob}\left( {{\omega }_{i}} \right) =\frac{1}{1-m_{i} (\upphi )}} \\ \end{array}}{\mathop {\mathop {\sum }\limits _{\omega \in A}}\limits _{A\subseteq \varOmega }} \frac{m_{i} ({A})}{\left| {A} \right| } \end{aligned}$$
(7)

where \(\left| A \right| \) denotes the number of elements in A and \(\upphi \) is the emptyset.

Finally, we define the decision rule \(R\) that associates the pattern \(P_{i} \) with one of the classes of as follows.

$$\begin{aligned} {R}\left( {P_{i}} \right) =\mathop {\hbox {argmax}}\limits _{{\omega }\in \varOmega } \left( {\hbox {Prob}\left( {{\omega }_{i}} \right) } \right) \end{aligned}$$
(8)

Results presentation

For the presentation of the results, two levels are proposed. The first level displays a single 3D class map with a colour associated with each class. This map labels all of the voxels with one of the \(\left( {\left\{ {\omega _1 ,\ldots ,\omega _k } \right\} } \right) \) classes. It also renders the tissue distribution homogeneous. For interpretation purposes, the map can be merged with any of the original image sources.

The second display level provides a more detailed description of the clustering results. The different probability maps resulting from the belief membership conversion (Eq. 7) are displayed using colour maps. These maps could be very useful in addressing conflict regions. Indeed, they include the empty set probabilistic distribution. The empty set is associated with the reject class, grouping all of the voxels that could not be clearly associated with one of the classes (i.e. the probability of the empty set was the greatest).

The maps also include all of the other subset cases.

For instance, for a typical case with 2 classes, \(\omega _1 =\hbox { healthy tissue}\) and \(\omega _2 =\hbox { suspicious tissue}\), the clustering produced 4 maps: a probability map for each class and a probability map for the 2 class unions in which a voxel has the same probability of being in either class. The last map is the empty set map, which groups the outliers.

The first level display creates the homogeneous tissue map by combining the 2 results of classes \(\omega _1\) and \(\omega _2.\)

Experiments and results

In this section, we report the results of the method applied to synthetic data with a known ground truth for class distribution and clinical images correlated with histopathology findings.

Simulated image experiments

The aims of these experiments were to test and evaluate the clustering process with regard to noise.

Creation

Simulation of multiparametric MR images is a difficult process. Moreover, a complete simulation should model and simulate the tumour behaviour for all of the MR sequences (T2W, DCE, DWI and spectroscopy). To our knowledge, these behaviours are not yet well known.

Nevertheless, to highlight the performance of the ECM clustering combined with spatial neighbouring relaxation on multisource data, we rendered multiparametric images using 3 MR clinical sequences: T2W, DCE and DWI. For each sequence, a radiologist delineated the peripheral and transition zones. The transition zone (TZ) was then filled with its mean value. The peripheral zone (PZ) level was deduced using a predetermined contrast value C defined as

$$\begin{aligned} {C}=\frac{\left| {{I}_{\mathrm{TZ}} -{I}_{\mathrm{PZ}}} \right| }{\left| {{I}_{\mathrm{TZ}} +{I}_{\mathrm{PZ}}} \right| } \end{aligned}$$
(9)

where \({I}_{\mathrm{TZ}}\) and \({I}_{\mathrm{PZ}}\) are the mean MR signals of TZ and PZ in the considered sequence, respectively.

Gaussian noise was added, and a median filter was applied to reduce the salt-pepper aspect of the noisy images so that they would be more similar to the real data. Figure 3 illustrates this process.

Fig. 3
figure 3

The image simulation process. From left to right, original MRI with predelineated peripheral and transition zones, the zones labelled with their respective mean level, data with Gaussian noise and data after a smoothing median filter. The first, second and third lines represent the T2W, DWI and DCE images, respectively

The signal-to-noise ratio of the simulated data was defined as:

$$\begin{aligned} \hbox {SNR}_{\mathrm{db}} =10\hbox {log}\left[ {\frac{\mathop \sum \nolimits _{{i}=1,\ldots ,\hbox {n}} \left( {{I}_0 } \right) ^{2}}{\mathop \sum \nolimits _{{i}=1,\ldots ,{n}} \left( {{I}-{I}_0 } \right) ^{2}}} \right] \end{aligned}$$
(10)

where \({I}_{0}\) and \(I\) are the intensities before and after adding the Gaussian noise, respectively.

Finally, the rendered images had different quality levels. The signal-to-noise ratios ranged from 18.21 to 7.77 dB, and the contrast levels (\(C\), Eq. 9) ranged from 0.1 to 0.25.

Results

The evaluation of the classification method for the synthetic data consisted of the accurate segmentation of the peripheral and transition zones. The evaluation was performed using the classification error from the voxel labelling.

Figure 4 illustrates the classification errors observed for different noise levels.

Real data evaluation

Description

For clinical evaluation, we considered the abilities of the method to segment cancerous regions. For this purpose, a retrospective study was done. Multiparametric MR images were collected from 15 patients using a Philips ACHIVA 1.5T machine. The data consisted of T2W images with a \(0.48\times 0.48\times 4.00\hbox { mm}^{3}\) voxel size (15 slices by volume), T1W DCE MRI images with a \(0.61\times 0.61\times 4.00\hbox { mm}^{3}\) voxel size (15 slices by volume, 20 dynamics) and an ADC map computed from 2 DWI acquisitions using b=0 and b=600 with a \(1.12\times 1.12\times 4.00\hbox { mm}^{3}\) voxel size (15 slices by volume). The DCE images were processed in-house using software implementing Tofts’ model to generate the pharmacokinetic \({K}_\mathrm{trans,}\) \({V}_\mathrm{e}\) and \({K}_\mathrm{ep}\) maps.

As described above, the MR volumes were registered and interpolated to fit the T2W resolution.

Fig. 4
figure 4

The results expressed as classification errors for labelling voxels belonging to either the peripheral or transition zones from multisource data with different noise levels

All of the patients underwent radical prostatectomy. To have a true diagnosis, the histological findings were correlated with MRI data. The prostate specimens were stained, fixed and sectioned according to the Stanford protocol [28]. A reconstructed histological map of each prostate was created. The contours of the histological zones, as well as the outlines of each cancer, were drawn on the slides under a microscope (Fig. 5a). The results from these analyses were reported on the MR images by manual correspondence with the histology images (Fig. 5b). This task was performed by experienced uro-radiologists, according to the method described in [29]. The prostate was divided into eight regions of interest by the octant technique. The top and bottom were divided into four quadrants, corresponding to the transition zone (TZ) and peripheral zone (PZ) on the left and right. Within each octant, the TZ, PZ and anterior fibromuscular stromal boundaries were traced. The tumour was located according to these histological zone boundaries.

Out of 15 patients, a total of 43 tumours were considered, including 25 tumours with a volume greater than 0.5 cc (Table 2).

Fig. 5
figure 5

a Histopathology image. b Cancer maps created from the histopathology images

Results

For quantitative evaluation, we measured the ability of the method to distinguish tumour patterns from healthy tissue patterns. For this purpose, at the end of the clustering process, the resulting 3D map was merged with the anatomical T2W MR images, allowing to the radiologist to label and associate the different classes to the tissue classes (peripheral zone, transition zone, etc.) and thus isolate the suspicious regions (tumours). By comparing the results to the actual diagnosis for each patient, the numbers of true positives, false positives, true negatives and false negatives were evaluated for different regions of interest. Sensitivity and specificity were computed, respectively, as follows:

$$\begin{aligned} \hbox {sensitivity}&= \frac{\hbox {true positives}}{\hbox {true positives}+\hbox {false negatives}}\\ \hbox {specificity}&= \frac{\hbox {true negatives}}{\hbox {true negatives}+\hbox {false positives}} \end{aligned}$$

The Dice (DSC) measure was also calculated as:

$$\begin{aligned} \hbox {DSC}=\frac{2.\left| {{S}\mathop \cap \nolimits {G}} \right| }{\left| {S} \right| +\left| {G} \right| } \end{aligned}$$

where \(S\) and \(G\) are the segmentation and the ground-truth results, respectively, while \(\left| . \right| \) operator denotes the pixels number in the set.

Table 2 Histopathologic analysis of the 15 patients included in the study

Table 3 summarizes the obtained results. The mean sensitivity and specificity were 65 % (0–100 %) and 81% (50–100 %), respectively. The worst scores corresponded to tumours with volumes less than 0.5 cc. This value is currently accepted as a limit for tumour detection on MR imaging. By considering only tumours with volumes greater than the 0.5 cc threshold, the mean sensitivity and specificity grew to 70 and 88 %, respectively.

Table 3 Quantitative evaluation of the proposed method

Figure 6 shows a case in which 3 classes were detected. The result (Fig. 6f) depicts the class map, which highlights a class for the peripheral zone, a class for the transition zone tissue and a third class composed of data from 2 patterns that could be interpreted as a tumour. Returning to the MR images, one of these patterns was clearly depicted in the 4 images (T2W, ADC, \({K}_\mathrm{trans}\) and \({V}_\mathrm{e})\), while the second was only visible as a hyper-vascular area on the \({K}_\mathrm{trans}\) map. The fusion allowed us to place these two patterns in the same class. The ground truth image (G) revealed two tumours (circles).

Fig. 6
figure 6

Classification results—Example 1. a T2W image with the region of interest to be analysed. Three tissues are defined: part of the PZ, part of the TZ and a suspected area. b The ADC map. c The \({K}_\mathrm{trans}\) map. d The \({V}_\mathrm{e}\) map. e Basic belief assignment (bba) map showing the membership degree of each voxel to the 3 classes. f Fusion map showing the distribution of the 3 tissues. Each colour is associated with a class. g The ground truth

Figure 7 depicts a case in which the analysis focused only on the peripheral zone, using 3 sources: the T2W (Fig. 7a), the \({K}_\mathrm{trans}\) (Fig. 7b) and the ADC (Fig. 7c) maps. Initialization was performed starting with the ADC map. The result of the fusion (Fig. 7e) is displayed in the superposition mode for the T2W image, whereas Fig. 7d shows the bba map. Both maps (d and e) highlight a region corresponding to the optimal fusion among the hypo-signal of the ADC map, the hyper-signal of the \({K}_\mathrm{trans}\) and the T2W hypo-signal. Figure 7f shows the ground truth.

Fig. 7
figure 7

Classification results—Example 2. a T2W image with peripheral zone delineation. b The \({K}_\mathrm{trans}\) map. c The ADC map. d The basic belief map. f The fusion map of images (a), (b) and (c). Each colour is associated with a class. The result is merged with the T2W (a) image. f The ground truth

As a last experiments and in order to highlight to proposed method efficiency, the obtained results were compared to one of the literature approaches. As it was indicated in the “Introduction” section and in Table 1, two methods classes exist: supervised algorithms needing a learning step and unsupervised algorithms without a learning step. Since our aim was to investigate a clustering technique to perform an unsupervised analysis of the image, the closest method from Table 1 to the proposed method is the one by Liu et al. [21]. This method is unsupervised and acts on MP MRI. The fuzzy random field modelling was implemented as described (Table 4).

Discussion: Conclusion

In this study, we described an unsupervised fusion scheme to analyse multiparametric MR images. This proposed framework was not designed for cancer detection or characterization but rather as a tool to assist radiologists in analysing multisource data. Because it is fully automatic, it can be applied in clinical routines for preliminary analysis and to fuse multiple MR sequences into a single 3D map in which the voxels that exhibit similar behaviours in all of the sequences are grouped into classes.

The method is based on a spatial registration and a normalization step that standardizes the data, followed by a multisource clustering step that is driven by the evidential C-means algorithm. A relaxation step is introduced in this algorithm to integrate the voxel spatial neighbourhood information. As a result of this relaxation, the basic belief for the assignment of each voxel can be corrected using information from connected neighbours, via a conjunctive combination of the bba functions.

The combination of the ECM algorithm with this spatial relaxation allowed us to perform clustering and segmentation on simulated multisource data with different noise levels (Fig. 3), with accuracy of 95 % (Fig. 4).

Two-level result reporting was proposed. The first level used a single map that represented homogeneous tissue distribution, while the second level was a membership degrees image. This image maps the source conflict regions.

We validated our method by analysing clinical data from 15 patients. The method had mean sensitivity and specificity of 65 and 81 %, respectively. These tests considered data consisting of grouped tumours, with different volumes from both the peripheral zone and the transition zone.

To our knowledge, there has been only one previously published study, by Liu et al. [21], in which the authors used an unsupervised classification method to diagnose prostate cancer using multiparametric MR images. The method was based on fuzzy Markov random fields, in which the parameters were implicitly estimated and combined with the segmentation process. However, the previous method was parametric because Gaussian distributions were assumed for the classes. Our approach was free from any assumptions regarding class distributions.

Compared with the previously reported methods, which have mainly been based on supervised classification algorithms, our results might appear weak. Indeed, the supervised techniques described in Table 1 reported sensitivity and specificity scores of at least 85 %. However, as discussed in the “Introduction” section, supervised classification requires an extensive learning process, and the most important issue with this type of approach is that the learning step must be updated each time the data change due to changes in the acquisition protocol.

In a study by Ozer et al. [22], two supervised methods were compared with an unsupervised algorithm (Liu et al. [21]). It was concluded that the supervised algorithms performed better than the unsupervised algorithm. In general, when the data are similar to those used during the learning process, in which all of the classification parameters are optimized, supervised approaches outperform unsupervised approaches. This situation changes if new data are used. In clinical practice, such a change often occurs due to changes in the acquisition protocols, when new MR sequences parameters or new machine are used.

Without the inclusion of any learning or calibration steps, the preliminary results for our unsupervised approach to computer-aided analysis of multiparametric MR images showed promising results. Moreover, the global framework described here could be extended to other multimodality sources. Indeed, photon emission tomography (PET) has the ability to analyse quantitative biomarkers that assess a host of physiological and biochemical tumour characteristics. Ultrasound elastography could also be valuable for tissue characterization. The only limitation for the integration of all of these sources into the proposed approach is spatial registration.

Table 4 Comparison of the proposed method with the Lui et al. method

The fusion strategy employed in this study attributed equally distributed confidence levels to the different image sources. However, specific parameterization using different weights is also possible. Tiwari et al. [30] investigated the weighted combination of multiparametric MR imaging for the evaluation of radiation therapy outcomes, and they reported promising results. The weighted combination of multiparametric or multimodality sources provided specific confidence levels for each imaging modality, according to its sensitivity and specificity. This combination could easily be applied for basic belief assignment modelling. It could also be used at different stages during initialization or before final fusion and conversion of the bba functions into class membership degrees.

Another important issue concerns validation. The evaluation described here was a single-observer study. Currently, we are preparing a more extensive evaluation with a more relevant patient base and better selection criteria, such as limitation to only one zone (peripheral or transition zones) and consideration of only clinically significant tumours larger than the threshold of 0.5 cc. This evaluation will be a multiobserver study.

Lastly, it is understood that the proposed approach will not replace supervised approaches. As clearly indicated, our method is not a diagnostic technique. The development and implementation of supervised techniques for distinguishing aggressive tumours from benign lesions remain of great importance to clinical practice.