1 Introduction

Hyperspectral imaging has gained great importance with the development of remote sensing technologies in recent years. A hyperspectral image (HSI) is obtained by measuring energy reflection through hundreds of continuously narrow wavelength bands. These images are used for different applications, such as environmental monitoring, agriculture, and defense systems [1, 2]. HSI applications in the mentioned areas generally focus on problems like classification, target-anomaly detection and spectral unmixing [3,4,5]. The performance of HSI applications is limited due to degradation on HSI by several types of noise caused by sensor malfunction or atmospheric effects. Therefore, denoising is an important pre-processing technique for HSI analysis. Traditional HSI denoising methods apply denoising onto each band one by one owing to the fact that each band is a grayscale image. The 2D methods ignore spatial–spectral correlation and denoising capacity of these 2D methods may be limited [6]. Transformation-based methods like Fourier transform or wavelet transform are also used to exploit spatial–spectral correlation in denoising [7]. On the other hand, recent studies which are based on low-rank matrix decomposition achieve better performance for the reconstruction of noisy HS images [8,9,10]. Low-rank methods capture spectral information successfully but they may cause spatial distortion in denoising.

Researchers investigated tensor decomposition-based techniques to enhance denoising quality, owing to the fact that an HSI can naturally be represented by a tensor [11]. Spatial–spectral correlation of the HSI can be captured more efficiently by some certain tensor decomposition techniques like CANDECOMP/ PARAFAC (CP) decomposition [12] and Tucker decomposition [13]. One of the proposed denoising studies that uses Tucker decomposition is a low-rank tensor approximation (LRTA) method, which provides spatial low-rank approximation, and spectral dimensionality reduction simultaneously [14]. Tucker decomposition is also used for learning overcomplete dictionaries of HSI data sets and these dictionaries were used in denoising problem as tensor dictionary learning (TDL) [15]. Another denoising method on tensor decomposition is hyper-Laplacian regularized unidirectional low-rank tensor recovery (LLRT) [16] which is not efficient on mixed noise as on Gaussian noise. CP decomposition, also known as PARAFAC, is a commonly used algorithm for HSI denoising, which may lead to spectral distortion due to the estimation of the CP rank that is dependent on variance of noise [17, 18]. In addition, other commonly used tensor decomposition techniques have not only similar problems, but also high computational complexity.

This paper proposes a statistical tensor decomposition method for HSI denoising, enhanced multivariance product representation (EMPR), unlike the other tensor-based methods which are the subjects of multivariate linear algebra. EMPR is a divide-and-conquer algorithm which is based on Sobol’s high-dimensional model representation (HDMR) [19]. Recently, EMPR has been used to solve different problems in image processing [20, 21]. Compared with the other tensor-based denoising methods, our main contributions are as follows:

  • EMPR is capable of capturing spatial–spectral correlation with components that have less dimension than the HSI. This feature of EMPR empowers denoising performance on mixed-type noise.

  • EMPR’s components are easy to compute and certain components which are called “support vectors” can be calculated flexibly with prior knowledge, if there is any, of the noise model.

  • EMPR is a statistical representation of a multivariate function which can be used as a dimensionality reduction method. EMPR’s components can be easily evaluated with marginal expectations under support and weight functions. Therefore, EMPR-based denoising is computationally more efficient than the other tensor-based denoising methods.

This paper is organized as follows: The next section gives an explanation on the mathematical background of EMPR. Section 3 clarifies how to use the EMPR algorithm to remove noise on HSI. We carried out simulated and real experiments in Sect. 4 to show compared quantitative and illustrative assessments. Section 5 concludes the study and informs readers on future works.

2 Enhanced multivariance product representation

EMPR is a dimensionality reduction method for expressing an N-direction tensor in terms of less-dimensional tensors and certain support vectors. Using EMPR method, the element with index \({i_1\ldots i_N}\) of the N-dimensional tensor structure \({\mathcal {X}}\) can be expressed as follows.

$$\begin{aligned}&{\mathcal {X}}_{i_1\ldots i_N}={\mathcal {X}}^{(0)} \prod \limits _{j=1}^Ns_{i_{j}}^{(j)} +\sum \limits _{j_1=1}^N{\mathcal {X}}_{i_{j_1}}^{(j_1)}\prod \limits _{\genfrac{}{}{0.0pt}{}{j=1}{j\ne j_1}}^Ns_{i_{j}}^{(j)}\nonumber \\&\quad +\sum \limits _{\genfrac{}{}{0.0pt}{}{j_1,j_2=1}{j_1<j_2}}^N{\mathcal {X}}_{i_{j_1},i_{j_2}}^{(j_1j_2)} \prod \limits _{\genfrac{}{}{0.0pt}{}{j=1}{j\ne j_1,j_2}}^Ns_{i_{j}}^{(j)} +\cdots +{\mathcal {X}}_{i_{j_1}i_{j_2}\ldots i_{j_N}}^{(j_1j_2\ldots j_N)}, \nonumber \\&i_j=1,2,\ldots ,n_j,\quad j=1,2,\ldots ,N. \end{aligned}$$
(1)

Here, \({\mathcal {X}}^{(0)}\) is a constant, \({\mathcal {X}}^{(j_1)}\)s are one-way tensors (or vectors), and \({\mathcal {X}}^{( j_1j_2)}\)s are two-way tensors, that is, matrices. The right hand side components can be considered as tensors of increasing number of dimensions. So the last element \({\mathcal {X}}^{(j_1j_2\ldots j_N)}\) is the N-way tensor. This expansion contains a finite number of sums. \(s^{(j)}\)s in the expansion are called support vectors. The main purpose of this method is to determine the general structures of EMPR’s components using support vectors and to represent the multivariate data with less variate components. To calculate the components in EMPR expansion, a weight vector (\(W_{i_j}^{(j)}\)) is used in each direction. This weight vector satisfies the normalization conditions given below.

$$\begin{aligned} \sum \limits _{i_j=1}^{n_j}W_{i_j}^{(j)}=1,\quad \sum \limits _{i_j=1}^{n_j}W_{i_j}^{(j)}\left( s_{i_j}^{(j)}\right) ^2=1,\nonumber \\ j=1,2,\ldots ,N. \end{aligned}$$
(2)

To uniquely obtain the EMPR components, vanishing conditions are defined over the weight and support vectors as follows

$$\begin{aligned}&\sum \limits _{i_{j_l}=1}^{n_{j_ l }}W_{i_{j_l}}^{(j_l)}s_{i_{j_l}}^{(j_l)}{\mathcal {X}}_{i_{j_1}\ldots i_{j_k}}^{({j_1}\ldots {j_k})}=0, \nonumber \\&l=1,2,\ldots ,k,\quad k=1,2,\ldots ,N. \end{aligned}$$
(3)

Under the conditions described above, the constant component of the EMPR expansion, \({\mathcal {X}}^{(0)}\) can be calculated as follows.

$$\begin{aligned} {\mathcal {X}}^{(0)}=\sum \limits _{i_1=1}^{n_1}\cdots \sum \limits _{i_N=1}^{n_N} \left[ \prod \limits _{k=1}^NW_{i_k}^{(k)}s_{i_k}^{(k)}\right] {\mathcal {X}}_{i_1\ldots i_N}. \end{aligned}$$
(4)

The univariate components denoted by \({\mathcal {X}}^{(j)}\) are calculated with the exclusion of the relevant direction.

$$\begin{aligned} {\mathcal {X}}_{i_j}^{(j)}= & {} \sum \limits _{i_1=1}^{n_1}\cdots \sum \limits _{i_{j-1}=1}^{n_{j-1}} \sum \limits _{i_{j+1}=1}^{n_{j+1}}\cdots \sum \limits _{i_{N}=1}^{n_{N}} \left[ \prod \limits _{\begin{array}{c} k=1 \\ j \ne k \end{array}}^{N}W_{i_k}^{(k)}s_{i_k}^{(k)}\right] \nonumber \\&\times {\mathcal {X}}_{i_1\ldots i_N}-{\mathcal {X}}^{(0)}s_{i_j}^{(j)}\nonumber \\&i_j=1,2,\ldots ,n_j,\quad j=1,2,\ldots ,N. \end{aligned}$$
(5)

Bivariate components are similarly calculated excluding the two related directions as follows.

$$\begin{aligned} {\mathcal {X}}_{i_j,i_k}^{(j,k)}= & {} \sum \limits _{i_1=1}^{n_1}\cdots \sum \limits _{i_{j-1}=1}^{n_{j-1}}\sum \limits _{i_{j+1}=1}^{n_{j+1}}\cdots \sum \limits _{i_{k-1}=1}^{n_{k-1}}\sum \limits _{i_{k+1}=1}^{n_{k+1}}\cdots \nonumber \\&\cdots \sum \limits _{i_{N}=1}^{n_{N}} \left[ \prod \limits _{\begin{array}{c} k=1 \\ j \ne k \end{array}}^{N}W_{i_k}^{(k)}s_{i_k}^{(k)}\right] {\mathcal {X}}_{i_1\ldots i_N}\nonumber \\&-{\mathcal {X}}^{(k)}_{i_k}s_{i_j}^{(j)}-{\mathcal {X}}^{(j)}_{i_j}s_{i_k}^{(k)}-{\mathcal {X}}^{(0)}s_{i_j}^{(j)}s_{i_k}^{(k)},\nonumber \\&j,k=1,2,\ldots ,N, \quad i_j=1,2,\ldots ,n_j,\nonumber \\&i_k=1,2,\ldots ,n_k. \end{aligned}$$
(6)

The other EMPR components at the right hand side of Eq. 1 are calculated in a similar way.

It is important to determine the support vectors in the EMPR method since it directly affects the performance of the method. Although there is no restriction on the evaluation of the support vectors, it is well known from the previous studies that the calculation of the support vectors by normalizing the directional averages of the data set has produced good results [23]. For this reason, the support vectors obtained in this way are used in this study, and their mathematical definition is given below.

$$\begin{aligned} {s}_{i_j}^{(j)}=\frac{\sum \limits _{i_1=1}^{n_1}\cdots \sum \limits _{i_{j-1}=1}^{n_{j-1}}\sum \limits _{i_{j+1}=1}^{n_{j+1}} \cdots \sum \limits _{i_{N}=1}^{n_{N}}{\mathcal {X}}_{i_1\ldots i_N}}{\left[ \sum \limits _{i_{j}=1}^{n_{j}}\left[ \sum \limits _{i_1=1}^{n_1} \cdots \sum \limits _{i_{j-1}=1}^{n_{j-1}}\sum \limits _{i_{j+1}=1}^{n_{j+1}} \cdots \sum \limits _{i_{N}=1}^{n_{N}}{\mathcal {X}}_{i_1\ldots i_N}\right] ^2\right] ^{\frac{1}{2}}}.\nonumber \\ \end{aligned}$$
(7)

In the next section, it is explained how noise can be removed in HSI by using EMPR components.

3 Proposed method

The mathematical expression of a noisy HSI is as follows

$$\begin{aligned} {\mathcal {Y}} = {\mathcal {X}} + {\mathcal {N}} \end{aligned}$$
(8)

where \({\mathcal {Y}}\), \({\mathcal {X}}\) and \({\mathcal {N}}\) are the observed noisy HSI, noise-free HSI and noise tensor, respectively. All are three-way tensors with the same dimension, \((n_1,n_2,n_3)\). Due to the multi-band nature of HSI, 2D denoising methods may cause image distortions or artifacts. The main reason for this is that such traditional methods are effective only in spatial dimensions and are inefficient at capturing correlated structures through the bands. In the EMPR method, the spatial relationship between the pixels as well as the relationship between the bands is included in the HSI data representation through the EMPR components. In other words, if the spatial coordinates of the first and second directions are indexed by ij and the spectral coordinate of the third direction (band order) is indexed by k in the HSI containing noise, the three-way component of the EMPR method, \({\mathcal {Y}}^{(1,2,3)}\), and the two-way components, \({\mathcal {Y}}^{(1,2)}\), \({\mathcal {Y}}^{(1,3)}\) and \({\mathcal {Y}}^{ (2,3)}\), define a statistical relationship between spatial and spectral dimensions in different directions. These EMPR components can be considered as high-frequency components in multi-resolution analysis-based methods and some of these high-frequency components are loaded by heavy noise.

The element in the \(\{i,j,k\}-th\) position of this HSI data set can be expressed exactly by the EMPR method with the following equation.

$$\begin{aligned} {\mathcal {Y}}_{i,j,k}= & {} {\mathcal {Y}}^{(0)}s_{i}^{(1)}s_{j}^{(2)}s_{k}^{(3)}+{\mathcal {Y}}_{i}^{(1)}s_{j}^{(2)}s_{k}^{(3)} \nonumber \\ {}+ & {} {\mathcal {Y}}_{j}^{(2)}s_{i}^{(1)}s_{k}^{(3)}+{\mathcal {Y}}_{k}^{(3)}s_{i}^{(1)}s_{j}^{(2)} +\quad {\mathcal {Y}}_{i,j}^{(1,2)}s_{k}^{(3)}\nonumber \\ {}+ & {} {\mathcal {Y}}_{i,k}^{(1,3)}s_{j}^{(2)}+{\mathcal {Y}}_{j,k}^{(2,3)}s_{i}^{(1)} +\quad {\mathcal {Y}}_{i,j,k}^{(1,2,3)}\nonumber \\ i=1,2,&\ldots&,n_1,j=1,2,\ldots ,n_2,k=1,2,\ldots ,n_3.\nonumber \\ \end{aligned}$$
(9)

To develop a novel denoising method using the EMPR method, some high-frequency terms have been extracted from the EMPR expansion of the HSI. For this purpose, the components, \({\mathcal {Y}}_{i,j,k}^{(1,2,3)}\), \({\mathcal {Y}}_{i,k}^{(1,3)}\) and \({\mathcal {Y}}_{j,k}^{(2,3)}\), are excluded from the element in the \(\{i,j,k\}-th\) position of this HSI which is \({\mathcal {Y}}_{i,j,k}\) as follows

$$\begin{aligned}&{\mathcal {X}}_{i,j,k}\approx {\mathcal {Y}}_{i,j,k}-{\mathcal {Y}}_{i,j,k}^{(1,2,3)}-{\mathcal {Y}}_{i,k}^{(1,3)}-{\mathcal {Y}}_{j,k}^{(2,3)}\nonumber \\&i=1,2,\ldots ,n_1,\quad j=1,2,\ldots ,n_2,\quad k=1,2,\ldots ,n_3\nonumber \\ \end{aligned}$$
(10)

From another point of view, the high-frequency EMPR terms \({\mathcal {Y}}_{i,j,k}^{(1,2,3)}\), \({\mathcal {Y}}_{i,k}^{(1,3)}\) and \({\mathcal {Y}}_{j,k}^{(2,3)}\) are taken as noise and the results suggest that EMPR was relatively successful in the tests. The performance of the proposed method and the results of the comparison between the other state-of-the-art tensor-based denoising methods are given in the next section.

Fig. 1
figure 1

Denoising results for band 50 of Abu-Urban data set

4 Experimental design and analysis

4.1 Experimental design

EMPR’s denoising performance is compared with PARAFAC, LRTA, TDL, and LLRT which all are tensor-based denoising methods. For quantitative assessment, peak signal-to-noise ratio (PSNR), relative dimensionless global error in synthesis (ERGAS), the structural similarity (SSIM), and spectral angle mapper (SAM) are chosen as evaluation metrics [24]. Larger values of PSNR and SSIM indicate better denoising performance. Smaller values of ERGAS and SAM point out high performance in denoising. Two types of experiments are designed for the assessment of the proposed method. The first type of experiment is simulated on two different HSI data sets. The second type of experiment is the visual comparison of denoising performance of EMPR on a real HSI. For the simulated experiment, the ABU-Urban and the ABU-AirportFootnote 1 from AVIRIS (Airborne Visible/Infrared Imaging Spectrometer) are chosen [25]. The Abu-Airport data set consists of \(100\times 100\) pixels with 205 bands while Abu-Urban has the same dimensions and 207 bands. For the real data experiment, UrbanFootnote 2 data set has been chosen.

Table 1 Quantitative Assessment of Denoising Performance for Abu-Urban
Table 2 Running Time for Abu-Urban
Fig. 2
figure 2

Spectral signature restoration of different methods for the pixel in [15,45] of Abu-Urban. a Noisy pixel, b PARAFAC, c LRTA, d TDL, e LLRT, f (EMPR)

4.2 Simulated experiments

Abu-Urban data set is chosen for the first simulated experiment. The noise model for this experiment includes zero-mean Gaussian noise with different standard deviation randomly sampled from [0.1,0.2] for each band, and randomly selected 20 bands were added noise with 20% density salt and pepper noise. Table 1 shows ERGAS, mean PSNR, mean SSIM, and SAM, respectively, for different tensor-based denoising methods. The results are given for the mean of twenty runs. The best results in Tables 14 for each quality metric are marked in bold and the second best results are underlined. According to the results given in this table, it is clear that EMPR outperforms for three quality indexes and it is the second best for MPSNR. Table 2 gives the running time and EMPR is the fastest denoising method among them all.

Fig. 3
figure 3

PSNR and SSIM values of different denoising methods in each band of the first simulated experiment on Abu-Urban

Figure 1 presents visual quality of denoising performance for each tensor-based method on Abu-Urban data set. The results are given for band 50 of the simulated HSI. The outputs of LRTA and LLRT algorithms are still noisy. PARAFAC and TDL yield better visional results, however, EMPR is demonstrably superior denoiser.

Table 3 Quantitative Assessment of Denoising Performance for Abu-Airport
Table 4 Running Time for Abu-Airport
Fig. 4
figure 4

Denoising results for band 20 of Abu-Airport data set

Spectra of a selected noisy pixel from Abu-Urban data set along with reconstruction results are illustrated in Fig. 2. EMPR provides better suppression of noise while representing pixel spectra. However, there are slight numerical differences between band 40 and band 100 for EMPR’s reconstruction as seen in Fig. 2f. Figure 3 presents PSNR and SSIM values through the bands, respectively. PSNR values are lower than PARAFAC, which is compatible with MPSNR results in Table 1.

The simulated experiment on the Abu-Airport data set is designed with four different types of noise to show the performance of the proposed method [22]. This consists of zero-mean Gaussian noise with different standard deviations randomly sampled from [0.1, 0.2] for each band, impulse noise with 20% density percentage for randomly 20 selected bands, dead lines with a width from 1 to 3 were added to 20% of the bands, which were randomly selected and stripes with a width from 6 to 15 were added to a randomly selected 40% of the bands. Table 3 indicates that EMPR effectively removes the mixed noise compared to the other methods. Figure 4 presents visual results which are compatible with quantitative results in the table. Figure 4 points out that all the methods except EMPR do not accomplish the removal of stripes. Running time of the methods are presented in Table 4.

Restoration of a noisy pixel for all of the methods is given in Fig. 5. EMPR-based denoising fluctuates less than the other methods and achieves better approximation to the original pixel.

PSNR and SSIM values are given in Fig. 6. As it can be seen in the figure, EMPR provides lower PSNR values than PARAFAC in the vast majority of the first fifty bands.

4.3 Experiment on real data

Urban data set is used to verify denoising performance of EMPR on HSI. This data set has some degradation due to noisy bands. A subscene of this data set with \(150\times 150\) size is cropped where the noise is dense and includes stripes. This data set has 210 bands. Bands 139 and 208 are corrupted by heavy noise. Figures 7 and 8 show visual results of denoising on bands 139 and 208, respectively. According to the visual results of the restored bands, EMPR has superiority over the other tensor-based methods.

Fig. 5
figure 5

Spectral signature restoration of different methods for the pixel in [45,89] of Abu-Airport-3. a Noisy pixel, b PARAFAC, c RTA, d TDL, e LLRT, f (EMPR)

Fig. 6
figure 6

PSNR and SSIM values of different denoising methods in each band of the simulated experiment-1 on Abu-Airport

Fig. 7
figure 7

Urban data set: a original band 139, b PARAFAC, c LRTA, d TDL, e LLRT, f EMPR

4.4 Discussion

According to the experiments with both simulated and real data, it can be concluded that EMPR is particularly effective at HSI denoising. Bivariate and trivariate EMPR components have spatial–spectral correlation for different directions of HSI data cube. We excluded EMPR’s most noisy components for the denoising process. As pointed out in Sect. 4.2, there are some numerical differences for some of the bands at EMPR-based denoising. These results potentially demonstrate that EMPR’s components may not be able to capture the spectrality totally on these bands of the HSI signals. Another topic of discussion is EMPR’s PSNR results in Fig. 3. Looking at these results, it can be inferred that EMPR is more sensitive to the Gaussian noise than to the other types of noise, since Gaussian noise added to all of the bands. Nevertheless, it should be noted that EMPR is still the best denoiser considering the other quantitative metrics like SSIM. Moreover, it is clear from the experiments that EMPR provides a better performance on the removal of the mixed-type noise.

Fig. 8
figure 8

Urban data set: a original band 208, b PARAFAC, c LRTA, d TDL, e LLRT, f EMPR

5 Conclusion and future work

This article proposes a new HSI denoising method using EMPR. EMPR naturally represents tensor-type data sets by describing different statistical features of different dimensions. The EMPR method makes it easy to process multidimensional data sets through pre-calculated support vectors. These support vectors can be considered as priors of the denoising model. The features of the support vectors and divide-and-conquer mechanism of EMPR allow us to develop an efficient tensor decomposition-based denoising method for HSI. Both simulated and real experiments demonstrated promising results on the EMPR’s denoising quality. Running time results also revealed that EMPR has lower computational complexity than the other state-of-the-art tensor-based denoising methods. On the other hand, as discussed in Sect. 4.4, EMPR’s sensitivity to the Gaussian noise should be explored in future work. A non-local approach can be utilized to improve EMPR’s performance on Gaussian noise and can empower spectrality of the EMPR components. Another future study for consideration is the detailed analysis of EMPR noisy components’ spectrality.