Multi-modal Medical Image Fusion Based on Geometric Algebra Discrete Cosine Transform

Wang, Rui; Fang, Nian; He, Yinmei; Li, Yanping; Cao, Wenming; Wang, Haiquan

doi:10.1007/s00006-021-01197-6

Multi-modal Medical Image Fusion Based on Geometric Algebra Discrete Cosine Transform

Published: 20 February 2022

Volume 32, article number 19, (2022)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Advances in Applied Clifford Algebras Aims and scope Submit manuscript

Multi-modal Medical Image Fusion Based on Geometric Algebra Discrete Cosine Transform

Download PDF

Rui Wang¹,
Nian Fang¹,
Yinmei He¹,
Yanping Li¹,
Wenming Cao² &
…
Haiquan Wang³

671 Accesses
12 Citations
Explore all metrics

Abstract

Multi-modal medical image fusion refers to the combination of patient area images obtained under diverse or identical imaging modalities, which improves the clinical applicability and provides more specific disease information for diagnosis. However, most of the existing image fusion algorithms usually divided color images into three channels of R, G, B for processing separately, which ignores the correlation between the channels and easily causes image information loss and blurring. This paper proposes a multi-modal color medical image fusion algorithm based on geometric algebra discrete cosine transform (GA-DCT). The GA-DCT algorithm combines the character of GA, which represents the multi-vector signal as a whole, can improve the quality of the fusion image and avoid a large number of complex operations related to encoding and decoding. Firstly, the source images are divided into several image blocks and expressed in GA multi-vector form; Secondly, we extend the traditional DCT to GA space and propose GA-DCT; Thirdly, we use GA-DCT to decompose the image to obtain AC and DC coefficients and finally a fusion algorithm are used to fuse the images. The experimental results show that the proposed algorithm can get clear and comprehensive fusion image, which also has great advantages under different compression ratios.

Color Medical Imaging Fusion Based on Principle Component Analysis and F-Transform

Article 01 July 2018

Review and Enhancement of Discrete Cosine Transform (DCT) for Medical Image Fusion

Multimodal Medical Image Fusion with Multi Resolution Discrete Cosine Transform

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Image fusion has become an important part of image processing, which refers to integrate multiple images of the same scene or different scenes into a new image so that can provide more comprehensive help for a specific field [43]. The fusion image can minimize information redundancy and contain all useful information of the source image [5]. In recent years, image fusion technology has received widespread attention and is widely used in multi-focus images [34, 61], medical images [10, 60], infrared images [48] and remote sensing images [19] in many areas.

Image fusion can be divided into three levels: data-level fusion, feature-level fusion and decision-level fusion. The data-level fusion, also known as pixel-level fusion, refers to the process of directly processing the data collected by the sensor. The advantage of this fusion is to maintain as much raw data as possible, providing subtle information that other fusion levels can not provide. The feature-level fusion method first obtains the regional characteristics from the acquired source image according to the feature extracted principle, then analyzes the extracted feature information, summarizes the most representative data characteristics, and the feature information is extracted from the further integrated region again. The decision-level fusion method requires first filtering processing and signal enhancement, and continues the practice in feature-level fusion method for feature extraction. Decision-level fusion approach focuses more on the information decisions brought by the target region itself. According to the characteristics of image fusion algorithm, it is usually divided into two categories: image fusion algorithms based on the space domain and image fusion algorithms based on the frequency domain. The former directly processes images in the spatial domain with the advantage that pixels instead of transform domain coefficients can preserve the different scale details of the image rather than the finite scale determined by the decomposition layer, while also avoiding the additional computational burden imposed by the multi-scale decomposition. The latter is to obtain the fusion image in the frequency domain, extract the coefficients according to the local features of the source image, select the appropriate fusion rules, and use the coefficients to obtain the fusion image [2]. The common fusion methods based on frequency domain include: contourlet transform [25], discrete wavelet transform [30, 44], dual-tree complex wavelet transform [9], stationary wavelet transform [16], curvelet transform [6], slicelet transform [22] and so on.

With the increasing application of image fusion, the research on image fusion has become more and more extensive. Fusion algorithms based on discrete cosine transform (DCT) and discrete wavelet transform (DWT) are the most common fusion methods. Image fusion algorithm based on the wavelet transform has been successful and widely used. For example, Naeem et al. [38] used the discrete wavelet transform (DWT) to fuse the image with a small number of details and another image with rich details which can change the uniformity of the image with encrypted details. Yang [59] proposed a DWT-based image fusion method, which uses the maximum coefficient fusion rule. Haghighat [24] proposed an efficient multi-focus image fusion method based on wavelet domain variance, which improves the quality of the fused image and reduces the computational complexity. Tang [52] proposed a new image fusion method which based on local contrast measurement in the DCT domain. However, the fused image obtained by this method will cause image blurring. In [21], the maximum pixel replacement and pixel average fusion rules are proposed. Experimental results show that this method is more sensitive to noise and artifacts. Abdollahzadeh et al. [1] proposed to calculate Sum-Modified-Laplacian (SML) in the DCT domain.

Multi-modal medical image fusion can improve the clinical accuracy of medical images. The two medical source images are fused through a certain fusion algorithm so that the fused image can contain the effective information in the source image. Medical image fusion based on the wavelet transform has achieved good results. Vijayarajan [53] used the average principal component fusion method based on DWT to fuse the computed tomography (CT) image and magnetic resonance image (MRI) which decomposed the source image into multi-scale input and obtain good experimental results. In [23], the author proposed an image fusion algorithm based on DWT-DBSS and used the maximum selection rule to obtain detailed fusion coefficients. Rajarshi et al. [46] proposed to use the maximum local extrema fusion rule to fuse the magnetic resonance images (MRI) and computed tomography (CT) images. Experimental results show that the fused image obtained by DWT algorithm retains most of the useful information of the source image. However, the DWT-based image fusion algorithm has low efficiency due to its high computational complexity and long experimental time. Moreover, the fused image will cause problems such as blocking effect and quality loss. Therefore, the author [39] proposed a new multi-focus image fusion algorithm based on correlation coefficients. In addition, the author used the singular value decomposition (SVD) method to directly fuse multi-focus images in the DCT domain [40]. The results show that the fusion image obtained by the DCT-based image fusion algorithm is relatively clear, and the experiment takes less time and is more efficient. However, the existing image fusion algorithms based on DWT and DCT are mainly used for grayscale images. For color images, the three channels of the color image are usually processed separately. In [7], the author used the DCT algorithm to fuse satellite images, processed the multiple channels of satellite images separately and finally integrated them to obtain the fused image. This method usually ignores the correlation between the various channels of the image, resulting in incomplete fusion image information.

Fortunately, geometric algebra (GA) provides a computational framework for multi-dimensional signal processing, which can treat multi-channel images as a whole [20, 42, 51]. Wang et al. [57] proposed the Sparse Fast Clifford Fourier Transform (SFCFT) theory, which selectively uses input data in scalar and vector fields to deal with big data problems. Felsberg [18] used Clifford Algebra to define the corresponding Clifford-Fourier transform (CFT). Berthier et al. [8] focused on the use of geometric methods of group actions, which performed a Clifford Fourier transform for spectral analysis of color images. Julia et al. [17] proposed the Clifford Fourier transform and extended the Fourier transform to include the general elements of Clifford Algebra. DCT has been a basic tool for signal and image processing for many years. It can directly perform experiments in the DCT domain and avoid the complicated image encoding and decoding process which can save a lot of time and improve efficiency. The Geometric Algebra Discrete Cosine Transform (GA-DCT) represents the multi-modal medical image in a holistic way and considers the correlation between channels, so we propose to extend the DCT to the geometric algebraic domain to fuse the multi-modal medical images.

With the development of image fusion algorithms and the application of GA in image fusion, a novel multi-vector image fusion algorithm is proposed. Firstly, the source images are required to be divided into several blocks. Then, the proposed image fusion algorithm represents each multi-modal medical image block as a multi-vector by using the theory of GA algorithm and performs GA-DCT on each block. By calculating the average value of the coefficients of the corresponding GA-DCT block, the fusion coefficient is obtained by using the fusion rule of the coefficient average value. The Inverse Geometric Algebra Discrete Cosine Transform (IGA-DCT) is applied for each block and the fusion image is reconstructed by merging all the blocks. In order to test the performance of the proposed algorithm, this paper conducts several fusion experiments on four sets of multi-modal color medical images of the brain. The experimental results show that the fusion image obtained by the proposed image fusion algorithm in this paper has higher resolution and more comprehensive information, and has a great advantage in subjective vision and objective evaluation.

The rest of this paper is organized as follows. In Sect. 2, this paper introduces the basic knowledge of geometric algebra. Section 3 introduces the GA-DCT algorithm and the fusion steps of the proposed algorithm in detail. Section 4 introduces the experimental analysis including subjective and objective fusion image quality evaluations. Finally, we make a conclusion in Sect. 5.

2 Geometric Algebra

Geometric algebra (GA) [26] was proposed by William K. Clifford, also known as Clifford Algebra, which provides a new idea for the research and application of image representation. It can perform the geometric operations and analysis in high-dimensional space [12, 15, 27, 32, 50] and has become an important research tool in theoretical mathematics, computer vision and physics [13, 33].

In this section, we will introduce the relevant knowledge of GA in detail.

2.1 Fundamental of Geometric Algebra

Let Gn represents a n-dimensional GA. A set of orthogonal bases of Gn is $ \left\{ 1, \beta _{1}, \beta _{2}, \ldots , \beta _{n}\right\} $ , which leads to a basis by geometric product.

$$\begin{aligned} \{ 1 , \{ \beta _ { i } \} , \{ \beta _ { i } \beta _ { j } \} , \ldots , \{ \beta _ { 1 } \beta _ { 2 } \ldots \beta _ { n } \} \}. \end{aligned}$$

(2.1)

The orthogonal basis introduced above is non-commutative and satisfies the following formula,

$$\begin{aligned}&\beta _{i}^{2}=1, \quad i=1, \ldots , n, \end{aligned}$$

(2.2)

$$\begin{aligned}&\quad \beta _ { i } \beta _ { ij } = \beta _ { i } \beta _ { i } \beta _ { j } = \beta _ { j } , \quad i , j = 1 , \ldots , n, \quad i \ne j, \end{aligned}$$

(2.3)

$$\begin{aligned}&\quad \beta _{i j}=\beta _{i} \beta _{j}=-\beta _{j} \beta _{i}=-\beta _{j i}, \quad i, j=1, \ldots , n, \quad i \ne j. \end{aligned}$$

(2.4)

It can be seen from the above formula that there are $ 2^{n} $ orthogonal bases of Gn. For example, the G2 contains four orthogonal bases and G3 contains eight orthogonal bases. The struction of the orthogonal basis of G2 and G3 is shown below,

$$\begin{aligned}&G _ { 2 } : \{ 1 , \{ \beta _ { 1 } , \beta _ { 2 } \} , \beta _ { 1 } \beta _ { 2 } \} = \{ 1 , \beta _ { 1 } , \beta _ { 2 } , \beta _ { 12 } \}, \end{aligned}$$

(2.5)

$$\begin{aligned}&\quad \left. \begin{array} { c } { G _ { 3 } : \{ 1 , \{ \beta _ { 1 } , \beta _ { 2 } , \beta _ { 3 } \} , \{ \beta _ { 1 } \beta _ { 2 } , \beta _ { 2 } \beta _ { 3 } , \beta _ { 1 } \beta _ { 3 } \} , \beta _ { 1 } \beta _ { 2 } \beta _ { 3 } \} } \\ { = \{ 1 , \beta _ { 1 } , \beta _ { 2 } , \beta _ { 3 } , \beta _ { 12 } , \beta _ { 23 } , \beta _ { 13 } , \beta _ { 123 } \} }. \end{array} \right. \end{aligned}$$

(2.6)

Just as vectors are the basic elements of linear algebra, the multi-vectors are the basic elements of GA. By observing the forms of complex numbers, quaternions and GA, we can find that the multi-vector structure of GA is the n-dimensional extension of complex numbers and quaternions [36, 49]. If a multi-vector $ a \in G_{n} $, then a can be represented as

$$\begin{aligned} a = a _ { 0 } + \sum _ { i = 1 } ^ { n } a _ { i } \beta _ { i }, \end{aligned}$$

(2.7)

where $ a_{0}, a_{1}, \ldots , a_{n} \in R_{n} $.

2.2 Basic Operation of Geometric Algebra

In fact, the product operation in GA space is called geometric product. The geometric product calculation formula of GA is composed of inner product and outer product. For vectors p and q, the geometric product is defined as follows,

$$\begin{aligned} p q = p \cdot q + p \wedge q, \end{aligned}$$

(2.8)

where $ p \cdot q $ is the scalar part, which represents the inner product in the geometric product. $ p \wedge q $ is the vector part, which represents the outer product in the geometric product. Due to the outer product is non-commutative, that is, $ {\varvec{p}} \wedge {\varvec{q}}=-{\varvec{q}} \wedge {\varvec{p}} $, then the geometric product is also non-commutative. The relationship among geometric products, inner products and outer products is shown in the Eqs. (2.9) and (2.10).

$$\begin{aligned}&p \cdot q = \frac{ 1 }{ 2 } ( p q + q p ), \end{aligned}$$

(2.9)

$$\begin{aligned}&p \wedge q = \frac{ 1 }{ 2 } ( p q - q p ). \end{aligned}$$

(2.10)

If p and q are first-order vectors (FOV), then $ p \wedge q $ can be called a bivector, which is interpreted as a vector facet formed by two vectors in geometric algebra, as shown in Fig. 1; the trivector $ p \wedge q \wedge m $ can be interpreted as a volume element with the direction of the vector facet $ p \wedge q $ and the one-dimensional vector m facing inward, as shown in Fig. 2.

2.3 Geometric Algebraic Representation of Multi-modal Image

As we know, a complex number is composed of a scalar part and a vector part. A quaternion is composed of a scalar part and three vector parts. The GA space Gn is a geometric extension of Rn. Therefore, any multi-vector $ Z \in \left( G_{n}\right) $ can be expressed in Eq. (2.11).

$$\begin{aligned} {\mathbf {Z}}=E_{0}({\mathbf {Z}})+\sum _{1 \le i \le n} E_{i}({\mathbf {Z}}) \beta _{i}+\sum _{1 \le i<j \le n} E_{i j}({\mathbf {Z}}) \beta _{i j}+\cdots +E_{1 \ldots n}({\mathbf {Z}}) \beta _{1 \ldots n}.\nonumber \\ \end{aligned}$$

(2.11)

The multi-modal image is expressed in the form of GA, and the image is processed in an overall manner, which can take into account the correlation between the channels of the color image, so it is widely used in image processing [14, 54,55,56]. Given a multi-modal image $ K \in \left( G_{n}\right) $ and its GA form is

$$\begin{aligned} {\mathbf {K}}=0+\sum _{1 \le i \le n} E_{i}({\mathbf {K}}) \beta _{i}+\sum _{1 \le i<j \le n} E_{i j}({\mathbf {K}}) \beta _{i j}+\cdots +E_{1 \ldots n}({\mathbf {K}}) \beta _{1 \ldots n},\nonumber \\ \end{aligned}$$

(2.12)

where $ {E}(\mathrm {K}) \in \mathbb {R} $, which represents the value of each channel of the multi-modal image and $\beta _ { i }$ represents the orthogonal basis of Geometric Algebra. All spectral channels of multi-modal image are represented by a set of orthogonal basis. Due to the scalar part is not used, the scalar part is zero.

3 Our Proposed Algorithm

Discrete cosine transform (DCT) is an effective tool in signal and image processing. As it becomes more widely used, researchers have tried to expand it to process the higher-dimensional signals. For multi-modal images, the traditional method is to divide the multi-modal image into several channels firstly, and DCT can be used for each spectral channel separately. However, the disadvantage of this implementation method is that it ignores the correlation between the spectrum channels. Therefore, this paper proposes a geometric algebra form of the discrete cosine transform, named the geometric algebra discrete cosine transform (GA-DCT). The GA-DCT proposed in this paper treats multi-modal image as a multi-vector, and processes the multi-modal image in an overall method by mapping each spectral channel to each blade of GA.

3.1 Geometric Algebra Discrete Cosine Transform

For a multi-modal image f(x, y) with size $ {M} \times {N} $, according to the non-commutative nature of geometric algebra, GA-DCT can be defined in two forms. Formulas (3.1) and (3.2) represent the GA-DCT on the left and right sides respectively.

$$\begin{aligned} C_{L}(u, v)= & {} \alpha (u) \alpha (v) \sum _{x=0}^{M-1} \sum _{y=0}^{N-1} \lambda f(x, y) \cos \Big [\frac{\pi (2 x+1) u}{2 M}\Big ]\nonumber \\&\cos \Big [\frac{\pi (2 y+1) v}{2 N}\Big ], \end{aligned}$$

(3.1)

$$\begin{aligned} C _ { R } ( u , v )= & {} \alpha ( u ) \alpha (v ) \sum _ { x = 0 } ^ { M - 1 } \sum _ { y = 0 } ^ { N - 1 } f ( x , y ) \cos \Big [ \frac{ \pi ( 2 x + 1 ) u }{ 2 M } \Big ]\nonumber \\&\cos \Big [ \frac{ \pi ( 2 y + 1 ) v }{ 2 N } \Big ] \lambda . \end{aligned}$$

(3.2)

The two forms of GA-DCT correspond to two inverse transforms respectively. Formulas (3.3) and (3.4) represent inverse transformations for the left-sided and right-sided respectively.

$$\begin{aligned} f _ { L } ( x , y )= & {} - \sum _ { x = 0 } ^ { M - 1 } \sum _ { y = 0 } ^ { N - 1 } \alpha ( u ) \alpha ( v ) \lambda C ( u , v ) \cos \Big [ \frac{ \pi ( 2 x + 1 ) u }{ 2 M } \Big ]\nonumber \\&\cos \Big [ \frac{ \pi ( 2 y + 1 ) v }{ 2 N } \Big ], \end{aligned}$$

(3.3)

$$\begin{aligned} f _ { R } ( x , y )= & {} - \sum _ { x = 0 } ^ { M - 1 } \sum _ { y = 0 } ^ { N - 1 } \alpha ( u ) \alpha ( v ) C ( u , v ) \cos \Big [ \frac{ \pi ( 2 x + 1 ) u }{ 2 M } \Big ]\nonumber \\&\cos \Big [ \frac{ \pi ( 2 y + 1 ) v }{ 2 N } \Big ] \lambda , \end{aligned}$$

(3.4)

where $ \lambda $ is a GA multi-vector with unit magnitude having no scalar part, i.e. $ \lambda $ holds the following properties,

$$\begin{aligned}&\lambda =\sum _{1 \le i \le n} E_{i}(\lambda ) \beta _{i}+\sum _{1 \le i<j \le n} E_{i j}(\lambda ) \beta _{i j}+\cdots +E_{1 \ldots n}(\lambda )\nonumber \\&\qquad \beta _{1 \ldots n}, E(\lambda ) \in \mathbb {R}, \end{aligned}$$

(3.5)

$$\begin{aligned}&\quad | \lambda | ^ { 2 } = \lambda * \tilde{ \lambda } = 1. \end{aligned}$$

(3.6)

Similar to the traditional DCT, $ \alpha (u), \alpha (v) $ are shown in formula (3.7),

$$\begin{aligned} \alpha (u)=\left\{ \begin{array}{ll}\frac{1}{\sqrt{M}}, &{} u=0 \\ \sqrt{\frac{2}{M}}, &{} u \ne 0\end{array}, \quad \alpha (v)=\left\{ \begin{array}{ll}\frac{1}{\sqrt{N}}, &{} v=0 \\ \sqrt{\frac{2}{N}}, &{} v \ne 0\end{array}.\right. \right. \end{aligned}$$

(3.7)

3.2 Algorithm Details

For a multi-modal image $ F \in \left( G_{n}\right) ^{M \times N} $, which are divided into $ n \times n $ pixel blocks. Let $ \left\{ f_{i, j}\right\} $ be a $ n \times n $ block in a source image and the GA-DCT coefficients are $ \left\{ D_{u, v}\right\} $. In this paper, the right-sided GA-DCT and IGA-DCT are adopted for multi-model medical image fusion. The GA-DCT of an image is shown in formula (3.8),

$$\begin{aligned} D _ { u , v } = \alpha ( u ) \alpha ( v ) \sum _ { x = 0 } ^ { n - 1 } \sum _ { y = 0 } ^ { n - 1 } f _ { i , j } \cos \left[ \frac{ \pi ( 2 x + 1 ) u }{ 2 n } \right] \cos \left[ \frac{ \pi ( 2 y + 1 ) v }{ 2 n } \right] \lambda , \end{aligned}$$

(3.8)

where $ u, v=0,1, \ldots , \mathrm {n}-1 $.

The source image block $ \left\{ f_{i, j}\right\} $ can be recovered from the GA-DCT coefficients by employing the IGA-DCT as shown in formula (3.9),

$$\begin{aligned} f _ { i , j } =-\sum _{x=0}^{M-1} \sum _{y=0}^{N-1} \alpha (u) \alpha (v) D _ { u , v } \cos \left[ \frac{\pi (2 x+1) u}{2 M}\right] \cos \left[ \frac{\pi (2 y+1) v}{2 N}\right] \lambda ,\nonumber \\ \end{aligned}$$

(3.9)

where $i, j=0,1, \ldots , \mathrm {n}-1$.

The GA-DCT coefficients of an image block of size $ n \times n $ is shown in Eq. (3.10). The source image is usually divided into $ 8 \times 8 $ blocks. Each image block is actually a 64-point discrete signal. GA-DCT takes these signals as input and then decomposes it into 64 orthogonal base signals. Therefore, the output of GA-DCT is the amplitude of 64 base signals, which is the GA-DCT coefficient. The transform coefficients on the frequency domain are the function of the two-dimensional frequency domain variables u and v. The coefficient corresponding to $ u=0 $ and $ v=0 $ is called DC component, which is DC coefficient, and the remaining 63 coefficients are called AC components, which are AC coefficients. Therefore, each data block is a matrix with 64 DCT coefficients. Among the 64 coefficients, the DC coefficient is located in the upper left corner of the image block and is equal to the average of 64 samples; the remaining 63 coefficients represent AC coefficients. The farther away from the DC component, the higher the frequency of the image AC component represented by the coefficient. The GA-DCT frequency band coefficients distribution is shown in Fig. 3.

$$\begin{aligned} D=\left[ \begin{array}{llllllll}d_{00} &{} d_{01} &{} d_{02} &{} d_{03} &{} d_{04} &{} d_{05} &{} d_{06} &{} d_{07} \\ d_{10} &{} d_{11} &{} d_{12} &{} d_{13} &{} d_{14} &{} d_{15} &{} d_{16} &{} d_{17} \\ d_{20} &{} d_{21} &{} d_{22} &{} d_{23} &{} d_{24} &{} d_{25} &{} d_{26} &{} d_{27} \\ d_{30} &{} d_{31} &{} d_{32} &{} d_{33} &{} d_{34} &{} d_{35} &{} d_{36} &{} d_{37} \\ d_{40} &{} d_{41} &{} d_{42} &{} d_{43} &{} d_{44} &{} d_{45} &{} d_{46} &{} d_{47} \\ d_{50} &{} d_{51} &{} d_{52} &{} d_{53} &{} d_{54} &{} d_{55} &{} d_{56} &{} d_{57} \\ d_{60} &{} d_{61} &{} d_{62} &{} d_{63} &{} d_{64} &{} d_{65} &{} d_{66} &{} d_{67} \\ d_{70} &{} d_{71} &{} d_{72} &{} d_{73} &{} d_{74} &{} d_{75} &{} d_{76} &{} d_{77}\end{array}\right] \end{aligned}$$

(3.10)

According to the above introduction, a multi-modal medical image fusion rule can be designed. At present, the more common fusion rules based on discrete cosine transform include local energy maximum rule, image contrast maximum rule and coefficient average rule. This paper uses the GA-DCT based on the average value of coefficients. Let M1 and M2 are two source color images of size $ M \times N $ and suppose that it can be divided into $ n \times n $ blocks and each block is represented in GA multi-vector form. Let $ X=x_{i, j} $ and $ Y=y_{i, j} $ be the GA form of two image blocks of the source color image M1 and M2,

$$\begin{aligned} \left. \begin{array} { l } { x _ { i , j } = x _ { R } \beta _ { 1 } + x _ { G } \beta _ { 2 } + x _ { B } \beta _ { 12 } }, \\ { y _ { i , j } = y _ { R } \beta _ { 1 } + y _ { G } \beta _ { 2 } + y _ { B } \beta _ { 12 } }, \end{array} \right. \end{aligned}$$

(3.11)

where $ x_{i, j} $ and $ y_{i, j} $ represent blocks of the two source image respectively.

Then, (3.8) can be applied to obtain the GA-DCT coefficients of $ x_{i, j} $ and $ y_{i, j} $. The GA-DCT coefficients of $ x_{i, j} $ and $ y_{i, j} $ are $ D_{x}=\left\{ d_{x, u, v}\right\} $ and $ D_{y}=\left\{ d_{y, u, v}\right\} $. Take the average of the block coefficients corresponding to the DCT coefficient matrix of the two source images as the DCT coefficient of the fused image. The formula is shown in (3.12),

$$\begin{aligned} D _ { f , u , v } = 0.5 \times ( d _ { x , u , v } + d _ { y , u , v } ), \end{aligned}$$

(3.12)

where $ d_{x, u, v} $ and $ d_{y, u, v} $ are the corresponding AC and DC coefficient of the input image block $ x_{i, j} $ and $ y_{i, j} $ respectively. Then, the fused image block is obtained by using the IGA-DCT.

Repeat the above steps for all image blocks to obtain the fused image blocks of the two source images, and then combine all fused image blocks to get the final fused image.

In conclusion, the steps of the GA-DCT algorithm are:

1.
Let M1 and M2 represent two source color images of size $ M \times N $ and suppose that they can be divided into $ n \times n $ blocks;
2.
Each divided image block will be represented into geometric algebra form;
3.
Perform GA-DCT on the image to obtain transform coefficients;
4.
Use the fusion rule of the coefficient averaging method to calculate the corresponding coefficients of the two source images to obtain the coefficients of the fusion image;
5.
IGA-DCT is used to obtain the fusion image.

Figure 4 is the framework of the multi-modal image fusion based on the GA-DCT.

4 Experimental Analysis

In order to test the effectiveness of the proposed algorithm, experiments are conducted on four sets of multi-modal medical images of the brain in the matlab environment. For comparison, we choose five fusion algorithms that are commonly used and have better fusion effects including Laplacian Pyramid [35], DWT-DBSS [23], SIDWT-Haar [58], Morphological Difference Pyramid [37] and DCT based on variance [3] respectively. The source image sets are selected from available medical images data base provided by Harvard Medical School [11]. Each image set contains a SPECT-T1 image and a SPECT-TC image which size are $ 256 \times 256 $.

4.1 Evaluation Standard

The performance of image fusion algorithms is usually evaluated using subjective and objective indicators. For subjective measurement, we mainly compare fused effect through visual observation. However, the objective indicator of color fusion image quality evaluation usually requires ideal fusion image, and it is difficult to unify the standard of ideal fusion image. In this article, we take the two source image as the ideal fusion image. At present, the most widely used fusion image quality evaluation indicators include Multi-scale Structural Similarity (MSSSIM) [29], Peak Signal-to-Noise Ratio (PSNR) [28], Root-Mean-Square-Error (RMSE) [47], Mutual Information (MI) [45], Entropy [41], Correlation Coefficient (CC) [31] et al. This article chooses the above six objective evaluation standards to quantify the fusion images.

SSIM measures the structural similarity between the source image and the fused image. The value of SSIM is set between 0 and 1. The larger the obtained SSIM value, the more similar between the fusion image and the source image, and the better the fusion effect. The calculation formula of SSIM is shown in formulas (4.1) and (4.2).

$$\begin{aligned} {\text {SSIM}}_{(x, y, f)}=0.5 \times \left( {\text {SSIM}}_{(x, f)}+{\text {SSIM}}_{(y, f)}\right) . \end{aligned}$$

(4.1)

In

$$\begin{aligned} \begin{aligned} {\text {SSIM}}_{(x, f)}&=\frac{\left( 2 \mu _{x} \mu _{f}+C_{1}\right) \left( 2 \sigma _{x f}+C_{2}\right) }{\left( \mu _{x}^{2}+\mu _{f}^{2}+C_{1}\right) \left( \sigma _{x}^{2}+\sigma _{f}^{2}+C_{2}\right) }, \\ {\text {SSIM}}_{(y, f)}&=\frac{\left( 2 \mu _{y} \mu _{f}+C_{1}\right) \left( 2 \sigma _{y f}+C_{2}\right) }{\left( \mu _{y}^{2}+\mu _{f}^{2}+C_{1}\right) \left( \sigma _{y}^{2}+\sigma _{f}^{2}+C_{2}\right) }, \end{aligned} \end{aligned}$$

(4.2)

$ \mu _{x}, \mu _{y} $ and $ \mu _{f} $ represent the mean values of the source image x, y and the fusion image f; $ \sigma _{x}^{2}, \sigma _{y}^{2} $ and $ \sigma ^{2} $ represent the variance of the source image and the fusion image respectively; $ \sigma _{x f} $ and $ \sigma _{y f} $ represent the covariance of the two source images and the fusion image respectively; $ \mathrm {C}_{1}, \mathrm {C}_{2} $ and $ \mathrm {C}_{3} $ are constants to avoid the denominator being 0 and maintain stability, $ C_{1}=\left( K_{1} \times L\right) ^{2} $, $ C_{2}=\left( K_{2} \times L\right) ^{2} $, usually $K _ { 1 } = 0.01 , K _ { 2 } = 0.03 , L = 255$.

PSNR is an indicator defined based on the mean square error. In the fusion image, the higher the PSNR value obtained, the closer the fusion image is to the source image. The PSNR calculation formula is shown in formula (4.3),

$$\begin{aligned} P S N R=10 \times \log _{10}\left( \frac{L^{2}}{M S E}\right) =20 \times \log _{10}\left( \frac{L}{R M S E}\right) . \end{aligned}$$

(4.3)

RMSE denotes the mean square error of the image, and the RMSE value is inversely proportional to the quality of the fused image, that is, the lower the RMSE value, the better the quality of the fused image. The calculation formula is

$$\begin{aligned} R M S E=\sqrt{\frac{\sum _{m=1}^{M} \sum _{n=1}^{N}[{\text {source}}(m, n)-{\text {fused}}(m, n)]^{2}}{M \times N}}. \end{aligned}$$

(4.4)

MI represents the degree of interdependence between the source image and the fused image. The greater the MI value, the better the fusion effect.

$$\begin{aligned} M I=\frac{J E(x, f)+J E(y, f)}{I E_{x}+I E_{y}}. \end{aligned}$$

(4.5)

In

$$\begin{aligned} \left. \begin{array} { l } { J E ( x , f ) = \sum _ { i = 0 } ^ { L - 1 } \sum _ { j = 0 } ^ { L - 1 } P _ { x , f } ( i , k ) \log P _ { x , f } ( i , k ) / ( P _ { x } ( i ) \times P _ { f } ( k ) ) }, \\ { J E ( y , f ) = \sum _ { i = 0 } ^ { L - 1 } \sum _ { j = 0 } ^ { L - 1 } P _ { y , f } ( i , k ) \log P _ { y , f } ( i , k ) / ( P _ { y } ( i ) \times P _ { f } ( k ) ) }, \end{array} \right. \end{aligned}$$

(4.6)

JE(x, f) and JE(y, f) denote the joint entropy of the source image and the fusion image respectively. IE denotes the information entropy of the image.

Entropy is a standard to reflect the richness of image information from the perspective of information theory. The size of information entropy reflects the amount of information carried by the image. The greater the information entropy of the image, the richer its information and the better the quality. The formula is

$$\begin{aligned} E N=-\sum _{L=0}^{L-1} P_{i} \times \log _{2} P_{i}, \end{aligned}$$

(4.7)

where L represents the image gray level. $ P_{i} $ represents the proportion of gray value i pixels to the total pixels. The larger the EN, the larger the amount of information in the fused image.

Correlation Coefficient reflects the degree of correlation between the fused image and the source image. The larger the correlation coefficient, the higher the similarity between the two images. The calculation formula is as follow,

$$\begin{aligned} C C(X, Y)=\frac{\sum _{i=1}^{M} \sum _{j=1}^{N}\left( X_{i, j}-\bar{X}\right) \left( Y_{i, j}-\bar{Y}\right) }{\sqrt{\left( \sum _{i=1}^{M} \sum _{j=1}^{N}\left( X_{i, j}-\bar{X}\right) ^{2}\right) \left( \sum _{i=1}^{M} \sum _{j=1}^{N}\left( Y_{i, j}-\bar{Y}\right) ^{2}\right) }},\nonumber \\ \end{aligned}$$

(4.8)

where X and Y represent the source image and the fused image respectively.

4.2 Subjective Fusion Image Quality Evaluations

The visual results of fusion experiments on image sets 1–4 are shown in Figs. 5, 6, 7 and 8 respectively. The following four sets of pictures are the brain medical source images and the fusion images obtained using Laplacian Pyramid, DWT-DBSS, SIDWT-Haar, Morphological Difference Pyramid, DCT-Variance and GA-DCT-Average algorithms.

Subjectively, Figs. 5, 6, 7 and 8c–f are the fused images obtained by the Laplacian Pyramid, DWT-DBSS, SIDWT-Haar and Morphological Difference Pyramid algorithms. It can be clearly seen from the figures that the boundary part of the fusion images is relatively complete, but the middle position is relatively dark as a whole. The sharpness and contrast of the fused images are also very low, indicating that these four algorithms do not fuse the two source images well, resulting in distortion of the fused image and the information contained in the image is not comprehensive. From Figs. 5, 6, 7 and 8g, we can seen that the resolution and contrast of the fused image obtained by using the DCT-Variance algorithm have improved. However, comparing the white frame of each image in Figs. 5 and 6, it obvious that the image (g) contains a large red area, which obscures the original information and may provide wrong information to medical workers. The four sets of images obtained by the DCT-Variance algorithm have lost the key areas of the source image (a), as shown in the red frame in each group of image (g), which means that the DCT-Variance algorithm cannot accurately fuse the information in the source image, which is likely to cause confusion in subjective judgments, and it is not conducive to the doctors to obtain accurate information. Figures 5, 6, 7 and 8h are the fusion images obtained by the GA-DCT-Average algorithm. We can see that the fusion result obtained by GA-DCT-Average is generally clearer than other images, and the fused image basically contains all the key information of the source image.

Table 1 Qualitative results of image-set 1

Full size table

Table 2 Qualitative results of image-set 2

Full size table

Table 3 Qualitative results of image-set 3

Full size table

Table 4 Qualitative results of image-set 4

Full size table

4.3 Objective Fusion Image Quality Evaluations

The Tables 1, 2, 3 and 4 respectively show the objective quality evaluation of the results obtained by fusing the four groups of images with different fusion algorithms. The bold symbols in each table represent the algorithm with the best index among the six algorithms.

Table 5 Time consumption of different fusion algorithms

Full size table

Objectively, the four groups of fused images obtained based on the GA-DCT-Average algorithm have absolute advantages in the two indicators of PSNR and RMSE, which shows that the fusion results are closer to the source images and the fusion effect is better than other algorithms in terms of these two indicators. The correlation coefficients of images obtained by GA-DCT-Average are significantly higher than other algorithms in the Tables 1, 2 and 3, while in Table 4, the results obtained by GA-DCT-Average are only slightly lower than the Laplacian Pyramid algorithm. It shows that the image obtained based on GA-DCT-Average has the highest correlation with the source image, and the two are the most similar. From Tables 1, 2, 3 and 4, we can also see that the result image obtained by the GA-DCT-Average algorithm is close to the best in terms of SSIM indicator, and there is only a slight gap between the DCT-Variance algorithm. Entropy indicates the amount of information carried by the image and the richness of image information. It can be seen from Tables 1 and 2 that the fusion image obtained by the GA-DCT-Average algorithm has the highest Entropy, which indicates that the image has the most information and the image quality is better than others. Tables 3 and 4 show that the image obtained by the GA-DCT-Average algorithm is only slightly lower than the DCT-Variance algorithm in the entropy indicator. In general, the proposed algorithm also occupies an advantage in objective evaluation indicators.

4.4 Time Consumption with Different Fusion Algorithms

The time-consuming is one of the important issue to evaluate the performance of the algorithm. Table 5 shows the comparison of the time-consuming by the six algorithms applied to the four sets of medical images. Since the consumed time of the six algorithms is very short, there will be a little error in the time of each experiment. In order to ensure the accuracy of the data, the average of the time obtained from ten experiments is taken as the time consumed by the algorithm. It can be seen from Table 5 that the algorithm proposed in this article takes longer than other algorithms because of some complicated calculations in geometric algebra. In general, the six algorithms mentioned in this article require relatively short experimental time and are relatively efficient.

4.5 Fusion Performance with Different Compression Ratios

The Figs. 9, 10, 11 and 12 show the PSNR values of four groups of color medical images fused by six different fusion algorithms under different compression ratios [4]. The compression ratio is defined as the ratio between the compressed image and the source image. It can be seen from Figs. 9, 10, 11 and 12 that the PSNR value of the GA-DCT-Average is significantly higher than other algorithms under different ratio. With the continuous increase of compression ratio, the PSNR values of GA-DCT-Average algorithms are constantly increasing and higher than other algorithms. It means that the proposed algorithm has great advantages under different compression ratios.

Therefore, we can find that the algorithm proposed in this paper occupies comparative advantages in multi-modal medical image fusion subjectively and objectively. The fusion effect is better than several common fusion algorithms comprehensively, which can provide great help for medical staff in diagnosing the cause.

5 Conclusion

This paper proposes a multi-modal medical image fusion algorithm based on the GA-DCT and conducts fusion experiments on four groups of brain medical color images. Considering the connection between the color image channels, we use multi-vector to represent the source image as a whole. Firstly, the source image is divided into several blocks and expressed them as the multi-vector in the GA form; then GA-DCT is used to process the image block. The DC and AC coefficients of the corresponding blocks of the source image are averaged as the coefficients of the fused image; finally perform IGA-DCT to obtain the results. Experimental results show that the proposed algorithm can overcome the problem of image blur and has a considerable improvement in sharpness and contrast. Under different compression ratios, the PSNR of the fused image obtained by the proposed algorithm is better than other algorithms. So it can be used as an effective method for multi-modal medical image fusion.

According to the research results, we can learn that the performance of the proposed algorithm has been improved compared with traditional algorithms, but it does not occupy a great advantage in some objective evaluation indicators. Therefore, the algorithm needs to be continuously improved, and the application of sparse representation and neural network based on geometric algebra in image fusion will be studied in subsequent research.

References

Abdollahzadeh, M., Malekzadeh, T., Seyedarabi, H.: Multi-focus image fusion for visual sensor networks. In: 2016 24th Iranian Conference on Electrical Engineering (ICEE), pp. 1673–1677 (2016)
Aishwarya, N., Abirami, S., Amutha, R.: Multifocus image fusion using discrete wavelet transform and sparse representation. In: 2016 International Conference on Wireless Communications. Signal Processing and Networking (WiSPNET). IEEE, Mar (2016)
Amin-Naji, M., Aghagolzadeh, A.: MultiFocus image fusion in DCT domain using variance and energy of Laplacian and correlation coefficient for visual sensor networks. J. Artif. Intel. Data Min. 6(2), 233–250 (2018)
Google Scholar
Amirjanov, A., Dimililer, K.: Image compression system with an optimisation of compression ratio. IET Image Process. 13(11), 1960–1969 (2019)
Article Google Scholar
Ardeshir Goshtasby, A.: Image fusion: advances in the state of the art. Inf. Fusion 8(2), 114–118 (2007)
Article Google Scholar
Arif, M., Wang, G.: Fast curvelet transform through genetic algorithm for multimodal medical image fusion. Soft Comput. 24(3), 1815–1836 (2020)
Article Google Scholar
Asokan, A., Anitha, J.: 2D discrete cosine transform for fusion of multitemporal satellite images. In: 2018 Second International Conference on Intelligent Computing and Control Systems (ICICCS), vol. 10, pp. 5–10 (2018)
Batard, T., Berthier, M., Jean, C.S.: Clifford-Fourier transform for color image processing. Geom. Algebra Comput. 135–162 (2010)
Beaulieu, M., Foucher, S., Gagnon, L.: Multi-spectral image resolution refinement using stationary wavelet transform. Remote Sens. 4032–4034 (2003)
Bhatnagar, G., Wu, Q.M.J., Liu, Z.: Directive contrast based multimodal medical image fusion in NSCT domain. IEEE Trans. 15, 1014–1024 (2013)
Google Scholar
Brain images database of Harvard Medical School. (online). http://www.med.harvard.edu/AANLIB/home.html.
Buchholz, S., Sommer, G.: On Clifford neurons and Clifford multi-layer perceptrons. Neural Netw. 21(7), 925–935 (2008)
Article Google Scholar
Byrnes, J.: Computational Noncommutative Algebra and Applications. Springer, Dordrecht (2004)
Google Scholar
Cao, W., F, Lyu., He, Z., Cao, G., He, Z.: Multimodal medical image registration based on feature spheres in geometric algebra. IEEE Access 21164–21172 (2018)
Chapman, A., Vishne, U.: Clifford algebras of binary homogeneous forms. J. Algebra 366, 94–111 (2012)
Article MathSciNet Google Scholar
Do Minh, N., Vetterli, M.: The contourlet transform: an efficient directional multiresolution image representation. IEEE Trans. Image Process. 14(12), 2091–2106 (2005)
Article ADS Google Scholar
Ebling, J., Scheuermann, G.: Clifford Fourier transform on vector fields. IEEE Trans. Vis. Comput. Graph. 11(4), 469–479 (2005)
Article Google Scholar
Felsberg, M.: Low-level image processing with the structure multivector. Appl. Geom. Algebra Comput. Sci. Eng. (2002)
Filippo, N., Andrea, G., Stefano, B.: Remote sensing image fusion using the curvelet transform. Inf. Fusion 8(2), 143–156 (2007)
Article Google Scholar
Franchini, S., Gentile, A., Sorbello, F., Vassallo, G., Vitabile, S.: ConformalALU: a conformal geometric algebra coprocessor for medical image processing. IEEE Trans. Comput. 64(4), 955–970 (2015)
Article MathSciNet Google Scholar
Gawari, N., Lalitha, S.: Comparative analysis of PCA, DCT and DWT based image fusion techniques. Int. J. Emerg. Res. Manag. Technol. 3(5) (2013)
Guo, K., Labate, D.: Optimally sparse multidimensional representation using shearlets. SIAM J. Math. Anal. 39(1), 298–318 (2007)
Article MathSciNet Google Scholar
Guruprasad, S., Kurian, M.Z., Suma, H.N.: A medical multi-modality image fusion of CT/PET with PCA, DWT methods. J. Dent. Mater. Tech. 4(2), 677–681 (2013)
Google Scholar
Haghighat, M.B.A., Aghagolzadeh, A., Seyedarabi, H.: Multi-focus image fusion for visual sensor networks in DCT domain. Comput. Electr. Eng. 37(5), 789–797 (2011)
Article Google Scholar
He, G., Xing, S., He, X.: Image fusion method based on simultaneous sparse representation with non-subsampled contourlet transform. IET Comput. Vis. 13(2), 240–248 (2019)
Article Google Scholar
Hestenes, D.: New foundations for classical mechanics. Am. J. Phys. 58(7), 703–704 (1990)
Article ADS MathSciNet Google Scholar
Hestenes, D., Sobczyk, G.: Clifford algebra to geometric calculus a unified language for mathematics and physics by D. Hestenes and G. Sobczyk. Bull. Lond. Math. Soc. 17(3), 289–290 (1984)
Google Scholar
Hore, A., Ziou, D.: Image quality metrics: PSNR vs. SSIM. In: 2010 20th International Conference on Pattern Recognition, pp. 2366–2369 (2010)
Jia, Q., Wan, X., Hei, B.: A new disparity map quality assessment based on structural similarity for remotely sensed image pairs. Remote Sens. Lett. 11(7), 659–666 (2020)
Article Google Scholar
Lewis, J.J., O’Callaghan, R.J., Nikolov, S.G.: Pixel- and region-based image fusion with complex wavelets. Inf. Fusion 8(2), 119–130 (2005)
Article Google Scholar
Li, J., Dai, W.: Image quality assessment based on the correlation coefficient and the 2-D discrete wavelet transform. In: IEEE International Conference on Automation and Logistics (2009)
Li, H., Hestenes, D., Rockwood, A.: Generalized Homogeneous Coordinates for Computational Geometry, pp. 27–60. Springer, Berlin (2001)
MATH Google Scholar
Li, Y., Liu, W., Li, X.: GA-SIFT: a new scale invariant feature transform for multispectral image using geometric algebra. Inf. Sci. 281, 559–572 (2014)
Article MathSciNet Google Scholar
Li, H., Li, L., Zhang, J.X.: Multi-focus image fusion based on sparse feature matrix decomposition and morphological filtering. Opt. Commun. 342, 1–11 (2015)
Article ADS Google Scholar
Liu, F., Chen, L., Lu, L.: Medical image fusion method by using Laplacian pyramid and convolutional sparse representation. Concurr. Comput. Pract. Exp. 32(17) (2020)
Lopes, W.B., Lopes, C.G.: Geometric-algebra adaptive filters. IEEE Trans. Signal Process. 3649–3662 (2019)
Marshall, S., Matsopoulos, G., Brunt, J.: Multiresolution morphological fusion of MR and CT images of the human brain. In: IEE Colloquium on Multiresolution Modelling and Analysis in Image Processing and Computer Vision (1995)
Naeem, E.A., Abd Elnaby, M.M., El-Sayed, H.S.: Wavelet fusion for encrypting images with a few details. Comput. Electr. Eng. 54, 450–470 (2016)
Article Google Scholar
Naji, M.A., Aghagolzadeh, A.: Multi-focus image fusion in DCT domain based on correlation coefficient. In: 2015 2nd International Conference on Knowledge-Based Engineering and Innovation (KBEI), pp. 632–639 (2015)
Naji, M.A., Noiey, P.R., Aghagolzadeh, A.: Multi-focus image fusion using singular value decomposition in DCT domain. In: 2017 10th Iranian Conference on Machine Vision and Image Processing (MVIP) (2017)
Okarma, K., Fastowicz, J.: Improved quality assessment of colour surfaces for additive manufacturing based on image entropy. Pattern Anal. Appl. 23, 1035–1047 (2020)
Article Google Scholar
Pham, M.T., Yoshikawa, T., Furuhashi, T., Tachibana, K.: Robust feature extractions from geometric data using geometric algebra. In: IEEE International Conference on Systems, pp. 529–533 (2009)
Pohl, C., Genderen, J.L.: Image fusion in remote sensing by multi-objective deep learning. Int. J. Remote Sens. 41(24), 9507–9524 (1998)
Google Scholar
Pu, T., Ni, G.: Contrast-based image fusion using the discrete wavelet transform. Opt. Eng. 39(8), 2075–2082 (2000)
Article ADS Google Scholar
Qu, G., Zhang, D., Yan, P.: Information measure for performance of image fusion. Electron. Lett. 38(7), 313–315 (2002)
Article ADS Google Scholar
Rajarshi, K., Himabindu, C.: DWT based medical image fusion with maximum local extrema. In: 2016 International Conference on Computer Communication and Informatics (ICCCI) (2016)
Raut, G.N., Paikrao, P.L., Chaudhari, D.S. A study of quality assessment techniques for fused images. Int. J. Innov. Technol. Explor. Eng. 290–294 (2013)
Saurabh, S., Aglika, G., George, B.: Infrared and visible image fusion for face recognition. SPIE Defense Commer. Sens. 585–596 (2004)
Shi, L., Funt, B.: Quaternion color texture segmentation. Comput. Vis. Image Underst. 107(1), 88–96 (2006)
Google Scholar
Sommer, G.: Geometric Computing with Clifford Algebras. Springer, Berlin (2001)
Book Google Scholar
Su, H., Bo, Z.: Conformal geometric algebra based band selection and classification for hyperspectral imagery. In: Proc. 8th Workshop Hyperspectral Image Signal Process, pp. 1–4 (2016)
Tang, J.: A contrast based image fusion technique in the DCT domain. Digital Signal Process. 14(3), 218–226 (2004)
Article Google Scholar
Vijayarajan, R., Muttan, S.: Discrete wavelet transform based principal component averaging fusion for medical images. AEU Int. J. Electron. Commun. 69(6), 896–902 (2015)
Article Google Scholar
Wang, R., Shen, M., Cao, W.: Joint sparse representation model for multi-channel image based on reduced geometric algebra. IEEE Access 24213–24223 (2018)
Wang, R., Shen, M., Cao, W.: Multivector sparse representation for multispectral images using geometric algebra. IEEE Access 12755–12767 (2019)
Wang, R., Zhang, W., Shi, Y., Wang, X., Cao, W.: GA-ORB: a new efficient feature extraction algorithm for multispectral images based on geometric algebra. IEEE Access 71235–71244 (2019)
Wang, R., Zhou, Y.-X., Jin, Y.-L.: Sparse fast Clifford Fourier transform. Front. Inf. Technol. Electron. Eng. 18(8), 1131–1141 (2017)
Article Google Scholar
Xin, W., You-Li, W., Fu, L.: New multi-source image sequence fusion algorithm based on SIDWT. In: 2013 Seventh International Conference on Image and Graphics (2013)
Yang, Y.: A novel DWT based multi-focus image fusion method. Procedia Eng. 24, 177–181 (2011)
Article Google Scholar
Yang, L., Guo, B.L., Ni, W.: Multimodality medical image fusion based on multiscale geometric analysis of contourlet transform. Neurocomputing 72(1), 203–211 (2008)
Article Google Scholar
Zhang, Q., Guo, B.L.: Multifocus image fusion using the nonsubsampled contourlet transform. Signal Process 89(7), 1334–1346 (2009)
Article ADS Google Scholar

Download references

Acknowledgements

This work was supported by National Natural Science Foundation of China (NSFC) under Grant nos. 61771299, 61771322 and Shen zhen foundation for basic researchJCYJ20190808160815125.

Author information

Authors and Affiliations

School of Communication and Information Engineering, Key Laboratory of Specialty Fiber Optics and Optical Access Networks, Joint International Research Laboratory of Specialty Fiber Optics and Advanced Communication, Shanghai Institute for Advanced Communication and Data Science Shanghai University, Shanghai, China
Rui Wang, Nian Fang, Yinmei He & Yanping Li
College of Electronics and Information Engineering, Shenzhen University, Shenzhen, China
Wenming Cao
Department of General Surgery, Shanghai General Hospital of Shanghai Jiaotong University, Shanghai, China
Haiquan Wang

Authors

Rui Wang
View author publications
You can also search for this author in PubMed Google Scholar
Nian Fang
View author publications
You can also search for this author in PubMed Google Scholar
Yinmei He
View author publications
You can also search for this author in PubMed Google Scholar
Yanping Li
View author publications
You can also search for this author in PubMed Google Scholar
Wenming Cao
View author publications
You can also search for this author in PubMed Google Scholar
Haiquan Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wenming Cao.

Additional information

Communicated by Hongbo Li.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, R., Fang, N., He, Y. et al. Multi-modal Medical Image Fusion Based on Geometric Algebra Discrete Cosine Transform. Adv. Appl. Clifford Algebras 32, 19 (2022). https://doi.org/10.1007/s00006-021-01197-6

Download citation

Received: 15 January 2021
Accepted: 28 December 2021
Published: 20 February 2022
DOI: https://doi.org/10.1007/s00006-021-01197-6

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Multi-modal Medical Image Fusion Based on Geometric Algebra Discrete Cosine Transform

Abstract

Similar content being viewed by others

Color Medical Imaging Fusion Based on Principle Component Analysis and F-Transform

Review and Enhancement of Discrete Cosine Transform (DCT) for Medical Image Fusion

Multimodal Medical Image Fusion with Multi Resolution Discrete Cosine Transform

1 Introduction