Super Resolution

Cho, Hyo-Moon

doi:10.1007/978-94-017-9075-8_3

Hyo-Moon Cho³

2127 Accesses

Abstract

We introduced, in this chapter, what the definition of a super resolution is and what the key approaching methods for major super resolution algorithms are. Numerous super resolution algorithms have based on the observation model and they have followed the warp-blur sequence. But, some cases which have large movements and warp factors such as video by taking in a vehicle are worse than normal interpolation methods. We introduce the smart and robust registration algorithm with rotation and shift estimation. To reduce the registration error, this algorithm decides the optimal reference image even other super resolution algorithms discard this registration error. This algorithm follows the warp-blur observation model because the blurring parameter is much bigger than warp parameter for camera rotation and/or vibration.

Access provided by Autonomous University of Puebla. Download chapter PDF

Overview of Super-resolution Techniques

Survey on Single Image based Super-resolution — Implementation Challenges and Solutions

Article 07 November 2019

Application of Super-Resolution Algorithms for the Navigation of Autonomous Mobile Robots

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

3.1 Introduction

In most imaging applications, a high resolution image is desired and widely required. Where, “high resolution image” means that an image has not only increasing the number of pixel but also increasing the resolving power. Therefore the resolution is related to the ability to distinguish details in the image.

The International Organization for Standardization (ISO) has described a precise method to measure the resolution of a digital camera [1]. The resolution can be measured as the highest frequency pattern of black and white lines where the individual black and white lines can still be visually distinguished in the image. It is expressed in line widths per picture height.

The standard also describes a method to compute the spatial frequency response (SFR) of a digital camera. The spatial frequency response is the digital imaging equivalent of the modulation transfer function (MTF) used in analogue imaging systems. It describes the variation between the maximum and minimum values that is visible as a function of the spatial frequency that is the number of black and white lines per millimeter. It can be measured using an image of a slanted black and white edge, and is expressed in relative spatial frequencies which is relative to the sampling frequency, line widths per picture height, or cycles per millimeter on the image sensor. The resolution chart that is used in the International Organization for Standardization (ISO) standard is shown in Fig. 3.1.

Image resolution is one of the important factors in digital camera design, since the cameras have been widely used to capture images for numerous imaging applications. Digital cameras have long fast evolved towards a steadily increasing number of pixels. From about 0.3 Mega-pixels in 1993, the number of pixels on the charge-coupled device (CCD) or the complementary metal oxide semiconductor (CMOS) sensor in a digital camera has increased to about 5 Giga-pixels in some of the latest professional models. This pixel count has become the major selling argument for the camera manufacturers. Although the number of pixels of camera is increased, the current resolution grade of digital cameras and their price do not satisfy consumer demands and may not satisfy the future demand also. Thus, finding a way to increase the current resolution level is needed.

The most direct solution to increase spatial resolution of image is to reduce the pixel sizes of image sensor. It needs the high level sensor manufacturing technology, and the cost to do this is very high. The most critical problem of it is the shot noise that degrades the image quality severely, because the amount of light available is also decreased as decreasing the pixel size. Therefore the method reducing the pixel size has the limitation of the pixel size reduction and the current technical status of this is almost saturated.

The other approach is to increase the chip size, it uses the reverse concept of above method and it leads to an increase in capacitance [2]. This second approach is difficult to speed up a charge transfer rate by large capacitance. Thus, this approach is also not a considered effective.

Therefore, a novel approach to overcome above limitations of sensors and optic manufacturing technologies is required. One promising approach is to use the digital image signal processing techniques to obtain a high resolution image or video sequence from observed multiple low resolution images. Recently, such a resolution enhancement approach has been one of the most active research areas, and it is called super resolution or simply resolution enhancement in the literature [2–62]. The major advantage of super resolution algorithms is that it may costless, and the existing low resolution images can be still utilized. In here, the meaning of using the existing low resolution images is that the obtained high resolution image by a super resolution algorithm consists of real data from several low resolution images not artificial data by computing data from just one image.

The basic condition of super resolution techniques is that the multiple low resolution images captured from the same scene and these are sub-sampled and aliased, as well as shifted with sub-pixel displacement. If the low resolution images have different sub-pixel displacement from each other and if aliasing is present, then a high resolution image can be obtained, since the new information in each low resolution image can be exploited to generate a high resolution image as shown in Fig. 3.2a. Whereas, the low resolution images are shifted by integer pixel units, then it is difficult to generate a high resolution image because each image contains the same information for each other as shown in Fig. 3.2b. That is, there is no new information that can be used to reconstruct a high resolution image.

Generally, the super resolution algorithm covers image restoration techniques [63, 64] that produce high quality images from noisy, blurred images although the main concern of it is to reconstruct high resolution image from under-sampled low resolution images. Therefore, the goal of super resolution techniques is to restore a high resolution image from several degraded and aliased low resolution images as illustrated in Fig. 3.3.

Big difference between restoration and super resolution is that the restoration does not change the size of image. In fact, restoration and super resolution reconstruction are closely related theoretically, and super resolution reconstruction can be considered as a second-generation problem of image restoration.

Another problem related to super resolution reconstruction is image interpolation that has been used to increase the size of a single image. Although this field has been extensively studied [65–67], the quality of an image magnified from an aliased low resolution image is inherently limited even though the ideal “sinc” basis function is employed. That is, single image interpolation cannot recover the high-frequency components lost or degraded during the low resolution sampling process. For this reason, image interpolation methods are not considered as super resolution techniques.

To achieve further improvements in this field, the next step requires the utilization of multiple data sets in which additional data constraints from several observations of the same scene can be used. The fusion of information from various observations of the same scene allows us super resolution reconstruction of the scene.

3.2 Observation Model

To comprehensively analyse the super resolution algorithm, the definition of the relation between a high resolution image and several low resolution images is necessary. One of famous and widely used the image formulation model is the observation model. The basic concept of observation model is if we can know how several low resolution images are generated from a high resolution image, then we can reconstruct a high resolution image from several low resolution images by using reverse process of observation model.

In this chapter, we employed the observation model for video sequence since the goal of this chapter is to obtain super resolution image on general video recording system. Let us denote by f (x, y, t) the continuous in time and space dynamic scene which is being captured. If the scene is sampled according to the Nyquist criterion in time and space, it is represented by the high resolution sequence f _l (m, n), where $ l = 1, \ldots ,L $, $ m = 0, \ldots ,PM - 1 $, and $ n = 0, \ldots ,PN - 1 $, the discrete temporal and spatial coordinates, respectively.

For reasons that will become clear right away, the parameter P is referred to as the magnification factor. Note that although different magnification factors P _r and P _c can be used for rows and columns, respectively, for simplicity and without lack of generality, we used the same factor P for both directions. It is, however, important to low resolution images that depending on the available images we may not be able to improve the spatial image resolution in both directions at the same degree.

Before we proceed, a matrix-vector representation of images and image sequences is introduced to use in addition with the point-wise representation. Using matrix-vector notation, each PM × PN image can be transformed into a (PM × PN) × 1 column vector, obtained by lexicographic image ordering.

The (PM × PN) × 1 vector that represents the l-th image in the high resolution sequence is denoted by f _l, with $ l = 1, \ldots ,L $. Additionally, if all frames f _l, $ l = 1, \ldots ,L $, are lexicographically ordered, the vector f of dimensions (L × PM × PN) × 1 is obtained.

The high resolution sequence f is input to the imaging system which generates the low resolution images denoted by g as illustrated in Fig. 3.4. The goal of super resolution is to obtain a high resolution frame, f _k, from the available low resolution images. All of the described techniques, however, may be applied to the super resolution of video by using, for example, a sliding window approach, as illustrated in Fig. 3.5. Alternatively, temporally recursive techniques can be developed in estimating a super resolution sequence of images. To obtain f _k, the imaging system and the temporal relationship between high resolution and low resolution sequences need to be modeled.

f _k: the lexicographically ordered image of the k-th high resolution frame, vector f
g _k: the lexicographically ordered image of the k-th low resolution frame, vector g

For the majority of the published work, sought after high resolution images $ {\text{f}}_{1} , \ldots ,{\text{f}}_{L} $, are assumed to satisfy

$$ f_{l} \left( {m,n} \right) = f_{k} \left( {m + d_{l,k}^{x} \left( {m,n} \right), n + d_{l,k}^{y} \left( {m,n} \right)} \right) $$

(3.1)

where $ d_{l,k}^{x} \left( {m,n} \right) $ and $ d_{l,k}^{y} \left( {m,n} \right) $ denote respectively the horizontal and vertical components of the displacement, that is,

$$ d_{l,k} \left( {m,n} \right) = \left( {d_{l,k}^{x} \left( {m,n} \right), d_{l,k}^{y} \left( {m,n} \right)} \right) $$

(3.2)

The model of Eq. (3.1) is a reasonable one under the assumption of constant illumination conditions in the same scene. It leads to the estimation of the optical flow in the scene, not necessarily, to the estimation of the true motion. Note that the above model applies to both local and global motion. Also note that there may exist pixels in one image for which no motion vector exists (occlusion problem), and pixels for which the displacement vectors are not unique. Finally, note that we are not including noise in the above model, since we will incorporate it later when describing the process to obtain the low resolution observations.

Equation (3.1) can be rewritten using matrix-vector notation as

$$ {\mathbf{f}}_{l} = {\mathbf{C}}\left( {{\mathbf{d}}_{l,k} } \right){\mathbf{f}}_{k} $$

(3.3)

where C(d _l,k) is the (PM × PN) × (PM × PN) matrix that maps frame f _l to frame f _k, and d _l,k is the (PM × PN) × 2 matrix defined by lexicographically ordering the vertical and horizontal components of the displacements between the two frames. We will be using the scalar and matrix-vector notation interchangeably through this manuscript.

The motion estimation problem, as encountered in many video processing applications, consists of the estimation of d _l,k or C(d _l,k) given f _l and f _k. What makes the problem even more challenging in super resolution is the fact that although the high resolution motion vector field is required, to get the high resolution images are not available, and therefore this field must be estimated utilizing the low resolution images. The accuracy of the d _l,k is of the outmost important in determining the quality of the sought after high resolution images.

3.2.1 The Warp-Blur Model

As the name implies, with this model the warping of an image is applied before it is blurred. This case is shown as Fig. 3.6.

The low resolution discrete sequence is denoted by g _l (i, j), with $ i = 0, \ldots ,M - 1 $, $ j = 0, \ldots ,N - 1 $. Using matrix-vector notation, each low resolution image is denoted by the (M × N) × 1 vector g _l. The low resolution image g _l is related to the high resolution image f _l by

$$ {\mathbf{g}}_{l} = {\mathbf{A}}_{l} {\mathbf{H}}_{l} {\mathbf{f}}_{l} + \eta_{l} $$

(3.4)

where the matrix H _l of size (PM × PN) × (PM × PN) describes the filtering of the high resolution image, A _l is the down sampling matrix of size MN × (PM × PN), and η_l denotes the observation noise. The matrices A _l and H _l are generally assumed to be known.

Equation (3.4) expresses the relationship between the low resolution and high resolution frames g _l and f _l, while Eq. (3.3) expresses the relationship between frames l and k in the high resolution sequence. Combining these two equations we obtain the following equation which describes the acquisition of a low resolution image g _l from the unknown high resolution image f _k,

$$ {\mathbf{g}}_{l} = {\mathbf{A}}_{l} {\mathbf{H}}_{l} {\mathbf{C}}\left( {{\text{d}}_{l,k} } \right){\mathbf{f}}_{k} + \eta_{l} +\upmu_{l,k} = {\mathbf{A}}_{l} {\mathbf{H}}_{l} {\mathbf{C}}\left( {{\text{d}}_{l,k} } \right){\mathbf{f}}_{k} + {\mathbf{e}}_{l,k} $$

(3.5)

where μ_l,k represents the registration noise and e _l,k represents the combined acquisition and registration noise. It is clear from Eq. (3.5) that C(d _l,k)—the warp—is applied first on f _k, followed by the application of the blur H _l. This process is pictorially illustrated in Fig. 3.7.

Note that the above equation shows the dependency of g _l on both unknowns, the high resolution image f _k and the motion vectors d _l,k. This observation model was first formulated in [34], without matrix notation, and later written in matrix form by [34]. Wang and Qi [68] attributes this model to [31]. The acquisition model utilized in [11] for deriving frequency domain super resolution methods can also be written using this model If we assume that the noise e _l,k in Eq. (3.5) is Gaussian with zero mean and variance $ \sigma^{2} $, denoted by $ N\left( {0, \sigma^{2} I} \right) $, the above equation produces the following conditional probability density functions to be used within the Bayesian framework,

$$ {\mathbf{P}}_{G} \left( {{\mathbf{g}}_{l} |{\mathbf{f}}_{k} ,{\text{d}}_{l,k} } \right)\,{ \propto }\,{ \exp }\left[ { - \frac{1}{{2\sigma^{2} }}\left\| {{\mathbf{g}}_{l} - {\mathbf{A}}_{l} {\mathbf{H}}_{l} {\mathbf{C}}\left( {{\text{d}}_{l,k} } \right){\mathbf{f}}_{k} } \right\|^{2} } \right] $$

(3.6)

such as a noise model has been used widely.

A uniform noise model is proposed by [25–28]. The noise model used by these authors is oriented toward the use of the projection onto convex sets (POCS) method in super resolution problems. The associated conditional probability density functions has the form

$$ {\mathbf{P}}_{G} \left( {{\mathbf{g}}_{l} |{\mathbf{f}}_{k} ,{\text{d}}_{l,k} } \right){ \propto }\left\{ {\begin{array}{*{20}c} {const} & {{\text{if}} \left| {\left[ {{\mathbf{g}}_{l} - {\mathbf{A}}_{l} {\mathbf{H}}_{l} {\mathbf{C}}\left( {{\text{d}}_{l,k} } \right){\mathbf{f}}_{k} } \right]\left( i \right)} \right| \le c, \forall i} \\ 0 & {elsewhere} \\ \end{array} } \right. $$

(3.7)

where the interpretation of the index i is that it represents the i-th element of the vector inside the brackets.

The zero value of c can be thought of as the limit of P _G (g _l |f _k, d _l,k) in Eq. (3.6) when σ = 0. Farsiu et al. [69, 70] have recently proposed the use of a generalized Gaussian Markov random field (GGMRF) [71] to model the noise in the image formation process for super resolution problems. Thus, Eq. (3.7) can be written as

$$ {\mathbf{P}}_{GG} \left( {{\mathbf{g}}_{l} |{\mathbf{f}}_{k},{\text{d}}_{l,k} } \right)\,{ \propto }\,{ \exp }\left[ { -\frac{1}{{2\sigma^{p} }}\left\| {{\mathbf{g}}_{l} - {\mathbf{A}}_{l}{\mathbf{H}}_{l} {\mathbf{C}}\left( {{\text{d}}_{l,k} }\right){\mathbf{f}}_{k} } \right\|_{p}^{p} } \right] $$

(3.8)

3.2.2 The Blur-Warp Model

Another acquisition model which has been used in the literature [29, 71, 72] first considers the blurring of the high resolution image, followed by warping and down-sampling, as shown in Fig. 3.8. In this case, the observation model becomes

$$ {\mathbf{g}}_{l} = {\mathbf{A}}_{l} {\mathbf{H}}_{l} {\mathbf{M}}\left( {{\text{m}}_{l,k} } \right){\mathbf{f}}_{k} + \eta_{l} +\upmu_{l,k} = {\mathbf{A}}_{l} {\mathbf{M}}\left( {m_{l,k} } \right){\mathbf{B}}_{l} {\mathbf{f}}_{k} + {\mathbf{w}}_{l,k} $$

(3.9)

where w _l,k denotes the acquisition and registration noise, B _l the blurring matrix for the l-th high resolution image, M(m _l,k) the motion compensation operator for the blurred high resolution images through the use of motion vector m _l,k, and A _l again the down-sampling matrix.

Different notation has been used in Eqs. (3.5) and (3.9) for the blur and warping operators in order to distinguish between these two models for the rest of the text. The three-conditional probability density functions in Eqs. (3.6)–(3.8) can be rewritten now for the blur-warp model, by substituting A _l, H _l, C(d _l,k) by A _l, M(m _l,k)B _l (for brevity we do not reproduce them here). The question as to which of the two models (blur–warp or warp–blur) should be used is addressed in [68]. The authors claim that when the motion has to be estimated from the low resolution images, using the warp–blur model may cause systematic errors and, in this case, it is more appropriate to use the blur–warp model. They showed that when the imaging blur is spatiotemporally shift invariant and the motion has only a global translational component the two models coincide. Note that in this case, the blur and motion matrices correspond to convolution matrices and thus they commute.

Before concluding this section on image formation for uncompressed observations, we mention here that for both the warp–blur and the blur–warp models we have defined conditional probability density functions for each low resolution observation g _l given f _k and d _l,k. Our goal, however, is to define the conditional probability density functions P(g|f _k, d), that is, the distribution when all the observations g and all the motion vectors d for compensating the corresponding high resolution frames to the k-th frame are taken into account. The approximation used in the literature for this joint-conditional probability density functions is

$$ {\mathbf{P}}\left( {{\mathbf{g}}\left| {{\mathbf{f}}_{k} } \right., {\mathbf{d}}} \right) = \prod\limits_{l = 1}^{L} {{\mathbf{P}}\left( {{\mathbf{g}}_{\varvec{l}} \left| {{\mathbf{f}}_{k} } \right., {\mathbf{d}}_{{\varvec{l},\varvec{k}}} } \right)} $$

(3.10)

which implies that the low resolution observations are independent given the unknown high resolution image f _k and motion vectors d.

3.3 Survey of the Super Resolution Algorithms

The idea of super resolution was first introduced in 1984 by Tsai and Huang [11] for multi-frame image restoration of band-limited signals. A good overview of existing algorithms is given by [3] and [73]. Most super resolution methods are composed of two main steps: first all the images are aligned in the same coordinate system in the registration step, and then a high-resolution image is reconstructed from the irregular set of samples. In this second step, the camera point spread function is often taken into account. The scheme of super resolution is illustrated in Fig. 3.9.

Precise sub-pixel image registration is a basic requirement for a good reconstruction. If the images are inaccurately registered, the high-resolution image is reconstructed from incorrect data and is not a good representation of the original signal. Zitova and Flusser [74] presents an overview of image registration methods. Registration can be done either in spatial or in frequency domain. By the nature of the Fourier transform, frequency domain methods are limited to global motion models. In general, they also consider only planar shifts and possibly planar rotation and scale, which can be easily expressed in Fourier domain. However, aliasing is much easier to describe and to handle in frequency domain than in spatial domain.

3.3.1 Registration

3.3.1.1 Frequency Approach

Tsai and Huang [11] describes an algorithm to register multiple frames simultaneously using nonlinear minimization in frequency domain. Their method for registering multiple aliased images is based on the fact that the original, high resolution signal is band-limited. They derived a system equation that describes the relationship between low resolution images and a desired high resolution image by using the relative motion between low resolution images. The frequency domain approach is based on the following three principles: (i) the shifting property of the Fourier transform, (ii) the aliasing relationship between the continuous Fourier transform (CFT) of an original high resolution image and the discrete Fourier transform (DFT) of observed low resolution images, (iii) and the assumption that an original high resolution image is band-limited.

These properties make it possible to formulate the system equation relating the aliased discrete Fourier transform (DFT) coefficients of the observed low resolution images to a sample of the continuous Fourier transform (CFT) of an unknown image. For example, let us assume that there are two one-dimension low resolution signals sampled below the Nyquist sampling rate. From the above three principles, the aliased low resolution signals can be decomposed into the un-aliased high resolution signal as shown in Fig. 3.9.

Let f _l (m, n) denote a continuous high resolution image and F _l(w _m, w _n) be its continuous Fourier transform (CFT). The global translations, which are the only motion considered in the frequency domain approach, yield the k-th shifted image of Eq. (3.1). By the shifting property of the continuous Fourier transform (CFT), the continuous Fourier transform of the shifted image, F _k(w _m, w _n), can be written as

$$ {\mathbf{F}}_{k} \left( {{\mathbf{W}}_{m} ,{\mathbf{W}}_{n} } \right) = { \exp }\left[ {j2\pi \left( {d_{l,k}^{x} \left( {m,n} \right){\mathbf{W}}_{m} , n + d_{l,k}^{y} \left( {m,n} \right){\mathbf{W}}_{n} } \right)} \right]{\mathbf{F}}_{l} \left( {{\mathbf{W}}_{m} ,{\mathbf{W}}_{n} } \right) $$

(3.11)

The shifted image f _k (m, n) is sampled with the sampling period T _m and T _n to generate the observed low resolution image g _k (m, n). From the aliasing relationship and the assumption of band-limitedness of F _l(w _m, w _n)

$$ \left| {{\mathbf{F}}_{k} \left( {{\mathbf{W}}_{m} ,{\mathbf{W}}_{n} } \right)} \right| = 0\; {\text{for }}\left| {{\mathbf{W}}_{m} } \right| \ge \left( {L_{m} \pi /T_{m} } \right), \left| {{\mathbf{W}}_{n} } \right| \ge \left( {L_{n} \pi /T_{n} } \right) $$

(3.12)

The relationship between the continuous Fourier transform (CFT) of the high resolution image and the discrete Fourier transform (DFT) of the k-th observed low resolution image can be written as [75]

$$ \gamma_{k} \left[ {\Omega _{m} ,\Omega _{n} } \right] = \frac{1}{{T_{m} T_{n} }}\mathop \sum \limits_{{m_{l} = 0}}^{{L_{m} - 1}} \mathop \sum \limits_{{n_{l} = 0}}^{{L_{n} - 1}} \left\{ {{\text{F}}_{k} \times \left( {\frac{2\pi }{{T_{m} }}\left( {\frac{{\Omega _{m} }}{M} + m} \right), \frac{2\pi }{{T_{n} }}\left( {\frac{{\Omega _{n} }}{M} + n} \right)} \right)} \right\} $$

(3.13)

By using lexicographic ordering for the indices m, n on the right-hand side and k on the left-hand side, a matrix vector form is obtained as:

$$ {\mathbf{Y}} =\Phi {\text{X}} $$

(3.14)

where Y is a p × 1 column vector with the k-th element of the discrete Fourier transform (DFT) coefficients of y _k[m, n], F is a L _m L _n × 1 column vector with the samples of the unknown continuous Fourier transform of f _l (m, n), and Φ is a p × L _m L _n matrix which relates the discrete Fourier transform of the observed low resolution images to samples of the continuous high resolution image.

Therefore, the reconstruction of a desired high resolution image requires us to determine Φ and solve this inverse problem. It is not clear, however, if such a solution is unique and if such an algorithm will not converge to a local minimum. Most of the frequency domain registration methods are based on the fact that two shifted images differ in frequency domain by a phase shift only, which can be found from their correlation. Using a log-polar transform of the magnitude of the frequency spectra, image rotation and scale can be converted into horizontal and vertical shifts. These can therefore also be estimated using a phase correlation method.

3.3.1.2 Phase Shift and Correlation

Reddy and Chatterji [76 and 77] describe such planar motion estimation algorithms. Authors apply a high-pass emphasis filter to strengthen high frequencies in the estimation. Kim and Su [78, 79 and 80] also apply a phase correlation technique to estimate planar shifts. To minimize errors due to aliasing, their methods rely on a part of the frequency spectrum that is almost free of aliasing. Typically this is the low-frequency part of the images. [81] showed that the signal power in the phase correlation corresponds to a poly phase transform of a filtered unit impulse. [82] developed a rotation estimation algorithm based on the property that the magnitude of the Fourier transform of an image and the mirrored version of the magnitude of the Fourier transform of a rotated image has a pair of orthogonal zero-crossing lines. The angle that these lines make with the axes is equal to half the rotation angle between the two images. The horizontal and vertical shifts are estimated afterwards using a standard phase correlation method.

3.3.1.3 Regularization

An extension of this approach for a blurred and noisy image was provided by [12], resulting in a weighted least squares formulation. In their approach, it is assumed that all low resolution images have the same blur and the same noise characteristics. This method was further refined by [13] to consider different blurs for each low resolution image. Here, the Tikhonov regularization method is adopted to overcome the ill-posed problem resulting from blur operator. Bose et al. [14] proposed the recursive total least squares method for super resolution reconstruction to reduce effects of registration errors (errors in Φ). A discrete cosine transform (DCT) based method was proposed by [15]. They reduce memory requirements and computational costs by using discrete cosine transform (DCT) instead of discrete Fourier transform (DFT). They also apply multichannel adaptive regularization parameters to overcome ill-posed such as underdetermined cases or insufficient motion information cases.

Theoretical simplicity is a major advantage of the frequency domain approach. That is, the relationship between low resolution images and the high resolution image is clearly demonstrated in the frequency domain. The frequency method is also convenient for parallel implementation capable of reducing hardware complexity. However, the observation model is restricted to only global translational motion and LSI blur. Due to the lack of data correlation in the frequency domain, it is also difficult to apply the spatial domain a priori knowledge for regularization.

Generally, the super resolution image reconstruction approach is an ill-posed problem because of an insufficient number of low resolution images and ill-conditioned blur operators. Procedures adopted to stabilize the inversion of ill-posed problem are called regularization. In this section, we present deterministic and stochastic regularization approaches for super resolution image reconstruction. Typically, constrained least squares (CLS) and maximum a posteriori (MAP) super resolution image reconstruction methods are introduced.

3.3.1.4 Spatial Approach

Spatial domain methods generally allow for more general motion models, such as homographies. They can be based on the whole image or on a set of selected corresponding feature vectors, as discussed by [83] and by RANSAC algorithm [84]. Keren et al. [85] developed an iterative planar motion estimation algorithm based on Taylor expansions. A pyramidal scheme is used to increase the precision for large motion parameters. A hierarchical framework to estimate motion in a multi resolution data structure is described in [86]. Different motion models, such as affine flow or rigid body motion, can be used in combination with this approach. Irani et al. [87] presented a method to compute multiple, possibly transparent or occluding motions in an image sequence. Motion is estimated using an iterative multi resolution approach based on planar motion. Different objects are tracked using segmentation and temporal integration. Gluckman [88] described a method that first computes planar rotation from the gradient field distribution of the images to be registered. Planar shifts are then estimated after cancellation of the rotation using a phase correlation method.

3.3.2 Reconstruction

3.3.2.1 Interpolation-Based and Frequency Domain

In the subsequent image reconstruction phase, a high resolution image is reconstructed from the irregular set of samples that is obtained from the different low-resolution images. This can be achieved using an interpolation-based method as the one used by [85]. Tsai and Huang [11] describes a frequency domain method, writing the Fourier coefficients of the high-resolution image as a function of the Fourier coefficients of the registered low-resolution images. The solution is then computed from a set of linear equations. This algorithm uses the same principle as the formulation in time domain given by [89].

3.3.2.2 POCS

A high-resolution image can also be reconstructed using a projection onto convex sets (POCS) algorithm [27], where the estimated reconstruction is successively projected on different convex sets. Each set represents constraints to the reconstructed image that are based on the given measurements and assumptions about the signal. Capel and Zisserman [83] and [90] use a maximum a posteriori (MAP) statistical method to build the high-resolution image.

Other methods iteratively create a set of low-resolution images from the estimated image using the imaging model. The estimate is then updated according to the difference between the real and the simulated low-resolution images [32, 85]. This method is known as iterative back-projection. Zomet et al. [91] improved the results obtained with typical iterative back-projection algorithms by taking the median of the errors in the different back-projected images. This proved to be more robust in the presence of outliers. Farsiu et al. [70] proposed a new and robust super resolution algorithm.

Instead of the more common L2 minimization, they use the L1 norm, which produces sharper high-resolution images. They also showed that this approach performs very well in combination with the algorithm by [91]. Elad and Feuer [31] present a super resolution framework that combines a maximum-likelihood/MAP approach with a projection onto convex sets (POCS) approach to define a new convex optimization problem. Next, they show the connections between their method and different classes of other existing methods.

3.4 Novel Super Resolution Registration Algorithm Based on Frequency

In this chapter, we show that the flowchart of the proposed algorithm and each implementation sources. And we describe the detail methodologies and show their experiment results such as the obtained high resolution images, their image quality and the computational complexity comparing with the results of other super resolution algorithms. First, we show our main flow chart as in Fig. 3.10.

Secondly, we obtained the low resolution video sequence by applying the down-sampling factor of two into the original video sequence, as shown in Fig. 3.11, and their resolution size is 320 × 240.

3.4.1 Pre-processing

In the second step, we designed the automatic low resolution input image selection algorithm to reduce the registration error. In the whole video sequence, there are unsuitable images according to the reference image. Therefore, it is very important to choose this.

The video sequence has some linearity since it is made with 30 frames per second (fps) or 25 frames per second (fps). However, the accuracy of this is not high. According to the numerous literatures for the motion estimation and the motion compensation, the probability of inner 1/4-pixel distance motion vector is over 90 % for the practical video sequences, and the motion compensation error has maximum value at 1/2-pixel distance [92–100] as shown in Fig. 3.12.

We designate the center image to the reference input image in the specified video sequence window, and analysis the registration error for each reference image and its comparing input low resolution images, at this time, we restrict the maximum number of input low resolution image is limited as five frames. The reason of this, it has very high computational complexity than others if we used many input low resolution images. The registration error is computed by the sum of difference (SAD) computing method since it can easily and simply calculate the motion compensation. Where, we assume that the block size computing the sum of difference (SAD) is as 8 × 8 to low computational complexity. Thus, the sum of difference (SAD) calculation allows us to take the motion compensation error (MCE). If the sum of difference (SAD) of one input low resolution image (ILRI) has 0 ≤ SAD ≤ maximum motion compensation error (MMCE, it is same with maximum SAD), then we can select it as an input low resolution image candidate (ILRIC). It is illustrated in Fig. 3.13.

In the next step, it compares the number of the input low resolution image candidates of each reference image. One reference image which has the largest the input low resolution image candidates is chosen as the optimal reference image. And also we propose an advanced architecture to choose the reference image to reduce computational complexity as shown in Fig. 3.14. This method can remove the duplication of the sum of absolute difference (SAD) calculation based on the partial distortion elimination (PDE) method at each frame. The basic concept of it is that if the difference between current and candidate block has small value then this candidate has higher probability to the optimal reference. Therefore, it is more efficient whenever as an input image which has larger initial accumulated sum of absolute difference (SAD) value is selected.

3.4.2 Planar Motion Estimation

Fourier based image registration methods only allow global motion in a plane parallel to the image plane. In such a case, the motion between two images can be described as a function of three parameters that are all continuous variables: horizontal and vertical shifts x _1,h and x _1,v and a planar rotation angle θ ₁.

A frequency domain approach allows us to estimate the horizontal and vertical shift and the (planar) rotation separately. Assume we have a continuous two-dimensional reference signal f ₀(x) and its shifted and rotated version f ₁(x):

$$ f_{1} \left( x \right) = f_{0} \left( {R\left( {x + x_{1} } \right)} \right) $$

(3.15)

$$ {\text{with}} \;x = \left( {\begin{array}{*{20}c} {x_{h} } \\ {x_{v} } \\ \end{array} } \right), x_{1} = \left( {\begin{array}{*{20}c} {x_{1,h} } \\ {x_{1,v} } \\ \end{array} } \right), R = \left( {\begin{array}{*{20}c} {\cos \theta_{1} } & { - \sin \theta_{1} } \\ {\sin \theta_{1} } & {\cos \theta_{1} } \\ \end{array} } \right) $$

This can be expressed in Fourier domain as

$$ \begin{aligned} F_{1} \left( u \right) & = \iint\limits_{x} {f_{1} \left( x \right)e^{{ - j2\pi u^{T} x}} dx} \\ & = \iint\limits_{x} {f_{0} \left( {R\left( {x + x_{1} } \right)} \right)e^{{ - j2\pi u^{T} x}} dx} \\ & = e^{{ - j2\pi u^{T} x_{1} }} \iint\limits_{x} {f_{0} \left( {Rx^{\prime } } \right)e^{{ - j2\pi u^{T} x^{\prime } }} dx^{\prime } } \\ \end{aligned} $$

(3.16)

With F ₁(u) the two-dimensional Fourier transform of f ₁(x) and the coordinate transformation x′ = x + x ₁. After another transformation x″ = R x′, the relation between the amplitudes of the Fourier transforms can be computed as

$$ \begin{aligned} \left| {F_{1} \left( u \right)} \right| & = \left| {e^{{ - j2\pi u^{T} x_{1} }} \iint\limits_{x} {f_{0} \left( {Rx^{\prime}} \right)e^{{ - j2\pi u^{T} x^{\prime}}} dx^{\prime}}} \right| \\ & = \left| {\iint\limits_{{x^{\prime}}} {f_{0} \left( {Rx^{\prime}} \right)e^{{ - j2\pi u^{T} x^{\prime}}} dx^{\prime}}} \right| \\ & = \left| {\iint\limits_{{x^{\prime\prime}}} {f_{0} \left( {x^{\prime\prime}} \right)e^{{ - j2\pi u^{T} \left( {R^{T} x^{\prime}} \right)}} dx^{\prime\prime}}} \right| \\ & = \left| {\iint\limits_{{x^{\prime\prime}}} {f_{0} \left( {x^{\prime\prime}} \right)e^{{ - j2\pi u^{T} \left( {Ru} \right)^{T} x^{\prime}}} dx^{\prime\prime}}} \right| \\ & = \left| {F_{0} \left( {Ru} \right)} \right| \\ \end{aligned} $$

(3.17)

We can see that |F ₁(u)| is a rotated version of |F ₀(u)| over the same angle θ ₁ as the spatial domain rotation in Fig. 3.15. |F ₀(u)| and |F ₁(u)| do not depend on the shift values x ₁, because the spatial domain shifts only affect the phase values of the Fourier transforms. Therefore we can first estimate the rotation angle θ ₁ from the amplitudes of the Fourier transforms |F ₀(u)| and |F ₁(u)|. After compensation for the rotation, the shift x ₁ can be computed from the phase difference between |F ₀(u)| and |F ₁(u)|.

3.4.3 Rotation Estimation

The rotation angle between |F ₀(u)| and |F ₁(u)| can be computed as the angle θ ₁ for which the Fourier transform of the reference image |F ₀(u)| and the rotated Fourier transform of the image to be registered |F ₁(Ru)| have maximum correlation. This implies the computation of a rotation of |F ₁(u)| for every evaluation of the correlation, which is computationally heavy and thus practically difficult.

If |F ₀(u)| and |F ₁(u)| are transformed in polar coordinates, the rotation over the angle θ ₁ is reduced to a (circular) shift over θ ₁. We can compute the Fourier transform of the polar spectra |F ₀(u)| and |F ₁(u)|, and compute θ ₁ as the phase shift between the two [76, 77]. This requires a transformation of the spectrum to polar coordinates. The data from the uniform u _h, u _v, -grid need to be interpolated to obtain a uniform u _h, u _v,-grid. Mainly for the low frequencies, which generally contain most of the energy, the interpolations are based on very few function values and thus introduce large approximation errors. An implementation of this method is also computationally intensive.

Our approach is computationally much more efficient than the two methods described above. First of all, we compute the frequency content A as a function of the angle θ by integrating over radial lines:

$$ \varvec{A}\left( \theta \right) = \int\limits_{\theta - \varDelta \theta /2}^{\theta + \varDelta \theta /2} {\int\limits_{0}^{\infty } {\left| {F\left( {u_{r} ,u_{\theta } } \right)} \right|du_{r} du_{\theta } } } $$

(3.18)

In practice, |F ₀(u _r , u _θ)| is a discrete signal. Different methods exist to relate discrete directions to continuous directions, like for example digital lines [101]. Here, we compute the discrete function A(θ) as the average of the values on the rectangular grid that have an angle θ − ∆θ/2 < u _θ < θ + ∆θ/2. As we want to compute the rotation angle with a precision of 0.1 degrees, A(θ) is computed every 0.1 degrees. To get a similar number of signal values |F ₀(u _r , u _θ)| at every angle, the average is only evaluated on a circular disc of values for which u _r < ρ (where ρ is the image radius, or half the image size). Finally, as the values for low frequencies are very large compared to the other values and are very coarsely sampled as a function of the angle, we discard the values for which u _r < ερ, with ε = 0.1. Thus, A(θ) is computed as the average of the frequency values on a discrete grid with θ − ∆θ/2 < u _θ < θ + ∆θ/2 and ερ < u _r < ρ.

This results in a function A(θ) for both |F ₀(u)| and |F ₁(u)| as shown in Fig. 3.16. The exact rotation angle can then be computed as the value for which their correlation reaches a maximum. Note that only a one-dimensional correlation has to be computed, as opposed to the two-dimensional correlation approaches in [76] and [77].

Of course, the use of such a radial projection also reduces the available information, and might introduce ambiguities in the estimation. The simulation result of our rotation estimation algorithms is shown in Fig. 3.17.

3.4.4 Shift Estimation

A shift of the image parallel to the image plane can be expressed in Fourier domain as a linear phase shift:

$$ \begin{aligned} F_{1} \left( u \right) & = \iint\limits_{x} {f_{1} \left( x \right)e^{{ - j2\pi u^{T} x}} }\;dx = \iint\limits_{x} {f_{0} \left( {x + x_{1} } \right)}\;e^{{ - j2\pi u^{T} x}} dx \\ & = e^{{j2\pi u^{T} x_{1} }} \iint\limits_{{x^{'} }} {f_{0} \left( {x^{'} } \right)e^{{ - j2\pi u^{T} x^{'} }} }\;dx^{'} = e^{{j2\pi u^{T} x_{1} }} F_{0} \left( u \right) \\ \end{aligned} $$

(3.19)

It is well known that the shift parameters x ₁ can thus be computed as the slope of the phase difference ∠(F ₁(u)/F ₀(u)) [76–79, 81, 82, 102]. To make the solution less sensitive to noise, a least squares method is widely used.

Here, the shift parameters x ₁ can be computed as the slope of the phase difference ∠(F ₁(u)/F ₀(u)). To make the solution less sensitive to noise, a least squares method is widely used. When we apply the inverse shift estimation into the object image after the rotation estimation for the reference image, the result image is exactly same with the reference image (see Fig. 3.18); therefore this shift estimation process is used in initial registration operation.

We decide three candidates for a reference image. To do this, we use the Hilbert space method. That is, we execute the initial registration process as Fig. 3.19.

In Fig. 3.19, LR₁ denotes as a reference image and from LR₂ to LR₄ are chosen candidates low resolution images. These candidate images are located at each high resolution grid by using the inverse shift estimation. For example, four sample images of 320 × 240 resolution to generate a high resolution image are shown in Fig. 3.20.

3.4.5 Reconstruction

And then, we can obtain a high resolution image, but its resolving power is not good. Because, obtained high resolution image has multichannel sampling frequencies and has unknown the offsets. To reduce the offsets and the number of multichannel sampling frequency, we apply the mean value filtering. That is, all of each pixel value is regenerated by using neighbor 5 pixels with cross-shape. Its graphical diagram and results image for four sample images are shown in Fig. 3.21.

Secondly obtained high resolution image, it looks not clear. Therefore, we apply the de-blurring operation to more reduce multichannel sampling frequencies, and then we apply sharpening process as shown in Fig. 3.23.

We apply mean value filtering, bi-cubic interpolation, de-blurring and sharpening process again like a kind iterative back-projection (IBP) method. And then we can obtain as Fig. 3.24.

These processes can be expressed as below equations. The initial registered image by the rotation and shift estimation has non-uniformed sampling frequency with unknown offsets. It can be expressed as

$$ {\text{Y}}_{m} = \mathop \sum \limits_{{i_{1} ,i_{2} }} e^{{j2\pi \left( {i_{1} N_{1} t_{m,1} + i_{2} N_{2} t_{m,2} } \right)}} D_{{t_{m} }}^{'} \alpha_{{i_{1} ,i_{2} }} $$

(3.20)

3.5 Conclusion

We have described super resolution methods, and especially we have focused to super resolution imaging with multichannel sampling with unknown offsets. In such algorithms, an accurate registration can decide the algorithm performance. We propose an advanced registration algorithms with smart rotation and shift estimation. The sequence for these two processes in our algorithm followed the warp-blur observation model. Generally, the cases that the blurring parameters are depend on the camera rotations and vibrations are much more than vice versa.

Firstly, our algorithm decides the optimal reference image to reduce the registration error, on the other hand another numerous super resolution algorithms discard considering this registration error or assume as uniform value. In this frame work, the registration error is calculated by using the sum of absolute difference (SAD) based on the partial distortion elimination (PDE). This process has been obtained the noticeable result comparing conventional algorithms.

Secondly, the proposed algorithm estimates the rotation and the shift in order successively, because our algorithm is based on the warp-blur observation model. The blurring effects are by the point spread function (PSF) of camera, and it is subject to changes according to the rotation parameter of the image.

Finally, we have reconstructed a high resolution image by using the planar motion estimation. This results in a set of nonlinear equations in the unknown signal coefficients and the offsets. Using this formulation, we have shown that the solution is generally unique if the total number of sample values is larger than or equal to the total number of unknowns (signal parameters and offsets).

We present the one reference image and their three candidate images for 10 sample images to reconstruct a high resolution image by using our proposed registration algorithm. These candidate images are obtained from first step. And also we represent all images for each sample and show a bi-cubic interpolated image and a super resolution image by proposed algorithm. The image quality of proposed algorithm is much higher than the bi-cubic interpolation method. We take the average PSNR of proposed algorithm is about 38 dB and another’s are lower than our method. It is as shown in Table 3.1.

Table 3.1 Comparison of the different methods presented in this chapter

Full size table

References

International Organization for Standardization, ISO 12233:2000 - Photography- Electronic still picture cameras - Resolution measurements (2000)
Google Scholar
T. Komatsu, K. Aizawa, T. Igarashi, T. Saito, Signal-processing based method for acquiring very high resolution image with multiple cameras and its theoretical analysis, in Proceedings of the Institute of Electrical Engineering, vol. 140, no. 1, pt. I (1993), pp. 19–25
Google Scholar
S. Borman, R.L. Stevenson, Spatial resolution enhancement of low-resolution image sequences-a comprehensive review with directions for future research. Technical Report, Laboratory for Image and Signal Analysis (LISA). University of Notre Dame, Notre Dame, Ind (1998). Available at http://www.nd.edu/∼sborman/publications/
S. Borman, R.L. Stevenson, Super-resolution from image sequences-a review, in Proceedings of 1998 Midwest Symposium Circuits and Systems (1999), pp. 374–378
Google Scholar
S. Chaudhuri (ed.), Super-Resolution Imaging (Kluwer, Norwell, 2001)
Google Scholar
H. Ur, D. Gross, Improved resolution from sub-pixel shifted pictures. CVGIP: Graph. Models Image Process. 54, 181–186 (1992)
Google Scholar
T. Komatsu, T. Igarashi, K. Aizawa, T. Saito, Very high resolution imaging scheme with multiple different-aperture cameras. Signal Process. Image Commun. 5, 511–526 (1993)
Article Google Scholar
M.S. Alam, J.G. Bognar, R.C. Hardie, B.J. Yasuda, Infrared image registration and high-resolution reconstruction using multiple translationally shifted aliased video frames. IEEE Trans. Instrum. Meas. 49, 915–923 (2000)
Article Google Scholar
N.R. Shah, A. Zakhor, Resolution enhancement of color video sequences. IEEE Trans. Image Process. 8, 879–885 (1999)
Article Google Scholar
N. Nguyen, P. Milanfar, An efficient wavelet-based algorithm for image superresolution. Proc. Int. Conf. Image Process. 2, 351–354 (2000)
Google Scholar
R.Y. Tsai, T.S. Huang, Multipleframe image restoration and registration, in Advances in Computer Vision and Image Processing (JAI Press Inc., Greenwich, 1984), pp. 317–339
Google Scholar
S.P. Kim, N.K. Bose, H.M. Valenzuela, Recursive reconstruction of high resolution image from noisy undersampled multiframes. IEEE Trans. Acoust. Speech Sig. Process. 38, 1013–1027 (1990)
Article Google Scholar
S.P. Kim, W.Y. Su, Recursive high-resolution reconstruction of blurred multiframe images. IEEE Trans. Image Process. 2, 534–539 (1993)
Article Google Scholar
N.K. Bose, H.C. Kim, H.M. Valenzuela, Recursive implementation of total least squares algorithm for image reconstruction from noisy, undersampled multiframes, in Proceedings of the IEEE Conference on Acoustics, Speech and Signal Processing, vol. 5 (Minneapolis, 1993), pp. 269–272
Google Scholar
S.H. Rhee, M.G. Kang, Discrete cosine transform based regularized high-resolution image reconstruction algorithm. Opt. Eng. 38(8), 1348–1356 (1999)
Article Google Scholar
M.C. Hong, M.G. Kang, A.K. Katsaggelos, A regularized multichannel restoration approach for globally optimal high resolution video sequence. in SPIE VCIP, vol. 3024 (San Jose, 1997), pp. 1306–1317
Google Scholar
M.C. Hong, M.G. Kang, A.K. Katsaggelos, An iterative weighted regularized algorithm for improving the resolution of video sequences, in Proceedings of the International Conference on Image Processing, vol. 2 (1997), pp. 474-477
Google Scholar
M.G. Kang, Generalized multichannel image deconvolution approach and its applications. Opt. Eng. 37(11), 2953–2964 (1998)
Article Google Scholar
R.C. Hardie, K.J. Barnard, J.G. Bognar, E.E. Armstrong, E.A. Watson, High-resolution image reconstruction from a sequence of rotated and translated frames and its application to an infrared imaging system. Opt. Eng. 37(1), 247–260 (1998)
Article Google Scholar
N.K. Bose, S. Lertrattanapanich, J. Koo, Advances in superresolution using L-curve, in Proceedings of the International Symposium on Circuits and Systems, vol. 2 (2001), pp. 433–436
Google Scholar
B.C. Tom, A.K. Katsaggelos, Reconstruction of a high-resolution image by simultaneous registration, restoration, and interpolation of low-resolution images, in Proceedings of 1995 IEEE International Conference on Image Processing, vol. 2, Washington (1995), pp. 539–542
Google Scholar
R.R. Schulz, R.L. Stevenson, Extraction of high-resolution frames from video sequences. IEEE Trans. Image Process. 5, 996–1011 (1996)
Article Google Scholar
R.C. Hardie, K.J. Barnard, E.E. Armstrong, Joint MAP registration and high-resolution image estimation using a sequence of undersampled images. IEEE Trans. Image Process. 6, 1621–1633 (1997)
Article Google Scholar
P. Cheeseman, B. Kanefsky, R. Kraft, J. Stutz, R. Hanson, Super-resolved surface reconstruction from multiple images. NASA Ames Research Center, Moffett Field, Technical Report FIA-94-12 (1994)
Google Scholar
H. Stark, P. Oskoui, High resolution image recovery from image-plane arrays, using convex projections. J. Opt. Soc. Am. A 6, 1715–1726 (1989)
Article Google Scholar
A.M. Tekalp, M.K. Ozkan, M.I. Sezan, High-resolution image reconstruction from lower-resolution image sequences and space varying image restoration, in Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), vol. 3 (San Francisco, 1992), pp. 169–172
Google Scholar
A.J. Patti, M.I. Sezan, A.M. Tekalp, Superresolution video reconstruction with arbitrary sampling lattices and nonzero aperture time. IEEE Trans. Image Process. 6(8), 1064–1076 (1997)
Article Google Scholar
P.E. Eren, M.I. Sezan, A.M. Tekalp, Robust, object-based high-resolution image reconstruction from low-resolution video. IEEE Trans. Image Process. 6(10), 1446–1451 (1997)
Article Google Scholar
A.J. Patti, Y. Altunbasak, Artifact reduction for set theoretic super resolution image reconstruction with edge adaptive constraints and higher-order interpolants. IEEE Trans. Image Process. 10(1), 179–186 (2001)
Article Google Scholar
B.C. Tom, A.K. Katsaggelos, An iterative algorithm for improving the resolution of video sequences, in Proceedings of the 1996 SPIE Conference on Visual Communications and Image Processing (Orlando, 1996), pp. 1430–1438
Google Scholar
M. Elad, A. Feuer, Restoration of a single super-resolution image from several blurred, noisy, and undersampled measured images. IEEE Trans. Image Process. 6(12), 1646–1658 (1997)
Article Google Scholar
M. Irani, S. Peleg, Improving resolution by image registration. CVGIP: Graph. Models Image Process. 53, 231–239 (1991)
Google Scholar
S. Mann, R.W. Picard, Virtual bellows: constructing high quality stills from video, in Proceedings of the IEEE International Conference on Image Processing (Austin, 1994), pp. 13–16
Google Scholar
M. Irani, S. Peleg, Motion analysis for image enhancement resolution, occlusion, and transparency. J. Visual Commun. Image Represent. 4, 324–335 (1993)
Article Google Scholar
M. Elad, A. Feuer, Superresolution restoration of an image sequence: adaptive filtering approach. IEEE Trans. Image Process. 8, 387–395 (1999)
Article Google Scholar
M. Elad, A. Feuer, Super-resolution reconstruction of image sequences. IEEE Trans. Pattern Anal. Mach. Intelli. 21(9), 817–834 (1999)
Article Google Scholar
M.C. Chiang, T.E. Boult, Efficient super-resolution via image warping. Image Vis. Comput. 18, 761–771 (2000)
Article Google Scholar
D. Rajan, S. Chaudhuri, Generation of super-resolution images form blurred observations using an MRF model. J. Math. Imaging Vision 16, 5–15 (2002)
Article MATH MathSciNet Google Scholar
D. Rajan, S. Chaudhuri, Simultaneous estimation of super-resolved intensity and depth maps from low resolution defocused observations of ascene, in Proceedings of the IEEE International Conference on Computer Vision (Vancouver, 2001), pp. 113–118
Google Scholar
D. Rajan, S. Chaudhuri, Generalized interpolation and its applications in super-resolution imaging. Image Vis. Comput. 19, 957–969 (2001)
Article Google Scholar
M.V. Joshi, S. Chaudhuri, Super-resolution imaging: use of zoom as a cue, in Proceedings of the Indian Conference on Vision, Graphics and Image Processing (Ahmedabad, 2002), pp. 439–444
Google Scholar
N.K. Bose, H.C. Kim, B. Zhou, Performance analysis of the TLS algorithm for image reconstruction from a sequence of undersampled noisy and blurred frames, in Proceedings of the ICIP-94, IEEE International Conference on Image Processing, vol. 3 (1994), pp. 571–575
Google Scholar
M. Ng, J. Koo, N. Bose, Constrained total least squares computations for high resolution image reconstruction with multisensors. Int. J. Imaging Syst. Technol. 12, 35–42 (2002)
Article Google Scholar
M.K. Ng, N.K. Bose, Analysis of displacement errors in high-resolution image reconstruction with multisensors. IEEE Trans. Circuits Syst. I 49, 806–813 (2002)
Article Google Scholar
M. Park, E. Lee, J. Park, M.G. Kang, J. Kim, DCT-based high-resolution image reconstruction considering the inaccurate sub-pixel motion information. SPIE Opt. Eng. 41(2), 370–380 (2002)
Article Google Scholar
W. Lim, M. Park, M.G. Kang, Spatially adaptive regularized iterative high resolution image reconstruction algorithm, in Proceedings of the VCIP2001, Photonicswest (San Jose, 2001), pp. 20–26
Google Scholar
E.S. Lee, M.G. Kang, Regularized adaptive high-resolution image reconstruction considering inaccurate subpixel registration. IEEE Trans. Image Process. 12, 826–837 (2003)
Google Scholar
Wirawan, P. Duhamel, H. Maitre, Multi-channel high resolution blind image restoration, in Proceedings of the IEEE ICASSP (AZ, 1989), pp. 3229–3232
Google Scholar
N. Nguyen, P. Milanfar, G. Golub, Efficient generalized cross-validation with applications to parametric image restoration and resolution enhancement. IEEE Trans. Image Process. 10, 1299–1308 (2001)
Article MATH MathSciNet Google Scholar
N. Nguyen, P. Milanfar, G. Golub, A computationally efficient superresolution image reconstruction algorithm. IEEE Trans. Image Process. 10, 573–583 (2001)
Article MATH MathSciNet Google Scholar
M. Elad, Y. Hel-Or, A fast super-resolution reconstruction algorithm for pure translational motion and common space-invariant blur. IEEE Trans. Image Process. 10(8), 1187–1193 (2001)
Article MATH Google Scholar
M. Ng, R. Chan, T. Chan, A. Yip, Cosine transform preconditioners for high resolution image reconstruction. Linear Algebra Appl. 316, 89–104 (2000)
Article MATH MathSciNet Google Scholar
D.S. Messing, M.I. Sezan, Improved multi-image resolution enhancement for colour images captured by single-CCD cameras, in Proceedings of the International Conference on Image Processing, vol. 3 (2000), pp. 484–487
Google Scholar
M.K. Ng, An efficient parallel algorithm for high resolution color image reconstruction, in Proceedings of the 7th International Conference on Parallel and Distributed Systems: Workshops (2000), pp. 547–552
Google Scholar
B.C. Tom, A.K. Katsaggelos, Resolution enhancement of monochrome and color video using motion compensation. IEEE Trans. Image Process. 10(2), 278–287 (2001)
Article MATH Google Scholar
M. Ng, W. Kwan, High-resolution color image reconstruction with Neumann boundary conditions. Ann. Oper. Res. 103, 99–113 (2001)
Article MATH MathSciNet Google Scholar
D. Chen, R.R. Schultz, Extraction of high-resolution video stills from MPEG image sequences, in Proceedings of the 1998 IEEE International Conference on Image Processing, vol. 2 (1998), pp. 465–469
Google Scholar
Y. Altunbasak, A.J. Patti, A maximum a posteriori estimator for high resolution video reconstruction from MPEG video, in Proceedings of the 2000 IEEE International Conference on Image Processing, vol. 2 (2000), pp. 649–652
Google Scholar
B. Martins, S. Forchhammer, A unified approach to restoration, deinterlacing and resolution enhancement in decoding MPEG-2 video. IEEE Trans. Circuits Syst. Video Technol. 12(9), 803–811 (2002)
Article Google Scholar
C.A. Segall, R. Molina, A.K. Katsaggelos, J. Mateos, Reconstruction of high-resolution image frames from a sequence of low-resolution and compressed observations, in Proceedings of the 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 2 (2002), pp. 1701–1704
Google Scholar
S.C. Park, M.G. Kang, C.A. Segall, A.K. Katsaggelos, Spatially adaptive high-resolution image reconstruction of low resolution DCT-based compressed images, in Proceedings of the 2002 IEEE International Conference Image Processing, vol. 2 (2002), pp. 861–864
Google Scholar
B.K. Gunturk, Y. Altunbasak, R.M. Mersereau, Multiframe resolution-enhancement methods for compressed video. IEEE Signal Process. Lett. 9, 170–174 (2002)
Article Google Scholar
H.C. Andrews, B.R. Hunt, Digital Image Restoration (Prentice-Hall, Englewood Cliffs, 1977)
Google Scholar
A.K. Katsaggelos (ed.), Digital Image Restoration, vol. 23 (Springer, Heidelberg, 1991)
Google Scholar
I.J. Schoenberg, Cardinal interpolation and spline functions. J. Approx. Theory. 2, 167–206 (1969)
Article MATH MathSciNet Google Scholar
R.E. Crochiere, L.R. Rabiner, Interpolation and decimation of digital signals-a turorial review. Proc. IEEE 69(3), 300–331 (1981)
Article Google Scholar
M. Unser, A. Aldroubi, M. Eden, Enlargement or reduction of digital images with minimum loss of information. IEEE Trans. Image Process. 4(3), 247–258 (1995)
Article Google Scholar
Z. Wang, F. Qi, On ambiguities in super-resolution modeling. IEEE Signal Process. Lett. 11, 678–681 (2004)
Article Google Scholar
S. Farsiu, D. Robinson, M. Elad, P. Milanfar, Robust shift and add approach to super-resolution, in Proceedings of SPIE Applications of Digital Image Processing XXVI, vol. 5203 (San Diego, 2003), pp. 121–130
Google Scholar
S. Farsiu, D. Robinson, M. Elad, P. Milanfar, Fast and robust multiframe super-resolution. IEEE Trans. Image Process. 13, 1327–1344 (2004)
Article Google Scholar
A. L´opez, R.Molina, A. K. Katsaggelos, A. Rodr´ıguez, J. M. L´opez, J. M. Llamas, Parameter estimation in Bayesian reconstruction of SPECT images: an aide in nuclear medicine diagnosis. Int. J. Imaging Syst. Technol. 14, 21–27 (2004)
Google Scholar
S. Lertrattanapanich, N.K. Bose, High resolution image formation from low resolution frames using Delaunay triangulation. IEEE Trans. Image Process. 11, 1427–1441 (2002)
Article MathSciNet Google Scholar
S.C. Park, M.K. Park, M.G. Kang, Super-resolution image reconstruction: a technical overview. IEEE Sig. Process. Mag. 20(3), 21–36 (2003)
Google Scholar
B. Zitová, J. Flusser, Image registration methods: a survey. Image Vis. Comput. 21(11), 977–1000 (2003)
Article Google Scholar
A.M. Tekalp, Digital Video Processing (Prentice Hall, Englewood Cliffs, 1995)
Google Scholar
B.S. Reddy, B.N. Chatterji, An FFT-based technique for translation, rotation, and scale-invariant image registration. IEEE Trans. Image Process. 5(8), 1266–1271 (1996)
Article Google Scholar
B. Marcel, M. Briot, R. Murrieta, Calcul de translation et rotation par la transformation de Fourier. Traitement du Signal 14(2), 135–149 (1997)
MATH Google Scholar
S.P. Kim, W.-Y. Su, Subpixel accuracy image registration by spectrum cancellation, in Proceedings of IEEE International Conference Acoustics, Speech, Signal Processing (ICASSP ’93), vol. 5 (Minneapolis, 1993), pp. 153–156
Google Scholar
H.S. Stone, M.T. Orchard, E.-C. Chang, S.A. Martucci, A fast direct Fourier-based algorithm for subpixel registration of images. IEEE Trans. Geosci. Remote Sens. 39(10), 2235–2243 (2001)
Article Google Scholar
P. Vandewalle, S.E. S¨usstrunk, M. Vetterli, Super-resolution images reconstructed from aliased images, in Proceedings of SPIE/IS&T Visual Communications and Image Processing Conference, ed. by T. Ebrahimi, T. Sikora, vol. 5150 (Lugano, 2003), pp. 1398–1405
Google Scholar
H. Foroosh, J.B. Zerubia, M. Berthod, Extension of phase correlation to subpixel registration. IEEE Trans. Image Process. 11(3), 188–200 (2002)
Article Google Scholar
L. Lucchese, G.M. Cortelazzo, A noise-robust frequency domain technique for estimating planar roto-translations. IEEE Trans. Sig. Process. 48(6), 1769–1786 (2000)
Article Google Scholar
D. Capel, A. Zisserman, Computer vision applied to super-resolution. IEEE Sig. Process. Mag. 20(3), 75–86 (2003)
Article Google Scholar
M.A. Fischler, R.C. Bolles, Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Comm. of the ACM 24(6), 381–395
Google Scholar
D. Keren, S. Peleg, R. Brada, Image sequence enhancement using sub-pixel displacements, in Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 88), (Ann Arbor, 1988), pp. 742–746
Google Scholar
J.R. Bergen, P. Anandan, K.J. Hanna, R. Hingorani, Hierarchical model-based motion estimation, in Proceedings of 2nd European Conference on Computer Vision (ECCV’92), Lecture Notes in Computer Science (Santa Margherita Ligure, 1992), pp. 237–252
Google Scholar
M. Irani, B. Rousso, S. Peleg, Computing occluding and transparent motions. Int. J. Comput. Vis. 12(1), 5–16 (1994)
Article Google Scholar
J. Gluckman, Gradient field distributions for the registration of images, in Proceedings of IEEE International Conference on Image Processing (ICIP ’03), vol. 3 (Barcelona, 2003), pp. 691–694
Google Scholar
A. Papoulis, Generalized sampling expansion. IEEE Trans. Circuits Syst. 24(11), 652–654 (1977)
Article MATH MathSciNet Google Scholar
R.R. Schultz, L. Meng, R.L. Stevenson, Subpixel motion estimation for super-resolution image sequence enhancement. J. Vis. Commun. Image Represent. 9(1), 38–50 (1998)
Article Google Scholar
A. Zomet, A. Rav-Acha, S. Peleg, Robust super-resolution, in Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR ’01), vol. 1, (Kauai, 2001), pp. 645–650
Google Scholar
W. Li, E. Salari, Successive elimination algorithm for motion estimation. IEEE Trans. Image Process. 4(1), 105–107 (1995)
Article Google Scholar
X.Q. Gao, C.J. Duanmu, C.R. Zou, A multilevel successive elimination algorithm for block matching motion estimation. IEEE Trans. Image Process. 9(3), 501–505 (2000)
Article Google Scholar
M. Brűnig, W. Niehsen, Fast full search Block matching. IEEE Trans. Circuit Syst. Video Technol. 11(2), 241–247 (2001)
Article Google Scholar
H.G. Musmann, P. Pirsch, H.J. Grallert, Advanced in picture coding. Proc. IEEE 73(4), 523–548 (1995)
Article Google Scholar
S. Zhu, K. Ma, A new diamond search algorithm for fast block matching motion estimation. IEEE Trans. Image Process. 9, 287–290 (2000)
Article Google Scholar
ITU-T, Video coding for low bitrate communication, Draft Recommendation H.263 (1995)
Google Scholar
Y.H. Jeong, J.H. Kim, The FASCO block matching algorithm based on motion vector prediction using spatio-temporal correlations. Korean Inst. Commun. Sci. 26(11A), 1925–1937 (2002)
MathSciNet Google Scholar
J.N. Kim, S.C. Byun, Y.H. Kim, B.H. Ahn, Fast full search motion estimation algorithm using early detection of impossible candidate vectors. IEEE Trans. Sig. Process. 50(9), 2355–2365 (2002)
Article MathSciNet Google Scholar
V. Ayala Ramirez, M. Devy, C. Parra, Active tracking based on Hausdorff matching, in Proceedings of Pattern Recognition 15th International Conference, vol. 4 (2000), pp. 706–709
Google Scholar
V. Velisavljevic, “Directionlets,” Ph.D. dissertation, Ecole Polytechnique F´ed´erale de Lausanne (EPFL), Lausanne, Switzerland, 2005, ph.D. Thesis EPFL 3358 (2005), School of Computer and Communication Sciences
Google Scholar
W.S. Hoge, A subspace identification extension to the phase correlation method. IEEE Trans. Med. Imaging 22(2), 277–280 (2003)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Engineering Faculty, Universiti Malaya, Kualar Lumpur, Malaysia
Hyo-Moon Cho

Authors

Hyo-Moon Cho
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hyo-Moon Cho .

Editor information

Editors and Affiliations

Yonsei University, Seoul, Korea, Republic of (South Korea)
Jaeseok Kim
Hanyang University, Ansan, Korea, Republic of (South Korea)
Hyunchul Shin

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Cho, HM. (2014). Super Resolution. In: Kim, J., Shin, H. (eds) Algorithm & SoC Design for Automotive Vision Systems. Springer, Dordrecht. https://doi.org/10.1007/978-94-017-9075-8_3

Download citation

DOI: https://doi.org/10.1007/978-94-017-9075-8_3
Published: 29 June 2014
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-017-9074-1
Online ISBN: 978-94-017-9075-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics