1 Introduction

Generally, the post-stack seismic signal can be assumed as the convolution of seismic wavelet and reflectivity. Since the wavelet is always band-limited, the seismic signal is band-limited and some important geology details are lost. Therefore, seismic deconvolution to remove the wavelet effect from seismic data is an important step in seismic data processing (e.g. Robinson 1984; Yuan and Wang 2011). However, because of the band-limitation of wavelet and noise, the seismic deconvolution is commonly an ill-posed inverse problem. Many deconvolution and reflectivity inversion methods (e.g. Taylor et al. 1979; Levy and Fullagar 1981; Debeye and van Riel 1990; Sacchi 1997; Herrmann 2005; Baziw and Ulrych 2006; Wu et al. 2007; Yuan and Wang 2013a) had been introduced to address the issue by adding a regularization or a constraint along the temporal direction and successfully applied on various seismic data sets and have the advantages that they are very fast and favor parallel computation. However, these techniques are realized trace by trace. Besides ignoring the spatial connection among traces, trace-by-trace processing often suffer from the lateral instability of the estimated reflectivity or impedance (e.g. Zhang et al. 2013; Yuan et al. 2015), probably mainly due to the influence of high-wavenumber components in model error or the inconsistency of the energy and waveforms among seismic traces.

Several multi-trace deconvolution or reflectivity inversion methods had been proposed to explore spatial dependencies among traces. For example, Lavielle (1991) proposed a multi-trace non-blind deconvolution method, which makes use of Gibbs distributions and Markov random fields to integrate a priori information to model the lateral coherency of the reflectors. For this multi-trace reflectivity inverse problem, the simulated annealing algorithm is used to locate the global minimum multi-trace reflectivity with a high degree of accuracy. Kaaresen and Taxt (1998) proposed a multi-trace blind deconvolution method to simultaneously invert for a short wavelet with known duration and multi-trace reflectivity with local continuities among neighboring traces by alternatively implementing a least-square algorithm and an iterated window maximization algorithm. Wang et al. (2006) proposed a structure-preserving multi-trace deconvolution method by using adaptive FX filtering to enhance the coherence of seismic events across midpoints. Heimer and Cohen (2008) introduced a multi-trace deconvolution method by using a Markov-Bernoulli random-field modeling combined with the Viterbi algorithm to further explore layer discontinuities. Kumar (2009) proposed a multi-trace non-spiky deconvolution method by using multiscale and multidirectional curvelet transform to exploit the continuities along reflectors for cases in which the assumption of spiky reflectivity may not hold. Zhang et al. (2013) extended the single-trace Basis pursuit reflectivity inversion (2011) to a multi-trace case by adopting a ‘Z’ shape spatial derivative as a regularization term. Gholami and Sacchi (2013) proposed a fast 3D blind seismic deconvolution method to simultaneously invert for a wavelet with sparseness in a wavelet transform domain and 3D reflectivity along 3D singularities in the time–space domain via a split Bregman iteration algorithm. These works mainly focus on developing different (fast) deconvolution or reflectivity inversion methods with a spatial regularization or a constraint about spatial connection among traces to process 2D or 3D stationary data.

In this paper, we also explore spatial (local) continuities of the reflectivity by adopting a spatial regularization. Furthermore, we focus on clarifying the physical roles of the spatial regularization. Besides preserving spatial continuities of structures, we also believe that the spatial regularization can help stabilizing reflectivity inversion, preventing trails like noodles, and recovering the reflectivity of the missing traces, as explained by the theory and verified by the synthetic and field data examples. In fact, the recovery of the reflectivity for the missing traces implicitly reveals that the spatial regularization can evidently and truly explore lateral continuity of the estimated reflectivity. Moreover, the spatial regularization has an ability to help yielding a high resolution and meanwhile high fidelity reflectivity result, or achieving a good tradeoff between reflectivity fidelity and resolution improvement, which is obviously observed in our synthetic data example.

2 Theory

According to Robinson and Treitel (1980), a single trace signal in the post-stack seismic data set is widely considered as the convolution of a seismic wavelet with a reflectivity series and can be mathematically written in a matrix–vector form as

$$ {\mathbf{S}}_{j} = {\mathbf{WR}}_{j} , $$
(1)

where column vectors S j and R j represent the j-th trace seismic signal and reflectivity series respectively, the wavelet convolution matrix \( {\mathbf{W}} = \left[ {\begin{array}{*{20}c} {w_{1} } & 0 & \cdots & 0 \\ {w_{2} } & {w_{1} } & \ddots & \vdots \\ \vdots & {w_{2} } & \ddots & 0 \\ {w_{L} } & \vdots & \ddots & {w_{1} } \\ 0 & {w_{L} } & \ddots & {w_{2} } \\ \vdots & \ddots & \ddots & \vdots \\ 0 & \cdots & 0 & {w_{L} } \\ \end{array} } \right] \) is a Toeplitz matrix, vector w = [w 1,w 2,…,w L ] denotes the seismic wavelet and L is the length of the wavelet.

Based on the assumption that the wavelet is spatiotemporal stationary, the multi-trace seismic reflection signal can be described as the following matrix–matrix system according to Eq. (1)

$$ {\mathbf{S}} = {\mathbf{WR}}, $$
(2)

where matrices S = [S 1, S 2,…, S M ] and R = [R 1, R 2,…, R M ] with M the number of traces.

Mathematically, Eq. (2) also can be rewritten in a matrix–vector system as follows

$$ {\mathbf{d}} = {\mathbf{Gm}}, $$
(3)

where vector d = vec(S), matrix G = kron(I,W), I is the identity matrix, vector m = vec(R), vec means arranging the columns of a matrix into a long concatenated vector, and kron defines a Kronecker product that reformulates matrix–matrix multiplication into matrix–vector product. Note G is a blocky diagonal matrix and exceedingly sparse with most entries zero.

Assuming that the wavelet w is known, the multi-trace reflectivity series can be simultaneously estimated by minimizing the following objective function

$$ O_{1} ({\hat{\mathbf{m}}}) = ||{\mathbf{d}}^{obs} - {\mathbf{Gm}}||_{2}^{2} , $$
(4)

where column vector \( {\hat{\mathbf{m}}} = \left[ {{\hat{\mathbf{R}}}_{1} ,{\hat{\mathbf{R}}}_{2} , \cdots ,{\hat{\mathbf{R}}}_{M} } \right]^{T} \), and \( {\hat{\mathbf{R}}}_{j} \) represents the j-th trace estimated reflectivity. From the viewpoint of inversion, d obs = vec(D obs) and Gm represent the observed multi-trace data and the calculated multi-trace data respectively, where D obs = [D obs1 D obs2 ,…, D obs M ] and D obs j represents the j-th trace observed data. Because the length of the seismic wavelet is always limited, the above objective function is a strict convex function at the absence of noise. Therefore, Eq. (4) has a unique minimum extreme, which can be obtained by taking the derivatives of \( O_{1} ({\hat{\mathbf{m}}}) \) with respect to variables m to be 0. Then a normal equation can be derived as follows

$$ {\mathbf{G}}^{T} {\mathbf{Gm}} = {\mathbf{G}}^{T} {\mathbf{d}}^{obs} , $$
(5)

where G T G = kron(I,W T W) is a non-negative definite square symmetric matrix, and superscript T represents the transpose.

If we perform singular value decomposition (SVD) for matrix W, the solution of Eq. (5) can be given as

$$ {\hat{\mathbf{m}}} = kron\left( {{\mathbf{I}},\sum\limits_{k = 1}^{K} {\frac{1}{{\delta_{k} }}{\mathbf{V}}_{k} {\mathbf{U}}_{k}^{T} } } \right){\mathbf{d}}^{obs} , $$
(6)

where δ 1 ≥ δ 2 ≥ ··· ≥ δ K  ≥ 0 are singular values of W, column vector U k (k = 1,…,K) is the left singular vector, and column vector V k (k = 1,2,…,K) is the right singular vector. Equation (6) means that \( {\hat{\mathbf{R}}}_{j} \) can be considered as the linear weighted superposition of K eigen-signals V k U T k D obs j (k = 1,2,…,K) with weights 1/δ k . Further, \( {\hat{\mathbf{R}}}_{j} \) can be regarded as the linear weighted superposition of the K right singular vectors V k (k = 1,…,K) with weights U T k D obs j /δ k . Since the amplitude of high-frequency components of seismic wavelet is always very small even close to zero mainly due to the absorption of media, matrix W has many small singular values. In general, the left singular vector U k and the right singular vector V k corresponding to small singular values commonly act as high-frequency signals, whereas those associated with large singular values own plentiful low-frequency components but very little high-frequency components. When D obs j does not include high-frequency noise (or model error) outside the frequency band of wavelet, the dependency of the left singular vector U k corresponding to small singular values on D obs j (j = 1,2,…,M) is poor. Therefore, U T k D obs j /δ k will not be too large to cause the estimated \( {\hat{\mathbf{R}}}_{j} \) strongly oscillating. In this case, the reflectivity inversion is stable. However, when D obs j includes high-frequency noise (or model error), U T k D obs j /δ k corresponding to most small singular values will be probably large, thus bringing in strong high-frequency components, further blurring the other frequency components of the estimated \( {\hat{\mathbf{R}}}_{j} \). In a word, it will give rise to the instability of the inversion.

In this paper, we impose a spatial regularization (Yuan et al. 2012) on Eq. (4) to reduce even overcome this issue. The improved objective function can be given as

$$ O_{2} ({\hat{\mathbf{m}}}) = ||{\mathbf{d}}^{obs} - {\mathbf{Gm}}||_{2}^{2} + \sum\limits_{l} {\gamma_{l} ||{\mathbf{C}}_{l} {\mathbf{m}}||_{2}^{2} } , $$
(7)

where C l is a first-order difference operator, which can be taken as the spatial difference along the interpreted horizon(s), dipping spatial direction(s) and/or horizontal spatial direction, and γ l is regularization parameter(s) determining the balance between the data residual and the smoothness degree or length of C l m. We do not take C l as a conventional frequency-wavenumber (FK) operator or a scale-position-angle (Curvelet) operator, since minimizing the energy of FK-domain or Curvelet-domain coefficients does not realize the spatial regularization role. Here, we only consider C l to be a first-order difference operator along the horizontal direction in order to conveniently clarify the roles of the spatial regularization. Therefore, Eq. (7) can be simplified as

$$ O_{3} ({\hat{\mathbf{m}}}) = ||{\mathbf{d}}^{obs} - {\mathbf{Gm}}||_{2}^{2} + \gamma ||{\mathbf{Cm}}||_{2}^{2} . $$
(8)

The function is a typical convex function, since both the data misfit term and the regularization term are convex. The optimum solution of Eq. (8) can be obtained by taking the derivatives of \( O_{3} ({\hat{\mathbf{m}}}) \) with respect to variables m being 0, hence we get

$$ {\hat{\mathbf{m}}} = \left( {{\mathbf{G}}^{T} {\mathbf{G}}\,{ + }\,\gamma {\mathbf{C}}^{T} {\mathbf{C}}} \right)^{ - 1} {\mathbf{G}}^{T} {\mathbf{d}}^{obs} \,{ = }\, \left( {{\mathbf{G}}^{T} {\mathbf{G}}\,{ +\, 2}\gamma {\mathbf{I}} + \gamma {\mathbf{A}}} \right)^{ - 1} {\mathbf{G}}^{T} {\mathbf{d}}^{obs} , $$
(9)

where C T C = 2I + A. If the term γ A is ignored, we have

$$ {\hat{\mathbf{m}}} = \left[ {\sum\limits_{k = 1}^{K} {\frac{{{\mathbf{U}}_{k}^{T} {\mathbf{D}}_{1}^{obs} }}{{\delta_{k} + 2\gamma /\delta_{k} }}} {\mathbf{V}}_{k} ,\sum\limits_{k = 1}^{K} {\frac{{{\mathbf{U}}_{k}^{T} {\mathbf{D}}_{2}^{obs} }}{{\delta_{k} + 2\gamma /\delta_{k} }}} {\mathbf{V}}_{k} , \ldots ,\sum\limits_{k = 1}^{K} {\frac{{{\mathbf{U}}_{k}^{T} {\mathbf{D}}_{N}^{obs} }}{{\delta_{k} + 2\gamma /\delta_{k} }}} {\mathbf{V}}_{k} } \right]^{T} . $$
(10)

It is obvious that \( \delta_{k} + 2\gamma /\delta_{k} \ge 2\sqrt {2\gamma } \), thus \( \frac{{{\mathbf{U}}_{k}^{T} {\mathbf{D}}_{j}^{obs} }}{{\delta_{k} + 2\gamma /\delta_{k} }} \) will not be so large even for small δ k if γ is not too small. In other words, the term 2γ/δ k plays a key role in reducing the amplification of noise. It demonstrates that the term γ C T C only derived from the regularization term includes a stabilization factor. If the term γ A is taken into account, the computation of the inverse of matrix G T G + γ C T C is usually stable so long as γ is not too small, as denoted in Fig. 1 and also demonstrated by the synthetic and field examples in the following Examples section.

Fig. 1
figure 1

Singular values of matrix G T G + γ C T C with different parameters γ. Matrix G is constructed by a 30-Hz Ricker wavelet. When γ = 0, many singular values of matrix G T G + γ C T C are very small and close to zero

If we take the derivatives of \( O_{3} ({\hat{\mathbf{m}}}) \) with respect to variable R i,j , we have

$$ \begin{aligned} \frac{{\partial O_{3} ({\hat{\mathbf{m}}})}}{{\partial {\mathbf{R}}_{i,j} }} &= {\mathbf{W}}_{i}^{T} \left( {{\mathbf{WR}}_{j} - {\mathbf{D}}_{j}^{obs} } \right) + \gamma \left( {2{\mathbf{R}}_{i,j}^{obs} - {\mathbf{R}}_{i,j - 1}^{obs} - {\mathbf{R}}_{i,j + 1}^{obs} } \right) \hfill \\ &= {\mathbf{W}}_{i}^{T} \left( {{\mathbf{WR}}_{j} - {\mathbf{D}}_{j}^{obs} } \right) + \gamma \left[ { - 1,2, - 1} \right]\left[ {{\mathbf{R}}_{i,j - 1}^{obs} ,{\mathbf{R}}_{i,j}^{obs} ,{\mathbf{R}}_{i,j + 1}^{obs} } \right]^{T} \end{aligned} , $$
(11)

where vector W i represents the i-th column of matrix W, R obs i,j−1 , R obs i,j and R obs i,j+1 represent the i-th elements at (j − 1)-th column, j-th column and (j + 1)-th column of matrix R respectively. Matrix W plays a temporal band-pass filtering role, eliminating the high- and low-frequency components of the reflectivity series R j outside the frequency band of the wavelet. In other words, the data misfit term is not sensitivity to high- and low-frequency components of reflectivity. Vector [−1, 2, −1] plays a spatial band-pass filtering role, which can mainly filter out the high-wavenumber components of the reflectivity series. Further, it can help overcoming lateral instability of the estimated reflectivity, probably mainly caused by the influence of high-wavenumber noise and/or the inconsistency of the energy and waveforms among seismic traces. Although a conventional spatial filter such as a smoothing filter can also be applied on the conventional deconvolution result to reduce the influence of the high-wavenumber noise, its operation works independently from the deconvolution. The smoothing filter can attenuate high-wavenumber noise, but meanwhile, they will deteriorate or destroy the match between the calculated data and the observed data. The spatial regularized deconvolution method, as an inversion technique, can simultaneously suppress high-wavenumber noise and keep the consistence with the data.

For the post-stack data, some seismic traces are sometimes missing, and/or some traces are probably relative poor-quality, such as low signal-to-noise ratio and arc phenomenon at the position of missing traces or faults caused by conventional migration. Generally, the robust inversion (e.g. Crase et al. 1990; Yuan et al. 2015) is a choice to deal with the under-sampled data set or low quality migration data set to obtain an acceptable inversion result. In this paper, we illustrate that the spatial regularization can help recovering the reflectivity of the missing and/or poor-quality traces by slightly rewriting Eq. (8) as

$$ O_{4} ({\hat{\mathbf{m}}}) = ||{\mathbf{P}}\left( {{\mathbf{d}}^{obs} - {\mathbf{Gm}}} \right)||_{2}^{2} + \gamma ||{\mathbf{Cm}}||_{2}^{2} , $$
(12)

where matrix P is a restriction operator discarding missing and/or poor-quality traces. Note that the row of matrix P is not larger than its column, thereby essentially playing a dimensionality-reduction role for the data misfit term. However, P leads to an underdetermined linear equation. It is obvious that the reflectivity corresponding to the discarding traces cannot be recovered if there is no spatial constraint. In fact, Eq. (8) is a special case of Eq. (12), when P = I. To solve Eq. (12) can be broken up into two parts: live traces part and discarding traces part. For live traces case, the derivatives of \( O_{4} ({\hat{\mathbf{m}}}) \) with respect to variable R i,j are in the same form as Eq. (11). For discarding traces case,

$$ \frac{{\partial O_{4} ({\hat{\mathbf{m}}})}}{{\partial {\mathbf{R}}_{i,j} }} = \gamma \left( {2{\mathbf{R}}_{i,j}^{obs} - {\mathbf{R}}_{i,j - 1}^{obs} - {\mathbf{R}}_{i,j + 1}^{obs} } \right) . $$
(13)

Let \( \frac{{\partial O_{4} ({\hat{\mathbf{m}}})}}{{\partial {\mathbf{R}}_{i,j} }} = 0 \), we get

$$ {\mathbf{R}}_{i,j}^{obs} = \left( {{\mathbf{R}}_{i,j - 1}^{obs} + {\mathbf{R}}_{i,j + 1}^{obs} } \right)/2 . $$
(14)

R obs j could be estimated by averaging its two neighboring estimated reflectivity traces. By repeatedly updating the reflectivity of the living traces and discarding traces, the frequency components of the reflectivity for the whole section or volume including discarding traces can be recovered. In essence, Eq. (14) reveals that the spatial regularization indeed can help exploring spatial continuities of the inverted model.

3 Examples

In this section, a 2D synthetic data example and a 2D field data example are used to illustrate the effectiveness of the spatial regularization in seismic deconvolution. We also compare spatial regularized deconvolution with conventional temporal deconvolution implemented by minimizing a data misfit and a L2 norm or a L1 norm of reflectivity to reveal the benefits of spatial regularized deconvolution.

We adopt different regularization schemes in the two examples, but use the same conjugate gradient (CG) algorithm (Trefethen and Bau 1997; Yuan and Wang 2013b) to iteratively solve the objective functions. Moreover, all deconvolution methods utilize all traces with the same initial model 0 and the maximum iteration number 30 to simultaneously invert for the whole reflectivity section.

3.1 Synthetic data example

A noisy synthetic data (Fig. 2a) is generated by adding 20 % random noise (the ratio of noise energy to signal energy is 20 %) to the convolution of a 30-Hz Ricker wavelet with the reflectivity derived from a synthetic BG impedance model. The data set contains 700 traces with a sample interval of 2 ms. We use a 0-0-65-80 Hz band-pass filtered result of the original BG reflectivity as a reference (Fig. 2b), in order to concentrate on discussing the roles of the spatial regularization in seismic deconvolution. Compared with the reference reflectivity, the resolution of the original data (Fig. 2a) is lower and the stratigraphic contact relationship is harder to discontinuity than the reference reflectivity due to the wavelet interference and noise, denoted by the rectangles and ellipses in Fig. 2a and b.

Fig. 2
figure 2

The comparisons among the synthetic noisy data (a), the reference reflectivity (b) and the estimated reflectivity obtained by using different deconvolution methods: the spatial regularized deconvolution with the optimum regularization parameter 30 (c), the conventional deconvolution by using a L2 norm of model parameters as a regularization with the optimum regularization parameter 1.5 (d), the conventional deconvolution by using a L1 norm of model parameters as a regularization with the optimum regularization parameter 0.1 (e), and the unregularized deconvolution (f)

We quantitatively define a relative error \( E = ||{\hat{\mathbf{m}}} - {\mathbf{m}}_{ref} ||_{2}^{2} /||{\mathbf{m}}_{ref} ||_{2}^{2} \), where column vector m ref represents the reference reflectivity, as an evaluation criterion to choose the optimum regularization parameter and appraise different deconvolution methods. The black curve with circles in Fig. 3 is the relative error E versus regularization parameter γ (called E-γ curve for short) of spatial regularized deconvolution. When γ = 30, the optimum deconvolution result can be obtained, as Fig. 2c shows. The blue curve with triangles in Fig. 3 is E-γ curve for using temporal regularized deconvolution implemented by minimizing a L2 norm of model parameters. It is obvious that when γ = 1.5, the optimum deconvolution result, as shown in Fig. 2d, is obtained. The red curve with asterisks in Fig. 3 is E-γ curve for using temporal regularized deconvolution implemented by minimizing a L1 norm of model parameters. When γ = 0.1, the optimum deconvolution result, as shown in Fig. 2e, is obtained. The two optimum regularization parameters for using temporal regularized deconvolution are also consistent to these obtained by using the classical L-curve method (Hansen 1992). For the sake of completeness, we also show the unregularized deconvolution result by setting γ = 0, shown in Fig. 2f. Figure 4 is the amplitude spectra of the original data (Fig. 2a), the reference (Fig. 2b) and deconvolution results obtained by using different methods (Fig. 2c–f). As both Fig. 2 in the time–space domain and Fig. 4 in the frequency-space domain show, the spatial regularization can help to stably invert for reflectivity (Fig. 2c, f), just like the regularization along the time direction (Fig. 2d, e). In addition, the spatial regularized deconvolution effectively reduces the influence of wavelet interference as denoted by the rectangles and ellipses in Fig. 2a–c, and gives rise to a high resolution and meanwhile high signal-to-noise ratio (SNR) result, which matches best with the reference. As well, the spatial regularized deconvolution avoids lateral instability of the inverted reflectivity, thereby preventing noisy trails like noodles always caused by the conventional temporal regularized deconvolution (Figs. 2d–e, 4d–e). It can be observed from Fig. 3 that the minimum relative errors of the two E-γ curves for using temporal regularized deconvolution are larger than that for using the spatial regularization. Figure 5 is the normalization wavenumber spectra of Fig. 2a–f, respectively. As Fig. 5 shows, the wavenumber components of spatial regularized deconvolution result is closest to those of the reference, whereas the wavenumber components of temporal regularized deconvolution result are still close to those of the original data.

Fig. 3
figure 3

The relative error E curves versus regularization parameter γ for using three different regularization methods: the spatial regularized deconvolution marked “Spatial” in the legend, the temporal regularized deconvolution adopting a L2 norm of model parameters as the regularization marked “Temporal L2” in the legend and the temporal regularized deconvolution adopting a L1 norm of model parameters as the regularization marked “Temporal L1” in the legend. The corresponding minimum relative errors of these three curves are 0.0693, 0.2804 and 0.3696, respectively

Fig. 4
figure 4

The amplitude spectra corresponding to Fig. 2a–f

Fig. 5
figure 5

The normalization wavenumber spectra corresponding to Fig. 2a–f. Every wavenumber spectrum curve is obtained by summing wavenumber spectra of all time slices and then normalizing

In order to further illustrate the ability of the spatial regularization for evidently exploring the spatial continuities among traces, we randomly set 175 traces to be null, and a new synthetic section with 25 % discarding traces (Fig. 6a) is generated. By searching the smallest E value in the E-γ curve, we get the optimum deconvolution result, shown in Fig. 6b, when γ = 20. It can be observed form Fig. 6b that the spatial regularized deconvolution result for the data set is comparable to that for the full-sampled data set (Fig. 2c), even for the discarding traces. However, the conventional temporal regularized deconvolution methods cannot recover the reflectivity series corresponding to the discarding traces (not shown).

Fig. 6
figure 6

The synthetic noisy data with 175 discarding traces (a) and the spatial regularized deconvolution result with the optimum regularization parameter 20 (b)

3.2 Field data example

A field post-stack seismic data is exploited to test the effectiveness of the spatial regularization for dealing with the real data. The dataset includes 608 traces with a sampling interval 1 ms, as shown in Fig. 7a. The deconvolution without regularization (Fig. 7e) yields a blurry result mainly aroused by the high-frequency model error above ~80 Hz. When the smoothness or sparseness regularization of model parameters along the temporal direction is adopted, deconvolution result with the optimum regularization parameter obtained by using the classical L-curve method becomes stable, but includes some noisy trails like noodles and suffers from weak lateral instability (Fig. 7c, d). As expected, the seismic deconvolution via the spatial regularization addresses these issues and yields a high resolution result with more clear details, as pointed out by the ellipses in Fig. 7b. These details improved by reducing wavelet interference also appear in temporal regularized deconvolution results (Fig. 7c, d). It is noticeable that the temporal regularized deconvolution methods do not introduce any constraints about spatial information of signals. The spatial regularized deconvolution does not visibly destroy the feature of structures thanks to the good trade-off between the data misfit term and the regularization term.

Fig. 7
figure 7

The comparisons among the field seismic data (a) and the estimated reflectivity obtained by using different deconvolution methods: the spatial regularized deconvolution with the regularization parameter 10 (b), the conventional deconvolution by using a L2 norm of model parameters as a regularization with the optimum regularization parameter 0.5 (c), the conventional deconvolution by using a L1 norm of model parameters as a regularization with the optimum regularization parameter 0.1 (d), and the unregularized deconvolution (e). The regularization parameters for Fig. 6c and d are chosen by the classical L-curve method, whereas the regularization parameter for Fig. 6b is chosen by trial and error

In order to clearly illustrate the continuity-preserving role of the spatial regularization, we also randomly set 152 traces to be null and generate a new section (Fig. 8a). Therefore, only 456 traces are utilized to implement 608 traces deconvolution. The result is shown in Fig. 8b. By comparing Fig. 8b with Fig. 7b, we find that the spatial regularized deconvolution result considering some discarded traces out of the inversion is close to that for processing the whole data. The recovery of the reflectivity for the discarded traces demonstrates that the spatial regularization indeed help exploiting the spatial (local) continuities among traces.

Fig. 8
figure 8

The field data with 152 discarding traces (a) and the spatial regularized deconvolution result (b) with the optimum regularization parameter 5 which is chosen by trial and error

4 Conclusions

The spatial regularization used in seismic deconvolution plays important roles in preventing the lateral instability of the estimated reflectivity, exploring spatial continuities among traces, and stabilizing reflectivity inversion. Compared with the conventional deconvolution adopting temporal regularization implemented by a L2 norm or a L1 norm of model parameters, spatial regularized deconvolution yields the higher resolution and meanwhile high SNR deconvolution result, mainly attributed to the optimal trade-off between the multi-trace data misfit term and the spatial regularization term. Different from the temporal regularization via a L1 norm, the spatial regularization has a limitation that it cannot help recovering information outside frequency band of seismic signal. Although we only show 2D data examples in this paper, our objective function Eq. (12) can be directly adopted to simultaneously process 3D seismic data.