1 Introduction

Stochastic processes on spheres provide a valuable mathematical framework to capture the spatial uncertainty of geophysical processes located over large portions of the Earth (Marinucci and Peccati 2011). Global data are typically characterized by dissimilar behaviors in different parts of the world, which can be attributed to diverse factors, including wind directions and teleconnections. As a result, the search for sophisticated models for globally dependent data has attracted growing interest from statisticians in recent decades. We refer the reader to Jeong et al. (2017) and Porcu et al. (2018) for thorough reviews about this topic.

The assumption of isotropy, commonly used in spatial data analysis, implies that the statistical properties of the process do not vary for different points on the surface of a sphere. The literature on isotropic processes is substantial. For instance, the design of parametric families of covariance functions has been addressed by Gneiting (2013), Guinness and Fuentes (2016), Peron et al. (2018) and Alegria et al. (2018). Lang and Schwab (2015), Hansen et al. (2015) and Clarke et al. (2018) discussed the regularity properties of Gaussian processes on spheres and hyperspheres. Computationally efficient simulation algorithms have been proposed by Creasey and Lang (2018), Cuevas et al. (2020), Lantuéjoul et al. (2019), Emery et al. (2019a), Emery and Porcu (2019) and Alegría et al. (2020). Although isotropy considerably simplifies the modeling of processes on spheres, it is generally a questionable assumption, as it does not allow for spatially varying dependencies (Stein 2007).

In recent years, there has been renewed interest in a more flexible class of processes, referred to as axially symmetric processes (Jones 1963), for which the spatial dependency is stationary with respect to longitude but may substantially change with shifts in latitude. While Hitczenko and Stein (2012), Huang et al. (2012) and Bissiri et al. (2020) studied several theoretical aspects of axially symmetric processes, Vanlengenberg et al. (2019) and Emery et al. (2019b) paid particular attention to the formulation of fast and efficient simulation algorithms. Other authors, including Stein (2007), Jun and Stein (2008), Castruccio and Genton (2014), Castruccio (2016), and Porcu et al. (2019), have illustrated the relevance of axially symmetric processes in environmental and climatological applications.

The Karhunen–Loève expansion in terms of spherical harmonic functions is a convenient mathematical framework to analyze Gaussian processes on spheres (Jones 1963; Marinucci and Peccati 2011). In the axially symmetric scenario, Karhunen–Loève expansions have received little attention, with the works of Stein (2007), where axially symmetric models are fitted to total column ozone data, and Hitczenko and Stein (2012), who concentrated on models based on differential operators, being notable exceptions. This paper is devoted to the study of axially symmetric Gaussian processes through their Karhunen–Loève expansions. The main contributions of this work are listed below:

  1. 1.

    We investigate a parametric family of Karhunen–Loève coefficients that allows for flexible second-order dependency structures. Our proposal permits us to gradually go from processes that are constant along the parallels of latitude (longitudinal independence) to processes whose finite-dimensional distributions are invariant under spatial rotations (isotropy).

  2. 2.

    We propose a general and simple strategy to render any longitudinally reversible model irreversible, i.e.; our approach allows covariance functions to be built with certain types of asymmetries along longitudes.

  3. 3.

    We focus on the approximation of axially symmetric processes through finitely truncated Karhunen–Loève expansions. We provide a theoretical bound for the \(L^2\)-error associated with this approximation. Such an approximation suggests a natural simulation method, which is examined through numerical experiments.

The article is organized as follows. Section 2 provides background material on axially symmetric processes and their Karhunen–Loève expansions. Section 3 contains the main results of this work. Specifically, we propose a parametric family of Karhunen–Loève coefficients that connects the isotropic case and the case of longitudinal independence. We present a strategy for building longitudinally irreversible processes. The bounds for the \(L^2\)-error of truncated Karhunen–Loève expansions are derived as well. In Sect. 4, our findings are illustrated through numerical experiments. Section 5 concludes the paper with a discussion.

2 Preliminaries

2.1 Spherical harmonic functions

The aim of this section is to introduce preliminary material about spherical harmonic functions. We denote the latitude and longitude coordinates of a spatial point on \({{\mathbb {S}}}^2 = \{{\textit{\textbf{x}}}\in {{\mathbb {R}}}^3: \Vert {\textit{\textbf{x}}}\Vert = 1\}\) by \(L \in [0,\pi ]\) and \(\ell \in [0,2\pi )\), respectively. For two locations on \({{\mathbb {S}}}^2\), with coordinates \((L_1,\ell _1)\) and \((L_2,\ell _2)\), the great circle distance between them is given by

$$\begin{aligned} {\text {d}}_{{\text {GC}}}\left( L_1,L_2,\Delta \ell \right) = 2 \arcsin \left\{ \left[ \sin ^2\left( \frac{L_1-L_2}{2}\right) + \sin L_1 \sin L_2 \sin ^2\left( \frac{\Delta \ell }{2} \right) \right] ^{1/2}\right\} , \end{aligned}$$

where \(\Delta \ell = \ell _1-\ell _2\). This metric represents the length of the shortest arc joining two spherical locations, so it is always true that \({\text {d}}_{{\text {GC}}}\left( L_1,L_2,\Delta \ell \right) \in [0,\pi ]\).

Spherical harmonic functions, denoted by \({{\mathcal {Y}}}_{nm}(L,\ell )\), for \(n\in {{\mathbb {N}}}_0\) and \(m\in \{-\,n,\ldots ,n\}\), form an orthogonal basis of the Hilbert space of complex-valued square integrable functions on \({{\mathbb {S}}}^2\). When \(n\in {{\mathbb {N}}}_0\) and \(m\in \{0,\ldots ,n\}\), we have

$$\begin{aligned} {{\mathcal {Y}}}_{nm} (L,\ell ) = \sqrt{ \frac{2n+1}{4\pi } \frac{(n-m)!}{(n+m)!} } P_{nm}(\cos L) \exp (\imath m \ell ), \end{aligned}$$

for \((L,\ell )\in [0,\pi ]\times [0,2\pi )\), where \(P_{nm}\) is the associated Legendre polynomial (Abramowitz and Stegun 1964) and \(\imath \in {{\mathbb {C}}}\) is the complex unit. For \(n\in {{\mathbb {N}}}\) and \(m\in \{-\,n,\ldots ,-1\}\), spherical harmonic functions are instead given by

$$\begin{aligned} {{\mathcal {Y}}}_{nm} (L,\ell ) = (-\,1)^m \overline{{{\mathcal {Y}}}_{n-m}} (L,\ell ), \end{aligned}$$

where \({\overline{c}}\) denotes the complex conjugate of c. The addition theorem is a mathematical identity of great importance when dealing with spherical harmonic functions (see, e.g., Marinucci and Peccati 2011), which states that

$$\begin{aligned} P_n\left( \cos {\text {d}}_{{\text {GC}}}\left( L_1,L_2,\Delta \ell \right) \right) = \frac{4\pi }{2n+1} \sum _{m=-n}^n {{\mathcal {Y}}}_{nm}(L_1,\ell _1) \overline{{{\mathcal {Y}}}_{nm}}(L_2,\ell _2), \end{aligned}$$

or equivalently,

$$\begin{aligned} P_n\left( \cos \text{d}_{{\text {GC}}}\left( L_1,L_2,\Delta \ell \right) \right) = P_n(\cos L_1) P_n(\cos L_2) + 2 \sum _{m=1}^n \frac{(n-m)!}{(n+m)!} \cos (m \Delta \ell ) P_{nm}(\cos L_1) P_{nm}(\cos L_2), \end{aligned}$$
(2.1)

with \(P_n = P_{n0}\) standing for the Legendre polynomial of degree n (Abramowitz and Stegun 1964). A more comprehensive discussion on spherical harmonic functions and Fourier analysis on \({{\mathbb {S}}}^2\) can be found in Marinucci and Peccati (2011).

2.2 Axially symmetric processes

A zero-mean real-valued Gaussian process, \(Z(L,\ell )\), which is indexed by latitude \(L\in [0,\pi ]\) and longitude \(\ell \in [0,2\pi )\), with finite second-order moments, defined on a probability space \((\Omega ,{{\mathcal {F}}},P)\), is referred to as an axially symmetric process (Jones 1963; Stein 2007) if its covariance function can be written as

$$\begin{aligned} {\text {cov}}\{Z(L_1,\ell _1), Z(L_2,\ell _2)\} = C(L_1,L_2,\Delta \ell ), \end{aligned}$$
(2.2)

for some function \(C: [0,\pi ]^2\times [-\,2\pi ,2\pi ] \rightarrow {{\mathbb {R}}}\). The covariance function of an axially symmetric process is stationary with respect to longitude and may have heterogeneous behaviours along latitudes. Two important particular cases are discussed.

Isotropy:

When the covariance function of the process is a function of locations \((L_1,\ell _1)\) and \((L_2,\ell _2)\) only through their great-circle distance, \({\text {d}}_{{\text {GC}}}\left( L_1,L_2,\Delta \ell \right)\), the process is called isotropic. It is clear that (2.2) includes an isotropic structure as a special case. The finite-dimensional distributions of isotropic Gaussian processes are invariant under the group of rotations on \({{\mathbb {S}}}^2\) (Marinucci and Peccati 2011).

Longitudinal independence:

Another limit scenario of axial symmetry is the longitudinal independence presented by Emery et al. (2019b), which means that \(C(L_1,L_2,\Delta \ell )\) in (2.2) does not depend on \(\Delta \ell\). Emery et al. (2019b) showed that longitudinally independent processes are constant along the parallels of latitude and argued that they can be useful in structural geology and geotechnics.

The presence or absence of longitudinal symmetry in the covariance function (2.2) provides a classification for axially symmetric processes. Following Stein (2007), an axially symmetric process is called longitudinally reversible if

$$\begin{aligned} C(L_1,L_2,\Delta \ell ) = C(L_1,L_2,-\Delta \ell ), \end{aligned}$$

for every \((L_1,L_2,\Delta \ell )\in [0,\pi ]^2 \times [-\,2\pi ,2\pi ]\). However, in general, C need not be symmetric in \(\Delta \ell\), in which case the process is said to be longitudinally irreversible.

2.3 Karhunen–Loève expansions of axially symmetric processes

An approach developed by Jones (1963) states that axially symmetric Gaussian processes on \({{\mathbb {S}}}^2\) admit Karhunen–Loève expansions in terms of spherical harmonic functions. Consider the expansion

$$\begin{aligned} Z(L,\ell ) = \sum _{n=0}^{\infty } \sum _{m=-n }^n c_{nm} {{\mathcal {Y}}}_{nm}(L,\ell ), \end{aligned}$$
(2.3)

where \(c_{nm}\) are zero-mean Gaussian random variables. The convergence in (2.3) holds in the \(L^2\) sense (the same comment applies for similar series throughout the manuscript). To obtain a real-valued process, the condition \(c_{nm} = (-\,1)^m \overline{c_{n-m}}\) must be imposed, where in particular we have that \(c_{n0}\) is a real-valued random variable. Under this symmetry condition and the explicit expressions for spherical harmonic functions, it is evident (2.3) reduces to

$$\begin{aligned} Z(L,\ell )& = \sum _{n=0}^{\infty } a_{n0} {\widetilde{P}}_{n0}(\cos L) \\ &\quad + 2 \sum _{n=1}^\infty \sum _{m=1}^n \left\{ a_{nm} \cos (m\ell )+ b_{nm} \sin (m\ell ) \right\} {\widetilde{P}}_{nm}(\cos L), \end{aligned}$$
(2.4)

where \(a_{nm}\) and \(b_{nm}\) are zero-mean Gaussian random variables, representing real and imaginary parts of \(c_{nm}\), respectively, and

$$\begin{aligned} {\widetilde{P}}_{nm} = \sqrt{ \frac{2n+1}{4\pi } \frac{(n-m)!}{(n+m)!} }\, P_{nm}. \end{aligned}$$

To ensure that \(Z(L,\ell )\) is axially symmetric, the (bi) sequences of coefficients, \(\{a_{nm}\}\) and \(\{b_{nm}\}\), must be uncorrelated in the index m. Specifically, Jones (1963) considered the following conditions:

  1. (C1)

    \({\text {cov}}\{ a_{n0}, a_{n'0} \} = {\text {cov}}\{ b_{n0}, b_{n'0} \} = f_{0}(n,n')\), for all \(n,n'\ge 0\).

  2. (C2)

    \({\text {cov}}\{ a_{nm}, a_{n'm'} \} = {\text {cov}}\{ b_{nm}, b_{n'm'} \} = \delta _{m}^{m'} f_{m}(n,n')/2\), for all \(n,n'\ge m\), with \(m>0\).

  3. (C3)

    \({\text {cov}}\{ a_{nm}, b_{n'm'} \} =-\, {\text {cov}}\{b_{nm},a_{n'm'}\} = \delta _{m}^{m'} g_{m}(n,n')/2\), for all \(n,n'\ge m\), with \(m>0\).

Here, \(\delta _{m}^{m'}\) denotes the Kronecker delta, \(f_m(n,n')\) captures the covariance function of each individual sequence, and \(g_m(n,n')\) characterizes the cross-covariance function between these sequences. Following Jones (1963), the cross-covariance function satisfies the identity \(g_m(n,n') = - \,g_m(n',n)\), and so \(g_m(n,n) = 0\). Defining

$$\begin{aligned} {\varvec{\upsilon }}_m =(a_{n_1m}, a_{n_2m},\ldots , b_{n_1m}, b_{n_2m},\ldots )^\top , \end{aligned}$$

for integers \(n_1,n_2,\ldots\) being greater than or equal to m, we observe that the covariance matrix of \({\varvec{\upsilon }}_m\) can be written as

$$\begin{aligned} {\varvec{\Gamma }}_m= \begin{pmatrix} {\textit{\textbf{F}}}_m &{} \quad {\textit{\textbf{G}}}_m\\ {\textit{\textbf{G}}}_m^\top &{}\quad {\textit{\textbf{F}}}_m, \end{pmatrix}, \end{aligned}$$
(2.5)

where \({\textit{\textbf{F}}}_m\) and \({\textit{\textbf{G}}}_m\) are matrices with entries \(f_m(n_i,n_j)\) and \(g_m(n_i,n_j)\), respectively. Hence, \({\textit{\textbf{F}}}_m\) is a positive semidefinite matrix, whereas \({\textit{\textbf{G}}}_m\) is an antisymmetric matrix, in such a way that the block matrix (2.5) is always positive semidefinite. The covariance function of \(Z(L,\ell )\), taking into account conditions (C1)–(C3), is given by

$$\begin{aligned} C(L_1,L_2,\Delta \ell )&= \sum _{n,n'=0}^\infty f_0(n,n') {\widetilde{P}}_{n0}(\cos L_1) {\widetilde{P}}_{n'0}(\cos L_2) \\&\quad + 2 \sum _{m=1}^\infty \sum _{n,n'=m}^\infty \{ f_m(n,n') \cos (m\Delta \ell )\\&\quad + g_m(n,n') \sin (m\Delta \ell ) \} {\widetilde{P}}_{nm}(\cos L_1) {\widetilde{P}}_{n'm}(\cos L_2), \end{aligned}$$
(2.6)

where the summability condition

  1. (C4)

    \(\sum _{n,n'=0}^\infty f_0(n,n') {\widetilde{P}}_{n0}(\cos L) {\widetilde{P}}_{n'0}(\cos L) + 2 \sum _{m=1}^\infty \sum _{n,n'=m}^\infty f_m(n,n') {\widetilde{P}}_{nm}(\cos L) {\widetilde{P}}_{n'm}(\cos L) < \infty ,\)

for all \(L\in [0,\pi ]\), ensures that \(Z(L,\ell )\) is a well-defined second-order process. Note that a zero-mean axially symmetric Gaussian process is completely characterized by \(f_m(n,n')\) and \(g_m(n,n')\).

Before concluding this section, the following comments are provided.

  • When \(f_{m}(n,n') = \delta _n^{n'} \xi _n\), for some sequence \(\{\xi _n: n\in {{\mathbb {N}}}_0\}\) of nonnegative real numbers, and \(g_m(n,n')\) is identically equal to zero, we obtain an isotropic process. In this special case, the addition theorem for spherical harmonic functions implies that

    $$\begin{aligned} C(L_1,L_2,\Delta \ell ) = \sum _{n=0}^\infty { \frac{\xi _n(2n+1)}{4\pi } } P_n\left( \cos {\text {d}}_{{\text {GC}}}\left( L_1,L_2,\Delta \ell \right) \right) , \end{aligned}$$
    (2.7)

    where the condition for finite variance is

    $$\begin{aligned} {\sum _{n=0}^\infty \xi _n(2n+1) < \infty .} \end{aligned}$$
    (2.8)

    Condition (2.8) together with the inequality \(|P_n(t)| \le 1\), for \(|t| \le 1\), imply the uniform convergence of (2.7) and ensure its continuity (this is a consequence of Weierstrass M-test). Equation (2.7) is the characterization of any continuous covariance function associated with an isotropic process on \({{\mathbb {S}}}^2\) (Schoenberg 1942).

  • Suppose now that the coefficients in (2.6) vanish for \(m>0\); then, the covariance function has a longitudinally independent structure because under this choice, we eliminate the terms depending on \(\Delta \ell\).

  • When \(g_m(n,n')\) is different from zero, we obtain a longitudinally irreversible process. Thus, the function \(g_m(n,n')\) is responsible for the asymmetry of the covariance function along longitudes.

3 Main results

3.1 A bridge between isotropy and longitudinal independence

In this section, inspired by the work of Emery et al. (2019b), we describe how to obtain a unified representation of the limit cases, isotropy and longitudinal independence, by means of a parametric family of coefficients \(f_m(n,n')\). The antisymmetric part \(g_m(n,n')\) will be analyzed in the next subsection, so for the moment we assume that it is identically equal to zero.

Consider the covariance function (2.6) with coefficients of the form

$$\begin{aligned} f_{m}(n,n') = \sqrt{\xi _n \xi _{n'} } \rho (n-n') \lambda _m, \end{aligned}$$
(3.1)

where \(\{\xi _n: n\in {{\mathbb {N}}}_0\}\) is a nonnegative sequence, \(\rho\) is a stationary correlation function, and \(\{\lambda _m: m\in {{\mathbb {N}}}_0\}\) is a nonnegative and bounded sequence. For every fixed \(m\in {{\mathbb {N}}}_0\), (3.1) is clearly a positive semidefinite function.

We restrict attention to the following limit cases:

  • Suppose first that \(\lambda _m = 1\), for all \(m\in {{\mathbb {N}}}_0\). Thus, as the range of \(\rho\) decreases to zero, \(f_{m}(n,n')\) goes to \(\delta _n^{n'} \xi _n\), converging to the isotropic case.

  • Longitudinal independence arises when \(\lambda _0 > 0\) and \(\lambda _m = 0\) for every \(m\ge 1\), regardless of the choice of \(\rho\).

There exist various ways to construct a sequence \(\{\lambda _m: m\in {{\mathbb {N}}}_0\}\) that unifies both cases. For instance, the sequence could be taken as \(\lambda _m = 1_{[0,\alpha ]}(m)\), where \(\alpha \in {{\mathbb {N}}}_0\) is a parameter and \(1_A\) denotes the indicator function of A. While \(\alpha =0\) represents longitudinal independence, \(\alpha \rightarrow \infty\) corresponds to isotropy (provided that \(\rho\) is a Kronecker delta). Another interesting alternative is \(\lambda _m = (1 + \gamma m^2)^{-1}\), where \(\gamma \ge 0\) is a continuous parameter. Here, \(\gamma = 0\) and \(\gamma \rightarrow \infty\) correspond to isotropy and longitudinal independence, respectively. Any isotropic model introduced in the literature can be obtained as a special case of (3.1). In this work, we prefer the sparse structure \(\lambda _m = 1_{[0,\alpha ]}(m)\) because it provides computational advantages. Series expansions with a large proportion of zeros have also been applied by Stein (2007) in the study of total column ozone.

To ensure that (3.1) yields a well-defined second-order Gaussian process, condition (C4) must be verified. The following proposition shows that under adequate assumptions of the asymptotic decay of \(\{\xi _n: n\in {{\mathbb {N}}}_0\}\), condition (C4) holds.

Proposition 3.1

Consider \(f_m(n,n')\) as in (3.1), and suppose that, for \(n > n_0\),

$$\begin{aligned} \xi _n \le r n^{-\beta }, \end{aligned}$$

for some constants \(r>0\) and \(n_0\in {{\mathbb {N}}}\). Then, condition (C4) holds if either

  1. (i)

    \(\beta >4\); or

  2. (ii)

    \(\beta >2\) and \(\rho (h) = \delta _0^h\).

Proof

Using the inequality (Siegel 1955)

$$\begin{aligned} \max _{x \in [-\,1,1]} |{\widetilde{P}}_{nm}(x)| \le \sqrt{2n+1}, \end{aligned}$$

we observe that a sufficient condition to obtain a process with finite variance is

$$\begin{aligned} \displaystyle \sum _{n,n'=0}^\infty f_0(n,n') \sqrt{(2n+1)(2n'+1)} + 2 \sum _{m=1}^\infty \sum _{n,n'=m}^\infty f_m(n,n') \sqrt{(2n+1)(2n'+1)} < \infty . \end{aligned}$$

We only analyze the terms with \(m>n_0\) (the other terms, for fixed \(m\le n_0\), can be studied in a similar manner). Since \(\rho (h) \le 1\), for all h, and \(\{\lambda _m: m\in {{\mathbb {N}}}_0\}\) is a nonnegative and bounded sequence, we have

$$\begin{aligned} \displaystyle \sum _{m>n_0}^\infty \sum _{n,n'=m}^\infty f_m(n,n') \sqrt{(2n+1)(2n'+1)}&\le \sum _{m>n_0}^\infty \lambda _m \sum _{n,n'=m}^\infty \sqrt{\xi _n \xi _{n'} {(2n+1)(2n'+1)} }\nonumber \\&\le \left( \sup _m \lambda _m\right) \sum _{m>n_0}^\infty \left( \sum _{n=m}^\infty \sqrt{ \xi _n(2n+1)} \right) ^2\nonumber \\&\le \left( \sup _m \lambda _m\right) 3r \sum _{m>n_0}^\infty \left( \sum _{n=m}^\infty n^{-(\beta -1)/2} \right) ^2. \end{aligned}$$
(3.2)

Employing an integral bound, one has

$$\begin{aligned} \sum _{n=m}^\infty n^{-(\beta - 1)/2} \le m^{-(\beta - 1)/2} + \int _{m}^\infty x^{-(\beta - 1)/2} {\text {d}}x = m^{-(\beta - 1)/2} + \frac{2m^{-(\beta - 3)/2}}{\beta - 3}, \end{aligned}$$

for all \(\beta >3\). Thus, we have that

$$\begin{aligned} m \mapsto \left( \sum _{n=m}^\infty n^{-(\beta - 1)/2} \right) ^2 \end{aligned}$$

decays algebraically with order \(\beta - 3\). We conclude that (3.2) is finite provided that \(\beta > 4\). The first part of the proof is completed.

Suppose now that \(\rho (h) = \delta _0^h\). To ensure (C4), we must verify that

$$\begin{aligned} \displaystyle \sum _{n=0}^\infty \xi _n \lambda _0 {\widetilde{P}}_{n0}(\cos L) {\widetilde{P}}_{n0}(\cos L) + 2 \sum _{m=1}^\infty \lambda _m \sum _{n=m}^\infty \xi _n {\widetilde{P}}_{nm}(\cos L) {\widetilde{P}}_{nm}(\cos L) < \infty . \end{aligned}$$

Again, using that \(\{\lambda _m: m\in {{\mathbb {N}}}_0\}\) is bounded, it is sufficient to show that

$$\begin{aligned} \displaystyle \sum _{n=0}^\infty \xi _n {\widetilde{P}}_{n0}(\cos L) {\widetilde{P}}_{n0}(\cos L) + 2 \sum _{m=1}^\infty \sum _{n=m}^\infty \xi _n {\widetilde{P}}_{nm}(\cos L) {\widetilde{P}}_{nm}(\cos L) < \infty , \end{aligned}$$

which is equivalent to

$$\begin{aligned} \sum _{n=0}^\infty \xi _n \left\{ {\widetilde{P}}_{n0}(\cos L) {\widetilde{P}}_{n0}(\cos L) + 2 \sum _{m=1}^n {\widetilde{P}}_{nm}(\cos L) {\widetilde{P}}_{nm}(\cos L) \right\} < \infty . \end{aligned}$$

From the addition theorem for spherical harmonic functions, we obtain the simplified condition (2.8), which is true provided that \(\beta > 2\). □

The second part of Proposition 3.1, i.e., when \(\rho (h) = \delta _0^h\), matches previous literature related to the isotropic case (Schoenberg 1942; Lang and Schwab 2015). However, it is slightly more general because we are not necessarily assuming that \(\{\lambda _m: m\in {{\mathbb {N}}}_0\}\) is a sequence of ones.

3.2 Modeling the antisymmetric part

We illustrate a simple approach to construct the antisymmetric coefficients \(g_m(n,n')\) from the coefficients \(f_m(n,n')\).

Proposition 3.2

For each \(m\in {{\mathbb {N}}}\), consider a covariance function \(f_m(n,n')\) of type (3.1). Thus, for all \(\kappa \in {{\mathbb {R}}}\), the cross-covariance function

$$\begin{aligned} g_m(n,n') = \frac{\sqrt{\xi _n \xi _{n'} } \lambda _m}{{4}} \left\{ \rho (n-n' - \kappa ) - \rho (n-n'+\kappa )\right\} , \end{aligned}$$

is an admissible model, in the sense that (2.5) is a positive semidefinite matrix.

Proof

Consider two independent sequences of zero-mean random variables, \(\{{\widetilde{a}}_{nm}\}\) and \(\{{\widetilde{b}}_{nm}\}\), with a covariance structure of type (3.1). For a nonnegative integer q, new sequences of coefficients are defined

$$\begin{aligned} {\left\{ \begin{array}{ll} a_{nm} = \displaystyle \frac{1}{\sqrt{2}}\left( {\widetilde{a}}_{nm} + \sqrt{\frac{\xi _n}{\xi _{(n+q)}}} {\widetilde{b}}_{(n+q)m} \right) ,\\ b_{nm} = \displaystyle \frac{1}{\sqrt{2}}\left( \sqrt{\frac{\xi _n}{\xi _{(n+q)}}} {\widetilde{a}}_{(n+q)m} - {\widetilde{b}}_{nm}\right) . \end{array}\right. } \end{aligned}$$
(3.3)

Hence, \({\text {cov}}\{ {a}_{nm}, {a}_{n'm'} \}\) and \({\text {cov}}\{ {b}_{nm}, {b}_{n'm'} \}\) are given by (3.1), that is, the marginal covariance functions are preserved by transformation (3.3). In addition, the cross-covariance function between the sequences \(\{{a}_{nm}\}\) and \(\{{b}_{nm}\}\) is given by

$$\begin{aligned} {\text {cov}}\{ {a}_{nm}, {b}_{n'm'} \} = -\,{\text {cov}}\{ {b}_{nm}, {a}_{n'm'} \} = \frac{\delta _{m}^{m'} \sqrt{\xi _n \xi _{n'} } \lambda _m}{{4}} \left\{ \rho (n-n' - q) - \rho (n-n'+q)\right\} . \end{aligned}$$
(3.4)

Following a similar scheme with continuous indices, we conclude that the cross-covariance function (3.4) is also valid if we replace the integer q with a parameter \(\kappa\) that varies continuously on \({{\mathbb {R}}}\). More precisely, we consider random variables \({\widetilde{a}}_{nm}\) and \({\widetilde{b}}_{nm}\) with indices n and m on the real line. The sequences \({a}_{nm}\) and \({b}_{nm}\) are constructed as in (3.3) (with \(\kappa\) instead of q) by restricting these indices to be nonnegative integers. The obtained sequences will have the desired cross-covariance function with \(\kappa \in {{\mathbb {R}}}\). □

The antisymmetric part is identically equal to zero when \(\kappa = 0\), recovering a longitudinally reversible process. In Sect. 4.2, we will illustrate the impact of \(g_m(n,n')\) on the covariance function as well as on the realizations of the process.

3.3 Finite Karhunen–Loève expansion and its \(L^2\)-error

This section focuses on the approximation of axially symmetric Gaussian processes through a finite linear combination of spherical harmonic functions. This approximation can be performed by means of a truncated version of (2.4), where truncation is taken with respect to index n. Specifically, given a large \(N\in {{\mathbb {N}}}\), we consider

$$\begin{aligned} {\widehat{Z}}_N(L,\ell ) = \sum _{n=0}^{N} a_{n0} {\widetilde{P}}_{n0}(\cos L) + 2 \sum _{m=1}^N \sum _{n=m}^N \left\{ a_{nm} \cos (m\ell )+ b_{nm} \sin (m\ell ) \right\} {\widetilde{P}}_{nm}(\cos L). \end{aligned}$$
(3.5)

Note that (3.5) is a truncation of (2.4), where we have simply modified the order of summation for convenience. This approximation technique was developed for isotropic spatial processes by Lang and Schwab (2015) and for spatially isotropic space-time processes by Clarke et al. (2018).

The purpose now is to derive a bound for the \(L^2\)-error in terms of N. The following proposition characterizes the accuracy of the approximation.

Proposition 3.3

Let \(N\in {{\mathbb {N}}}\) and consider the stochastic processes \(Z(L,\ell )\) and \({\widehat{Z}}_N(L,\ell )\) in (2.4) and (3.5), respectively, under conditions (C1)–(C4). Then,

$$\begin{aligned} \Vert Z-{\widehat{Z}}_N \Vert ^2_{L^2(\Omega \times {{\mathbb {S}}}^2)} = \sum _{n=N+1}^\infty f_0(n,n) + 2\sum _{n=N+1}^\infty \sum _{m=1}^n f_m(n,n). \end{aligned}$$
(3.6)

Proof

First, observe that \(Z-{\widehat{Z}}_N\) can be split into three parts, namely, \(Z -{\widehat{Z}}_N = T_1 + 2 T_2 + 2 T_3\), where

$$\begin{aligned} T_j = \sum _{(n,m)\in \Delta _j} \left\{ a_{nm} \cos (m\ell )+ b_{nm} \sin (m\ell ) \right\} {\widetilde{P}}_{nm}(\cos L), \end{aligned}$$

with \(\Delta _1 = \{ (n,m): n > N {\text { and }} m=0 \}\), \(\Delta _2 = \{ (n,m): n > N {\text { and }} 1 \le m \le N \}\), and \(\Delta _3 = \{ (n,m): n \ge m {\text { and }} m > N \}\). According to conditions (C1)–(C4), \(T_i\) and \(T_j\) are uncorrelated processes for all \(i\ne j\) since the sets \(\Delta _i\) and \(\Delta _j\) are disjoint sets in the index m. Thus,

$$\begin{aligned} \Vert Z-{\widehat{Z}}_N \Vert ^2_{L^2(\Omega \times {{\mathbb {S}}}^2)} = \Vert T_1\Vert ^2_{L^2(\Omega \times {{\mathbb {S}}}^2)} + 4\Vert T_2\Vert ^2_{L^2(\Omega \times {{\mathbb {S}}}^2)} + 4\Vert T_3\Vert ^2_{L^2(\Omega \times {{\mathbb {S}}}^2)}. \end{aligned}$$

The decomposition of \(Z-{\widehat{Z}}_N\) into three mutually uncorrelated processes is a key part of this proof. Using integration in terms of spherical coordinates, one has

$$\begin{aligned} \Vert T_1\Vert ^2_{L^2(\Omega \times {{\mathbb {S}}}^2)} = \sum _{n,n'=N+1}^\infty E\{a_{n0}a_{n'0}\} \int _0^{2\pi } \int _0^{\pi } {\widetilde{P}}_{n0}(\cos L) {\widetilde{P}}_{n'0}(\cos L) \sin L \, {\text {d}}L \, {\text {d}}\ell . \end{aligned}$$

Recall that \(E\{a_{n0}a_{n'0}\} = f_0(n,n')\). Additionally, using the orthogonality properties of the associated Legendre polynomials (see Abramowitz and Stegun 1964),

$$\begin{aligned} \int _0^{\pi } {\widetilde{P}}_{nm}(\cos L) {\widetilde{P}}_{n'm}(\cos L) \sin L \, {\text {d}}L = \frac{\delta _n^{n'}}{2\pi }, \end{aligned}$$
(3.7)

we conclude that \(\Vert T_1\Vert ^2_{L^2(\Omega \times {{\mathbb {S}}}^2)} = \sum _{n=N+1}^\infty f_0(n,n).\) In addition, we have

$$\begin{aligned} \Vert T_2\Vert ^2_{L^2(\Omega \times {{\mathbb {S}}}^2)}&= \int _0^{2\pi } \sum _{m,m'=1}^N \sum _{n,n'=N+1}^\infty E\left( \left\{ a_{nm} \cos (m\ell )+ b_{nm} \sin (m\ell ) \right\} \right. \\&\quad \times \left. \,\left\{ a_{n'm'} \cos (m'\ell )+ b_{n'm'} \sin (m'\ell ) \right\} \right) \times \int _0^{\pi } {\widetilde{P}}_{nm}(\cos L) {\widetilde{P}}_{n'm'}(\cos L) \sin L \, {\text {d}}L \, {\text {d}}\ell . \end{aligned}$$

Using conditions (C1)–(C4) and the orthogonality properties of the trigonometric Fourier basis on \([0,2\pi ]\), we have

$$\begin{aligned} \Vert T_2\Vert ^2_{L^2(\Omega \times {{\mathbb {S}}}^2)} = \sum _{m=1}^N \sum _{n,n'=N+1}^\infty \pi f_m(n,n') \int _0^{\pi } {\widetilde{P}}_{nm}(\cos L) {\widetilde{P}}_{n'm}(\cos L) \sin L \, {\text {d}}L. \end{aligned}$$

Thus, (3.7) implies that

$$\begin{aligned} \Vert T_2\Vert ^2_{L^2(\Omega \times {{\mathbb {S}}}^2)} = \sum _{m=1}^N \sum _{n=N+1}^\infty \frac{f_m(n,n)}{2}. \end{aligned}$$

Using similar arguments, we obtain

$$\begin{aligned} \Vert T_3\Vert ^2_{L^2(\Omega \times {{\mathbb {S}}}^2)} = \sum _{m=N+1}^\infty \sum _{n=m}^\infty \frac{f_m(n,n)}{2}. \end{aligned}$$

We conclude the proof by noting that

$$\begin{aligned} \Vert T_2\Vert ^2_{L^2(\Omega \times {{\mathbb {S}}}^2)} + \Vert T_3\Vert ^2_{L^2(\Omega \times {{\mathbb {S}}}^2)} = \sum _{n=N+1}^\infty \sum _{m=1}^n\frac{f_m(n,n)}{2}. \end{aligned}$$

A remarkable feature of Proposition 3.3 is that the \(L^2\)-error is completely characterized by the decay of the diagonal elements of the matrices \({\varvec{\Gamma }}_m\) in (2.5). Therefore, the antisymmetric part does not influence this quantity. In the isotropic case, we obtain a corollary result previously reported by Lang and Schwab (2015).

Corollary 3.1

Let \(N\in {{\mathbb {N}}}\) and consider \(Z(L,\ell )\) and \({\widehat{Z}}_N(L,\ell )\) in (2.4) and (3.5), respectively, under conditions (C1)–(C4). In addition, suppose that \(f_m(n,n') = \delta _n^{n'} \xi _n\). Then,

$$\begin{aligned} \Vert Z-{\widehat{Z}}_N \Vert ^2_{L^2(\Omega \times {{\mathbb {S}}}^2)} = \sum _{n=N+1}^\infty (2n+1) \xi _n. \end{aligned}$$
(3.8)

Since Proposition 3.3 gives the \(L^2\)-error for every axially symmetric process, we derive an explicit bound when the coefficients \(f_m(n,n')\) are of the form (3.1). Such a result is reported in the following corollary.

Corollary 3.2

Let \(N\in {{\mathbb {N}}}\) and consider \(Z(L,\ell )\) and \({\widehat{Z}}_N(L,\ell )\) in (2.4) and (3.5), respectively, under conditions (C1)–(C4) with \(f_m(n,n')\) of the form (3.1). Suppose that there exist constants \(\beta >2\), \(r>0\) and \(n_0\in {{\mathbb {N}}}\) such that \(\xi _n \le r n^{-\beta }\), for all \(n > n_0\). Thus,

$$\begin{aligned} \Vert Z-{\widehat{Z}}_N \Vert ^2_{L^2(\Omega \times {{\mathbb {S}}}^2)} \le c N^{-(\beta -2)}, \end{aligned}$$
(3.9)

for some constant \(c>0\) that depends on \(r,\beta ,n_0\) and that is independent of N.

Corollary 3.2 above extends Proposition 5.2 in Lang and Schwab (2015), from the isotropic to the axially symmetric case. When \(f_m(n,n')\) is of the form (3.1), we have

$$\begin{aligned} \sum _{n=N+1}^\infty f_0(n,n) + 2\sum _{n=N+1}^\infty \sum _{m=1}^n f_m(n,n) \le \left( \sup _m \lambda _m \right) \sum _{n=N+1}^\infty (2 n+1) \xi _n. \end{aligned}$$

Thus, the proof of Corollary 3.2 follows the same arguments given by Lang and Schwab (2015). Additionally, it is worth noting that the same rate of convergence obtained in Corollary 3.2 can be derived in terms of \(L^p\) norms for every \(p>0\). The proof of this assertion is completely analogous to the proofs reported in Lang and Schwab (2015) and Cleanthous et al. (2020), so it is omitted.

4 Numerical experiments

4.1 Simulating axially symmetric Gaussian processes

The approximation methodology developed in the previous section suggests a natural simulation algorithm based on a weighted sum of finitely many spherical harmonic functions. We simulate longitudinally reversible Gaussian processes (antisymmetric coefficients are explored in the next subsection). Consider the following particular cases of (3.1).

Example 1

Let \(\rho (h) = \delta _0^{h}\) and consider a sequence \(\{\xi _n: n\in {{\mathbb {N}}}_0\}\) of Legendre–Matérn type (Guinness and Fuentes 2016), that is,

$$\begin{aligned} \xi _n = (\tau ^2 + n^2)^{-\nu -1/2}, \end{aligned}$$

where \(\tau\) and \(\nu\) are positive parameters. The sequence \(\{\lambda _m: m\in {{\mathbb {N}}}_0\}\) is taken as \(\lambda _m = 1_{[0,\alpha ]}(m)\). When \(\alpha \rightarrow \infty\), the covariance function of the process converges to the Legendre–Matérn model of Guinness and Fuentes (2016), which is associated with an isotropic process for which \(\tau\) and \(\nu\) regulate the range and mean square differentiability of the sample paths, respectively (see Guinness and Fuentes 2016 for details). Recall that a longitudinally independent structure is obtained with \(\alpha = 0\). Using Proposition 3.1, it is straightforward to verify that the summability condition (C4) holds when \(\nu >1/2\). In Fig. 1, we report simulated realizations with \(\nu = 1.5\), \(\tau ^2 = 100\) and different values for \(\alpha\). We set \(N = 200\) and a grid of longitudes and latitudes of size \(500 \times 500\). As \(\alpha\) increases, the features of the realizations gradually change from longitudinal independence to isotropy.

Fig. 1
figure 1

Simulated processes on a grid of longitudes and latitudes of size \(500\times 500\), with \(N=200\), and the covariance function described in Example 1, with \(\nu = 1.5\), \(\tau ^2 = 100\), and different values for \(\alpha\). The same random seed has been used for each realization

Example 2

The multiquadric model (see, e.g., Gneiting 2013) is characterized by the sequence

$$\begin{aligned} \xi _n = (1-\delta ) \delta ^n, \qquad n\in {{\mathbb {N}}}_0, \end{aligned}$$
(4.1)

where \(0<\delta <1\) is a parameter. The correlation function \(\rho\) and the sequence \(\{\lambda _m: m\in {{\mathbb {N}}}_0\}\) are taken as in Example 1. Here, the summability condition (C4) is visibly satisfied for any \(0<\delta <1\). Figure 2 displays simulated realizations with \(\delta =0.7\) and different values for \(\alpha\). We again consider \(N = 200\) and a grid of longitudes and latitudes of size \(500 \times 500\). The effect of \(\alpha\) on the realizations is similar to that reported in Example 1.

Fig. 2
figure 2

Simulated processes on a grid of longitudes and latitudes of size \(500\times 500\), with \(N=200\), and the covariance function described in Example 2, with \(\delta =0.7\), and different values for \(\alpha\). The same random seed has been used for each realization

Example 3

Unlike the previous examples, where \(\rho\) has been taken as a Kronecker delta, we now explore the effect of a different correlation on the realizations of the process. We consider the multiquadric coefficients (4.1), \(\lambda _m = 1_{[0,\alpha ]}(m)\) and an exponential correlation \(\rho (h) = \exp (-\,\phi |h|)\), where \(\phi\) is a positive parameter. Figure 3 reports realizations with \(\delta =0.7\), \(\alpha = 2\), and different values for \(\phi\). Again, we set \(N = 200\) and a spatial grid of size \(500 \times 500\). As \(\phi\) increases, the range of the correlation decreases, and we obtain a behavior that is similar to that reported in Example 2, as expected. However, as \(\phi\) decreases, we observe realizations that are characterized by strong correlations along latitudes.

Fig. 3
figure 3

Simulated processes on a grid of longitudes and latitudes of size \(500\times 500\), with \(N=200\), and the covariance function described in Example 3, with \(\delta =0.7\), \(\alpha =2\), and different values for \(\phi\). The same random seed has been used for each realization

For additional examples of sequences \(\{\xi _n: n\in {{\mathbb {N}}}_0\}\), we refer the reader to Ma (2012), Terdik (2015) and Leonenko et al. (2018).

4.2 Illustrating the effect of the antisymmetric part

The aim of this section is to show the impact of \(g_m(n,n')\) on the covariance function of an axially symmetric process. Figure 4 shows the contour plots of the covariance function given in Example 1 after adding the antisymmetric part (3.4) in terms of \(L_2\) and \(\Delta \ell\) for different fixed values of \(L_1 = \pi /3, \pi /2, 2\pi /3\). We set \(\tau ^2=100\), \(\nu =1.5\) and \(\alpha = 8\). Each panel considers \(\Delta \ell\) varying in the range \([-\,0.2,0.2]\), whereas \(L_2\) varies in the range \([L_1-0.2,L_1+0.2]\). The first row corresponds to a longitudinally reversible process (\(\kappa =0\)), whereas the second and third rows depict the distortion produced by \(g_m(n,n')\) with \(\kappa = 1\) and \(\kappa =-\,1\), respectively. We detect a rotation of the contour plots when the parameter \(\kappa\) is different from zero. The orientation of this rotation depends on the sign of \(\kappa\).

Fig. 4
figure 4

Contour plots of the covariance function given in Example 11 after adding the antisymmetric part (3.4) in terms of \(L_2\) and \(\Delta \ell\) for fixed values of \(L_1 = \pi /3, \pi /2, 2\pi /3\) (from left to right). Here, \(\tau ^2=100\), \(\nu =1.5\) and \(\alpha = 8\). The first row corresponds to a longitudinally reversible process (\(\kappa =0\)), whereas the second and third rows correspond to longitudinally irreversible processes with \(\kappa = 1\) and \(\kappa =-\,1\), respectively. Light blue colors indicate lower values

Figure 5 shows the contour plots of this covariance function in terms of \(L_1\) and \(L_2\) for different fixed values of \(\Delta \ell = -\,0.2,0,0.2\), with \(\tau ^2=100\), \(\nu =1.5\), \(\alpha = 8\), and \(\kappa =1\). Each panel considers \((L_1,L_2)\in [1.2,1.9]^2\). It is clear that the antisymmetric part produces a shift in the contour plots. The covariance function differs for \(\Delta \ell =-\,0.2\) and \(\Delta \ell =0.2\), illustrating the longitudinal irreversibility generated by our proposal. This asymmetry could also be observed in the second and third rows of Fig. 4 because the graphs do not reflect across the horizontal line given by \(\Delta \ell = 0\).

Fig. 5
figure 5

Contour plots of the covariance function given in Example 1 after adding the antisymmetric part (3.4) in terms of \(L_1\) and \(L_2\), for \(\Delta \ell = -\,0.2\) (left), \(\Delta \ell = 0\) (middle) and \(\Delta \ell = 0.2\) (right). Here, \(\tau ^2=100\), \(\nu =1.5\), \(\alpha = 8\), and \(\kappa =1\). Light blue colors indicate lower values

In Fig. 6, we report the realizations from the covariance function given in Example 1 after adding \(g_m(n,n')\) as in (3.4). Specifically, we consider \((\alpha , \kappa ) \in \{2, 4\}\times \{0, 1\}\). The inclusion of the antisymmetric part generates processes with a stronger anisotropy in the northwest direction than in other directions. This behavior is consistent with the contour plots previously reported.

Fig. 6
figure 6

Simulated processes on a grid of longitudes and latitudes of size \(500\times 500\), with \(N=200\), and the covariance function in Example 1 after adding the antisymmetric part (3.4). We set \(\nu = 1.5\), \(\tau ^2 = 100\), and different values for \(\alpha\) and \(\kappa\). The same random seed has been used for each realization

4.3 Assessing the accuracy of the simulation algorithm

We turn to a validation study of the quality of the simulation algorithm. We first study how the algorithm reproduces the theoretical variogram structure. We simulate 1000 independent realizations over 1000 spatial locations, considering the covariance function given in Example 1, with \(\nu =1.5\), \(\tau ^2=100\) and \(\alpha =10\). Then, the empirical local variograms are obtained for fixed latitudes and compared to the theoretical variograms. Figure 7 displays the results for four distinct latitudes. Note that on average, the empirical variograms match the theoretical models. The variability of the empirical variograms increases as the longitudinal lag increases, which is commonly observed in practice (see, e.g., Cuevas et al. 2020). Additionally, for latitudes close to the south pole, the variabilities of the empirical variograms are more severe than those far from the south pole.

We also verify the theoretical bound for the \(L^2\)-error via simulations. We consider the same setting as in the previous examples, i.e.; in particular we have that \(\nu =1.5\), which corresponds to a quadratic algebraic decay of the error. The true process is taken as the Karhunen–Loève expansion with \(N=1000\), since for larger N, we do not observe substantial variations. Then, we progressively truncate the expansion at different values of N and look at the decay of the error on a logarithmic scale. Following Lang and Schwab (2015), instead of the \(L^2\)-error in space, we quantify a stronger error given by the maximum error over all grid points. Figure 8 shows that, on average, we obtain the expected convergence rate for 1000 independent repetitions of this experiment. More precisely, the empirical convergence rate is given by \(N^{-\,2.009}\), which is close to the theoretical convergence rate.

Fig. 7
figure 7

Empirical variograms for 1000 independent simulations from Example 1 versus theoretical variograms. Here, we consider four distinct latitudes and the parameters \(\nu =1.5\), \(\tau ^2=100\) and \(\alpha =10\). For each panel, the red dashed line is the theoretical variogram, the blue dotted line is the average empirical variogram, and the gray zones show the empirical variogram envelopes based on the 1000 repetitions

Fig. 8
figure 8

Convergence rates in terms of N, in a logarithmic scale, for 1000 independent realizations. The theoretical rate is given by the blue line. The average empirical rate, associated with the 1000 repetitions, is given by the black line

5 Discussion

In this paper, we discuss several aspects related to Karhunen–Loève expansions of axially symmetric Gaussian processes. We illustrate how to obtain the limit cases, isotropy and longitudinal independence, by means of an adequate choice of the Karhunen–Loève coefficients. We have also incorporated an antisymmetric coefficient, which allows for the parametric regulation of longitudinal reversibility. Bounds for the \(L^2\)-error associated with a truncated version of the Karhunen–Loève expansion have been derived. This weighted sum of finitely many spherical harmonic functions serves as a natural simulation strategy. Simulation experiments are performed that show the effectiveness of our proposal: (1) it reproduces the prescribed second-order dependency, and (2) the empirical convergence rate of the truncation error matches the theoretical one.

The investigation of more complex coefficients and their impact on the attributes of the axially symmetric process is an interesting topic that merits more attention. The smoothness and Hölder continuity properties could be explored in a similar fashion to the works of Lang and Schwab (2015), Kerkyacharian et al. (2018) and Cleanthous et al. (2020). The simultaneous modeling of multiple correlated spatial processes on spheres, each one having an axially symmetric structure, is also a promising research direction. The findings of Jun (2011), Alegría et al. (2019) and Emery et al. (2019b) might be useful here. Exploring axially symmetric processes that evolve temporally is another interesting topic.

Our findings are not limited to the simulation of axially symmetric processes and may certainly be used for both the modeling and prediction of global data. The search for accurate and efficient methods to estimate the parameters involved in our models is a challenging topic that we expect to tackle in the future.