Ocean spectral data assimilation without background error covariance matrix

Chu, Peter C.; Fan, Chenwu; Margolina, Tetyana

doi:10.1007/s10236-016-0971-x

Ocean spectral data assimilation without background error covariance matrix

Published: 01 August 2016

Volume 66, pages 1143–1163, (2016)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Ocean Dynamics Aims and scope Submit manuscript

Ocean spectral data assimilation without background error covariance matrix

Download PDF

Peter C. Chu¹,
Chenwu Fan¹ &
Tetyana Margolina¹

334 Accesses
6 Citations
1 Altmetric
Explore all metrics

Abstract

Predetermination of background error covariance matrix B is challenging in existing ocean data assimilation schemes such as the optimal interpolation (OI). An optimal spectral decomposition (OSD) has been developed to overcome such difficulty without using the B matrix. The basis functions are eigenvectors of the horizontal Laplacian operator, pre-calculated on the base of ocean topography, and independent on any observational data and background fields. Minimization of analysis error variance is achieved by optimal selection of the spectral coefficients. Optimal mode truncation is dependent on the observational data and observational error variance and determined using the steep-descending method. Analytical 2D fields of large and small mesoscale eddies with white Gaussian noises inside a domain with four rigid and curved boundaries are used to demonstrate the capability of the OSD method. The overall error reduction using the OSD is evident in comparison to the OI scheme. Synoptic monthly gridded world ocean temperature, salinity, and absolute geostrophic velocity datasets produced with the OSD method and quality controlled by the NOAA National Centers for Environmental Information (NCEI) are also presented.

Acoustic Data Assimilation: Concepts and Examples

Numerical Experiments with the Nemo Ocean Circulation Model and the Assimilation of Observational Data from Argo Drifters

Article 03 September 2023

Oceanic EM damping and spectral splitting by the SD-gram

Article 17 September 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

In ocean data assimilation (or analysis), the coordinates (x, y, z) are usually represented by the position vector r with grid points represented by r _n, n = 1, 2, …, N, and observational locations represented by r ^(m), m = 1, 2, …, M. Here, N is the total number of the grid points, and M is the total number of observational points. A single or multiple variables c = (u, v, T, S, …), no matter two or three dimensional, can be ordered by grid point and by variable, forming a single vector of length NP with N the total number of grid points and P the number of variables. For multiple variables, non-dimensionalization is conducted before forming a single vector c (Chu et al. 2015) with “true”, analysis, and background fields (c _t, c _a, c _b) and observational data (c _o) being represented by N and M dimensional vectors,

$$ {\mathbf{c}}_{t,a,b}^T=\left[{c}_{t,a,b}\left({\mathbf{r}}_1\right),{c}_{t,a,b}\left({\mathbf{r}}_2\right),\dots, {c}_{t,a,b}\left({\mathbf{r}}_N\right)\right],\kern0.75em {\mathbf{c}}_o^T=\left[{c}_o\left({\mathbf{r}}^{(1)}\right),{c}_o\left({\mathbf{r}}^{(2)}\right),\dots, {c}_o\left({\mathbf{r}}^{(M)}\right)\right], $$

(1)

where the superscript ‘T’ means transpose. The innovation (or called the observational increment

$$ \mathbf{d}\equiv \left({\mathbf{c}}_o-\mathbf{H}{\mathbf{c}}_b\right), $$

(2)

represents the difference between the observational and background data at the observational points r ^(m). Here, H = [h _mn] is an M × N linear observation operator matrix converting the background field c _b (at the grid points, r _n) into “first guess observations” at the observational points r ^(m) (Fig. 1).

The analysis error (ε _a) and observational error (ε _o) are defined by

$$ {\boldsymbol{\upvarepsilon}}_a={\mathbf{c}}_a-{\mathbf{c}}_t,\kern0.75em {\boldsymbol{\upvarepsilon}}_o\equiv {\mathbf{H}}^T{\mathbf{c}}_o-{\mathbf{c}}_t, $$

(3a)

which are evaluated at the grid points. The two errors are usually independent of each other,

$$ \left\langle {\boldsymbol{\upvarepsilon}}_o^T{\boldsymbol{\upvarepsilon}}_a\right\rangle =0,\kern1em \left\langle \right\rangle \equiv \frac{1}{N-1}{\displaystyle \sum_{n=1}^N\left[\right]}. $$

(3b)

Minimization of the analysis error variance

$$ {E}^2=\left\langle {\boldsymbol{\upvarepsilon}}_a^T{\boldsymbol{\upvarepsilon}}_a\right\rangle \to \min $$

(4)

gives the optimal analysis field c _a for the “true” field c _t.

A common practice in ocean data assimilation (or analysis) is to use a N × M weight matrix W = [w _nm] to blend c _b (at the grid points r _n) with innovation d (at observational points r ^(m)) (Evensen 2003; Tang and Kleeman 2004; Chu et al. 2004a; 2015; Galanis et al. 2006; Oke et al. 2008; Han et al. 2013; Yan et al. 2015)

$$ {\mathbf{c}}_a={\mathbf{c}}_b+\mathbf{W}\mathbf{d}. $$

(5)

Minimization of the analysis error variance with respect to weights,

$$ \partial {E}^2/\partial {w}_{nm}=0. $$

(6)

determines the weight matrix

$$ \mathbf{W}=\mathbf{B}{\mathbf{H}}^T{\left(\mathbf{H}\mathbf{B}{\mathbf{H}}^T+\mathbf{R}\right)}^{-1}. $$

(7)

Here, B is the N × N background error covariance matrix; R is the M × M observational error covariance matrix and is usually simplified as a product of an observational error variance ($ {e}_o^2 $) and an identity matrix I,

$$ \mathbf{R}={e}_o^2\mathbf{I}. $$

(8)

Substitution of (7) into (5) leads to the optimal interpolation (OI) equation,

$$ {\mathbf{c}}_a={\mathbf{c}}_b+\mathbf{B}{\mathbf{H}}^T{\left(\mathbf{H}\mathbf{B}{\mathbf{H}}^T+\mathbf{R}\right)}^{-1}\mathbf{d}, $$

(9)

which produces the analysis field c _a from the innovation d. The challenge for the OI method is the determination of the background error covariance matrix B.

An alternative approach is to use a spectral method with lateral boundary (Г) information to decompose the variable anomaly at the grid points [c(r _n) − c _b(r _n)] into (Chu et al. 2015),

$$ {c}_a\left({\mathbf{r}}_n\right)-{c}_b\left({\mathbf{r}}_n\right)={s}_K\left({\mathbf{r}}_n\right),\kern0.75em {s}_K\left({\mathbf{r}}_n\right)\equiv {\displaystyle \sum_{k=1}^K{a}_k\ }{\phi}_k\left({\mathbf{r}}_n\right), $$

(10)

where {ϕ _k} are basis functions; K is the mode truncation. The eigenvectors of the Laplace operator with the same lateral boundary condition of (c − c _b) can be used as the set of the basis functions {ϕ _k} and written in matrix (Chu et al. 2015)

$$ \boldsymbol{\Phi} =\left\{{\phi}_{kn}\right\}=\left[\begin{array}{cccc}\hfill {\phi}_1\left({\mathbf{r}}_1\right)\hfill & \hfill {\phi}_2\left({\mathbf{r}}_1\right)\hfill & \hfill \dots \hfill & \hfill {\phi}_K\left({\mathbf{r}}_1\right)\hfill \\ {}\hfill {\phi}_1\left({\mathbf{r}}_2\right)\hfill & \hfill {\phi}_2\left({\mathbf{r}}_2\right)\hfill & \hfill \dots \hfill & \hfill {\phi}_K\left({\mathbf{r}}_2\right)\hfill \\ {}\hfill \dots \hfill & \hfill \dots \hfill & \hfill \dots \hfill & \hfill \dots \hfill \\ {}\hfill {\phi}_1\left({\mathbf{r}}_N\right)\hfill & \hfill {\phi}_2\left({\mathbf{r}}_N\right)\hfill & \hfill \dots \hfill & \hfill {\phi}_K\left({\mathbf{r}}_N\right)\hfill \end{array}\right]. $$

(11)

For a given mode truncation K, minimization of the analysis error variance (4) with respect to the spectral coefficients

$$ \partial {E}_K^2/\partial {a}_k=0,\kern0.75em k=1,\ldots,K $$

(12)

gives the spectral ocean data assimilation equation (Chu et al. 2004b, 2015),

$$ {\mathbf{c}}_a={\mathbf{c}}_b+\mathbf{F}{\boldsymbol{\Phi}}^T{\left[\boldsymbol{\Phi} \mathbf{F}{\boldsymbol{\Phi}}^T\right]}^{-1}\boldsymbol{\Phi} {\mathbf{H}}^T\mathbf{d}, $$

(13)

where F is an N × N (diagonal) observational contribution matrix

$$ \mathbf{F}=\left[\begin{array}{cccccc}\hfill {f}_1\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill 0\hfill & \hfill {f}_2\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill \ddots \hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill {f}_n\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill \ddots \hfill & \hfill 0\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill {f}_N\hfill \end{array}\right],\kern1.25em {f}_n\equiv {\displaystyle \sum_{m=1}^M{h}_{nm}}. $$

(14)

Here, the matrices Φ, F, and H are all given in comparison to the OI Eq. (9) where the background error covariance matrix B needs to be determined.

This spectral method has been proven effective for the ocean data analysis. Chu et al. (2003a, b) named the spectral method as the optimal spectral decomposition (OSD). With it, several new ocean phenomena have been identified from observational data such as a bi-modal structure of chlorophyll-a with winter/spring (February–March) and fall (September–October) blooms in the Black Sea (Chu et al. 2005a), fall–winter recurrence of current reversal from westward to eastward on the Texas–Louisiana continental shelf from the current-meter, near-surface drifting buoy (Chu et al. 2005b), propagation of long Rossby waves at mid-depths (around 1000 m) in the tropical North Atlantic from the Argo float data (Chu et al. 2007), and temporal and spatial variability of the global upper ocean heat content (Chu 2011) from the data of the Global Temperature and Salinity Profile Program (GTSPP, Sun et al. 2009).

The spectral mode truncation is the key for the success of the OSD method. It acts as a spatial low pass filter for the fields to allow the highest wave numbers corresponding to the highest spectral eigenvalues without aliasing due to the information provided from the observational network.

Questions arise: Can a simple and effective mode truncation method be developed to take into account of model resolution (i.e., total number of model grid points)? What are the major differences between OI and OSD? What is the quality and uncertainty of the OSD method? The purpose of this paper is to answer these questions. The remainder of the paper is organized as follows. Section 2 describes error analysis. Section 3 presents the steep-descending mode truncation method. Section 4 shows idealized “truth” and “observational” fields. Section 5 compares analysis fields between OSD and OI. Section 6 introduces three synoptic monthly gridded world ocean temperature, salinity, and absolute geostrophic velocity datasets produced with the OSD method and quality controlled by the NOAA National Centers for Environmental Information (NCEI). Conclusions are given in Section 7. Appendices A and B briefly describe several methods to determine the H matrix. Appendix C shows the determination of basis functions. Appendix D presents the Vapnik-Chervonenkis dimension for mode truncation. Appendix E depicts a special B matrix for this study.

2 Error analysis

Low mode truncation does not represent the reality well, while high mode truncation may contain too much noise. Let the truncated spectral representation s _K in (10) at the grid points form an N-dimensional vector,

$$ {\mathbf{s}}_K^T=\left[{s}_K\left({\mathbf{r}}_1\right),{s}_K\left({\mathbf{r}}_2\right),\dots, {s}_K\left({\mathbf{r}}_N\right)\right]. $$

(15)

The M-dimensional innovation vector [see (2)]

$$ {\mathbf{d}}^T=\left[d\left({\mathbf{r}}^{(1)}\right),d\left({\mathbf{r}}^{(2)}\right),\dots, d\left({\mathbf{r}}^{(M)}\right)\right] $$

at observational points can be transformed into the grid points

$$ {D}_n\equiv D\left({\mathbf{r}}_n\right)=\frac{{\displaystyle \sum_{m=1}^M{h}_{nm}{d}^{(m)}}}{f_n},\kern0.75em {f}_n\equiv {\displaystyle \sum_{m=1}^M{h}_{nm}}, $$

(16)

where D(r _n) represents the observational innovation at the grid points,

$$ D\left({\mathbf{r}}_n\right)={c}_o\left({\mathbf{r}}_n\right)-{c}_b\left({\mathbf{r}}_n\right). $$

(17)

From Eq. (3a), observations at grid points are computed using c _o(r _n) = H ^T c _o(r _m). The original background state, c _b(r _n), keeps in the grid space. The matrix form of (16) is

$$ \mathbf{F}\mathbf{D}={\mathbf{H}}^T\mathbf{d}, $$

(18)

where f _n denotes contribution of all observational data unto the grid point r _n. The larger the value of f _n, the larger the observational influence on that grid point (r _n). D is an N-dimensional vector at the grid points,

$$ {\mathbf{D}}^T=\left({D}_1,{D}_2,\dots, {D}_N\right) $$

(19)

The analysis error (i.e., analysis c_a versus “truth” c _t) in the spectral data assimilation [see (10)] is given by

$$ \begin{array}{l}{\varepsilon}_a\left({\mathbf{r}}_n\right)\equiv {c}_a\left({\mathbf{r}}_n\right)-{c}_t\left({\mathbf{r}}_n\right)\\ {}=\left[{c}_a\left({\mathbf{r}}_n\right)-{c}_b\left({\mathbf{r}}_n\right)\right]-\left[{c}_o\left({\mathbf{r}}_n\right)-{c}_b\left({\mathbf{r}}_n\right)\right]+\left[{c}_o\left({\mathbf{r}}_n\right)-{c}_t\left({\mathbf{r}}_n\right)\right]\\ {}={s}_K\left({\mathbf{r}}_n\right)-D\left({\mathbf{r}}_n\right)+{\varepsilon}_o\left({\mathbf{r}}_n\right)\end{array} $$

(20)

Here, (10) and (17) are used. The analysis error is decomposed into two parts

$$ {\varepsilon}_a\left({\mathbf{r}}_n\right)={\varepsilon}_K\left({\mathbf{r}}_n\right)+{\varepsilon}_o\left({\mathbf{r}}_n\right), $$

(21)

with the truncation error given by

$$ {\varepsilon}_K\left({\mathbf{r}}_n\right)={s}_K\left({\mathbf{r}}_n\right)-D\left({\mathbf{r}}_n\right), $$

(22a)

and the observational error given by

$$ {\varepsilon}_o\left({\mathbf{r}}_n\right)={c}_o\left({\mathbf{r}}_n\right)-{c}_t\left({\mathbf{r}}_n\right). $$

(22b)

3 Steep-descending mode truncation

The Vapnik-Chervonenkis dimension (Vapnik 1983; Chu et al. 2003a, 2015) was used to determine the optimal mode truncation K _OPT. As depicted in Appendix D, it depends only on the ratio of the total number of observational points (M) versus spectral truncation (K) and does not depend on the total number of model grid points (N). This method neglects observational error and ignores the model resolution. In fact, the analysis error variance over the whole domain is given by

$$ {E}_a^2\equiv \left\langle \left[{\boldsymbol{\upvarepsilon}}_a^T\mathbf{F}{\boldsymbol{\upvarepsilon}}_a\right]\right\rangle =\left\langle \left[{\boldsymbol{\upvarepsilon}}_K^T\mathbf{F}{\boldsymbol{\upvarepsilon}}_K\right]\right\rangle +2\left\langle \left[{\boldsymbol{\upvarepsilon}}_K^T\mathbf{F}{\boldsymbol{\upvarepsilon}}_o\right]\right\rangle +\left\langle \left[{\boldsymbol{\upvarepsilon}}_o^T\mathbf{F}{\boldsymbol{\upvarepsilon}}_o\right]\right\rangle, \kern0.5em \left\langle \left[{\boldsymbol{\upvarepsilon}}_o^T\mathbf{F}{\boldsymbol{\upvarepsilon}}_o\right]\right\rangle =\frac{M}{N}{e}_o^2, $$

(23)

where $ {e}_o^2 $ is the observational error variance [see (8)]. Here, the observational error is assumaed the same at grid points as at the grid points. This is due to the simplification of the error covariance matrix R = $ {e}_o^2 $ I. The Cauchy-Schwarz inequality shows that

$$ \begin{array}{l}{E}_a^2\le \left\langle \left[{\boldsymbol{\upvarepsilon}}_K^T\mathbf{F}{\boldsymbol{\upvarepsilon}}_K\right]\right\rangle +2\sqrt{\left\langle \left[{\boldsymbol{\upvarepsilon}}_K^T\mathbf{F}{\boldsymbol{\upvarepsilon}}_K\right]\right\rangle}\sqrt{\left\langle \left[{\boldsymbol{\upvarepsilon}}_o^T\mathbf{F}{\boldsymbol{\upvarepsilon}}_o\right]\right\rangle }+\left\langle \left[{\boldsymbol{\upvarepsilon}}_o^T\mathbf{F}{\boldsymbol{\upvarepsilon}}_o\right]\right\rangle \\ {}={E}_K^2+2{E}_K\sqrt{M/N{e}_o}+\left(M/N\right){e}_o^2\end{array} $$

(24)

The relative analysis error reduction at the mode-K can be expressed by the ratio

$$ {\gamma}_K=\mathit{\ln}\left[\frac{E_{K-1}^2+2{E}_{K-1}\sqrt{M/N{e}_o}+M{e}_o^2/N}{E_K^2+2{E}_K\sqrt{M/N{e}_o}+M{e}_o^2/N}\right],\kern1.25em K=2,3,\dots $$

(25)

Both E _K and E _K-1 are large for small K (low-mode truncation), which may lead to a small value of γ _K. Both E _K and E _K-1 are small for large K (high-mode truncation), which also leads to a small value of γ _K. An optimal truncation should be between the low-mode and high-mode truncations with a larger value (over a threshold) of γ _K. This procedure is illustrated as follows. The values (γ ₂, γ ₂, …, γ _KB) are calculated using (25) from a large K _B (say 250). The mean and standard deviation of γ can be computed as,

$$ \overline{\gamma}=\frac{1}{K_B-1}{\displaystyle \sum_{K=2}^{K_B}{\gamma}_K},s=\sqrt{\frac{1}{K_B-2}{\displaystyle \sum_{K=2}^{K_B}{\left({\gamma}_K-\overline{\gamma}\right)}^2}}. $$

(26)

Suppose that the relative error reductions (γ ₂, γ ₃, …, γ _KB) satisfy the Gaussian distribution. A 100(1 − α) % upper one-sided confidence bound on γ is given by

$$ {\gamma}_{th}=\overline{\gamma}+{z}_{\alpha }s, $$

(27)

which is used as the threshold for the mode truncation. Here, z is the random variable satisfying the Gaussian distribution with zero mean and standard deviation of 1. If several γ values exceed the threshold, the highest mode

$$ {K}_{\mathrm{OPT}}=\underset{\gamma_K\ge {\gamma}_{th}}{\mathit{\max}}(K) $$

(28)

is selected for mode truncation. After the mode truncation K _OPT is determined, the spectral coefficients (a _k, k = 1, 2, …, K _OPT) can be calculated, and so as the truncation error variance $ {E}_{K_{\mathrm{OPT}}}^2 $.

3.1 Multi-platform observations

Let observation be conducted by L instruments with different $ {e}_o^{(l)} $deployed at $ {\mathbf{r}}_l^{\left({m}_l\right)} $ (m _l = 1, 2, .., M _L; l = 1, 2, …, L). The total number of observations is $ M={\displaystyle \sum_{l=1}^L{M}_l} $. The M-dimensional observational vector is represented by

$$ {\mathbf{c}}_o^T=\left[\begin{array}{l}{c}_o\left({\mathbf{r}}_1^{(1)}\right),{c}_o\left({\mathbf{r}}_1^{(2)}\right),\dots, {c}_o\left({\mathbf{r}}_1^{\left({M}_1\right)}\right),{c}_o\left({\mathbf{r}}_2^{(1)}\right),{c}_o\left({\mathbf{r}}_2^{(2)}\right),\dots, {c}_o\left({\mathbf{r}}_2^{\left({M}_2\right)}\right),\dots, \\ {}{c}_o\left({\mathbf{r}}_L^{(1)}\right),{c}_o\left({\mathbf{r}}_L^{(2)}\right),\dots, {c}_o\left({\mathbf{r}}_L^{\left({M}_L\right)}\right)\end{array}\right] $$

(29)

The observational error variance is given by

$$ \left\langle {\boldsymbol{\upvarepsilon}}_o^T\mathbf{F}{\boldsymbol{\upvarepsilon}}_o\right\rangle ={M}_1{\left({e}_o^{(1)}\right)}^2+{M}_2{\left[{e}_o^{(2)}\right]}^2+\dots +{M}_L{\left[{e}_o^{(L)}\right]}^2. $$

(30)

The relative error reduction γ_K for mode truncation (25) is replaced by

$$ {\gamma}_K=\mathit{\ln}\left[\frac{E_{K-1}^2+2{E}_{K-1}{\displaystyle \sum_{l=1}^L\sqrt{M_l/N}{e}_o^{(l)}}+{\displaystyle \sum_{l=1}^L{M}_l{\left({e}_o^{(l)}\right)}^2}/N}{E_K^2+2{E}_K{\displaystyle \sum_{l=1}^L\sqrt{M_l/N}{e}_o^{(l)}}+{\displaystyle \sum_{l=1}^L{M}_l{\left({e}_o^{(l)}\right)}^2/N}}\right],\kern1.25em K=2,3,\dots $$

(31)

After the mode truncation is determined, the OSD Eq. (13) is used to get the analysis field.

4 “Truth,” “background,” and “observational” fields

Consider an artificial non-dimensional horizontal domain (−19 < x < 19, −15 < y < 15) with the four curved rigid boundaries (Fig. 2):

$$ \begin{array}{l}\frac{x}{10}-0.3 \cos \left(\frac{y}{8}\right) \sin \left(\frac{x}{10}\right)=\xi =\left\{\begin{array}{c}\hfill -\pi /2\kern1.25em \left(\mathrm{west}\right)\hfill \\ {}\hfill \kern0.75em \pi /2\kern1.5em \left(\mathrm{east}\right)\kern2.5em \hfill \end{array}\right.\\ {}\frac{y}{8}-0.2 \sin \left(\frac{x}{5}\right)\left[1- \cos \left(\frac{y}{8}\right)\right]=\eta =\left\{\begin{array}{c}\hfill -\pi /2\kern1.25em \left(\mathrm{south}\right)\hfill \\ {}\hfill \kern0.75em \pi /2\kern1.5em \left(\mathrm{north}\right)\kern2.5em \hfill \end{array}\right.\end{array} $$

(32)

The domain is discretized with Δx = Δy = 0.5. The total number of the grid points inside the domain (N) is 3569. Figure 3 shows the first 12 basis functions {ϕ _k}, which are the eigenvectors of the Laplacian operator with the Dirichlet boundary condition, i.e., b ₁ = 0 in (61) of Appendix C.

The first basis function ϕ ₁(x _n) shows a one-gyre structure. The second and third basis functions ϕ ₂(x _n)and ϕ ₃(x _n)show the east-west and north-south dual-eddies. The fourth basis function ϕ ₄(x _n) shows the east-west slanted dipole-pattern with opposite signs in the northeastern region (positive) and the southwestern region (negative). The fourth basis function ϕ ₄(x _n) shows the tripole-pattern with negative values in the western and eastern regions and positive values in between. The higher order basis functions have more complicated variability structures.

Two “truth” fields for the non-dimensional domain with 4 rigid and curved boundaries (Fig. 2) contain multiple mesoscale eddies (treated as “truth”) given by

$$ \left\{\begin{array}{c}\hfill {c}_t\left(x,y\right)=25-{y}^2/40+3 \cos \left[{L}_x\xi \left(x,y\right)\right]\mathit{\sin}\left[{L}_y\eta \left(x,y\right)+\beta \right]\hfill \\ {}\hfill \begin{array}{l}\xi =\frac{x}{10}-0.3 \cos \left(\frac{y}{8}\right) \sin \left(\frac{x}{10}\right),\kern0.75em \eta =\frac{y}{8}-0.2 \sin \left(\frac{x}{5}\right)\left[1- \cos \left(\frac{y}{8}\right)\right]\\ {}\left({L}_x,{L}_y,\beta \right)=\left(3,2,\pi /2\right)\end{array}\hfill \end{array}\right., $$

(33)

for the large-eddy field (Fig. 4a) and given by

$$ \left\{\begin{array}{c}\hfill {c}_t\left(x,y\right)=25-{y}^2/40+3 \cos \left[{L}_x\xi \left(x,y\right)\right]\mathit{\cos}\left[{L}_y\eta \left(x,y\right)+\beta \right]\hfill \\ {}\hfill \begin{array}{l}\xi =\frac{x}{10}-0.3 \cos \left(\frac{y}{8}\right) \sin \left(\frac{x}{10}\right),\kern0.75em \eta =\frac{y}{8}-0.2 \sin \left(\frac{x}{5}\right)\left[1- \cos \left(\frac{y}{8}\right)\right]\\ {}\left({L}_x,{L}_y,\beta \right)=\left(7,5,0\right)\end{array}\hfill \end{array}\right. $$

(34)

for the small-eddy field (Fig. 4b). The background field is given by

$$ {c}_b\left(x,y\right)=25-{y}^2/40 $$

(35)

The “observational” points {r ^(m)} are randomly selected inside the domain (Fig. 5) with the total number (M) of 300. The “observational” points {r ^(m)} are kept the same for all the sensitivity studies. The domain is discretized by Δx = Δy = 0.5 with total number (N) of grid points of 3569.

Sixteen sets of “observations” (c _o) are constructed from Fig. 4a, b using the analytical values plus white Gaussian noises (ε _o) of zero mean and various standard deviations (σ) from 0 (no noise) to 2.0 with 0.1 increment from 0 to 1.0 and 0.2 increment from 1.0 to 2.0 (total 16 sets), generated by the MATLAB,

$$ {c}_o\left({\mathbf{r}}^{(m)}\right)={c}_t\left({\mathbf{r}}^{(m)}\right)+{\varepsilon}_o\left({\mathbf{r}}^{(m)}\right). $$

(36)

Figure 6a, b show 6 out of the 16 constructed sets with σ = (0, 0.2, 0.5, 10., 1.6, 2.0). Both OSD and OI methods are used to get the analysis field c_a(r _n) from these “observations”. The bilinear interpolation (see Appendix B) is used for the observation operator H in this study.

5 Comparison between OSD and OI

a.
OSD analysis fields

The steep-descending mode truncation K _OPT depends on the user-input parameter e _o [see (25)] and observational noise σ. $ {E}_a^2 $ and γ _K are computed from the “observational” data in Fig. 6a, b. The threshold of mode truncation (27) varies with the significance level α. In this study, (e ₀, σ) vary between 0 and 2; α has two levels of (0.05, 0.10) with z _0.05 = 1.645, z _0.10 = 1.287 in (27). For given values of e ₀ (= 0.2) and σ (= 0.8), the optimal mode truncation depends on the significance level α with K _OPT = 58 for α = 0.05 (Fig. 7a) and K _OPT = 67 for α = 0.10 (Fig. 7b). Most results shown in this section is for α = 0.05 since it it a commonly used significance level.

For the large-eddy field, K _OPT is not sensitive to the values of σ and e _o. It is 7 in the upper-left portion and 6 in the lower-right portion of Table 1. For the small-eddy field, K _OPT takes (58, 67) for the most cases, 178 for the high noise levels (σ ≥ 1.8) and low e _o values (e _o ≤ 1.0), and 82 for the low noise levels (σ ≤ 0.1) and low e _o values (e _o ≤ 0.3) (Table 2).

Table 1 Dependence of K _OPT on (σ, e _o) for the large-eddy field shown in Fig. 6a with significance level α = 0.05

Full size table

Table 2 Dependence of K _OPT on (σ, e _o) for the small-eddy field shown in Fig. 6b with significance level α = 0.05

Full size table

The analysis field using the OSD data assimilation (13) for a particular user-input parameter e _o and noise level σ, $ {c}_a^{OSD}\left({\mathbf{r}}_n,{e}_o,\sigma \right) $, is represented in Fig. 8a (the large-eddy field) using “observations” in Fig. 6a (with various σ), and in Fig. 8b (the small-eddy field) using “observations” in Fig. 6b (with various σ). Comparison between Figs. 8a, b and 4a, b demonstrates the capability of the OSD method with the analysis fields $ {c}_a^{OSD}\left({\mathbf{r}}_n,\sigma, {e}_o\right) $ fully reconstructed for all occasions.

b.
OI Analysis Fields

With the assumption that the c field is statistically stationary and homogeneous, the OI Eq. (9) with the R and B matrices represented by (8) and (65) [see Appendix E] is used to analyze the “observational” data with three user-defined paramters: (r _a, r _b, e _o). Here, r _a and r _b are the decorrelation scale and zero crossing (r _b > r _a); e _o is the standard deviation of the observational error. Let these paramters take discrete values with total number of P _a for r _a, P _b for r _b, and P _e for e _o. In this study, we set P _a = P _b = P _e = 5. e _o has five values (0.2, 0.5, 1.0, 1.5, 2.0). Considering the horizontal domain from −15 to 15 in both (x, y) directions, r _a takes 5 values (2, 3, 4, 5, 6); (r _b - r _a) takes 5 values (0.5, 1.0, 1.5, 2.0, 2.5). There are 125 combinations of (r _a, r _b, e _o) for the test.

The analysis field from the OI data assimilation (9), $ {c}_a^{OI}\left({\mathbf{r}}_n,\sigma, {r}_a,{r}_b,{e}_o\right) $, with four different sets of user-input parameters (r _a, r _b - r _a, e _o): (2, 2.5, 1), (4, 5.5, 1), (6, 8.5, 1), and (6, 8.5, 2), are presented in Fig. 9a (the large-eddy field) using “observations” in Fig. 6a, and in Fig. 9b (the small-eddy field) using “observations” in Fig. 6b. Comparison between Figs. 9a, b and 4a, b demonstrates strong dependence of the OI output on the selection of the parameters (r _a, r _b, e _o). For the large-scale eddies (Fig. 9a), the analysis fields c_a are very different from the “truth” field c _t for r _a = 2, r _b = 2.5, e _o = 1 for all “observations” (Fig. 6a); the difference between the reconstructed and “truth” fields decreases as r _a and r _b increase; the two fields are quite similar when r _a = 6, r _b = 8.5 for both e _o = 1 and 2. Such similarity reduces with increasing e _o. For the small-scale eddies (Fig. 9b), the analysis fields c_a are totally different from the “truth” field c _t for r _a = 6, r _b = 8.5, e _o = 1 and 2 for all “observations” (Fig. 6b), less different as r _a and r _b decrease; and are quite similar to c _t when r _a = 2, r _b = 2.5, e _o = 1.

c.
Root mean square error

The analysis field from OSD, $ {c}_a^{OSD} $, depends only on the observational error variance $ {e}_o^2 $ and its uncertainty is represented by the root mean square error R ^OSD,

$$ {R}^{OSD}\left(\sigma, {e}_o\right)=\sqrt{\frac{1}{N}{\displaystyle \sum_{n=1}^N{\left[{c}_a^{OSD}\left({\mathbf{r}}_n,\sigma, {e}_o\right)-{c}_t\left({\mathbf{r}}_n\right)\right]}^2}}. $$

(37a)

Average over all the values of e _o leads to the overall uncertainty

$$ {\overline{R}}^{OSD}\left(\sigma \right)=\sqrt{\frac{1}{N{P}_e}{\displaystyle \sum_{e_o}{\displaystyle \sum_{n=1}^N{\left[{c}_a^{OSD}\left({\mathbf{r}}_n,\sigma, {e}_o\right)-{c}_t\left({\mathbf{r}}_n\right)\right]}^2}}}. $$

(37b)

The analysis field using OI ($ {c}_a^{OI} $) depends on three user-defined parameters (r _a, r _b, e _o). Its uncertainty due to a particular parameter is represented by

$$ {R}^{OI}\left(\sigma, {r}_a\right)=\sqrt{\frac{1}{N{P}_b{P}_e}{\displaystyle \sum_{r_b}{\displaystyle \sum_{e_o}{\displaystyle \sum_{n=1}^N{\left[{\psi}_a^{OI}\left({\mathbf{r}}_n,\sigma, {r}_a,{r}_b,{e}_o\right)-{\psi}_t\left({\mathbf{r}}_n\right)\right]}^2}}}}, $$

(38a)

$$ {R}^{OI}\left(\sigma, {r}_b\right)=\sqrt{\frac{1}{N{P}_a{P}_e}{\displaystyle \sum_{r_a}{\displaystyle \sum_{e_o}{\displaystyle \sum_{n=1}^N{\left[{\psi}_a^{OI}\left({\mathbf{r}}_n,\sigma, {r}_a,{r}_b,{e}_o\right)-{\psi}_t\left({\mathbf{r}}_n\right)\right]}^2}}}}, $$

(38b)

$$ {R}^{OI}\left(\sigma, {e}_o\right)=\sqrt{\frac{1}{N{P}_a{P}_b}{\displaystyle \sum_{r_a}{\displaystyle \sum_{r_b}{\displaystyle \sum_{n=1}^N{\left[{\psi}_a^{OI}\left({\mathbf{r}}_n,\sigma, {r}_a,{r}_b,{e}_o\right)-{\psi}_t\left({\mathbf{r}}_n\right)\right]}^2}}}}, $$

(38c)

which are compared to $ {\overline{R}}^{OSD}\left(\sigma \right) $ and R ^OSD(σ, e _o).

Figure 10 shows the comparison between R ^OI(σ, r _a) and $ {\overline{R}}^{OSD}\left(\sigma \right) $ for 5 different r _a values: (2, 3, 4, 5, 6) and two types (the large-scale and small-scale) of the “observational” field. R ^OI(σ, r _a) monotonically increases with σ and is generally larger than $ {\overline{R}}^{OSD}\left(\sigma \right) $. For the “observations” representing the large-scale eddy fields (L _x = 2, L _y = 3, see Fig. 6a), $ {\overline{R}}^{OSD}\left(\sigma \right) $increases slightly from 0.32 for σ = 0 to 0.34 for σ = 2.0. However, R ^OI(σ, r _a = 2) is always larger than $ {\overline{R}}^{OSD}\left(\sigma \right) $and increases from 0.37 for σ = 0 to 1.13 for σ = 2.0; R ^OI(σ, r _a ≥ 3)is smaller than $ {\overline{R}}^{OSD}\left(\sigma \right) $for small σ, equals $ {\overline{R}}^{OSD}\left(\sigma \right) $at certain σ ₀, and larger than $ {\overline{R}}^{OSD}\left(\sigma \right) $for σ > σ₀. The value of σ ₀ increases with r _a from 0.4 for r _a = 3 to 1.0 for r _a = 6. R ^OI(σ, r _a = 6) increases from 0.13 for σ = 0 to 0.62 for σ = 2.0. For the “observations” representing the small-scale eddy field (L _x = 5, L _y = 7, see Fig. 6b), $ {\overline{R}}^{OSD}\left(\sigma \right) $increases slightly from 0.22 for σ = 0 to 0.27 for σ = 0.4; evidently from 0.27 for σ = 0.4 to 0.40 for σ = 0.5; and slowly from 0.40 for σ = 0.5 to 0.71 for σ = 2.0. However, R ^OI(σ, r _a) is much larger than $ {\overline{R}}^{OSD}\left(\sigma \right) $ for any r _a. For example, R ^OI(σ, r _a = 2)increases from 0.43 for σ = 0 to 1.14 for σ = 2.0; …, R ^OI(σ, r _a = 6)increases from 0.89 for σ = 0 to 1.06 for σ = 2.0.

Figure 11 shows the comparison between R ^OI(σ, r _b) and $ {\overline{R}}^{OSD}\left(\sigma \right) $ for 5 different (r _b − r _a) values: (0.5, 1.0, 1.5, 2.0, 2.5) and two types (large-scale and small-scale) of the “observational” fields. R ^OI(σ, r _b) monotonically increases with σ and is generally larger than $ {\overline{R}}^{OSD}\left(\sigma \right) $. For the “observations” representing the large-scale eddy fields (L _x = 2, L _y = 3, see Fig. 6a), R ^OI(σ, r _b − r _a) monotonically increases with σ from around 0.2 for σ = 0 to around 0.78 for σ = 2.0 for all the values of (r _b − r _a) with σ ₀ from 0.4 for (r _b − r _a) = 0.5 to 0.6 for (r _b − r _a) = 2.5. For the “observations” representing the small-scale eddy fields (L _x = 5, L _y = 7, see Fig. 6b), R ^OI(σ, r _b − r _a) is much larger than $ {\overline{R}}^{OSD}\left(\sigma \right) $ for any (r _b − r _a) and σ. For example, R ^OI(σ, r _b − r _a = 0.5) increases from 0.53 for σ = 0 to 1.00 for σ = 2.0; …, R ^OI(σ, r _b − r _a = 2.5)increases from 0.58 for σ = 0 to 1.00 for σ = 2.0.

Figure 12 shows the comparison between R ^OI(σ, e _o) and R ^OSD(σ, e _o) for 5 different e _o values: (0.2, 0.5, 1.0, 1.5, 2.0) and two types (large-scale and small-scale) of the “observational” fields. First, R ^OI(σ, e _o) monotonically increases with σ and is evidently larger than R ^OSD(σ, e _o) for all σ and e _o. Second, dependence of R ^OSD(σ, e _o) on σ is insensitive to the change of e _o. For the “observations” representing the large-scale eddy fields (L _x = 2, L _y = 3, see Fig. 6a), R ^OI(σ, e _o) is close to R ^OSD(σ, e _o) for σ < 1.2, and much larger than R ^OSD(σ, e _o) for σ > 1.2 with e _o = 0.2 and 0.5; and vice versa with e _o = 1.0, 1.5, and 2.0. R ^OI(σ, e _o = 2.0)increases slightly from 0.98 at σ = 0 to 1.08 at σ = 2.0 and is almost twice of R ^OSD(σ, e _o) for all σ. For the “observations” representing the small-scale eddy fields (L _x = 5, L _y = 7, see Fig. 6b), R ^OI(σ, e _o) is also larger than R ^OSD(σ, e _o). For example, R ^OI(σ, e _o = 2.0)increases slightly from 1.37 at σ = 0 to 1.42 at σ = 2.0, which is two to three times of R ^OSD(σ, e _o = 2.0) for σ < 1.0.

The overall performance between OI and OSD with various noise levels (σ) can be estimated by the error ratio,

$$ \kappa \left(\sigma \right)=\frac{{\overline{R}}^{OSD}\left(\sigma \right)}{{\hat{R}}^{OI}\left(\sigma \right)},\kern0.75em {\widehat{R}}^{OI}\left(\sigma \right)\equiv \sqrt{\frac{1}{N{P}_a{P}_b{P}_e}{\displaystyle \sum_{r_a}{\displaystyle \sum_{r_b}{\displaystyle \sum_{e_o}{\displaystyle \sum_{n=1}^N{\left[{c}_a^{OI}\left({\mathbf{r}}_n,\sigma, {r}_a,{r}_b,{e}_o\right)-{c}_t\left({\mathbf{r}}_n\right)\right]}^2}}}}}. $$

(39)

Figure 13 shows the dependence of κ(σ) (evidently less than 1) on σ for the two types (large-scale and small-scale eddies) of the “observational” fields represented by Fig. 6a and b with two different significance levels (α = 0.05, 0.10) for the threshold of mode truncation in the OSD method (27). At α = 0.05 (Fig. 13a), for the large-scale eddy field, κ(σ) takes 0.71 at σ = 0; fluctuates with σ; and decreases to 0.57 at σ = 2.0. For the small-scale eddy field, κ(σ) increases monotonically with σ from 0.43 at σ = 0 to 0.67 at σ = 2.0. At α = 0.10 (Fig. 13b), for the large-scale eddy field, κ(σ) takes 1.17 at σ = 0; decreases monotonically with σ to 0.40 at σ = 2.0. For the small-scale eddy field, κ(σ) increases monotonically with σ from 0.36 at σ = 0 to 0.70 at σ = 2.0. It means that the OSD performs better for the test case. Integration of κ(σ) over the whole interval of the noise level [0, 2.0] yields

$$ \hat{\kappa}=\frac{1}{2}{\displaystyle \underset{0}{\overset{2}{\int }}\kappa \left(\sigma \right)d\sigma }=\left\{\begin{array}{ccc}\hfill \alpha =0.05\hfill & \hfill \alpha =0.1\hfill & \hfill \hfill \\ {}\hfill 0.76\hfill & \hfill 0.72\hfill & \hfill \mathrm{large}\hbox{-} \mathrm{scale}\ \mathrm{eddy}\hfill \\ {}\hfill 0.51\hfill & \hfill 0.59\hfill & \hfill \mathrm{small}\hbox{-} \mathrm{scale}\ \mathrm{eddy}\hfill \end{array}\right. $$

(40)

which means that the overall error for the OSD is 76 % (51 %) of the OI error for the large-scale (small-scale) eddy field for α = 0.05. The overall performance of the OSD method is relatively insensitive to the selection of the significance level α.

The computational cost of the OSD and OI methods is comparable in the test cases. In the OSD method, the steep-descending method for mode truncation requires (a) the computation of a large number K _b in Eq. (26) of eigenvectors and (b) the construction and solution of the OSD Eq. (13) can be done once for all. In the OI method, however, the construction and solution of the OI Eq. (9) must be repeated each time background/observations changes.

6 Synoptic monthly gridded temperature and salinity fields

The OSD method is used to to produce the synoptic monthly gridded (SMG) temperature (T) and salinity (S) datasets (Chu and Fan 2016a; Chu et al. 2016) from the two world ocean observational (T, S) profile datasets [the NOAA national Centers for Environmental Information (NCEI)’s World Ocean Database (WOD) and the Global Temperature and Salinity Profile Program (GTSPP)]. The synoptic monthly gridded absolute geostrophic velocity dataset (Chu and Fan 2016b) is also established from the SMG-WOD (T, S) fields using the P vector method (Chu 1995; Chu and Wang 2003). These datasets have been quality controlled by the NCEI professionals and are openly downloaded for public use at http://data.nodc.noaa.gov/geoportal/rest/find/document?searchText=synoptic+monthly+gridded&f=searchPage. The duration is January 1945 to December 2014 for the synoptic monthly gridded WOD (T, S) and absolute geostrophic velocity fields and January 1990 to December 2009 for the synoptic monthly gridded GTSPP (T, S) fields.

7 Conclusions

Ocean spectral data assimilation has been developed on the base of the classic theory of the generalized Fourier series expansion such that any ocean field can be represented by a linear combination of the products of basis functions (or called modes) and corresponding spectral coefficients. The basis functions are the eigenvectors of the Laplace operator, determined only by the topography with the same lateral boundary condition for the assimilated variable anomaly. They are pre-calculated and independent on any observational data and background fields. The mode truncation K depends on the observational data and a user input parameter $ {e}_o^2 $ (i.e., observational error variance); and is determined via the steep-descending method.

The OSD completely changes the common ocean data assimilation procedures such as OI, KF, and variational methods, where the background error covariance matrix B needs to be pre-determined since the weight matrix W is used. However, the OSD uses the spectral form to represent the observational innovation at the grid points [see (17)]. Minimization of the truncation error variance leads to the optimal selection of the spectral coefficients. Thus, the background error covariance matrix B vanishes in the OSD procedure since the weight matrix W is not used. It is contrast to the existing OI method, where the B matrix is often assumed to be stationary and homogeneous with user-defined parameters.

The capability of the OSD method is demonstrated through its comparison to OI using analytical 2D fields of large and small mesoscale eddies inside a domain with 4 rigid and curved boundaries as “truth”, and addition to the “truth” of white Gaussian noises with zero mean and standard deviations (σ) varying from 0 (no noise) to 2.0 with 0.1 increment at randomly selected locations used as “observations.” A simple covariance function (Bretherton et al. 1976) was used for the OI procedure with three user-defined parameters (r _a, r _b, e _o) taking 5 possible values each. The OSD uses the same value of e _o. The performance of OSD and OI is compared by (1) patterns for each set of 125 combinations of parameters, (2) root mean square errors for varying parameters, and (3) overall root mean square errors. The results show that the overall error reduction using the OSD is evident, which is 76 % (51 %) [72 % (59 %)] for significance level α = 0.05 (α = 0.10) of the OI error for the large-scale (small-scale) eddy field. In context of practical application, synoptic monthly gridded world ocean temperature, salinity, and absolute geostrophic velocity datasets have been produced with the OSD method and quality controlled by the NOAA National Centers for Environmental Information (NCEI).

Two issues need to be addressed on the correlation matrix. First, the comparison between the OSD and OI is at one particular instant in time. The B matrix used in the OI is based only on distance. Second, in the covariance matrix-based methods, when the covariance matrix is fixed once and for all, it is well-known that the very first data assimilation cycle is doing well, but subsequent cycles are less effective because the remaining error has a tendency to be orthogonal to the directions of the covariance matrix. In the OSD method, the correction is based on spectral functions (i.e., basis functions) chosen once-and-for all. More sophisticated, flow-based covariance matrix will allow OI to perform much better. Further verification and validation under real-time ocean conditions are needed to verify the quality of OSD in time cycles and to compare between OSD and OI methods.

In the two test cases (large and small eddy fields), it is clear that the optimal mode truncation K _OPT (around 6 for the large eddy field and around 60 for the small eddy field) are very closed to the number of eigenvectors required to represent the truth field (Fig. 4). This shows the capability of the steep-descending mode truncation. However, the performance of the method for the truth field is a mixture of large and small scales in different parts of the domains needs to be further investigated.

References

Bretherton FP, Davis RE, Fandry CB (1976) A technique for objective analysis and design of oceanographic experiments applied to MODE-73. Deep-Sea Res Oceanogr Abstr 23:559–582. doi:10.1016/0011-7471(76)90001-2
Article Google Scholar
Chu (1995) P-vector method for determining absolute velocity from hydrographic data
Chu PC (2008) Probability distribution function of the upper equatorial Pacific current speeds. Geophys Res Lett 35. doi:10.1029/2008GL033669
Chu PC (2009) Statistical characteristics of the global surface current speeds obtained from satellite altimeter and scatterometer data. IEEE J Sel Topics Earth Obs Remote Sensing 2(1):27–32
Article Google Scholar
Chu PC (2011) Global upper ocean heat content and climate variability. Ocean Dyn 61(8):1189–1204
Article Google Scholar
Chu PC, Fan CW (2016a) Synoptic monthly gridded three dimensional (3D) World Ocean Database temperature and salinity from January 1945 to December 2014 (NCEI Accession 0140938). NOAA National Centers for Environmental Information (NCEI), http://data.nodc.noaa.gov/cgi-bin/iso?id=gov.noaa.nodc:0140938.
Chu PC, Fan CW (2016b) Synoptic Monthly Gridded WOD Absolute Geostrophic Velocity (SMG-WOD-V) (January 1945–December 2014) with the P-vector method (NCEI Accession 0146195). NOAA National Centers for Environmental Information (NCEI), http://data.nodc.noaa.gov/cgi-bin/iso?id=gov.noaa.nodc:0146195
Chu PC, Wang GH (2003) Seasonal variability of thermohaline front in the central South China Sea. J Oceanogr 59:65–78
Article Google Scholar
Chu PC, Ivanov LM, Margolina TM (2005b) Seasonal variability of the Black Sea chlorophyll-a concentration. J Mar Syst 56:243–261
Article Google Scholar
Chu PC, Ivanov LM, Melnichenko OM (2005a) Fall-winter current reversals on the Texas-Louisiana continental shelf. J Phys Oceanogr 35:902–910
Article Google Scholar
Chu PC, Fan CW, Sun LC (2016) Synoptic monthly gridded Global Temperature and Salinity Profile Programme (GTSPP) water temperature and salinity from January 1990 to December 2009 (NCEI Accession 0138647). NOAA National Centers for Environmental Information (NCEI), http://data.nodc.noaa.gov/cgi-bin/iso?id=gov.noaa.nodc:0138647.
Chu PC, Ivanov LM, Korzhova TP, Margolina TM, Melnichenko OM (2003a) Analysis of sparse and noisy ocean current data using flow decomposition. Part 1: theory. J Atmos Oceanic Technol 20:478–491
Article Google Scholar
Chu PC, Ivanov LM, Korzhova TP, Margolina TM, Melnichenko OM (2003b) Part 2: application to Eulerian and Lagrangian data. J Atmos Ocean Technol 20:492–512
Article Google Scholar
Chu PC, Ivanov LM, Margolina TM (2004b) Rotation method for reconstructing process and field from imperfect data. Int J Bifur Chaos 14:2991–2997
Article Google Scholar
Chu PC, Ivanov LM, Melnichenko OV, Wells NC (2007) Long baroclinic Rossby waves in the tropical North Atlantic observed from profiling floats. J Geophys Res 112:C05032. doi:10.1029/2006JC003698
Google Scholar
Chu PC, Tokmakian RT, Fan CW, Sun LC (2015) Optimal spectral decomposition (OSD) for ocean data assimilation. J Atmos Ocean Technol 32:828–841
Article Google Scholar
Chu PC, Wang GH, Chen YC (2002) Japan/East Sea (JES) circulation and thermohaline structure, part 3, autocorrelation functions. J Phys Oceanogr 32:3596–3615
Article Google Scholar
Chu PC, Wang GH, Fan CW (2004a) Evaluation of the U.S. Navy’s Modular Ocean Data Assimilation System (MODAS) using the South China Sea Monsoon Experiment (SCSMEX) data. J Oceanogr 60:1007–1021
Article Google Scholar
Chu PC, Wells SK, Haeger SD, Szczechowski C, Carron M (1997) Temporal and spatial scales of the Yellow Sea thermal variability. J Geophys Res (Oceans) 102:5655–5668
Article Google Scholar
Evensen G (2003) The ensemble Kalman filter: theoretical formulation and practical implementation. Ocean Dyn 53:343–367
Article Google Scholar
Franke R, Nielson G (1991) Scattered data interpolation and application: a tutorial and survey. In: Hagen H, Roller D (eds) Geometric modelling, methods and applications. Springer, Berlin, pp. 131–160
Google Scholar
Galanis GN, Louka P, Katsafados Kallos PG, Pytharoulis I (2006) Applications of Kalman filters based on non-linear functions to numerical weather predictions. Ann Geophys 24:2451–2460
Article Google Scholar
Han GJ, Wu XR, Zhang SQ, Liu ZY, Li W (2013) Error covariance estimation for coupled data assimilation using a Lorenz atmosphere and a simple pynocline ocean model. J Clim 26:10218–10231
Article Google Scholar
Oke PR, Brassington GB, Griffin DA, Schiller A (2008) The Bluelink Ocean Data Assimilation System (BODAS). Ocean Model 21:46–70
Article Google Scholar
Spepard D (1968) A two-dimensional interpolation function for irregularly spaced data. Proc 23rd Nat Conf ACM, 517–523.
Sun LC, Thresher A, Keeley R, et al. (2009) The data management system for the Global Temperature and Salinity Profile Program (GTSPP). In Proceedings of the “OceanObs’09: Sustained Ocean Observations and Information for Society” Conference (Vol. 2), Venice, Italy, 21–25 September 2009, Hall, J, Harrison D.E. and Stammer, D., Eds., ESA Publication WPP-306
Tang Y, Kleeman R (2004) SST assimilation experiments in a tropical Pacific Ocean model. J Phys Oceanogr 34:623–642
Article Google Scholar
Vapnik, VH (1983) Reconstruction of Empirical Laws from Observations (in Russian). Nauka, p 447
Yan CX, Zhu J, Xie JP (2015) An ocean data assimilation system in the Indian Ocean and West Pacific Ocean. Adv Atmos Sci. doi:10.1007/s00376-015-4121-z
Google Scholar

Download references

Acknowledgments

The Office of Naval Research, the Naval Oceanographic Office, and the Naval Postgraduate School supported this study.

Author information

Authors and Affiliations

Naval Ocean Analysis and Prediction Laboratory, Department of Oceanography, Naval Postgraduate School, Monterey, CA, USA
Peter C. Chu, Chenwu Fan & Tetyana Margolina

Authors

Peter C. Chu
View author publications
You can also search for this author in PubMed Google Scholar
Chenwu Fan
View author publications
You can also search for this author in PubMed Google Scholar
Tetyana Margolina
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Peter C. Chu.

Additional information

Responsible Editor: Jean-Marie Beckers

This article is part of the Topical Collection on the 47th International Liège Colloquium on Ocean Dynamics, Liège, Belgium, 4–8 May 2015

Appendices

Appendix A. Determination of H-matrix using all grid points

IDW interpolation, using all grid points, is one of the most commonly used techniques for interpolation based on the assumption that the value of h _mn in H-matrix are influenced more by the nearby points and less by the more distant points. Let

$$ {d}_n^m=\sqrt{{\left({x}^{(m)}-{x}_i\right)}^2+{\left({y}^{(m)}-{y}_j\right)}^2} $$

(41)

be the distance between the grid point (x _i, y _j) and observational point (x ^(m), y ^(m)). The influence of the grid point x _n on the observational point x ^(m) is given by (Spepard 1968)

$$ {h}_{mn}={\left({d}_n^m\right)}^{-q}/{\displaystyle \sum_{n=1}^N{\left({d}_n^m\right)}^{-q}} $$

(42)

where q is an arbitrary positive real number called the power parameter (typically, q = 2). Another form of h _mn is given by (Franke and Nielson 1991)

$$ {h}_{mn}=\frac{{\left[\left({D}^{(m)}-{d}_n^m\right)/{D}^{(m)}{d}_n^m\right]}^2}{{\displaystyle \sum_{n=1}^N{\left[\left({D}^{(m)}-{d}_n^m\right)/{D}^{(m)}{d}_n^m\right]}^2}}, $$

(43)

where D ^(m) is the distance from the observational point x ^(m) to the most distant grid point. Equation (43) has been found to give better results than (42). As a result, c _b(x ^(m), t), is somewhat symmetric about each grid point.

Appendix B. Determination of H-matrix using neighboring grid points

Consider the position vector x = (x, y) located inside the grid cell (Fig. 14),

x _i ≤ x < x _i + 1,y _j ≤ y < y _j + 1..

Mathematically, the variable c _b at r (inside the grid cell) can be represented approximately by a polynomial,

$$ {c}_b\left(\mathbf{r}\right)={\displaystyle \sum_{\alpha =0}^L{\displaystyle \sum_{\beta =0}^L{A}_{\alpha \beta }{\left(x-{x}_i\right)}^{\alpha }{\left(y-{y}_j\right)}^{\beta }}} $$

(44)

where L = 1 refers to the bilinear interpolation, and L = 3 leads to the bicubic interpolation. For the bilinear interpolation, Eq. (44) becomes

$$ {c}_b\left(\mathbf{r}\right)={A}_{00}+{A}_{10}\left(x-{x}_i\right)+{A}_{01}\left(y-{y}_j\right)+{A}_{11}\left(x-{x}_i\right)\left(y-{y}_j\right) $$

(45)

or in matrix notation,

$$ {c}_b\left(\mathbf{r}\right)=\left[1\kern0.5em \left(x-{x}_i\right)\right]\left[\begin{array}{cc}\hfill {A}_{00}\hfill & \hfill {A}_{01}\hfill \\ {}\hfill {A}_{10}\hfill & \hfill {A}_{11}\hfill \end{array}\right]\left[\begin{array}{c}\hfill 1\hfill \\ {}\hfill \left(y-{y}_j\right)\hfill \end{array}\right]. $$

(46)

Since c _b at four neighboring grid points: c _b(x _i, y _j), c _b(x _i+1, y _j), c _b(x _i+1, y _j), c _b(x _i+1, y _j+1) are given, substitution of the four values into (45) leads to the determination of the four coefficients A ₀₀, A ₁₀, A ₀₁, A ₁₁. Using these coefficients, the bilinear interpolation (45) becomes

$$ \begin{array}{l}{c}_b\left(\mathbf{r}\right)=\frac{c_b\left({x}_{i+1},{y}_{j+1}\right)}{\left({x}_{i+1}-{x}_i\right)\left({y}_{j+1}-{y}_j\right)}\left(x-{x}_i\right)\left(y-{y}_j\right)+\frac{c_b\left({x}_{i+1},{y}_j\right)}{\left({x}_{i+1}-{x}_i\right)\left({y}_{j+1}-{y}_j\right)}\left(x-{x}_i\right)\left({y}_{j+1}-y\right)\\ {}+\frac{c_b\left({x}_i,{y}_{j+1}\right)}{\left({x}_{i+1}-{x}_i\right)\left({y}_{j+1}-{y}_j\right)}\left({x}_{i+1}-x\right)\left(y-{y}_j\right)+\frac{c_b\left({x}_i,{y}_j\right)}{\left({x}_{i+1}-{x}_i\right)\left({y}_{j+1}-{y}_j\right)}\left({x}_{i+1}-x\right)\left({y}_{j+1}-y\right)\end{array} $$

(47)

Let the observational point r ^(m) be located in the grid cell,

$$ {x}_i\le {x}^{(m)}<{x}_{i+1},{y}_j\le {y}^{(m)}<{y}_{j+1}. $$

Evaluation of c _b at the observational point r ^(m) using (46) leads to

$$ {c}_b\left({\mathbf{r}}^{(m)}\right)={p}_{i,j}^{(m)}{c}_b\left({x}_i,{y}_i\right)+{p}_{i+1,j}^{(m)}{c}_b\left({x}_{i+1},{y}_j\right)+{p}_{i,j+1}^{(m)}{c}_b\left({x}_i,{y}_{j+1}\right)+{p}_{i+1,j+1}^{(m)}{c}_b\left({x}_{i+1},{y}_{j+1}\right) $$

(48)

where the proportional coefficients {$ {p}_{i,j}^{(m)},{p}_{i+1,j}^{(m)},{p}_{i,j+1}^{(m)},{p}_{i+1,j+1}^{(m)} $} are defined by

$$ \begin{array}{ll}{p}_{i,j}^{(m)}=\frac{\left({x}_{i+1}-{x}^{(m)}\right)\left({y}_{j+1}-{y}^{(m)}\right)}{\left({x}_{i+1}-{x}_i\right)\left({y}_{j+1}-{y}_j\right)},\hfill & {p}_{i+1,j}^{(m)}=\frac{\left({x}^{(m)}-{x}_i\right)\left({y}_{j+1}-{y}^{(m)}\right)}{\left({x}_{i+1}-{x}_i\right)\left({y}_{j+1}-{y}_j\right)},\hfill \\ {}{p}_{i,j+1}^{(m)}=\frac{\left({x}_{i+1}-{x}^{(m)}\right)\left({y}^{(m)}-{y}_j\right)}{\left({x}_{i+1}-{x}_i\right)\left({y}_{j+1}-{y}_j\right)}\hfill & {p}_{i+1,j+1}^{(m)}=\frac{\left({x}^{(m)}-{x}_i\right)\left({y}^{(m)}-{y}_j\right)}{\left({x}_{i+1}-{x}_i\right)\left({y}_{j+1}-{y}_j\right)}.\hfill \end{array} $$

(49)

It is noted that the proportionality coefficients {p _i , j(m) , p _{i + 1 , j(m)} , p _{i , j + 1(m)} , p _{i + 1 , j + 1(m)}} depend solely on the location of the observational points (r ^(m)), and

$$ {p}_{i,j}^{(m)}+{p}_{i+1,j}^{(m)}+{p}_{i,j+1}^{(m)}+{p}_{i+1,j+1}^{(m)}=1. $$

(50)

Setting L = 3 in (44) leads to the bicubic spline interpolation,

$$ \begin{array}{l}{c}_b\left(\mathbf{r}\right)={A}_{00}+{A}_{10}\left(x-{x}_i\right)+{A}_{01}\left(y-{y}_j\right)+{A}_{11}\left(x-{x}_i\right)\left(y-{y}_j\right)\hfill \\ {}+{A}_{20}{\left(x-{x}_i\right)}^2+{A}_{02}{\left(y-{y}_j\right)}^2+{A}_{30}{\left(x-{x}_i\right)}^3\hfill \\ {}+{A}_{21}{\left(x-{x}_i\right)}^2\left(y-{y}_j\right)+{A}_{12}\left(x-{x}_i\right){\left(y-{y}_j\right)}^2+{A}_{03}{\left(y-{y}_j\right)}^3\hfill \end{array} $$

(51)

or in matrix notation,

$$ {c}_b\left(\mathbf{r}\right)=\left[\begin{array}{cccc}\hfill 1\hfill & \hfill \left(x-{x}_i\right)\hfill & \hfill {\left(x-{x}_i\right)}^2\hfill & \hfill {\left(x-{x}_i\right)}^3\hfill \end{array}\right]\left[\begin{array}{cccc}\hfill {A}_{00}\hfill & \hfill {A}_{01}\hfill & \hfill {A}_{02}\hfill & \hfill {A}_{03}\hfill \\ {}\hfill {A}_{10}\hfill & \hfill {A}_{11}\hfill & \hfill {A}_{12}\hfill & \hfill 0\hfill \\ {}\hfill {A}_{20}\hfill & \hfill {A}_{21}\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill {A}_{30}\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \end{array}\right]\left[\begin{array}{c}\hfill 1\hfill \\ {}\hfill \left(y-{y}_j\right)\hfill \\ {}\hfill {\left(y-{y}_j\right)}^2\hfill \\ {}\hfill {\left(y-{y}_j\right)}^3\hfill \end{array}\right] $$

(52)

which is rewritten by

$$ \begin{array}{l}{c}_b\left(\mathbf{r}\right)=\left[\begin{array}{cccc}\hfill 1\hfill & \hfill x\hfill & \hfill {x}^2\hfill & \hfill {x}^3\hfill \end{array}\right]\left[\begin{array}{cccc}\hfill 1\hfill & \hfill -{x}_i\hfill & \hfill {x}_i^2\hfill & \hfill -{x}_i^3\hfill \\ {}\hfill 0\hfill & \hfill 1\hfill & \hfill -2{x}_i\hfill & \hfill 3{x}_i^2\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill 1\hfill & \hfill -3{x}_i\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 1\hfill \end{array}\right]\\ {}\left[\begin{array}{cccc}\hfill {A}_{00}\hfill & \hfill {A}_{01}\hfill & \hfill {A}_{02}\hfill & \hfill {A}_{03}\hfill \\ {}\hfill {A}_{10}\hfill & \hfill {A}_{11}\hfill & \hfill {A}_{12}\hfill & \hfill 0\hfill \\ {}\hfill {A}_{20}\hfill & \hfill {A}_{21}\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill {A}_{30}\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \end{array}\right]\left[\begin{array}{cccc}\hfill 1\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill -{y}_j\hfill & \hfill 1\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill {y}_j^2\hfill & \hfill -2{y}_j\hfill & \hfill 1\hfill & \hfill 0\hfill \\ {}\hfill -{y}_j^3\hfill & \hfill 3{y}_j^2\hfill & \hfill 3{y}_j\hfill & \hfill 1\hfill \end{array}\right]\left[\begin{array}{c}\hfill 1\hfill \\ {}\hfill y\hfill \\ {}\hfill {y}^2\hfill \\ {}\hfill {y}^3\hfill \end{array}\right]\end{array} $$

(53)

Determination of the 10 coefficients (A ₀₀, A ₀₁, A ₀₂, A ₀₃, A ₁₀, A ₁₁, A ₁₂, A ₂₀, A ₂₁, A ₃₀) requires not only the values,

$$ \begin{array}{l}{A}_{00}={c}_b\left({x}_i,{y}_j\right),\hfill \\ {}{A}_{00}+{A}_{10}\varDelta x+{A}_{20}{\left(\varDelta x\right)}^2+{A}_{30}{\left(\varDelta x\right)}^3={c}_b\left({x}_{i+1},{y}_j\right),\hfill \\ {}{A}_{00}+{A}_{01}\varDelta y+{A}_{02}{\left(\varDelta y\right)}^2+{A}_{03}{\left(\varDelta y\right)}^3={c}_b\left({x}_i,{y}_{j+1}\right),\hfill \\ {}\begin{array}{l}{A}_{00}+{A}_{10}\varDelta x+{A}_{01}\varDelta y+{A}_{11}\Delta \mathrm{x}\Delta \mathrm{y}+{A}_{20}{\left(\varDelta x\right)}^2+{A}_{02}{\left(\varDelta y\right)}^2\\ {}+{A}_{30}{\left(\varDelta x\right)}^3+{A}_{21}{\left(\varDelta x\right)}^2\varDelta y+{A}_{12}\varDelta x{\left(\varDelta y\right)}^2+{A}_{03}{\left(\varDelta y\right)}^3\\ {}={c}_b\left({x}_{i+1},{y}_{j+1}\right),\end{array}\hfill \end{array} $$

(54)

but also the derivatives at the neighboring grid points

$$ \begin{array}{l}{A}_{10}=\partial {c}_b\left({x}_i,{y}_j\right)/\partial x=\left[{c}_b\left({x}_{i+1},{y}_j\right)-{c}_b\left({x}_{i-1},{y}_j\right)\right]/2\varDelta x,\hfill \\ {}{A}_{01}=\partial {c}_b\left({x}_i,{y}_j\right)/\partial y=\left[{c}_b\left({x}_i,{y}_{j+1}\right)-{c}_b\left({x}_i,{y}_{j-1}\right)\right]/2\varDelta y,\hfill \\ {}\begin{array}{cc}\hfill \hfill & \hfill \begin{array}{l}{A}_{10}+2\left(\varDelta x\right){A}_{20}+3{\left(\varDelta x\right)}^2{A}_{30}=\partial {c}_b\left({x}_{i+1},{y}_j\right)/\partial x\\ {}=\left[{c}_b\left({x}_{i+2},{y}_j\right)-{c}_b\left({x}_i,{y}_j\right)\right]/2\varDelta x,\end{array}\hfill \end{array}\hfill \\ {}\begin{array}{cc}\hfill \hfill & \hfill \begin{array}{l}{A}_{10}+\left(\varDelta y\right){A}_{11}+{\left(\varDelta y\right)}^2{A}_{12}=\partial {c}_b\left({x}_i,{y}_{j+1}\right)/\partial x\\ {}=\left[{c}_b\left({x}_{i+1},{y}_{j+1}\right)-{c}_b\left({x}_{i-1},{y}_{j+1}\right)\right]/2\varDelta x,\end{array}\hfill \end{array}\hfill \\ {}\begin{array}{cc}\hfill \hfill & \hfill \begin{array}{l}{A}_{01}+\left(\varDelta x\right){A}_{11}+{\left(\varDelta x\right)}^2{A}_{21}=\partial {c}_b\left({x}_{i+1},{y}_j\right)/\partial y\\ {}=\left[{c}_b\left({x}_{i+1},{y}_{j+1}\right)-{c}_b\left({x}_{i+1},{y}_{j-1}\right)\right]/2\varDelta y,\end{array}\hfill \end{array}\hfill \\ {}\begin{array}{cc}\hfill \hfill & \hfill \begin{array}{l}{A}_{01}+2\left(\varDelta y\right){A}_{01}+3{\left(\varDelta y\right)}^2{A}_{03}=\partial {c}_b\left({x}_i,{y}_{j+1}\right)/\partial y\\ {}=\left[{c}_b\left({x}_i,{y}_{j+2}\right)-{c}_b\left({x}_i,{y}_j\right)\right]/2\varDelta y.\end{array}\hfill \end{array}\hfill \end{array} $$

(55)

The solution of the above set of 10 linear algebraic Eqs. (54) and (55) leads to the determination of the 10 coefficients (A ₀₀, A ₀₁, A ₀₂, A ₀₃, A ₁₀, A ₁₁, A ₁₂, A ₂₀, A ₂₁, A ₃₀). It is noted that values of c _b at the 10 neighboring grid points (x _i, y _j), (x_i+1, y _j), (x _i, y _j+1), (x _i+1, y _j+1), (x _i-1, y _j), (x _i, y _j-1), (x _i+2, y _j), (x _i-1, y _j+1), (x _i+1, y _j-1), (x _i, y _j+2) are used to solve (54) and (55). Following (53), interpolation of c _b at the 10 neighboring grid points on the observational r ^(m) = (x ^(m), y ^(m)) using the bicubic interpolation is given by

$$ \begin{array}{l}{c}_b\left({\mathbf{r}}^{(m)}\right)=\left[\begin{array}{cccc}\hfill 1\hfill & \hfill {x}^{(m)}\hfill & \hfill {\left({x}^{(m)}\right)}^2\hfill & \hfill {\left({x}^{(m)}\right)}^3\hfill \end{array}\right]\left[\begin{array}{cccc}\hfill 1\hfill & \hfill -{x}_i\hfill & \hfill {x}_i^2\hfill & \hfill -{x}_i^3\hfill \\ {}\hfill 0\hfill & \hfill 1\hfill & \hfill -2{x}_i\hfill & \hfill 3{x}_i^2\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill 1\hfill & \hfill -3{x}_i\hfill \\ {}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 1\hfill \end{array}\right]\\ {}\left[\begin{array}{cccc}\hfill {A}_{00}\hfill & \hfill {A}_{01}\hfill & \hfill {A}_{02}\hfill & \hfill {A}_{03}\hfill \\ {}\hfill {A}_{10}\hfill & \hfill {A}_{11}\hfill & \hfill {A}_{12}\hfill & \hfill 0\hfill \\ {}\hfill {A}_{20}\hfill & \hfill {A}_{21}\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill {A}_{30}\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \end{array}\right]\left[\begin{array}{cccc}\hfill 1\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill -{y}_j\hfill & \hfill 1\hfill & \hfill 0\hfill & \hfill 0\hfill \\ {}\hfill {y}_j^2\hfill & \hfill -2{y}_j\hfill & \hfill 1\hfill & \hfill 0\hfill \\ {}\hfill -{y}_j^3\hfill & \hfill 3{y}_j^2\hfill & \hfill 3{y}_j\hfill & \hfill 1\hfill \end{array}\right]\left[\begin{array}{c}\hfill 1\hfill \\ {}\hfill {y}^{(m)}\hfill \\ {}\hfill {\left({y}^{(m)}\right)}^2\hfill \\ {}\hfill {\left({y}^{(m)}\right)}^3\hfill \end{array}\right]\end{array} $$

(56)

Thus, an equation similar to (48) can be written for evaluating c _b at the observational point r ^(m) with the known 10 coefficients (A ₀₀, A ₀₁, A ₀₂, A ₀₃, A ₁₀, A ₁₁, A ₁₂, A ₂₀, A ₂₁, A ₃₀,

$$ \begin{array}{l}{c}_b\left({\mathbf{r}}^{(m)}\right)={p}_{i,j}^{(m)}{c}_b\left({x}_i,{y}_j\right)+{p}_{i+1,j}^{(m)}{c}_b\left({x}_{i+1},{y}_j\right)+{p}_{i,j+1}^{(m)}{c}_b\left({x}_i,{y}_{j+1}\right)\hfill \\ {}+{p}_{i+1,j+1}^{(m)}{c}_b\left({x}_{i+1},{y}_{j+1}\right)+{p}_{i-1,j}^{(m)}{c}_b\left({x}_{i-1},{y}_j\right)+{p}_{i,j-1}^{(m)}{c}_b\left({x}_i,{y}_{j-1}\right)+{p}_{i,+2j}^{(m)}{c}_b\left({x}_{i+2},{y}_j\right)\hfill \\ {}+{p}_{i-1,j+1}^{(m)}{c}_b\left({x}_{i-1},{y}_{j+1}\right)+{p}_{i+1,j-1}^{(m)}{c}_b\left({x}_{i+1},{y}_{j-1}\right)+{p}_{i,j+2}^{(m)}{c}_b\left({x}_i,{y}_{j+2}\right)\hfill \end{array} $$

(57)

where the 10 corresponding coefficients {p _i , j(m) , p _{i + 1 , j(m)} , p _{i , j + 1(m)} , p _{i + 1 , j + 1(m)}, p _{i − 1 , j(m)}, p _{i , j − 1(m)}, p _{i , + 2j(m)}, p _{i − 1 , j + 1(m)}, p _{i + 1 , j − 1(m)}, p _{i , j + 2(m)}} are analytically determined and depends solely on the location of the observational points (r ^(m)), and

$$ {p}_{i,j}^{(m)}+{p}_{i+1,j}^{(m)}+{p}_{i,j+1}^{(m)}+{p}_{i+1,j+1}^{(m)}+{p}_{i-1,j}^{(m)}+{p}_{i,j-1}^{(m)}+{p}_{i,+2j}^{(m)}+{p}_{i-1,j+1}^{(m)}+{p}_{i+1,j-1}^{(m)}+{p}_{i,j+2}^{(m)} $$

(58)

Since only 10 neighboring grid points are used to interpolate at the observational point r ^(m) using the bicubic interpolation, the matrix H has only 10 non-zero values in each row. However, it is too tedious to write it out.

Appendix C. Basis functions

As pointed by Chu et al. (2015), three necessary conditions should be satisfied in selection of basis functions {ϕ _k(r)} as follows: (i) satisfaction of the same homogeneous boundary condition of the assimilated variable anomaly, (ii) orthonormality, and (iii) independence on the assimilated variables. The first necessary condition requires the same boundary condition for (c − c _b) and the basis functions {ϕ _k}. The second necessary condition is given by

$$ {\displaystyle \underset{\varGamma }{\iint }{\phi}_k\left(\mathbf{r}\right){\phi}_{k\prime }}\left(\mathbf{r}\right)d\mathbf{r}={\delta}_{kk\prime }, $$

(59)

where δ _kk′ is the Kronecker delta,

$$ {\delta}_{kk\prime }=\left\{\begin{array}{c}\hfill 0\kern1em \mathrm{if}\kern0.5em k\ne k^{\prime}\hfill \\ {}\hfill 1\kern1.25em \mathrm{if}\kern0.5em k=k^{\prime}\hfill \end{array}\right.. $$

(60)

Due to their independence on the assimilated variable (the third necessary condition), the basis functions are available prior to the data assimilation.

The basis functions are the eigenvectors {ϕ _k} of the Laplacian operator with the same boundary condition as the variable anomaly (c − c _b),

$$ {\nabla}^2{\phi}_k=-{\lambda}_k{\phi}_{k,\kern1.5em }\left[{b}_1\left(\tau \right)\mathbf{e}\cdotp \nabla {\phi}_k+{b}_2\left(\tau \right){\phi}_k\right]\Big|{}_{\varGamma }=0,\kern0.75em k=1,\dots, \infty . $$

(61)

Here, {λ _k} are the eigenvalues, e is the unit vector normal to the boundary; τ denotes a moving point along the boundary, and [b ₁(τ) , b ₂(τ)] are parameters varying with τ. The boundary condition in (61) becomes the Dirichlet boundary condition when b ₁ = 0, and the Neumann boundary conditions when b ₂ = 0. As pointed by Chu et al. (2015), different variable anomalies have different [b ₁(τ) , b ₂(τ)]. For example, the temperature, salinity, and velocity potential anomalies have b ₂ = 0 for the rigid boundary and b ₁ = 0 for the open boundary. However, the anomaly has b ₁ = 0 for the rigid boundary and b ₂ = 0 for the open boundary. It is obvious that the eigenvectors {ϕ _k} are orthonormal and independent of the assimilated variables.

Appendix D. Vapnik-Chervonenkis dimension for mode truncation

The Vapnik-Chervonenkis dimension (Vapnik 1983; Chu et al. 2003a, 2015) is to seek the optimal mode truncation on the base of the first term of the analysis error (23),

$$ {J}_{tr}=\left\langle \left[{\boldsymbol{\upvarepsilon}}_K^T\mathbf{F}{\boldsymbol{\upvarepsilon}}_K\right]\right\rangle =\frac{{\displaystyle \sum_{n=1}^N{\left[{f}_n\left({D}_n-{D}_n^{(K)}\right)\right]}^2}}{N-1} $$

(62)

with the cost function

$$ \begin{array}{l}{J}_K={J}_{tr}+\mu \left(K,M,\alpha \right),\\ {}\mu \left(K,M,\alpha \right)={J}_{*}\sqrt{\frac{\left[ \ln \left(2M/K\right)+1\right]- \ln \left(\alpha /M\right)}{M/K}},\kern0.75em K=1,2,\dots, \infty \end{array} $$

(63)

Here, α (≪1) is the significance level. J _* is the upper bound of J _tr. For a given M, J _tr decreases monotonically with K; μ increases with K if α is given. The optimal mode truncation is through the minimization of the cost function,

$$ \underset{K}{\mathit{\min}}\left(\ {J}_K\right)={J}_{K_{\mathrm{opt}}}. $$

(64)

This method neglects observational error [only first term of (23) considered] and ignores the model resolution (represented by the total number of grid points N). The ratio of observational points (M) and the spectral truncation (K) is the key to determine the optimal mode truncation K _OPT.

Appendix E. B matrix

The B matrix is often established based on the assumption of statistical stationarity and homogeneity of the reconstructed field with a simple covariance function, for example Bretherton et al. (1976) proposed

$$ \mathbf{B}={\left[{b}_{ij}\right]}_{N\times N},\kern1em {b}_{ij}=\left(1-\frac{r_{ij}^2}{r_b^2}\right) \exp \left(-\frac{r_{ij}^2}{r_a^2}\right),\kern0.75em {r}_{ij}^2={\left|{\mathbf{r}}_i-{\mathbf{r}}_j\right|}^2,\kern0.5em {r}_b>{r}_a, $$

(65)

depending on distances only. Here, r _ij is the distance between the two grid points r _i and r _aj; r _ay and r _b are the decorrelation scale and zero crossing. To conduct the OI data assimilation, the three parameters (e _o, r _a, r _b) need to be defined by user. Chu et al. (1997, 2002) compute auto-correlation functions from historical observational data to fit the Gaussian function and get de-correlation scales for the B matrix. Recent studies show that some variables such as upper ocean current speed do not satisfy the normal distribution, but the Weibull distribution (Chu 2008, 2009).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chu, P.C., Fan, C. & Margolina, T. Ocean spectral data assimilation without background error covariance matrix. Ocean Dynamics 66, 1143–1163 (2016). https://doi.org/10.1007/s10236-016-0971-x

Download citation

Received: 11 October 2015
Accepted: 04 July 2016
Published: 01 August 2016
Issue Date: September 2016
DOI: https://doi.org/10.1007/s10236-016-0971-x

Ocean spectral data assimilation without background error covariance matrix

Abstract

Similar content being viewed by others

Acoustic Data Assimilation: Concepts and Examples

Numerical Experiments with the Nemo Ocean Circulation Model and the Assimilation of Observational Data from Argo Drifters

Oceanic EM damping and spectral splitting by the SD-gram

1 Introduction

2 Error analysis

3 Steep-descending mode truncation

3.1 Multi-platform observations

4 “Truth,” “background,” and “observational” fields

5 Comparison between OSD and OI

6 Synoptic monthly gridded temperature and salinity fields

7 Conclusions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendices

Appendix A. Determination of H-matrix using all grid points

Appendix B. Determination of H-matrix using neighboring grid points

Appendix C. Basis functions

Appendix D. Vapnik-Chervonenkis dimension for mode truncation

Appendix E. B matrix

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Ocean spectral data assimilation without background error covariance matrix

Abstract

Similar content being viewed by others

Acoustic Data Assimilation: Concepts and Examples

Numerical Experiments with the Nemo Ocean Circulation Model and the Assimilation of Observational Data from Argo Drifters

Oceanic EM damping and spectral splitting by the SD-gram

1 Introduction

2 Error analysis

3 Steep-descending mode truncation

3.1 Multi-platform observations

4 “Truth,” “background,” and “observational” fields

5 Comparison between OSD and OI

6 Synoptic monthly gridded temperature and salinity fields

7 Conclusions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendices

Appendix A. Determination of H-matrix using all grid points

Appendix B. Determination of H-matrix using neighboring grid points

Appendix C. Basis functions

Appendix D. Vapnik-Chervonenkis dimension for mode truncation

Appendix E. B matrix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation