Regionalized Variable Theory

Oliver, Margaret A.; Webster, Richard

doi:10.1007/978-3-319-15865-5_2

Margaret A. Oliver³ &
Richard Webster⁴

Part of the book series: SpringerBriefs in Agriculture ((BRIEFSAGRO))

4298 Accesses

Abstract

Many physical, chemical and biological processes have acted to create the current environment with the result that the variation appears to be random. Practical geostatistics treats the results as if they were the outcomes of correlated random processes and is underpinned by assumptions of stationarity. Variation may be treated as second-order stationary and represented by covariance functions. The somewhat weaker assumption of intrinsic stationarity leads to a more general analysis based on the variogram as a description of the variation. Quasi-stationarity limits stationarity to local areas, and with sufficient data the assumptions can be applied locally. If there is trend then more complex assumptions are needed; these usually comprise a combination of deterministic spatially smooth trend plus random residuals that are spatially correlated and stationary to some degree.

Access provided by Autonomous University of Puebla. Download chapter PDF

Keywords

2.1 Random Variables and Regionalized Variable Theory

The processes that act in the environment obey the laws of physics, and are in that sense deterministic: the variation we observe has its physical causes. Nevertheless, numerous processes have combined and interacted to produce the current environment, and the results are so complex that the variation appears as though it were generated randomly (Webster 2000). In geostatistical terms, we regard the grades of ores, properties of the soil, or the rainfall of a region, of almost any size, as the realizations of random processes.

Based on this view the value of, for example, a soil property such as its pH, at any place, x denoting its coordinates in two dimensions, is just one of the infinitely many that are possible there. We associate with each place x not just one value but a whole suite of values with a mean, a variance and higher-order moments of a distribution. The actual value at x is regarded as just one value from that distribution, allocated at random. Thus the value of the variable at x is treated as a random variable, which we denote with the capital Z. The set of random variables for all x in ℜ constitutes a random function, random process or stochastic process. Random variables in the real space, which may be one-, two- or three-dimensional, are also called ‘regionalized variables’, and hence we have the theory of regionalized variables mentioned above.

A random function has no mathematical description in the way that a deterministic one has, i.e. it cannot be written as an equation. Nevertheless, it may have ‘structure’ in that there is correlation in space, or in time (for signals). This means that values at different places may be related to one another in a statistical sense. Intuitively, we expect the features of the environment at places near to one another to be similar, whereas those at widely separated places are less likely to be. This intuition is formalized in the theory of random functions. We must realise that the randomness is a mental model of the world and not a property of the environment.

2.1.1 Stationarity

Stationarity underpins the practicality of geostatistics; it is an assumption that enables us to treat data as though they have the same degree of variation over a region of interest. We can represent the random process by the model

$$Z\left( {\mathbf{x}} \right) \, = \mu + \varepsilon \left( {\mathbf{x}} \right),$$

(2.1)

where μ is the mean of the process and ε(x) is a random quantity with a mean of zero and a covariance, C(h), where h is the separation in space and known as the lag.The covariance is

$$C\left( {\mathbf{h}} \right) \, = {\text{ E}}[\varepsilon \left( {\mathbf{x}} \right)\varepsilon \left( {{\mathbf{x}} + {\mathbf{h}}} \right)],$$

(2.2)

which is equivalent to

$$C\left( {\mathbf{h}} \right) \, = {\text{ E}}[\{ {\text{Z}}\left( {\text{x}} \right) \, - \mu \} \left\{ {Z\left( {{\mathbf{x}} + {\mathbf{h}}} \right)} \right\} \, - \mu \} \left] { \, = {\text{ E}}} \right[\left\{ {Z\left( {\mathbf{x}} \right)} \right\}\left\{ {Z\left( {{\mathbf{x}} + {\mathbf{h}}} \right)} \right\} \, - \mu^{ 2} ] .$$

(2.3)

Here Z(x) and Z(x + h) are the values of the random variable Z at places x and x + h and E denotes the expectation. This covariance depends on h and only on h, the separation between samples in both distance and direction; it is a function of h. The assumption on which this is based is that of second-order stationarity. In the real world, we often encounter situations in which we cannot assume that the mean is constant, and if so the covariance cannot exist. Such a situation need not be a stumbling block; we can simply weaken the assumption of stationarity to that of what Matheron (1963) called intrinsic stationarity in which the expected differences are zero,

$${\text{E}}\left[ {Z\left( {\mathbf{x}} \right) \, - Z\left( {{\mathbf{x}} + {\mathbf{h}}} \right)} \right] \, = \, 0,$$

(2.4)

and the covariance of the residuals is replaced by the variance of the differences to measure the spatial relations:

$${\text{var}}\left[ {Z\left( {\mathbf{x}} \right) \, - Z\left( {{\mathbf{x}} + {\mathbf{h}}} \right)} \right] \, = {\text{ E}}\left[ {\left\{ {Z\left( {\mathbf{x}} \right) \, - Z\left( {{\mathbf{x}} + {\mathbf{h}}} \right)} \right\}^{ 2} } \right] \, = { 2}\gamma \left( {\mathbf{h}} \right) .$$

(2.5)

Here γ(h) is the semivariance at lag h, and as a function of h it is the variogram. The variogram is based on differences, and provided Eq. (2.4) holds locally it is valid. This property makes the variogram more generally useful than the covariance function. In Chap. 3 we describe how to compute the covariance and variogram functions. We focus on the variogram because of its generality and go on in Chap. 3 to describe variogram modelling.

For second-order stationary processes the covariance function and variogram are equivalent:

$$\gamma \left( {\mathbf{h}} \right) \, = C\left( {\mathbf{0}} \right) - C\left( {\mathbf{h}} \right),$$

(2.6)

where C(0) = σ ² is the variance of the process.

A process that appears stationary at one scale might at another scale appear to embody trend, that is, a systematic component. At this scale we might have to elaborate the simple model represented in Eq. (2.1) by

$${\text{Z}}\left( {\mathbf{x}} \right) \, = u\left( {\mathbf{x}} \right) \, + \varepsilon \left( {\mathbf{x}} \right),$$

(2.7)

in which u(x) is a deterministic trend term that replaces the constant mean, μ. Its variogram,

$$\gamma ({\mathbf{h}} ) { } = \frac{ 1}{ 2}{\text{ E}}\left[ {\left\{ {\varepsilon ({\mathbf{x}}) - \varepsilon ({\mathbf{x}} + {\mathbf{h}})} \right\}^{2} } \right],$$

(2.8)

is no longer the same as

$$\gamma ({\mathbf{h}} ) { } = \frac{ 1}{ 2}{\text{ E}}\left[ {\left\{ {Z ({\mathbf{x}}) - Z ({\mathbf{x}} + {\mathbf{h}})} \right\}^{2} } \right],$$

(2.9)

of Eq. (1.5). It is the variogram of the residuals from the trend. We explain what to do to estimate the variogram in the presence of trend in Chap. 6.

Author information

Authors and Affiliations

Soil Research Centre, University of Reading, Reading, RG6 6DW, UK
Margaret A. Oliver
Rothamsted Research, Harpenden, AL5 2JQ, UK
Richard Webster

Authors

Margaret A. Oliver
View author publications
You can also search for this author in PubMed Google Scholar
Richard Webster
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Margaret A. Oliver .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Oliver, M.A., Webster, R. (2015). Regionalized Variable Theory. In: Basic Steps in Geostatistics: The Variogram and Kriging. SpringerBriefs in Agriculture. Springer, Cham. https://doi.org/10.1007/978-3-319-15865-5_2

Download citation

DOI: https://doi.org/10.1007/978-3-319-15865-5_2
Published: 31 March 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-15864-8
Online ISBN: 978-3-319-15865-5
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)

Publish with us

Policies and ethics