Introduction

Understanding within-field yield variability as affected by soil properties and field topography is important in site-specific management practices. Topographic variables such as elevation and slope have been shown to relate to crop yield (Yang et al. 1998; Kravchenko and Bullock 2000; Jiang and Thelen 2004). Sometimes, these variables can account for as much as 60% of the within-field variability (Yang et al. 1998). However, these relationships vary across years depending on weather conditions. In site-specific management, soil apparent electrical conductivity (ECa) has also become an important tool in assessing soil suitability and productivity because it relates to a wide range of soil chemical and physical properties that affect crop yield (McNeill 1992; Lund et al. 2000; Kitchen et al. 2003; Sudduth et al. 2005). Applications of mapped ECa have included characterizing soil spatial variability (Corwin and Lesch 2005), delineating management zones (Kitchen et al. 2005), and estimating topsoil depth for claypan soils (Kitchen et al. 1999; Sudduth et al. 2003). Together, these soil and topographic variables can provide guidance for site-specific management decision-making, partly because high resolution datasets can be obtained at reasonable costs using on-the-go sensing technologies.

Yield monitoring and mapping technologies that enable measuring, geo-referencing and recording grain yield with high precision and spatial resolution have created a great opportunity to study the spatial and temporal variability observed in crop yield. Yield monitors collect dense datasets in a pointwise fashion as the combine harvester moves through the field, and these data can be imported to a GIS system for display, processing and analysis. A common procedure for processing yield monitor data is to aggregate point yield data to a network of grids with an appropriate cell size for analysis as areal data (Birrell et al. 1996; Perez-Quezada et al. 2003).

Data analysis relating yield and soil and/or topographic variables has mostly adopted classical multiple linear regression techniques. As non-linear alternatives, projection pursuit regression (Sudduth et al. 1996), multiple quadratic regression (Kitchen et al. 1999), and neural network methods (Drummond et al. 2003) have also been employed. However, none of these approaches were able to explicitly model the spatial association of yield information. Further challenges for these analysis techniques include soil and topographic variables that are often interrelated, causing over-fitting of the model, and the general inability of these methods to assess the effects of temporal climatic conditions for the number of site-years commonly available (Drummond et al. 2003).

Recent development of the spatial Bayesian hierarchical framework has gained increasing popularity in ecological studies that often deal with spatial data (Wikle 2003; Oleson and He 2004). One of the important advantages of spatial Bayesian models is that the spatial effect can be explicitly expressed in the model by assuming a prior distribution with its parameters specified from the information of spatial neighbors. In soil science, spatial Bayesian models have been applied to estimate the correlation coefficients of socio-ecological variables most strongly associated with soil NO3–N and C, while accounting for varying spatial associations among the two dependent soil variables caused by different land use types (Oleson et al. 2006).

Texts devoted to the theory and applications of Bayesian hierarchical models include, e.g., Gelman et al. (2005) and Banerjee et al. (2004), the latter focusing on spatial data analysis applications. For the reader’s convenience, basic background information is presented here. The Bayesian hierarchical framework is developed from Bayes’ rule of probability which provides a relationship between joint probability distribution and conditional distributions of two variables. For example, assuming there are two variables Y and θ, Bayes’ theorem states that the joint probability distribution of Y and θ, p(Y, θ), can be obtained by the product of the conditional distributions and marginal distributions of Y and θ. Mathematically, Bayes’ rule is expressed as p(Y, θ) = p(Y|θ)p(θ) = p(θ|Y)p(Y). Thus, the conditional probability of Y can be obtained by p(Y|θ) = p(θ|Y)p(Y)/p(θ), and likewise, the conditional probability of θ by p(θ|Y) = p(Y|θ)p(θ)/p(Y).

As summarized in Banerjee et al. (2004), given observed data y = (y 1,…,y n) with probability density function f(y|θ), and a vector of unknown parameters θ = (θ 1,…,θ k ), the fundamental difference of Bayesian approach from the classical statistical models is the former assumes that θ is a random quantity sampled from a prior distribution π(θ|λ) based on previous knowledge, where λ is a vector of hyper-parameters (i.e., parameters for the prior distribution). If λ is known, inference concerning θ is based on the posterior distribution of θ,

$$ p({\varvec{\uptheta}}|{\mathbf{y}},{\varvec{\lambda}}) = \frac{{p({\mathbf{y}},{\varvec{\uptheta}}|{\varvec{\lambda}})}}{{p({\mathbf{y}}|{\varvec{\lambda}})}} = \frac{{p({\mathbf{y}},{\varvec{\uptheta}}|{\varvec{\lambda}})}}{{\int {p({\mathbf{y}},{\varvec{\uptheta}}|{\varvec{\lambda}})d{\varvec{\uptheta}}} }} = \frac{{f({\mathbf{y}}|{\varvec{\uptheta}})\pi ({\varvec{\uptheta}}|{\varvec{\lambda}})}}{{\int {f({\mathbf{y}}|{\varvec{\uptheta}})\pi ({\varvec{\uptheta}}|{\varvec{\lambda}})d{\varvec{\uptheta}}} }} $$
(1)

It can be seen in this relationship that both the data (f) and the prior knowledge (π) contribute to the posterior. In reality, however, λ is often not known. Thus, a second-stage (or a hyper–prior) distribution h(λ) is required, and Eq. 1 becomes

$$ p({\varvec{\uptheta}}|{\mathbf{y}}) = \frac{{p({\mathbf{y}},{\varvec{\uptheta}})}}{{p({\mathbf{y}})}} = \frac{{\int {f({\mathbf{y}}|{\varvec{\uptheta}})\pi ({\varvec{\uptheta}}|{\varvec{\lambda}})h({\varvec{\lambda}})d{\varvec{\lambda}}} }}{{\int {f({\mathbf{y}}|{\varvec{\uptheta}})\pi ({\varvec{\uptheta}}|{\varvec{\lambda}})h({\varvec{\lambda}})d{\varvec{\uptheta}}d{\varvec{\lambda}}} }} $$
(2)

Usually, λ can be replaced by its estimator, \( {\hat{\mathbf{\varvec{\lambda}}}} \), which maximizes the marginal distribution p(y|λ) with respect to θ. Then inference about θ can be based on the estimated posterior distribution \( p({\varvec{\uptheta}}|{\mathbf{y}},{\hat{\mathbf{\varvec{\lambda}}}}) \).

One obstacle to adopting Bayesian analysis applications for scientists working in fields of ecology, agriculture and natural resources has been the programming skills required to evaluate the full conditionals required by the Bayesian models. The development of the free computer program “Windows version of Bayesian Updating using Gibbs Sampler” (WinBUGS, http://www.mrc-bsu.cam.ac.uk/bugs/winbugs/contents.shtml), which has a built-in Gibbs sampler, an algorithm commonly used for generating random samples in Bayesian analysis, greatly facilitated Bayesian model formulation and computation. The WinBUGS program eliminated the need for specifying the full conditional distributions of the model, allowing the user to create their own models by providing only model distribution and prior distributions of model parameters. White and Sun (2006) illustrated the use and efficacy of this program using an ecological case study and reported that WinBUGS produced results compatible with those using the same model programmed in FORTRAN.

The objective of this project was to apply a spatial Bayesian hierarchical approach, implemented using the WinBUGS program, to examine the effects of soil, topographic and climatic variables on corn yield for a Missouri field.

Materials and methods

The dataset

The study site was a 36-ha corn-soybean field located near Centralia, Missouri. Elevation of the field ranged from 262 to 266 m. Soil types found in the field were claypan soils and included Mexico (fine, smectitic, mesic Aeric Vertic Epiaqualfs), Adco (fine, smectitic mesic Aeric Vertic Eqiaqualfs), and Leonard (fine smectitic, mesic, Vertic Epiaqualfs). These soils were somewhat poorly or poorly drained and have a restrictive layer with high clay content (the claypan) occurring below topsoil at a highly variable depth. Corn yield data for 1997, 1999, 2001 and 2003 were used in the analyses. These yield data were collected using commercial yield monitors mounted on combine harvesters. During the harvest, the combine usually traveled at approximately 5–8 km h−1 and yield data were recorded every second. Thus, depending on swath width, a single yield data point represented an average yield for an area approximately 6–10 m2. An automatic yield data processing program—Yield Editor (Sudduth and Drummond 2007)—was used to remove questionable and unrealistic yield data points caused by operating errors such as abrupt changes of speed, partial swath and combine stops and starts. Descriptive statistics of corn grain yields are given in Table 1. Growing season precipitation records showed that droughty conditions prevailed in July, 1999 and July, 2003. In August 2003, monthly precipitation was above the long-term average, but the timing of the rain events was too late to alleviate the water stress of the corn plants (Fig. 1). Thus, corn yield was severely reduced in these 2 years. Continuous surfaces of yield data were generated using ordinary kriging with an exponential semivariogram using the ArcGIS Spatial Analyst (ESRI, Redlands, CA, USA) and then were output to a 30 m by 30 m cell resolution, resulting in a total of 390 data cells. The interpolated yield maps are shown in Fig. 2. The choice of 30 m cell size was mainly a consideration of data capacity of the WinBugs program, even though others have also used a 30-m resolution to evaluate the relationship between crop yield and soil information for the same field (Sudduth et al. 1996; Drummond et al. 2003). At this resolution some fine spatial structures maybe lost, but by visual examination of kriged maps the general spatial pattern was largely preserved.

Table 1 Descriptive statistics of corn grain yield and growing-season weather conditions
Fig. 1
figure 1

Growing season (April–September) precipitation (mm). The cross bar of the hollow boxes indicates 30-year average for the month (i.e., average from 1971 to 2000)

Fig. 2
figure 2

Yield maps for 1997, 1999, 2001, and 2003. The dimension of each map cell is 30 m by 30 m

Elevation of the field was mapped using a total station surveying instrument and standard mapping procedures. Soil ECa was measured using an on-the-go Dualem-2S sensor (Dualem Inc., Milton, Ontario, Canada)Footnote 1 every second on a 10-m transect spacing. The Dualem-2S sensor had two effective sensing depths of 1.2 and 3.0 m, and the 1.2-m sensing depth was used. The speed of travel of the sensor was approximately 7.2 km h−1. Both elevation and ECa data were kriged to the same spatial extent as the yield data. Maximum slope in degrees was derived from the interpolated elevation data using Spatial Analyst in ArcGIS (ESRI 2007). The Spatial Analyst calculates slope values along the rate of maximum change in elevation for each cell to its neighbors using a 3 by 3 search window. The kriged elevation and ECa maps are presented in Fig. 3.

Fig. 3
figure 3

Elevation (a) and ECa (b) maps of the field. The dimension of each map cell is 30 m by 30 m

Three weather variables for the period of July and August were included as temporal co-variates in the model. They were mean maximum daily temperature (Max.temp), mean daily temperature range (Temp.range) and cumulative precipitation (cPREC; Table 1). These particular climatic variables were chosen because they have been shown to significantly affect corn yield using a dataset collected over 100 years in Central Missouri (Hu and Buyanovsky 2003).

The model

The baseline model

The baseline model was a linear regression model including a mean structure of co-variates (elevation, slope, soil ECa, Max.temp, Temp.range, and cPREC), and a random spatial effect.

Let Y i,j denote the observed yield at cell i in year j, and assume

$$ Y_{{i,j}} = {\mathbf{X}}_{{i,j}} {\varvec{\upbeta}} + Z_{i} + e_{{i,j}} $$
(3)

where \( e_{{i,j}} \) represents the measurement errors which are assumed to be independent and identically distributed (iid) as N(0, \( \sigma _{e}^{2} \)), \( Z_{i} \) is the spatial effect of cell i. \( {\varvec{\upbeta}} \) is the × 1 parameter coefficient vector of co-variates and \( {\mathbf{X}}_{{i,j}} \) is the 1 × n covariate vector. The spatial co-variates were considered fixed in this case.

Spatial effect modeling

A conditional auto-regressive (CAR) model was used to account for the spatial association among yield (i.e., \( Z_{i} \) in Eq. 3). In general, a CAR model can be expressed as follows:

$$ (Y_{i} |y_{m} ,m \ne i)\sim N\left( {\sum\limits_{m} {b_{{im}} y_{m} ,\tau ^{2} } } \right),\quad i = 1, \ldots ,n. $$
(4)

where m denotes the set of “neighbor cells” around cell i. It is assumed that \( Y_{i} \) has a normal distribution with conditional mean given by the average of its neighbors, y m , and a common variance component, τ 2. b im are elements of a symmetric matrix B, which is called an adjacency matrix, where b im are equal to 1 if cell i and cell m are adjacent, and 0 otherwise. The CAR model allows for the “borrowing” of information from adjacent cells in estimating parameters for each individual cell.

Specification of prior distributions and hyper-parameters

As mentioned above, each model parameter needed a prior distribution, and the parameters of the prior distribution (i.e., hyper-parameters) were also needed to complete the model. In our dataset, all the coefficients of co-variates in Eq. 3 were given a normal distribution N(0, \( \sigma _{0}^{2} \)), where \( \sigma _{0}^{2} \) is a positive constant. The model parameters, variance (\( \sigma _{e}^{2} \)), and the spatial variance (τ 2), were assumed to follow the inverse gamma distribution IG(a, b), which is a common prior distribution specification for error variance (Gelman et al. 2005). Therefore, the two variance terms were specified as:

$$ \sigma _{e}^{2} \sim IG\left( {{\text{a}}_{e} ,{\text{b}}_{e} } \right)\quad {\text{and}}\quad \tau ^{ 2} \sim IG({\text{a}}_{z} ,{\text{b}}_{z} ) $$
(5)

where the hyper-parameter values of (a e , b e ) and (a z , b z ) are positive constants.

All hyper-parameters (i.e., \( \sigma _{0}^{2} \), a e , b e , a z , and b z ) were given such that prior distributions were non-informative. Non-informative priors have large or infinite variance so parameter estimations are largely driven by data and less by the prior-distributions specified.

A complete list of prior distributions and hyper-parameters are summarized in Table 2. Further, a diagram illustrating the inter-relationships between model components (Eqs. 3–5) is given in Fig. 4.

Table 2 Prior distributions and hyper-parameter values for co-variates (β), model variance (\( \sigma _{e}^{2} \)) and spatial variance (\( \tau ^{2} \))
Fig. 4
figure 4

Diagram of the spatial Bayesian hierarchical model described by Eqs. 3–5

Bayesian computation using WinBUGS

Posterior distributions are estimated using the Gibbs sampler in WinBUGS. If the model has k parameters, θ = (θ 1,…,θ k )′, the Gibbs sampler generates random samples from each of the full conditional distributions {p(θ i |θ ji , y), i = 1,…,k} in the model. Given initial values of \( \left\{ {\theta _{2}^{{(0)}} , \ldots ,\theta _{k}^{{(0)}} } \right\} \), The Gibbs sampler iterates as follows (Banerjee et al. 2004):

  • Step 1: Draw \( \theta _{1}^{{\left( t \right)}} \;{\text{from}}\;p\left( {\theta _{ 1} |\theta _{2}^{{(t - 1)}} ,\theta _{3}^{{(t - 1)}} , \ldots ,\theta _{k}^{{(t - 1)}} ,{\mathbf{y}}} \right) \)

  • Step 2: Draw \( \theta _{2}^{{\left( t \right)}} \;{\text{from}}\;p\left( {\theta _{ 1} |\theta _{1}^{{(t)}} ,\theta _{3}^{{(t - 1)}} , \ldots ,\theta _{k}^{{(t - 1)}} ,{\mathbf{y}}} \right) \)

  • ·

  • ·

  • Step k: Draw \( \theta _{k}^{{\left( t \right)}} \;{\text{from}}\;p\left( {\theta _{ 1} |\theta _{1}^{{(t)}} ,\theta _{2}^{{(t)}} , \ldots ,\theta _{{k - 1}}^{{(t)}} ,{\mathbf{y}}} \right) \)

Thus, when t is sufficiently large, \( \left\{ {\theta _{1}^{{(t)}} , \ldots ,\theta _{k}^{{(t)}} } \right\} \) can be shown to converge to the true joint posterior distribution p(θ 1,…,θ k |y).

Model implementation

Three versions of the baseline model (Eq. 3) were implemented. The first version, Model (a), was run for each year with neither temporal co-variates nor the spatial effect Z i in the model; the second version, Model (b), was run for each year with the spatial effect Z i , but without temporal co-variates; and the last version, Model (c), was run for the 4-year combined dataset with the three temporal co-variates and the spatial effect Z i .

For each version of the baseline model, posterior densities for each model parameter were obtained after 100,000 iterations, with the first 10,000 iterations discarded as “burn-in”. Model runtime for all model versions was less than 10 min using a Pentium IV computer.

Results and discussion

By-year analysis without spatial effect

Parameter posterior means, standard deviations (SD) and 95% credible intervals obtained using Model (a) are given in Table 3. Soil ECa was negatively related to corn yield in three (1997, 1999 and 2001) out of the four site-years, with statistical significance in 1997 and 1999 (i.e., the 95% credible intervals excluded zero). For claypan soils, negative correlations between ECa and crop yield under droughty growing seasons has been previously reported, especially when rainfall was below normal in July and August (Kitchen et al. 1999). In agreement with these findings, 1997 and 1999 had well-below-normal precipitation during July or August (Fig. 1). Slope was negatively and significantly related to yield for all four site-years. The relationships between slope and crop yield are generally negative (Yang et al. 1998; Kravchenko and Bullock 2000; Jiang and Thelen 2004) because steep slopes usually result in severe erosion characterized by thinner topsoil, higher clay content, lower infiltration rate, and greater runoff and, hence, lower soil productivity. Another topographic variable, elevation, was negatively significant only in 2003. Relationships of elevation to crop yield varied, depending on location and year. As Kravchenko and Bullock (2000) pointed out, the effect of elevation on yield is reflected through water availability, and this effect is more readily observed under extreme weather conditions and field topography. The elevation change was gradual in our field and thus elevation did not affect crop yield in most of the site-years. The negative relationship in 2003 was probably because more soil water was stored in the low-elevation areas of the field. The low-elevation areas (depositional areas) had greater topsoil (silt loam) thickness, and topsoil usually provides about twice as much plant available water capacity as the underlying claypan. On the other hand, erosional areas, especially the highly eroded side-slopes where claypan was near the soil surface, often show deficiencies in soil water storage due to slow recharge of the clay material (Jiang et al. 2007). This effect was particularly an issue in 2003 because below-normal precipitation was also recorded for the fall/winter recharge period before the 2003 growing season (data not shown). This resulted in very low level of soil water, except for the depositional areas in the low elevations.

Table 3 Posterior means, standard deviations, and 95% credible intervals obtained using Model (a)

Model residuals of yield (predicted yield–measured yield) are mapped in Fig. 5. Spatial patterns in the residuals were similar to those seen in the yield maps (Fig. 2). Clearly, these spatial patterns in the yield were preserved in the residuals without a spatial component in the model. Large positive residuals (over-estimation) were mostly in the most-eroded side-slope areas, whereas large negative residuals (under-estimation) occurred along the lower drainage-way and the outlet on the north. Another distinctive pattern can be found around the south-west corner of the field. The south end of the field had not been in crop production until 1980, and the south-west portion of the field was the location of an old farmhouse and livestock housing area. Therefore, animal manures, as well as less eroded soil, contributed to the higher yield in that part of the field (under-estimation).

Fig. 5
figure 5

Residual (predicted yield–observed yield) maps of corn yield obtained using Model (a), without a spatial component. Numbers in parentheses are number of cells in each yield range. The dimension of each map cell is 30 m by 30 m

By-year analysis with spatial effect

Parameter posterior means, standard deviations and 95% credible intervals obtained using Model (b) are given in Table 4. In general, consistent relationships were found for soil and topographic co-variates with those obtained by Model (a). For example, soil ECa was negatively significant in 1997 and 1999 and slope was negatively significant for all four site-years. The standard deviations of the soil ECa and slope estimates were comparable between Models (a) and (b). The standard deviation of elevation, however, increased for all four years compared with Model (a). For example, in 2003 the standard deviation increased to 0.107 Mg ha−1 in Model (b) from 0.063 Mg ha−1 in Model (a) and as a result, elevation was no longer significant. This result was possibly because the spatial association of yield was related to elevation changes in parts of the field. With the spatial effect added, the model standard deviations (σ e ) decreased for all 4 years, from an average of 0.778 Mg ha−1 for Model (a) to 0.382 Mg ha−1 for Model (b). The standard deviations of the spatial effect (τ) were lower for the dry and low-yielding years (1999 and 2003) and higher for the high-yielding years (1997 and 2001). The correlation coefficient was 0.7 (not shown).

Table 4 Posterior means and 95% credible intervals for regression coefficients, and model and spatial error, using Model (b)

Spatial effect maps are presented in Fig. 6. The Model (b) spatial maps were, in large, opposite of the residual maps using Model (a) (Fig. 5). For the spatial effect, patterns of negative values were observed in the eroded side-slope areas and patterns of positive values were found along the drainage outlet areas and in the south-west corner of the field. Visually, more resemblance seemed to exist between 1999 and 2003, the two dry years, and between 1997 and 2001, the two moderately-dry years. Allowing for the small number of site-years, this result suggests that the spatial patterns of yield can be temporally consistent under similar weather patterns. This result agreed with Sadler (1998) and Jiang et al. (2007), who found that spatial pattern of corn yield was temporally consistent when soil water was limited.

Fig. 6
figure 6

Spatial effect maps for corn grain yield using Model (b). Numbers in parentheses are number of cells in each yield range. The dimension of each map cell is 30 m by 30 m

Residual maps obtained using Model (b) are presented in Fig. 7. With yield spatial association accounted for by the CAR model, residuals were greatly reduced, the spatial patterns observed in the residual maps from Model (a) were resolved and the residuals began to look more or less random. This indicated that the spatial CAR model successfully explained the spatial association of yield.

Fig. 7
figure 7

Residual (predicted yield–observed yield) maps of corn yield obtained using Model (b), with a spatial component. Numbers in parentheses are number of cells in each yield range. The dimension of each map cell is 30 m by 30 m

Analysis for four-year data combined with spatial effect

Parameter posterior means, standard deviations and 95% credible intervals obtained using Model (c) are given in Table 5. The Temp.range and Max.temp were negatively related to corn yield, similar to the findings in Hu and Buyanovsky (2003), who reported high corn yields were related to cooler and stable daily temperatures from June through August. Surprisingly, cPREC was not significant, emphasizing that the timing of precipitation during the critical development stages corn plants can be more important than the total amount (Hu and Buyanovsky 2003). When data were combined, the model standard deviation was substantially increased to 0.606 Mg ha−1, compared with the average SD value (0.382 Mg ha−1) of Model (b). This increase was because overall yield variance increased when data were combined. The standard deviation for the spatial effect, however, decreased to 0.900 Mg ha−1 from an average of 1.204 Mg ha−1 using Model (b). These results demonstrated the spatial Bayesian model was able to fit the additional complexities introduced by the inclusion of climate co-variates, and was able to detect the effect of weather conditions on corn yield with a small number of site-years, fewer than generally needed in more traditional methodologies such as multiple regression and some non-linear fitting techniques (Drummond et al. 2003).

Table 5 Posterior means and 95% credible intervals for regression coefficients, and model and spatial effect standard deviations, using Model (c)

The spatial effect map for the 4-year combined yield is presented in Fig. 8. The combined map seemed to be smoother than the spatial effects mapped for individual years using Model (b) (Fig. 6). Nonetheless, the most distinctive patterns were retained. This map indicated how strongly and persistently the corn yield was spatially associated within the study period.

Fig. 8
figure 8

Strength of spatial association, demonstrated using Model (c), for the combined corn yield of 1997, 1999, 2001, and 2003. Numbers in parentheses are number of cells in each yield range. The dimension of each map cell is 30 m by 30 m

Conclusions

A spatial Bayesian hierarchical approach was employed to model the relationships of soil, topographic and climate variables to corn yield. Our results generally agreed with previous research findings in the literature. Compared with other methodologies, the Bayesian hierarchical approach distinguishes itself by being able to explicitly model the spatial association of yield data. This is an advantage to the classical linear regression technique, which assumes independence of the dependent variables in the model. Thus, all data points could be utilized in the analysis, which greatly increased the efficiency of the data and the power of statistical inference. The CAR model successfully captured the structures of spatial association of corn yield both for individual years and for 4 years combined. Model errors were reduced by about 50% when adding the spatial effect component to the model. Further, the approach was able to assess the effects of temporal variables (i.e., climatic variables) on corn yield with a small number of site-years. When more site-years become available, it is conceivable and reasonable to consider a temporal random effect to capture the time dependency in the corn yield over time. The Bayesian model provides the flexibility to accommodate that possibility. The flexible nature of the model also makes it possible to consider the inclusion of additional spatial (e.g., soil or topographic) variables when appropriate.

The WinBUGS software and its spatial module, GEOBUGS, proved to be useful tools in constructing spatial Bayesian models. However, some limitations need to be noted for the sake of future development of the software. First, the ability of the software to handle large datasets is limited. For example, we attempted to run a 10 m by 10 m cell-size resolution (which would give 3634 data points) for yield and each covariate; WinBUGS was unable to handle this. Second, WinBUGS has primitive color schemes for displaying spatial results. Also, it lacks an exporting function by which spatial results can be output to a data file, which then can be imported to a GIS or other package for display and further analysis. As a result, subtle changes in spatial patterns may be challenging to discern. Overall, the WinBUGS software was able to facilitate the adoption of the spatial Bayesian approach by providing relatively easy access to the full Bayesian formulation and computation to scientists working in soil science and other natural resource fields where spatial data are commonplace.