Introduction

A range of responses in the mobilisation and transport of phosphorus (P) by rainfall-runoff has been reported for tilled and untilled conditions across varying agro-climatic environments and soil types (McDowell et al. 2001; Hart et al. 2004). Many studies have sought to establish a relationship between soil P and runoff P concentrations, in the hope that a soil P test could be used for environmental management, and ideally for agronomic management also. No general relationship has yet been established. Although runoff P mostly increases with soil P concentration, such relationships tend to be specific to site, land-use, and experimental method including the soil P test (Wendt and Corey 1980; Schärer et al. 2006; Nash et al. 2007; Tunney et al. 2007). Confounding factors in addition to tillage include recent fertiliser application (Hart et al. 2004) and variation in soil P-buffering capacity (Dougherty et al. 2010).

Studies of P runoff from dairy pastures in Australia show at paddock scale (Baginska et al. 1998; Cornish et al. 2002) and plot scale (Cornish et al. 2002; Dougherty et al. 2008a) that total P concentrations in runoff can be high (>1 mg L−1), predominantly in the dissolved form, and broadly related to soil P concentration. Other studies of pasture grazed by sheep at various locations in south-eastern Australia have also established rising relationships between the concentrations of P in soil and in runoff (McCaskill et al. 2003). However, methodological differences make it impossible to derive a single relationship from these studies that holds for all or even most of them, complicating the use of soil testing for environmental P management.

A further complication with managing P runoff from pastures is the lack of agreement about whether P is mobilised from pastoral land primarily by dissolution, or by particle detachment, as it is from cultivated land. Many studies at catchment scale indicate that particulate P dominates export from agricultural land, including pasture (e.g. McColl 1978; Smith 1987; Stevens et al. 1999; Douglas et al. 2007). However, whilst the majority of P measured in outflow from a catchment may occur as particulate P, much of that P may have been dissolved initially and adsorbed on to suspended particulate matter either in-stream or between collection and analysis of water samples (Sharpley et al. 1981, 2000; Lambert et al. 1992). In several Australian studies, up to 90 % of P in surface runoff from pastoral sites was in the form of dissolved reactive P (Nash and Murdoch 1997; Nash et al. 2000; Cornish et al. 2002). However, there was considerable variation in the proportions of total P as dissolved reactive P amongst these findings, which may be accounted for by variable sorption of P following mobilisation, or simply by differences in vegetative cover and soil erosion. Given typical total soil P concentrations of 0.04–0.4 % (Whitehead 2000), even low sediment concentrations may contribute appreciable P to runoff.

The aims of our research are first to understand the relative importance of dissolution and particle detachment as P mobilisation processes in surface runoff from pasture, and second to evaluate whether a single relationship between soil properties and runoff P could be used for environmental management across soil types and varying land-use intensity, if confined to pastoral land and developed with a consistent methodology across sites.

This paper reports on-farm research using a rainfall-simulator to investigate P loss in surface runoff from pastoral sites and its relationships with land use, groundcover, soil type, and soil P properties. The context was the Sydney Drinking Water Catchments in New South Wales, Australia, where the Sydney Catchment Authority provides guidance to land managers regarding fertiliser use, effluent disposal and other practices that may impact on water quality. This is a large catchment (15,700 km2) with low population density, so the detailed data required for complex water quality models or risk assessment tools are unavailable. The Sydney Catchment Authority requires broadly applicable guidelines that use readily obtainable inputs, to assist it to meet its water quality targets.

Materials and methods

Study area

The Sydney Drinking Water Catchments are situated in SE NSW, Australia. More than 60 % of the catchment area is non-agricultural land, mostly forest. Land use in the remaining 485,000 ha is mostly pastoral, ranging from intensive dairy on improved pasture through to fine wool production on unfertilised native pasture. Geology is varied, with sandstone, metamorphic rock, granite, basalt and shale comprising the most common lithological groups, making up 70 % of the catchment area (SCA/DWLC 2003). The soils derived from these parent materials cover a wide range of Australian Soil Classification orders, including Ferrosols derived from basalt, Chromosols and Kurosols from shale and metamorphic rock, and Kandosols and Tenosols from sandstone. Average annual rainfall for the catchment ranges from around 1,500 mm in the eastern-most areas, to around 600 mm in the west.

Locations and sites

Fifteen locations were included in the study, spread across the middle section of the catchment between the towns of Robertson (34°35′S, 150°35′E) in the east and Goulburn (34°45′S, 149°44′E) in the west. At each location, from two to six sites were selected for rainfall simulation, within 30-40 m of each other, thus with the same general land-use and lithology but varying soil P and vegetative cover. This resulted in a total of 38 sites for rainfall simulations. With the design of the rainfall simulator, each site was comprised of two plots, each with its own soil P and runoff data, giving a total of 76 plots.

All locations fell into one of the following designations of current land use/pasture type: native/unimproved pasture; rural residential (cattle, sheep or horses); improved pasture (excluding dairying); and dairy pasture. Dairying is a relatively small and declining industry in the region, but as dairy pasture is often associated with high P in runoff, one-third of the locations were selected because they were used currently or in the recent past for dairy production. Based on information provided by the present landowner on the fertiliser used in recent years, pasture improvement and stocking rates, we broadly considered the locations as being presently under ‘low’, ‘moderate’ or ‘high’ land use intensity.

The locations covered the most common soil lithological groups under agriculture in the catchment, namely metamorphic rock, basalt, and shale. Of the 76 plots, 18 were on basalt and 58 were on soils with non-basaltic lithology. Sites were avoided where fertiliser had been applied in the weeks prior to rainfall simulation and with obvious recent defecation, to minimise confounding by ‘incidental’ losses of P (Hart et al. 2004). All sites had a pasture cover, although of varying amount, and none showed obvious signs of soil erosion.

At each site, slope was measured using a spirit level clinometer, and the percentage vegetation cover, using a double layer grid point frame (50 points) of the same dimensions as the runoff plots, after Stanton (1960). Location descriptions are given in Table 1.

Table 1 Location details (range and mean are for sites at each location)

Soil sampling and analysis

For general characterisation of each location, two sets of soil samples were taken along transects in the vicinity of the runoff plot sites. One set of 25–30 core samples (2.5 cm diameter) was collected from the standard depth of 0–10 cm used for agronomic purposes. The other set of 25–30 samples (5 cm diameter) was collected from 0 to 2 cm, the depth commonly used for environmental purposes because surface runoff is thought to interact mainly with the top few mm of soil (Sharpley 1985). The cores were bulked, air dried at 40 °C, and passed through a 2 mm sieve.

Samples from both depths were analysed for bicarbonate-extractable (Colwell 1963) soil test P. The Colwell test is widely used in Australia for agronomic purposes, thus providing a large data base of potentially useful information for environmental-P management as well as agronomic management for which critical values are well established (Gourley et al. 2007). Briefly, 1.0 g soil was extracted with 100 mL 0.5 M NaHCO3, adjusted to pH 8.5, by shaking end-over-end for 16 h. Extracts were filtered (Adventec 5A) and extractable P was then measured, after colour development based on the molybdate blue method of Murphy and Riley (1962) (Rayment and Higginson 1992), using a Lachat Quickchem 8000 FIA+ ion flow injection analyser (LaChat Instruments, Milwaukee, WI).

The transect soil samples from the 0–10 cm depth were also analysed for phosphorus buffering index, the currently accepted commercial approach to measuring soil buffering capacity in Australia (Burkitt et al. 2002), as well as soil total P (TKP digestion in 18 M H2SO4 in the presence of a Cu catalyst, measured by FIA—QuickChem method 13-115-01-1-B), Total C and N (LECO CNS2000 Analyser, Leco Corporation, St. Joseph, MI) and pH (1:5 water).

Soil properties determined from the transect sampling are summarised in Table 2. With respect to soil P properties at the 15 locations, total soil P ranged from 40 mg kg−1 for low input pasture on soil of metamorphic origin to 2,400 mg kg−1 for soil under intensive dairy management on soils derived from basalt. Colwell soil test P for the 0–10 cm depth (Colwell0–10) varied between locations from 7 to 307 mg kg−1 reflecting land use intensity, with soil test P for low input locations ranging from 7 to 21 mg kg−1 (mean 19 ± 1.9), moderate input locations from 15 to 60 mg kg−1 (54 ± 8.8), and high input locations from 37 to 307 mg kg−1 (159 ± 27.6). Colwell soil test P values for 10 of the 15 locations were in the range where pastures may respond to added P (<75 mg kg−1, refer to Table 2 footnotes and Havilah et al. 2005). The other 5 locations had soil test P values of between 60 and 307 mg kg−1. Four of these 5 locations were used now or in the past for dairy production. Phosphorus buffering index varied with lithology. Mean values for basalt (991, no units), shale (236) and metamorphic (61) locations fell within phosphorus buffering index categories of high, moderate and very low, respectively (Table 2).

Table 2 Soil properties at each location, rated low (L), medium (M) or high (H) for land-use intensity (0–10 cm depth, unless specified otherwise)

To relate soil P to runoff P concentration, soils were also sampled from within the 76 rainfall-runoff simulator plots (10 cores per plot, bulked) and analysed for bicarbonate-extractable soil test P (0–2 and 0–10 cm) and phosphorus buffering index (0–10 cm) as described above.

Rainfall simulator and simulations

An oscillating boom rainfall simulator was used to generate surface runoff. It consisted of three Veejet 80100 flat fan nozzles attached to a hollow boom, mounted on an aluminium A-frame, 2.4 m above the ground, similar to Loch et al. (2001). The simulator allows ‘rainfall’ to be applied to two adjacent plots, 2.0 × 0.75 m in size. The plot boundaries were defined by lengths of 3 mm sheet steel, 150 mm wide, bolted together to form open-ended rectangles. These were aligned parallel with the direction of slope, open at the down-slope end, with a gap of 10 cm between the two plots. Steel boundaries were driven into the ground to a depth of 75 mm. A shallow trench was dug at the open end of each plot into which a stainless steel trough was located to collect runoff. A thin band of bentonite clay applied to the front edge formed a watertight seal between the trough and the plot surface. Perspex sheets were placed at an angle over the troughs, to prevent water being sprayed directly into them. The surface runoff from the plots was collected in gravity-fed containers a few meters down slope of the plots.

The first set of 27 simulations (locations 1–9) was conducted in Oct-Nov 2005. One of the paired plots in each of these simulations was chosen randomly and covered with shadecloth (70 % mesh) supported 150 mm above the soil surface on a rectangular wooden frame placed on the metal boundary sheets. Thus, the covered plots received the same rainfall intensity as the uncovered plots but with significantly reduced raindrop energy (Loch and Foley 1994). Vegetative cover on the paired plots varied from 46 to 100 %. Of the 27 pairs, 24 were on non-basalt soils. The pairs were matched well for groundcover (%), with the relationship between plots ‘with’ and ‘without’ shadecloth being:

$$ {\text{y}} = 1.035{\text{x}} - 3.21\left( {R^{2} = 0.99,P < 0.001} \right) $$
(1)

In a second set of 11 simulations conducted in Nov. 2006, the shadecloth cover was not used. These provided a further 22 uncovered plots from locations 10–15. Twelve of these plots (6 pairs) were on basalt soils and the other 10 plots (5 pairs) were on non-basalt soils.

The total data-set comprised of 76 plots, each with its own soil P and runoff data; 27 plots with shadecloth from the first set, and 49 plots without shadecloth including 27 and 22 plots from the first and second set, respectively.

Water applied by the simulator was sourced from the Sydney Catchment Authority Pumping Station at Burrawang, NSW. Previous laboratory testing showed that the P concentration of this water was close to detectable limits, typically <0.01 mg L−1. To determine rainfall rate prior to each rainfall-runoff simulation, plots were covered with plastic sheeting that directed 100 % of the water applied into the collection troughs. The volume of runoff collected from each plot over 5 min was then measured, and if necessary the simulator was re-positioned so that each of the paired plots received approximately the same volume of water during a simulation run. In this study, a rainfall intensity of 60 mm h−1 was applied until runoff commenced, after which rainfall application was continued for a further 15 min.

Runoff sample collection and analysis

Surface runoff collected over this period from individual plots was bulked before sampling for analysis. Samples were stored on ice in the field before being returned to the laboratory, where they were stored at −20 °C until analysis.

Sub-aliquots of the collected runoff were filtered in the field (0.45 μm) immediately after collection for dissolved reactive P analysis, based on the molybdate blue method of Murphy and Riley (1962), using FIA as described above. Bulk sub-samples of the collected runoff were taken for analysis of total P and total suspended solids. Total P was determined by alkaline persulphate oxidation, after Hosomi and Sudo (1986) and Maher et al. (2002). The percentage of total P present as dissolved reactive P (%DRP) was calculated to express the relative magnitude of the two mobilisation processes, dissolution and particle detachment.

Although alkaline persulphate oxidation is widely used for total P determination in turbid water, some of the total P was evidently not fully extracted from 6 of the runoff samples in which dissolved reactive P values substantially exceeded their total P values (3 sites, each covered and uncovered with shadecloth). These sites were all at Location 4, which had soils derived from shale and a long history of dairy production with high inputs (Table 1) resulting in concentrations of Colwell P (211 mg kg−1) and total P (1,340 mg kg−1) that are extreme for soils with this lithology in Australia. Incomplete extraction of total P may result from condensation, complexation or occlusion of unfiltered (high total suspended solids) high-P samples during storage (Maher and Woo 1998; Jarvie et al. 2002). Repeating the analysis and the use of more aggressive extraction techniques (concentrated nitric + sulphuric acid; Pote and Daniel 2000) failed to resolve this issue. Data from Location 4 were therefore excluded from any statistical analyses requiring total P but retained in analyses of dissolved reactive P. We assume the problem was confined to total P in runoff samples from the shale soil with extreme P concentrations. Basalt soils are inherently higher in total P than non-basalt soils, but samples from basalt soils revealed no problem with total P extraction.

Total suspended solids were determined after method 2540D, described in Standard Methods (APHA 2005).

Statistical analysis

Statistical analyses were conducted using MS Excel 2003, Minitab 14.1 and S-Plus 6.1. Regression relationships between soil P and P in runoff for all of the soil types were compared statistically before dividing soil types into two groups for analysis, namely basalt and non-basalt soils. A paired t test was used for comparisons involving paired plots. The measure of dispersion given after a mean is the standard error of the mean. In linear regression, where the intercept was not significantly different from zero (P ≥ 0.05) the regression has been forced through the origin. Multiple regression was used to predict runoff P concentration from soil test P and phosphorus buffering index. The equations were developed with data from plots with shadecloth, to minimise confounding from soil erosion, and tested with an independent set of data from plots without shadecloth but with high vegetative cover and low surface erosion.

Results

Soil test P at two sampling depths

Concentrations of soil test P in the 0–2 cm layer were higher than the 0-10 cm layer (Fig. 1). For basalt soils, the relationship between extractable P at the two depths was curvilinear, tending towards a 1:1 relationship as the concentration increased. For non-basalt soils, the relationship between depths was linear. Both regressions were highly significant (P < 0.001).

Fig. 1
figure 1

Bicarbonate-extractable (Colwell) soil P at two sampling depths in non- basalt soil (diamond), y = 1.46x + 7.3 (R 2 = 0.95, P < 0.001) and basalt soil (square), y = − 0.002x2 + 1.82x + 26.5 (R 2 = 0.97, P < 0.001)

The percentage of total P present as dissolved reactive P in runoff (%DRP)

The  %DRP varied widely, regardless of soil type and almost regardless of the absolute concentration of total P in runoff (Fig. 2). Three factors contributed to this variation.

Fig. 2
figure 2

The percentage of total P present as dissolved reactive P in runoff (%DRP) from plots uncovered or covered with shadecloth on metamorphic (triangle), shale (diamond), and basalt (square) soils. A subset of 24 uncovered plots was paired with covered plots for paired t test comparison of  %DRP (see text)

First, %DRP was greater in plots with shadecloth (44 %, range 7–100 %) than paired-plots without shadecloth (28 %, range 1–96 %) (P < 0.01). These 24 plot-pairs comprised all of the covered plots and a sub-set of the uncovered plots shown in Fig. 2 (Location 4 was not included, see “Materials and methods”). Greater %DRP with shadecloth was associated with lower concentrations of total suspended solids, viz. 167 mg L−1 compared with 690 mg L−1 from uncovered plots (P < 0.01).

Second, for uncovered plots, %DRP was greater in basalt soils (73 ± 8.5 %) than shale (56 ± 8.23 %) and metamorphic (34 ± 11.4 %) soils (P < 0.05), thus appearing to mirror the differences in phosphorus buffering index shown in Table 2.

Third, weak but significant positive relationships between %DRP (y) and soil test P(x) were observed within the non-basalt group of sites, whether shadecloth was used (Eq. 2) or not (Eq. 3):

$$ {\text{y}} = - 72.7 + 38.7{\text{Ln}}\left( {\text{x}} \right)\left( {R^{2} = 0.53,P < 0.01} \right) $$
(2)
$$ {\text{y}} = - 14.9 + 14.7{\text{Ln}}\left( {\text{x}} \right)(R^{2} = 0.24,P < 0.05) $$
(3)

Effects of soil cover on concentrations of dissolved reactive P, total P and total suspended solids in runoff

For paired-plots (excluding Location 4), the mean dissolved reactive P concentration with shadecloth (0.23 mg L−1) was not significantly different (P ≥ 0.05) from dissolved reactive P concentration without shadecloth (0.28 mg L−1). When the plots with extreme soil P from Location 4 were included, shadecloth cover resulted in a small but significant (P < 0.05) decrease in dissolved reactive P concentration, from 0.76 to 0.64 mg L−1.

Shadecloth had potentially much greater effects on total P concentration (Fig. 3), but this was manifest only at the seven sites shown by closed symbols, for which total P averaged 2.78 mg L−1 when uncovered and 0.69 mg L−1 when covered (P < 0.001). With these seven plot-pairs, the difference in total P reflected a difference in sediment (total suspended solids) concentration which averaged 422 mg L−1 from the covered plots and 1,751 mg L−1 from the paired uncovered plots (P < 0.05). These plots were at sites with relatively low pasture cover in Table 1. Shadecloth had little effect on total P in runoff from plots with high vegetative cover and low total suspended solids (open diamonds), as shown by the near 1:1 regression for these data (Fig. 3).

Fig. 3
figure 3

The effect of shadecloth on total P (TP) concentration (mg L−1) in runoff from non-basalt soils (excluding Location 4). The regression uses only data from plots with high vegetative cover and low total suspended solids (TSS) in runoff (open diamond), y = 0.043 + 1.09x (R 2 = 0.96, P < 0.01)

Increasing vegetative cover in the absence of shadecloth substantially reduced sediment concentrations in runoff (Fig. 4). However, there was no such relationship between vegetative cover and total P in runoff because total P in soil (Table 2) varied between locations with similar groundcover (Table 1), resulting in P concentrations in the sediment in runoff ranging between 0.03 and 2.0 % (mean 0.34 %).

Fig. 4
figure 4

Relationship between total suspended solids (TSS) in runoff and ground cover, for sites with no shadecloth, y = 32.301e−0.0529x (R 2 = 0.65)

Effect of soil P concentration on P concentrations in runoff

Total P in runoff from plots without shadecloth

Despite a general trend for total P to rise with soil test P in the non-basalt group of soils, the data were variable and the relationships were not statistically significant for either the non-basalt or basalt soils, at either sampling depth (0–2 or 0–10 cm) (Fig. 5).

Fig. 5
figure 5

Concentrations of total P (TP) in runoff in response to Colwell soil P, from basalt and non basalt soils and soil sampling from 0–2 to 0–10 cm depths. Sites with no shadecloth (excludes sites at location 4, see “Materials and methods” for explanation)

Dissolved reactive P in runoff from plots without shadecloth

Dissolved reactive P concentration in runoff responded positively to soil P concentration amongst both basalt and non-basalt soils (Fig. 6). Differences in runoff dissolved reactive P concentration between soil types were small at low soil test P but increased as soil test P increased. The slopes of the regressions were significantly different (P < 0.001). Consequently, dissolved reactive P concentrations in runoff were much greater in non-basalt than basalt soils at higher soil P concentrations.

Fig. 6
figure 6

Concentrations of dissolved reactive P (DRP) in runoff in response to Colwell soil P, from non-basalt and basalt soils and soil sampling from 0–2 to 0–10 cm depth. Sites with no shadecloth, includes location 4. For non-basalt, 0–2 cm depth, y = −0.315 + 0.014x (R 2 = 0.90) and for 0–10 cm depth, y = − 0.21 + 0.02x (R 2 = 0.89). For basalt soils, 0–2 cm depth, y = 0.047 + 0.002x (R 2 = 0.55) and for 0–10 cm depth, y = 0.136 + 0.002x (R 2 = 0.61)

Soil sampling depth affected the slope of the regression within non-basalt soils (P < 0.05), but the strength of the relationship was comparable for the two sampling depths (R 2 = 0.90 and 0.89).

Effect of shadecloth on the relationship between soil test P and P in runoff

This analysis was restricted to non-basalt sites because of the small number of basalt sites with shadecloth. Data were from the 0-10 cm soil sampling depth. There was a positive relationship between soil test P and total P in runoff when plots were covered with shadecloth but no relationship without shadecloth (Fig. 7, top). Dissolved reactive P concentrations also rose with increasing soil test P, but this relationship was significant whether plots were covered with shadecloth or not (Fig. 7, bottom). Linear regressions were fitted to these data because there was no strong statistical argument to use a non linear model across all of the data, but it is important to note that substantial increases in either total P or dissolved reactive P in the covered plots (and dissolved reactive P in uncovered plots) occurred only when Colwell0–10 soil test P was greater than about 30 mg kg−1.

Fig. 7
figure 7

Effects of shadecloth on concentrations in runoff total P (TP, top) and dissolved reactive P (DRP, bottom) in response to Colwell soil P (0–10 cm), non basalt soils (total P excludes data from location 4, see text). For total P plus shadecloth, y = − 0.08 + 0.016x (R 2 = 0.64). For dissolved reactive P minus shadecloth, y = − 0.21 + 0.02x (R 2 = 0.89). For dissolved reactive P plus shadecloth, y = − 0.32 + 0.021x (R 2 = 0.91)

Effect of soil P buffering index on P concentrations in runoff, with shadecloth

This analysis was restricted to plots with shadecloth cover to minimise the confounding effects of surface soil erosion, thus restricting the analysis to non-basalt soils. The response of P in runoff to soil phosphorus buffering index is given in Fig. 8, but because soil test P and phosphorus buffering index may be confounded, the soil test P associated with each value for phosphorus buffering index is shown in lieu of the customary symbols. P concentrations in runoff clearly rose as phosphorus buffering index fell, but all of the notably elevated values for total P (>0.3 mg L−1) and dissolved reactive P (>0.1 mg L−1) were also associated with soils with relatively high soil test P (>30 mg kg−1). Nevertheless, the possible importance of phosphorus buffering index independently of soil test P is suggested by the sites falling within a narrow range of soil test P (26–35 mg kg−1), shown in the inset graphs in Fig. 8.

Fig. 8
figure 8

The response of total P (TP) and dissolved reactive P (DRP) in runoff to phosphorus buffering index and soil test P for non-basalt soils with shadecloth cover. Numbers on the figure replacing symbols refer to soil test P (mg kg−1) for each plot. Insert graphs are for plots within a narrow range of soil test P (26–35 mg kg−1) below the agronomic critical value (refer to text). For the inset graphs: total P = 10.49x−0.84 (R 2 = 0.94); dissolved reactive P = 10.95x−1.08 (R 2 = 0.86)

The interaction between phosphorus buffering index and soil test P implicit in Fig. 8 for the covered non-basalt soils parallels the significant interaction in Fig. 6 where runoff dissolved reactive P concentrations from the highly buffered basalt soils were similar to the more weakly buffered non-basalt soils at low soil test P, but much lower at higher soil test P.

Predicting P concentrations in runoff

No statistical model was significant when fitted to the soil-P/runoff-P data for all of the sites. However, even without statistical analysis it was clear that high P concentrations in runoff could result from surface soil erosion (Fig. 3) and extreme soil P concentrations (Figs. 6, 7). The greater need is to predict runoff-P concentrations in the more uncertain situation where erosion is not a major factor and soil test P is within the range where farmers are most likely to be making decisions about fertiliser use and disposal of manure and effluent. Therefore, data were examined from plots with shadecloth (to manage erosion) and with <100 mg P kg−1 to determine if P concentrations in runoff could be predicted satisfactorily from the measured soil variables. The resultant regression equations were then tested using an independent set of data from plots with good vegetative cover (or <500 mg total suspended solids L−1).

The sites with shadecloth included 20 non-basalt and four basalt plots. The highest soil test P value in these data was 69 mg kg−1. The multiple regressions that best predicted total P and dissolved reactive P in runoff from soil Colwell0-10 soil test P, phosphorus buffering index and their interaction were:

$$ \begin{aligned} {\text{Total}}\;{\text{P}} & = 0.254-0.014\;{\text{soil}}\;{\text{test}}\;{\text{P}} + 0.0003\;{\text{soil}}\;{\text{test}}\;{\text{P}}^{2} -6.59\left( {1/{\text{phosphorus}}\;{\text{buffering}}\;{\text{index}}} \right) \\ & \quad + 0.606\left( {{\text{soil}}\;{\text{test}}\;{\text{P}} \times 1/{\text{phosphorus}}\;{\text{buffering}}\;{\text{index}}} \right) \\ & \quad \left( {R^{2} = 0.91,P < 0.001} \right) \\ \end{aligned} $$
(4)
$$ \begin{aligned} {\text{Dissolved}}\;{\text{reactive}}\;{\text{P}} & = 0.293-0.022{\text{ soil}}\;{\text{test}}\;{\text{P}} + 0.0003{\text{ soil}}\;{\text{test}}\;{\text{P}}^{2} -14.08\left( {1/{\text{phosphorus}}\;{\text{buffering}}\;{\text{index}}} \right) \\ & \quad + 0.813\left( {{\text{soil}}\;{\text{test}}\;{\text{P}} \times 1/{\text{phosphorus}}\;{\text{buffering}}\;{\text{index}}} \right) \\ & \quad \left( {R^{2} = 0.94, P < 0.001} \right) \\ \end{aligned} $$
(5)

Regressions of the observed values for total P and dissolved reactive P and those predicted from Eqs. 4 and 5 passed through the origin with slopes of 1.08 (total P) and 0.78 (dissolved reactive P) (Fig. 9). The models systematically slightly under-predicted dissolved reactive P at high concentrations but there was no evident bias in the predictions of total P.

Fig. 9
figure 9

Measured and predicted total P (TP) and dissolved reactive P (DRP) in runoff from soils with soil test P < 70 mg kg−1, with shadecloth cover. For total P, y = 1.08x (R 2 = 0.83). For dissolved reactive P, y = 0.78x (R 2 = 0.85)

The independent data comprised 29 plots without shadecloth but with good vegetative cover. There were 23 plots on non-basalt soils and six on basalt. Plots had >90 % groundcover and the highest soil test P value was 94 mg P kg−1. Some of these plots were pairs for the uncovered plots from the first set of simulations in 2005 and others were from the second set of simulations in 2006. Both regressions were significant (P < 0.01), with high R 2. However, Eq. 4 tended to under-predict total P at higher concentrations and Eq. 5 under-predicted dissolved reactive P at all concentrations.

Discussion and conclusion

Numerous studies in south-eastern Australia have together established that relatively higher P concentrations in runoff from pastures are generally associated with higher P concentrations in soil (Costin 1980; Baginska et al. 1998; Cornish et al. 2002; McCaskill et al. 2003; Barlow et al. 2005; Dougherty et al. 2008a, b). However, they provide no guideline for soil P that may be used for environmental risk assessment and management because any quantitative relationships between the concentrations of soil P and runoff P are location and experiment-specific, as with other research internationally (Hart et al. 2004).

We aimed to determine if a generally applicable relationship between concentrations of soil P and runoff P could be developed for pastoral land by using a single methodology on fields that covered a range of soil types and land-use intensity, and if so to propose guidelines for soil P to use in environmental management. The sites covered a wide range of values for soil properties thought to influence P concentrations in runoff, but avoided sites with recent fertiliser or manure accretion to minimise confounding by ‘incidental’ losses of P (Hart et al. 2004). In some cases runoff plots were paired, with one of each pair being covered with shadecloth to reduce rainfall energy to help distinguish between dissolution and particle detachment (soil erosion) as primary mobilisation processes.

Before considering relationships between soil P and runoff P, the following discussion addresses the relative importance of the mobilisation processes. This is important, as the form in which P is mobilised determines the approach taken to managing P losses in runoff. For example, if particulate P is the predominant form mobilised, then it may be beneficial to mitigate soil erosion or construct vegetated buffer strips near to the source, but these measures may have little or no effect on P exports if dissolved reactive P predominates (Dorioz et al. 2006; Fiener and Auerswald 2009). Despite the fundamental importance of this principle, there is no agreement in the literature as to whether P is mobilised from pastures predominantly as dissolved reactive P or in the particulate form resulting from surface soil erosion (Hart et al. 2004; Bilotta et al. 2008). This uncertainty arises partly from the possibility of dissolved reactive P sorption to soil in subsequent overland flow (Sharpley et al. 1981, 2000; Lambert et al. 1992), which we minimised by collecting and filtering samples immediately after runoff.

The mobilisation process in pastures

The relative importance of dissolution and particle detachment

Despite the lack of cultivation and the absence of obvious soil erosion at the pastoral study sites, dissolution did not always dominate P mobilisation. Three factors contributed to variation in the relative magnitude of P mobilisation as dissolved P, expressed as the percentage of total P as dissolved reactive P (%DRP).

First, the amount of vegetative cover was not always adequate to protect against particle detachment. Shadecloth reduced the mean concentrations of sediment in runoff from 690 to 167 mg L−1, thereby decreasing particulate P concentrations and increasing the mean %DRP from 28 % in uncovered plots to 44 % from the paired covered plots. The other two factors, soil type and soil test P, affected the %DRP through their effects on the concentration of dissolved reactive P. Soils derived from basalt had higher average %DRP (73 %) than soils derived from either shale (56 %) or metamorphic parent material (34 %). Whilst this appeared to reflect their differences in soil P buffering capacity, the relationship may not be a causal as the red Ferrosols derived from basalt tend to be structurally more stable than the other soils (mostly Kurosols). High soil test P also contributed to higher %DRP, which was most evident at four sites with non-basalt soils and a history of dairy production. Other research in Australia has found a high proportion of total P as dissolved reactive P in runoff from high-input dairy pastures (e.g. Nash and Murdoch 1997; Barlow et al. 2005; Dougherty et al. 2008a, b).

Surface soil erosion and the loss of P in runoff from pasture

Total P concentration in runoff was related to soil test P, but this was evident only when the confounding effect of soil erosion was reduced by protecting soils with shadecloth (Fig. 7, top). The importance of soil erosion was at first surprising, given that there was no obvious surface erosion at any of the sites. However, amongst the uncovered plots, sediment concentrations rose sharply as ground cover fell below 90 % (Fig. 4), with the highest total suspended solids concentrations (~3 g L−1) observed when vegetative groundcover was lowest (~50 %). The effect of ground cover on soil erosion is well known (Costin 1980; Castillo et al. 1997; Loch 2000) and there are published standards for the amount of cover recommended to protect against erosion. In arable farming the amount of residue recommended may be as low as 30 % (Freebairn et al. 2006), whilst for managing surface erosion in permanent pasture the New South Wales Department of Primary Industries and the Hawkesbury-Nepean Catchment Management Authority advise >70 % cover (Anon. 2006).

Managing surface-soil erosion from pastures is thus an obvious first line of attack on P losses in runoff, but our data suggest that targets for managing soil erosion are too low for managing P losses in runoff. This is because erosion rates that are negligible from the perspective of soil conservation may nevertheless result in significant loss of sediment-P in runoff. The following estimates illustrate this point.

With total suspended solids of 3 g L−1, we estimate an average annual rate of soil erosion of <2.0 t ha−1 assuming 600 mm annual rainfall (typical for the west of the study region) and an annual runoff coefficient of 0.1 (Young et al. 2001). This also assumes that total suspended solids concentrations in rainfall-runoff are the same as with the rainfall simulator which, with its high rainfall rate of 60 mm h−1, may over-estimate average annual erosion. Whilst 2 t ha−1 exceeds the “tolerable” rate of soil loss of <0.5 t ha−1 year−1 espoused by Loughran et al. (2004), it is comparable to other studies on well-managed grazed pasture in SE NSW, for example 2.2 t ha−1 year−1 in Mahmoudzadeh et al. (2002) and 3.3 t ha−1 year−1 in Erskine et al. (2002). It is much less than may occur from tillage-based agriculture, e.g. 19 t ha−1 year−1 from vegetable farms near Sydney (Hollinger et al. 2001).

Despite sediment losses from the sites with 50 % groundcover being low from an erosion perspective, the concentrations of total P in runoff were high from an environmental perspective, with 3 mg P L−1 from sites with greatest total suspended solids being 150 times the default trigger value (0.02 mg L−1) requiring risk assessment in upland streams in slightly disturbed ecosystems in SE Australia (ANZECC/ARMCANZ 2000). Thus, a relatively low erosion rate (<2 t ha−1 year−1) and low soil P (Fig. 5) may still lead to runoff P concentrations that concern environmental regulators.

Exports of P or ‘P loading’ are just as important as P concentrations, as the following estimates of annual export coefficient illustrate for sites generating runoff with 3 g L−1 total suspended solids. These calculations assume an average total P of 3 mg L−1 (Fig. 3), rainfall of 600 mm, and runoff coefficient of 0.1 (as above). The estimated export coefficient of 1.8 kg P ha−1 year−1 is comparable with farm-scale estimates in the region that have highlighted concerns with P runoff from pasture land (Baginska et al. 1998; Cornish et al. 2002). Thus, P losses of environmental concern may occur despite the rates of soil erosion being low from the perspective of soil conservation.

The groundcover required to manage P losses in runoff

Although surface erosion rates were low with around 50 % vegetative cover, as shown above, increasing cover to >90–95 % further reduced erosion (Fig. 2). Effective cover reduced total P concentrations in runoff to an order of magnitude less than occurred with 50 % ground cover, whether the cover came from vegetation (Fig. 3) or from shadecloth (Fig. 7). The exceptions to this were sites with extreme soil test P concentrations arising from a history of dairy production. We conclude that maintaining groundcover above the thresholds recommended for erosion management has the potential to significantly reduce total P losses from pasture sites without a history of dairy production.

In our study, low ground cover was sometimes associated with low soil P (Tables 1, 2), so consideration should be given to applying P fertiliser where soil infertility is likely to have contributed to the low ground cover (Robinson and Lazenby 1976; McGufficke 2003).

With respect to environmental water quality guidelines, it is notable that even with good ground cover and low soil test P, total P in runoff at the point of generation will be up to 0.2 mg L−1 (Fig. 7 top, right), which is well above the environmental ‘trigger’ value (0.02 mg L−1) for the streams to which field runoff discharges. Clearly it is inappropriate for environmental regulation to apply these trigger values to in-field or edge-of-field measurements.

Runoff P responses to soil test P and P buffering index

Where surface erosion was controlled through high ground cover or shadecloth, much of the remaining variation in P concentrations in runoff across the diverse soil types and land-use intensity was accounted for by variation in soil test P and phosphorus buffering index. Dissolved reactive P concentrations in runoff increased with soil test P, but in linear regressions the increase was much greater from non-basalt soils with low or moderate phosphorus buffering index than from basalt soils with high phosphorus buffering index (Fig. 6). Consequently, soil test P alone was unrelated to P concentration in runoff for the combined soil types, whereas multiple regression including both soil test P and phosphorus buffering index predicted P concentration in runoff reasonably well (Figs. 9, 10).

Fig. 10
figure 10

Measured and predicted total P (TP) and dissolved reactive P (DRP) for plots without shadecloth but with > 90 % groundcover (or < 500 mg L−1 total suspended solids) and soil test P < 100 mg kg−1. For total P, y = 0.74x (R 2 = 0.92). For dissolved reactive P, y = − 0.12 + 0.94x (R 2 = 0.86)

Whilst a range of runoff-P responses to soil test P has been reported in the literature (McDowell et al. 2001; Hart et al. 2004), the interacting effect of phosphorus buffering index is less well documented under field conditions. However, the chemistry behind the effect has been discussed (Ehlert et al. 2003; Moody 2007) and there is supporting evidence from controlled field experiments. In one laboratory study, Olsen soil test P (also bicarbonate extraction)/P sorption index (PSI) was a good predictor of ‘readily desorbable’ P (Hughes et al. 2006). Under controlled field conditions, dissolved reactive P concentration in overland flow from pasture was related, in one study, to Olsen P and P sorption index (PSI) (McDowell and Condron (2004) and, in another, to degree of P saturation which is a function of soil test P and PSI (Allen et al. 2006). In Australia, Dougherty et al. (2010) found that 80 % of the variation in dissolved reactive P concentration in runoff from pastures growing in reconstructed soils in runoff trays was explained by bicarbonate-extractable soil test P (Colwell) and phosphorus buffering index, which measure soil P quantity and buffering, respectively.

The present study included a heterogeneous set of field sites. In basalt soil, the runoff-P response to soil test P was clearly linear, as well as being smaller than in the non-basalt soils as discussed above. Statistically, the non-basalt soils behaved as a group, although contrasting responses may have been revealed with more data, given that phosphorus buffering index ranged from low to moderate (Table 2) and the response to soil test P appears much more likely to be non-linear in soils with low phosphorus buffering index (Dougherty et al. 2010).

For the non-basalt soils, the data for runoff P over the full range of soil test P were fitted to linear regression rather than non-linear regression, for statistical reasons, but inspection of the data suggests the response may be non-linear, at least at lower values of soil test P. There was no systematic rise in runoff P concentration up to about 50 mg Colwell0–2 P kg−1 (or 30 mg Colwell0–10 P kg−1) (Fig. 7). Some authors have fitted ‘bent stick’ statistical models to their data to capture this non-linearity, using the ‘break point’ to define a critical P concentration for environmental purposes (McDowell et al. 2003).

In the absence of data for P buffering capacity, 30 mg Colwell0–10 P kg−1 could be considered as a single ‘critical P’ concentration for environmental management of non-basalt pasture soils. However, there was clearly an effect of phosphorus buffering index even if the non-basalt soils were statistically homogeneous. For example, for a selection of non-basalt sites with soil test P of 30 (±5) mg kg−1, both total P and dissolved reactive P increased by an order of magnitude as phosphorus buffering index fell from 400 (moderate) to 25 (very low) (Fig. 8). This undoubtedly explains some of the variation in the relationship between soil test P and runoff P evident in Fig. 7.

Environmental risk prediction from soil test P and P buffering index

Multiple regression was used to predict total P and dissolved reactive P from soil test P and phosphorus buffering index, first with the data from plots covered with shadecloth to minimise confounding from soil erosion. The resultant equation was then used without re-parameterisation on independent data from plots without shadecloth but high vegetative cover, which we took to represent good pasture management. In the model providing the best fit, the runoff P responses to both soil test P and phosphorus buffering index were curvilinear (Eqs. 4, 5), which makes it risky to apply the model to data beyond the range used for parameterisation.

The model gave satisfactory predictions of P concentrations in runoff when applied to the independent data with soil test P < 100 mg kg−1 and vegetative cover >90 % and no shadecloth (Fig. 10).

The coefficient of determination for total P (R 2 = 0.91) using Eq. 4 and for dissolved reactive P (R 2 = 0.94) using Eq. 5 were superior to those using the model published by Dougherty et al. (2010) when parameterised with the present data, but the relationships were nevertheless highly significant:

$$ \begin{aligned} {\text{Log}}\;{\text{dissolved}}\;{\text{reactive}}\;{\text{P}} & = -4.71 + 1.038 \cdot (\log {\text{Colwell}}\;{\text{P}} \\ & \quad + 0.15 \cdot (\log {\text{Colwell}}\;{\text{P}} \cdot \log {\text{phosphorus}}\;{\text{buffering}}\;{\text{index}}) \\ & \quad (R^{2} = 0.76,P < 0.001) \\ \end{aligned} $$
(6)
$$ \begin{aligned} {\text{Log}}\;{\text{total}}\;{\text{P}} & = - 3.90 + 1.625 \cdot \log {\text{Colwell}}\;{\text{P}} \\ & \quad - 0.1869\,(\log \;{\text{Colwell}}\;{\text{P}} \cdot \log {\text{phosphorus}}\;{\text{buffering}}\;{\text{index}}) \\ & \quad \quad (R^{2} = 0.85,P < 0.001) \\ \end{aligned} $$
(7)

These results confirm that soil P guidelines for environmental management need to reflect both soil P capacity and buffering. This is the conclusion reached by Gourley et al. (2007) for agronomic management in Australia, where current practice is to measure Colwell soil test P (a capacity test) and phosphorus buffering index and from these determine a critical concentration using Eq. 8:

$$ {\text{Critical}}\;{\text{P}}\,\left( {\text{Colwell}} \right) = 19.6 + 1.1\left( {{\text{phosphorus}}\;{\text{buffering}}\;{\text{index}}^{0.55} } \right) $$
(8)

Calculations using Eq. 8 result in critical concentrations of Colwell P for agronomic management ranging from 29 mg kg−1 for soils from very low phosphorus buffering index (36) through to 40 mg kg−1 for moderately buffered soils (141–280), and 55 mg kg−1 for soils with high phosphorus buffering index (281–840).

Management guidelines

The first step to reducing P concentrations in runoff is to maintain greater than 90–95 % ground cover, which is greater than recommended for managing soil erosion. Where ground cover is low because of P-deficiency, pasture improvement should be undertaken including the use of P fertiliser.

Where cover is high, fertiliser use should be guided by measurements of both soil P capacity and buffering. In Australia, where the Colwell soil test is common, we suggest that Eq. 8 (Gourley et al. 2007) can be used to provide critical P concentrations for environmental risk assessment that have been normalised for phosphorus buffering index.

Available soil P can be raised with fertiliser to improve agronomic production without fear of significantly increasing P losses in runoff provided soil test P remains below the agronomic critical concentration defined by the soil phosphorus buffering index. This supposition needs to be verified experimentally, but in the meantime it provides a defensible approach to environmental risk management.

The Colwell test uses a sampling depth of 10 cm for agronomic application. Although surface runoff is thought to interact mainly with the top few mm of soil (Sharpley 1985), values for soil test P in the 0–2 and 0–10 cm depths are highly correlated (Fig. 1), so the vast amount of soil P data collected for agronomic management should also be useful for environmental management.

Where P-fertiliser can be applied to pasture with little risk of significantly increasing P in runoff, care is still needed to avoid the ‘incidental losses’ associated with recently applied fertiliser (or manure). Once fertiliser P has been sorbed and entered into the various mineral and organic pools, it is much less likely to be mobilised than freshly applied fertiliser (Hart et al. 2004; Davis et al. 2005). Thus, it is management of the fertiliser that matters for environmental protection of water quality, not the concentration of P in the soil provided it remains below the critical value and soil erosion is managed by retaining good vegetative cover.