Main

Total emissions of oxidized plus reduced N from intensive agriculture and the burning of fossil fuels increased markedly from the middle of the twentieth century in industrialized nations7. There is strong evidence from comparisons of similar habitats along N deposition gradients3,8 that these increases have led to declining biodiversity in semi-natural ecosystems through acidification and eutrophication. These ‘space-for-time’ studies assume that air pollution has only increased, and that the deposition gradient is representative of a unidirectional temporal shift in grassland biodiversity. Since the late 1980s, however, measures to reduce atmospheric pollution have successfully reduced UK emissions of NOx by ~50% and of sulfur (S) by ~90% (ref. 9). Quantifying the potential recovery of biodiversity in response to reducing air pollution requires an alternative to the space-for-time approach, ideally monitoring long-term community dynamics on permanent plots2. In this context, the Park Grass Experiment at Rothamsted, which started in 1856, presents a unique opportunity to study shifts in biodiversity in response to environmental change both pre- and post-industrialization6.

Park Grass consists of permanent plots with different fertilizer treatments that were established on a uniform pasture that was at least 100 years old in 1856. In the early 1900s, most plots were divided in two, and lime was applied to one half—designated the limed (L) or unlimed (U) sub-plots. In 1965, the limed sub-plots were further split into sub-plots ‘a’ and ‘b’, and the unlimed sub-plots were further divided into sub-plots ‘c’ and ‘d’. Since this time, varying amounts of lime have periodically been added to maintain a target pH of 7, 6 and 5 for sub-plots a, b and c, respectively; sub-plot d is left unlimed (Extended Data Table 1). The liming treatments mean that the eutrophication effect of atmospheric N deposition on plant community dynamics can be quantified independently of soil pH (which also responds to changes in S deposition). Park Grass is in a semi-urban environment, close to a road and on the edge of the town of Harpenden, which act as local sources of atmospheric pollutants4,10. Local measurements of ammonium and nitrate deposited in rainfall show that they have both declined by a comparable amount since 1985, and reflect the current national downward trend in total N emissions (Fig. 1). Our measurements did not include all species of N, including ammonia; however, estimated total N deposition for grassland at this site in 2010–2012, including all N species, was ~21 kg ha−1 yr−1 (http://www.apis.ac.uk) compared with ~45 kg ha−1 yr−1 measured in 1996 (ref. 4). Analysis of the N concentration in archived herbage samples from the first hay cut from the plot that has never received any fertilizer inputs also showed a significant decline in percentage N since the 1980s (Fig. 2).

Figure 1: Changes in atmospheric N deposition, pH and proportion of legumes on the limed and unlimed sub-plots of the Park Grass nil plot.
figure 1

a, Changes in wet (×) and dry (+) N deposition; line indicates moving 5-year average (the small increase in the early 2000s may be a legacy of a run of mild winters in the 1990s). b, Change in proportion of legumes (by dry weight) measured in the first herbage cut. Lines indicate the change in decadal average of percentage legumes on the limed plot (blue circles, blue dashed line), or unlimed plot (red circles, red dashed line). The plots had an average pH over the period of 7.0 when limed (blue diamond, blue continuous line) and 5.2 when unlimed (red diamond, red continuous line).

PowerPoint slide

Source data

Figure 2: Change in N concentration measured on archived herbage samples taken from Park Grass sub-plot 3d between 1960 and 2012 that has never received any external inputs of lime or fertilizer.
figure 2

A split-line regression fitted to the data has an intersection point of 1980 (95% confidence limits, 1976 and 1988), with a significant decline after 1980 (R2 = 0.67, P < 0.001 using least squares linear regression). During the sampling period, legumes never exceed 5% of total biomass, and the contribution of nitrogen fixation to N in the herbage is therefore expected to be minimal.

PowerPoint slide

We analysed data on relative biomass of vascular plant species sampled on a range of sub-plots between 1903 and 2012, with a focus on both the ‘nil plot’ (plot 3), which has never received any fertilizers, and the ‘transition plots’ (Extended Data Table 2). The latter received 96 kg N ha−1 (plus P, K, Na and Mg), either as ammonium sulfate (plot 9) or sodium nitrate (plot 14) until 1989, when the plots were split. Since then, no further N was applied to one-half of the plots (now 9/1 and 14/1). The original treatment continued on the remaining halves (now plots 9/2 and 14/2). Generalized linear models (GLMs) and mixed models (GLMMs) were used to quantify the effect of changes in wet atmospheric N deposition (measured as either a 3- or 5-year moving average) on the proportion of plant functional groups, species richness and the exponent of the Shannon diversity index (eH′). Temporal trends in relative species abundance and dissimilarity between plots were also analysed using multivariate methods.

On the nil plot, the proportion of legumes tracked changes in atmospheric N deposition, declining to low relative abundance at the end of the twentieth century before showing a degree of recovery over the recent sampling period (Fig. 1 and Extended Data Table 3). The addition of lime also increased the proportion of legumes and forbs at the expense of grasses. A decrease in pH was observed between 1985 and 1991 resulting from the deposition of S and N that was not always compensated for by the addition of lime11 (Extended Data Table 1). It is likely that this contributed to some of the observed decline in the proportion of legumes at this time. However, in the recent sampling period (1991–2012), pH has largely remained constant while N deposition has continued to decline, and pH was not significantly correlated with wet N deposition in any of the models, allowing them to be treated as independent variables. Comparisons of species richness between historical sampling periods are confounded by the fact that the area sampled and protocol used has changed through time. However, as the Simpson diversity index has a low sensitivity to sample size, it can be used as an indication of temporal trends in species diversity on Park Grass (Fig. 3). A decline in diversity was observed on the nil sub-plots between the 1940s and 1990s—declines were steeper on the unlimed sub-plot because of the combined effect of eutrophication and acidification. The latest samples, taken since 2010, show diversity is recovering, although it is still at levels below those recorded in the 1930–1940s on the unlimed nil plot (Extended Data Table 3). An analysis across all the individual sub-plots sampled confirmed the positive effect of decreasing atmospheric deposition on plant species richness and diversity, as well as an increase in the proportion of legumes (Extended Data Table 4).

Figure 3: Historical trends in Simpson’s plant diversity index between 1910 and 2012 for unlimed and limed sub-plots.
figure 3

a, b, Results are from unlimed (a) and limed (b) sub-plots. Decadal mean and s.e.m. are presented for: plot 3 (no fertilizers; blue circles), plot 7 (PKNaMg; green circles), plot 9/2 (PKNaMg plus 96 kg N ha−1 applied as ammonium sulfate; red circles), plot 14/2 (PKNaMg plus 96 kg N ha−1 applied as sodium nitrate; purple circles), plot 9/1 (N withheld since 1989; red open circles, dashed line) and plot 14/1 (N withheld since 1989; purple open circles, dashed line). For the limed plots in b, the data post-1965 are from the ‘a’ sub-plots, as they are closest to the pH maintained on the limed half of the plots before they were split. The relatively high value for plot 14/1d compared to plot 3d and 7d in the 1990s can be explained by the temporary increase in diversity during the transition period. Plot 7a was not sampled between 2010 and 2012.

PowerPoint slide

The expected increase in diversity and directional shift in species communities on the transition plots after the cessation of N fertilization in 1989 was observed, except on plot 9/1d, which continues to be constrained by very low soil pH (Figs 3 and 4). Plot 7, which receives the same amount of the other nutrients as plots 9/1 and 14/1 but has never had any N fertilizer additions, can be viewed as the plot towards which the transition plots should be moving. In the case of the b sub-plots, this appears to be the case, but plot 9/1d is only recovering very slowly from a low pH, and 14/1d appears to still have a community that is intermediate between 14/2 d and 7d. Over most of the recent sampling period, the plant community dynamics on plots 9/2 and 14/2, which continued to receive N, unexpectedly showed a temporal trend that was largely parallel with the transition plots (Extended Data Figs 1 and 2 and Extended Data Tables 5 and 6). This suggests that the effect of withholding N fertilizer became apparent within the first few years of the treatment change, and since then all the plots have been responding to the same underlying environmental trend. A comparison of multivariate analyses using either year or ± N fertilizer as the explanatory variable showed that the species that responded to withholding N on the transition plots were very similar to those driving the temporal trends in plant communities observed on the wider experiment (Extended Data Fig. 3). In particular, abundance of the legume species, Trifolium pratense and Lathyrus pratensis, and the forbs, Plantago lanceolata and Ranunculus acris, all increased significantly when N fertilizer was withheld and also across the whole experiment with time. This was confirmed from the analysis at the sub-plot level for plots 3, 9 and 14 (Extended Data Table 7).

Figure 4: Temporal trends in plant communities, between 1991 and 2012, on selected Park Grass plots.
figure 4

ad, Redundancy analysis ordinations are presented for plot 9b (a), plot 9d (b), plot 14b (c) and plot 14d (d), including transition plots (purple circles) and plots that have continued to receive fertilizer N (grey circles). In each case, the samples from plot 7b or 7d have been included as having the species composition to which the transition plots are moving towards (green circles). Sub-plots a and c are excluded as they were not sampled on plot 7 in 2010–1012. The size of the symbols is proportional to the numbers of species in each sample, and the relative proportion of the plant functional groups have been projected as supplementary variables.

PowerPoint slide

Source data

The positive responses of plant diversity to decreasing atmospheric deposition and N fertilizer inputs on Park Grass shows that grasslands have the capacity to recover from the negative effect of eutrophication, particularly where the confounding effect of decreasing pH had been removed by applying lime. The fact that legumes showed the strongest temporal response (coinciding with measured reductions in the N concentration of the cut herbage on the nil plot) supports the view that reducing N deposition was a causal factor of the observed community dynamics. However, only wet N deposition was included in our models, and we were unable to quantify the contributions from dry deposition and other species of N over a sufficient time period, including ammonia and nitric acid. Sulfur emissions have also declined since the 1980s, and although the liming treatments mean that the indirect effect of changing S deposition on soil pH can be treated independently of the eutrophication effect of N deposition, we cannot fully discount the direct nutritional effect of S. However, the plots included in the analysis with K, Na and Mg as part of the fertilizer treatment also receive up to 122 kg S ha−1 yr−1, meaning they are unlikely to be limited by S.

The continuity of the experimental treatments on Park Grass, together with the measurement of atmospheric chemistry and plant community data on the same local scale, avoids some of the problems associated with attributing large-scale ecological changes observed in national vegetation surveys to anthropogenic drivers12. This may partly explain why a clear signal of a recovery of plant diversity from eutrophication has yet to be detected in the wider landscape13,14. However, it is also the case that the magnitude of local scale reductions in N deposition we observed at Rothamsted are not yet reflected at the national scale to the same degree9. Interpreting changes on the Park Grass Experiment more widely in the context of comparisons with other grassland studies and its relevance to the wider landscape must also take into account the specific management context. The twice-yearly mowing and removal of biomass on Park Grass may explain the relatively rapid transient dynamics observed on the experiment when compared to equivalent studies in systems with less disturbance, leading to the accumulation of litter5, or dominated by slower growing, woody species15,16. In addition, the close proximity of plots with differing plant communities means that limitation of propagules is not likely to be as important a constraint in the recovery of the communities as may be the case for larger scale grassland restoration17. Despite these considerations, the Park Grass Experiment remains a unique indicator of the effects of environmental change and an important part of the evidence base for assessing the biological effects of changes in management or policy on the wider environment.

Methods

Description of Park Grass Experiment and vegetation sampling protocol

The Park Grass Experiment was established on old grassland at Rothamsted in 1856 to examine the effects of different mineral fertilizers and organic manures on productivity of permanent pasture cut for hay6. The experiment is located on a moderately well-drained silty clay loam overlying clay-with-flints, a chronic or vertic Luvisol according to the FAO classification. The soil pH was slightly acidic when the experiment began (5.4–5.6), and the nutrient status was poor. The original vegetation of Park Grass was classified as dicotyledon-rich Cynosurus cristatus–Centaurea nigra grassland; one of the mesotrophic grassland communities in the British National Vegetation Classification system18. Treatments imposed in 1856–1865 included controls (nil, no fertilizer or manure), and various combinations of P, K, S, Mg and Na, with N applied as either sodium nitrate or ammonium salts. Each plot (ranging from 75 to 634 m2) now consists of plant communities adapted to the fertilizer treatments naturally assembled from the local species pool. Farmyard manure was applied to two plots but was discontinued after 8 years, because when applied annually to the surface in large amounts it did not decompose quickly and had adverse effects on the sward. Farmyard manure, applied every 4 years, was re-introduced on three plots in 1905.

The experiment consists of 20 main plots. The plots are cut in mid-June and made into hay. For 19 years, the re-growth was grazed by sheep penned on individual plots, but since 1875 a second cut, usually carted green, has been taken in place of grazing. The plots were originally cut by scythe, then by horse-drawn and then tractor-drawn mowers. Yields were originally estimated by weighing the produce, either of hay (first harvest) or green crop (second harvest), and dry matter was determined from the whole plot. Since 1960, yields of dry matter have been estimated from strips cut with a forage harvester. However, for the first cut the remainder of the plot is still mown and made into hay, continuing earlier management and ensuring the return of seed. For the second cut, the whole plot is cut with a forage harvester. A small amount of lime, 4 t CaCO3 ha−1, was added to all plots in the late 1880s. Most plots were divided in two in 1903 or 1920 to introduce a test of regular liming on one half. In 1965, they were further divided into four sub-plots (a–d). The a, b and c sub-plots now receive lime every 3 years, if necessary, sufficient to maintain a target soil pH of 7, 6 and 5, respectively. The d sub-plots are unlimed. In 1989, the plots receiving 96 kg N ha−1 were split, and nitrogen fertilizer withheld from half of the plots to investigate the ability of the plant communities to recover from chronic nitrogen addition.

Vegetation surveys have been carried out on Park Grass on more than 30 occasions since the experiment began19. The original botanical sampling protocol was to take handfuls of cut herbage at regular intervals from every swath after the scythe or cutting machine. Each sample was then sub-sampled until a weight of approximately 12–20 lb (5.4–9.1 kg) was obtained. For the samples taken between 1973 and 1976, samples were cut by hand every two to three paces along ten transects on the larger plots and six on the smaller plots. Approximately 600 g of material was analysed from each sub-plot. Between 1991 and 2012, above-ground biomass from six randomly located 50 × 25 cm quadrats was sampled from all sub-plots using a standard protocol. The herbage was cut with scissors to ground level in early June, immediately before harvesting the first hay crop. The plant material was taken back to the laboratory where it was sorted into species. Samples were oven-dried at 80 °C for about 24 h, after which dry mass was determined for each species. Data from the six quadrats were aggregated to provide an estimate of species richness for each plot in each year.

Monitoring of atmospheric nitrogen deposition: wet ‘bulk’ deposition

The methods of collection and the amount of data available have varied over time. Initial precipitation data (1853–1968) were collected using a rain gauge with a surface area of one-thousandth of an acre (7 ft 3.12 in. × 6 ft, or 4.04 m2). The gauge being constructed at ground level of lead supported by wood over a brick lined cellar housing four collection tanks from which a sample of rain water was taken. From 1969 to 1986, precipitation was collected in what is described as a ‘simple funnel-and-bottle bulk gauge’20. In the latter years, 1986 to present, precipitation has been collected in a bulk rain water collector of a design described previously21. All these collection methods took place within the Rothamsted meteorological enclosure, which is located approximately 817 m east-northeast of the Park Grass Experiment. The amounts of nitrate-N and ammonium-N, in mg l−1 in solution, were then determined. The amount of nitrogen in kg ha−1 deposited by wet deposition is determined by the formula: kg ha−1 = (mg l−1 × R)/(A × 105), in which R is the amount of rain water collected in mm, and A is the surface area of collector funnel in m2. The total amount of nitrogen was then calculated for each year.

Monitoring of atmospheric nitrogen deposition: dry deposition

Nitrogen deposition in the form of NO2 was collected passively using diffusion samplers over an exposure period of 2 weeks. The samplers are made up of a 30-μ l aliquot of 20% triethanolamine/water absorbent sandwiched between two stainless steel meshes housed in a coloured thermoplastic rubber cap. Into this cap, a 70-mm-long × 11-mm-diameter acrylic tube is inserted with a protective white thermoplastic rubber cap on the opposite end. These samplers were then placed, in sets of three, at locations around the edge of the Park Grass Experiment at a height of 1.5 m above ground and the protective cap removed. After 2 weeks, samplers were sealed using a protective cap and collected. They were then extracted into 2 ml 18.2 MΩ RO (reverse osmosis) water and analysed for nitrite N (NO2-N) by continuous colourimetric flow analysis. The resulting levels of NO2-N in μ g N m−3 were then averaged over the year and converted to kg N ha−1 by multiplying by the deposition velocity for managed grassland at Rothamsted (0.3751 mm s−1)22.

Analysis of herbage samples for nitrogen concentration.

Representative sub-samples of plant material from the archived hay or herbage samples on sub-plot 3d were dried at 40 °C for 48 h, ball-milled to a homogenous fine powder, dried again at 60 °C for 24 h and analysed with an elemental analyser (NA 1110; Carlo Erba) interfaced (ConFlo II, Finnigan MAT) to a continuous-flow isotope ratio mass spectrometer (Delta Plus; Finnigan MAT), EA-CF-IRMS. After every tenth sample, a solid internal laboratory standard (SILS) with similar C/N ratio as the respective sample material (fine ground wheat flour) was run as a control. The precision (s.d.) for sample repeats was better than 0.04%. Samples taken from herbage cut between 1960 and 2012 were analysed, before this date, the herbage sampling protocol differed with material dried in situ, which is affected by disintegration losses in the hay making process.

Statistics

For the plots where all the sub-plots were sampled (3, 9 and 14), all sub-sets regression using GLMs was used to identify the model that explained the maximum variability in species richness, eH and the relative proportion of functional groups using only independent explanatory variables with P < 0.05. The following explanatory variables were included: pH, wet atmospheric nitrogen deposition (included both as a 3- and 5-year moving average), total rainfall in the previous growing season (March–August) and rainfall in current growing season (March–May)—rainfall has been found to explain short-term variability in community composition significantly23. For the proportion of the different functional groups (legumes, grasses and ‘other’), a binomial distribution with a logit link function was used to allow for the variability in the total first cut biomass to be accounted for. The proportion of each functional group was analysed separately. A normal distribution with an identity link was used for species richness and e H ' except for the acid plots with a high frequency of low species counts, in which a Poisson distribution with a log link was used. As opposed to a step-wise approach, all sub-sets regression analyses included all possible combinations of explanatory variables, using the adjusted R2 and Mallows’ Cp as criteria for comparing models.

For the nil treatment (plot 3), that has never received any fertilizers, data were available on relative proportion of functional groups that covered an important historical period from 1903 in which wet nitrogen deposition and pH were also measured on the experiment. This whole data set (n = 70) was therefore included in the models. Although less frequent data were available on species richness and diversity, changes in the area sampled and sorting effort meant that a comparison of data over the whole historical period would not have been valid. However, between 1991 and 2012, a standard area and sampling protocol was used. This time period coincided with the reductions in atmospheric nitrogen deposition observed on the experiment and was, therefore, used to quantify responses of species richness and eH (n = 13) across all the plots.

For the analysis of the data from all sub-plots sampled across a range of fertilizer treatments, a GLMM was used with sub-plot and year input as random factors. Fixed effects were input in the same order to a model previously fitted to explain variance in species richness between fertilizer treatments24: pH, nitrogen addition (three levels; +48, +96 or +144 kg N ha−1) and ±phosphorus before inputting N deposition as a final continuous explanatory variable as either a 3- or a 5-year moving average. Two further fixed effects were initially included, ±potassium and whether the plot was a transition plot, but neither significantly explained any additional variance in any of the diversity metrics. Plot 13 was the only farmyard manure plot in the data set and was not included in this analysis. Both the GLMs and the GLMMs were run using the software GenStat25.

The temporal shifts in plant communities was analysed at the species level using multivariate approaches. To investigate any directional response of the transition plots following the cessation of nitrogen fertilization, the Bray–Curtis dissimilarity index was calculated using the first sample date, 1991, as a reference point and regression models fitted to the data using year as the explanatory variable. This was also done for the plots that continued to receive nitrogen fertilizer with the expectation that these plots would increasingly diverge from the transition plots with time. Nonlinear regression with groups was used to quantify differences in the responses of the Bray–Curtis index to time of the transition plots and those that had continued to receive nitrogen. Redundancy analysis, using rainfall in the current growing season as a covariate, was used to identify community shifts over time, using GLMs to identify species that responded significantly to the first ordination axis, constrained by year. This was done for each of the sub-plots separately for plots 3, 9/1, 9/2, 14/1 and 14/2. Finally, the species responding to withholding N fertilizer were compared with those driving the temporal responses using two additional partial canonical correspondence analyses. First, data from 1991–2012 for plots 9 and 14 were analysed with year input as a categorical covariate and ±nitrogen as the explanatory variable. Second, the data from all the sub-plots sampled from 1991–2012 were analysed, excluding the transition plots (9/1 and 14/1) with plot input as a covariate, and year as a continuous explanatory variable. Only species that were recorded at least three times at the level of the sub-plot were included in the analysis. The software, Canoco 5, was used for all the multivariate analyses26.