Introduction

In the face of environmental degradation due to agriculture intensification, many European countries adopted Agri-environment Schemes (AES) as a response (Batáry et al. 2015). Among those schemes, flower strips are often proposed as a strategy to enhance flower-feeding insect biodiversity (Haaland et al. 2011) as well as ecosystem services, such as pollination (Nicholls and Altieri 2013; Uyttenbroeck et al. 2017), natural pest control (Landis et al. 2000; Hatt et al. 2017, 2018) and improvement in landscape aesthetics (Junge et al. 2015). The type of strips and their management may vary between countries, depending on their policy (Haaland et al. 2011). Several countries adopted AES prescription for perennial field margins made of indigenous forbs and grasses managed by mowing, with the aim to keep meadow-like vegetation (Smith et al. 2010; Haaland et al. 2011; Tarmi et al. 2011). In this way, Walloon (South Belgium) AES management prescriptions for wildflower strips include the use of species typical from local lowland hay meadows (Piqueray et al. 2016) and mowing as a management. They therefore aimed at creating habitats analogous to these ecosystems, known to be particularly attracting to insects, but in strong decline in Europe (Ridding et al. 2015; Staab et al. 2015). Doing this, they may contribute to the preservation of species related to this ecosystem, alongside historical site preservation and restoration (Kiehl and Pfadenhauer 2007; Staab et al. 2015). Such strips are likely to provide a further advantage for farmers in the form of forage production (De Cauwer et al. 2006a). However, farmers can be at a loss as to how using forage from their strips (Bruinenberg et al. 2002; De Cauwer et al. 2006a). Some other concerns may rose from farmers adopting wildflower strip AES. One major concern is about the risk to promote noxious weeds (Smith et al. 2010). Despite several implications in terms of farmer acceptance of AES, these multiple aspects have rarely been studied (Uyttenbroeck et al. 2016), but see works by De Cauwer et al. (2006a, b, 2008). It is therefore imperative to know how to implement and manage flower strips in order to make them both effective regarding their aims and acceptable to farmers.

To create and maintain wildflower strips analogous to hay meadows on nutrient-rich cultivated soils reveals somewhat challenging. Indeed, especially the high phosphorus content is known to be unsuitable to the development of species-rich grasslands (Janssens et al. 1998). In such fertile conditions, grasses tend to become dominant at the expense of flowering species (Schellberg et al. 1999; Mountford et al. 2016). Especially, the growth of legumes may be restricted due to their low competitive abilities against grasses in nutrient-rich ecosystems (Zanetti et al. 1996). This in turns affects forage quality, as fewer legumes in the forage can lead to lower protein content. The inclusion of grasses in wildflower strips seed mixture is therefore a questionable management practice (Staab et al. 2015). On the one hand, it may affect the emergence of the sown flowers which would affect both insect attraction and landscape aesthetics improvement. But on the other hand, grass species may be useful at the implementation of field margins, as they can help controlling weed emergence through clonal field occupation (Hansson and Fogelfors 1998). The question arises therefore which proportion of grass to sow in order to promote flowers while maintaining weeds at an acceptable level.

Furthermore, the mowing regime has been shown to be a main driver of plant community composition in grassland ecosystems (Kahmen et al. 2002). Mowing can indeed decrease the inter- and intraspecific competition, therefore permitting the co-existence of numerous species in a small scale (Tälle et al. 2016). In many countries, wildflowers strips are left unmanaged (Scheper et al. 2015). In others, annual late-summer or winter mowing may be prescribed (Haaland et al. 2011; Tarmi et al. 2011). However, a recent study pointed out that early-summer mowing was more efficient to maintain forbs in nutrient-rich contexts (Kirmer et al. 2018). It also offers the advantage that flowers may re-grow in the late-summer and that a second forage harvest may be available to farmers in autumn. This results in a twice-a-year mowing frequency, identified by Uchida and Ushimaru (2014) as an intermediate disturbance regime (between land abandonment and intensification) particularly favourable to maintain plant and insect species richness in agricultural lands. Also, in the absence of soil fertilisation, increased biomass uptake through multiple cuttings is likely to diminish soil nutrient stocks (Oelmann et al. 2007), therefore conducting to soil condition more suitable for the maintenance of meadow species (Critchley et al. 2002). Moreover, adapting mowing date may help to prevent weed infestation (Smith et al. 2010). Forage yield and quality are also likely to be decreased in cases of delayed mowing (Bruinenberg et al. 2002). There is therefore a need to test how mowing regime is able to modulate benefits generated by wildflower strips.

In this study, we aimed to explore, through an experimental design, how management options in grassland-like wildflower strips may influence their ability to provide environmental benefits (flower provision for insects and landscape purposes, reduction of soil nutrient load) and respond to main farmer concerns (noxious weed promotion, forage quantity and quality). We first hypothesize that decreasing grass density should result in increasing flower provision, but should also promote weed development. Second, increasing mowing frequency should increase flower provision, forage quantity and quality and help controlling weeds. It should result in soil nutrient depletion due to increasing forage exportation. Through the verification of these hypotheses, we aimed at determining which option may optimise ecological benefits and farmer acceptance for AES.

Materials and Methods

Study Site and Experimental Design

The experiment was implemented in April 2010. It was located in Gembloux (Belgium, Wallonia; 50°33’45”N; 4°42’22”E; alt. 170 m; annual mean temperature ca. 9 °C, annual rainfall ca. 800 mm). Soil at the site is very fertile (WRB soil group: retisol). It was previously occupied by an intensive arable field devoted to growing cereals and row crops (previous 3-year rotation: potatoes, spelt and winter barley). Two seeds mixtures, differing in amount of grass seed present, were tested (Table 1): (1) the seed mixture with high grass density (Ghigh) was composed of 85% grasses and 15% flowers. This proportion, as well as the applied sowing density (30 kg/ha) correspond to the recommendation of the seed provider; (2) the seed mixture with lower grass density (Glow) had the same flower seed density and composition as Ghigh but the grasses seed density was divided by two. The Glow was therefore sown at 17.25 kg/ha (Table 1). The three mowing regimes applied were: (1) once a year in August–September (MOW1); (2) once every two years in August–September (MOW0.5); (3) twice a year in late June and in August–September (MOW2). MOW1 corresponds to the prescription that was applied in the Walloon wildflower strips AES at the beginning of the experiment. MOW0.5 and MOW2 were introduced as alternative management options. Mowing regime application began in 2011. In 2010 (first year), all plots were mown only in September. Mowing was accomplished by a plot harvester (Haldrup© with Busatis © cutting blade) and forage was removed. The experiment was therefore made of six treatments (2 sown grass densities*3 mowing regimes). It contained four replicates, corresponding to four blocks, and was therefore composed of 24 experimental plots. Plot size was 60 m² (6 × 10 m). Within each block, plots were placed according to a strip-plot design, with seed mix placed longitudinally and mowing regimes transversely within each block (Fig. 1).

Table 1 List of species and sowing densities in the Ghigh and Glow modalities
Fig. 1
figure 1

Schematic representation of the experimental design in Gembloux (Belgium; 50°33’45”N; 4°42’22”E). Grass buffer zones were maintained to facilitate circulation and working in the experimental plots

Floristic Survey

In each of the 24 plots, six permanent 1 m²-quadrats were placed for vegetation surveys. In these quadrats, the horizontal cover of all the sown species was recorded each year from 2010 to 2014 in early June, at the peak of vegetation. Unsown weed species were not recorded individually, but instead as a global problematic weed species cover (annual arable weeds, Cirsium arvense and Rumex crispus/obtusifolius). As indicators for flower-feeding insects and landscape interests, we computed the following metrics:

  • Flowering plant species richness (insects)

  • Total flowering plant cover (insects and landscape)

  • Flowering plant Shannon’s diversity (insects)

  • Flower colour Shannon’s diversity (landscape)

Flower colour diversity indeed proved to be an important feature in landscape aesthetics (Stilma et al. 2009; Junge et al. 2015). Flower colour was assessed based on field observation and is provided in Table 1. To compute the colour diversity, horizontal cover of the species with identical colour was summed to obtain a total cover for each colour. Species richness and Shannon’s diversity were calculated, as they have been shown to be relevant for flower-feeding insects in previous studies (Ebeling et al. 2008; Fründ et al. 2010).

Forage Yield and Forage Analyses

The plot harvester directly provided fresh matter yields (FMY) for each mown plot, each year between 2010 and 2014, in three distinct 1.5 × 6 m strips within the plot. In each plot, one ca. 1 kg fresh forage sample was taken at the moment of mowing. It was weighed and then dried in an oven (65 °C for 36 h) in order to measure dry matter content (DMC) and subsequently calculate dry matter yield (DMY).

In 2014, dried samples were ground in a hammer mill (1 mm screen; Waterleau, BOA, Belgium) and then ground again in a Cyclotec mill (1 mm screen; FOSS Electric, Hillerød, Denmark). They were then submitted to NIRS analysis (XDS spectrometer, FOSS Electric, Hillerød, Denmark), and the absorption data was recorded as log 1/R from 1100 to 2498 nm, every 2 nm (WINISI 1.5, FOSS Tecator Infrasoft International LCC, Hillerød, Denmark). Chemical characteristics of forage, i.e. proteins (% DM), fibres (% DM) according to the Van Soest method, i.e. neutral detergent fiber (NDF) and acid detergent lignin (ADL) and digestible organic matter (DOM, % DM, cellulase method) were then estimated using the NIRS calibrations previously developed at CRA-W for hay and grass (Table 2). The correspondence between the predicted sample and the NIR spectral database was evaluated through the standardised H value (distance between the predicted sample and the centroid of the spectral database) according to Shenk and Westerhaus (1991). NIRS predictions were correct when averaged H value of predicted samples was lower than 3. For the MOW2 regime, retained values were the means weighted by DMY at each date (June and September).

Table 2 NIRS calibration characteristics for estimating the chemical composition of wildflowers strips

Soil Analyses

In March 2014, at the resumption of the growing season, soil samples were collected in each experimental plot (n = 24) using a 2 cm-diameter auger. Five soil samples were randomly collected at 20 cm of soil depth. The five soil samples were merged in a composite sample.

Mobile nitrogen (N-NO3) was measured on fresh samples sieved to 8 mm. Concentrations were determined in a soil-solution mixture at 1:5 w:v ratio after extraction with KCl (0.1N) and agitation for 30 min. After a 30 min decantation and subsequent filtration (filter: Whatman© 602H1/2), nitrate was reduced into nitrite using a cadmium column. Nitrite was analysed by the modified Griess-llosvay method (Bremner 1965; Guiot 1975). All soil samples were dried at 40 °C and sieved to 2 mm. A subsample of each sample was finely ground (<200 µm) for C and N analyses. Total organic carbon (Ctot) and total nitrogen content (Ntot) were measured by dry combustion (ISO10694 and ISO13878, respectively for Ctot and Ntot). The available potassium (Kav) and phosphorus (Pav) concentrations were determined in a soil-solution mixture at a 1:5 w:v ratio after extraction with CH3COONH4 (0.5 M) and EDTA (0.02 M) at pH 4.65 and agitation for 30 min (Lakanen and Erviö 1971). The concentration of K was measured by flame atomic absorption spectrometry while the concentration of P was measured by colourimetry at 430 nm.

Data Analyses

We tested effects of sown grass density, mowing regime, and their interaction using mixed-effects models with block as a random effect. When yearly data was available (i.e. floristic data and forage yield), year effect was included in models. Analyses were also computed for each year separately and for total (sum) forage yield values. ANOVAs were computed on mixed-effect models in order to determine effects significance. Analyses were computed using Minitab 16 (Minitab Inc.).

Results

Flowering Species Response

Our models revealed year effects (P < 0.001) for all tested variables where yearly data was available (Table 3). Concerning flowering species related variables (abundance, richness, diversity and colour diversity) interaction between year and mowing regime was significant.

Table 3 ANOVAs for variables measured at each of the 5 years of the experiment (2010–2014)

Species richness was significantly influenced by mowing regime only in 2013 (P < 0.05, Table 4) and 2014 (P < 0.01, Table 4). It tended to decrease with time, whatever the mowing regime. However, the decrease tended to be lower with increasing mowing frequency. Consequently, in the last year (2014), the species richness was the highest in MOW2 and the lowest in MOW0.5. MOW1 had an intermediate value. The response was similar for plant diversity, but to a lower extent. In 2014, it was also slightly higher for MOW2, but only with marginal significance (0.05 < P < 0.1, Table 4). Also, colour diversity responded this way. However, in this case, the highest value was observed for MOW0.5 in 2014. In 2012, the interaction between mowing regime and seed mix was significant for both colour and plant diversity (P < 0.05, Table 4). Flowering plant cover was influenced by sown grass density in the first three years 2010–2012 (Table 4), with higher values for Glow seed mix (Table 4). In the last year (2014), difference in flowering plant cover was only due to mowing regime, with the highest value for MOW2 (P < 0.01, Table 4).

Table 4 Mean values (±SD) by mowing regime and by seed mix of all tested variables

Weed Cover

Weed cover showed a yearly variation, but was not influenced by management options. It was the highest in the first year (2010). The lowest values were observed in 2011 and 2012 (0.9% and 1.4%, respectively). It then increased again in 2013 and 2014 until 9.2% in average (Table 4), mainly due to Cirsium arvense expansion.

Forage Quantity and Quality

The mowing regime significantly influenced the forage DMY (P < 0.05, Table 4), except in the first year (2010) when a single mowing regime (once in September) was applied to all plots. Over the 5 year period, wildflower strips cut twice a year (MOW2) had the highest DM forage production compared to the low mowing rate. Grass proportion in the seed mix did not influence the forage DMY (Table 4).

As confirmed by the averaged H value of predicted samples lower than 3 (H = 2,44), NIRS calibrations developed from hay and grass samples could be used for predicting wildflower strip characteristics. Cutting regime of wildflower strips significantly impacted the chemical characteristics and the digestibility of forage. Forage from MOW2 had a higher protein content (P < 0.001, Table 4), lower fibre content (NDF) (P < 0.001, Table 4) and lower indigestible fibre content (ADL) (P < 0.01, Table 4). Finally, the digestible organic matter (DOM) was significantly higher in MOW2 than in MOW0.5 and MOW1 (P < 0.001, Table 4). As in the case of DMY, decreasing the proportion of grass seeds in the seed mixture did not influence the chemical composition of the forage.

Soil Nutrients

Plots with the highest mowing rate tended to experience a Kav depletion. For this nutrient, we observed contents varying from 23.8 mg/100 g in MOW2 to 27.7 mg/100 g in MOW1 29.2 mg/100 g in MOW0.5 (P < 0.05, Table 4). None of the other soil variables were affected by management options.

Discussion

Grass Seed Density Effect

Our study revealed effects of grass seed density on flowering plant abundance in the first three years, with a higher flower cover in the seed mix with reduced grass proportion. This was primarily due to improved development of the legumes Lotus corniculatus and Trifolium pratense during the first years after strip implementation (Table 5). This is congruent with the general low competitive ability of legumes under eutrophic conditions (Zanetti et al. 1996). However, this was not observed beyond the fourth year after sowing, as mowing regime effects became more and more pronounced and overwhelmed initial sowing conditions. No effect was found on colour and species diversities. It was previously shown by Staab et al. (2015) that detrimental effect of grass on flower species diversity appeared when grass biomass proportion is higher than 90%. In our case, it is likely that this threshold was not reached even with our higher grass density.

Table 5 Mean species cover each year, by mowing regime and by seed mix

Contrarily to our initial assumption, grasses did not hamper noxious weed, as weed cover was not significantly influenced by grass proportion in the seed mix. Earlier studies on weed suppression by flower strips showed that weeds only overdevelop in the case of spontaneous unsown flower strips, while sown flower strips efficiently cover the soil against weeds (Denys and Tscharntke 2002; De Cauwer et al. 2008). After the third year, such a conclusion could have been driven from our experience, as annual weed emergence on the first year was rapidly pulled up to a ca. 1% weed cover. However, from the fourth year, weed cover increased again to ~10% on average on the fifth year, i.e. the end of AES commitment. This was moreover mainly due to C. arvense, a particularly pernicious species that is likely to spread into arable fields where it is difficult to control (Tiley 2010). This rather high cover level is therefore likely to discourage farmers from continuing with their implemented flower strips.

Mowing Effect

Colour diversity, reflecting potential for landscape aesthetics (Junge et al. 2015), was the highest under the MOW0.5 mowing regime after 5 years of application. A tradeoff is therefore observed with flower abundance and richness that were the lowest under this mowing regime. The highest values for these variables were observed under the twice-a-year mowing regime. T. pratense was one of the main drivers for this pattern, as it responded the best to the MOW2 regime (Table 5). This result is particularly relevant as this species is considered to be a keystone species for bumblebee conservation in Europe and could facilitate the pollination of wild and cultivated plants (Kleijn and Raemakers 2008; Rundlöf et al. 2014).

The increased abundance of this species in the MOW2 regime may also partly explain the better forage quality in those plots. Of course, it is well known that digestibility correlates positively with the utilisation frequency, either mowing or grazing (Gardarin et al. 2014). Also, the stage of maturity of individual species when forages are mown, can explain the difference in digestibility (Bruinenberg et al. 2002). However, most species in our study, including the dominant Festuca rubra, belong to the plant functional type (PFT) C according to the classification proposed by Cruz et al. (2002). That type is characterised by low digestibility values in the beginning of the growing season but a slower decline during the growth of the plant. Therefore, over-maturity may be only part of the explanation for the difference in forage quality, species composition being another one. Trifolium pratense is known for its participation in DMY (De Cauwer et al. 2006a) and is frequently used in hay meadows with multi-cut management (Halling et al. 2004). Its digestibility is commonly high and less dependent on the ageing of the plant. Moreover, dicotyledonous species, that had higher cover in the MOW2 plots, have lower NDF content and higher pectin content, resulting in better digestibility (Bruinenberg et al. 2002). Forage production is not the main aim of wildflower strips. However, it may lead to a better integration of AES within usual farming procedures, and therefore make AES more acceptable (Sattler and Nagel 2010).

Concerning the soil variables, we only found that mowing regime influenced K availability, in the sense of an increased K depletion with increasing biomass exportation, i.e. the MOW2 regime. Plant offtake plays a major role in K cycle. Repeated mowing without K supply is likely to provoke a K depletion within 3–10 years (Kayser and Isselstein 2005). For other soil variables, installation of flower strips and subsequent biomass exportation through mowing did not result in a clear soil nutrient depletion. It is well known from ecological restoration literature that repeated biomass exportation is unlikely to rapidly reduce availability of these nutrients in the soil (Marrs 1993; Walker et al. 2004; Piqueray and Mahy 2010). In case of conversion from arable land to grassland, P uptake from mowing can fail at diminishing plant-available P, as it can be replaced through mineralisation from larger non-available pools (Walker et al. 2004). Pav content in our study site (ca. 30 mg/100 g) was far higher than the 5 mg/100 g considered as the upper limit for maintaining species-rich grasslands in semi-natural systems (Janssens et al. 1998). We indeed observed a decrease in species richness and flower cover over the 5 years of the experiment, that was limited through multiple mowing. This confirms the need for a rather early mowing date to maintain flowering plant species under fertile conditions (Kirmer et al. 2018).

Conclusions and implications for management

Accordingly to our assumptions, mowing twice a year (in late June and in September) resulted in the most interesting option. It permitted enhancement of flowering plant cover and resulted in better forage production, both in quantity and quality. We showed that this option would likely contribute to decreased K availability in soils, but did not have an impact on other soil nutrients over 5 years. This option should therefore be promoted in wildflower management and was therefore accepted as an alternative management option in Walloon AES following this study. One can argue that a first mowing in late June, at the end of the flowering peak, may be a problem for insect conservation. However, a meta-analysis by Humbert et al. (2012b) revealed that there was generally minimal advantage of delaying the first mowing date beyond early summer in grasslands. Also, several species are able to regrow after late June-mowing, and therefore extend flower availability in late summer (Kirmer et al. 2018). However, there is a great amount of evidence that keeping unmown refuges is of primary importance for insect conservation in grasslands (Humbert et al. 2012a; Lebeau et al. 2015) and therefore probably in grassland-like wildflower strips too. Such zones were efficient at preserving insects as well as insect-mediated ecosystem services (Buri et al. 2014). Therefore, maintaining an unmown refuge zone of min. 3-m width all along the wildflower strips was retained as compulsory management in Wallonia (southern Belgium). In case of a second mowing, the refuge zone of the first mowing has to be maintained or enlarged in order to keep a less disturbed overwintering zone (Schmidt et al. 2008). However, it is recommended that it be moved annually within the wildflower strip in order to avoid species richness decrease due to management abandonment (Schmidt et al. 2008; Uchida and Ushimaru 2014). Mowing regime options other than twice-a-year should not be completely rejected as, with the exception of forage production aspects, they were valuable regarding the tested variables. They can therefore be advantageous in farms without livestock, which is increasingly the case in cropland regions where wildflower strips are often implemented.

None of our hypotheses regarding weed control were verified. Their relative abundance in the first year was due to annual weeds from the soil seed bank. Indeed, soil seed bank is usually dense in arable lands and mainly depends on the former management such as crop rotation or herbicide use (Asteraki et al. 2004; Albrecht 2005). Their emergence in wildflower strips mainly relies on the capacity of sown species to outcompete them (Asteraki et al. 2004; De Cauwer et al. 2008), which was the case in our study in the second and third year. From the fourth year, neither increasing grass seed density nor mowing frequency resulted in a decrease of weed cover. Regardless of the treatment, wildflower tended to shelter noxious weeds after 5 years, notably the thistle Cirsium arvense. This supports the need to test the efficacy of specific management practices such as selective clipping or spraying, both being allowed in the Walloon AES.