Introduction

Both population growth and anthropogenic warming are concentrated in cities. Emerging solutions to these related issues reimagine where and how to grow food. Relocating production to underutilized urban areas like vacant lots and lawns can increase yields (McDougall et al. 2019) while reducing greenhouse gas emissions (Cleveland et al. 2017). Further incorporating useful trees diversifies agricultural outputs while storing more C in wood and soils (De Stefano and Jacobson 2018). Combining the high provisioning services of urban agriculture with the climate regulating services of agroforestry may be possible in Urban Food Forests (UFF). Clark and Nicholas (2013) defined UFF as ‘‘the intentional and strategic use of woody perennial food producing species in urban edible landscapes to improve the sustainability and resilience of urban communities.” Because this broad definition includes isolated trees, other authors emphasize a more systems-based approach by stipulating that UFF should further mimic the structural and functional complexity of natural forests where diverse perennials grow together (Salbitano et al. 2019). Early assessments of UFF are promising. A mature 1 ha food forest can feed 5–6 people per year (Nytofte and Henriksen 2019) while sequestering 40,000 kg of C in living plant tissues (Schafer et al. 2019). Whether or not UFF can approximate the C dynamics of other forest ecosystems while maintaining yields remains unknown.

A major factor that could limit services from UFF is soil function. Soils supply roots with key resources and provide a medium for interactions with microbial symbionts and pathogens. Moreover, soils can store C in more chemically stable forms than living vegetation (Trumbore 2000). However, the physical, chemical and biological properties of both urban and agricultural soils tend to reduce a range of soil functions (Cardoso et al. 2013). Urban soil compaction limits root growth and water availability (Nawaz et al. 2013). Whether or not trees planted in UFF can either tolerate or reverse compaction remains unknown. Furthermore, urban and agricultural soils can have low microbial diversity and activity, which could limit nutrient availability and C cycling (Guilland et al. 2018; Díaz-Vallejo et al. 2021). Finally, UFF explicitly incorporates woody species with contrasting functional traits, which could change fine-scale soil properties relative to more homogenous land cover types, like lawns.

To understand how belowground changes support ecosystem services from UFF, we analyzed fine-scale variation in soil structure and function in an urban land use experiment. The New College Food Forest and Carbon Farm is a small (0.057 ha) demonstration UFF in a rapidly urbanizing area of Florida, U.S.A. Shortly after installation in 2017, we measured soil C at 56 locations across the UFF and adjacent lawn in different vertical strata. We compared C measurements using a novel before-after impact-control (BACI) design that incorporated uncertainty from calibration curves while controlling for spatial autocorrelation in a heirarchical Bayesian framework. Our research objectives were to (1) document differences in physical, chemical and biological aspects of soil function between UFF and adjacent lawn and (2) to analyze fine-scale variation in soil C sequestration within and between land cover types. We expected that conversion to UFF would improve soil function by decompacting soils and increasing nutrient concentrations, microbial activity and richness relative to lawns, which had experienced consistent, low-intensity management for at least 70 years. Furthermore, we expected to observe higher rates of soil C sequestration beneath UFF, with increasing spatial variation driven by contrasting tree traits.

Methods

Site description

The land use experiment took place on the New College of Florida Campus at [27.380446°N, -82.562416°W] in southwestern Florida approximately 100 m east of Sarasota Bay and 600 m west of the Sarasota-Bradenton International Airport. The climate is humid temperate-subtropical with mean annual temperature of 22.8 ℃ and annual precipitation of 1346 mm varying between a hot summer wet season and a cool winter dry season. Local soils are Eau Galle—Myakka Fine Sands, with sandy and loamy marine deposits as the main parent materials (Soil Survey Staff 2023). Prior to urbanization, the predominant land cover types in the area were pine flatwoods, a savannah-like ecosystem maintained by periodic fire with a discontinuous canopy of Pinus elliottii, a dense shrub layer dominated by Serenoa repens, and a rich mixture of grasses and forbs (Sparkman and Bryant 2016). In 1921, the study area was developed by a real estate mogul who constructed an estate with two major buildings and roadways. By 1948, when the oldest aerial imagery that is available today was captured (USDA 1948), several large pines remained but the groundcover on the property was more uniform than surrounding undeveloped parcels, consistent with conversion to lawn (Fig. S1). Lawn management in the United States typically promotes uniform cover by desirable species of grasses through some combination of periodic mechanical mowing, irrigation, and nutrient enrichment while controlling undesirable species with herbicide, fungicide or pesticides (Thompson and Kao-Kniffin 2019).

The establishment of the UFF began in February 2016 with sheet mulching over the existing lawn. Community volunteers then combined locally sourced topsoil, compost and biochar into berms that were approximately 10 cm high and 1 m wide. Starting in May 2016, students planted 50 seedlings representing 33 different useful woody species (Table S1) into rows along the berms along with assorted short-lived woody and pseudo-woody plants, vines, and herbaceous plants. Ongoing maintenance consisted of watering, trail-making, mulching and spreading compost generated by students on campus from food scraps. During this period, the adjacnt lawn area was mowed biweekly at a 6—12 cm height, retruning clippings to the turf and without irrigation or chemical fertilizer application. An informal survey of plants present in lawn habitats identified 36 species, with grasses being the most diverse plant family, represented by at least 8 different taxa, with the most frequently encountered species in the footprint of the study area being the common turfgrass, Paspalum ntoatum, and the introduced subshrub, Sida acuta (unpublished data).

Sampling design

To understand how implementation affected soil function, we employed a novel Before-After-Control-Impact (BACI) sampling design. Originally developed to estimate the environmental effects of unique interventions, BACI compares the magnitude of change in repeated measurements before and after implementation to similar measurements at a control location while statistically controlling for autocorrelation from non-random sampling (Conner et al. 2015). We applied different aspects of the BACI design to our different research objectives. For our primary research objective, which was to quantify fine-scale variation in C sequestration, we established impact and control sampling arrays in the UFF and adjacent lawn, respectively. We use fixed landmarks to collect soil samples at 4 m intervals along each berm for 28 sampling locations in a hexagonal array, which we reproduced in the lawn. We sampled soils in both arrays shortly after construction in 2017 and after two years of managing the UFF in 2019. We controlled for spatial autocorrelation using a hierarchical Bayesian model (see Supplementary Methods).

In addition to representing spatial variation in the horizontal dimensions, we also analyzed changes across vertical strata. At each sampling location and time interval, we removed herbaceous vegetation and detritus and then used a gauge auger to extract a soil core. In the UFF, we split the upper 30 cm of each core into three vertical strata representing the berm stratum of imported soils (0–10 cm), the upper stratum of original lawn soils (10–20 cm) and the lower stratum of original lawn soils (20–30 cm) (n = 84). In the lawn, we only sampled the top 20 cm of soils and split samples into two strata (0–10 cm and 10–20 cm, n = 56). Berm soils should show the greatest impact effect, followed by the original upper stratum and finally the original lower stratum compared to the corresponding control strata in the lawn.

For the secondary research objective, comparing other physical, chemical and biological differences we only sampled after implementation in 2019 but again compared measurements between the impact and control areas using a spatial model to account for autocorrelation. In each area, we added four sampling sites that corresponded to an expanded margin of the UFF and an equivalent area of the lawn for 64 sampling locations. At all locations, we measured soil bulk density (SBD) in every stratum (n = 160). We also measured soil respiration at every location and stratum except for the lowest (n = 96).

For soil nutrients and microbial diversity, we used different sampling approaches. In a random subset of 20 sites from each area, we collected additional soil samples that we physically aggregated and analyzed for soil nutrient concentrations. Finally, we leveraged preliminary results from the soil C analysis to identify six sampling locations—four in the UFF and two in the lawn—from which we remeasured soil nutrients along with microbial diversity. For a summary of the sampling strategy for each dataset and analysis, refer to Table S2.

Soil C measurements, calibration and projection

During the 2017 campaign, we air-dried all 140 samples to constant weight and analyzed them for organic carbon (OC) content (%) using chromic acid wet oxidation (Walkley and Black 1934). During the 2019 campaign, we transferred all 160 samples to sealed polyethylene bags for storage up to two days at 4 °C. After measuring soil respiration from a subset of samples, we dried samples to 105 °C for a minimum of 8 h and calculated Soil Bulk Density (SBD) as the ratio of sample dry mass to fresh volume. We then measured soil organic matter via loss on ignition (LOI). Specifically, we combusted up to 15 mL of dried sample for 4 h at 550 °C in a Lindberg Blue M muffle furnace (Fisher Scientific, Waltham MA). We calculated LOI as the ratio of the combusted sample mass to its initial dry mass.

Because we measured different but related variables in each sampling campaign (OC in 2017; SBD and LOI in 2019), we generated calibration relationships using a stratified dataset. In 2019, we collected an additional 15 samples representing the highest, lowest, and approximate mean LOI from each stratum and area, where we measured SBD and LOI using the methodology described above and Walkley–Black OC using the same analytical lab as in 2017 (Fig. S2). To identify the most adequate calibration relationships, we used the function “lm” in R-package “stats” to fit pairwise regressions between original and log-transformed values of OC and LOI as well as between SBD and OC while including every combination of area, stratum and their interactions as covariates. We selecting the functional form of the relationship with the lowest AIC (Table S3) for further analysis.

Uncertainty from calibration curves and spatial autocorrelation are common in soil C analyses and BACI designs (Conner et al. 2015). To address each of these complications in a coherent statistical framework, we used a Hierarchical Bayesian (HB) approach. HB models are modular and integrate uncertainty from multiple sources into posterior distributions of key quantities (Ogle and Barber 2012). We describe the model in detail in Supplementary Methods S1, which also includes all code and data necessary to reproduce the analysis. We note that this approach tends to broaden credible intervals without biasing the means. Therefore, estimates of the statistical significance of differences between land cover types and years are conservative relative to conventional analyses that would use calibration constants, censor observations or ignore spatial autocorrelation.

To estimate the attainable SOC stocks in the UFF, we used fine-scale variation in C sequestration rates to parameterize a simple mass balance equation for SOC (Morais et al. 2019). The model assumes that the change in SOC depends on the difference between the SOC input (K) and the mineralization rate (α):

$$\frac{d\mathrm{SOC}}{dt}=K-\alpha *\mathrm{SOC}$$
(1)

We calculated these quantities in the UFF by applying the moment-based estimators for the intercept and slope parameters to the spatially detrended estimates of SOC in 2017 and 2019 across all (o = 1…28) sampling locations (See Supplementary Methods, Eqs. S1-S8):

$$\alpha =\frac{Cov\left({\mathrm{SOC}17}_{o},{ {\mathrm{SOC}19}_{o}-SOC17}_{o}\right)}{Var\left({SOC17}_{o}\right)}$$
(2)
$$K=\overline{{{\mathrm{SOC}19 }_{o}-\mathrm{SOC}17}_{o}}-\alpha *\overline{{\mathrm{SOC}17 }_{o}}$$
(3)

In addition to using these values as estimates for SOC input and mineralization in the UFF, we projected SOC accumulation through time to its equilibrium value (K/α) using the solution to the differential equation:

$$\mathrm{SOC}(t)=\frac{K}{\alpha }\left(1-{e}^{-\alpha t}\right)+{e}^{-\alpha t}*\overline{{\mathrm{SOC}17 }_{o}}$$
(4)

where t is time in years. To relate the quantities from the Mass Balance Equation (Eq. 1) to system level regulation of the UFF, we estimated the minimum quantity of mulch necessary to meet the estimated C supply rate (K). Specifically, we used the standard C content of wood of 50% (w/w) (USDA 2011) and bulk density of dry wood chips of 200 kg m−3 (Forest Products Laboratory 2010) to convert supply rate estimates into mass and depth of mulch.

To propagate uncertainty from all aspects of the model, including calibration curve estimation (Supplementary Methods Eqs. S1, S3, S7, S9), and spatial detrending (e.g. Supplementary Methods Eqs. S5, S6), we calculated the SOC input and mineralization parameters (Eq. 1) during Markov Chain Monte Carlo (MCMC) sampling and drew 1200 samples from the MCMC chains for the projection (Eq. 4). For details of model fitting, see Supplementary Methods.

Soil respiration

We measured soil respiration in 2019 as the mass specific respiration rate (MSRR) under standard laboratory conditions. Within 48 h of collection, we allowed samples to equilibrate to room temperature inside 500 ml glass tissue culture flasks, which served as static respiration chambers. We measured initial headspace CO2 concentrations by injecting 10 mL of soda-lime scrubbed air and pumping the syringe three times to mix. We measured the resulting 10 mL headspace sample using a PP Systems EGM-4 Infrared Gas Analyzer (Amesbury, MA) and then repeated the procedure after an incubation of approximately 20 min. We calculated the headspace C masses by applying the ideal gas law:

$$mass\ C\ =12.05\ *\ PPM\ \frac{pressureATM\ *\ (jarVolume\ -coreLength\ *\ \pi \ *\ {augerRadius}^{2})}{(273.15\ +\ labTemp) * R}$$
(5)

where R = 82.05 cm3 atm g−1 mol−1 K−1. We then calculated MSRR as the increase in headspace C min−1 g−1 of dry soil.

To assess the biological relevance of differences in SBD and MSRR between upper strata of the lawn and UFF in 2019, we quantified the frequencies with which samples exceeded US Department of Agriculture thresholds for each soil health metric (Soil Science Division Staff 2017). Soil compaction limits plant growth in sandy soils with bulk densities greater than 1.6 g cm−3 and in sandy loams with bulk densities greater than 1.4 cm−3. Those with respiration rates below 0.032 μg C min−1 g−1 are considered to have very low soil activity relative to a typical value of 0.056 μg C min−1 g−1 in agricultural soils.

Soil nutrients and microbes

To represent differences in soil nutrients, we air dried samples and mailed them to the University of Georgia Soil, Plant, and Water Laboratory for quantification of soil pH, phosphorus (P), potassium (K), calcium (Ca), magnesium (Mg), zinc (Zn), manganese (Mn), iron (Fe), copper (Cu), molybdenum (Mo) and cation exchange capacity (CEC) by Mehlich I sum with percent base saturation. We summarized variation in scaled nutrient concentrations using Principal Component Analysis (PCA) using the function “prcomp” in R v 4.0.2 packages “stats.” (R Core Team 2020).

To describe soil microbial diversity, we collected six total samples using the stratified approach described above. At each location, we collected five cores to 20 cm depth with a sterilized gauge auger. After removing the top 5 cm, the soil was homogenized and frozen at -80 °C. DNA was extracted using a Qiagen DNeasy PowerSoil kit (Qiagen USA, Germantown, MD) following the standard protocol except for increasing the vortexing time to 10 min to improve inhibitor removal. DNA was extracted into 100μL nanopure water. We sent 25μL of total DNA to Molecular Research LP Laboratory (MR DNA, Shallowater, TX) for diversity assays based on DNA barcoding. For bacteria identification, 16S rRNA V4 region was amplified (515/806 primers, Parada et al. 2016) with a HotStar Taq Plus Master Mix Kit following manufacturer instructions (Qiagen, USA). For fungal identification, the ITS region was amplified (ITS1 and ITS4 primers, White et al. 1990). Amplicons were barcoded using bTEFAP (Dowd et al. 2008), adapted for the sequencing technology used here. Sequencing was performed with ThermoFisher Scientific Ion S5, following manufacturers’ instructions. Sequences were processed by removing barcodes, primers, short sequences < 150 bp, sequences with ambiguous base calls and sequences with homopolymer runs exceeding 6 bp. Then sequences were denoised, OTUs were generated by clustering at 97% similarity (3% divergence), and singleton sequences and chimeras were removed. The resulting OTUs were taxonomically classified using BLASTn against a curated database. The number of sequences per taxonomic level were compiled as counts.

We calculated observed richness and the Chao 1 alpha diversity metric from fungal and bacterial samples using the function “estimate_richness” in the R package phyloseq (McMurdie and Holmes 2013) and considered areas different if their estimated standard errors did not overlap. To represent overall differences in community composition, we used Principal Coordinate Analysis (PCoA) of Bray–Curtis distances implemented using “ordinate” in phyloseq. To identify distinctive fungal taxa in each area, we used indicator species analysis as implemented by the “multipatt” function in R package “indicspecies” (De Cáceres and Legendre 2020), correcting for differences in sample numbers in each area. We selected any taxa with marginally significant point biserial correlation coefficients (p < 0.1) for further functional characterization by cross referencing fungal OTUs to the FunGuild Database (Nguyen et al. 2016). We analyzed evidence for functional differentiation by comparing the trophic guild classification of distinctive fungal OTUs in the UFF versus all others using Fisher’s Exact Test in R package “stats”.

Results

Converting lawn to UFF precipitated a dramatic change in soil structure and function after only two years (Fig. 1). The majority of samples from the top 10 cm of lawn soil (24/32) exceeded the soil compaction standard for sandy loams (> 1.4 g cm−3) and five samples exceeded the soil compaction standard for sands (> 1.6 cm−3). In contrast, just 1/32 samples from the same stratum beneath the UFF exceeded 1.4 cm−3, despite the added weight of plants and soils on berms. Taking the spatial structure of the sampling locations into account, the average SBD of the upper stratum of the lawn was 1.478 g cm−3 ([1.435, 1.521] 95%CI), which was nearly double that of the corresponding stratum beneath the UFF, 0.751 g cm−3 ([0.689, 0.820] 95%CI). The differences in the original lower strata were less extreme but still significantly lower under the UFF (Lawn 10–20 cm mean SBD = 1.336 g cm−3 [1.297, 1.375] 95% CI, UFF original 10–20 cm mean SBD = 0.969 g cm−3 [0.885, 1.058] 95%CI). The lowest average SBD occurred in the imported soils in the berm stratum of the UFF (mean SBD = 0.583 g cm−3 [0.534, 0.637] 95%CI).

Fig. 1
figure 1

Urban food forest soils were more active and less compacted than lawn soils. Dashed lines correspond to agronomically relevant soil function thresholds (see Materials and Methods) with long dashes corresponding to the least functional range, and short dashes corresponding to intermediate function. Triangles correspond to samples from the Urban Food Forest and squares to samples from the lawn. Dark fill indicates samples from the imported berm stratum in the Urban Food Forest. Light fill corresponds to the original soil strata in either area

Rapid soil decompaction in the UFF was associated with an agriculturally relevant increase in microbial activity (Fig. 1). Taking the spatial structure of the 32 sampling locations into account, the average mass specific respiration rate in the upper stratum of lawn soil was 0.0015 μg C g−1 min−1 ([0.0012, 0.0019] 95%CI), approximately 20 times slower than the common threshold for the least healthy category of soil microbial activity of 0.032 μg C g−1 min−1. In the corresponding stratum under the UFF, average respiration rates were an order of magnitude faster, 0.0175 μg C g−1 min−1 ([0.0112, 0.0260] 95%CI) and similar to the respiration rate of berm soil (0.0156 μg C g−1 min−1 [0.0100, 0.0230] 95%CI). Among 64 UFF samples, 21 exceeded the threshold for the least healthy category of soil activity compared to just one sample of 32 from the lawn. Among all 96 samples, 11/12 with healthy levels of soil activity (> 0.056 μg C g−1 min−1) occurred in the UFF.

Compared to control sites in the lawn, UFF soils also had higher concentrations of soil nutrients (Fig. 2A). Cation exchange capacity (CEC), which measures overall nutrient availability, was only 7.75 meq per 100 g in a physically aggregated sample from the lawn. In contrast, CEC in the UFF was 4.63 times higher. Every measurement of K, Ca, Mg, Mn and Fe concentrations from the five UFF samples exceeded those from the three lawn samples.

Fig. 2
figure 2

Urban food forest soils have higher nutrient concentrations (a) and distinctive fungal communities (b) compared to lawn soils. Large symbols in panel (a) correspond to physically aggregated samples from each area. Small symbols in panel a correspond to 6 additional stratified samples. Triangles correspond to samples from the Urban Food Forest and squares to samples from the lawn

Microbial communities from more functional UFF soil were diverse and distinctive compared to lawn soil (Fig. 2B). Among 417 unique fungal OTUs, the Chao 1 index of the four UFF samples was 423.1 (± 4.4 s.e.) compared to 395.3 (± 18.5 s.e.) in two samples from the lawn. Overall, 99 OTUs were unique to the UFF compared to just 24 from the lawn. The UFF included 26 marginally significant indicator taxa that matched the FunGuild traits database. The trophic modes for the 26 distinctive UFF soil fungi were significantly different from 224 other fungi (Fisher’s Exact Test, p = 0.012). Fungal communities exhibited fine scale variability, with different taxa dominating specific samples (Supplemental Table S4A). For example, Trechispora was the most abundant taxon in one UFF soil sample (16.95%) and was present at low abundance in the rest of the samples (0.04–0.09%). Similarly, Pisolithus was the most abundant taxon present in a lawn sample (29.82%) but was present at low abundance in the rest of the samples (0.08–0.13%).

Bacterial communities from the UFF were also more diverse and distinctive. Among 2323 bacterial OTUs, the Chao 1 index for the UFF samples was 2317.5 (± 10.7 s.e.) with 130 unique taxa compared to an alpha diversity of 2291.3 (± 19.4 s.e.) and 39 unique taxa in the lawn. The most abundant bacterial taxa in each sample were variable with some overlap between UFF samples and the lawn samples (Supplemental Table S4B). Steroidobacter were the most abundant bacteria in UFF soils (1.40–4.91%) compared to the lawn soils (0.17–1.24%). This genus has been previously reported present in farmed soils (Huang et al. 2019). Holophaga, which has been reported from natural forest soils (Hackl et al. 2004), was also abundant in UFF soils (1.18–2.89%) but had low abundance in lawn soil (0.06–0.42%). The most abundant genus in the lawn soil was Actinoallomurus, which was reported present in mangrove soils (Tang et al. 2013), however it was found in small amounts (2.4%).

As soil functions improved, and distinctively rich microbial communities developed, the UFF sequestered much more organic C belowground than traditionally managed lawn. Shortly after establishment, all of the difference in SOC stocks between areas was attributable to C added with the berm stratum (Fig. 3). After two years of management for useful tree growth, SOC more than doubled among 168 samples from the top 30 cm of the UFF. Taking the spatial structure into account, the average soil C concentration in the UFF increased from 5.88 ([4.48, 7.65] 95%CI) to 14.14 ([8.94, 22.28] 95%CI) kg C m−2. In contrast, soil C concentrations in the lawn increased only 25% from 3.92 ([3.00, 5.02] 95%CI) to 4.91 ([3.29, 7.25] 95%CI) kg C m−2 (Fig. 3), which was not significantly different. The original soil strata beneath the UFF sequestered slightly more C than corresponding areas of the lawn (UFF original soil C sequestration = 0.77 ([0.22,1.72] 95%CI) kg m−2 y−1; Lawn soil C sequestration = 0.49 ([0.14,1.11] 95%CI kg m−2 y−1), while the vast majority of C was sequestered in the berm stratum (3.35 ([1.38,6.56] 95%CI) kg m−2 y−1).

Fig. 3
figure 3

Soil carbon density approximately doubled in the Urban Food Forest compared to a negligible increase in the lawn. Error bars represent posterior 95% credible intervals for each stratum and area. Dark fill corresponds to carbon concentration within the berm stratum of the Urban Food Forest. Medium fill corresponds to carbon concentration in the original upper stratum of soil in either area (0–10 cm depth). Light fill corresponds to the original lower soil stratum in either area (10–20 cm depth)

While differences between areas demonstrated how quickly UFF adds SOC, fine scale variation within the UFF indicated approaching limits to its SOC sequestration capacity. Contrary to our expectation that different tree functional traits would exaggerate fine-scale variation in C sequestration, C densities became more homogenous because locations that began with relatively high SOC sequestered less C than locations with relatively low initial SOC (Fig. 4A). The negative linear relationship between initial SOC stocks and SOC sequestration is consistent with a simple soil C mass balance model (Eq. 1). After parameterizing the model from spatially detrended C stock and sequestration values, the SOC supply rate was estimated as 6.57 kg C m−2 yr−1 ([3.73, 10.9] 95%CI) and the SOC mineralization rate was estimated as 0.41 yr−1 ([0.04, 0.70] 95%CI). Meeting the estimated SOC supply rate from added mulch alone would have required 13.2 kg m−2 yr−1 ([7.46, 21.9] 95%CI) corresponding to a layer 6.6 cm deep converted directly into soil C each year. Given the estimated SOC supply and mineralization rates, the mass balance equation projected an equilibrium SOC stock of 14.48 kg C m−2 ([8.94, 19.02] 95%CI). After just two years, the UFF had attained nearly 90% of its estimated SOC stock capacity and was projected to achieve 95% of its capacity after only 4 years. By contrast, the estimated SOC supply rate in the lawn was significantly lower than in the UFF, 2.56 ([1.63,3.87] 95%CI) kg C m−2 yr−1 with a slightly higher mineralization rate (0.526 [0.41,0.67] 95%CI yr−1) and was within 1% of its estimated equilibrium, 4.88 ([3.31,7.20] 95% CI) kg C m−2 (Fig. S3), which aligns with our statistically insignificant estimated increase in SOC.

Fig. 4
figure 4

Fine scale changes in soil carbon concentration over two years implies that the Urban Food Forest exhibits system-level mass balance (a) and a rapidly approaching limit to sequestration capacity (b). Transparent line overlay represents 1200 draws from the posterior distribution for the estimated SOC supply rate (K) and mineralization rate (α) parameters (Eq. 1)

Discussion

Established ecological theory (Loreau 2010; Hulvey et al. 2013) and emerging empirical syntheses (Feliciano et al. 2018; Salbitano et al. 2019) agree with policy analysis (McElwee et al. 2020) that replacing unproductive urban land uses with UFF can mitigate related problems of urban food security and climate change. Only two years after converting lawn to UFF, our land use experiment demonstrated agriculturally relevant improvements in multiple dimensions of soil function with especially rapid soil C sequestration. However, fine-scale changes in C stocks generated more uniform conditions, despite the high diversity of tree species involved, implying that system level constraints have rapidly emerged in the UFF. Comparing our results to existing analyses of urban agriculture and agroforestry contextualizes the promise and limitations of UFF as a sustainable solution to major environmental challenges.

UFF improves soil function

Soil compaction, whereby stress from mechanical loads reduces soil pore space and associated biological functions, is the most widespread and important physical means for soil degradation in urbanized and agricultural soils (Nawaz et al. 2013; Smith et al. 2016; Thompson and Kao-Kniffin 2019). Natural recovery can take decades of physical cycling and bioperturbation (Nawaz et al. 2013). However, we found rapid decompaction following land use change, as measured by significant differences in SBD between the same soil strata under the UFF and at control sites in the lawn. The magnitude of change was similar to that reported in other recent studies of land use change. Two years after converting vacant lots in Ohio, U.S.A. to vegetable gardens, surface SBD decreased from 1.79 g cm−3 to 0.98 g cm−3 (Beniston et al. 2016), a change approximately 11% greater than what we observed. Rapid soil decompaction in the UFF without direct mechanical perturbation probably reflects a combination of processes, including exclusion of heavy lawn mowers, increased woody root growth and infiltration of organic matter produced in situ and added with management.

Converting lawn to UFF also improved biological and chemical metrics for soil function. Soil activity, as measured by short-term respiration, was significantly higher in both the imported berm soil and original topsoil than it was in the topsoil of the lawn. Similar increases in soil respiration have been documented close to established trees in both experiments and observations of agroforestry systems (Lee and Jose 2003; Hoosbeek et al. 2018). We note that the short duration of our ex situ measurements exclude soil C efflux from more recalcitrant compounds and root respiration. Longer incubations and in situ measurements would be necessary to estimate ecosystem-level soil C efflux in a way that is comparable to the mineralization rates that we estimated using the mass balance model (Eq. 1).

Higher soil activity tends to increase nutrient availability (Van Der Heijden et al. 2008), which is consistent with the higher nutrient concentrations and CEC that we measured in the UFF compared to the lawn. In the control lawn area, leaf clippings were retained, which tends to increase nutrient availability and SOC storage relative to management that removes clippings (Thompson and Kao-Kniffin 2019), but lawn management did not add fertilizer. Some of the improvement in soil activity and nutrient availability in the UFF could directly reflect input from compost, which was instrumental in improving C stocks in experimental reforestation plots in New York City (Ward et al. 2021). However, a forest garden in France without exogenous inputs exhibited higher CEC and nutrient availability than conventional agricultural soils, likely because tree roots improved mineral weathering and plant litter retained nutrients within the system (de Tombeur et al. 2018). Determining whether improved soil chemistry sustainably increases yields will require more detailed analysis of soil nutrient speciation as well as an assessment of nutrient losses with harvest compared to input from exogenous sources.

Improvements in both soil activity and nutrient availability coincided with the development of a more diverse and functionally distinctive soil microbial community in the UFF compared to the lawn. The differences that we observed may combine the known beneficial effects of both agroforestry and urban agriculture. Converting forests to agriculture generally decreases the diversity of soil bacteria, which recovers following reforestation (Díaz-Vallejo et al. 2021). These changes can have important functional consequences. Compared to farmland, agroforestry systems in China had higher soil C and respiration, which correlated with increased diversity of both bacteria and fungi (Ren et al. 2018). The increases in bacteria were also associated with changes in nutrient composition (Ren et al. 2018). Urbanization also impacts soil microbial community structure and function, such that urban soils tend to have reduced bacterial biomass and diversity (Guilland et al. 2018). However, microbial diversity within urban greenspaces varies greatly depending on cultivation and the presence of woody plants (Thompson and Kao-Kniffin 2019). In Adelaide, Australia, nutrient availability and bacterial richness were highest in urban garden plots and much lower in lawns of sports fields (Baruch et al. 2021). Sports fields also had lower fungal richness, particularly among saprotprohic taxa, compared to urban greenspaces with woody vegetation (Baruch et al. 2020). Taken together with our analysis, these results strongly suggest that cultivating diverse woody plant species in urban settings changes microbial communities in ways that promote physical and chemical dimensions of soil function relative to traditionally managed lawn.

Rapid SOC sequestration exhibits system-level constraints

Gains in soil function coincided with rapid SOC sequestration, reinforcing the potential for urban land use to mitigate climate change. While lawns can store more SOC than other land covers, depending on their management and biogeography (Thompson and Kao-Kniffin 2019), we measured relatively low SOC concentrations (4.91 kg C m−2) that were nearly identical to those measured in urban lawns in another region of Florida (4.9 kg C m−2, Nagy et al. (2014)). Moreover, lawn SOC did not significantly increase after two years, implying that 70 years of consistent management generated an SOC equilibrium, as has been documented after just 30 years in other urban grasslands (Shi et al. 2012). Compared to the relatively low equilibrium SOC concentrations in the lawn, establishing the UFF precipitated a dramatic increase in SOC.The magnitude was high but within the range observed for certain agroforestry systems. An early modelling analysis estimated potential soil C sequestration of up to 23 kg m−2 by converting traditional agriculture to agroforestry (Albrecht and Kandji 2003), which is consistent with empirical measurements from Canada which found over 20.1 kg C m−2 in soils from forested sites compared to just 15 kg C m−2 in herblands (Baah-Acheamfour et al. 2015). The comparable land cover types in our study showed a similar difference in surface SOC stocks. Our measurements were also similar to values reported from coffee polyculture in Costa Rica (16.8 kg C m−2, Alpizar et al. 1986), which is very similar to the maximum attainable SOC that we projected of approximately 16 kg C m−2. Our measurements differ from others primarily due to the rapid initial rate of sequestration in berm soils (3.35 kg C m−2 yr−1), which was nearly five times faster than observed in most agroforestry systems (Feliciano et al. 2018). However, the rate of C sequestration that we observed in the original upper stratum of soil (0.77 kg m−2 y−1) is much closer to values reported from other agroforestry systems.

Several factors could contribute to the high values for SOC stocks and sequestration that we observed in our study. First, our measurements occurred along berms where soil amendments and tree plantings were concentrated. Because SOC concentrations tend to increase closer to trees (Cardinael et al. 2020; Clivot et al. 2020), our values represent a maximum SOC stock for a subset of the UFF soils. Additional sampling from intervening alleys would more accurately estimate the SOC stocks of the entire project area and would probably yield lower average SOC concentrations that are closer to values that have been reported for other agroforestry experiments. Geographic and biological factors could also contribute to the high maximum values for SOC concentrations that we observed. Agroforestry projects in warmer locations tend to exhibit higher soil C concentrations (Feliciano et al. 2018). Also, ecological theory and experiments indicate that increasing woody plant diversity and associated functional trait variation tends to increase SOC sequestration (Hulvey et al. 2013). Our experiment included 33 species of trees and many more perennial pseudowoody herbs, vines and grasses with contrasting biogeographic origins. The extreme functional diversity of species involved could have increased the maximum SOC concentration. Additional experiments that vary species composition across climatic gradients are necessary to estimate how and why the SOC capacity of UFF may vary from other land use types.

While maximum SOC stocks and sequestration rates that we estimated were high, fine-scale variability implied a rapidly approaching limit to SOC sequestration. Contrary to our prediction that contrasting tree functional traits would exaggerate initial differences in SOC concentrations, with some species storing more C belowground than others, fine-scale variability in SOC concentrations actually decreased through time. The negative correlation between initial SOC concentration and subsequent SOC sequestration was consistent with a mass balance model (Eq. 1). The model, which has been used to estimate SOC capacity for global land cover types (Morais et al. 2019), is considerably simpler than widely used mechanistic models for soil C cycling (e.g. CENTURY, Parton et al. 1988; RothC, Coleman et al. 1997), and assumes uniform conditions with constant SOC supply and proportionally constant mineralization. Our finding that these assumptions approximately hold in a two-year old land use experiment has two major implications.

The first implication is that the system has fixed C sequestration capacity under current management and biological conditions. Indeed, 90% of attainable SOC had been reached just two years after establishment. For this reason, UFF soils may quickly meet discrete C sequestration goals, for instance as carbon credits, but not for mitigating ongoing emissions. It is possible that succession above and belowground could alter the parameters of the mass balance model to change the equilibrium SOC concentration or break the assumptions of the model altogether such that ongoing SOC sequestration is more likely. A recent metanalysis indicated that agroforestry soil C sequestration rates may be multiphasic, with an initial decelerating trend yielding to a transitory increase before a long term steady state (Feliciano et al. 2018). For a holistic picture of how UFF can factor into mitigation schemes, further measurements and modelling will be necessary to estimate the capacity and rate of C sequestration aboveground, which are reported to be considerable (Schafer et al. 2019), as well as the extent to which local food production offsets GHG emissions as has been demonstrated for other forms of urban agriculture (Cleveland et al. 2017).

The second major implication is that converting lawn to UFF precipitated the emergence of system-level regulation in just two years. Exogenous and endogenous factors could contribute. Added compost and mulch supply SOC and could homogenize microbial communities in a way that promotes constant proportional mineralization. The mass balance model constrains the quantity of exogenous inputs necessary to explain the increase in UFF SOC concentrations. Assuming that mulch was the main exogenous source of SOC during establishment, several tons added per year would be necessary to account for all of the additional C, which is inconsistent with the management history of the site. However, differences in management intensity and duration certainly contributed to the differences in stocks and sequestration between the UFF and the lawn. The lawn received neither irrigation nor fertilizer, which tend to increase SOC concentrations and sequestration (Thompson and Kao-Kniffin 2019). Elsewhere in Florida, where lawns likely received more inputs and urban forests, less, differences in soil SOC concentrations between these two land covers were not significantly different (Nagy et al. 2014). Given that short term changes in management can modify long-term equilibrium conditions in both lawns and forests (Peach et al. 2019), strategic use of irrigation and fertilizers may increase soil function and carbon sequestration across a range of urban land covers.

While management may contribute to system-level regulation in both lawn and UFF, the speed with which it emerged in the UFF despite the extreme functional diversity of plants raises the possibility that endogenous mechanisms played some role., The community of woody plants and associated organisms, including people, may reinforce negative feedback regulation over belowground C cycling at the system-level. For example, shared mycorrhizal networks can redistribute photosynthetic resources to reduce variation in resource stress among trees in forests (Klein et al. 2016). Analogous exchanges involving SOC supply and mineralization may operate within the UFF. In other words, from the perspective of soil C cycling, the collection of plants under current management began to behave like a forest. As such, soil C mass balance could serve as an objective criterion for delineating forest ecosystems independent of arbitrary definitions that depend on tree stem density and size distributions. Identifying which mechanisms generate this pattern, as well as their characteristic temporal and spatial scales, could be a productive avenue for basic and applied research on C cycling and sequestration in forest ecosystems.