Introduction

Earth system modeling lends a great potential for supporting regional environmental management (Ward et al. 2020; Dixon et al. 2021). Earth system modeling is evolving as an unprecedent research infrastructure that provides quality Earth system data and climate services for the society (Joussaume et al. 2017; Roberts et al. 2018b; Fiedler et al. 2021). For example, the publicly available data of the Coupled Model Intercomparison Project (CMIP) projects provide diverse climate services in water, agriculture, energy, and health sectors (Ilori and Balogun 2021; Usta et al. 2021; Ahmed 2021). Global climate models (GCMs) mainly simulate the physical atmospheric and oceanic processes with generally predetermined inputs of atmospheric composition. Earth system models (ESMs) reach far beyond GCMs by including explicit representation of biogeochemical processes and their interactions with the physical climate. Accordingly, ESMs can better explore the human impacts on the Earth systems through coupled representations of human activities with the physics and biogeochemistry of the atmosphere, ocean, land, rivers, and cryosphere. Using ESMs we can explore and understand how Earth systems respond to natural and anthropogenic forcings, while assessing future climate changes and mitigation plans (Eyring et al. 2019). Thus, ESMs have the potential to serve as a unique interdisciplinary research infrastructure for regional environmental management.

Coupled Model Intercomparison Project Phase Six (CMIP6) offers avenues for regional environmental services. ESMs have been utilizable for regional environmental management through coupled regional ESMs such as the regionally coupled atmosphere‐ocean‐sea ice‐marine biogeochemistry model (ROM, Sein et al 2015) and the Earth system regional climate model (RegCM-ES, Reale et al 2020). These regional ESMs include the same components as a global ESM, but cover a specific area with boundary conditions provided by global ESMs, or observation data. However, the compatibility across different model components is often restricted (Giorgi and Gao 2018), and there are often large uncertainties associated with the boundary conditions (Adachi and Tomita 2020). As such, coupled regional ESMs have been applied only to limited regional settings (Giorgi and Gao 2018). Yet out of the pressure to expand the scope of modeling in climate science, the CMIP6 became larger and more extensive in scope (Eyring et al. 2016). For example, CMIP6 endorsed a High-Resolution Model Intercomparison Project (HighResMIP, Haarsma et al. 2016) allowing ESMs with variable fine scale-grid resolution to focus on regional phenomena. This new generation of advanced high-resolution Earth system models (HR-ESMs) with improved horizontal resolution and process representation, particularly at the regional scale with unprecedented fidelity, enhances our confidence in predictions and projections (Roberts et al. 2018b). While these high-resolution models have very high computational cost, the data of these CMIP model runs are publicly available. HR-ESMs, that have nominal resolution of 25 km or less as being evaluated by CMIP6, go far beyond the standard low-resolution Earth system models (LR-ESMs) such as most of the simulations of CMIP5 and CMIP6 DECK historical experiment, which has a nominal resolution of 100 km. In addition, the Coordinated Regional Downscaling Experiment (CORDEX) uses CMIP outputs and provides, for selected regions, dynamically downscaled climate change experiments (Gutowski Jr. et al. 2016; Gutowski et al. 2020). This unprecedented resolution and model fidelity result in improved simulation of Earth processes that are of growing interest to societal decision-making. Accordingly, HR-ESMs may serve as a more reliable tool for assessing smaller-scale phenomena such as tropical cyclones (Jiaxiang et al. 2020), climate extremes such as heatwaves and heavy precipitation (Iles et al. 2020), costal processes (Ward et al. 2020), coral reef conservation (Dixon et al. 2021), and regional ocean currents (Le et al. 2020).

This study discusses the potential use of ESMs for regional environmental management using the Florida red tide as a case study. Red tides are worldwide occurrences of intense harmful algae blooms (Tian and An 2013; Wen et al. 2013; Xu et al. 2014; Liu and Fang 2017). The Florida red tide is composed of mixotrophic dinoflagellate, K. brevis. These microscopic protists occur regularly along the West Florida Shelf and cause substantial environmental and socioeconomic damage. This includes impacts on human health (e.g., respiratory, skin, and eye irritation), fisheries (e.g., massive fish kills, and shellfish poisoning), ecosystem services (e.g., harming sea turtles, marine mammals, and birds), tourism (e.g., hotels, and restaurants), recreational activities, and local small business. The occurrence of red tides in the Gulf of Mexico may involve multiple system processes, including land-to-ocean nutrient and sediment transport from rivers and submarine groundwater discharge, ocean currents and upwelling, ocean biogeochemical processes, African Sahara dust, wind direction, and tropical cyclones (Brand and Compton 2007). Many of the physical and biogeochemical processes that are related to the occurrence of red tide can be simulated ESMs as discussed in "ESMs for regional environmental management" section. Accordingly, this research direction aims at using ESMs to understand the impact of climate changes on red tide frequency, and accordingly assess the environmental and socioeconomic impacts of red tide under different Shared Socioeconomic Pathways (SSPs, Riahi et al. 2017) of the CMIP6.

Given the above general research direction, here we focus on Loop Current, a regional ocean current in the Gulf of Mexico, which is particularly an important factor for predicting red tide occurrences (Weisberg et al. 2014; Maze et al. 2015; Liu et al. 2016b). The main objective of this manuscript is to present this case study to serve as an example to discuss the prospects and limitations of the current generation of CMIP6 and next generation development of ESMs for red tide management. In other words, we highlight the current modeling gaps and future research directions of ESMs as a useful tool for providing regional environmental management services.

Methods

Red tide hypothesis and data

A current working hypothesis of the occurrence of red tide in the West Florida Shelf is based on the position of the Loop Current and its eddies (Perkins 2019) such that the Loop Current position can be “the first definitive predictor of bloom possibility” (Maze et al. 2015). The Loop Current and its eddies can be easily detected from sea surface height variability (Weisberg et al. 2014; Maze et al. 2015). Maze et al. (2015) showed that the differences in the Loop Current’s position between periods of large blooms and no bloom are statistically significant such that a northern Loop Current (LCN) position (Fig. 1a) penetrating through the Gulf of Mexico is a necessary condition for a large red tide bloom to occur. In other words, there is no large bloom for southern Loop Current (LCS) position (Fig. 1b), while there could be a large bloom or no bloom for LCN (Fig. 1a). Similar to the definition of Maze et al. (2015), we defined a large bloom as an event with the cell count exceeding 1 × 105 cells/L for ten or more successive days without a gap of more than five consecutive days, or 20% of the bloom length in the study area (Fig. 1a). The K. brevis cell count data used in this study are from the harmful algal bloom database of the Fish and Wildlife Research Institute at the Florida Fish and Wildlife Conservation Commission (FWRI 2020). For the analysis period of 22 years from 1993 to 2015, there were 15 periods of large blooms, and 29 periods with no bloom given a 6-month period length.

Fig. 1
figure 1

Reanalysis data of sea surface height above geoid (zos) [m] reveal (a) LCN and (b) LCS. Two red segments along the 300 m isobath in (b) are used to determine Loop Current position, and the red box in (a) shows the area where red tide blooms are considered by Maze et al. (2015) and this study

Reanalysis data

The Loop Current can be identified from altimetry reanalysis data as an area with sea surface height above geoid (CMIP variable: zos) higher than the surrounding waters. We use the global-reanalysis-phy-001–030 monthly product of Copernicus Marine Environment Monitoring Service (CMEMS), a global ocean eddy-resolving reanalysis covering the altimetry from 1993 to 2015 and onward with approximatively 8 km horizontal resolution (Drévillon et al. 2018; Fernandez and Lellouche 2018). Following Maze et al. (2015), the difference between the mean altimetry height along the two segments of the 300 m isobath (Fig. 1b) is used as a proxy for the Loop Current position (Appendix) such that positive and negative difference between the north and south segments indicates LCN (Fig. 1a) and LCS (Fig. 1b), respectively.

Earth system model data

We use multi-model ensemble of HR-ESMs with six ensemble members as shown in Table 1. For comparison, we use an ensemble of LR-ESMs with two ensemble members (i.e., EC-Earth3P and E3SM-1–0). The members are from both CMIP6 historical experiment (Eyring et al. 2016) and hist-1950 experiment (Haarsma et al. 2016). These are two sibling experiments that use historical forcings (e.g., historical greenhouse gases concentrations, solar forcing, etc.) of recent past until 2015. To reduce computational cost, the hist-1950 experiment starts from 1950 with initial conditions from the spin-up 1950 run. The historical experiment starts from 1850, initialized from any point early enough in the pre-industrial control run. Each member can have multiple runs with perturbed realizations (r), initializations (1), physics (p), and forcings (f) as shown in Table 1. For example, r(1–6) of ECMWF-IFS-HR means six runs of six perturbed realizations. The HR-ESMs have an ocean nominal resolution ranging from 8 to 25 km. The two last digits of HadGEM3-GC31-HH/HM/MM refer to the atmosphere and ocean, respectively, each with high (H) or medium (M) resolutions. For LR-ESMs, E3SM-1–0 has variable ocean resolution of 30–60 km from the pole to the equator, and EC-Earth3P has a nominal ocean resolution of 1° (about 100 km).

Table 1 ESMs models used in this study consisting of six HR-ESMs (i.e., ocean model resolution of less than or equal 25 km) and two LR-ESMs

Model validation

We investigate the agreement between model simulations and reanalysis data with respect to Loop Current position, and accordingly the possibility of large bloom occurrence following the empirical relation of Maze et al. (2015). To validate ESMs for this purpose, we identified three basic tasks with evolving difficulties. The first task is simulating the phenomena of interest at the regional spatial scale, which is the Loop Current in this case. The second task is to provide adequate estimation of the frequency of an oscillating event over the simulation period. This is achieved in this study by validating the frequency of LCN and LCS from model simulation with reanalysis data. This validation task is particularly important as the main purpose of this research direction is to understand the future frequency of red tide under different SSPs of CMIP6. Simulating the regional physical phenomena of interest (e.g., Loop Current) with an accurate frequency of oscillation are among the important steps toward this purpose along with the other steps that are presented in the discussion section. The third validation task is to examine the temporal agreement of Loop Current positions between model and reanalysis data.

Results and discussion

Loop current position simulation

The first task is to validate whether the models can simulate LCN and LCS positions. High-resolution eddy-permitting grids of HR-ESMs meet this modeling need as demonstrated by Fig. 2, which shows a snapshot comparison of the simulated sea surface height above geoid (variable: zos) from a low-resolution eddy closure EC60to30 mesh, and the high-resolution ORCA12 grid. Fig 2a shows that the low-resolution mesh cannot simulate LCN, whereas Fig. 2b indicates that the high-resolution ORCA12 grid resolves mesoscale eddies. Compared with LR-ESMs, HR-ESMs not only have a finer horizontal resolution, but also improved process description as reflected by the ocean grid in HR-ESMs. The low-resolution EC60to30 grid of E3SM-1–0, which is an eddy closure (EC) grid, is not expected to resolve the regional spatial phenomena of interest, but rather to produce global or regional average. This is mainly because low-resolution grids (e.g., EC60to30 of E3SM-1–0 and ORCA1 of EC-Earth3P in Table 1) require global parametrization of mesoscale eddies, rather than explicitly resolving mesoscale eddies and boundary currents. By contrast, high-resolution eddy-permitting grids (e.g., ORCA12 and ORCA025 in Table 1) do not require ocean eddy flux parameterization.

Fig. 2
figure 2

A snapshot (March 2010) of sea surface height above geoid (zos) [m] simulated using a (a) LR-ESM, and (b) HR-ESM with nominal resolution of 100 km and 10 km, respectively

In this study, we are not arguing that global ESMs can replace regional models (e.g., the Gulf of Mexico HYCOM by the HYCOM Consortium) in accurately simulating the Loop Current and eddy positions. The Loop Current has a chaotic and random cycle with the average period being around one cycle every 8–18 months (Sturges and Evans 1983; Maze et al. 2015). The objective of this work is neither to accurately represent the shape of the Loop Current and its anticyclonic ring, nor to precisely simulate the time of Loop Current occurrence, which are even challenging tasks for regional models. The object of this research direction is to understand assess the validity of ESMs in regional environmental management. The specific objective of this study is model validation given the criteria of Maze et al. (2015) that is based Loop Current frequency.

Frequency of loop current position

The second validation task is related to event frequency, which is adequately estimated by the HR-ESMs ensemble. We plot the time series of the Loop Current position as inferred from the reanalysis data against occurrence of large blooms (Fig. 3a). For the study period of 22 years with 6 month interval length, there is a total of 44 intervals. For the reanalysis data, the LCN count is 32, and the LCS count is 12 (Eqs. A2A4). The higher frequency of LCN than LCS is expected. This frequency ratio between LCS and LCN of 0.375 (i.e., 12/32) is very similar to that of Maze et al (2015), which is 0.364 given their study period from 1993 to 2007 and their reanalysis product (Eq.A2–A4). Additionally, Fig. 3a indicates that no large blooms are associated with LCS (i.e., zos anomaly less than 0). This is consistent with the findings of Maze et al. (2015) that for LCS no large bloom occurs, and that LCN is a necessary condition for large bloom to occur.

Fig. 3
figure 3

Temporal match of large bloom/no bloom with Loop Current positions given by (a) observation reanalysis, and (b) simulations of HR-ESMs for multi-model ensemble average. Positive and negative bars indicate LCN and LCS, respectively

To evaluate to what degree HR-ESMs can simulate the reanalysis data, the HR-ESMs predictive performance is shown in Fig. 3b, and the results are summarized in Table 2. A comparison between HR-ESMs and LR-ESMs with respect to frequency of an event and temporal agreement is summarized in Table 2. The LCN frequency over the study period is reasonably reproduced by HR-ESMs multi-model ensemble that has a frequency of 35 compared to 32 for the reanalysis data (Eqs. A2A4). The best single-model ensemble ECMWF-IFS-MR, which has three repeated runs r(1–3)i1p1f1 as shown in Table 1, has a LCN frequency of 33 that is very close to the reanalysis data (Eqs. A2A4). These results show that unlike LR-ESMs, the HR-ESMs are generally capable of reproducing the frequencies of the Loop Current north and south positions according to the relation of Maze et al (2015).

Table 2 Frequencies of LCN and LCS positions, and their relation to the occurrence of large blooms

Temporal match

Data-model temporal agreement is the third validation task, which shows relatively large mismatch specifically with respect to LCS. The HR-ESMs multi-model ensemble average has a total temporal match of 33 intervals out of 44. This means that during 44 intervals of the study period, the Loop Current position (i.e., LCN or LCS) of the model simulation matched that of the reanalysis data for 33 intervals, and for 11 intervals they were mismatching. Out of 32 intervals showing LCN by the reanalysis data, the model simulations matched 26 of them with an error of 18.75% (Eq. A6). Out of the 12 intervals indicating LCS by the reanalysis data, the model simulations matched only five of them with an error of 58.33% (Eq. A5). In summary, the HR-ESMs multi-model ensemble average has a total temporal match error of 25% error as calculated from Table 2 (i.e., [44–33]/44 as shown by Eq. A7), and this error is more pronounced for the LCS than LCN. The error is invariant to the first year period (i.e., Jan–Jun), and second year period (i.e., Jul–Dec). Out of the 15 large blooms, this error in the LCS has resulted in 2 false-positive periods of bloom formation that is 2 LCS intervals with large bloom (Eq. A8). In comparison, LR-ESM shows 15 false positives of bloom formation that is 15 LCS intervals with large bloom (Eq. A8).

Poor temporal match is expected. Given one year operational scale of ESMs (Chou et al. 2018), temporal mismatch for shorter intervals is expected due to weak signal-to-noise ratio in short-timescale predictions, systematic biases, and drifting. In addition, both the historical and hist-1950 experiments are free-running, which means that they are not expected to temporally coincide with real-world conditions. This can be particularly true for the historical simulations, which might be further out of synchronization with real conditions relative to hist-1950. This temporal match can even suffer from systematic bias over several decades. For example, Tokarska et al (2020) noted that for certain models (e.g., E3SM) the uncertainty in climate feedbacks and the aerosol forcing can result in historical warming similarly as observation, but with poor temporal agreement. This remark on simulated global mean surface air temperature may also be applicable for sea surface height. In other words, as no temporal match with historical record is generally expected, global predictions can be analyzed for processes and cycles, but not temporal alignment of specific ocean features. Accordingly, we attempt to find a coarse temporal agreement based on long interval (i.e., 6 month interval) using the coarse relationship of Maze et al. (2015) between Loop Current positions and red tide blooms. Thus, this task can be considered as pseudotemporal correspondence that captures the general pattern of a dynamic process. While temporal match is not critical for the main modeling purpose, which is to understand future trends and frequencies of red tide, it could have additional benefits such as using ESMs to develop an early warning system for red tide. ESMs are generally designed to make predictions at coarse time scales of decades to centuries. However, multiple techniques (e.g., downscaling, pattern scaling, and use of analogues) can be used to provide information at fine time horizons that match the decision contexts (van den Hurk et al. 2018). Additionally, coupled ESMs are now being tested for global prediction on short-range timescales (Brassington et al. 2015; Hewitt et al. 2017). These points are further discussed in "ESMs for regional environmental management" section.

Study findings and limitations

Several relationships have been established between Loop Current and red tide blooms (Weisberg et al. 2014, 2019; Maze et al. 2015; Liu et al. 2016a). The purpose of this work is not to support or refute any of these relations, but to use the empirical correlation of Maze et al. (2015) to illustrate the potential use of the publicly available data of CMIP6 for providing environmental services and to facilitate this discussion.

We illustrate data-model evaluation of Loop Current simulation using HR-ESMs for three basic tasks. First, adequate simulation of the spatial phenomena of interest is a bottleneck condition. According to Haarsma et al. (2016) to provide information relevant for stakeholders and adaptation strategies, regional climate information focuses on smaller scales and extreme events and requires high-resolution modeling to better capture local processes and teleconnections with distant regions that has a strong impact on the region of interest. For example, our case study shows the importance of simulating the regional phenomena of Loop Current that loops the Gulf of Mexico, which requires HR-ESMs versus a global phenomenon such as the Gulf Stream, which is a strong ocean current that circulate warm water from the Gulf of Mexico into the Atlantic Ocean. Unlike low-resolution ESMs, HR-ESMs can resolve the phenomena of interest at the regional scale (e.g., LCN and LCS) as suggested by other studies (Caldwell et al. 2019; Hoch et al. 2020). Being able to simulate the underlying physical processes of interest is a prerequisite to any meaningful ocean biogeochemical modeling at the regional scale.

Second, adequate estimation of the frequency of event oscillation (e.g., frequency of LCS and accordingly the absence of large blooms) is important for understanding the impacts of different climate scenarios on the frequency of red tide as explained below. Since LR-ESMs cannot simulate the physical phenomena of interest, this class of models failed to reproduce the observed frequency (Table 2).

Third, temporal matching at management timescale can permit additional services. Theoretically, the temporal correspondence obtained in this study could be a mere coincident. Otherwise, this could be attribute to the use heuristic relation with coarse temporal resolution. This might suggest that a pseudotemporal correspondence might be possible in the absence of a large drift. If such temporal correspondence cannot be established, this should not impact the main modeling purpose of understanding the frequency and trend of red tide under different climate scenarios and of estimating the socioeconomic impacts accordingly. Such temporal correspondence is generally not currently possible without downscaling, and substantial long-term investment in climate science capability and model design are still needed to resolve finer spatiotemporal scales (Fiedler et al. 2021). However, if temporal match with ocean observations at short timescale is required, strategies to reduce mismatch include improving initial conditions (Hewitt et al. 2017), bias correction and recalibration methods (Manzanas 2020), and pattern scaling (van den Hurk et al. 2018). These can be simpler techniques compared to the more challenging dynamical downscaling techniques. In addition, this analysis can be repeated by replacing the CMIP6 data with CORDEX data as soon as they become available.

This study supports the evidence that improved horizontal resolution with weather-resolving global model resolutions (∼25 km or finer) and using coupled ESMs (i.e., coupled simulations of atmosphere–cryosphere–land–ocean) can improve predictive performance at short timescales (Scaife et al. 2011, 2019; Hewitt et al. 2017; Little et al. 2019). While the focus of this study is the ocean component, the results are based on coupled simulations. By better capturing transient mesoscale motions, these simulations correspond to the observed ocean phenomena exhibiting fine scale boundary currents, transient fluctuations, coastal upwelling zones, meanders and jets (Hewitt et al. 2017), which can permit these models to be used for environmental management.

This study is a preliminary showcase of the possibility of using CMIP6 data for red tide management, and many further steps are needed. The presented results are mainly for preliminary validation of CMIP6 data, but are of limited use for predicting red tide in the Gulf of Mexico. With respect to the relationship between Loop Current and red tide, the only relation that we consider is that large blooms are unlikely to occur for the case of LCS (Maze et al. 2015). For the case of LCN, further relations (Weisberg et al. 2014, 2019; Liu et al. 2016a) are needed to constrain the cases of large and no blooms, respectively. In addition to Loop Current, other relevant drivers (e.g., African Sahara dust, offshore and alongshore wind speed, atmospheric CO2 concentration, sea surface temperature, etc.) will be considered under different SSPs of CMIP6 in which socio‐economic scenarios are used to derive emissions scenarios with and without climate polices for mitigation. These drivers may be simultaneously considered using machine learning in a probability framework in a future study. For example, Tonelli et al. (2021) used machine learning with CMIP6 data to study marine microbial communities under different future scenarios. While the abovementioned drivers of red tide can be mostly assessed with CMIP6 data, several limitations persist as discussed next.

ESMs for regional environmental management

Limitations and prospects of ESMs for regional environmental management

While the presented case study is on red tide, ESMs can be used to provide multiple environmental management services that are controlled by land to ocean nutrient and sediment transport, land and ocean biogeochemical reactions, tropical cyclones, and wind speed and direction. To illustrate this potential use of HR-ESMs in providing environmental management services, we discuss the outputs of ESMs that are useful for red tide management, which can be generalized to other regional environmental problems. Brand and Compton (2007) discussed different drivers that contribute to the initiation, growth, maintenance, and termination of red tide. Box 1 summarizes these drivers with their corresponding ESMs outputs, using the Energy Exascale Earth System Model (E3SM) as an example. The E3SM project is an ongoing state-of-the-art Earth system modeling project that attempts to answer more demanding questions related to human activities interaction with the Earth system. Physical ocean and atmospheric processes presented in Box 1 are already implemented in current generation of HR-ESMs. River flow will be available for HR-ESMs, yet river nutrient transport and simulation of anthropogenic impacts will be addressed in next generation development (Leung et al. 2020). With respect to the ocean biogeochemical processes (e.,g., Heil et al., 2014), the implementation of the biogeochemical processes presented in Box 1 in HR-ESMs is still in progress. However, biogeochemical cycles are not yet coupled between ocean and land in the current generation development. The feedbacks between the ocean biogeochemistry and the physical ocean and climate states, transport of nitrogen and phosphorus from land to ocean, and river–ocean biogeochemistry fluxes, and the incorporation of anthropogenic nutrient sources at a regional scale are ongoing in E3SM project (Burrows et al. 2020; Leung et al. 2020).

Conclusions

This article discusses the potential uses of the HR-ESMs of CMIP for environmental management at regional scale. We present a case study about harmful algae blooms in Florida commonly known as red tide, and the position of Loop Current, which is a warm ocean current that enters the Gulf of Mexico, can be the first predictor of red tide. This case study was intended to tailor the outputs of CMIP6 models to an environmental application at the regional scale. Three basic criteria for evaluating model predictions have been presented. Unlike LR-ESMs, HR-ESMs can adequately simulate the physical phenomena of interest (i.e., Loop Current position), which is prerequisite to any meaningful biogeochemical modeling. In addition, HR-ESMs can adequately reproduce the frequency of the event of interest (i.e., LCN and LCS), which is crucial for assessing the impact of climate change on red tide. At last, the most challenging evaluation task is the temporal agreement of model simulations with observation data at the management timescale. Large temporal mismatch is observed suggesting that this is an unsuitable criterion for evaluating global ESMs. Loop Current position alone is only one predictor. Machine learning seems to be a viable option for prediction using additional Loop Current relations and red tide drivers. The article also identifies the current gaps and development needs of ESMs with respect to environmental management services, while realizing that the new generation of HR-ESMs in CMIP6 is a remarkable development. The development gaps include (1) coupling of land, river, and ocean biogeochemistry, (2) accounting of anthropogenic disturbances on natural systems (e.g., anthropogenic nutrient sources and freshwater withdrawal), (3) coupling between human activities and ESMs, and (4) advancement of interactive coupled regional ESMs. Such developments are needed for red tide management, and many other environmental management services. Stakeholder engagement in model development is essential to facilitate the translation of scientific understanding to better inform decision-making of regional environmental management.