Introduction

Biological indicators are increasingly used in aquatic ecosystem health monitoring due to their potential over other evaluation methods like physico-chemical parameters (Lopez-Lopez & Sedeno-Diaz, 2015; Moog et al., 2018). Bioindicators can reflect long-term aquatic ecosystem conditions (Uherek & Gouveia, 2014; Young et al., 2014). Biomonitoring is a fast and cost-effective method, since it requires less field and laboratory equipment compared to physico-chemical parameters (Asif et al., 2018; Fontanetti et al., 2012; Kumari & Paul, 2021). This approach has gained recognition and has been used in freshwater assessment worldwide (Karmakar et al., 2021; Mahaut et al., 2013; Parmar et al., 2016; Poulíčková et al., 2004; Riato & Leira, 2020). In particular, a technique of grouping biological metrics into a single index, known as the multimetric index (MMI), is being used in health monitoring of both lentic and lotic ecosystems. Multimetric index represents various aspects of ecological information, such as abundance, diversity, composition, richness, and function (Arman et al., 2019; Gabriels et al., 2010; Huang et al., 2015; Zhang et al., 2019). Since the first attempt of MMI by Karr (1981) using fish, other organisms such as diatoms (Reavie et al., 2008; Schowe & Harding, 2014; Yu et al., 2021), macrophytes (Aguiar et al., 2011; Beck et al., 2010; Penning et al., 2008), and macroinvertebrates (Edegbene et al., 2019; Menetrey et al., 2011; Utami & Fajar, 2022) have been used worldwide. The use of MMIs as freshwater monitoring tools has been reported in various regions including North America (Haugerud, 2006; Karr, 1981; Wesolek et al., 2010), South America (Baptista et al., 2007; Carayon et al., 2020; Touron-poncet et al., 2014), Europe (McGoff et al., 2013; Miler et al., 2013; Solimini et al., 2008; Trigal et al., 2006), Asia (Ding et al., 2021; Zhang et al., 2019), and Africa (Edegbene et al., 2019; Wondmagegn & Mengistou, 2023).

Macroinvertebrates have for long been used as bioindicators of water quality, owing to their ability to respond differently to environmental stressors, mainly of anthropogenic origin (Assefa et al., 2020; Ntitslidou et al., 2021). Most macroinvertebrates are easy to collect/sample and have relatively long life cycle, thus representing seasonal changes and functional traits that enable them to indicate disturbance in the habitats (Bêche et al., 2006; Bonada et al., 2006; Young et al., 2014). Unlike relatively big-sized organisms such as fish with higher habitat restrictions (Kramer & Chapman, 1999), macroinvertebrates can easily be found in almost all aquatic ecosystems (Feio et al., 2023). Their responses to the environment are guided by their sensitivity/tolerance levels to pollution. For instance, Ephemeroptera, Plecoptera, and Trichoptera (EPT) have been reported to be very sensitive to organic pollution (Akamagwuna et al., 2021; Menetrey et al., 2008). As such, the abundance of EPT is often very low in an ecosystem or site with high organic pollution, often reflecting the status of impairment. Considering the same stressor (organic pollution), some chironomids are considered to be very tolerant, i.e., their abundance is not much affected by organic pollution (Auta et al., 2016 ; Cornette et al., 2015). The variation in sensitivity of organisms prompted studies to illustrate some macroinvertebrate taxa with their respective sensitivity/tolerance levels (Bonada et al., 2006). Indeed, the macroinvertebrate-based sensitivity/tolerance metrics have been used in a number of biomonitoring studies (Baptista et al., 2007; Edegbene et al., 2022; Macedo et al., 2016; Shah et al., 2011). In addition, macroinvertebrates have been considered in MMI development, an approach that has been widely applied in lotic environments (rivers/streams) in comparison to lentic (lakes/reservoirs) (Assefa et al., 2020; Ndatimana et al., 2023; Odountan et al., 2018).

The development and application of macroinvertebrate-based MMI in lakes are drastically increasing in developed countries (Ding et al., 2021; McGoff et al., 2013; Miler et al., 2013; Zhang et al., 2019) and a few developing countries, including African countries (Wondmagegn & Mengistou, 2023). In developed countries, various metrics are assessed in response to the main stressors of lake integrity such as eutrophication, acidification, and geomorphological changes (Young et al., 2014). The profundal-based macroinvertebrate metrics (Jyväsjärvi et al., 2010, 2012), littoral-based metrics (Johnson, 2003; McFarland et al., 2010; Schartau et al., 2008), and assessment of geomorphological alteration (Brauns et al., 2007; Miler et al., 2013) were applied to handle these main disturbances of lake integrity. In addition, collaboration and the implementation of water-related policies such as the European Union Water Framework Directive (WFD) and the United States Environmental Protection Agency (USEPA) have helped the development and application of MMIs in developed countries (Gabriels et al., 2010; Poikane et al., 2011; Sreeja, 2018).

In Africa, lakes are supporting the life of riparian communities, both directly and indirectly, for instance, through provision of fishery resources (Abo-Taleb et al., 2023; Ogutu-Ohwayo et al., 2016), water for irrigation (Agodzo, 2013; Salomonsson & Thoresson, 2010; Zamdayu & Japhet, 2021), municipal and domestic use (Salomonsson & Thoresson, 2010; Shishaye, 2018; Yonnana et al., 2017), as well as recreation services (Kolawole et al., 2020; Nakirya et al., 2015). However, these activities are affecting water quality and the health status of inhabiting organisms (Cohen et al., 2019; Onywere et al., 2012). Indeed, laundry activities, agricultural wastes, direct dumping of wastes, oil spillage, and automobile washing, among others, cause organic and heavy metal pollution, which alter the ecological functioning of lakes (Gwenzi & Chaukura, 2018; Obinnaa & Ebere, 2019; Redwan & Elhaddad, 2022). The effective response of macroinvertebrates to such disturbances makes them good indicators of aquatic health (Gezie et al., 2017; Wondmagegn & Mengistou, 2020). Therefore, the application of MMI is reliable especially when the exact stressors affecting the lake are not known (Schoolmaster et al., 2012). Thus, macroinvertebrate-based MMI is a good option in monitoring African lakes which are being affected by multiple stressors. Indeed, macroinvertebrates have been recommended in MMI development to improve the use of biomonitoring approaches on African lakes (Ndatimana et al., 2023; Odountan et al., 2018). Macroinvertebrates showed potentials of application in MMI for African rivers (Alemneh et al., 2019; Aura et al., 2010; Edegbene et al., 2019), but their development and application in lakes are still at infancy.

Old Gawu and Tagwai Lakes in North Central Nigeria are currently threatened by agricultural and municipal wastes from their catchments (Musa & Saba, 2011), which increase organic pollutants in the lakes. In addition, non-point source pollution mainly from human activities in and around the lakes such as laundry, agricultural runoff, fishing, and washing automobiles portends the continuous pollution of these municipal lakes. This hinders the ecological functioning of the lakes and affects the inhabiting organisms (Egwu et al., 2020). Thus, there is a need for consistent water quality monitoring to ensure desirable ecological integrity. Unfortunately, the existing water quality assessment of lakes in the region is mainly based on physico-chemical parameters (Ibrahim et al., 2019), which are relatively expensive, time-consuming, and do not provide early warning signals compared to biological indicators. Currently, there is no documented macroinvertebrate-based MMI developed for lakes in Nigeria. Therefore, the use of metrics of macroinvertebrates from Old Gawu and Tagwai Lakes to develop an MMI will provide a baseline for successful biomonitoring of lakes in Nigeria, particularly, North Central region. It is anticipated that macroinvertebrate-based MMI will be able to discriminate highly impaired from least impaired sites. In addition, the developed MMI will help in rapid assessment of lake health status, as well as effective and evidence-based policy development for lake water quality management in the region.

Methods

Study area description

The study was conducted in two municipal lakes in North Central Nigeria: Old Gawu, located in the Federal Capital Territory (FCT) Abuja, and Tagwai Lake, located in Minna, Niger state Capital. Old Gawu Lake is situated on longitude 6°35′55.6″E and latitude 9°30′41.7″N (Fig. 1), located in Abaji local government area. Old Gawu is surrounded by agricultural farms dominated by subsistence crops such as yams, maize, beans, and white-seed melon locally known as Egusi. The main activities from this lake are livestock watering, laundry, and water abstraction for irrigation especially during dry season, recreation, and fishing. Meanwhile, Tagwai Lake is located in Minna, situated on longitude 6°39′40.95″E and latitude 9°33′59.958″N (Fig. 1). The lake is surrounded by agricultural farms dominated by maize, okra, and yams. This lake supports community in the vicinity through irrigation of agricultural farms, fishing, livestock watering, and recreation. Ecologically, the lakes are habited by a diversity of organisms such as fish, invertebrates (micro and macro), reptiles, and amphibians. The North Central Nigeria region has a tropical climate with a mean annual temperature of 30 °C, relative humidity of 61%, and rainfall of 1334 mm.

Fig. 1
figure 1

Map showing sampling stations (in set: Nigeria, (A) Tagwai Lake, (B) Old Gawu Lake)

Macroinvertebrate sampling and laboratory analysis

Macroinvertebrates were sampled from eight sites, i.e., four sites per lake. For each site, three replicates from randomly selected sampling points were collected using a D-shaped hand net with a mesh size of 0.5 mm (Jones et al., 2004). Macroinvertebrates were collected within 25 to 40 cm depth, with a distance between sampling points approximately 3 to 5 m. The three replicate samples from each site were then pooled together into a single composite sample. Sampling was done monthly from April to August and October 2022, covering the dry period (April–May), wet period (June–August), and transition/before the dry season (BDS) (October). From the field, samples were fixed in 80% ethanol and transported to the Animal Biology Laboratory, Federal University of Technology, Minna (FUTMINNA).

In the laboratory, samples were sorted using a Digital microscope (YIZHAN, 38 MP FHD camera V6) connected to a computer and identified to family level using taxonomic keys by Arimoro and James (2008), de Moor et al. (2003), Gooderham and Tsyrlin (2002), and Thorp and Rogers (2011). Macroinvertebrates were assigned to functional feeding groups (FFG) using trait-based fuzzy coding by Tachet et al. (2010) and the aid of specialists from the Hydrobiology Unit, FUTMINNA.

Environmental variables

At each site, water temperature, dissolved oxygen (DO), hydrogen ion concentration (pH), and electric conductivity (EC) were determined using HACH multi-meter probe (HQ40d, USA). The laboratory analysis consisted of nitrate (NO3), calcium (Ca), phosphate (PO4), magnesium (Mg), BOD5, total alkalinity (TA), chlorine (Cl), total hardness (TH), chemical oxygen demand (COD), and chlorophyll-a (Chl-a), which were analysed based on standard methods by Gilcreas (1966).

Site categorization

The sampled sites were classified based on observed disturbance in the littoral zones, mainly the inlets of sewage/effluents, automobile washing, and recreational activities associated with point source pollution, as well as vegetation assemblage. Indeed, observation of riparian zone is an important aspect during site classification (Wondmagegn & Mengistou, 2023). The study area is mainly threatened by organic pollution, and the site clustering was emphasized by the level of impairment detected using selected environmental variables indicating organic pollution. The selected variables were DO, BOD, TDS, EC, and nutrients: NO3 and PO4. Thus, the Bray-Curtis clustering revealed four least impacted sites/reference and four impacted/impaired sites and was further confirmed using the Mann–Whitney U test (p < 0.05). The reference condition was given to sites with the least disturbance since it is not practical to find sites with undisturbed conditions (Zhang et al., 2019). The rest were classified as impaired/highly disturbed. The reference sites were Old Gawu Site 3 (OGS3), Old Gawu Site 4 (OGS4), Tagwai Site 2 (TS2), and Tagwai Site 3 (TS3), whereas impaired sites were Old Gawu Site 1 (OGS1), Old Gawu Site 2 (OGS2), Tagwai Site 1 (TS1), and Tagwai Site 4 (TS4). The reference sites were characterised by high DO and low BOD, EC, TDS, and nutrients, unlike impaired sites with low DO and high BOD, EC, TDS, and nutrients.

Candidate metrics selection

The macroinvertebrate metrics to be considered in MMI development are presumed to be available in the environment, with the potential ability to discriminate the sites effectively (Zhang et al., 2019). Since MMI combines several metrics into a single value (Gabriels et al., 2010), the constituent metrics are assumed to be representatives of various ecological information. Indeed, variation in metric groups creates diversity of response to disturbances, hence increasing the ecological information to be represented in an index (Baptista et al., 2007; Barbour et al., 1995, 1996).

To assess the response of the macroinvertebrate community to disturbances, 54 macroinvertebrate-based metrics were selected. The metrics selection consisted of measuring various aspects of macroinvertebrate communities gathered in five groups. The metric groups include (i) taxa abundance, i.e., the absolute number of macroinvertebrate individuals; (ii) taxa composition, i.e., relative abundance; (iii) taxa richness metrics; (iv) diversity metrics; and (v) functional feeding group (FFG). Of the 54 selected metrics, 15 were abundance metrics, 14 composition, 12 richness, six diversity indices, and seven FFG (Table 1).

Table 1 The candidate metrics and their expected response to pollution

Metrics computation and screening tests

To ensure better discrimination of references and impaired sites by candidate metrics, a series of tests were used. The screening tests included (i) sensitivity/discriminatory, (ii) stability/seasonality, (iii) redundancy, and (iv) correlation with environmental variables (abiotic factors) resulting into core metrics (Baptista et al., 2007; Jun et al., 2012; Wang et al., 2015; Wondmagegn & Mengistou, 2023). The metrics’ ranges (minimum-maximum) were considered in selection, whereby the metrics whose median value of 0 in both sites (reference and impaired) were excluded to avoid inefficient indication of the difference between these sites (Hering et al., 2006; Zhang et al., 2019). After this test, the normality of candidate metrics was performed using Shapiro-Wilk normality test. The metrics with normal and non-normal distribution were tested with parametric and parametric tests, respectively.

Discriminatory analysis, which consists of testing whether the metrics were able to differentiate the reference from impaired sites, was performed using box and whisker plots, accompanied with the Mann–Whitney U test. The box and whisker plots were used to visualize the variation of metrics range between reference and impaired sites, according to Barbour et al. (1996). The degree of interquartile (IQ) overlap helped to define the sensitivity value, by assigning the score to each metric. The IQ scores were 3 to 0, indicating no overlap to highly overlapped ranges. The IQ scores were assigned as IQ = 3, IQ = 2, and IQ = 1 when there was no overlap between two groups, overlap with all medians outside the IQ range, and moderate overlap with one median outside the IQ range, respectively. Meanwhile, IQ = 0 was assigned when the ranges were completely overlapping each other with all medians inside the IQ ranges. The metrics selected for further tests were those that showed higher sensitivity/discriminatory power (IQ 2), with a significant difference between the reference and impaired sites (p < 0.05).

The metrics which successfully met the previous criteria were tested for seasonal stability using the Kruskal-Wallis test. This was done by testing whether the metrics can withstand the seasonal variations. The metrics with stability power can withstand the seasonal variation, indicated by the absence of significant difference (p > 0.05) among seasons (Baptista et al., 2007). The redundancy test was applied to stable metrics using Spearman’s rank correlation to eliminate more redundant metrics, which showed a significantly high correlation (Spearman r ≥ 0.7 and p < 0.05). These metrics are considered not to represent new information in the index. In this case, one or more metrics were eliminated to ease the index application, reduce the analysis cost, and avoid redundant information (Barbour et al., 1996; Hering et al., 2006; Wondmagegn et al., 2019). The metrics that met redundancy tests were tested for correlation with environmental variables. Indeed, the potential metrics to be integrated in MMI should be correlated with environmental variables to be able to indicate the changes associated with disturbances. The retained metrics had significant correlation with environmental variables as indicated by Spearman’s correlation. The final metrics were expected to be negatively or positively correlated depending on explanatory factors and the prediction response to stressor/disturbance. Therefore, the metrics that passed all the aforementioned criteria were considered core/final metrics and were integrated in NCN-LMMI.

Development of North central Nigeria Lake multimetric index (NCN-LMMI)

The metrics which successfully met all the three criteria were integrated into NCN-LMMI as core metrics. The integration of core metrics into the final MMI required standardization of metrics since they had different numerical scales (numbers or percentages). Therefore, discrete scoring systems were performed to enable the collection of their values into single metrics (Baptista et al., 2007). For each metric, the range of values was sectioned into three possible scores, corresponding to the upper (75%), median (50%), and lower quartile (25%). The metrics’ quartile was used as a threshold, referring to the responses of metrics to the increase of disturbance or pollution. For metrics anticipated to decrease in response to the rise in disturbance/pollution, the values above the lower quartile of the reference distribution were given a score 5. Nonetheless, for metrics anticipated to increase with disturbance, each value below the upper quartile of the reference distribution was assigned a score 5. The scores 5, 3, and 1 represent the reference, intermediate, and highly degraded condition, respectively (Wang et al., 2015; Wondmagegn & Mengistou, 2023). Thus, the values of each metric, including the minimum, lower quartile, median, upper quartile, and maximum at reference sites, were used as the threshold to split score ranges for the establishment of the MMI.

NCN-LMMI performance validation

To evaluate the performance of the developed MMI, we estimated precision and compare the coefficient of variation (CV) of MMI values (Chen et al., 2014, 2017; Hawkins et al., 2010; Macedo et al., 2016). The estimation of precision was obtained using the correct classification percentage (CCP). The CCP of the reference cluster was computed as the percentage of sites appraised as good and fair from sites in the entire cluster, whereas the CCP of the impaired cluster was computed as the percentage of sites appraised as poor and very poor from sites in the entire group. Furthermore, to estimate the MMI precision, the CV of MMI values of the two clusters were compared, whereby the more precise the MMI, the lower the CV (Chen et al., 2014).

Data analysis

The metrics were organized, and arranged in Microsoft Excel (Microsoft Excel, 2019 Office), and further analyses were performed in R environment (RCore Team, 2022). The cluster analysis was performed using the Factoshiny package (Vaissie et al., 2021), for site classification. The Shapiro test was used to check for the normality of data sets. Mann–Whitney U test was used to assess the discriminatory power of metrics between reference and impaired sites, along with box and whisker plots visualization. Kruskal-Wallis test was used to check for seasonal differences of metrics across the study period. Spearman’s correlation was applied to determine the relationship between the metrics themselves and with environmental variables for core metrics selection, using the corrplot package (Taiyun & Simko, 2021).

Results

Macroinvertebrate metrics selection

Out of 54 metrics tested for discriminatory/sensitivity test, only 16 metrics showed discriminatory potential between reference and impaired sites (Table 2). This was revealed by the discriminatory power/sensitivity score (IQ ≥ 2) and significant difference (p < 0.05, Mann–Whitney test) expressed by the metrics between impaired and reference sites. Among these 16 metrics, five were abundance metrics, three composition, two richness, four FFG, and two diversity metrics. However, Per_lee, abun_Tric, and Tric_rich were not found in the reference sites (Median = 0) and thus were eliminated in further analysis. Therefore, 13 metrics were considered sensitive with their ability to discriminate reference from impaired sites, as indicated by the Mann–Whitney test and box–whisker plot (Table 2).

Table 2 Metrics response in discriminatory test (comparison between reference and impaired)

The sensitive metrics were tested for their ability to withstand seasonal changes. Macroinvertebrate assemblages often depend on the temporal changes in ecosystems (Bêche et al., 2006). The selected metrics for this test did not show significant difference between seasons, i.e., stable metrics. Therefore, only seven out of 13 metrics were able to show stability in various seasons (Table 3). The selected metrics, i.e., seasonally stable metrics were Eph_abun, ET_abun; CE_abun; Gast_rich; Shan_H, Marg_Index, Per_filt_feeder, and Per_Shre+Pred+Scrap.

Table 3 The median values with range (minimum-maximum) in parentheses of sensitive metrics during seasonality test

The redundancy test was applied to seven stable metrics selected for seasonality test using Spearman’s rank correlation. Metrics were considered redundant when expressed a significant correlation coefficient greater than or equal to 0.7 (r ≥ 0.7; p < 0.05) (Hering et al., 2006; Wondmagegn & Mengistou, 2023; Fig. 2). For two or more redundant metrics, we selected one based on either knowledge of its potential indicator, i.e., ecological information derived from the metrics comparably to other(s) or its simplicity in calculation, to ease the index. Based on the aforementioned criteria, Shan_H and Marg_Index were redundant. However, Shan_H was chosen, ascribed to its ability to inform macroinvertebrate diversity aspect in the site and is often applied in diversity measurements, particularly during MMI development (Miler et al., 2013; Timm & Möls, 2012). Likewise, Per_filt_feeder was selected comparably to Per_Shre + Pred + Scrap, owing to its relatively easy identification cost, thus contributing to the simplicity of the index. Indeed, the main target of MMI is to compile the ecological information from various metrics response to disturbance into a single, simple, effective, and robust index (Chowdhury et al., 2016). The five metrics that succeeded redundancy test were ET_abun, CE_abun, Gast_rich, Shan_H, and Per_filt_feeder.

Fig. 2
figure 2

Correlation matrix of the seasonally stable metrics of macroinvertebrates applied for data collected from Old Gawu and Tagwai Lakes during the redundancy test

While assessing the relationship between metrics and environmental variables, the five selected metrics that passed through sensitivity, stability, and redundancy were considered. The Spearman’s rank correlation revealed that all selected metrics were correlated with at least two environmental variables (Table 4). Therefore, the selected metrics at this stage were considered core metrics and were finally integrated into the final NCN-LMMI.

Table 4 Correlation matrix of the seasonally stable metrics and environmental variables for data collected from Old Gawu and Tagwai Lakes.

Development of North central Nigeria Lake multimetric index (NCN-LMMI)

From all the 54-candidate metrics tested, 9.25% were integrated into the index. The final metrics integrated into the NCN-LMMI were ET_abun, CE_abun, Gast_rich, Shan_H, and Per_Filt-feeder (Fig. 3). For each of the five-core metrics integrated into the NCN-LMMI, the values for apt quartile at reference sites were used as a threshold for splitting high score from the lower score (Table 5). To standardize the metrics values, the discrete scoring was applied following the response of final metrics (either increase or decrease with increase in disturbance) as described in Table 5. NCN-LMMI was attained by summing the scores of five metrics, thus generating the range of 5 to 25 as the possible range of the index. Thereafter, the NCN-LMMI range was quadrisected to provide four water quality categories I, II, III, and IV portending to good, fair, poor, and very poor ecological condition of the lake (site), respectively (Table 6).

Fig. 3
figure 3

Box plots of the core metrics and their distribution along the clustered sites of Old Gawu and Tagwai Lake: Boxes represent IQ (25–75% percentiles), the line divides the box represents the median, whereas range bars show maximum and minimum values.

Table 5 The computed metric threshold based on macroinvertebrate assemblages of reference for each of the five-core metrics integrated into NCN-LMMI
Table 6 North Central Nigerian Lake Multimetric Index score portend different water quality category and their respective ecological status

NCN-MMI validation

The CCP values of the reference cluster and the impaired cluster were 100% and 75%. respectively, an indication of a very good precision of MMI. Indeed, the scores of the newly developed NCN-LMMI were applied to the impaired sites, OG1, OG2, TS1, and TS4 data, and revealed that all were in poor ecological condition (Table 7). This indicates the NCN-LMMI’s ability to differentiate the impaired from reference sites. This was emphasized by the CVs of the reference cluster and impaired cluster which were 0.14 and 0.15, respectively, proving that NCN-LMMI was more precise with both reference and impaired sites.

Table 7 Scoring the impaired site dataset using the newly developed MMI (NCN-LMMI) to test for its robustness.

Discussion

Site classification

Based on the nature of Tagwai and Old Gawu Lakes, with common stressors including agricultural activities and the domestic and municipal wastes (Oladosu et al., 2019), the NCN-LMMI was developed for organic pollution. Indeed, these stressors are the primary sources of organic pollution that lead to nutrient enrichment of the lakes (Zhang et al., 2019), thus affecting aquatic ecosystem health. Dissolved oxygen (DO), BOD, EC, and nutrients (nitrate and phosphate) were used, since their concentration can indicate the levels of organic pollution in a water body (Menetrey et al., 2011; Šidagyte et al., 2013). The impairment of sites can also be linked to physical degradation such as water abstraction and clearing of the vegetation buffering the lakes leading to direct entry of pollutants. The reference sites should be able to express the natural patterns and processes basically with no anthropogenic disturbances, to be able to differentiate them from disturbed sites (Whittier et al., 2007). However, it is hard to find the reference sites as described (Stoddard et al., 2006; Whittier et al., 2007); instead, the least-disturbed sites were considered as reference in this study. Generally, site classification depends on water body stressor/disturbance; thus, various parameters act as indicators, for instance, trophic level (Ding et al., 2021; Zhang et al., 2019), degree of acidity (e.g., Schartau et al., 2008), or combination of multiple factors (Menetrey et al., 2011).

The site classification resulted in two final clusters namely reference sites, i.e., less disturbed, and impaired sites, i.e., highly degraded/disturbed. Two clusters were also obtained by Ruse (2010) and Shah et al. (2011), during MMI development. However, various MMIs resulted in the use of three categories: reference/no-impacted/less impacted, moderately impaired, and disturbed/impaired group (Chowdhury et al., 2016; Ding et al., 2021; Wondmagegn & Mengistou, 2023). Therefore, the number of clusters depends on degradation gradients and site classification methods.

Metrics selection and NCN_MMI development

The choice of candidate metrics for NCN-LMMI reflects metrics used locally and in the bioassessment of the water quality of lake ecosystems. Metrics selection can be done based on taxa indicators used in the local/regional area (Edegbene et al., 2019; Wondmagegn & Mengistou, 2023) or the successful metrics used in the same ecosystem/type of water body, either lake or river (Hering et al., 2006; Zhang et al., 2021). Indeed, a similar approach was applied while selecting metrics for MMI Flanders, where metrics which were successfully used throughout the targeted area were considered (Gabriels et al., 2010).

A total of five-core metrics from four metrics groups: two abundance, one composition, one diversity, and one FFG were integrated into the final MMI. The metrics from various groups indicate the complexity (diversity of information/responses to disturbance) by macroinvertebrate assemblages, hence improving the robustness of MMI (Hering et al., 2006; Stoddard et al., 2008). The screening tests helped eliminate the metrics from 54 to only five with intention of obtaining the simple, effective, and more informative metrics governing the index. Through this process, some MMIs end up screening process with different numbers of core/final metrics. Final metrics can be four (Ding et al., 2021), often between five and ten final metrics (McGoff et al., 2013; Shah et al., 2011; Wesolek et al., 2010; Zhang et al., 2019), or even from ten and above (Keizer-Vlek, 2014; Trigal et al., 2006). The variation in the number of final metrics is mostly based on the nature of water body, metrics computation and analysis, and target of MMI as well.

The abundance metrics were used in water quality monitoring, specifically in MMI development (Šidagyte et al., 2013; Timm & Möls, 2012; Zhang et al., 2019). Indeed, the absolute abundance of macroinvertebrate provides a response to the environment, owing to their affinity to stressors (Camargo et al., 2011). For instance, the abundance of macroinvertebrates showcases water chemistry accruing to pollution that may be occurring in the environment (Arimoro et al., 2015). Composition metrics have been used in many MMIs (Hering et al., 2006; McGoff et al., 2013; Schartau et al., 2008; Shah et al., 2011; Taowu et al., 2008), due to their details in indicating the organisms’ responses to stressors. Meanwhile, diversity metrics are widely used in water quality monitoring including MMI (Ding et al., 2021; Hering et al., 2006; Timm & Möls, 2012; Zhang et al., 2019) and other biological approaches of water quality assessment such as surveys and multivariate analysis (Djamai et al., 2019; Ghosh & Biswas, 2015; Latha & Thanga, 2010). Shannon–Wiener index has a strong influence on indicating the status of macroinvertebrates (Clews et al., 2014), hence predicting their response to stressors. The FFG was considered as a group of metrics while developing NCN-LMMI, a similar approach that has previously been applied (Chowdhury et al., 2016; Ding et al., 2021; Huang et al., 2015; Wesolek et al., 2010). The selection of FFG metrics was associated with their sensitivity to nutrient enrichment, thus acting as a good indicator of organic pollution, compared to other taxonomic metrics (Menezes et al., 2010; Xu et al., 2014). In addition, FFG encompasses various taxa that could respond differently to stressors (Newbold et al., 2020); however, as single feeding behaviour, assemblage provides efficient information to the final MMI. Furthermore, this metric (Per_filter feeders) was able to successfully pass through all screening tests, mainly due to its sensitivity to stressors, stability among seasons, and ability to correlate to environmental variables (Figs. 2 and 3).

The core metrics correlated significantly with environmental variables, justifying the robustness of NCN-LMMI. For instance, Shan_H positively correlated with DO and negatively correlated with phosphate, TA, Ca, and Mg. Indeed, Shan_H decreases with the increase in pollution (Moiseenko et al., 2009). The pollution-sensitive metrics such as ET_abun and EC_abun, which decrease with the increasing pollution negatively correlated with variables like BOD, TDS, EC, and nutrients. This expresses the ability of NCN-LMMI to notice the change, and status of pollution, and provide ideal signals/early warning. Similar findings, i.e., the correlation of the metrics with environmental parameters, often used as indicators of pollution (Gabriels et al., 2010; Shah et al., 2011; Šidagyte et al., 2013). On the other hand, a relationship was revealed among the two pollution tolerant metrics: the Gast_rich and Per_filt_feeder with environmental variables such as TDS, EC, Cl, and TH. This is congruent with the reported studies by Schartau et al. (2008) and Zhang et al. (2019), where a positive correlation was found between metrics that respond positively to the disturbance.

Ephemeroptera and Trichoptera taxa have been reported to be highly sensitive to organic pollution (Akamagwuna et al., 2019; Chowdhury et al., 2016; Mohammed et al., 2022). Similarly, these taxa are part of EPT, the commonly used metric in both biological assessment and MMIs (Baptista et al., 2007; Gabriels et al., 2010; McGoff et al., 2013; Menetrey et al., 2011). However, EPT did not show stability by discriminating reference from impaired sites. The ET_abun is considered sensitive, its abundance decreased with increasing pollution, and the trend was emphasized in the data collected at Old Gawu and Tagwai Lakes. On the contrary, ET can respond positively to pollution when the sample is dominated by the less sensitive taxa such as Caenidae and Heptageniidae (Brown, 2009), which could alter the response to disturbance. Ephemeroptera appeared in two final metrics (ET_abun and CE_abun), not only to efficiently represent their information in the MMI but also to contemplate its interactions with Trichoptera and Coleoptera. Individually, Coleoptera and Trichoptera did not show discriminatory power; however, when combined with Ephemeroptera, they were able to differentiate reference from impaired and withstood the seasonal variation. Indeed, Coleoptera were recorded in high abundance from reference sites compared to impaired sites. In addition, studies reported metrics comprising Coleoptera as sensitive metrics (Dong et al., 2014; McGoff et al., 2013; Menetrey et al., 2011).

Gastropoda richness was integrated into MMI, owing to its response to disturbances. The most sampled family was Thiaridae, a pollution-tolerant taxa (Akindele & Liadi, 2013; Weir & Salice, 2012). Nevertheless, some gastropods are sensitive, e.g., Caenogastropoda, also known as gilled snails (Puglisi, 2021). Caenogastropoda’s breathing mechanism requires absorption of DO through gills, making them highly dependent on good water quality with sufficient DO concentration for survival, hence indicating high sensitivity to pollution (Anyanwu et al., 2019). Gastropoda richness has been integrated into various MMIs as a core metric (Schartau et al., 2008; Taowu et al., 2008; Zhang et al., 2019), owing to their ability to discriminate efficiently reference from impaired sites.

In NCN-LMMI development, Shan_H was able to differentiate reference from impaired sites, owing to its response to environmental degradation. Shannon–Wiener index is the most common metric used in macroinvertebrate-based MMI (Burton et al., 1999; Gabriels et al., 2010; Trigal et al., 2006; Wesolek et al., 2010). Its reliable measure and indicator help in understanding community diversity (Ghosh & Biswas, 2015), hence indicating the diversity of taxa in the targeted sites. In addition, Shannon–Wiener index is a pollution indicator, which is expected to be lower in impaired areas compared to reference sites (Kartikasari et al., 2013). Apart from being used in MMI development as diversity metrics, Shannon–Wiener is often used in biomonitoring and management of various water bodies (Djamai et al., 2019; Ghosh & Biswas, 2015; Latha & Thanga, 2010).

Macroinvertebrates express various feeding habits, which give them the ability to respond differently to disturbance. The percentage of filter feeders was part of NCN-LMMI after successful completion of all metrics screening. Filter feeding is common among macroinvertebrates like bivalves, using gills (ctenidia) (Ward et al., 2003), and Ephemeroptera, Trichoptera, and Diptera using setae, mouth brushes, and silk secretions, acting as sieves (Yee & Kaufman, 2019). The Per_filt_feeders was an increasing metric, probably due to the abundance of Culicidae, a pollution-tolerant taxa (Jeanrenraud et al., 2023). This metric was also considered by O’Toole et al. (2008) in testing the relationship between nutrients and selected benthic macroinvertebrates in lakes of Europe.

Precision through CCP and CV was estimated to prove whether NCN-LMMI reacts appropriately to environmental condition, and it has been revealed accordingly. The validation of metrics is one step to ensure the MMIs developed are indeed reflecting the expectations. During validation, some developers consider the same dataset used during MMI development (Alemneh et al., 2019; Zhang et al., 2019) or validate using separate dataset collected from the same sites (e.g., Huang et al. (2015) and Wondmagegn & Mengistou, (2023)). A similar validation method was applied by Chen et al. (2014), Krynak and Yates (2018), and Zhang et al. (2019).

Conclusion

NCN-LMMI is the first attempt as a lake-based multimetric index, using macroinvertebrates in North Central Nigeria and Nigeria at large to the best of our knowledge. It is anticipated to be helpful in the monitoring of lake water quality especially municipal lakes in this region, while contributing to addressing public health issues associated with pollution, such as waterborne diseases. Policy makers and water resource managers can use this tool to develop effective management strategies. Indeed, this can be a better alternative in water quality monitoring of lakes over traditional methods of physico-chemical parameters, owing to its cost-effectiveness, both in terms of finance and time.