Introduction

Periphytic microfauna is an important component of benthic communities and plays necessary roles in energy transfer from bacteria and microalgae to the high trophic levels in microbial food webs (Azovsky 1988; Norf et al. 2009; Xu et al. 2011a, 2014a). With rapid responses to environmental changes, ease of sampling, and standardized methodologies for determining water quality status, microperiphyton has been considered as a useful bioindicator for monitoring programs to date (Cairns and Henebry 1982; Coppellotti and Matarazzo 2000; Xu et al. 2009, 2014b). Furthermore, colonization parameters of periphytic microfauna have proved to be a potentially robust indicator for assessing the loading capacity or self-purification capacity of an aquatic ecosystem for pollutant inputs (Cairns and Henebry 1982; Railkin 1995; Burkovskii and Mazei 2001; Xu et al. 2009; Zhang et al. 2013). However, the community/colonization-based bioassessments have traditionally been based on full species abundance datasets (Jiang et al. 2007; Xu et al. 2014a). This is time-consuming and laborious and significantly limits their use for monitoring research on large spatial scales.

Taxonomic sufficiency is a useful tool to reduce the time cost for monitoring research (Ellis 1985; Heino and Soininen 2007; Jiang et al. 2014). An effective approach is to identify a single taxonomic or ecological group as suitable surrogates that reflects the sensitivity of the entire species assemblage (Heino and Soininen 2007; Xu et al. 2011a; Jiang et al. 2014). Although the feasibility of these surrogates has been reported for many benthic faunas, studies on the utility of an optimal species pool of a full species set in a community for colonization-based bioassessment surveys has yet to be carried out (Stark et al. 2003; Xu et al. 2011b).

In this study, the step-best-matching analysis was used to identify an optimal species pool for colonization-based bioassessment using periphytic microfauna, based on a dataset that was collected in coastal waters of the Yellow Sea, near Qingdao, northern China. The aim of this study was to develop a feasible protocol for identifying an optimal species pool for marine bioassessment using periphytic microfauna.

Material and methods

Data collection

The study site was located in the harbor of the Olympic Sailing Center at Qingdao, northern China (Fig. 1). This is a typical coastal area of Yellow Sea with an average depth of ~8 m and an average tidal range of 3 m.

Fig. 1
figure 1

Sampling station, which was located in the harbor of the Olympic Sailing Center (OSC) at Qingdao, on the Yellow Sea coast of northern China

A total of 280 glass slides (17.5 cm2 for each) were used as artificial substrates for collecting microperiphyton communities at two depths of 1 and 3 m below the water surface. For each depth, a total of seven PVC frames were used to hold a total of 140 slides, 20 of which were randomly collected from each PVC frame at the time interval of 1, 3, 7, 10, 14, 21, and 28 days, during the study period. From both depths, samples were collected simultaneously.

Identification and enumeration of microperiphyton species were carried out following the methods outlined by Xu et al. (2009, 2011b). Taxonomic classification of ciliates was based on published keys and guides such as Song et al. (2009) and Berger (1999).

Data analyses

The colonization process of the periphytic ciliates can be followed by sampling replicate glass slides over time to follow the rate and magnitude of each species accrual process. Data can thus fit the colonization equilibrium model equation developed by MacArthur and Wilson (1967):

$$ {S}_{\mathrm{t}}={S}_{\mathrm{e}\mathrm{q}}\left(1-{\mathrm{e}}^{\hbox{--} Gt}\right) $$

where S t = the taxon number at time t, S eq = the estimated equilibrium species number of the ciliate colonization, G = the colonization rate constant, and t = the colonization time. Three functional parameters (S eq, G, and T 90) were estimated using the software SIGMAPLOT (v12.5) (e.g., Zhang et al. 2013). Fitness tests were conducted to determine whether the taxon numbers observed at each time interval fit with the MacArthur-Wilson model at the 0.05 significance level. The regression analysis and fitness tests were carried out using the regression wizard-equation “Exponential Rise to Maximum (single, 2 parameter)” in the software SIGMAPLOT. The time taken to reach 90 % S eq (T 90) was calculated according to the equation T 90 = −ln (0.1)/G (e.g., Zhang et al. 2013).

The increase of individual abundance over total experimental phase was tested if it was fitted to the logistic model:

$$ {A}_{\mathrm{t}}={A}_{\max }/\left[1+{e}^{\left(a-rt\right)}\right] $$

where A t = the individual abundance at time t, A max = the carrying capacity of individual abundance (maximum abundance), r = the growth rate constant, and a = the coefficient constant of initial individual abundance; T 50 = the time to 50 % A max. All parameters (e.g., A max, r, and T 50%) were estimated using the SIGMAPLOT. Fitness tests were to determine whether the individual abundance recorded at each time interval fit with the logistic model at the 0.05 significance level. The regression analysis and fitness tests were carried out using the regression wizard-equation “Sigmoidal (sigmoid, 3 parameter)” in the software SIGMAPLOT (Zhang et al. 2012).

Species diversity (H′), evenness (J′), and species richness (D) of samples were calculated as follows:

$$ \begin{array}{l}H^{\prime }=-{\displaystyle \sum_{i=1}^s{P}_i\left( \ln {P}_i\right)}\hfill \\ {}J^{\prime }=H^{\prime }/ \ln S\hfill \\ {}D=\left(\mathrm{S}-1\right)/ \ln N\hfill \end{array} $$

where H′ = observed diversity index, P i = proportion of the total count arising from the ith species, S = total number of species, and N = total number of individuals.

The step-best-matching analysis was used to exhaustively select the best matching subsets that were significantly correlated with the matrix of the full species abundance dataset from the same dataset by discarding the former selections using the submodule BVSTEP in the PRIMER v6.1.16 package (Clarke and Gorley 2006). The initial selection was define as best 1, second selection after discarding the first selection as best 2, third selection after discarding the first and second selection as best 3,…, and last selection with significant correlation with the matrix of full species set after peeling out selections 1–n from full species set as best n (Zhong et al. 2014). Using the selections identified at a 0.95 correlation level with the full species dataset, a smallest subset of species was determined as an optimal species pool with sufficient information of the full species set (Clarke and Gorley 2006).

The community structures of samples were analyzed using the PRIMER 6 package (Clarke and Gorley 2006). The distance-based redundancy analysis (dbRDA) of PERMANOVA+ was used to summarize temporal variations in community pattern at five taxonomic levels, using Bray-Curtis similarities from the fourth root-transformed species abundance data, respectively (Anderson et al. 2008). Mental analysis was used to summarize the relationships between pairs (the full species set and the optimal species pool) of similarity matrices using the Spearman rank correlation coefficients (p values) which were computed by the submodule RELATE (Clarke and Gorley 2006).

The nonparametric Kolmogorov-Smirnov test was used to evaluate the differences in the species number and colonization rates between depths of 1 and 3 m at the 0.05 significance level, using the software SPSS (v16.0) (e.g., Xu et al. 2012; Zhang et al. 2012). Data were log-transformed before analyzing.

Results

Full species dataset and optimal species pool

The species distribution, occurrence, and abundance in the full microperiphyton dataset studied were summarized in Supplementary Table S1 (see Appendix). A total of 77 microperiphyton species were recorded at two depths in coastal waters of the Yellow Sea, northern China, during the study period.

By successive selections of species that were generated by the BVSTEP procedure, a 23-species subset was identified from the full 77-species set (see Supplementary Table S1). Regression analyses revealed that the values of both species number and abundance represented a significantly closed relationship between the subset and the full set (R 2 > 0.98; Fig. 2).

Fig. 2
figure 2

Scatter plots of species numbers (a) and abundance (b) between the 23-species subset and the full set, showing linear relationships. R 2: determination coefficients

Colonization curves based on small subset

Colonization curves in species composition based on both the small subset and the full set are plotted in Fig. 3a. Regression analyses demonstrated that all colonization curves in species composition were well fitted to MacArthur-Wilson model equation (Table 1). Three functional parameters—S eq, G, and T 90—were calculated following the MacArthur-Wilson model equation (Table 1). Compared to the values based on the full dataset, the colonization rates (G) were significantly high and the time reaching 90 % equilibrium species number (T 90 ) significantly low (P < 0.05) at both depths, respectively.

Fig. 3
figure 3

Colonization curves in species number (a) and abundance (b) based on 23-species subset and the full set, showing the fitness to MacArthur-Wilson and logistic model

Table 1 Functional parameters of microperiphyton fauna, based on the 23-species subset and the full 77-species set, at two depths of 1 and 3 m of the Yellow Sea during the period

Growth curves based on both the small subset and the full set were plotted in Fig. 3b. Regression analyses revealed that all growth curves were well fitted to logistic model equation (Table 2). Three parameters—A max, r, and T 50—were obtained from the logistic model equation. However, the growth rates (r) and the time reaching 50 % maximum abundance (T 50) showed no significant changes (P > 0.05) between the two datasets.

Table 2 Estimated parameters of logistic curves in abundance based on the 23-species subset and the full 77-species set, at two depths of 1 and 3 m of the Yellow Sea during the period

Colonization dynamics in community structure based on small subset

The colonization dynamics in community pattern based on both the small subset and the full set during colonization period are shown in Fig. 4. The first dbRDA axis (dbRDA1) mainly separated the immature (1–7 days) communities (on the left) from the mature (10–28 days) (on the right), while the second dbRDA axis (dbRDA2) mainly discriminated the initial (1 day) (upper left) from the transitional (3–7 days) communities (lower left) (Fig. 4). It should be noted that almost all vectors (correlation coefficients >0.3) for microperiphyton species pointed toward the mature sample clouds (right; Fig. 4).

Fig. 4
figure 4

Distance-based redundancy analysis (dbRDA) of 14 data points of the 23-species subset and the full set, showing the colonization dynamics

Mental analysis revealed that community pattern of the 23-species subset was significantly correlated with that of the full set (Spearman correlation coefficient >0.97; P < 0.05).

Temporal variations in biodiversity based on small subset

The linear relationships of the species richness, evenness, and diversity between the small subset and the full set are summarized in Fig. 5. The values of all three biodiversity indices represented a significantly closed relationship between the two datasets (R 2 > 0.91; Fig. 5).

Fig. 5
figure 5

Scatter plots of species richness (a), diversity (b), and evenness (c) between the 23-species subset and the full set, showing linear relationships. R 2: determination coefficients

Discussion

Many investigations have revealed that the colonization process of periphytic microfauna could be assigned into the initial, transitional exponential, and the equilibrium stages (Azovsky 1988; Burkovskii and Mazei 2001; Zhang et al. 2012, 2013). Zhang et al. (2012, 2013) reported that the periphytic ciliate assemblage showed significantly high similarities in community structure in the equilibrium stage but represented a significant difference from those during the initial/exponential stage (Zhang et al. 2012, 2013). Our previous works have also demonstrated that the colonization curves in species number and abundance well fitted the MacArthur-Wilson and logistic models, respectively.

In this study, a 23-species subset was identified from the full 77-species dataset of periphytic microfauna using the step-best-matching analysis. Multivariate analysis showed that this small subset maintained sufficient information of the entire communities and represented a consistent colonization dynamics with the full species dataset. Otherwise, regression analysis revealed that the community structural parameters, i.e., species richness, abundance, and biodiversity indices, represented a close linear relationship between the subset and the full species set. Thus, we propose that this subset might be used as an optimal species pool of periphytic microfauna for colonization-based bioassessment surveys.

It should be noted that the use of an optimal species pool as a surrogate for species-level identification offers two advantages: (1) more reliable since full species identification is complex and laborious, and (2) less time-consuming, reducing the costs of monitoring programs. Based on this work, the community pattern and biodiversity indices based on the optimal species pool may explain >97 and >90 % variation of those based on the full species dataset, respectively. Furthermore, the colonization rates (G) were significantly high and the time reaching 90 % equilibrium species number (T 90) significantly low based on this optimal species pool compared to the full species dataset (P < 0.05). This implies that it may shorten the exposure time to use the small species pool compared to the full species dataset and allows developing a time-efficient sampling strategy for marine bioassessment.

In summary, a 23-species subset with sufficient information of the whole community was identified. The small subset maintained sufficient information of colonization pattern of the full species set and may be used as a robust surrogate of entire periphytic microfauna in colonization surveys. This allows developing a time-efficient protocol for marine monitoring programs using of periphytic microfauna. However, further studies on a range of marine nichs and over large temporal/spatial scales are needed in order to verify this conclusion.