Introduction

The Caizi Lake wetland is an important freshwater ecosystem located in the middle and lower reaches of the Yangtze River, which is an important wintering habitat for Anser fabalis and Cygnus columbianus along the migration route from East Asia to Australia, as well as for East Asian endangered species such as Grus monacha and the Ciconia boyciana (An et al. 2021). Therefore, strengthening the protection of the Caizi Lake wetland is critical for the conservation of the ecosystem and species diversity. The Yangtze-to-Huaihe Water Diversion Project was launched in December 2016 and the Caizi Lake is an important node lake. The project carries water northward through the Caizi Lake, with 60% of the total pilotage volume and more than 85% of the spontaneous pilotage volume flowing through the lake. After the completion of the project, the diversity of wetland plants and the survival of migratory birds in the tidal flat would be impacted as the water level rises and the wetland area decreases (Li et al. 2019a, b). In the face of the current reduction of wetland area and wetland destruction, the Chinese government has mandated the restoration of wetlands from farmlands to promote the protection of wetlands and the habitat of migratory birds. Particularly, the conversion of farmlands to wetlands in the Caizi Lake wetland should be prioritized.

Species diversity indices serve as indicators of the community structure and function and are therefore used as sensitive indicators of wetland community structure, function, migratory bird habitats, and ecosystem stability (Zheng et al. 2019). Understanding the species diversity of wetland vegetation is helpful to assess the community type, community succession stage, habitat quality, and ecosystem stability (Garbisu et al. 2020). Particularly, this study focuses on the species diversity of reclaimed farmlands that are being converted back into wetlands. Our findings indicated that the soil heavy metal content in the study area exceeded the screening value established by the Chinese GB15618-2018 standard (GB 15618–2018 n.d.). Therefore, pesticides and heavy metal stress could affect the complexity and specificity of regional plant diversity (Yang et al. 2020). Environmental pollutants such as pesticides and heavy metals are important factors to consider when planning the restoration of the Caizi Lake ecosystem. Therefore, it is important to understand the distribution and source of heavy metals in degraded or reduced farmlands, as well as to explore the influence on species diversity and plant tolerance mechanisms. Most studies on soil heavy metals have focused on biological effectiveness (Cyriac et al. 2021; Zuran et al. 2019), heavy metal toxicity (Matayoshi et al. 2021; Yang et al. 2021), and ecological risk to individuals (Ekengele et al. 2017; Tóth et al. 2016). Moreover, studies on group effects largely focus on the community diversity of microbes, aquatic organisms, animals, and mine scrap lands (Deng et al. 2021; Jia et al. 2019; Yang et al. 2020). However, few studies have evaluated the influence of heavy metals on the diversity of plant communities in degraded or reduced farmlands. Therefore, this study evaluates wild hygrophytes in Xinjian Dyke in the Caizi Lake National Wetland Park and discusses the PMF source apportionment of heavy metals and its influence on the species diversity of the plant community. This study thus provides a reference for the ecological restoration of reclaimed farmlands for the Yangtze-to-Huaihe Water Diversion Project.

Materials and methods

Study area

The Caizi Lake wetland is a typical silted shallow freshwater lake wetland located in a subtropical monsoon climate region, with abundant rainfall and a pleasant climate. The wetland is located in the north bank of the Yangtze River, at the junction between Yixiu District, Zongyang, and Tongcheng County, Anqing City, Anhui Province (117°01′–117°10′E, 30°45′–30°56′N), with an average elevation of 9.1 m. The flat water area of the wetland is 16,667 hm2, the wet water area is 24,230 hm2, and the dry water area is 14,520 hm2 (Gao et al. 2011). An analysis of the plant and bird species during the soil and plant sampling in this study revealed that there are 402 species of plants and 315 species of birds in Caizi Lake. The species and number of wintering waterbirds have been stable for many years, with 30–40 species and more than 20,000 individuals (Wang et al. 2018).

Xinjian Dyke is a restoration and reconstruction area that is less than 1 km away from the migratory bird gathering point. In view of the impact of the Yangtze-to-Huaihe Water Diversion Project on the habitat reduction of migratory birds, it is necessary to return farmland to wet. But more than 85% of the district is occupied by farmland, and rice, wheat, and vegetables were planted on it (Fig. 1); therefrom, the lots not occupied by human activities present narrowly transects and only four which are distributed in the river highlands and basically have no spatial heterogeneity. The damage of human activities is very serious in Xinjian Dyke. Wild plant communities are distributed along the benchland in a band-like plaque in land-lake ecotones. Wild vegetation species have distinct seasonal characteristics with fewer species before June, increasing from June to September, and decreasing again to only a single or a few species after October. Generally, very few species dominate each plaque.

Fig. 1
figure 1

Localization of the study area and distribution of the four belt transects

Soil sampling and determination

Soil samples were collected in February 2021 in four 200 m × 20 m strip zones (No. YD1, YD2, YD3, YD4) (Fig. 1) locating on land and water alternating zones because of the large area of farmland and marsh restriction in this study. Moreover, three sampling sites were arranged in the central region of each strip zone, resulting in a total of 12 soil sample sites. Meanwhile, three sample sites were selected in the north forestland free of exogenous pollution as a control (CK) (Fig. 1). Soil samples with a 20-cm depth were collected with a columnar soil sampler. Each sampling site was collected three times and mixed, and 2 kg of fresh samples was gathered through a quad method (Sun et al. 2021). The samples were then naturally air-dried for testing after removing impurities such as animal and plant residues. Next, 100 g of sample was passed through a 200-mesh nylon sieve and digested by HCl-HNO3-HF-HClO4 solution in a screw-cap polytetrafluoroethylene digestion vessel. The samples were sent to a certified laboratory to determine heavy metal concentration after they were pretreated with 0.45-μm microporous filter membrane. Concentrations of Cd, Cr, Pb, Cu, As, and Zn were determined using an inductively coupled plasma-atomic emission spectrometry (Agilent 7900 ICP-MS, Agilent, USA) with the lowest detection limit of 0.07 mg/kg, 2 mg/kg, 2 mg/kg, 0.5 mg/kg, 0.6 mg/kg and 7 mg/kg, respectively, according to HJ 803–2016, that of Hg was determined using a hydride generation-atomic fluorescence spectrometry (HG-AFS, HD-CG10 made in Shandong Holder, China) with the lowest detection limit of 0.002 mg/kg according to HJ/T 166 -2004. The pH of the soil was measured with a soil-to-deionized water ratio of 1:2.5 and a minimum pH value display unit of 0.01 (Seven Compact™, Mettler-Toledo, Switzerland).

Plant diversity survey

The survey was conducted from October 11 to 14, 2021. A tidal flat survey method was adopted to estimate community type and species richness based on the geographical and topographical characteristics of the study area. Soil samples can be taken every 3 years according to HJ/T 166 -2004. Through the analysis on the local development status and 5-year plan, the major pollution sources are basically unchanged in the first and last 5 years. The soil data measured in this study can reflect the situation in last recent years. Based on this, plant samplings were selected in the same four strip zones as soil sampling. The plant community was distributed along the strips, and 1 m × 1 m quadrats were set according to the distribution characteristics of the plants in the strip zone. Furthermore, 10 m × 10 m quadrats were set for plots with shrubs to statistically analyze the species composition (GB T 30363-2013 n.d.; HJ 710.1.11–2014 n.d.), abundance, coverage, and dominant species of herbaceous and shrub communities (Xie et al. 2014). Scattered and artificially grown plants were not evaluated in this study. Twelve representative wild plant communities were investigated. The strip of each type of community is illustrated in Fig. 2.

Fig. 2
figure 2

Distribution of surveyed communities

Calculation and statistical analysis

PMF model

Positive matrix factorization (PMF) models are relatively advanced receptor models that are often used in ecological assessment. Particularly, this approach has been widely recognized and applied due to its low dependence on source spectrum information. In this model, the original matrix (Xij) is decomposed into the source component spectrum matrix (fkj), sharing rate matrix (gik), and a residual matrix (eij) (Sofowote et al. 2008). The calculation process does not require inputs from the source fingerprint, as shown in Eq. (8):

$${X}_{\mathrm{ij}}={\sum }_{k=1}^{p}{g}_{\mathrm{ik}}{f}_{\mathrm{kj}}+{e}_{\mathrm{ij}}$$
(1)

where Xij is the content matrix in the receptor and represents the content of element j in soil sample i (mg·kg−1); p is the number of factors (i.e., generating sources); gik is the factor contribution matrix, representing the content of j element in source k (mg·kg−1); fkj is the spectral matrix of the factor component, representing the content of j element in the k component of the source (mg·kg−1); eij is the residual matrix, which is calculated according to the definition of the objective function.

The minimum value of the objective function Q in the PMF model is then calculated as shown in Formula 2:

$$\mathrm{Q}=\sum\nolimits_{\mathrm{i}=1}^{n}\sum\nolimits_{\mathrm{j}=1}^{\mathrm{m}}{\left(\frac{xij-{\sum }_{k=1}^{p}{g}_{ik{f}_{kj}}}{{u}_{ij}}\right)}^{2}$$
(2)

where n represents the number of samples, m represents the number of species, and Uij represents the uncertainty of the j species in sample i.

If the heavy metal content is less than or equal to the method detection limit (MDL), the uncertainty (UNC) calculation formula is shown in Formula 3:

$$UNC=\frac{5}{6}\times \mathrm{MDL}$$
(3)

If the heavy metal content is greater than MDL, the calculation formula of UNC is shown in Formula 4:

$$UNC=\sqrt{{\left(c\times RSD\right)}^{2}+0.5\times {\mathrm{MDL}}^{2}}$$
(4)

where c is the species content (mg·kg−1), RSD is the relative standard deviation, and MDL is the given limit of species detection (mg·kg−1).

Importance value

The importance value (IV) is a comprehensive quantitative indicator reflecting the role and position of a certain species in the forest community (GB T 30,363–2013), which is calculated based on various growth forms:

$$\mathrm{IV}=\frac{\left(\mathrm{relative abundance}+\mathrm{relative dominance}+\mathrm{relative frequency}\right)}{3}\times 100$$
(5)

where,

Relative abundance = a plant’s abundance/sum of all plants in the sample plot × 100%

Relative dominance = a plant’s dominance/sum of dominance values of all plants in the sample plot × 100%

Relative frequency = a plant’s frequency/sum of frequency values of all plants in the sample plot × 100%

Plant diversity indices

Plant diversity is frequently characterized using the Shannon–Wiener, Margalef, and Simpson indices (Mulya et al. 2021, GB/T 30363-2013 n.d.).

$$H=-\sum_{i=1}^{s}{P}_{i}\mathrm{ln}{P}_{i},{D}_{M}=\frac{\left(S-1\right)}{\mathit{ln}N},Ds=1-{\sum }_{i=1}^{S}{P}_{i}^{2}$$

where H is the Shannon–Wiener index; S is the number of species present in the community; Pi = ni /N, where ni is the number of individuals of a species and N is the total number of individuals of all species in the community. DM is the Margalef index; DS is the Simpson index.

Statistical analyses

The ArcGIS software was used to draw the spatial distribution map of heavy metal content and plant communities. Heavy metal source analysis was conducted using the EPA PMF 5.0 software. Pearson correlation analysis and regression analysis (RA) were conducted using SPSS 21.0. The standard deviation is shown as an error bar on the graph, and significance thresholds were set at P<0.05 (significant) and P<0.01 (highly significant). The plant species in this study were named according to the global biodiversity information facility (GBIF).

Results

Distribution characteristics and source analysis of heavy metals in soil

The Hg content in the soil of Xinjian Dyke ranged from 2.14 to 2.23 μg/kg, with HgYD2 (the Hg content of strip YD2) exhibiting the maximum value and HgYD3 exhibiting the minimum (Table 1, Fig. 3); the As content ranged from 27.32 to 35.33 μg/kg, AsYD1 was the maximum and AsYD3 was the minimum; the Cd content ranged from 1.38 to 2.4 μg/kg, CdYD2 was the maximum and CdYD4 was the minimum; the Cr content ranged from 115.83 to 134.32 μg/kg, CrYD4 was the maximum and CrYD2 was the minimum; the Pb content ranged from 54.33 to 69.44 μg/kg, PbYD4 was the maximum and PbYD3 was the minimum; the Cu content ranged from 50.11 to 86.89 μg/kg, CuYD4 was the maximum and CuYD1 was the minimum; the Zn content ranged from 205.76 to 223.42 μg/kg, ZnYD3 was the maximum and ZnYD4 was the minimum.

Table 1 Distribution of heavy metal content in soil of four strip zones (mg/kg)
Fig. 3
figure 3

Distribution of heavy metal content in four strips

The PMF model was originally used in the analysis of air pollution sources and has been applied for the assessment of water and soil pollution in recent years (Salim, et al. 2019; Meng, et al. 2021). The average contents of the seven heavy metals were all higher than the CK values, and the contents of Hg, Cd, Cu, and Zn were higher than the maximum allowable values established by the GB15618-2018 (Table 1). This demonstrated that although the distribution difference of soil heavy metals in the study was not obvious, it was affected by human activities to a certain extent. Therefore, PMF sources were analyzed for seven heavy metals. After 20 iterations, the minimum q (robustness value = 128.5) was obtained. When the number of factors was 5, the model had the smallest difference between the objective function and objective function true value, meaning that the calculation scheme was the most stable. Therefore, the source of heavy metals was locked into 5 factors. Fitting results obtained by source analysis are shown in Fig. 4. The Hg element fitting curve R2 was 0.73, and those of the other elements were greater than 0.9, which indicated that the overall accuracy of the PMF model was relatively good. Therefore, this model could fully explain the information contained in the original data, thereby meeting the needs of source analysis. The output results of the model operation are shown in Fig. 5. The source component spectrum overlap was high, indicating the high similarity of the results. Previous studies have shown that heavy metals originate from a variety of industrial sources such as mining and fossil fuel burning, nonferrous metal smelting, machinery manufacturing and electroplating, instrument fabrication, organic synthesis, and manufacturing (Chaturvedi et al. 2018; Kartal et al. 2006; Loska and Wiechuła, 2003; Men et al. 2018). These industrial sources were excluded after field investigation in this study. Additionally, coal heating emission sources were not considered and no film has been used in agricultural land in recent years.

Fig. 4
figure 4

PMF model operation output results

Fig. 5
figure 5

Spectrum of heavy metal sources in four belt strips. F1 is the traffic source; F2 is the agricultural non-point source of organic fertilizer emissions; F3 is the mixed source of soil parent material and organic fertilizer; F4 is a mixture of vehicle exhaust and organic fertilizer; F5 is the soil parent material source

As indicated in Table 2, the main contributing elements of F1 were Cd and Pb, with a contribution rate of 83.9%, whereas the contribution of Hg was 0. The external sources of Cd include industrial sources, fossil fuel combustion, and pesticides, among others. Pb mainly originates from industrial raw material production, metal mining, and transportation emissions (He et al. 2013; Li and Jia 2018). After the investigation, industrial sources such as mining, fossil fuel combustion, nonferrous metal smelting, machinery manufacturing, electroplating instruments, and organic synthesis were excluded. There were no coal heating emission sources and no film had been used in farmlands in recent years. Moreover, the roads were mainly internal roads with less traffic pollution. Therefore, F1 was the pesticide source.

Table 2 The contribution rate of each factor to heavy metals

The main contributing elements of F2 were Cu and Hg, which contributed to 73.7%, whereas Cd made an extremely low contribution. Cu mainly comes from mining, smelting processes, fossil fuel combustion, and organic fertilizers. As is mainly related to fossil fuel combustion, soil-forming matrix, pesticides, and thin film use (Cai et al. 2015). Therefore, F2 was a mixed source of soil-forming matrix and organic fertilizer.

The main contributing elements of F3 were Zn and Pb, which contributed to 64.2%, and Cd had an extremely low contribution. The main sources of Zn are zinc mining, smelting processing, and industrial emissions, as well as the use of livestock manure (Xu et al. 2014; Yuan et al. 2019). The production of Pb is related to mineral mining, smelting and processing, industrial emissions, and transportation emissions (Cloquet et al. 2006). Therefore, F3 was an agricultural source of livestock manure.

The main contributing elements of F4 were Cu and Pb, which contributed to 66.5%. Therefore, F4 was an organic fertilizer pollution source.

The main contributing elements of F5 were Hg and Cr, which contributed to 75.7%, whereas the contributions of Cu and Pb were 0. Hg mainly comes from industrial sources, sewage irrigation, household garbage, and coal burning for heating (Wu and Zhang 2010; Talukder et al. 2021). Cr is a typical product of the plating process and is also derived from diagenesis (Du et al. 2008), which was also shown as such in previous studies (Xu et al 2015). Therefore, F5 is a mixed source of sewage irrigation, domestic waste, and soil-forming matrix.

In summary, the main source of Cd was pesticide; Cu and As originated from a mixed source of the soil matrix and organic fertilizer; Zn from agricultural livestock manure; Pb mainly from pesticide, organic fertilizer, and livestock manure; Hg and Cr from sewage irrigation, domestic waste, and the soil matrix.

Analysis of the community species composition

The structure of plant communities at different taxon levels (family, genus, and species) can serve as an indicator of the responses of these communities to various environmental factors. There were 25 species of wild plants belonging to 22 genera and 13 families in the Xinjian Dyke wetland, among which Gramineae and Compositae accounted for 24% and 16% of the total species, respectively (Table 3). The most common species were Zizania latifolia (Griseb.) Stapf, Cynodon dactylon (L.) Pers., Eccoilopus cotulifer (Thunb.) A.Camus, Monochoria vaginalis (Burm.f.) C.Presl, Artemisia annua L., Erigeron annuus (L.) Pers., Pterocypsela elata (Hemsl.) C.Shih, Ranunculus polii Franch. ex Hemsl., Polygonum criopolitanum Hance, Polygonum perfoliatum (L.) L., Broussonetia papyrifera (L.) Vent., Morus alba L., Calystegia sepium (L.) R.Br., Cerastium glomeratum Thuill., and Pyrus betulifolia Bunge.

Table 3 Species composition of hygrophytic plant species in four strip zones

The community was divided into eight types based on the IV values: Z. latifolia community (Two), E. cotulifer community (Two), C. dactylon/Setaria viridis (L.) P.Beauv. community, Kalimeris indica (L.) Sch.Bip./C. dactylon community, Astragalus sinicus L. community, R. polii community, B. papyrifera community (Two), and P. betulifolia community (Two). With the highest IV value as the dominant species and an IV of 15 as the companion species, the shrub and herb community similarities were very high. The plots showed a single species as the absolute dominant species. The dominant species with the highest IV value was the shrub P. betulifolia (IV = 33.6%), and the highest IV in the herb communities belonged to Z. latifolia (29.45%). The communities were mostly consociations, the six communities with the dominant species of Z. latifolia, B. papyrifera, and P. betulifolia contained 3–4 companion species, the A. sinicus community had 2, and the K. indica community had 1. In contrast, the R. polii community, E. cotulifer community, and C. dactylon/S. viridis communities had no companion species. The important value (IV) is a characteristic indicator of community building species and dominant species, reflecting the functional status of species in the community. The B. papyrifera community and Z. latifolia communities had 3–4 companion species and higher IV, with no more than 7 community species, two E. cotulifer communities, the K. indica/C. dactylon communities, and the K. indica/S. viridis communities had lower IV and only had 0–1 companion species. However, the number of species exceeded nine. This was because higher importance values or more dominant species will occupy more niche temporal or spatial dimensions, and the inhibition of non-companion species growth reduced species diversity.

Analysis of the community diversity index

The diversity index reflects the community species diversity. The species diversity of the four strip zones was compared using different indices.

Shannon–Wiener index

The H values of the four belt strips were the highest in the E. cotulifer-YD4 community and lowest in the B. papyrifera community-YD2 (Fig. 6). The species diversity of the community exhibited the following descending order according to the H value: E. cotulifer community-10/YD4 > E. cotulifer community-3/YD1 > K. indica/C. dactylon community-7/YD3 > A. sinicus community-12/YD4 > C. dactylon/S. viridis community-8/YD3 > R. polii community-11/YD4 > B. papyrifera community-9/YD3 > P. betulifolia community-6/YD2 > P. betulifolia community -4/YD2 > Z. latifolia community-1/YD1 > Z. latifolia community-5/YD2 > B. papyrifera community-2/YD1.

Fig. 6
figure 6

Diversity index statistics of the twelve plant communities. Note: a comparison of diversity indices for the populations possessing same dominant species, P < 0.05; b, c, d comparison of diversity indices for the same strip zone, P < 0.05

Simpson index

The DS values of the four strip zones were all the highest in the E. cotulifer community-YD4 and the lowest in the B. papyrifera-YD2 community. The community species diversity exhibited the following descending order based on the DS values: E. cotulifer community-10/YD4 > E. cotulifer community-3/YD1 > K. indica/C. dactylon community-7/YD3≈ A. sinicus community-12/YD4 > C. dactylon/ S. viridis community-8/YD3≈ R. polii community-11/YD4 > B. papyrifera community-9/YD3 > P. betulifolia community-6/YD2≈ Z. latifolia community-1/YD1 > P. betulifolia community-4/YD2 > Z. latifolia community-5/YD2 > B. papyrifera community-2/YD1 (Fig. 6). The above-described analyses indicated that the change trend of the DS value of the four strip communities is basically the same as the H value. Correlation analysis showed that the H and DS values were significantly correlated (p < 0.01, r = 0.969) (Table 4).

Table 4 Correlation analysis between soil heavy metals and diversity indices in four strip zones

Margalef index

The highest DM value of the four belt strips was that of the E. cotulifer community-YD4 community, whereas the lowest DM value was that of the Z. latifolia community-YD1. According to the DM value, the community species diversity exhibited the following descending order: E. cotulifer community-10/YD4 > K. indica/C. dactylon community-7/YD3 > E. cotulifer community-3/YD1 > C. dactylon/S. viridis community-8/YD3 > R. polii community-11/YD4 > B. papyrifera community-9/YD3 > A. sinicus community-12/YD4 > P. betulifolia community-6/YD2 > B. papyrifera community-2/YD1 > P. betulifolia community-4/YD2 > Z. latifolia community-5/YD2 > Z. latifolia community-1/YD1. However, the change trend was inconsistent with that of the Shannon–Wiener and Simpson indices. Correlation analysis showed that the DM value was not correlated with the H and DS values (Table 4).

Discussion

Relationship between index change and species diversity

The Shannon–Wiener index includes richness and evenness and can therefore comprehensively reflect the diversity and evenness of the community (Wilsey and Stirling 2007). A larger Shannon–Wiener index means that there are more plant species in the community, and therefore, there is a higher species richness. At the same time, the uniformity of the individual distribution among species increases the community diversity. The Margalef index reflects the species richness, whereas the Simpson index is more closely related to frequency and species uniformity. Zhang et al. (2015) reported that species richness takes the number of species as an indicator, implying that all species contribute equally to the community. However, plants do not have equivalent functions in a community (Zhang et al. 2015; Adrián et al. 2021), and therefore, species richness alone cannot explain the functional aspects of plant communities. Our findings suggested that the Simpson and Shannon–Wiener indices of the plant community in the study area remained consistent. The developing wetland community has a more uniform succession trend, and therefore, the uniformity of the individual distribution among species was an important determinant of community diversity. The Margalef index was strongly complementary to the Simpson and Shannon–Wiener indices. After comparing the community diversity index in the same zone (Fig. 6), the H values and DS values of the three clusters in the YD1 strip change trends were consistent. The highest value was observed in the E. cotulifer community, the lowest value was observed in the B. papyrifera community, with both showing significant differences (p < 0.05). The highest DM value was observed in the E. cotulifer community, whereas the lowest value was observed in the Z. latifolia community, all communities showed significant differences (p < 0.05). Among the H and DM trends of the three clusters in the YD2 strip, the highest value was observed in the P. betulifolia community, whereas the lowest value was observed in the Z. latifolia community, and all of these values showed significant differences (p > 0.05). The lowest DS value was observed in the Z. latifolia community; however, there was no significant difference between the two P. betulifolia communities (p > 0.05). Among the H and DM trends of the three clusters in the YD3 strip, the highest value was observed in the K. indica/C. dactylon community, whereas the lowest value was observed in the B. papyrifera community, all of these values showed significant differences (p > 0.05). The highest DS value was observed in the K. indica/C. dactylon community. However, no significant difference was observed between the R. polii community and the B. papyrifera community (p > 0.05). The changing trend of the H values and DS values of the E. cotulifer community in the YD4 strip was as follows. The highest value was observed in the E. cotulifer community. There were no significant differences between the A. sinicus community and the C. dactylon/S. viridis community. The highest DM value was observed in the E. cotulifer community, whereas the lowest value was observed in the A. sinicus community, both with significant differences (p > 0.05).

The trend of the H and DS values within the YD1 and YD4 strips was the same, showing that the Shannon–Wiener index was closely related to the enriched species. The H and DM values exhibited identical trends in the clusters within the YD2 and YD3 strips, indicating that species richness contributes significantly to the Shannon–Wiener index. From the same zone of species diversity index comparison and the overall comparison of the diversity of four strips, the convergence and specificity of the three indices have some conflict. This uncertainty indicated that the diversity indices were not only related to biological factors such as abundance, richness, dominant species, and individual growth, but also to abiotic environmental factors (Fu et al. 2020; Jin et al. 2011; Zhang et al. 2016).

Effect of heavy metal stress on plant diversity

The survey in February 2021 indicated that the high degree of anthropogenic disturbance was evidenced by the high degree of community similarity and small patch aggregation and distribution in the study area. The abiotic factors that affect the diversity of terrestrial plants in the study area include water level, water quality, grazing, mowing, the Yangtze-to-Huaihe Water Diversion Project, and soil quality. The four belt strips were all located in the alternate areas of land and water, and therefore, the effects of the difference in water level and water quality can be excluded. There was no difference in grazing and mowing activity intensity between the four strips. The research area was carrying out the Yangtze-to-Huaihe Water Diversion Project, which had little interference in plant diversity, and sampling in this study was conducted away from the engineering area. Therefore, the above factors did not contribute to the differential variation of the community species diversity. The factors affecting soil quality included artificial factors such as pH, salinity, nutrients, soil texture, and structure. The pH value of the four strips was 5.37–6.63, the soil was weakly acidic, the soil organic matter (SOM) was 1.09–1.88%, with a variation coefficient of 0.112 (previous investigation of this study, unpublished) and weak dispersion. Moreover, there was a small spatial difference in soil texture and structure, with pollutants likely being the main influencing factor. Therefore, this study explored the impact of heavy metal pollution on species diversity.

Table 4 summarizes the results of the correlation analysis of heavy metal content and plant diversity indices. There was no significant correlation between Hg and the three diversity indices. Cr was positively correlated with the Shannon–Wiener and Simpson indices (P < 0.01), with correlation coefficients of 0.741 and 0.762, respectively. As was positively correlated with the Margalef index (P < 0.05, r = 0.652). These findings indicated that as the content of Cr and As in the strip increased, the corresponding diversity index also increased. The positive effect of these metals on plant diversity could potentially be attributed to a combination of factors including SOM and other soil physical and chemical parameters, as well as the biological characteristics of plant species, microorganisms, and other neglected environmental factors (Ma et al. 2022). Alternatively, the presence of heavy metals may reduce the advantages of a single plant, increase competition among plants, and promote plant coexistence (Garbisu et al. 2020; Lalor et al. 2004; Li et al. 2019a, b). The positive effect of heavy metals on species diversity might be related to their complementary effect or selection effect on plant species. The complexity and specificity of regional plant diversity increased with increasing soil heavy metal content, which may be due to the use of supplementary resources by some species to promote inter-species diversity. It may also be that increasing species numbers increases the likelihood of the random introduction of high-yielding species (Zhang et al. 2015). Additionally, the existence of some species with heavy metal enrichment ability alleviates the toxic stress of heavy metals on other species, thereby improving community species diversity.

In contrast, Cd and Zn were negatively correlated with the diversity index. The correlation coefficient between Cd and the Shannon–Wiener index was − 0.666 (P < 0.05), whereas that between Cd, Zn, and the Simpson index was − 0.636 and − 0.594 (P < 0.05), respectively, indicating that as Cd and Zn content increased, the corresponding diversity index decreased. The content of Cd and Zn in this study exceeded the GB 15,618–2018 guidelines and exhibited different degrees of toxicity and ecological risk to individual plants and their populations. Therefore, high concentrations of these heavy metals had negative effects on community diversity indices. The content of Hg exceeded GB 15,618–2018 but was not correlated with the community species diversity index. The toxicity of Hg on individuals and populations may be inhibited by other environmental factors, thus indirectly reducing the negative effect of Hg on species diversity. This hypothesis was mentioned in Connell’s theory of intermediate interference (Connell 1978). Fantaw et al. (2006) demonstrated that the distribution characteristics of vegetation were greatly influenced by the soil and other environmental factors, in addition to their mutual influence.

Four strips had a high Cu background value (66.40 mg/kg) and were above the standard value (50 mg/kg). The Cu contents of the YD1 and YD2 strips were lower than the background values, whereas the Cu contents of the YD3 and YD4 strips were higher than the background values. Cu occurred mainly as a mixed source of the soil matrix and organic fertilizer, which made the Cu distribution uneven. Deng et al. (2021) found that the Cu concentrations under the plant tolerable conditions were significantly positively correlated with plant height, stem thickness, crown diameter, and plot coverage, which promotes the species diversity of the community. Therefore, the plant species in the study area developed a certain tolerance to Cu, and Cu showed a very significant positive correlation with the Shannon–Wiener and Simpson indices (p < 0.01; the correlation coefficient r was 0.983 and 0.970, respectively). After analysis, the Cu content of the four strips exceeded the Chinese GB 15,618–2018 standard value (Table 1). Although Cu is a trace element needed for plant growth, the growth and distribution of plants can be limited if the Cu content exceeds the upper limit of the plant’s nutrient tolerance. This might partly explain the negative correlation between Cu and the Margalef index, which indicated that Cu had a certain inhibitory effect on the community species composition and limited the community richness.

Conclusion

Plant communities often exhibit a single absolute dominant species, with dominant and associated species with high IV occupying key ecological niches. However, this inhibits the growth of non-associated species and thus reduces the community species diversity. Source analyses of seven heavy metals by PMF in the wetland indicated that the seven heavy metals originated from pesticides, fertilizers, livestock manure, sewage irrigation, agricultural sources, and soil matrix sources. Heavy metals affect community species diversity by influencing the enriched species evenness and richness of plant communities.

In conclusion, we must reasonably increase the number of tolerant species and maximize species niche differentiation to improve the differences and functional complementarity between coexisting individuals. Native pioneer plants should be given priority in artificial planting, whereas functional redundant species should be avoided. Additionally, more studies are needed to confirm the changes in soil heavy metal distribution characteristics and heavy metal accumulation process before, during, and after ecological restoration.