Introduction

Biogeographic regionalizations represent fundamental abstractions of the geographic organization of life on Earth in response to past or current physical and biological forces, and are central to many basic and applied questions in biogeography, ecology, evolution, and conservation (Kreft & Jetz, 2010; Whittaker et al., 2013; Lomolino et al., 2010). Since biogeographic regionalizations are traditionally developed by experts and are based on qualitative evidence, the lack of transparency and quantitative support has set constraints on their utility (Kreft & Jetz, 2010). In fact, these kinds of regionalizations have not always been based on detailed species distribution data, while accurate distribution mapping is the basis for detecting biogeographic patterns and processes (Riddle et al., 2011). The recent availability of multivariate techniques, enhanced computational power, and information on species distribution now enable a quantitative scrutiny and expansion of biogeographic regionalizations that will facilitate new and more rigorous uses (Kreft & Jetz, 2010). Different quantitative methods have been applied to define statistically derived biogeographic regionalizations: cluster analysis (Márquez et al., 2001; Kreft & Jetz, 2010; Rueda et al., 2010; Heikinheimo et al., 2012; Linder et al., 2012; Mateo et al., 2013; Abbate et al., 2016; Divíšek et al., 2016), the network approach (Vilhena & Antonelli, 2015), and fuzzy logic (Olivero et al., 2013). All these methods provide objective approaches for classifying biota, because they use quantitative measures of similarity between areas to reveal natural patterns of distribution. Furthermore, both Olivero et al. (2013) and Vilhena and Antonelli (2015) have remarked that in a biogeographic regionalization it is important to combine sharp and precise boundaries with gradual transition zones.

In terms of its biogeographic features, Italy is a very interesting country, belonging to both the Eurosiberian and Mediterranean Regions. Italy is of particular interest due to its latitudinal extension, great environmental and climatic heterogeneity, richness in vascular flora (it contains approximately half of the number of species found in Europe), richness in vegetation types, and its position in the Mediterranean Basin (see Abbate et al., 2015, and references therein). The ecoregional classification of Italy composed by Blasi et al. (2014) reflects this complexity, providing 7 provinces, 11 sectors, and 33 subsectors. Italy’s countryside is characterised by a rich diversity of cultural landscapes, shaped by traditional land-uses (Plieninger et al., 2006). Moreover, the Italian peninsula provided refuge areas for flora during the glacial period (Svenning et al., 2008); the modern heterogeneity of vegetation was also observed in the Holocene history of trees (Magri et al., 2015).

Despite different studies being conducted over the last 60 years (Giacomini, 1958; Takhtajan, 1986; Rivas-Martinez et al., 2004; Blasi et al., 2007; Abbate et al., 2016), the phytogeographic regionalization of peninsular Italy, in particular with regard to the central sector, is not completely well defined. In fact, the extension, boundaries, names, and the inner delimitation of the Eurosiberian and Mediterranean regions have repeatedly changed over time. Following the regionalization by Rivas-Martinez et al. (2004), inland areas of central Italy belong to the Apennine sector of the Eurosiberian Region; coastal areas of the western Tyrrhenian side and of the eastern Adriatic one both belong to the Mediterranean Region and can be referred to the “Settore Italico of Provincia Tirrenica” and the “Settore Apulo of Provincia Adriatica”, respectively. However, the regionalizations by Rivas-Martinez et al. (2004), and previously those by Giacomini (1958) and Takhtajan (1986), are expert-based and not statistically derived like the regionalizations by Blasi et al. (2007) and by Abbate et al. (2016). Particularly, the recent regionalization by Abbate et al. (2016), based on native woody flora and on the higher-taxon approach, has contributed to describe the phytogeographic structure of Italy at the scale of administrative regions; in central Italy, a Tyrrhenian Apennine sector and an Adriatic Apennine sector have each been described. Unfortunately, the database used did not allow for further distinction at a more detailed spatial scale within the Mediterranean and the Temperate Regions.

The choice to analyse woody plants (including trees, shrubs, and lianas) was made because they are important biodiversity surrogates for highly heterogeneous areas. In Italy, the richness of native woody plants has been shown to be a good predictor of the overall native vascular flora richness at a medium spatial scale (corresponding to administrative regions) (Abbate et al., 2015). Moreover, woody plants represent one of the best groups of organisms for examining the role of climate in shaping the geographic variation of species richness (Qian, 2013). As concerns forest ecosystems, data on woody plant species are also valid predictors of both beta and gamma diversity of all plant species (Giorgini et al., 2015); despite this, evidences also occurred that this is not always true: for instance, in recent studies tree species diversity appeared not to be among the drivers of the overall species diversity relatively to the understorey of European forests (Ampoorter et al., 2016). Woody flora and trees, providing habitat structure and therefore being biotic drivers of animal distribution, also have a strong influence on the biogeographic structure of many animal groups (Rueda et al., 2010; Heikinheimo et al., 2012). Therefore, for phytogeographic regionalization, using suitable surrogate groups, for which taxonomic and distributional data are more easily available, can be considered a useful approach, especially for highly biodiverse territories.

In this framework, the use of an higher-taxon approach also represents a good tool; in fact, reduction in taxonomic resolution is desirable because it could allow for more rapid acquisition of knowledge while requiring less effort, if only some information is lost (Landeiro et al., 2012). The higher-taxon approach has been largely tested as a surrogate for species richness for a number of taxa, including woody plants (e.g. Balmford et al., 1996; La Ferla et al., 2002; Prinzing et al., 2003; Villaseñor et al., 2005; Mandelik et al., 2007; Landeiro et al., 2012; Alves et al., 2016); some studies have also tested the usefulness of the higher-taxon approach to describe patterns in the compositional turnover of species (e.g. Prinzing et al., 2003; Bergamini et al., 2005; Heino & Soininen, 2007; Mandelik et al., 2007; Landeiro et al., 2012; Alves et al., 2016; Abbate et al., 2016; Latini et al., in prep.).

Nowadays, the importance of rich datasets covering various components of biodiversity and comprising data from different sources is emphasized. For the last 200 years, information on species distribution has mostly been scattered in natural history collections and scientific publications or monographs. Now, this information is increasingly available in digital formats: databasing of plant diversity data became one of the major objectives in biodiversity informatics, and floristic records provide baseline data for several studies (Lavoie, 2013; Bedini et al., 2016). In the last few years, several georeferenced floristic databases have been created for central Italy (Conti et al., 2010; Latini et al., 2014; Peruzzi & Bedini, 2015).

In this study, we selected an area in central peninsular Italy and used rich georeferenced floristic databases. We aimed to (1) develop a statistically derived biogeographic regionalization based on native woody flora, investigating environmental correlates, and (2) assess congruence between the biogeographic patterns of native woody species, genera, and families.

Materials and Methods

Study Area

The study area is located in central Italy and includes two administrative regions (Lazio and Abruzzo) (Fig. 1). It has a surface area of ca. 28,040 km2 (ISTAT, 2016a). A continuous mountain ridge (the Central Apennines) separates the western Tyrrhenian slope from the eastern Adriatic one, with an altitudinal range of 2912 m from the coast to the highest peak of the entire Apennines; in the western slope, the Anti-Apennine mountains and volcanoes are present. Six small islands in the Tyrrhenian Sea are included. The Tyrrhenian coast is ca. 375 km long, and the Adriatic one is ca. 140 km. The percentage cover of the three standard elevation belts (ISTAT, 2011) is 10% plains, 44% hills, and 46% mountains. The climate is characterised by four climatic regions (Temperate, Mediterranean and two transitional regions) and six bioclimates (Blasi & Michetti, 2007). The most widespread lithotypes are limestones and dolomites (34%), alluvial sediments and clays (22%), volcanic rocks (18%), and flysch deposits and arenaceous assemblages (17%) (Geoportale Nazionale, 2016). Thus, a clear west-to-east gradient is present in terms of topography, climate, and lithology, producing high environmental diversification. According to the Coordination of Information on the Environment (CORINE) Land Cover 2012 classification (ISPRA, 2012), agricultural areas cover 52% of the area, forests cover 27%, scrub and herbaceous vegetation covers 14%, and artificial surfaces cover 5%. In terms of vegetation, 34 series and seven geosygmeta have been described (Blasi, 2010), mostly including deciduous Quercus-Fagus forests and secondarily sclerophyllous ones. The 7,200,000 inhabitants are mostly concentrated in coastal areas, especially in the cities of Roma and Pescara (ISTAT, 2016b).

Fig. 1
figure 1

Study area (Lazio and Abruzzo administrative regions, central Italy) with the used 10 × 10 km grid cells (colours from white to black correspond to the increasing elevation; geographic coordinates in WGS84)

Data Collection

For our study, we selected all native woody taxa at present certainly occurring in the Lazio and/or Abruzzo administrative regions (Bartolucci et al., in prep.; see Bartolucci et al., 2016). All taxa belonging to phanerophyte and nano-phanerophyte life-forms sensu Raunkiaer (Pignatti, 1982) were considered, including some taxa being phanerophytes or nano-phanerophytes only secondarily (namely Asparagus acutifolius, Atriplex halimus, and Santolina etrusca). We excluded from the analysis groups of species and hybrids. Alien taxa (national and/or regional) were generally not considered; however, Abies alba (doubtfully native for Lazio), Olea europaea, and Cercis siliquastrum (both naturalized to Abruzzo), were considered in both regions. Periploca graeca (doubtfully native to Abruzzo) was not considered. There were 229 selected species belonging to 106 genera and 51 families.

Taxa distribution data were compiled from various sources including herbarium specimens as well as published and unpublished data; data were stored in the georeferenced floristic databases of Lazio and Abruzzo regions (Conti et al., 2010; Latini et al., 2014). The basic herbarium collection data were provided by RO, APP, and UTV (acronyms follow Thiers, 2016). Published data included mainly floristic and phytosociological papers. Unpublished data included field observations we recently collected in central Italy. Only data identified to the species level were considered in this study. We excluded doubtfully identified taxa. Only data collected after 1950 were considered; this threshold was chosen following botanical and floristic literature, in which records collected after this year are considered recent (Scoppola & Magrini 2005; Celesti-Grapow et al., 2009; Conti et al., 2010). Cultivated and naturalized taxa were excluded. Although herbarium and published records did not usually provide latitudinal and longitudinal information, we were able to georeference many of these records based on the original location descriptions. An accuracy value, according to Conti et al. (2010), was used to express spatial resolution of the georeferenced data. Only georeferenced data with accuracy values of 1 (originally georeferenced data or point site), 2 (corresponding to an area of nearly 1 km2 around the reference point) or 3 (roughly corresponding to a municipality area) were considered. Data with an accuracy value of 4 (wide geographic range) or 5 (regional presence) were not considered. Nomenclature follows Bartolucci et al. (in prep.; see Bartolucci et al., 2016). The resulting dataset included a total of 43,968 taxa occurrence data.

A 10 km × 10 km equal-area grid (European Environmental Agency, 2016) was chosen to appropriately reflect the spatial accuracy of the distribution data, and in accordance with the study goal (Kreft & Jetz, 2010); this spatial resolution (analysis grain) was chosen because a higher resolution was not possible due to the spatial error in the geo-referenced data. The 10 × 10 km grid was superimposed on the map of the Lazio and Abruzzo administrative regions (ISTAT, 2016a). All the cells and portions of cells were initially considered, the correlations between cell surface and number of species being weak (Pearson r = 0.33). The resulting grid included a total of 344 cells (201 for Lazio, 112 for Abruzzo, and 31 at the boundaries between the two regions) (Fig. 1).

Subsequently, the 10 × 10 km grid was superimposed onto taxa distribution data. Three incidence matrices (for species, genera, and families) were created, containing species, genera, and families presence-absence data, respectively, within each grid cell. Cells with five or fewer species, showing also a low sampling intensity, were removed from the analysis (Fig. 2). The final dataset comprises 290 cells, 224 species, 103 genera, and 80 families, with in all 11,682 species presence data, 8194 genera presence data, and 5342 families presence data.

Fig. 2
figure 2

Number of records and number of species per grid cells

For each grid cell or portion of a grid cell, a set of environmental variables was recorded. Coordinates (X and Y, WGS84 UTM 33 T) and distance from the sea (DIST_SEA, km) were calculated, considering centroids. Minimum, mean, maximum and range values for elevation (ELEV_MIN, ELEV_MEAN, ELEV_MAX, ELEV_RANGE, m a.s.l.) were calculated using a 20 m DEM (ISPRA, 2016). The mean values of the following bioclimatic variables were obtained from WorldClim (Hijmans et al. 2005; WorldClim, 2016): annual mean temperature (T_MEAN, °C), maximum temperature of the warmest month (T_MAX, °C), minimum temperature of the coldest month (T_MIN, °C), annual temperature range (T_RANGE, °C), annual precipitation (P_ANN, mm), precipitation of the wettest month (P_WET, mm), and precipitation of the driest month (P_DRY, mm). Surface covered by each geolithological type was obtained using geolithological map from the Geoportale Nazionale (2016), with the types abbreviated as follows: limestones, dolomites, marls, travertines, and glacial deposits (LIT_LIM), alluvial sediments and clays (LIT_ALL), volcanic rocks (LIT_VOL), arenaceous assemblages and flysch deposits (LIT_ARE), sands, conglomerates, and aeolian deposits (LIT_SAN), lakes and glaciers (LIT_LAK), and gypsum and evaporite deposits (LIT_EVA). Surface covered by each CORINE Land Cover category was obtained using CORINE Land Cover 2012 map (ISPRA, 2012), with the category abbreviated as follows: artificial surfaces (CLC_1), agricultural areas (CLC_2), forests and seminatural areas (CLC_3), wetlands (CLC_4), water bodies (CLC_5).

All the analyses were performed using the software QGIS version 2.6.1 (QGIS Development Team, 2014).

Data Analysis

To quantitatively delineate phytogeographical regions of central Italy based on native woody flora, we used multivariate techniques, as recommended by Kreft & Jetz (2010).

Analyses were performed at species, genus, and family taxonomic levels.

Using the βsim index between cells based on presence-absence data of species, genera, and families respectively, dissimilarity matrices for species, genera, and families were created. Values of βsim varied from 0, for the identical taxa composition of two grid cells, to 1 for grid cells that do not share any taxon. Βsim does not take shared absences (zero values) into account: for biogeographic regionalizations, presence-only measures are important because areas which lack many species and share none should not to be found similar purely due to shared absence (Linder et al., 2012). The advantage of βsim over other distance metrics is its independence of species richness gradients in the study area, since it is not sensitive to major differences in species richness (Koleff et al., 2003; Kreft & Jetz, 2010; Leprieur & Oikonomou, 2014).

To extract the discontinuity sets in native woody flora composition, each dissimilarity matrix was submitted to the clustering procedure using Ward’s minimum variance method (Ward, 1963). Hierarchical algorithms, which construct a hierarchy of clusters, are particularly useful in biogeographical analysis, because biogeographical regions are hierarchically arranged (Kreft & Jetz, 2010; Vilhena & Antonelli, 2015). Ward’s method minimizes the sum of the within-group sums of squares merging the clusters only if they increase the within-cluster variation the least; therefore, it was chosen because it is recommended when within-cluster homogeneity is desired (Legendre & Legendre, 1998; Ramette, 2007). Since Ward’s method works in Euclidean space, but the βsim index produces a non-Euclidean dissimilarity matrix (Legendre & Legendre, 1998), we used a βsim matrix corrected by using the correction by Cailliez (1983), which computes the smallest positive number and adds it to each dissimilarity value. This procedure was successfully applied by Divíšek et al. (2014). We did not include spatial constraints during clustering; cells were clustered based on their taxa composition without regard to their spatial proximity so as to not force cohesion of clusters not justified by the actual distributions of taxa.

Indicator species analysis (ISA) (Dufrêne & Legendre, 1997) was used to identify taxa (species, genera, and families) associated with each cluster or with the combination of clusters (De Cáceres et al., 2010). The indicator value index (IndVal) resulting from the analysis is the product of two components (A and B); component A (specificity), in the case of presence/absence data (Bakker, 2008), is the ratio between the number of cells in a cluster occupied by the taxon and the total number of cells occupied by the taxon; component B (frequency) is the ratio between the number of cells in a cluster occupied by the taxon and the total number of cells in the cluster. Values of IndVal varied from 0 for a species that is absent from a cluster and 1 for species that occurs in all samples within a cluster and does not occur in other clusters. For each cluster or combination of clusters, indicator taxa were those with an IndVal higher than 0.25, as suggested by Dufrêne and Legendre (1997), and for p < 0.05. ISA was successfully used by Mateo et al. (2013) to identify species associated to biogeographic units.

To look for interpretable clusters to retain, we based on the visual inspection of the dendrograms and on the taxa-based maps, and on the ISA results.

In order to analyse the relationships between clusters and environmental variables, basic statistics for environmental variables were obtained and visually examined using boxplots and barplots.

In order to assess congruence among the taxonomic levels, woody flora patterns at the species, genus, and family levels were compared to each other. Dissimilarity matrices were compared using a Mantel test based on Pearson’s product-moment correlation (Mantel, 1967; Legendre & Legendre, 1998); the significance of the resulting Mantel statistics was evaluated by a permutation test (999 permutations). Species and genera site topology resulting from cluster analysis was compared by applying Pearson’s chi-square test (with simulated p value based on 2000 replicates) on a contingency table.

All analyses were performed in R software (R Core Team, 2015) using ‘vegan’ (Oksanen et al., 2016) and ‘indicspecies’ (De Cáceres & Legendre, 2009) packages.

Results

At all the considered taxonomic levels, the dendrograms (Fig. 3), the mapping of the clusters (Fig. 4), and the ISA results suggest the presence of three well-defined clusters (named S1, S2, and S3 at the species level; G1, G2, and G3 at the genus level; and F1, F2, and F3 at the family level). Although we did not include spatial constraints to influence the clustering of neighbouring cells, phytogeographic regions for native woody flora were quite spatially cohesive, especially at the species level. Further interpretable subclusters can be identified at the species level (Figs. 5, 6).

Fig. 3
figure 3

Dendrograms and phytogeographic units resulting from the cluster analysis (Ward method on βsim dissimilarity with Cailliez correction) conducted on the occurrence data of native woody flora in the 10 × 10 km grid cells at species, genus, and family levels

Fig. 4
figure 4

The three phytogeographic units of Lazio and Abruzzo resulting from the cluster analysis (Ward method on βsim dissimilarity with Cailliez correction) conducted on the occurrence data of native woody flora in the 10 × 10 km grid cells at species, genus, and family levels

Fig. 5
figure 5

Dendrograms and phytogeographic subunits resulting from the cluster analysis (Ward method on βsim dissimilarity with Cailliez correction) conducted on the occurrence data of native woody flora in the 10 × 10 km grid cells at species level

Fig. 6
figure 6

The six phytogeographic subunits of Lazio and Abruzzo resulting from the cluster analysis (Ward method on βsim dissimilarity with Cailliez correction) conducted on the occurrence data of native woody flora in the 10 × 10 km grid cells at species level

Results obtained across different taxonomic levels are congruent with each other. In fact, Mantel tests show a strong correlation between dissimilarity matrices of species and genera (r = 0.8784, p = 0.001), between genera and families (r = 0.8368, p = 0.001), and between species and families (r = 0.7286, p = 0.001); chi-square tests calculated on contingency tables reveal that site topologies are also congruent with each other for both the two-cluster and the three-cluster solutions (Table 1). However, the hierarchical relationships between clusters present some differences between taxonomic levels. In fact, the highest dendrogram division separates clusters S1 and S2 from S3 at the species level, clusters G1 and G2 from G3 at the genus level, while at the family level, the first division separates cluster F1 from F2 and F3.

Table 1 Contingency tables for phytogeographic units at species (S1, S2, S3), genus (G1, G2, G3), and family (F1, F2, F3) levels and Pearson’s Chi-squared test with simulated p-value (based on 2000 replicates): a) species/genera contingency table: X-squared = 203.14, p-value <0.001; b) genera/families contingency table: X-squared = 128, p-value <0.001. c) genera/families contingency table: X-squared = 179.76, p-value <0.001

Hereafter, we report the most important features of the clusters to highlight affinities and differences between taxonomic levels. Moreover, features of the subclusters identified at the species level are also reported. For the three-cluster solutions, dendrograms are in Fig. 3 and maps are in Fig. 4; boxplots for the most important environmental variables are in Fig. 7; ISA results are in Table 2; extended results of numbers of cells, taxa and basic statistics for environmental variables are in Appendix, Tables 3, 4, and 5; extended results of ISA are in Appendix, Tables 6, 7, and 8. For the six-cluster solution, dendrogram is in Fig. 5 and map is in Fig. 6; other results are available under request.

Fig. 7
figure 7

Boxplot for the most important environmental variables for the three phytogeographic units resulting from the cluster analysis (Ward method on βsim dissimilarity with Cailliez correction) conducted on the occurrence data of native woody flora in the 10 × 10 km grid cells at species, genus, and family levels

Table 2 Indicator Species Analysis results: indicator taxa and indicator values (IndVal) for each native woody flora phytogeographic unit or combination of units at species (S1, S2, S3), genus (G1, G2, G3), and family (F1, F2, F3) levels

Clusters S1, G1, and F1 can all be identified as a Mediterranean unit. They are characterized by the lowest number of cells and number of taxa per cluster; they are located in the low-elevation coastal areas of both the Tyrrhenian and Adriatic slopes and have the highest annual mean temperature values and the lowest annual precipitation values. Alluvial sediments (i.e. LIT_ALL), limestones (i.e. LIT_LIM), and sands (i.e. LIT_SAN) prevail for lithotypes, and agricultural areas prevail for the CORINE Land Cover classification. In the ISA results 32 indicator species were identified for S1 (e.g. Pistacia lentiscus, Myrtus communis, and Lonicera implexa), of which six are exclusive (e.g. Juniperus turbinata and Euphorbia dendroides); 17 indicator genera were identified for G1 (e.g. Myrtus, Ceratonia, and Rosmarinus), of which three are exclusive (e.g Anthyllis and Medicago); 6 indicator families were identified for F1 (e.g. Myrtaceae, Cistaceae, and Lamiaceae), of which one is exclusive (Arecaceae).

Clusters S2, G2, and F2 can all be identified as a transition unit between the Mediterranean and Eurosiberian units. For cluster S2, the number of cells and taxa is the highest among species-level clusters (i.e. 190 species in S2); for clusters G2 and F2, the numbers of taxa are the highest among genera- and family-level clusters (i.e. 98 genera in G2 and 49 families in F2). Clusters S2, G2, and F2 are located in middle-elevation inland areas of both Tyrrhenian and Adriatic slopes. Alluvial sediments (i.e. LIT_ALL) and volcanic rocks prevail for the lithotypes in these areas, and agricultural areas prevail for the CORINE Land Cover classification. Only one species was identified as an indicator of S2 (Adenocarpus complicatus), one family was identified as an indicator of F2 (Staphyleaceae), and no indicator genus was detected for G2.

Clusters S3, G3, and F3 can all be identified as a Eurosiberian unit. For clusters G3 and F3, the numbers of cells are the highest among genus and family clusters, while the numbers of taxa in these clusters are the lowest among genus and family clusters. These clusters are located in the inland high-elevation areas; they have the lowest annual mean temperature values and the highest annual precipitation values. Limestones (i.e. LIT_LIM) and arenaceous assemblages (i.e. LIT_ARE) prevail for lithotypes of these areas, and forests and seminatural areas dominate for the CORINE Land Cover classification. Overall, 40 species are indicators of S3 (e.g. Fagus sylvatica, Sorbus aria, and Laburnum anagyroides), five of which are exclusive (e.g. Oreoherzogia fallax and Ribes uva-crispa); seven genera are indicators of G3 (e.g. Fagus, Laburnum, and Amelanchier); two families are indicator of F3 (i.e. Taxaceae and Grossulariaceae).

The shared indicator taxa between clusters are also noteworthy. There were 27 species indicators of S1-S2 (e.g. Rubia peregrina, Rosa sempervirens, and Smilax aspera), 17 genus indicators of G1-G2 (e.g. Rubia, Smilax, and Asparagus), and nine family indicators of F1-F2 (e.g. Rubiaceae, Asparagaceae, Smilacaceae). There were two species indicators of S1-S3 (i.e. Tilia cordata and Rosa spinosissima), five genus indicators of G1-G3 (e.g. Juniperus, Daphne, and Ilex), five family indicators of F1-F3 (e.g. Thymelaeaceae, Cupressaceae, and Rhamnaceae). There were 38 species indicators of S2-S3 (e.g. Acer campestre and Quercus pubescens), 20 genus indicators of G2-G3 (e.g. Acer, Rosa, and Crataegus), and 12 family indicators of S2-S3 (e.g. Sapindaceae, Betulaceae, and Fagaceae).

At the species level, it was possible to further divide the units in subunits; at genus and family levels, interpretable subunits cannot be clearly identified, being geographically scattered and not well supported by indicator taxa. First of all, within the Transition unit, a subcoastal subtype (S2a) and an inland one (S2b) can be identified; in particular, in the subcoastal subtype, elevation values are lower, alluvial sediments and clays prevail, and most indicator species are hygrophilous (e.g. Populus nigra, Salix purpurea, Populus alba, and Salix triandra). Secondly, the Mediterranean unit can be divided into an eurimediterranean subtype (S1a) and a stenomediterranean one (S1b). In the stenomediterranean subtype, including islands and cells close to the coast mostly located in the Tyrrhenian slope, stenomediterranean indicator species prevail (e.g. Juniperus turbinata, Anthyllis barba-jovis, and Ceratonia siliqua). Finally, within the Eurosiberian unit, we can recognize two subtypes corresponding to the average elevation: a mountain subtype (S3a) and a subalpine one (S3b).

Discussion

The proposed phytogeographic regionalization of the analysed sector of peninsular Italy based on native woody flora shows the presence of three well-defined and spatially coherent units. These units can be clearly identified at all the taxonomic levels considered. Further divisions in subunits can be detected only at the species level.

Congruence, at the adopted scale, between native woody flora patterns at the species, genus, and family levels is evident from the visual comparison of the species-, genera-, and families-based maps and is well supported by the Mantel and chi-squared tests. These results are consistent with previous findings by Abbate et al. (2016) and by Latini et al. (in prep.). Moreover, most of the indicator taxa for each unit are taxonomically linked to each other, especially for the Mediterranean unit, and secondly for the Eurosiberian unit. Obviously, the correspondence is particularly good in the case of monotypic genera and families. Contrarily, for genera and families including species with very different ecological preferences (e.g. Daphne gnidium, D. sericea, Juniperus macrocarpa, and J. turbinata, indicator of S1; D. mezereum, D. alpina, J. communis and J. sabina, indicator of S3) this correspondence is lower, since the species can work as an indicator while the genus and family cannot.

At each considered taxonomic level, the low-elevation coastal unit and the high-elevation inland unit can be identified as Mediterranean and Eurosiberian units, respectively, which are already identified in previously existing phytogeographic regionalizations by Giacomini (1958), Takhtajan (1986), and Rivas-Martinez et al. (2004). Moreover, a middle-elevation inland unit highly biodiverse was identified, representing a Transition unit between them. Compared with the above-cited regionalizations, our findings further reduce the spatial extension of the Mediterranean unit, moving it towards the coast. In addition, the extension of the Eurosiberian unit was reduced, being partially replaced by the Transition unit. An analysis by Latini et al. (in prep.) of the woody flora distribution patterns along a west-east transect in Lazio and Abruzzo made it evident that three woody flora types (i.e. low-elevation coastal type, middle-elevation inland type, and high-elevation inland one) are present; these three types describe well the phytogeographic units here detected for the whole of Lazio and Abruzzo.

The Mediterranean unit, geographically restricted to coastal areas and islands, is characterized by the lowest number of cells and taxa, but by the strongest floristic identity. This identity is supported at the family level by the dendrogram divisions and by possessing the highest number of indicator families. At the genus level, we observed the highest number of indicator genera; at the species level the presence of many exclusive indicator taxa was detected, mostly Mediterranean ones. Such floristic identity at the family level, and to a small extent at the genus and species levels, is arguably linked to the fact that many families and genera, especially those that are monotypic or paucitypic (at least for Italian woody flora), are geographically restricted to coastal areas characterized by a Mediterranean climate (Blasi & Michetti, 2007). Particularly in this unit, indicator taxa are taxonomically linked. For example, the species Myrtus communis is an indicator of S1, the genus Myrtus is an indicator of G1, and the family Myrtaceae is an indicator of F1. The Mediterranean area was already identified as a well-circumscribed region also in regionalizations based on bryophyte (Mateo et al., 2013) and on vascular plants (Heikinheimo et al., 2012), given the presences of species that are specific to it and are largely distributed across the region. It is noteworthy the good match between the detected Mediterranean unit and the vegetation series assigned by Blasi (2010) to the Mediterranean Bioclimatic Region.

The Eurosiberian unit results to be very wide (it comprises the highest number of cells), particularly when detecting at the genus and family levels. This unit shows its maximum floristic identity especially at the species level, as indicated by the dendrogram and the fact that it possesses the highest number of indicator species, mostly Eurosiberian ones. This result is consistent with the strong floristic identity highlighted by Latini et al. (in prep.) for high-elevation inland areas of the Abruzzo Apennine mountains. Here, species associated with the inland high-elevation type were mostly mesophilous Eurasiatic/Atlantic or microthermal ones (some of which were conifers), with boreal or high mountainous distributions. Also in this unit, a clear taxonomic linkage between indicator taxa was assessed: e.g. Taxus baccata, Ribes uva-crispa, and R. alpinum are indicators at the species level, Taxus and Ribes at genera level, and Taxaceae and Grossulariaceae are indicators at family one.

The identification of the middle-elevation inland unit as a Transition unit was supported by many results: it is spatially located between the Mediterranean and Eurosiberian units, its floristic identity is the lowest among units, and the relationships with other units are not completely well defined. For example, the transition unit is characterized by the lowest number of indicator taxa (i.e. only one species and one family, and no genera). It is interesting how, the richness in the total number of taxa notwithstanding, there are only few indicator taxa. In fact, the highest number of taxa was detected in this unit, at all taxonomic levels (i.e. 190 at the species level, 98 at the genus level, and 49 at the family level). At the species level, the highest number of species corresponds to the highest number of cells. This floristic richness of middle-elevation areas is consistent with previous findings by Abbate et al. (2016), who noted a high level of plant diversity for the hilly belt in the administrative regions of Italy. Noteworthy is the richness of native woody taxa in areas with a strong presence of agricultural and seminatural areas. As stated above, hierarchic relationships between the Transition unit and the other units are not completely clear, as testified by some discordant results obtained: the Transition unit is more similar to the Mediterranean unit in some respects, and is more similar to the Eurosiberian unit in other respects, confirming its transitional nature. In fact, as shown by dendrograms, the Transition unit is closely related to the Mediterranean one at the species and genus levels, and to the Eurosiberian unit at the family level. Conversely, at all taxonomic levels, ISA results point to a higher affinity of the Transition unit with the Eurosiberian unit, as testified by the highest number of shared indicator taxa. Therefore, we retain the identification of a Transition unit as being particularly advisable and appropriate. Moreover, the identified Transition unit is exactly located in a zone previously ascribed to the Mediterranean Region by Giacomini (1958) and by Takhtajan (1986), but to the Eurosiberian one by Rivas-Martinez et al. (2004). This historic doubt behind the location of the boundaries between the Mediterranean and Eurosiberian regions can be resolved by identifying a Transition zone between them. Other transition zones in correspondence of historically debated biogeographic boundaries were also recently identified by Olivero et al. (2013).

No qualitative differences between the western Tyrrhenian and the eastern Adriatic slope have been detected, since the three identified phytogeographic units are present in both slopes; conversely, a strong difference in the spatial extension of these units—the units are wider in the Tyrrhenian slope—is present. It is the difference in physiography of the Tyrrhenian slope and of the Adriatic one that arguably plays an important role in determining these extensions, together with the climatic features. Previous findings by Giacomini (1958), Takhtajan (1986), Rivas-Marinez et al. (2004), Blasi et al. (2007), Abbate et al. (2016), and Latini et al. (in prep.) have reported differences between the two peninsulas’ slopes. Particularly, neither the division of the Mediterranean Region into Tyrrhenian and Adriatic sectors, as carried out by Giacomini (1958), Takhtajan (1986), and Rivas-Martinez et al. (2004), nor the differences between the Tyrrhenian and Adriatic subtypes within high-elevation inland types evidenced by Latini et al. (in prep.), are confirmed by our findings. The supposed floristic identity of an Adriatic Mediterranean subunit was not successfully delineated, we argue, due to the small areas occupied by this unit in the eastern portion of territory here considered; the small extension of the Mediterranean Region, located in the southern coastal sector, has been already highlighted by Conti (2004) in a previous phytogeographic regionalization of Abruzzo. The floristic differences within the high-elevation inland type were probably not detected due to the here adopted spatial scale.

Environmental features like lithology, distance from the sea, and elevation play important roles in defining subunits at species levels. However, the 10 × 10 km grid we used, especially in highly heterogeneous areas like the considered one, was not fine enough to satisfactorily delineate the boundaries of subunits.

In conclusion, the obtained regionalization based on native woody flora contributes to the knowledge of the phytogeographic structure of central Italy, identifying environmental correlates of the units. Further research, once data will be available, will be aimed at comparing the observed spatial patterns of native woody flora to those of the whole vascular flora or to other taxonomic/functional groups, to highlight congruences and differences between the obtained phytogeographic regionalizations.

The higher taxon-approach has proven to be useful for detecting native woody flora patterns and for providing phytogeographical regionalization at a fine scale. In particular, genus and family levels are especially suitable for coarse/medium-scale regionalizations, while species data are needed to provide fine-scale regionalizations.

Furthermore, if provided for wider study area, for example the whole Italy, such a statistically derived phytogeographic regionalization could be useful for many basic and applied purposes, e.g. to contextualize national conservation actions according to biogeographic boundaries. In particular, the identification of the boundaries of the Transion unit along the whole peninsular Italy could be very interesting, because it is a very highly biodiverse and heterogeneous unit, where agricultural systems and high nature-value rural landscapes are prevailing.