Introduction

Clayey deposits, like all natural soil deposits, generally show a broad variability in their geotechnical properties and parameters. This is due to geological and environmental processes related to deposition and diagenesis (Kulhawy 1992; Kim et al. 2012), and to shrinking–swelling dynamics (Vogel et al. 2005; Galeandro et al. 2013a), which cause variability of geomechanical properties and uncertainty in their values (Cherubini and Orr 1999; Phoon and Kulhawy 1999). In addition to this inherent soil variability, the variability of soil properties is also associated with other causes of uncertainty, such as the representativeness of samples, measurement errors due to measuring instruments and procedures, procedural-operator variation, random testing effects, and transformation uncertainty due to the adoption of semi-empirical and theoretical models for design parameters (Phoon and Kulhawy 1999; Baecher and Christian 2003; Akbas and Kulhawy 2010; Kim et al. 2012; Di Matteo et al. 2013).

All these uncertainties affect the variability of measured soil strength, and thus the design process (Harr 1987; Cherubini 2000a, b). For this reason, a reliability analysis is needed to choose soil parameters required for the design of geo-works interacting with soils.

Several theoretical frameworks aim to explicitly quantify and process the uncertainty related to geotechnical engineering applications (Harr 1987; Phoon et al. 1995; Cherubini and Orr 1999; Akbas and Kulhawy 2010). These are based on statistical analyses, aimed at reducing the uncertainty, and those errors related to the non-linearity of the processes. Therefore, these approaches can accurately estimate geotechnical properties (Que et al. 2008), supporting the evaluation of the reliability of design parameters and of the whole design process in general.

In particular, the values of soil parameters necessary for deterministic geotechnical design should be selected in a reliable way considering a specific safety level. Their evaluation represents a critical step in design processes, especially because of the various sources of uncertainty (Schneider 1997; Cherubini and Orr 1999; Phoon and Kulhawy 1999a; Cortellazzo 2000; Cardoso and Fernandes 2001; Baxter et al. 2008; Bond and Harris 2008; Kim et al. 2012).

A reliable estimation of the inherent variability of soil properties is the key to reliability-based design in geotechnical engineering. For this purpose, the coefficient of variation (COV) and probability distribution function (PDF) are commonly used for quantifying the inherent variability of geotechnical properties (Cortellazzo and Mazzuccato 1996; Phoon and Kulhawy 1999; Cortellazzo 2000; Akbas and Kulhawy 2010). The COV values of a number of soil properties were extensively investigated (Lumb 1974; Lacasse and Nadim 1996; Phoon and Kulhawy 1999; Cherubini and Orr 1999; Baecher and Christian 2003 and others). In particular, starting from a broad literature review, Phoon and Kulhawy (1999) present the COVs of soil properties along a general soil type, and the approximate range of mean values for which the COVs are applicable.

This work focuses on contributing to our knowledge of the local variability of soil strength by examining a case study related to the analysis of variability of the undrained shear strength of a marine stiff clay, measured by a pocket penetrometer. It is noteworthy, that the analysis of local variability of soil strength was pursued here by looking at shear strength data obtained by testing materials from a single borehole. Therefore, we did not face any problem with the spatial variability of soil properties. In fact, the aim was to emphasise the importance of determining the local variability of shear strength data, since a single borehole is usually available for any geotechnical engineering design procedures of small structures.

The use of a pocket penetrometer allows provision of a large dataset of measurements, and the development of detailed statistical analysis of COVs, as well as PDFs that better fit the measured data.

A global statistical analysis was performed, i.e., considering the entire dataset with a measurement step of about 25 cm across the whole investigated clayey deposit (about 25 m), and a local statistical analysis, i.e., analysing the variability of the ten measurements at each investigated depth.

The case study presented shows that the variability of the measured strength values is quite large, both locally and across the whole deposit, even if the tested clay deposit is homogeneous both geologically and visually. In addition, a discussion about the inherent variability of soil properties and its influence on the characteristic values of soil strength for geotechnical engineering design is presented.

Test site

The investigated site is located at the top of a hill close to Grottole (Matera, South Italy, Fig. 1a). The site is characterised by a marine regressive sequence of the lithological terms of Bradanic foredeep domain (Pieri et al. 1996; Tropeano et al. 2002; Galeandro et al. 2013b). The outcropping deposits are coarse-grained terraced deposits (Monte Marano Sands) topping the deposits of the marine sequence of the Bradanic foredeep domain. These are characterised mainly by silty-clayey deposits of clays and consist of stiff and jointed grey-blue marly-silty clays and clayey silts, characterised by the presence of silty and sandy levels up to 10 cm thick, whose frequency and thickness increases upward. These deposits are usually blue-grey silty clays, except for the upward levels; there the deposits assume a yellowish-ocher color, due to weathering processes, which induced drying and caused degradation of the clay bounding. On the one hand this weathering process usually induces a mechanical decay, on the other, diagenesis phenomena could increase soil strength.

Fig. 1
figure 1

Location and schematic stratigraphy of the study site

Figure 1b shows the schematic stratigraphic log of the investigated site, obtained by continuous rotary sampling of a 30-m-deep borehole. The stratigraphic sequence of silty-clayey deposits, all belonging to the geological formation of the Pleistocenic Sub-Apennine grey-blue clays and clayey silts (Pieri et al. 1996), can be summarised as follows (Fig. 1b):

  • Layer A: 6–15 m (yellowish ocher sandy clayey silts);

  • Layer B: 15–20 m (grey blue clayey silts);

  • Layer C: 20–30 m (grey-blue clays).

Even though the whole deposit seems to be quite homogeneous, it is characterised by inherent variability, as a consequence of the presence of sandy intercalations of variable thickness, sedimentation compression, the weathering process, etc. The variability in strength is also due to small-scale geological variations, i.e., microstructures within geological material (Cafaro and Cherubini 2002). Thus, this geological formation may be an interesting case study for the characterisation of the inherent variability of soil geotechnical properties. In fact, the investigated soil deposit is not affected by geological structure superimposed by tectonics, or by a complex geological history like other tectonically deformed clayey deposits. Soil laboratory tests performed on undisturbed samples (Table 1) showed that the studied deposits are stiff silty clays of medium plasticity, in good agreement with literature values for sub-Apennine clays in the same area (Genevois et al. 1984).

Table 1 Main geotechnical properties of the studied deposit by laboratory test on undisturbed samples

Data collection

To perform a detailed reliable statistical analysis of soil strength variability for engineering purposes, it is necessary to collect a large number of reliable measurements, involving soil strata of relevant thickness, like those commonly investigated for geotechnical design purposes. In order to gather a large dataset of soil strength values along the entire extracted soil column, a hand pocket penetrometer is used, allowing quick determination of approximate values of soil strength. Readings are obtained by pushing the loading piston against the soil sample, and reading the approximate unconfined compressive strength (kg/cm2) on the permanent scale on the piston barrel. The measurements were performed on the extracted column of sub-Apennine grey-blue clays, after removing the external layer of the samples, likely disturbed by the sampler itself. The extracted soil column was studied by dividing it into vertical segments of 25 cm. Around ten measurements (Fig. 2) were extracted for each increment, obtaining a population of collected data characterised by 870 strength values (kg/cm2). These were used to perform a local study of the inherent variability of the soil strength, analysing the ten values measured at each measurement step, and to perform a global analysis across the whole investigated strata with steps of 25 cm.

Fig. 2
figure 2

Data collection. a Removal of the disturbed surficial portion of samples. b Execution of measurement on the collected samples. c, d Some samples subjected to the measurements

Local analysis of the inherent variability of pocket penetrometer strength

To evaluate the inherent variability in strength of the studied deposit measured by pocket penetrometer, the average value μ (kg/cm2), the standard deviation σ (kg/cm2) and the COV of the ten measured compressive strength values (Fig. 3) were estimated for each measurement step.

Fig. 3
figure 3

a Average strength and standard deviation vs depth. b Coefficient of variation (COV) vs depth

Locally, COVs range between 0.05 and 0.2, sometimes exceeding this value, peaking at 0.4 or more. These COV values may appear rather high for measurements performed at the same point, if compared with COV literature values (Harr 1987; Cherubini and Orr 1999; Phoon and Kulhawy 1999). However, they refer to a specific point and show how soil strength may be locally affected by a severe variability, related to sedimentation, diagenesis and weathering phenomena.

The soil strength profile (Fig. 3a) was very variable along the studied log. The highest values of strength were measured at a depth of 13–14 m, and were due partly to the presence of drier soil. It is interesting to note that average strength values decrease dramatically below a depth of 15 m, at the transition from the yellowish sandy-clayey loams to grey-blue clayey loams (Figs. 1b, 2). This difference is consistent with the different consistence indexes measured on laboratory samples (Table 1). Similar results about the relationship between geotechnical behaviour of weathered and unweathered sub-Apennine clays were reported by Cotecchia (1996) and by the present authors for other sites, where it was reported that weathering may introduce an increase of strength due to drying. Unweathered silty clays below the depth of 15 m, or more, are characterised by average measured values consistent with those depths. The measured local variability is quite high for such apparently homogeneous silty clay deposits.

Analysis of the inherent variability across the whole soil strata

To complete the analysis of the inherent variability of soil strength across all soil strata, an analysis of variability of the entire set of measured strength data was undertaken, based on the assumption that the soil column from 6 m to the base of the borehole at 30 m is a unique soil stratum belonging to the same geological formation.

The outcomes of the analysis are summarised in Tables 2 and 3, where Table 2 summarises the results for all samples, while Table 3 reports the results for each batch of data, representative of single tests. The value of COV is 0.33, which is slightly higher than the local COVs previously evaluated for each set of ten measurements (Fig. 3b), but still considerably lower than COV values from literature (Harr 1987; Phoon and Kulhawy 1999; Cherubini and Orr 1999) for undrained strength: 0.45–0.55. This quite low value can be considered consistent with the presumable homogeneity of the studied geological formation, and with the absence of variability due to testing procedure, measurement and data transformation procedures. In fact, all tests are perfectly homogeneous since they were implemented by the same person and on the same day, without any transformation procedure.

Table 2 Mean values, and standard deviations (SD) of the whole dataset
Table 3 Mean values, and SD of each batch of measures, grouped according to the average depth of test

Subsequently, the analysis was detailed as follows: the whole dataset was divided into three layers (A, B and C), corresponding to the three soil levels of the stratigraphic profile. For each layer, the average value, the standard deviation and COV (Table 4) were estimated. We compared the mean local COVs, i.e. the average value of local COVs evaluated for each dataset of ten measurements included in the considered soil column. It was interesting to observe that, in the upper weathered layer A, both average strength values μ (kg/cm2) and COVs were higher than in the lower unweathered strata (B and C). This is probably due to weathering phenomena affecting the inherent variability of the geological formation, thus, values for unweathered levels were lower than for weathered ones. In addition, it could be due also to a more severe sedimentation variability affecting the upper part of the deposit as consequence of the regressive phases of sedimentation.

Table 4 Mean values, SD and COV for each stratigraphic level

It was interesting to observe that COVs evaluated for these three layers were moderately higher than the locally evaluated COV values, considering the ten data points at each depth increment of about 25 cm. It was also interesting to observe that the coefficient of variation evaluated for the layer B (between 15 and 20 m) was slightly smaller than that evaluated for the deeper soil stratum C (between 20 and 30 m). This is reasonable for quite homogeneous strata, showing that the smaller the stratum, the lower the COV value.

A detailed analysis of the inherent variation was performed by dividing each stratigraphic layer into more thin sub-layers (Tables 4, 5, 6). Results are described in the following, grouped by lithotype.

Table 5 Mean values, SD and COV for the yellowish ocher sandy clayey loams, layer A
Table 6 Mean values, SD and COV for grey blue clayey silts group of data, layer B

Yellowish-ocher sandy-clayey silt (layer A)

The detailed analysis of the inherent variability across the upper sandy-clayey silts layer A performed by dividing it in thinner levels of 4.5 (6–10.5 and 10.5–15 m) and 3 m (6–9; 9–12; 12–15) (Table 5) shows that COV values are slightly lower for thinner sub-layers. Only levels between 9 and 12 m show high variability, with COV >0.40. This increase of COV between 9 and 12 m is probably due to some disturbances in lithology and consistency index at a depth of about 10–10.5 m, which becomes evident only when considering thin levels. A special zone characterised by significant anomalies in the sedimentation process or in weathering processes may give a relevant variation of measured values and their relevance may be strong, particularly if the sub-layer is thin. This result emphasises the importance of performing a detailed stratigraphic analysis before defining the representativeness of the tested samples and using the results for geotechnical modelling.

Clayey silts and grey-blue clay (layers B and C)

For grey-blue clay (layer B) and clayey silts (layer C), the COVs obtained by dividing the layer into thinner sub-layers are quite low, being almost equal to, or even lower than, the local COV values (Tables 6, 7). This shows the quite good homogeneity of the deposits, and allows the variation across the strata to be compared with the punctual values. Looking at layer C, higher COV values are obtained for levels between 20 m and 22.5 m, and between 22.5 m and 25 m from the ground level, which is also characterised by a low shear strength value, due to the presence of thicker sandy intercalations and a higher water content. COVs for these levels are equal to 0.25 and 0.37, respectively. This result is consistent with the analysis of intraclass correlation coefficient RI (Wickremesinghe and Campanella 1991; Phoon et al. 2003) and Bartlett test statistics B stat (Kanji 1993) profiles, generally used for CPT soundings, which allows statistical detection of homogeneous layer boundaries by identifying the peaks of these variables.

Table 7 Mean values, standard deviations and COVs for grey blue clay group of data, layer C

The intraclass coefficient index RI is generated by moving two contiguous windows containing m data points each over a measurement profile and computed as follows:

$$ RI = \frac{1}{{1 + \frac{1}{{\frac{m - 1}{m} + \frac{{\left( {\mu_{1} - \mu_{2} } \right)^{2} }}{{2\left( {s_{1}^{2} + s_{2}^{2} } \right)}}}}}} $$

where μ I and s 2I represent in the order average value and variance of samples on a window. For the case of variances of two samples, s 21 and s 22 , the Bartlett test statistic can be evaluated as follows (Kanji 1993):

$$ B_{stat} = \frac{{2.30259\left( {m - 1} \right)}}{C}\left[ {2\log s^{2} - \left( {\log s_{1}^{2} + \log s_{2}^{2} } \right)} \right] $$

where m is the number of data points used to evaluate s 21 (or s 22 ), the total variance s is defined as:

$$ s_{2} = \frac{{s_{1}^{2} + s_{2}^{2} }}{2} $$

And the constant C is given by:

$$ C = 1 + \frac{1}{{2\left( {m - 1} \right)}} $$

It is noteworthy that this additional analysis not only identifies statistically homogeneous sections consistent with geological boundaries, but also detects a change of homogeneous section between ca. 20 m and 25 m (Fig. 4). This confirms the importance of a detailed stratigraphic analysis of the layers, in order to gather evidence of variability due to soil characteristics. Then, a more detailed analysis of the soil log shows that the investigated layers are characterised by some anomalies, whereas sandy levels are thicker, as already observed for the previous yellowish-ocher sandy-clayey silt layer.

Fig. 4
figure 4

Identification of soil boundaries using intraclass correlation coefficient index (RI) and Bartlett statistic (B stat)

Statistical distribution of data

For each considered group of data, a statistical analysis was performed, dividing the measured data into 0.5 kg/cm2 wide classes of strength. Class width was chosen by considering the resolution of the instrument of 0.1 kg/cm2, in order to obtain classes characterised almost by the same number of values. Data fitting was estimated with some common probability distribution functions used to analyse geotechnical data. In particular, normal distribution, lognormal distribution and Gamma distribution were accounted for (Table 8). In particular, the latter two distributions were considered in order to account for the asymmetrical distribution of samples, with respect to the average value.

Table 8 Probability distribution functions: µ (kg/cm2) and σ (kg/cm2) indicate mean value and standard deviation, respectively

Figure 5 shows the fitting of data to the assumed probability distribution functions for each sub-layer (A, B and C) and for the entire soil column. In order to determine the reliable COVs and PDFs for the investigated soil strength, once parameters of distributions are estimated and then fit to the collected data, goodness-of-fit tests, such as the Kolmogorov–Smirnov (K–S) and Pearson (χ 2) tests are performed, assuming a significance level α equal to 0.05 (Benjamin and Cornell 1970; Ang and Tang 1975; Baecher and Christian 2003; Fenton and Griffiths 2008; Kim et al. 2012).

Fig. 5
figure 5

Fitting data to probability distribution functions (PDFs) and cumulative probability functions (CPFs) for soil samples between depths of a 6 and 30 m; b 6 and 15 m; c 15 and 20 m; d 20 and 30 m; e results of statistical tests

Table 9 shows results of goodness-of-fit tests.

Table 9 Results of statistical tests and determination of reliable probability distribution functions (PDFs)

If data from 6 m to 30 m are analysed all together, all PDFs are rejected, according to Pearson test. Dividing the soil strength profile into layers (A, B and C) corresponding to the geological boundaries of the soil profile (Fig. 2), both tests reject all distributions for the layer between 6 m and 15 m, and for the deepest layer, 20–30 m. Assuming thinner layers, it is noteworthy that central layers seem to fit the assumed probability distributions, with no clear prevalence of a distribution over the others. The deepest layers, 25–30 m seems to fit better to Gamma probability distribution, although the Kolmogorov–Smirnov test does not reject the assumption. For the remaining layer, there is a general uncertainty, for which it is not possible to clearly assume a probability distribution among the selected layers; this is likely due to the variability in the measurements.

Table 9 summarises the test results and indicates the best distributions, i.e. the distributions verified by one or both of the performed goodness-of-fit tests, considering the best fitting distributions to be those simultaneously verified by both tests. Where no distribution was verified by the tests, the distribution where tests fail less often was chosen. It is clear that none of the considered distributions prevailed in terms of fitness to the measured data. Anyway, a normal distribution, which was also suggested by Eurocode 7, was not rejected at least by one test 8 times out of 17. Gamma distribution was not rejected 7/15 times, being the second best performing distribution. Finally, the lognormal distribution was not rejected at least by one test 5/17 times. It is interesting to note that normal and lognormal distributions constitute the best-fitting distribution for the datasets characterised by relatively low COVs, i.e. lower than 0.25, except the layer at 6–15 m where the COV is equal to 0.31, while the Gamma distribution can be considered the best fitting distribution for datasets with highest COV, i.e. higher than 0.31.

Characteristic values

According to the common practices of geotechnical engineering, soil strength parameters are obtained by applying partial factors to the characteristic values of strength parameters (Potts and Zdravkovic 2012). Characteristic values are defined as a careful estimate of the value affecting the occurrence of the limit state, and are obtained starting from in situ or laboratory observations (Orr 2000; Frank et al. 2004; Baxter et al. 2008). Then, starting from measured data, characteristic values can be evaluated by using an engineering expert approach (Cherubini and Orr 1999). Anyway, a commonly adopted approach is the statistical one, which suggests the following formulation for the characteristic values:

$$ x_{c} = \mu - k_{n} \sigma = \mu \left( {1 - k_{n} COV} \right) $$
(1)

where k n is a statistical coefficient, depending on the chosen confidence level and on the assumed distribution function. In structural engineering, a confidence level of 95 % is normally assumed, which can be adopted also in geotechnical engineering. However, the use of such a level in engineering geology design may sometimes be too conservative. Statistical methods can be reliably applied if the numerosity of the pool of samples is higher than ten (Schneider 1997); however, in engineering geology practice, such numerosity is unusual. Schneider (1997) proposed an approximation of Eq. (1) assuming a value of k n = 0.5, which is a good approximation for n > 10 tests (Schneider 1997). Zupan and Turk (2002) proposed another formulation for the characteristic value at 95 % of confidence level based on the normal distribution, using the following improved unbiased estimate of Eq. (1) evaluating the coefficient k n as:

$$ k_{n} = 1.645 \cdot \varepsilon_{n} $$
(2)

where ε n is a sort of scaling coefficient depending on the number of performed tests (Table 10). Figure 6 and Table 11 show the characteristic values for the dataset of the investigated case study, evaluated as the 5 % percentile related to the studied probability distributions, based on Schneiders’ (1997) approximation and on the approach of Zupan and Turk (2002), in order to show how different procedures can affect the final result. Results show interesting differences among the estimated characteristic values. The characteristic values evaluated as the 5 % percentile of the three assumed probability distributions were rather similar, despite the depth at which they are estimated. They are always lower than Schneider values, while remaining comparable with Zumpan and Turk values. This means that values based on probability distributions are less conservative but more economically convenient, since they lead to cheaper solutions in engineering geology design. This is particularly true when looking at the normal distribution, as recommended by Eurocode 7.

Table 10 Scaling coefficient ε n (from Zupan and Turk 2002)
Fig. 6
figure 6

Characteristic values evaluated according to different statistical distribution and other approaches

Table 11 Characteristic values evaluated according to different statistical distribution and other approaches

In particular, from the engineering geologist and geotechnical designer’s point of view, Schneider approximation provides higher characteristic values, and these values are less conservative than those coming from the classical Statistics and Zupan and Turk’s (2002) approach. Zupan and Turk (2002) estimated characteristic values, which are really close to the values evaluated as 5 % percentile of normal distribution.

Table 12 and Fig. 7 show characteristic values evaluated as 5 % percentile of normal distribution considering different sample sizes n, characterised by n random values for some considered layers. The results show that, for sample pool size n higher than 20 or 30, it is possible to obtain characteristic values affected by low uncertainty.

Table 12 Characteristic values evaluated as 5 % percentile of normal distribution for some sample size n and for some soil layers
Fig. 7
figure 7

Characteristic values evaluated as 5 % percentile of normal distribution for some sample size n and for some soil layers

Conclusions

The work aimed to make a contribution to knowledge about the inherent variability of soil strength related to geological structures due to sedimentation, diagenesis and weathering phenomena, in order to show the importance of its reliable evaluation for engineering geology design. For this purpose, the paper focussed on characterising and estimating the variability of the undrained strength of a marine stiff clay measured by a pocket penetrometer by means of a statistical analysis, aimed at evaluating the representative coefficients of variation and the PDFs fitting the experimental data.

The soil strength profile showed that the relevant variability of soil strength along the studied log was higher than expected for such homogeneous silty clay deposits. Locally, COVs ranged between 0.05 and 0.2, and rarely exceeded 0.4. For the entire soil column in the same geological formation, the value of the COV was 0.33, higher than the average local value, and lower than literature values of 0.45–0.55. COVs evaluated by dividing the whole log in three layers corresponding to some variation of stratigraphic features of the deposits were slightly higher than the single COVs evaluated locally at each depth increment.

If the soil column is divided in small thickness levels, COVs were lower than the values corresponding to the entire layer and were not particularly high, being equal to ca. 0.20, except for some levels as a consequence of high variability due to weathering phenomena, the presence of thicker sandy intercalations, and higher water content. This result is perfectly consistent with the statistically homogeneous sections detected by evaluating RI and B stat profiles. The analysis shows that an accurate evaluation of geological and grain size differences of soil strata is important for evaluation of the representativeness of samples. A pocket penetrometer is an interesting tool for controlling local variability and the representativeness of samples for laboratory tests.

The fitting analysis of the measured dataset with different statistical distributions showed that none of the considered probability distributions (normal, lognormal and Gamma) may be considered fully reliable, even if the best is the normal, according to Eurocode 7 indications. In any case, normal and lognormal distributions constitute the best fitting distributions for homogeneous datasets with COV <0.25. Gamma distribution gives interesting results for soil layers characterised by different materials and with a COV of the measured dataset larger than 0.31.

Finally, the available dataset was used to check the reliability of different approaches for the evaluation of characteristic values. The characteristic values of soil properties were estimated as 5 % percentile of PDFs and by other approaches, showing that characteristic values evaluated as 5 % percentile of the normal distribution are quite consistent with those evaluated by the approach of Zupan and Turk (2002). The characteristic values evaluated according to the assumed PDFs corresponds to low values, i.e. cheaper solutions, than the Schneider approximation, which provides higher characteristic values and is then less conservative.