Introduction

The standard penetration test (SPT) is one of the most frequently used tests in geotechnical studies. The SPT was developed in 1927 (Bowles 1988), and this test is currently used and has become one of the most popular in situ tests. The allowable bearing pressure and SPT blow count in sands were correlated and published by Terzaghi and Peck (1948). Gibbs and Holtz (1957) considered the overburden pressure and length of the drill rod as a cause of non productibility between adjacent boreholes, and they suggested an overburden correction. Lacroix and Horn (1973) proposed a correction for correlations of nonstandard penetration resistances with standard penetration resistances. Skempton (1986) defined the corrected blow count (N 60) as the SPT N value corrected for field procedures and apparatus, including the hammer efficiency, borehole correction, sample barrel correction and rod length correction. The Gibbs and Holtz overburden correction results in higher blow counts for uniform soil deposits lying over a stiffer material, so Liao and Whitman (1986) suggested an overburden correction termed (N 1)60. Numerous researchers have proposed and used empirical correlations between SPT N values and shear wave velocity: Seed and Idriss (1981), Lee (1990), Dikmen (2009), Hasancebi and Ulusay (2007), Ulugerli and Uyanık (2007), Chatterjee and Deepankar (2013), and Ayolabi and Adegbolo (2014). There are also many empirical equations in the literature derived from correlations of SPT blow count and other in situ test parameters; Yagiz et al. (2008), Bozbey and Togrol (2010), and Kayabaşı (2012) studied the correlations of SPT blow counts and pressuremeter parameters. Researchers have proposed a series of empirical equations for sandy, silty, and clayey soils. The empirical equations between cone penetration tip resistance and SPT blow count number N values are also common in the literature: Schmertmann (1970), Chang (1988), Emrem and Durgunoglu (2000), and Akca (2003). Standard penetration test is also used in geotechnical researches widely. Topsakal and Topal (2014) studied slope stability of a re-activated landslide on the Artvin-Savsat road in Meydancik, Turkey. They performed SPT test in boreholes. The SPT-N values generally increased with depth but researchers stated that there is a relationship between the failure surface and the SPT-N values, it can be stated that the SPT-N values decreases around the failure surface. The application of the SPT test for geographic information system for the development of soil suitability map was carried out by Stalin and Vurukan Arun (2014). Investigators countered study area with SPT N values and bearing capacity for various depths and they suggested that the potential of GIS to find the solution of phenomena that was very tedious by any other means, not only in terms of accuracy but also by the use of derived information as the input for other correlation analysis.

The related standards for SPT are the ASTM D 1586 (Bowles 1988), British Standard 1377 (1975), ENV (Eurocode 7) 1997, and Türk Standartları Enstitüsü (TSE) 5744 (1988). SPT is a dynamic test performed for the determination of the strength parameters of silty, sandy, and clayey soils and heavily weathered rock units. A standard split sampler is driven into the soil by dropping a 63.5-kg hammer on the drill rod from a height of 76 cm. The sampler is advanced in the soil by 450 mm (Fig. 1). The number of blow counts is recorded for each 150 mm. The blow counts of the first 150 mm are not included in the calculation to avoid disturbances of the soil. The blow counts of the second and third 150 mm advance of the sampler are recorded separately. These blow counts are the raw blow counts of the test. Then, a groundwater correction, rod energy ratio correction, rod length correction, sample barrel correction and borehole diameter correction are applied, and the corrected blow counts (N 60) are obtained.

Fig. 1
figure 1

Sketch of standard penetration test instrumentation (Coduto 1994)

In this study, data from an investigation of the foundations of a sewerage station (Kayabasi and Gürsoy 1995a, b, c), central pumping station (Kayabasi and Gürsoy 1995a, b, c), and west pumping station (Kayabasi and Gürsoy 1995a, b, c) in Mersin City from Turkey are employed. The location plan of stations is shown in Fig. 2. A total of 20 boreholes in the sewerage station, 10 boreholes in the central station, and 4 boreholes in the west pumping station were drilled. One hundred thirty SPTs at the sewerage station, 72 SPTs at the central pumping station, and 18 SPTs at the west pumping station were performed in the boreholes. The drilling program of the boreholes can be summarized as follows: drill 0.00–1.50 m, perform the SPT test at 1.50–1.95 m, drill 1.95–2.10 m, clear the borehole, sample 2.10–2.70 m undisturbed soil, drill 2.70–3.00 m, clear the borehole, and perform the SPT test at 3.00–3.45 m. This program is conducted for the entire determined depth of the borehole. Constant head permeability tests and pressuremeter tests were also performed in certain boreholes. During the drilling program of the sewerage station, central pumping station, and west pumping station, disturbed and undisturbed samples were collected from boreholes. The disturbed samples from the split-spoon sampler were collected, examined, labeled, and placed in a sealed-waxed jar, and the undisturbed samples are also labeled, waxed, and sent to the General Directory of Electrical Power Resources Survey and Development (EIEI) laboratory daily.

Fig. 2
figure 2

Location map of the study area

Laboratory testing

A series of laboratory tests were applied for the sewerage project of Mersin. Bulk unit weight tests, sieve tests, Atterberg limit tests, natural moisture content tests, direct shear tests, consolidation tests and triaxial compression tests were conducted on the borehole samples. All of the samples were collected and tested in accordance with the procedures suggested by ASTM (American Society for Testing and Materials) (1994). The undisturbed samples were not representative of SPT test levels because of depth differences between the undisturbed samples and SPT test levels. Therefore, the geotechnical properties of the undisturbed samples determined in the laboratory were not correlative with the SPT blow counts; however, the index properties of the samples from the split spoon sampler were correlative with the SPT blow counts.

According to the mean grain size distribution, the soil samples of the sewerage station consist of 74.9 % fine-grained material (no. <200 sieve) and 39.5 % clay-sized material (USBR United States Dep. Int. Bur. Reclamation 1974). The 78 of 130 fine-grained sample locates over the A line. The fine-grained material is mainly highly plastic clay (CH) (Fig. 3). Others locate under the A line, so they can be named as OH or MH soil. The mean values of the liquid limit (LL) and plastic limit (PL) are 61.5 and 31.0 %, respectively. The plasticity index (PI) mean value is 28.89 (Table 1). The activity (Ac) of the clayey soil is determined by the PI divided by the percent of clay-sized particles (less than 2 μm). The mean activity of the sewerage station clayey soil is 0.57 with a minimum value of 0.22 and maximum value of 3.14 (Fig. 4a). According to the activity classification proposed by Seed et al. (1964), the soil samples are inactive clay. Skempton (1953) classified soils according to activity, and the activity value of montmorillonite changes between 1 and 7. The Ca montmorillonite amount increases to approximately 1, and the Na montmorillonite amount increases to 7 with increasing activity value. The activity of illites is between 0.5 and 1, and the activity of kaolinite is approximately 0.5 (Mitchell 1975). According to this classification, the sewerage foundation soil can be classified as Ca montmorillonite. The consistency (Ic) of silt and clay is defined as the difference between the liquid limit (LL) and moisture content (w) divided by the plasticity index. Therefore, consistency can be defined as the resistance of soil to penetration. Peck et al. (1953) proposed a chart showing the relationship between SPT blow count and soil consistency (Table 2). Clay and silts can be classified as very soft to very stiff (British Standart EN ISO 14688-2 2013), (Table 3), and 63 % of the foundation soils had a consistency greater than 1, which indicates very stiff or hard soil, and 18 % of the foundation soil consistency is between 0.75 and 1, which indicates stiff soil (Fig. 5).

Fig. 3
figure 3

Distribution of the foundation clays on a plasticity chart

Table 1 Statistical evaluation of foundation soil sample properties from sewerage station
Fig. 4
figure 4

Activity histograms of the study area: a central pumping station; b west pumping station; and c, sewerage station

Table 2 Penetration resistance, SPT blowcount, and undrained shear strength (Peck et al 1953)
Table 3 Defining of qualitative and quantitative consistency (BS EN ISO 14688-2)
Fig. 5
figure 5

Consistency histograms of the study area: a central pumping station; b west pumping station; and c, sewerage station

The soil samples of the central pumping station consist of fine-grained material 47 % (no. <200 sieve) and 18 % clay-sized material (USBR United States Dep. Int. Bur. Reclamation 1974); 48 of 72 samples are mainly low plastic (CL), with the exceptions being13 sample organic clays or inorganic silts with high plasticity (OH-MH) and 11 samples that are high plastic clay (CH) (Fig. 3). All of the samples of the central pumping station separate nearly parallel to the A line of the plasticity chart. The mean values of the liquid limit (LL) and plastic limit (PL) are 48 and 28 %, respectively (Table 4). The plasticity index mean value is 21 %. The mean activity of the central pumping station clayey soil is 0.34 with a minimum value 0.096 and maximum value of 1.31 (Fig. 4b). According to the mean value and the classification of activity proposed by Seed et al. (1964), the central pumping station soil is inactive clay. According to Mitchell (1975) clay classification, the clayey soil of central pumping station can be classified as kaolinitic clay soil. The 51 % of the foundation soil sample of central pumping station have consistency greater than 1, which indicates very stiff or hard soil. 22.5 % of the central pumping foundation soil consistency is between 0.75 and 1, which indicates stiff soil (Fig. 5b).

Table 4 Statistical evaluation of foundation soil sample properties from central pumping station

The soil samples of the west pumping station consist of fine-grained material 69 % (no. <200 sieve) and 30 % clay-sized material (USBR1974) of 18 samples are low plastic (CL) that accumulates close to 50 % of the liquid limit line and 3 samples over the A line are CH, with the exceptions being 5 samples of organic clays or inorganic silts with high plasticity (OH-MH) close to the A line and 1 sample of organic clays or inorganic silts with high plasticity (OH) (Fig. 3). The mean values of the liquid limit (LL) and plastic limit (PL) is 64.6 and 25.87 %, respectively. The plasticity index mean value is 30.09 (Table 5). The mean activity of the west pumping station clayey soil is 0.62 with a minimum value 0.24 and maximum value of 1.59 (Fig. 4c). According to the classification of activity proposed by Seed et al. (1964), the west pumping station soil is inactive clay. West pumping foundation clayey soil is named as ca monmorillonite according to Mitchell (1975) clay classification. Of the west pumping station soil, 94 % has a consistency greater than 1, which indicates very stiff or hard soil, and 6 % of the foundation soil consistency is between 0.75 and 1, which indicates stiff soil (Fig. 5c).

Table 5 Statistical evaluation of foundation soil sample properties from west pumping station

Regression analysis

Simple regression analysis

The regression analysis is a widespread statistical method used for the derivation of empirical equations (Bieniawski 1978; Hoek and Brown 1997; Kayabasi et al. 2003; Yagiz and Gokceoglu 2010; Kayabaşı 2012; Chen-Chang et al. 2011; AlHomadhi Emad 2014), and such analyses require comparable data. In this study, the SPT blow count and parameters derived from the laboratory tests on the disturbed samples are compared. Thus, a total of 220 corrected SPT blow counts and the same number of laboratory test results are correlated. A series of simple regression analyses are performed to determine the empirical relationships between the SPT blow count and the following parameters: moisture content (w, in percent), fine-grained material percent (FGP, in percent), liquid limit (LL, in percent), plastic limit (PL, in percent), plasticity index (PI, in percent), consistency index (Ic) of clayey material, and activity (Ac) of clayey material (Table 6).

Table 6 Estimating emprical equations for N 60 versus soil parameters

As shown in Table 6, the acceptable results of the simple regression analysis can be summarized as follows: the equations with the highest coefficients of regression between N 60 and activity (Ac) are represented by a logarithmic function (Fig. 6); the correlation between N 60 and moisture content produces the highest regression coefficient with a linear function, and the correlation between N 60 and FGP (in percent) produces the highest regression coefficient with a linear function also. A power function produces the highest regression coefficient between N 60 and Ic, and it is also a significant result. A summary of the simple regression analyses results shows that increasing activity and consistency of soil increases the N 60 value with a logarithmic function and power function, respectively. Decreasing water content and fine-grained percent increases the N 60 value with a linear function. However, the coefficient regressions of the simple regression analyses for N 60-LL (in percent), N 60-PL (in percent) and N 60-PI (in percent) are low, so these regressions are considered to be unacceptable results and are not evaluated as input parameters in the multiple regression analysis.

Fig. 6
figure 6

Graphs showing the simple regression analyses: a between N 60 and activity; b between N 60 and moisture content; c between N 60 and fine-grained percent; d between N 60 and consistency

The prediction performance of the simple regression equations are evaluated with root mean square error (RMSE) and values accounted for (VAF) indices. Determinations of RMSE and VAF have long been used by various researchers (Alvarez Grima and Babuska 1999; Gokceoglu 2002; Dagdelenler et al. 2011; Minaeian and Ahangari 2013). If the RMSE values are closer to 0 and VAF values are closer to 100, the prediction performance of the equation is excellent. The RMSE and VAF indices for each parameter are provided in Tables 7 and 8.

Table 7 The values of the root mean sguare errors (RMSE) and values account for (VAF) for the predicted N 60 equation derived from simple regression analyses
Table 8 The values of the root mean square errors (RMSE) and values account for (VAF) for the predicted N 60 equation derived from multiple regression analyses with two independent variables

The optimum results are from the equation derived from the correlation between the moisture content and measured SPT blow count; however, the other equations also provide acceptable results.

Multiple regression analysis with two independent parameters

Four parameters of clayey soil properties have been determined from simple regression analyses that could be used as independent input parameters for the multiple regression analyses. These four parameters are soil activity (Ac), fine-grained percent (FGP), soil consistency (Ic) and moisture content (w). A combination of these parameters in two groups results in the following six functions:

$$ f\left(\mathrm{w},\mathrm{A}\mathrm{c}\right)={N}_{60} $$
(1)
$$ f\left(\mathrm{A}\mathrm{c},\mathrm{F}\mathrm{G}\mathrm{P}\right)={N}_{60} $$
(2)
$$ f\left(\mathrm{A}\mathrm{c},\mathrm{I}\mathrm{c}\right)={N}_{60} $$
(3)
$$ f\left(\mathrm{w},\mathrm{F}\mathrm{G}\mathrm{P}\right)={N}_{60} $$
(4)
$$ f\left(\mathrm{w},\mathrm{I}\mathrm{c}\right)={N}_{60} $$
(5)
$$ f\left(\mathrm{I}\mathrm{c},\mathrm{F}\mathrm{G}\mathrm{P}\right)={N}_{60} $$
(6)

When developing an empirical equation, the input parameters should not be interrelated each other. Functions 2, 3, and 5 consist of related parameters. Activity and fine-grained percent of clayey soil are the input parameters of function 2. FGP is a determining parameter of activity, so there is relation between input parameters of Function 2. If we examine the input parameters of Function 3, plasticity index is determining parameter both activity and consistency index, so there is also relationship between activity and consistency index. Lastly, water content is determining parameter consistency index, there is relationship between input parameters of Function 5 also. Therefore, functions 2, 3, and 5 are eliminated in the multiple regression analyses. The Functions 1, 4, and 6 are accepted for multiple regression analysis due to there is not interrelationship between input parameters of functions.

The simple regression equation between N 60 and activity (Ac) is a logarithmic relationship, whereas the simple regression equation between N 60 and moisture content (w, in percent) is a linear relationship. The nature of these equations shows that the nonlinear multiple regression is more suitable than the linear multiple regression.

$$ {N}_{60}=7.92\; \ln \left(\mathrm{A}\mathrm{c}\right)+24.70 $$
(7)
$$ {N}_{60}=-0.52\left(\mathrm{w}\right)+31.20 $$
(8)

The combination of Eqs. 7 and 8 can be defined as follows:

$$ {N}_{60}=A\; \ln \left(\mathrm{A}\mathrm{c}\right)+B\left(\mathrm{w}\right)+C $$
(9)

A, B, and C are the coefficients of the multiple regression equation. The following equation is obtained for the predictions of the SPT blow count N 60 from a nonlinear regression analysis using a statistical program (SPSS 2002).

$$ {N}_{60}=5.55\; \ln \left(\mathrm{A}\mathrm{c}\right)-0.32\left(\mathrm{w}\right)+31.7 $$
(10)

The coefficient of determination (R 2) between N 60 (measured) and N 60 (predicted) from Eq. 10 is 0.78 (Fig. 7).

Fig. 7
figure 7

Graph showing the relationship between N 60 (measured) and N 60 (predicted) with two variables: activity and moisture content

$$ {N}_{60\left(\mathrm{measured}\right)}=1.0002\;{N}_{60\left(\mathrm{predicted}\right)}-0.0117\left({R}^2=0.78\right) $$
(11)

Subsequently, the combination of equations N 60-moisture content (w) and N 60-fine-grained percent (in percent) equation is conducted. The moisture content and fine-grained percent are defined as a function of the standard penetration blow count N 60 in Eqs. 8 and 12.

$$ {N}_{60}=-0.52\left(\mathrm{w}\right)+31.20 $$
(8)
$$ {N}_{60}=-0.21\left(\mathrm{F}\mathrm{G}\mathrm{P}\right)+31.03 $$
(12)

Equations 8 and 12 are both linear, so the multiple linear regression analysis can be conducted. The combination of Eq. 8 and 12 can be expressed as follows:

$$ {N}_{60}=A\left(\mathrm{w}\right)+B\left(\mathrm{F}\mathrm{G}\mathrm{P}\right)+C $$
(13)

A, B, and C are the coefficients of the equations. Equation 14 is obtained by employing a linear multiple regression analysis:

$$ {N}_{60}=-0.344\left(\mathrm{w}\right)-0.122\left(\mathrm{F}\mathrm{G}\mathrm{P}\right)+34.879 $$
(14)

The coefficient of determination (R 2) between N 60 (measured) and N 60 (predicted) from Eq. 14 is 0.69 (Fig. 8).

Fig. 8
figure 8

Graph showing the relationship between N 60 (measured) and N 60 (predicted) with two variables: fine-grained percent and moisture content

The last stage of the multiple regression analysis is the combination of the fine-grained percent and consistency as a function of the standard penetration blow count N 60 in Eq. 6.

The previously performed simple regression analyses produce Eqs. 12 and 15:

$$ {N}_{60}=-0.21\left(\mathrm{F}\mathrm{G}\mathrm{P}\right)+31.03 $$
(12)
$$ {N}_{60}=9.1166\;{\mathrm{e}}^{0.537\left(\mathrm{I}\mathrm{c}\right)} $$
(15)

The combination of the Eqs. 12 and 15 can be expressed as follows:

$$ {N}_{60}=A\;{\mathrm{e}}^{\mathrm{B}\left(\mathrm{I}\mathrm{c}\right)}+C\left(\mathrm{F}\mathrm{G}\mathrm{P}\right)+D $$
(16)

Performing a nonlinear analysis with two variables versus the standard penetration blow count N 60 produces Eq. 17.

$$ {N}_{60}=2559\;{\mathrm{e}}^{0.002\left(\mathrm{I}\mathrm{c}\right)}-0.147\left(\mathrm{F}\mathrm{G}\mathrm{P}\right)-2539 $$
(17)

The coefficient of determination (R 2) between N 60 (measured) and N 60 (predicted) from Eq. (17) is 0.73 (Fig. 9).

Fig. 9
figure 9

Graph showing the relationship between N 60 (measured) and N 60 (predicted) with two variables: consistency index and fine-grained percent

All of the empirical equations derived in this study are provided in Table 9. The high coefficient of regression in all of the equations is noteworthy. The equation derived from the input parameters activity and moisture content produces the highest determination coefficient.

Table 9 Emprical equations derived in this study

Results and discussion

Soil mechanical properties are known to determine the standard penetration blow count number. The purpose of this study is to explain the relationship between the corrected SPT blow counts (N 60) and fine-grained size content, atterberg limits, activity, and consistency. For this purpose, data were obtained from foundation investigations of a sewerage station, central pumping station, and west pumping station in Mersin City, Turkey. The foundation areas of three projects mainly consisted of clayey soils. A total of 20 boreholes in the sewerage station, 10 boreholes in the central station, and 4 boreholes in the west pumping station were drilled. There were 220 N 60 values in the statistical correlation with the laboratory test results of the split spoon samples from the same standard penetration tests.

A series of simple regression analyses were conducted between N 60 and the following parameters: liquid limit, plasticity limit, plasticity index, activity, and consistency index of soil samples. The determination coefficient of the simple regression analyses for N 60-LL, N 60-PL, and N 60-PI were considered unacceptable, so these soil index properties were not used as input data in subsequent regression analyses. The highest determination coefficient was found between N 60 and activity of soils. The simple regression analysis between N 60 and moisture conent, N 60 and fine-grained percent content also resulted in high determination coefficients. The prediction performance of the simple regression equations were tested with RMSE indices and VAF values. All of the accepted empirical equations resulted in accurate prediction performances.

In a second step, multiple regression analyses were conducted. The moisture content and activity of clayey soils were used as the initial input parameters, the moisture content and fine-grained percent of clayey soil were used as the second input parameters, and the fine-grained percent content and consistency index of the clayey soils were used as the final input parameters. All of the derived empirical equations had high regression coefficients, especially the equation derived from the independent input parameters activity and moisture content. The prediction performances of the equations with two independent parameters derived from the multiple regression analyses were more accurate than the equations derived from simple regression analyses.

It is concluded that the performance of the empirical equations obtained in this study is satisfactory. However, all empirical models depend on the number and quality of the data employed. For this reason, before using the empirical equations developed in this study for design purposes, they must be checked. The amount of data limited in this study, so this study must be supported by additional studies. In order to be general equations, the empirical equations of this study must be supported by countless data.