1 Introduction

All mechanical soil properties fall into two categories: intrinsic and state properties. The intrinsic properties include particle size distribution, particle sphericity, roundness, surface roughness, and properties related to mineralogy such as hardness. State properties include void ratio, effective confining stress and fabric. Intrinsic properties are those that are unaltered by sampling, densification or other disturbance. Abundant studies have shown that soils with similar intrinsic and state properties exhibit similar macro mechanical behavior. Some of the studies include: Eisma [1], Koerner [2], Holubec and d’Appolonia [3], Youd [4], Zelasko et al. [5], Edil et al. [6], Oda et al. [7], Sladan et al. [8], Vepraskas and Cassel [9], Moroto and Ishii [10], Sukumaran and Ashmawy [11, 12], Cubrinovski and Ishihara [13], Yasin and Safiullah [14], Santamarina and Cho [15], Cerato and Lutenegger [16], Cho et al. [17], Guo and Su [18], Masad et al. [19], Rouse et al. [20], Bareither et al. [21], Cavarretta et al. [22], Shin and Santamarina [23], Zheng and Hryciw [24] and many others.

The Discrete Element Method (DEM) is commonly used to simulate particulate materials under mechanical loading to investigate micro scale behavior that may not be easily observed or measured in physical tests. Hypothetically, if a virtual soil specimen is created that has the same intrinsic and state properties as a real soil specimen its behavior in a DEM simulation should faithfully mimic the actual physical test. The question then becomes how to reproduce the intrinsic and state properties in the DEM model.

Although major advances have occurred in DEM modeling over the last few decades, the virtual soil specimens often still consist of idealized disks or spheres as they did at the origin of DEMs in the 1970s [25]. Admittedly, disks or spheres have the merits of simple contact detection and force calculation, and therefore low computational cost. However, they oversimplify the geometries of natural soil particles and, as a result, may not accurately simulate the micro or macro behavior of real soils. Therefore, techniques for generating irregular particle shapes to more faithfully reproduce real soil grains have been sought.

Non-spherical particles such as ellipsoids [26,27,28,29,30], spherical cylinders [31], pentagons [32], rounded-cap elongated rectangles [33], polyhedrons [36,37,38,39,40,41], and Non-Uniform Rational Basis-Splines (NURBS) [41] have been developed and utilized in DEM. ‘While these have been improvements over perfect disks or spheres, they are still idealized geometries. An improvement towards simulating more realistic particle geometries uses clumps of overlapping spheres (or circles) to create various irregular shapes. Many clump generation algorithms have been proposed including the methods of Ferellec and McDowell [41] and Taghavi [42]. The basic procedures of these algorithms are similar; they exhaustively fit many interior circles (or spheres) to the particle boundary then eliminate some of the less needed circles (or spheres) to reduce computational effort. However, the criteria used to eliminate the circles vary from particle to particle depending on particle size, angularity and image magnification [43]. Therefore, users must subjectively tune the criteria parameters for each clump to achieve an optimal number of circles to represent each particle. Consequently, considerable effort is needed to generate a statistically significant large number of clumps to reproduce a natural soil specimen in a DEM model.

Recently, Zheng and Hryciw [43] developed a two dimensional (2D) “corner-preserving” clump generation algorithm. They did so by utilizing and extending the classic definition of particle roundness developed by Wadell [44,45,46]. The corner preserving algorithm significantly reduces the need for user intervention in the clump generation process and therefore minimizes subjectivity and enhances the accuracy of the generated clumps. Compared to previous approaches, the corner-preserving algorithm uses many fewer circles to represent a soil particle thereby also improving computational efficiency.

To reproduce an actual soil specimen perfectly, the corner-preserving algorithm could conceivably be used to generate as many clumps as there are particles in the soil specimen. To do so would require an image of every soil particle. It is certainly worth considering if such a large number of particle images must necessarily be collected each time a new soil is to be simulated. Instead, could virtual soil specimens be constructed from a permanent library of clumps based only on desired distributions of particle shapes and sizes? The desired distributions could be to model an actual soil specimen for which the size distribution would be determined either by conventional sieving or image analysis and for which the shape distribution would be determined by image analysis on a statistically representative number of particles. Alternatively, the library could be used for parametric studies without an actual soil to be modeled.

Some intrinsic properties including surface roughness and hardness are accounted for through defined mechanical model parameters such as the contact friction coefficient and particle stiffness. The remaining parameters including the distributions of particle size, particle sphericity and roundness have limitless possibilities and must be preserved by the clump geometries. The question narrows down to how to create appropriate clump geometries to satisfy the desired distributions. As mentioned earlier, exhaustively reproducing all of a specific sand specimen’s particles by clumps is not an efficient approach. Creation of the clump library would eliminate the need for reproducing tens of thousands of particles for each simulation.

This paper describes the development of a large permanent clump library for use with DEMs. It shows how the library was constructed from numerous previously collected images of individual sand particles and how it may be used. At the present time, the library consists of only 2D particles developed from images of 2D particle projections. While soils must naturally be simulated by 3D clumps in DEMs, there are compelling reasons and distinct advantages to first developing a 2D particle library. First, the classic definitions of R and S are based on 2D particle projections and there is a long history of their usage. The corner-preserving algorithm is also presently only developed for 2D. It is easier to demonstrate the construction and usage of a 2D library. Results of 2D simulations are easier to visualize and interpret. Finally, the computational power required to create and test the library is far less demanding. Indeed, a 3D DEM simulation using a statistically valid number of clumps to faithfully simulate a real soil problem is still somewhat unrealistic.

2 Intrinsic soil properties

Based on 2D particle projections, particle size will be defined in this paper as the diameter, d, of circle having the same area, A, as the soil particle:

$$\begin{aligned} d=2\times \sqrt{\frac{A}{\pi }} \end{aligned}$$
(1)

Particle shape could be characterized at three observation scales according to Barrett [47]. From largest to smallest scale, the descriptors are sphericity (S), roundness (R), and surface roughness. Sphericity describes how close the overall particle shape is to a perfect circle (or sphere). For 2D projections, the S has been defined differently by Tickell [48], Wadell [46], Santamarina and Cho [15], Altuhafi et al. [49], and Krumbein and Sloss [50]. Zheng and Hryciw [51, 52] compared the various definitions and found the one proposed by Krumbein and Sloss [50] to be the most practical for describing S. Cho et al. [17] and Zheng and Hryciw [53] showed that various mechanical properties of soils including packing, compressibility, critical state friction angle, and small strain modulus are closely related to the S of Krumbein and Sloss [50]. Therefore, their simple definition of sphericity is adopted for this study:

$$\begin{aligned} S=\frac{d_2 }{d_1 } \end{aligned}$$
(2)

where \(d_{1}\) and \(d_{2}\) are the length and width of a particle respectively as shown in Fig. 1.

Fig. 1
figure 1

Geometric parameters for defining various intrinsic properties

At the second observation scale, R describes the sharpness of corners on soil particles. The definition of particle roundness was first proposed by Wadell [18,19,20] as the ratio of the average radius of curvature of the corners of a particle to the radius of the maximum inscribed circle as shown in Fig. 1:

$$\begin{aligned} R=\frac{{\sum \nolimits _{i=1}^{N_c } {r_i } }\Big /{N_c }}{r_{ins} } \end{aligned}$$
(3)

where \(r_{\mathrm{i}}\) is the radius of the ith corner circle; \(r_{\mathrm{ins}}\) is the radius of the maximum inscribed circle; and \(N_{\mathrm{c}}\) is the number of the corners around particle perimeter. Determination of Wadell roundness was recently automated using a computational geometry algorithm developed by Zheng and Hryciw [51]. The algorithm was verified by Zheng and Hryciw [51, 52] using all of the particles in the R reference charts provided by Krumbein [54], Krumbein and Sloss [50] and Power [55].

Surface roughness describes the soil surface at smallest observation scale as shown in Fig. 1. In DEM simulations, while d, S and R can be preserved exactly by the geometries of the generated clumps, surface roughness must be represented by the contact frictional coefficient \(\mu \). The determination of \(\mu \) will be addressed later in this paper.

Fig. 2
figure 2

The corner-preserving algorithm

3 Corner-preserving algorithm for creating clumps

The corner preserving algorithm for creating 2D clumps begins with a binary particle projection such as shown in Fig. 2a. Two steps are involved in creating a clump to represent the particle silhouette. In step 1, the computational geometry algorithm developed by Zheng and Hryciw [51] identifies corners on the particle perimeter and fits circles to them as shown in Fig. 2b. The maximum inscribed circle and minimum circumscribing box are also determined. At this point, d, A, R and S can be computed. In two-dimensional DEM models, the third dimension is of unit width and therefore the particle volume is \(V=A \times 1\).

The particle perimeter along with its corner circles (Fig. 2c) are the input to the second stepwhere the non-corner portions of the particle perimeter are identified and fitted with additional circles as shown in Fig. 2d. In the end, the soil particle perimeter is represented by a cluster of several corner circles and many non-corner circles. The example particle shown in Fig. 2d was approximated by a total of 61 circles of which 9 are corners circles and 52 are non-corner circles.

In the process of clump generation, various actual intrinsic soil properties including d, R and S are computed for each particle, i.e. clump. These parameters will serve as unique indices for the clumps in the library.

4 Clump library construction

To date, a total of 98,489 images of real soil particles have been collected and modeled by clumps using the 2D corner-preserving algorithm. The “clump library” includes a wide range of particle sizes, angularities, and sphericities. The library can be expanded by adding more clumps. Ten select clumps from the library are shown in Fig. 3. The listed R and S values have been rounded and saved in the library with two decimals. The number of circles used to construct each clump \((n_{\mathrm{c}})\) is also shown. Having complex surface structures, the angular particles clearly require more circles to approximate them. Very elongated (low S) particles also naturally need more circles. Each clump is archived in the library by an index “\(R_{L}\_S_{\mathrm{L}}\_\hbox {id}\)” consisting of three integers as shown in Fig. 3. The \(R_{\mathrm{L}}\) and \(S_{\mathrm{L}}\) are hundred-folds of the computed R and S values respectively. Since many clumps may have the same \(R_{\mathrm{L}}\) and \(S_{\mathrm{L}}\) values, the third parameter “id” is used to discriminate them. The combination of these three digits is able to catalog an unlimited number of clumps in the library. The information stored in the library for each clump also includes basic and structural information. The basic information includes the clump volume v and size d. The structural information includes the total number of circles \(n_{{c}}\), as well as the radius and center location of each circle.

Fig. 3
figure 3

Ten selected clumps from the library

Fig. 4
figure 4

Map of the clump library

A convenient map of the clump library is shown in Fig. 4. The locations of clumps in the library are determined by their \(R_{\mathrm{L}}\) and \(S_{\mathrm{L}}\) values. As just mentioned, many particles may possess the same R and S combination. Thus, multiple clumps may exist at one map location. For example, there are 34 clumps at the spot \((R_{\mathrm{L}}, S_{\mathrm{L}}) = (55, 63)\) in Fig. 4. The 34 clump are distinguished by the id numbers from 55_63_1, 55_63_2, ... up to 50_63_34.

In Fig. 4, the library has 100 \(\times \) 100 locations but not all locations contain clumps. Locations at \(S_{\mathrm{L}} < 20\) and \(R_{\mathrm{L}} < 15\) barely have any occupants. This is somewhat expected. Extremely elongated and angular particles are rare in nature since such particles are vulnerable to breakage [56]. Histograms of the clumps for \(R_{\mathrm{L}}\) and \(S_{\mathrm{L}}\) are shown in Fig. 5a, b respectively. Angular particles have more complex outlines than rounded particles and the variability of particle shapes increases with increasing angularity (decreasing roundness). Therefore, more angular particles in the range \(20< R_{\mathrm{L}} < 50\) were deliberately collected for the library as shown in Fig. 5a. The \(S_{\mathrm{L}}\) histogram in Fig. 5b reflects the prevalence of shapes in the range \(60<S_{\mathrm{L}} < 80\) found in nature.

Fig. 5
figure 5

The number of clumps in the library versus \(R_{\mathrm{L}}\) and \(S_{\mathrm{L}}\)

Fig. 6
figure 6

Overview of the virtual specimen preparation technique

5 Steps in virtual soil specimen creation

The DEM virtual soil specimen creation process includes four steps as shown in Fig. 6.

  1. Step 1:

    Given a specific soil, users must characterize it to obtain particle d, S, and R distributions. The d can be determined by traditional sieve analysis or by spectral of optical techniques such as the University of Illinois Aggregate Imaging System (UIAIA) [57,58,59,60], the Aggregate Imaging System (AIMS) [61,62,63,64], the Translucent Segregation Table system (TST) [65, 66], the QICPIC system [49], the Vision cone penetrometer (VisCPT) [67,68,69,70] and Sedimaging [71]. The particle shape distributions including R and S distributions can be computed using computational geometry methods from either two-dimensional binary projections [51] or from images of three-dimensional particle assemblies [52]. This study found that 200 particles were sufficient to provide satisfactory characterization of a real soil. However, users can create their own particle size and shape distributions to construct a customized virtual soil specimen.

  2. Step 2:

    Users must input the testing vessel dimensions, a traditional weight-based particle size distribution (real or assumed) and a target packing porosity \(n_{\mathrm{p}}\). For normal (not gap-graded) soils, weight (or volume) based particle size distributions may be fitted with the Rosin–Rammler function [72]:

    $$\begin{aligned}&V\left( d \right) =1-\exp \left[ {-\left( {\frac{d}{D}} \right) ^{\lambda }} \right] \end{aligned}$$
    (4)

    in which D and \(\uplambda \) are the fitting parameters related to the 50% particle size by weight, \(D_{50}\) and the coefficient of uniformity, \(C_{\mathrm{u}}\) respectively. The total number of particles (N) required to fill the virtual testing vessel at porosity \(n_{\mathrm{p}}\) and their distribution by size (d) are then computed.

  3. Step 3:

    The distributions of R and S are unwieldy in discrete form. As such, they may be modeled by a two-dimensional Gaussian probability density functions (PDF):

    $$\begin{aligned}&f\left( {x|\mu ,\Sigma } \right) =\frac{1}{2\pi }\frac{1}{\left| \Sigma \right| ^{1/2}}\nonumber \\&\qquad \qquad \qquad \quad \times \exp \left\{ {-\frac{1}{2}\left( {x-\eta } \right) ^{\prime }\Sigma ^{-1}\left( {x-\eta } \right) } \right\} \nonumber \\ \end{aligned}$$
    (5)

    where x is a matrix of R and S values of k evaluated particles:

    $$\begin{aligned}&x=\left[ {{\begin{array}{l} {R_1 ,R_2 ,R_3 ,\ldots ,R_k } \\ {S_1 ,S_2 ,S_3 ,\ldots ,S_k } \\ \end{array} }} \right] ; \end{aligned}$$

    \(\eta \) is the mean vector of R and S values:

    $$\begin{aligned}&\eta =\left[ {{\begin{array}{l} {\eta _R } \\ {\eta _S } \\ \end{array} }} \right] , \end{aligned}$$

    and \(\Sigma \) is the covariance matrix: \(\Sigma =\left[ {{\begin{array}{l} {\Sigma _{RR} } \\ {\Sigma _{SR} } \\ \end{array} }{\begin{array}{l} {\Sigma _{RS} } \\ {\Sigma _{SS} } \\ \end{array} }} \right] \). The program retrieves the N clumps from the library based on a soil specimen’s R & S PDF.

  4. Step 4:

    The N clumps are input to the DEM code to assemble the virtual soil specimen. Several DEM codes are compatible with clump simulation including the commercial code Particle Flow Code (PFC) by Itasca [73] and open source codes including the Large-scale Atomic/Molecular Massively Parallel Simulator (LAMMPS) [74] and its improved version General Granular and Granular Heat Transfer Simulations (LIGGGHTS) [75]. The PFC by Itasca was used in this study.

6 Example of virtual soil specimen creation

The process used to prepare virtual specimens for DEM simulation of direct shear testing is demonstrated using an Indiana Beach sand at dense and loose conditions.

Fig. 7
figure 7

Characterization of Indiana Beach sand

In Step 1 traditional sieve analysis was used to determine the particle size distribution of an Indiana Beach sand. The result is shown in Fig. 7a as a traditional weight-based cumulative distribution. A total of 200 particles were then randomly selected from the physical specimen. The particles were spread on a flat surface and binary images were captured. Zheng and Hryciw’s [51] computational geometry method was used to compute R and S values for every particle. The cumulative distributions of R and S values by number of particles are shown in Fig. 7b, c.

In Step 2, the particle size distribution shown in Fig. 7a is fitted using Eq. (4). The values \(D = 1.2\) and \(\lambda = 2.3\) were found to match the Indiana Beach Sand sieving results very well. The curve is discretized into 200 equal fractions as shown in Fig. 8. Each fraction represents a volume increment of \(\Delta V = 0.5\%\). The particle size \(d_{\mathrm{i}}\) representing the ith volume fraction can be back-calculated using Eq. (4). Knowing \(d_{\mathrm{i}}\), the volume \(v_{\mathrm{i}}\) of each particle in the ith volume fraction can be computed as shown in Fig. 8.

Fig. 8
figure 8

Discretization of particle size distribution curve

To determine the number of particles in the ith fraction, the volume of solids in this fraction must be computed as \(V_{\mathrm{t}}(1-n_{\mathrm{p}})\Delta V\) where \(V_{\mathrm{t}}\) is the testing vessel volume and \(n_{\mathrm{p}}\) is the porosity, both specified by the user. Finally, the number of particles required for the ith volume fraction can be expressed as:

$$\begin{aligned} num_i =\frac{V_t \times \left( {1-n_p } \right) \times \Delta V}{v_i } \end{aligned}$$
(6)

For example, the volume of a 2D shear box \(V_{\mathrm{t}}\) used in this study was \(100 \times 48 \times 1 = 4800\hbox { mm}^{3}\) and the target porosity \(n_{\mathrm{p}}\) was set to 0.2. The particle size of ith volume fraction was 0.94 mm. Based on Eq. (6), \(num_{\mathrm{i}}= 27\) particles were required to model this fraction. This computation is repeated for each volume fraction. The total number of required particles N to fill the direct shear box will be the sum of \(num_{\mathrm{i}}\). In this case, N was 12,518.

In Step 3, N clumps are retrieved from the library according to their R and S distributions as follows. The R and S distributions in Fig. 7b, c are fitted to the two-dimensional Gaussian probability density function, Eq. (5). In this example, \(\eta \) and \(\Sigma \) were computed as:

$$\begin{aligned} \eta =\left[ {{\begin{array}{c} {0.61} \\ {0.72} \\ \end{array} }} \right] \hbox { and }\Sigma =\left[ {{\begin{array}{cc} {0.0193} &{}\quad {-0.0019} \\ -0.0019&{} \quad 0.0293 \\ \end{array} }} \right] . \end{aligned}$$

These were inserted into Eq. (5) to determine the probability density function f which is plotted in Fig. 9. This f(RS) map in Fig. 9 dictates the sampling probability for each location in the clump library shown in Fig. 4. The program will sample the library N times as was determined in Step 2. It should be noted that one location can be sampled multiple times. The number of stored clumps at each location in Fig. 4 will generally not be the same as the needed number from that location. Statistically speaking, if the total number of particles needed from a single location in the library is m and the number of clumps at that location is k, then each clump from that library location will be used, on average, m / k times.

Fig. 9
figure 9

The normalized probability density map

Fig. 10
figure 10

Number of clumps retrieved from each location in the library for the example

Using the above procedure, \(N = 12{,}518\) clumps were retrieved from the library for the Indiana Beach sand. Figure 10 shows the number of clumps sampled from each location in the library.

The d values obtained in Step 1 and 2 are now randomly assigned to the N generated clumps. This is done by enlarging or reducing each clump to meet its assigned d value. For example, the 15.2 mm clump shown in Fig. 2 is one of the clumps retrieved from the library. The particle was assigned a size of \(d = 0.94\hbox { mm}\). Therefore, the clump in Fig. 2 is reduced by a factor of 0.94/15.2 to achieve the target d. The final size and shape information for this clump is \((R_{\mathrm{i}}, S_{\mathrm{i}}, d_{\mathrm{i}}) = (0.21, 0.70, 0.94\hbox { mm})\).

The d, R and S distributions of clumps in the DEM model are compared to the distributions of the original soil in Fig. 11. The DEM model successfully reproduced the target distributions of the original soil.

Fig. 11
figure 11

Sand size and shape distributions generated from the clump library compared to their target distributions

In Step 4, the N generated clumps were input into PFC and rained into a direct shear box. The top cap was added and the specimen was consolidated under a normal stress of 100 kPa. A linear contact model was used in this simulation. The model parameters were set as follows: the clump density \(\rho \) was 2650 \(\hbox { kg}/\hbox {m}^{3}\); the effective modulus of the clumps \(E_{\mathrm{c}}^{*}\) was \(5 \times 10^{8}\hbox { N}/\hbox {m}^{2}\); the effective modulus of the wall \(E_{\mathrm{w}}^{*}\) was \(5 \times 10^{9}\hbox { N}/\hbox {m}^{2}\); the normal-to-shear stiffness ratio \(\kappa ^{*}\) was 1.5; both normal and shear critical damping ratio \((\beta _{\mathrm{n}}\) and \(\beta _{\mathrm{s}})\) of the clumps were 0.5. To achieve a dense packing state, the friction coefficients for both the clump–clump and clump–wall interfaces were set to zero. The specimen is shown after consolidation in Fig. 12a. Two small square areas are enlarged and shown in Fig. 12b, c. As observed, each clump possesses a unique size and shape.

Fig. 12
figure 12

The generated DEM model at a dense condition

After consolidation, the final porosity n was 0.13. It should be noted that the n value after consolidation is not necessarily equal to the initially specified \(n_{\mathrm{p}}\). The \(n_{\mathrm{p}}\) is a presumed value for initially computing the number of clumps required to be retrieved from the library. The n after consolidation may be different than \(n_{\mathrm{p}}\). Correspondingly, the final vessel height may also be different from the previously specified value to accommodate the change in porosity due to consolidation. In this example, the height of the specimen was 44 mm after consolidation instead of previously specified 48 mm dimension. It was found in this study that such small changes in shear box height did not affect the final simulation results. Naturally, the \(n_{\mathrm{p}}\) should be as close to n as possible. Therefore, steps 1–4 could be repeated using the resulting n value as the new \(n_{\mathrm{p}}\). Readers are encouraged to do so if they feel it is necessary. However, in this study, \(n_{\mathrm{p}}\) was fixed at 0.2 and was not adjusted.

To achieve a loose packing condition, the friction coefficient for both the clump–clump and clump–wall interfaces were set to 0.3 for the consolidation stage. The final porosity n was 0.2 which is the same as the presumed value \(n_{\mathrm{p}}\). All other steps in setting up the loose condition were the same as for the dense state.

7 DEM simulation of direct shear tests

The normal stress was kept as 100 kPa during specimen shearing. The clump–clump and clump–wall contact friction coefficients were set to 0.50 and 0.95 respectively. The upper half of the box was fixed and the lower half was displaced at a constant velocity of 0.004 mm/s. The top wall was continuously adjusted via a serve-control mechanism [73] to maintain the constant vertical normal stress during shearing. The horizontal displacement, vertical displacement, and shear force results are shown in Fig. 13.

Fig. 13
figure 13

DEM simulations of Indiana Beach sand at dense and loose conditions

The peak friction angle and dilation angle can be computed from the test curves as

$$\begin{aligned} \phi _p= & {} \hbox {a }\tan \left( {\frac{T_p }{P}} \right) \end{aligned}$$
(7)
$$\begin{aligned} \psi _p= & {} \hbox {a }\tan \left( {\frac{\Delta u_y }{\Delta u_x }} \right) \end{aligned}$$
(8)

where P is the normal force and \(T_{p}\) is the shear force at the peak; \(\Delta u_{\mathrm{x}}\) and \(\Delta u_{\mathrm{y}}\) are the horizontal and vertical displacement rates at the peak strength state. The resulting \(\phi _{\mathrm{p}} \) and \(\psi _{\mathrm{p}}\) are listed in Fig. 13.

Fig. 14
figure 14

Intrinsic property distributions of the simulated soils

8 Clump library usage in parametric studies

Direct shear tests on five additional 2D versions of soils including 30A, Crushed Gabbro (CG), Leighton Buzzard (LB), Class IIA (IIA), and Glass Beads (GB) were also simulated to demonstrate the versatility of the clump library. The 30A and Crushed Gabbro are very angular and crushed angular soils respectively. The Leighton Buzzard is a subangular soil frequently used in research as reported by Lings and Dietz [76] and Jewell [77]. The Class IIA and Glass Beads are subrounded and rounded materials respectively. The particle size distribution of Leighton Buzzard (LB) shown in Fig. 14b was extracted from Lings and Dietz [76]. The particle size distributions of the remaining four sands shown in Fig. 14a, b were determined through sieving analysis. The dashed lines are the Rosin–Rammler curves fitted to the sieving data. The fitting parameters D and \(\lambda \) are summarized in Table 1. The volume of the shear box \(V_{t}\) was set to \(100 \times 48 \times 1 = 4800\hbox { mm}^{3}\) and the target porosity \(n_{\mathrm{p}}\) was set to 0.2 for all of the test simulations. The resulting numbers of particles (N) required to fill the direct shear box for the five soils are reported in Table 1.

Table 1 The parameters of virtual soils in DEM simulations

For each soil except the Glass Beads, 200 particles were characterized to determine their R and S distributions. The results are shown as solid lines in Fig. 14c, d. The mean R and S values (\(R_{\mathrm{m}}\) and \(S_{\mathrm{m}})\) are listed in Table 1. These solid lines were used to fit two-dimensional Gaussian distributions of R and S using Eq. (5) for retrieval of clumps from the library. The resulting R and S distributions of the picked clumps are plotted as dashed lines in Fig. 14c, d. They agree well with the target distributions. For the Glass Beads, both R and S are equal to 1.0 therefore perfect disks were used in the simulation. The model parameters including \(\rho \), \(E_{\mathrm{c}}^{*}\), \(E_{\mathrm{w}}^{*}\), \(\kappa ^{*}\), \(\beta _{\mathrm{n}}\), and \(\beta _{\mathrm{s}}\) were the same as used in the earlier simulation of Indiana Beach sand. Only dense conditions were simulated. Therefore, the friction coefficient \(\mu \) was set to zero for the consolidation stage and to 0.5 for the shearing stage. The normal stresses were set to 100 kPa. The porosities n of the five sands after consolidation are listed in Table 1.

The simulation results are shown in Fig. 15 while the \(\phi _{\mathrm{p}} \) and \(\psi _{\mathrm{p}}\) values are plotted in Fig. 16. As shown both \(\phi _{\mathrm{p}} \) and \(\psi _{\mathrm{p}}\) increase with decreasing R (increasing angularity) which agrees with the experimental observations of Santamarina and Cho [35], Cho et al. [37], Bareither et al. [56] and countless others.

Although average S values of natural soils fall in a narrow range of 0.6–0.8 [56], it would be interesting to observe the behavior of soils having low S. This is easily facilitated using the clump library. Two virtual soils were created using the Class IIA soil but defining two additional new S distributions as shown in Fig. 17: \(S_{\mathrm{m}} = 0.52\) and \(S_{\mathrm{m}}= 0.32\). The size and R distributions as well as other parameters were maintained the same as in the original Class IIA. After consolidation, the porosity values were expectedly different: the elongated soils had larger n. The simulation results from these newly created virtual soils are shown in Fig. 18. The computed \(\phi _{\mathrm{p}} \) and \(\psi _{\mathrm{p}}\) for each test are shown in Fig. 19. Both \(\phi _{\mathrm{p}} \) and \(\psi _{\mathrm{p}}\) increase as S decreases suggesting elongated particles increase strength and dilation. The same laboratory observation was also made by Santamarina and Cho [35] and Cho et al. [37].

Fig. 15
figure 15

DEM simulations of five sands with different roundnesses

Fig. 16
figure 16

The effects of roundness by DEM simulations

Fig. 17
figure 17

Sphericity distributions of actual and three virtual Class IIA specimens

Fig. 18
figure 18

Simulations of Class IIA at three sphericities

Fig. 19
figure 19

The effects of sphericity by DEM simulations

To investigate the importance of particle gradation, the parameters D and \(\uplambda \) for the Leighton Buzzard sand were changed to create a virtual Well-Graded Leighton Buzzard sand (WGLB) having a \(C_{\mathrm{u}}\) of 8.7. The particle size distribution curve for WGLB was added to Fig. 14b. The fine portion smaller than \(D_{10}\) (0.15 mm) was ignored because it would dramatically increase the total number of clumps, thereby exceeding current computational ability. The R and S distributions and the remaining parameters were maintained the same as they were for the original Leighton Buzzard sand as listed in Table 1. The DEM simulation results show that strength was enhanced slightly by increasing \(C_{\mathrm{u}}\) from 1.3 (LB) to 8.7 (WGLB) as shown in Fig. 20. Kokusho et al. [78] performed a series of laboratory tests on sands with different \(C_{\mathrm{u}}\) values and found that shear strength could either increase or decrease with increasing \(C_{\mathrm{u}}\) depending on particle crushability. If particles are not crushable, the shear strength will increase with increasing \(C_{\mathrm{u}}\) while the opposite trend was observed for crushable particles. The crushing of particle will suppress the relative movement and overriding of particles resulting in smaller strength and dilatancy. In this study, both \(\phi _{\mathrm{p}} \) and \(\psi _{\mathrm{p}}\) increase with \(C_{{u}}\) because the clumps were not allowed to crush. This agrees with Kokusho et al.’s [78] observation.

Fig. 20
figure 20

The effects of gradation in DEM simulations

9 Discussion

  1. 1.

    The current 2D clump library consists of 98,489 geometries. In the real world, every particle, particularly if it is angular, has a unique geometry. Furthermore, even for the same particle, different scanning direction will yield different projected 2D geometries. Therefore, the 2D library obviously cannot include all possible particle geometries. However, as discussed earlier, many studies have shown that different soils with similar intrinsic properties will exhibit similar macro-mechanical behavior under the same state conditions. The 2D library does not aim to exhaustively contain all possible particle geometries encountered in nature. Rather, it contains sufficient geometries for reproducing all possible intrinsic parameter distributions.

  2. 2.

    To simplify image acquisition, the clumps in the library were generated from images of particles spread out on a flat surface. As such, the particles are more likely to have been displaying their maximum area projections. While it would appear that such images are most reflective of particles lying in a depositional plane, the library does not ascribe any specific orientation to the clumps. As such, the clumps in the library may be used for simulating other orientations having appropriate user-specified R and S distributions.

  3. 3.

    The shortcomings of two-dimensional DEMs are well known: 2D particles have only three degrees for freedom compared to six in real soils. Nevertheless, as discussed in the Introduction, 2D DEMs hold several attractive benefits including computational efficiency, simplicity of model preparation and easier visualization of particle motions. Furthermore, the direct linkage between Wadell’s classic 2D definition of particle roundness, through the corner-preserving algorithm, preserves the exact shape of particles and facilitates more efficient clump generation. In the future, Wadell’s definition of roundness could and should be extended to 3D thus providing a basis for 3D clump generation. Such clumps would require hundreds of spheres making them still computationally unrealistic at the present time. Nevertheless, even in 2D, the value of clump libraries towards parametric study of particle micromechanics is significant.

10 Conclusions

A library consisting of almost 100,000 two-dimensional clumps was developed for use with Discrete Element Methods. Each clump was created from a binary image of a sand particle. The particles in the clump library are indexed by their values of roundness (R) and sphericity (S). The R follows the classical definition of roundness developed by Wadell [44,45,46] while S follows a simple definition proposed by Krumbein and Sloss [50]. A previously developed computational geometry technique fits circles to a particle’s corners thereby facilitating the computation of R. A previously developed “corner-preserving” algorithm fits many additional interior circles to the particle perimeter. The particles in the clump library are based on sands taken from many sources and thus they possess a wide range of R and S values.

The clump library may be used to simulate an actual soil whose particle size distribution is determined by either sieving or by image analysis and whose distributions of R and S are determined by computational geometry on 200 randomly selected particles. The particle size distribution is fitted to the Rosin–Rammler function while the R and S distributions are smoothed by a two-dimensional Gaussian probability density function. The clump library may also be used in parametric studies in which the particle size, R and S distributions are designed by the user. In either case, the clumps are picked from the library to create a virtual soil specimen according to their Gaussian distributions of R and S. The clump sizes are then adjusted to match the desired particle size distribution. Example problems showed very good matches between the actual distributions of particle size, R and S and the distributions of these intrinsic properties in a virtual specimen composed of clumps from the library.

DEM simulations of direct shear tests on five vastly different sands were performed using the clump library. The simulation results qualitatively match well known trends documented in the literature. Both \(\phi _{\mathrm{p}} \) and \(\psi _{\mathrm{p}}\) were observed to increase with decreasing R (increasing angularity), decreasing S, and increasing \(C_{\mathrm{u}}\) (for non-crushable soils). Those simulations illustrate the usefulness and versatility of the clump library in creating virtual specimens possessing various shapes and particle gradations.