1 Introduction

This paper attempts to demonstrate how the combination of the Finite Element Method (FEM) and the Multiple Response Surface (MRS) optimization can be used to obtain the optimal operating conditions in the design phase of a double-row TRB. TRBs are mechanical devices that are used in many industrial applications. Basically, the TRB is formed by a pair of inner raceways that are separated by a gap of constant distance (δ), an outer raceway, and a group of tapered rollers that are located between the inner and outer raceways. This type of mechanical device is designed to support preload (P), axial (Fa), radial (Fr) and torque (T) loads (operating conditions) that are applied under both static and dynamic conditions. An incorrect combination of these operating conditions can result in high contact stresses (even higher than 1000 MPa) on the TRB rolling elements. This can cause significant local deformations (α max ) and defects, such as pitting and fatigue spalling (Harris and Kotzalas 2006). In addition, if the operating conditions that are applied to the TRB differ from the values that the manufacturer provided, the distribution of contact stresses on the outer raceway in some areas of contact can become nil, cause the rollers to detach from the outer raceways (Nagatomo et al. 2012). Also, a torque that greatly exceeds that recommended by the manufacturer, combined with a reduced preload, may cause the rotation of one of the raceways and a variation of δ that causes the TRB works with difficulty. In this sense, the design, optimization and determination of optimum operating conditions of TRB are mainly based on parameters such as contact stresses, local deformation in the raceways, misalignments and distribution of forces on the rolling elements (Harris and Kotzalas 2006). These parameters are generally studied individually in the design phase of the bearings. However, this form of study has the disadvantage of not obtaining a complete and realistic design of the mechanical device. The novel aspect of this work is that the combination of FEM and the Multiple Response Surface (MRS) optimization could be used to obtain the optimal operating conditions in the design phase of a double-row TRB, while considering at the same time three of the most important design parameters:

  • The contact stress radio that is calculated using the maximum contact normal stress [obtained in the top side of the outer raceway (Stop)] and the minimum contact normal stress [obtained in the bottom side of the outer raceway (Sbottom)].

  • The difference between the gaps obtained in the top and bottom sides of the inner raceways (δtop and δbottom) or misalignment.

  • The local deformation of the contact surface of the outer raceway (αmax).

The optimal operating conditions in a double-row TRB can be achieved by adjusting the values of P, Fa, Fr and T loads when: (1) do not exceed a fixed threshold of contact stress radio for both rows of rollers, (2) that the difference between the gaps or misalignment should be as reduced as much as possible, and (3) that the local deformation on the contact surface of the outer raceway (αmax) not exceed the maximum elastic deformation value. Due to the complexity of finding the optimal operating condition of double-row TRB when three of the most important parameters used in the design phase of TRB are fulfilled simultaneously, 3D FE models, including the P, Fa, Fr and T loads, are developed. The results obtained from these FE models are used (1) to generate quadratic regression models, (2) to predict new operating conditions covering the entire space of possibilities, and (3) to obtain the optimal operating condition of a double-row TRB using MRS with desirability functions.

1.1 Use of the FEM for modeling and optimizing bearings

The FEM is commonly used to design and optimize mechanical devices since it reduces the number of physical prototypes and experiments needed during the design phase (Shigley et al. 2004). In this way, FEM has been applied by many researchers to design bearings. The object of its application is to reduce the produce’s cost by experimental tests once the prototypes have been manufactured.

The FEM, combined with experimental data, has been used to identify and characterize the static and dynamic properties of the bearings. In this sense, Arora et al. (2010) and Arora et al. (2011) used a combination of FEM and experimental data to find the structural characteristics of radial Air-foil bearings and axial Air-foil bearings respectively. The proposed method uses the natural frequencies and modal shapes of the complete bearing to identify structural characteristics of the mechanical device. Also, FEM has been used to obtain the bearing stiffness of a wide range of bearing types and parameters (Guo and Parker 2012). This proposed method was validated by experiments in the literature and is compared to analytical models. Despite the obvious advantages of using the FEM, there are some disadvantages when this method is applied to design mechanical components, such as convergence problems, due to the adjustment of the mesh size, especially if the FE model includes mechanical contacts. It is well known that if the contact surface between the bodies is limited and the mesh size is large, the number of nodes in that contact surface will be reduced and the calculation of contact stresses will not be accurate (Zhang et al. 2003; Demirhan and Kanber 2008). Thus, it is concluded that the FEM is not an appropriate method to use in design and optimization processes if a considerable number of FE simulations are required and the FE models includes nonlinearities. One strategy to reduce the number of simulations required for modeling and optimizing the FE models is to build regression models, such as those based on soft computing techniques and Response Surface Methodology (RSM), that learn from the most characteristic samples obtained from FEM simulations and to use their outputs as a substitute of new FE simulations. RSM is a group of mathematical techniques that utilize a low-degree polynomial function to model the relationships between the independent variables (input variables) and one or more response variables (output variables). Sometimes, these polynomial functions do not provide good results in complex problems that have many nonlinearities and a high number of inputs as the polynomials should be continuous functions. In contrast, soft computing techniques is the set of techniques that are used to explore databases automatically or semi-automatically in order to find patterns, trends or rules that explain the behavior of the data in a given context (Borgelt et al. 2013). Models based on soft computing techniques normally require a larger amount of data to obtain a good model than that required for RSM (Escribano et al. 2014; Lostado et al. 2014). Using soft computing techniques, Lostado et al. (2009) studied the contact pressure on a hub where it was mounted a double-row TRB. In this case, different types of regression techniques were used to predict the distribution of the contact pressures on the hub as function of a preload, a load and a friction coefficient. Applying these regression techniques usually requires a large number of data to generate models with an appropriate generalization capability.

1.2 Response surface method for modeling and optimizing problems

In many cases, the RSM is used to model experimental responses. However, it recently is being used in combination with other techniques (i.e., FEM) to optimize products and industrial processes. RSM is an approximate optimization method that looks at various controllable and independent variables (inputs) and their responses (outputs), in order to identify the combination of inputs that provide the best response [Eq. (1)] (Box and Wilson 1951).

$$ {\text{Y}} = {\text{f}}\left( {{\text{x}}_{1} ,{\text{x}}_{2} ,{\text{x}}_{3} , \ldots ,{\text{x}}_{\text{k}} } \right) $$
(1)

where Y is the response for the experiments and (\( {\text{x}}_{1} ,{\text{x}}_{2} ,{\text{x}}_{3} , \ldots ,{\text{x}}_{\text{k}} \)) are the vectors of input. To optimize the response Y, it is necessary to find an approximation functional relationship between the inputs and the response surface. A second-order polynomial is the functional relationship used by RSM [(Eq. (2)] in this kind of work.

$$ {\text{Y}} = {\text{b}}_{0} + \sum\limits_{i = 1}^{n} {{\text{b}}_{\text{i}} \cdot {\text{X}}_{\text{i}} } + \sum\limits_{i = 1}^{n} {{\text{b}}_{\text{ii}} \cdot {\text{X}}_{\text{i}}^{2} } + \sum\limits_{i = 1}^{n - 1} {\sum\limits_{j = i + 1}^{n} {{\text{b}}_{\text{ij}} \cdot {\text{X}}_{\text{i}} \cdot {\text{X}}_{\text{j}} } } + \varepsilon $$
(2)

where the first summation is the linear part, the second is the quadratic part, the third is the product of pairs of all inputs of the polynomial, and ε is the approximation error. The values of the coefficients b0, bi, bii and bij are calculated using regression analysis to determine the relationship between inputs and outputs. Also, the terms that are selected to form the equation are chosen according to their levels of significance (Gelman 2005). One computes this level by using the analysis of variance (ANOVA) and selecting terms according to the p value obtained. When a problem has more than one output, as in this work, the optimization problem can be solved using MRS (Derringer and Suich 1980). In this sense, Harrington developed the desirability functions to obtain a compromise between the different outputs [Eqs. (3) and (4)], and the overall desirability, which is defined as the geometric mean of the desirability of each output [Eq. (5)] (Harrington 1965).

$$ {\text{d}}_{\text{r}}^{ \hbox{max} } = \left\{ {\begin{array}{*{20}c} 0 & {\quad {\text{if f}}_{\text{r}} \left( {\text{X}} \right) < {\text{A}}} \\ {\left( {\frac{{{\text{f}}_{\text{r}} \left( {\text{X}} \right) - {\text{A}}}}{{{\text{B}} - {\text{A}}}}} \right)^{\text{S}} } & {\quad {\text{if A}} \le {\text{f}}_{\text{r}} \left( {\text{X}} \right) \le {\text{B}}} \\ 1 & {\quad {\text{if f}}_{\text{r}} \left( {\text{X}} \right) > {\text{B}}} \\ \end{array} } \right. $$
(3)
$$ {\text{d}}_{\text{r}}^{ \hbox{min} } = \left\{ {\begin{array}{*{20}c} 1 & {\quad {\text{if f}}_{\text{r}} \left( {\text{X}} \right) < {\text{A}}} \\ {\left( {\frac{{{\text{f}}_{\text{r}} \left( {\text{X}} \right) - {\text{A}}}}{{{\text{A}} - {\text{B}}}}} \right)^{\text{S}} } & {\quad {\text{if A}} \le {\text{f}}_{\text{r}} \left( {\text{X}} \right) \le {\text{B}}} \\ 0 & {\quad {\text{if f}}_{\text{r}} \left( {\text{X}} \right) > {\text{B}}} \\ \end{array} } \right. $$
(4)
$$ {\text{D}} = \left( {\prod\limits_{{{\text{r}} = 1}}^{\text{R}} {{\text{d}}_{\text{r}} } } \right)^{{{ 1\mathord{\left/ {\vphantom { 1{\text{R}}}} \right. \kern-0pt} {\text{R}}}}} $$
(5)

In these equations, parameters A and B correspond to the limits of the inputs range, S is an exponent that determines how important it is to reach the target value, X corresponds to the input vector, and fr is the polynomial function that is used to predict the response.

1.3 Combining FEM and MRS to optimize mechanical problems

For decades, many researchers have used the FEM as an alternative in attempting to reduce the costs during the design and optimization phases of mechanical problems. In this work, we show that FEM has a disadvantage of requiring a large amount of computational time when the studied problem has a non-linear behavior. In contrast, RSM method tends to be immune to computational cost, as it provides a crude approximation for the same problem modeled with FEM (Marwala 2010). In this sense, the combination of RSM when the optimization involves several features and FEM has been used extensively to optimize industrial processes. Thus, for example, Bahloul et al. (2006) develops a 3D FE model for the prediction of the punch load and the stress distribution during the wiping-die bending process. In this work, the FE simulation is carried out through ABAQUS/Standard code, and the prediction of the punch load is performed using RSM to optimize the values of the main parameters involved in the bending process. Moreover, Zeng et al. (2009) combines RSM with FEM for optimizing the design of roll profiles for cold roll forming. In this case, the spring-back angle is considered to be an objective function and the edge membrane longitudinal strains is considered as constrain condition to ensure high forming accuracy with the minimum roll diameter of the forming process. Yanhui et al. (2010) improved the deformation homogeneity in aerospace forgings and Di Lorenzo et al. (2010) developed an optimization strategy for hydro-forming process design. In these research studies, a combination of FEM and RSM were used to model and optimize both processes. More recently, Azaouzi et al. (2012) studied an optimization approach to design a multi-step stamping tool while using a combination of RSM and FEM. Other authors also have used these methods to optimize the design of buildings. In this regard, del Coz Diaz et al. (2014) used RSM and FEM to develop predictive models to simulate and optimize the heat transfer process in buildings.

In the current paper, regression techniques using the results of FE analysis are applied to model S1, S2, Δδ, and αmax when P, Fa, Fr and T are acting under static conditions on a double-row TRB. These models are based on polynomial regression techniques, and were built using RSM. Since TRB has more than one output to be optimized (S1, S2, Δδ and αmax), the optimization problem is solved by using MRS by means of desirability functions. Two criteria, Mean Absolute Percentage Error (MAPE) and Root Mean Square Error (RMSE), are used to determine the accuracy of the regression models and in the adjustment process of the FE models.

2 Modeling the double-row TRB using FEM

During the process, all FE models of the double-row TRB are considered to be symmetrical three-dimensional models to reduce the computational cost. The FE models proposed includes the outer raceway, the inner raceways, the tapered rollers and the hub on which the TRB is mounted (Fig. 1a). The FE models are assembled from a combination of elements with 8 and 6 nodes formulated in linear shape functions. The coefficient of friction between the hub and the inner raceway is considered to be 0.2, whereas the coefficient between the rollers and the inner raceway and between the rollers and the outer raceway is considered to be 0.001 (Demirhan and Kanber 2008). Raceways and rollers are modeled using linear elastic and isotropic steel with a Young’s modulus (E) equal to 208 GPa and a Poisson’s ratio (ν) of 0.29. The hub is modeled on the basis of E = 200 GPa and ν = 0.29 (Demirhan and Kanber 2008). For all simulated FE models, the contact detection method used was the segment-to-segment method (MSC Marc User's Guide 2010). In contrast to the classical method of contact detection node-to-segment method, the segment-to-segment method is a consistent and stable method, and is able to easily pass the patch test (El-Abbasi and Bathe 2001). Also, the segment-to-segment method is able to reduce the error in calculating contact stresses with a mesh refinement (Meguid and Czekanski 2008; Zavarise and De Lorenzis 2009). The numerical technique used to solve the mechanical problem was the solver constraints technique. This technique does not require any special element to be placed at the points of contact, and the algorithm automatically detects nodes making contact and generates the appropriate constrains to ensure that no penetration occurs. Also, the friction model used in this case was the bilinear coulomb friction stick slip model. It was a quasi-static problem in which the movements between the different contact surfaces were practically negligible.

Fig. 1
figure 1

FEM of the TRB: a Different component parts of the TRB, b Initial mesh of size 5.1 mm. c Final mesh of size 0.2 mm

In this case, the FE models were adjusted and validated using radial loads Fr and preloads P well-known by this way: First, the mesh size in the contact area between the outer raceway and the tapered roller was adjusted comparing the normal contact stress obtained by the FE model and by a theoretical model based on the Hertz’s theory. Then and once the mesh size was adjusted, the relative displacements between inner and outer raceway obtained by the FE model were compared to the results obtained in a test bench. These test results were used to ensure that the mesh size of the FE model and the coefficients of friction and elastic properties (E and ν), were perfectly defined. The following subsection shows the procedure to adjust and validate the FE models.

2.1 Adjustment of TRB based on the mesh size of raceways and rollers

One of the main difficulties in solving mechanical contact by the use of FEM is to obtain realistic contact stresses. It is well known that if the contact surface between the FE models is low and the mesh size is very large, the number of nodes in the contact surface is smaller and therefore the contact stresses obtained by this method does not are accurate (Feng and Prinja 2001). In this respect, researchers have studied how mesh size affects the contact stresses in the contact zone. For example, (Satyanarayana and Melkote 2004) studied the influence of the element size on the contact stress between ball bodies and flat surfaces. Similarly, Demirhan and Kanber (2008) developed two dimensional FE models based on cylindrical roller bearings and studied the influence of element size on the contact stress. The FE models are validated experimentally by two methods. One was by using theoretical models based on Lundberg’s and Hertz’s theory to calculate the contact stresses in the outer raceways (Lundberg and Sjövall 1958; Hertz 1896). The other method was by determining experimentally the relative displacement between the inner and outer raceways of the cylindrical roller bearings. The contact stresses obtained from the theoretical model based on Hertz’s theories are calculated from the force applied to the cylindrical rollers, which is obtained after FE models are simulated. The convergence of the FE models based on the cylindrical roller bearings are achieved and considered to be valid when the results of the contact stresses and displacements obtained by the FE models proposed did not differ significantly from those results obtained, theoretically and experimentally respectively. In the current paper, and in a way that is similar to Demirhan´s work, the mesh size of raceways and rollers were adjusted by comparing the contact stress obtained from the FE models and those obtained from the theoretical model based on Hertz’s theories. Because the contact detection method that was proposed in this case for all FE models was the “segment to segment” method, it was possible to reduce the error in calculating the contact stresses with a mesh refinement (Zavarise and De Lorenzis 2009). In this case, FE models with localized fine mesh sizes in the contact area were created successively with increasingly smaller mesh size (mesh refinement) until the contact stresses on these FE models on the outer raceway did not differ significantly from the results obtained by theoretical models based on the Hertz’s theory (Demirhan and Kanber 2008). This study concentrated on the contact zone (upper and bottom zones of the outer raceway) for both rows of rollers (Fig. 1b). The loads used for adjusting the mesh size of the FE model by comparison to theoretical models were P with 300, 400, 500, and 600 N and an Fr equal to 2000 N. The theoretical models, in which the FE models were compared, were based on the Hertz’s theory for cylindrical bodies (Hertz 1896). Equation (6) shows how to calculate the contact normal stress \( \sigma \) on the outer raceway based on Hertz’s theory (Fig. 2a).

$$ \sigma = \frac{2 \cdot Q}{{\pi \cdot l_{t} \cdot b_{o} }} $$
(6)
Fig. 2
figure 2

a Theoretical distribution of contact stresses between the roller and the flat surface, b Main geometrical dimensions of a double-row TRB (Harris and Kotzalas 2006)

In this equation, \( b_{o} \) is the width of the contact area, \( l_{t} \) is the rollers’ effective length, and \( Q \) is the normal force on each tapered roller that is obtained directly from the FE model when the preload P (300, 400, 500, and 600 N) and Fr (2000 N) are simulated. Similarly, \( b_{o} \) is calculated by using Eqs. (9),  (10) and  (11), where \( \sum {\rho_{o} } \) is the radii of curvature, \( D_{m} \) is the bearing pitch diameter, \( d_{m} \) is mean diameter of tapered roller, and \( \alpha_{o} \) is the free contact angle.

$$ b_{o} = 3.35 \cdot 10^{ - 3} \left( {\frac{Q}{{l_{t} \cdot \sum {\rho_{o} } }}} \right)^{{{1 \mathord{\left/ {\vphantom {1 2}} \right. \kern-0pt} 2}}} $$
(7)
$$ \sum {\rho_{o} = } \frac{1}{{D_{m} }}\left[ {\frac{2}{{1 + \gamma_{o} }}} \right] $$
(8)
$$ \gamma_{o} = \frac{{D_{m} \cdot \cos \alpha_{o} }}{{d_{m} }} $$
(9)

The particular geometrical dimensions of the studied TRB are: \( D_{i} \) (Bore diameter) = 78 mm, \( D_{o} \) (Outer diameter) = 130 mm, L (Longitude) = 90 mm, and Z (Number of rollers) = 25, \( \alpha \) (Contact angle) = 14.07°, \( \alpha_{o} \) (Free contact angle) = 15.7°, \( \alpha_{R} \) (Tapered roller included angle) = 3.25°, \( \alpha_{i} \) (Inner raceway-roller contact angle) = 12.45°, \( l_{t} \) (Rollers’ effective length) = 20.9 mm, \( D_{m} \) (Mean diameter of tapered roller) = 11.925 mm, \( D_{\hbox{max} } \) (Diameter of tapered roller at large end) = 12.5 mm, \( D_{\hbox{min} } \) (Diameter of tapered roller at small end) = 11.35 mm, and \( d_{m} \) (Bearing pitch diameter) = 102.4 mm. Figure 2(b) shows the main geometrical dimensions of a double-row TRB (Harris and Kotzalas 2006).

Then, like the work performed by Illera et al. (2014) to set FE models in contact problems, the FE models based on the TRB are considered to be valid when the computational costs of each simulation are not excessive, and when:

$$ \forall y_{i} \, / \, MAPE_{yi} \, < \, \varphi_{yi} $$
(10)

where \( y_{i} \, \) is the adjustment parameter and corresponds to the mesh size of the elements of the contact area. Also, \( \varphi_{yi} \) is a previously established threshold and \( MAPE_{yi} \, \) is the error that is defined as Eq. (11).

$$ MAPE_{yi} = \frac{1}{m}\sum\limits_{k = 1}^{m} {\left| {\frac{{Y_{yi\;FEM} - Y_{{yi{\kern 1pt} \;TH}} }}{{Y_{yi\;TH} }}} \right|} $$
(11)

where, \( Y_{yi\;FEM} \) and \( Y_{yi\;TH} \) are the contact stresses obtained from the FE model and the theoretical model, respectively, on the contact zone (upper and bottom zones of the outer raceway) of both rows of rollers and for each different combination of P and Fr. k is one of each result obtained for the nodes that belong to the localized contact zone and m is the total number of nodes considered at each mesh size. Figure 3 shows the contact stresses localized in the upper zone of the outer raceway when the mesh sizes are 5.1 and 0.2 mm. In this case, eight computers with Intel Xeon Processor, CPU 3.4 GHz (2 processors) and 10.00 GB (RAM), were used to simulate the FE models.

Fig. 3
figure 3

Contact stresses on the upper zone of the outer raceway that correspond to the first row of rollers when the mesh size is: a 5.1 mm b 0.2 mm

Table 1 shows the MAPE calculated with Eq. (11) to adjust the mesh of the FE models when P is 300, 400, 500, and 600 N; Fr is 2000 N; and the mesh sizes are 5.1, 1.0 and 0.2 mm. Table 1 shows the results obtained for the upper zone of the outer raceway in the first row of rollers. It also shows the average computational time to simulate each FE model with its corresponding mesh density.

Table 1 MAPE and computational cost of the FE models during the mesh density adjustment of the upper zone of the outer raceway that corresponds to the first row of rollers

In this study, the thresholds that are considered to be valid are \( \varphi = 6\,\% \) for MAPE and 14,400 min (10 days) for maximum computational cost. According to Table 1, MAPE is lower than the threshold considered for all (\( \varphi = 6\,\% \)) P: 4.58, 4.68, 4.78, and 4.88 %). The average computational time is 11,952 min (8.3 days). Similarly, the MAPE was calculated for the upper zone of the outer raceway that corresponds to the second row of rollers and for the bottom zone of the outer raceway to both rows of rollers and to each different combination of P and Fr when the mesh sizes were 5.1, 1.0 and 0.2 mm. For all these latter cases studied, when the mesh size was 0.2 mm, the computational time and the MAPE obtained were, respectively, <6 % and 14,400 min. In addition, all of the most important parameters used to determine the quality of the mesh were fulfilled strictly for all mesh sizes proposed and for all loads applied (Kelly et al. 1983). So, for example, the aspect ratio (defined as the ratio of the longest edge of an element to its shortest edge) never exceeded a value of 5:1 for any of the elements of the meshes studied. The Jacobian factor was always greater than 0.6 so that elements’ zero volume did not exist, and the skew angles were less than 45° for all meshes and loads studied.

According to these criteria, the FE models selected had a mesh size equal to 0.2 mm in the contact zone (upper and bottom zones of the outer raceway) of both rows of rollers. Also, the coefficients of friction (μ = 0.2 and μ = 0.001) and elastic properties (ERollers = 208 GPa, νRollers = 0.29; EHub = 200 GPa and ν Hub = 0.29) proposed for the adjustment of TRB that were based on the mesh size of raceways and rollers were considered to be valid.

2.2 Adjustment of TRB based on relative displacements of the raceways

Once the mesh size, the coefficients of friction and the elastic properties where fixed and considered to be valid, the relative displacements between the inner and outer raceways that were obtained by the FE model were compared to the results obtained on the test bench. This study was conducted primarily to determine if the stiffness of the proposed FE models were very similar to that obtained experimentally for the TRB studied. The relative displacements between inner and outer raceway for each of the combinations of P and Fr were obtained experimentally from a test bench and also from the FE models based on the TRB proposed once the mesh size had been adjusted. This relative displacement is measured experimentally by two comparator pencils (Red Crown developed by Testar Marposs) (Fig. 4a). The adjustments of the relative displacements between the raceways are used mainly to verify that the mesh size of the FE model, their different coefficients of friction, and their elastic properties (E and ν) are defined perfectly. The experimental application of P on the TRB was conducted with a screw that, in turn, pushed a steel sleeve on the inner raceway, while the Fr was applied using a load cell (HBM U3 with 5 KN of capacity) (Fig. 4a). P loads were calibrated previously using a strain gauge with the same value of the preload that was acting on the TRB when it was mounted on a steel sleeve. Figure 4b shows the points where the relative displacement between the inner and outer raceway in the FE model was measured. Figure 4c also shows the differences between the relative displacements obtained from the FE models and from the test bench for the variation of P and Fr. The solid line and the dashed line show, respectively, the relative displacement obtained from the FE models and that obtained experimentally. From these pairs of curves, it can be seen that the difference between the relative displacements obtained (one from the FE models and one from the test bench) are similar. Also, Fig. 4b shows that the greater relative displacement occurred when the lower P was applied, and the minimum relative displacement was obtained when the higher P was applied. Moreover, it is seen in this figure that the greater the preload applied is, the greater is the stiffness of the mechanical contacts and the lower is the deformation of the mechanical device. This stiffness variation is in good agreement with the load–deflection relationship that was previously reported in the literature by Kania (2006) and Harris and Kotzalas (2006).

Fig. 4
figure 4

a Test bench for the TRB: assembly of the comparators pencils, strain gauge and load cell. b FE model to measure the relative displacement between raceways. c Relative displacement of the FE model and experimental data for different values of P

The relative displacements obtained from the FE models were consistent with the experiments and the preset parameters in the FE model (Ebearing = 200 GPa, Ehub = 208 GPa, ν = 0.29, coefficient of friction = 0.001 and 0.2) when the mesh size is considered to be valid as explained previously.

2.3 Operating conditions for double-row TRB

Double-row TRBs are mechanical devices that are designed to support the combination of the P, Fa, Fr and T loads (Fig. 5). Due to this combination of loads, the TRB is subjected to contact normal stresses and local deformations on the different parts of the mechanical device (tapered rollers and outer and inner raceway). Also, the inner raceway is able to move a gap δ along the axis where the bearing is mounted. This gap is the distance that allows the TRB to maintain a preset value of P.

Fig. 5
figure 5

Main components of a typical TRB and the corresponding loads acting on it: a Lateral view b Frontal view

Depending on how the loads that are acting on the TRB are combined, the distribution of contact normal stresses, localized normal deformations, and the gap δ can produce values that are undesirable for the functioning of the mechanical device (Harris and Kotzalas 2006). Thus, for example, if the value of the external loads applied to the TRB corresponds to the standard values provided by the manufacturer, the distribution of normal contact stresses on the outer raceway will usually be similar to that shown in Fig. 6a. In this case, since both the top and bottom zones of the outer raceway show a non-zero value for normal contact stresses, all rollers maintain mechanical contact with the raceways at all times. If the mechanical contact of the rotating parts (tapered rollers and outer and inner raceway) is always maintained, the operating conditions of the TRB are correct and follow the manufacturer specifications.

Fig. 6
figure 6

Distribution of contact stresses on the outer raceway when: a the takeoff of the rollers does not occur b the takeoff of the rollers does occurs. δtop and δbottom when the external loads applied to the TRB are for: c standard values and d incorrect values (Harris and Kotzalas 2006)

In contrast, if the values of the external loads that was applied to the TRB were incorrect (i.e., a highly reduced P combined with a very high Fr and T), the distributions of contact normal stresses on the outer raceway usually would be similar to that shown in Fig. 6b. According to Fig. 6b, the top zone of the outer raceway had a non-zero value for normal contact stresses, whereas the bottom zone of the outer raceway had a zero value for normal contact stresses. In this condition, only a reduced number of rollers in the top zone maintained mechanical contact with the raceways, which can cause the rollers to detach from the outer raceway (Harris and Kotzalas 2006). Also, when the values of the external loads that are applied to the TRB correspond to the standard values that the manufacturer provided, the gap δ on the top (δtop) and the bottom (δbottom) should have been the same (Fig. 6c). In contrast, if the values of the applied external loads on the TRB were incorrect (i.e., a highly reduced P combined with a very high T), the gap on the top (δtop) and the bottom (δbottom) would have been different, which would have caused faulty operation of the TRB (Fig. 6d). In addition, an excessive local deformation on the outer raceway can cause undesirable effects, such as pitting and fatigue spalling of the TRB (Harris and Kotzalas 2006; Eschmann et al. 1985). According to these three requirements, we considered in this paper the following conditions for optimal operation of the TRB:

  1. (i)

    The contact ratio obtained between the normal top contact stress (Stop) and normal bottom contact stress(Sbottom) on the outer raceway and for each of the two columns of rollers should be higher than 20 % to prevent detachment of the rollers in the TRB [Eq. (12)] (Harris and Kotzalas 2006).

    $$ {\text{Contact ratio }}({\text{S}}) = \frac{{{\text{S}}_{\text{top}} }}{{{\text{S}}_{\text{bottom}} }} \cdot 100 $$
    (12)
  1. (ii)

    The difference between the gaps δtop and δbottom should be reduced as much as possible to prevent a forced TRB operation [Eq. (13)] or misalignment (Harris and Kotzalas 2006).

    $$ {\text{Gap difference }}({\Delta \delta }) =\updelta_{\text{top}} -\updelta_{\text{bottom}} $$
    (13)
  1. (iii)

    The localized normal deformation of the outer raceway must not exceed the maximum elastic deformation (αmax) of the contact zone with regard to the diameter of the tapered rolling (d) according to Eq. (14) to avoid pitting and fatigue spalling (Eschmann et al. 1985).

    $$ \upalpha_{\hbox{max} } = 0.0001 \cdot d $$
    (14)

In this case, the minimum diameter of the tapered roller is 11.3 mm. Therefore, the value of α max must not exceed any point on the outer raceway by more than 0.0013.

In this work, the optimal operating condition in a double-row TRB is considered to have been reached when:

  • The contact stress radio obtained for both rows of rollers (S1 and S2) should be between 25 and 50 %.

  • The difference between the gaps obtained in the top and bottom sides of the inner raceways (δtop and δbottom) or misalignment have been reduced as much as possible.

  • The local deformation on the contact surface of outer raceway (α max ) does not exceed the maximum elastic deformation value of 0.00113.

2.4 Design of experiments and design matrix

All statistical methods need a previously decided set of Design of Experiments (DoE) to determine the minimum number of experiments needed to fully consider the space of possibilities and ensure that the data obtained from the experiments is representative to support the hypotheses proposed (Fisher 1935). In general, the initial hypotheses are defined when a number of controllable variables (inputs or design factors) and uncontrollable variables (noise factors) determine the number of responses (outputs) by a continuous and differentiable function (Box and Behnken 1960; Taguchi 1986). There are many methods available to develop a DoE, but all this methods involve the construction of a design matrix, in which inputs and outputs are considered. One of the most widely used methods to develop a DoE is the full factorial design method (Montgomery 2008). Using this method, the experiments can adopt all possible combinations for each of the values (or levels) and each of the factors. Specially, a 2k  factorial design is a particular, full factorial design that has two levels and generates 2k experiments, where k is the number of factors. Using this method in the optimization of the proposed TRB, the number of factors is k = 4 (P, Fr, Fa, and T) and the number of experiments or FE simulations needed is 16. This limited number of experiments or FE simulations is not enough to cover the entire space of possibilities and, subsequently, to obtain accurate regression models with which to search for the optimal operating condition for a double-row TRB. However, a factorial design 3K generates 3k experiments or 81 FE simulations. This amount of data may be sufficient to cover completely the entire space of possibilities, but has the disadvantage of generating a large number of experiments or FE simulations. In this work, the DoE was performed using Central Composite Design (CCD). This method is considered to be a fractional three-level design that is useful in obtaining quadratic regression models, thereby reducing the number of experiments in comparison to a factorial design 3k. Table 2 shows the values of the variables, their code, and the levels needed to implement the DoE based on CCD method.

Table 2 Independent variables and experimental design levels used with the CCD method

Once the factors and levels have been set as in Table 2, the design matrix and their corresponding combination of operating conditions are generated using the “R” statistical software and subsequently simulated in the FE models (R 2014). In this case, 25 experiments or FE simulations are needed to cover the entire space of possibilities and to subsequently search for the optimal operating condition for a double-row TRB. The design matrix (See Table 3) shows the number of experiments or FE simulations and the corresponding values for the operating conditions (P, Fr, Fa, and T).

Table 3 Design matrix with the combination of operating conditions (inputs) to be simulated

3 Results

3.1 Results obtained from the FE models

Once the FE models had been validated, an automatic procedure was performed on parameterized FE models in order to run the 25 simulations according to the design matrix. The results of these FE simulations are shown in Table 4 [contact ratio for the first row of rollers (S1), contact ratio for the second row of rollers (S2), maximum normal deformation (αmax), and gap difference (Δδ)].

Table 4 Results obtained from the FE models using the inputs of Table 3

3.2 Analysis of variance

Equation (2) was fitted using the data shown in Tables 3 and 4 to obtain the regression equations for all responses with the use of the RMS “R” package (Lenth 2009). Second order polynomial models were built for each response, and several criteria (R2, p value, MAPE and RMSE) were used to select the most accurate model.

Equations (17), (18), (19) and (20) show the second degree polynomial function obtained to model S1, S2, αmax and Δδ.

$$ \begin{aligned} {\text{S}}_{1} \, = \, - 0.3553 \, + \, 0.00013 \cdot {\text{P }} - \, 4.2\cdot10^{ - 9} \cdot {\text{P}}^{2} \, + \, 4.14 \cdot 10^{ - 7} \cdot {\text{F}}_{\text{r}} - \, 3 \cdot 10^{ - 10} \cdot {\text{P}} \cdot {\text{F}}_{\text{r}} \hfill \\ - \, 1.4174 \cdot 10^{ - 5} \cdot {\text{F}}_{\text{r}} \, + \, 2.2 \cdot 10^{ - 9} \cdot {\text{P}} \cdot {\text{F}}_{\text{a}} \, - \, 3 \cdot 10^{ - 10} \cdot {\text{F}}_{\text{r}} \cdot {\text{F}}_{\text{a}} + \, 1.001 \cdot 10^{ - 7} \cdot {\text{F}}_{\text{a}}^{2} \, - \, 6.593 \cdot 10^{ - 7} \cdot {\text{T }} \hfill \\ \end{aligned} $$
(15)
$$ \begin{aligned} {\text{S}}_{2} \, = \, - 0.7544 \, + \, 0.00015 \cdot {\text{P }} - \, 4.2 \cdot 10^{ - 9} {\text{P}}^{2} \, + \, 8.2536 \cdot 10^{ - 6} {\text{F}}_{\text{r}} \, - \, 4 \cdot 10^{ - 10} {\text{PF}}_{\text{r}} \, - \, 1 \cdot 10^{ - 10} {\text{F}}_{\text{r}}^{2} \hfill \\ - \, 0.000106 \cdot {\text{F}}_{\text{a}} \, - \, 2 \cdot 10^{ - 10} \cdot {\text{P}} \cdot {\text{F}}_{\text{a}} \, + \, 3 \cdot 10^{ - 10} \cdot {\text{F}}_{\text{r}} \cdot {\text{F}}_{\text{a}} \, + \, 1.604 \cdot 10^{ - 7} \cdot {\text{F}}_{\text{a}}^{2} \, + \, 5.987 \cdot 10^{ - 7} \cdot {\text{T}} \hfill \\ \end{aligned} $$
(16)
$$ \begin{aligned}\upalpha_{\hbox{max} } \, = \, 0.001693 \, - \, 7.29 \cdot 10^{ - 7} \cdot {\text{P }} + \, 3.37 \cdot 10^{ - 8} \cdot{\text{F}}_{\text{r}} \, - \, 1.3504 \cdot 10^{ - 6} \cdot {\text{F}}_{\text{a}} \, + \, \hfill \\ 2 \cdot 10^{ - 10} \cdot {\text{P}} \cdot {\text{F}}_{\text{a}} \, + \, 1 \cdot 10^{ - 10} \cdot {\text{F}}_{\text{a}}^{2} \, + \, 2.9 \cdot 10^{ - 9} \cdot {\text{T}} \hfill \\ \end{aligned} $$
(17)
$$ \begin{aligned} \varDelta\updelta \, = \, 0.03729 \, - \, 9.0738 \cdot 10^{ - 6} \cdot {\text{P }} + \, 5 \cdot 10^{ - 10} \cdot {\text{P}}^{2} \, + \, 4.363 \cdot 10^{ - 7} \cdot {\text{F}}_{\text{r}} \hfill \\ + 1.5571 \cdot 10^{ - 6} \cdot {\text{F}}_{\text{a}} \, + \, 2 \cdot 10^{ - 10} \cdot {\text{P}} \cdot {\text{F}}_{\text{a}} \, - \, 3.1 \cdot 10^{ - 9} \cdot {\text{F}}_{\text{a}}^{2} \, - \, 1.81 \cdot 10^{ - 8} \cdot {\text{T}} \hfill \\ \end{aligned} $$
(18)

These equations show how each output was defined by first and second degree polynomials. Then, ANOVA was used to reduce the size of these regression models by removing the insignificant terms of the polynomial (Sathiya et al. 2006). This method, called step-wise regression, automatically removes the features of no significance for each model (Wilkinson and Dallal 1981). Equations (19), (20), (21) and (22) show the reduced quadratic models.

$$ \begin{aligned} {\text{S}}_{1} \, = \, - 0.004182 \, + \, 5.66892 \cdot 10^{ - 5} \cdot{\text{P }} - \, 3 \cdot 10^{ - 10} \cdot {\text{P}} \cdot {\text{F}}_{\text{r}} \, \hfill \\ - \, 3\cdot10^{ - 10} \cdot {\text{F}}_{\text{r}} \cdot {\text{F}}_{\text{a}} \, + \, 1.078 \cdot 10^{ - 7} \cdot {\text{F}}_{\text{a}}^{2} \, - \, 6.54 \cdot 10^{ - 7} \cdot {\text{T}} \hfill \\ \end{aligned} $$
(19)
$$ \begin{aligned} {\text{S}}_{2} \, = \, - 0.434307 \, + \, 7.55976\cdot10^{ - 5} \cdot {\text{P }} + \, 8.7701 \cdot 10^{ - 6} \cdot{\text{F}}_{\text{r}} \, - \, 4\cdot10^{ - 10} \cdot {\text{P}} \cdot {\text{F}}_{\text{r}} \, \hfill \\ - \, 1 \cdot 10^{ - 10} \cdot {\text{F}}_{\text{r}}^{2} \, - \, 8.39731 \cdot 10^{ - 5} \cdot{\text{F}}_{\text{a}} + \, 1.532 \cdot 10^{ - 7} \cdot{\text{F}}_{\text{a}}^{2} \, + \, 8.1 \cdot 10^{ - 7} \cdot {\text{T}} \hfill \\ \end{aligned} $$
(20)
$$ \upalpha_{\hbox{max} } \, = \, 0.000546 \, - \, 1.685\cdot10^{ - 7} \cdot {\text{P }} - \, 1.5291 \cdot 10^{ - 6} \cdot {\text{F}}_{\text{a}} \, + \, 2 \cdot 10^{ - 10} \cdot {\text{P}} \cdot {\text{F}}_{\text{a}} $$
(21)
$$ \begin{aligned} \Delta\updelta \, = \, 0.036197 \, - \, 8.4334\cdot10^{ - 6} \cdot {\text{P }} + \, 5\cdot10^{ - 10} \cdot {\text{P}}^{2} + \, 3.95 \cdot 10^{ - 7} \cdot {\text{F}}_{\text{r}} \, \hfill \\ + \, 3\cdot10^{ - 10} \cdot {\text{P}} \cdot {\text{F}}_{\text{a}} \, - \, 3.1 \cdot 10^{ - 9} \cdot {\text{F}}_{\text{a}}^{2} \, - \, 9.5 \cdot 10^{ - 9} \cdot {\text{T}} \hfill \\ \end{aligned} $$
(22)

Tables 5, 6, 7 and 8 show the results of ANOVA for each of the final quadratic models. Most of the variables have a p value that is less than 0.01. This indicates that the variables used on the reduced quadratic models are statistically significant. The multiple correlation coefficient (R2) is calculated as the measure of the amount of variation around the mean obtained by the regression model. The results showed that all values of R2 are close to 1, which indicates that these models possess a good predictive capacity.

Table 5 ANOVA results for the variable S1 using the final reduced quadratic model
Table 6 ANOVA results for the variable S2 using the final reduced quadratic model
Table 7 ANOVA results for the variable αmax using the final reduced quadratic model
Table 8 ANOVA results for the variable Δδ using the final reduced quadratic model

MAPE and RMSE are calculated to determine the generalization capacity of the reduced quadratic models using the samples shown in Table 4 according to Eqs. (23) and (24).

$$ MAPE = \frac{1}{m} \cdot \sum\limits_{k = 1}^{m} {\left| {\frac{{Y_{k \, FEM} - Y_{{k{\text{ Model}}}} }}{{Y_{k \, FEM} }}} \right|} $$
(23)
$$ RMSE = \sqrt {\frac{1}{m} \cdot \sum\limits_{k = 1}^{m} {\left( {Y_{k \, FEM} - Y_{{k{\text{ Model}}}} } \right)^{2} } } $$
(24)

In this case, Y kFEM are the normalized responses obtained from the FE models and Y kModel are those obtained from the reduced quadratic models developed with RSM. The normalization of the data is commonly used in statistical processes in order to transform all variables to the same scale (from 0 to 1). In this case, this transformation is carried out subtracting the minimum value from each original value and dividing by the range of each variable according to Eq. (25).

$$ Y_{k, \, norm} = \frac{{Y_{k} - \hbox{min} \left( Y \right)}}{range(Y)} $$
(25)

Table 9 shows the prediction errors, where the maximum error corresponds to α max (MAPE equal to 8.06 % and RMSE equal to 13.29 %), and the minimum error corresponds to Δδ (MAPE equal to 0.71 % and RMSE equal to 1.46 %).

Table 9 Results of the predicted error criteria for S1, S2, αmax and Δδ using the reduced quadratic models

Additionally, 22 new FE models were implemented to test the quadratic regression models with new operating conditions that had not been used previously to generate reduced quadratic models. The errors obtained during the testing stage are shown in Table 10, where the maximum error corresponds to S2 (MAPE equal to 5.47 % and RMSE equal to 7.70 %), and the minimum error corresponds to Δδ (MAPE equal to 1.72 % and RMSE equal to 2.87 %).

Table 10 Test dataset with 22 new instances

Figure 7 shows the relationship between the actual (FEM) and predicted (reduced quadratic models) values of S1 (Fig. 7a), S2 (Fig. 7b), αmax (Fig. 7c), and Δδ (Fig. 7d). The figures show that these models are adequate to predict these values because the residuals that were obtained are small and the correlations between actual and predicted values are high.

Fig. 7
figure 7

Scatter diagram of: a contact ratio for the first row of rollers (S1), b contact ratio for the second row of rollers (S2), c the normal maximum deformation (αmax) and d the gap difference (Δδ)

3.3 Multi-response optimization

Table 11 shows the combination of operating conditions and outputs that were studied in searching for the optimal operating condition in a double-row TRB through desirability functions using the RMS “R” package (Kuhn 2014). The first column of this table shows the inputs and outputs that were studied. The second column of the table shows the goal set in the optimization process for both inputs and outputs. In this case, the process of searching for the best combination was accomplished by prefixing a range for the preload (between 8000 and 10,000 N), while Fr, Fa and T were analyzed for a fixed number of settings. In this case, the combination of the operating conditions Fr, Fa and T were considered to be the load capacity, while P is preset when the TRB is mounted on the hub. Thus, for example, Fr varies from 60,000 to 100,000 N (every 500 N), Fa varies from −200 to 600 N (every 10 N) and T varies from −100,000 to 100,000 N mm (every 2500 N mm). The requirements of searching for the optimal operating condition are set between 20 and 50 % for S1 and S2 (goal in range), between −0.0011 and −0.0006 mm for αmax (goal equal to maximum), and between 0.0197 and 0.0366 mm for Δδ (goal equal to minimum). The combination of inputs (Fr, Fa and T) for the optimal operating condition required (S1, S2, αmax and Δδ) involves a total of 5,31,441 different combinations with their corresponding different desirabilities.

Table 11 Results of the optimal operating condition to obtain the requirements of S1, S2, αmax and Δδ (Total number of combinations studied equal to 5,31,441)

Some of the combinations obtained to reach the optimal operating conditions according to the requirements of the TRB are shown in Table 12. Thus, for example, the first row of the table (Combination N° 1) shows the highest values of desirability (=1) when the load capacity of the TRB is the maximum (Fr, Fa and T), while P can reach any value (P = 8666.75 N; Fr = 62,500 N; Fa = −200 N, and T = 52,500 N mm). Also, if the value of Fr is the maximum when the values of Fa and T are the maximum negatives values (Combination N° 2), P = 8451.01 N and the desirability is equal to 0.199. This combination of operating conditions shows that for large values of Fr, the desirability values obtained are lower. In contrast, if the values of Fa are the maximum and minimum possible for any of the values of Fr and T, the values obtained for P and for the desirability are, respectively, 9905.42 and 0.936 (N° 3) and 8605.55 and 1 (N° 4). Furthermore, the combinations N° 5, 6, 7 and 8 show the operating condition when the values of S1, S2, αmax and Δδ are the lowest. Also, these four combinations of operating conditions show high values of desirability. Finally, the combination N° 9 has a desirability value equal to zero when the values of Fr, Fa and T are the highest.

Table 12 Results of the operating condition necessary to reach the requirements of S1, S2, αmax and Δδ

Figure 8 shows the desirability function versus the normalized load capacity used to obtain the requirements with the optimal operating conditions based on combinations from Table 11. In this case, the normalized load capacity is defined as the sum of the normalized values that correspond to Fa, Fr and T [Eq. (26)].

Fig. 8
figure 8

Desirability function versus the normalized load capacity

$$ {\text{LC}}_{\text{norm}} = {\text{F}}_{\text{a norm}} + {\text{F}}_{\text{r norm}} + {\text{T}}_{\text{norm}} $$
(26)

Each one of the normalized values of Fa, Fr and T are calculated using Eq. (25). Figure 8 shows that the optimal operating condition to obtain the requirements of S1, S2, αmax and Δδ (desirability equal to 1) is reached for a reduced number of combinations (top line from point A to point B). In this case, point A corresponds to the minimum values of normalized load capacity (P = 8694.88 N; Fr = 60,000 N; Fa = 0 N and T = 0 N mm), and point B corresponds to the maximum values of normalized load capacity (P = 8666.75 N; Fr = 62,500 N; Fa = −200 N and T = 52,500 N mm). In contrast, the lower line (from point C to point D) corresponds to the worst operating condition to obtain the requirements (equal desirability to 0). Point C corresponds to values of the minimum load capacity (P = 8087.38 N; Fr = 96,000 N; Fa = −200 N and T = 52,500 N mm), and point D corresponds to values of the maximum load capacity (P = 8058.11 N; Fr = 100,000 N; Fa = −200 N and T = 100,000 N mm). Finally, point E corresponds to the combination of the maximum load capacity (P = 9999.87 N; Fr = 100,000 N; Fa = 600 N and T = 100,000 N mm), which corresponds to a desirability value of 0.126.

Figure 9 shows the desirability function versus several loads: Fr (Fig. 9a), Fa (Fig. 9b) and T (Fig. 9c). The desirability function may achieve maximum values equal to 1 in a reduced range of Fr (from point F (60,000 N) to point G (62,500 N) in Fig. 9a), in a wider range of Fa (from point H (−200 N) to point I (70 N) in Fig. 9b), and in a wider range of T (from point J (50,000 N) to point K (100,000 N) in Fig. 9c). Finally, from these figures one can see that the maximum values of the desirability function are achieved only for values of Fr that are close to the minimum of its range.

Fig. 9
figure 9

Desirability function versus a Fr, b Fa, and c T

Finally, in this work, several more FE simulations are conducted with some of the optimized operating conditions in order to compare the FE results to the predicted ones. Figure 10 provides, for some of the optimal operating conditions shown in Table 12, a comparison between the values of S1, S2, αmax and Δδ obtained from the RSM method and the values obtained from the FE model.

Fig. 10
figure 10

Comparison of the predictions of S1, S2, α max and Δδ using the RSM method (cross) and FE method (open circle)

This figure clearly shows that the errors in the prediction of S2 obtained with the RSM method (MAPE equal to 5.45 %) with respect to the values obtained from the FE model are higher than with the other parameters (MAPE for S1 equal to 2.92 %, for αmax equal to 4.4 % and for Δδ equal to 1.76 %).

4 Conclusion

It has been clearly demonstrated that a combination of FEM and MRS optimization can be used to search for the optimal operating condition in the design phase of double-row TRB. The FEM has been used successfully to simulate the TRB and a good agreement between simulations and experiments has been obtained. Then, the validated FE models are used to generate three of the most important parameters of the TRB design, which was used subsequently to develop quadratic regressions models. In order to check their generalization ability, these quadratic regressions models were tested with new operating condition. Several error criteria were calculated. In addition, a multi-response optimization study was conducted using a desirability approach and based on the reduced quadratic regressions models obtained. The optimization results indicated that the optimal operating conditions, which are P = 8666.75 N; Fr = 62,500 N; Fa = −200 N and T = 52,500 N mm, correspond to the maximum load capacity for the range of operating conditions preset. The study also showed that, for the range of preset operating conditions, the combination that makes the load capacity maximum does not obtain the requirements for which the TRB has been designed. Finally, from the results obtained, it is concluded that combining FEM and MRS optimization can be used successfully in the design of any mechanical device.