1 Introduction

The global precast construction system has developed rapidly in recent years, with an annual rate of about 5% [1], and the market size of the precast industry reached nearly USD 200 billion in 2017 [2]. In China, over 600 precast factories have been established the past 3 years, and over 1000 precast factories cover more than 30,000 m3 [3]. Facing the rapid growth of market, urgent issue of the shortage of skilled labor in precast industry has been discussed frequently. Production methods and manufacturing processes used for precast components are different from traditional ones and demands workers greater both knowledge and technical precision [4].

Skilled workers are undoubtedly important because they could provide high and stable productivity. However, on the other side, the productivity of unskilled workers seems often being ignored. As the saying goes, “Rome wasn’t built in 1 day,” newly employed workers wouldn’t become skilled one of a sudden. They have to go through a learning process to be skillful, and researchers have studied that and developed the learning theory, which has been applied to many different industries, including the construction industry [5,6,7,8,9,10,11]. During the learning process, the productivity of unskilled workers will increase over time and gradually become stable. If the manager regards unskilled workers’ productivity as a constant, their productivity may not be fully utilized. Therefore, a precise model to describe the changes in the productivity of unskilled worker is critical. However, literature shows that only simple models such as the straight-line model or cubic model have been applied in the construction or precast industry [9,10,11,12,13,14]. Considering the complexity of the precast industry, whether these simple models could precisely describe the learning status is doubtful.

Therefore, the purpose of this study is as follows: (1) to develop different learning curve models for trainees during their initial learning of each precast component production process; (2) to evaluate different learning curve models and find out the best one. Since measuring productivity difference for individuals due to workers’ ages is complicated, we have assumed that workers whose ages are in the range of 15–65 have the same productivity.

2 Precast Production Management and Process

Precast method has been considered to be effective production methods to control cost, improve productivity, and ensure quality within the construction industry, while maintaining fast and automatic production processes [15]. It is regarded as one of the most common and advanced industrialization methods in the construction industry, with the utilization of the methods of normalization, standardization, and modularization. The building is divided into many elements or components, such as columns, walls, beams, plates, and so on. After being produced in a factory via industrial processes, these elements or components are transported to the construction site to be assembled into a building structure [16].

Studies have been conducted to improve the productivity of precast factories through various methods, such as management practices, process reengineering, and simulation [16,17,18,19,20].

Li et al. organized the literature in the Management of Prefabricated Construction (MPC) research field between 2000 and 2013. These studies are categorized into five major themes within, including the “Future Development of the Industry”, “Technology Development and Application”, “Performance Evaluation”, “Technology Application Environment” and “Design, Production, Transportation and Assembly Strategies”.[19].

The production process of the precast industry has also been reviewed. By reviewing studies based on production process models of precast factories, the manufacturing processes for precast components can be understood [21]. Based on previous research, we regard the production processes of precast components as the following 14 basic activities (photos of some construction activities are shown in Fig. 1), including: (1) steel mold cleaning (clearing molds); (2) module assembly; (3) lofting (positioning for iron components); (4) dipping of steel rod cages; (5) laying of embedded parts; (6) checking before pouring concrete; (7) pouring concrete; (8) surface whitewashing; (9) concrete curing; (10) mold removing; (11) demolding; (12) component repair; (13) inspection of finished components and (14) warehouse storage, [22,23,24]. Further work including data collection, analysis, and discussion will base on 14 activities in this study.

Fig. 1
figure 1

Related production activities of precast structure

3 Learning Curve Theory and Its Application

In 1936, Wright found that when yield is doubled in aircraft component production lines, the required work time can be reduced by 20%. He then proposed a straight-line model that speculates a constant rate of learning or improvement, by which the work time of a given production cycle can be reduced by a constant percentage each time a new cycle is added [5, 9, 25]. After Wright proposed the straight-line learning curve model, many other learning curve models that are different from this model were proposed. Since Wright's discovery of the learning effect on repetitive activities in aircraft component production lines in 1936 [5], the question of how to use the learning curve effect to improve the productivity of repetitive production activities has been a subject of concern to many scholars and applied to many industries. Many studies have published papers on whether this theory can improve productivity, predict output value, assess project progress, and improve cost-effectiveness [7, 26,27,28,29]. In addition, construction industry-related researchers have applied the learning curve theory to improve industry productivity [11, 25].

Jordan Srour et al. divided the various learning curve models proposed by scholars based on Wright’s straight-line model into five categories: (1) the Wright model and its variations; (2) polynomial models; (3) exponential models; (4) hyperbolic models; (5) the recursive model proposed by Srour himself [11]. Among these models, 7 learning curve models are well-known and frequently used, and include the following:

  1. 1.

    Straight-line model

    This model assumed that the degree of improvement of work time is a result of learning at a fixed logarithmic ratio, resulting in a straight line forming in double logarithmic coordinates (Eq. 1) [5, 10, 30, 31].

    $$Y = aX^{ - n} ;\quad L = 2^{ - n} ,$$
    (1)

    where Y time required to produce unit X (cost or man-hours), X quantity of units reproduced, a time required to produce unit 1 (in cost or man-hours), n slope of learning curve in double logarithmic coordinates, L learning rate.

  2. 2.

    Stanford B model

    With concern that the straight-line model was not fully applicable to certain data from the WWII era, the Stanford Research Institute of the United States Department of Defense took into account the existing experience of the workers that the straight-line model did not include. An improved model named the Stanford B model was proposed based on straight-line model theory in 1949 (Eq. 2) [11, 32].

    $$Y = a\left( {X + b} \right)^{ - n} ;\quad L = 2^{ - n} ,$$
    (2)

    where b is the degree of experience that already exists \(\left( {1 \le b \le 10} \right)\), and the rest of the parameters are set the same as those set by the straight-line model.

    Parameter b in this model is generally preset to 4. When b = 0, it represents the complete absence of existing experience on the part of the operator, under these conditions the Stanford B model is identical to the straight-line model [33].

  3. 3.

    DeJong model

    DeJong developed the DeJong model in 1957, considering whether mechanized operations would affect the learning curve. He argued that if operations were primarily controlled by machinery, the potential compression of production time proportional to the increase in the number of operations could be damped, and added an ‘incompressibility factor’ to the learning curve model to define the degree to which the production time could be compressible (Eq. 3) [6, 34].

    $$Y = a\left[ {m + \left( {1 - m} \right) \times X^{ - n} } \right];\quad L = 2^{ - n} ,$$
    (3)

    where m incompressibility factor \(\left( {0 \le m \le 1} \right)\).

    In general, if the operation is performed manually, m = 0.25; if m = 0, it means that the operation is under complete manual control, in which case the DeJong model is identical to the Straight-line model. Meanwhile, if m = 1, the operation is fully automated, and as such undergoes no learning effect [31].

  4. 4.

    S-Curve model

    The S-Curve model was developed by Carr in 1946. Since subsequent studies found that the Stanford B model was more suitable for the first half of the curve and the DeJong model was more suitable for the second half of the curve, Carr combined these two learning curves into the S-Curve model (Eq. 4), where the parameter settings are the same as those of the above model [26, 32, 35]

    $$Y = a\left[ {m + \left( {1 - m} \right)\left( {X + b} \right)^{ - n} } \right];\quad L = 2^{ - n} .$$
    (4)
  5. 5.

    Cubic model

    The cubic model included the effects of existing experience and the cessation of productivity improvement after operational proficiency had been achieved and assumed that the learning rate would not be constant (Eq. 5) [10, 36]

    $$\log Y = \log a - n\left( {\log X} \right) + c\left( {\log X} \right)^{2} + d\left( {\log X} \right)^{3} ;\quad L = 2^{ - n} .$$
    (5)
  6. 6.

    Exponential model

    The concept of the exponential learning curve was first developed by Thurstone in 1919 and was refined by Kientzle et al. [8, 37,38,39,40,41], The mathematical formula of the constant time model developed by Towill is shown in Eq. (6)

    $$Y = A + B*\left( {1 - e^{{c\left( {x - 1} \right)}} } \right);\quad L = 2^{ - n} ,$$
    (6)

    where A initial performance, the time it takes to produce the first unit (synonymous with the above variable a). B difference between the asymptotic and initial performance. A + B asymptotic or final performance, the production time that tends to stabilize after the learning process has been completed. c learning constant.

  7. 7.

    Piecewise model

The comparison among these 7 methods is listed in Table 1. In addition, the time used in this formula is slightly different from that used in the several aforementioned formulas, the aforementioned time x is defined as the production efficiency on the xth day, and here the time is defined as the production efficiency after x days of study, so x − 1 is taken as the parameter for the equation [11, 26].

Table 1 Comparison between commonly used learning curve models

Since the learning curve theory came into being, many scholars have also applied it toward the cause of improving the productivity of the construction industry. As far as the learning process is concerned, it can be divided into the initial operation learning phase and the later routine procedure phase [42, 43]. In 1986, Thomas et al. collected data from 65 of precast component utilization procedures at construction sites, conducted fitting to five learning curve models including the straight-line, Stanford B, cubic, piecewise, and exponential models in order to examine their R2 value. The results show that the cubic model has the best fit for historical data and is also best suited to predict the production time for independent sampling data at the same phase [10]. Everett and Farghal studied the fit of 12 learning curves to historical data and the ability to predict future performance against 60 sets of construction data covering the on-site assembly process of precast components. The results show that the cubic model is more suitable for fitting existing historical data compared to other models. However, the cubic model performs the worst at predicting future production data; the straight-line model performs the worst in fitting existing historical data but the best at predicting future production data [9].

Learning curve theory was also applied to different construction projects in other studies. Lee et al. studied cases of high-rise buildings in Korea and developed a set of learning curves which considered several factors that could affect the learning curve in the construction of high-rise building projects and were then converted into another set of suggested learning curves to improve labor productivity [14]. Based on the data of 15-storey concrete buildings in Italy, Pellegrino et al. conducted a fitting using a straight-line model and discussed the influence of interrupting construction projects on the learning curve [13]. Many scholars also have applied the learning curve theory to formwork engineering, reinforcement fixing operations, roof insulation engineering, and other projects [25, 44, 45]. Based on the above literature, it is found that the learning curve models most commonly used in the construction industry are the straight-line model and cubic model. Researches attempt to apply learning curve theory to increase the productivity of the construction industry has a very long history with many research results having been achieved in this field [10, 11]. However, the application of learning curves in the precast industry has only involved a few analysis on the assembly operation of precast components at construction sites [9, 10], and there has been little research on the production processes of precast components. Furthermore, regarding the complexity of the construction industry, both the straight-line model and the cubic model could be questioned as too simple. Therefore, this study will analyze the training data of precast workers in learning the production process of precast components and validate the fit and predictive accuracy by using straight-line, cubic, and exponential models, allowing the results of the analysis to help the precast industry improve its productivity.

4 Data Collection and Basic Analysis

This study gathered and analyzed the precast structural component data from more than 90% of new precast construction projects in Taiwan among 5 years (2015–2019). To participate, understand and investigate the production system of precast plants through thorough field study, we observed, measured, collected, and verified the characteristics and the duration of each manufacturing activities in the field. Data collected mainly targeted on the main production time of three types of structural components, namely the main beam, minor beam, and column. Our team measured every trainee’s daily production time based on 14 basic activities mentioned above, and the production data was collected from the first day they learned to work on those activities until their performance becomes steady. There are 4352 workers involved in the data collection project where 354,240 data points are recorded from the field and none of them has been used or published in any other work. The research contents of 14 activities in precast factories conducted by the research team are described as follows:

  1. 1.

    Type of projects for data collection

    There are seven project types for the collected data, including collective housing, schools, office buildings, large shopping malls, technology plants, biotech factories, and composite shopping malls, as shown in Fig. 2. The data collected include production times for basic activities in the primary construction of precast building structures, and the recorded production times are calculated in minutes.

  2. 2.

    Objects for data collection

    1. (1)

      Newly employed workers: the manager of the precast factory will allocate training activities as need, and each newly employed worker is considered able to formally conduct production activities after completing one of the 14 training activities.

    2. (2)

      In-service workers: the training of other activities is carried out to increase worker’s skill levels according to human resource planning and assignment of the precast factory, as well as personal preference on the part of the workers themselves.

    3. (3)

      The trainees include both domestic workers and foreign workers.

      Fig. 2
      figure 2

      Scope analysis of precast projects

  3. 3.

    Object background information for data collection

    1. (1)

      The trainees used for data collection in this study are all actively employed workers at a precast factory.

    2. (2)

      Each trainee has undergone a physical examination and was in good health before becoming an active employee.

    3. (3)

      The experience of the trainees, whether related to the precast industry or not, is irrelevant to the training activities.

  4. 4.

    Data collection methods

    1. (1)

      In the first year, our team observed the training status of workers within the precast factory, and during each training session at the precast factory, the team mainly performed measurement and video recording from 8:00 AM to 5:00 PM that day. However, some activities (such as lofting, laying of embedded parts, surface whitewashing, and component repair) were trained on a non-periodic basis, and the team also made appropriate cooperative efforts. The next 4 years, with the consent of the precast factory, data collection was mainly performed via CCTV video recording, and videos were regularly exported for data analysis.

    2. (2)

      In this study, data were collected for each individual’s training sessions across all 14 activities, via random sampling mode. If the person exited the training period prematurely, that data was excluded.

    3. (3)

      The collected data correspond to the training modules produced on that day, with a maximum of 6 sets and a minimum of 3 sets, and the work time of various production processes have been recorded for each of the 14 activities.

    4. (4)

      The training of some activities is sometimes conducted privately by the workers themselves, mainly for activities such as lofting, the laying of embedded parts, surface whitewashing, and component repair (evidence states that employee pay is increased after they have completed training for the above four activities), so training time does not necessarily occur during working hours.

  5. (5)

    Analysis of production trainee numbers

    The analysis of the number of production trainees in this study is shown in Table 2, and the total number of trainees for whom data has been collected is 4352. There are 3432 domestic (78.9%) and 920 foreign (21.1%) workers participated during training. In terms of age distribution, the highest proportion for domestic workers in Taiwan is at age of 30–39 (1605 persons, 46.8%), while the largest proportion for foreign workers is at age of 20–39 (396 persons, 43.0%). The total average time of employment of those involved in precast projects was 1.61 years, and the overall time employed of those involved was not high. The total average time employed of domestic workers was 1.68 years, while that of foreigners was 1.53 years.

    Table 2 The analysis of the number of production trainees

This study distinguishes 14 activities into 3 modules according to the categories used by Chen et al.: the molding module, the filling module, and the repair and storage module [18]. The data analyses of component production trainees are described separately below:

  1. 1.

    Molding module

    The analysis of data from the molding module is shown in Table 3 and consists of five activities: steel mold cleaning (mold clearing), module assembly, lofting, dipping of steel rod cages, and laying of embedded parts. Within the research database of this module, domestic workers accounts for over 70% in most training activities, except in the laying for embedded parts, where it was 59.6% (293 persons). Most of the trainees aged 20–39 (over 70% in every activity of this module), and more than one-third of them aged 30–39. In terms of the average time employed for participants in precast projects, the average time employed of personnel in lofting and laying of embedded parts is significantly higher than that of other activities in this module.

  2. 2.

    Filling module

    The analysis of the data for the filling module is shown in Table 3 and consists of four activities: checking before pouring concrete, pouring concrete, surface whitewashing, concrete curing. Within the research database of this module, domestic workers accounts for over 50% in every training activity, with the highest ratio being in concrete pouring at 86.2% (424 persons). Over 70% workers aged 20–39 for all four activities. In terms of the average time employed of participants in precast projects, the average time employed of employees engaged in surface whitewashing is significantly higher than that of other activities in this module.

    Table 3 Analysis of trainee data
  3. 3.

    Repair and storage module

    The analysis of the data for the repair and storage module is shown in Table 3, and this consists of five activities: mold removing, demolding, component repair, inspection of finished components, and warehouse storage. Within the research database, domestic workers accounts for over 60% in every training activity, with the ratios of component repair and inspection for finished components being as much as 74.6% (367 persons) and 73.6% (362 persons) respectively. Over 70% of workers aged 20–39 for all four activities. In terms of average time employed for participants in the precast projects, time employed for those trained in mold removal and component repair was significantly higher than those of other activities in this module.

Due to the different degrees of difficulty for each activity, the number of days required to perform data collection also varied. For some activities, the work time of workers tended to stabilize within 10 days, but other activities required more than a hundred days to stabilize. The number of observations, observation days, and data points collected for each activity are shown in Table 4 Each worker produced 3–6 sets of modules per day, and the work time was recorded according to the 14 prescribed activities. Within the production data of 14 activities, work time on the final measurement day can be reduced by 32–87% compared with that of the first day, as shown in Table 5. I It follows that the learning effect clearly increases productivity in trainees. However, if human resources are to be deployed to take advantage of this effect, it is important to know how the trainees' work time changes before they enter a stable phase under the learning effect. Therefore, in the next section, various models of learning curve theories will be applied to identify the most suitable model to describe the changes in production data for trainees during the initial learning phase, which can serve as an important foundation for subsequent research or practical applications in improving the productivity of precast factories.

Table 4 Collected data
Table 5 Production time comparison between the first and final measurement for each activity

5 Learning Curve Model and Validation for Precast Component Production

To acquire the learning curve for each activity, we analyze the obtained data through tenfold cross-validation and makes ten analyses by dividing the 354,240 datasets for all 14 activities in the building precast structure into ten equal parts. In each analysis, 90% of the data are used as training data to perform regression analysis of the straight-line, cubic, and exponential learning curve models, and the R2 value is used to check the degree of fit. The other 10% of the data are then used as testing data to perform validation, and the MAPE value is then calculated to judge the prediction accuracy of the model. The results of the analysis are shown in Table 6, for each activity, there is a range of R2 values and MAPE values due to a total of 10 times of cross-validation. For each activity, both the R2 and the MAPE value of the cubic and exponential learning curve models perform better than those of the straight-line learning curve model. It is thus known that, for the precast component production data at the initial production phase, the cubic and exponential models can more accurately fit the historical data than the straight-line model, and also are more suitable for predicting the production data of trainees. Therefore, this study will continue the subsequent analysis based on the cubic and exponential models and generate learning curves for each activity.

Table 6 Learning curve model comparative analysis for each activity

In the above-mentioned cross-validation, the data have undergone ten cross-validation analyses, so ten sets of learning curves are generated for each activity. In the analysis results of the cubic and exponential models, the difference between the maximum and minimum values of R2 is within 0.008, and the difference between the maximum and minimum values of MAPE is within 10%, so the ten sets of learning curves for each activity can be regarded as being very similar curves. To generate a learning curve representing each activity, the study has selected the minimum MAPE value among the ten sets of learning curves of each activity as the learning curve LCbest represented the activity.

After comparing the LCbest curves of the cubic and exponential models, it is found that the differences in MAPE values for the two models are within 5%, indicating that the two models have similar performance. Among these differences, the biggest comes from the activity of lofting, in which the MAPE value of the exponential model exceeds that of the cubic model by 4.07%. In addition, if the learn curve is actually drawn, it can be found that many activities are affected by the cubic model’s characteristics of inflection points and more inconsistent fluctuations in the initial phase, while the exponential model can mitigate this problem (as shown in Fig. 3). Therefore, we suggest that the learning curve of each activity in the precast factory should adopt the exponential model.

Fig. 3
figure 3

Cubic and exponential learning curve of lofting

The exponential model learning curve, initial learning rate, R2 value, and MAPE value for each activity are shown in Table 7. The R2 values for all the activities are above 0.88, indicating that the degree of fit is extremely high and the MAPE values are all less than 10%, in line with the high-accuracy prediction defined by Lewis [46]. Therefore, the learning curve model of 14 basic activities developed in this study can fit the data collected by this study and can also accurately predict the production data of newly employed workers having undergone initial training.

Table 7 Exponential model data for each activity

Based on the above research results, learning rate can be seen to not be a fixed value in learning processes in which trainees have learned how to conduct precast component production activities, so the finding of this study is in line with that of Thomas et al. [10]. Moreover, we know that the cubic learning curve model proposed by Thomas et al. has both good fit and predictability. However, the performance of the cubic model in the initial learning of some activities undergoes major fluctuations, so we propose that the exponential model performs as a more appropriate model to represent the learning curve of precast component production activities.

6 Results and Discussion

According to the above research results, we further divide the learning curves of all activities into two groups (as shown in Fig. 4) by using the K-means algorithm and the learning curve formula from Table 9. The result of this grouping can be seen from Table 8, and there are 10 activities in the first group while four activities in the second. The initial performance (A) and the asymptotic performance of the second group are both high, and the absolute value of the learning constant is lower. As a result, the complex activities can be defined including lofting, laying of embedded parts, surface whitewashing, and component repairs. This helps managers to understand training difficulty for each activity and take advantage of it to ensure sufficient professional human resources for each activity. Figure 5 shows the exponential learning curves for all activities where Figs. 6, 7 and 8 illustrate closer looks at the learning curves for each modules. As observed, it takes a long time for the complex activities to stabilize. When further analyzing the asymptotic performance (A + B) for each activity (as shown in Table 9), it is found that those complex activities indicate negative extremes for lofting and surface whitewashing. These two activities difficultly achieve convergence because their learning constants are particularly small (< 0.01). The other two complex activities achieve high asymptotic performance, as observed, due to their learning constants rational for convergence. This specifies that it still takes longer to complete these two activities than that of the others even if workers are skilled.

Fig. 4
figure 4

Clustering result of exponential learning curve (with centroids)

Table 8 K-means grouping results
Fig. 5
figure 5

Exponential learning curves of all activities

Fig. 6
figure 6

Exponential learning curves of molding module

Fig. 7
figure 7

Exponential learning curves of filling module

Fig. 8
figure 8

Exponential learning curves of repair and storage module

Table 9 Asymptotic performance of all activities

To sum up, the exponential model provides the value of asymptotic performance serving as the production time that workers may achieve under maximum proficiency. This study therefore suggests the adoption of the exponential model to model the learning curves for production workers learning to make precast components. The model has a satisfactory degree of fit (R2 > 0.88), and the post-cross-validation results also show that the model has a highly accurate prediction capability (MAPE value < 10%). The other findings show that 4 difficult activities have been identified as lofting, laying of embedded parts, surface whitewashing, and component repairs. No matter how well trained workers carry out these four activities, their performance does not show much learning effect by the reason of various circumstances on-site, customized orders, and high-quality demands.

7 Conclusion

Based on literature and field visits, and the production data of 14 basic precast activities obtained from precast factories in Taiwan are studied and analyzed using the learning curve theory. Using a total of 373,077 datasets regarding 14 production activities sorted among a total of 4352 workers, the findings show that the exponential model is more suitable than the straight-line model for fitting historical data and predicting the production data of trainees during their initial training. This also indicates that the learning rate is not a fixed value during the learning process as previously considered in the construction industry. The second finding expresses that the learning curve model proposed in the study has a good fit to the historical data (R2 values all > 0.88), and the model is highly accurate in predicting the production data of trainees through their initial learning curves (MAPE values < 10%). The third finding reveals that, through using the K-means method, the 14 basic activities are divided into two groups due to the convergence of their learning curves respectively. As a result, the complex activities can be defined including lofting, laying of embedded parts, surface whitewashing, and component repairs. This helps managers to understand training difficulty for each activity and take advantage of it to ensure sufficient professional human resources for each activity. It is an important reference for the production planning and personnel training planning of precast factories to improve the productivity of the precast industry. The contributions by the study are substantial especially for practitioners.

The results of this study can serve as a well-developed and accurate foundation, and it is suggested that future studies make efforts in this direction. Follow-up studies focus on the threshold value for worker proficiency standards that is another important step for managerial practice. To achieve it, since asymptotic performance for those difficult activities (1−ec(x−1)) in the model is as close as 1, it implies that the workers’ training never goes effective. Therefore, it is suggested that future research can seek a threshold as standard proficiency for workers based on the level of difficulty or complexity toward each activity. Studies dealing with formulas grouped to a couple of general formulas for all activities are also recommended to possibly simplify and to increase practicability for the findings. Additionally, since productivity difference for an individual based on workers’ ages and nationality is possible, future work regarding productivity difference among workers’ ages and nationality is practicable to enhance the current work.