1 Introduction

Logistics is the flow of raw materials or other goods to end customers (Waters 2003; Yildiz et al. 2010). Delivering items to the correct place at an appropriate time at a reasonable cost is an essential success factor in the logistics industry (Liu and Lyons 2011). Concern arises when any stage of this process causes customer dissatisfaction, which potentially leads to loss of business. Logistics companies currently engage in activities that transcend the traditional goods delivery model, such as service delivery, rendering, and integration (Esper et al. 2003; Renko and Ficko 2010). For example, UPS has expanded its business, serving as a solution provider that offers service integration in e-commerce, accounts receivable, insurance, and financing. These activities relate to UPS’s core business, and such expansion into other areas develops customer dependency and loyalty, which eventually translate into sustainable business.

A long-term relationship with customers is crucial in the logistics industry because numerous aspects of service encounters (such as cost, speed of delivery, and courtesy) can easily be imitated by competitors. Customer churn (or customer attrition) is key for gauging success in the logistics industry. Van den Poel and Lariviere (2004) showed that the cost of a high churn rate (at 25 % compared with 7 %) for a UK retail bank was nearly 220,000 euros after 25 years. Although this figure varies among industries, a highly competitive market, such as the logistics industry, is unlikely to have a low customer churn rate. Therefore, researchers recommend a defensive marketing strategy (Fornell and Wernerfelt 1987; Ahn et al. 2006) that prevents customers from switching service providers. Customer churn prediction is a popular element for retention and loyalty reasons (De Bock and Van den Poel 2012; Glady et al. 2009; Huang et al. 2012; Hung et al. 2006; Kisioglu and Topcu 2011; Li et al. 2011b; Neslin et al. 2006).

The purpose of customer churn management is to minimize losses caused by customer churn and to retain high-value customers, thereby maximizing profit. The 80/20 rule suggests that 80 % of revenue is provided by 20 % of customers (Xu and Walton 2005). Focusing on retaining high-value customers is a reasonable strategy. Consequently, studying the profile of lost customers and predicting customer churn is crucial for survival in highly competitive industries. Customer churn prediction has been applied in numerous fields, particularly in the telecommunication and financial industries, in which target audiences are primarily individuals (Huang et al. 2012; Verbeke et al. 2012; Kisioglu and Topcu 2011; Nie et al. 2011; Tsai and Lu 2009). Most high-value customers in the logistics industry are businesses that are bound by regulations, corporate policy, and accumulated business practice. Because of this difference, switching cost is considerably lower for individual customers than for business customers.

The contribution of this study is twofold. First, we expanded the body of knowledge in customer value analysis, specifically, the length, recency, frequency, monetary, and profit (LRFMP) model, to study the effect of such a model on business customers in the logistics industry (Li et al. 2011a; Verbeke et al. 2012). The results indicated that this model is generalizable in the logistics industry business context. Second, we investigated treating the effect of all five LRFMP variables concurrently. If some LRFMP variables overshadowed other variables, then further insight may be attained regarding how variables become predictors when variability of primary variables is limited or controlled.

2 Related work

2.1 Customer value analysis

Customer value analysis involves identifying patterns and group associations by using a high number of customer data. By conducting customer value analysis, businesses can identify valued customers who have contributed to business revenue the most for a considerable period of time (Cheng and Chen 2009). Among techniques for assessing customer value, the recency, frequency, and monetary (RFM) model has received considerable attention in recent literature (Chang et al. 2010; Chen et al. 2005; Cheng and Chen 2009; Hosseini et al. 2010; Liang 2010; Liu and Shih 2005; Yeh et al. 2009). The RFM technique is used widely in customer behavior analysis to study customer value and market segmentation.

Hughes (1994, 2005) studied numerous historical transaction data and determined that the following three key behavioral indicators are relevant to customer value analysis.

  1. 1.

    Recency refers to the period of time between the previous purchase date and a set date (determined before analysis begins). The likelihood of repurchase is high when the duration between these two dates is short.

  2. 2.

    Frequency refers to the total number of purchases within a particular period of time. It is used to measure the interaction frequency between a customer and the business. A high level of interaction indicates customer loyalty.

  3. 3.

    Monetary refers to the total dollar amount of a customer’s purchases within a particular period of time. It can be used to measure the contribution of a customer to revenue. The greater the amount spent on purchases is, the more the customer contributes to revenue.

The importance of these three indicators varies among industries. Identifying the relative weight of these indicators for the specific domain of business is useful. Studies have recently begun expanding the RFM model by including additional variables. For example, Wei et al. (2012) suggested that the longevity of a relationship with a customer affects customer loyalty. They suggested an LRFM model including the longevity of a relationship, a CRFM model in which the RFM model is applied to various product categories, and a CLVRFM model based on traditional RFM analysis.

Chang and Tsai (2011) added product category groups to develop the GRFM model. Yeh et al. (2009) developed a RFMTC model by including the first purchase time (T) and customer churn probability (C). These extensions to the traditional RFM model have generated mixed results.

2.2 Customer churn

To retain customers, companies engage in activities to satisfy customers and reduce customer defection. Customer retention is the ultimate goal of customer relationship management (Payne and Frow 2005; Reinartz et al. 2004). In increasingly competitive business environments, acquiring new customers is increasingly expensive and difficult. Retaining existing customers is a popular strategy that is comparatively less costly than attracting new customers (Reinartz and Kumar 2003). Several studies have shown that acquiring a new customer is usually five to six times more expensive than retaining a customer (Athanassopoulos 2000; Slater and Narver 2000).

The scenario in which customers cease transacting with a company is called customer churn or customer attrition (Neslin et al. 2006; Yu et al. 2011). Customers who end a relationship with a company and develop a new relationship with a different company are called “churners” (Kisioglu and Topcu 2011).

Customer churn causes revenue loss and other negative effects on corporate operations (Saradhi and Palshikar 2011). Therefore, establishing an accurate customer churn prediction model for identifying key factors that cause churn is crucial. Several key recent studies on customer churn are summarized in Table 1.

Table 1 Recent studies on customer churn prediction

According to Table 1, most previous studies on customer churn prediction have focused on the banking, retail, and telecommunication industries. According to a review of relevant literature, no study has investigated customer churn prediction in the logistics industry. In addition, variables that affect churn vary substantially among industries. For example, average minutes of usage, age, and place of residence, which are used in the telecommunication industry, are not relevant to logistics (Kisioglu and Topcu 2011). Similarly, debt ratio, home ownership, and credit risk, which are applied in the banking industry, are not relevant to logistics (Burez and Van den Poel 2008). One method for gaining insight into customer churn in an industry on which little empirical guidance has been provided in the literature is to apply a common set of classification techniques derived from previous churn studies that used a set of variables relevant to the industry. The immediate benefit of using common techniques is that it enables researchers to provide new evidence regarding the generalizability of existing methods.

Table 1 shows a common set of techniques used in recent churn studies. This set of techniques includes logistic regression (LGR), decision tree analysis (C4.5), artificial neural network (multilayer perceptron, MLP), and support vector machine (SVM). These four techniques were applied in this study.

3 Methods

3.1 Data collection and preprocessing

The company investigated in this study was founded in 1938 and is one of the largest logistics companies in Taiwan. It has approximately 2,500 transportation vehicles and over 100,000 business customers. In 2010, its total revenue was over TWD 8 billion (roughly USD 271 million). The original dataset comprised data on 106,747 business customers who engaged in over 210 million transactions between March, 2010 and August, 2012.

Before customer value analysis and customer churn prediction were conducted, a series of data preprocessing tasks, including merging customer, shipping, and delivery tables; removing records with missing values; deleting duplicate records; and aggregating records for each business customer, was performed. Moreover, recently acquired customers were excluded from the analysis because they had not been with the company long enough to be considered retained customers. These recent customers were defined as those for whom the transaction length (i.e., the number of days between the first and the final transaction) was shorter than 30 days. After recently acquired customers had been excluded, a total of 69,170 business customers remained in the final data set.

Before a customer churn prediction model was developed, the groups of active and lost customers were defined. In the case company, lost business customers were defined as those who engaged in no transactions in the past month. A customer service representative usually contacts lost customers to determine why they have not engaged in transactions. The case company classifies lost customers into four categories: those who changed location, those who went bankrupt, those who switched to a competitor, and those in debt. After consulting with experienced account managers, we considered only customers who switched to a competitor. The case company cannot address the attrition of customers in the other three categories. Among the 69,170 business customers, the numbers of active and lost customers were 67,849 and 1,321, respectively.

After consulting with the case company, we determined that a total of 18 variables (see Table 2) in three general categories were relevant to customer churn (customer profile, customer transaction behavior, and quality of delivery service). The descriptive statistics for both active and lost customers are shown in Table 3.

Table 2 Definition of variables
Table 3 Descriptive statistics of the complete customer data set

3.2 Experimental design

3.2.1 Customer value analysis

Figure 1 shows the research process, which can be divided into two main steps: customer value analysis and customer churn prediction. First, the differences in the scales of the LRFMP variables (see Table 2 for the scales) were standardized before further processing. The standardization procedure follows Hughes’ (1994) equal depth approach (or called customer quintile method by Miglautsch 2000). Specifically, the customers were sorted in ascending order according to the variable CsnRcn (recency) and in descending order according to the other four variables. For each of the LRFMP variables, the customers were partitioned into equal quintiles. These quintiles were assigned numbers from 5 (highest customer value) to 1 (lowest customer value). This approach to standardizing scales has been the primary approach applied in other LRFMP studies (Cheng and Chen 2009; McCarty and Hastak 2007; Coussement et al. 2014).

Fig. 1
figure 1

Research process

To determine the weighting of each LRFMP variable, this study employed the analytic hierarchy process (AHP). Five senior sales managers at the case company were invited to evaluate the relative importance of the LRFMP variables. An example of the AHP pair-comparison matrix for the LRFMP model is shown in Table 4. According to the AHP assessment, the relative LRFMP weights were 0.077, 0.225, 0.43, 0.241, and 0.026, indicating that frequency carries the most weight, followed by recency, monetary, length, and profit.

Table 4 An example of AHP pair-comparison matrix for the LRFMP model

After the standardized LRFMP score for each customer was collected and the weight was determined for each of the LRFMP variables by using the AHP, a composite score for each customer was calculated as follows:

Assume that the standardized LRFMP scores of customer u i (u i  ∊ U) are NL i , NR i , NF i , NM i , and NP i , respectively. The LRFMP composite score of customer u i , denoted as Score ui , is defined in the following equation:

$$LRFMP(u_{i} ) = 0.077 \times NL_{i} + 0.225 \times NR_{i} + 0.43 \times NF_{i} + 0.241 \times NM_{i} + 0.026 \times NP_{i} .$$
(1)

The coefficients of the variables in this equation were determined using the aforementioned AHP study. The composite scores were subjected to a median split to study patterns among the customers. The top 50 % of customers were labeled “valuable customers,” whereas the remaining customers were labeled “less valuable customers.” Median split is a common approach for studying patterns, behavioral difference, intention, and satisfaction in churn-related studies. Examples include word of mouth in customer lifetime value (Lee et al. 2006), and customer loyalty (Zhang et al. 2010).

3.2.2 Customer churn prediction

Because valuable customers contribute more to the profitability of the firm than do less valuable customers, we concentrated on these customers and their churn rate. We used Weka 3.7.7, a widely used open-source data mining software (www.cs.waikato.ac.nz/ml/weka), to study the performance of classification techniques, namely J48 (C4.5 in Weka), MultilayerPerceptron (MLP in Weka), SMO (SVM in Weka), and SimpleLogistic (LGR in Weka) (Linoff and Berry 2011; Tan et al. 2006).

The churn rate of valuable customers was approximately 1 %. Such a low customer churn rate can cause a class imbalance problem, wherein the majority class or group influences the prediction more than the minor class or group because of unequal representation. Previous studies have suggested that the sample size can be adjusted to improve the prediction performance of supervised learning (Tan et al. 2006). A resampling procedure was conducted to ensure that the lost/active ratio remained balanced. Specifically, we generated the dataset by undersampling the majority instances and retaining the complete set of minority instances so that the sample sizes of the two groups were approximately equal. In addition, useful instances in the majority class can be lost if the resampling procedure is applied only once. Therefore, a random resampling technique was applied 30 times to generate multiple datasets; for each generated dataset, tenfold cross-validation was applied to evaluate sample quality.

3.3 Parameter settings

According to the industry-wide data mining process model, the Cross Industry Standard Process for Data Mining, parameter calibration for the modeling phase of data mining involves testing the model by using a range of parameter values to optimize the model. Details on the parameter settings are shown in Table 5. Regarding C4.5, a decision tree stops growing if the number of instances in a node does not satisfy a user-specified threshold (i.e., 15, 20, or 25). Regarding MLP, we chose a single-hidden-layer network topology with a sigmoid function. Four other crucial parameters were adjusted in the experiments, including the number of nodes in the hidden layer, learning rate, momentum factor, and maximal number of epochs. In SVM, both the PolyKernel and RBFKernel were selected as the kernel function in all tests.

Table 5 Parameter tuning in different classification techniques

3.4 Performance measure

The performance of the classification models can be evaluated using several widely accepted indicators (accuracy, precision, recall, and F1) based on the confusion matrix. The confusion matrix in Table 6 can be used to calculate these four metrics as follows:

$$Accuracy = \frac{TP + TN}{TP + FP + FN + TN}$$
(2)
$$Precision = \frac{TP}{TP + FP}$$
(3)
$$Recall = \frac{TP}{TP + FN}.$$
(4)
$$F1 = \frac{2 \times Precision \cdot Recall}{Precision + Recall}.$$
(5)
Table 6 Confusion matrix

4 Results and discussion

4.1 Results of customer value analysis

The purpose of customer value analysis is to identify valuable customers that potentially contribute to the profitability of the company. As previously discussed, valued customers are the top 50 % of customers (a total of 34,675) based on the composite scores (Score ui ) calculated using the LRFMP variables. The churn rate for this group was approximately 1.1 %. The customers in this 1.1 % were lost customers, whereas the customers who remained with the company were active customers. Although a low churn rate in this group is a positive sign for a healthy future, it caused the problem of unequal representation of the two groups in data analysis. A resampling procedure was conducted to ensure that the sizes of the two groups were approximately equal. Table 7 shows descriptive statistics on LRFMP variables for the two customer groups. Table 8 shows how the two groups differed in other related variables.

Table 7 Descriptive statistics of the complete customer data set in customer value analysis
Table 8 Descriptive statistics of the valuable customer data set

4.2 Results of customer churn prediction

The data sets produced in the previous section were used to measure the performance of four classifiers, namely C4.5, MLP, SVM, and LGR. The results and accuracy measures are presented in Table 9. Accuracy, precision, recall, and the F1 measure were used to guide the direction of model tuning. To facilitate explanation, we report only the average values based on 30 generated datasets.

Table 9 Experimental results for all classifiers

For each classification algorithm, we report only the optimal result (bold-faced numbers in Table 9) among various parameter settings. The highest accuracy rates for C4.5, MLP, SVM, and LGR were 93.1, 90.9, 88.4, and 87.6 %, respectively. The highest F1 measures of C4.5, MLP, SVM, and LGR were 93.3, 91.1, 87.2, and 86.8 %, respectively. A one-tailed paired t test was conducted to compare the prediction accuracy of the classifiers. The results indicated that C4.5 significantly outperformed the other three algorithms (C4.5 vs. MLP: t = 22.4057, p < 0.001; C4.5 vs. SVM: t = 12.4619, p < 0.001; C4.5 vs. LGR: t = 34.2875, p < 0.001). In addition, SVM and MLP significantly outperformed LGR (MLP vs. LGR: t = 18.7484, p < 0.001; SVM vs. LGR: t = 2.144, p < 0.05). Similar results were obtained regarding the F1 measure. C4.5 significantly outperformed the other three algorithms (C4.5 vs. MLP: t = 18.7992, p < 0.001; C4.5 vs. SVM: t = 62.5325, p < 0.001; C4.5 vs. LGR: t = 36.2557, p < 0.001). Furthermore, SVM and MLP significantly outperformed LGR (MLP vs. LGR: t = 18.6723, p < 0.001; SVM vs. LGR: t = 3.063, p < 0.01). These results indicated that C4.5 was the optimal algorithm according to all of the reported performance measures. LGR performed the least favorably. In addition, we observed that the sensitivity for parameter variation was low, indicating that all techniques used in the analysis are reliable classification models.

A customer churn prediction model can be used as an early warning tool for businesses, and extracting critical factors related to customer churn can provide additional useful knowledge that supports decision making. We conducted an additional experiment to identify the top three variables that exerted the most substantial effect on whether a customer is lost or active. The information gain ratio (or gain ratio) was used to determine the variables most relevant to the model. The results indicated that CsnRcn (days since the most recent transaction) was the most influential variable, followed by CsnLng (longevity of the relationship) and CsnMnt (average revenue gained from the customer). In other words, customers for whom the number of days since the previous transaction (CsnRcn) is higher, those who have maintained a shorter relationship with the company (CsnLng), and those who have provided lower average revenue to the case company (CsnMnt) have a higher probability of being a churner. This result was expected because most business customers have a constant need for the delivery of products or services. Unless a replacement delivery method is implemented (e.g., mail replaced by e-mail or electronic communications), the need for delivery does not fluctuate drastically in a short period of time. Therefore, a high value in either recency (CsnRcn) or longevity (CsnLng) is a possible indication of customer churn. This finding is consistent with those of previous studies, such as Chen et al. (2008) and Li et al. (2011a). The association of the longevity of the relationship (CsnLng) and average revenue provided to the case company (CsnMnt) with the target variables in the aforementioned experiments is consistent with the results reported by Buckinx and Van den Poel (2005). The consistency of these customer churn variables with those used in existing studies has numerous implications because the respondents were business users of delivery services. Such a combination of context and user profiles has not received sufficient attention in relevant literature.

4.3 Profile analysis

This section presents the results of profile analysis, of which the goal is to determine patterns between lost and active customers based on the similar magnitudes of the top three variables (CsnRcn, CsnLng, and CsnMnt) reported in the previous section. The data were first sorted according to CsnRcn (ascending order), CsnLng (descending order), and then CsnMnt (descending order). Because of this sorting procedure, the variability of records among these three variables in each quartile was limited. The top 50 % of this sorted data consisted of only 5 churners and 17,332 active customers, representing a churn rate of 0.03 % [5/(5 + 17,332)]; however, the bottom 50 % consisted of 404 churners and 16,934 active customers, representing a churn rate of 2.33 %. As these three variables were the top three predictors, churners were located mainly in the bottom portion of the sorted data set. Quartile 3 contained only 6 churners and 8,663 active customers. Therefore, most churners were located in Quartile 4, which contained 398 churners and 8,271 active customers, representing a churn rate of 4.59 %. This high churn rate is 153 times higher than that of the top 50 % of the sorted data set.

The next step involved explaining the top predictors of churn in the fourth quartile, wherein the variability of CsnRcn, CsnLng, and CsnMnt was limited because of the aforementioned sorting procedure. According to the gain ratio and Chi square statistic, the top predictors of churn in Quartile 4 were CsnRcn (recency), CsnLng (length), CsnStpItvMax, CsnFrq (frequency), and CsnStpItvAvg (as shown in Table 10). CsnMnt (monetary) did not qualify as a top predictor, whereas CsnFrq (frequency) did, indicating that the effect of frequency may have been shadowed until the variability of other variables was limited or controlled in the fourth quartile. In all analyses, profit was never influential. This insight improved our understanding of the relationship between the LRFMP variables and churn.

Table 10 Variable contribution to churn when variability of recency, length and CsnStpItvMax are limited

Regarding other profile variables, both customer region (CsrRgn) and distance to the nearest branch of the case company (CsrNrRng) exerted little effect on whether the customer was lost to competition. One possible reason is that the rural–urban disparity in Taiwan is low and the case company provides a free package pickup service for business customers. The statistics in Table 10 indicate that none of the quality variables were among the top predictors. Further analysis revealed that the case company provides high-quality delivery service according to several metrics. For example, the delivery failure rate was 0.026 %, the redelivery rate was 0.014 %, the rate of deliveries completed in 1 day was 94.1 %, and the average number of days for delivery was 1.06. These numbers indicate that the company provided excellent delivery service. In other words, delivery quality was not likely a key cause of customer churn.

Table 11 lists several crucial decision rules generated using C4.5. We report only some of the crucial rules for customer churn prediction because of space limitations. The first number at the end of each rule represents the number of customers satisfying the antecedent part of the rule, whereas the second number represents the number of customers incorrectly classified according to the rule. Decision tree induction can be used to characterize a group of churn customers, and the results can be applied in customer relationship management.

Table 11 The decision rules extracted by the C4.5

5 Conclusion

This study examined the variables that contribute to the customer churn of valuable customers at a logistics company. Valuable customers were identified based on their composite scores, which were calculated using the LRFMP variables. Because the overall ratio of lost customers to active customers was low, a resampling procedure was adopted to balance the representation of the two customer groups. Various prediction models were then used to compare the prediction performance. The effects of the LRFMP variables, as well as other profile variables, on customer churn were examined.

The decision tree model outperformed the other models (MLP, SVM, and LGR) in predicting customer churn. The top three most influential predictors for customer churn were recency (CsnRcn), length (CsnLng), and monetary (CsnMnt), whereas the other two LRFMP variables (i.e., frequency and profit) were not significant predictors; these results are not consistent with those of several other customer churn studies. This points to an issue that relates to whether the five LRFMP variables equally influence churn.

This study addressed how and when these other LRFMP variables become useful or influential. We limited the variability of the top three churn variables (i.e., recency, length, and monetary) by sorting the records based on the three variables and focusing on the quartile that contained the highest number of lost customers. The results indicated that recency (CsnRcn), length (CsnLng), the maximal time interval between two adjacent transactions (CsnStpItvMax), frequency (CsnFrq), and the average time interval between two adjacent transactions (CsnStpItvAvg) became predictors after the variability of the top three variables was limited. This result indicates that the effects of the five variables in the LRFMP model do not equally affect churn. Even in the quartile containing the customers that are most likely to churn (high recency, short length, and short duration between transactions), many customers still decided to remain with the company. Our identification of these “secondary” predictors, frequency, length, and time between adjacent transitions, provides insight into the intention to churn.

The primary contributions of this study are described as follows. First, our sample of business customers in the logistics industry represents a population that has not been previously explored. Many previous studies on churn have focused on individual users who usually have a lower switching cost than do business customers (Miguéis et al. 2012; Huang et al. 2012). Although insight into the logistics industry and churn of business customers is limited, studying the effect of LRFMP variables in this context provides evidence regarding the generalizability of the LRFMP and traditional RFM models. Second, variables related to churn vary greatly among industries. As indicated in Sect. 2, numerous variables used in other industries are either not relevant or not applicable (e.g., average minutes of usage, place of residence, debt ratio, home ownership, and credit risk). This study clarifies how churn can be predicted more accurately in the logistics industry.

Third, not all of the five LRFMP variables exerted an equal effect on churn. Previous studies (Wei et al. 2012) have laid the foundation for these five variables, whereas this study expanded on this foundation by showing that only recency, longevity, and monetary are the most influential predictors of the five variables. Frequency was not considered a crucial predictor until the variability of the aforementioned three variables was limited. These results revealed that frequency, monetary, and length are the variables that are the most related to churn for customers who are the most likely to leave the company. The profit variable of the LRFMP model never became a significant predictor in our analyses.

Several directions can be taken in the future. First, the data were provided by a single case company. Data from numerous companies can be collected and compared to enhance the generalizability of the LRFMP model further. Second, the LRFMP model is a popular customer value analysis model, but it is not the only model. As shown in the current study, not all five of the LRFMP variables exerted a notable influence on churn. Future studies can consider multiple customer value analysis models. Third, as mentioned in the Introduction, logistics companies have begun including other services, such as banking and accounts receivable, in their core business. Such an expansion increases customer “lock-in,” a common business practice used to retain customers. Future studies can report the effect of customer lock-in on customer churn.