Keywords

Introduction

The process of buying, selling, transferring, or exchanging products, services, and/or information via computer networks is defined as E-Commerce [1]. Starting from the invention of electronic fund transfer, and the development of internet e-commerce applications flourished, and various business models have emerged [2]. A standard classification of E-commerce is by the nature of the transactions or the relationship among the participants. The major types can be listed as business-to-business, business-to-consumer, consumers-to-business. As mobile technologies emerge and adapt to social media increases, new e-commerce models such as mobile commerce and social-commerce have become popular. E-commerce can take different forms in a company, depending on the technology.

Customer segmentation is the practice of separating customers into different, meaningful, and identical subsets according to various attributes. For marketing efforts, it is generally used for gathering insights about the customers and provide customized actions according to their characteristics. To achieve this, a company needs to understand the customers’ segments and the properties and desires of the segments [3]. Companies apply various forms of segmentation, such as value-based, lifetime period, or demographics based. Also, different sources of data can be included in the process, and segments can be formed based on demographics and characteristic properties, and their needs and loyalty levels [4].

Segmentation can be used in three decision levels, strategic, managerial, and operational level [5]. At the strategic level, segmentation is used to assist the corporate strategic plans and highlight product positions according to diversified ranges of buyers. At the managerial level, it is used to deal with resource assignments and targets setting based on customer segments in order to involve the customer groups into the marketing activities and compose organizational processes considering them.

This study aims to summarize potential segmentation perspectives and apply fuzzy clustering with a real-world case study. Fuzzy sets proposed by Zadeh [34] provide tools and operations for handling imprecision and vagueness in real-world problems. Fuzzy sets have been used for many other real-world problems such as, company selection Estralla et al. [29] technology selection [6], multi-criteria decision making Yatsalo et al. [33], public transportation [7], economic analysis [8], and risk management [9]. The case study is from Modanisa.com, which is one of the top e-commerce websites which focus on selling textile products globally.

The remaining of the paper is as follows, in the second section, a brief literature review is provided, then in the third section, fuzzy clustering is introduced. Section four presents the real-world application steps and the results of the study. Finally, the conclusion is given in the last section.

Literature Review

The literature provides various approaches and methods for customer segmentation, such as demographic-based segmentation, value-based segmentation, propensity based segmentation, life period segmentation. Some of the procedures use heuristics and expert opinions for segmentation, some of them use basic arithmetic operations. The most popular analytical tool for segmentation is cluster analysis. It is a data mining technique that assigns data elements to unknown groups with considering high similarity [10]. The literature also provides various clustering techniques that can be grouped as partitional or hierarchical algorithms. The partitional methods focus on grouping the data elements into a predefined number of clusters. On the other hand, hierarchical methods take advantage of enabling the optimization of a criterion considering the compatibilities of objects within clusters or incompatibilities between clusters.

In recent years various perspectives on market segmentation have been investigated in the literature. Ahani et al. [11] focus on integrating social media data to segmentation efforts to propose a market segmentation model and choice prediction model for SPA hotel market. Liu et al. [12] develop a market segmentation approach by integrating preference analysis and multi-criteria decision-making methods. In the proposed approach, additive value function and pairwise comparison matrices are used to gather preference data. Then this data is used with the hierarchical clustering method to form the segments. Lim et al. [13] develop a market segmentation approach by using a Bayesian spatial profile regression. By this approach, both spatial autocorrelations present in warehouse rents and multicollinearity among the known rent price determinants are handled for segmentation. Diaz Perez and Bethencourt-Cejas [14] focus on the segmentation of tourists that visit a location and propose using the Chi-square Automatic Interaction Detection method, which is a multivariate analysis technique. The results reveal that the proposed approach is superior to the traditional methods used in the tourism domain. Qin et al. [15] focus on market segmentation problem for demand-side platforms. In their study, the researchers model the optimal market segmentation granularity as an optimization problem and form a mathematical programming model to find the optimum granularity. Huerta-Munoz et al. [16] addresses a customer segmentation problem from a beverage distribution firm. Under a given requirement, the researchers try to form customer clusters where similar customers are involved in. The researchers propose a mathematical attribute formulation and use a greedy heuristic that iteratively destroys and reconstructs customer segmentations. Hong [17] proposes a novel segmentation method by employing the Taguchi method. The Taguchi method is used as a tool to select the initial seeds. The author compares the results with the results of a Self-Organizing Map and shows that the proposed method is superior. Oztaysi and Cevik Onar and Oztaysi [18] propose a user segmentation approach by using twitter data. The author uses the data from social media and uses a fuzzy c-means algorithm to segment the users. Oner [19] propose a two steps segmentation approach,in the first step, they use hesitant fuzzy sets to segment the retail locations, and then in the second step, they use customer segmentation by adding the results of the first step into the analysis. Murray et al. [20] deal with a real-world customer segmentation problem where the existing descriptive variables are not suitable for defining the similarities between customers. The authors employ data mining techniques and identify behavior patterns in historical data that involve noise. The authors claim that the proposed results are suitable for strategic decision-making. In a recent study [21] propose a multi-criteria decision-making model for determining the approach for customer segmentation. The authors use Neutrosophic sets to select the most propose approach. Oztaysi et al. [31] propose a segmentation approach based on customer location data. Dogan et al. [22] focus on customer segmentation based on indoor customer paths. Oner and Oztaysi [19] focus on retailer clustering based on hesitant fuzzy sets. Oztaysi [23] use fuzzy c-medoids clustering for gender prediction.

Methodology

In the domain of segmentation, the clustering procedure is used to define subgroups of customers who have common properties. Each customer is defined by a data point in a multi-dimensional space where each dimension represents different properties. There are various techniques in the literature which convert input data into clusters. Chen et al. [24]. From one perspective, these techniques can be grouped as crisp and fuzzy. The main difference between the two groups is the definition of membership of an element to a cluster. Crisp clustering assigns a data element into a single specific cluster, while fuzzy clustering algorithms assign a data element to diversified clusters simultaneously with a membership degree [25]. In the literature, fuzzy clustering is used by using two points of view, either considering uncertain data or considering crisp data with uncertain clusters [26]. One of the most commonly adopted techniques for fuzzy clustering is the fuzzy c-means algorithm, in which data elements are assigned to a predefined number of clusters with different membership values [24].

The fuzzy c-means algorithm is based on similarity or dissimilarity measures, which are extracted from distance measurement such as Euclidean distance [27]. The main definition is the partition matrix, which represents the extracted clusters. A fuzzy partition matrix is defined from [28] with the conditions given in the following:

$$\begin{aligned} & \mu_{ik} \in \left[ {0,1} \right],1 \le i \le c,1 \le k \le N, \\ & \mathop \sum \limits_{i = 1}^{c} \mu_{ik} = 1,\quad \le k \le N, \\ & 0 < \mathop \sum \limits_{k = 1}^{N} \mu_{ik} < N,1 \le i \le c \\ \end{aligned}$$

Equation (1b) defines the sum of each cluster should be equal to 1, and the membership degree should be represented with an interval [11].

Fuzzy c-means clustering utilizes an objective function and focuses on minimizing the objective function to find the appropriate clusters.

$$J\left( {Z,U,V} \right) = \sum\limits_{i = 1}^{c} {\sum\limits_{j = 1}^{N} {} } (\mu_{ij} )^{m} \left\| {z_{j} - v_{i}^{2} } \right\|$$

where Z is the set of data elements to be clustered, U shows the fuzzy partition matrix, V is the vector which indicates the cluster centers. As seen from the given formula, N represents the number of observations, µ denotes the related membership value, c is the number of appeared clusters, and m is the parameter called fuzzifier that identifies the fuzziness degree of the final clusters, and fuzzifier parameter can get values greater than 1. When the fuzzifier parameter equals to one, then the clusters are formed with crisp clustering. Besides that, \(z_{j} - v_{i}\) denotes the distance between observation j and the center of cluster i.

The minimization of the given objective function comprised of a nonlinear optimization problem that can be calculated with a wide range of techniques such as iterative minimization and heuristic approaches such as simulated annealing or genetic algorithms. The steps of fuzzy c-means (FcM) clustering algorithm is defined as in the following [29]:

  1. 1.

    Initialize U = [uij] matrix, U(0)

  2. 2.

    At k-step: calculate the centers vectors V(k) = [vi] with U(k)

    $$v_{i} = \frac{{\mathop \sum \nolimits_{i = 1}^{N} \mu_{ij}^{m} .z_{j} }}{{\mathop \sum \nolimits_{i = 1}^{N} \mu_{ij}^{m} }}$$
  3. 3.

    Update U(k), U(k+1)

    $$\mu_{ij} = \frac{1}{{\mathop \sum \nolimits_{k = 1}^{c} \left( {\frac{{\left\| {z_{j} - v_{i} } \right\|}}{{\left\| {z_{j} - v_{k} } \right\|}}} \right)^{{\frac{2}{m - 1}}} }}$$

If \(\left\| {U^{{\left( {k + 1} \right)}} - U^{\left( k \right)} } \right\| < \delta\) then STOP; otherwise, go to step 2.

The number of clusters must be determined to reach meaningful clustering results. To this end, various clustering tests are accomplished with different cluster numbers. The results of the tests are compared by using the Xie-Beni index [30], and the clustering with the lowest value is selected for segmentation.

$$X(Z;U,V) = \frac{{\sum\nolimits_{i = 1}^{c} {\sum\nolimits_{j = 1}^{N} {\left( {\mu_{ij} } \right)^{m} \left\| {z_{j} - v_{i} } \right\|^{2} } } }}{{c \cdot \mathop {\min }\limits_{i \ne j} \left\| {v_{i} - v_{j} } \right\|^{2} }}$$

Results

Modanisa.com is one of the leading e-commerce companies located in Turkey, which focuses on selling textile products to more than 150 countries worldwide. The company sells 70.000 products of 650 different brands. Segmenting customers from different countries is very important for marketing activities. While there can be various segmentation perspectives, segmentation based on purchase variety is selected for this study. Purchase variety refers to the purchase activities of the customers based on several categories the purchased, the number of brands they preferred, and the number of different products they purchased.

The first step of the application is data preparation. The source of the data needed for the segmentation analysis is purchase transactions. The first step of the application is the preprocessing step. The purchase transactions include Product Id, customer Id, quantity data, so Category Id and Brand Id values are added to the transaction table by using SQL commands. Then by raw data that will be used for segmentation are formed. A sample data set is given in Table 1.

Table 1 Sample dataset used for segmentation

The next step is outlier detection and normalization. The values in each column are checked for z values, and the values larger than 3 and lower than −3 are excluded from the data set. The z scores of the remaining data are used for fuzzy clustering.

Fuzzy c-means clustering is used for segmentation. In order to find the exact number of clusters, various different cluster number parameters are selected, and the results are compared by using Xei-Beni index (Table 2).

Table 2 Clustering performances by using Xei-Beni index

The lower values of the Xie-Beni index refer to better clustering results; thus, for this study, c value is selected as five, which means in the study, five clusters are formed. The results can be summarized by the centroid table given in Table 3.

Table 3 Centroid table

The results present very clear and actionable results. As a summary, Cluster 1 is composed of customers with a very wide variety. They buy from various categories, brands, and products. Cluster 2 is loyal to some categories; they buy a variety of products from a very selected number of categories. Cluster 3 is composed of customers who are loyal to brands and products. Customers in Cluster 4 buy from different categories, but the variety of products does not change much. The final cluster, Cluster 5 is composed of customers with focused categories and brands.

Conclusion

In this study, a real-world case study is examined, and steps of segmentation by using the fuzzy c-means algorithm is given. After preprocessing, the data are normalized, and the outliers are eliminated. A set of c parameter is used for different runs of fuzzy c-means. Xie-Beni index is used to obtain the best parameter, and the result reveals that five clusters are best for segmentation.

For further studies, other perspectives of clustering, such as value-based clustering or intention based clustering, can be examined. Besides, other fuzzy and crisp clustering methods can be used with the same data, and the results can be compared with the outcome of this study.