Keywords

1 Introduction

There are several ways a business tries to attract new customers all the while trying to retain its current customers. A few decades back, these businesses practiced what was known as mass marketing—in which companies tried to sell the most popular product to all their customers or they practiced product differentiation—in this, companies offered a variety of products to a large market.

But, as technology rapidly evolved, companies moved to a newer approach—personalizing the products and targeting them to a specific market segment. Customer segmentation (or market segmentation as it is widely known) is the most approachable method for obtaining the above result. Understanding their customers and the market has allowed businesses to service each of their customers with care and personalization and proved the value of focusing on them. The oldest and most traditional form of segmentation is demographic segmentation, although with recent developments in big data, newer forms have emerged [1].

These approaches have also taken into consideration buyer attitudes, motivations, patterns of usage, and preferences [2]. The platform that utilizes segmentation the most is the e-commerce industry. Since the birth of e-commerce, sellers are constantly looking for ways to expand their reach while also maintaining a stable relationship with their frequent customers. And, in the age of booming social media, consumer data is as readily available as daily bread. Companies like Facebook, Google sell billions of dollars worth of consumer data to corporations which are then used as a basis of market segmentation. These e-commerce giants like Amazon, Flipkart target each segment with personalized promotional offers as well as personalized advertisements driving in clicks and eventually large amounts of profits. In this article, we shall look at the currently available approaches to customer segmentation and their merits and demerits [14].

2 Background

The applications of customer segmentation are irreplaceable. It has now become the heart of product marketing and strategy in any industry. It is an indispensable tool for organizations to understand the market, whom to target with what product, and how to optimize the marketing strategy [13].

With the increasing popularity of social media, consumer data is readily available for businesses to use and profile the customers, to understand them better, and to act accordingly.

Let us say a website has a new user. The user would be asked to register an account on the website, for which they would be offered a discount coupon. Instead of offering every new user the same coupon, the website could—with the help of customer segmentation—give out targeted discount coupons. This will increase the likelihood of that user buying a product rather than window shop. It is also useful for planning customer communications as it allows for personalized recommendations for each group (Fig. 1).

Fig. 1
figure 1

Market segmentation

3 Customer Segmentation

Cluster analysis or clustering is the task of grouping a set of objects in such a way that objects in the same group (called a cluster) are more similar (in some sense) to each other than to those in other groups (clusters) [7] (Fig. 2).

Fig. 2
figure 2

Types of customer segmentation

Domains: Customer segmentation can be applied to many domains. These domains are defined by the type of data they are using for segmentation. The most prominent of these are listed below.

3.1 Demographic Segmentation

The oldest and the most traditional form of market segmentation, demographic customer segmentation, focuses on the structure of the population based on everyone’s current living status. Age, gender, income, education, marital status, etc. are generally considered as demographic data. Segmentation performed on such data often yields basic groups which are the go-to for target marketing—if any other type of data is not available.

Most frequent segments formed after demographic segmentation include:

  • Students—this group involves unmarried, young adults/adolescents who have little or no income.

  • Parents—this group usually has married, middle-aged adults who have kids (dependents) and have a stable income.

  • Women restarting their careers—these middle-aged women tend to have a low academic background and low income.

  • Senior citizens—this group includes old people who either live on a monthly pension or rely on their children, and they tend to live alone and so on.

These segments have different spending habits which can be directly related to their living scenario (Fig. 3).

Fig. 3
figure 3

Demographic customer segmentation

3.2 Psychographic Segmentation

These are subjective attributes about a person that is not as easily available as other forms of data.

In this, data cannot be directly fed into an algorithm and is expected to yield results. The data and results must be interpreted by a psychological professional. Just as demographic segmentation emphasizes a person’s current living situation, psychographic segmentation utilizes a person’s real-life behavior which pertains to things like personality traits, values, attitudes, interests, lifestyles, subconscious, and conscious beliefs, motivations [2], etc. (Fig. 4).

Fig. 4
figure 4

Psychographic customer segmentation

3.3 Behavioral Segmentation

While demographic and psychographic segmentation focuses on who a customer is, behavioral segmentation focuses on how the customer acts [13]. The most widely used and most efficient form of segmentation—behavioral—relies on how the customer acts toward the company’s services or services related to it. This involves gathering data, performing analysis, and targeting the customers in real time. Therefore, this type requires higher computational power than others. Purchasing habits, spending habits, user status, and brand interactions are the main attributes companies look toward for behavioral analysis (Fig. 5).

Fig. 5
figure 5

Behavioral customer segmentation

3.4 Geographic Segmentation

The simplest form of segmentation involves gathering the location of a user and segmenting them based on it. Geographic segmentation utilizes a customer’s ZIP code, city, country, a radius around a certain location, climate, urban, or rural [14] attributes for segmentation.

Geographic segmentation is an effective methodology used by organizations with large national or international markets to better understand the location-based attributes that comprise a specific target market [11] (Fig. 6).

Fig. 6
figure 6

Geographic customer segmentation

4 Pros and Cons

4.1 Demographic Segmentation

Pros:

  • Demographic variables are quite easier to collect and measure when compared with other segmentation techniques.

  • Targeting is usually more straightforward when using demographics as a metric—for example, you can target consumer groups, such as women or men between the ages of 40 and 50.

  • Consumer profiles are more comprehensible across the board, making it easier to develop strategy among various departments (sales, customer service).

Cons:

  • The model offers limited insight for marketers. Similar demographics among customers do not always mean that they all have the same needs.

  • Because customers can have varying needs, a ‘one-size-fits-all’ approach to consumers based solely on broad demographics can make your marketing message ineffective.

  • Skewed or problematic demographic data within a given region can produce unreliable assumptions, which can reduce the accuracy of your marketing methods.

4.2 Behavioral Segmentation

Pros:

  • Marketers can build targeted consumer segments based on their responsiveness to certain product categories, promotion types, or path-to-purchase preferences.

  • Monitoring and understanding the behavior of consumers online has become easier due to advances in data collection and tracking technologies.

Cons:

  • While consumer behavior can be tracked, it is not always easy to pinpoint the motivations behind those behaviors with segmentation models, because they can vary greatly from person to person.

  • Behavioral segmentation is often based on complex data constructs that are not always easy to understand without the help of a large team of data scientists and marketers.

4.3 Psychographic Segmentation

Pros:

  • Marketers can get some insight into customer motivations.

  • Psychographic segmentation can help brands in executing more emotive marketing to highly responsive segments.

Cons:

  • Psychographic surveys which are self-reported can be inaccurate.

  • Although marketers can use predictive modeling to create statistical projections, the accuracy of all these predictions firmly depends on the quality of the data used.

  • Marketers need clear rules about how to interpret psychographic data to ensure a consistent approach among the individuals or departments that engage in customer segmentation analysis.

4.4 Geographic Segmentation

Pros:

  • People in different communities have different needs. Something useful for someone in a more rural area, like gardening supplies, may not have any appeal to city dwellers. Separating consumers by where they live can ensure that marketers are only targeting those that may want or need your products or services.

  • Marketers can adjust advertised pricing based on the cost of living of the customers that they are targeting.

  • You can tailor products based on the local preferences of your customers. For example, certain clothes are more popular in some Canadian regions than others.

  • You can use this segmentation to target customers in new areas where you want to grow your business.

Cons:

  • This type of audience segmentation makes the assumption that people living in the vicinity of each other have got similar needs, which is not always true. This is true if you are targeting a wider geographic area.

5 Methodology

Although customer segmentation can be done through several methods, the structure is similar for all.

Prerequisites: Segmentation involves processing data; though any system can run the algorithms, larger data requires higher computational power.

5.1 Data Collection

There are many ways to collect consumer data, but since each domain requires a different type of data, data collection methods vary between them.

Demographic data can either be directly asked from the customer through surveys or be bought from data resources such as social media, census. Psychographic data is the least readily available data and contains sensitive personal data; therefore, it must be directly obtained from the users. Behavioral data is obtained through analysis of users’ actions—be it clicks on advertisements or purchase habits, etc. Lastly, obtaining geographic data is a one-step process since it includes only the location.

5.2 Data Preprocessing

The raw data collected needs to be processed before analysis and segmentation. This involves removing outliers, null values, duplicates, and corrupted data. Principal component analysis also needs to be performed to compute the principal component and change the dimensionality of the data by selecting the first few principal components and ignoring the rest.

This ensures that machine learning algorithms are not overburdened and makes data visualization easier.

5.3 Data Analysis

In statistics, exploratory data analysis is an approach to analyzing datasets to summarize their main characteristics, often using statistical graphics and other data visualization methods [6]. EDA is a valuable tool that is used to analyze the data at hand and develop a model based on which the machine learning algorithm is performed.

Mainly performed in two ways: univariate analysis—for data consisting of a single variable—and multivariate analysis (aka relationship analysis)—for data consisting of more than one variable.

Data visualization is a major step involved in EDA, different forms of visualization of data include—box plots and histograms (for univariate analysis)—scatterplot, and bar charts (for multivariate analysis) (Fig. 7).

Fig. 7
figure 7

Exploratory data analysis

5.4 Segmentation

This is the main step in the whole process. This involves performing machine learning algorithms to obtain the segments and various fields attributed to it. There are several ways to perform segmentation (some do not involve machine learning), but to achieve the highest efficiency, some are more suitable to a type of segmentation than others.

But, before segmentation is performed, we need to find the optimal number of clusters or segments. This can be done using hyperparameter tuning and elbow method.

Parameters that define the model architecture are referred to as hyperparameters, and thus, this process of searching for the ideal model architecture is referred to as hyperparameter tuning [8]. When it comes to clustering algorithms, the hyperparameter in question is the number of clusters that need to be made. This is performed with the help of the elbow method.

These methods are discussed in the next section (Fig. 8).

Fig. 8
figure 8

Customer segmentation process

6 Methods Available

There are many ways to perform customer segmentation; in this article, we will mainly focus on how machine learning algorithms compare in terms of usability and efficiency for segmentation.

6.1 Rule-Based

Rule-based segmentation is based on criteria such as Boolean logic or thresholds and is often two-dimensional; this is traditionally the easiest type of targeting for many professionals, as one can do it by filtering in Excel, e.g., marketers have traditionally segmented customers based on heuristics such as the industry, company size (in B2B) or age, income, etc. in B2C [3]. This follows an ‘if A then B’ type of rules and forms an algorithm (Fig. 9).

Fig. 9
figure 9

Rule-based segmentation

6.2 Supervised Clustering with Decision Tree

This method uses a specific target or dependent variable, and the target would predict differences in independent variables (input). Data utilized in this method is previous purchase patterns and customer demographic. The algorithm used is the decision tree with the target on their nodes. According to Baer, although this method connects the target with the other customer attributes, it shows only one aspect of customer behavior [4]. This is used for demographic and behavioral segmentation [13, 14].

6.3 k-means

First, for each centroid, the algorithm finds the nearest points (in terms of distance that is usually computed as Euclidean distance) to that centroid and assigns them to its category. Second, for each category (represented by one centroid), the algorithm computes the average of all the points which has been attributed to that class. The output of this computation will be the new centroid for that class [5]. To find the optimal number of clusters, we use the elbow method (hyperparameter tuning). The elbow method is a heuristic used in determining the number of clusters in a dataset. The method consists of plotting the explained variation as a function of the number of clusters and picking the elbow of the curve as the number of clusters to use [9]. This algorithm is the best algorithm for behavioral segmentation (Figs. 10 and 11).

Fig. 10
figure 10

Elbow method

Fig. 11
figure 11

k-means clustering

6.4 k-prototype

K-prototype is a clustering method based on partitioning. Its algorithm is an improved form of the k-means and k-mode clustering algorithm to handle clustering with mixed data types [10]. It is used for datasets that have both numerical and categorical data types. For numerical data types, it uses the k-means algorithm and calculates the average after each iteration. For categorical data types, it uses the k-mode algorithm which calculates Euclidean distance from each cluster center and calculates modes after each iteration. To find the optimal number of clusters, we use the elbow method.

6.5 k-medoid (PAM)

A problem with the k-means and k-means++ clustering is that the final centroids are not interpretable. The idea of k-medoids clustering is to make the final centroids of the actual data points. This result makes the centroids interpretable [12].

The working is the same as the k-means, and the only difference comes in the updating centroids step in between iterations. Instead of computing the mean of points in a cluster (like in k-means), we swap the previous centroid with all other (m−1) (if there are m-point in a cluster) points from the cluster and finalize the point as new centroid that has a minimum loss [12] (Table 1).

Table 1 Methods for customer segmentation

7 Conclusion

We discussed several ways a company profits from customer segmentation, its methodology, and various ways it can be performed. Even though there are many types of and any ways to perform customer segmentation, the purpose of each remains the same to personalize a business service and experience for its customers. For the last two decades, market segmentation has taken over every big and small company’s marketing scheme. Every corporate has a data analyst in its marketing department. It is left to be seen what the future holds for artificial intelligence incorporated with business.