Keywords

1 Introduction

One of the major challenges faced by both online and offline retailers is the problem of assortment optimization, in which they choose a specific group of products to offer to customers such that their expected revenue can be maximized. The revenue generated by an assortment of products is usually determined by two factors: the revenue generated by selling each individual product, and the purchasing behavior of consumers. The latter is often captured by discrete choice models such as a multinomial logit (MNL) model [26] and a nested logit (NL) model [7]. Unlike previous studies that assume fixed choice models, we take into account the fact that customer purchasing behavior may be influenced by sophisticated selling practices such as advertising. Specifically, advertising is an important and effective strategy for establishing brand recognition and communicating the value of a product effectively to the public. Given the importance of advertising, determining how to allocate the promotional budget over products and time is a critical aspect of retailers’ decision making [13, 19, 22], hence, it is important for a retailer to consider the impact of advertising on their product choices to increase revenue. To maximize this effect, the retailer should align their advertising and product recommendations.

In this paper, we propose and investigate a joint advertising and assortment optimization problem (JAAOP). We employ the MNL model to understand consumer purchasing behavior, in which every product, including the choice not to purchase, is assigned a random utility. Once presented with a variety of products, the consumer chooses the one with the highest utility. Our study differs from previous research on traditional assortment optimization by taking into account the influence of advertising. That is, rather than just selecting a group of products, we investigate the potential of combining advertising with traditional product selection to enhance the overall optimization. Specifically, we assume that the platform can increase the attractiveness (a.k.a. utility) of a product by advertising it, the effectiveness of which is represented by a product-specific response function and the amount of advertising efforts allocated to that product. With constraints on the advertising budget, our goal is to jointly determine which products to present to consumers and how to allocate the advertising budget among them in order to maximize expected revenue. In one extension of our work, we also examine the sequential choice behavior of consumers [15], a common feature on online shopping platforms such as Amazon and Taobao, where a large number of products are displayed to the consumer in stages. If the consumer does not select any products in a stage, they will move on to the next set of products. This requires the platform to not only select which products to display, but also their positions. We formulate this problem as a joint multi-stage advertising and assortment optimization problem.

1.1 Summary of Contributions

This section summarizes the major contributions of our work.

  • We introduce the JAAOP in which the platform must concurrently select (1) an advertising strategy and (2) a set of products to present to consumers. By using the MNL model and assuming no constraint on the maximum number of products that can be displayed to a user, we can obtain an optimal revenue-ordered assortment and an efficient advertising strategy.

  • When a constraint on the maximum number of products that can be displayed to a user is present, we analyze the problem under different response functions. If the response function is a log function, the optimal advertising strategy is to allocate all the advertising budget to a single product. If the response function is a general concave function, we formulate our problem as a nonlinear continuous optimization problem and use McCormick inequalities to convert it into a convex optimization problem. We then develop an efficient algorithm to find the optimal strategy.

  • In Appendix A and B, we study several extensions. In one extension of this study, we include the price of each product as a decision variable and consider the joint product assortment, pricing, and advertising optimization problem. We also extend our model to incorporate the multi-stage purchase behavior and investigate the structural properties of the problem. We develop a heuristic method that comes with a performance guarantee.

  • We also conduct a series of experiments to evaluate the performance of our solutions and further confirm the value of advertising in Appendix C. Our proposed heuristic method is robust and outperforms other methods in different settings. Specifically, the results suggest that allocating the advertising budget uniformly or greedily leads to substantial revenue loss.

2 Literature Review

Our work is closely related to the assortment optimization problem in revenue management, which aims to select a subset of products to maximize the expected revenue. Various discrete choice models have been proposed to model consumer decision-making behavior, including the MNL model [26], the Nested Logit (NL) model, the d-level NL model and so on. Recently, several works have considered sequential choice behavior. For example, Flores et al. [12] investigated a two-stage MNL model, where the consumer sequentially browses two stages that are disjoint in terms of potential products and [17] extended to the multi-stage setting. Moreover, [15] developed a sequential MNL model, where the utility of the no-purchase option is fixed at the beginning instead of being resampled each time, and studied the assortment and pricing problem with impatient customers.

Another related problem is the advertising budget allocation problem. [2] proposed a model to allocate resources among multiple brands in a single period. [9] further considered advertising budget allocation across products and media channels. [11] considered the lagged effect of advertising and studied the dynamic marketing budget allocation problem for a multi-product, multi-country setting. [1] proposed a single-product spatiotemporal model that includes the spatial differences and sale dynamics.

Finally, our work belongs to the growing literature that aims to improve revenue through sophisticated selling practices beyond product selection. The approaches in this area include offering certain items only through lotteries [18] and making certain products unattractive to consumers [4]. [4] studied the refined assortment optimization problem for several regular choice models, including the MNL, latent-class MNL (LC-MNL), and random consideration set (RCS) models. While the authors in [4] focus on reducing the utilities of some products to improve revenue, our approach aims to increase revenue by increasing the utilities of some products. The main differences are: 1. [4] focus on strategically reducing the utilities of products, whereas our study centers on increasing the utilities of products. 2. [4] assume that changing the utility of a product has no cost, while our model takes into account the cost of increasing the utility of a product through advertising and considers budget constraints in the optimization problem. We also show that the platform has no incentive to decrease the utilities of any products in the MNL model under a cardinality constraint. A similar result was discovered independently by [4] for an unconstrained MNL model.

3 Preliminaries and Problem Formulation

3.1 MNL Model

We list the main notations in Table 1. Generally, the input of our problem is a set of n products \(\mathcal {N} = \{1, 2, \cdots , n\}\). In the MNL model, each product \(i\in \mathcal {N}\) has a utility \(q_i+\epsilon _i\), where \(q_i\) is a constant that captures the initial utility of product i, and \(\epsilon _i\) is a random variable that captures the error term. We assume that \(\epsilon _i\) follows a Gumbel distribution with a location-scale parameter (0, 1). Let \(\textbf{v}\) denote the preference vector of \(\mathcal {N}\), where \(v_i := e^{q_i}\) for each \(i\in \mathcal {N}\). Given an assortment \(S \subseteq \mathcal {N}\) and a preference vector \(\textbf{v}\), for each product \(i\in \mathcal {N}\), a consumer will purchase product i with a probability of

$$\begin{aligned} \phi _{i}(S, \textbf{v})=\left\{ \begin{array}{ll}\frac{v_{i}}{1+\sum _{j \in S} v_{j}} &{} \text{ if } i \in S \\ 0 &{} \text{ otherwise. } \end{array}\right. \end{aligned}$$
(1)

The no-purchase probability is \(\phi _{0}(S, \textbf{v}) = \frac{1}{1+\sum _{j \in S} v_{j}}\). Let \(\textbf{r}\) denote the revenue vector of \(\mathcal {N}\), where for each product \(i\in \mathcal {N}\), \(r_i >0\) represents the revenue from product i. Based on the above notations, the expected revenue \( R(S, \textbf{v})\) of the assortment S is given by

$$\begin{aligned} R(S, \textbf{v}) = \sum _{i \in S} r_i \cdot \phi _{i}(S, \textbf{v}) = \frac{\sum _{i \in S} r_i v_i}{1 + \sum _{i \in S} v_i}. \end{aligned}$$
(2)

3.2 Joint Advertising and Assortment Optimization

We use a vector \(\textbf{x}\) to represent an advertising strategy where for each \(i\in \mathcal {N}\), \(x_i\) represents the amount of advertising efforts allocated to i. Let \(\textbf{c}\) denote the advertising effectiveness of \(\mathcal {N}\). We assume that the utility of each product \(i\in \mathcal {N}\), increases by \( f(c_i x_i)\) if it receives \(x_i\) advertising efforts from the platform, where \(f(\cdot )\) is called response function and \(c_i\) is the advertising effectiveness of product i. Intuitively, \(\textbf{c}\) and \(f(\cdot )\) together determine the degree to which a product’s preference weight is influenced by the advertising it receives from the platform. For a given preference vector \(\textbf{v}\), the expected revenue \(R(S, \textbf{v}, \textbf{x})\) of an assortment S under an advertising strategy \(\textbf{x}\) is calculated as

$$\begin{aligned} R(S, \textbf{v}, \textbf{x}) = \frac{\sum _{i \in S}r_i e^{q_i+f(c_i x_i)}}{1 + \sum _{i \in S} e^{q_i+f(c_i x_i)}} = \frac{\sum _{i \in S}r_i v_i g(c_i x_i)}{1 + \sum _{i \in S} v_i g(c_i x_i)}, \end{aligned}$$
(3)

where \(g(\cdot )=e^{f(\cdot )}\). Hence, \( R(S, \textbf{v}) = R(S, \textbf{v}, 0)\).

We next formally introduce the JAAOP.

Definition 1

Let \(\mathcal {X} = \{\textbf{x} | \sum _{i=1}^n x_i \le B\}\) denote the set of all feasible advertising strategies subject to the advertising budget B. JAAOP aims to jointly find an assortment S of size at most K and a feasible advertising strategy \(\textbf{x} \in \mathcal {X} \) to maximize the expected revenue, that is,

$$\begin{aligned} \max _{\textbf{x} \in \mathcal {X}} \max _{S: |S|\le K} R(S, \textbf{v}, \textbf{x}). \end{aligned}$$
(4)

Let \(S^*\) and \( \textbf{x}^*\) denote the optimal assortment and advertising strategy, respectively, subject to the advertising budget B and cardinality constraint K. In case of multiple optimal assortments, we select the one with the smallest number of items. For ease of presentation, let \(S_{\textbf{v}}\) denote the optimal assortment when \(B=0\), that is, \(S_{\textbf{v}}=\arg \max _{S: |S|\le K} R(S, \textbf{v})\). In this paper, we make two assumptions about \(g(\cdot )\).

Assumption 1

\(g(\cdot )\) is differentiable, concave, and monotonically increasing.

We will now provide the reasoning behind this assumption. Several researchers have investigated the impact of advertising on customer utility, including [10, 25], and [28]. These studies all assumed logarithmic response functions, which imply that market share is a concave function of advertising efforts, meaning that the benefit of incremental advertising decreases as advertising efforts increase. This property, also known as the law of diminishing returns, has been widely used in other works [3, 9, 21]. The assumption we made in our study, known as Assumption 1, captures this property effectively. For the market share of product i in assortment S, that is, \(\frac{v_i g(c_i x_i)}{1 + \sum _{i \in S} v_i g(c_i x_i)}\), we can verify the concavity of the market share function = by observing the negativity of the second derivative. The second assumption states that the advertising effect is zero if a product receives zero amount of advertising efforts from the platform.

Assumption 2

\(g(0)=1\).

We present a useful lemma that states that there exists an optimal advertising strategy that always uses the entire advertising budget.

Lemma 1

There exists an optimal advertising strategy \(\textbf{x}^*\) for problem (4) such that \(\sum _{i \in S^*} x_i= B\).

4 Unconstrained JAAOP

We start by examining a special case of the JAAOP, where \(K=n\), meaning there is no limit on the assortment size.

In the absence of any size constraints and advertising budget, our problem becomes the standard unconstrained assortment optimization problem. As proven by [26], the optimal assortment in this scenario is a revenue-ordered assortment, i.e. all products generating revenue greater than a certain threshold are included. This threshold, as demonstrated in [24], is the expected optimal revenue.

Lemma 2

[24, Theorem 3.2].    If \(K=n\) and \(B=0\), there exists an optimal assortment \(S_{\textbf{v}}\) such that \(S_{\textbf{v}} = \{i \in \mathcal {N} | r_i > R(S_{\textbf{v}}, \textbf{v})\}\).

This characteristic has been noted in other contexts as well, such as the joint pricing and assortment optimization problem [27] and the robust assortment optimization problem [24]. The optimal assortment, given a fixed advertising strategy, remains revenue-ordered. Thus, to find the best solution, we find the optimal advertising strategy for each possible revenue-ordered assortment, and choose the one with the highest expected revenue as the final result. The number of possible revenue-ordered assortments is at most n. The efficiency of this algorithm can be improved by taking into consideration the following observations.

Lemma 3

There exists an optimal assortment \(S^*\) such that \(S^* \subseteq S_{\textbf{v}}\).

Lemma 3 implies that to find the optimal advertising strategy, we must evaluate all the revenue-ordered assortments within \(S_{\textbf{v}}\) and determine the optimal advertising plan. Then, for any revenue-ordered assortment S, we find the optimal advertising strategy to obtain the complete solution, i.e.,

$$\begin{aligned} \mathop {\textrm{max}}_{\textbf{x} \ge 0} & \qquad \frac{\sum _{i \in S} r_i v_i g(c_i x_i)}{1 + \sum _{i \in S} v_i g(c_i x_i)} \\ \mathop {\mathrm {s.t.}}& \qquad \sum _{i \in S} x_i = B. \nonumber \end{aligned}$$
(5)

With \(u_i = v_i g(c_i x_i)\), (5) can be rewritten as:

$$\begin{aligned} \mathop {\textrm{max}}_{\textbf{u} } & \qquad \frac{A(\textbf{u})}{B(\textbf{u})} = \frac{\sum _{i \in S} r_i u_i}{1 + \sum _{i \in S} u_i} \\ \mathop {\mathrm {s.t.}}& \qquad \textbf{u} \in \mathcal {U}, \nonumber \end{aligned}$$
(6)

where \(\mathcal {U} = \{\textbf{u} | \sum _{i=1}^n m_i(u_i) \le {B} , u_i \ge v_i, i = 1, \ldots , n\}\) and \(m_i (\cdot ) = g^{-1}(\frac{\cdot }{v_i})/c_i\). Here (6) is a single-ratio fractional programming (FP) problem. Before presenting our solution to (6), we show that \(\mathcal {U}\) is a convex set.

Lemma 4

The constraint set \(\mathcal {U}\) in (6) is a convex set.

This part describes our solution to (6) in detail. Lemma 4 indicates that (6) is a concave-convex FP problem. We apply the classical Dinkelbach transform [8] by iteratively solving the following parameterized problem:

$$\begin{aligned} \begin{array}{cl} \mathop {\textrm{max}}_{\textbf{u}} &{} \qquad h(y) = A(\textbf{u}) - y B(\textbf{u}) \\ \mathop {\mathrm {s.t.}}&{} \qquad \textbf{u} \in \mathcal {U}. \end{array}\end{aligned}$$
(7)

In particular, our algorithm starts with iteration \(t=0\) and \(y_0 = \frac{A(\textbf{v})}{B(\textbf{v})}\), and in each subsequent iteration \(t+1\), we find \(\textbf{u}_{t+1}\) to maximize \(h(y_t)\) by solving (7) and update \(y_{t+1}= \frac{A(\textbf{u}_{t+1})}{B(\textbf{u}_{t+1})}\). This process iterates until the optimal solution of (7) is 0 and we output the corresponding maximizer \(\textbf{u}^F\). Equation (7) can be solved efficiently because \(A(\textbf{u})\) is a concave function and \(B(\textbf{u})\) is a convex function. This algorithm is guaranteed to converge to the optimal solution [8]. After solving (6) and obtaining \(\textbf{u}^F\), we transform (6) to an optimal advertising strategy such that for each \(i \in S\), we set \(x_i=m_i(u^F_i)\); that is, we allocate \(m_i(u^F_i)\) efforts to i. A detailed description of our solution is presented in Algorithm 1.

Algorithm 1
figure a

Optimal Solution for Unconstrained JAAOP

5 Cardinality-Constrained JAAOP

We next study our problem under a cardinality constraint of \(K>0\). First, we examine a scenario where \(g(\cdot )\) is a linear function, and then we delve into the general case where \(g(\cdot )\) is a concave function.

5.1 g(x) as Linear Function

We first study the scenario where \(g(\cdot )\) is a linear function, expressed as \(1 + ax\) for some \(a\ge 0\). The next lemma demonstrates the existence of an optimal advertising strategy that allocates the entire budget to a single product. For each \(i\in \mathcal {N}\), we define \(\textbf{x}_i\) as a vector in which the i-th element is B and all other elements are zero.

Lemma 5

For any assortment S, the optimal solution for the following problem is achieved at \(\textbf{x}_i\) for some \(i\in S\):

$$\begin{aligned} \begin{array}{cl} \mathop {\textrm{max}}_{\textbf{x} \ge 0} &{}\qquad L(S, \textbf{x}) = \frac{\sum _{i \in S} r_i v_i (1 + a c_i x_i)}{1 + \sum _{i \in S} v_i (1+ a c_i x_i)} \\ \mathop {\mathrm {s.t.}}&{} \qquad \sum _{i \in S} x_i = B. \end{array} \end{aligned}$$
(8)

This lemma implies that to find the optimal advertising strategy, we need to consider at most n candidate advertising strategies: \(\{\textbf{x}_i|i\in \mathcal {N}\}\). Specifically, considering \(\textbf{x}_i\), we replace the original preference weight \(v_i\) of i using \(v_i g(c_i B)\) and then solve the standard capacity-constrained assortment optimization problem to obtain an optimal assortment. Among the n returned solutions, we return the best one as the final solution. [23] showed that the standard cardinality-constrained assortment optimization problem for each \(\textbf{x}_i\) can be solved in \(O(n^2)\) time. Thus, the overall time complexity of our solution is \(n\times O(n^2) = O(n^3)\). Assume all products are indexed in non-increasing order of \(r_i\). The next lemma shows that we can further narrow the search space and reduce the time complexity to \(O(n^2 T)\), where \(T = \max \{i | i \in S_{\textbf{v}}\}\) represents the index of the product that has the smallest revenue in \(S_{\textbf{v}}\).

Lemma 6

Assume all products are indexed in non-increasing order of \(r_i\). Let \(T =\max \{i | i \in S_{\textbf{v}}\}\), there exists an optimal assortment \(S^*\) such that \(S^* \subseteq \{1, 2, ..., T\}\).

We present the detailed implementation of our algorithm in Algorithm 2.

Algorithm 2
figure b

Optimal Cardinality Constrained Solution for Log Response Function

5.2 g(x) as a General Concave Function

We next discuss the general case. Before presenting our solution, we first construct an example to demonstrate that allocating the entire budget to a single product is not necessarily optimal.

Example 1

Consider three products with revenue \(\textbf{r} = (8, 7.5, 2.8 )\), preference weight \(\textbf{v} = (1.2, 1, 1.7)\) and the effectiveness \(\textbf{c} = (0.9, 0.8, 1)\). Assume the cardinality constraint is \(K = 2\) and the total advertising budget is \(B = 10\). We consider a concave function \(g(x) = \sqrt{x} +1\). If we are restricted to allocating the entire budget to a single product, then the optimal advertising strategy is (10, 0, 0), the optimal assortment is composed of the first two products, and the expected revenue of this solution is 6.75. However, the actual optimal advertising strategy is (approximately) (8.285, 1.715, 0), the actual optimal assortment contains the first two products, and it achieves expected revenue of 6.812. The above example shows that the single-product advertising strategy is no longer optimal for a general concave response function.

We next present our solution. Let \(u_i = v_i g(c_i x_i)\) and define \(m_i (\cdot ) =g^{-1}(\frac{\cdot }{v_i})/c_i\) for each \(i\in \mathcal {N}\), we first transform (4) to an equivalent nonlinear mixed integer program (9) by replacing \(\sum _{i = 1}^n x_i = B\) using \(\sum _{i =1}^n m_i(u_i) \le B\),

$$\begin{aligned} \max _{ \textbf{u} \in \mathcal {U}} & \qquad \max _{\textbf{t} \in \{0,1\}^n} \frac{\sum _{j=1}^n u_j r_j t_j}{1 + \sum _{j=1}^n u_j t_j} \\ \mathop {\mathrm {s.t.}}& \qquad \sum _{i=1}^n t_i \le K, \nonumber \end{aligned}$$
(9)

where \(\mathcal {U} = \{\textbf{u} | \sum _{i=1}^n m_i(u_i) \le {B} , u_i \ge v_i, i = 1, \ldots , n\}\). We next present a useful lemma from [6].

Lemma 7

(Theorem 1 [6]). The inner problem of (9) is equivalent to the following linear program

$$\begin{aligned} \max _{\textbf{w}, w_0} & \qquad \sum _{j=1}^n r_i w_i \end{aligned}$$
(10)
$$\begin{aligned} \mathop {\mathrm {s.t.}}& \qquad \sum _{i=1}^n w_i + w_0 = 1, \end{aligned}$$
(11)
$$\begin{aligned} & \qquad \sum _{i=1}^n \frac{w_i}{u_i} \le K w_0, \end{aligned}$$
(12)
$$\begin{aligned} & \qquad 0 \le \frac{w_i}{u_i}\le w_0 \qquad \forall i \in \mathcal {N}. \end{aligned}$$
(13)

Notice that (12) and (13) involve some nonlinear constraints if \(\textbf{u}\) is not fixed. Thus we introduce new variables \(\ell _i = \frac{w_i}{u_i}\), \(i\in \mathcal {N}\) and rewrite (9) as follows:

$$\begin{aligned} \text {(NO)} \quad \max _{ \textbf{u} \in \mathcal {U}} \max _{\textbf{w}, \mathbf {\ell }, w_0} \quad &\sum _{j=1}^n r_i w_i \end{aligned}$$
(14)
$$\begin{aligned} \mathop {\mathrm {s.t.}}\quad &\sum _{i=1}^n w_i + w_0 = 1, \end{aligned}$$
(15)
$$\begin{aligned} \qquad \quad &\sum _{i=1}^n \ell _i \le K w_0, \end{aligned}$$
(16)
$$\begin{aligned} \quad \qquad &0 \le \ell _i \le w_0 \qquad \forall i \in \mathcal {N} , \end{aligned}$$
(17)
$$\begin{aligned} \quad \qquad &w_i = \ell _i u_i \qquad \forall i \in \mathcal {N} . \end{aligned}$$
(18)

We further use the classic McCormick inequalities ([20]) to relax the nonconvex constraints (18):

$$\begin{aligned} \text {(MC)} \quad &w_i \ge \ell _i v_i \qquad \forall i \in \mathcal {N} , \\ &w_i \ge u_i + \ell _i v_i g(B c_i) - v_ig(B c_i) \qquad \forall i \in \mathcal {N} ,\\ &w_i \le \ell _i v_i g(B c_i) \qquad \forall i \in \mathcal {N} , \\ &w_i \le u_i + \ell _i v_i - v_i \qquad \forall i \in \mathcal {N}. \end{aligned}$$

Through the above relaxation, we transform (NO) into a convex optimization problem that can be solved efficiently. After solving this relaxed problem and obtaining a solution \(\textbf{w}\), we can compute the final assortment S as follows: we first find the product for which the \(w_i\) is strictly larger than 0, that is \(S^w = \{i |i\in \mathcal {N}, w_i \ne 0\}\). Then we sort the products in \(S^w\) by the value of \(w_i\) and choose the first K products. Notice that the advertising strategy that is obtained from solving the previous relaxed problem may not be optimal. One can solve (6) to find the optimal advertising strategy for S. Lastly, if the size of the input is large, we can reduce the problem size by selecting a smaller group of candidate products based on Lemma 6. A detailed description of our solution is listed in Algorithm 3.

Algorithm 3
figure c

Cardinality Constrained Solution for General Response Function

6 Conclusion

This paper considers the JAAOP problem under the MNL model, where the seller decides their advertising strategy for all products to improve the current revenue. We consider both the log and general concave response functions. If there are no capacity constraints, we show that the optimal assortment is still revenue-ordered. However, this result does not hold in the presence of a cardinality constraint. When the response function is a log function, we prove that the optimal advertising strategy is a single-product advertising strategy, thus the optimal solution could be found in polynomial time. For the general concave response function, we develop an efficient algorithm to find a near-optimal solution. We further consider the seller could adjust the price simultaneously, and show that such a problem can be efficiently solvable under unconstrained setting or be transformed as a mixed-integer nonlinear programming for the cardinality constraint setting. Additionally, as an extension, we study the multi-stage MNL choice model, in which the customer browses the assortments sequentially. Our results demonstrate that the seller has no incentive to decrease the utility of any product, even under the capacity constraint. Finally, we conduct extensive experiments to illustrate that the advertising strategy is more effective with small cardinality constraint and large no-purchase utility.