1 Introduction

This paper is an attempt to extend the random utility model (RUM) to the realm of dynamic choice. The framework that I am going to use is the one introduced by Kreps (1979). In this framework, the decision is made in two stages. In the first stage, the decision-maker chooses a menu from a set of available menus, out of which he/she would choose an individual item in the second stage. For example, he/she might first choose a restaurant to then choose a meal from its menu once in the restaurant. In this framework, menus derive all their values from what they contain. In other words, we abstract away from situations in which menus have intrinsic values.

Following Kreps (1979), choices over menus have been employed by numerous papers to model various issues and behavioral biases that are unidentifiable within the classical framework when we only look at choices from menus, i.e., choices in the second stage.Footnote 1 All of these models characterize the behavior of a single decision-maker. These models are often compared with the standard model where the decision-maker chooses a menu that offers the best item according to her preferences over individual items.Footnote 2 However, in many empirical studies, such as experimental settings, choice data come in an aggregate form from a population of decision-makers who might differ in their preferences and attitudes. In such scenario, choice data will appear stochastic in the sense that different menus might be observed to be chosen from the same set of available menus. This fact calls for modeling choices over menus in a stochastic fashion to better test, calibrate, compare, and contrast different models of behavior.

This paper does precisely the above task when the underlying model of individual behavior is the standard model. To do so, I assume that choices are generated via random utility maximization. Specifically, I assume that there exists a probability distribution over the set of all possible preferences over individual items, which in turn leads to a probability distribution over the set of preferences over menus of items, according to the standard model. For example, restaurant A is better than B if the best meal of A is better than the best meal of B, according to the realized preference relation over meals.Footnote 3 I provide the necessary and sufficient conditions on stochastic choices over menus for them to be compatible with the described model.

2 Related literature

On one hand, this paper is related to the literature on stochastic choice and in particular the random utility model. To understand what the random utility model entails, let X be the finite universal set of items. Menus, denoted by \(m,m',m_1\), etc. are non-empty subsets of X. Denote the probability of choosing an item x in a menu m by \(\rho (x|m)\). \(\rho : X \times 2^X\setminus \emptyset \rightarrow [0,1]\) is called the random choice rule and satisfies \(\sum _{x\in m} \rho (x|m)=1\). This condition is a feasibility condition requiring the chosen option to be available, i.e., to belong to m. Also, let O(X) be the set of all strict orderings of X and denote a generic element of it by u. According to RUM, there exists \(\mu \in \varDelta (O(X))\), such that

$$\begin{aligned} \rho (x|m)= \mu (u \in O(X)| \forall y\in m, u(x)\ge u(y)). \end{aligned}$$

In words, \(\rho (x|m)\) is equal to the likelihood of having an strict ordering that places x above all other items in m.Footnote 4

Not all random choices are consistent with RUM. McFadden and Richter (1990) provided the condition that characterizes RUM.Footnote 5 The condition is called the axiom of revealed stochastic preferences (ARSP), which goes as the following:

Axiom of Revealed Stochastic Preferences: For any list \(x_{1}\in m_{1},\ldots , x_{n}\in m_{n}\), the following is true:

$$\begin{aligned} \sum _{i=1}^{n} \rho (x_{i}|m_{i}) \le \max _{u\in O(X)} \sum _{i=1}^{n} 1_ {\{ \forall \quad x_j \in m_i, u(x_{i})\ge u(x_{j}) \}}. \end{aligned}$$

The intuition for ARSP is the following: For every \(u \in O(X)\), let T(u) be the number of items in the list that are ranked above all other items in their respective menus, according to u. Also, let the right-hand side of the inequality above be equal to \(T^{*}\). Therefore, for any \(u \in O(X)\), \(T(u)\le T^{*}\). On the other hand, the left-hand side of the inequality above is some weighted average of \((T(u))_{u \in O(X)}\) and, therefore, is less than \(T^{*}\), hence the inequality above.

On the other hand, this paper is closely related to the literature that studies choices over menus. Menus are meant to represent the first stage of a dynamic decision process that determine the available options in future stages. Apart from their relevance to everyday decisions, in the theoretical literature, menus have been used to model and identify some anticipated behavioral aspect or irregularity in decision-making that cannot be fully identified in the classical static environments.

In the first paper on the subject, Kreps (1979) first characterizes deterministic preferences over menus that are compatible with a single standard decision-maker. It turns out a single axiom is all that is needed, which requires the union of any two menus to be indifferent to the more preferred one. Furthermore, Kreps (1979) studies the effect of subjective uncertainty regarding future mood (i.e., second-stage preferences) on preferences over menus and contrasts it with preferences of a standard decision-maker, who does not perceive such uncertainty. He shows that while a standard decision-maker does not find the combination of two menus more valuable than both, a decision-maker who perceives uncertainty regarding his/her future preferences might do so. In other words, such uncertainty leads to a preference for flexibility.

Gul and Pesendorfer (2001), henceforth GP (2001), models the effect of temptation and costly self-control in the second stage on preferences over menus. They consider two scenarios. In one scenario, the decision-maker has temptation, but has the ability to abstain by exerting costly self-control. In the other scenario, the decision-maker falls to his/her temptation, no matter how much will they exert their will. The second scenario is equivalent to the dual-self-model of intertemporal decision-making proposed by Strotz (1955). In contrast to Kreps (1979), the decision-makers modeled in GP (2001) might prefer commitment rather than flexibility. In particular, for these decision-makers, the combination of any two menus will always be ranked in between the two.

Ortoleva (2013) studies the effect of costly contemplation of choice or thinking in the second stage on preferences over menus. In this model, such costly contemplation might also lead to a preference for commitment. Also, Kopylov (2012) studies the effect of perfectionism and guilt aversion on choices over menus. In all these models, in the first stage, the decision-maker anticipates these irregularities and adjusts her choices over menus accordingly to maximize her own utility. In some cases, the preferences of the first- and second-stage selves are aligned, such as in Kreps (1979), Ortoleva (2013) and temptation with self-control model of GP (2001). In some other cases, there might be a conflict of interest between the two, such as the temptation without self-control model of GP (2001).

There are a number of papers that incorporate random preferences in modeling choices over menus. Stovall (2010) models a decision-maker who has temptation with self-control as defined in GP (2001), but is uncertain about the temptation preference which he/she would have in the second stage. In other words, the decision-maker expects to have random temptations in the second stage when he/she is to choose an item from the menu chosen in the first stage. Stovall (2010) assumes that the random temptation preferences come from a finite set. The decision-maker anticipates this randomness in the first stage and attaches an “expected utility” to each available menu with the expectation being taken with respect to the probabilities of different temptation preferences. In contrast to my work, since the randomness is resolved in the second stage in this model, preferences and choices over menus will be deterministic.

In a similar model, Dekel and Lipman (2012), henceforth DL (2012), characterize a Strotzian decision-maker—one who suffers from temptation but without self-control as defined in GP (2001)Footnote 6—who has a single normative preference over individual items, but is uncertain about which temptation preference which he/she is going to have in the second stage. They call such decision-maker random Strotzian. In this paper, as in Stovall (2010), since the randomness is resolved in the second stage, it will lead to deterministic preferences and choices in the first stage. DL (2012) show that preferences that are modeled in Stovall (2010)—what they call random GP preferences—have also a random Strotzian representation, assuming that the probability measure over the set of temptation preferences that the decision-maker considers is nontrivial. They define a measure as nontrivial if it assigns zero measure to the set of utility functions (preferences) that are indifferent between all individual items.

Chatterjee and Krishna (2009) consider a decision-maker that has two selves, a long run self and a virtual self who have deterministic but different preferences over the individual items. At stage one, the decision-maker—possessed by his/her long run self—anticipates that his/her virtual self will take over in the second stage with some privately known probability. Responding to this anticipation, the decision-maker evaluates each menu by the expected utility of the chosen item in the second stage evaluated by the long run self’s utility function and the expectation being taken with respect to the probability that the virtual self will take over. Once again, this model leads to deterministic preferences and choices in the first stage.

Another close paper to mine is a more recent work by Fudenberg and Strzalecki (2015), henceforth FS (2015). This paper also aims at importing stochastic properties of choice to a dynamic environment. In some aspects, FS (2015) Fudenberg and Strzalecki (2015) is more general than my approach. In particular, this paper allows for environments with an arbitrary finite number of stages, rather than only two stages. Moreover, at each stage, they allow the menus to not only determine the future available options, but also to have intrinsic values of their own. In contrast to my approach, FS (2015) assumes that we observe choices in all stages. Moreover, it focuses on specific stochastic choice models, such as logit, and does not characterize the general effect of having random utilities on choices over menus.

In the next section, I introduce the stochastic version of the standard model and provide a characterization of it.

3 Standard model

3.1 Preliminaries

X is the universal set of items and is finite. Menus are subsets of X and are denoted by \(m, m',m_1\) and so on.Footnote 7\(M \subseteq 2^X\setminus \{\{\emptyset \}\}\) denotes a generic collection of menus that contain more than just the empty menu. \(S_M\) denotes the union of all menus in M. For every M and m, let \(M-m= \{m'-m| m'\in M\}\). Call m a singleton if \(|m|=1\). The set of all strict orderings of X is denoted by O(X), and the set of weak orderings of X is denoted by WO(X). Without loss of generality, we can represent each ordering of X, weak or strict, by an ordinal utility function. For the sake of simplicity, I use orderings and utility functions interchangeably.

I assume that observable data consist of the probability of choosing an arbitrary menu m in an arbitrary collection of menus M that contains m. Such probabilities are represented via a function called the random menu choice rule, (RMCR), denoted by \(\rho : 2^X \times 2^{2^X}\setminus \{\{\emptyset \}\} \rightarrow [0,1]\), such that \(\sum _{m\in M} \rho (m |M)=1\). Also, for all menus m and collection of menus M, \(\rho \) satisfies \(\rho (m| M \cup \{\emptyset \})= \rho (m|M)\). The latter means that empty menus do not affect the choice probability of other available menus.

3.2 Representation

Before the first stage, \(u \in O(X)\) is realized according to a probability distribution that is denoted by \(\mu \in \varDelta (O(X))\). In the first stage, one of the available menus in M whose best item is better than that of all others according to u is chosen. In this model, indifferences can arise whenever the intersection of m with some other menu in M is not empty. To be more precise, whenever the best item according to the realized utility function u lies in the intersection of m and some other menu in M, and then, the decision-maker becomes indifferent between m and those menus containing that best item. This model stays silent about how these ties are broken. In other words, conditional on a realized utility function, u, I only assume that the chosen menu is one of the best available ones according to \(V_u\).Footnote 8 Formally, this translates to the following: every \(u \in O(X)\) induces a weak ordering \(V_u\in WO(2^X \setminus \{\emptyset \})\) of menus in the following way:

$$\begin{aligned} V_{u}(m)= \max _{x\in m} u(x). \end{aligned}$$

As such, the standard model is equivalent to:

$$\begin{aligned}&\mu [u| \forall m'\in M, V_{u}(m)> V_{u}(m')]\le \rho (m|\{m\}\cup M)\\&\quad \le \mu [u|\forall m'\in M, V_{u}(m)\ge V_{u}(m')]. \end{aligned}$$
(1)

The inequality on the left requires that the probability of choosing m against the menus in M to be weakly greater than the probability of having a utility function u, such that \(V_u\) ranks m strictly above all menus in M. The inequality on the right requires the probability of choosing m against the menus in M not to be greater than the probability of having a utility function u, such that \(V_u\) ranks m weakly above all menus M.

3.3 Axioms

In this section, I provide the axioms that characterize the standard model. Notice that in this model, the restriction of observed choices to the collection of singletons is similar to choices that are generated by a random utility maximizer in the realm of static choice. Thus, the restriction of \(\rho \) to the set of singletons must satisfy ARSP. This is stated formally in the following axiom.

  1. 1.

    ARSP Over Singletons Let n be a an arbitrary natural number. Also, for every \(i\in \{1,2,\ldots ,N\}\), let \(M_i\) be a collection of singletons and \(\{x_i\} \in M_i\). Then, the following holds:

    $$\begin{aligned} \sum _{i=1}^{n} \rho (\{x_{i}\}|M_{i}) \le \max _{u\in O(X)} \sum _{i=1}^{n} 1_ {\{ \forall \{x_j\} \in M_i, u(x_{i})\ge u(x_{j}) \}}. \end{aligned}$$

    Now, consider three available restaurants with mutually exclusive menus \(m,m',m''\). Suppose that after a while, m and \(m'\) merge into a single restaurant that serves the menu \(m\cup m'\). After this change, the probability of choosing the new restaurant, \(m\cup m'\) would be the same as the probability of choosing either m or \(m'\), before the merging took place. This observation leads us to the next merging additivity axiom.

  2. 2.

    Merging Additivity For all mutually exclusive menus m, \(m'\), and \(m''\), we have:

    $$\begin{aligned} \rho (m\cup m'| \{m\cup m',m''\})= \rho (m| \{m, m',m''\}) + \rho (m'|\{m, m',m''\}). \end{aligned}$$

    To see how the next axiom works, consider a set of available restaurants M whose menus do not overlap with restaurant m. The probability of choosing m against the restaurants in M is only sensitive to the union of the items offered by restaurants in M and not to how these items are distributed across different restaurants. This property is formalized in the following reorganization invariance axiom.

  3. 3.

    Reorganization invariance For every menu m and sets of menus M and \(M'\), \(S_M=S_{M'}\) and \(m\cap S_M= \emptyset \) imply:

    $$\begin{aligned} \rho (m|\{m\}\cup M)=\rho (m|\{m\}\cup M'). \end{aligned}$$

    The next axiom, disjointing inequalities, consists of two inequalities. To understand the axiom, consider an arbitrary set of available restaurants. If one of the restaurants discards all the items on their menu that are found on the menu of the other available restaurants, its choice probability would weakly decrease. Moreover, the level of decrease in its choice probability is not greater than its choice probability when it offers only the discarded items, and those items are removed from the menus of all other available restaurants. These inequalities are formally stated below.

  4. 4.

    Disjointing inequalities For every menu m and set of menus M, we have:

    1. (a)
      $$\begin{aligned} \rho (m-S_{M}| \{m-S_{M}\} \cup M\})\le \rho (m|\{m\}\cup M). \end{aligned}$$
    2. (b)
      $$\begin{aligned}&\rho (m|\{m\}\cup M) - \rho (m-S_{M}| \{m-S_{M}\} \cup M\})\\&\quad \le \rho (m\cap S_{M}| \{m\cap S_{M}\} \cup (M-m)). \end{aligned}$$

Now, we are ready to state the theorem.

Theorem

An RMCR \(\rho \) satisfies axioms 1–4 if and only if there is a probability distribution over \(\textit{O(X)}\). such that the \(\rho \) satisfies (1), i.e.. \(\rho \) is rationalizable by the standard model.

Proof

The formal proof is in the appendix. Here, I provide an informal sketch for the “only if” part. First, by ARSP over singletons and the fact that ARSP characterizes RUM, it is guaranteed that a probability measure \(\mu \) over preferences exists, such that the restriction of \(\rho \) to singletons can be represented by (1). Next, using merging additivity and reorganization invariance, for any such \(\mu \), I prove the validity of (1) for bigger menus and when the menu in consideration is disjoint with other available menus. Finally, using the disjointing inequalities, I prove the validity of (1) for cases where the available menus might overlap. \(\square \)

Identification Note that when we confine ourselves to menus of deterministic outcomes in X, the inferred probability measure \(\mu \in O(X)\) above may not be unique.Footnote 9 However, in an alternative framework, we can allow menus to contain lotteries over X and assume that the decision-maker has a random cardinal utility function that evaluates lotteries in \(\varDelta (X)\). In this framework, if we require the decision-maker to be an expected utility maximizer in the realm of lotteries, then we can replace the axioms of random expected utility model of Gul and Pesendorfer (2006) with our ARSP axiom over singletons. Since the random expected utility model of Gul and Pesendorfer (2006) uniquely identifies the probability distribution of the cardinal utility function, our model would do so, as well. Other than that, the rest of our axioms remain unchanged.

One important feature of the model is that adding an item x to a menu m will always increase the chance of choosing that menu against any collection of menus M that are disjoint with \(m\cup \{x\}\). This is because all the preferences that rank m above all menus in M still do so for \(m\cup \{x\}\). However, there are extra new preferences that rank \(m\cup \{x\}\) above all menus in M, those that place x above all items that are found in m and the menus in M. Therefore, all such preferences also choose \(m\cup \{x\}\) over all menus in M. One can think of this property as a form of stochastic monotonicity.

4 Conclusion

This paper is an attempt to characterize the implications of random preferences over a set of items, when the actual choice objects are menus of such items. I assumed that menus are not intrinsically valuable, and they draw all their values from what they contain. I studied choices over menus generated by a population of decision-makers, each of whom behaves according to the standard model. I showed that such choices are characterized by four axioms, namely ARSP over singletons, merging additivity, reorganization invariance, and disjointing inequalities.

One natural step forward is to investigate stochastic choices of a population of Strotzian decision-makers who have different pairs of time-inconsistent preferences over the set of individual items. Another natural future step is to incorporate the stochasticity of choice in other famous models of choice over menus, such as the temptation with self-control model of GP (2001) and the model of preference for flexibility in Kreps (1979), among others.