1 Introduction

Purchases of many goods are characterized by a tradeoff between accepting the best price currently available and waiting to see if a better offer appears at a later date. For example, a seller of a home must decide whether to accept an offer from a buyer or to turn it down and wait for a higher price from another prospective purchaser. This decision may be made a number of times before a transaction occurs. Markets with this feature are called search markets. These markets innovate over time as information technology progresses. Recent years have seen the advent of additional services that are offered as part of search processes, changing the decision problem that searchers face. This paper focuses on a particular innovation that has become prevalent in such markets, the Price Freeze Option, or PFO. A PFO is a service offered by firms, which guarantees the availability of an observed offer for potential future acceptance. Purchasing a PFO gives a searcher the possibility of accepting an offer at a later stage, while not committing herself to that offer. That is, once an offer is frozen, the searcher can go back and accept the frozen offer at a later date.

PFOs are making inroads into various important markets. As an example, consider commercial airlines, a large number of whom have introduced the option to freeze the price of airline tickets. For example, at the time of this writing, United Airlines offers an option, called Farelock, to freeze a price of an airline ticket for one week, for a fee of between 5 and 10 US dollars.Footnote 1 Lufthansa, Air France, and other global and regional carriers offer similar possibilities.Footnote 2 The proliferation of such offers suggests that there is some demand for the option to lock in prices, and that the airlines find the practice profitable, or at least necessary to stay competitive. Similar features exist in the market for mortgages, in which it is often possible to lock in an interest rate for a limited period of time, and for hotel and rental car reservations that can be canceled for some time after they are made.Footnote 3

To our knowledge, the institution of price freezing has not been studied by economists.Footnote 4 Thus, it is not well-understood how the existence of the option to freeze prices affects the behavior of consumers. It has not been established under which, if any, conditions the availability of a PFO lengthens or shortens searches, benefits the searcher, or is profitable for the party offering the option. Furthermore, it is unknown how the response of consumers to the opportunity to purchase such a price guarantee depends on the length of time they have to make the purchase and the cost of the freezing option.

We use both theoretical and experimental methods to study the effect of PFOs on search behavior and outcomes. We study the searcher’s decision only, and take the price setting process and freeze fees as exogenous. While this emphasis on one side of the market does not allow us to evaluate equilibrium predictions, it does permit us to focus on the searcher’s ability to solve a dynamic search problem without having to consider strategic uncertainty about the behavior of agents on the other side of the market. The theoretical analysis identifies benchmark decisions and outcomes that result from optimal behavior of a risk neutral agent. The experiment is used to consider which aspects of the model are likely to find empirical support, and where its predictions might exhibit inaccuracies.

Our purpose is to study price freezing as an institution, and not to simulate or investigate the airline, mortgage, or any other specific market. Our focus is, rather, on the PFO itself and its implications on searchers’ behavior, and our goal is to obtain some general insights regarding the properties of this institution. While we do find it striking that numerous firms have recently adopted price freezing with such vigor, we do not address the forces behind the decision to adopt a policy of offering PFOs. Rather, in our experiment, individuals are randomly assigned in different phases of the sessions to different markets that may or may not permit price freezing, and the implications of this institution on search behavior are studied ceteris paribus.

Our model builds on a classical homogeneous good sequential search model with a finite horizon and a risk neutral agent, which we extend to include the presence of PFOs. We consider how behavior and outcomes respond to changes in (i) the price of the freezing option, and (ii) the length of the time horizon available to the decision maker. We do so both in the absence and the presence of the possibility of recalling and accepting a prior offer other than the one that was frozen. We consider a setting in which there is a particular good available for purchase that has no close substitutes, such as an airline ticket to travel to a meeting on the only flight available on the day that one must travel.

We characterize the optimal decision rule, which is a non-stationary policy that we call a reservation / double reservation policy. This policy dictates that in the stages just before the terminal stage, there are two price thresholds, and in sufficiently early periods, there is one threshold. When there are two thresholds, offers more favorable than a cutoff level are accepted, those in an intermediate range are frozen, and those that are less favorable than a second cutoff are rejected. If there is one threshold, only acceptances and rejections occur. The threshold price levels depend on the offer distribution, the cost of search, the fee for freezing an offer, and the number of periods available to continue the search.

We then report a laboratory experiment, in which we study whether some of the conclusions of the model are borne out in the data. To evaluate the comparative statics of the model, we vary, in different treatments, the fee charged for the freeze option and the length of the time horizon. We test whether higher freeze fees decrease the average length of searches and lower the incidence of the use of the freeze option. We also consider whether lengthening the time horizon increases search length, leads to less use of the freeze option, and increases profits. We evaluate these predictions in an environment in which recall of prior offers is not possible, as well as one in which recall is possible, but imperfect.Footnote 5 Finally, we investigate individual decisions and consider how well these conform to the optimal strategy that the model predicts.

The data show that the model predicts the general patterns in the experiment very well. The differences between experimental treatments with regard to search length, the usage of freezing, and the payoff to the searcher, are consistent with the comparative statics of the model. The treatment effects are strong despite a relatively small number of observations. There is strong evidence for the use of reservation and double reservation price rules in the predicted manner. A logistic functional form describes the probability of accepting an offer at a given price very accurately. Our model also outperforms four alternatives, with different underlying mechanisms, that have been applied to sequential search in previous literature. We adapt these mechanisms to our setting, where freezing is allowed, and compute the resulting predictions. The alternative mechanisms are: (1) risk aversion, (2) cognitive information acquisition costs, (3) anticipated regret, and (4) inclusion of sunk costs in the payoff calculation. Models based on these mechanisms have been applied to experimental data on search without freezing by Cox and Oaxaca (1989), Gabaix et al. (2006),Footnote 6 Weng (2009) and Kogut (1990), respectively. We consider whether these mechanisms explain our data better than the model we propose in Sect. 3.

The main departure from the model is a modest, though statistically significant, tendency to end searches too early in most treatments. As discussed in Sect. 2, this pattern has also been documented in a number of prior studies in which price freezing is not possible. Beyond documenting that under-searching is common, we observe two other patterns. Firstly, we show that lowering the freeze fee magnifies the extent to which searchers’ exploration is below the optimal level. Secondly, we show that while under-searching is prevalent, its adverse effects on individuals’ profits are small. Thus, in the short run, one-sided model we consider, where firms do not respond to consumer behavior, searchers’ welfare loss is not substantial.Footnote 7

As we describe in Sect. 4, in some of our treatments, offers that have been rejected cannot be recalled later. These conditions are referred to as the No Recall treatments. In other treatments, if an offer is rejected and search continues, there is a positive probability less than one of being able to recall the offer and accept it later. We refer to these as Imperfect Recall treatments.Footnote 8 Imperfect recall can typically arise in two ways in the field. Firstly, the good may become sold out. For example, popular concert tickets may run out in minutes. On retailing websites on Cyber Monday, it may be a matter of seconds. In such cases, there is limited opportunity to recall earlier offers. Secondly, prices may exhibit high volatility, so that even if revisiting previous vendors is possible, the price is likely to have changed. In such cases, recall of earlier offers may be possible, but is far from guaranteed. It is evident that the value of a PFO decreases in the recall probability. While a PFO and the ability to recall earlier offers share some similar features, they also have important differences. In particular, a PFO pertains to a specific offer that was chosen by the searcher to be frozen, whereas recall applies to the best offer seen at any prior stage. Our results regarding the effects of PFOs on behavior are robust to both settings with no recall and imperfect recall.

The paper proceeds as follows. In Sect. 2 we discuss related literature. We briefly review the experimental literature on search, and discuss several explanations that have been proposed for the tendency for search to be terminated too early. We then briefly mention the varying asssumptions on the recall of prior offers in prior literature. Section 3 develops a theoretical model of sequential search featuring a PFO and characterizes the optimal solution for a risk neutral agent when recall of offers is not possible. The first part of the section assumes that the PFO is exogenous and shows that if it is accepted, the acceptance must occur in either the first period of search or in the last period in which it is possible. The second part extends this analysis to allow the searcher to purchase the PFO at any time. The results in this section serve as the source of hypotheses for our experiment. Section 4 describes our experimental design, indicates the model’s specific predictions, and states the hypotheses for our experiment. Section 5 presents the results from the No Recall treatments. We first present results regarding treatment effects and then move on to individual decision patterns. Later in the section, we evaluate the four alternative models mentioned above against the model we derive in Sect. 3. Section 6 summarizes the findings and offers concluding thoughts. The results from treatments with Imperfect Recall are reported in Appendix D.

2 Background

The economic literature on search dates back to the seminal analysis of Stigler (1961). He develops a simultaneous search model, where a consumer decides ex-ante how many price offers to sample. Sequential search, in which a decision maker takes a sequence of decisions about whether or not to accept new offers, was introduced by McCall (1970), DeGroot (1970), Kohn and Shavell (1974) and others.Footnote 9 The basic sequential search model is very simple. Every period, a risk-neutral agent draws an offer from a fixed and known distribution, and then chooses between accepting and rejecting it. Acceptance terminates the decision problem, while rejection moves the task on to the next period, where a new offer is received, and so on. A constant search cost is paid for every offer drawn. Under these simple assumptions, the optimal strategy is a reservation rule (i.e. a cutoff strategy), where the cutoff is chosen so that the expected marginal benefit of continuation to the next stage equals the per-period search cost. It is well-known that this reservation rule is stationary when the time horizon is infinite. Under a finite horizon assumption, the reservation rule has the property that the cutoff is monotonically increasing over time, as individuals become less selective when the expected benefit from continuation decreases.Footnote 10

The experimental literature on testing predictions of search models dates back to Kahan et al. (1967) and Rapoport and Tversky (1970). Kahan et al. (1967) vary the distribution of offers and do not observe a strong effect on the quality of decisions. They observe that subjects tested in groups search longer than those participating individually. Rapoport and Tversky (1970) observe decisions close to optimal with some early stopping. Schotter and Braunstein (1981) evaluate various theoretical implications. For instance, they study how search behavior is affected by exogenously-induced risk aversion, changes in the offer distribution and in the information the searcher holds, as well as the degree of recall. They also test whether individuals are following an optimal threshold rule directly by eliciting the payment that subjects require as compensation for not engaging in search. A large part of the subsequent experimental search literature focuses on whether individuals apply reservation rules in optimal stopping problems. While the elegant, simple, and intuitive optimal reservation price rule prediction is one of the most appealing attributes of these models, the typical empirical finding is that individuals tend to stop earlier than is predicted by this rule (see, for example, the experiments of Cox and Oaxaca (1989), Sonnemans (1998), Kogut (1990), Einav (2005) and Schorvitz (1998)).Footnote 11

Various explanations have been proposed to rationalize early stopping. We consider how well models based on these explanations predict decisions in our task compared to our model. A commonly discussed explanation is risk aversion, under which searchers value the action of stopping at a premium because it provides a deterministic payoff, avoiding the variability involved in continuation. Thus, risk averse buyers have higher reservation prices. While Cox and Oaxaca (1989) attribute under-searching to risk aversion, Sonnemans (1998) argues that only a small fraction of decisions to stop early can be rationalized by reasonably risk averse preferences.

A second mechanism, based on cognitive information acquisition costs, is proposed by Gabaix et al. (2006). Their behavioral assumption is that at any stage, individuals treat the search problem as having fewer remaining future stages than there actually are. This immediately implies under-searching because reservation prices are increasing over time as the end of the horizon approaches. Their model was intended for application to an environment with heterogeneous goods, in which individuals direct their search. The model successfully predicts experimental results for that setting. Here, we have a homogeneous good setting, no directed search and an additional action, freezing, that can be taken. Thus, applying their model to our data is not a test of their theory, but rather an exploration of whether a similar mechanism might be at work in our environment.

The third mechanism we consider, anticipated regret preferences, is based on the anticipated regret theory of decision making under uncertainty proposed by Loomes and Sugden (1986). In their model, agents anticipate that after uncertainty is resolved, they will compare the realized payoff of the chosen alternative with the payoff that would have been obtained by choosing a different action (and having the uncertainty resolved in the same manner). Weng (2009) shows that in a model with perfect recall, incorporating anticipated regret implies a standard reservation rule strategy, but with higher buyer reservation prices than in the benchmark model, leading to under-searching.

The fourth mechanism that we consider is one proposed by Kogut (1990). Under this account, individuals do not treat the accumulated search costs incurred before the current period as sunk. Instead, though the actual search costs are constant over time, individuals act as if they are increasing. This leads them to stop their search earlier than they would otherwise. We consider a version of this mechanism, with a failure to treat both search costs and freeze fees as sunk.Footnote 12

The impact of different assumptions on the capacity to recall previous offers has also been studied. Landsberger and Peled (1977) and Karni and Schwartz (1977) formalize sequential search with imperfect recall, and Janssen and Parakhonyak (2014) analyze the implications of costly recall. Landsberger and Peled (1977) augment the homogeneous good sequential search model by allowing for any recall probability (encompassing perfect, imperfect and no recall). In their model, the recall probability is independent of the time elapsed since the best prior offer, and consumers know the price distribution. They interpret imperfect recall as an indicator of market conditions - prices are less likely to be available for recall in the future when demand is greater or supply is more constrained. In the model of Karni and Schwartz (1977), the recall probability decreases as time elapses since the best offer has been seen, and consumers do not know the price distribution ex-ante. They characterize a family of learning processes for which a reservation price strategy is optimal. The way we model imperfect recall is similar to the approach taken in Landsberger and Peled (1977).

3 Theory

This section is organized in the following manner. In Subsection 3.1, we develop a model of sequential search with an exogenously given outside option. In Subsection 3.2, we endogenize the outside option by introducing a PFO, which is in effect an opportunity to purchase the availability of an outside option. We assume a finite horizon throughout the entire section. In both subsections, we assume that recall of prior offers is not possible. We consider the case of imperfect recall in Appendix B, solving it using numerical methods. The model is formulated, in line with the theoretical literature on consumer search, as the decision problem of a buyer facing a sequence of offers from potential sellers and who searches for the lowest price. The theory can be readily translated in a symmetric manner into the environment of the experiment, in which subjects are sellers confronting a sequence of offers to buy, and we perform this translation when we evaluate its predictions. The proofs of the propositions in this section are given in Appendix C.

3.1 Search with an outside option

We begin by analyzing a sequential search model with an outside option, which can also be interpreted as an offer that was previously frozen. Consider a potential buyer of a good, who can receive a price offer in each of a sequence of T stages, indexed by \(t \in \{1,...,T\}\). Assume that price offers \((p_{t})_{t=1}^{T}\) are independently and identically distributed on \([0,\bar{p}]\), according to a continuous distribution F and are drawn sequentially. Each offer in a stage \(t > 1\) is drawn at a cost c (the first offer, at \(t = 1\), is free). The searcher is risk neutral and knows the price distribution. The price under the outside option available by withdrawing from the search is denoted by k.

In each stage t, the searcher chooses between (1) accepting \(p_t\), (2) rejecting \(p_t\), and continuing to the next stage after paying the search cost of c, or (3) taking the outside option k. If the player accepts an offer in stage t, she purchases the item and pays the current offer price \(p_t\). Denote by \(\tilde{R}_t^T(k)\) the expected payment that the searcher would make if she rejects at stage t and continues optimally thereafter, when the horizon is of length T. We call \(\tilde{R}_t^T(k)\) the post-freeze rejection payment. By “payment” we mean the expected price to be paid for the item and all expected search costs to be expended in subsequent stages, provided that the agent proceeds optimally. We suppress the dependence of \(\tilde{R}_t(k)\) on T to simplify the notation.

The post-freeze expected payment of the individual in stage t is denoted by \(\tilde{V}_t\left( p_t,k\right)\). This is what the individual expects to pay, in terms of both the price for the item and in search costs, if she makes optimal decisions from stage t onward. It is the minimum of the current price, the outside option, and the post-freeze rejection payment. Thus, when \(t<T\), we have

$$\begin{aligned} \tilde{V}_t\left( p_t,k\right)= & {} \min \{p_t,k,\tilde{R}_t(k)\} \end{aligned}$$
(1)
$$\begin{aligned} \tilde{R}_t(k)= & {} \intop _{0}^{\bar{p}} \tilde{V}_{t+1}(p_{t+1},k) dF(p_{t+1}) + c. \end{aligned}$$
(2)

Rejection in the terminal period T yields a payoff of 0, so that \(\tilde{V}_T\left( p_T,k\right) = \min \{p_T,k\}\).

The following functions h(x) and g(x) are useful in deriving some of our results.Footnote 13

$$\begin{aligned} h\left( x\right)= & {} \intop _0^{\bar{p}} \min \{ p', x \} dF(p') = \intop _{0}^{x}p'dF(p') + (1-F(x)) x \end{aligned}$$
(3)
$$\begin{aligned} g\left( x\right)= & {} x-h\left( x\right) = \intop _{0}^{x}F\left( p'\right) dp'. \end{aligned}$$
(4)

A threshold strategy is one in which there exists a cutoff price for every stage. The buyer accepts all offers below the cutoff, and rejects all prices above it. Consider the function h(x), as formulated in equation (3). The expression denotes the expected price paid in the last search period when an outside option is available. The first term describes the case of an offer lower than the outside option, in which case the offer is accepted. The second term corresponds to the case in which the offer is greater than the outside option, in which event the outside option is taken. Notice that \(h(\bar{p}) = E(p)\), so that if the buyer adopts the strategy of accepting any price offered, she accepts all offers and pays in expectation the average offer.Footnote 14

Assumption 1

\(c < g(\bar{p})\).

Let \(p^* = g^{-1}(c)\). It is well known that \(p^*\) is the stationary optimal reservation price in a sequential search model with no recall and an infinite horizon.Footnote 15 This price is the basis of our first proposition, which describes the optimal decision rule for the buyer in the presence of an outside option. The proposition states that if the outside option price is lower than \(p^*\), the buyer accepts the outside option in the first stage. If not, the buyer uses a reservation price strategy with a dynamic threshold that increases in each stage.

Proposition 1

When \(k \le p^*\), search ends immediately by accepting either \(p_1\) or k. When \(k>p^*\), an increasing reservation price strategy is optimal and k is either never chosen or chosen at the terminal stage.

Figure 1 shows the optimal strategy in p-space. The bold line indicates the optimal decision rule, the ranges of offers for which it is optimal to accept \(p_t\), settle for k, or reject \(p_t\). The left panel shows that the post-freeze rejection payment is always higher than the outside option if \(k \le p^*\). The right panel illustrates how, in the case where \(k > p^*\), post-freeze rejection payments increase over time as the end of the horizon approaches.

Fig. 1
figure 1

Post-freeze expected payment in p-space

We end this subsection by noting a limit result, which we use in Subsection 3.2. Namely, provided that \(k>p^*\), for any current stage t, as the number of future stages \(T - t\) becomes large, the post-freeze rejection payment approaches \(p^*\).

Proposition 2

The sequence \(\big (\tilde{R}_t ^T(k)\big )_{T=2} ^\infty\) converges uniformly to \(\tilde{R}_t ^\infty (k)=p^*\) for \(k>p^*\).

3.2 Endogenous freezing of offers

Now suppose that the searcher can, at any stage, purchase an option to freeze the current offer, in effect buying an outside option of the type described in the previous subsection. Assume that the searcher can freeze one offer by paying a fixed fee \(f>0\), which we will refer to as the freeze fee. Once one offer is frozen, a second offer may not be frozen.Footnote 16 Offers may not be unfrozen. There is no other outside option available. To analyze this situation, we introduce several functions that are analogous to those used in Subsect. 3.1. \(V_t(p_t)\) is the pre-freeze expected payment, \({R}_t\) is the pre-freeze rejection payment, and \(K_t(p_t)\), which we shall refer to as the freeze payment, is the expected payment when freezing \(p_t\) and continuing to the next stage with \(p_t\) as an outside option. Formally,

$$\begin{aligned} V_t\left( p_t\right)&= \min \{p_t, R_t, K_t \left( p_t\right) \} \end{aligned}$$
(5)
$$\begin{aligned} K_t\left( p_t\right)&= \tilde{R}_t (p_t) + f \end{aligned}$$
(6)
$$\begin{aligned} R_t&= \intop _{0}^{\bar{p}} V_{t+1}(p_{t+1}) dF(p_{t+1}) + c. \end{aligned}$$
(7)

We begin with a lemma stating the straightforward fact that the pre-freeze rejection payment is weakly lower than the worst possible post-freeze rejection payment.

Lemma 1

\(R_t \le \tilde{R}_t (\bar{p})\) for all \(t<T\)

Next, we make the assumption that the freeze fee is sufficiently low to guarantee the existence of prices that are frozen. To assure this, we assume that the fee is low enough so that there exists some price that would be frozen in the next-to-last stage:

Assumption 2

The freeze fee f satisfies

$$\begin{aligned} f< & {} \intop _{h(\bar{p})+c}^{\bar{p}} [p-(h(\bar{p})+c)]dF(p) \end{aligned}$$
(8)
$$\begin{aligned}= & {} h(\bar{p}) - h( h( \bar{p}) + c) \end{aligned}$$
(9)

To gain intuition for this condition, notice that it can be written as \(K_{T-1}(R_{T-1})<R_{T-1}\). This means that at \(T-1\), it is better to freeze an offer equal to the pre-freeze rejection payment than rejecting. We now turn to the optimal strategy of the consumer when a PFO is available. We use the following terminology to describe the optimal rule.

Definition 1

Under a reservation rule (RR), in stage t,

$$\begin{aligned} V_t(p_t) = {\left\{ \begin{array}{ll} p_t &{} \text{ if } p_t<R_t\\ R_t &{} \text{ otherwise. } \end{array}\right. } \end{aligned}$$
(10)

Under a double reservation rule (DRR), in stage t,

$$\begin{aligned} V_t(p_t) = {\left\{ \begin{array}{ll} p_t &{} \text{ if } p_t<a_t\\ K_t\left( p_t\right) &{} \text{ if } p_t\in \left[ a_t,b_t\right) \\ R_t &{} otherwise \end{array}\right. } \end{aligned}$$
(11)

where \(0<a_t<b_t<\bar{p}\). A subset of stages in which a RR is used is denoted by \(T^{RR}\subseteq T\) and a subset of stages in which a DRR is used is denoted by \(T^{DRR}\subseteq T\).

A RR specifies a stage-specific threshold price below which offers are accepted and above which they are rejected. A DRR specifies a stage-specific threshold below which offers are accepted, another one above which offers are rejected, and an intermediate range between the two thresholds, in which offers are frozen. We define a strictly increasing RR sub-policy to be one in which \(R_t<R_{t'}\) for \(t<t'\), \(\{t,t'\} \subseteq T^{RR}\). Similarly, we define a strictly increasing DRR sub-policy to be one in which \(a_t<a_{t'}\) and \(b_t<b_{t'}\) for \(t<t'\), \(\{t,t'\} \subseteq T^{DRR}\). That is, a DRR is strictly increasing if the acceptance region strictly increases and the rejection region strictly decreases over time.

The following lemma allows us to restrict ourselves to particular strategies when solving for the optimal policy. The lemma shows that the consumer will always use either a RR or a DRR at every stage.

Lemma 2

Under the optimal solution, \(t \in T^{RR} \cup T^{DRR}\) for all \(t<T\).

The following lemma is a monotonicity result which is useful for characterizing the optimal solution. It shows that the range of offers that is rejected decreases over time.

Lemma 3

Any offer that is rejected in some period t is also rejected in period \(t-1\). That is,

$$\begin{aligned} V_{t}(p) = R_{t} \implies V_{t-1}(p) = R_{t-1} \end{aligned}$$
(12)

In Proposition 3, we describe the structure of the optimal policy for the consumer to follow.

Proposition 3

In a given stage t, as long as an offer was not frozen before, the optimal policy consists of a strictly increasing RR for \(t \le t^*\) and a strictly increasing DRR for \(t>t^*\), where \(0\le t^*<T-1\). If an offer was frozen before stage t, then the optimal policy follows an increasing RR thereafter. In period T it is optimal to accept \(p_T\) or the previously frozen offer in the event that such an offer exists, whichever is lower.

Thus, the solution can take two forms. When \(t^*=0\), the optimal policy is an increasing DRR. When \(t^*>0\), an increasing RR is optimal in stages 1 through \(t^*\) followed by an increasing DRR thereafter. We now present a final result, which states that when the horizon is sufficiently long, there will be at least one stage at the beginning of the search sequence where it is optimal not to freeze any offer.

Proposition 4

There exists a \(T^*<\infty\) such that for \(T>T^* \implies t^*>0\).

The left panel of Fig. 2 illustrates a stage in which a DRR is optimal (\(t\in T^{DRR}\)). The pre-freeze expected payment for the stage is given by the lower envelope of the functions \(p_t, K_t(p_t)\), and \(R_t\). The figure illustrates how \(a_t\) is a cutoff value, above which it optimal to freeze an offer, and below which it is optimal to accept. \(b_t\) defines a similar threshold between freezing and rejection as optimal actions. The right panel plots the solution at \(t=1\) when \(T \rightarrow \infty\), where we must have \(1\in T^{RR}\).

4 Experimental design

4.1 General structure

The sessions were conducted in the Economic Science Laboratory at the Eller College of Management of the University of Arizona in late 2016. All subjects were undergraduate students at the university. A total of 177 subjects participated in the experiment and the number of individuals present varied across sessions. The experiment consisted exclusively of individual choice tasks. Upon arrival, a first set of instructions, which pertained to the risk elicitation protocols, was read aloud by the experimenter. Subjects then performed the protocols. Upon completing these two tasks, a second set of instructions, describing the 180 search problems participants were about to face, were read aloud. Afterward, the main part of the experiment began.Footnote 17 These were followed by the main part of the experiment, which consisted of 180 search problems that had the possibility of counting toward earnings.Footnote 18 The session concluded with a brief questionnaire. Subjects could complete the sequence of tasks at their own pace. The entire sequence of tasks took between 60 and 100 minutes to complete. The experiment was programmed using Z-tree (Fischbacher (2007)).

Fig. 2
figure 2

Example of a DRR, and of behavior as \(T \rightarrow \infty\)

The 180 search problems that could count toward subjects’ earnings were divided into three equal blocks of 60 search problems, as described in Subsection 4.2. There was a mandatory two-minute pause between each block of 60 trials to allow participants to rest. There was also a requirement that subjects stay in the laboratory for at least one hour, to prevent them from completing the task as rapidly as they could in order to leave the session early.Footnote 19 Subjects were paid for one randomly selected task (either one of the two risk measurement tasks or one of the 180 search problems), plus a $5 show up fee.Footnote 20 Each of the 182 tasks was equally likely to be selected to count toward participants’ earnings.

There are two notions of time in the experiment. We will use the term stage to refer to each time the subject must make a decision on an offer (as this term was used in Sect. 3), whereas the term round refers to an entire search problem, which consists of a sequence of stages. We use the terms round and sequence interchangeably. Our experiment, therefore, includes 180 rounds, and each round consists of multiple stages.

The subjects in the experiment are potential sellers of a fictitious item. In each stage of a round, subjects receive an offer, drawn from a discrete uniform distribution on \(\{0, \ldots , 1000\}\), in which each of the 1001 integers in the range is equally likely. A search cost of c = 10 is paid for every offer (except for the offer in the first stage of each round). The offers in each stage are independent of those in preceding or subsequent stages. A player may accept an offer at any stage. If she accepts an offer, she receives the offer price minus the accumulated search costs within the round. The round ends when an offer is accepted. If she rejects the offer in a given stage, the round continues to the next stage. Rejection is not possible in the terminal stage of a round. Offers and costs are denominated in terms of an experimental currency, which is convertible to US dollars at a rate of 70 to 1.

4.2 The three 60-period blocks

Every subject faced the exact same 180 offer sequences. These 180 sequences consisted of 60 sequences that were drawn in advance. The sequences were repeated three times, with a modification that we describe below. Thus, there were three blocks of 60 rounds. The environment was identical for all individuals, except for the exogenous variations in the freeze fee, horizon length and the recall probability (denoted as f, T and q respectively), described in the next subsection. The number of stages T and the recall probability q were varied between subjects, while f was varied within subjects across blocks. Each 60-period block was preceded by four practice periods.

The first block of sequences, those employed in rounds 1–60, were random independent draws from \(U\{0, \ldots , 1000\}\). In the second and third blocks, subjects face perturbed offers. These are created by adding a random, relatively small, integer to the corresponding offers from the first block.Footnote 21

4.3 The treatments

In different treatments, we vary the freeze fee f, the time horizon T, and the recall probability q. The freeze fee is varied within-subject across blocks. Under the Low Freeze Fee (Lo) condition, individuals must pay 10 to freeze an offer, and under High Freeze Fee (Hi) they must pay 40. In a third condition, No Freezing (No) freezing is not possible, so that the freeze fee can be thought of as infinite. In some sessions, the freeze fee is varied in ascending order across the three blocks; Lo in the first 60 rounds of the search task, Hi in rounds 61–120, and No in the last 60 rounds. In other sessions, the reverse sequence is in effect, and the fees appear in descending order across the three blocks. The cost c of generating an offer in the next stage is always 10.

T and q are varied between subjects. The time horizon is fixed at \(T = 4\) in some sessions and \(T = 10\) in others. The recall probability is set to \(q = 0\) in some sessions (No Recall or NR conditions) and \(q =.5\) (referred to as Imperfect Recall or IR) in others. When \(q =.5\), recall during a given stage of a sequence is possible with probability .5. When recall is possible in stage t, the highest offer from stages 1 to t - 1 may be recalled and accepted. Whether recall is available within a given stage of a sequence is independently drawn in each stage. For example, recall of prior offers from the first two stages may not be possible when facing the third offer in a sequence, but may be possible when facing the fourth.Footnote 22

Thus, the experiment has a 2 x 2 x 3 structure. We refer to the treatments in an abbreviated form by the time horizon (4 or 10 periods), whether recall was not possible or imperfect (IR or NR), and the level of the freeze fee (Lo, Hi or No). For example, in Hi10IR, subjects could sample up to 10 offers in a round, the high freeze fee of 40 was in effect, and there was a recall probability of .5. Our hypotheses concern comparisons between different levels of f and T, which are evaluated under No Recall and Imperfect Recall separately.

Table 1 provides some information about the participants in each treatment, in terms of gender distribution, number of years of university study, the number of previous economic experiments and the percentage of participants who studies economics or business. Our sample is relatively experienced in participating in experiments. This is reassuring as this increases our confidence that subjects are aware of the direct positive relationship between understanding the instructions and expected earnings. Participant earnings averaged $15.30 with a standard deviation of $3.66. The sessions averaged approximately 1 hour and 25 minutes in duration.

Table 1 Subject numbers and characteristics in the different treatments

4.4 Optimal decisions

The optimal strategy, using the parameters described in Subsection 4.3, in each of the No Recall and Imperfect Recall treatments, are illustrated in the panels on the left and right sides of Fig. 3, respectively. In each of these panels, we plot the solution for one level of f, when no offer has yet been frozen, for periods 1 to T\(-1\).Footnote 23 The black region denotes offers that are accepted, gray stands for offers that are frozen, and offers in the white area are rejected. For the IR treatments, we plot the solutions for the case in which the highest offer seen so far is zero, implying that recall is not available (so that the value of recalling enters through the freeze and pre-freeze rejection payments only).

The figure shows that the optimal thresholds under No Recall are monotonic and concave. The acceptance and freezing regions are increasing over time. More freezing occurs when the freeze fee is low. For example, in the Lo conditions (\(f=10\)), the freezing region is already larger than 10% of the range of possible offers in stage \(T-9\), whereas under Hi the freezing region surpasses 10% only in stage \(T-5\). The horizon \(T = 10\) is long enough so that no freezing is predicted to occur in the initial stage. In the Imperfect Recall case, optimal thresholds are also monotonic, and are concave once the DRR sub-policy is in effect.

Fig. 3
figure 3

Optimal policy with \(p\sim U[0,1000]\). Optimal behavior, according to the model, for the different treatments. The left panels correspond to the case of No Recall. The right panels are for Imperfect Recall, assuming that in each stage the highest offer observed so far is zero, so that the impact of recall is through the freeze and pre-freeze rejection payments when freezing or rejecting. Each row plots the solution for a different freeze fee condition. Offers are accepted in the black area, frozen in the gray area, and rejected in the white area. In treatments where \(T = 10\) \((T = 4)\), the data from \(T - 9\) \((T - 3)\) onward are applicable

4.5 Hypotheses

The hypotheses guiding the design of our experiment are derived from the theoretical results presented in the Sect. 3 for the No Recall treatments, and from the computation of the optimal solution for the Imperfect Recall treatments, as described in Appendix B. We solve the models for each of our treatments using the offer and recall realizations faced by subjects in the experiment. Table 2 presents the resulting mean search lengths, the percentage of rounds in which it is optimal to freeze an offer, the average earnings of the searcher (which equal 1000 minus the price paid minus total search costs minus the freeze fee if an offer is frozen) and the other party’s surplus (the earnings of a hypothetical individual on the other side of the market, whose surplus equals 1000 minus any offer accepted, plus any freeze fee paid). The numbers in the table reveal some of the comparative statics of the model. On average, a higher freeze fee results in shorter search and less freezing. It can also be seen from the table that increasing the horizon T results in longer search and higher earnings. The effect on search length of changes in the freeze fee and horizon length constitute our Hypothesis 1. Hypothesis 2 concerns use of the freeze option, and asserts that it is more common when the freeze fee is lower and when the horizon is shorter.Footnote 24

Within Hypothesis 2, our model makes a prediction regarding the effect of freezing that at first glance may seem counterintuitive. One’s intuition might be that, just like a financial option, freezing is more valuable when there is more time remaining during which one can accept the frozen offer, and thus for a given freeze fee, one might be more likely to pay to freeze an offer when there are more stages remaining. However, the optimal policy has the property that freezing is more likely when the horizon is relatively short (4 periods) than when it is long (10 periods). We interpret support for this prediction as strong evidence in favor of our model. Our third hypothesis is that individuals employ the optimal RR-DRR policy. That is, they make acceptance, freezing, and rejection decisions that are consistent with the model. They employ the thresholds depicted in Fig. 3.

Table 2 Model predictions, by treatment

Summarizing, the experiment is designed to test the following hypotheses. If all three hypotheses are supported, we would conclude that our model is strongly supported.

Hypothesis 1::

Search length is longer when the freeze fee f is smaller and the time horizon T is greater.

Hypothesis 2::

Freezing is less frequent when the freeze fee f is greater and the time horizon T is greater.

Hypothesis 3::

Individuals employ the optimal RR-DRR policy.

5 Results

5.1 Summary statistics

Figure 4 illustrates some general patterns in the data from each of the twelve treatments. This figure shows the means and percentages of key variables in the different treatments, aggregated over subjects and rounds, compared to the theoretical predictions (represented by circles and xs, respectively). The data displayed are the average search length (panel 4a), the percentage of rounds in which an offer has been frozen (panel 4b) and the earnings per round (panel 4c)). In panel 4b, the different shades indicate the eventual fate of frozen offers, whether they are accepted in the last possible round, taken in a round other than the last, or not accepted at all.Footnote 25

Panel (a) shows that search length tends to be greater for smaller f and larger T under both No Recall and Imperfect Recall. Search length tends to be shorter than predicted. Panel (b) shows that there are more offers frozen under T = 4 than T = 10. These patterns are consistent with our model. While under T = 4, the vast majority of acceptances of frozen offers are in the terminal stage, as predicted, this is not the case for the T = 10 treatments. This raises some doubt about the validity of the model’s prediction that frozen offers are only accepted in the terminal round. Panel (c) reveals that earnings are greater when the time horizon is longer, as the model predicts.

5.2 Treatment effects

In thie section, we consider the effect of treatments in the No Recall treatments. The estimated treatment effects on search length, freezing usage and earnings are reported in Tables 3 and 4. In Table 3, we regress each of these dependent variables on six treatment dummy variables, corresponding to each of the No Recall treatments. The regression has no constant and standard errors are clustered by subject. Therefore the coefficients can be interpreted as treatment averages. In each row, there are four entries: (1) the estimated coefficient, (2) the standard error, (3) the value predicted by the model (as given in Table 2), and (4) the p-value from a t-test for the equality of the coefficient to its predicted value.

In Table 4 we use the results of Table 3 to conduct hypothesis tests for each of our directional hypotheses. The first column contains the null hypothesis, the second column is its interpretation in terms of the coefficients from the regression in Table 3 and the third column presents the p-value of the hypothesis test. To correct for mutiple hypothesis testing, we require a Bonferroni correction factor of 19 for the 19 directional hypotheses in the table so that we consider a result as significant at \(p<\).05/19 =.0026. To correct for testing the three hypotheses of no treatment differences, we consider \(p<\).05*3 =.15 as a conservative threshold to reject the hypothesis of no difference. Table 4, along with the patterns shown in Fig. 4, provide the basis for our first four results.

Fig. 4
figure 4

Within-Treatment Means and Frequencies. This figure contains the observed within-treatment means and frequencies for key variables (as circles) and the theoretical predictions (as xs). Panel (b) also indicates whether frozen offers were accepted later and whether the acceptances occur in the last stage

Table 3 The effect of treatment on earnings, search length, and usage of the freezing option

Result 1

The length of search is (1) decreasing in the freeze fee and (2) increasing in the horizon length, as predicted by the model. Search length is lower than predicted in most treatments.

Support for Result 1: The tests reported in Table 4 show that search length exhibits the predicted treatment differences as T changes in all three treatment comparisons (tests 10–12). It moves in the predicted direction as f changes in 4 of 6 cases (tests 13–18). Table 3 shows that search length is significantly less than the model prediction in all treatments except for Hi4 and No4. \(\Box\)

Table 4 Hypothesis tests of treatment differences

Result 2

The usage of the freezing option decreases in the freeze fee and the horizon length, as predicted by the model. There is less freezing than predicted by the model. When T = 4, the large majority of frozen offers are accepted in the last stage, as predicted, whereas this is not the case when T = 10.

Support for Result 2: The last four tests in Table 4 indicate that the incidence of freezing is significantly lower when T = 10 than when T = 4. It is also decreasing in the freeze fee f and the effect is significant when T = 10. The last column of Table 3 shows that there is significantly less freezing than predicted in every treatment. Figure 4 shows that under T = 4, the majority of the acceptances of frozen offers do occur in the last stage, as the model predicts. \(\Box\)

Result 3

Earnings are affected by freeze fees when T = 4, but not when T = 10, as predicted by the model. Earnings are 96.6% of the optimal level.

Support for Result 3: The first nine tests of Table 4 show that eight out of nine treatment effects are in the same direction as those that the model predicts. Tests (4)–(9) reveal that earnings are affected by freeze fees when T = 4, but not when T = 10. Tests (1)–(3) in the table show that earnings are greater in the T = 10 than in the T = 4 treatments, as the model anticipates. The estimates in the first column of Table 3 indicate the average earnings compared to the model prediction. Averaging the data across treatments reveals that earnings average 96.6% of the predicted level. \(\Box\)

As indicated earlier, some previous studies have found under-searching (searching for fewer stages than is optimal) relative to the optimal length of search when no freezing is possible. As summarized in Result 4, this general tendency, though modest, also appears when freezing is possible. Our next result is that low freeze fees tend increase the tendency to under-search.

Result 4

The extent to which individuals under-search decreases in f. That is, the availability of an affordable freezing option amplifies under-searching.

Support for Result 4: The left half of Table 5 presents the average number of stages that individuals under-search. In all treatments, except for No4, there is under-searching, albeit by negligible amounts in some treatments. In the T = 10 treatments, the under-searching ranges between 0.39 and 0.74 stages, with the extent of under-searching monotonically decreasing in f.

To shed light on heterogeneity in under-searching, we classified whether each of our subjects was an under-searcher. We ran one-tailed Wilcoxon signed-rank tests, one for each subject, where the null hypothesis is that the observed search length is equal to or higher than the predicted search length. The right panel of Table 5 presents, for each treatment, the percentage of subjects for whom the null hypothesis is rejected at a 0.05 significance level. Thus, the table can be interpreted as the share of under-searchers in each treatment. The share of under-searchers is greater, the lower the freeze fee, and the longer the time horizon T.Footnote 26\(\Box\)

Table 5 Average number of stages under-searched and percentage of individuals who are under-searchers

5.3 Individual responses to changes in freeze fees

Our experimental design allows us to examine the effects of a higher freezing fee, within-subject. Tables 10–12 in Appendix F present augmented confusion matrices, portraying the predicted effects of freeze fee variation on search length, PFO usage and earnings, respectively. Since we have paired observations, the comparisons are for the same round and thus involve the same sequence of offers (modulo the minor perturbation) and the same person, and only the freeze fee varies. For each freeze-cost variation (given in the first column), we report in the third column the overall percentage of rounds in which this variation was predicted to yield a positive, negative or no effect on the outcome variable. Then, in the next three columns we report the percentage of rounds where the outcome variable increased, decreased, or stayed the same in the instances in which the model made each possible prediction. The last column reports, out of the observations where either a positive or negative change was observed, the percentage where the change was as predicted. This last column aims to measure the extent to which subjects’ behavioral adjustments are in the same direction as the theory predicts. The unit of observation is the behavior of an individual subject in one of the 60 matched pairs of sequences.

Table 10 shows the percentage of changes in search length that are consistent with the model predictions. The percentage of all changes that are in the predicted direction range between 51.7% and 90.2%. The most common deviation is a failure to change one’s search length when a change is predicted. Thus, there is under-responsiveness to treatment variation, in that there are more zero changes observed than are predicted. However, when there is responsiveness it tends to be in the direction predicted by our model as only 8.15% of all decisions involve changes that are in the opposite direction of the model. Table 11 shows the data for freezing usage. It shows that over 90% of the changes between treatments are consistent with model predictions, and that the principal deviation is a failure to adjust at all when a change is predicted. The data for earnings are shown in Table 12. They show a similar pattern in that most decisions involve no change in earnings even when a change is predicted. Most changes are in the correct direction, with cells indicating a majority in the incorrect direction involving relatively few observations.

5.4 Do individuals use optimal reservation price strategies?

In this section, we evaluate Hypothesis 3, which concerns the use of the optimal RR-DRR policy. Figure 5 illustrates the fraction of offers of different magnitudes that have been accepted and frozen in the first stage of each round under each treatment. Behavior in the first stage provides the most stringent test of our model, since the backward induction task involved in optimizing in the first stage is the most demanding among all of the stages.Footnote 27 Each panel corresponds to one treatment condition. The graphs show the pooled data from all individuals. Each dot denotes the percentage of offers at the value indicated on the horizontal axis that were accepted. The x symbol indicates the percentage of offers at different levels that were frozen.

Our model predicts that all offers that are below a certain threshold are rejected. In treatments with no freezing, shown in the bottom two panels, it is predicted that all offers at or above this threshold are accepted. In some of the conditions in which freezing is possible, given in the other four panels, there is one threshold distinguishing offers that are rejected and those that are frozen (the reject-freeze threshold), and another cutoff dividing those offers that are frozen and those that are accepted (the freeze-accept threshold). In each panel, the predicted thresholds are denoted with vertical lines.

First consider acceptance decisions. We use nonlinear least squares minimization to fit the plotted acceptance frequencies to the logistic function

$$\begin{aligned} l(x;m,r) = [1+\exp (-r(x-m))]^{-1} \end{aligned}$$
(13)

where m and r are parameters representing the midpoint and the curvature, respectively. Under each panel in Fig. 5, we report the fitted values of m and r, and a 95% confidence interval in parentheses. Large curvature is evidence for the use of a threshold strategy in the acceptance decision, because it indicates that the acceptance probability increases very rapidly at or near a particular value. We interpret large curvature, along with a midpoint m close to the optimal threshold, as evidence supporting the reservation strategy indicated by the model. The logistic functional form has the feature that it permits more deviations from the optimal strategy for offers close to the threshold. This means that errors are less likely, the more costly that they are. Note that as observations in this figure are aggregated across individuals, the results we report pertain to a representative consumer, potentially masking heterogeneity in underlying individual behavior.

In the treatments where there exist offers that are predicted to be frozen in the first stage, we apply a similar strategy to evaluate the threshold between the regions of rejection and freezing. For fitting the logistic function to the freezing fractions, we consider all offers below the offer for which the freezing frequency is highest.

The observed patterns can be summarized as the following result.

Result 5

Aggregated acceptance and freezing decisions in stage 1 are well approximated by a logistic function. However, the estimated acceptance thresholds are typically lower than predicted by the model. The freezing thresholds are close to predicted levels.

Support for Result 5: The figures reveal some strong and consistent patterns. The first is that the percentage of offers in excess of the theoretical acceptance thresholds that are indeed accepted is very high. In other words, in almost all instances in which acceptance of the current offer is predicted, acceptance occurs. The second is that the acceptance probabilities are described very well by a logistic function. The third is that the estimated acceptance thresholds are modestly, though significantly, lower than the predicted level in all treatments. This means that there is a greater tendency to accept offers below the theoretical cutoff for acceptance than to reject offers that are above the cutoff. This corresponds to the early termination of search on average. Fourth, the probability of an offer being frozen is much higher in the range in which it is predicted than when it is not predicted, though it does not reach a level greater than .8 for any range of offers in any treatment. Fifth, a logistic specification fits the relationship between the probability of freezing and that of rejecting quite well in the treatments in which freezing is predicted to occur. Sixth, the estimated thresholds between rejection and freezing are very close to the models’ predictions, which lie within the 95% confidence interval in Lo4, and just outside of it in the other two treatments in which freezing is predicted. \(\Box\)

5.5 Alternative models

We now consider whether adaptations of four models of decision making that have been proposed in previous literature outperform our model, presented in Sect. 3, which assumes optimal decision making under risk neutrality. We recognize that the models were proposed for environments without freezing, and in some cases, for settings that differed in other ways. Thus, any failure of one of the models to explain patterns in our data does not suggest that they are not appropriate in other environments. The four models are expressed for the situation in which the searcher is a seller, as in our experiment.

The first model we consider is in the spirit of the Regret Model developed by Loomes and Sugden (1986) and applied to sequential search by Weng (2009). Under this model, the individual incurs a disutility cost when she accepts an offer that was less favorable than the best offer that she has previously rejected. Specifically, the individual incurs a disutility \(\beta (x - y)\) if she chooses an alternative that yields pecuniary payoff x rather than another which would have resulted in a greater payoff y. In other words,

$$\begin{aligned} u(x, y) = x - \beta \textbf{1}\{y>x\}(y-x) \end{aligned}$$
(14)
Fig. 5
figure 5

Empirical frequencies of decisions in the first stage. The plots contain empirical frequencies of accepting and freezing in the first stage of each round for the six No Recall treatments. We fit a logistic function to the acceptance frequencies by nonlinear least squares minimization. We also fit a logistic function to the freezing frequencies for offers that are at or below the offer for which the freezing frequency is highest. Vertical lines represent the decision thresholds implied by the model

Applied to our setting, in which x is the offer the searcher accepts and y is the best offer foregone, the rejection and freezing payments are:

$$\begin{aligned} R_t(\check{p}_t)&= \intop _0^{\bar{p}} \max \{ u(p_{t+1}, \check{p}_t ), K_{t+1}(p_{t+1}, \check{p}_t), R_{t+1}(p_{t+1}) \} dF(p_{t+1}) - c, \end{aligned}$$
(15)
$$\begin{aligned} \tilde{R}_t(k, \check{p}_t)&= \intop _0^{\bar{p}} \max \{ u(p_{t+1}, \max \{ \check{p}_t, k \}), u(k, \check{p}_t), \tilde{R}_{t+1}(k, \check{p}_t) \} dF(p_{t+1}) - c, \end{aligned}$$
(16)
$$\begin{aligned} K_t(p_t, \check{p}_t)&= \tilde{R}_t(p_t, \check{p}_t) - f, \end{aligned}$$
(17)

where \(\check{p}_t = \max \{ p_1, \ldots , p_t \}\). Under this specification, the possibility of regret lowers the value of accepting, freezing, and rejecting compared to the risk neutral model. However, compared with the risk neutral optimum, it lowers the value of accepting the most and rejecting the least. Thus, it can be shown that regret of this form leads to longer searches than under our model. One intuition for this is the following. In our model, individuals become less picky over time, so that they will sometimes be in a situation in which they are accepting an offer that they turned down at an earlier stage. However, an individual who feels regret incurs an additional cost when accepting such an offer, making her less likely to do so. The tendency to reject these offers serves to lengthen the searches of those who experience regret. This behavior is at odds with our data, and we do not discuss this model further.

The second model we consider is what we shall term the Cognitive Acquisition Cost (CAC) model. This model is inspired by the work of Gabaix et al. (2006), who modeled under-search over a finite horizon as a consequence of applying backward reasoning for an insufficient number of stages. We consider three versions of this model. The first version assumes that the choice made in stage \(t<T\) is the optimal decision for stage min\(\{t + z, T - 1\}\), with \(z = 1\). That is, decisions taken at stage \(t < T\) are those that would be optimal at stage \(t + 1\), as long as \(t+1<T\). The second version of the Cognitive Acquisition Cost model is similar, except that the searcher behaves optimally under the assumption that there are T - 2 stages. Thus, the choice made in stage \(t<T\) is the optimal decision for stage min\(\{t + z, T - 1\}\), with \(z = 2\). The third version assumes that the individual always behaves as if it were stage \(T - 1\), which is the assumption in Gabaix et al. (2006). Each of the three versions assumes a different type of failure of backward reasoning. The first two are consistent with applying an insufficient number of steps of backward reasoning, by 1 and 2 periods, respectively. The third version is consistent with the capacity to reason backward for only one step. The three versions of the model are evaluated here in this subsection, with some additional detail provided in Appendix G. The conclusion from the analysis is that none of the adaptations of the Cognitive Acquisition Cost Model predict the decisions made by our participants as effectively as our model assuming risk-neutral and optimal decisions. The model predicts more freezing then the risk-neutral model, since the probability of freezing increases in later stages under the optimal policy, while we observe the opposite pattern in our data.

The third model is that of Kogut (1990), who proposes that agents under-search because they are susceptible to a type of sunk cost fallacy. We refer to this model as the Sunk Cost Fallacy (SCF) model. Under the SCF model, searchers use the total costs incurred to date rather than the marginal cost of searching for an additional stage when they make their decisions, as if they do not realize that previous costs are sunk. Specifically, they behave as if their freezing and rejection payments are given by:

$$\begin{aligned} \tilde{R}_t(k)&= \intop _{0}^{\bar{p}} \tilde{V}_{t+1}(p_{t+1},k) dF(p_{t+1}) + tc + f, \end{aligned}$$
(18)
$$\begin{aligned} \tilde{K}_t(p)&= \tilde{R}_t(p), \end{aligned}$$
(19)
$$\begin{aligned} R_t&= \intop _{0}^{\bar{p}} V_{t+1}(p_{t+1}) dF(p_{t+1}) + tc. \end{aligned}$$
(20)

Thus, the expected costs of rejecting the current offer and obtaining a new one are always perceived as the accumulated search costs of all stages up to the present, tc. Because the individual perceives the cost of continuing the search as greater than under optimal decision making, searches tend to terminate earlier.

The fourth model we examine assumes optimal behavior under risk aversion. This has been previously proposed as an explanation for under-searching (see for example Cox and Oaxaca (1989)). We model risk aversion using the CRRA utility function \(u(x) = \frac{x^{1-\alpha }}{1-\alpha }\) and assume that subjects integrate their earnings and costs in each stage in the following manner. Accepting an offer of p yields u(p). The other actions, namely rejecting, freezing and rejecting while having an offer k frozen, respectively, yield:

$$\begin{aligned} R_t= & {} \intop _0^{\bar{p}} \max \{ u(p_{t+1}), K_{t+1}(p_{t+1}), R_{t+1} \} dF(p_{t+1}) - u(c), \end{aligned}$$
(21)
$$\begin{aligned} K_t(p_t)= & {} \tilde{R}_t(p_t) + u(c) - u(c+f), \end{aligned}$$
(22)

and

$$\begin{aligned} \tilde{R}_t(k) = \intop _0^{\bar{p}} \max \{ u(p_{t+1}), u(k), \tilde{R}_{t+1}(k) \} dF(p_{t+1}) - u(c). \end{aligned}$$
(23)

We evaluate the alternative models, in comparison to the risk neutral model, by considering only those rounds in which there are differing predictions for the first stage. For each model and every treatment, Table 6 reports the proportion of decisions in these instances that were correctly predicted by our risk neutral model, the remaining proportion that was predicted by the alternative model, the p-value from testing for a difference in proportions between the two models, the number of rounds for which first round predictions differ between the two models and the corresponding number of observations (the number of participants in the treatment times the number of rounds in which the two models make competing predictions). We report only those model-treatment combinations for which there are at least five rounds of competing predictions, limiting the extent to which our conclusions are driven by the particular sequences observed by subjects (the treatment-pairs with fewer than five rounds differing are included Table 14, provided in Appendix F). For the model with risk aversion, we take the value of \(\alpha =.3\), which is close to typical estimates reported in the literature (Holt and Laury (2002); Noussair et al. (2014)). In all treatments, the risk neutral model outperforms the model assuming risk aversion and the three versions of the CAC model. It also does better than the SCF model in two of three comparisons. Overall, none of the alternative models presented here outperforms the model we have proposed in Sect. 3.

Table 6 Comparative performance of the optimal risk neutral model against alternative models

6 Discussion

We have analyzed the effect of an option to freeze price offers on the behavior of agents engaged in sequential search. Our model generates a number of predictions, which serve as hypotheses for our experiment. The model predicts that the existence of the freeze option and the length of the time horizon available increase search length. Lower freeze fees also increase search length. These patterns are observed in the data. Furthermore, as predicted, the usage of freezing decreases as it becomes more expensive and as the time horizon increases. Our results regarding freezing are robust to having imperfect recall, though the impact of the presence of a freeze option is more pronounced when recall is impossible. Individuals have a strong tendency to behave as if they employ threshold strategies with regard to the range of offers that they accept, freeze and reject. The frequency of accepting offers increases sharply at specific threshold levels. These levels tend to be at or somewhat below those predicted by our model. Freezing is most common in the range of offers for which it is predicted. Our overall interpretation is that the model is quite successful in predicting behavior and outcomes. In our view, this result provides a behavioral foundation for the assumption of optimality in search behavior in structural modeling of demand in markets featuring consumer search.

The main deviation from optimal behavior is a modest tendency toward under-search, the termination of the search earlier than predicted. Under-searching relative to the risk-neutral optimum has been a prominent finding in the experimental search literature. Beyond documenting that it continues to appear when freezing is possible, we document that affordable PFOs do increase the extent of under-searching. Nevertheless, this under-searching leads only to small losses in earnings.Footnote 28

Several other avenues for future research come to mind. On the empirical side, PFO data, which is increasingly available online, along with data on behavior and/or prices, can be used to structurally estimate demand, assuming decision making that is as in the model presented in the current paper. To that end, our model can be of use even though that it allows only for one offer to be frozen, provided that one can show evidence that freezing multiple offers is rare. This is particularly likely to be the case when typical consumers search very little, as in, for example, Moraga-Gonzalez et al. (2018), or in settings where search or freezing carries a high cost.

Our theoretical model could be extended to allow the freezing of multiple offers, or the possibility of freezing offers that expire before date T. The price of the PFOs could also be modeled as proportional to the underlying price of the good. Similar experiments could be conducted, but augmented to include sellers. Studying both sides of the market would allow for the evaluation of long run welfare effects. Finally, policymakers and firms may want to experiment with introducing PFOs for wage offers in labor markets.