Introduction

Rogers (1962) describes the theory of Mimetics as a biological predisposition to learn through the emulation of observed behavior. Babies mimic the facial expressions of adults long before they understand the intricacies of the emotions those facial expressions convey. This type of social learning through imitation is one of the earliest forms of learning and never leaves us, even as adults (Marsden 1998). Since mimetic behavior is conducted on a sub-conscious level, people are unaware, and often do not believe, they can be readily influenced by simply observing the behavior of others.

In the context of information cascades, Bikhchandani et al. (1992, 1998) describe that in circumstances where private information is incongruent with publicly observed behavior in the stock market, it is perfectly rational to abandon one’s private information in favor of the collective intelligence of the market. Extending the argument, in situations where the best course of action is not obvious, individuals will look to others before making decisions, and will often base those choices on observed behavioral cues.

Adopting the behavior of those around us can be done for several reasons. Bearden et al. (1989) suggest individuals adopt behavior for either information or social reasons. In situations where an individual believes the group possesses a collective information advantage, the person will follow the herd in the hope of obtaining the group’s superior information. At other times, an individual will follow the herd in order to fulfill a sub-conscious need to fit into a greater social network. Informational gains and social acceptance are two of the components that combine to create utility in individuals.Footnote 1

In the mortgage markets, an economic default describes a situation when a homeowner defaults on his mortgage due to an inability to make monthly payments, whereas a strategic default is said to occur when a homeowner makes the conscious choice to default on his mortgage when he is fully capable of making his monthly payments. Wheaton and Nechayev (2011) posit that strategic defaults are an innocuous and even necessary clearing mechanism to restore balance in the housing markets.Footnote 2 Alternatively, Wyman (2010), White (2010), FICO (2011), and Guiso et al. (2011) vehemently argue that the increase in strategic defaults is a major contributor to the foreclosure epidemic and resulting stalled economy.Footnote 3 Hence, the purpose of this study is to investigate if/how individuals mimetically herd following the observed behavior of their peers as it relates to the decision to strategically default on one’s mortgage.

We find that people’s a priori behavior is significantly altered by observing the actions of their peers, and is pronounced in the presence of a Maven, or real estate expert. Conversely, mimetic herding is significantly mitigated when the individual holds a strong moral objection to strategic default, and is not influenced by signal strength. Finally, people tend to herd more for information gains as opposed to social reasons.Footnote 4

Literature Review

The concept of mimetic herding dates back to Roman times when wealthy families hired individuals to mourn at funerals to create a greater sense of loss due to the passing of the deceased (Bikhchandani et al. 1998). In modern times, knowing that people are genetically predisposed to mimic the behavior of others, comedians plant friends in the audience who laugh early and loudly in the hopes of whipping the crowd into a tickled frenzy. Restaurant owners often close off the back rooms to create the illusion that the food is so good the establishment is filled to capacity. Even the U.S. Navy recognizes the impact of mimetics as it is the current practice to have judges render decisions in reverse order of rank to minimize the tendency of younger judges to follow rulings of older, more established ones. Ben Bernanke also avoids herding by soliciting the opinion of the other members of the Board of Governors before stating his position on economic issues facing the country.Footnote 5 In France and Israel, polls are banned during the weeks leading up to a political election so as not to influence the pattern of voting behavior during the actual election (Farnsworth 2007).

Alevy et al. (2007) discuss how herding is found in fields as varied as medicine, adoption of technology, responses to environmental hazards, and of course, in financial markets. Diamond and Dybvig (1983), Shiller (1990), Eichengreen et al. (1998), Morris and Shin (1999), Bikhchandani and Sharma (2000), Persaud (2000), and Chari and Kehoe (2004) argue that herding adds volatility to the financial markets because these mimicking behaviors cause excessive price movements, or over-reactions, in security prices. The result is an unnecessarily unstable environment. Shiller (2008) argues that herding is responsible for the recent real estate bubble, and suggests that home prices did not need to go nearly as high as they did, and therefore, need not come down as far as they have.Footnote 6 Alternatively stated, in the absence of herding behavior, excessive booms and crashes could be avoided, or at least certainly mitigated.

We are careful to disclose that herding in not necessarily an irrationality argument. Bikhchandani et al. (1992, 1998) use Bayesian statistics to mathematically demonstrate that it can be fully rational to abandon private information in favor of the herd if one believes the market possesses a superior collective information set. Two supporting (yet non-Bayesian) explanations in favor of rational herding are offered by Anderson and Holt (1997) who argue that utility is derived by both social proximity to peer positions and the avoidance of being alone in one’s convictions. Interestingly, Keynes (1965) expresses that the positive utility derived from being the only one who is correct is far less than the negative utility experienced when being the only one who is wrong.

Although (too) often used interchangeably in the extant literature, Celen and Kariv (2004) and Chamley (2004) correctly note an important distinction between an information cascade and a herd. While both result in a large number of people making the same decision, in a herd, private information/signals are not necessarily being ignored. Alternatively stated, social learning can take place in a herd, but not in an information cascade where private information is being ignored in favor of the public choices of the group. Whether it is referred to in the literature as herding, information cascades, social contagion, or observational learning, mimetic herding has long been recognized as a means of describing how people influence each other through mere observation. Noting the increased incidence of strategic mortgage default, we seek to test just how malleable homeowners are to the adoption of an advocacy of strategic default.

Experimental Design

Past studies examining the theory of information cascades and herding can be divided into two classes: regression-based tests using real world data and laboratory experiments using primary data collected from individual participants.Footnote 7 Some experimental studies have relied upon Bayesian statistical experiments involving the drawing of balls from a bag. This type of abstract approach points to the difficulty in investigating theories hoping to discern the behavioral motivations of individuals. In the current investigation, we seek a much more direct investigatory approach to the empirical testing of herding behavior.

The experiment begins by participants being presented with a common setting and set of assumptions as presented in Exhibit 1 (see Appendix). Specifically, participants are assumed to have purchased a home at a price of $300,000 making a down payment of $30,000 while taking out a fixed rate mortgage. Now, 4 years later, the home has dropped in value to one of nine different levels (Scenarios 1 through 9) leaving the home underwater.Footnote 8 Assuming, the participant does not expect home prices to go up or down at all over the next 3 years, and assuming he is able to continue making his mortgage payments,Footnote 9 we ask the person to share the likelihood that he will strategically default on his mortgage in each of the scenarios sequentially.Footnote 10

After the participant registers his likelihood of strategically defaulting under the first scenario, a bar chart of the results is shown to reveal the collective responses of past participants (see Exhibit 2 of the Appendix). As such, it is not possible to influence the answers of participants in Scenario 1. Instead, these values are used as a baseline measurement or starting reference point. However, an ability to influence responses is put to the test in all subsequent scenarios. Specifically, in order to measure the differential effects of what each theory would predict, we must carefully create alternative laboratory environments. To accomplish this task, it is necessary to control the feedback the group receives through altering the bar charts. So, while we are able to record the true responses of each participant, we falsely report back to participants a set of behaviors necessary to tease out each specific hypothesis. Walking the participant through each of these nine scenarios constitutes Phase I of the experiment. Phase II uses the exact same scenarios and assumptions, but involves a slightly altered setting. Instead of asking people what they would do in each of the scenarios, participants are asked what they would recommend a friend do in each scenario. To test the various hypotheses, we need a reason to ask the same set of questions as in Phase I. However, to realistically allow for different responses, we need for there to be the possibility that the participant would respond differently between Phases I and II. In sum, our goal is to hold constant the effective settings between the two phases while at the same time plausibly allowing for the participant to observe different collective or aggregated responses. Below, we explain how these effects are controlled for in each successive experimental design variant.

Control Sample

When considering the various hypotheses to be tested and the control measures that need to be in place, it is necessary to create 50 distinct deviations from our base experiment. The base experiment involves each participant (within this control sample group) walking through each of the phases and indicating their likelihood of defaulting without receiving any feedback concerning the responses of other participants. This control group enables us to see how participants respond in the absence of feedback manipulations. Beyond the control group, the (falsely) reported group responses are constructed in multiple ways to isolate each particular theory as discussed below.

Skewing Order

We posit that people can be swayed from their a priori actions and opinions simply by observing a different behavior from others around them. This issue is examined by falsely reporting extremely low group mean responses (low likelihood of strategically defaulting) in one of the phases and extremely high group mean responses (high likelihood of strategically defaulting) in another. Skewing order simply refers to whether the high means are shown in Phase 1 or Phase 2. As a random design variable, we do not expect skewing order to be significant.

Presentation Order

In constructing the base experimental design, we associate Phase I with the likelihood of the participant defaulting, while in Phase II, we change the setting to reference advice given to the participant’s friend. The base design also uses low-mean graphs in Phase I, and high-mean graphs in Phase II. To avoid problems with order effects, we then randomize the order in which these controls appear so as not to systematically influence the results.

Signal Strength

The next variation in the design relates to the strength of the herding signal implied through the shape of the distribution of participant responses. A strong signal is seen when the group’s answers converge around the mean score resulting in a tall and tight distribution of responses (a leptokurtotic distribution). In this case, Hirshleifer et al. (1994) predict the participant may feel compelled to express the same view even though he may not share it. Conversely, a weak signal is seen when the group’s responses are spread out more evenly across the 9-point “likelihood of default” scale (a platykurtotic distribution). Jegadeesh and Kim (2010) suggest less herding in this situation because individuals are implicitly given greater freedom to choose any score they want. Alternatively, Fernández et al. (2011) interpret a wider dispersion in the distribution of participant responses as being reflective of a more uncertain market environment. If this is the case, the uncertainty would be a reason for participants to herd more, not less. Based on the competing predictions in the extant literature, we leave the sign on the signal strength variable as an empirical question.

To hold all else constant and truly isolate the effect of signal strength, it is necessary to create a set of graphs whose mean is the same between both groups, but whose distribution is tight for one group and wide for another. We do this for all nine scenarios and across both phases. An example of the signal strength design is illustrated in Exhibit 3 (see Appendix).

The central hypothesis in this study is that the (falsely) reported aggregated responses will significantly impact participant indications of likelihood of strategically defaulting in each successive scenario. We further control for various order effects in the experimental design to remove potential confounding of the results. Until this point in the paper, we have only discussed variations of the experiment where the bar charts reflect previous participant’s likelihood of strategic default indications. We next extend our discussion to variations in the design which consider the influence that specific individuals might have on the results.

Maven Influencer

A “Maven” is an expert or knowledge leader in a particular subject (Feick and Price 1987). These are individuals whose opinions are sought particularly when the decision to be made is complex and/or ambiguous. We hypothesize that observing the behavior/choice of a Maven will have a greater influence on participants than observing the behavior of an average person. To test this hypothesis, we create a variation of the design where after each scenario, in addition to reporting (false) group responses, we also report the response from a real estate expert.

Based on Engelberg and Parsons (2011), 18 major newspapers from across the United States were identified–one for each of the scenarios (9 × 2). We then selected 18 last names, 9 male first names and 9 female first names. Using a Fisher and Yates (1938)Footnote 11 shuffling algorithm to ensure proper randomization of draws without replacement, we then assigned a full reporter name to each newspaper. In this variant of the experiment, participants saw not only falsified means and a corresponding bar chart for each scenario, they also saw the exact score associated with a Maven from a leading newspaper. To be sure the Maven effect is consistent with the High-Low skewing effect, the Maven’s score was randomly selected from integers strictly greater (less) than the reported mean in the High (Low) graph phase. Exhibit 4 (see Appendix) reports the newspapers and reporter names utilized in this study.

Participant (Non-Maven) Influencer

White (2010) and Seiler et al. (2011) demonstrate that there are substantial levels of shame associated with the act of strategically defaulting. Thus far, individual responses have not been publicly associated with a particular individual. If we introduce the possibility that participant’s likelihood of default scores will be shown to others in the experiment, we would expect people to express a lesser willingness to strategically default or at least a lesser willingness to break away from the herd. To carry out this variation in design, we begin by collecting the names of participants entering the experiment and convey that their responses have a (1/n) random chance of being shared along with their name, city and state with future participants.Footnote 12 We then use the same Fisher and Yates (1938) algorithm to randomize the 18 fictitious name combinations shown in Exhibit 4 (see Appendix). In this sub-sample, after the participant answers each scenario, they see the (falsely) aggregated responses of those before them as well as what they believe to be a randomly selected past participant, that participant’s city and state, as well as their likelihood of strategic default score for that scenario. The city and state names were randomly selected from our population-weighted database.

To clarify the experimental design, each participant is exposed to just one of the 50 variant combinations discussed above. As such, we are following a “between subjects” design. The 50 variant combinations of the study are graphically presented in Exhibit 5 (see Appendix).

Methodology

We seek to measure how susceptible people are to changing their responses based on outside influences. Due to our unique design, we create Mimetic Herding scores in several different ways to examine the issue from different perspectives. First, we create Mimetic Herding scores for a given respondent by taking the absolute difference between their Phase II and Phase I responses. Due to the controlled environment, we are able to attribute the reason for these potential shifts from one phase to the next to controls within our model. Next, we create Mimetic Herding scores at the scenario level by comparing responses to the corresponding answers from the control sample. Control sample means make sense because they represent a good proxy for how the participants likely would have responded in the absence of our experimental controls. Finally, we create Mimetic Herding scores at the scenario level by comparing responses to the fabricated mean values shown to respondents during the experiment. These reported means are appropriate in the sense that they were perceived by participants to be the true mean responses of their peers. These three techniques enable us to clearly identify the amount of mimetic herding that occurs.

For our initial measure of Mimetic Herding, we are interested in the deviation of a respondent’s answer on a given question q 1 from Phase I with the respondent’s answer on the corresponding question r 1 from Phase II. For a question q, take individual i in group j (either A for Aggregated Responses Influencer group, M for Maven Influencer group, P for Participant Influencer group), to have a private belief x ijq about the appropriate answer for the question. The Mimetic Herding Score, θ jq , for group j, question q is then

$$ \matrix{ {{\mathrm{Mimetic}}\;{\mathrm{Herding}}\;{\mathrm{Scor}}{{\mathrm{e}}_1}} \hfill &{{\theta_{jq}} = ABS\left( {{x_{ijq}} - {x_{ijr}}} \right)} \hfill \\ }<!end array> $$
(1)

where, x ijr is the respondent’s answer to question r, which is the corollary to question q, but is from Phase II. The MSE, \( \vartheta_{jq}^2 \), for group j, question q, is then

$$ \vartheta_{jq}^2 = \frac{1}{{{N_j}}}\sum\limits_{i = 1}^{{N_j}} {{{\left( {ABS\left( {{x_{ijq}} - {x_{ijr}}} \right) - {{\overline x}_{ijqr}}} \right)}^2}} $$
(1a)

where, N j is the number of group j members in the sample and x ijr is the participant’s answer to question r, which is the corollary to question q, but is from Phase II, and \( {\overline x_{ijqr}} \) is the mean of the absolute value series. A group that is exhibiting Mimetic Herding will have a higher MSE for that question.

For our second measure of Mimetic Herding, we are interested in the deviation of a respondent’s answer on a given question q from the control sample’s response on question q. To create our measure of Mimetic Herding, we use a squared deviation from the mean approach similar to Healy (2009), as follows. For a question q, take individual i in group j (either A for Aggregated Responses, M for Maven Influencer group, or P for Participant Influencer group), to have a private belief x ijq about the appropriate answer for the question. The Mimetic Herding score for group j, question q is then

$$ \matrix{ {{\mathrm{Mimetic}}\;{\mathrm{Herding}}\;{\mathrm{Scor}}{{\mathrm{e}}_2}} \hfill &{{\phi_{jq}} = {{\left( {{x_{ijq}} = Contro{l_q}} \right)}^2}} \hfill \\ }<!end array> $$
(2)

where, Control q is the control sample’s answer for question q. The MSE, \( \Delta_{jq}^2 \), for a group j, question q is then

$$ \Delta_{jq}^2 = \frac{1}{{{N_j}}}\sum\limits_{i = 1}^{{N_j}} {{{\left( {{x_{ijq}} - Contro{l_q}} \right)}^2}} $$
(2a)

where, N j is the number of group j members in the sample and Control q is the control sample’s answer for question q. A group that is exhibiting Mimetic Herding will have a higher MSE for that question.

For our final measure of Mimetic Herding, we are interested in the deviation of a respondent’s answer on a given question q from the mean response shown to the respondent on question q. For question q, take individual i in group j (either A for Aggregated Responses, M for Maven Influencer group, or P for Participant Influencer group), to have a private belief x ijq about the appropriate answer for the question. The Mimetic Herding score for group j, question q is then

$$ \matrix{ {{\mathrm{Mimetic}}\;{\mathrm{Herding}}\;{\mathrm{Scor}}{{\mathrm{e}}_3}} \hfill &{{\chi_{jq}} = {{\left( {{\chi_{ijq}} - AggrRes{p_q}} \right)}^2}} \hfill \\ }<!end array> $$
(3)

where, AggrResp q is the value that was reported as being the mean answer given by previous survey participants for that question. The MSE, \( \delta_{jq}^2 \), for group j, question q is then

$$ \delta_{jq}^2 = \frac{1}{{{N_j}}}\sum\limits_{i = 1}^{{N_j}} {{{\left( {{x_{ijq}} - AggrRes{p_q}} \right)}^2}} $$
(3a)

where, N j is the number of group j members in the sample and AggrResp q is the value that was reported as being the mean answer given by previous survey participants for question q. A group that is exhibiting Mimetic Herding will have a lower MSE for that question.

Now that the three Mimetic Herding scores have been created, we next specify a model to explain these scores as follows:

$$ {\mathrm{Mimetic}}\;{\mathrm{Herding}} = f\left\{ {{\mathrm{Experimental}}\;{\mathrm{Design}}\;{\mathrm{Controls}},{\mathrm{Miscellaneous}},{\mathrm{Demographics}}} \right\} $$
(8)

where the specific variables are defined as:

Dependent Variables: Mimetic Herding

Mimetic Herding Score–1

Absolute difference between Phase I and Phase II scores for each of the final eight scenarios

Mimetic Herding Score–2

Mimetic Herding Scores using the control group means

Mimetic Herding Score–3

Mimetic Herding Scores using the fabricated means

Independent Variables: Experimental Design Controls

Skewing Order

Refers to the scenario averages experimenters inserted in an attempt to herd participant scores either higher or lower. 1 = high means; 0 = low means.

Presentation Order

Refers to whether phase 1 of the experiment referred to the actions the participant would do if she were in the situation versus what they would recommend their friend would do. 0 = you; 1 = friend.

Signal Strength

Refers to the situation where the distribution of past participant responses was falsely reported to be extremely tightly centered about the mean (leptokurtotic). 1 = a Tight-Tight, 0 otherwise.Footnote 13

Maven Influencer

A dummy variable equal to 1 if an expert opinion was seen by the current participant, 0 otherwise.

Participant Influencer

A dummy variable equal to 1 if a past participant opinion was seen by the current participant, 0 otherwise.

Independent Variables: Miscellaneous

Can Be Easily Influenced

Refers to how easily the participant thinks his answers can be influenced by others from 1 (not easily influenced) to 9 (easily influenced).

Real Estate Knowledge

The self-assessed level of knowledge from 1 (not knowledgeable) to 9 (very knowledgeable).

Susceptibility to Informational Influence (SII)

The self-assessed level of malleability due to information gains from 1 (not malleable) to 9 (very malleable) based on Bearden et al. (1989).

Susceptibility to Normative Influence (SNI)

The self-assessed level of malleability due to the social need to fit in from 1 (not malleable) to 9 (very malleable) based on Bearden et al. (1989).

Moral Viewpoint

The degree to which the participant believes it is morally wrong to strategically default on one’s mortgage from 1 (not wrong) to 9 (very wrong).

Independent Variables: Demographics

Age

Age

Number of Children under 18

The number of children under the age of 18

Children under 18 Dummy

1 if the participant has at least one child under the age of 18, 0 otherwise

Education Level

1 = Ph.D.; 2 = Master’s Degree; 3 = Bachelor’s Degree; 4 = Some college; 5 = High School Diploma; and 6 = Less than High School Diploma

College Dummy

1 if the participant has a college degree, 0 otherwise

Ethnicity:

1 = African American; 2 = Asian; 3 = Caucasian; 4 = Hispanic; 5 = Native American; 6 = other

Male Dummy

1 if the participant is a male, 0 otherwise

Income

The participant’s income expressed as an increasing function across seven $20,000 buckets.

Single Dummy

1 if the participant is single, 0 otherwise

Net Worth

The participant’s net worth expressed as an increasing function across nine $200,000 buckets ranging from less than −$400,000 through over $1,000,000

Positive Net Worth Dummy

1 if the participant has a positive net worth, 0 otherwise

Data

Data from this study were collected in the last week of March 2011, using an existing on-line network of homeowners who stand ready to complete Web-based experiments. Our Institute simply posts a notice on the server for a task we would like to be performed and lets potential participants self-select into the pool. Participants hail from all across the United States and generally reflect the overall population of U.S. homeownersFootnote 14 with the exception that our participants are somewhat younger and have attained a higher level of education.Footnote 15 Originally, 1,495 individuals participated in our experiment. However, after screening the data, the final sample consists of 1,365 usable responses.

Most economic experiments use a reward-based system where compensation is tied to the correctness of the answers provided by participants. However, in the context of strategic default, there truly are no “correct” answers. We merely collect data on people’s strategic default acceptance threshold. As such, it is not possible to tie compensation to participant responses in a meaningful way. Instead, we posted the experiment with a flat participation fee of $2.06,Footnote 16 an amount large enough to encourage participation, but not so large as to attract people who might engage for the wrong reasons.

Having collected data from this general pool of participants for a multitude of different studies in the past, we have evolved an intricate screening system to ensure cleaner tests of our hypotheses. Concerning missing data, we screen to be sure participants answered all 18 default scenarios in order to obtain a complete picture of each person’s strategic default acceptance threshold. Further, at two different places within the experiment, we ask the person to answer a certain number, say “3,” for one question and say “7,” for another. This simple technique ensures participants are at least reading the questions carefully.Footnote 17 Moreover, we place a hidden timer in the experiment to record the time it took for the person to read the instructions. If the person advances the screen before 10 seconds have elapsed, we discard all their responses.Footnote 18 Another technique to identify people who do not take the experiment seriously is to have them begin the experiment by selecting their city and state from a pull-down menu early in the experiment. Then, at the end of the experiment, we ask them for their zip code.Footnote 19 If the two do not match up, we remove them from the final sample.

In addition to the above methods, we also have the ability to require participants to have an overall service satisfaction rating of no less than 95 % on past tasks completed for us and other posting organizations. To clarify, participants are approved (or disapproved) by each posting organization for having performed sufficiently (insufficiently) on a given task. These ratings are made available to all subsequent posting organizations so that they may choose to disallow people from performing tasks for them in the future. On an individual organization level, we have also built in screens to permanently block people who do not meet our series of screens in past experiments from ever participating in future experiments for us. We accomplish this by banning the unique individual identifiers created when a person signs up to create an account.

Beyond individual task evaluations, we further identify those who do not fully engage in the experiment by cross-referencing the demographic data from the current collection effort to those we have collected in the past. If the person’s unique identifying number has been used in a past study, we are able to cross-reference their demographic data and run further screens to see if these reported values have unrealistically changed. If, for example, a person reports different genders or nationalities from one experiment to the next, a red flag is sent up to identify the person as falsifying their responses. These folks would be permanently banned from our individualized pool.

Readers might be concerned that people within this system may have created multiple accounts using multiple e-mail addresses, and as such, may have participated in our current experiment multiple times. This is highly unlikely given that in order to establish an account within this system, an active credit card matching the name of the accountholder must be provided. Moreover, once a person completes our experiment, they are unable to re-enter the task. Importantly, if a person were to complete the same task multiple times, it would seem reasonable to assume they were doing so for the money. As such, they would be likely to want to hurry through the experiment, taking as little time as possible to discern the intricacies of our experimental design. Consequently, this type of behavior would work toward a null result, not towards a significant finding. In sum, as with any experimental design, it is in our best interest to produce as pure an experimental environment as possible if we hope to definitively answer our research questions. Table 1 provides descriptive statistics for the sample.

Table 1 Descriptive statistics

Results

To learn how participants might have responded in the absence of our experimental manipulations of the environment, we begin by analyzing data collected from a control group. Panel A of Table 2 reports that their uninfluenced scores range from a low of 1.62 (not likely to strategically default) up to 6.99 (more likely to strategically default). The shape of the progression moving from lower scenarios (where the home is not far underwater) to higher scenarios (where the home is extremely far underwater) is upward sloping as hypothesized. From a design perspective, it is fundamentally sound to see results that are not compressed against either the ceiling or floor of the scale. These baseline scores can more formally be interpreted as the person’s strategic default acceptance threshold.

Table 2 Univariate examination of the experimental design controls in the absence of an influencer

Panel B reports the fabricated means that were shown to all other participants (not the control group). In Panel C, we begin to examine the data using simple univariate techniques. First, the low versus high means are considered. Note that in this single-level aggregation, all other influences are mixed in with the low versus high means. Therefore, interpretation of the results must be met with caution. Panel C is simply being reported to get an initial feel for the data.

Paired-Samples T-Tests report significance in almost all high versus low skewing scenarios indicating that participants might have been swayed from their unbiased estimates based on seeing (fabricated) collective results that were artificially presented as being extremely high or low. Scores are statistically similar when considering the single-aggregation effect of the distribution of past responses (leptokurtosis versus platykurtosis). Early indications are, therefore, that participants were not affected by the degree of dispersion of responses about the mean (signal strength). Concerning presentation order, it appears that whichever group went first had higher reported values. This did not matter whether the phase asked about the participant or the participant’s friend. This presentation order is consistent with the effect seen in the control group as well. Observing an order effect is another confirmation that the steps taken to control the experimental design were necessary to isolate the key variables of interest–skewing and signal strength. Panels D and E present secondary and tertiary dissections of the likelihood of strategic default by the three control variables. These breakdowns support earlier indications that presentation order and skewing matter, but not so much the strength of the signal.

Table 3 presents a correlation matrix of potential independent variables used in the regression analyses to follow. Most of the demographic variables are intuitively linked. For example, married participants are associated with being slightly older, more likely to have graduated college, make more money and have greater wealth, have children, and so forth. Moving away from demographics, people who believe they can more easily be influenced, have lower self-assessed levels of knowledge, and rate higher on the Susceptibility to Normative Influence scale. But, again, none of the correlation coefficients send up a red flag in terms of multicollinearity problems moving forward.Footnote 20

Table 3 Correlation matrix for independent variables subsequently used to explain mimetic herding

Exhibits 6 and 7 (see Appendix) visually present the results of our central hypotheses in the paper. Exhibit 6 (see Appendix) shows the average mimetic herding scores for the main sample relative to the control sample’s responses on each question. The upward trend shows strong evidence that the respondent’s answers were pulled farther and farther from the control sample’s responses on each question. This is clear evidence of mimetic herding. Exhibit 7 (see Appendix) shows the average mimetic herding scores for the main sample relative to the fabricated means that the participant observed on each question. The downward trend shows that the homeowner’s answers were pulled closer to the fabricated means over time again providing evidence of mimetic herding.

Table 4 provides a direct multivariate test of our central hypotheses in the paper. We specify the measure of Mimetic Herding as the absolute difference in participant scores between the first and second phases of the experiment.Footnote 21 As hypothesized, the presence of a Maven significantly encourages herding well beyond the 99 % level. However, signal strength does not differentially invoke herding. Interestingly, people who thought their behavior could be influenced by others were correct. Based on susceptibility to information influence (SII) and susceptibility to normative influence (SNI) measures developed by Bearden et al. (1989), it appears participants herd more for informational gains as opposed to social reasons.Footnote 22 Finally, as hypothesized, herding behavior is significantly mitigated when the person holds a moral objection to strategic default.

Table 4 Regressions to explain mimetic herding

While Table 4 examines herding behavior scenario by scenario, in Table 5 we use the Healy (2009) inspired sum of squared distance from the mean measure to calculate an accumulated Mimetic Herding score that aggregates static measurements across each of the eight scenarios.Footnote 23 Mimetic herding is said to increase when the participant moves farther from the control sample means (Eq. 2) or when the participant moves closer to the fabricated means (Eq. 3). Two sets of means are considered in this table. The first set of means to which participant scores are compared is the means observed within the control sample where nothing was done to influence participant responses. The justification for considering these is that this is a reasonable approximation for what the participants would have indicated in the absence of our experimental influences. The second set of means we use in the calculation of mimetic herding is the fabricated means artificially introduced by the authors. It makes sense to use these from the standpoint that the participants are led to believe that these are the responses from past participants.

Table 5 Regressions to explain mimetic herding scores based on control group and fabricated means

To examine accumulated mimetic herding behavior, we consider the combination of experimental controls where the participant saw the high means in Phase I. The reason for this focus is because, based on the scores reported in the control group samples, we know that this experimental variant is the one where participants were confronted with reported group means that were most incongruent with their a priori answers. Alternatively stated, participants who began the experiment with high mean feedback in Phase 1 had to decide right away if they were going to retain their internal likelihood of strategically defaulting under each scenario or herd by following the (falsely) reported means.

The first column of Table 5 presents the results associated with the control sample mean comparisons.Footnote 24 The observation of Maven behavior again significantly increases the herding activity of the sample, while signal strength does not. Participants who indicate they can be influenced by others are correct. Finally, those with a strong moral belief that it is wrong to strategically default are statistically significantly less likely to follow the (perceived) herd.

The second column of Table 5 again employs the Healy (2009) based measure, but this time uses the fabricated group means as the relevant benchmark instead of the control group means. We expect the signs of all variables to be reversed as we are now hypothesizing that participants will move towards the fabricated means (as opposed to away from their control group means), and this is exactly what happens. Upon first inspection, it seems surprising that the presence of a Maven becomes non-significant for the first time in the study. However, recall that the reported Maven score does not coincide with the fabricated means. Instead, it is intentionally reported as being on the extreme side of the mean (higher than the mean in “high mean” graphs and lower than the mean in “low mean” graphs). As such, we do not expect it to pull participants to the means. We expect Mavens to pull participants beyond the mean. As before, participants appear to herd for informational gains rather than for social reasons.

Robustness Check

Experimental settings are necessary to create an environment where all else is held constant and where a sufficient number of variants (50 in this study) can be carried out (Alevy et al. 2007). However, a question that often arises in an experimental study is whether or not the results would apply to a group of professionals (Harrison and List 2004; Locke and Mann 2005; Bikhchandani and Sharma 2000). For this reason, we conduct a variant of this study in a face-to-face setting using real estate professionals who are also homeowners. This is accomplished through the use of an Instant Response Device (IRD). An IRD is a credit card sized, hand-held electronic device that allows participants to register their responses to scenarios we construct on an interactive, real-time basis. Since participants are all located in the same room, we are able to test whether or not their responses are being influenced by the collective responses of their peers.

As with the Internet sample, we create the exact same (falsified) graphs that reflect both the mean and full distribution of participant responses. Since it is not possible to replicate all 50 variants of the study at once, we selected the variant where the professionals’ Phase I (Phase II) graphs reflected low (high) means, a tight (tight) distribution, they were asked to describe their (friend) likelihood of strategic default, and with no external influencer. The results are shown in Table 6.

Table 6 T-test and regression results using real estate professionals in a face-to-face setting

Panel A of Table 6 reports the means associated with the scenarios in each phase. Note that the Scenario 1 scores are almost identical indicating no difference between the advice the real estate professional would provide to a friend and what he would do for himself. Interestingly, significant differences do not occur until Scenario 4. This implies that it took three scenarios for participants to herd towards the values (they believe to be) reported by their peers.Footnote 25 Scores stayed higher through Phase II, as hypothesized, based on the higher reported means shown to the audience.

In Panel B, regressions are estimated for each of the eight scenarios consistent with the process carried out in Table 4. The main difference is that the experimental design controls are necessarily absent, again due to the fact that only a single variant of this study (not all 50) was administered. These remaining independent variables do little to explain which particular participant attributes contribute to the herding behavior. In sum, it is clear that even professional real estate experts follow the herd in this context.

Conclusions

This study examines mimetic herding in an experimental setting to measure whether or not it is possible to change homeowners’ stated willingness to strategically default at varying levels of negative equity. By falsely reporting the mean willingness to strategically default of their peers, participants significantly deviated from their prior positions. This behavioral departure is intensified when the homeowner (falsely) observes a Maven’s, or market leader’s choice in each scenario. When incorporating the participant’s moral viewpoint on strategic default, homeowners are less likely to herd when they hold an a priori moral objection. Mimetic herding tests did not support malleability in a priori positions on the part of homeowners simply by reporting a difference in signal strength. Finally, based on the Bearden et al. (1989) measure of SII and SNI, homeowner participants appear to herd more for informational gains rather than for social purposes.

Motivated by Bikhchandani and Sharma (2000), we then performed a robustness check on the data by collecting a (second) sample of real estate industry professionals who are also homeowners. Consistent with theoretical expectations, even real estate professionals significantly herd after observing the behavior of their peers. Due to the difficulty of convening large groups of industry professionals, we leave further confirmatory variants of our experimental design to future research efforts.