1 Introduction

In standard signaling games, a sender has private information and a receiver tries to infer that information from a message sent by the sender. There are some environments in which there is a message that induces the receiver to take an action that maximizes the senders payoff. For example, Kihlstrom and Riordan (1984) showed that firms may use costly advertising in order to signal product quality, but in the settings we have in mind, a promotional campaign might also make more consumers aware of the product and consequently, that campaign may be more beneficial to companies than no advertising.Footnote 1 Similarly, Spence (1973) suggested that workers invest in education in order to signal their productivity in the labor market, but a higher level of education may also give access to better job opportunities in the situations we consider, in which case education may be beneficial to the worker. In both examples, the sender prefers the receivers response to one of the messages. Specifically, some consumers may consider buying a product after observing a marketing campaign that makes them aware of its existence, but that product would be overlooked by those consumers without the advertising expenditure. In the second example, a company may consider offering a highly productive job only to those workers whose level of education is sufficiently high. Finally, it is easy to imagine situations in which only the highest senders type would have access to the receivers best responses with perfect information. For example, with perfect information, those customers with the highest willingness to pay for a product would prefer to buy the product from only the best companies. Likewise, companies would prefer to offer certain high-profile jobs to only the most productive workers.

In this article, we analyze a general class of signaling games in which the highest message induces the uninformed receiver to take the action that is most preferred by the sender. Additionally, the receivers best response to the highest type of sender with perfect information is so good that all senders types will have strong incentives to imitate the highest. In this type of games, there will only be pooling equilibria and as expected, the outcome in which all senders types choose the highest message is the only equilibrium selected by the neologism-proof refinement developed by Farrell (1993). However, other standard refinements do not rule out the implausible equilibria.

As out-of-equilibrium beliefs are unconstrained in a perfect Bayesian equilibrium, there are usually multiple equilibria in signaling games and previous researchers have proposed different refinements in order to eliminate some of those equilibria. In the literature on game theory, there is a wide variety of articles which have identified a general class of signaling games in which the unique equilibrium selected by previous refinements is a separating equilibrium (Riley 1979; Cho and Sobel 1990; Esó and Schummer 2009; Liu and Pei 2020). Differently, Mailath et al. (1993) use a numerical example of an educational signaling game in order to illustrate that the unique result selected by their refinement will be a separating or a pooling equilibrium depending on the prior distribution of workers types. Unlike that literature, we identify a general class of signaling games in which there are only pooling equilibria, but the refinement introduced by Farrell (1993) selects the plausible equilibrium. Interestingly, divinity or D1 as defined by Cho and Kreps (1987) and Cho and Sobel (1990) would also select the same equilibrium in our class of games if we added the single-crossing condition, but we found that Farrells criterion does not need that condition to select the same equilibrium.

As an application, we study an educational model which belongs to our general class of signaling games. Here, the workers productivity in the labor market increases with his level of education. Additionally, only those workers with the highest level of education will gain access to a highly productive job with training opportunities. Furthermore, the productivity difference between workers types is so high for all levels of education that low-productivity workers will have strong incentives to imitate high-productivity workers educational investment. Finally, the prior distribution of workers types is sufficiently concentrated on the highest possible type. Unsurprisingly, in a non-selective educational system with low costs of education, we obtain a unique neologism-proof equilibrium in pure strategies in which all workers types choose the highest possible level of education in order to get the job with training opportunities. However, divinity and other criteria do not eliminate the other implausible equilibria.

This article is organized as follows. Section 2 briefly reviews previous literature and shows our main contribution. Section 3 presents the main ingredients of a typical signaling game, whereas Sect. 4 introduces the main assumptions which differentiate our general class of games from other signaling games and characterizes all the equilibria obtained. After this, Sect. 5 proves that the refinement proposed by Farrell (1993) selects the most plausible equilibria even when the single-crossing condition is not satisfied, whereas other refinements require that condition in order to select that equilibria. Next, in Sect. 6, we study a similar model to that introduced by Spence (1973) in his seminal article as an application of our class of games. Finally, Sect. 7 summarizes the main conclusions and the appendix includes the proofs of all lemmas, propositions and theorems.

2 Literature review and contribution

As there are multiple equilibria in most signaling games, previous literature has proposed different criteria in order to constrain the out-of-equilibrium beliefs and in many cases, the unique equilibrium selected is a separating equilibrium (Riley 1979; Cho and Sobel 1990; Esó and Schummer 2009; Clark and Fudenberg 2021). Similarly, Mailath et al. (1993) use a specific example of the Spences model with separating and pooling equilibria in order to illustrate that their criterion only selects the least-cost separating equilibrium under certain assumptions, but they also found the conditions under which the selected equilibrium is pooling. Unlike that literature, we study a general class of signaling games in which there are only pooling equilibria, but one of them is more plausible than others. In those games, we show that only the refinement introduced by Farrell (1993) will be useful to select that plausible equilibrium.

In his seminal article, Farrell (1993) defined the neologism-proof equilibrium (NPE), which is a sequential equilibrium with no credible neologism. A credible neologism is an out-of-equilibrium message for which there is a set of senders types who are strictly better off than in equilibrium when the receiver responds to that deviation by concentrating his beliefs on those types, whereas other types are worse off. Farrell (1993) called pooling equilibria uncommunicative and he proved that the NPE are uncommunicative when the senders preferences over the receivers beliefs are independent of the senders type. However, Farrell (1993) acknowledged that the NPE needs not be unique if it exists. In this article, we have identified a general class of monotonic signaling games in which there are multiple pooling equilibria that survive standard refinements, but only one is NPE. Therefore, the concept of equilibrium introduced by Farrell is more useful than other refinements in our class of games.Footnote 2

Interestingly, Cho and Sobel (1990) found that their refinement, namely divinity, selects a unique pooling equilibrium in a class of signaling games with the single-crossing condition. In our class of games, we found that the set of divine equilibria coincides with the set of NPE when we add that condition. As Farrells criterion does not need this assumption in order to select those equilibria, it is more robust to the specification of the single-crossing condition than divinity in our specific class of signaling games.

In comparing different refinements, there is an abundance of literature demonstrating that divinity is the strongest criterion. For example, Cho and Kreps (1987) showed that the intuitive refinement gives rise to a lower set of equilibria than those criteria suggested by McLennan (1985), Milgrom and Roberts (1986) and Kohlberg and Mertens (1986) and that all divine equilibria also satisfy the intuitive criterion, but the opposite is not true. Furthermore, Sobel et al. (1990) proved that the set of equilibria that survives divinity is a subset of a set of equilibria obtained after deleting iteratively weakly dominated strategies. Additionally, Cho and Sobel (1990) showed that divinity, universal divinity and never a weak best response criterion are equivalent in monotonic signaling games. Therefore, divine equilibria are strategically stable as defined by Kohlberg and Mertens (1986) in monotonic signaling games. Finally, Fudenberg and He (2017, 2018, 2020) proved that the set of divine equilibria is a subset of the set of type compatible equilibria and of the set of rationality compatible equilibria, Clark and Fudenberg (2021) showed that justified communication equilibria are path equivalent to divine equilibria in co-monotonic signaling games, and Dominiak and Lee (2023) proved that each divine equilibrium is a rational hypothesis testing equilibrium, which is their notion of equilibrium. As we consider monotonic signaling games and those articles showed that divinity selects a subset of the set of equilibria selected by those refinements mentioned above in monotonic signaling games, we only compare the neologism-proof equilibrium selected to that selected by divinity with and without the single-crossing condition.

Before applying any refinement, we have identified a class of games in which all the equilibria in pure strategies are pooling, which are monotone equilibria as defined by Liu and Pei (2020) because lower senders types do not choose higher messages than higher types. Our model and that introduced by Liu and Pei (2020) complement each other because both articles provide sufficient conditions under which all equilibria in pure strategies are monotone. They require the senders utility function to be strictly decreasing with the senders message, whereas cheap-talk games in which the senders utility function does not depend on the message may also belong to our class of games.Footnote 3 Additionally, Liu and Pei assume that the senders utility function is strictly increasing in the receivers action, while we only require that function to be weakly increasing. Furthermore, their receivers payoff function satisfies a quasiconcavity-preserving property, but we assume that it has increasing differences in the senders type and the receivers actions. The main difference between our models is that they assume that the senders utility function has strictly increasing differences in her types and messages and it has increasing differences in her types and the receivers actions, but we do not even need a single-crossing condition. However, Liu and Pei do not make assumptions about the prior distribution of senders types, whereas we assume that it is sufficiently concentrated on the highest possible type. As a result of that assumption, all senders types have strong incentives to imitate the highest type in our model.

Our article shares some similarities with that by Pei (2020). For example, a key assumption of our general class of signaling games is that the receivers best response to the senders highest message depends on the senders type. This is also one of the assumptions of the model introduced by Pei (2020). Additionally, in his theorem 2, he also assumes that the receivers highest action best replies against the senders highest type and message. This is also a key assumption in our general class of signaling games that gives strong incentives to all the senders types to imitate the highest when he sends the highest possible message. Apart from those key assumptions, like Pei, we assume that the receivers best response to each message is a singleton under perfect information, which is satisfied for generic payoff functions of the receiver. However, there are important differences between our class of games and that analyzed by Pei (2020). First, he assumes that the senders utility function is strictly decreasing with the message, but our class of signaling games also accommodates cheap-talk games. Secondly, he considers senders utility functions that are strictly increasing with the receivers action, but we allow those utility functions to be weakly increasing. A key difference between both models is that the senders utility function has increasing differences in the senders type and messages and actions in Peis model. Although we do not impose this type of single-crossing condition, all the senders types have strong incentives to imitate the highest possible type in our specification of the senders utility function. Finally, Pei considers signaling games in which the receivers utility function has strictly increasing differences in his action and the senders types and messages, whereas we assume weakly increasing differences. Therefore, our models can be seen as complements because Pei studies reputation building in repeated signaling games, whereas we consider one-shot signaling games with a different type of monotonicity condition.

In this article, we also study an extension of the model introduced by Spence (1973) that satisfies all the assumptions of our class of signaling games. In particular, we assume that education contributes to increasing productivity in the labor market. In our setting, a worker has private information on his productivity and chooses an education level. After that, a company observes the workers decision and chooses the type of job offered and the wage associated to that job. If the workers level of education is different from the maximum level, the workers productivity inferred by the company is so low, that only a low-quality job without training opportunities can be offered and the wage paid will equal the expected productivity. However, if the worker achieves the maximum level of education, the company may find it profitable to offer a high-quality job with training opportunities and a higher wage. As the educational system is not selective in the model, we obtain a plethora of pooling equilibria, but the only NPE in pure strategies is that in which the highest level of education is selected. Interestingly, Dominiak and Lee (2023) use their refinement to constrain the set of pooling equilibria in the standard Spences model, but they select equilibria in which the level of education chosen by the worker is lower than a certain threshold. Additionally, they cannot discard the Rileys separating equilibrium with their criterion.

3 Model

Now, we can introduce the main elements of our generic class of signaling games. There are two players, a Sender (S) and a Receiver (R). The Sender has private information, summarized by his type, t, an element of a finite set, \(T=\left\{{t}_{1},{t}_{2},\cdots ,{t}_{n}\right\}\subset {\mathbb{R}}\), where \({t}_{1}<{t}_{2}<\cdots <{t}_{n}\). There is a strictly positive probability distribution p(t) on T, where p(t) is the ex-ante probability that S′s type is t and is common knowledge. After observing his type, S sends a message, m, to R, where \(m\in M=\left\{{m}_{1}{,m}_{2},\cdots {m}_{N}\right\}\) and \({m}_{1}<\cdots {<m}_{N}\). After observing m, R chooses an action, a, from a finite partially ordered set A(m) and both players obtain their levels of utility. The sender and the receiver have von Neumann-Morgenstern utility functions u(t, m, a) and v(t, m, a), respectively. As usual, we assume that u(·) increases with a. Furthermore, the receivers utility function has increasing differences in t, that is, when \(a^{\prime } > a\) and \(t^{\prime } > t\), \(v\left( {t^{\prime },m,a^{\prime }} \right) - v\left( {t^{\prime },m,a} \right) \ge v\left( {t,m,a^{\prime }} \right) - v\left( {t,m,a} \right)\) for all \(m\). This assumption means that the receivers marginal utility from his action does not decrease with the senders type.Footnote 4

In this model, the strategy chosen by the sender will be a function \(q:T \to {\Delta }_{{\overline{M}}}\), where \(q\left(m|t\right)\) is the probability that S sends message m, given that his type is t, and \({\Delta }_{\overline{M} }\) denotes the set of possible mixed strategies that the sender can choose. Likewise, the receivers strategy will be a function \(r:M \to {\Delta }_{{\overline{A}}}\), where \(r\left(a|m\right)\) is the probability that R chooses action a, given that he has observed message m, and \({\Delta }_{\overline{A} }\) denotes the set of possible mixed strategies that the receiver can choose. Finally, the receivers posterior belief upon receiving the senders message is a function \(\mu : M \to {\Delta }_{{\overline{T}}}\), where \(\mu \left(t|m\right)\) is the posterior probability assigned by R to t after observing m and \({\Delta }_{\overline{T} }\) is the set of probability distributions on T.

As usual, we consider perfect Bayesian equilibria (PBE), which are sets of signaling rules, \(q\left(m|t\right)\), action rules, \(r\left(a|m\right)\), and beliefs, \(\mu \left(t|m\right)\), that satisfy two conditions:

  1. 1.

    Rationality: Each player maximizes his or her utility given the other players strategy:

    1. i.

      \(\forall t\in T, q\left({m}^{*}|t\right)>0\) only if \(\overline{u }\left[t,{m}^{*},r\left(a|{m}^{*}\right)\right]=\underset{m\in M}{\text{max}}\overline{u }\left[t,m,r\left(a|m\right)\right]\).

    2. ii.

      \(\forall m\in M, r\left({a}^{*}|m\right)>0\) only if \(\sum_{t\in T}v\left(t,m,{a}^{*}\right)\mu \left(t|m\right)=\underset{a\in A(m)}{\text{max}}\sum_{t\in T}v(t,m,a)\mu \left(t|m\right)\).

    where \(\overline{u }\left[t,m,r\left(a|m\right)\right]=\sum_{a\in A(m)}u\left(t,m,a\right)r\left(a|m\right)\).

  2. 2.

    Consistency: The equilibrium posterior beliefs are consistent with Bayes rule: If \(\sum_{t\in T}q\left(m|t\right)p\left(t\right)>0\), then \(\mu \left({t}^{*}|m\right)=\frac{q\left(m|{t}^{*}\right)p\left({t}^{*}\right)}{\sum_{t\in T}q\left(m|t\right)p\left(t\right)}\).

From now on, \(\overline{u }\left(\cdot \right)\) \(\left(u\left(\cdot \right)\right)\) denotes Ss utility when R chooses a mixed (pure) strategy.

4 Potential equilibria

In this section, we characterize our class of games and the equilibria obtained. To start with, we introduce some notation: Let \(BR(I,m)\) represent the union of sets of mixed best responses to assessments concentrated on the subset I of T and similarly, \({BR}_{PB}(I,m)\) is the set of mixed best responses to m when all the senders types belonging to I send m with probability one, but the rest of types send it with probability zero, and the receivers posterior beliefs after m are prescribed by Bayes rule in that situation. Finally, \(a\left(t,m\right)=arg\underset{a\in A(m)}{\text{max}}v\left(t,m,a\right)\) represents R′s optimal response to \(m\) when he knows that S′s type is \(t\) and \({a}_{p}\left(m\right)=arg\underset{a\in A(m)}{\text{max}}\sum_{t\in T}v(t,m,a)p\left(t\right)\) denotes Rs optimal response to \(m\) under his prior beliefs.

We assume that all those best responses exist for each value of m and for each set of posterior beliefs. Additionally, for each \(t\) and each \(m\), \(a\left(t,m\right)\) and \({a}_{p}\left(m\right)\) are unique pure strategies and due to increasing differences of \(v\left(\cdot \right)\), \(a\left(t,m\right)\) is non-decreasing with \(t\) and \(a\left({t}_{1},m\right)\le {a}_{p}\left(m\right)\le a\left({t}_{n},m\right) \forall m\in M\).

Besides, we assume that \({a}_{p}\left({m}_{N}\right)=a\left({t}_{n},{m}_{N}\right)\). According to this assumption,Footnote 5 when the posterior beliefs are equal to the prior probabilities, R will choose the same response to the highest possible message as that chosen when R is certain that the sender has the highest possible type. This assumption may be satisfied in two different scenarios. First, when the prior probability of the highest possible type is sufficiently high, the receiver will respond to a message in a pooling equilibrium as if he were playing against that type. Second, after observing \({m}_{N}\), the receiver may be much more penalized from not choosing his optimal action against \({t}_{n}\) than from not choosing his optimal action against other senders types. In this second scenario, the receiver has also strong incentives to respond to \({m}_{N}\) by choosing his best response to \({t}_{n}\). In our class of games, an equilibrium in which all senders types send the highest message will be more plausible than others due to this specific assumption.Footnote 6

Since we are assuming that u(·) increases with a, our class of signaling games are monotonic as defined by Cho and Sobel (1990) because all types of sender have the same preferences for the receivers strategies. The next assumptions characterize the subset of monotonic signaling games we study:

C1

The following conditions are satisfied:

  1. i.

    \(u\left(t,{m}_{N},a({t}_{n},{m}_{N})\right)>u\left(t,m,a({t}_{n},m)\right) \forall t\in T \hspace{0.5cm} \forall m\ne {m}_{N}\)

  2. ii.

    For all \(I,J \subset T\) such that \(I \cap J = \emptyset\) and \(t_{n} \in I\), for all \(m,m^{\prime } \in M\), with \(m \ne m^{\prime }\), for all \(r_{m} \in BR_{PB} \left( {I,m} \right)\) and for all \(r_{m^{\prime}} \in BR_{PB} \left( {J,m^{\prime}} \right)\), then there will be some \(t \in J\) such that \(\overline{u}\left( {t,m,r_{m} } \right) > \overline{u}\left( {t,m^{\prime },r_{m^{\prime}} } \right)\).

C.2

For all \(t\in T\) and for all \(m,m^{\prime} \in M\): \(u\left( {t,m^{\prime },a\left( {t_{1} ,m^{\prime }} \right)} \right) \le u\left( {t,m,a_{p} \left( m \right)} \right)\).

The first part of C1 means that all types of sender prefer the highest possible message to a different message when the receiver knows that those messages were sent by the highest type of sender and responds to them accordingly.Footnote 7 This assumption is satisfied when the highest message sent by the highest type makes the receiver choose some action that would never be chosen with a different message. For example, imagine that education is productive, and is especially productive among those workers with the highest ability in Spences model. In this scenario, if employers responded to each level of education by paying the monetary value of the productivity of the highest-ability worker with that level of education, all workers would choose the highest level of education regardless of their ability. According to part ii in C1, any type of sender will always prefer to send a different message whenever this new message makes the receiver believe that he or she belongs to a set of types with the highest possible type. This assumption implies that the receivers response to the highest possible type with certainty is so good that all types will try to imitate him. In our educational example, if the productivity of the worker with the highest ability is much greater than the productivity of workers with different abilities, the former will receive an extremely high wage when he is identified by employers and for this reason, workers with different abilities will have strong incentives to mimic the highest-ability worker by choosing the same level of education. According to C2, any type of sender will be weakly better off with a message when the receiver reacts to that message by using the prior beliefs than with any message when the receiver responds to it by concentrating his beliefs on the lowest type. This is a standard assumption in many signaling games. In our educational example, it means that a worker prefers one level of education to another when employers respond to the former with a wage equal to the ex-ante expected productivity and responds to the latter with a wage equal to the productivity of the lowest-ability worker.Footnote 8 In this educational example, assumptions C1 and C2 are satisfied as long as the cost of education is sufficiently low regardless of the workers ability. Otherwise, low-ability workers would find it prohibitively expensive to choose the same level of education as the highest-ability worker.

Our assumptions describe situations in which the cost of the signal is sufficiently low and the receivers response to the best possible sender is so good that all senders types have strong incentives to imitate the highest type. This is the reason why there are multiple pooling equilibria. Additionally, as there is one message that makes the receiver respond to the highest type of sender by choosing the senders favorite action, we should anticipate a result in which the highest sender chooses that message and other types make the same choice. Consequently, a pooling equilibrium in which all senders types send the highest possible message will be more plausible than others under the assumptions of the model.

Since the game is finite, the existence of equilibrium is guaranteed. Now, we can characterize the equilibria in pure strategies obtained in our class of games.

Lemma 1

Under assumptions C1 and C2, there are only N PBE in pure strategies, which are completely pooling and in each PBE, all sender′s types send the same message from \(M=\left\{{m}_{1}{,m}_{2},\cdots ,{m}_{N}\right\}\).

Under assumption C2, any outcome with all types sending the same message is an equilibrium because there is a response to any other message that prevents S from defecting. Part ii of assumption C1 guarantees that the sender will always find it profitable to convince the receiver that he belongs to the set with the highest type. For this reason, a type belonging to a set without \({t}_{n}\) would prefer to deviate from a non-pooling equilibrium by sending the message sent by the set with \({t}_{n}\).

Now, in order to identify the PBE in mixed strategies obtained, we introduce two definitions.

Definition 1

We say that R′s beliefs are induced by prior probabilities concentrated on \(I\) whenever they are defined as: \({p}_{I}\left(t\right)=\frac{p\left(t\right)}{\sum_{t\in I}p\left(t\right)}\) if \(t\in I\) and \({p}_{I}\left(t\right)=0\) otherwise. Let us call \({P}_{I}\left({t}_{i}\right)=\sum_{t={t}_{1}}^{{t=t}_{i}}{p}_{I}\left(t\right)\), which is the cumulative distribution function associated with \({p}_{I}\left(t\right)\).

Imagine that there were a PBE in which only those types belonging to \(I\) choose \(m\) with probability one. In that equilibrium, R′s beliefs after \(m\) would be calculated using \({p}_{I}\left(t\right)\) and R would respond to \(m\) with a best response from \({BR}_{PB}\left(I,m\right)\).

Definition 2

If only a set \(I\subset T\) sends \(m\) with positive probability in a PBE, we say that R′s beliefs are induced by that equilibrium message whenever they are defined as: \({\mu }_{I}\left(t|m\right)=\frac{q\left(m|t\right)p\left(t\right)}{\sum_{t\in I}q\left(m|t\right)p\left(t\right)}\) if \(t\in I\) and \({\mu }_{I}\left(t|m\right)=0\) otherwise. Similarly, \({M}_{I}\left({t}_{i}|m\right)=\sum_{t={t}_{1}}^{{t=t}_{i}}{\mu }_{I}\left(t|m\right)\).

In a PBE in which only those types belonging to \(I\) send a message, definition 2 implies that R′s posterior beliefs after that message will be prescribed by Bayes rule.

Now, we use those definitions in order to limit the PBE in mixed strategies obtained.

Lemma 2

In a signaling game in which C1 holds, there cannot be an equilibrium in mixed strategies which satisfies the following conditions simultaneously:

  1. i.

    A set of sender′s types, \(I\subset T\), send a message, \(m\in M\), and another set, \(J\subset T\), send another message, \(m^{\prime } \in M\) , with positive probabilities, where \({t}_{n}\in I\), \(I\cap J=\varnothing\) and \(m \ne m^{\prime }\).

  2. ii.

    The receiver′s beliefs induced by \(m\), \({M}_{I}\left(t|m\right)\), first-order stochastically dominate the beliefs induced by the prior probabilities concentrated on \(I\), \({P}_{I}\left(t\right)\).

  3. iii.

    The receiver′s beliefs induced by the prior probabilities concentrated on \(J\), \({P}_{J}\left(t\right)\), first-order stochastically dominate the beliefs induced by \(m^{\prime }\), \(M_{J} \left( {t{|}m^{\prime }} \right)\).

This lemma allows us to restrict the number of equilibria in mixed strategies significantly. According to this result, any senders type will have incentives to send the same message as that sent by the highest type whenever that message increases the type expected by the receiver. In fact, part ii of C1 gives any senders type strong incentives to imitate the highest type because the receivers response to that type provides the action preferred by all senders types.

The next result is a summary of lemmas 1 and 2.

Theorem 1

In a signaling game in which C1 and C2 are satisfied, there will only be two types of PBE. First, there are N completely pooling PBE in pure strategies and in each equilibrium, all types send one of the messages from \(M=\left\{{m}_{1}{,m}_{2},\cdots ,{m}_{N}\right\}\) with probability one. Second, there will be equilibria in mixed strategies which do not satisfy the conditions of lemma 2.

This theorem shows that there are only completely pooling PBE in pure strategies. For example, imagine that we have a signaling game with \(T=\left\{\text{1,2},\text{3,4},\text{5,6},7,\text{8,9},10\right\}\), \(M=\left\{{m}_{1},{m}_{2},{m}_{3}\right\}\) and satisfying assumptions C1 and C2. In this game, there will be 3 pooling PBE in pure strategies: one in which all types send \({m}_{1}\), another in which all types send \({m}_{2}\) and the last in which all types send \({m}_{3}\). Additionally, there is a multiplicity of PBE in mixed strategies. However, lemma 2 discards some potential equilibria. For instance, there cannot be an equilibrium in which type 10 chooses \({m}_{1}\) and \({m}_{3}\) with positive probability, types 1–4 and 6–9 choose \({m}_{3}\) with probability one, and type 5 chooses \({m}_{2}\) and \({m}_{3}\) with positive probability. This equilibrium would satisfy all the conditions of lemma 2. Using the notation of that lemma, \(m={m}_{1}\), \(m^{\prime } = m_{2}\), \(I=\left\{10\right\}\), \(J=\left\{5\right\}\) and consequently, condition i of lemma 2 holds. After observing \({m}_{1}\), the receivers beliefs should be concentrated on type 10. As these beliefs coincide with the beliefs induced by the prior probabilities concentrated on \(I\), the former first-order stochastically dominate the latter and condition ii of lemma 2 is also satisfied. Finally, after observing \({m}_{2}\), the receiver should concentrate his beliefs on type 5. As these beliefs are equal to the beliefs induced by the prior probabilities concentrated on \(J\), the latter first-order stochastically dominate the former and condition iii of lemma 2 also holds. Thus, the result in question cannot be a PBE in mixed strategies.

Now, we introduce the single-crossing condition, which is a typical assumption in many applications:

C3

\(\overline{u }\left(\cdot \right)\) satisfies the single-crossing condition if when \(t < t^{\prime }: \hspace{0.2cm}\overline{u}\left( {t,m,r} \right) \le \overline{u}\left( {t,m^{\prime },r^{\prime }} \right)\) and \(m \le m^{\prime }\) imply \(\overline{u}\left( {t^{\prime },m^{\prime },r^{\prime }} \right) - \overline{u}\left( {t^{\prime },m,r} \right) \ge \overline{u}\left( {t,m^{\prime },r^{\prime }} \right) - \overline{u}\left( {t,m,r} \right)\) and strictness in either inequality implies \(\overline{u}\left( {t^{\prime },m^{\prime },r^{\prime }} \right) - \overline{u}\left( {t^{\prime },m,r} \right) > \overline{u}\left( {t,m^{\prime },r^{\prime }} \right) - \overline{u}\left( {t,m,r} \right).\)

C3 is equivalent to the generalized Spence-Mirrlees condition specified by Liu and Pei (2020), but assumption A4 of Cho and Sobel (1990) provides a different version of this condition. C3 implies that higher types are more willing to choose higher messages than lower types.

5 Equilibrium selection

In this section, we formally introduce the NPE as defined by Farrell (1993) and compare the equilibria selected by this criterion to those selected by divinity.

Definition 3

(Neologism-proof equilibrium). Given a PBE and the utility obtained by the t-type of sender in that equilibrium, \({\overline{u}}^{*}\left(t\right)\), a credible neologism is an out-of-equilibrium message, \(m\), such that there is a set of senders types, \(K\), that satisfies the following conditions simultaneously:

$${\overline{u}}^{*}\left(t\right)<\overline{u}\left(t,m,{r}_{m}\right) \forall t\in K, \forall {r}_{m}\in {BR}_{PB}\left(K,m\right)$$
(1)
$${\overline{u}}^{*}\left(t\right)>\overline{u}\left(t,m,{r}_{m}\right) \forall t\in T\setminus K,\forall {r}_{m}\in {BR}_{PB}\left(K,m\right)$$
(2)

A neologism-proof equilibrium is a PBE that has no credible neologism.

According to this definition, a PBE will pass this test as long as we cannot find a set of types, K, associated with an out-of-equilibrium message, m, that satisfies two conditions. First, when the receivers beliefs are induced by the prior probabilities concentrated on K, those senders types belonging to K prefer m to their equilibrium message. Second, given those beliefs, every type \(t\in T\setminus K\) would prefer the equilibrium message to m.

In order to compare this refinement with the divinity criterion or D1 in Cho and Kreps (1987) and in Cho and Sobel (1990), we include the next definition.

Definition 4

(Divinity) Let \((q,r,\mu )\) be an equilibrium and let \({\overline{u} }^{*}(t)\) be the equilibrium expected utility of the t-type of sender. Given an off-the-equilibrium-path signal, m, we define:

$$P\left(t|m\right)=\left\{r\in BR\left(T,m\right): {\overline{u} }^{*}(t)<\overline{u }(t,m,r)\right\}$$
(3)
$${P}^{0}\left(t|m\right)=\left\{r\in BR\left(T,m\right): {\overline{u} }^{*}(t)=\overline{u }(t,m,r)\right\}$$
(4)

where \(P\left(t|m\right)\) is the R′s set of best responses to m that incentivizes the t-type to defect and \({P}^{0}\left(t|m\right)\) is the set of best responses that gives that type the same utility as in equilibrium.

D1 rules out a disequilibrium message, m, of the t-type of sender if there exists t′ such that:

$$P\left(t|m\right)\cup {P}^{0}\left(t|m\right)\subset P\left({t}^{\prime}|m\right)$$
(5)

An equilibrium survives D1 if, for all off-the-equilibrium-path messages m, \(\mu \left(t|m\right)=0\) whenever (5) holds for some t′ such that \(P\left( {t^{\prime }{|}m} \right)\)\(\ne \varnothing\).

Now, we show that the set of NPE coincides with the set of equilibria that survives divinity in our class of games when C3 is satisfied.

Proposition 1

Under assumptions C1-C3, there is only a unique divine equilibrium in pure strategies in which all types of sender send \({m}_{N}\). This is also the unique NPE in pure strategies. Additionally, only those equilibria in mixed strategies in which some sender′s type sends \({m}_{N}\) with positive probability survive divinity and are NPE.

It is easy to understand the intuition behind this result. Under part i of C1, when the receiver perfectly identifies the highest type, he responds to \({m}_{N}\) with the best possible action. This assumption gives the highest type strong incentives to choose the highest message. Besides, part ii of C1 incentivizes other senders types to imitate the highest type. For those reasons, the most plausible equilibrium in pure strategies is that in which all types select the highest message and the most plausible equilibria in mixed strategies are those in which some type sends that message. Therefore, when assumptions C1-C3 hold, we select the same equilibria with divinity and with the criterion introduced by Farrell (1993).

As our class of games are monotonic, divinity is equivalent to universal divinity, never a weak best response and not vulnerability to credible deviations as defined by Esó and Schummer (2009). Thus, Farrells criterion also selects the same PBE as those refinements in our class of signaling games and the equilibria selected are strategically stable as defined by Kohlberg and Mertens (1986) because divinity is equivalent to strategic stability in monotonic signaling games (Cho and Sobel 1990). Finally, if we substituted assumptions C1 and C2 with a senders utility function which is strictly quasi-concave, a receivers utility function with strictly increasing differences and continuous utility functions with respect to m and a, divinity would still select the same equilibrium outcome under the single-crossing condition (Cho and Sobel 1990).

However, our final result shows that we do not need C3 in order to select the pooling equilibrium with the highest message with Farrells refinement in our class of signaling games, but all equilibria are divine when C3 is not satisfied.

Theorem 2

Under assumptions C1 and C2, the only NPE in pure strategies is the equilibrium in which all sender′s types choose \({m}_{N}\) and only those PBE in mixed strategies in which some type chooses \({m}_{N}\) with positive probability may be NPE.

Under our assumptions, there always exists a set of NPE. Specifically, we have proven that this refinement selects the same type of equilibria in our class of signaling games even without the single-crossing condition. In those equilibria, low types imitate the highest type by choosing the highest message. Hence, this theorem shows that we have characterized a general class of signaling games in which the concept of NPE introduced by Farrell (1993) is more useful than other criteria because it selects the most plausible equilibria in this class of games. Under our assumptions, all senders types have strong incentives to choose the same message as that chosen by the highest type. Moreover, all senders types are better off when they choose the highest message as long as the receiver responds to that message by using the prior beliefs. Consequently, the equilibrium in which all senders types choose the highest possible message is the most plausible result in our class of games and this is the outcome selected by Farrells criterion.

6 Economic application: educational signaling

In this section, we analyze an extension of the Spences model in which education contributes to increasing productivity and to opening new job opportunities. As the model belongs to our class of signaling games, an overeducation equilibrium will be the only NPE in pure strategies.

In this setting, there is a worker who has private information on his ability. That ability is equal to \({t}_{1}\) with probability \(p\) and \({t}_{2}\) with probability \(1-p\), where \({t}_{1}<{t}_{2}\). Using the usual notation, \(T=\left\{{t}_{1},{t}_{2}\right\}\). After observing her ability, the worker may choose one of the following educational levels: \(M=\left\{{e}_{1},{e}_{2}\right\}\), where \({e}_{1}<{e}_{2}\). Let us denote the workers cost of each level of education by \(c\left(t,e\right)\), which increases with the level achieved: \(c\left(t,{e}_{1}\right)<c\left(t,{e}_{2}\right) \forall t\in T\).

After observing the workers level of education, companies will compete for the worker a la Betrand and as a result, they pay a wage equal to the workers expected productivity. Those companies may offer two types of jobs. First, there are certain inferior jobs that do not require a high level of education and the workers productivity in that type of job is given by \({\pi }_{l}\left(t,{e}_{1}\right)\), which increases with her ability, that is, \({\pi }_{l}\left({t}_{1},{e}_{1}\right)<{\pi }_{l}\left({t}_{2},{e}_{1}\right)\). In this job, a higher level of education does not increase the workers productivity because the skills acquired with that level of education are not related to those required by the job, i.e., \({\pi }_{l}\left({t}_{1},{e}_{1}\right)={\pi }_{l}\left({t}_{1},{e}_{2}\right)\) and \({\pi }_{l}\left({t}_{2},{e}_{1}\right)={\pi }_{l}\left({t}_{2},{e}_{2}\right)\). Besides, the company may offer a second type of jobs that require a minimum level of performance. In particular, the company would be able to sell the products generated by a worker occupying this premium job as long as her productivity is equal to \(\overline{\pi }\), that may also measure the quality of the products produced by the worker. For this reason, if the workers level of productivity is lower than the required level, the company has to train her so that she can reach it. Before that training, the workers productivity in the premium job depends on her ability and educational level and is denoted by \({\pi }_{h}\left(t,e\right)>{\pi }_{l}\left(t,e\right) \forall t\in T, \forall e\in M\). Now, this productivity increases with the workers ability and with the level of education because some of the mental skills required by the job are acquired with education. Consequently, \({\pi }_{h}\left({t}_{1},e\right)<{\pi }_{h}\left({t}_{2},e\right) \forall e\in M\) and \({\pi }_{h}\left(t,{e}_{1}\right)<{\pi }_{h}\left(t,{e}_{2}\right) \forall t\in T\). As \(\overline{\pi }>{\pi }_{h}\left({t}_{2},{e}_{2}\right)\), the company should train even the high-ability worker with the highest level of education in the premium job.

Now, we introduce the key assumptions of the model. Firstly, when the worker occupies the premium job, she generates some positive synergy or externality in the company denoted by \(E\) and this is only known by the company. In order to simplify the model, we assume that the companys cost of training in the premium job is linear. Specifically, \(K:{\mathbb{R}}\to {\mathbb{R}}^{+}\) is the training cost function defined by:

$$K\left(x\right)=\left\{\begin{array}{ll}\kappa \cdot x&\quad if\quad x\ge 0\\ 0&\quad if\quad x<0\end{array}\right.$$
(6)

This function means that the training cost generated by increasing the workers productivity by \(x=\overline{\pi }-\pi >0\) in the premium job is proportional to that increase in productivity because \(\kappa\) is a constant parameter. In this context, we make the following assumption:

$$\frac{E}{\overline{\pi }-{\pi }_{h}\left({t}_{2},{e}_{1}\right)}<\kappa <\frac{E}{\overline{\pi }-{\pi }_{h}\left({t}_{2},{e}_{2}\right)}$$
(7)

This assumption implies that the company would only offer the premium job to the high-ability worker with the highest level of education if it had perfect information.Footnote 9 Other models, such as those introduced by Gibbons and Waldman (1999) and Liu and Pei (2020), also assume that the workers benefit from education increases with her talent.

Next, we add an assumption that guarantees that the company will offer the premium job when it observes the high level of education and is completely uninformed:

$$p<\frac{\frac{E}{\kappa }-\left[\overline{\pi }-{\pi }_{h}\left({t}_{2},{e}_{2}\right)\right]}{{\pi }_{h}\left({t}_{2},{e}_{2}\right)-{\pi }_{h}\left({t}_{1},{e}_{2}\right)}$$
(8)

This assumption will be satisfied if the positive externality generated by the worker in the premium job, \(E\), is sufficiently high, or if the prior probability of the low-ability worker is sufficiently low.

When the worker occupies the premium job, she is trained by the company and receives a wage equal to her productivity, \(\overline{\pi }\). Although companies compete for hiring the worker, they may differ with respect to the positive synergy generated by the worker in the premium job, in which case the company with the highest synergy may offer the worker a greater wage and will eventually hire the worker. In that case, \(E\) measures the difference between the synergy generated by the hiring company and that generated by the second best, which explains why the worker receives a wage of \(\overline{\pi }\) in the premium job after receiving the training.Footnote 10

In this section, we will analyze what happens in a non-selective or universal educational system in which the cost of an additional level of education does not increase with the workers ability. Specifically, we assume that \(c\left({t}_{1},{e}_{2}\right)-c\left({t}_{1},{e}_{1}\right)=c\left({t}_{2},{e}_{2}\right)-c\left({t}_{2},{e}_{1}\right)\). Hence, the single-crossing condition is not satisfied.

In our set-up, \(\overline{\pi }>{\pi }_{l}\left({t}_{2},{e}_{1}\right)+c\left(t,{e}_{2}\right)-c\left(t,{e}_{1}\right)\) and consequently, the high level of education is a highly effective signal of productivity because it allows the worker to access a well-paid job with training opportunities. This assumption is satisfied if the company sells the products generated by the worker in the premium job to consumers with a high willingness to pay for the product, or if the training course offered by the company is effective and increases the workers productivity in the premium job significantly. In both cases, \(\overline{\pi }\) will be sufficiently high and the assumption will hold.

It is easy to see that this model belongs to our general class of signaling games. For example, it is a monotonic game because the workers utility function increases with the wage received. Under the assumption described by condition (7), companies never offer the premium job to a worker with the low level of education because her productivity is too low. Therefore, companies will offer the inferior job to a worker with a low or a high level of education with a wage belonging to this closed interval, \(\left[{\pi }_{l}\left({t}_{1},{e}_{1}\right),{\pi }_{l}\left({t}_{2},{e}_{2}\right)\right]\), or they will offer the premium job with a wage equal to \(\overline{\pi }>{\pi }_{h}\left({t}_{2},{e}_{2}\right)>{\pi }_{l}\left({t}_{2},{e}_{2}\right)\), which implies that companies action set is equivalent to offering a wage belonging to \(\left[{\pi }_{l}\left({t}_{1},{e}_{1}\right),{\pi }_{l}\left({t}_{2},{e}_{2}\right)\right]\cup \left\{\overline{\pi }\right\}\), which is a partially ordered set with the usual order. The existence of increasing differences in the receivers utility function is trivially satisfied because the workers productivity increases with her type and for this reason, companies profit from paying a low wage to a worker increases with her productivity. As shown above, all the companys best responses exist and are wages equal to the workers expected productivity in the inferior job and \(\overline{\pi }\) in the premium job. Similarly, the optimal type of job offered by the company with perfect information is the premium job if the worker has a high ability and a high level of education and the inferior job otherwise. The assumptions shown above guarantee that the companys best response to the prior beliefs and the low level of education is the inferior job and a wage equal to the expected productivity, \({a}_{p}\left({e}_{1}\right)=\left\{\text{inferior job}, p{\pi }_{l}\left({t}_{1},{e}_{1}\right)+\left(1-p\right){\pi }_{l}\left({t}_{2},{e}_{1}\right)\right\}\), whereas the best response to the prior beliefs and the high educational level is the premium job and a wage equal to the required productivity in that job, \({a}_{p}\left({e}_{2}\right)=\left\{\text{premium job}, \overline{\pi }\right\}\). Given those optimal responses, \({a\left({t}_{1},{e}_{1}\right)\le a}_{p}\left({e}_{1}\right)\le a\left({t}_{2},{e}_{1}\right)\), \({a\left({t}_{1},{e}_{2}\right)\le a}_{p}\left({e}_{2}\right)\le a\left({t}_{2},{e}_{2}\right)\) and \({a}_{p}\left({e}_{2}\right)=a\left({t}_{2},{e}_{2}\right)\) as required in the general model.

In this simplified model, it is straightforward to prove that assumptions C1 and C2 hold because \(\overline{\pi }>{\pi }_{l}\left({t}_{2},{e}_{1}\right)+c\left(t,{e}_{2}\right)-c\left(t,{e}_{1}\right)\). In fact, the high wage received in the premium job implies that the worker will be better off with the high level of education in the premium job than with the low level of education in the inferior job and the best possible wage in that job. Therefore, the first part of assumption C1 is satisfied. As the premium job is optimally offered by the company only if the worker has a high ability and a high level of education, the low-ability worker has strong incentives to imitate the high-ability worker and the second part of assumption C1 also holds. Lastly, as formal education only increases the workers productivity when she occupies the premium job, the worker prefers the high level of education as long as it allows her to access the premium job. Similarly, the worker prefers the low level of education as long as it prevents the company from treating her as the low-ability worker in the inferior job. For those reasons, assumption C2 is satisfied.

To finish off this section, we obtain our last result, which is a direct application of the results obtained in the previous section.

Proposition 2

There are only two pooling PBE in pure strategies that survive divinity: One in which all workers′ types choose \({e}_{1}\) and another in which they choose \({e}_{2}\). In addition to this, there are PBE in mixed strategies in which both worker′s types choose \({e}_{2}\) with positive probability and they survive divinity. However, only the pooling PBE in pure strategies in which the worker chooses \({e}_{2}\) and the PBE in mixed strategies are NPE.

For the sake of space, we omit the proof of this result because it is a direct application of the results obtained in the general model.Footnote 11 In our model, the high-ability worker has strong incentives to choose the high level of education in order to access the premium job and receive a high wage. As the companys best response to the high level of education under the prior beliefs is offering the premium job and the high wage, the low-ability worker has also strong incentives to choose the high level of education. As a result, the pooling equilibrium in which the worker invests in the high level of education is the most plausible result, but standard refinements are not able to discard other equilibria in a universal or non-selective educational system because the single-crossing condition is not satisfied. For this reason, we apply the refinement introduced by Farrell (1993) that selects the most plausible equilibrium of this model. As shown in proposition 1, if we introduce a single-crossing condition by assuming that the cost of education decreases with the workers productivity, then divinity selects the same overeducation pooling equilibrium, which is the most plausible equilibrium in this model. However, we have shown that the equilibrium selection proposed by Farrell (1993) is the same with and without this condition in our class of signaling games, which suggests that the neologism proof equilibrium is more robust to the specification of the single-crossing condition than divinity in this class of games.

7 Conclusions

In this article, we have characterized a class of signaling games in which an oversignaling equilibrium is the most plausible outcome. In that equilibrium, low senders types have incentives to imitate the highest type by sending the highest possible message. As expected, this kind of behavior will appear when the highest message induces the receiver to choose the senders favorite action and the consequences of being treated as a weak type are sufficiently adverse.

In our class of signaling games, a multiplicity of pooling equilibria arises, but the equilibrium in which the sender chooses the highest message is more plausible than others. In those games, there is a certain alignment of the senders and the receivers preferences. Specifically, when the receiver responds to messages with his prior beliefs, all senders types prefer a particular message. Furthermore, when the receiver responds to that message against his prior information, he chooses the action most preferred by all types of sender. As a result, all senders types will have strong incentives to choose that particular message in order to keep the receiver uninformed. Despite the plausibility of this result in our class of games, when the single-crossing condition is not satisfied, divinity and other standard refinements cannot eliminate implausible outcomes, but the refinement introduced by Farrell (1993) is able to select that plausible equilibrium. Thus, this paper has identified a class of signaling games in which the notion of equilibrium introduced by Farrell (1993) is more useful than other refinements.

In the literature on refinements, we have shown that divinity is stronger than other criteria in our class of signaling games because it selects a subset of the equilibria selected by other refinements. However, the undefeated equilibrium suggested by Mailath et al. (1993) cannot be compared with divinity this way because, in that refinement, the receivers beliefs after an out-of-equilibrium message must be equal to the posterior beliefs prescribed by an alternative equilibrium in which that message is in the equilibrium path. In our class of games, there are multiple pooling equilibria and each of the out-of-equilibrium messages is an equilibrium message in a different PBE in pure strategies and in some equilibrium in mixed strategies. In this context, all the equilibria are undefeated because the out-of-equilibrium beliefs are consistent with those prescribed by Bayes rule in a PBE in mixed strategies and as a result, this refinement does not help to select the oversignaling equilibrium.

As an application of our general class of signaling games, we have analyzed an extension of Spences model in which the educational investment increases the workers productivity in the labor market, and the highest level of education allows the worker to access a premium job with training opportunities. As expected, when the cost of education does not depend on the workers ability and the potential wage received in the premium job with the highest level of education is sufficiently high, there are multiple pooling equilibria, but the outcome in which all workers types choose the highest level of education is the only neologism-proof equilibrium in pure strategies.Footnote 12