1 Introduction: KK and being in a position to know

Call the thesis, imprecisely stated, that knowledge iterates freely the KK-Principle. (KK) is the simplest formulation of the KK-Principle.

(KK) (S knows that p) ⊃ (S knows that (S knows that p)).

(KK) is widely taken to be untenable. It is standardly assumed that knowing that p entails believing that p (though see Radford (1966) and Myers-Schulz and Schwitzgebel (2013)). If so, then since, plausibly, S can know that p without believing that S knows that p (though see Greco (2014a, 174, fn17, 2014b)), (KK) will admit counter-instances. Similarly, if S must possess the concept of F to know that F(ai,aj…), then S may fail to know that S knows that p despite knowing that p due to failing to possess the concept of knowledge (Feldman (1981), Das and Salow (2016); though see Stalnaker (1999)). As a result, much discussion of the KK-Principle has instead focused on weaker variants of (KK), such as \( ({\text{KK}}^{-})\).Footnote 1

\( ({\text{KK}}^{-})\) (S knows that p) ⊃ (S is in a position to know that (S knows that p)).

\( ({\text{KK}}^{-})\) employs the notion of being in a position to know. No full analysis of this state is necessary for present purposes. However, we can assume, minimally, that being in a position to know that p is factive (i.e., that S is in a position to know that p implies that p) and strictly weaker than knowledge that p (i.e., that S knows that p implies that S is in a position to know that p but that S is in a position to know that p does not imply that S knows that p).

It has been argued, notably in Greco (2014a), that accounts of knowledge which involve essential appeal to normality are particularly conducive to defence of the KK-principle. This paper investigates the plausibility of that claim. It is argued, first, that Greco’s defence of the KK-principle depends upon an implausible assumption about the logical principles governing iterated normality claims. Once this assumption is dropped, counter-instances to the principle can be expected to arise. Second, it is argued that even if the assumption is maintained, there are other logical properties of normality which can be expected to lead to failures of KK.

Section 2 presents a version of Williamson’s margins-for-error argument against KK; Sects. 3, 4 discuss the defence of the KK-Principle due to Greco (2014a) and show that it is dependent upon the implausible assumption that the logic of normality ascriptions is at least as strong as K4; finally, Sects. 5, 6 argue that a weakened version of Greco’s constraint on knowledge is plausible and demonstrate that this weakened constraint will, given uncontentious assumptions, systematically generate counter-instances to the KK-principle of a novel kind.

2 Inexact knowledge and margins for error

Sometimes our knowledge of the world is inexact. Say that S has inexact knowledge of the value of a parameter x iff, for any value taken by x, there is some non-trivial constant size of interval centred on that value such that S knows that the value of x falls within the interval, but there is no subset of that interval such that S is in a position to know that x falls in that subset.

Inexact knowledge can arise in a variety of ways. Our information about some physical magnitude may depend upon a measuring device which issues reports with a non-trivial degree of inaccuracy. Alternatively, our best available judgement about a state of affairs may involve estimation, as when evaluating the size of crowd, or the number of coffee beans in a jar. Williamson (1992, 2000) has argued that the existence of known inexact knowledge is incompatible with (KK) (and, more generally, any acceptable precifisication of the KK-Principle). Call this the Margins-for-Error Argument.

Suppose that S has inexact knowledge of the temperature in a room. Whatever the value of the temperature, there is a non-trivial range such that S knows that the temperature falls within that range of its actual value. However, she is not in a position to know, of any value within that range, that it is not the value of the temperature in the room. Suppose that this range is ± 5 °F. Then for all n, if the temperature in the room is n °F, the strongest proposition S knows is that the temperature is between (n − 5) °F and (n + 5) °F. It follows, assuming closure, that if the temperature is n °F, then, for any n′ ≤ 5, S does not know that the temperature is not (n + n′) °F.

Assume that, by reflecting on the imperfections of her means of coming to know the temperature, S knows, for some 0 < i ≤ 5, that if the temperature is n °F, then she does not know that the temperature is not (n + i) °F. That is, she knows that any knowledge she has of the temperature is inexact.Footnote 2 Likewise assume that, due to her diligence, her knowledge is closed under entailment. While these assumptions may be idealisations for normal agents under normal conditions, they are harmless ones in the present context.

(KK) is inconsistent with S’s knowledge of her inexact knowledge of the temperature. Suppose that the temperature is 70 °F. Then S knows that the temperature is between 65 °F and 75 °F. Accordingly, by closure, she knows that the temperature is not 80 °F. By (KK), she knows that she knows that the temperature is not 80 °F. Furthermore, by contraposition of the above, S knows, for some 0 < i ≤ 5, that if she knows that the temperature is not (n + i) °F, then the temperature is not n °F. Hence, by closure, S knows that the temperature is not (80 − i) °F. By a second application of (KK), S knows that S knows that the temperature is not (80 − i) °F. Thus, by a second application of closure, S knows that the temperature is not (80 − 2i) °F… Repetition of this procedure will yield the conclusion that S knows that the temperature is not 70 °F. Yet by factivity, this is inconsistent with the original supposition that S does not have exact knowledge of the temperature. Accordingly, Williamson concludes that we must reject (KK) in its full generality.

The margins-for-error argument only directly supports rejection of (KK). However, Williamson suggests (2000, 115), it can be extended into an argument against \( ({\text{KK}}^{-})\) with the additional assumption that if S is in a position to know that p then S knows that p. That is, S acquires all knowledge she is in a position to acquire (making \( ({\text{KK}}^{-})\) and (KK) equivalent). Hence, if the margins-for-error argument succeeds in demonstrating the existence of counter-examples to (KK), it also succeeds in demonstrating the existence of counter-examples to \( ({\text{KK}}^{-})\) in those cases in which this further idealisation holds.

3 Knowledge and normality

The conclusion of the margins-for-error argument has been claimed (by e.g., Hawthorne and Magidor (2009), Stalnaker (2009) and Greco (2014a, b)) to be incompatible with the success of putative explanations which make essential appeal to the KK-principle in pragmatics, computer science, game theory and other areas. Insofar as one takes these concerns to be well-founded and yet seeks to preserve such explanations, there is reason to investigate alternative accounts of knowledge which validate some version of the KK-principle.

A number of defences of the KK-principle do so by appealing to an account of knowledge stated in terms of normality. Versions of this strategy can be found in Dretske (1981), Greco (2014a), Stalnaker (2015) and Goodman and Salow (2018). As the most explicit such defence of the KK-principle, I will focus on Greco’s version of the normality theory (though the concerns generalise to the other accounts). Where C is a schematic variable ranging over agents’ total cognitive states, Greco’s normality theory can be formulated as (NTK):Footnote 3

(NTK) S knows, in C, that p iff

(i.) Normally ((S is in C) ⊃ p); and

 

(ii.) Conditions are normal.

For present purposes, an agent’s cognitive state can be thought of as the minimal state of affairs comprising all and only those states of the agent to which her acquisition of knowledge is sensitive. For example, it may include, but need not be limited to, facts about her evidence, the concepts in her possession, the methods of belief formation available to her, &c. Construed in this way, (NTK) will not offer a non-circular analysis of knowledge, but might nevertheless be hoped to generate informative predictions regarding, e.g., structural properties of knowledge.

Like Dretske (1981) and Stalnaker (2015), Greco motivates (NTK) by appeal to the notion of a state ‘carrying’ information. Under a popular account of this notion, a system carries the information that p in a given state iff normally, if the system is in that state, p is true (see, e.g., Stampe (1977), Dretske (1981), Millikan (1984) and Stalnaker (1999), a.o.)Footnote 4 On such accounts, (NTK.i) (as a necessary condition on knowledge) follows directly from the bridge principle that S knows that p only if her cognitive state carries the information that p. (NTK) can then be restated as the theory that S knows that p iff S is in a cognitive state which carries the information that p and conditions are normal.

Greco argues that (NTK) entails (KK). We can consider an informal instance of this argument, before stating it precisely in more general terms. Suppose that S knows that the temperature is between 65 and 75 °F. Then, by the L-to-R direction of (NTK): (a) conditions are normal and (b) S is in a state C such that normally, if S is in C then the temperature is between 65 and 75 °F. Next, by the R-to-L direction of (NTK), it is sufficient for S to know that S knows that the temperature is between 65 and 75 °F that: (a′) conditions are normal; and (b′) S is in some state C* such that normally, when S is in C*, (a) and (b) obtain. Yet, Greco claims, C itself is such a state. That is, he claims it follows from (b) that normally, if S is in C, (b) obtains. Thus, (b) entails (b′), under the assumption that, normally, conditions are normal. Since (a) = (a′), (a) trivially entails (a′). If this is correct, then S’s satisfaction of the necessary conditions for knowledge that p is sufficient for S’s satisfaction of the sufficient conditions for knowledge that S knows that p. If S knows that p then S knows that S knows that p. Accordingly, Greco concludes, (NTK) entails (KK).

Formulating the argument in greater generality, let ■φ be the proposition that normally φ and □φ the proposition that necessarily φ. We assume that ■ is weaker than □; that is, ⊨□φ ⊃ ■φ. Let CS be the proposition that S is in C.Footnote 5 Let KSp be the proposition that S knows that p. (∀p (p ⊃ ¬■¬p)) is the proposition that conditions are normal. In a Kripke semantics, if N(w) is the set of worlds normal at w, w⊨(∀p (p ⊃ ¬■¬p)) iff w∈N(w). Greco’s argument assumes the principle (SR)—that is, the principle that normally, conditions are normal.

(SR) ■(∀p (p ⊃ ¬■¬p)).Footnote 6

Finally, we assume the logic for normality is a normal modal logic. We can restate (NTK) as follows:

(NTK) □(Cs ⊃ (KSp ≡ (■(CS ⊃ p) ∧ ∀p (p ⊃ ¬■¬p)))).

Assuming CS and KSp, Greco argues KS(KSp) can be derived as follows:

(P1)

KSp

(premise)

(P2)

(KSp ≡ (■(CS ⊃ p) ∧ ∀p (p ⊃ ¬■¬p)))

(from CS, (NTK), ⊨□φ ⊃ φ, by MP)

(P3)

■(CS ⊃ p)

(from (P1),(P2), by MP, ∧E)

(P4)

p (p ⊃ ¬■¬p)

(from (P1),(P2), by MP, ∧E)

(P5)

KS(KSp) ≡ ■(CS ⊃ (■(CS ⊃ p) ∧ ∀p (p ⊃ ¬■¬p))) ∧ ∀p (p ⊃ ¬■¬p)

(from (P2), (NTK))Footnote 7

(P6)

KS(KSp) ≡ ■(CS ⊃ ■(CS ⊃ p))

(from (P4),(P5),(SR))

(P7)

■(CS ⊃ ■(CS ⊃ p))

(from (P3), by ■(φ ⊃ ψ)⊨■(φ ⊃ ■(φ ⊃ ψ)))

(C)

KS(KSp)

(from (P6),(P7), by MP)

Substituting KSp/p in (P2) yields (P5). From the assumption that KSp and (SR), we know that ■∀p (p ⊃ ¬■¬p) and ∀p (p ⊃ ¬■¬p). Thus, (P5) simplifies to (P6) by ∧-elimination and: ■(φ ⊃ (ψ∧χ)), ■χ⊨■(φ ⊃ ψ). Yet, Greco claims, ■(φ ⊃ ψ)⊨■(φ ⊃ ■(φ ⊃ ψ)). Thus, from (P3), it follows that ■(CS ⊃ ■(CS ⊃ p)). So, by the R-to-L direction of (P6), KS(KSp). Having derived KS(KSp) from KSp (and auxiliary assumptions), Greco concludes (NTK) entails the KK-principle.

4 Iterated normality

The crucial step in Greco’s argument is the derivation of (P7) from (P3). This inference depends upon the validity of the schema (CI) (for ‘Crucial Inference’):

(CI) ■(φ ⊃ ψ)⊨ ■(φ ⊃ ■(φ ⊃ ψ))

In the informal statement of the argument in Sect. 3, (CI) was required infer from (b) (i.e., the claim that normally, if S is in C then the temperature is between 65 and 75 °F), that normally, if S is in C, then (b) obtains. (CI), in turn, entails the axiom schema 4 of modal logic.Footnote 8

(4) ■φ⊨■■φ.

(4) says that what is normal is normally normal—normality iterates freely. This corresponds to the constraint that N be transitive; for any w, if w′∈N(w), then N(w′) ⊆ N(w). In order to evaluate the acceptability of Greco’s argument then, we need to assess the plausibility of (4) for ■.

Most extant logics of normality either validate the schema (Smith (2008) and Boutilier (1994)), do not permit iterated normality statements (Veltman (1996)) or have trivial logics of iterated normality (Delgrande (1987) and Asher and Morreau (1995)). Nevertheless, cases such as the following suggest there is good reason to reject it.

Suppose that Jaime has a relatively regular schedule; normally, she arrives home from work between 5.00 pm and 6.00 pm, though it would not be abnormal for her to arrive home at any time between 5.00 pm and 6.00 pm. Furthermore, she normally does not come close to arriving home abnormally early or abnormally late. For any time, normally, if she arrived home at that time, it would not have been abnormal for her to have arrived up to 5 min earlier or up to 5 min later.

Jaime’s situation appears perfectly coherent. Yet it involves a violation of the principle that normality iterates freely. Since it would not be abnormal for her to have arrived home at any time between 5.00 pm and 6.00 pm, there are normal situations in which she arrives home at 1 min before 6 pm. Yet, normally, whenever she arrives home, it would not have been abnormal for her to arrive up to 5 min later. Thus, in a normal situation in which she arrives home 1 min before 6 pm, it would be normal for her to have arrived home 1 min after 6.00 pm. Thus, despite the fact that she does not normally arrive 6.01 pm, it is not the case that normally, she does not normally arrive at 6.01 pm.

We can model Jaime’s situation as follows: Let wn.nn be the world at which Jaime arrives at n.nnpm. Then, suppose that w5.30 is a world at which (a)–(c) obtain.

  1. (a)

    Normally, Jaime arrives between 5.00 pm and 6.00 pm;

  2. (b)

    For any time between 5.00 pm and 6.00 pm, it would not be abnormal for Jaime to arrive at that time;

  3. (c)

    Normally, Jaime does not come within 5 min of arriving abnormally early or abnormally late.Footnote 9

By (b), w5.59 is accessible from w5.30 (since otherwise, it would be abnormal for Jaime to arrive at 5.59 pm). Yet, since w5.59 is normal relative to w5.30, by (c), w6.01 is accessible from w5.59 (since otherwise, Jaime would not normally not come within 5 min of arriving abnormally late). Yet, by (a), w6.01 is inaccessible from w5.30. Thus, in order for (a)–(c) to be satisfied at w5.30, the relevant accessibility relation must be non-transitive.

On the assumption that what is normal is a contingent matter, failures of normality to iterate appear unsurprising. Plausibly, facts about what is normal depend on other, non-modal facts (e.g., statistical facts, facts about evolutionary history, &c.). Furthermore, it is plausible that the facts on which facts about normality depend can vary, both across normal worlds and between the actual world and worlds which are actually normal. For example, facts about when Jaime normally arrives home can be expected to depend (in part) on facts about when she in fact arrives home. If, over a sufficiently long period of time (and holding all else fixed), Jaime arrives home later in w′ than she does in w, then the latest she would normally arrive home in w′ can be expected to be (marginally) later than the latest she would normally arrive home in w. A little lateness can make lateness the new normal. Yet, if Jaime’s arrivals in w′ would not have be abnormal in w, then, in w, there will be some times at which it would be abnormal for Jaime to arrive home, but which would be normal in normal conditions.

As long as failures of (4) are permitted, we can construct counterexamples to (KK) while assuming (NTK). Suppose that, at the actual world, S is in a cognitive state C such that normally, if S is in C then the temperature is between 65 and 75 °F and that normally, if S is in C then it would not be abnormal for the temperature to be 1 degree cooler or hotter. Suppose, additionally, that conditions are normal (meaning that the temperature must actually be between 66 and 74 °F). By (NTK), S knows that the temperature is between 65 and 75 °F. Nevertheless, there is some normal world at which S is in C, the temperature is 65 °F and at which it is not abnormal for S to be in C and for the temperature to be 64 °F. Hence, by (NTK), at that world, S does not know that the temperature is between 65 and 75 °F. Thus, it is not the case that in the actual world, at all normal worlds in which S is in C, S knows that the temperature is between 65 °F and 75 °F. So, by (NTK), at the actual world, S knows that the temperature is between 65 °F and 75 °F but does not know that she knows that the temperature is between 65 °F and 75 °F.Footnote 10

5 Knowledge in abnormal conditions

Section 4 argued that (4) was too strong a constraint on the logic of normality. Some abnormal worlds are not normally abnormal. As a result, it was argued, counter-instances to (KK) should be expected under (NTK). Yet, even in the absence of failures of (4), there is reason to think that normality-based accounts of knowledge are less conducive to a defence of the KK-principle than has generally been thought. In the remainder of the paper, we will turn our attention to a less contentious logical property of normality and show how it also creates problems for the free iteration of knowledge.

Conditions are sometimes abnormal. The logic of normality must, as such, be weaker than KT; some worlds do not count as normal by their own standards. This corresponds to the constraint that N be non-reflexive; for some w, w∉N(w). Sections 5, 6 argue that regardless of whether (4) is accepted, failures of the T axiom schemaFootnote 11 make (NTK) implausibly strong, and, furthermore, that such failures can be expected to yield counter-instances to (KK) on the theory’s most natural weakening.

(NTK.ii) entails that if conditions are abnormal, then, no matter what cognitive state S is in, there is no p such that she knows that p. Any actual abnormality implies actual global scepticism for Greco (though not necessary global scepticism). Since the actual world is in fact admirably abnormal in numerous ways, this is an especially serious concern. A number of revisions to (NTK) are available which would allow the theory to avoid this problem. However, each revision faces serious problems of its own.

One response would be to index the relevant normality claims in (NTK) to the cognitive state of the individual. Let ■ψφ be the proposition that φ is true in all worlds normal with respect to ψ. The current proposal is that each instance of ■ is replaced with ■Cs.Footnote 12 That is, S knows that p when in C iff in all worlds normal with respect to S being in C, if S is in C, then p, and conditions are normal with respect to S being in C. Clearly, under this revision, S can know that p when in C despite conditions being abnormal, as long as conditions are nevertheless normal with respect to S being in C. A version of this strategy is considered in passing by Greco (2014a, 181, fn35), possibly motivated by this concern. Stalnaker’s proposal (2006, 2015) that normality be indexed to particular sources of information can also be seen as a variant of the same strategy.

However, while this will avoid the immediate problem that any actual abnormality implies global scepticism, it remains too strong. Abnormal conditions can be beneficial to the acquisition of knowledge. Suppose that, while in C, S finds herself in an environment which is more epistemically hospitable than is normal (with-respect-to-CS). Plausibly, this will be to her epistemic advantage—acquisition of knowledge should, if anything, be easier. Yet, on the proposed revision, any abnormality (with-respect-to-CS) is incompatible with S possessing knowledge while in C.

For example, imagine an animal which has evolved an acute sense of smell in order to navigate the subterranean environment it normally inhabits. Under normal conditions, below ground, its olfactory system is subject to a non-negligible degree of noise. When the same system is employed above ground (i.e., in abnormal conditions with respect to cognitive states produced by the system) it is subject to a lesser degree of noise (though the states it produces are of the same kind). Intuitively, if the animal can acquire knowledge via its olfactory system below ground, it can acquire knowledge via the same system above ground at least as easily. Yet, since conditions above ground are abnormal with respect to the cognitive state it is in (since normally, when in that state, it is below ground), any acquisition of knowledge in such conditions is predicted to be impossible.

Or, consider a measuring device designed for use in environments with a high level of interference (e.g., a radio receiver designed for use in areas of high electromagnetic interference). Under normal conditions for its use, it generates inexact information about the parameter it measures; reports on the value of the parameter are subject to a risk of error; however, there will be (abnormal) conditions of lower interference in which its reports are subject to a less substantial risk of error. Intuitively, it should be easier to acquire knowledge of the value of the relevant physical parameter in the latter case than in the former. Yet this is not what is predicted under the proposed revision.

Note that it need not be claimed that abnormally hospitable epistemic conditions are always knowledge-conducive. If a system is normally wholly unreliable, it may never be capable of generating knowledge, even if its present behaviour in fact abnormally reliable. That is, there may be a difference in kind between a normally accurate system functioning with abnormally high accuracy, and a normally inaccurate system functioning with the same level of abnormally high accuracy.

Rather than indexing, the proponent of (NTK) might alternatively appeal to the apparent context-sensitivity of normality ascriptions.Footnote 13 As Dretske (1981) and Stalnaker (2015) observe, standards of normality appear capable of variation across contexts. For example, the range within which the acidity of a cup of coffee must fall to qualify as normal for Colombian medium roast will plausibly vary depending on whether the assessor is a distributor selecting beans for a high-end blend or a non-connoisseur evaluating whether her stock of beans has expired.

Dretske appeals to the context-sensitivity of normality ascriptions to explain how knowledge can be attained by agents whose cognitive states are compatible with sceptical alternatives. S may know that p when in C, despite being unable to rule out ¬p scenarios on the basis of C, as long as, given the contextually determined standards of normality, all CS∧¬p-worlds are abnormal. For S to know that p when in C, the standards of normality must be sufficiently high that they do not categorise any CS∧¬p-world as normal.

In contrast, the present sceptical concern does not arise from an inability to rule out sceptical alternatives. According to (NTK), knowledge that p requires the world of evaluation to be normal. Thus, for S to know that p, the standards of normality must be sufficiently low that they do not categorise the world at which S is located as abnormal.

The problem is that these two anti-sceptical requirements impose conflicting demands on context. The elimination of sceptical alternatives requires the standards determined by context to be sufficiently high for sceptical scenarios to qualify as abnormal. In contrast, the satisfaction of (NTK.ii) requires the standards determined by context to be sufficiently low for the world of evaluation to qualify as normal.

In any world which is itself less normal than the most normal sceptical alternative worlds, these two conditions will be incapable of being jointly satisfied. Yet, plausibly, where the kind of abnormality exhibited by the world of evaluation is conducive to true belief formation (as discussed above), a degree of abnormality at that world which is (arbitrarily) higher than the abnormality of the most normal sceptical alternative world is consistent with knowledge.

Eliminating (NTK.ii) would leave (NTK.i) as a sufficient condition on knowledge that p (conditional on CS). (NTK.i) is clearly too weak for this role, since it does not even entail factivity. Since conditions are sometimes abnormal, (NTK.i) can be satisfied despite p being false.Footnote 14 In summary, (NTK.ii) is too strong as a necessary condition. Yet without (NTK.ii), (NTK.i) is too weak as a sufficient condition by itself.

However, the prospects of identifying an interesting principle relating normality and knowledge should not be dismissed outright. In particular, adopting (NTK.i) as a merely necessary condition on knowledge avoids each of these issues. Call the resulting thesis the Weak Normality Condition on knowledge:

(WNC) S knows, in C, that p only if Normally(CS ⊃ p).

(WNC) states that S knows, in C, that p only if in all normal worlds in which S is in C, p is true. That is, S does not know that p if it would not be abnormal for S to be in the state she is in and p be false. (WNC) is asymmetrically entailed by (NTK). However, it avoids each of the above problems. For all that has been said thus far, it is a potential necessary condition on knowledge. Indeed, there are at least some theoretical and abductive reasons in favour of adopting (WNC).

Most significantly, (WNC) is the weakest condition on knowledge compatible with the requirement that S knows that p only if her cognitive state carries the information that p. If a system carries the information that p iff it is in a state which normally implies p, and S knows that p only if she is in state which carries the information that p, then S knows that p only if she is in a state which normally implies p. To the extent this account of carrying information supports (NTK), it also supports (WNC). Indeed, (WNC) follows on the weaker condition that a system in given state carries the information that p only if it is in a state which normally implies p. This appears prima facie reasonable. If it wouldn’t be abnormal for the system to be in its present state and p be false, then the information its present state carries cannot include p.

(WNC) also derives abductive support from its ability to explain judgements about certain minimal pairs. Nelkin (2000) imagines a computer screen whose background color is selected at random each time it is switched on. On any given occasion, there is a .99999999 chance that the color selected is blue, and a .00000001 chance that the color selected is red. Without having seen the color of the screen, Nelkin suggests, it is not possible for an agent to come to know that it is blue on the basis of probability alone, even if the screen is in fact blue (imagine that the agent and screen are in separate rooms). However, if an agent enters the room and sees the red screen, it appears possible for her to come to know it is red (assuming that it is, in fact, red). In particular, this is possible in spite of the fact that the likelihood having a (hallucinatory) appearance as of a red screen when the screen is in fact blue will be higher than the likelihood of the screen being red.Footnote 15

As has been noted by, e.g., Smith (2010), facts about normality can easily explain the contrast between these cases.Footnote 16 Nothing abnormal would have to happen for the screen to be red. As such, (WNC) predicts that the agent without direct evidence of the color of the screen is not in a position to come to know that it is blue on the basis of the probabilities alone (even if it is, in fact, blue). It would be unlikely, but not abnormal, for her to be in the cognitive state she is in and for the screen be red. However, it would be abnormal for an agent to have an appearance as of the screen being red when it is in fact blue. (WNC) precludes attribution of knowledge of the screen color in former, but not the latter case (regardless of the low probability of a red screen). As such, the condition appears well-suited to explain otherwise problematic judgements. In line with the theoretical motivation for (WNC), it is plausible to suppose that the relevant difference between the agents in the two cases is that the latter, but not the former, is in a state which carries (non-probabilistic) information about the colour of the screen.

6 Higher-order ignorance inside the margins

(WNC) constitutes a more plausible implementation of the normality-based picture of knowledge underlying (NTK). (WNC) is the weakest principle entailed by Greco’s primary argument for (NTK) and avoids the latter’s sceptical implications. However, failures of the KK-principle can be expected to arise from (WNC) in cases in which agents are subject to systematic gettierization.

Suppose that S comes to truly believe that the time is 8am as a result of consulting a clock which stopped precisely 12 hours prior. Under these conditions, S’s belief that the time is 8am is gettierized; it is merely a matter of luck that S’s belief is true. Gettierized belief that p is (i.) factive and (ii.) incompatible with knowledge that p.

Say that S is systematically gettierized when in C with respect to p iff normally, if S is in C, then S believes that p and S’s belief that p is gettierized. Systematic gettierization is unusual, but does not appear to be in any way incoherent. However, given (WNC), counter-examples to (KK) and \( ({\text{KK}}^{-})\) arise naturally from cases of systematic gettierization. I will first present the abstract structure of such cases before considering concrete examples instantiating this structure.

Suppose that S is systematically gettierized when in C with respect to p. Since gettierization is factive, S satisfies (WNC) with respect to p. Normally, if she is in C, p is true. It is consistent with S being systematically gettierized that she is in C, believes that p and her belief is not gettierized—sometimes, conditions are abnormal. In such cases, since she is not gettierized and satisfies (WNC), under the assumption that any further conditions on knowledge are satisfiable, there appears prima facie reason to think that she can know that p. Nevertheless, she cannot know that she knows that p. Since gettierization is incompatible with knowledge, normally, if she is in C, she does not know that p. Hence, by (WNC) she is not in a position to know that she knows that p. \( ({\text{KK}}^{-})\) fails. Assuming that knowing that p entails being in a position to know that p, (KK) also fails.

Such counter-examples are compatible with the assumption that normality iterates freely. Instead, they depend on the fact that a world may fail to be normal by its own standards. An agent may be systematically gettierized without being in fact gettierized. We can consider a pair of concrete cases exemplifying this structure:

Industrial Saboteurs.

Suppose that it is Xander’s job to check the pressure in the water cooler of a nuclear power plant. Every morning, Xander checks a gauge at his desk which is connected to the cooler and fixes the conditions in the plant relative to its pressure. Unbeknownst to Xander, a saboteur, Yasmin, arrives before him in the morning and tampers with the gauge so that it reports the wrong pressure. Luckily, though also unbeknownst to Xander, a counter-saboteur, Zadie, manually changes the pressure in the cooler so that it matches the report Yasmin has set on the gauge.

However, suppose that one day, both Yasmin and Zadie oversleep and the gauge remains untampered with, reliably reporting the pressure in the cooler as it is designed to do. Xander checks the gauge and forms a true belief about the pressure inside the water cooler.

Building Facades.

Suppose that Ava has recently moved to a new city. Unbeknownst to her, every building in the city is currently undergoing renovation, and is hidden behind a façade. However, each façade is designed to perfectly match the exterior of the building behind it. Whenever Ava sees a building, her beliefs about its appearance are based on its façade, which she falsely believes to be its exterior.

However, suppose that one day, the façade is temporarily removed from the building opposite Ava’s house. She sees its exterior directly and forms a true belief about the building’s appearance.

Xander’s beliefs about the pressure in the water cooler are systematically gettierized. Normally, when he forms a belief about the pressure, his belief is gettierized. However, on the day on which Yasmin and Zadie oversleep, it appears reasonable to judge that, upon consulting the report on the gauge, Xander knows the pressure in the cooler. In particular, Xander’s belief about the temperature satisfies (WNC) and is not gettierized. However, by (WNC), he is not in a position to know that he knows, since normally, his belief would be gettierized and hence fail to constitute knowledge. Hence, if Xander possesses first-order knowledge about the pressure, he is a counter-instance to the KK-Principle.

Ava’s beliefs about the appearance of buildings in her town are likewise systematically gettierized. Normally, when she forms a belief about the appearance of a building in the city, her belief is gettierized. However, when the façade on the building opposite her house is removed, it appears reasonable to judge that she can come to know what its exterior looks like. Yet by (WNC), she is not in a position to know that she knows, since normally, her beliefs about its appearance (along with the appearance of other buildings in the city) would be gettierized and hence fail to constitute knowledge. Hence, if Ava possesses first-order knowledge of the appearance of the building, she is, likewise a counter-instance to the KK-principle.

Judgements here are by no means conclusive. The cases are primarily intended to function as concrete examples of the theoretical features which allow for failures of higher-order knowledge under (WNC), even when normality is assumed to iterate freely. Accordingly, there is still potential for a proponent of (WNC) to preserve the KK-principle by resisting the attribution of first-order knowledge to any agent who is systematically gettierized. Before concluding, it will be worthwhile to briefly consider the prospects for this form of response.

The KK-principle sympathiser who accepts (WNC) must deny that S can know, in C, that p if in all normal CS-worlds it is merely a matter of luck that p; that is, they must deny that systematic gettierization is compatible with first-order knowledge. An analogous concern is raised by Stalnaker (2015) regarding safety. Stalnaker suggests that S’s belief, in C, that p, is no better if in all nearby CS-worlds, p could easily have been false, than it would be if in some nearby CS-worlds, p were in fact false.Footnote 17 If it was merely a matter of luck that one’s success was not merely a matter of luck, one is no better off (Stalnaker claims) than if one’s success was merely a matter of luck. The present objection would correspondingly hold that S’s belief, in C, that p, is no better if in all normal CS-worlds, p could easily have been false than it would be if in in some normal CS-worlds p were in fact false. If one’s success is normally merely a matter of luck, one is no better off than if it would not be abnormal for one to be unsuccessful. However, accepting the latter as a constraint on knowledge has some seemingly undesirable consequences. In particular, it implies that agents are never in a position to know precisely what could easily be the case.

Consider the following simple model of the kind employed in Williamson (2013) and Goodman (2013), and which is well-suited to our current purposes. A world in the model is a pair of real numbers, ⟨ e, f ⟩. Informally, a world ⟨ e, f ⟩ represents two pieces of information: (i.) the real value, e, of some physical parameter, and (ii.) the value, f, of the parameter as represented by some system. For present purposes, the second element of each point can be identified with the (class of) state(s) of a cognitive system which represent the parameter as having the relevant value.

Let N(⟨ e, f ⟩) be the set of normal worlds at ⟨ e, f ⟩ in which the system is in the state f. Let R(⟨ e, f ⟩) be the set of worlds which could easily obtain at ⟨ e, f ⟩. We assume (potentially idealizing) that what is normal given that the system is in the state f is determined by the value of f. That is, there is some i  > 0, such that N(⟨ e, f ⟩) = {⟨ e′, f ⟩: |f − e′ | ≤ |f − i|}. The normal worlds at ⟨ e, f ⟩ are worlds at which the value of the parameter does not differ from its apparent value by more than i. Likewise we assume that what could easily be the case at ⟨ e, f ⟩ is determined by e. That is, there is some j  > 0, such that R(⟨ e, f ⟩) = {⟨ e′, f ⟩: |e − e′| < |e − j|}. The nearby worlds at ⟨ e, f ⟩ are the worlds at which the value of the parameter does not differ from e by more than j. Let Easy(k) be the proposition: {⟨ e, f ⟩: ⟨ k, f ⟩ ∈ R(⟨ e, f ⟩); that is, Easy(k) is the set of worlds at which k could easily be the value of the parameter. We can consider two constraints on KSp, the set of worlds at which S knows that p:

  1. I.

    If ⟨ e, f ⟩ ∈ KSp, then N(⟨ e, f ⟩) ⊆  p.

  2. II.

    If ⟨ e, f ⟩ ∈ KSp, then R(⟨ e, f ′⟩) ⊆  p, for each ⟨ e, f ′ ⟩ ∈ N(⟨ e, f ⟩).

(I) and (II) correspond to (WNC) and the proposed further constraint, respectively. The former says that S can know that p only if p is true at all the normal worlds where the system is in f. The latter says that S can know that p only if p is true at all of the worlds near to those worlds. Yet, it follows from (II) alone that S is never in a position to gain precise knowledge of what the value of the physical parameter could easily be. That is, for all k, KS(Easy(k)) = ∅.Footnote 18

To see why, in intuitive terms, consider the diagram above. Worlds, as pairs of reals, can be identified with points on a plain. N(⟨ e, f ⟩) and R(⟨ e, f ⟩) can be identified with line segments (since their elements differ only with respect to e). The solid line in Fig. 1 corresponds to N(⟨ e, f ⟩). The dashed lines correspond to the sets of worlds which could easily be the case at the minimal and maximal worlds in N(⟨ e, f ⟩), respectively. The dotted lines correspond to the worlds which could have easily been the case at the minimal and maximal worlds in those sets, respectively (for comprehensibility, line segments are separated on the vertical axis, though, clearly, they will in fact share the same y-co-ordinate). As can be seen, there is no world which: (i.) could easily have been the case at the minimal world easily the case at the minimal N(⟨ e, f ⟩)-world, and also (ii.) could easily have been the case at the maximal world easily the case in the maximal N(⟨ e, f ⟩)-world. Thus, there will be no k such that, for every N(⟨ e, f ⟩)-world, Easy(k) is true throughout the worlds easily the case at that world. As such, by (II), there is no k such that KS(Easy(k)).

Fig. 1
figure 1

Diagrammatic representation of proof in footnote 18

Yet, it is frequently possible for us to know that there is some precise value a parameter could easily take, even if it is not possible for us to know the precise value it in fact takes. For example, consider a set of scales which are subject to an error of ± .5 kg. Suppose that the scales display 100 kg, and the object on the scales in fact weighs 100 kg. It appears plausible that it is possible to know, under these conditions, that the object could easily be 100 kg (even if one is not in a position to know that it is in fact 100 kg). Yet the proponent of the revised principle must deny this. In order to preserve the KK-principle under (WNC) the defender of the normality-based approach is forced to deny the possibility of precise knowledge of what could easily be the case.Footnote 19

7 Conclusion

The structural properties of normality have been widely assumed to make it well-suited to defence of the KK-principle. This paper has questioned this assumption in two regards. First, it was shown to depend upon the tacit assumption that normality iterates freely, an assumption which, it was claimed, is untenable. Second, it was argued that the most plausible way of formulating the relation between knowledge and normality can be expected to give rise to counter-instances to the principle, even on the assumption that normality iterates freely.

The kind of counter-instances generated by (WNC) are independent of assumptions about inexact knowledge or corresponding margin-for-error constraints (although (WNC) remains compatible with such constraints). The latter forms of counter-instance can arise if an agent’s knowledge that p depends upon p’s truth at some set of appropriately related worlds, where the relation in question is non-transitive. In contrast, the form of counter-example discussed in Section 5 arises due to the non-reflexivity of the relation in question. Where the set of appropriately related worlds need not include the world of evaluation, an agent can know that p despite her belief that p possessing a property in all accessible worlds which is factive yet incompatible with knowledge that p (for example, gettierization).

It is for this reason that conditions on knowledge stated in terms of normality can be expected to generate counter-instances to the KK-principle which do not arise from conditions stated in terms of relations such as modal closeness, such as safety. Every world is close to itself, but many worlds classify themselves as abnormal.