Keywords

1 Introduction

Sentence (1) has a distributive reading (see (1a)) and a cumulative reading (see (1b)). This paper focuses on its cumulative reading.

figure a

Our intuition for the cumulative reading of (1) crucially involves the notion of maximality. As described in (1b), the two modified numerals (see the underlined parts of (1)) denote and count the totality of boys who saw any movies and the totality of movies seen by any boys.

In Brasoveanu (2013)’s compositional analysis of the cumulative reading of (1), two mereology-based maximality operators are applied simultaneously (at the sentential level) to derive the truth condition that matches our intuition.

In this paper, I further investigate the nature and source of this maximality. In particular, I follow Krifka (1999) to show that there are natural language cumulative-reading sentences that cannot be naturally interpreted with mereological maximality.

In a nutshell, I propose that (i) although the cumulative reading of (1) involves multiple modified numerals, it actually does not involve multiple independent maximality operators, but only one, and (ii) this maximality operator is not necessarily mereology-based, but rather informativeness-based, with regard to the resolution of a contextually salient degree QUD (Question under discussion). Thus Brasoveanu (2013)’s analysis for (1b) can be considered a special case within a more generalized theory on maximal informativeness.

The rest of the paper is organized as follows. Section 2 presents Brasoveanu (2013)’s mereological-maximality-based analysis of cumulative-reading sentences like (1). Then Sect. 3 presents Krifka (1999)’s discussion on a case that challenges a direct extension of Brasoveanu (2013)’s analysis. In Sect. 4, I propose to adopt the notion of QUD-based maximality of informativeness and show how this new notion of maximality provides a unified account for the data addressed by Brasoveanu (2013) and Krifka (1999). Section 5 compares the current QUD-based view with von Fintel et al. (2014)’s entailment-based view on maximality of informativeness. Section 6 further shows a wider empirical coverage for the notion of QUD-based maximality of informativeness. Section 7 concludes.

2 Brasoveanu (2013)’s Analysis of Cumulative Reading

Cumulative-reading sentences involve modified numerals, which bring maximality (see e.g., Szabolcsi 1997, Krifka 1999, de Swart 1999, Umbach 2006).

The contrast in (2) shows that compared to bare numerals (here two dogs), modified numerals (here at least two dogs) convey maximality, as evidenced by the infelicity of the continuation perhaps she fed more in (2b). Thus, the semantics of two in (2a) is existential, but the semantics of at least two in (2b) is maximal, indicating the cardinality of the totality of dogs fed by Mary.

figure b

According to Brasoveanu (2013), the semantics of the cumulative reading of (1) involves the simultaneous application of two maximality operators.

Fig. 1.
figure 1

The genuine cumulative reading of (1) is true in this context.

Fig. 2.
figure 2

The genuine cumulative reading of (1) is false in this context.

figure c

As sketched out in (3), Brasoveanu (2013) casts his analysis in dynamic semantics. The semantic contribution of modified numerals is two-fold. They first introduce plural discourse referents (drefs), x and y (assigned to u and \(\nu \) respectively). Then after restrictions like \(\textsc {boy}(x)\), \(\textsc {movie}(y)\), and \(\textsc {see}(x, y)\) are applied onto these drefs, modified numerals contribute maximality tests and cardinality tests. As shown in (3), two maximality operators \(\sigma \) are applied simultaneously to x and y at the global/sentential level, picking out the mereologically maximal x and y that satisfy all the relevant restrictions. Finally, these mereologically maximal x and y are checked for their cardinality, so that eventually the sentence addresses the cardinality of all the boys who saw any movies (which is 3) and the cardinality of all the movies seen by any boys (which is 5).

Crucially, the genuine cumulative reading characterized in (3) is distinct from the non-attested pseudo-cumulative reading shown in (4), where exactly three boys takes a wider scope than exactly five movies:

figure d

The analysis shown in (4) can be ruled out by the contrast between our intuitive judgments: sentence (1) is judged true under the context shown in Fig. 1 but false under the context shown in Fig. 2. However, the truth condition characterized in (4) is actually true under the context shown in Fig. 2, where boy-sums \(b_2 \oplus b_3 \oplus b_4\) and \(b_1 \oplus b_2 \oplus b_4\) each saw a total of 5 movies, and there are no larger boy-sums such that they saw in total 5 movies between them. Therefore, as concluded by Brasoveanu (2013), only (3), but not (4), captures our intuitive interpretation of sentence (1). In other words, our intuitive interpretation for the cumulative reading of (1) (see (3)) involves no scope-taking between the two modified numerals (see also Krifka 1999, Charlow 2017 for more discussion).

Although Brasoveanu (2013) and relevant discussions in existing literature have shown that the reading of (4) is empirically non-attested and needs to be ruled out, they do not explain why (4) is not attested. In this sense, the simultaneity of applying two maximality operators seems a stipulation.

3 A Challenging Case Discussed by Krifka (1999)

Krifka (1999) uses sentence (5) to address his observations on cumulative reading.

figure e

First, the intuitively most natural interpretation of (5) also indicates that there is no scope-taking between the two modified numerals here:

‘The problem cases discussed here clearly require a representation in which NPs are not scoped with respect to each other. Rather, they ask for an interpretation strategy in which all the NPs in a sentence are somehow interpreted in parallel, which is not compatible with our usual conception of the syntax/semantics interface which enforces a linear structure in which one NP takes scope over another.’ (Krifka 1999)

Then Krifka 1999 further points out that the simultaneous mereology-based maximization strategy that works for data like (1) does not work for (5):

‘Under the simplifying (and wrong) assumption that foreigners do not own land in Guatemala, and all the land of Guatemala is owned by someone, this strategy would lead us to select the alternative In Guatemala, 100 percent of the population own 100% of the land, which clearly is not the most informative one among the alternatives – as a matter of fact, it is pretty uninformative. We cannot blame this on the fact that the NPs in (27) (i.e., (5) in the current paper) refer to percentages, as we could equally well express a similar statistical generalization with the following sentence (assume that Guatemala has 10 million inhabitants and has an area of 100,000 square kilometers):

figure f

Again, the alternative In Guatemala, 10 million inhabitants own 100,000 square kilometers of land would be uninformative, under the background assumptions given.

What is peculiar with sentences like (27) is that they want to give information about the bias of a statistical distribution. One conventionalized way of expressing particularly biased distributions is to select a small set among one dimension that is related to a large set of the other dimension. Obviously, to characterize the distribution correctly, one should try to decrease the first set, and increase the second. In terms of informativity of propositions, if (27) is true, then there will be alternative true sentences of the form In Guatemala, n percent of the population own m percent of the land, where n is greater than three, and m is smaller than seventy. But these alternatives will not entail (27), and they will give a less accurate picture of the skewing of the land distribution.’ (krifka 1999)

In short, Krifka (1999)’s discussion on (5) suggests that in accounting for cumulative-reading sentences, (i) a direct application of simultaneous mereology-based maximization strategy does not always work, and (ii) what kind of concern interlocutors aim to address via the use of a cumulative-reading sentence matters for sentence interpretation, and in particular, the interpretation of the interplay between modified numerals.

4 Proposal: QUD-Based Maximal Informativeness

As suggested by Krifka (1999), QUD should matter in our intuitive interpretation of cumulative-reading sentences. Based on this idea, I start with an informal discussion on the underlying QUD in interpreting cumulative-reading sentences (Sect. 4.1). Then I propose a QUD-based view on maximality of informativeness (Sect. 4.2) and develop a compositional analysis for cumulative-reading sentences like (1) and (5) within a dynamic semantics framework (Sect. 4.3), à la Bumford (2017) and in the same spirit as Brasoveanu (2013).

4.1 Cumulative-Reading Sentences and Their Underlying QUD

Here I first show that numerals or measure phrases provide quantity/measurement information, but quantity/measurement information alone does not determine how we interpret an uttered sentence and reason about its informativeness. The same sentences (e.g., (6) and (7)) can lead to different patterns of meaning inference, depending on a potentially implicit degree QUD.

In (6), the measurement information provided by 7 o’clock directly addresses what time it is (see (6a)). However, it is not (6a), but rather an underlying degree question, that determines whether (6) is interpreted as it’s as late as 7 o’clock (\(\approx \) already 7 o’clock) or it’s as early as 7 o’clock (\(\approx \) only 7 o’clock).

If the underlying QUD is how late it is (see (6b)), then (6) is interpreted as it’s as late as 7 o’clock, conveying a stronger meaning than it’s 6/5/...o’clock by indicating a higher degree of lateness. Thus, to resolve how late it is, we consider a temporal scale from earlier to later time points, and higher informativeness correlates with later time points, i.e., the increase of numbers.

On the other hand, if the underlying QUD is how early it is (see (6c)), then (6) is interpreted as it’s as early as 7 o’clock, conveying a stronger meaning than it’s 8/9/...o’clock by indicating a higher degree of earliness. Thus, to resolve how early it is, we consider rather a temporal scale from later to earlier time points, and higher informativeness correlates with earlier time points, i.e., the decrease of numbers.

figure g

Similarly, along a scale of length, we intuitively feel that John is 5 feet tall is stronger than John is 4 feet tall. This intuition is actually based on the degree QUD – How tall is John. However, it is not guaranteed that measurement sentences containing a higher number are always more informative. Depending on whether the underlying degree QUD is (7b) or (7c), (7) can be interpreted as John is as tall as 5 feet and more informative than an alternative sentence with a smaller number, or (7) can be interpreted as John is as short as 5 feet and more informative than an alternative sentence with a larger number.Footnote 1

figure i

Therefore, as illustrated by (6) and (7), in the interpretation of sentences containing numerals, it is not always the case that the use of larger numbers leads to higher level of informativeness. Rather, the inference on informativeness hinges on (i) an underlying degree QUD (along with the direction of the scale associated with the degree QUD) and (ii) how numerals are used to resolve the degree QUD. Sometimes the use of smaller measurement numbers leads to higher informativeness in resolving degree QUDs (e.g., (6c) and (7c)).

Fig. 3.
figure 3

QUD: How much is the overall film consumption among boys? The cardinalities of some boy-sums and movie-sums in the context of Fig. 1 are plotted as dots. The extreme case that addresses the degree QUD in the most informative way is represented by the right-uppermost dot, i.e., the one corresponding to the boy-sum \(b_2 \oplus b_3 \oplus b_4\) and the 5 movies they saw between them.

Fig. 4.
figure 4

QUD: How skewed is wealth distribution? The plotting of the percentages of the population and their owned land should form a parallelogram-like area. The extreme case that addresses the degree QUD in the most informative way is represented by the left-uppermost corner, which means that 3% of the population own 70% of the land.

The above observation can be extended to cumulative-reading sentences that contain multiple numerals: the interpretation of a sentence and our inference on its informativeness depend on its underlying degree QUD.

In particular, when multiple numerals are used together to address a degree QUD, their interplay brings new patterns for connecting numbers and meaning inference on informativeness. Higher informativeness does not correlate with the increase or decrease of a single number, but an interplay among numbers.

According to our intuition, the cumulative-reading of sentence (1) addresses and can be a felicitous answer to QUDs like (8a) or (8b) , but it does not address QUDs like (8c) or (8d).Footnote 2 Therefore, as illustrated in Fig. 3, higher informativeness correlates with the increase along both the dimensions of boy-cardinality and movie-cardinality, and the right-uppermost dot on this two-dimensional coordinate plane represents maximal informativeness. I.e., maximal informativeness amounts to simultaneous mereology-based maximality.

figure j

On the other hand, as pointed out by Krifka (1999), the cumulative-reading of sentence (5) addresses and can be a felicitous answer to degree QUDs like (9a), but it does not address QUDs like (9b) (cf. (8a) as a felicitous QUD for 8). Therefore, as illustrated in Fig. 4, the plotting of the percentage of the population and their entire owned land forms a parallelogram-like area, and higher informativeness correlates with a higher ratio between the quantity of owned land and its owner population. In other words, higher informativeness correlates simultaneously with the decrease along the dimension of population and the increase along the dimension of land quantity. It is the left-uppermost corner of this parallelogram-like area that represents maximal informativeness. In this case, obviously, maximal informativeness is not mereology-based.

figure k

In brief, although the interpretation of both types of cumulative-reading sentences is based on maximal informativeness, they are different with regard to how maximal informativeness is computed from numbers, and crucially, this computation is driven by an underlying degree QUD.

Further evidence comes from the monotonicity of numerals used in these cumulative-reading sentences. In the case represented by (1)/(8), the two numerals cannot have opposite monotonicity, while in the case represented by (5)/(9), the presence of two numerals with opposite monotonicity is perfectly natural (e.g., in (5)/(9), the use of a downward-entailing expression, at most 3%, along with the use of an upward-entailing expression, at least 70%). Evidently, in the former case, the two numerals contribute to the informativeness of a sentence in a parallel way, while in the latter case, the two numerals contribute to the informativeness of a sentence in opposite ways.

It is worth mentioning that multi-head comparatives (see von Stechow 1984, Hendriks and De Hoop 2001) also provide empirical support for (i) a degree-QUD-based interpretation and informativeness inference as well as (ii) the connection between QUDs and the pattern of monotonicity.

As illustrated in (10)–(12), the underlying QUD determines how the changes of quantity/measurement contribute to sentence interpretation.

In contrast, (13) sounds degraded because with the use of fewer dogs and more rats, the sentence fails to suggest a QUD that it can felicitously address: (i) the evaluation in terms of the quantity and quality of preys and (ii) the quantity of dogs as successful predators are at odd with each other in conveying coherent meaning.

figure l
figure m
figure n
figure o

4.2 A QUD-Based Maximality Operator

Based on the informal discussion in Sect. 4.1, I propose a QUD-based maximality operator and implement it within a dynamic semantics framework:

figure p

As shown in (14), I assume meaning derivation to be a series of updates from an information state to another, and an information state m (of type \(g \rightarrow \left\{ g \right\} \)) is represented as a function from an input assignment function to an output set of assignment functions (see also Bumford 2017).

The QUD-based maximality operator \({\textbf {M}}_{u_1, u_2, ...}\) works like a filter on information states. With the application of \({\textbf {M}}_{u_1, u_2, ...}\), the discourse referents (drefs, which are assigned to \(u_1, u_2, \ldots \)) that lead to the maximal informativeness in resolving a QUD will be selected out.

More specifically, the definition of \({\textbf {M}}_{u_1, u_2, ...}\) includes an operator \(G_\textsc {qud}\), which, when applied on drefs, returns a value indicating informativeness. This informativeness amounts to a measurement in addressing a contextually salient degree QUD: e.g., in the case of (8) (see Fig. 3), how much the overall film consumption is among boys; in the case of (9) (see Fig. 4), how skewed wealth distribution is in Guatemala. In this sense, \(G_\textsc {qud}\) can be considered a measure function.

4.3 Analyzing Cumulative-Reading Sentences

The step-by-step semantic derivation for the core example (1) is shown in (15).

(15a) first shows the introduction of plural drefs and relevant restrictions.

Given that this sentence is interpreted with a contextually salient QUD like how high film consumption is among boys (see (8) and Fig. 3), higher informativeness amounts to higher degree of consumption level (e.g., with \(d_1 > d_2\), the consumption level is \(d_1\)-high is more informative than the consumption level is \(d_2\)-high). Thus the measurement of informativeness amounts to the measurement of cardinalities of both plural drefs (see 15b).

Maximal informativeness is achieved when the mereologically maximal drefs (i.e., \(b_2 \oplus b_3 \oplus b_4\) and \(m_2 \oplus m_3 \oplus m_4 \oplus m_5 \oplus m_6\) in Fig. 1) are assigned (see (15c)).

figure q

The step-by-step semantic derivation of the core example (5) is shown in (16). The crucial difference between the analysis in (15) vs. (16) consists in their QUD, i.e., \(G_{\textsc {qud}}\), as reflected in (15b) vs. (16b).

Given that (5) is interpreted with a QUD like how skewed wealth distribution is in Guatemala (see (9) and Fig. 4), higher informativeness amounts to higher degree of skewedness. Thus the measurement of informativeness amounts to the ratio between the quantity of drefs (see (16b)).

Maximal informativeness is achieved when the quantity of a dref y satisfying \(\textsc {land}(y) \wedge \textsc {own}(x, y)\) divided by the quantity of a dref x satisfying \(\textsc {human}(x) \wedge \textsc {own}(x, y)\) yields the maximal ratio/quotient (see (16c)).

Finally, modified numerals at most 3% and at least 70% impose cardinality tests on the drefs selected out from the step in (16c) (see (16d)).

figure r

Here I would also like to address an issue raised by an anonymous reviewer: is it valid to use QUD in the derivation of (truth-conditional) meaning of a sentence? Works like Grindrod and Borg (2019) point out that the framework of QUD is pragmatic, mainly accounting for phenomena like the use of prosodic focus in question-answer congruence, and further extension to account for truth-conditional meaning is illegitimate.

As sketched in the following examples (17)–(19), modified numerals actually parallel exactly with items that bear prosodic focus: their interpretation all involve (i) a certain QUD, (ii) the application of a maximality operator that results in maximal informativeness (in addressing the QUD), and (iii) a further post-suppositional test (on cardinality or identity/part-whole relationship). Thus the current proposed treatment of sentences containing modified numerals is actually compatible with, not against, the view of Grindrod and Borg (2019) (see also Krifka 1999; Zhang 2023 for relevant discussions).

figure s
figure t
figure u

5 Discussion: Comparison with von Fintel et al. (2014)

Under the current analysis, it is a contextually salient degree QUD (i.e., what interlocutors care about, their ultimate motivation behind their utterance, see Roberts 2012) that determines how informativeness is actually measured (see the implementation of \(G_\textsc {qud}\) in (15b) vs. 16b). This degree-QUD-based informativeness measurement, \(G_{\textsc {qud}}\), further determines how the maximality operator \({\textbf {M}}_{u_1, u_2, ...}\) filters on drefs and how (modified) numerals affect meaning inference.

The notion of degree-QUD-based maximality of informativeness proposed here is in the same spirit as but more generalized than the entailment-based one proposed by von Fintel et al. (2014) (which primarily aims to account for the interpretation of the; see also Schlenker 2012). According to von Fintel et al. (2014), informativeness ordering is based on entailment relation (see (20)).

figure v

Thus as shown in (21), depending on the monotonicity of properties, maximal informativeness corresponds to maximum or minimum values.

figure w

Compared to von Fintel et al. (2014), the notion of QUD-based maximal informativeness developed in the current paper is more generalized in two aspects.

First, the current QUD-based view on maximality of informativeness can be easily extended from dealing with a single value to a combination of values.

As shown in Sect. 4, in cumulative-reading sentences where multiple numerals are involved, maximal informativeness does not directly correspond to whether the uttered numbers are considered maximum or minimum values. In example (5), as observed by Krifka (1999), each of the numerals (i.e., at most 3% and at least 70%) alone cannot be maximum or minimum values. It is how the combination of these uttered numbers contributes to resolve an implicit, underlying QUD that leads to the achievement of maximal informativeness.

Second, and more importantly, the current degree-QUD-based view on maximality of informativeness can overcome the issue that sometimes we intuitively feel that one sentence has a stronger meaning (or is more informative) than another, but the former does not directly entail the latter.

In (21a), Miranda is 1.65 m tall means that the height of Miranda reaches the measurement of 1.65 m, i.e., \(\lambda w. \textsc {height}(\textsc {Miranda})(w) \ge 1.65 \text { m}\). Thus it does entail Miranda is 1.60 m tall\(\lambda w. \textsc {height}(\textsc {Miranda})(w) \ge 1.60 \text { m}\).

The two sentences mentioned in footnote 1 (repeated here as (22)) should be interpreted in a way parallel to the two sentences in (21a). Actually we do have a natural intuition that (22a) has a stronger meaning than (22b). However, it is evident that (22a) does not directly entail (22b).

figure x

Under the current degree-QUD-based view on maximality of informativeness, I tease apart (i) the height measurement (typically with units like feet, meters, etc.) and (ii) the degree of tallness. Presumably, items of different comparison class can share the same scale for height measurement (e.g., the height of humans, giraffes, and mountains can be measured along the same scale and with the same units). However, the degrees of tallness and the comparison between them hinge on the notion of comparison class (e.g., toddlers are usually compared with other toddlers in terms of tallness). Thus it is evident that although ‘\(\lambda w. \textsc {height}(\textsc {John})(w) \subseteq [6', +\infty )\)’ does not entail ‘\(\lambda w. \textsc {height}(\textsc {John})(w) \subseteq [4', 5']\)’, under a degree QUD like to what extent is John tall, the measurement ‘\([6', +\infty )\)’ represents a higher degree in addressing this degree QUD and is thus more informative than ‘\( [4', 5']\)’. Therefore, our intuition that (22a) has a stronger meaning than (22b) can be accounted for.

For the core example (1), it is also worth noting that under the scenario of Fig. 1, although exactly 3 boys saw exactly 5 movies holds true, exactly 1 boy saw exactly 4 movies does not hold true (in Fig. 1, no boys saw more than 3 movies). Thus, this example also shows that it is problematic to build informativeness ordering directly upon the entailment relation between uttered sentences and their alternatives (derived by replacing uttered numbers with other numbers). However, exactly 3 boys saw exactly 5 movies does indicate a higher film consumption level (or a more prosperous situation) than the consumption level indicated by exactly 1 boy saw exactly 4 movies. Thus, the uttered sentence indicates a higher informativeness in addressing an underlying QUD than its alternatives. In this sense, with the use of a degree QUD, the current proposal provides a more generalized view on informativeness than an entailment-based one.

6 Extension: QUD-Based Informativeness and even

Beyond cumulative-reading sentences (and measurement sentences like (22)), here I use the case of even to show a broader empirical coverage of the proposed QUD-based view on maximality of informativeness (see also Zhang 2022).Footnote 3

According to the traditional view on even, its use brings two presuppositions (and presuppositions are considered a kind of entailment): (i) entity-based additivity (see (23a)) and (ii) likelihood-based scalarity (see (23b)).

figure y

However, it seems that the notion of entailment is too strong to characterize the meaning inferences with regard to the use of even. As illustrated by an example from Szabolcsi (2017), under the given scenario in (24), the use of an even-sentence in (25) is perfectly natural, but it challenges the traditional entailment-based view on our natural inferences for even-sentences. First, as shown in (25a), the presuppositional requirement of additivity is not met, because Eeyore was the only one who took a bite of thistles and spit them out. Second, if no one other than Eeyore took a bite of thistles, it seems also questionable to claim that the likelihood of the truth of the prejacent is lower than that of X spit thistles out (\(X \in \text {Alt}(\text {Eeyore})\)), as shown in (25b).

figure z
figure aa

Zhang (2022) proposes a degree-QUD-based analysis for the use of even (see Greenberg 2018 for a similar view). The use of even is always based on a contextually salient degree QUD (for (25), how prickly are those thistles). The prejacent of even (here Eeyore spit those thistles out) provides information to resolve this degree QUD with an increasingly positive answer, and compared with alternatives, this prejacent is also considered maximally informative in resolving this degree QUD. I.e., here compared to X spit those thistles out (\(X \in \text {Alt}(\text {Eeyore})\)), Eeyore spit those thistles out is maximally informative in resolving the degree question how prickly are those thistles.

Therefore, as illustrated in (26), the presupposition of even contains two parts. First, in all the accessible worlds where the prejacent is true, the range of the prickliness measurement of thistles, \(I_p\), exceeds the threshold \(d_\text {stdd}\) (i.e., the degree QUD is resolved by the prejacent with a positive answer). Second, compared to \(I_q\) (i.e., the range of the prickleness measurement of thistles informed by an alternative statement X spit thistles out), \(I_p\) is maximally informative.Footnote 4

figure ab

It is interesting to see that our interpretation for both cumulative-reading sentences and focus-related sentences can be based on the same degree-QUD-based mechanism of informativeness and demonstrate the maximality of informativeness.

7 Conclusion

Starting from the discussion on our intuitive interpretation for two kinds of cumulative-reading sentences, this paper proposes a degree-QUD-based view on the maximality of informativeness. The informativeness of a sentence basically stands for how the sentence resolves a contextually salient degree QUD.

For cumulative-reading sentences like Exactly three boys saw exactly five movies, its informativeness means the degree information in addressing how high the film consumption level is among boys, and the uttered numbers reflects mereological maximality. Then for cumulative-reading sentences like In Guatemala, at most 3% of the population own at least 70% of the land, its informativeness means rather the degree information in addressing how skewed wealth distribution is in Guatemala, and the uttered numbers reflects the maximality of the ratio between land and their owner population.

It seems that the current QUD-based view on the maximality of informativeness can overcome some issues that challenge the existing entailment-based view on informativeness and provide a broader empirical coverage. A further development of the current proposal to account for other related phenomena, especially with regard to the interpretation of numerals and focus items (e.g., even), as well as a more detailed discussion on its theoretical implications are left for future research.