Numerals do not need numerosities: robust evidence for distinct numerical representations for symbolic and non-symbolic numbers

Marinova, Mila; Sasanguie, Delphine; Reynvoet, Bert

doi:10.1007/s00426-019-01286-z

Numerals do not need numerosities: robust evidence for distinct numerical representations for symbolic and non-symbolic numbers

Original Article
Published: 18 January 2020

Volume 85, pages 764–776, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Psychological Research Aims and scope Submit manuscript

Numerals do not need numerosities: robust evidence for distinct numerical representations for symbolic and non-symbolic numbers

Download PDF

1092 Accesses
18 Citations
Explore all metrics

Abstract

In numerical cognition research, it has traditionally been argued that the processing of symbolic numerals (e.g., digits) is identical to the processing of the non-symbolic numerosities (e.g., dot arrays), because both number formats are represented in one common magnitude system—the Approximate Number System (ANS). In this study, we abandon this deeply rooted assumption and investigate whether the processing of numerals and numerosities can be dissociated, using an audio-visual paradigm in combination with various experimental manipulations. In Experiment 1, participants performed four comparison tasks with large symbolic and non-symbolic numbers: (1) number word–digit (2) tones–dots, (3) number word–dots, (4) tones–digit. In Experiment 2, we manipulated the number range (small vs. large) and the presentation modality (visual–auditory vs. auditory–visual). Results demonstrated ratio effects (i.e., the signature of ANS being addressed) in all tasks containing numerosities, but not in the task containing numerals only. Additionally, a cognitive cost was observed when participants had to integrate symbolic and non-symbolic numbers. Therefore, these results provide robust (i.e., independent of presentation modality or number range) evidence for distinct processing of numerals and numerosities, and argue for the existence of two independent number processing systems.

Evidence for distinct magnitude systems for symbolic and non-symbolic number

Article 26 December 2015

Toward a unified account of nonsymbolic and symbolic representations of number: Insights from a combined psychophysical-computational approach

Article 16 December 2021

Processing Symbolic Numbers: The Example of Distance and Size Effects

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

During the past decade of research in the domain of numerical cognition, the relation between symbolic numbers (e.g., Arabic numerals, number words) and non-symbolic numerosities (e.g., dot arrays) has been the subject of debate. According to the most popular view, “[…] numerical symbols and nonnumerical numerosities converge onto shared neural representations” (Piazza, Pinel, Le Bihan, & Dehaene, 2007, p. 302), and are thus represented by one and the same evolutionary-determined brain system (Nieder, 2016), referred to as the ‘Approximate Number System’, or shortly ANS (e.g., Dehaene, 2007; Nieder & Dehaene, 2009). Consequently, it has been extensively argued that the processing (and the acquisition) of numerical symbols requires these symbols to be mapped on their corresponding pre-existing non-symbolic numerosities (for explicit statements, see Cantlon, et al., 2009, p. 2218; Dehaene & Cohen, 1995, p. 85–86; Feigenson, Dehaene, & Spelke, 2004, p. 309; see also Kutter, Bostroem, Elger, Mormann, & Nieder, 2018, p. 7; Nieder, 2016, p. 379; Piazza, 2010, p. 4; Piazza et al., 2007, p. 302). One of the main arguments in favor of the mapping of numerical symbols onto the ANS is the presence of a ratio and or/distance effects in both symbolic and non-symbolic numerical tasks. The distance effect and the ratio effect are two strongly related observations, showing that, respectively, the absolute distance (|n₁–n₂|) or the relative distance (n₁/n₂) between two numbers have an impact on the behavioral performance. Typically, the distance effect is reported in studies using symbolic numbers (Defever, Sasanguie, Gebuis, & Reynvoet, 2011; Sasanguie, De Smedt, Defever, & Reynvoet, 2012), while the ratio effect is the most common metric in studies involving numerosities only (Barth, Kanwisher, & Spelke, 2003; Halberda & Feigenson, 2008), and in studies where the symbolic and non-symbolic numbers are combined, (e.g., Marinova, Sasanguie, & Reynvoet, 2018; Sasanguie, De Smedt, & Reynvoet, 2017; Van Hoogmoed & Kroesbergen, 2018). More specifically, when participants have to compare two numbers with a small absolute difference between them, their performance is worse than when comparing numbers with a large absolute difference (e.g., comparing 4 and 6 is harder than comparing 4 and 8, hence a distance effect) irrespective of whether both numbers were presented in the same notation (i.e., two digits) or not (i.e., word number and a digit, Dehaene & Akhavein, 1995). Similarly, participants’ performance is also worse when the relative distance (i.e., the ratio) of the numbers to be compared is closer to 1 (e.g., comparing 6 and 8 (ratio = 1.33) is harder than comparing 2 and 4 (ratio = 2)). Both the distance and the ratio effects are typically explained by the ‘mental number line’ (Dehaene, 2001). Thus, it is argued that the smaller the absolute distance and/or the relative distance between two numbers on this number line is, the greater the representational overlap of their magnitude distributions, making it harder to discriminate between them (Gallistel & Gelman, 1992; Moyer & Landauer, 1967). It is proposed that the relative metric (i.e., ratio) accounts for the compressive nature of the mental number line.

Alternatively, other researchers proposed the existence of two independent numerical systems: one for exact symbolic numbers and another for approximate quantities (e.g., Krajcsi, Lengyel, & Kojouharova, 2016; Núñez, 2017; Reynvoet & Sasanguie, 2016). They did so in part on the basis of two behavioral observations. First, several studies observed a different impact of the ratio on the performance in different conditions, in which the numerical notation was manipulated (i.e., symbolic, non-symbolic, or mixed). More specifically, Sasanguie, De Smedt, and Reynvoet (2017), and Marinova, Sasanguie, and Reynvoet (2018) used an audio-visual paradigm in which participants were instructed to numerically match (i.e., to decide whether two numbers are numerically the same or not, Sasanguie et al., 2017) or to compare (Ex.3 in Marinova et al., 2018) pairs of symbolic numbers (i.e., visually presented digits or auditory presented number words) and/or non-symbolic quantities (i.e., visually presented dot arrays or auditory presented sequences of beeps). In both studies, no ratio effect was present when participants had to match/compare purely symbolic pairs (i.e., a digit and a number word). By contrast, the ratio effect was observed when at least one of the numbers to be matched/compared was a non-symbolic numerosity. The authors interpreted these findings as evidence for two distinct systems underlying the performances in these tasks: a symbolic system, in which symbolic numbers are processed exactly, and an approximate system processing the non-symbolic numerosities. A second behavioral observation supporting the distinct numerical systems view is the presence of cognitive switch cost when participants have to integrate symbolic and non-symbolic numbers in a specific task. For instance, Lyons, Ansari, and Beilock (2012) instructed participants to compare pairs of visually presented pure symbolic numbers (i.e., number words and digits), pure non-symbolic quantities (i.e., pairs of dot arrays) and mixed pairs (i.e., dots and digits). The authors reasoned that if only one common system is activated for all notations, similar performances could be expected in pure and mixed pairs. Alternatively, if different numerical representations would be used to perform the task depending on the presentation format, it is more plausible to assume slower reaction times in mixed pairs than in pure pairs, due to a cognitive cost needed to switch between the two different numerical representation systems. Confirming the latter hypothesis, slower RTs for mixed pairs compared to pure pairs were indeed observed. Later on, Marinova et al. (2018) demonstrated similar findings using an audio-visual paradigm. In this study, again slower responses were observed for mixed compared to pure number pairs. Altogether, these studies support the idea of two independent number processing systems.

The absence of the ratio effect in the symbolic tasks could, however, also be explained by specific stimulus set characteristics which were shared in all previously mentioned studies. For example, the above studies only have used small numbers and small sets of numerosities (Experiments 1 and 3 in Marinova et al., 2018; Sasanguie et al., 2017). Small numbers are frequently encountered in daily life (Dehaene & Mehler, 1992; Gielen, Brysbaert, & Dhondt, 1991) and, therefore, distinct symbolic representations may have been formed for these small numbers only (see Verguts, Fias, & Stevens, 2005). In Lyons et al. (2012) and in Ex. 2 of Marinova et al. (2018), in fact, also large numbers were used—more specifically, the tens (i.e., 10, 20, 30, 40). However, many studies have shown that two-digit numbers can be decomposed under specific task settings (e.g., Moeller, Huber, Nuerk, & Willmes, 2011; Nuerk & Willmes, 2005; Reynvoet, Notebaert, & Van den Bussche, 2011), something we believe is very likely to occur in a stimulus set containing only tens. In these studies, participants most probably based their decisions on the decades of the numbers only, which is equivalent to comparing 1, 2, 3, and 4 (i.e., small numbers). Therefore, the question remains whether a dissociation between symbolic and non-symbolic numbers, reflected by the different impact of the ratio effect on the behavior across pure symbolic, pure non-symbolic, and mixed tasks, can be replicated when using larger numbers.

We investigated exactly this by conducting two audio-visual experiments in adults. In Experiment 1, participants were presented with four audio-visual comparison tasks with large numbers (> 5), falling outside of the subitizing range (Kaufman, Lord, Reese, & Volkmann, 1949): (1) a number word–digit task, (2) a tones–dots task, (3) a number word–dots task and (4) a tones–digit task. Each task contained number/quantity pairs ranging between 10 and 40, with ratios of various difficulties (from 1.11 to 2.00). Using this paradigm, we first avoid the possibility that participants base their judgements on the physical properties of the stimuli (see also Barth et al., 2003; Marinova et al., 2018; Sasanguie et al., 2017). Second, we also avoid the possibility that numbers are decomposed (e.g., Nuerk & Willmes, 2005). A decomposition strategy is very efficient in unimodal presentations, because participants can base their comparison decisions on the decade only. However, with audio-visual presentation, a decomposition strategy would be inefficient because, in languages like Dutch where a unit–decade inversion exists, the place of units and decades differs between the two consecutive stimuli (e.g., auditory: “one-and-twenty”; visual: “21”). In Experiment 2, we wanted to replicate and extend the findings of Experiment 1 by additionally manipulating the number range and the order of the stimulus presentation (i.e., visual stimulus first or auditory stimulus first). This way, we could directly address potential differences between small and large numbers, and we could examine whether the order of the stimulus modality presentation matters. In line with the distinct system account, we expected that the ratio effect would differ across the four audio-visual tasks, depending on whether the tasks contained only symbolic, only non-symbolic, or mixed number pairs. Consequently, this should result in an interaction between the task and the ratio. More precisely, in all tasks containing non-symbolic numbers (i.e., tones–dots, number word–dots, and tones–digits), where we expected the ANS to be activated, a ratio effect should be observed. In contrast, in the purely symbolic condition (i.e., number word–digit), where numbers will be processed exactly without activating the ANS, a smaller ratio effect is to be expected, if present at all (Marinova et al., 2018; Sasanguie et al., 2017). Furthermore, we expect the participants to show slower responses for mixed number pairs—where they presumably have to switch between different systems, in contrast to pure number pairs—where such a switch is not required. Alternately, if all tasks are performed via ANS mapping mechanisms, we expect to observe only main effect of ratio, and no difference between the pure and mixed trails.

Experiment 1

Method

Participants

Participants were recruited via a university online subscription system.^{Footnote 1} Twenty-four university students and university employees participated in exchange of a non-monetary reward. The experimental protocol was approved by the university’s ethical committee (file number G-20160679). All participants gave written informed consent. All of them had normal or corrected to normal vision and hearing. Three participants were removed because they were too slow (> 3SD from the group mean, per audio-visual task) or because they did not perform above chance level in one of the tasks. Consequently, the final sample consisted of 21 adults aged between 18 and 28 (M_age = 21.05 years, SD = 3.37, 9 males). We performed power analysis to determine the sample size, using the G*Power software version 3.1 (Faul, Erdfelder, Lang, & Buchner, 2007). To obtain the effect size, \(\eta_{\text{p}}^{2}\) = 0.19 (the lowest size of ratio effect reported in Appendix in Marinova et al., 2018), with α = 0.05 and power set at 80%, the required sample was 16 participants. As a consequence, power is guaranteed with our current sample size of 21 participants (data available upon request to the corresponding author).

Procedure, tasks, and stimuli

All participants performed four audio-visual comparison tasks (1) a number word–digit task, (2) a number word–dots task, (3) a tones–digit task and (4) tones–dots task (see Fig. 1). The stimuli consisted of numbers between 10 and 40, presented in the auditory modality as spoken number words or tones (i.e., beep sequences) and in the visual modality as digits or dot arrays. There were six ratios: 1.11, 1.14, 1.20, 1.25, 1.50, and 2.00.^{Footnote 2} The complete stimulus list is shown in Table 1.

Table 1 The six ratios of the audio-visual tasks, with their corresponding number pairs

Full size table

The number words were digitally recorded (sampling rate 44.1 kHz, 16-bit quantization) by a female native Dutch speaker. The recordings were band-pass filtered (180–10,000 Hz), resampled at 22.05 kHz and matched for loudness. The beep sequences were generated with a custom Python 2.7 script in a way that each individual beep lasted a fixed 40 ms. Its pitch randomly varied (300–1200 Hz) and also the duration of silence between the beeps (i.e., the inter-tone interval) was randomly varied (the minimal silence duration allowed by the parameters of the program was 10 ms).^{Footnote 3} This way, we ensured that the presentation rate of the beeps in each sequence was fast enough to encourage participants to use approximations, instead of engaging in counting strategies as demonstrated in previous studies (see Barth et al., 2003; Philippi, van Erp, & Werkhoven, 2008; Tokita, Ashitani, & Ishiguchi, 2013; Tokita & Ischiguchi, 2012, 2016). The dot stimuli were generated with the MATLAB script of Gebuis and Reynvoet (2011), controlling for non-numerical cues (i.e., total surface, convex hull, density, dot size and circumference). The digits were written in font Arial, size 40. The number words and beeps were presented binaurally through headphones at about 65 dB SPL. Participants were tested simultaneously in small groups of six people, in a quiet room equipped with 15-in. LG LCD displays and individual active noise control headphone sets. E-prime 2.0 software (Psychology Software Tools, http://pstnet.com) was used for controlling the stimulus presentation and recording of the data.

Each trial began with a 600-ms white fixation cross in the center of a black screen. Then the auditory stimulus was presented for 2500 ms in the case of number words, or 3500 ms in the case of beep sequences, immediately followed by the visual stimulus presented for 2500 ms. Afterwards, a blank screen was presented. Participants were instructed to judge which quantity (the auditory or the visual) was larger by pressing the “a” or “p” buttons on an AZERTY keyboard. Participants could respond either during the presentation of the visual stimulus, or during the blank screen, presented immediately after the second stimulus. The next trial began after a 1500-ms intertrial interval. Prior to each audio-visual task, each subject received five practice trials, during which feedback was provided. The practice trials were followed by 72 randomly presented trials of the same type (without feedback). In half of the trials, the small number of the number pair was presented first, followed by the larger number (e.g., 19–21); in the other, half the larger number was presented first, followed by the smaller number (e.g., 21–19). Each audio-visual task was presented in a separate block. Consequently, the order of the tasks was fully counterbalanced across participants in a Latin square design.

Results

Ratio effect

First, the median reaction times on correct responses (18% errors, leaving 4943 trials) and the mean accuracies (6048 trials) were submitted to a repeated measures ANOVA with task (four levels) and ratio (six levels) as within-subject factors. Whenever the assumption of sphericity was violated, the Greenhouse–Geisser correction was applied. Mean accuracies and median reaction times are depicted in Table 2. To make our data as informative as possible, next to the classical statistics, we also report the Bayes factors (BF)—or log(BF) in case the BF values are too large to interpret (Jarosz & Wiley, 2014; Wagenmakers, Marsman, et al., 2018; Wagenmakers, Love, et al., 2018). To obtain both classical and Bayesian results, the JASP statistical package version 0.9.0.1 (https://jasp-stats.org/) was used. The default Cauchy prior was used for calculating the BFs.^{Footnote 4}

Table 2 Mean accuracies and median reaction times (with corresponding standard deviations), depicted per audio-visual task and ratio

Full size table

The ANOVA on the reaction times showed a main effect of task, F(3, 60) = 28.19, p < 0.001, \(\eta_{\text{p}}^{2}\) = 0.59, 90% CI [0.43, 0.66],^{Footnote 5} log(BF_Inclusion) = 36.33, a main effect of ratio, F(2.58, 51.54) = 17.80, pGG < 0.001, \(\eta_{\text{p}}^{2}\) = 0.47, 90% CI [0.28, 0.57], log(BF_Inclusion) = 16.04, and moderate evidence for a task × ratio interaction, F(6.74, 134.87) = 3.74, pGG = 0.001, \(\eta_{\text{p}}^{2}\) = 0.16, 90% CI [0.04, 0.21], BF_Inclusion = 3.97 (see Fig. 2). The presence of a ratio effect in each task was further investigated via separate post hoc one-way ANOVAs.

In the number word–digit task, there was no main effect of ratio, F(2.82, 56.30) = 1.78, pGG = 0.12, \(\eta_{\text{p}}^{2}\) = 0.08, 90% CI [0.00, 0.18], BF_Inclusion = 0.36. In the tones–dots task, there was strong evidence for the presence of a ratio effect, F(3.42,68.48) = 8.74, pGG < 0.001, \(\eta_{\text{p}}^{2}\) = 0.304, 90% CI [0.13, 0.41], BF_Inclusion = 23.68. In the number word–dots task there was extreme evidence for the presence of a ratio effect, F(2.66, 53.26) = 10.01, pGG < 0.001, \(\eta_{\text{p}}^{2}\) = 0.33, 90% CI [0.143, 0.451], BF_Inclusion = 152.51. Finally, in the tones–digits task, the evidence for the presence of ratio effect was moderate, F(4.06, 81.17) = 3.74, pGG = 0.007, \(\eta_{\text{p}}^{2}\) = 0.33, 90% CI [0.03, 0.24], BF_Inclusion = 8.75.

The ANOVA on the accuracies showed a main effect of task, F(3, 60) = 65.66, p < 0.001, \(\eta_{\text{p}}^{2}\) = 0.77, 90% CI [0.66, 0.81], BF_Inclusion = ∞,^{Footnote 6} a main effect of ratio, F(3.13, 62.54) = 64.98, pGG < 0.001, \(\eta_{\text{p}}^{2}\) = 0.77, 90% CI [0.66, 0.81], BF_Inclusion = ∞, and a task × ratio interaction, F(15, 300) = 9.63, p < 0.001, \(\eta_{\text{p}}^{2}\) = 0.33, 90% CI [0.22, 0.36], BF_Inclusion = 34.49 (see Fig. 2). In the number word–digit task, there was no ratio effect, F(5, 100) = 0.36, p = 0.88, \(\eta_{\text{p}}^{2}\) = 0.02, 90% CI [0.00, 0.02], BF_Inclusion = 0.04. In the tones–dots task, there was extreme evidence for the presence of ratio effect, F(5, 100) = 31.56, p < 0.001, \(\eta_{\text{p}}^{2}\) = 0.61, 90% CI [0.49, 0.67], BF_Inclusion = ∞. In the number word–dots task, there was again extreme evidence for a ratio effect, F(5, 100) = 29.41, p < 0.001, \(\eta_{\text{p}}^{2}\) = 0.60, 90% CI [0.47, 0.65], log(BF_Inclusion) = 36.04. In the tones–digit task, there was extreme evidence for a ratio effect, F(5, 100) = 19.95, p < 0.001, \(\eta_{\text{p}}^{2}\) = 0.50, 90% CI [0.36, 0.57], log(BF_Inclusion) = 26.65 (Fig. 3).

Switch cost

To examine whether a cost for switching between two magnitude systems is present in tasks where integrating two independent number representations was required (i.e., number word–dots and tones–digits), we analyzed these tasks together as mixed pair tasks, and compared them to the pure number pairs, where no integration was required (number word–digit and tones–dots). Presence of a switch cost should be indicated by significantly slower RTs for the mixed number pairs.

For the analysis, we only included the ratios where the accuracies in all audio-visual tasks were above 70%, i.e., the ratios 1.25, 1.5 and 2.00 (see Table 2). This was done because we wanted to compare the current switch cost results for large numbers with the results for small numbers from our previous study (Experiment 3 in Marinova et al., 2018), while keeping the ratios as similar as possible (i.e., in Ex. 3 Marinova et al., 2018 the easy ratios were 1.8 and 2.00, and hard ratios were 1.28 and 1.33). Moreover, in the other three more difficult ratio conditions, participants made a lot of mistakes which made a reaction time analysis of the switch cost unreliable.

The one-tailed paired t tests showed that in all ratios, responses were significantly slower for mixed number pairs than for pure number pairs, ratio 1.25, t(20) = 5.72, p < 0.001, d = 1.25, 95% CI [0.66, 1.18], BF₊₀ = 3348.41; ratio 1.5, t(20) = 3.62, p < 0.001, d = 0.79, 95% CI [0.29, 1.28], BF₊₀ = 45.54; and ratio 2.00, t(20) = 2.60, p = 0.009, d = 0.57, 95% CI [0.10, 1.02], BF₊₀ = 6.42 (see Fig. 4). The size of the switch cost between the ratios was not significantly different: 1.50 vs 1.25, t(20) = 1.57, p = 0.13, d = 0.34, 95% CI [− 0.10, 0.78], BF₁₀ = 0.65, 2.00 vs 1.25, t(20) = 1.76, p = 0.09, d = 0.39, 95% CI [− 0.06, 0.82], BF₁₀ = 0.85, and 2.00 vs. 1.50, t(20) = 0.459, p = 0.65, d = 0.10, 95% CI [− 0.33,0.53], BF₁₀ = 0.25.

Discussion

In line with previous studies (Marinova et al., 2018; Sasanguie et al., 2017), the current data showed ratio effect in all tasks containing a non-symbolic numerosity (i.e., tones–dots, number words–dots, tones–digits), but not in the tasks with symbolic numbers only (i.e., number word–digits), suggesting distinct representations for non-symbolic and symbolic numbers. Also with regard to the switch cost, the results confirmed our (and those of others) previous findings: responses to the mixed number pairs were significantly slower than those to the pure number pairs (Lyons et al., 2012; Marinova et al., 2018), indicating that, when symbolic and non-symbolic numbers have to be integrated for the task requirements, participants have to link two distinct representations.

Before drawing any strong conclusions, however, we additionally wanted to verify whether the absence of a ratio effect in the number word–digit task was not due to other factors. First, we wanted to verify that the presence of ratio effect was not masked by a floor effect in the RTs in most of the ratio conditions. Therefore, to increase the strength of the potentially masked ratio effect, we averaged the RT performance in the two easiest ratio conditions (1.50 and 2.00) and compared it with the averaged RT in the two hardest ratio conditions (1.11 and 1.14). If a ratio effect is present in the number word–digit task, the easy ratios should be significantly faster than the hard ratios. One-tailed paired t test showed no support for this claim, t(20) = − 1.63, p = 0.06, d = − 0.356, 95% CI [− 0.00, 0.02], BF₁₀ = 1.32.

Second, although we try to avoid the decomposition of double-digit numbers with the current cross-modal presentation technique, it remains possible that participants adopt such a strategy, especially given the long presentation of the number words. Second, although the ratio between numbers did not affect the performance in the pure symbolic condition, the absolute distance between the numbers could still be relevant. To exclude both alternatives, we conducted multiple linear regressions on the accuracy, and on the reaction time data, with: (a) absolute distance (the absolute distance between the auditory and the visual numbers), (b) ratio, (c) unit distance (the distance between the units of the auditory and the visual stimulus), and (d) decade distance (the distance between decades of the auditory and the visual stimulus). If participants would decompose the numbers, the data should be best predicted by the decade distance (and/or the unit distance). In addition, if participants based their decisions on the absolute distance between numbers and not on the ratio between them, absolute distance should also be a significant predictor. In the accuracy data, the effect of decade distance approached significance (p = 0.053), however, this finding was not supported by the Bayesian linear regression (BF₁₀ = 0.34). In the RT data, none of the predictors were significant (all ts < 1.5, all ps > 0.05 all BFs₁₀ < 1). Therefore, we did not find evidence for decomposition, neither with the classical nor with the Bayesian analyses.

The lack of ratio effect in the number word–digit condition could also be caused by the order in which the stimuli were presented (auditory first, then visual) in the current and in all of our previous studies (Sasanguie et al. 2017; Marinova et al., 2018; and the current Exp. 1). In an EEG study measuring the mismatch negativity (MMN) by Finke et al. (2018), no ratio-dependent MMN in the symbolic auditory–visual condition was observed, whereas a significant ratio-dependent MMN amplitude in the visual–auditory condition was present. Although it is not clear what caused this asymmetry in Finke et al. (2018), it is in any case not consistent with the current interpretation of the results of Experiment 1. More specifically, if the findings in Experiment 1 are due to two distinct representations for symbolic and non-symbolic numbers, the order of presentation (auditory–visual vs. visual–auditory) should not matter. Therefore, in Experiment 2, we manipulated the order of modality presentation. In addition, we included both small (i.e., single digit) and large (i.e., two-digit) numbers with exactly the same ratios, making it possible to test directly whether the same effects occur for frequent small numbers and less frequent large numbers. Finally, we averaged data across items (i.e., across number pairs; see also Brysbaert, 2007), to test whether our current and also previous findings can be generalized across item as well. Because this results in a lot of repetitions (see also Brysbaert & Stevens, 2018), participants could not complete the experiment in one single session. Participants completed three 1-h sessions, spread over three consecutive days, during different times of the day. Participants performed the same four audio-visual tasks as in Experiment 1, either in a “visual–auditory” or in “auditory–visual” presentation order. A ratio effect was expected for both small (i.e., frequent) numbers and large (less frequent) numbers in the tones–dots, tones–digit, and numbers word–dots tasks, but not in the number word–digit task. Furthermore, we expected similar findings for the two presentation order conditions.