Dissociation of Size and Distance Effect in Numerical Magnitude Comparison in Less Familiar Number Ranges

Alexis Garsmeur; Roxane Morand; André Knops

doi:10.5334/joc.486

Introduction

While our understanding of the neural and cognitive mechanisms underlying the processing of numerical information has advanced over the last few decades, one of the most fundamental questions remains unclear: is numerical magnitude represented in a format-independent manner. One can broadly differentiate non-symbolic formats such as simultaneously presented sets of elements from symbolic formats such as Arabic digits or number words. How numerical symbols, that are initially meaningless, end up representing numerosity is a specific case of the more general “symbol-grounding problem” (Harnad, 1990). In order to manipulate symbols efficiently, children need to connect symbolic representations to their semantic meaning. In the field of numerical cognition, proposals have been made concerning the nature of this connection process, putting forward different cognitive capacities as foundations for symbolic meaning such as subitizing, which is the ability to perceive and keep track of the exact quantities of items in a small cardinality set (Carey, & Barner, 2019) or cognitive control (Leibovich, & Ansari, 2016).

According to one commonly accepted idea, the same semantic code underlies both symbolic and non-symbolic formats (Dehaene, 1992; Piazza et al., 2007). That key behavioral signatures emerge in both formats further substantiates this account. In magnitude comparison tasks, where participants have to indicate the numerically larger of two stimuli, performance improves (faster reaction times, lower error rates) with increasing numerical distance between to-be-compared stimuli. This is referred to as the distance effect. For a given distance, performance decreases with increasing numerical magnitude range which is referred to as size effect. These effects have been reported in experiments using single-digit symbolic number comparison (Moyer and Landauer, 1967), as well as non-symbolic numerosity comparison (Dehaene, 1992). Further evidence from brain imaging research supports this account. The intraparietal sulcus (IPS) is engaged during both symbolic and non-symbolic number processing (Dehaene, Piazza, Pinel, & Cohen, 2003). From these results, it has been inferred that the meaning of symbolic representations (Indo-arabic digits) derives from the activation of a common, analogue psychological mechanism called the approximate number system (ANS). The ANS is therefore supposed to underlie the processing of both symbolic and non-symbolic representations. It is often described by the metaphor of the mental number line, a spatial representation of numbers on a logarithmically organized unidimensional manifold where smaller numbers are located left from larger numbers. Distance and size effects are supposed to be by-products of the ratio effect, a key signature that governs a multitude of sensory dimensions and which can be described by Weber’s law.

However, recent evidence challenges the view of a unitary system, suggesting instead that non-symbolic and symbolic quantities are processed by different cognitive mechanisms (Krajcsi, 2017; Krajcsi & Kojouharova, 2017). In this dual system view, the ANS is limited to processing of non-symbolic quantities. Symbolic numbers that are referring to exact quantity information (as opposed to the approximate nature of the ANS) are processed by the Discrete Semantic System (DSS). The DSS is composed of number representations stored in the form of semantic networks, similar to the mental lexicon (Krajcsi et al., 2016). Numbers are represented by nodes. Nodes are connected via edges that can represent arithmetic relations like parity or whether two numbers are prime numbers or not. The DSS account provides a different explanation for the presence of distance and size effects. According to this account, an increasing frequency of encountering a given symbol decreases the processing time of this piece of information. Because the frequency of encountering a given number symbol and its numerosity is inversely proportional to their numerical magnitude (Dehaene & Mehler, 1992), larger numbers require more processing time compared to smaller numbers (size effect). In a semantic network, the strength of the connections between nodes is the product of their semantic relationship. When two concepts are strongly associated, they are closely linked in the semantic network. This strong connection can lead to competition during retrieval. When accessing any given concept, connected concepts will be co-activated, creating interference and slowing down processing. Therefore, the strength of the link between two number symbols corresponds to their numerical proximity and produces the distance effect.

In order to test this hypothesis, Krajcsi (2017) evaluated the correlation between the distance and size effects for both symbolic and non-symbolic number comparison tasks. According to the unitary ANS explanation, the distance and size effects should be correlated because they emerge from the ratio effect. That is to say, for a given participant, the stronger the distance effect, the stronger the size effect. However, for the DSS account, the distance effect (semantic proximity) and size effect (frequency of exposition) do not emerge from the same mechanism and therefore there is no reason to expect any relationship between the two. Confirming this hypothesis Krajcsi (2017) found a high and significant correlation between distance and size effects when comparing non-symbolic stimuli (dots arrays) but no significant correlation when comparing symbolic stimuli (indo Arabic integers).

However, Krajcsi used numbers from 1 to 9 as stimuli, for which we can speculate that the limited numerical range may mitigate the chances of the size effect to emerge. In the ANS Account, the logarithmic compression is the key feature that leads to a size effect. The limited number range, however, may have impeded the emergence of a size effect. It has been argued, for example, that over the course of an experiment participants acquire automatic stimulus-response mappings that circumvent semantic elaboration when the stimulus set is of limited size (Kunde, Kiesel & Hoffman, 2003).

According to the DSS the size effect emerges from a frequency gradient in the given number range. Since numbers 1 through 9 are the most frequently encountered numbers, the high familiarity may have diminished the variance in the reaction times and hence the gradient may not be sufficiently pronounced to allow for the size effect to emerge. Some authors proposed that the repeated exposure to symbols referring to quantities leads to an association between symbols which gradually decreases the automatic activation of the underlying non-symbolic quantities. This phenomenon is called symbolic estrangement (Lyons, Ansari & Beilock, 2012). In this view, the ANS is a unitary system that only subserves newly acquired stimuli. In consequence, sufficient practice entrains the processing of symbols via another system that could potentially correspond to the DSS described by Krajcsi (2017).

One way to test this idea is to evaluate the correlation between the distance and size effects for a wider range of numbers. Therefore, in the first experiment, we applied the paradigm used by Krajcsi (2017) to two-digit integers ranging from 11 to 99. We expected that this range would give us sufficient variation to observe size effects. Additionally, this allowed us to directly match symbolic and non-symbolic quantities without applying a multiplicative transformation to the symbolic quantities when creating the non-symbolic numerosities as in Krajcsi (2017).

For the DSS account, size effects are not expected because the range of exposure frequency in the number range 11 to 99 is smaller compared to the range between one and nine (Pajot et al., 2025). Accordingly, similarly to the one-digit integers, there should be no correlation between size and distance. On the other hand, the symbolic estrangement hypothesis account predicts that with sufficient variation a correlation should emerge for less familiar stimuli. However, this correlation could be weaker than with the non-symbolic quantities depending on the degree of familiarity and therefore “estrangement” with these symbols.

Experiment 1

To probe any potential correlation between the distance and size, Experiment 1 was designed as a magnitude comparison task where symbolic and non-symbolic magnitudes between 11 and 99 were compared with a fixed reference (55). This increased the size range (calculated as the sum of the two to-be-compared numbers) which varies between 66 and 154 (excluding 110) in the current task, compared to Krajcsi (2017), where sizes ranged from 3 to 17. We reasoned that this design change would allow for more variability in RT and therefore increase the chance of a potential correlation to emerge.

Participants

We recruited 34 participants through advertisements on college campuses. Most participants were students in the Master “Economie et Psychologie” of Paris 1 Panthéon Sorbonne, all residing in France. At the beginning of the experiment, participants were asked to provide personal information, including their age, gender, and handedness. We excluded one participant from the analysis because his error rate was superior to 3 SD above the mean error rate of the group. The final sample included 14 males and 18 females (one participant did not provide this information). The mean age was 26.45 years (ranging from 20 to 59). The protocol was carried out in compliance with the ethical standards of the Declaration of Helsinki.

Stimuli and Materials

We used MATLAB to generate stimuli for both formats.

Non-symbolic quantities (sets of dots)

Numerosities ranged from 11 to 99 and each numerosity (except 55) was presented twice, counterbalancing the presentation side of each number. Thus, 88 trials were shown twice.

To control for the non-numerical features that inevitably covary with numerosity, we generated eight stimulus sets: In a given set, either the total surface (TS) area or convex hull (CH) was kept constant between the two presented dots arrays. The visual property that obligatorily varied between the two presented arrays was either congruent or incongruent with the numerosity, yielding 4 stimulus sets. Since the larger stimulus was equally often presented on the left or right side from central fixation, we created these four sets for each side. For numbers before the reference, size increase as the distance to the reference decrease. For numbers after the reference, size increase as the distance to the reference increase. Therefore, in the stimuli set distance and size are uncorrelated.

Symbolic quantities (two-digit numbers)

Analog to the non-symbolic condition, we presented the numbers 11 through 99 (except 55). Each of the 88 numbers was shown twice.

Design and Procedure

To mitigate the potential influence of the primacy effect, the order of the formats alternated between participants. The experiment was controlled by Psychtoolbox (Brainard, 1997), running on MATLAB R2023b. The following instructions were presented to the participants with symbolic stimuli: “Please decide whether the shown number is larger or smaller than 55! Press g when it is smaller. Press h when it is larger.” With non-symbolic stimuli, participants were instructed as follows: “Please indicate which side the more numerous set is displayed on! Press g for left and h for right!”. When participants took too much time to respond, a message appeared on the screen: “Please respond as fast and as accurately as possible.” Ten practice trials preceded each format.

Non-symbolic quantities

In the numerosity comparison, two sets of dots were presented to the left side and right side of the screen center, respectively. One of the two sets had a numerosity of 55 dots (reference) while the other changed numerosity across trials. For each numerosity, the reference was displayed on the left in 50% of the trials. Dot arrays were presented until participants responded or a maximum response duration of 2000 ms. Response was followed by an empty screen of 500 ms until the next trial. The 176 trials were presented in 4 blocks, each lasting for approximately 2 minutes. In order to control for confounding, 25% of stimuli were incongruent in Total area (the largest number also had the largest Total area) and 25% of the stimuli were congruent in the Convex hull (the largest number also had the largest Convex hull). The 50% remaining trials were congruent in both visual properties.

Symbolic quantities

In the symbolic magnitude comparison task, participants were instructed to compare a number displayed in the center of the screen to the number 55. The displayed number remained visible until the participant pressed a key, or a maximum response duration of 2000 ms. After the response, a blank screen was shown for 500 ms before the next trial started. The experiment consisted of 4 blocks, each lasting approximately 2 minutes.

Analysis

We used R (Team, 2010) for all analyses.

First, we excluded trials with erroneous, missed or premature (reaction times (RTs) < 200 ms) responses. Next, we excluded trials that fell outside a participant- and experiment-wise range of mean RT ± 3 standard deviations (SDs). Finally, we excluded participants if their error rate (or mean reaction time) in at least one of the two tasks exceeded the mean group error rate (or mean reaction time) by more than 3 SDs. Applying this criterion we excluded one participant. All analyses are based on the data from the remaining 33 participants.

To capture the distance effect, we calculated the absolute distance between the probe number and the reference, following Krajcsi (2017). The size effect was assessed as the sum of the reference and the probe number (55 + displayed number).

To assess the presence of a distance and size effect at the group level, we computed a 2 x 2 repeated measures ANOVA with size and distance as factors using the ezANOVA package of RStudio. We categorized distances 1 to 22 as small distances and the distances 23 to 44 as large distances. Number pairs with sums between 66 (55 + 11) and 109 (55 + 54) were categorized as small sizes and pairs with sums between 111 (55 + 56) and 154 (55 + 99) as large sizes. ANOVA’s results are reported in supplementary materials. To exploit the continuous nature of the stimuli, we also tested distance and size effects at the group level using a linear mixed model with participants as random intercepts.

In order to assess the presence of a ratio effect in both format condition, we tested the ratio effect at the group level using another linear mixed model with random intercept for participants. Ratio for a given pair was calculated as the smallest number between the probe number and the reference divided by the largest number between the two.

We also calculated the slopes of the distance effect for numbers below 55 and those above, for each participant and each format conditions (symbolic and non-symbolic). Under the assumption of a logarithmic compression of the number representation, we would expect the slopes for numbers above 55 to be less steep compared to numbers below 55. In order to compare the magnitude and not the directionality of slopes, we ran a dependent sample t-test on the absolute values of slopes of the linear regression of presented numbers on RTs, separately for each format. Moreover, we conducted an analysis on the mean response time across participants for symbolic/non-symbolic numbers below and above 55. We expected to see in both formats a larger response time for symbolic/non-symbolic numbers above 55 than for symbolic/non-symbolic numbers below 55 (as the mean ratio is supposed to be closer to one for the set of numbers above 55).

For each participant and each format, we assessed size and distance effects via unstandardized regression coefficients in a multiple stepwise regression with distance as the first and size as the second regressor. Size was introduced second because we used a fixed reference comparison task in which the distance effect counteracts the size effect for numbers higher than the reference.

Unstandardized regression coefficients of the distance and size effects were calculated for all participants for both formats. Regressions between distance and size effects coefficients were first calculated using (a) all participants, then recalculated (b) excluding outliers’ coefficients with a two-dimensional outlier detection using an ellipse with a radius set as 95% confidence interval, multivariate t-distribution were assumed (Fox and Weisberg 2011; Friendly and Monette 2013) and finally (c) using only participants having both a significant size and distance effects (Roth et al., 2024). The analysis after excluding participants according to (b) and (c) aimed at restricting subsequent correlation analyses to those participants who do manifest both effects at the individual level. It can be considered a conservative way of identifying those who rely on the ANS. Note that it does not – in and by itself – predict that there is a significant correlation between these effects, however.

The Pearson’s product correlation between distance effects coefficients and size effects coefficients across participants was calculated for both formats. Since the results of the regression analyses without outliers and with both significant size and distance effects produced effectively identical results as the one including all participants (n = 33), they will not be reported here.

In order to test the presence of a common mechanism across both format we develop a second measure that was not present in Krajcsi (2017). We computed the ratio effect directly for both format conditions and then correlated the two effects among the two tasks. If the same system characterizes by the ratio effect underly symbolic and non-symbolic performances we should observe a high correlation between both ratio effects. To calculate the ratio effect, we regressed RT of given stimulus on its ratio using a simple linear model.

Finally, we computed the split-half reliability and corrected, using the Spearman-Brown correction, our correlation coefficients for the lack of reliability of our variables following the method described in Krajcsi (2017). Negative correlation scores were considered as equivalent to a reliability score of zero. Conversely, corrected correlation coefficient superior to 1 (or –1), where considered equivalent to a perfect correlation of 1 (or –1). This allowed us to correct for the presence of noise in, for example, the visual perception of our stimuli that could differ from one task to another (symbolic vs non-symbolic). Different levels of noise could decrease the strength of the correlation between our two effects for a given task, possibly creating artificial differences of correlation coefficients between tasks.

Results

Accuracy differed between the symbolic and non-symbolic format (mean = 95.21%, SD = 2.60 and mean = 63.22%, SD = 4.03, respectively), with notably higher variability observed in the dot condition. This variation originates from the pairs with ratios close to 1, such as 54–55, 53–55, or 52–55, which were below detectability threshold which is around 1.1 for adults (Halberda, Mazzocco, & Feigenson, 2008; Pica et al., 2004).

Non-symbolic comparisons (t(32) = 8.72, p < .001, d = 1.46) were slower (Mean (SD) = 777 (167) ms) compared to symbolic two-digits integers (Mean (SD) = 562 (68) ms).

For non-symbolic quantities, a linear mixed model at the group level on all participants’ trials with participant as random intercepts yielded significant effects for distance (t(3586) = 3.104, p = .002), size (t(3586) = 3.307, p < .001) but no significant effect for their interaction (t(3586) = 1.162, p = .245). The second linear mixed model with ratio as the only regressor also yields a significant effect (t(3588) = 22.16, p < .001). For symbolic numerosities, a linear mixed model showed a significant effect for distance (t(5388) = 6.175, p < .001), size (t(5388) = 2.689, p = .007) and their interaction (t(5388) = 2.512, p = .012). As for non-symbolic task, the second linear mixed model with ratio as the only regressor also yields a significant effect (t(5390) = 13.87, p < .001).

If size and distance effect reflect a common underlying representation governed by Weber’s law, we should see that overall reaction times and the distance effect are modulated by the overall magnitude range in which these markers are computed. In line with this, participants reacted significantly faster to numbers below 55 (M = 743 ms, SD = 149 ms) compared to numbers above 55 (M = 815 ms, SD = 187 ms, t(32) = 7.47, p < .001, d = 0.32), for non-symbolic quantities. Conversely, for symbolic stimuli, participants reacted faster to numbers above 55 (M = 556 ms, SD = 68 ms) than to numbers below 55 (M = 568 ms, SD = 70 ms, t(32) = 2.36, p = .025, d = 0.16).

At the group level, the distance effect for non-symbolic quantities, as measured via the magnitude (absolute value) of the slope of the regression of number on RT, was larger for numbers below 55 (M = 8.79 ms/number, SD = 4.93 ms) compared to quantities above 55 (M = 7.03 ms/number, SD = 4.24 ms, t(32) = 2.52, p = .017, d = 0.38). Similarly, for symbolic stimuli, the distance effect was more pronounced for numbers below 55 (M = 3.57 ms/number, SD = 1.46 ms) compared to numbers above 55 (M = 2.32 ms/number, SD = 1.58 ms, t(32) = 3.52, p = .0013, d = 0.80) (see Figure 1). This could reflect the stipulated compression of the underlying magnitude representation in both formats.

Reaction times plotted as a function of the displayed numbers in the non-symbolic (left) and symbolic (right) conditions in Experiment 1. Blue line represents the regression of numerical distance on reaction time, separately for both formats and numbers larger or smaller than the reference (55). Red line represents the mean reaction time for each presented number.

To probe the notion that distance and size effects emerge from a unitary system (ANS account), we computed the correlation of the unstandardized regression coefficients of the two effects in the two formats (see Figure 2). The estimated correlation coefficient for the non-symbolic task was r(31) = –.65 (p < .001), implying a very high degree of correspondence which is in line with the idea that both effects emerge from the ratio effect. For the symbolic task the estimated correlation coefficient was not significant (r(31) = –.01, p = .968). These results closely replicate the findings by Krajcsi (2017) who found correlations of r = –.88 and r = –.96 for the non-symbolic task and r = –.11 and r = –.13 with symbolic one-digit integers. Confidence intervals of correlation coefficients didn’t overlap between non-symbolic (95% CI[–.86, –.44]) and symbolic tasks (95% CI[–.37, .35]). The correlation between the ratio effects of both formats was r(31) = 0.01 (p = .962).

Relation between the distance and size effect unstandardized regression coefficients, displayed on scatterplots and measured with correlation coefficients for dot comparison.

Recently, the idea that interindividual variability should not be considered as noise but reflect differences in cognitive processes has gathered some interest (Roth et al., 2024). Roth and colleagues investigated the significance of the SNARC effect at the individual level and argued that this effect, albeit robust at the group level, is not present in all participants. This discussion has inspired us to rerun the above analyses with participants showing both significant distance and size effects at the individual level only. In the non-symbolic task, most distance effects (31 participants; 93.94% of the sample) and a small portion of size effects (11 participants; 33.33% of the sample) were significant (see Figure 3). For symbolic two-digits integers, most of the distance effects were significant (31 participants; 93.94% of the sample) but the number of significant size effects was greatly reduced (6 participants; 18.18%). Therefore, in the case of the symbolic two-digit integers we were not able to run a meaningful regression analysis on so few participants. Concerning the non-symbolic task, results remained unchanged, the correlation between significant size and distance effects was still significant.

Histogram of participants with significant size or distance effects in Experiment 1 in two digits symbolic (Symbolic) and the dot arrays (Non-symbolic) conditions. The black rectangles (overlap) depict the number of participants showing both effects in both formats.

The split half reliability analysis correction increased the strength of our correlation in both formats (Non-symbolic: r = –.65; r corrected = –.88; Symbolic two-digit integers: r = –.01; r corrected = –.18). Complete report of reliability scores for each condition and experiment can be found in supplementary materials.

Discussion

Experiment 1 replicated main results obtained by Krajcsi (2017). We observed a clear correlation between size and distance effect in the non-symbolic format, corroborating existing evidence that the processing of non-symbolic numerosities relies on a unitary analog magnitude representation that is governed by Weber’s law (Dehaene, 2003). For symbolic stimuli, we observed no correlation between size and distance effects, undermining the idea that symbolic number processing is governed by the same system as non-symbolic quantity processing. This result may be due to the incoherent emergence of a size effect in the symbolic format, as we hypothesized for the results by Krajcsi (2017).

The absence of a coherent size effect in all analyses could be due to the overlearned nature of integers. We hypothesized that using less familiar and automated symbolic stimuli should produce behavioral signatures typical of the approximate magnitude system. In experiment 2, we therefore used decimals ranging from 0.01 to 0.98.

Experiment 2

For experiment 1, we reasoned that using a larger stimulus range would increase the chances to observe a size effect and thereby a correlation between size and distance effects that would follow from accessing an approximate number representation that is governed by Weber’s law. However, even when using two-digit integers, we failed to observe a correlation between size and distance effect. One may argue that two-digit numbers are still too familiar and overlearned and therefore mostly governed by a discrete semantic system. To test this hypothesis, we used less familiar and less automatized stimuli in Experiment 2: decimal numbers in the range 0.01 to 0.98.