Ring models of binocular rivalry and fusion

Wang, Ziqi; Dai, Wei; McLaughlin, David W.

doi:10.1007/s10827-020-00744-7

Ring models of binocular rivalry and fusion

Published: 03 May 2020

Volume 48, pages 193–211, (2020)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Computational Neuroscience Aims and scope Submit manuscript

Ring models of binocular rivalry and fusion

Download PDF

Ziqi Wang¹,
Wei Dai² &
David W. McLaughlin^2,3,4,5

716 Accesses
3 Citations
Explore all metrics

Abstract

When similar visual stimuli are presented binocularly to both eyes, one perceives a fused single image. However, when the two stimuli are distinct, one does not perceive a single image; instead, one perceives binocular rivalry. That is, one perceives one of the stimulated patterns for a few seconds, then the other for few seconds, and so on – with random transitions between the two percepts. Most theoretical studies focus on rivalry, with few considering the coexistence of fusion and rivalry. Here we develop three distinct computational neuronal network models which capture binocular rivalry with realistic stochastic properties, fusion, and the hysteretic transition between. Each is a conductance-based point neuron model, which is multi-layer with two ocular dominance columns (L & R) and with an idealized “ring” architecture where the orientation preference of each neuron labels its location on a ring. In each model, the primary mechanism initiating binocular rivalry is cross-column inhibition, with firing rate adaptation governing the temporal properties of the transitions between percepts. Under stimulation by similar visual patterns, each of three models uses its own mechanism to overcome cross-column inhibition, and thus to prevent rivalry and allow the fusion of similar images: The first model uses cross-column feedforward inhibition from the opposite eye to “shut off” the cross-column feedback inhibition; the second model “turns on” a second layer of monocular neurons as a parallel pathway to the binocular neurons, rivaling out of phase with the first layer, and together these two pathways represent fusion; and the third model uses cross-column excitation to overcome the cross-column inhibition and enable fusion. Thus, each of the idealized ring models depends upon a different mechanism for fusion that might emerge as an underlying mechanism present in real visual cortex.

Multisensory-inspired modeling and neural correlates for two key binocular interactions

Article Open access 17 May 2024

A single retinal circuit model for multiple computations

Article 27 June 2018

Dichoptic vision in the absence of attention: neither fusion nor rivalry

Article Open access 09 September 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

“Binocular rivalry” and “binocular fusion” are two fascinating perceptual phenomena that arise in the cortical processing of visual information. Binocular rivalry is a visual phenomenon that appears when two incompatible monocular images are presented, one image to the left eye and the other image to the right eye. Under such stimulation, the perception is not that of a fused image; rather, only one of the images is perceived at a time, for a random duration of a few seconds, with irregular temporal transitions between the perception of one image and the other. This rivalry phenomenon was described centuries ago, for example by Della Porta (1593) as referenced by Wade (1998), and Dutour (1760); and has attracted numerous research – including psychophysical experiments (e.g., Blake and Fox 1974; Levelt1965; O’Shea and Crassini 1981), electrophysiology in animals (e.g., Leopold and Logothetis 1996; Gail et al. 2004), and neural imaging studies (e.g. fMRI) in humans (e.g., Polonsky et al. 2000; Tong and Engel 2001). On the other hand, binocular fusion (or single vision) generates the perception of a single image by “fusing” the two monocular stimulations in certain ways. For example, position fusion, as discussed in this paper, is the phenomenon that two images of oriented objects with similar orientation, presented binocularly, give rise to the perception of a single object that is a combination of the two monocular orientations (Nelson 1975).

Oriented bars and gratings are common visual stimuli with which to test binocular rivalry quantitatively. When the angles of grating orientation are similar, fusion results; when orthogonal, rivalry.

The cortical mechanisms that underlie binocular rivalry and fusion have not been completely understood. Most likely these mechanisms are initiated by front end “lower level” processing in the primary visual cortex (V1) (Xu et al. 2016), and significantly modulated by top-down feedback from higher cortical regions, such as attention. See Brown and Norcia (1997), Zhang et al. (2011), Brascamp and Blake (2012), Cavanagh and Holcombe (2006), and Ling and Blake (2012), and a review by Dieter et al. (2016). Interocular (or “cross-column”) inhibition plays a significant role underlying binocular rivalry, and theoretical models have been proposed to explain rivalry (Laing and Chow 2002; Wilson 2003; Moreno-Bote et al. 2007), each relying on some form of interocularor inhibition as a key mechanism. However, less common are models of binocular rivalry for distinct stimuli that also include the binocular fusion of similar stimuli, and those which do study rivalry and fusion all use the same mechanism (Said and Heeger 2013; Li et al. 2017; Wilson 2017). Previously there are two differing general conceptual ideas of fusion – one by Blake and O’Shea (1988) and Blake (1989) and the other by Wolfe (1986). In Blake’s view, fusion is the default percept for similar stimulations; and only when the two binocular stimuli are too dissimilar to fuse, does rivalry result (Blake 1989). On the other hand, in Wolfe’s view, rivalry is always present, and the perception of fusion results from the interaction of two pathways (the stereopsis and rivalry pathways) (Wolfe 1986). In addition to these two conceptual ideas of fusion, we propose a third: to introduce cross-column excitatory connections to overcome cross-column inhibition and allow the fusion of similar binocular stimulation.

In this work, we investigate how each one of these three conceptual mechanisms for binocular rivalry & fusion can be explicitly realized in computational models by developing three distinct neuronal network models. Each model consists of conductance-based, integrate-and-fire point neurons with firing rate adaptation, divided into subpopulations for each functional layer, each type (excitatory or inhibitory), and each ocular dominance “column”. These models are similar to more realistic large-scale neuronal models, but with two significant idealizations – i) a one dimensional “ring” architecture, and ii) the explicit incorporation into each model of a mechanism designed to produce rivalry and fusion. Our goal for these models is to understand in detail each potential conceptual mechanism; and (since these types of models are easy to compare with more realistic models) to develop some understanding about which of the conceptual frameworks is likely to emerge in large-scale comprehensive neuronal models of layers of primate V1 – models (such as Cai et al. 2005; Zhou et al. 2013; Chariker et al.2016) that are constructed through anatomical and physiological constraints, and not by inputting special architecture designed to achieve binocular rivalry & fusion. And through this, we hope to develop intuition about which of the conceptual mechanisms are likely to be present in human or primate primary visual cortex.

Each of the three models captures both fusion and rivalry, and the hysteretic transitions (Wilson 2017; Buckthought et al. 2008) between them, but through distinct underlying cortical mechanisms. These models address the front end of the visual system and do not contain higher level feedback such as attention; thus, they address only front end mechanisms that initiate binocular rivalry and fusion. Each model consists of several “layers” of neurons (see Figs. 1, 2, & 3), with L-R ocular dominance layers of monocular neurons that receive excitation from left (right) eye, together with a binocular “summation” or “perception” layer that sums information from the two ocular dominance layers. In this Introduction, we will give a general overview of each of the three models, with more detailed descriptions in the section Methods and Models.

We begin with the features common to each of the three models: Locally, and within each ocular dominance column, the excitatory synaptic connections between point neurons are mediated by (fast) AMPA type glutamate receptors; local inhibition is mediated by (fast) GABA A type receptors. All long-range cross-column projections are excitatory, with synaptic connections mediated by (slow) NMDA type receptors, selectively targeting both inhibitory and excitatory neurons with orientation preference similar to that of the projecting excitatory neuron. The models are idealized through the use of a “ring” architecture, in which the neurons of each subpopulation reside on a “ring” labeled by the neuron’s orientation preference, 𝜃_j,j = 1,...,N. Theoretical studies (Shpiro et al. 2007; Moreno-Bote et al. 2007; Wilson 2003; Laing and Chow 2002) have convincingly argued that the primary mechanism underlying binocular rivalry is cross-column inhibition, which is common to all three models. This inhibition is generated by excitatory neurons in the left (right) ocular dominance column projecting to inhibitory neurons in the other right (left) column. These cross-column excitatory projections are presumably long-range; thus, they project selectively for orientation preference, to NMDA receptors on inhibitory neurons.

To prevent rivalry and allow fusion, some mechanism must overcome cross-column inhibition and allow similar stimuli to fuse. Each of the three models has its own distinct mechanism to overcome cross-column inhibition. The first model follows Said and Heeger (2013) and implements fusion as a defauilt percept through a subpopulation of excitatory neurons (“opponency neurons”). These opponency neurons drive feedback cross-column inhibition, but themselves are inhibited by “cross-column feedforward inhibition”; thus, they act as an “XOR” logic gate that is only active when the visual stimuli to each eye are sufficiently distinct. The second model realizes, for the first time, the conceptual framework of “two parallel pathways” – replacing the opponency neurons by a second layer of monocular neurons that provides a second pathway to the summation layer. This additional monocular layer acts as an AND logic gate which is active only when similar images are presented to each eye. The third model does not use a distinct subpopulation of neurons as a gate; rather, it introduces cross-column excitation between monocular neurons to overcome cross-column inhibition. These cross-column excitatory connections may be similar to connections suggested in the review paper Tong et al. (2006), where the exact function of the connections are not specified. In this third model, a balance between cross-column excitation and cross-column inhibition is achieved (we believe for the first time) allowing distinct images to rival and similar images to fuse.

2 Methods and models

Our models are multilayer, with each layer containing two ocular dominance columns corresponding to the two eyes. (For convenience, we use the term “layers” to organize the description of the sub-populations of neurons. These “layers” do not correspond to different layers of V1 in the visual cortex; rather, the sub-populations will most likely reside in a single layer of V1 such as 4Cα.) Each layer consists of conductance-based, integrate-and-fire point neurons with firing rate adaptation, divided into sub-populations for each layer and each type (excitatory or inhibitory). Each sub-population consists of 90 neurons, each labeled by its orientation preference, 𝜃_j = jΔ_𝜃,j = 1,...,90, and Δ_𝜃 = 2 degrees. Thus, each sub-population can be thought of as residing on a “ring”. The membrane potential of each excitatory (inhibitory) neuron $V^{j}_{\sigma }, \sigma = E,I$ satisfies

$$ \begin{array}{@{}rcl@{}} C_{\sigma} \frac{d}{dt}V^{j}_{\sigma} &=& - g_{L,\sigma}\left( V^{j}_{\sigma} - V_{L}\right) - g^{j}_{\sigma E}(t) \left( V^{j}_{\sigma} - V_{E}\right) \end{array} $$

$$ \begin{array}{@{}rcl@{}} && - g^{j}_{\sigma I}(t) \left( V^{j}_{\sigma} - V_{I}\right) , \end{array} $$

(1)

where the C_σ,σ = E,I, denotes capacitances, g_L is the leak conductance, $g^{j}_{\sigma } (t)$ are the time dependent excitatory and inhibitory conductances, and V_L,V_E,&V_I are reversal potentials. The neuron generates a spike when its voltage reaches a threshold value, $V^{j}_{\sigma } (t = t^{j}_{\sigma })=V_{T}$, and resets $V^{j}_{\sigma }$ to V_reset for $t \in (t^{j}_{\sigma }, t^{j}_{\sigma } + \tau _{ref} ). $ Biophysical parameters are used: C_E = 0.5nF,C_I = 0.2nF,g_L,E = 25nS,g_L,I = 20nS,V_L = V_reset = − 70mV,V_E = 0mV, V_I = − 70mV,V_T = − 55mV,τ_ref = 2ms.

Conductance profiles $g^{j}_{\sigma \sigma ^{\prime }}(t)$ contain an external drive, the cortical-cortical interactions within and between layers, and firing rate adaptation –

$$ \begin{array}{@{}rcl@{}} g^{j}_{\sigma E}(t) &=& g^{j}_{LGN}(t) + g^{j}_{\sigma E, c-c}(t), \\ g^{j}_{\sigma I}(t) &=& g^{j}_{\sigma I, c-c}(t) + g^{j}_{\sigma I, adapt}(t). \end{array} $$

(2)

The excitatory external drive $g^{j}_{LGN}(t)$ loosely represents the drive from the visual grating stimuli, through the retina and the LGN, to V1. Only the neurons in the monocular layers are driven by visual stimuli, which are gratings presented to each “eye”, with the left (right) eye driving the left (right) monocular layers. This excitatory drive is represented by Poisson spike trains, producing AMPA synaptic inputs (with decay time scales of 2ms) to the monocular neurons. The strength of external drive (i.e., Poisson firing rate) to the j th monocular neuron with orientation preference 𝜃_j decays with 𝜃 − 𝜃_j (where 𝜃 is the orientation of the grating) as a Gaussian with standard deviation σ = 12.0 degrees (FWHM = 28 degrees). These Poisson spike trains provide stochasticity, and there is no other source of noise in the model.

There are three different types of visual stimulation: i) For rivalry: the left (right) column is stimulated by an oriented grating with orientation 𝜃_L (𝜃_R = 𝜃_L + 90 degrees); ii) For fusion: each column is stimulated simultaneously by gratings with very similar orientations; (iii) For the hysteretic transition: the left column is stimulated by a grating oriented at 𝜃_L, while the right column is stimulated by a grating at 𝜃_R = 𝜃_L + Δ, ramping away from fusion by slowly increasing Δ from 0, and ramping back to fusion by fixing 𝜃_L sufficiently positive and slowly decreasing Δ, with a speed of 2 degrees per second. In each model, the strength of stimulation is chosen to produce firing rates of a single neuron in a range of 10 − 50 Hz.

Within each ocular dominance column, the excitatory synaptic transmissions, including those that project to binocular neurons, are mediated by AMPA type glutamate receptors. Local inhibition is mediated by GABA A type receptors. All long-range cross-column projections are excitatory, with synaptic transmissions mediated by NMDA type receptors. The dynamics of AMPA and GABA A synaptic gating variables are modelled as instantaneous “rise-time” jumps of magnitude 1 when a spike occurs presynaptically, and then an exponential decay with time constants of 2ms (10ms) for AMPA (GABA A) respectively. Each neuron in the model undergoes an adaptation which contributes to the inhibitory GABA A conductance, and whose efficacy increases 0.15 with each spike and decays to zero with a time constant of 2000ms. NMDA synaptic dynamics are modeled as the following:

$$ \begin{array}{@{}rcl@{}} \tau_{s}\frac{d}{dt} s &=& -s + \alpha \tau_{s} x (1-s) \\ \tau_{x} \frac{d}{dt} x &=& - x + {\sum\limits_{i}^{N}} \delta(t-{t^{E}_{j}}), \end{array} $$

(3)

where s is a gating variable with decay time constant τ_s = 100ms, and the rise time to saturation is controlled by α = 0.5/ms, and an intermediate variable x which jumps instantaneously to magnitude 1 when an excitatory spike occurs presynaptically at spike time ${t^{E}_{j}}$, and decays exponentially with rate τ_x = 2ms.

The coupling strengths used in the three models are specified, together with the network architecture in the captions of Figs. 1, 2, and 3. We emphasize that these are idealized models, in which each neuron receives many fewer synaptic contacts than a real cortical neuron; thus, we do not use biophysical measurements to set the synaptic coupling strengths. Rather, the strengths are tuned to produce the phenomena of rivalry and fusion with realistic characteristics.

The numerical simulation is carried out using the package Brian 2 in Python (Stimberg et al. 2013), with Runge-Kutta 4 as the numerical method. We use MATLAB (MATLAB 2018) for all data analysis.

Turning to the coupling architecture, we first describe the architecture that is common to all three models: All within-column excitatory connections target neurons of similar orientation preference; thus, these excitatory connections have weights following a Gaussian footprint as a function of difference in orientation preferences:

$$ W_{ij} = W(\theta_{i} - \theta_{j}) = exp\big[-\frac{(\theta_{i} - \theta_{j})^{2}}{2\sigma^{2}}\big], $$

(4)

where (𝜃_i − 𝜃_j) is the difference between orientation preferences of the i^th and j^th neurons, with σ = 12.0 degrees. Note that all synaptic projections across ocular dominance columns are excitatory and preferentially target neurons of the similar orientation preferences. Often the inhibitory neurons receiving cross-column excitatory projections themselves project non-selectively with respect to orientation, with coupling weights that are equal for all differences in orientation preference; however, each of the first two models have a sub-population of inhibitory neurons which project selectively with a Gaussian footprint, with σ = 12.0 degrees. We note that, in primate visual cortex, inhibitory neurons near the pinwheel centers of the ordered map of orientation preference would project to all angles of preference; and we note that the nonselective projection of inhibitory neurons is consistent with the psychophysical observation that “cross-eye inhibition” suppresses all orientations equally (Blake and Lema 1978).

The primary mechanism underlying binocular rivalry is common to all three models – cross-column inhibition. This inhibition is generated by excitatory neurons in the left (right) ocular dominance column projecting to inhibitory neurons in the other [right (left)] column. These cross-column excitatory projections project selectively (for orientation preference) to NMDA receptors on inhibitory neurons. To prevent rivalry, some mechanism(s) must overcome cross-column inhibition and allow similar visual stimuli to fuse. Each of the three models has its own distinct architecture and mechanism to overcome cross-column inhibition. We describe these separately for each model.

The first model (shown schematically in Fig. 1), is a three-layer model – monocular neurons which reside in the left (right) columns of the “lowest” layer; opponency neurons which reside in the left (right) columns (though they receive synaptic inputs from both left and right monocular neurons, we classify them by their source of excitation); and summation neurons which are binocular, shared by the two columns, and constitute the model’s “top” layer. Neurons in the binocular layer receive and sum excitatory inputs from the layer of monocular neurons (projection 5 in Fig. 1, together with its symmetric partner (not shown)), and their activities are assumed to reflect the percepts. This top summation layer has 90 excitatory binocular neurons that receive inputs from excitatory monocular neurons of both eyes, and 90 inhibitory binocular neurons (not shown in Fig. 1) that provide local, recurrent inhibition of the excitatory binocular neurons.

External stimuli of oriented gratings on the left (right) “eye” provide Poisson spike trains that drive the left (right) monocular layer, as described above. In each ocular dominance column, the monocular layer has 90 excitatory neurons receiving external stimuli, together with 90 inhibitory neurons (not shown in Fig. 1) that receive external stimuli and recurrent excitation from excitatory monocular neurons in the same column, and in turn inhibit locally. The monocular layer also has 90 additional inhibitory neurons (explicitly shown in Fig. 1) that receive cross-column feedback excitation from excitatory opponency neurons of the other ocular dominance column (projection 4 in Fig. 1); in turn, these monocular inhibitory neurons locally inhibit the excitatory monocular neurons equally at all angles of preference (projection 6 in Fig. 1). This type of inhibition of the excitatory monocular neurons (projection 4 composed with projection 6 in Fig. 1, together with its symmetric partner (not shown)) will be referred to as cross-column feedback inhibition.

Each layer of opponency neurons contains 90 excitatory neurons, and 90 inhibitory neurons. The excitatory opponency neurons receive excitatory inputs from the same eye monocular neurons (projection 1 in Fig. 1) and inhibitory inputs from inhibitory neurons in the opponency layer of similar orientation preference. These inhibitory neurons themselves are driven selectively by feedforward projections from excitatory monocular neurons of the other ocular dominance column. This pathway of inhibition of the excitatory oponency neurons (projection 2 composed with projection 3 in Fig. 1) will be referred to as cross-column feedforward inhibition.

We emphasize that, in this first model, the synaptic projections across ocular dominance columns are all excitatory and preferentially target neurons of the same orientation preferences. The monocular inhibitory neurons driven by cross-column excitation locally project to monocular neurons non-selectively with respect to orientation preference; while the inhibitory neurons in the opponency layer that participate in cross-column feedforward inhibition selectively project to excitatory opponency neurons of similar orientation preference through a Gaussian footprint with σ = 12.0 degrees.

The second model (shown schematically in Fig. 2) replaces opponency neurons with a second layer of monocular neurons that provide a second parallel projection to the binocular neurons. It is a four-layer model – with two monocular layers (A and B), a layer of excitatory neurons providing cross-column excitation, and a binocular summation layer. The excitatory neurons in the new monocular layer B are inactive under dis-similar binocular stimulation, because their external stimulation is cancelled by the local inhibition within the layer (which is also stimulated externally). Under similar stimulation, layer B becomes active (through cross-column dis-inhibition) and provides a second acitive pathway to the binocular layer (projection 3 in Fig. 2). We find it easiest to describe this second model as a two step conversion of the first: (i) The elimination of opponency neurons by removing from the opponency layer the inhibitory neurons involved in cross-column feedforward inhibition, while retaining the excitatory neurons in the (formerly) opponency layer as a source of cross-column excitation. (This layer is renamed as the cross-column excitation layer. It’s presence provides a source of cross-column feedback inhibition, and is just a matter of convenience, as this source could also be provided by the excitatory neurons in the “original” monocular layer.) (ii) The addition of layer B of monocular neurons, with its inhibitory neurons themselves inhibited selectively (by projection 1 of Fig. 2) by projections from the inhibitory neurons of similar orientation preference in the original monocular layer A of the same column; and with the excitatory neurons in layer B inhibited selectively by layer B’s inhibitory neurons of similar orientation preference (by projection 2 of Fig. 2). The excitatory neurons in both layers A and B project to the binocular summation neurons, providing the two parallel projection pathways. When layer B’s inhibitory neurons are active, they silence layer B’s excitatory neurons; hence, they silence layer B’s projection to the summation layer. On the other hand, when layer B’s inhibitory neurons are inhibited, layer B’s excitatory neurons get dis-inhibited and produce a second active projection to the summation layer.

In addition to the projections that mediate the disinhibition mechanism mentioned above, there are two additional projections (not shown in Fig. 2) from layer B to layer A’s inhibitory neurons, one excitatory and one inhibitory. These connections fascilitate the disinhibition mechanism by forming a feedback loop. In this second model, the coupling strengths are retained for all of the remaining local and global couplings of the first model. The newly-introduced couplings all have strengths following a Gaussian footprint of σ = 12.0 degrees.

In this second model, we note that all excitatory cross-column projections project selectively to the same orientation preferences. The inhibitory neurons driven by cross-column excitation project to monocular layer A neurons non-selectively; while the inhibitory projections from monocular layer A to the new monocular layer B and the inhibitory projections within layer B project selectively.

The third model (shown schematically in Fig. 3), is a two-layer model – with two ocular dominance columns of monocular neurons and a binocular summation layer. There is no opponency nor secondary monocular layer; rather, excitatory monocular neurons in each ocular dominance column possess two types of cross-column connections, each selectively targeting neurons in the other column of similar orientation preferences (sd of 3 degrees). The first type selectively targets inhibitory neurons, and is the source of cross-column inhibition (projection 1 in Fig. 3); the second type selectively targets excitatory neurons, and is the source of (a new) cross-column excitation (projection 3 in Fig. 3). In this third model, rather than shutting off cross-column inhibition, binocular stimulation with similar stimuli produces sufficient cross-column excitation, that together with the external drive, overcomes the cross-column inhibition and allows fusion to occur; whereas, under orthogonal stimulation, cross-column excitation targets neurons that are not externally stimulated, and thus is not strong enough to overcome cross-column inhibition and enables rivalry.

3 Results

We study each model’s responses to three different types of binocular stimulations: i) For rivalry: orthogonal gratings; ii) For fusion: similarly oriented gratings; iii)For the hysteretic transition: ramping the orientation difference in the two gratings, by slowly increasing (decreasing) the orientation difference from a fused (rivaling) configuration.

3.1 Rivalry and fusion in the first model

Binocular rivalry of orthogonal gratings

First, we stimulate with orthogonal gratings oriented at 𝜃_L = 36^∘, and 𝜃_R = 126^∘. Stimuli to the left and right eyes have identical strengths. Neurons located nearby L-monocular excitatory neuron with orientation preference (36^∘) [and nearby R-monocular excitatory neuron with orientation preference (126^∘)] also receive stimulation through the left eye (right eye), but with strengths falling off as a Gaussian, as described in Methods.

As shown in the spike-time raster plots of Fig. 4, the model rivals when orthogonal gratings are presented dichoptically. Figure 4 shows the firing patterns of the excitatory neurons in each of three layers (L & R monocular, L&R opponency, and binocular). The L & R monocular neurons rival – fire stochastically in alternations, with approximate average duration of 2-3 seconds (Fig. 4c). The R(L) opponency neurons have a similar alternating firing pattern to the R(L) monocular neurons (Fig. 4b), because the former are driven directly by the latter. These alternations are caused by cross-column feedback inhibition (projection 4 composed with projection 6 in Fig. 1). Under this orthogonal stimulation, the excitatory opponency neurons are not inhibited by those nearby inhibitory neurons driven by cross-column feed-forward projections from monocular neurons (projection 2 composed with projection 3 in Fig. 1), because those monocular neurons themselves are not excited. (See Fig. 1.) Figure 4a shows that the summation neurons stochastically alternate. These alternations at the summation layer are summed from projections from the (rivaling) L-R monocular layers.

Performance details of the model’s rivalry conform qualitatively with experimental observations. The distribution of dominance durations simulated in our model follows a skewed gamma distribution Fig. 5a, which is in agreement with previous literature (Laing and Chow 2002). We also confirm that the model conforms with two of Levelt’s rules (Levelt 1965). Levelt’s rules 1 & 2 predict that if stimulus contrast to one eye is fixed while that to the other eye is varied, (1) the mean dominance duration of the eye presented with increasing input will also increase, and (2) changes in the mean dominance duration are more significant for the image with relatively higher contrast (Li et al. 2017). To test this within the model, the Poisson firing rate of the incoming spike train is held constant for left ocular dominance column, while the firing rate to the right column is varied. As shown in Fig. 5b, the model captures the changes of the mean dominance duration predicted by Levelt’s rules 1 & 2. Since the other two models use the same mechanism to realize rivalry as explained in Methods, they conform with experimental observations as well (results not shown).

Binocular fusion of similarly oriented gratings

Psychophysics (Kaufman and Arditi 1976) shows that similar images fuse. To study fusion in the first model, we stimulate dichoptically each orientation column (L, R), with an oriented grating and a grating at nearby orientation, respectively. Figure 6 shows model one’s response when the gratings separated by only 2 degrees. The oriented gratings that stimulate each column are very similar; hence, they evoke largely overlapping neuronal activity in both left and right dominance columns. As the stimulation for each column is so similar, the excitatory and inhibitory drives of the excitatory opponency neurons cancel each other out, so that the opponency neurons rarely fire (Fig. 6b&c). These silenced opponency neurons cannot drive cross-column feedback inhibition, and allow persistent firing of excitatory monocular neurons (Fig. 6d&e). The binocular neurons sum the excitations from both eyes, and the dominant “percept” is a combination of the two stimulated orientations. As shown in Fig. 6a the images fuse, as described in the experimental literature (Nelson 1975).

More trivially, identical images (such as gratings at the same orientation, or identical plaids shown to both eyes) must fuse. In all three of our models, identical images fuse (not shown).

NMDA mediates opponency mechanism more effectively than AMPA

The models show that NMDA type receptors, due to their relatively long and sustained time course, facilitate the opponency mechanisms underlying fusion. To demonstrate this within the first model, we replaced the decay time constant of NMDA with a much smaller time constant (2 ms) like that of AMPA type receptors, and found (not shown) that the monocular neurons always rival, even under the presentation of identical stimulations to both columns. This difference in response between NMDA and AMPA is due to NMDA’s longer sustained time course, which provides a higher level of cross-column feedforward excitation to inhibitory neurons; and in turn, inhibits the excitatory opponency neurons more effectively than AMPA, whose time course is less-sustained. Thus, the opponency neurons drive cross-column feedback inhibition which results in rivalry – even with the presentation of identical stimuli. If fusion were to occur with cross-column excitation mediated by AMPA rather than NMDA receptors, a stronger cross-column feedback would be required.

3.2 Rivalry and fusion in the second model

The second model is realized as a four-layer model, with the opponency layer replaced by a purely excitatory layer, together with the addition of a second layer of monocular neurons. Both the excitatory and inhibitory neurons in this second monocular layer are externally driven, balanced in such a way that the external excitatory drive on the monocular excitatory neurons in the layer B is cancelled by that layer’s local inhibition; thus, the projection of this second layer to the binocular summation neurons is silent, unless this balance is broken by the other layers of the model.

Given the ever presence of cross-column feedback inhibition, left and right column monocular neurons in layer A will always rival, under both similar and distinct stimuli. In addition, these layer A inhibitory neurons also project selectively to inhibitory neurons in monocular layer B. Thus, the inhibitory neurons in monocular layer B can be inhibited, providing local “dis-inhibition“ to that layer’s excitatory neurons. These layer B excitatory neurons will only be active when this feedback dis-inhibition is present, which in turn will only be active with similar stimuli to both columns. When active, this dis-inhibition breaks the excitatory-inhibitory balance in layer B, and releases the layer B excitatory monocular neurons in the left (right) column in temporal phase with the layer A excitatory monocular neurons in the other [right (left)] column. That is, the rivalry of layer B neurons will be out of phase with the rivalry of layer A neurons; and the two parallel projections to the binocular neurons will sum together at the binocular layer for a coherent “perception” of fusion.

Binocular rivalry of orthogonal gratings

The response of model 2 to orthogonal binocular stimulation is shown in Fig. 7. Indeed, the new layer B of monocular neurons is silent Fig. 7c; the original monocular layer A rivals Fig. 7b; and the summation layer inherits this rivalry Fig. 7a. As this rivalry is driven solely by monocular layer A, it is identical to the rivalry in the rivalry in model 1, with identical stochastic characteristics (not shown).

Binocular fusion of similarly oriented gratings

Figure 8 shows the response of model 2 to binocular stimulation by gratings of similar orientation, with gratings separated by two degrees of orientation preference. The original monocular layer A continues to rival (Fig. 8b); the new monocular layer B is now active and rivals (Fig. 8c), out of phase with layer A; and the two parallel projections produce fusion in the binocular layer (Fig. 8a). The same mechanism of model 2 causes identical binocular images such as plaids to fuse at the summation layer into a single image (not shown).

3.3 Rivalry and fusion in the third model

The third model is a two layer model – a monocular layer externally driven and projecting to a binocular summation layer. (See Fig. 3.) The excitatory monocular neurons have two essential types of cross-column projections – i) cross-column inhibition, that broadly inhibits the receiving column’s monocular excitatory neurons; and ii) cross-column excitation, where the monocular excitatory neurons selectively target monocular excitatory neurons of similar orientation preference in the other column. Under similar binocular stimulation, this additional direct and selective cross-column excitation overcomes the cross-column inhibition and allows similar stimuli to fuse.

Binocular rivalry of orthogonal gratings

When binocularly stimulated by orthogonal gratings, cross-column excitation is not effective in driving the monocular excitatory neurons in the other column. To see this, recall that the monocular excitatory neurons in the left column are driven by external gratings of orientation 𝜃_L, and thus their cross-column excitatory projections selectively target excitatory neurons in the right column of orientation preference near 𝜃_L. However, the orthogonal external stimulus to the right column does not drive these 𝜃_L neurons; thus, the total excitation of these right column excitatory neurons is not strong enough to overcome their cross-column inhibition, and rivalry occurs. (See Fig. 9.)

Binocular fusion of similarly oriented gratings

On the other hand, when binocularly stimulated by gratings of similar orientation, monocular neurons are driven strongly by both the external drive and the cross-column excitation, which together overcome cross-column inhibition and allow similarly orientated gratings to fuse. (See Fig. 10.) We note that in the fused state of the third model, the spread of raster plot of monocular neurons is much narrower than that of rivalry. This is because the spread is determined by the cross-column excitation which has a tight orientation selectivity (sd of 3 degrees), thus the driven neurons common to both columns benefit the most from it.

3.4 Hysteretic transition between rivalry and fusion

In all three models, the transitions between fusion and rivalry are hysteretic, as they are in psychophysical experiments (Buckthought et al. 2008). To investigate the transitions between fusion and rivalry, we follow Wilson (2017) and very slowly (2 degrees per sec) change the difference in angle between the gratings stimulating the left and right eyes – first by slowly increasing this difference from identical angles (to study the transition from the fused state to a rivalry state), and then by slowly decreasing this difference from distinct angles (to study the transition from the rivalry to fusion). The results for all three models are shown in (Figs. 11 and 12, where, for each model, the average firing rates of the binocular layer are shown for both cases [“diverging” (increasing Δ𝜃) and “converging” (decreasing Δ𝜃)]. In all three models, the hysteretic response are clear, with the transition of the converging (Δ𝜃 decreasing) branch occuring at a smaller transition angle than the diverging branch. The transition values of Δ𝜃 (from fusion to rivalry and from rivalry to fusion) and the width in Δ𝜃 of the bi-stable region are in reasonable agreement with physchophysics (Buckthought et al. 2008). Note in model three, the averaged firing rate for the binocular neurons is very similar in both the rivalry and fusion states. This is because cross-column inhibition is always present, even with similar stimuli which fuse. Thus, we find it clearest to use the averaged firing rates of the monocular neurons to identify the transition angles (see Fig. 12b & c).

4 Discussion

Summary

We have studied point neuron ring models of the front end initiation of binocular rivalry, fusion, and the hysteretic transition between rivalry and fusion. It is generally accepted (See, e.g., Laing and Chow 2002; Wilson 2003; Shpiro et al.2007; Moreno-Bote et al. 2007) that the mechanism underlying binocular rivalry is cross-column inhibition; thus, the issue is how this inhibition is overcome under similar stimulation to each eye, allowing fusion. Model 1 is a three-layer model that uses a layer of opponency neurons to shut off cross-column inhibition; Model 2 is a four-layer model that allows cross-column inhibition to be always present (by eliminating opponency neurons), and introduces an additional layer of monocular neurons as a parallel pathway projecting to the binocular neurons – with the two pathways working together to fuse similar images at the binocular layer; Model 3 is a two layer model that introduces no additional layers of neurons, but includes cross-column selective excitation to overcome cross-column inhibition, and together the cross-column excitation and inhibition allow similar images to fuse.

Interpretation as logic gates

The first model (shown schematically in Fig. 1) realizes an “exclusive or” (XOR) circuitry though opponency neurons, as can be seen explicitly as follows: The layer of opponency neurons serves as logic gates, one gate for each pair of opponency neurons of identical orientation preference. Each gate has two input ports: i) the feedforward excitation & inhibition to the left & right opponency neurons from left monocular neurons, and ii) the feedforward excitation & inhibition to the right & left opponency neurons from right monocular neurons; and output: the feedback inhibition on monocular neurons. This output is active only with active drive from either the left OR the right monocular neuron, and is inactive when both the left and right monocular neurons do not fire (no excitatory drive), or when both are active (feedforward cross-column inhibition on the opponency neurons cancelling same-column excitation).

Alternatively, the second model (shown schematically in Fig. 2) replaces the layer of opponency neurons (the XOR gate) with a second monocular layer B whose excitatory neurons serve as AND gates, one for each orientation preference, as can be seen explicitly as follows: The excitatory neurons in monocular layer B have two input ports: i) the excitation by the visual stimuli and ii) the feedback dis-inhibition from the monocular layer A; and an output: projection to the binocular summation layer. In this case, the output projection is active if and only if the excitation from the visual stimuli and the feedback dis-inhibition from monocular layer A are both active. In all other cases, the output will be inactive – either due to no stimulation or no dis-inhibition or the lack of both. This logic gate interpretation provides an intuitive description of the different mechanisms by which the two models overcome cross-column inhibition: Under similar stimulation, the XOR mechanism “shuts off” cross-column inhibition, while the AND mechanism “turns on” the second pathway from monocular layer B to the binocular layer.

From the perspective of dynamical systems

Each of the three models is a high dimensional stochastic dynamical system of dimension 3 × N, where N is the total number of neurons, each described by three variables (voltage v and conductances g_E&g_I). The source of noise is the Poisson processes representing the visual (LGN) drive to the system. In all three models, the rivalry state consists of two meta-stable states of the dynamical system – representing the two (left and right) percepts. The long time rivalry state approached by the dynamical system has random, noise induced, temporal jumps between these two meta-stable states. Without noise, under distinct dichoptic visual stimulation, each model would have two stable states (one for each percept), each with its own “basin of attraction”. Which stable state is approached dynamically would depend upon in which basin of attraction the initial condition resides.

On the other hand, the dynamics of the binocular fusion of similarly-oriented gratings is very different in model 2 from that in models 1 and 3. In the first and third models, there is one dynamical state that represents the fused percept. This dynamical state is stable. In contrast, the second model has two distinct meta-stable states, which rival out of phase as is apparent in the raster plots of the two monocular layers in Fig. 8. Note however, these two states possess identical firing patterns for the layer of binocular summation neurons, representing the fused percept. We note that in all three models at low contrast orthogonal input, the rivalry state is lost to a fused state (not shown). This is consistent with the psychophysical observation (Liu et al. 1992) that rivalry is lost to fusion near the detection threshold.

Possible experimental tests of the model mechanisms

Could differences in the response properties of the three models be detected experimentally, and thus identify which mechanism is present in the real cortex? The three models have very similar response properties in the rivalry regime; that is, when stimulated by distinct visual patterns such as orthogonal gratings. However, there are differences in how the models fuse similar visual patterns, and how the models transition between fusion and rivalry. These differences are not apparent in the response properties of the binocular summation neurons; however, there are differences in the response properties of the models’ (left vs right) monocular neurons. In the first and third models, the monocular neurons do not rival in the fused state (Figs. 6, 10). In the second model with two monocular layers, the monocular neurons always rival in the fused state, with the rivalry in two monocular layers out of phase with each other (Fig. 8). This out of phase rivalry might be measurable experimentally. (There is a fMRI experiment (Xu et al. 2016) showing that large patches of V1 cortex do not rival during fusion, although “monocular layer B” could be below the resolution of fMRI.) Distinguishing between the mechanisms that underlie fusion in the first and third models seems more difficult. One distinction is that, in the first model, fusion is allowed by shutting off cross-column feedback inhibition; while in the third model, cross-column inhibition is always present, but overcome by cross-column excitation in the fusion regime. This distinction with respect to inhibition might be measurable, perhaps by combining pharmocolgy with electrophysiology.

Relationship to other work

Although our model has similarities to earlier models in the literature, we believe that our work is the first to study three distinct mechanisms to overcome the cross-column inhibition underlying rivalry; thus, to construct explicitly three distinct computational models that combine fusion, rivalry and the transition between. While our point neuron ring models are idealized, they employ the same components as realistic large-scale models of V1 (conductance-based integrate-and-fire point neurons, with adaptation; excitatory, selective for orientation preference, long-range connections, mediated at the time scales of NMDA type receptors; inhibition, mediated at the time scales of GABA A type receptors). The one dimensional ring architecture is the primary idealization in our models. Most of the earlier models addressed rivalry, but not fusion. While mean field firing rate models of rivalry were most common, some previous studies have employed point neuron models, (e.g., Wilson 2003; Moreno-Bote et al. 2007), including point neuron models with ring structures (e.g. Laing and Chow 2002; Cohen et al. 2019). The work of Laing and Chow (2002) provides a rather complete study of the rivalry between Hodgkin-Huxley point neurons driven by two gratings with distinct orientations; however, they do not model ocular dominance columns, and do not study fusion. The work of (e.g. Cohen et al. 2019) addresses the issue of operating states of idealized balanced cortical networks displaying rivalry – important because large-scale comprehensive neuronal networks mostly operate in a balanced state. While we have not focused on the operating points of our ring models, we do note that the monucular layers exhibit asynchronous firing, one of the most important properties of balanced networks. Said and Heeger (2013) and Li et al. (2017) used mean firing rate models with divisive normalization to realize opponency neurons that they introduced to rival stimuli by distinctly oriented gratings, while fusing identical plaids. Recently, Wilson (2017) developed a rather complete rate model to capture rivalry, fusion of plaids, and the hysteretic transition between fusion and rivalry within a single model with XOR circuitry. That model’s realization of fusion is somewhat different from our first model – in that the inhibitory neurons mediating rivalry in Wilson’s model target only orientations away from the orientation preference of the projecting neuron. This causes rivalry to cease under similar stimulations, and allows fusion. In our first model, the inhibitory neurons mediating mutual inhibition target all orientations, and fusion is achieved by using similar stimulation to “shut off” that inhibition.

Remark on “perceived contrast”

In this work, we have not studied contrast perception during monocular and binocular viewing. Nonetheless, model 3 has an interesting response property that is related to “perceived contrast”; namely, the binocular neurons’ firing rates (interpreted as “perceived contrast”) have similar values in the fused state and rivalrous state under the same input contrast (Fig. 12), similar to what experiments have found during binocular and monocular viewing (Legge and Rubin 1981; Wilson 2017). This is in contrast with models 1 and 2, whose binocular response in the rivalry state is much less than that in the fused state (Fig. 11). This one special case (with equal contrast to each eye) hints at the possibility that model 3 might have realistic contrast perception. In this case, we note that in model 3 the balance between cross column excitation and inhibition itself normalizes the binocular responses, instead of requiring an additional global inhibition such as the divisive normalization that is shown to produce realistic contrast perception in the firing rate model of reference (Wilson 2017).

An AND pathway may be necessary

There was a discussion in the literature (Wolfe 1986; Blake and O’Shea 1988; Blake 1989; Wolfe 1988) about whether AND or XOR is the more likely biological mechanism. Although Wolfe argued for the AND pathway, later experiments and computational studies focused almost solely on the XOR theory. Psychophysics experiments (Blake and Boothroyd 1985) were key to the debate, with Blake and O’Shea (1988) arguing that the AND theory would have difficulty explaining them. Here, by comparing our first and second model’s performances on one variant of these experiments, we re-examine the almost forgotten, but possibly valid AND mechanism of binocular vision (see the extensive review by Wolfe1986).

The psychophysical finding in Blake and Boothroyd (1985) may be briefly summarized: a vertical grating is presented to each eye with an additional horizontal grating presented to only one of the two eyes. The percept is a stable plaid. As the contrast of each of the three gratings is separately and abruptly decreased to a very low contrast, no difference is observed in the ability to detect the abrupt contrast reduction of each one of the three gratings, and are comparable to those under fusion (when only the two vertical gratings are present). The result leads to the conclusion that the horizontal grating is not suppressed under this stimulus setup – for if it were suppressed, say by rivalry with the other eye’s vertical grating, it’s contrast reduction would be more difficult to detect. Under the AND conceptual theory, the horizontal grating could only activate the rivalry pathway (having no similar stimuli in the other eye) but not the fusion pathway, and thus the horizontal grating should be intermitantly suppressed – leading to the conclusion that the AND theory would predict that it is more difficult to detect the contrast decrement of the horizontal grating. However, Wolfe argued that this is not the case: the horizontal grating would not be suppressed because of a permanent dominance of the eye receiving both vertical and horizontal stimulation (Wolfe 1988).

When presenting the stimuli described above to our first two models, we found that both models realize Wolfe’s explanation – the eye receiving both horizontal and vertical grating remains constantly dominant (Fig. 13). This supports Wolfe’s argument that the eye with the horizontal grating can always be dominant. Nevertheless, the two models do have some differences in response to these stimuli: (i) In the first (XOR) model, this constant dominance nullifies fusion of the two vertical gratings. (See Fig. 13a&b.) (The cross-column feedback inhibition originating from the eye stimulated with both vertical and horizontal gratings is too strong for it to be shut-off by the other eye that is driven only by the single vertical grating. Thus, in model 1 the monocular neurons driven by that eye’s single vertical grating are soon inactive.) We note that model 3 performs similarly to model 1 for these visual stimuli. (ii) In the second (AND) model, the pathway for the eye with a single vertical grating remains dis-inhibited due to the constant dominance of the other eye, allowing fusion of the vertical stimuli from the two eyes. (See Fig. 13c&d.) Thus, in the second model, all three stimuli are “perceived”, while in the first and third models, only the vertical-horizontal (plaid) stimuli from the one eye is “perceived”, and the sole vertical stimuli at the other eye is not. Model 2 would predict no difference in the ability to detect an abrupt decrease in the contrast of any one of the three grating stimuli; on the other hand, models 1 and 3 would not be able to detect the sole vertical grating. This does not account for the full list of experiments and interpretations discussed in O’Shea (1987); however, it does suggest that the XOR mechanism does not provide a complete view of the binocular vision, and that some form of a parallel pathway mechanism might be needed in the visual system.

Major assumptions in our models

First, in all three models, all cross-column projections are assumed to be excitatory. Since these cross-column projections must be at least moderately long range, we view it more realistic to model them as excitatory, selectively targeting NMDA receptors on local inhibitory neurons with similar orientation preferences. These assumptions have basis in experiments. Long distance excitatory projections are known to target neurons of similar orientation preference (Bosking et al. 1997), and are known to target NMDA (as well as AMPA) receptors. Moreover, NMDA type receptors do occur on inhibitory interneurons in the cerebral cortex (see, e.g., Jones and Bühl 1993; Maccaferri and Dingledine 2002; Homayoun and Moghaddam 2007; Wang and Gao 2009), and in visual cortex (see, e.g., Huntley et al. 1994; Wong-Riley et al. 1998). And there is some evidence (Homayoun and Moghaddam 2007) that the NMDA receptors on inhibition can dominate those on excitation – an additional assumption necessary in our models. Also, all three models require that the local inhibition itself acts selectively in some sub-populations and broadly at all angles of orientation preference in other sub-populations. While there is evidence supporting these assumptions, taken together they do constitute rather strong theoretical assumptions underlying the models. Alternatively, one could represent cross-column projections as inhibitory, with projections of moderate spatial extent that activate slow time scale inhibitory receptors. As the existence of long range, orientation selective, excitatory projections is more certain, we chose to represent the cross-column projections as excitatory.

Shortcomings of our models

One notable weakness of all three models is that they fail to capture the psychophysical observation that decreasing the angle between the two nearly orthogonal gratings will slightly increase the mean dominance durations of rivalry (Andrews and Purves 1997). (Note the model of Laing and Chow 2002 captures this effect.) In our models, the strengths of mutual inhibition are equal for all neurons. In order to capture this observation, the models would need to have a fall-off in inhibition in terms of the orientation difference of the neurons, but a broad fall-off because the neurons oriented orthogonally must receive enough mutual inhibition to be suppressed and therefore produce rivalry.

Conclusion

Of the three models, we believe that the mechanism of the third model is most likely to emerge in large-scale comprehensive models; and hence, we believe that it the most likely to underly rivalry and fusion in the real visual cortex. The third model does not rely on additional layers of neurons with additional architectures (as do Models 1 and 2). Rather, it simply relies on a balance between cross-column inhibition and cross-column excitation – a balance which changes depending upon the similarity or distinctness of the visual stimuli.

References

Andrews, T.J., & Purves, D. (1997). Similarities in normal and binocularly rivalrous viewing. Proceedings of the National Academy of Sciences, 94(18), 9905–9908.
Article CAS Google Scholar
Blake, R. (1989). A neural theory of binocular rivalry. Psychological Review, 96(1), 145.
Article CAS PubMed Google Scholar
Blake, R., & Boothroyd, K. (1985). The precedence of binocular fusion over binocular rivalry. Perception & Psychophysics, 37(2), 114–124.
Article CAS Google Scholar
Blake, R., & Fox, R. (1974). Adaptation to invisible gratings and the site of binocular rivalry suppression. Nature, 249(5456), 488.
Article CAS PubMed Google Scholar
Blake, R., & Lema, S.A. (1978). Inhibitory effect of binocular rivalry suppression is independent of orientation. Vision Research, 18(5), 541–544.
Article CAS PubMed Google Scholar
Blake, R, & O’Shea, RP. (1988). “abnormal fusion” of stereopsis and binocular rivalry. Psychological Review, 95, 151–154.
Article CAS PubMed Google Scholar
Bosking, W.H., Zhang, Y., Schofield, B., & Fitzpatrick, D. (1997). Orientation selectivity and the arrangement of horizontal connections in tree shrew striate cortex. Journal of Neuroscience, 17(6), 2112–2127.
Article CAS PubMed Google Scholar
Brascamp, J.W., & Blake, R. (2012). Inattention abolishes binocular rivalry: perceptual evidence. Psychological Science, 23(10), 1159–1167.
Article PubMed Google Scholar
Brown, R.J., & Norcia, A.M. (1997). A method for investigating binocular rivalry in real-time with the steady-state vep. Vision research, 37(17), 2401–2408.
Article CAS PubMed Google Scholar
Buckthought, A., Kim, J., & Wilson, H.R. (2008). Hysteresis effects in stereopsis and binocular rivalry. Vision Research, 48(6), 819–830.
Article PubMed Google Scholar
Cai, D., Rangan, A.V., & McLaughlin, D.W. (2005). Architectural and synaptic mechanisms underlying coherent spontaneous activity in v1. Proceedings of the National Academy of Sciences, 102(16), 5868–5873.
Article CAS Google Scholar
Cavanagh, P., & Holcombe, A.O. (2006). Successive rivalry does not occur without attention. Journal of Vision, 6(6), 818–818.
Article Google Scholar
Chariker, L., Shapley, R., & Young, L.S. (2016). Orientation selectivity from very sparse lgn inputs in a comprehensive model of macaque v1 cortex. Journal of Neuroscience, 36(49), 12368–12384.
Article CAS PubMed Google Scholar
Cohen, B.P., Chow, C.C., & Vattikuti, S. (2019). Dynamical modeling of multi-scale variability in neuronal competition. Communications Biology, 2(1), 1–11.
Article Google Scholar
Della Porta, G. (1593). De refractione optices parte: libri novem... Ex officina Horatii Salviani, apud Jo. Jacobum Carlinum, & Antonium Pacem.
Dieter, K.C., Brascamp, J., Tadin, D., & Blake, R. (2016). Does visual attention drive the dynamics of bistable perception? Attention, Perception, & Psychophysics, 78(7), 1861–1873.
Article Google Scholar
Dutour, E.F. (1760). Discussion d’une question d’optique [discussion on a question of optics]. Mémoires de Mathématique et de physique présentés par Divers Savants, 3, 514–530.
Google Scholar
Gail, A., Brinksmeyer, H.J., & Eckhorn, R. (2004). Perception-related modulations of local field potential power and coherence in primary visual cortex of awake monkey during binocular rivalry. Cerebral Cortex, 14(3), 300–313.
Article PubMed Google Scholar
Homayoun, H., & Moghaddam, B. (2007). Nmda receptor hypofunction produces opposite effects on prefrontal cortex interneurons and pyramidal neurons. Journal of Neuroscience, 27(43), 11496–11500.
Article CAS PubMed Google Scholar
Huntley, G.W., Vickers, J., Janssen, W., Brose, N., Heinemann, S., & Morrison, J. (1994). Distribution and synaptic localization of immunocytochemically identified nmda receptor subunit proteins in sensory-motor and visual cortices of monkey and human. Journal of Neuroscience, 14(6), 3603–3619.
Article CAS PubMed Google Scholar
Jones, R., & Bühl, E. (1993). Basket-like interneurones in layer ii of the entorhinal cortex exhibit a powerful nmda-mediated synaptic excitation. Neuroscience Letters, 149(1), 35–39.
Article CAS PubMed Google Scholar
Kaufman, L., & Arditi, A. (1976). The fusion illusion. Vision Research, 16(5), 535–543.
Article CAS PubMed Google Scholar
Laing, C.R., & Chow, C.C. (2002). A spiking neuron model for binocular rivalry. Journal of Computational Neuroscience, 12(1), 39–53.
Article PubMed Google Scholar
Legge, G.E., & Rubin, G.S. (1981). Binocular interactions in suprathreshold contrast perception. Perception & Psychophysics, 30(1), 49–61.
Article CAS Google Scholar
Leopold, D.A., & Logothetis, N.K. (1996). Activity changes in early visual cortex reflect monkeys’ percepts during binocular rivalry. Nature, 379(6565), 549.
Article CAS PubMed Google Scholar
Levelt, W.J. (1965). On binocular rivalry. PhD thesis, Van Gorcum Assen.
Li, H.H., Rankin, J., Rinzel, J., Carrasco, M., & Heeger, D.J. (2017). Attention model of binocular rivalry. Proceedings of the National Academy of Sciences, 114(30), E6192–E6201.
Article CAS Google Scholar
Ling, S., & Blake, R. (2012). Normalization regulates competition for visual awareness. Neuron, 75(3), 531–540.
Article CAS PubMed PubMed Central Google Scholar
Liu, L., Tyler, C.W., & Schor, C.M. (1992). Failure of rivalry at low contrast: evidence of a suprathreshold binocular summation process. Vision Research, 32(8), 1471–1479.
Article CAS PubMed Google Scholar
Maccaferri, G., & Dingledine, R. (2002). Control of feedforward dendritic inhibition by nmda receptor-dependent spike timing in hippocampal interneurons. Journal of Neuroscience, 22(13), 5462–5472.
Article CAS PubMed Google Scholar
MATLAB. (2018). version 9.4.0 (R2018a). Natick: The MathWorks Inc.,.
Google Scholar
Moreno-Bote, R., Rinzel, J., & Rubin, N. (2007). Noise-induced alternations in an attractor network model of perceptual bistability. Journal of Neurophysiology, 98(3), 1125–1139.
Article PubMed Google Scholar
Nelson, J.I. (1975). Globality and stereoscopic fusion in binocular vision. Journal of Theoretical Biology, 49 (1), 1–88.
Article CAS PubMed Google Scholar
O’Shea, R.P. (1987). Chronometric analysis supports fusion rather than suppression theory of binocular vision. Vision Research, 27(5), 781–791.
Article PubMed Google Scholar
O’Shea, R.P., & Crassini, B. (1981). The sensitivity of binocular rivalry suppression to changes in orientation assessed by reaction-time and forced-choice techniques. Perception, 10(3), 283–293.
Article PubMed Google Scholar
Polonsky, A., Blake, R., Braun, J., & Heeger, D.J. (2000). Neuronal activity in human primary visual cortex correlates with perception during binocular rivalry. Nature Neuroscience, 3(11), 1153.
Article CAS PubMed Google Scholar
Said, C.P., & Heeger, D.J. (2013). A model of binocular rivalry and cross-orientation suppression. PLoS Computational Biology, 9(3), e1002991.
Article CAS PubMed PubMed Central Google Scholar
Shpiro, A., Curtu, R., Rinzel, J., & Rubin, N. (2007). Dynamical characteristics common to neuronal competition models. Journal of Neurophysiology, 97(1), 462–473.
Article PubMed Google Scholar
Stimberg, M., Goodman, D.F., Benichoux, V., & Brette, R. (2013). Brian 2-the second coming: spiking neural network simulation in python with code generation. BMC Neuroscience, 14(1), P38.
Article PubMed Central Google Scholar
Tong, F., & Engel, S.A. (2001). Interocular rivalry revealed in the human cortical blind-spot representation. Nature, 411(6834), 195.
Article CAS PubMed Google Scholar
Tong, F., Meng, M., & Blake, R. (2006). Neural bases of binocular rivalry. Trends in Cognitive Sciences, 10(11), 502–511.
Article PubMed Google Scholar
Wade, N.J. (1998). A natural history of vision. MIT press.
Wang, H.X., & Gao, W.J. (2009). Cell type-specific development of nmda receptors in the interneurons of rat prefrontal cortex. Neuropsychopharmacology, 34(8), 2028.
Article CAS PubMed Google Scholar
Wilson, H.R. (2003). Computational evidence for a rivalry hierarchy in vision. Proceedings of the National Academy of Sciences, 100(24), 14499–14503.
Article CAS Google Scholar
Wilson, H.R. (2017). Binocular contrast, stereopsis, and rivalry: toward a dynamical synthesis. Vision research, 140, 89–95.
Article PubMed Google Scholar
Wolfe, J.M. (1986). Stereopsis and binocular rivalry. Psychological Review, 93(3), 269.
Article CAS PubMed Google Scholar
Wolfe, JM. (1988). Parallel ideas about stereopsis and binocular rivalry: a reply to blake and o’shea (1988). Psychological review.
Wong-Riley, M., Anderson, B., Liebl, W., & Huang, Z. (1998). Neurochemical organization of the macaque striate cortex: correlation of cytochrome oxidase with na+ k+ atpase, nadph-diaphorase, nitric oxide synthase, and n-methyl-d-aspartate receptor subunit 1. Neuroscience, 83(4), 1025–1045.
Article CAS PubMed Google Scholar
Xu, H., Han, C., Chen, M., Li, P., Zhu, S., Fang, Y., Hu, J., Ma, H., & Lu, H.D. (2016). Rivalry-like neural activity in primary visual cortex in anesthetized monkeys. Journal of Neuroscience, 36 (11), 3231–3242.
Article CAS PubMed Google Scholar
Zhang, P., Jamison, K., Engel, S., He, B., & He, S. (2011). Binocular rivalry requires visual attention. Neuron, 71(2), 362–369.
Article CAS PubMed PubMed Central Google Scholar
Zhou, D., Rangan, A.V., McLaughlin, D.W., & Cai, D. (2013). Spatiotemporal dynamics of neuronal population response in the primary visual cortex. Proceedings of the National Academy of Sciences, 110(23), 9517–9522.
Article CAS Google Scholar

Download references

Acknowledgements

We thank David Heeger and John Rinzel for very helpful and informative discussions.

Author information

Authors and Affiliations

Integrated Program in Neuroscience, McGill University, 3801 Rue Université, Montréal, QC, H3A 2B4, Canada
Ziqi Wang
New York University – Courant Institute of Mathematical Science, 251 Mercer Street, New York, NY, 10012, USA
Wei Dai & David W. McLaughlin
New York University – Tandon School of Engineering, 6 Metro Tech Center, Brooklyn, NY, 11201, USA
David W. McLaughlin
New York University Shanghai, 1555 Century Ave, Pudong, Shanghai, 200122, China
David W. McLaughlin
Neuroscience Institute at NYU Langone Medical Center, Science Building, 435 East 30th Street, New York, NY, 10016, USA
David W. McLaughlin

Authors

Ziqi Wang
View author publications
You can also search for this author in PubMed Google Scholar
Wei Dai
View author publications
You can also search for this author in PubMed Google Scholar
David W. McLaughlin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to David W. McLaughlin.

Ethics declarations

Conflict of interests

The authors have no conflict of interests.

Additional information

Action Editor: A. Borst

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, Z., Dai, W. & McLaughlin, D.W. Ring models of binocular rivalry and fusion. J Comput Neurosci 48, 193–211 (2020). https://doi.org/10.1007/s10827-020-00744-7

Download citation

Received: 05 August 2019
Revised: 19 March 2020
Accepted: 24 March 2020
Published: 03 May 2020
Issue Date: May 2020
DOI: https://doi.org/10.1007/s10827-020-00744-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Ring models of binocular rivalry and fusion

Abstract

Similar content being viewed by others

Multisensory-inspired modeling and neural correlates for two key binocular interactions

A single retinal circuit model for multiple computations

Dichoptic vision in the absence of attention: neither fusion nor rivalry

1 Introduction

2 Methods and models

3 Results

3.1 Rivalry and fusion in the first model

Binocular rivalry of orthogonal gratings

Binocular fusion of similarly oriented gratings

NMDA mediates opponency mechanism more effectively than AMPA

3.2 Rivalry and fusion in the second model

Binocular rivalry of orthogonal gratings

Binocular fusion of similarly oriented gratings

3.3 Rivalry and fusion in the third model

Binocular rivalry of orthogonal gratings

Binocular fusion of similarly oriented gratings

3.4 Hysteretic transition between rivalry and fusion

4 Discussion

Summary

Interpretation as logic gates

From the perspective of dynamical systems

Possible experimental tests of the model mechanisms

Relationship to other work

Remark on “perceived contrast”

An AND pathway may be necessary

Major assumptions in our models

Shortcomings of our models

Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation