A Geometric Model of Multi-scale Orientation Preference Maps via Gabor Functions

Baspinar, Emre; Citti, Giovanna; Sarti, Alessandro

doi:10.1007/s10851-018-0803-3

A Geometric Model of Multi-scale Orientation Preference Maps via Gabor Functions

Published: 07 April 2018

Volume 60, pages 900–912, (2018)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Mathematical Imaging and Vision Aims and scope Submit manuscript

A Geometric Model of Multi-scale Orientation Preference Maps via Gabor Functions

Download PDF

Emre Baspinar^1,2,
Giovanna Citti^1,2 &
Alessandro Sarti^1,2

367 Accesses
15 Citations
1 Altmetric
Explore all metrics

Abstract

In this paper we present a new model for the generation of orientation preference maps in the primary visual cortex (V1), considering both orientation and scale features. First we undertake to model the functional architecture of V1 by interpreting it as a principal fiber bundle over the 2-dimensional retinal plane by introducing intrinsic variables orientation and scale. The intrinsic variables constitute a fiber on each point of the retinal plane and the set of receptive profiles of simple cells is located on the fiber. Each receptive profile on the fiber is mathematically interpreted as a rotated Gabor function derived from an uncertainty principle. The visual stimulus is lifted in a 4-dimensional space, characterized by coordinate variables, position, orientation and scale, through a linear filtering of the stimulus with Gabor functions. Orientation preference maps are then obtained by mapping the orientation value found from the lifting of a noise stimulus onto the 2-dimensional retinal plane. This corresponds to a Bargmann transform in the reducible representation of the $\text {SE}(2)=\mathbb {R}^2\times S^1$ group. A comparison will be provided with a previous model based on the Bargmann transform in the irreducible representation of the $\text {SE}(2)$ group, outlining that the new model is more physiologically motivated. Then, we present simulation results related to the construction of the orientation preference map by using Gabor filters with different scales and compare those results to the relevant neurophysiological findings in the literature.

A sub-Riemannian model of the visual cortex with frequency and phase

Article Open access 29 July 2020

Monochromaticity of Orientation Maps in V1 Implies Minimum Variance for Hypercolumn Size

Article Open access 08 April 2015

Orientation Maps in V1 and Non-Euclidean Geometry

Article Open access 17 June 2015

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

As it is well known the simple cells in the primary visual cortex are organized in structures called orientation preference maps. This special organization has been studied with geometric instruments starting by the work of Petitot and Tondut [29]. In that study the orientation maps were obtained as the superposition of randomly weighted orientation fields corresponding to all possible orientation angles around the pinwheels (see the geometric explanations of Petitot [27] related to the pinwheels). A different model always based only on orientation was introduced by Barbieri et al. [2, 3]. In that paper the orientation preference structure was recovered starting from the observation that its Fourier transform is concentrated on an annulus. This model as the previous recalled one is based on properties apparently independent of the other aspects of the cortical models. Additionally to those studies, the models, in terms of its cortical orientation and orientation-frequency selectivity, which were provided by Bressloff and Cowan [7, 9], and the model proposed for the cortical spatiotemporal selective behavior by Barbieri et al. [1] could be useful references for the reader.

In this article we present a new model for the generation of the orientation preference maps, considering both orientation and scale features. Hypercolumns of the simple cell receptive profiles are the fundamental units of the set of receptive profiles. They build a 2-dimensional subgroup of rotation–dilation at each point (x, y) of the retinal plane $M\subset \mathbb {R}^2$. In other words the base variables are the spatial components $(x,y)\in M$ and the intrinsic variables are orientation and scale parameters $(\theta , \sigma )\in [0, \pi )\times \mathbb {R}^+$. Having two intrinsic variables in hand, we can either fix scale and obtain the orientation map of the simple cells or we can employ a range of scale values and obtain a multi-scale orientation map. In this way the model integrates several visual features observed in neurophysiology, psychophysics and neuroimaging experiments and provides a more precise orientation map.

The main novelty of our approach is that the orientation map description is strongly related to the functionality of the cortex and the simple cell responses in presence of a visual stimulus. Indeed we start with a random stimulus I on the retinal plane, and obtain the responses of the cells through a linear filtering with translated, rotated and dilated Gabor functions. Finally we employ integration of the output over fiber and maximum selection in order to select the prevalent orientation and scale. This whole procedure starting with obtaining the simple cell responses and ending with application of the integration over fiber and the maximum selection over the full set of receptive profiles is called lifting. Consequently we propose to obtain the orientation maps by employing a lifting of noise stimulus through the functional structure of the cortex. We will outline that this corresponds to a Bargmann transform [4] in the reducible representation of the $\text {SE}(2)$ group, which is different than the case in [2] where Barbieri et al. considered the irreducible representation. Hence our model is neural based.

The theoretical criterion underpinning the modeling we propose in this paper relies on the so-called neurogeometrical approach described by Citti and Sarti [11], Petitot and Tondut [29], Sarti et al. [32]. Following this approach, processing capabilities of sensorial cortices and particularly of the visual cortex are modeled based on the geometrical structure of neural connectivity. Global and local symmetries of the visual stimuli are inherited by the cortical structure that presents their invariances (see Sanguinetti et al. [30]). Then the structure is defined on group of invariances that are also spaces, meaning Lie groups. Particularly the simple cells are sensitive to local position and orientation features of stimuli, which are elements of the roto-translation group $\text {SE}(2)$. The corresponding Lie algebra and its integral curves model neural connectivity between cells. Moreover, since the algebra is not commutative, it is possible to pose an uncertainty principle, whose minimization gives rise to the shape of receptive profiles of the simple cells. The model has been extended to other variables such as scale by Sarti et al. [32], and to other cell types such as complex cells sensitive to movement by Barbieri et al. [1] and Cocci et al. [13]. Furthermore some image processing applications employing scale and orientation information extracted from a given image via a suitable wavelet transform were proposed in [35] by Sharma and Duits. In [12, 31], a neurogeometrical field theory has been introduced by Sarti and Citti to model connectivity between different cortices and it has been shown that harmonic analysis on the neurogeometry excited by the stimulus accounts for the constitution of perceptual units, while in [34] semiotic forms have been obtained through the same principle by Sarti and Piotrowski.

The orientation maps of V1 have been introduced in [2] by Barbieri et al. as a Bargmann transform in the irreducible representation of $\text {SE}(2)$, while in the present article here a model of the orientation maps is proposed in terms of the reducible representation, that is more neurophysiologically plausible. Then all the principal morphologies present in the visual cortex are modeled in a compact way in the neurogeometrical framework.

As a general consideration about the choice of the receptive profile model, let us recall that receptive field models consisting of cascades of linear filters and static nonlinearities may be adequate to account for responses to simple stimuli such as gratings and random checkerboards, but their predictions of responses to complex stimuli such as natural scenes are only approximately correct. A variety of mechanisms such as response normalization, gain controls, cross-orientation suppression, intra-cortical modulation can intervene to change radically the shape of the profile. Then any static and linear model for the receptive profiles has to be considered just as a very first approximation of the complex behavior of a real dynamic receptive profile, which is not perfectly described by any of the static wavelet frames.

For example derivatives or difference of Gaussian functions are very good approximations of the behavior of classical receptive profiles of the simple cells. In [26], Lindeberg shows that the functions of Gaussian derivatives obtained from the extension of the family of rotationally symmetric Gaussian kernels to the family of affine Gaussian kernels are able to model closedness under affine image transformations. This property is important in the case of image deformations, in particular perspective deformations, which are essential in 3D perception. Thanks to the closedness under affine transformations the Gaussian derivatives are able to handle the perspective deformations (see the explanations of Lindeberg in [26] for details). On the other hand, the Gabor functions which we use in our model are for 2D perception and they are restricted to the similarity group.

Lindeberg shows in [25, Section 4] that an offset term appears throughout the axiomatic derivation of the Gaussian derivatives as receptive field models. This offset term can be used for compensating spatial shifts (translations) and time delays of the kernels of the Gaussian derivatives in order to avoid spatial disparities and provide time causality of the kernels. In the case of the Gabor functions here, we are not interested in the temporal behavior of the simple cell receptive profiles and differently from the case of Lindeberg [25], the offset term corresponds to a phase shift of the Gabor function. It changes the shape of the Gabor function by introducing the phase shift to the wave content of the function (see Fig. 1 where we present an example of a phase shifted Gabor function). Therefore the Gabor functions are able to model both asymmetric simple cells and even/odd symmetric simple cells while the functions of the Gaussian derivatives account only for the symmetric simple cells. On the other hand it is true that the majority of the receptive profiles of the simple cells in the primary visual cortex are even and odd symmetric, and it is an open issue to evaluate the importance to discard a minority of the asymmetric profiles.

In the specific model which we propose in this article, we have used only the Gabor filters without any phase shift. In this case the Gabors can be easily replaced with the derivatives of Gaussians, without loss of generality. However the choice we made based on the Gabor functions allows to extend the model to the true distribution of the profiles in the primary visual cortex (including the asymmetric receptive profiles with the phase shifts) in a straightforward way. In the works of Koenderink [22, 23] and Lindeberg [26] the reader can find more information about some other models as well, which employ alternative choices of the receptive profiles in terms of the Gaussian derivatives.

We test the model at different scales, in order to represent properties of orientation maps in different cortical areas where the scale of the receptive profile changes. Our simulation results are compared with neural experimental results. A comparison will be provided with a previous model based on the Bargmann transform in the irreducible representation of the $\text {SE}(2)$ group, outlining that the new model is more physiologically motivated. Moreover we remark that it is possible to extend the model in order that additional visual features such as frequency and phase are taken into account.

In Sect. 2 we explain the receptive profiles of the simple cells and describe the group structure with its geometrical properties. Then we give explicitly the procedure of the construction of cortical map in Sect. 3. Afterward, in Sect. 4, we show that the Gabor functions are minimizers of an uncertainty principle and the filtering with the Gabors can be interpreted as a Bargmann transform in the reducible representations. Then we provide the relation of the Bargmann transform to the orientation map construction procedure and we compare it to another method using the Bargmann transform with the Gabor functions in the irreducible representations on the Fourier domain. Finally, in Sect. 6 we present the simulation results of the model and compare them to the experimental results given in the literature.

2 Receptive Profiles of Simple Cells

2.1 Receptive Fields and Receptive Profiles

The simple cells of visual areas evoke impulse responses to stimulus applied on the retinal plane $M\subset \mathbb {R}^2$. Every simple cell is identified by its receptive field (RF) which is defined as the domain of the retina to which the cell is sensitive and connected through the retino-geniculo-cortical paths. Once a RF is stimulated it evokes a spike response.

In classical sense a RF contains on and off regions, i.e., positive and negative contrast regions, respectively. The decomposition of RF into those regions depends on the nature of the cell response given to light and dark luminance Dirac stimulations. The response is realized by the simple cell receptive profile. Receptive profile (RP) of a simple cell is defined on RF and it is simply the impulse response of the cell. Conceptually it is the measurement of the response of the corresponding cell to a stimulus at some point $(x,y)\in M$. We denote the RP at the retinal position $(x,y)\in M$ with orientation $\theta \in [0,\pi )$ and scale $\sigma \in \mathbb {R}^+$ by $\varPsi _{(x,y,\theta ,\sigma )}:M\times [0, \pi )\times \mathbb {R}^+\rightarrow \mathbb {C}$. The simple cells of the primary visual cortex are strongly oriented and they are sensitive to several visual features, in particular to orientation and scale. Their RPs are often interpreted as Gabor functions [19] since the Gabor functions are mathematically convenient for encoding such features as Daugman [15] explained based on a minimum uncertainty condition. In the literature other models employing alternative choices of RPs in terms of Gaussian derivatives were proposed as well, following the works of Koenderink [22, 23] where he pointed out the resemblance between the simple cell receptive profiles and the kernels in terms of the Gaussian derivatives. The reader can refer to Lindeberg [26] where he proposed a family of functions in terms of the Gaussian derivatives as a natural choice of the simple cell receptive profile with respect to certain symmetry properties.

2.2 The Set of Receptive Profiles

Once the retinal layer is activated by some visual stimulus $I(x_0,y_0)\in \mathbb {R}$, at the point $(x_0,y_0)\in M$ the simple cells process the retinal stimulus through their RPs which are denoted by $\varPsi _{(x_0,y_0)}$ where the subindex refers to the corresponding spatial position on M at which $\varPsi $ is centered. Each RP at the point $(x_0,y_0)$ is dependent on a preferred orientation $\theta $ and a scale $\sigma \in \mathbb {R}^+$ (see Figs. 2 and 3). The set of RPs is obtained through the translation to the point $(x_0,y_0)$ and the rotation by $\theta $, i.e.,

$$\begin{aligned} \begin{aligned} T_{(x_0,y_0,\theta ,\sigma )}(\xi ,\eta )=&\begin{pmatrix} x_0 \\ y_0 \end{pmatrix}+\text {e}^{\sigma }\begin{pmatrix} \cos (\theta ) &{} \;-\sin (\theta )\\ \sin (\theta ) &{} \;\cos (\theta ) \end{pmatrix} \begin{pmatrix} \xi \\ \eta \end{pmatrix} \\ =&(x,y), \end{aligned} \end{aligned}$$

(1)

applied on the Gabor mother function

$$\begin{aligned} \varPsi _0(\xi ,\eta )=\text {e}^{-(\xi ^2+\eta ^2)}\text {e}^{i2\eta }. \end{aligned}$$

(2)

The general expression of the Gabor functions obtained from the mother function is given by

$$\begin{aligned} \varPsi _{(x_0,y_0,\theta ,\sigma )}(x,y)=\varPsi _0(T^{-1}_{(x_0,y_0,\theta ,\sigma )}(x,y)). \end{aligned}$$

(3)

Note that we find the transformation law of the group

$$\begin{aligned} G\simeq \{T_{(x_0,y_0,\theta _0,\sigma _0)}: (x_0,y_0,\theta ,\sigma )\in \mathbb {R}^2\times [0,\pi )\times \mathbb {R}^+ \}, \end{aligned}$$

(4)

by applying the coordinate transform given by (1) successively as follows:

$$\begin{aligned} \begin{aligned} \,&T_{(x_1, y_1, \theta _1,\sigma _1)} T_{(x_0, y_0,\theta , \sigma )}(\xi ,\eta )\\ =\,&\begin{pmatrix} x_1 \\ y_1 \end{pmatrix}+\text {e}^{\sigma _1} R_{\theta _1}\begin{pmatrix} x_0 \\ y_0 \end{pmatrix}+\text {e}^{\sigma _1+\sigma }R_{\theta _1+\theta }\begin{pmatrix} \xi \\ \eta \end{pmatrix}\\ =\,&T_{(x_2, y_2, \theta _2, \sigma _2)}(\xi ,\eta ), \end{aligned} \end{aligned}$$

(5)

where $R_{\theta }$ represents the rotation matrix and

$$\begin{aligned} \begin{aligned} \sigma _2\, =\,&\sigma _1+\sigma ,\quad \theta _2=\theta _1+\theta ,\\ \begin{pmatrix} x_2 \\ y_2 \end{pmatrix}=&\begin{pmatrix} x_1 \\ y_1 \end{pmatrix}+\text {e}^{\sigma _1}R_{\theta _1}\begin{pmatrix} x_0 \\ y_0 \end{pmatrix}. \end{aligned} \end{aligned}$$

(6)

We refer to the explanations provided by Sarti et al. in [32] for more details.

2.3 Functional Connectivity of the Cortex

The hypercolumns are endowed with internal isotropic short range connections which we specifically call vertical connections. The vertical connections do not provide the inter-hypercolumnar interactions and without such inter-connections, the hypercolumns located at different retinal points $(x_0,y_0)\in M$ would be isolated from each other. We know from the neurophysiological results (see the works of Bosking et al. [6], Das and Gilbert [14]) that there are long ranged, strongly anisotropic connections between the hypercolumns. This second type of connections within the primary visual cortex is called horizontal connections. The horizontal connections play the main role in inter-columnar information flow, i.e., contour integration and image inpainting.

Moreover Bosking et al. [6] observed that the horizontal connections link preferentially the simple cells at different spatial locations $(x_0,y_0)\in M$ but corresponding to the same orientation (approximately). In other words, the horizontal connections characterize the contour integration along the aligned curve fragments with approximately same orientations, respecting the saliency (see the saliency description of Wertheimer [39]) of the global structure obtained through the integration. Contour integration in a salient way is closely related to the existence of specific connectivities within the primary visual cortex, which are named as association fields by Field et al. [17]. The association fields confirm the anisotropic behavior of the horizontal connections in the psychophysical level.

In order to implement this functional connectivity we associate with each receptive profile $\varPsi _{(x_0,y_0,\theta ,\sigma )}$ the following 1-form

$$\begin{aligned} \omega _{(\theta ,\sigma )}=\text {e}^{-\sigma }(-\sin (\theta )\mathrm{d}x+\cos (\theta )\mathrm{d}y), \end{aligned}$$

(7)

where $\mathrm{d}x,\,\mathrm{d}y\in T^{*}M$ represent the covector fields dual to the vector fields $\partial _x,\partial _y\in TM$. The 1-form $\omega $ is the main instrument describing the orientation selectivity of a simple cell since it selects the direction along the vector field

$$\begin{aligned} X_3=\text {e}^{\sigma }(-\sin (\theta )\partial _x+\cos (\theta )\partial _y), \end{aligned}$$

(8)

and the vector $X_3\vert _{(x_0,y_0,\theta ,\sigma )}$ at point $(x_0,y_0)$ gives the image gradient at that point corresponding to the receptive profile $\varPsi _{(x_0,y_0,\theta ,\sigma )}$. The direction along $X_3$ is associated with the orientation angle which the simple cells at $(x_0, y_0)$ are sensitive to (see also Fig. 2). Furthermore, with the additional exponential $\text {e}^{-\sigma }$, the 1-form $\omega $ weights the contour fragment at $(x_0, y_0)$, lying orthogonal to $X_3$, in such a way that the fragment corresponding to the same scale as $\omega $ produces the highest simple cell response magnitude. In short $\omega _{(\theta , \sigma )}$ is the main instrument which renders both orientation and scale selectivity of the primary visual cortex simple cells.

Finally we find the horizontal left invariant vector fields as

$$\begin{aligned} {\text {ker}}\omega ={\text {span}}\{X_1, X_2, X_4 \}, \end{aligned}$$

(9)

where

$$\begin{aligned} X_1=\,&\text {e}^{\sigma }(\cos (\theta )\partial _x+\sin (\theta )\partial _y),\nonumber \\ X_2=\,&\partial _{\theta },\\ X_4=\,&\partial _{\sigma }.\nonumber \end{aligned}$$

(10)

Here we note that due to the fact that

$$\begin{aligned} \begin{array}{ll} &{}[X_1, X_2]= -X_3,\\ &{}[X_1, X_4]= -X_1, \end{array} \end{aligned}$$

(11)

the horizontal vector fields are non-commutative. Yet they span the whole tangent bundle together with their commutators, i.e.,

$$\begin{aligned} { TM}={\text {span}}\{X_1, X_2, X_4, [X_1,X_2]\}. \end{aligned}$$

(12)

That is, the horizontal vector fields given by (10) fulfill the Hörmander condition [20].

3 The Model of Multi-scale Orientation Maps

In this section we present our model of orientation cortical maps. As we explained in the introduction, we propose that cortical maps are obtained via a two step procedure: First the simple cells act on a random stimulus, and consequently maximally activated orientation and scale are selected, producing the cortical map.

The response given to a stimulus by a simple cell with the orientation preference $\theta $, the scale $\sigma $ and located at $(x_0, y_0)\in M$ is expressed by

$$\begin{aligned} \begin{aligned} O_{(\theta ,\sigma )}(x_0,y_0)=&\int \limits _M I(x,y)\varPsi _{(x_0,y_0,\theta ,\sigma )}(x,y)\,\mathrm{d}x\,\mathrm{d}y. \end{aligned} \end{aligned}$$

(13)

See Fig. 4 for a visualization of such outputs. For every retinal point $(x_0,y_0)$ a particular value of orientation is selected via the integration on the fiber:

$$\begin{aligned} \overline{\theta }(x_0, y_0)= \frac{1}{2}{\text {arg}}\Big (\int \limits _0^{\pi }{\text {Re}}\Big \{O_{(\theta ,\sigma )}(x_0,y_0)\Big \}\text {e}^{i\theta }\mathrm{d}\theta \Big ). \end{aligned}$$

(14)

We considered here only the real part of the output but alternative choices are possible, for example the energy or the imaginary part of the output. We refer to [33] for more information about such choices.

Lindeberg, in [24], describes a scale selection technique in terms of Gaussian derivatives normalized by scale. Basically the method finds extrema over scales corresponding to normalized receptive field responses by scale. A similar approach in our particular framework associated with Gabor functions is considered and the scale selectivity is provided by the maximum of the output at the point $(x_0,y_0)$ over the scale fiber at the selected value of $\overline{\theta }$:

$$\begin{aligned} \overline{\sigma }(x_0, y_0)= \underset{{\sigma \in \mathbb {R}^+}}{{\text {argmax}}}\Big ({\text {Re}}\Big \{O_{(\overline{\theta }, \sigma )}(x_0,y_0)\Big \}\Big ). \end{aligned}$$

(15)

Let us note that we employ the maximum selectivity given in (15), for selecting the scale value, as Sarti et al. did in [32], while we use the integration over fiber given in (14) in order to find the orientation preference over the fiber at the point $(x_0,y_0)\in M$. This procedure allows us to achieve a more robust orientation selectivity. Here we assume that generically there is a unique maximum, so that it is equivalent the order in which we select $\overline{\theta }$ and $\overline{\sigma }$. Note that the procedure described by (14) and (15) is done for every fixed point $(x_0, y_0)$ on the retinal plane and the selected orientations $\overline{\theta }(x_0, y_0)$ and scales $\overline{\sigma }(x_0,y_0)$ are represented at the corresponding fiber locations $(x_0, y_0)\in M$. In such a way we obtain the multi-scale orientation map $\overline{\theta }(x,y)$ that is represented in Fig. 11 with the same type of color map as in the classical case Fig. 6. The overall procedure for obtaining cortical maps is schematized in Fig. 5.

This procedure corresponds to the lifting of a general stimulus I(x, y) provided by simple cells circuitry. We explicitly note here that the cortical orientation maps will be obtained by using the lifting of a random stimulus. This choice is motivated also by the fact that experimentally the cortical maps arise in the early post-natal period in the absence of any visual experience, just in the presence of an intrinsic random basal activation (see the studies of Jegelka et al. [21], Tanaka et al. [38], Bednar and Miikkulainen [5]). A refinement of the orientation maps is performed subsequently by activation patterns based on random waves (see the results provided by Cang et al. [10], Stellwagen and Shatz [36]).

Note that convolution with a Gabor filter will provide a smooth function. Indeed the Gabor is simply a Gaussian function multiplied by a complex exponential. The resulting function will then be a smooth function depending on the variance of the Gaussian, which is the scale. Finally the orientation selection will provide smooth functions, with values in $S^1$. It is well known that even the harmonic functions with values in $S^1$ develop vortices, which will the models for the pinwheels.

It is natural to build feature cortical maps by means of Gabor functions, since they are strictly related to all the functional geometry. In fact we will see that they arise as minimizers of the uncertainty principle in this setting.

4 Orientation Maps as Cortical Bargmann Transforms

4.1 An Uncertainty Principle

Orientation maps have been constructed by Barbieri et al. in [3] where an uncertainty principle related to the functional geometry of the cortex and its non-commutative structure were used.

The uncertainty principle in its general form always applies in presence of two self-adjoint non-commutating vector fields $P_1$ and $P_2$. In our framework, as given by Folland in [18], it is written as the following:

Proposition 1 Let us denote $\mathcal {H}$ an Hilbert space endowed with the scalar product $\langle .\,,\, . \rangle $. Consider two self-adjoint vector fields $P_1$ and $P_2$ on $\mathcal {H}$. Then the following inequality holds:

$$\begin{aligned} |{\langle f, [P_1, P_2] f\rangle } |\le 2 \Vert P_1 f \Vert \Vert P_2 f\Vert , \end{aligned}$$

(16)

for all $L^2(\mathbb {R}^2)$ functions f in the domain of $[P_1, P_2]$.

Proof

Since $P_1$ and $P_2$ are self-adjoint, we can write that

$$\begin{aligned} \begin{aligned} \langle f,\, [P_1,\, P_2]f \rangle =&\langle f,\, (P_1P_2-P_2P_1)f \rangle \\ =\,&\langle P_1 f,\,P_2f \rangle - \langle P_2 f,\,P_1 f \rangle \\ =\,&2i{\text {Im}}\{\langle P_1 f, P_2 f \rangle \}. \end{aligned} \end{aligned}$$

(17)

We employ the Cauchy–Schwarz inequality and write:

$$\begin{aligned} \begin{aligned} \langle f,\, [P_1,\, P_2]f \rangle \le 2|{ \langle P_1 f, P_2 f \rangle } | \le 2 \Vert P_1 f \Vert \Vert P_2f \Vert . \end{aligned} \end{aligned}$$

(18)

$\square $

The first inequality in (18) becomes an equality when $\langle P_1 f, P_2 f \rangle $ is purely imaginary and the second one turns into equality when $P_1f$ and $P_2f$ are linearly dependent. As a consequence minima of the uncertainty inequality (i.e., for that the inequality (16) turns into equality) satisfy the following equation

$$\begin{aligned} P_1 f= i\lambda P_2 f. \end{aligned}$$

(19)

The condition $P_1 f= i\lambda P_2$ with $\lambda \in \mathbb {R}$ gives the minimizers which are called coherent states.

4.2 Gabors as Minimizers of the Uncertainty Principle

We have seen that the Gabor functions are defined on the 2-dimensional retinal plane and generated by the action on a mother filter of $T^{-1}_{(x_0,y_0,\theta ,\sigma )}$ where $T_{(x_0,y_0,\theta ,\sigma )}$ is defined in (1). Accordingly the differential of $T^{-1}$ sends the vector fields $X_1$ and $X_2$ acting in the 4-dimensional manifold of variables $(x,y,\theta , \sigma )$ to new vector fields $Y_1$ and $Y_2$ defined on the retinal plane as:

$$\begin{aligned} \begin{aligned} (dT^{-1})(X_1)\, =\,&\partial _{\xi },\\ (dT^{-1})(X_2)\, = \,&\eta \partial _{\xi }-\xi \partial _{\eta }. \end{aligned} \end{aligned}$$

(20)

It is well known that Gabor filters are minimizers of the uncertainty principle in the Heisenberg group, less known is that they are indeed also coherent states related to the non-commutating vector fields $Y_1$ and $Y_2$ induced by the functional architecture of V1. As a result they satisfy the analogue of (19). Precisely the Gabor functions of the type $\varPsi _{(x_0,y_0,\theta ,\sigma )}$ given by (3) satisfy

$$\begin{aligned} \begin{aligned}&Y_1\varPsi _{(x_0,y_0,\theta ,\sigma )}(x,y)=-2\xi \varPsi _0(\xi ,\eta ),\\&Y_2\varPsi _{(x_0,y_0,\theta ,\sigma )}(x,y)=-i2\xi \varPsi _0(\xi ,\eta ), \end{aligned} \end{aligned}$$

(21)

which fulfill (19) for $\lambda =-1$.

4.3 Interpretation of the Cortical Maps as a Bargmann Transform of a Random Stimulus

Let us recall that the operator associated with the coherent states is the Bargmann transform:

$$\begin{aligned} \begin{aligned} (B^{{ SE}(2)}I)(x,y,\theta , \sigma ):=\,&\langle \varPsi _{(x_0,y_0,\theta , \sigma )},\; I\rangle _{L^2(\mathbb {R}^2)}\\ =\,&\int \limits _{\mathbb {R}^2}\varPsi _{(x_0,y_0,\theta , \sigma )}(\xi , \eta ) I(\xi , \eta )\mathrm{d}\xi , \mathrm{d}\eta . \end{aligned} \end{aligned}$$

(22)

In particular, the response of simple cells, being defined by (13) as the convolution with a Gabor coherent state, can be interpreted as the Bargmann transform in the functional architecture:

$$\begin{aligned} O_{(\theta , \sigma )}(x,y ) = (B^{{ SE}(2)}I)(x,y,\theta , \sigma ). \end{aligned}$$

(23)

As a consequence, the orientation maps are associated with the Bargmann transform of a random stimulus.

5 Comparison with Previous Models of the Cortical Maps

5.1 Superposition of Random Waves

One of the first models for the construction of the orientation preference maps is proposed by Petitot in [27] where the map is obtained through the superposition of randomly weighted complex sinusoids

$$\begin{aligned} \sum \limits _{k=1}^{k=N}c_k\text {e}^{i2\pi \big (x\cos (2\pi k/N)+y\sin (2\pi k/N)\big )}, \end{aligned}$$

(24)

with N denoting the number of frequency samples and where the coefficients $c_k\in [0, 1]$ are the white noise.

In this way the functional role of the Gabor functions as receptive profiles is disregarded since the orientation map is constructed via direct superimposition of the waves with randomly generated magnitudes, avoiding that the Gabors naturally process the stimulus by lifting it to the phase space of the corresponding intrinsic variables. More specifically in this procedure the complex sinusoid functions are not localized while it is known from neurophysiological experiments that the orientation selectivity is performed locally by the simple cells (see for example the work of Field and Tolhurst [16]).

5.2 The Bargmann Transform of the Irreducible Representations

In the model proposed by Barbieri et al. in [3], the orientation map is built starting from the coherent states in the irreducible representation.

Definition 1 The representation of a group G is a map $\varPhi : G\rightarrow A(V)$, from the group G to the space of automorphisms of a vector space V, such that $\varPhi $ is compatible with the group law. The representation will be denoted by $(\varPhi , V)$, and it is called irreducible if it has no proper group subrepresentation $(\varPhi , W)$, where W is a subspace of V.

Taking the Fourier transform of the vector fields $Y_1$ and $Y_2$ defined in (20), we obtain:

$$\begin{aligned} \mathcal {F}(Y_1f)=iz_1\hat{f},\quad \mathcal {F}(Y_2f)=(z_2\partial _{z_1}-z_1\partial _{z_2})\hat{f}. \end{aligned}$$

(25)

We can write those vector fields also in terms of the polar coordinates $(z_1,z_2)=(\varOmega \cos (\varphi ), \varOmega \sin (\varphi ))$ with $\varOmega \in \mathbb {R}^+$ and $\varphi \in S^1$. In this case the fields become

$$\begin{aligned} \hat{Y}_1\hat{f}=i\varOmega \cos (\varphi )\hat{f},\quad \hat{Y}_2\hat{f}=\partial _{\varphi }\hat{f}. \end{aligned}$$

(26)

The vector fields $\hat{Y}_1$ and $\hat{Y}_2$ do not contain any radial derivative and only depend on the angular direction in the Fourier space. Therefore they act independently on every circle, of arbitrary radius $\varOmega $. Then it is possible to restrict the action of these vector fields to any circle with radius $\varOmega $ on the Fourier space separately (see the explanations of Sugiura [37] for details). This is the reason why the vector fields $Y_1, Y_2$ on the whole space (in the Fourier domain as well) are called reducible, while $\hat{Y}_1$ and $\hat{Y}_2$ which cannot be further reduced once $\varOmega $ is fixed, are called irreducible.

If we write the coherent state condition (19) on the Fourier domain in terms of $\hat{Y}_1$ and $\hat{Y}_2$,

$$\begin{aligned} \hat{Y}_1\hat{f}=i\lambda \hat{Y}_2\hat{f}, \end{aligned}$$

(27)

we find the coherent states

$$\begin{aligned} \begin{aligned} \hat{\varPsi }^{\varOmega }_{(x_0,y_0,\theta ,\sigma )}(\varphi )=\hat{\varPsi }_{(x_0,y_0,\theta ,\sigma )}(\varOmega \cos (\varphi ),\varOmega \sin (\varphi )),\\ \end{aligned} \end{aligned}$$

(28)

where $\hat{\varPsi }_{(x_0,y_0,\theta ,\sigma )}$ is the Fourier transform of the Gabor filters, while $\hat{\varPsi }^{\varOmega }_{(x_0,y_0,\theta ,\sigma )}$ is a function of the angular variable, defined on the circle of the radius $\varOmega $.

In [2, 3] Barbieri et al. use the family of the coherent states obtained for a fixed value of $\sigma $, and for a single value of $\varOmega $

$$\begin{aligned} \hat{\varPsi }^{\varOmega }_{(x_0,y_0,\theta )}. \end{aligned}$$

(29)

In perfect analogy with Eq. (13) the Bargmann transform in these variables is expressed as the operator with the kernel $\hat{\varPsi }^{\varOmega }_{(x_0,y_0,\theta )}$ as:

$$\begin{aligned} \begin{aligned} B^{\varOmega } g(x,y,\theta ):=\,&\langle \hat{\varPsi }^{\varOmega }_{(x_0,y_0,\theta )},\; g\rangle _{L^2(S^1)}\\ =\,&\int \limits _0^{2\pi }\hat{\varPsi }^{\varOmega }_{(x_0,y_0,\theta )}(\varphi )g(\varphi )\mathrm{d}\varphi . \end{aligned} \end{aligned}$$

(30)

In [2, 3], this transform is applied to a white noise g defined on the annulus (on the Fourier domain). For every point (x, y) an orientation is selected by means of an integration analogous to the one expressed in (14):

$$\begin{aligned} \overline{\theta }^{\varOmega }(x,y)= \frac{1}{2}{\text {arg}}\Big (\int \limits _0^{\pi }\Big \{ B^{\varOmega } g(x,y,\theta )\Big \}\text {e}^{i\theta }\mathrm{d}\theta \Big ). \end{aligned}$$

(31)

In this way they find an orientation preference at each point (x, y) which depends on the fixed value of $\varOmega $ and they obtain the orientation preference maps (with no scale parameters).

Although both our model and the model proposed in [2, 3] by Barbieri et al. make use of the idea of a Bargmann transform they differ on three points.

Firstly our method employs the coherent states corresponding to the reducible representations while the other one uses the states restricted to the irreducible representations in the Fourier domain.

Secondly we start from a noise generated on the real domain and apply the Bargmann transform, while the other method introduces the noise in the Fourier domain on the irreducible representations, and apply the Bargmann transform in the Fourier space. The choice made in the present paper here is physiologically more plausible since experimentally the cortical maps can arise in the early post-natal period in the absence of any external stimulus, just in the presence of a random basal activation (see Bednar and Miikkulainen [5] and Jegelka et al. [21]). The present model has the potential to provide a reasoning and an explanation of how the formation of cortical maps occurs at the neurophysiological level.

The third main difference is that the present model can also consider the scale selectivity while in the other model the scale is fixed. More generally it is possible to extend the present model in order to include other visual features by using generalized Gabor functions.

6 Experiments

We consider a stimulus I(x, y) of $128\times 128$ pixels with the random values generated from a uniform distribution over $[-1, 1]$ at each pixel.

We obtain the total set of simple cell responses via the linear filtering of the test image with the rotated and translated Gabor filter banks as described in (13) with different scale values $\sigma $. Then we represent the selected orientation $\overline{\theta }(x,y)$ and $\overline{\sigma }(x,y)$, via (14) and (15), at every point (x, y) on the $128\times 128$ image plane.

Previously in the literature it was reported from the physiological experiments of Bosking [6] (see Fig. 6) that the orientation preference map had certain characteristics (see the explanations of Bressloff and Cowan [8], and Petitot [12, p.27], [27, p.87]). To begin with, orientation preferences on the map are distributed almost continuously across the cortex and the pinwheel architecture is crystalline-like. In other words there is a regular lattice of the pinwheels on the orientation preference map with a certain spatial periodicity. Furthermore the orientation map contains three types of points as described by Petitot [27, p.87], namely: (a) Regular points around which the orientation iso-lines are parallel (the zones with regular points are called linear zones), (b) Singular points which are located at the center of the pinwheels (Those singularities might have positive or negative chirality. That is, when we turn around a pinwheel in the clockwise direction, the orientations turn in the clockwise direction—positive chirality—or in the counter-clockwise direction—negative chirality. The pinwheels represent opposite chiralities when they are adjacent to each other), (c) Saddle points at the center of regions where iso-orientation lines bifurcate (the case where two iso-orientation lines start from the same pinwheel and arrive at opposite pinwheels).

We will see that in the present study, we are able to produce all the three kinds of points. In the first experiment we consider different fixed scales and apply (13) and (14) to obtain the orientation maps: The results are shown in Fig. 7, where the orientation maps are visualized and the three kinds of points are outlined.

Figure 8 shows the cross-correlation between the simulated cortical maps where several picks are present and they testify the crystalline structure of the map. Notice that the periodicity of the peaks is linearly dependent on the scale of the Gabor filters employed for the construction of the map.

The size of the pinwheel structure is also strictly correlated with the scale of the Gabor filters, as shown in Fig. 9.

Let us note that as one passes through V1–V2–V3 areas of the cortex the sizes of the simple cells increase and the lattice of the orientation map extends while the pinwheels are preserved, as visualized on the top of Fig. 10. Our simulations are able to reproduce the same type of orientation maps, which preserve the pinwheels through different cortex layers, by simply changing the scale of the Gabor filter bank as shown on the bottom of Fig. 10.

In the next series of experiments we will compute the orientation maps by selecting at every point orientation and scale by using the three Eqs. (13), (14) and (15). This case is the closer one to the physiological situation of a normal visual cortex, where cells with different orientations and sizes are present. In Fig. 11 the relevant simulation result of the model is visualized, showing the orientation map rendering both orientation and scale selectivity.

In the final experiment, which is given in Fig. 12, we used the same procedure as in Fig. 11 but using three different sets of scales and we obtained a result similar to Fig. 10. This procedure is closer to the real receptive field composition of the primary visual cortex.

7 Conclusion

In this paper we presented a new model for the generation of orientation preference maps in the primary visual cortex, considering both orientation and scale features. We considered modeling the functional architecture of the primary visual cortex by taking into account orientation and scale features and using a framework inspired by Sarti et al. [32]. Furthermore, we also provided the physical reasoning behind the choice of the generalized Gabor function by showing that it is a coherent state of the non-commutative framework corresponding to the cortex functional architecture. The intrinsic variables of orientation and scale constitute a fiber on each point of the retinal plane and the set of receptive profiles of simple cells is located on the fiber. The orientation preference maps are then obtained simply as the lifting of a noise stimulus by a set of Gabor filters, mapping the orientation value on the 2-dimensional plane. This corresponds to the Bargmann transform in the reducible representation of the $\text {SE}(2)$ group which is followed by a maximum response selection procedure. A comparison has been provided with a previous model based on the Bargmann transform in the irreducible representation of the $\text {SE}(2)$ group, outlining that the new model is more physiologically motivated. From the simulation results it appears that this technique is able to reproduce the cortical maps of different areas with morphological characteristics comparable to the experimental data. A clear advantage of the method consists also in its versatility since a number of different features could be considered, such as frequency and phase. Further studies will be conducted in this direction in the close future.

References

Barbieri, D., Citti, G., Cocci, G., Sarti, A.: A cortical-inspired geometry for contour perception and motion integration. J. Math. Imaging Vis. 49(3), 511–529 (2014)
Article MATH Google Scholar
Barbieri, D., Citti, G., Sanguinetti, G., Sarti, A.: Coherent states of the euclidean group and activation regions of primary visual cortex. arXiv preprint arXiv:1111.0669 (2011)
Barbieri, D., Citti, G., Sanguinetti, G., Sarti, A.: An uncertainty principle underlying the functional architecture of v1. J. Physiol. Paris 106(5), 183–193 (2012)
Article Google Scholar
Bargmann, V.: On a hilbert space of analytie functions and an associated integral transform. part II. A family of related function spaces application to distribution theory. Commun. Pure Appl. Math. 20(1), 1–101 (1967)
Article MATH Google Scholar
Bednar, J.A., Miikkulainen, R.: Constructing visual function through prenatal and postnatal learning. Neuroconstructivism Perspect. Prospects 2, 13–37 (2004)
Google Scholar
Bosking, W.H., Zhang, Y., Schofield, B., Fitzpatrick, D.: Orientation selectivity and the arrangement of horizontal connections in tree shrew striate cortex. J. Neurosci. 17(6), 2112–2127 (1997)
Article Google Scholar
Bressloff, P.C., Cowan, J.D.: The functional geometry of local and horizontal connections in a model of v1. J. Physiol. Paris 97(2), 221–236 (2003)
Article Google Scholar
Bressloff, P.C., Cowan, J.D.: A spherical model for orientation and spatial-frequency tuning in a cortical hypercolumn. Philos. Trans. R. Soc. Lond. B Biol. Sci. 358(1438), 1643–1667 (2003)
Article Google Scholar
Bressloff, P.C., Cowan, J.D., Golubitsky, M., Thomas, P.J., Wiener, M.C.: Geometric visual hallucinations, euclidean symmetry and the functional architecture of striate cortex. Philos. Trans. R. Soc. B Biol. Sci. 356(1407), 299–330 (2001)
Article Google Scholar
Cang, J., Rentería, R.C., Kaneko, M., Liu, X., Copenhagen, D.R., Stryker, M.P.: Development of precise maps in visual cortex requires patterned spontaneous activity in the retina. Neuron 48(5), 797–809 (2005)
Article Google Scholar
Citti, G., Sarti, A.: A cortical based model of perceptual completion in the roto-translation space. J. Math. Imaging Vis. 24(3), 307–326 (2006)
Article MathSciNet MATH Google Scholar
Citti, G., Sarti, A.: Neuromathematics of Vision. Springer, Berlin (2014)
Book MATH Google Scholar
Cocci, G., Barbieri, D., Citti, G., Sarti, A.: Cortical spatiotemporal dimensionality reduction for visual grouping. Neural Comput. 27(6), 1252–1293 (2015)
Article Google Scholar
Das, A., Gilbert, C.D.: Long-range horizontal connections and their role in cortical reorganization revealed by optical recording of cat primary visual cortex. Nature 375(6534), 780 (1995)
Article Google Scholar
Daugman, J.G.: Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortical filters. JOSA A 2(7), 1160–1169 (1985)
Article Google Scholar
Field, D., Tolhurst, D.: The structure and symmetry of simple-cell receptive-field profiles in the cat’s visual cortex. Proc. R. Soc. Lond. B Biol. Sci. 228(1253), 379–400 (1986)
Article Google Scholar
Field, D.J., Hayes, A., Hess, R.F.: Contour integration by the human visual system: evidence for a local association field. Vis. Res. 33(2), 173–193 (1993)
Article Google Scholar
Folland, G.B.: Harmonic Analysis in Phase Space, (AM-122), vol. 122. Princeton University Press, Princeton (2016)
Google Scholar
Gabor, D.: Theory of communication. part 1: the analysis of information. J. Inst. Electr. Eng. Part III Radio Commun. Eng. 93(26), 429–441 (1946)
Google Scholar
Hörmander, L.: Hypoelliptic second order differential equations. Acta Mathematica 119(1), 147–171 (1967)
Article MathSciNet MATH Google Scholar
Jegelka, S., Bednar, J.A., Miikkulainen, R.: Prenatal development of ocular dominance and orientation maps in a self-organizing model of v1. Neurocomputing 69(10), 1291–1296 (2006)
Article Google Scholar
Koenderink, J.J.: The structure of images. Biol. Cybern. 50(5), 363–370 (1984)
Article MathSciNet MATH Google Scholar
Koenderink, J.J., van Doorn, A.J.: Representation of local geometry in the visual system. Biol. Cybern. 55(6), 367–375 (1987)
Article MathSciNet MATH Google Scholar
Lindeberg, T.: Feature detection with automatic scale selection. Int. J. Comput. Vis. 30(2), 79–116 (1998)
Article Google Scholar
Lindeberg, T.: Generalized gaussian scale-space axiomatics comprising linear scale-space, affine scale-space and spatio-temporal scale-space. J. Math. Imaging Vis. 40(1), 36–81 (2011)
Article MathSciNet MATH Google Scholar
Lindeberg, T.: A computational theory of visual receptive fields. Biol. Cybern. 107(6), 589–635 (2013)
Article MathSciNet MATH Google Scholar
Petitot, J.: The neurogeometry of pinwheels as a sub-riemannian contact structure. J. Physiol. Paris 97(2), 265–309 (2003)
Article Google Scholar
Petitot, J.: Neurogéométrie de la vision. Modeles mathématiques et physiques des architectures fonctionelles. Éd. École Polytech, Paris (2008)
Google Scholar
Petitot, J., Tondut, Y.: Vers une neurogéométrie. fibrations corticales, structures de contact et contours subjectifs modaux. Mathématiques informatique et sciences humaines 145, 5–102 (1999)
Google Scholar
Sanguinetti, G., Citti, G., Sarti, A.: A model of natural image edge co-occurrence in the rototranslation group. J. Vis. 10(14), 37–37 (2010)
Article Google Scholar
Sarti, A., Citti, G.: The constitution of visual perceptual units in the functional architecture of v1. J. Comput. Neurosci. 38(2), 285–300 (2015)
Article MathSciNet Google Scholar
Sarti, A., Citti, G., Petitot, J.: The symplectic structure of the primary visual cortex. Biol. Cybern. 98(1), 33–48 (2008)
Article MathSciNet MATH Google Scholar
Sarti, A., Citti, G., Petitot, J.: Functional geometry of the horizontal connectivity in the primary visual cortex. J. Physiol. Paris 103(1), 37–45 (2009)
Article Google Scholar
Sarti, A., Piotrowski, D.: Individuation and semiogenesis: an interplay between geometric harmonics and structural morphodynamics. In: Sarti, A., Montanari, F. & Galofaro, F. (eds.) Morphogenesis and Individuation, pp. 49–73. Springer (2015)
Sharma, U., Duits, R.: Left-invariant evolutions of wavelet transforms on the similitude group. Appl. Comput. Harmon. Anal. 39(1), 110–137 (2015)
Article MathSciNet MATH Google Scholar
Stellwagen, D., Shatz, C.: An instructive role for retinal waves in the development of retinogeniculate connectivity. Neuron 33(3), 357–367 (2002)
Article Google Scholar
Sugiura, M.: Unitary Representations and Harmonic Analysis: An Introduction, vol. 44. Elsevier, Amsterdam (1990)
MATH Google Scholar
Tanaka, S., Miyashita, M., Ribot, J.: Roles of visual experience and intrinsic mechanism in the activity-dependent self-organization of orientation maps: theory and experiment. Neural Netw. 17(8), 1363–1375 (2004)
Article Google Scholar
Wertheimer, M.: Laws of organization in perceptual forms. In: A Source Book of Gestalt Psychology. Harcourt Brace, New York (1923)

Download references

Author information

Authors and Affiliations

Dipartimento di Matematica, Università di Bologna, Bologna, Italy
Emre Baspinar, Giovanna Citti & Alessandro Sarti
CAMS/CNRS-EHESS, Paris, France
Emre Baspinar, Giovanna Citti & Alessandro Sarti

Authors

Emre Baspinar
View author publications
You can also search for this author in PubMed Google Scholar
Giovanna Citti
View author publications
You can also search for this author in PubMed Google Scholar
Alessandro Sarti
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alessandro Sarti.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Baspinar, E., Citti, G. & Sarti, A. A Geometric Model of Multi-scale Orientation Preference Maps via Gabor Functions. J Math Imaging Vis 60, 900–912 (2018). https://doi.org/10.1007/s10851-018-0803-3

Download citation

Received: 07 July 2017
Accepted: 26 February 2018
Published: 07 April 2018
Issue Date: July 2018
DOI: https://doi.org/10.1007/s10851-018-0803-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A Geometric Model of Multi-scale Orientation Preference Maps via Gabor Functions

Abstract

Similar content being viewed by others

A sub-Riemannian model of the visual cortex with frequency and phase

Monochromaticity of Orientation Maps in V1 Implies Minimum Variance for Hypercolumn Size

Orientation Maps in V1 and Non-Euclidean Geometry

1 Introduction

2 Receptive Profiles of Simple Cells