- Research
- Open access
- Published:
How Uncertainty Bounds the Shape Index of Simple Cells
The Journal of Mathematical Neuroscience volume 4, Article number: 5 (2014)
Abstract
We propose a theoretical motivation to quantify actual physiological features, such as the shape index distributions measured by Jones and Palmer in cats and by Ringach in macaque monkeys. We will adopt the uncertainty principle associated to the task of detection of position and orientation as the main tool to provide quantitative bounds on the family of simple cells concretely implemented in primary visual cortex.
Mathematics Subject Classification (2000)2010:62P10, 43A32, 81R15.
1 Introduction
One of the fundamental tasks performed by simple cells in primary visual cortex is that of detecting position and local orientation of a stimulus [1]. On the other hand, the functional behavior of simple cells as visual detectors is characterized in terms of standard linear filtering and with other so-called nonclassical behaviors [2]. We will concentrate on linear aspects, and consider classical receptive profiles modeled with a planar oscillation under a spatially localizing window. In [3], such receptive profiles were studied in terms of two dimensionless indexes of shape corresponding to the product of the frequency of the oscillation and the sizes of the window in the direction of the oscillation and in the orthogonal one, showing that the distribution of such feature on V1 simple cells of macaque monkeys is confined to a specific region. This result is summarized in Fig. 1. Remarkably, the same confinement was found also in cats [4, 5], and this suggests that this pattern can be associated with some criteria of optimality with respect to perceptive tasks. Notable proposals of such criteria were stated in terms of sparse coding in [6], already discussed in [3], and more recently in [7], or in terms of Bayesian learning [8].
In this paper, we will focus on the task of position and orientation detection, and propose theoretical motivations based on the uncertainty principle for the corresponding geometry to explain such confinement. In general, the uncertainty principle is indeed a tool that gives information on the possible localization of functions with respect to competing symmetries that in this case are those of the well-known group of translations and rotations of the Euclidean plane. The role of symmetries in the mechanisms of visual perception in V1 is a well recognized point [9–11], as well as the uncertainty principle was already invoked to explain relevant cortical morphologies [12, 13]. Here, we will use such concepts to characterize the resolution that can be obtained with joint spatial and angular measurements, based on the localization properties of receptive profiles. In terms of such characterizations, we will deduce the bounds observed in Fig. 1 as the result of intrinsic notions of balance between joint measurements resolutions.
2 Receptive Profiles and Relevant Symmetries
We will assume isotropic Gaussian Gabor filters as a model for standard V1 simple cells classical receptive profiles, defined on the Euclidean image plane:
with parameters , , . Each V1 simple cell is assumed to perform a linear filtering with a function shaped as in (1), so that it can be characterized by these parameters. Their mapping on the two dimensional cortical layers are referred to as cortical maps [14]. In particular, the centers q of receptive fields are in a so-called retinotopic correspondence on the cortex [1], while the size σ is in average larger at the periphery and smaller close to the fovea [15]. The frequency parameters p are generally considered in polar coordinates , where is called spatial frequency and the angle θ up to a factor of π is called preferred orientation, and their cortical maps are also well studied [11, 13, 16].
The family of functions (1) were proposed in [12] due to their optimal localization in space and frequency with respect to the classical Heisenberg uncertainty principle, and their fitness to model the linear behavior of simple cells was thoroughly tested [3]. We note, however, that here we are dealing with a simplified model of isotropic receptive fields, since as we will see this provides enough information for the present study, with the advantage that the results can be stated in a clearer form. We also recall that the real and imaginary parts in (1) correspond to so-called even and odd cells
but it will be sufficient for our purposes to deal with the full complex function as a whole.
2.1 Groups of Transformations
Let us introduce the following unitary operators on :
-
(i)
translations: , ,
-
(ii)
modulations: , ,
-
(iii)
dilations: , ,
-
(iv)
rotations: , ,
where stands for the usual counterclockwise rotation of an angle θ on the Euclidean plane. In particular, we note that rotations commute with dilations, and it is easy to see that
If we denote with , a normalized isotropic Gaussian with unit standard deviation
then we can characterize the functions (1) in terms of the operators (i), (ii), and (iii) as
Such a family is the prototype of a so-called wave packet systems [17], and much is known about these structures [18, 19].
In this work, we will deal with the localization properties of (1) with respect to translations and local rotations, i.e., making use of the symmetries (i) and (iv), since they constitute two fundamental symmetries related to the mechanisms of visual perception in V1 (see, e.g., [9] and references therein).
Local rotations are defined by
where is a rotation of the Euclidean plane around point q, and with respect to these transformations we have the following.
Lemma 2.1 Let be as in (1). Then
Proof Using (2) and the definition of local rotations (3), we get
so (4) follows since rotations commute with dilations and is isotropic, i.e., . □
Actually, the fact that is isotropic allows to write the whole family (1) in terms of all the operators (i) to (iv). Indeed, denoting with θ the polar angle of p, that means , we can write (1) as
where , so another way to characterize the system of functions (1) is to consider a family and rotate and translate each of its members. The aim of next section is actually to deduce properties on the localization of with respect to the parameters q and θ, expressed in terms of the parameters and σ.
3 Measures of Uncertainty
In this section, we characterize the uncertainty associated to joint measurements of positions and local orientations in terms of the properties of the measurement devices, expressed by functions, and quantify such uncertainties for the case of receptive profiles.
We recall that the generators of translations along the Cartesian axis are given by partial derivatives
while the generator of a rotation around point q can be written in terms of the ordinary infinitesimal rotation operator
and acts as the skew self-adjoint operator on
We will measure averages and variances using the standard definitions for operators on , denoting with the scalar product and with the associated norm.
Definition 3.1 Let L be a densely defined skew self-adjoint linear operator on . We define its mean value over as
and its variance over as
Since skew self-adjoint operators are the infinitesimal generators of a one parameter group of unitary transformations, the meaning of the average (7) is that of measuring the deformation of f under such transformations
and the imaginary constant is merely a convention to ensure the result to be real. With this averaging, the variance (8) has the usual meaning of strength of the fluctuations of f under the considered transformations that corresponds to the second moment of the distribution . This means then that the variance (8) provides a measure of the localization of f with respect to the symmetry .
When applied to the operators (5) and (6) of linear and rotational derivatives, these variances correspond respectively to a measure of linear and rotational fluctuations of a function f. The more f is insensitive to translations (f smooth and close to a constant function), the smaller is its P variance, while a small variance means that f has little sensitivity to rotations around q.
The notion of localization in orientation that arises indicates that a function consisting of a set of parallel stripes, independently on their widths, is maximally localized in orientation, while a function that is circular symmetric around q is minimally localized.
If we are interested in the joint localization properties of a function with respect to a two parameters group of unitary transformations, generated by two skew self-adjoint operators and , we are led to consider the distribution
In this case, if the operators and do not commute, then the second moments of the distribution (9) are influenced by their commutator. Such an effect of competing symmetries is quantified by the uncertainty principle.
3.1 The Uncertainty Principle
The operators (5) and (6) satisfy the commutation relations of angular momentum [20]
These commutators define the algebra of the group (see, e.g., [9] and references therein), and for them the following generalized uncertainty principle holds [13, 21], with respect to the quantities of Definition 3.1. Since we are dealing with densely defined operators, we will skip in what follows the technicalities related to operator domains, and refer the statements simply to . For more details, see [21].
Theorem 3.2 ( uncertainty principle)
For any , it holds
These inequalities play the same role for the noncommutative symmetries of rotations and translations as the one played by the ordinary uncertainty inequality for the noncommutativity of quantum mechanical operators. The main difference is that in this case if we consider separately each of the two inequalities, we cannot obtain a constant lower bound. Indeed for a function f the product of variances of an infinitesimal rotations and a translations along one axis can be arbitrarily small, provided that the average of translations along the other axis on f is small. This effect disappears when we consider translations on both axis, which is natural whenever we do not want to discriminate one direction over the other. In this case, we can actually recast the two inequalities (11) into one inequality with a constant lower bound.
The following definition is closely related to that of [22], and for this reason we use the same notation Angv.
Definition 3.3 Let us define the functional
where
We denote with the corresponding measure of angular uncertainty
With this definition, a direct consequence of the uncertainty principle is the following.
Theorem 3.4 For all
This inequality resembles the ordinary Heisenberg uncertainty inequality, since the presence of a constant lower bound provides a clear constraint on the joint localizations quantified by and . However, as first noted in [23], the uncertainty inequalities (11) cannot be simultaneously minimized, so also (13) does not admit minimizers. This is related to the issue of nonexistence of a canonically conjugate observable for angular momentum [24, 25]. Indeed, if we had a well defined self-adjoint operator canonically commuting with angular momentum, we would end up with a well-known complex equation defining minimal uncertainty states [21], while in this case we have two such equations, whose solutions provide CR function functions on the for two noncompatible almost complex structures [26].
3.2 Autocorrelations
We pass now to the study of the properties of the distribution (9) applied to the symmetries under study, that we call autocorrelation since it has the form of the autocorrelation of a function with respect to the group of rotations and translations, and extends naturally the ordinary definition of autocorrelation with respect to translations. We will actually restrict the analysis to the square modulus of correlations, since as we will see it contains enough information for the present purposes. In particular, we will show that such correlations can be used to characterize the uncertainty in the detection of position and local preferred angle associated to a function.
Definition 3.5 Given f in , we define its autocorrelation centered at q as
In general, provides a natural way to study the joint localization properties of f with respect to position and local preferred angle. Indeed, when we specialize to translations we get the usual autocorrelation, and by Plancherel theorem
so we have that by Young inequality and the Riemann–Lebesgue lemma is bounded and goes to 0 as ξ becomes large. Moreover, by the usual uncertainty principle, we have that when f is well localized in space, then is broadly localized, hence passing under another Fourier transform will decay rapidly, uniformly on q, and vice versa.
On the other hand, if we consider correlations only with respect to rotations, for simplicity centered at
essentially the same argument applies to the decay of correlations for functions that are localized with respect to rotations.
Remark 3.6 (What does “essentially the same argument” mean)
Since , we get
so setting polar coordinates, with the notation
where and the last transition is Parseval identity.
Since as a tensor product of Hilbert spaces, and since f is localized with respect to rotations in the real plane if and only if it is localized with respect to rotations in the Fourier plane, then we can assume without loss of generality that , where decays rapidly away from and . So,
and now strictly the same argument used for (15) applies.
3.3 Uncertainty Associated to Measurements with Receptive Profiles
When specialized to receptive profiles, the introduced uncertainties can be explicitly computed. In the proofs, we will use the shorthand notation , and θ will be the polar angle of p.
Lemma 3.7 The variance of the operators (5) on receptive profiles (1) is
Proof Since , we get , and
□
Lemma 3.8 The variance of the operator (6) on receptive profiles (1) is
and we will call it angular momentum variance.
Proof Since
then
Its mean value vanishes on , due to the isotropy of :
To compute the variance, by analogous arguments
□
We have then obtained the following proposition, which shows that for receptive profiles the angular momentum variance is inversely proportional to the angular uncertainty quantified in terms of Angv.
Proposition 3.9 Let be as in (1). Then
Proof Using Definition 3.3, Lemma 3.7, and Lemma 3.8, we have
□
We will now consider autocorrelations of receptive profiles, and see that they indeed contain precisely the desired joint information on localizations in space and local orientation associated to the uncertainties we computed.
Proposition 3.10 Let be defined by (1). Then its -autocorrelation reads
Proof By Lemma 2.1, and computing the Fourier transform of a Gaussian
so the result follows since . □
This proposition shows that the decay of the autocorrelation in space is a Gaussian with the same width of the corresponding receptive profile that characterize spatial uncertainty. With respect to rotations, we have ended up with a Von Mises distribution in orientations. Such distributions appear naturally when discussing the uncertainty principle [20, 27], but they also provide a good model for orientation tuning of simple cells [28], which is defined as the response curve of a cell to oriented stimuli [28, 29]. This confirms that the introduced notion of localization is compatible with the resolution of measurements performed with receptive profiles. Moreover, we note that the commonly used circular variance [30] of the Von Mises distribution in (19), up to a normalization constant, is
which results to be numerically close to what we have introduced as angular uncertainty (12) when applied to receptive profiles (18)
In particular, as we will see in next section, typical values of in the filters encountered in V1 are around 1.7, where the difference between these two notions of variance is around .
4 Bounds on the Shape Index Induced by Uncertainty
In this section, we will use the measures of uncertainty referred to receptive profiles (18) and (19) to deduce relevant features about the physiological data measured in [3] and depicted in Fig. 1. In particular, we will see how the information provided by the analysis of uncertainty relations of Sect. 3 are sufficient to establish bounds on the number of subregions observed in the family of filters implemented in V1, and permit to reobtain characteristic sampling rates commonly used in image analysis.
A receptive profile consists of an oscillation of frequency under a Gaussian bell of width σ, so it appears natural to define a dimensionless index of shape [3]
This quantity is related to the number of subregions defining a receptive profiles, since if we let be the number of half wavelength of receptive profile’s oscillation within k standard deviations σ, we obtain . As it is apparent from the data measured in [3], we see that approximately standard deviations are sufficient to represent the main content of the filters, so that we can relate the effective subregions N to n as .
In terms of n, the angular momentum variance (17) of a receptive profile reads
while its angular variance (12), after (18), reads
4.1 Lower Bound for Orientation Measurements
By the discussion in Sect. 3, we have seen how we can quantify with ΔΘ the angle resolution allowed by a linear filtering. If we refer to the task of orientation detection, we can set as a reasonable bound that of angle uncertainty less than , that is expressed by
This condition can be stated in terms of the shape index using (22)
As we can see in Fig. 1 and by the discussions in [3], cells which show a selectivity in orientation all lie above this threshold. Moreover, we note that for indexes , it can be a hard task to distinguish an even cell from being represented only by a Gaussian, while odd cells under this threshold all appear identical up to a multiplicative factor, so the parametric fit of the Gabor model (1) is quite delicate in this region. We can then interpret the bunch of broadly tuned cells around the zero value of the shape index n as generally below the minimal uncertainty bound that allows a consistent detection of orientations.
4.2 Upper Bound
In order to discuss the upper bound, we introduce a notion of characteristic length associated to a specific level set of the correlations (1), intrinsically related to the task of detection of positions and local orientations. Its purpose is to quantify the minimum distance that one needs to cover in order to decorrelate a function f as much as f is decorrelated when compared at orthogonal directions.
Definition 4.1 The correlation length for is the smallest distance λ for which
If we apply this notion to receptive profiles (1), we obtain the following.
Proposition 4.2 The shape index (20) is bounded from above by the ratio of the correlation length λ and the spatial uncertainty σ
Proof Condition (23) on receptive profiles , by (19) reads , since
so (24) follows by the relation (21) between and the shape index. □
On the other hand, as discussed when dealing with the relation between the shape index and the number of subregions, we have also that the effective field of influence of a receptive profile can be set within two standard deviations σ. From this point of view, we can then assume that the distance d at which a receptive profile is effectively spatially uncorrelated corresponds to the distance that one has to cover in order to let its effective field of influence not intersect with its translation at a distance d, i.e., .
In order to couple with both position and orientation measurements, we will then consider the hypothesis of balance of the two characteristic scales introduced that is the identification . By (24), this condition can be stated in terms of the shape index as
To compare this bound with Fig. 1, we recall that here we are dealing with the simplified model of isotropic receptive fields, while in [3] the analysis is performed considering two anisotropic indexes and . In terms of such indexes, we can see that the largest part of the population lies within two bounds and , and looks in good accordance with their mean value.
The question of whether this identification of characteristic distances is truly implemented in the cortex cannot be answered at this point, but we note that a cortical scale related to the symmetries under study that is possibly compatible with the proposed relation is the mean correlation length of orientation preference maps (see, e.g., [13, 14] and references therein). Indeed, by the measurements performed in [31], we see that such scale is comparable with the size of a so-called cortical point image, that is, the cortical region that is activated after a highly spatially localized stimulus, and at least when we reduce to linear behavior of cells this notion corresponds to what we have indicated as effective field of influence.
4.3 Sampling on Orientations
Another intriguing consequence of the performed uncertainty analysis can be stated in terms of optimal sampling rates for orientation detection. Indeed, if we consider the mean value on shape index measured in [3], or equivalently, in terms of the deduced bounds, for , we have that
With respect to Gabor filters possessing such n, one way to use such result is to consider that the detection of orientations at angles that are closer than this uncertainty do not provide an actual improvement in the resolution of the local orientation present in the stimulus, so that it can be sufficient to cover the interval of orientations with a sampling having a spacing. This actually compares well with the notions of optimal sampling adopted in image analysis tasks (see, e.g., in [32] and references therein), generally justified with independent arguments. Moreover, this uncertainty analysis permits to set clear sampling spacings depending on the shape index of the filter used.
5 Conclusions
In this paper, we have studied theoretical aspects of an analytic characterization of uncertainty that generalizes the well-known Heisenberg uncertainty principle to the symmetries associated with the task of joint measurements of position and local orientation. The implications of this analysis, together with an hypothesis of balance between characteristic correlation distances, allowed us to obtain bounds comparable with experimental data on the shape index of the V1 simple cells that are selective for orientation, and to separate them from broadly tuned cells, which lie below the uncertainty bound for consistent orientation detection.
We remark that this was possible even if our working assumptions on the functional behavior of simple cells were reduced to linear filtering with symmetric receptive fields, and the only considered task is the one associated to the sole symmetries of rotations and translations.
Whether such elementary principles could be directly responsible of the observed distribution of receptive profiles is a question that can hardly find an answer. Nevertheless, the present study shows that they are sufficient to describe many of the relevant features that concern the shape of simple cells.
References
Hubel DH, Wiesel TN: Functional architecture of macaque monkey visual cortex. Proc R Soc Lond B 1977, 198: 1–59. 10.1098/rspb.1977.0085
Graham NV: Beyond multiple pattern analyzers modeled as linear filters (as classical V1 simple cells): useful additions of the last 25 years. Vis Res 2011, 51: 1397–1430. 10.1016/j.visres.2011.02.007
Ringach DL: Spatial structure and symmetry of simple cell receptive fields in macaque primary visual cortex. J Neurophysiol 2002, 88: 455–463.
Jones JP, Palmer LA: The two-dimensional spatial structure of simple receptive fields in cat striate cortex. J Neurophysiol 1987, 58: 1187–1211.
Jones JP, Palmer LA: An evaluation of the two-dimensional Gabor filter model of simple receptive fields in cat striate cortex. J Neurophysiol 1987, 58: 1233–1258.
Olshausen BA, Field DJ: Sparse coding with an overcomplete basis set: a strategy employed by V1? Vis Res 1997, 37: 3311–3325. 10.1016/S0042-6989(97)00169-7
Shelton J, Sterne P, Bornschein J, Sheikh AS, Luecke J: Why MCA? Nonlinear sparse coding with spike-and-slab prior for neurally plausible image encoding. Adv Neural Inf Process Syst 2012, 25: 2285–2293.
Hosoya H: Multinomial Bayesian learning for modeling classical and nonclassical receptive field properties. Neural Comput 2012, 24: 2119–2150. 10.1162/NECO_a_00310
Citti G, Sarti A: A cortical based model of perceptual completion in the roto-translation space. J Math Imaging Vis 2006, 24: 307–326. 10.1007/s10851-005-3630-2
Chossat P, Faugeras O: Hyperbolic planforms in relation to visual edges and textures perception. PLoS Comput Biol 2009, 5: 1–16.
Keil W, Wolf F: Coverage, continuity and visual cortical architecture. Neural Syst Circuits 2011. 10.1186/2042-1001-1-17
Daugman JG: Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two dimensional visual cortical filters. J Opt Soc Am A 1985, 2: 1160–1169. 10.1364/JOSAA.2.001160
Barbieri D, Citti G, Sanguinetti G, Sarti A: An uncertainty principle underlying the functional architecture of V1. J Physiol (Paris) 2012, 106: 183–193. 10.1016/j.jphysparis.2012.03.001
Basole A, White LE, Fitzpatrick D: Mapping multiple features in the population response of visual cortex. Nature 2003, 423: 986–990. 10.1038/nature01721
Hubel DH, Wiesel TN: Uniformity of monkey striate cortex: a parallel relationship between field size, scatter, and magnification factor. J Comp Neurol 1974, 158: 295–305. 10.1002/cne.901580305
Ohki K, Chung S, Kara P, Hubener M, Bonhoeffer T, Reid RC: Highly ordered arrangement of single neurons in orientation pinwheels. Nature 2006, 442: 925–928. 10.1038/nature05019
Córdoba A, Fefferman C: Wave packets and Fourier integral operators. Commun Partial Differ Equ 1978, 3: 979–1005. 10.1080/03605307808820083
Kalisa C, Torrésani B: N-dimensional affine Weyl–Heisenberg wavelets. Ann Inst Henri Poincaré. Phys Théor 1993, 59: 201–236.
Labate D, Weiss G, Wilson E: An approach to the study of wave packet systems. Contemporary Mathematics 345. In Wavelets, Frames and Operator Theory. Birkhäuser, Basel; 2004:215–235.
Carruthers P, Nieto MM: Phase and angle variables in quantum mechanics. Rev Mod Phys 1968, 40: 411–440. 10.1103/RevModPhys.40.411
Folland GB, Sitaram A: The uncertainty principle: a mathematical survey. J Fourier Anal Appl 1997, 3: 207–238. 10.1007/BF02649110
Breitenberger E: Uncertainty measures and uncertainty relations for angle observables. Found Phys 1985, 15: 353–364. 10.1007/BF00737323
Jackiw R: Minimum uncertainty product, number phase uncertainty product and coherent states. J Math Phys 1968., 9: Article ID 339 Article ID 339
Dubin DA, Hennings MA, Smith TB: Quantization in polar coordinates and the phase operator. Publ Res Inst Math Sci 1994, 30: 479–532. 10.2977/prims/1195165908
Kastrup HA: Quantization of the canonically conjugate pair angle and orbital angular momentum. Phys Rev A 2006., 73: Article ID 052104 Article ID 052104
arXiv: http://arxiv.org/abs/arXiv:1301.3783 Barbieri D, Citti G: Reproducing kernel Hilbert spaces of CR functions for the Euclidean Motion group. arXiv:1301.3783.
Hradil Z, Řehác̆ek J, Bouchal Z, C̆elechovský R, Sánchez-Coto LL: Minimum uncertainty measurement of angle and angular momentum. Phys Rev Lett 2006., 97: Article ID 243601 Article ID 243601
Swindale NV: Orientation tuning curves: empirical description and estimation of parameters. Biol Cybern 1998, 78: 45–56. 10.1007/s004220050411
Mooser F, Bosking WH, Fitzpatrick D: A morphological basis for orientation tuning in primary visual cortex. Nat Neurosci 2004, 7: 872–879. 10.1038/nn1287
Jammalamadaka SR, Sengupta A: Topics in Circular Statistics. World Scientific, Singapore; 2001.
Bosking WH, Crowley JC, Fitzpatrick D: Spatial coding of position and orientation in primary visual cortex. Nat Neurosci 2009, 5: 874–882.
Lee TS: Image representation using 2D Gabor wavelets. IEEE Trans Pattern Anal Mach Intell 1996, 18: 959–971. 10.1109/34.541406
Acknowledgements
The research of the first author was supported by Grant DIM2011—Région Île de France. The research of the second author was supported by Project AGAPE.
Author information
Authors and Affiliations
Corresponding author
Additional information
Competing Interests
The authors declare that they have no competing interests.
Authors’ Contributions
All authors contributed equally to the writing of this paper. All authors read and approved the final manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Barbieri, D., Citti, G. & Sarti, A. How Uncertainty Bounds the Shape Index of Simple Cells. J. Math. Neurosc. 4, 5 (2014). https://doi.org/10.1186/2190-8567-4-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/2190-8567-4-5