- Open Access
Analysis of a hyperbolic geometric model for visual texture perception
The Journal of Mathematical Neuroscience volume 1, Article number: 4 (2011)
We study the neural field equations introduced by Chossat and Faugeras to model the representation and the processing of image edges and textures in the hypercolumns of the cortical area V1. The key entity, the structure tensor, intrinsically lives in a non-Euclidean, in effect hyperbolic, space. Its spatio-temporal behaviour is governed by nonlinear integro-differential equations defined on the Poincaré disc model of the two-dimensional hyperbolic space. Using methods from the theory of functional analysis we show the existence and uniqueness of a solution of these equations. In the case of stationary, that is, time independent, solutions we perform a stability analysis which yields important results on their behavior. We also present an original study, based on non-Euclidean, hyperbolic, analysis, of a spatially localised bump solution in a limiting case. We illustrate our theoretical results with numerical simulations.
Mathematics Subject Classification:30F45, 33C05, 34A12, 34D20, 34D23, 34G20, 37M05, 43A85, 44A35, 45G10, 51M10, 92B20, 92C20.
The selectivity of the responses of individual neurons to external features is often the basis of neuronal representations of the external world. For example, neurons in the primary visual cortex (V1) respond preferentially to visual stimuli that have a specific orientation [1–3], spatial frequency , velocity and direction of motion , color . A local network in the primary visual cortex, roughly 1 mm2 of cortical surface, is assumed to consist of subgroups of inhibitory and excitatory neurons each of which is tuned to a particular feature of an external stimulus. These subgroups are the so-called Hubel and Wiesel hypercolumns of V1. We have introduced in  a new approach to model the processing of image edges and textures in the hypercolumns of area V1 that is based on a nonlinear representation of the image first order derivatives called the structure tensor [8, 9]. We suggested that this structure tensor was represented by neuronal populations in the hypercolumns of V1. We also suggested that the time evolution of this representation was governed by equations similar to those proposed by Wilson and Cowan . The question of whether some populations of neurons in V1 can represent the structure tensor is discussed in  but cannot be answered in a definite manner. Nevertheless, we hope that the predictions of the theory we are developing will help deciding on this issue.
Our present investigations were motivated by the work of Bressloff, Cowan, Golubitsky, Thomas and Wiener [11, 12] on the spontaneous occurence of hallucinatory patterns under the influence of psychotropic drugs, and its extension to the structure tensor model. A further motivation was the following studies of Bressloff and Cowan [4, 13, 14] where they study a spatial extension of the ring model of orientation of Ben-Yishai  and Hansel, Sompolinsky . To achieve this goal, we first have to better understand the local model, that is the model of a ‘texture’ hypercolumn isolated from its neighbours.
The aim of this paper is to present a rigorous mathematical framework for the modeling of the representation of the structure tensor by neuronal populations in V1. We would also like to point out that the mathematical analysis we are developing here, is general and could be applied to other integro-differential equations defined on the set of structure tensors, so that even if the structure tensor were found to be not represented in a hypercolumn of V1, our framework would still be relevant. We then concentrate on the occurence of localized states, also called bumps. This is in contrast to the work of  and  where ‘spatially’ periodic solutions were considered. The structure of this paper is as follows. In Section 2 we introduce the structure tensor model and the corresponding equations. We also link our model to the ring model of orientations. In Section 3 we use classical tools of evolution equations in functional spaces to analyse the problem of the existence and uniqueness of the solutions of our equations. In Section 4 we study stationary solutions which are very important for the dynamics of the equation by analysing a nonlinear convolution operator and making use of the Haar measure of our feature space. In Section 5, we push further the study of stationary solutions in a special case and we present a technical analysis involving hypergeometric functions of what we call a hyperbolic radially symmetric stationary-pulse in the high gain limit. Finally, in Section 6, we present some numerical simulations of the solutions to verify the findings of the theoretical results.
2 The model
By definition, the structure tensor is based on the spatial derivatives of an image in a small area that can be thought of as part of a receptive field. These spatial derivatives are then summed nonlinearly over the receptive field. Let denote the original image intensity function, where x and y are two spatial coordinates. Let denote the scale-space representation of I obtained by convolution with the Gaussian kernel :
The gradient is a two-dimensional vector of coordinates which emphasizes image edges. One then forms the symmetric matrix of rank one , where T indicates the transpose of a vector. The set of symmetric positive semidefinite matrices of rank one will be noted throughout the paper (see  for a complete study of the set of symmetric positive semidefinite matrices of fixed-rank ). By convolving componentwise with a Gaussian we finally form the tensor structure as the symmetric matrix:
where we have set for example:
Since the computation of derivatives usually involves a stage of scale-space smoothing, the definition of the structure tensor requires two scale parameters. The first one, defined by , is a local scale for smoothing prior to the computation of image derivatives. The structure tensor is insensitive to noise and details at scales smaller than . The second one, defined by , is an integration scale for accumulating the nonlinear operations on the derivatives into an integrated image descriptor. It is related to the characteristic size of the texture to be represented, and to the size of the receptive fields of the neurons that may represent the structure tensor.
By construction, is symmetric and non negative as by the inequality of Cauchy-Schwarz, then it has two orthonormal eigenvectors , and two non negative corresponding eigenvalues and which we can always assume to be such that . Furthermore the spatial averaging distributes the information of the image over a neighborhood, and therefore the two eigenvalues are always positive. Thus, the set of the structure tensors lives in the set of symmetric positive definite matrices, noted throughout the paper. The distribution of these eigenvalues in the plane reflects the local organization of the image intensity variations. Indeed, each structure tensor can be written as the linear combination:
where is the identity matrix and . Some easy interpretations can be made for simple examples: constant areas are characterized by , straight edges are such that , their orientation being that of , corners yield . The coherency c of the local image is measured by the ratio , large coherency reveals anisotropy in the texture.
We assume that a hypercolumn of V1 can represent the structure tensor in the receptive field of its neurons as the average membrane potential values of some of its membrane populations. Let be a structure tensor. The time evolution of the average potential for a given column is governed by the following neural mass equation adapted from  where we allow the connectivity function W to depend upon the time variable t and we integrate over the set of symmetric definite-positive matrices:
The nonlinearity S is a sigmoidal function which may be expressed as:
where μ describes the stiffness of the sigmoid. is an external input.
The set can be seen as a foliated manifold by way of the set of special symmetric positive definite matrices . Indeed, we have: . Furthermore, , where is the Poincaré Disk, see, for example, . As a consequence we use the following foliation of , which allows us to write for all with . z and Δ are related by the relation and the fact that z is the representation in of with .
It is well-known  that (and hence SSPD(2)) is a two-dimensional Riemannian space of constant sectional curvature equal to −1 for the distance noted defined by
The isometries of , that are the transformations that preserve the distance are the elements of unitary group . In Appendix A we describe the basic structure of this group. It follows, for example, [7, 18], that SDP(2) is a three-dimensional Riemannian space of constant sectional curvature equal to −1 for the distance noted defined by
As shown in Proposition B.0.1 of Appendix B it is possible to express the volume element in coordinates with :
We note and equation (2) can be written in coordinates:
We get rid of the constant by redefining W as .
In , we have assumed that the representation of the local image orientations and textures is richer than, and contains, the local image orientations model which is conceptually equivalent to the direction of the local image intensity gradient. The richness of the structure tensor model has been expounded in . The embedding of the ring model of orientation in the structure tensor model can be explained by the intrinsic relation that exists between the two sets of matrices and . First of all, when goes to zero, that is when the characteristic size of the structure becomes very small, we have , which means that the tensor degenerates to a tensor , which can be interpreted as the loss of one dimension. We can write each as , where and is the polar representation of x. Since, x and −x correspond to the same θ is equated to . Thus , where is the real projective space of dimension 1 (lines of ). Then the integration scale , at which the averages of the estimates of the image derivatives are computed, is the link between the classical representation of the local image orientations by the gradient and the representation of the local image textures by the structure tensor. It is also possible to highlight this explanation by coming back to the interpretation of straight edges of the previous paragraph. When then and the orientation is that of . We denote by the projection of a symmetric definite positive matrix on the set defined by:
where is as in equation (1). We can introduce a metric on the set which is derived from a well-chosen Riemannian quotient geometry (see ). The resulting Riemannian space has strong geometrical properties: it is geodesically complete and the metric is invariant with respect to all transformations that preserve angles (orthogonal transformations, scalings and pseudoinversions). Related to the decomposition , a metric on the space is given by:
The space endowed with this metric is a Riemannian manifold (see ). Finally, the distance associated to this metric is given by:
where and denotes the polar coordinates of for . The volume element in coordinates is:
where we normalize to 1 the volume element for the θ coordinate.
Let now be a symmetric positive semidefinite matrix. The average potential of the column has its time evolution that is governed by the following neural mass equation which is just a projection of equation (2) on the subspace :
In coordinates, (4) is rewritten as:
This equation is richer than the ring model of orientation as it contains an additional information on the contrast of the image in the orthogonal direction of the prefered orientation. If one wants to recover the ring model of orientation tuning in the visual cortex as it has been presented and studied by [1, 2, 19], it is sufficient i) to assume that the connectivity function is time-independent and has a convolutional form:
and ii) to look at semi-homogeneous solutions of equation (4), that is, solutions which do not depend upon the variable r. We finally obtain:
It follows from the above discussion that the structure tensor contains, at a given scale, more information than the local image intensity gradient at the same scale and that it is possible to recover the ring model of orientations from the structure tensor model.
The aim of the following sections is to establish that (3) is well-defined and to give necessary and sufficient conditions on the different parameters in order to prove some results on the existence and uniqueness of a solution of (3).
3 The existence and uniqueness of a solution
In this section we provide theoretical and general results of existence and uniqueness of a solution of (2). In the first subsection (Section 3.1) we study the simpler case of the homogeneous solutions of (2), that is, of the solutions that are independent of the tensor variable . This simplified model allows us to introduce some notations for the general case and to establish the useful Lemma 3.1.1. We then prove in Section 3.2 the main result of this section, that is the existence and uniqueness of a solution of (2). Finally we develop the useful case of the semi-homogeneous solutions of (2), that is, of solutions that depend on the tensor variable but only through its z coordinate in .
3.1 Homogeneous solutions
A homogeneous solution to (2) is a solution V that does not depend upon the tensor variable for a given homogenous input and a constant initial condition . In coordinates, a homogeneous solution of (3) is defined by:
Hence necessary conditions for the existence of a homogeneous solution are that:
the double integral (6) is convergent,
does not depend upon the variable . In that case, we write instead of .
In the special case where is a function of only the distance between and :
the second condition is automatically satisfied. The proof of this fact is given in Lemma D.0.2 of Appendix D. To summarize, the homogeneous solutions satisfy the differential equation:
3.1.1 A first existence and uniqueness result
Equation (3) defines a Cauchy’s problem and we have the following theorem.
Theorem 3.1.1 If the external inputand the connectivity functionare continuous on some closed interval J containing 0, then for allin, there exists a unique solution of (7) defined on a subintervalof J containing 0 such that.
Proof It is a direct application of Cauchy’s theorem on differential equations. We consider the mapping defined by:
It is clear that f is continuous from to . We have for all and :
Since, is continuous on the compact interval J, it is bounded there by and:
We can extend this result to the whole time real line if I and are continuous on .
Proposition 3.1.1 Ifandare continuous on, then for allin, there exists a unique solution of (7) defined onsuch that.
Proof We have already shown the following inequality:
Then f is locally Lipschitz with respect to its second argument. Let V be a maximal solution on and we denote by β the upper bound of . We suppose that . Then we have for all :
This implies that the maximal solution V is bounded for all , but Theorem C.0.2 of Appendix C ensures that it is impossible. Then, it follows that necessarily . □
3.1.2 Simplification of (6) in a special case
Invariance In the previous section, we have stated that in the special case where W was a function of the distance between two points in , then did not depend upon the variables . As already said in the previous section, the following result holds (see proof of Lemma D.0.2 of Appendix D).
Lemma 3.1.1 Suppose that W is a function ofonly. Thendoes not depend upon the variable.
Mexican hat connectivity In this paragraph, we push further the computation of in the special case where W does not depend upon the time variable t and takes the special form suggested by Amari in , commonly referred to as the ‘Mexican hat’ connectivity. It features center excitation and surround inhibition which is an effective model for a mixed population of interacting inhibitory and excitatory neurons with typical cortical connections. It is also only a function of .
In detail, we have:
with and .
In this case we can obtain a very simple closed-form formula for as shown in the following lemma.
Lemma 3.1.2 When W is the specific Mexican hat function just defined then:
where erf is the error function defined as:
Proof The proof is given in Lemma E.0.3 of Appendix E. □
3.2 General solution
We now present the main result of this section about the existence and uniqueness of solutions of equation (2). We first introduce some hypotheses on the connectivity function W. We present them in two ways: first on the set of structure tensors considered as the set SPD(2), then on the set of tensors seen as . Let J be a subinterval of . We assume that:
(H1): , ,
(H2): where W is defined as for all where is the identity matrix of ,
(H3): , where .
Equivalently, we can express these hypotheses in coordinates:
(H1 bis): , ,
(H2 bis): where W is defined as for all ,
(H3 bis): , where
3.2.1 Functional space setting
We introduce the following mapping such that:
Our aim is to find a functional space where (3) is well-defined and the function maps to for all t s. A natural choice would be to choose ϕ as a -integrable function of the space variable with . Unfortunately, the homogeneous solutions (constant with respect to ) do not belong to that space. Moreover, a valid model of neural networks should only produce bounded membrane potentials. That is why we focus our choice on the functional space . As is an open set of , is a Banach space for the norm: .
Proposition 3.2.1 Ifwithand W satisfies hypotheses (H 1bis)-(H 3bis) thenis well-defined and is fromto.
Proof, we have:
3.2.2 The existence and uniqueness of a solution of (3)
We rewrite (3) as a Cauchy problem:
Theorem 3.2.1 If the external currentbelongs towith J an open interval containing 0 and W satisfies hypotheses (H 1bis)-(H 3bis), then fo all, there exists a unique solution of (10) defined on a subintervalof J containing 0 such thatfor all.
Proof We prove that is continuous on . We have
Because of condition (H2) we can choose small enough so that is arbitrarily small. This proves the continuity of . Moreover it follows from the previous inequality that:
with . This ensures the Lipschitz continuity of with respect to its second argument, uniformly with respect to the first. The Cauchy-Lipschitz theorem on a Banach space yields the conclusion. □
Remark 3.2.1 Our result is quite similar to those obtained by Potthast and Graben in. The main differences are that first we allow the connectivity function to depend upon the time variable t and second that our space features is no longer abut a Riemanian manifold. In their article Potthast and Graben also work with a different functional space by assuming more regularity for the connectivity function W and then obtain more regularity for their solutions.
Proposition 3.2.2 If the external currentbelongs toand W satisfies hypotheses (H 1bis)-(H 3bis) with, then for all, there exists a unique solution of (10) defined onsuch thatfor all.
Proof We have just seen in the previous proof that is globally Lipschitz with respect to its second argument:
then Theorem C.0.3 of Appendix C gives the conclusion. □
3.2.3 The intrinsic boundedness of a solution of (3)
In the same way as in the homogeneous case, we show a result on the boundedness of a solution of (3).
Proposition 3.2.3 If the external currentbelongs toand is bounded in timeand W satisfies hypotheses (H 1bis)-(H 3bis) with, then the solution of (10) is bounded for each initial condition.
Let us set:
Proof Let V be a solution defined on . Then we have for all :
The following upperbound holds
We can rewrite (11) as:
If this implies for all and hence for all , proving that is stable. Now assume that for all . The inequality (12) shows that for t large enough this yields a contradiction. Therefore there exists such that . At this time instant we have
The following corollary is a consequence of the previous proposition.
Corollary 3.2.1 Ifandthen:
3.3 Semi-homogeneous solutions
A semi-homogeneous solution of (3) is defined as a solution which does not depend upon the variable Δ. In other words, the populations of neurons is not sensitive to the determinant of the structure tensor, that is to the contrast of the image intensity. The neural mass equation is then equivalent to the neural mass equation for tensors of unit determinant. We point out that semi-homogeneous solutions were previously introduced in  where a bifurcation analysis of what they called H-planforms was performed. In this section, we define the framework in which their equations make sense without giving any proofs of our results as it is a direct consequence of those proven in the general case. We rewrite equation (3) in the case of semi-homogeneous solutions:
We have implicitly made the assumption, that does not depend on the coordinate Δ. Some conditions under which this assumption is satisfied are described below and are the direct transductions of those of the general case in the context of semi-homogeneous solutions.
Let J be an open interval of . We assume that:
(C1): , ,
(C2): where is defined as for all ,
(C3): where .
Note that conditions (C1)-(C2) and Lemma 3.1.1 imply that for all , . And then, for all , the mapping is integrable on .
From now on, and the Fischer-Riesz’s theorem ensures that is a Banach space for the norm: .
Theorem 3.3.1 If the external currentbelongs towith J an open interval containing 0 andsatisfies conditions (C 1)-(C 3), then for all, there exists a unique solution of (13) defined on a subintervalof J containing 0.
This solution, defined on the subinterval J of can in fact be extended to the whole real line, and we have the following proposition.
Proposition 3.3.1 If the external currentbelongs toandsatisfies conditions (C 1)-(C 3) with, then for all, there exists a unique solution of (13) defined on.
We can also state a result on the boundedness of a solution of (13):
Proposition 3.3.2 Let, with. The open ballofof center 0 and radius ρ is stable under the dynamics of equation (13). Moreover it is an attracting set for this dynamics and ifandthen:
4 Stationary solutions
We look at the equilibrium states, noted of (3), when the external input I and the connectivity W do not depend upon the time. We assume that W satisfies hypotheses (H1 bis)-(H2 bis). We redefine for convenience the sigmoidal function to be:
so that a stationary solution (independent of time) satisfies:
We define the nonlinear operator from to , noted , by:
Finally, (14) is equivalent to:
4.1 Study of the nonlinear operator
We recall that we have set for the Banach space and Proposition 3.2.1 shows that . We have the further properties:
Proposition 4.1.1satisfies the following properties:
is continuous on.
Proof The first property was shown to be true in the proof of Theorem 3.3.1. The second property follows from the following inequality:
We denote by and the two operators from to defined as follows for all and all :
where H is the Heaviside function.
It is straightforward to show that both operators are well-defined on and map to . Moreover the following proposition holds.
Proposition 4.1.2 We have
Proof It is a direct application of the dominated convergence theorem using the fact that:
4.2 The convolution form of the operator in the semi-homogeneous case
It is convenient to consider the functional space to discuss semi-homogeneous solutions. A semi-homogeneous persistent state of (3) is deduced from (14) and satisfies:
where the nonlinear operator from to is defined for all and by:
We define the associated operators, :
We rewrite the operator in a convenient form by using the convolution in the hyperbolic disk. First, we define the convolution in a such space. Let O denote the center of the Poincaré disk that is the point represented by and dg denote the Haar measure on the group (see  and Appendix A), normalized by:
for all functions of . Given two functions in we define the convolution ∗ by:
We recall the notation .
Proposition 4.2.1 For allandwe have:
Proof We only prove the result for . Let , then:
and for all , so that:
Let b be a point on the circle . For , we define the ‘inner product’ to be the algebraic distance to the origin of the (unique) horocycle based at b through z (see ). Note that does not depend on the position of z on the horocycle. The Fourier transform in is defined as (see ):
for a function such that this integral is well-defined.
Lemma 4.2.1 The Fourier transform in, ofdoes not depend upon the variable.
Proof For all and ,
We recall that for all is the rotation of angle ϕ and we have , and , then:
We now introduce two functions that enjoy some nice properties with respect to the Hyperbolic Fourier transform and are eigenfunctions of the linear operator .
Proposition 4.2.2 Letandthen:
Proof We begin with and use the horocyclic coordinates. We use the same changes of variables as in Lemma 3.1.1:
By rotation, we obtain the property for all .
For the second property , Lemma 4.7] shows that:
A consequence of this proposition is the following lemma.
Lemma 4.2.2 The linear operatoris not compact and for all, the nonlinear operatoris not compact.
Proof The previous Proposition 4.2.2 shows that has a continuous spectrum which iimplies that is not a compact operator.
Let U be in , for all we differentiate and compute its Frechet derivative:
If we assume further that U does not depend upon the space variable z, we obtain:
If was a compact operator then its Frechet derivative would also be a compact operator, but it is impossible. As a consequence, is not a compact operator. □
4.3 The convolution form of the operator in the general case
We adapt the ideas presented in the previous section in order to deal with the general case. We recall that if H is the group of positive real numbers with multiplication as operation, then the Haar measure dh is given by . For two functions , in we define the convolution ⋆ by:
We recall that we have set by definition: .
Proposition 4.3.1 For allandwe have:
Proof Let be in . We follow the same ideas as in Proposition 4.2.1 and prove only the first result. We have
We next assume further that the function W is separable in z, Δ and more precisely that where and for all . The following proposition is an echo to Proposition 4.2.2.
Proposition 4.3.2 Let, andthen:
whereis the usual Fourier transform of.
Proof The proof of this proposition is exactly the same as for Proposition 4.2.2. Indeed:
A straightforward consequence of this proposition is an extension of Lemma 4.2.2 to the general case:
Lemma 4.3.1 The linear operatoris not compact and for all, the nonlinear operatoris not compact.
4.4 The set of the solutions of (14)
Let be the set of the solutions of (14) for a given slope parameter μ:
We have the following proposition.
Proposition 4.4.1 If the input currentis equal to a constant, that is, does not depend upon the variablesthen for all, . In the general case, if the conditionis satisfied, then.
Proof Due to the properties of the sigmoid function, there always exists a constant solution in the case where is constant. In the general case where , the statement is a direct application of the Banach fixed point theorem, as in . □
Remark 4.4.1 If the external input does not depend upon the variablesand if the conditionis satisfied, then there exists a unique stationary solution by application of Proposition 4.4.1. Moreover, this stationary solution does not depend upon the variablesbecause there always exists one constant stationary solution when the external input does not depend upon the variables. Indeed equation (14) is then equivalent to:
and β does not depend upon the variablesbecause of Lemma 3.1.1. Because of the property of the sigmoid function S, the equationhas always one solution.
If on the other hand the input current does depend upon these variables is invariant under the action of a subgroup ofthe group of the isometries of (see Appendix A), and the conditionis satisfied then the unique stationary solution will also be invariant under the action of the same subgroup. We refer the interested reader to our workon equivariant bifurcation of hyperbolic planforms on the subject.
When the conditionis satisfied we call primary stationary solution the unique solution in.
4.5 Stability of the primary stationary solution
In this subsection we show that the condition guarantees the stability of the primary stationary solution to (3).
Theorem 4.5.1 We suppose thatand that the conditionis satisfied, then the associated primary stationary solution of (3) is asymtotically stable.
Proof Let be the primary stationary solution of (3), as is satisfied. Let also be the unique solution of the same equation with some initial condition , see Theorem 3.3.1. We introduce a new function which satisfies:
where and the vector is given by with . We note that, because of the definition of Θ and the mean value theorem . This implies that for all .
If we set: , then we have:
and G is continuous for all . The Gronwall inequality implies that:
and the conclusion follows. □
5 Spatially localised bumps in the high gain limit
In many models of working memory, transient stimuli are encoded by feature-selective persistent neural activity. Such stimuli are imagined to induce the formation of a spatially localised bump of persistent activity which coexists with a stable uniform state. As an example, Camperi and Wang  have proposed and studied a network model of visuo-spatial working memory in prefontal cortex adapted from the ring model of orientation of Ben-Yishai and colleagues . Many studies have emerged in the past decades to analyse these localised bumps of activity [25–29], see the paper by Coombes for a review of the domain . In [25, 26, 28], the authors have examined the existence and stability of bumps and multi-bumps solutions to an integro-differential equation describing neuronal activity along a single spatial domain. In [27, 29] the study is focused on the two-dimensional model and a method is developed to approximate the integro-differential equation by a partial differential equation which makes possible the determination of the stability of circularly symmetric solutions. It is therefore natural to study the emergence of spatially localized bumps for the structure tensor model in a hypercolumn of V1. We only deal with the reduced case of equation (13) which means that the membrane activity does not depend upon the contrast of the image intensity, keeping the general case for future work.
In order to construct exact bump solutions and to compare our results to previous studies [25–29], we consider the high gain limit of the sigmoid function. As above we denote by H the Heaviside function defined by for and otherwise. Equation (13) is rewritten as:
We have introduced a threshold κ to shift the zero of the Heaviside function. We make the assumption that the system is spatially homogeneous that is, the external input I does not depend upon the variables t and the connectivity function depends only on the hyperbolic distance between two points of . For illustrative purposes, we will use the exponential weight distribution as a specific example throughout this section:
The theoretical study of equation (20) has been done in  where the authors have imposed strong regularity assumptions on the kernel function W, such as Hölder continuity, and used compactness arguments and integral equation techniques to obtain a global existence result of solutions to (20). Our approach is very different, we follow that of [25–29, 31] by proceeding in a constructive fashion. In a first part, we define what we call a hyperbolic radially symmetric bump and present some preliminary results for the linear stability analysis of the last part. The second part is devoted to the proof of a technical Theorem 5.1.1 which is stated in the first part. The proof uses results on the Fourier transform introduced in Section 4, hyperbolic geometry and hypergeometric functions. Our results will be illustrated in the following Section 6.
5.1 Existence of hyperbolic radially symmetric bumps
From equation (20) a general stationary pulse satisfies the equation:
For convenience, we note the integral with . The relation holds for all .
Definition 5.1.1 V is called a hyperbolic radially symmetric stationary-pulse solution of (20) if V depends only upon the variable r and is such that:
and is a fixed point of equation (20):
whereis a Gaussian input andis defined by the following equation:
andis a hyperbolic disk centered at the origin of hyperbolic radius ω.
From symmetry arguments there exists a hyperbolic radially symmetric stationary-pulse solution of (20), furthermore the threshold κ and width ω are related according to the self-consistency condition
The existence of such a bump can then be established by finding solutions to (23) The function is plotted in Figure 1 for a range of the input amplitude . The horizontal dashed lines indicate different values of ακ, the points of intersection determine the existence of stationary pulse solutions. Qualitatively, for sufficiently large input amplitude we have and it is possible to find only one solution branch for large ακ. For small input amplitudes we have and there always exists one solution branch for . For intermediate values of the input amplitude , as αβ varies, we have the possiblity of zero, one or two solutions. Anticipating the stability results of Section 5.3, we obtain that when then the corresponding solution is stable.
We end this subsection with the usefull and technical following formula.
Theorem 5.1.1 For all:
where is the Fourier Helgason transform of and
withand F is the hypergeometric function of first kind.
Remark 5.1.1 We recall that F admits the integral representation:
Remark 5.1.2 In Section 4we introduced the function. In, it is shown that:
Remark 5.1.3 Let us point out that this result can be linked to the work of Folias and Bressloff inand then used in. They constructed a two-dimensional pulse for a general radially symmetric synaptic weight function. They obtain a similar formal representation of the integral of the connectivity function w over the diskcentered at the origin O and of radius a. Using their notations
whereis the Bessel function of the first kind andis the real Fourier transform of w. In our case instead of the Bessel function we findwhich is linked to the hypergeometric function of the first kind.
We now show that for a general monotonically decreasing weight function W, the function is necessarily a monotonically decreasing function of r. This will ensure that the hyperbolic radially symmetric stationary-pulse solution (22) is also a monotonically decreasing function of r in the case of a Gaussian input. The demonstration of this result will directly use Theorem 5.1.1.
Proposition 5.1.1 V is a monotonically decreasing function in r for any monotonically decreasing synaptic weight function W.
Proof Differentiating with respect to r yields:
We have to compute
It is result of elementary hyperbolic trigonometry that
we let , and define
It follows that
We conclude that if then for all and
which implies for , since .
To see that it is also negative for , we differentiate equation (24) with respect to r:
The following formula holds for the hypergeometric function (see Erdelyi in ):
Substituting in the previous equation giving we find:
Consequently, for . Hence V is monotonically decreasing in r for any monotonically decreasing synaptic weight function W. □
As a consequence, for our particular choice of exponential weight function (21), the radially symmetric bump is monotonically decreasing in r, as it will be recover in our numerical experiments in Section 6.
5.2 Proof of Theorem 5.1.1
The proof of Theorem 5.1.1 goes in four steps. First we introduce some notations and recall some basic properties of the Fourier transform in the Poincaré disk. Second we prove two propositions. Third we state a technical lemma on hypergeometric functions, the proof being given in Lemma F.0.4 of Appendix F. The last step is devoted to the conclusion of the proof.
5.2.1 First step
In order to calculate , we use the Fourier transform in which has already been introduced in Section 4. First we rewrite as a convolution product:
Proposition 5.2.1 For all:
Proof We start with the definition of and use the convolutional form of the integral:
In , Helgason proves an inversion formula for the hyperbolic Fourier transform and we apply this result to W:
the last equality is a direct application of Lemma 4.2.1 and we can deduce that
Finally we have:
which is the desired formula. □
It appears that the study of consists in calculating the convolution product .
Proposition 5.2.2 For allforwe have:
Proof Let for we have:
for all , so that:
5.2.2 Second step
In this part, we prove two results:
the mapping is a radial function, that is, it depends only upon the variable r.
the following equality holds for :
Proposition 5.2.3 Ifand z is writtenwithin hyperbolic polar coordinates the functiondepends only upon the variable r.
Proof If , then and . Similarly . We can write thanks to the previous Proposition 5.2.2:
which, as announced, is only a function of r. □
We now give an explicit formula for the integral .
Proposition 5.2.4 For allwe have:
Proof We first recall a formula from .
Lemma 5.2.1 For all the following equation holds:
Proof See . □
It follows immediately that for all and we have:
We integrate this formula over the hyperbolic ball which gives:
and we exchange the order of integration:
We note that the integral does not depend upon the variable . Indeed:
and indeed the integral does not depend upon the variable b:
Finally, we can write:
because (as solutions of the same equation).
This completes the proof that:
5.2.3 Third step
We state a useful formula.
Lemma 5.2.2 For allthe following formula holds:
Proof See Lemma F.0.4 of Appendix F. □
5.2.4 The main result
At this point we have proved the following proposition thanks to Propositions 5.2.1 and 5.2.4.
Proposition 5.2.5 If, is given by the following formula:
We are now in a position to obtain the analytic form for of Theorem 5.1.1. We prove that
Indeed, in hyperbolic polar coordinates, we have:
On the other hand:
and we use Lemma 5.2.2 to establish (24).
5.3 Linear stability analysis
We now analyse the evolution of small time-dependent perturbations of the hyperbolic stationary-pulse solution through linear stability analysis. We use classical tools already developped in [29, 31].
5.3.1 Spectral analysis of the linearized operator
Equation (20) is linearized about the stationary solution by introducing the time-dependent perturbation:
This leads to the linear equation:
We separate variables by setting to obtain the equation:
Introducing the hyperbolic polar coordinates and using the result:
Note that we have formally differentiated the Heaviside function, which is permissible since it arises inside a convolution. One could also develop the linear stability analysis by considering perturbations of the threshold crossing points along the lines of Amari . Since we are linearizing about a stationary rather than a traveling pulse, we can analyze the spectrum of the linear operator without the recourse to Evans functions.
With a slight abuse of notation we are led to study the solutions of the integral equation:
where the following equality derives from the definition of the hyperbolic distance in equation (25):
Essential spectrum If the function ϕ satisfies the condition
then equation (28) reduces to:
yielding the eigenvalue:
This part of the essential spectrum is negative and does not cause instability.
Discrete spectrum If we are not in the previous case we have to study the solutions of the integral equation (28).
This equation shows that is completely determined by its values on the circle of equation . Hence, we need only to consider , yielding the integral equation:
The solutions of this equation are exponential functions , where γ satisfies:
By the requirement that ϕ is 2π-periodic in θ, it follows that , where . Thus the integral operator with kernel has a discrete spectrum given by:
is real since:
We can state the folliwing proposition:
Proposition 5.3.1 Provided that for all, then the hyperbolic stationary pulse is stable.
We now derive a reduced condition linking the parameters for the stability of hyperbolic stationary pulse.
Reduced condition Since is a positive function of r, it follows that:
Stability of the hyperbolic stationary pulse requires that for all , . This can be rewritten as:
Using the fact that for all , we obtain the reduced stability condition:
From (22) we have: