 Short report
 Open Access
 Published:
Stability analysis of a neural field selforganizing map
The Journal of Mathematical Neuroscience volume 10, Article number: 20 (2020)
Abstract
We provide theoretical conditions guaranteeing that a selforganizing map efficiently develops representations of the input space. The study relies on a neural field model of spatiotemporal activity in area 3b of the primary somatosensory cortex. We rely on Lyapunov’s theory for neural fields to derive theoretical conditions for stability. We verify the theoretical conditions by numerical experiments. The analysis highlights the key role played by the balance between excitation and inhibition of lateral synaptic coupling and the strength of synaptic gains in the formation and maintenance of selforganizing maps.
1 Introduction
Selforganizing maps (SOMs) are neural networks mapping a highdimensional space to a lowdimensional one through unsupervised learning. They were first introduced by Grossberg [14] and later by Kohonen [19]. SOMs are widely used in computer science and data analysis for quantization and visualization of highdimensional data [25, 38]. They also constitute a suitable tool in computational neuroscience to study the formation and maintenance of topographic maps in primary sensory cortices such as the visual cortex [24, 31] and the somatosensory cortex [13, 34]. Many variations and applications of Kohonen’s SOM algorithm can be found in [16] and [27].
A type of selforganizing map based on neural fields theory has been introduced in [8], where neural fields are used to drive the selforganizing process. Neural fields are integrodifferential equations that describe the spatiotemporal dynamics of a cortical sheet [3–5]. The SOM proposed in [8] describes the topographic organization of area 3b of the primary somatosensory cortex of monkeys [21, 28]. The model relies on an earlier work [29] known as the dynamic SOM (DSOM) algorithm. DSOM provides an online SOM learning algorithm, where the Kohonen’s SOM timedependent learning rate and neighborhood function have been replaced by timeinvariant ones. The DSOM neighborhood function and learning rate solely depend on the distance of the winner unit (i.e., the most active neuron) from the input. The model proposed in [8, 9] combines the DSOM timeinvariant learning rate and neighborhood function with Oja’s learning rule [26]. As thoroughly described in [8, 9], the model is compatible with anatomical evidence of how area 3b in monkeys develops, maintains, and reorganizes topographic representations of a skin patch of the index finger.
In this work, we provide theoretical insights on the stability and convergence of the neural field SOM algorithm proposed in [8, 9] by studying a more general class of systems than that originally proposed in [8]. We use Lyapunov’s stability theory adapted to neural field dynamics [11]. Since typical activation functions employed in the model (such as absolute values or rectification functions) are not necessarily differentiable, we do not rely on linearization techniques but rather directly assess the stability of the original nonlinear dynamics. Yet, the obtained results are local, meaning that they are valid only for initial conditions in the vicinity of the considered equilibrium. Nonetheless, we show that they agree with numerical simulations. The stability conditions derived in this work can be used toward the direction of tuning neural field models such that they achieve the best possible results in developing selforganizing maps and thus more generalized representations. Moreover, the conditions we propose indicate that the balance between lateral excitation and inhibition keeps the system stable, thus ruling out possible configurations in which learning does not take place properly. These findings are in line with both experimental observations [18, 30] and computational modeling [35–37].
The paper is organized as follows. In Sect. 2, we recall the SOM model under concern and its basic mechanisms. In Sect. 3, we present our main theoretical results, which we confront to numerical simulations in Sect. 4. A discussion on the obtained results is provided in Sect. 5. Mathematical proofs are given in Sect. 6.
2 Selforganizing neural fields
2.1 Neural population dynamics
We consider the following neural fields equation:
where Ω is a connected compact subset of \(\mathbb {R}^{q}\) (\(q=1,2,3\)). For \(q=2\), the integral of a function \(g:\Omega =\Omega _{1}\times \Omega _{2}\to \mathbb{R}\) is to be understood as \(\int _{\Omega }g(r)\,dr=\int _{\Omega _{1}}\int _{\Omega _{2}} g(r_{1},r_{2})\,dr_{2}\,dr_{1}\) with \(r=(r_{1},r_{2})\), and similarly for \(q=3\); \(u(r,t)\) represents the mean membrane potential at position \(r\in \Omega \) and time \(t\geq 0\), τ is a positive decay time constant, I denotes an external input, and \(w_{l}\) is a function that represents the strength of lateral synaptic coupling. It is given by
where the excitation and inhibition synaptic weights are typically given by
and
with \(K_{e},K_{i},\sigma _{e},\sigma _{i}>0\). In [8, 9] the input is provided through a twodimensional skin model. The skin model is composed of a twodimensional grid and receptors. The receptors are points distributed on the surface of the grid (uniformly). When a stimulus is applied on the grid, the receptors sample the input signal and convey the information to the cortical model. The skin stimulus is a noisy Gaussianlike function, and the input to the neural fields model is provided by the following function:
where \(\cdot _{1}\) denotes the 1norm: \(x_{1}=\sum_{i=1}^{m} x_{i}\), and \(s:\mathbb{R}^{2} \rightarrow [0,1]^{m}\) is a function that maps the raw input from the twodimensional skin space to \([0,1]^{m}\). For instance, for a tactile stimulus at position \(p\in \mathbb {R}^{2}\) on the skin, \(s(p)\in \mathbb {R}^{m}\) could be defined as the normalized distance from p to the location of each receptor, thus potentially of much higher dimension than 2. For a more detailed description of receptor model, see [9]. The function \(w_{f}: \Omega \times \mathbb {R}_{\geq 0}\to \mathbb {R}^{m}\) represents feedforward synaptic weights with value updated according to
where γ is a positive constant that represents the learning rate, and \(\operatorname{rect}(x) = \max \{x, 0\}\). It is worth observing that since \(s(p)\in [0,1]^{m}\), \(w_{f}(r,t)\in [0,1]^{m}\) for all \(r\in \Omega \) and \(t\geq 0\) given any initial conditions satisfying \(w_{f}(r,0)\in [0,1]^{m}\) for all \(r\in \Omega \) (this can be seen by observing that the entries of \(\frac{\partial w_{f}}{\partial t}(r,t)\) are negative as soon as the corresponding entries of \(w_{f}(r,t)\) become greater than 1; similarly, they are positive when the corresponding entries of \(w_{f}(r,t)\) get below 0: see (5)). Hence \(\frac{w_{f}(r,t)s(p)_{1}}{m}\in [0,1]\) at all times. Thus expression (4) can be interpreted as a high input when the feedforward weights are close to \(s(p)\) and as a lower input when these are more distant.
The overall model (1), (4), (5) reflects the dynamics of a cortical neural population in combination with a learning rule of the feedforward connections \(w_{f}\), which convey information from receptors to the cortical sheet. As described in [8, 9], this model can express a variety of different behaviors, depending on the lateral connectivity kernels \(w_{e}\) and \(w_{i}\).
The main advantage of the learning rule given by Eq. (5) is that it is a biologically plausible modification of the DSOM learning rule [29]. In DSOM the learning rate and neighborhood function are timeinvariant and can adapt to the input according to one single parameter, called elasticity. This particular modification leads to the following behavior: if the winner neuron (i.e., the neuron that has the shortest distance from the input stimulus to its corresponding codebook–weight) is close to the stimulus, then the neighborhood function shrinks around it. This results in making the weights of neurons within the dynamic neighborhood stronger and the weights of the other units weaker. However, when the winning unit is very far from the current input, the neighborhood function exhibits a broad activity pattern, promoting learning of every unit in the network. Therefore in [8] the neighborhood function has been replaced by the term \(\int _{\Omega }w_{e}(rr')\operatorname{rect}(u(r',t))\,dr'\), providing a more realistic and biological plausible learning algorithm for selforganizing maps in the context of neuroscience.
2.2 Selforganizing maps
We start by briefly describing how the SOM model introduced in [8] and [9] works. The algorithm starts by initializing the feedforward weights randomly (usually uniformly), and the neural field activity \(u(r, 0)\) is set to zero. The second step is sampling the input space by randomly drawn samples of dimension m from an input distribution. At every epoch, one sample is given to the neural field (1) and (5) through Eq. (4). This first step is depicted in Fig. 1(A), where a twodimensional point \(p=(p_{1}, p_{2})\) is sampled from a uniform distribution, \(p_{1}, p_{2} \sim \mathcal{U}(0, 1)\). The samples are mapped to the neural space through the function s and then are passed to Eq. (4). At this point, we should point out that there are two ways of presenting stimuli while training a selforganizing map. The first is predetermining an amount of input samples and present one at each epoch (online learning) and the second is collecting all the input samples into a batch and giving all of them at once to the network (batch learning). In this work, we use the former (online learning) since it is biologically plausible.
Then the algorithm proceeds with computing the numerical solution of Eqs. (1) and (5). To that aim, Eqs. (1) and (5) are discretized and solved numerically using Euler’s forward method. The numerical solution of Eq. (1) is typically a bellshaped curve (bump) centered on the neuron that is the closest unit to the input sample and therefore is called the winner neuron or best matching unit (BMU). In Fig. 1(B), this is depicted as a black disc on a discrete lattice. The lattice represents a discretization of the field where each tile corresponds to a neuron. Neurons that lie within the vicinity (within the black disc in Fig. 1(B)) defined by the solution of Eq. (1) update their weights based on Eq. (5). The rest of the neurons feedforward weights remain in their previous state. Once the temporal integration of Eqs. (1) and (5) is complete, the activity of the field is reset to its baseline activity. Then another input sample is drawn, and the whole process repeats itself. Once the number of epochs is exhausted, the learning stops, and the mapping process is completed.
To make the aforementioned algorithm directly comparable to Kohonen SOM [19], we provide some insights. First, in Kohonen’s SOM, we compute the distance between the input and the codebooks. Here we do the same using Eq. (4). The neighborhood function that Kohonen’s SOM uses to update the feedforward weights is replaced here by the numerical solution of the neural field (Eq. (1)) and more precisely by the term \(\int _{\Omega }w_{e}(rr')\operatorname{rect}(u(r', t))\,dr'\). Both the learning rate and the width of the neighborhood function are timeindependent in our case, as opposed to Kohonen’s SOM, where they are both timedependent. Our learning rule is different since we use a modified Oja rule [26], which is based on Hebbian learning [15], and it is therefore biologically plausible [1]. The dimensionality reduction in both models, the Kohonen and ours, takes place at the level of the learning rule. This means that Eq. (5) is responsible for learning the representations and mapping the input distribution (of dimensions m) on a manifold of lower dimension \(q\in \{1,2,3\}\).
3 Explicit conditions for stability
The most important question when one trains a selforganizing map is: Will the learning process converge and properly map the input space to the neural one? In most of the cases, it is not possible to predict this. However, in the specific case of the selforganizing algorithm provided by [8], here we show that it is possible to obtain an analytical condition that guarantees the stability of the equilibrium point of system (1)–(5). Stability during learning is a prerequisite to generate a meaningful mapping and thus a proper topographic map. Moreover, a byproduct of deriving such a stability condition is providing some insights on how to properly tune model parameters.
To this end, we now proceed to the mathematical analysis of the model. For generality, the adopted mathematical framework is slightly wider than merely Eqs. (1), (4), (5) and encompasses more general classes of activation functions and synaptic kernels. We start by introducing the considered class of systems and then provide sufficient conditions for its stability and convergence.
3.1 Model under study
The selforganizing neural field (1), (4), (5) is a particular case of the more general dynamics
where \(\tau ,\gamma >0\), \(w_{l},w_{e}\in L_{2}(\Omega ^{2},\mathbb{R})\), the set of all squareintegrable functions from \(\Omega ^{2}\) to \(\mathbb{R}\), and \(f_{e}\), \(f_{l}\) and \(f_{s}\) are Lipschitz continuous functions.
3.2 Existence of equilibrium patterns
Assuming that \(\inf_{r\in \Omega }\int _{\Omega } w_{e}(r,r') f_{e}({u^{*}}( r'))\,dr'\neq 0\), any equilibrium pattern \(({u^{*}},w_{f}^{*})\) of (6a) and (6b) satisfies the following equations:
Since \(\omega _{l}\in L_{2}(\Omega ^{2},\mathbb{R})\), [12, Theorem 3.6] ensures the existence of at least one such equilibrium pattern.
3.3 Stability analysis of Eq. (6a) and (6b)
We recall that an equilibrium \(x^{*}\) of a system \(\dot{x}(t)=f(x(t))\), where \(x(t):\Omega \to \mathbb {R}^{n}\) for each fixed \(t\geq 0\), is called globally exponentially stable if there exist \(k,\varepsilon >0\) such that, for all admissible initial conditions,
where \(\\cdot \\) denotes the spatial \(L_{2}\)norm. This property ensures that all solutions go to the equilibrium configuration \(x^{*}\) in the \(L_{2}\) sense (global convergence) and that the transient overshoot is proportional to the \(L_{2}\)norm of the distance between the initial configuration and the equilibrium (stability). The equilibrium pattern \(x^{*}\) is said to be locally exponentially stable if (8) holds only for solutions starting sufficiently near from it (in the \(L_{2}\) sense). We refer the reader to [11] for a deeper discussion on the stability analysis of neural fields.
Our main result proposes a sufficient condition for the local exponential stability of Eq. (6a) and (6b). Its proof is given in Sect. 6.1.
Theorem 1
Let Ω be a compact connected set of \(\mathbb {R}^{q}\), let \(w_{l}\in L_{2}(\Omega ^{2},\mathbb{R})\), and let \(w_{e}:\Omega ^{2}\to \mathbb {R}\) be a bounded function. Assume further that \(f_{l}\), \(f_{s}\), and \(f_{e}\) are Lipschitz continuous functions, and let \(\ell _{l}\) denote the Lipschitz constant of \(f_{l}\). Let \((u^{*},w_{f}^{*})\) denote any equilibrium of Eq. (6a) and (6b), as defined in Eq. (7a) and (7b). Then, under the conditions
and
the equilibrium pattern \((u^{*},w_{f}^{*})\) is locally exponentially stable for Eq. (6a) and (6b).
Condition (9) imposes that the synaptic weights of the lateral coupling \(w_{l}\) are sufficiently small: stronger lateral synaptic weights can be tolerated if the maximum slope \({\ell_{l}}\) of the activation function \(f_{l}\) is low enough, meaning that the system given by Eq. (6a) is less selfexcitable. Recall that if \(f_{l}\) is a differentiable function, then \(\ell _{l}\) can be picked as the maximum value of its derivative. Nonetheless, Theorem 1 does not impose such a differentiability requirement, thus allowing us to consider nonsmooth functions such as absolute values, saturations, or rectification functions. Note that it was shown in [33] that condition (9) ensures that the system owns a single equilibrium pattern. It is also worth stressing that the slopes of the functions \(f_{s}\) and \(f_{e}\) do not intervene in the stability conditions.
Condition (10) requires a sufficient excitation in the vicinity of the equilibrium \(u^{*}\). Roughly speaking, it imposes that the considered equilibrium pattern \(u^{*}\) does not lie in a region where \(f_{e}\) is zero.
3.4 Stability analysis of the SOM neural fields
Theorem 1 provides a stability condition for the model described by Eq. (6a) and (6b). We next apply it to the model given in [8] to derive more explicit and testable stability conditions. More precisely, the selforganizing neural fields (1), (4), (5) can be put in the form of Eq. (6a) and (6b) by letting \(f_{e}(x)=f_{l}(x)=\text{rect}(x)\), \(f_{s}(x)= 1\frac{x_{1}}{m}\), and
In view of (7a) and (7b), the equilibrium patterns of Eqs. (1), (4), (5) are given by
The Lipschitz constant of \(f_{l}\) is \(\ell _{l}=1\). Based on this, we can also derive the following corollary, whose proof is provided in Sect. 6.2.
Corollary 1
Assume that Ω is a compact connected set of \(\mathbb{R}^{q}\), and let \(w_{e}\), \(w_{i}\), and \(w_{l}\) be as in (11a)–(11c). Then, under the condition that
the equilibrium \((u^{*},w_{f}^{*})\), as defined in Eq. (12a) and (12b), is locally exponentially stable for Eqs. (1)–(5).
A particular case for which local exponential stability holds is when the excitation and inhibition weight functions are sufficiently balanced. Indeed, it appears clearly that Eq. (13) is fulfilled if \(K_{e}\simeq K_{i}\) and \(\sigma _{e}\simeq \sigma _{i}\). See the discussion in Sect. 5 for further physiological insights on this condition.
The integral involved in (13) can be solved explicitly. For instance, in the twodimensional case (\(q=2\)) the condition boils down to the following:
Corollary 2
Let \(\Omega =[a,b]\times [a,b]\) for some \(a,b\in \mathbb{R}\) with \(b\geq a\), and let \(w_{e}\), \(w_{i}\), and \(w_{l}\) be as in (11a)–(11c). Define
where \(\operatorname{Erf}:\mathbb{R}\to (1,1)\) denotes the Gauss error function. Then, under the condition
the equilibrium \((u^{*},w_{f}^{*})\), as defined in Eq. (12a) and (12b), is locally exponentially stable for Eq. (1)–(5).
Plenty of approximations are available for the Erf function in the literature. For instance, the following expression approximates it with a 5.10^{−4} error:
with \(a_{1} = 0.278393\), \(a_{2} = 0.230389\), \(a_{3} = 0.000972\), and \(a_{4} = 0.078108\); see, for instance, [2]. The Erf function is also commonly implemented in mathematical software, thus making Eq. (15) easily testable in practice.
4 Numerical assessment on a twodimensional map
To numerically assess whether the above stability condition correctly predicts the performance of the learning process, we focus on a simple example of a twodimensional map (\(q=2\)) and a twodimensional input space (\(n=2\)). Furthermore, we choose \(s(p)\) to be the identity function since we do not consider any receptors: the position of the tactile stimuli is assumed to be directly available. This choice is motivated by the fact that the presence or absence of a receptors grid does not affect the theoretical results of the current work. We refer to [8, 9] for a more complex application of the neural field selforganizing algorithm.
We sample twodimensional inputs from a uniform distribution. Therefore we have \(s_{i}(p) = (p_{1}, p_{2})\), where i indicates the ith sample, and \(p_{1}, p_{2} \sim \mathcal{U}(0, 1)\). In all our simulations, we use 7000 sample points and train the selforganizing map over each of them (7000 epochs). It is worth stressing the difference between the training time (epochs) and the simulation time. The former refers to the iterations over all the input samples (stimuli): one such input is presented to the model at each epoch. The latter is attributed to the numerical temporal integration of Eqs. (1)–(5). Thus each epoch corresponds to a predefined number of simulation steps. At the end of each epoch the activity of the neural field is reset to baseline activity before proceeding to the next epoch.
4.1 Parameters and simulation details
The neural fields equations are discretized using \(k = 40\times 40\) units. Accordingly, the twodimensional model (1)–(5) is simulated over a spatial uniform discretization \(\Omega _{d}\) of the spatial domain \(\Omega =[0,1]\times [0,1]\), namely \(\Omega _{d}=\bigcup_{i,j=1}^{40} (\frac{i}{40},\frac{j}{40})\). The considered input space, over which the stimuli are uniformly distributed, is also \([0, 1]\times [0,1]\) (twodimensional input vectors). The temporal integration is performed using the forward Euler method, whereas the spatial convolution in Eqs. (1)–(5) is computed via the fast Fourier transform (FFT). The learning process runs for 7000 epochs. The components of the feedforward weights are initialized from a uniform distribution \(\mathcal{U}(0, 0.01)\), and the neural field activity is set to zero. At each epoch, we feed a stimulus to Eqs. (1)–(5), and the system evolves according to its dynamics, whereas the feedforward weights are being updated. Then we reset the neural fields activity to zero. We run each experiment ten times using a different pseudorandom number generator (PRNG) seed each time (the PRNG seeds are given in Appendix 7: the same initial conditions and the set of PRNG seeds were used in each experimental condition).
The source code is written in Python (Numpy, Numba, Sklearn, and Matplotlibdependent) and are freely distributed under the GPL 3Clause License (https://github.com/gdetor/som_stability). All the parameters used in numerical simulations are summarized in Table 1. All simulations ran on an Intel NUC machine equipped with an Intel i710th generation processor and 32 GB of physical memory, running Ubuntu Linux (\(20.04.1\) LTS, Kernel: \(5.4.0\)47generic). The simulation of one selforganizing map consumes 493 MB of physical memory, and it took 2671 seconds to run the 7000 epochs.
4.2 SOM quality measures
We measure the quality of the selforganizing maps using two performance indicators, the distortion \(\mathcal{D}\) [6] and the \(\delta x\delta y\) representation [7]. We recall here that \(\Omega _{d}\) is the spatial uniform discretization of \(\Omega =[0, 1] \times [0, 1]\) and \(k=40 \times 40\) is the number of nodes (neurons). Furthermore, for each \(j\in \{1, \ldots , k\}\), \(w^{j}_{f}(t^{*})\) denotes the steadystate value of the feedforward weights at the jth node of the spatial discretization, and \(t^{*}\) corresponds to the time at the end of an epoch.
The distortion assesses the quality of a selforganizing map. It measures the loss of information over the learning process. In other words, it indicates how good a reconstruction of an input will be after the mapping of all inputs to a lowerdimensional neural map. In a sense, distortion measures how well a SOM algorithm “compresses” the input data with respect to the neighborhood structure. Mathematically, the distortion is computed according to its discrete approximation:
where n is the number of samples we use during the training of the selforganizing map.
Distortion is essentially an indicator of the map convergence, but it is not a reliable tool for assessing its quality. To gauge the quality of the map, we use the \(\delta x  \delta y\) representation [7]. It shows when a map preserves the topology of the input space and hence how well a topographic map is formed. To estimate the \(\delta x  \delta y\), we compute all the pairwise distances between the feedforward weights, \(\delta x = \delta x(i, j) = w_{f}^{i}(t^{*})  w_{f}^{j}(t^{*}) \), and all the distances between the nodes of the uniform discretization of the input space \([0, 1]^{2}\), \(\delta y(i, j) =y_{i}  y_{j}\) for \(i, j = 1, \ldots , k\), where \(y_{i}\) are the discrete nodes of \(\Omega _{d}\). We plot the \(\delta x  \delta y\) (i.e., δx is the ordinate, and δy the abscissa) along with a straight line, named \(\mathcal{L}_{\delta x  \delta y}\), that crosses the origin and the mean of δx points. If the point cloud representation of \(\delta x  \delta y\) closely follows the line \(\mathcal{L}_{\delta x  \delta y}\), then the map is considered wellformed and preserves the topology of the input space.
To quantify the \(\delta x  \delta y\) representation through a scalar performance index, we perform a linear regression on the point cloud of \(\delta x  \delta y\) without fitting the intercept (magenta line in figures), and we get a new line named \(\mathcal{L}_{\Delta }\). Then we define the measure \(\mathcal{P} = \sqrt{\sum_{i=1}^{k}(a_{i}  b_{i})^{2}}\), where \(a_{i} \in \mathcal{L}_{\delta _{x}\delta y}\) and \(b_{i} \in \mathcal{L}_{\Delta }\). Naturally, \(\mathcal{P}\) should approach zero as the two lines are getting closer, indicating that the selforganizing map respects the topology of the input space, and thus it is wellformed.
4.3 Stable case
We start by simulating the model described by Eqs. (1)–(5) with the parameters given in the first line of Table 1. With these parameters, condition (15) is fulfilled (\(0.47 < 1\)), and Corollary 2 predicts that the equilibrium is exponentially stable over each epoch. Accordingly, the model succeeds in building up a selforganizing map as shown in panel (A) of Fig. 2. The white discs indicate the feedforward weights after learning, and the black dots indicate the input data points (twodimensional rectangular uniform distribution).
Panels (B) and (C) show the \(\delta x  \delta y\) representation and the distortion, respectively. We observe that the \(\delta x  \delta y\) representation indicates a correlation between the feedforward weights and the rectangular grid points (aligned with the mean of δx–red line). This means that the selforganizing map is wellformed and conserves the topology of the input. Moreover, the distortion declines and converges toward 0.0025, pointing out first that the loss of information during learning is low and that the structure in the selforganizing map is preserved. However, the boundary effects (the density of points is higher at the boundary of the map in panel (A)) affect both the distortion (it does not converge to zero; see panel (C) in Fig. 2) and the \(\delta x  \delta y\) representation (it is not perfectly aligned with the red line; see panel (B) in Fig. 2). In spite of these boundary effects, the obtained \(\delta x\delta y\) performance indicator is good (\(\mathcal{P}=0.01\)).
The evolution of the norm2 of feedforward weights of three randomly chosen units (\(r^{\ast }=(0.25, 0.25)\), \((0.1, 0.225)\), \((0.35, 0.075)\)) is shown in the panel (D) of Fig. 2. This implies that the weights converge to an equilibrium after a transient period of about 2000 epochs. The oscillations around the equilibrium are due to a repeated alteration of the input stimulus, which causes a shift to the feedforward weights values of each winner neuron (see [8] for more detail).
4.4 Unstable case
The second line of Table 1 provides parameters for which Condition (15) is violated (\(5.25 > 1\)). According to our theoretical predictions, the model might not be stable and thus may not be able to develop any selforganizing map at all. To make sure that this is the case (and not merely a transient effect), we have let the training take more epochs (\(20{,} 000\)). Nevertheless, here we present only the 7000 first epochs for consistency with the rest of our experiments. This situation is illustrated in Fig. 3, where the selforganizing process has failed to generate a wellformed map (panel (A)). In this case, it is apparent that selforganization process has failed to generate a topographic map.
The \(\delta x  \delta y\) representation in panel (B) of Fig. 3 looks like a diffused cloud, indicating that there is no correlation between the grid points and the feedforward weights, meaning that there is no preservation of the topology of the input space. Accordingly, the performance index reaches the value \(\mathcal{P}=0.41\), thus higher than the stable case. Moreover, the distortion in panel (C) of Fig. 3 oscillates without converging to an equilibrium, pointing out that the loss of information remains high and therefore the mapping is not successful. Finally, the norm2 of feedforward weights of three units (\(r^{\ast }=(0.25, 0.25)\), \((0.1, 0.225)\), \((0.35, 0.075)\)) are shown in panel (D): it is apparent that they do not converge to an equilibrium. Instead, they oscillate violently and never stabilize around an equilibrium configuration.
4.5 Numerical assessment of Corollary 2
Finally, we numerically tested condition (15) of Corollary 2 for different values of the parameters \(K_{e}\) and \(K_{i}\) (all other parameters remained the same as in Table 1). For each pair \((K_{e}, K_{i})\), we computed the lefthand side of Eq. (15), the distortion \(\mathcal{D}\) (averaged over the last 10 epochs), and the \(\delta x  \delta y\) performance index \(\mathcal{P}\); see Fig. 4. We observe that for high values of \(K_{e}\) and \(K_{i}\), the stability condition of Corollary 2 is violated (the black solid line overpasses the black dashed line). The distortion (orange curve) closely follows the lefthand side of condition (15) (up to a scaling factor), suggesting that distortion can serve as a measure of stability of system (1)–(5). Furthermore, the distortion and the \(\delta x  \delta y\) performance index \(\mathcal{P}\) indicate that the learning process degrades for high values of \((K_{e},K_{i})\), in line with the fact that condition (15) is violated. Figure 5 confirms this good alignment between the theoretical stability condition and the performance of the selforganizing map: for the first five cases, it properly maps the input space to the neural one, whereas the topology of the input space is not preserved in the last two cases, and a malformed topographic map is obtained.
5 Conclusion
In this work, we have presented theoretical conditions for the stability of a neural field system coupled with an Ojalike learning rule [26]. Numerical assessments on a twodimensional selforganizing map indicate that the theoretical condition is closely aligned with the capacity of the network to form a coherent topographic map.
Previous works have shown through simulations that the dynamical system described by Eqs. (1)–(5) can develop topographic maps through an unsupervised selforganization process [8, 9]. The model relies on the activity of a neural field to drive a learning process. This type of models are capable of developing topographic maps and reorganize them in face of several kinds of disturbances. Here we proceed to a rigorous theoretical analysis of such kind of models by employing neural field Lyapunov theory.
The obtained stability conditions are reminiscent of those obtained for general neural fields dynamics, in which the spatial \(L_{2}\)norm of synaptic weights plays an essential role [11, 22, 33]. In our setting, these conditions translate in a good balance between excitation and inhibition for the exponential stability of the model equilibrium, thus allowing the selforganizing process to develop topographic maps. It is worth stressing that the proof techniques employed here do not rely on a linearization of the system around the considered equilibrium; it thus allows us to cover nondifferentiable activation functions (such as classical saturation or rectification functions).
These stability conditions provide a means to identify the parameters set within which the unsupervised learning works efficiently and thus provides an indication on how to tune them in practice. In particular, they can be used to further investigate how the dynamics of an underlying system affects the learning process during an unsupervised training process and what is the effect of the parameters on the final topographic map: as Fig. 4 indicates, the parameters of the model directly affect the quality of the topographic map. However, a limitation of the present work is that it does not offer a way of choosing the parameters in an optimal way. Furthermore, although the conditions provided by Theorem 1 guarantee the stability of the neural field, they do not predict the quality of the obtained map: Stability ensures that the learning will converge to an equilibrium, but the quality of the obtained map strongly depends on the structure of this equilibrium and hence on the chosen initial values of the feedforward weights. This is a wellknown problem with selforganizing maps [19], which is generally solved using a decreasing neighborhood, starting from a very wide one. In our case the neighborhood function is directly correlated with the profile of the field activity and is fixed (stereotyped). We thus cannot always ensure the proper unfolding of the map. It is to be noted that when the neighborhood of a Kohonen is kept fixed, it suffers from similar problems. Nevertheless, the numerical assessment of the proposed theoretical stability conditions suggests that the stability condition accurately predicts the emergence of topographic maps through unsupervised learning: see Figs. 4 and 5.
Other works have studied stability conditions for Kohonen maps and vector quantization algorithms using methods from linear systems stability theory [32] or through energy functions [10]. However, these works focus on the learning rule for the Kohonen selforganizing maps [20], and the dynamics are not explicitly given by dynamical systems. Our work goes beyond by taking into account not only the learning dynamics, but also the neural dynamics that drives the selforganizing process.
Last but not least, it has been shown that neural adaptation is crucial in the development of the neocortex [23] and neurons tend to adapt their input/output relation according to the statistics of the input stimuli. Our theoretical results provide conditions under which this input/output adaptation successfully takes place at least at a computational level.
6 Proof of the theoretical results
6.1 Proof of Theorem 1
To place the equilibrium at the origin, we employ the following change of variables:
where \(u^{*}\) and \(w_{f}^{*}\) denote the equilibrium patterns of Eq. (6a) and (6b), as defined in Eq. (7a) and (7b). Then system (6a) and (6b) can be written as
where for all \(x\in \mathbb{R}\) and all \(r\in \Omega \),
With this notation, we have \(\tilde{f}_{l}(r,0)=\tilde{f}_{s}(0)=0\) for all \(r\in \Omega \), meaning that (17a) and (17b) owns an equilibrium at zero. Thus the stability properties of the origin of (17a) and (17b) determine those of the equilibria of (6a) and (6b).
First, observe that since \(w_{e}\) is a bounded function and Ω is compact, there exists \(\bar{\omega }_{e}>0\) such that
To assess the stability of (17a) and (17b), we may be tempted to rely on linearization techniques. Nevertheless, the linearized system (17a) and (17b) around the origin would necessarily involve the derivative of \(f_{s}\) at zero, which may be undefined if \(f_{s}\) is not differentiable at zero (which is the case for the system of interest (1)–(5), where \(f_{s}\) involves an absolute value). Consequently, the proof we propose here relies on Lyapunov methods [17], which were extended to neural fields in [11].
Consider the following Lyapunov functional:
where \(\rho >0\) denotes a parameter whose value will be decided later. First, observe that the following bounds hold at all \(t\geq 0\):
where \(\underline{\alpha }:=\frac{1}{2}\min \{\tau ; \rho /\gamma \}>0\) and \(\overline{\alpha }:=\frac{1}{2}\max \{\tau ; \rho /\gamma \}>0\). The derivative of V along the solutions of (6a) and (6b) reads
Moreover, denoting by \(\ell _{s}\), \(\ell _{e}\), and \(\ell _{l}\) the Lipschitz constants of \(f_{s}\), \(f_{e}\), and \(f_{l}\) respectively, we have that for all \(x\in \mathbb{R}\) and all \(r\in \Omega \).
Applying the Cauchy–Schwarz inequality and using Eq. (22), it follows that
Hence, using again the Cauchy–Schwarz inequality,
Observing that \(\int _{\Omega }\tilde{u}(r',t)^{2}\,dr'\) is independent of r and defining
it follows that
Furthermore, using Eq. (23), we have that
Invoking the inequality \(2ab \leq (a^{2}/\lambda + \lambda b^{2})\) for all \(a, b \in \mathbb{R}\) and \(\lambda > 0\), we obtain that
for any \(\lambda >0\).
Now assumption (10) ensures that \(\inf_{r\in \Omega }\int _{\Omega }w_{e}(r,r') \tilde{f}_{e}(r',0)\,dr' >0\). It follows that there exists \(c>0\) such that
Consequently, using (24) and the Cauchy–Schwarz inequality, we get that for any \(v\in L_{2}(\Omega ,\mathbb{R})\),
where the last bound comes from (18). Let \(\mathcal{B}_{\varepsilon }\) denote the ball (in \(L_{2}\)norm) of radius \(\varepsilon >0\), that is, \(\mathcal{B}_{\varepsilon }:= \{ v\in L_{2}(\Omega ,\mathbb{R}) : \v\< \varepsilon \} \). Letting \(\varepsilon := c/\ell _{e}\bar{w}_{e}\), we conclude from the above expression that
Consider an initial condition such that \(\tilde{u}(\cdot ,0)\in \mathcal{B}_{\varepsilon }\) and let \(T\in [0,+\infty ]\) denote the time needed for \(\tilde{u}(\cdot ,t)\) to leave \(\mathcal{B}_{\varepsilon }\). Then, by definition, \(\tilde{u}(\cdot ,t)\in \mathcal{B}_{\varepsilon }\) for all \(t\in [0,T)\), and \(\tilde{u}(\cdot ,T)\notin \mathcal{B}_{\varepsilon }\) if T is finite. Note that by the continuity of solutions, \(T>0\). Moreover, in view of (28),
Combining Eqs. (21), (22), (23), and (29), we obtain that for all \(t\in [0,T)\),
Pick \(\lambda =(1\ell _{l}\bar{w}_{l})/\ell _{s}\). Note that \(\lambda >0\) since \(\ell _{l}\bar{w}_{l}<1\) by assumption (see Eq. (9)). Then the choice \(\rho =\frac{\ell _{s}}{c\lambda }= \frac{\ell _{s}^{2}}{c(1\ell _{l}\bar{w}_{l})}>0\) leads to
Using (20) and letting \(\alpha :=\frac{1}{2\overline{\alpha }}\min \{1 ; \rho c\}>0\), we finally obtain that
Integrating this gives \(V(t)\leq V(0)e^{\alpha t}\) for all \(t\in [0,T)\), which yields, using (20),
Thus if initial conditions are picked within the \(L_{2}\)ball of radius \(\frac{\sqrt{\underline{\alpha }}}{\varepsilon \sqrt{\overline{\alpha }}}\), then \(\\tilde{u}(\cdot ,t)\+\\tilde{w}_{f}(\cdot ,t)\< \varepsilon \) at all times \(t\geq 0\). This means that for these initial conditions, solutions never leave the ball \(\mathcal{B}_{\varepsilon }\), and hence \(T=+\infty \). Thus Eq. (30) ensures the exponential stability on this set of initial conditions.
6.2 Proof of Corollary 1
Assumption described by Eq. (13) is equivalent to requiring \(\bar{w}_{l}<1\), with \(\bar{w}_{l}\) defined in Eq. (25). Since the Lipschitz constant of the rectification is \(\ell _{l}=1\), this makes Eq. (9) fulfilled.
Moreover, we claim that the solution \(u^{*}\) of the implicit Eq. (12a) and (12b) is necessarily positive on some subset of Ω of nonzero measure. To see this, assume on the contrary that \(u^{*}(r)\leq 0\) for almost all \(r\in \Omega \). Then \(\operatorname{rect}(u^{*}(r))=0\) for almost all \(r\in \Omega \), which implies that
In view of Eq. (12a) and (12b), this implies that \(u^{*}(r)=1\) for all \(r\in \Omega \), thus leading to a contradiction. Consequently, as claimed, \(u^{*}\) is necessarily positive on some subset \(\Omega ^{+}\) of Ω of nonzero measure. Recalling that here Ω is assumed to be a compact set, it follows that
which makes Eq. (10) satisfied. The conclusion then follows from Theorem 1.
6.3 Proof of Corollary 2
The following onedimensional relation holds:
To compute its twodimensional counterpart, let \(r=(r_{1},r_{2})\) and \(r'=(r_{1}',r_{2}')\). Then, for \(\Omega =[a,b]\times [a,b]\),
Using Fubini’s theorem, it follows that
By Eq. (31) this gives
The lefthand term of Eq. (13) then reads
which concludes the proof.
7 PRNG seed
We ran both stable and nonstable experiments ten times with different PRNG seeds. All the PRNG seeds we used are 10, 74, 433, 721, 977, 1330, 3433, 5677, 9127, 7659.
Availability of data and materials
The source code used in this work for running the simulations, analyzing the results, and plotting the figures is freely distributed under the GPL3 License and can be found at https://github.com/gdetor/som_stability.
Abbreviations
 SOM:

Selforganizing map
 DSOM:

Dynamic selforganizing map
 FFT:

Fast Fourier transform
 PRNG:

Pseudorandom number generator
References
Abbott LF, Nelson SB. Synaptic plasticity: taming the beast. Nat Neurosci. 2000;3:1178–83.
Abramowitz M, Stegun IA. Handbook of mathematical functions with formula, graphs and mathematical tables. New York: Dover; 1983.
Amari S. Dynamics of pattern formation in lateralinhibition type neural fields. Biol Cybern. 1977;27(2):77–87.
Bressloff PC. Spatiotemporal dynamics of continuum neural fields. J Phys A, Math Theor. 2012;45(3):033001.
Coombes S. Waves, bumps, and patterns in neural field theories. Biol Cybern. 2005;93(2):91–108.
Cover TM, Thomas JA. Elements of information theory. New York: Wiley; 2012.
Demartines P. Organization measures and representations of Kohonen maps. In: First IFIP Working Group. vol. 10. Citeseer; 1992.
Detorakis GI, Rougier NP. A neural field model of the somatosensory cortex: formation, maintenance and reorganization of ordered topographic maps. PLoS ONE. 2012;7(7):e40257.
Detorakis GI, Rougier NP. Structure of receptive fields in a computational model of area 3b of primary sensory cortex. Front Comput Neurosci. 2014;8:76.
Erwin E, Obermayer K, Schulten K. Selforganizing maps: ordering, convergence properties and energy functions. Biol Cybern. 1992;67(1):47–55.
Faugeras O, Grimbert F, Slotine JJ. Absolute stability and complete synchronization in a class of neural fields models. SIAM J Appl Math. 2008;69(1):205–50.
Faugeras O, Veltz R, Grimbert F. Persistent neural states: stationary localized activity patterns in nonlinear continuous npopulation, qdimensional neural networks. Neural Comput. 2009;21(1):147–87.
Grajski KA, Merzenich M. Neural network simulation of somatosensory representational plasticity. In: Advances in neural information processing systems. 1990. p. 52–9.
Grossberg S. Physiological interpretation of the selforganizing map algorithm. 1994.
Hebb DO. The organization of behavior: a neuropsychological theory. Psychology Press; 2002.
Kaski S, Kangas J, Kohonen T. Bibliography of selforganizing map (SOM) papers: 1981–1997. Neural Comput Surv. 1998;1(3–4):1–176.
Khalil H. Nonlinear systems. 2nd ed. New York: Macmillan Publishing Co.; 1996.
Knight RT, Staines WR, Swick D, Chao LL. Prefrontal cortex regulates inhibition and excitation in distributed neural networks. Acta Psychol. 1999;101(2):159–78.
Kohonen T. Selforganized formation of topologically correct feature maps. Biol Cybern. 1982;43(1):59–69.
Kohonen T. Selforganizing maps. vol. 30. Berlin: Springer; 2001.
Krubitzer LA, Kaas JH. The organization and connections of somatosensory cortex in marmosets. J Neurosci. 1990;10(3):952–74.
Galtier MN, Faugeras OD, Bressloff PC. Hebbian learning of recurrent connections: a geometrical perspective. Neural Comput. 2012;24(9):2346–83.
Mease RA, Famulare M, Gjorgjieva J, Moody WJ, Fairhall AL. Emergence of adaptive computation by single neurons in the developing cortex. J Neurosci. 2013;33(30):12154–70.
Miikkulainen R, Bednar JA, Choe Y, Sirosh J. Computational maps in the visual cortex. Berlin: Springer; 2006.
Nasrabadi NM, Feng Y. Vector quantization of images based upon the kohonen selforganizing feature maps. In: Proc. IEEE int. conf. Neural networks. vol. 1. 1988. p. 101–5.
Oja E. Simplified neuron model as a principal component analyzer. J Math Biol. 1982;15(3):267–73.
Oja M, Kaski S, Kohonen T. Bibliography of selforganizing map (som) papers: 19982001 addendum. Neural Comput Surv. 2003;3(1):1–156.
Qi HX, Preuss TM, Kaas JH. Somatosensory areas of the cerebral cortex: architectonic characteristics and modular organization. Senses Compr Ref. 2008;6:143.
Rougier N, Boniface Y. Dynamic selforganising map. Neurocomputing. 2011;74(11):1840–7.
Schaefer M, Heinze HJ, Rotte M. Taskrelevant modulation of primary somatosensory cortex suggests a prefrontal–cortical sensory gating system. NeuroImage. 2005;27(1):130–5.
Sirosh J, Miikkulainen R. Ocular dominance and patterned lateral connections in a selforganizing model of the primary visual cortex. In: Advances in neural information processing systems. 1995. p. 109–16.
Tucci M, Raugi M. Stability analysis of selforganizing maps and vector quantization algorithms. In: The 2010 international joint conference on neural networks (IJCNN). Los Alamitos: IEEE Comput. Soc.; 2010. p. 1–5.
Veltz R, Faugeras O. Local/global analysis of the stationary solutions of some neural field equations. SIAM J Appl Dyn Syst. 2010;9(3):954–98.
Wilson SP, Law JS, Mitchinson B, Prescott TJ, Bednar JA. Modeling the emergence of whisker direction maps in rat barrel cortex. PLoS ONE. 2010;5(1):e8778.
Xing J, Gerstein GL. Networks with lateral connectivity. I. dynamic properties mediated by the balance of intrinsic excitation and inhibition. J Neurophysiol. 1996;75(1):184–99.
Xing J, Gerstein GL. Networks with lateral connectivity. II. Development of neuronal grouping and corresponding receptive field changes. J Neurophysiol. 1996;75(1):200–16.
Xing J, Gerstein GL. Networks with lateral connectivity. III. Plasticity and reorganization of somatosensory cortex. J Neurophysiol. 1996;75(1):217–32.
Yin H. Learning nonlinear principal manifolds by selforganising maps. In: Principal manifolds for data visualization and dimension reduction. Berlin: Springer; 2008. p. 68–95.
Acknowledgements
Not applicable.
Funding
This work was partially funded by grant ANR17CE240036.
Author information
Authors and Affiliations
Contributions
GID conceived the idea, wrote the code, designed, and ran the numerical experiments, AC performed the mathematical derivation and analysis. GID, AC, and NPR contributed to preparing, writing, and revising the manuscript. All authors read and approved the final manuscript.
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Consent for publication
Not applicable.
Additional information
Georgios Detorakis and Antoine Chaillet contributed equally to this work.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Detorakis, G., Chaillet, A. & Rougier, N.P. Stability analysis of a neural field selforganizing map. J. Math. Neurosc. 10, 20 (2020). https://doi.org/10.1186/s13408020000976
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s13408020000976
Keywords
 Selforganizing maps
 Neural fields
 Lyapunov function
 Asymptotic stability
 Neural networks