Online spike-based recognition of digits with ultrafast microlaser neurons

Masominia, Amir; Calvet, Laurie E.; Thorpe, Simon; Barbay, Sylvain

doi:10.3389/fncom.2023.1164472

ORIGINAL RESEARCH article

Front. Comput. Neurosci., 03 July 2023
Volume 17 - 2023 | https://doi.org/10.3389/fncom.2023.1164472

Online spike-based recognition of digits with ultrafast microlaser neurons

Amir Masominia¹^*

Laurie E. Calvet²

Simon Thorpe³

Sylvain Barbay¹

¹Université Paris-Saclay, CNRS, Centre de Nanosciences et de Nanotechnologies, Palaiseau, France
²LPICM, CNRS-Ecole Polytechnique, Palaiseau, France
³CERCO UMR5549, CNRS—Université Toulouse III, Toulouse, France

Classification and recognition tasks performed on photonic hardware-based neural networks often require at least one offline computational step, such as in the increasingly popular reservoir computing paradigm. Removing this offline step can significantly improve the response time and energy efficiency of such systems. We present numerical simulations of different algorithms that utilize ultrafast photonic spiking neurons as receptive fields to allow for image recognition without an offline computing step. In particular, we discuss the merits of event, spike-time and rank-order based algorithms adapted to this system. These techniques have the potential to significantly improve the efficiency and effectiveness of optical classification systems, minimizing the number of spiking nodes required for a given task and leveraging the parallelism offered by photonic hardware.

1. Introduction

Photonic artificial neural networks can open great prospects for the realization of fast and energy efficient image recognition tasks. The advantages of photonic systems include integration, very small dissipation during information transport, ultra-fast response times (sub nanosecond) and, due to the number of controllable nonlinearities in optical materials, a wealth of physical properties useful for information processing. In particular, spiking or excitable photonic systems (Nahmias et al., 2013; Feldmann et al., 2019; Skalli et al., 2022) are exceptional candidates for building third generation neural networks, which are predicted to signficantly improve power consumption and augment computational capabilities (Maass, 1997; Thorpe et al., 2001; Stöckl and Maass, 2021).

Recent research has generated great interest in the use of physical reservoir computing, where the complex dynamics of a physical system are exploited to project input data into a larger dimensional space. Its output is then typically classified in an offline computation using a relatively simple method like a ridge regression (Tanaka et al., 2019; Nakajima, 2020). Such methods include a wide variety of different systems such as nano-electronic spin-torque nanoscillators (Torrejon et al., 2017), organic electrochemical networks (Cucchi et al., 2021), and photonics (Lugnan et al., 2020) using, e.g., optoelectronic oscillators (Larger et al., 2017) or spiking vertical cavity surface emitting lasers (Robertson et al., 2022; Owen-Newns et al., 2023). One exception to resorting to a software based step involved all-optical time-series prediction using passive devices (Bueno et al., 2018) and emulated spiking-based recognition of four different letters in the optical domain (Feldmann et al., 2019). Removing this offline step can significantly improve the response time and energy efficiency of these systems. In this study, we present an algorithmic approach to image recognition that utilizes ultrafast photonic spiking neurons, allowing for a recognition by solely observing the responses of these neurons. This work represents an important step toward the development of more efficient and effective methods for image classification.

We investigate a model task of digit classification utilizing simplified 5 × 5 binary pixel images of the 10 digits, as depicted in Figure 1A. The photonic nodes used here are semiconductor-based micropillar lasers with integrated saturable absorber. Our technique is based on tuning the key physical parameters of photonics microlaser neurons, such as the bias pump, input bit time and intensity, so that their response consists of a single spike that is sensitive to certain features of the data. The optical spiking neurons exhibit the fundamental properties of biological neurons, such as excitability, relaxation, and a refractory period, but are six orders of magnitude faster (Pammi et al., 2020).

FIGURE 1

Figure 1. (A) The 10 digits used in this study. Digits consist of 5 × 5 binary pixels in each image. (B) Horizontal and vertical receptive fields define input bit sequences, which are fed into the neuron through the time varying pump μ₁(t). Depending on the neurons internal parameters, the neurons can fire or not. (C) Illustration of the neuron response to the V₄ receptive field with the input bit sequence [0, 0, 1, 1, 1]. (Top): Corresponding input pump μ₁(t) (Equation 2) with bit duration τ_b, pump pulse duration τ_p and pump amplitude c. (Middle): Net gain of the micropillar R(t). (Bottom) Output intensity I(t).

The inspiration for this technique is found in the biological concept of a receptive field (RF), where neurons exhibit a selectivity to only certain stimuli, resulting in a very sparse and therefore energy efficient encoding of information. Our method is also related to temporal encoding schemes known as time to first spike (Bonilla et al., 2022), which are gaining increasing interest due to its low energy consumption while maintaining an excellent computational performance (Abderrahmane et al., 2020; Kheradpisheh and Masquelier, 2020; Park et al., 2020; Gardner and Grüning, 2021; Guo et al., 2021). Such encoding is believed to play a crucial role in several cognitive processes, such as memory retention and decision-making. It is particularly well-suited for applications where speed is required. While we explore here a simple classification task, our approach may also be used for ultra-fast feature selection prior to a larger classification task, which will benefit the system by reducing its dimensionality. What makes our results unique in the literature is not just the consideration of different temporal schemes based on an optically spiking neuron, but the use of a more complex neuron model for hardware compared to the integrate and fire neurons previously explored.

2. Methods

2.1. Model for spiking microlaser neuron

The model we use to describe the microlaser neurons derives from the Yamada (1993) model, which has proven to adequately predict the response of such systems in various configurations (Barbay et al., 2011; Terrien et al., 2020). It consists of three dimensionless coupled ordinary differential equations:

\begin{array}{l} \dot{I} = I (G - Q - 1) + β (G + η) 2 \\ \dot{G} = γ_{G} (μ_{1} (t) - G (1 + I)) \\ \dot{Q} = γ_{Q} (μ_{2} - Q (1 + s I)) & (1) \end{array}

where I is the intracavity intensity, G is the gain, Q is the saturable absorption, and γ_G and γ_Q are the gain and the SA relaxation rates, respectively. All the parameters are scaled to the cavity photon lifetime, which is on the order of 1–2 ps in physical units. The parameter β models the amout of spontaneous emission coupled to the laser cavity mode. We define the important quantity, net gain, as R = G−Q−1. The microlaser is pumped with a strength μ₁, which can be adapted to either electrical or optical pumps, and μ₂ is the linear unsaturated absorption. The saturation parameter is s, which, for semiconductor materials here, takes the value s≃10. We use typical parameters for semiconductor materials, such that γ_G = γ_Q = 0.005, μ₂ = 2, η = 1.4, and β = 10⁻⁴. Note that the gain and SA recombination rates are small (γ_{G, Q}≪1), resulting in a slow-fast nonlinear system. The intensity dynamics are essentially governed by the net gain (at least in the linear regime): if R>0 laser intensity increases, and if R < 0 it decreases. The nonlinear terms in the gain and SA equations describe stimulated emission and absorption processes, leading either to light amplification or (saturable) absorption. The main physically controllable parameter is the pump μ₁(t).

These equations have been studied theoretically in Dubbeldam and Krauskopf (1999) in the limit β → 0. When the pump μ₁ is increased from 0, the laser starts in its off state and no light is emitted until the laser threshold is reached. Above the laser threshold (μ₁>μ_{1, th} = 1+μ₂), the microlaser is in the self-pulsing regime and emits a train of short pulses. Just below the laser threshold, the microlaser is in the excitable regime: it has a stable quiet state corresponding to no laser emission; if it is perturbed above a certain threshold (the excitable threshold), it emits a fast calibrated pulse in response and returns back to its quiet state in a time corresponding to the absolute and relative refractory periods. Experimentally, the pulse duration is about 200 ps and the relative refractory period is of the order of a few hundred piocseconds, typically 350 ps or more (Selmi et al., 2014). The spiking microlaser also displays spike latency (Selmi et al., 2016) and temporal summation (Selmi et al., 2015). It thus has all the main ingredients of a biological neuron from a computational point of view.

In this study, we focus on the excitable regime, where the microlaser can be considered as a photonic spiking neuron. Input signals can act on I or G, and are defined, respectively as coherent or incoherent. We consider the latter case, which is more practical experimentally, and input the information in the system on the pump parameter μ₁(t), with a coding scheme that will be explained below.

2.2. Input coding

The data to be classified consists of images of digits, each of which is made up of 25 binary pixels (Figure 1A). These images are input into the micropillar by translating the value of each pixel in a row (H_i) or column (V_i) into a corresponding time varying pump value relative to the base pump value (μ₀). This process is illustrated in Figure 1B). The input pump coding is defined by each input bit p from the given receptive field (H_i or V_i) such that:

\begin{array}{l} μ_{1} (t) = μ_{0} + \sum_{i} c p_{i} Π_{τ_{p}} (t - i τ_{b}) & (2) \end{array}

where Π_{τ_p}(t) is a boxcar function of duration τ_p. The time varying pump value is calculated as the sum of the base pump value μ₀ and of the translated bit sequence, each bit having duration τ_b≥τ_p and the added pump amplitude corresponding to the bit “1” being c (0 otherwise). The control parameters τ_b and τ_p are determined by physical considerations. If τ_b is much larger than the relative refractory period of the microlaser, the input bits will not interact since the system can reach steady state after each input pump pulse. By choosing the bit time τ_b slightly less than the relative refractory time, consecutive bits can be summed up due to temporal summation and the system can be made to fire only after a certain sequence is met. This is illustrated in Figure 1B) where the three consecutive “1” part of the input bit sequence “00111” allows the net gain R(t) to increase slightly above zero and thus elicit a spike in the laser intensity. Note that in this case, the system only fires after at least three consecutive input “1”s. The pump pulse width τ_p is chosen in conjunction with the pump amplitude c since, in the small pump pulse duration limit, the physically relevant quantity for the system is the pump pulse energy, i.e., the product τ_pc.

3. Results

3.1. Horizontal and vertical receptive fields

Our approach to digit recognition is based on the temporal summation property of neurons (Koch, 2004), which results in receptive fields for vertical and horizontal features in the input image. Temporal summation, a.k.a. integration, occurs when the neuron integrates the individual stimuli into a single, stronger signal. It has been shown to be important biologically for numerous cognitive functions such as sensing pain (Price et al., 1977), seeing in dim light (Warrant, 1999), and auditory perception (Heil and Neubauer, 2003). It has been demonstrated in a photonic spiking neuron in Selmi et al. (2015) and used for ultrafast image processing using VCSELs in Hejda et al. (2022) and Robertson et al. (2022).

As shown in Figure 1, the majority of the features of each digit are oriented horizontally and vertically, and often consist of aligned blocks of pixels. This observation will drive our approach to the classification task. Our strategy is based on the manipulation of the physical parameters of the micropillar neurons such that they respond to specific input patterns. As shown in Figure 1, by encoding the pixels into pump values and sending them row by row or column by column to the corresponding micropillar neurons, we can elicit a response in the form of a spike if a certain pattern is detected (in this case, three perturbations in a row). This approach allows us to effectively classify the digits based on their distinctive features.

A salient feature observable in all the images in Figure 1 is the presence of either three or four perturbations arranged in a contiguous row or column. Furthermore, the digits 2 and 5 possess symmetrical features, requiring the implementation of neurons with new sets of parameters to detect the patterns “11101” or “10111.” By exploiting the principle of temporal summation, the parameters of the micropillar can be optimized such that the arrival of either of these perturbation patterns results in a net gain above the threshold (R≥0), leading to the generation of a spike. This can be used subsequently as an indicator of the detection of a specific pattern, making the classification in an experimental implementation relatively straightforward. Alternatively, the latency of the resultant spike can be utilized as a means of differentiating between the two primary patterns, as shown below.

3.2. Feature detection

We aim to investigate the parameter regime in which micropillars exhibit a spike response to a specific pattern while remaining insensitive to other patterns. To accomplish this objective, a comprehensive sweep of the relevant parameter space was conducted. The parameters that were varied in this sweep include the pump value (μ₁), as well as the intensity of the perturbation, represented by the coefficient c. Furthermore, the effects of two additional parameters, namely the bit- and perturbation-times were also considered. The bit-time represents the duration of time allocated for each pixel to influence the system, while the perturbation-time denotes the duration of the perturbation when it is present. Through systematic variation of these parameters, we were able to determine the optimal parameter values that lead to the desired behavior in the micropillars. As demonstrated in Figures 2A, B, the distinction between various features is possible by a proper choice of the parameters (μ) and (c). Interestingly, the summation being a time dependant process, the input sequences “11101” and “10111” can be distinguished in a sizeable parameter range. Here, the bit-time and perturbation-time were fixed at 50 and 30 units of simulation time, respectively. These values serve to broaden the parameter regions for each pattern, thus enabling greater accuracy in identification.

FIGURE 2

Figure 2. (A) Minimum number of consecutive “1” bits in the input sequence required to excite the photonic neuron. The region μ₀ ⪆ 2.75 for the pump value corresponds to the self-pulsing regime and does not require any additional perturbations to produce a spike. (B) Parameter region where the input sequence 10111 is distinguishable. (C) Spike latency difference (in log₁₀ scale) for sequences of three and four consecutive “1” bits, as a function of pump values and coefficients. The other parameters are τ_b = 50 and τ_p = 30.

3.3. Output coding with spike latency

Spike latency, also known as the temporal delay, is a nonlinear interval between the presentation of a stimulus to a neuron and the subsequent generation of an action potential as a response. This latency period is of paramount importance in neuron activation as it enables encoding the strength of the stimulus in the time domain (Fujii et al., 1996) a process known as temporal coding. In general, spike latency is a critical aspect of neuronal function and plays a pivotal role in a plethora of brain processes.

As an alternative to the previous coding scheme, we can take advantage of the fact that the temporal stamp for each spike, in the event of receiving any of the two predominant patterns, will be unique to that specific pattern. Through this approach, we can further diminish the feature space of the original dataset. In order to effectively implement this method experimentally, it is imperative to have a substantial disparity between the spike latencies of the two dominant patterns (three or four perturbations in a row). As depicted in Figure 2C, we conducted a comprehensive sweep over the same parameters, while maintaining the values of τ_b and τ_p as previously established. The spike latency diverges to infinity at the excitable threshold and decreases strongly when the excitation increases, until saturating to some non-zero value. This strong variation can be seen in Figure 2C where the spike latency difference varies over several orders of magnitude. It shows that a proper choice of parameters can allow for a large spike latency when needed to ease the feature distinction. In the vicinity of the excitable threshold, we can find the parameter region where the difference grows exponentially toward infinity, thus providing a window where the detection of two patterns will be eased. Indeed, from an experimental point of view, the system would also be subject to noise and large latencies would also be accompanied by large fluctuations of the latency time. Thus, a large latency difference can be useful to differentiate among input features.

3.4. Classification

We present three classification methods based on the encoding of the output spikes. One relies only on the presence or absence of a spike related to a specific feature (event based). The second incorporates the timing information of the spike arrival (spike-time based). The last one depends on the arrival order of the spikes (rank order based). As we will show, this choice can have a significant impact on the performance of the model and on the reduction of the overall computational cost. In the present classification methodology, the vertical and horizontal receptive fields (V and H, respectively) allow the retrieval of particular features at specific positions in the input image. This is rendered possible by fixing the pair of parameters, μ₀ and c, for each considered feature.

3.4.1. Event base coding

A time-independent classification approach is employed, relying on an event-driven coding strategy. In this method, each receptive field is input into a photonic neuron with different parameters μ₀ and c, which are adjusted to detect the desired input feature corresponding to the presence or absence of a specific sequence, as calculated in Figure 2A. As shown in Table 1a, the three most significant sequences that result in a successful classification of the 10 digits are found to be “111,” “1111,” and “10111.” The parameters chosen to detect the sequences “111” and “1111” are μ₀ = 1.25 and c = 6 and c = 5.5, respectively. In order to separate the classes corresponding to digits 2 and 5, it is necessary to introduce a feature to differentiate between the sequences “10111” and “11101.” Based on Figure 2B, we choose a feature with μ₀ = 1.25 and c = 5.5. As a result, we show in Table 1a the codes associated to the different features tested. From these, it is clear that only 10 features are necessary to distinguish between the digits: {H1, H3, H5, V2, V4} associated to “111,” {V2, V3, V4} associated to “1111” and {V2} or {V4} associated to “10111.” We can eventually add {V3} tuned to detect “1111” if we exclude the empty code for digit “1.” This method can thus classify the digits using 10 neurons in parallel, provided one inputs the data of chosen receptive fields. The classification time taken will be on the order of 5τ_b+τ_l, τ_l being the largest latency time expected for the response. In physical units, this can lead to a few nanoseconds for the process.

TABLE 1

Table 1. Minimal output code (see text) without and with spike latency, respectively (a, b), and with arrival time (c) for each input digit.

3.4.2. Spike-time coding

In this second method, we make use of spike latency, one of the fundamental properties of biological neurons that also exists in our artificial photonic neuron. As shown in Figure 2C, the theoretical spike latency difference (δτ_l) between the two input sequences, namely “111” and “1111,” can approach infinity. This difference in latency derives from the evolution of net-gain [R(t)] and laser intensity [I(t)] in the different input sequences. To surmount this, we consider the temporal position of the resulting spikes. As illustrated in Figure 3, by choosing a parameter region (μ₀, c, τ_b, and τ_p), where the time latency difference $δ τ_{l} = τ^{(3)} - τ^{(4)}$ is greater than τ_b, we can introduce a coding scheme based on spike time. This spike-time coding scheme, which is shown in Table 1b, assigns a temporal position stamp k to each receptive field (Hn_k/Vn_k), k ranging from 1 to 5, fastest to arrive (lowest latency) to slowest, respectively. This leads to a unique code for each digit class.

FIGURE 3

Figure 3. Spike time based coding. Input sequences in chosen receptive fields are sent to micropillar lasers tuned to spike in response to at least three consecutive “1” bits. τ⁽³⁾ and τ⁽⁴⁾ are the latency times for input patterns “111” and “1111,” respectively. Possible spike timings are separated by τ_b (temporal length of the pixel) and are indicated by labels 1–5. The correct output sequence can be identified and results in the prediction of a given class.

This method of coding removes the need for the redundant check of receptive fields {V2, V3, V4}, for presence of different patterns, where the temporal position of the output spike (k) contains the information of type and location of the pattern all at once. This reduces to seven the number of artificial neurons needed for this coding, while increasing the collected features to 12. By choosing μ₀ = 2.65 and c = 0.5 we can be assured of a delay difference $δ τ_{l} \approx 1 0^{3}$ , which is an order of magnitude greater than τ_b = 50, therefore satisfying the criteria of no overlap of temporal position stamps, regardless of input pattern disposition in receptive fields.

As shown in Figure 3, the output spike time of each receptive field, can be utilized in different ways to decode. The simplest way to realize the decoding scheme consists in the use of a set of ten micropillars, each tuned to respond to a certain pattern of output codes. The system still takes advantage of parallel processing, and therefore, keeps the feature extraction and prediction time to a few nanoseconds.

3.4.3. Rank coding

We can extend the framework of spike-time coding to align with the principles of rank order coding (Thorpe and Gautrais, 1998), where only the order of arrival of spike times is taken into account. To ensure that each the spike resulting from a receptive field is unique and remove accidental degeneracies of spike-times in different receptive fields, we assign a random delay τ_d∈[1, 100] to each receptive field. The range of τ_d was decided in relation to the delays of other patterns so that it can alter the arrival time effectively. Here we chose τ_b = [24, 47, 20, 60, 91, 9, 67] for receptive fields {V2, V3, V4, H1, H3, H4, H5}, respectively. We use the same number of receptive fields, i.e., 7. We use the parameters μ₀ = 2.6, c = 0.7, and keep τ_b = 50, such as to have a clear distinction between the three patterns : “111,” “1111,” and “11111.”

As illustrated in Table 1c, by considering only the arrival order of the first three spikes, we can accurately predict the class to which it belongs. During the learning process, the output connection of each class is made to a particular receptive field that has been chosen and optimized to respond to a specific spike arrival order. In the case of digits with fewer than three output spikes, such as 1 and 7, the order is still unique and distinguishable from other classes. Using a more comprehensive dataset would result in a wider range of spike combinations.

It is worth noting that the random delay can be easily implemented by, e.g., adding a small random difference in the pump parameter μ₀ of each neuron, or could even be “naturally” implemented by the inherent fabrication inhomogeneities of the different microlasers considered. All the random choices are not equivalent and may not lead to an efficient classification. They can thus be considered as hyperparameters for our problem. However, our simulations show that it is rather easy to select a set of random delays suitable for a given task.

4. Discussion and conclusion

In this numerical study, we demonstrate the efficacy of incorporating a nonlinear element, specifically laser micropillars, into the classification task. The incorporation of this fundamental nonlinearity serves to elevate the feature space, thereby enabling the utilization of fewer features for the classification task. By using the concept of a receptive field, we can significantly reduce the number of spikes from one spike per one bit, to one spike per multiple bits. This shows how more complex neuron models can be used to advantage to improve the energy consumption of a spiking neural network.

In the event-based method, we observe that utilizing 10 artificial neurons (laser micropillar neurons)—as opposed to the 25 features (of which 15 are distincts) in the original feature space consisting of 5 × 5 images—enabled successful classification of the entire dataset. The classification is based on choosing the right parameters for the micropillar laser, enabling the separation of different input sequences. We find that it can be rather easy to obtain a correct classification requiring a resolution of 10 percent in both μ₀ and C.

We also implemented methods using time coding, which brings the functionality of our system closer to that of biological systems. In the spike-time based and rank based methods, the number of artificial neurons needed is further reduced to 7, representing an absolute minimum for the dataset in question. The reduction in the number of features and subsequently, the number of neurons required, is of significant importance as it allows for offline or extra computation to be avoided, thereby enabling the system to naturally identify distinguishable features and minimized the time and energy required for a classification. The parameters range required in this case is slightly smaller than the one identified in Figure 2A, as can be seen in Figure 2C and still represents a sizable range. Moreover, the rank order classification algorithm is particularly appealing because it only relies on the spike arrival order. This is unlike spike-time based methods, where spike arrival times must be compared to a reference. The classification is effective within the time needed to receive the third fastest spike, which can be on the order of only a few nanoseconds in physical systems (Selmi et al., 2016).

In light of the findings of this theoretical study, the implementation in an experimental setup would serve as a clear demonstration of the computational and learning capabilities of a neuromorphic system based on micropillars. As depicted in Figure 3, this implementation can be achieved using multiple micropillars or by dividing the task into a manageable process for a single pillar through adjustments of the necessary parameters. Using semiconductor sources to pump the micropillars optically and multiple optical modulators to produce optical pulses as input data into the pillars are experimentally feasible approaches for this implementation. Electrical biasing of the micropillar lasers is also a viable and promising technique. We also note that the algorithms presented here are not restricted to these specific setups and could be used in the experimental platforms of Ma et al. (2017) and Skalli et al. (2022).

Scaling up of the network much further or generalization to larger datasets is an open question. However, we would like to stress that our system in its current configuration, in addition to being used as the output of a physical reservoir for a classification task, may also be used prior to a larger classification as an ultra-fast feature selector, in the spirit of convolutional neural networks. It is worth also noting that a multilayer structure can in principle be adopted in this approach since the timing information would simply flow between the layers. In the case of the rank order code, it occurs at the network's final stage and it is no necessary to determine the timing at any other point in the network.

Furthermore, the biological relevance of the temporal models introduced in this study allows for the exploration of more complex, real-world datasets as was used in Van Rullen et al. (1998) for facial recognition using one spike per neuron. Such an algorithm relies on detecting basic facial features and their relative positions which could be performed in hardware with our ultra fast system. The fundamental principle remains the same, identifying the dominant features that define the dataset and allowing the micropillar to reduce the feature space through its nonlinearity. The ranking of the output is also a crucial factor in reducing the time and energy required for more complex tasks where, only specific connections are capable of extracting the necessary information for classification. The potential applications are diverse and, owing to the rapid response time of our artificial neurons, highly convenient. In addition, we point out that the ingredients used by our algorithms stem from very general properties found in many other excitable systems, namely temporal summation and spike latency. Thus, we expect that the algorithms introduced here can be implemented in many other systems.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors on reasonable request.

Author contributions

AM performed the numerical simulations. AM, LC, and SB analyzed the results and wrote the article. All authors contributed to the development of the ideas and results presented in this work and approved the submitted version of the manuscript.

Funding

The authors acknowledge the support of the French Agence Nationale de la Recherche (ANR), under grant ANR-19-CE24-0006 (project ANACONDA).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Abderrahmane, N., Lemaire, E., and Miramond, B. (2020). Design space exploration of hardware spiking neurons for embedded artificial intelligence. Neural Netw. 121, 366–386. doi: 10.1016/j.neunet.2019.09.024

PubMed Abstract | CrossRef Full Text | Google Scholar

Barbay, S., Kuszelewicz, R., and Yacomotti, A. M. (2011). Excitability in a semiconductor laser with saturable absorber. Opt. Lett. 36, 4476–4478. doi: 10.1364/OL.36.004476

PubMed Abstract | CrossRef Full Text | Google Scholar

Bonilla, L., Gautrais, J., Thorpe, S., and Masquelier, T. (2022). Analyzing time-to-first-spike coding schemes: a theoretical approach. Front. Neurosci. 16:971937. doi: 10.3389/fnins.2022.971937

PubMed Abstract | CrossRef Full Text | Google Scholar

Bueno, J., Maktoobi, S., Froehly, L., Fischer, I., Jacquot, M., Larger, L., et al. (2018). Reinforcement learning in a large-scale photonic recurrent neural network. Optica 5:756. doi: 10.1364/OPTICA.5.000756

CrossRef Full Text | Google Scholar

Cucchi, M., Gruener, C., Petrauskas, L., Steiner, P., Tseng, H., Fischer, A., et al. (2021). Reservoir computing with biocompatible organic electrochemical networks for brain-inspired biosignal classification. Sci. Adv. 7:eabh0693. doi: 10.1126/sciadv.abh0693

PubMed Abstract | CrossRef Full Text | Google Scholar

Dubbeldam, J. L. A., and Krauskopf, B. (1999). Self-pulsations of lasers with saturable absorber : dynamics and bifurcations. Opt. Commun. 159:325. doi: 10.1016/S0030-4018(98)00568-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Feldmann, J., Youngblood, N., Wright, C. D., Bhaskaran, H., and Pernice, W. H. P. (2019). All-optical spiking neurosynaptic networks with self-learning capabilities. Nature 569, 208–214. doi: 10.1038/s41586-019-1157-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Fujii, H., Ito, H., Aihara, K., Ichinose, N., and Tsukada, M. (1996). Dynamical cell assembly hypothesis-theoretical possibility of spatio-temporal coding in the cortex. Neural Netw. 9, 1303–1350. doi: 10.1016/S0893-6080(96)00054-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Gardner, B., and Grüning, A. (2021). Supervised learning with first-to-spike decoding in multilayer spiking neural networks. Front. Comput. Neurosci. 15:617862. doi: 10.3389/fncom.2021.617862

PubMed Abstract | CrossRef Full Text | Google Scholar

Guo, W., Fouda, M. E., Eltawil, A. M., and Salama, K. N. (2021). Neural coding in spiking neural networks: a comparative study for robust neuromorphic systems. Front. Neurosci. 15:638474. doi: 10.3389/fnins.2021.638474

PubMed Abstract | CrossRef Full Text | Google Scholar

Heil, P., and Neubauer, H. (2003). A unifying basis of auditory thresholds based on temporal summation. Proc. Natl. Acad. Sci. 100, 6151–6156. doi: 10.1073/pnas.1030017100

PubMed Abstract | CrossRef Full Text | Google Scholar

Hejda, M. C. V., Alanis, J. A., Ortega-Piwonka, I., Figueiredo, J., Javaloyes, J., Romeira, B., et al. (2022). Resonant tunneling diode nano-optoelectronic excitable nodes for neuromorphic spike-based information processing. Phys. Rev. Appl. 17:024072. doi: 10.1103/PhysRevApplied.17.024072

CrossRef Full Text | Google Scholar

Kheradpisheh, S. R., and Masquelier, T. (2020). Temporal backpropagation for spiking neural networks with one spike per neuron. Int. J. Neural Syst. 30:2050027. doi: 10.1142/S0129065720500276

PubMed Abstract | CrossRef Full Text | Google Scholar

Koch, C. (2004). Biophysics of Computation: Information Processing in Single Neurons. New York, NY: Oxford University Press.

Google Scholar

Larger, L., Baylón-Fuentes, A., Martinenghi, R., Udaltsov, V. S., Chembo, Y. K., and Jacquot, M. (2017). High-speed photonic reservoir computing using a time-delay-based architecture: million words per second classification. Phys. Rev. X 7:011015. doi: 10.1103/PhysRevX.7.011015

CrossRef Full Text | Google Scholar

Lugnan, A., Katumba, A., Laporte, F., Freiberger, M., Sackesyn, S., Ma, C., et al. (2020). Photonic neuromorphic information processing and reservoir computing. APL Photon. 5:020901. doi: 10.1063/1.5129762

PubMed Abstract | CrossRef Full Text | Google Scholar

Ma, P. Y., Shastri, B. J., de Lima, T. F., Tait, A. N., Nahmias, M. A., and Prucnal, P. R. (2017). All-optical digital-to-spike conversion using a graphene excitable laser. Opt. Exp. 25, 33504–33513. doi: 10.1364/OE.25.033504

CrossRef Full Text | Google Scholar

Maass, W. (1997). Networks of spiking neurons: the third generation of neural network models. Neural Netw. 10, 1659–1671. doi: 10.1016/S0893-6080(97)00011-7

CrossRef Full Text | Google Scholar

Nahmias, M., Shastri, B., Tait, A., and Prucnal, P. (2013). A leaky integrate-and-fire laser neuron for ultrafast cognitive computing. IEEE J. Sel. Topics Quantum Electron. 19, 1–12. doi: 10.1109/JSTQE.2013.2257700

CrossRef Full Text | Google Scholar

Nakajima, K. (2020). Physical reservoir computing-an introductory perspective. Jpn. J. Appl. Phys. 59:060501. doi: 10.35848/1347-4065/ab8d4f

CrossRef Full Text | Google Scholar

Owen-Newns, D., Robertson, J., Hejda, M., and Hurtado, A. (2023). GHz rate neuromorphic photonic spiking neural network with a single vertical-cavity surface-emitting laser (VCSEL). IEEE J. Sel. Top. Quant. Electron. 29, 1–10. doi: 10.1109/JSTQE.2022.3205716

CrossRef Full Text | Google Scholar

Pammi, V. A., Alfaro-Bittner, K., Clerc, M. G., and Barbay, S. (2020). Photonic computing with single and coupled spiking micropillar lasers. IEEE J. Sel. Top. Quantum Electron. 26, 1–7. doi: 10.1109/JSTQE.2019.2929187

CrossRef Full Text | Google Scholar

Park, S., Kim, S., Na, B., and Yoon, S. (2020). “T2FSNN: deep spiking neural networks with time-to-first-spike coding,” in Proceedings of the 57th ACM/EDAC/IEEE Design Automation Conference, DAC 9220, Virtual Event USA (IEEE Press), 1–6. doi: 10.1109/DAC18072.2020.9218689

CrossRef Full Text | Google Scholar

Price, D. D., Hu, J. W., Dubner, R., and Gracely, R. H. (1977). Peripheral suppression of first pain and central summation of second pain evoked by noxious heat pulses. Pain 3, 57–68. doi: 10.1016/0304-3959(77)90035-5

PubMed Abstract | CrossRef Full Text | Google Scholar

Robertson, J., Kirkland, P., Alanis, J. A., Hejda, M., Bueno, J., Caterina, G. D., et al. (2022). Ultrafast neuromorphic photonic image processing with a VCSEL neuron. Sci. Rep. 12:4874. doi: 10.1038/s41598-022-08703-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Selmi, F., Braive, R., Beaudoin, G., Sagnes, I., Kuszelewicz, R., and Barbay, S. (2014). Relative refractory period in an excitable semiconductor laser. Phys. Rev. Lett. 112:183902. doi: 10.1103/PhysRevLett.112.183902

PubMed Abstract | CrossRef Full Text | Google Scholar

Selmi, F., Braive, R., Beaudoin, G., Sagnes, I., Kuszelewicz, R., and Barbay, S. (2015). Temporal summation in a neuromimetic micropillar laser. Opt. Lett. 40, 5690–5693. doi: 10.1364/OL.40.005690

PubMed Abstract | CrossRef Full Text | Google Scholar

Selmi, F., Braive, R., Beaudoin, G., Sagnes, I., Kuszelewicz, R., Erneux, T., et al. (2016). Spike latency and response properties of an excitable micropillar laser. Phys. Rev. E 94:042219. doi: 10.1103/PhysRevE.94.042219

PubMed Abstract | CrossRef Full Text | Google Scholar

Skalli, A., Robertson, J., Owen-Newns, D., Hejda, M., Porte, X., Reitzenstein, S., et al. (2022). Photonic neuromorphic computing using vertical cavity semiconductor lasers. Opt. Mater. Exp. 12:2395. doi: 10.1364/OME.450926

PubMed Abstract | CrossRef Full Text | Google Scholar

Stöckl, C., and Maass, W. (2021). Optimized spiking neurons can classify images with high accuracy through temporal coding with two spikes. Nat. Mach. Intell. 3, 230–238. doi: 10.1038/s42256-021-00311-4

CrossRef Full Text | Google Scholar

Tanaka, G., Yamane, T., Héroux, J. B., Nakane, R., Kanazawa, N., Takeda, S., et al. (2019). Recent advances in physical reservoir computing: a review. Neural Netw. 115, 100–123. doi: 10.1016/j.neunet.2019.03.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Terrien, S., Pammi, V. A., Broderick, N. G. R., Braive, R., Beaudoin, G., Sagnes, I., et al. (2020). Equalization of pulse timings in an excitable microlaser system with delay. Phys. Rev. Res. 2:023012. doi: 10.1103/PhysRevResearch.2.023012

CrossRef Full Text | Google Scholar

Thorpe, S., Delorme, A., and Rullen, R. V. (2001). Spike-based strategies for rapid processing. Neural Netw. 14, 715–725. doi: 10.1016/S0893-6080(01)00083-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Thorpe, S., and Gautrais, J. (1998). Rank Order Coding. Boston, MA: Springer US, 113–118. doi: 10.1007/978-1-4615-4831-7_19

CrossRef Full Text | Google Scholar

Torrejon, J., Riou, M., Araujo, F. A., Tsunegi, S., Khalsa, G., Querlioz, D., et al. (2017). Neuromorphic computing with nanoscale spintronic oscillators. Nature 547, 428–431. doi: 10.1038/nature23011

PubMed Abstract | CrossRef Full Text | Google Scholar

Van Rullen, R., Gautrais, J., Delorme, A., and Thorpe, S. (1998). Face processing using one spike per neurone. Biosystems 48, 229–239. doi: 10.1016/S0303-2647(98)00070-7

PubMed Abstract | CrossRef Full Text | Google Scholar

Warrant, E. J. (1999). Seeing better at night: life style, eye design and the optimum strategy of spatial and temporal summation. Vision Res. 39, 1611–1630. doi: 10.1016/S0042-6989(98)00262-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Yamada, M. (1993). A theoretical analysis of self-sustained pulsation phenomena in narrow-stripe semiconductor lasers. IEEE J. Quant. Electron. 29, 1330–1336. doi: 10.1109/3.236146

CrossRef Full Text | Google Scholar

Keywords: photonic hardware, temporal coding, rank-order code, spiking neurons, microlasers, receptive fields

Citation: Masominia A, Calvet LE, Thorpe S and Barbay S (2023) Online spike-based recognition of digits with ultrafast microlaser neurons. Front. Comput. Neurosci. 17:1164472. doi: 10.3389/fncom.2023.1164472

Received: 12 February 2023; Accepted: 13 June 2023;
Published: 03 July 2023.

Edited by:

Qian Lou, University of Central Florida, United States

Reviewed by:

Matej Hejda, University of Strathclyde, United Kingdom
Luke Theogarajan, University of California, Santa Barbara, United States
Bruno Romeira, International Iberian Nanotechnology Laboratory (INL), Portugal

Copyright © 2023 Masominia, Calvet, Thorpe and Barbay. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Amir Masominia, amir-hossein.masominia@c2n.upsaclay.fr

ORIGINAL RESEARCH article

Online spike-based recognition of digits with ultrafast microlaser neurons

1. Introduction

2. Methods

2.1. Model for spiking microlaser neuron

2.2. Input coding

3. Results

3.1. Horizontal and vertical receptive fields

3.2. Feature detection

3.3. Output coding with spike latency

3.4. Classification

3.4.1. Event base coding

3.4.2. Spike-time coding

3.4.3. Rank coding

4. Discussion and conclusion

Data availability statement

Author contributions

Funding

Conflict of interest

Publisher's note

References

This article is part of the Research Topic

People also looked at