Abstract
Computation of the direction of motion and the detection of collisions are important features of autonomous robotic systems for course steering and avoidance manoeuvres. Current approaches typically rely on computing these features in software using algorithms implemented on a microprocessor. However, the power consumption, computational latency and form factor limit their applicability. In this work we take inspiration from motion detection studied in the Drosophila visual system to implement an alternative. The nervous system of the Drosophila contains 150000 neurons [1] and computes information in a parallel fashion. We propose a topology comprising a dynamic vision sensor (DVS) which provides input to spiking neural networks (SNN). The network is realised through interconnecting leaky-integrate and fire (LIF) complementary metal oxide semiconductor (CMOS) neurons with hafnium dioxide (HfO2) based resistive random access memories (RRAM) acting as the synaptic connections between them. A genetic algorithm (GA) is used to optimize the parameters of the network, within an experimentally determined range of RRAM conductance values, and through simulation it is demonstrated that the system can compute the direction of motion of a grating. Finally, we demonstrate that by modulating RRAM conductances and adjusting network component time constants the range of grating velocities to which it is most sensitive can be adapted. It is also shown that this allows for the system to reduce power consumption when sensitive to lower velocity stimulus. This mimics the behavior observed in Drosophila whereby the neuromodulator octopamine adjusts the response of the motion detection system when the insect is resting or flying.
Access provided by CONRICYT-eBooks. Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
In recent years detailed partial connectomes of insect neural networks have been produced. An example is the elementary motion detection (EMD) network of Drosophila’s visual system [2,3,4]. The insect has two large eyes composed of repeating hexagonal columns called ommatidia. Each ommatidia has an identical structure which processes visual information from a small region of the full visual field. It is composed of four distinct layers of neuropil - the retina, lamina, medulla and the lobula as shown in Fig. 1(a). The neural pathway for detecting elementary motion within this structure is depicted schematically in Fig. 1(b). It begins at the retina where cells transduce light into electrical signals. These then synapse onto L1 and L2 cells in the lamina. At this stage the information is rectified into ON and OFF pathways - ON pathways carry information on luminescence increments and the OFF pathways luminescence decrements. The L1 and L2 cells in the lamina synapse predominantly onto the Mi1, Tm3 and Tm2, Tm1 cells in the medulla. These cells are implicated in implementing temporal delays between adjacent activity in spatial regions in the visual field [5]. The pairs of cells in the medulla synapse onto T4 and T5 cells in the lobula which are then excited if a spatiotemporal correlation exists between it’s presynaptic cells indicating motion. Subsets of T4 and T5 cells are sensitive to motion in one of the four cardinal directions up, down, left and right and terminate independently in one of four distinct layers in the lobula plate (LP). Groups of lobula plate tangential cells (LPTC) are excited by activity in one layer of the LP and fire to indicate motion in a specific direction. Another group of lobula plate/lobula columnar type two (LPLC2) cells are excited only by diverging motion in each layer of the LP such that they indicate a looming stimulus on a collision course [6]. Furthermore, it has also been found that the neuromodulator octopamine tunes dynamics of the EMDs in the Drosophila visual system as a function of whether the insect is resting or flying [7,8,9]. This allows the insect to adapt its sensitivity to different velocities of stimulus as well as reduce power consumption whilst in a resting state. In Fig. 1(c) tshe response of an LTPC cell is reported for Drosophila stimulated with a moving grating when it is in resting and flying states. The area under the curve for the insect in its resting state is greatly reduced relative to that of its flying state which is thought to be an evolutionary adaptation to optimize its consumption of energy.
The Hassenstein-Reichardt EMD (HR-EMD) is a popular model which reproduces experimental observations of Drosophila’s EMDs [10]. Photoexcitation at adjacent regions in the visual field propogates signals through crossing low pass filters before being recombined at a multiplication unit to detect spatiotemporal correlations. The output of the unit can be either positive or negative indicating motion in one of two directions as in Fig. 1(d). A number of previous works have implemented the HR-EMD in analogue very large scale integrated (aVLSI) systems for point [11], 1D [12], 2D [13] and rotational motion [14] detection but the platform is yet to be adopted for applications. Another perspective on hardware based motion detection are token and feature based EMDs [15,16,17,18]. Here we propose an alternative approach which stays true to the guiding principle of detecting spatiotemporal correlations but takes advantage of the ability to integrate resistive memory with CMOS technology and recent insights into the structural organization of Drosophila’s elementary motion detection neural network. We use a dynamic vision sensor (DVS) to provide input for a SNN composed of CMOS LIF neurons interconnected by RRAM synapses to perform direction detection and detect collisions. First we present the network topology followed by relevant electrical measurements of RRAM matrices and finally demonstrate through simulation the ability of the network to compute the direction of motion of a grating.
2 The Network
This section introduces the sub-units of the full system individually before outlining how they fit together.
2.1 Delay and Correlate Network
To detect motion in one direction along a single dimension a temporal delay can be implemented between two spatially adjacent inputs followed by a downstream mechanism for detecting spatiotemporal correlations. In Drosophila this delay is thought to be implemented between pairs of neurons in the medulla [5] and the correlation performed by the postsynaptic neurons in the lobula. Delay can be implemented in a LIF CMOS neuron through controlling the input time constant of the neuron and a correlation operation can be performed by parameterizing a neuron to fire only when two presynaptic spikes arrive within a short time window. These features are captured by the one dimensional delay and correlate spiking neural network shown in Fig. 2(a). Neurons in the input layer fire to denote spatial activity and synapse with excitatory connections onto a second delay layer. In this layer the input time constant of the B1 and B3 neurons are larger than the central B2 neuron. B1 and B2 synapse with excitatory connections onto the correlator neuron C1 in the output layer and B3 synapses with an inhibitory connection. As depicted in Fig. 2(b) if the input layer is excited in the sequence A1, A2 then A3 along its preferred direction the firing times of B1 and B2 should be similar and the firing time of B3 will be delayed. This allows for the two excitatory spikes from B1 and B2 to arrive at C1 within a short time window such that it fires. For motion against the preferred direction, excitation in the sequence A3, A2 then A1, the firing times of B3 and B2 will be similar and that of B1 will be delayed. Consequently C1 will not be sufficiently excited given the combination of inhibition and temporal separation of the excitatory input. In a third case where all input elements are excited simultaneously B1 and B3 will fire at the same time, negating each others contribution, such that C1 does not receive sufficient excitation from B2 alone to fire. For detection of motion in two dimensions four identical 1D delay and correlate SNNs, sharing the same central low input time constant node, can be arranged orthogonally. If the input neurons are spatially arranged in a cross then spatiotemporal correlations can be detected corresponding to UP, DOWN, LEFT and RIGHT motion within a small region of the visual field. A 2D delay and correlate SNN and the spatial organization of its input is shown in Fig. 3. To extract information from an entire visual field it then follows that an array of 2D delay and correlate SNNs can be connected to an input such that they receive spatially corresponding excitation from across the visual field.
2.2 Readout Network
Inspired by the connectivity pattern observed in Drosophila which allows for motion and collision detection we connect the 2D delay and correlate SNN array to a readout network of five neurons in a similar manner. The connection pattern between the 2D delay and correlate SNNs across the visual field and the readout network is depicted in Fig. 4. To readout the direction of motion four neurons, modelling the LPTCs, corresponding to UP, DOWN, LEFT and RIGHT motion are excited by the corresponding directional output of each of the 2D delay and correlate SNNs across the visual field. Additionally, one further neuron, modelling the LPLC2 neuron is included. This neuron splits the visual field into four quadrants. From each quadrant excitatory connections are made between the 2D delay and correlate SNN directions which diverge from the centre of the visual field and inhibitory connections from directions corresponding to movement converging towards the centre. A looming stimulus on a collision course is defined as an expanding dark edge from the centre to the edges of the visual field and therefore this neuron will fire to denote looming.
2.3 System Architecture
Activity in the visual field excites a DVS camera [19, 20] which propagates spikes presynaptically to spatially corresponding 2D delay and correlate SNNs using the address event representation (AER) protocol [21]. Two dimensional spatiotemporal correlations of the input data are detected by the 2D delay and correlate SNNs before their outputs are integrated together in a readout layer which computes the direction of motion and detects impending collisions.
3 Relevant Properties of RRAM Synapses
In oxide based resistive random access memories (RRAM) the storage element consists of a transition metal oxide (TMO) material - sandwiched between metal electrodes - which can be switched to two distinct stable conductance values. The low resistance state (LRS) or the high resistance state (HRS) can be achieved through SET or RESET operations respectively by applying electric fields of opposite polarities between the electrodes. RRAM devices can be integrated in matrices [22] whereby each device in the matrix is connected in series at the drain of a transistor (1T1R). This allows for each memory device to be read from and written to individually. Each 1T1R structure in the matrix is addressed using the bit line (BL) and source line (SL) which connect to the top electrode of the device and the source of the transistor respectively and a wordline (WL) which connects to the gate of the transistor. The WL voltage therefore regulates the transistor drain source current, in this case termed the programming current (ICC), which determines device conductance and variability in this value after a SET operation. The method of connecting LIF CMOS neurons together through a RRAM matrix is depicted in Fig. 5. Connections between neurons are defined by a device in LRS whereas non-connections are defined by a device in HRS and specific devices can be programmed individually. In this manner specific feed-forward network topologies can be realised. The data plotted in Fig. 6 corresponds to measurements from a 4kbit matrix of HfO2 based TMO RRAM devices with titanium nitride (TiN) electrodes integrated in a 130 nm CMOS technology node [23]. In Fig. 6(a) the cumulative distribution function (CDF) of the conductance values per RRAM device state in the matrix is plotted whereby the LRS and HRS conductance value of each device is shown. Additionally, in Fig. 6(b) the effect of varying the gate voltage of the transistor, therefore the programming current, on the mean value of the LRS conductance CDF is shown. Crucially, the ability to modulate RRAM device conductance allows emulation of neural plasticity [24] - an essential property of neural networks for learning, adapting and maintaining homeostasis.
4 Simulation Results
A simulated DVS of resolution 10 \(\times \) 10 is stimulated with a horizontal grating moving upwards vertically over its input in time providing OFF pathway spikes for the network topology. Noise is also simulated through setting each DVS pixel to spike with a probability of 0.025 per step of the grating. The frequency of the grating is defined as pixels crossed per second of simulation time. Twenty 2D delay and correlate SNNs are used to span the visual field each containing thirteen neurons and twenty one synapses. In total the topology requires 256 neurons and 580 synapses for a DVS of resolution 10 \(\times \) 10. The simulation advances in discrete time steps at which the synaptic currents and neuron input and output voltages are updated with respect to their values at the previous timestep in accordance with the LIF neuron model and the first order dynamic synapse model [25]. The desired outcome is for the readout layer neuron denoting upward motion to fire at an elevated rate relative to the others. UP, DOWN, LEFT and RIGHT correspond to the number of times each readout neuron fires within one vertical sweep of the grating. The F1 score corresponding to correctly identified upward motion, defined in Eq. 1, is used to assess the network performance. F1 score ranges from a minimum value of zero to a maximum value of one. The grating is swept over the input at a range of frequencies and the F1 score is calculated per frequency. We refer to the plot of F1 score with grating frequency as the sensitivity tuning curve (STC).
Genetic algorithms (GA) are bio-inspired approaches to multi-parameter optimization problems simulating the process of natural selection to arrive at a near optimal solution [26] and are readily applied to optimize neural networks [27]. Here a GA is used to set the free parameters of the network. Therein, the conductances and time constants of the synapses and the threshold voltage and the input time constants of the neurons for the repeating 2D delay and correlate SNN and the readout network. The parameters for each of the four orthogonal 1D delay and correlate SNNs are the same such that each EMD is composed of four identical rotations of the same network sharing a common central node. Similarly, parameters for all readout layer neurons and synapses are set equal such that the full parameter space has twenty-four dimensions. The values for RRAM conductance are bounded between \(5\times 10^{-5}\)S and \(3\times 10^{-4}\)S corresponding to the range of values obtained through measurement of RRAM LRS with different values of ICC as in Fig. 6(b). Sixty networks are created per generation. First generation parameters are assigned by sampling from a uniform distribution between a lower and upper bound per parameter. A STC is produced for each of the sixty networks and the ten with the largest area under the curve (AUC) in addition to two randomly selected ones are recombined in pairs to produce the next generation. Parameters are assigned with a certain probability of mutation. Hard mutations occur with a probability of 0.05 whereby the parameter is randomly assigned a value from a uniform distribution between an upper and lower bound. A soft mutation occurs with a probability of 0.5 whereby the parameter is reassigned from a normal distribution around the value inherited from the parent with a standard deviation of 1%. The optimization terminates when the F1 score plateaus. Using the parameters of the best performing network after termination of the genetic optimization ten sensitivity tuning curves were obtained with ten instances of the topology and the means were plotted since the performance of each instance differs slightly due to the modelled inherent RRAM variability and simulated DVS noise. In total two network configurations were found inspired by Drosophila using octopamine to adapt its elementary motion detection neural network between a flying and resting state. One configuration is optimized to detect lower frequency gratings and the other to detect higher frequency gratings - they are referred to as the Slow and Fast network states respectively. The Slow and Fast configurations are plotted with a red dashed and green trace in Fig. 7(a) which shows that the Slow state accurately detects the direction of motion within a range of grating frequencies of 0.7−3 Hz while the Fast state does so within a range between 2–20 Hz. Further, with greater correspondence to experimental results reported in Fig. 1(c), a firing-rate tuning curve (FTC) is plotted in Fig. 7(b) corresponding to the frequency at which the UP neuron fires in the Slow and Fast network states. Since the area under the FTCs of Fig. 7(b) is greater over the range of grating frequencies when the network is in the Fast state than the Slow state the power consumed by the readout network in the Slow state is reduced by 42% relative to that of the Fast state. In Fig. 8(a) the schematic of the repeating structure across the topology is depicted. Figure 8(b) and (c) report the synapse and neuron parameters used to achieve the results of Fig. 7. To arrive at these values for the Slow state the network was optimized with the described genetic algorithm to have the greatest area under the STC within the range of 1–3 Hz. On this occasion values of the RRAM conductances were multiplied by one and a half times and a second genetic optimization to maximize the area under the STC between 10–25 Hz was performed with only the synapse and neuron input time constants as free parameters. The result of Fig. 7 shows that, like Drosophila, the range of velocities of stimulus to which the system is most sensitive to can be adapted by tuning the RRAM synapse conductances and the network component time constants. Additionally when the network is switched to the Slow state the power consumption of the system is reduced since it no longer responds to non-relevant higher velocity stimulus.
5 Conclusions
Making use of resistive memory and spiking neural networks an alternative approach to elementary motion detection was proposed. A DVS camera provided input on the visual field to an array of delay and correlate spiking neural networks composed of simulated LIF CMOS neurons and RRAM synapses based on HfO2 technology to compute the direction of motion and to detect collisions. The network was parameterized using a genetic algorithm and then through simulation sensitivity tuning curves demonstrated that it was able to detect the motion of a grating over a range of frequencies. As observed in Drosophila through tuning RRAM conductances and network component time constants the range of grating frequencies to which it is most sensitive to can be adapted. In addition it was seen that, like Drosophila, when the topology was parameterized such that it was sensitive to lower velocity stimulus the power consumption was reduced relative to the configuration for higher velocity stimulus.
References
Chiang, A.S., Lin, C.Y., Chuang, C.C., Chang, H.M., Hsieh, C.H., Yeh, C.W., Shih, C.T., Wu, J.J., Wang, G.T., Chen, Y.C., Wu, C.C., Chen, G.Y., Ching, Y.T., Lee, P.C., Lin, C.Y., Lin, H.H., Wu, C.C., Hsu, H.W., Huang, Y.A., Chen, J.Y., Chiang, H.J., Lu, C.F., Ni, R.F., Yeh, C.Y., Hwang, J.K.: Three-dimensional reconstruction of brain-wide wiring networks in drosophila at single-cell resolution. Curr. Biol. 21(1), 1–11 (2011)
Takemura, S., Bharioke, A., Lu, Z., Nern, A., Vitaladevuni, S., Rivlin, P.K., Katz, W.T., Olbris, D.J., Plaza, S.M., Winston, P., Zhao, T., Horne, J.A., Fetter, R.D., Takemura, S., Blazek, K., Chang, L.A., Ogundeyi, O., Saunders, M.A., Shapiro, V., Sigmund, C., Rubin, G.M., Scheffer, L.K., Meinertzhagen, I.A., Chklovskii, D.B.: A visual motion detection circuit suggested by drosophila connectomics. Nature 500, 175 (2013)
Maisak, M.S., Haag, J., Ammer, G., Serbe, E., Meier, M., Leonhardt, A., Schilling, T., Bahl, A., Rubin, G.M., Nern, A., Dickson, B.J., Reiff, D.F., Hopp, E., Borst, A.: A directional tuning map of drosophila elementary motion detectors. Nature 500, 212 (2013)
Haag, J., Arenz, A., Serbe, E., Gabbiani, F., Borst, A.: Complementary mechanisms create direction selectivity in the fly. eLife 5, e17421 (2016)
Behnia, R., Clark, D.A., Carter, A.G., Clandinin, T.R., Desplan, C.: Processing properties of ON and OFF pathways for drosophila motion detection. Nature 512, 427 (2014)
Klapoetke, N.C., Nern, A., Peek, M.Y., Rogers, E.M., Breads, P., Rubin, G.M., Reiser, M.B., Card, G.M.: Ultra-selective looming detection from radial motion opponency. Nature 551, 237 (2017)
Jung, S.N., Borst, A., Haag, J.: Flight activity alters velocity tuning of fly motion-sensitive neurons. J. Neurosci. 31(25), 9231–9237 (2011)
Suver, M., Mamiya, A., Dickinson, M.: Octopamine neurons mediate flight-induced modulation of visual processing in drosophila. Curr. Biol. 22(24), 2294–2302 (2012)
Arenz, A., Drews, M.S., Richter, F.G., Ammer, G., Borst, A.: The temporal tuning of the drosophila motion detectors is determined by the dynamics of their input elements. Curr. Biol. 27(7), 929–944 (2017)
Hassenstein, V., Reichardt, W.: Systemtheoretische Analyse der Zeit-, Reihenfolgen- und Vorzeichenauswertung bei der Bewegungsperzeption des Rüsselkäfers Chlorophanus. Z. Naturforsch. B 11, 513 (1956)
Harrison, R.R., Koch, C.: An analog VLSI model of the fly elementary motion detector. In: Jordan, M.I., Kearns, M.J., Solla, S.A. (eds.) Advances in Neural Information Processing Systems 10, pp. 880–886. MIT Press, Cambridge (1998)
Liu, S.C.: A neuromorphic a VLSI model of global motion processing in the fly. IEEE Trans. Circ. Syst. II Analog Digital Signal Proces. 47(12), 1458–1467 (2000)
Harrison, R.R.: A biologically inspired analog IC for visual collision detection. IEEE Trans. Circ. Syst. I Regul. Pap. 52(11), 2308–2318 (2005)
Plett, J., Bahl, A., Buss, M., Kühnlenz, K., Borst, A.: Bio-inspired visual ego-rotation sensor for MAVs. Biol. Cybern. 106(1), 51–63 (2012)
Krammer, J.: Compact integrated motion sensor with three-pixel interaction. IEEE Trans. Pattern Anal. Mach. Intell. 44(2), 86–101 (1996)
Krammer, J., Koch, C.: Pulse-based analog VLSI velocity sensors. IEEE Trans. Cir. Syst. II Analog Digit. Signal Process. 44(2), 86–101 (1997). https://doi.org/10.1109/82.554431
Sarpeshkar, R., Kramer, J., Indiveri, G., Koch, C.: Analog VLSI architectures for motion processing: from fundamental limits to system applications. Proc. IEEE 84(7), 969–987 (1996). https://doi.org/10.1109/5.503298
Shoemaker, P.A.: Implementation of Visual motion detection in analog neuromorphic circuitry - a case study of the issue of circuit precision. Proc. IEEE 102(10), 1557–1570 (2014)
Lichtsteiner, P., Posch, C., Delbruck, T.: A 128 \(\times \) 128 120 dB 15 \(\upmu \)s latency asynchronous temporal contrast vision sensor. IEEE J. Solid-State Circ. 43(2), 566–576 (2008)
Serrano-Gotarredona, T., Linares-Barranco, B.: A 128 \(\times \)128 1.5% contrast sensitivity 0.9% FPN 3 \(\upmu \)s latency 4 mW asynchronous frame-free dynamic vision sensor using transimpedance preamplifiers. IEEE J. Solid-State Circ. 48(3), 827–838 (2013)
Chan, V., Liu, S.C., van Schaik, A.: AER EAR: a matched silicon cochlea pair with address event representation interface. IEEE Trans. Circ. Syst. I Regul. Pap. 54(1), 48–59 (2007). https://doi.org/10.1109/TCSI.2006.887979
Grossi, A., Vianello, E., Zambelli, C., Royer, P., Noel, J.P., Giraud, B., Perniola, L., Olivo, P., Nowak, E.: Experimental investigation of 4kbit RRAM arrays programming conditions suitable for TCAM. IEEE Trans. Very Large Scale Integr. (VLSI) Syst. PP(99), 1–9 (2018)
Grossi, A., Nowak, E., Zambelli, C., Pellissier, C., Bernasconi, S., Cibrario, G., Hajjam, K.E., Crochemore, R., Nodin, J.F., Olivo, P., Perniola, L.: Fundamental variability limits of filament-based RRAM. In: 2016 IEEE International Electron Devices Meeting (IEDM), pp. 4.7.1–4.7.4, December 2016. https://doi.org/10.1109/IEDM.2016.7838348
Garbin, D., Vianello, E., Bichler, O., Rafhay, Q., Gamrat, C., Ghibaudo, G., DeSalvo, B., Perniola, L.: HfO\(_{2}\)-based OxRAM devices as synapses for convolutional neural networks. IEEE Trans. Electron Devices 62(8), 2494–2501 (2015). https://doi.org/10.1109/TED.2015.2440102
Brette, R., Rudolph, M., Carnevale, T., Hines, M., Beeman, D., Bower, J.M., Diesmann, M., Morrison, A., Goodman, P.H., Harris, F.C., Zirpe, M., Natschläger, T., Pecevski, D., Ermentrout, B., Djurfeldt, M., Lansner, A., Rochel, O., Vieville, T., Muller, E., Davison, A.P., El Boustani, S., Destexhe, A.: Simulation of networks of spiking neurons: a review of toolsand strategies. J. Comput. Neurosci. 23(3), 349–398 (2007)
Simon, D.: Evolutionary Optimization Algorithms. Wiley, Hoboken (2013)
Montana, D.J., Davis, L.: Training feedforward neural networks using genetic algorithms. In: IJCAI 1989 Proceedings of the 11th International Joint Conference on Artificial Intelligence - Volume 1, pp. 762–767. Morgan Kaufmann Publishers Inc., San Francisco (1989)
Acknowledgments
The authors would like to acknowledge the support of J. Casas through the CARNOT chair of excellency in bio-inspired technologies. In addition this work was also partially supported by the h2020 NeuRAM3 project.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Dalgaty, T. et al. (2018). Insect-Inspired Elementary Motion Detection Embracing Resistive Memory and Spiking Neural Networks. In: Vouloutsi , V., et al. Biomimetic and Biohybrid Systems. Living Machines 2018. Lecture Notes in Computer Science(), vol 10928. Springer, Cham. https://doi.org/10.1007/978-3-319-95972-6_13
Download citation
DOI: https://doi.org/10.1007/978-3-319-95972-6_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-95971-9
Online ISBN: 978-3-319-95972-6
eBook Packages: Computer ScienceComputer Science (R0)