Learning Efficient Backprojections Across Cortical Hierarchies in Real Time

Max, Kevin; Kriener, Laura; Pineda García, Garibaldi; Nowotny, Thomas; Senn, Walter; Petrovici, Mihai A.

doi:10.1007/978-3-031-44207-0_48

Kevin Max¹¹,
Laura Kriener¹¹,
Garibaldi Pineda García¹²,
Thomas Nowotny¹²,
Walter Senn¹¹ &
…
Mihai A. Petrovici¹¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14254))

Included in the following conference series:

International Conference on Artificial Neural Networks

1346 Accesses
1 Citations

Abstract

Models of sensory processing and learning in physical substrates (such as the cortex) need to efficiently assign credit to synapses in all areas. In deep learning, a well-established solution is error backpropagation; this however carries several biologically implausible requirements, such as weight transport from feed-forward to feedback paths. We present Phaseless Alignment Learning (PAL), a biologically plausible approach for learning efficient feedback weights in layered cortical hierarchies. Our dynamical system enables the simultaneous learning of all weights with always-on plasticity, and exclusively utilizes information locally available at the synapses. PAL is entirely phase-free, avoiding the need for forward and backward passes or phased learning, and enables efficient error propagation across multi-layer cortical hierarchies, while maintaining bio-physically plausible signal transport and learning.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Learning efficient backprojections across cortical hierarchies in real time

Article 06 June 2024

Burst-dependent synaptic plasticity can coordinate learning in hierarchical circuits

Article 13 May 2021

Distributing task-related neural activity across a cortical network through task-independent connections

Article Open access 18 May 2023

Keywords

1 Summary

Neural activity is modulated through learning, i.e., long-term adaptation of synaptic weights. However, it remains unresolved how weights are adapted across the cortex to effectively solve a given task. A key question is how to assign credit to synapses that are situated deep within a hierarchical network. In deep learning, backpropagation (BP) is the current state-of-the-art for solving this issue, and may potentially serve as an inspiration for neuroscience. Application of BP to cortical processing is however non-trivial, due to several biologically implausible requirements it entails. For example, it requires information to be buffered for use at different stages of processing. Additionally, error propagation occurs through weights that must be mirrored at synapses in different layers, resulting in the weight transport problem. Furthermore, artificial neural networks (ANNs) operate in separate forward and backward phases, with inference and learning alternating strictly.

We introduce Phaseless Alignment Learning (PAL) [4], a biologically plausible technique for learning effective top-down weights across layers in cortical hierarchies. We propose that cortical networks can learn useful backward weights by utilizing a ubiquitous resource of the brain: noise. Despite being usually treated as a disruptive factor, noise can be leveraged by the feedback pathway as an additional carrier of information for synaptic plasticity.

PAL describes a fully dynamic system that effectively addresses all of the aforementioned problems: it models the dynamics of biophysical substrates, and all computations are carried out using information locally available at the synapses; learning occurs in a completely phase-less manner; plasticity is always-on for all synapses, both forward and backward, at all times. Our approach is consistent with biological observations and facilitates efficient learning without the need for wake-sleep phases or other forms of phased plasticity found in many other models of cortical learning.

PAL can be applied to a broad range of models and represents an improvement over previously known biologically plausible methods of credit assignment. For instance, when compared to feedback alignment (FA), PAL can solve complex tasks with fewer neurons and more effectively learn useful latent representations. We illustrate this by conducting experiments on various classification tasks using a cortical dendrite microcircuit model [7], which leverages the complexity of neuronal morphology and is capable of prospective coding [2].

2 Theory

PAL utilises the noise found in physical neurons, as information is sent across the cortical hierarchy, see Fig. 1 (a). Neuronal dynamics are described in a rate-based coding scheme of a network with $\ell ~=~1 \, \ldots \, N$ layers,

$$\begin{aligned} \tau \, \boldsymbol{\dot{u}}_\ell = - \boldsymbol{u}_\ell + \boldsymbol{W}_{\ell ,\ell -1} \boldsymbol{r}_{\ell -1} + \boldsymbol{e}_\ell + \boldsymbol{\xi }_\ell \,, \end{aligned}$$

(1)

with bottom-up input $\boldsymbol{W}_{\ell ,\ell -1} \boldsymbol{r}_{\ell -1}$, and noise $\boldsymbol{\xi }_\ell $; the local error signal $\boldsymbol{e}_\ell $ is used to update forward weights through $\boldsymbol{\dot{W}}_{\ell ,\ell -1} \propto \boldsymbol{e}_\ell \, \boldsymbol{r}_{\ell -1}^T$. Errors are passed down from higher layers through top-down synapses $\boldsymbol{B}_{\ell ,\ell +1} $ via $\boldsymbol{e}_\ell = \varphi ' \cdot \, \boldsymbol{B}_{\ell ,\ell +1} \, \boldsymbol{e}_{\ell +1} $.

As suggested in [7], the different terms in Eq. (1) correspond to the different compartments of a pyramidal neuron, and the error is transported as the difference in firing rates of pairs of pyramidal and interneurons.

PAL learns from the noise $\boldsymbol{\xi }_\ell $ accumulated on top of a stimulus signal as it passes through the network. Backprojections are learned using high-pass-filtered rates $\widehat{\boldsymbol{r}}_{\ell +1}$ through the rule

$$\begin{aligned} {\boldsymbol{\dot{B}}}_{\ell ,\ell +1} \propto \boldsymbol{\xi }_\ell \; \big ( {\widehat{\boldsymbol{r}}}_{\ell +1} \big )^T - \alpha \, {\boldsymbol{B}}_{\ell ,\ell +1} \;. \end{aligned}$$

(2)

By exploiting the autocorrelation properties of neuronal noise, this learning rule dynamically achieves approximate alignment $\boldsymbol{B}_{\ell ,\ell +1} \, || \, \boldsymbol{W}_{\ell +1,\ell }^T$ for all layers simultaneously, and without interrupting the learning of forward weights (see Fig. 1 (b,c)). This allows networks which implement PAL to efficiently learn all weights (feedforward and feedback) without phases, as opposed to many bio-inspired learning rules found in the literature (e.g., Difference Target Propagation and variants [1, 3], AGREL [5, 6], Equilibrium Propagation [8]).

3 Results

We have evaluated PAL on varius tasks: for an excerpt of results, see Fig. 1 (b-h). Additionally, we benchmark PAL using standard tests such as the MNIST digit classification task, where the dendritic microcircuit model (of network size: [784-100-10]) achieves a final test error $3.9 \pm 0.2$ % using PAL and $4.7 \pm 0.1$ % with microcircuits with FA. We emphasize that our results were achieved through simulation of a fully dynamic, recurrent system that is biologically plausible. Weight and voltage updates were applied at every time step, and populations of multi-compartment neurons were used as a bio-plausible error transport mechanism. Our findings demonstrate that PAL can efficiently learn all weights and outperforms FA on tasks involving classification and latent space separation.

We argue that PAL can be realized both in biological and, more generally, physical components. Specifically, it capitalizes on the inherent noise present in physical systems and leverages simple filtering techniques to distinguish between signal and noise where necessary. A realization of PAL (or a variant) in physical form, whether in the cortex or on neuromorphic systems, constitutes an elegant solution to the weight transport problem, while enabling efficient learning with purely local computations.

References

Ernoult, M., et al.: Towards scaling difference target propagation by learning backprop targets. arXiv preprint arXiv:2201.13415 (2022)
Haider, P., et al.: Latent equilibrium. Adv. Neural Inf. Process. Syst. 34, 17839–17851 (2021)
Google Scholar
Lee, D.-H., Zhang, S., Fischer, A., Bengio, Y.: Difference target propagation. In: Appice, A., Rodrigues, P.P., Santos Costa, V., Soares, C., Gama, J., Jorge, A. (eds.) ECML PKDD 2015. LNCS (LNAI), vol. 9284, pp. 498–515. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-23528-8_31
Chapter Google Scholar
Max, K., Kriener, L., García, G.P., Nowotny, T., Senn, W., Petrovici, M.A.: Learning efficient backprojections across cortical hierarchies in real time. arXiv preprint arXiv:2212.10249 (2022)
Pozzi, I., et al.: A biologically plausible learning rule for deep learning in the brain. arXiv preprint arXiv:1811.01768 (2018)
Roelfsema, P., Ooyen, A.: Attention-gated reinforcement learning of internal representations for classification. Neural Comput. 17, 2176–2214 (2005)
Google Scholar
Sacramento, J., et al.: Dendritic cortical microcircuits approximate the backpropagation algorithm. In: Advances in Neural Information Processing Systems. vol. 31 (2018)
Google Scholar
Scellier, B., Bengio, Y.: Equilibrium propagation. Front. Comput. Neurosci. 11, 24 (2017)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Physiology, University of Bern, Bern, Switzerland
Kevin Max, Laura Kriener, Walter Senn & Mihai A. Petrovici
School of Engineering and Informatics, University of Sussex, Brighton, UK
Garibaldi Pineda García & Thomas Nowotny

Authors

Kevin Max
View author publications
You can also search for this author in PubMed Google Scholar
Laura Kriener
View author publications
You can also search for this author in PubMed Google Scholar
Garibaldi Pineda García
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Nowotny
View author publications
You can also search for this author in PubMed Google Scholar
Walter Senn
View author publications
You can also search for this author in PubMed Google Scholar
Mihai A. Petrovici
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kevin Max .

Editor information

Editors and Affiliations

Democritus University of Thrace, Xanthi, Greece
Lazaros Iliadis
Democritus University of Thrace, Xanthi, Greece
Antonios Papaleonidas
Lancaster University, Lancaster, UK
Plamen Angelov
Teesside University, Middlesbrough, UK
Chrisina Jayne

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Max, K., Kriener, L., Pineda García, G., Nowotny, T., Senn, W., Petrovici, M.A. (2023). Learning Efficient Backprojections Across Cortical Hierarchies in Real Time. In: Iliadis, L., Papaleonidas, A., Angelov, P., Jayne, C. (eds) Artificial Neural Networks and Machine Learning – ICANN 2023. ICANN 2023. Lecture Notes in Computer Science, vol 14254. Springer, Cham. https://doi.org/10.1007/978-3-031-44207-0_48

Download citation

DOI: https://doi.org/10.1007/978-3-031-44207-0_48
Published: 22 September 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-44206-3
Online ISBN: 978-3-031-44207-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Learning Efficient Backprojections Across Cortical Hierarchies in Real Time

Abstract

Similar content being viewed by others

Learning efficient backprojections across cortical hierarchies in real time

Burst-dependent synaptic plasticity can coordinate learning in hierarchical circuits

Distributing task-related neural activity across a cortical network through task-independent connections

Keywords

1 Summary

2 Theory

3 Results

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Learning Efficient Backprojections Across Cortical Hierarchies in Real Time

Abstract

Similar content being viewed by others

Learning efficient backprojections across cortical hierarchies in real time

Burst-dependent synaptic plasticity can coordinate learning in hierarchical circuits

Distributing task-related neural activity across a cortical network through task-independent connections

Keywords

1 Summary

2 Theory

3 Results

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation