Learning in large-scale spiking neural...

Learning in large-scalespiking neural networks

Trevor Bekolay

Center for Theoretical NeuroscienceUniversity of Waterloo

Master’s thesis presentationAugust 17, 2011

Outline

Unsupervised learning

Supervised learning

Reinforcement learning

Unsupervised learning

The problem

How does the brain learn without feedback?

Neurons connect at synapses

Neurotransmitter

receptors

Voltage-gated

calcium

channel

Axon terminal

(presynaptic)

Synaptic cleft

Dendritic spine

(postsynaptic)

Synaptic strengths change

DefinitionsThis is called synaptic plasticity.

When a synapses gets stronger, it is potentiated.

When a synapse gets weaker, it is depressed.

Modelling LTP/LTD

0 1 2 3 4 5 60

Time (hours)

~100Hz stim for10s ~100Hz stim for10s

−10 −5 0 5 10 15 20 25 30−100

Time (minutes)

P slope

3Hz stim for 5 min

BCM rule

∆ωij = κaiaj(aj − θ)

Modelling LTP/LTD

0 1 2 3 4 5 60

Time (hours)

~100Hz stim for10s ~100Hz stim for10s

−10 −5 0 5 10 15 20 25 30−100

Time (minutes)

P slope

3Hz stim for 5 min

BCM rule

∆ωij = κaiaj(aj − θ)

Modelling LTP/LTD with timing

Pre-post Post-pre-40 0 40

-40 0 40

Pre-post Post-pre

What’s the difference?

TheoremIf spike trains have Poisson statistics,STDP can be mapped onto a BCM rule(Izhikevich, 2003 and Pfister & Gerstner, 2006).

What’s the difference?

TheoremIf spike trains have Poisson statistics,STDP can be mapped onto a BCM rule(Izhikevich, 2003 and Pfister & Gerstner, 2006).

Test with a simulation

i jΔωij= κ iφ(aj )a ,θ

Presynaptic

Postsynaptic

Pulse delay

Pulse rate

Recreated STDP curve

−0.03 −0.02 −0.01 0.00 0.01

tpost − tpre (seconds)

inωij

Pulse rate: 20Hzθ = 0.30

θ = 0.33

θ = 0.36

Exhibits frequency dependence

−0.03 −0.02 −0.01 0.00 0.01

tpost − tpre (seconds)

inωij

Pulse rate: 40Hz

Frequency dependence details

1 5 10 20 50 100Stimulation frequency (Hz)

inωij

Simulation (post-pre pairs)

θ = 0.33

θ = 0.30

inωij

Experiment (Kirkwood)

Light-rearedDark-reared

inωij

Simulation (pre-post pairs)

θ = 0.33

θ = 0.30

Supervised learning

The problem

Given input X and output Y, find G(x) = y

Learning nonlinear functions

Function Inputdimensions

Outputdimensions

Neurons inlearningnetwork

Neurons incontrolnetwork

f(x) = x1x2 2 1 420 420f(x) = x1x2 + x3x4 4 1 756 756f(x) = [x1x2, x1x3, x2x3] 3 3 756 756f(x) = [x1, x2]⊗ [x3, x4] 4 2 700 704 (2-layer)

1500 (3-layer)f(x) = [x1, x2, x3]⊗ [x4, x5, x6] 6 3 800 804 (2-layer)

1700 (3-layer)

Simulation results

0 10 20 30 40 50 60

f(x) = x1x2

0 20 40 60 80 1000.8

f(x) = x1x2 + x3x4

0 20 40 60 80 1000.9

f(x) = [x1x2, x1x3, x2x3]

0 20 40 60 80 100 120 140Learning time (seconds)

3 layer control2 layer control

f(x) = [x1, x2]⊗ [x3, x4]

0 50 100 150 200 250 300 350 400Learning time (seconds)

3 layer control

2 layer control

f(x) = [x1, x2, x3]⊗ [x4, x5, x6]

Simulation results

1 2 3 4 5 6 7

Input dimensions20

Output dimensions30

3 4 5 6 7 8 9

Input+Output dimensions20

∼ 2.25 hours to learn 500 → 500 D function

Large-scale models

Neural Engineering Framework1. Representation (encoding and decoding)

2. Transformation

Representation

−1.0

−0.8

−0.6

−0.4

−0.2

Encoding

Decoding

Time (seconds)0.0 0.1 0.2 0.3 0.4 0.5

−1.0

−0.8

−0.6

−0.4

−0.2

Time (seconds)

Encoding vectors represent neuron sensitivity

−1.0 −0.5 0.0 0.5 1.00

−0.5

−0.50.0

LIF tuning curves Projected in 2-D space

Plasticity in the NEF

∆ωij = κaiαjej · Ewhere

I ej is the encoding vector andI E is the error to be minimized.

Calculating E online

OutputInputx

f(y) = -y

G(x) = ?

Excitatory/InhibitoryModulatoryPlastic

Our solution

OutputInputx

f(y) = -y

G(x) = ?

Excitatory/InhibitoryModulatoryPlastic

Network provides E

+EM-rule ∆ωij = κaiαjej · E

Reinforcement learning

Behavioural results

40 80 120 1600.0

1.0Always move left

Always move right

L:21% R:63% L:63% R:21% L:12% R:72% L:72% R:12%Average over 200 runs

One experimental run

One simulation run

Action selection (basal ganglia)

π(s) = arg maxaQ(st+1, at+1)

Changing Q-values

Ventral striatum calculates reward prediction error

Neural results

Experimental

Simulation

0510152025303540

020406080100120140160

Delay Approach Reward DelayN

Contributions

I Shown evidence that BCM capturesqualitative features of STDP

I Proposed a biologically plausiblesolution to the supervised learningproblem

I Created a model that solves stochasticreinforcement learning task andmatches biology

A unifying learning rule

∆ωij = αjai [κuaj(aj − θ) + κsej · E]

Acknowledgements

Thank you for listening and (possibly) reading!

Learning in large-scale spiking neural...

Documents

Gradient Descent for Spiking Neural Networkspapers.nips.cc/paper/7417-gradient-descent-for-spiking-neural-netwo… · spikes. Research in spike-based computation has been impeded

Spiking Neural Networks and You Brains and games

Spiking neural network simulator - SpiNNakerspinnakermanchester.github.io/docs/project_docs/Spiking... · 2019-12-27 · Spiking neural network simulator Xin Jin (jinxa@cs.man.ac.uk)

Reverse-engineering in spiking neural networks parameters ... · Reverse-engineering in spiking neural networks parameters: exact deterministic parameters estimation3 1 Introduction

EFFICIENT SIMULATION SCHEME FOR SPIKING NEURAL

An Augmented OxRAM Synapse for Spiking Neural Network (SNN

The Next Generation Neural Networks: Deep Learning and Spiking Neural Networks · · 2015-08-10Networks: Deep Learning and Spiking Neural Networks ADVANCED SEMINAR submitted by

Unsupervised Learning with Self-Organizing Spiking Neural … · 2020. 2. 17. · Unsupervised Learning with Self-Organizing Spiking Neural Networks Hananel Hazan , Daniel Saundersy,

Arithmetic Operation in Spiking Neural P System with Chain ...€¦ · which combines spiking neural P system (SNP, for short) with discrete Morse theory, that is to say, neural membrane

On Some Classes of Sequential Spiking Neural P Systems

Studying a chaotic spiking neural model

Algorithm-architecture adequacy of spiking neural networks

A temporal classifier system using spiking neural networks

Building Blocks for Electronic Spiking Neural Networks_Van Schaik01

Theoretical Computer Science Asynchronous spiking neural P systems

BindsNET: A Machine Learning-Oriented Spiking Neural

Artificial Spiking Neural Networks

Synchronization in Spiking Neural Networks - Freedoursat.free.fr/.../CS790R_S06_9_Neural_Networks1.pdf · 2/22/2006 CS 790R - Synchronization in Spiking Neural Networks 24. 1. Temporal

BRIAN Simulation of spiking neural networks

Digital Implementation of a Spiking Convolutional Neural