11 1 Backpropagation. 11 2 Multilayer Perceptron R – S 1 – S 2 – S 3 Network

Backpropagation

Multilayer Perceptron

R – S1 – S2 – S3 Network

Example

Elementary Decision Boundaries

First Subnetwork

First Boundary:a1

1hardlim 1– 0 p 0.5+ =

Second Boundary:

hardlim 0 1– p 0.75+ =

Elementary Decision Boundaries

Third Boundary:

Fourth Boundary:

Second Subnetwork

hardlim 1 0 p 1.5– =

hardlim 0 1 p 0.25– =

Total Network

1– 0

0 1–1 0

0.50.75

1.5–0.25–

W2 1 1 0 0

0 0 1 1= b2 1.5–

1.5–=

1 1= b30.5–=

Function Approximation Example

1 en–

+-----------------=

10= w 2 11

10= b11

10–= b21

1= w1 22

Nominal Parameter Values

Nominal Response

-2 -1 0 1 2-1

Parameter Variations

-2 -1 0 1 2-1

1– w 1 12

1– w1 22

1– b2

Multilayer Network

m 1++ = m 0 2 M 1– =

Performance Index

p1 t1{ , } p2 t2{ , } pQ tQ{ , }

Training Set

F x E e2 = E t a– 2 =

Mean Square Error

F x E eTe = E t a–

Tt a– =

Vector Case

F̂ x t k a k – T t k a k – eT k e k = =

Approximate Mean Square Error (Single Sample)

w i jm

k 1+ wi jm

w i jm

------------–= bim

k 1+ bim

---------–=

Approximate Steepest Descent

Chain Rule

f n w dwd

-----------------------f n d

nd--------------

n w dwd

---------------=

f n n cos= n e2w

= f n w e2w cos=

f n w dwd

-----------------------f n d

nd--------------

n w dwd

--------------- n sin– 2e2w e

2w sin– 2e2w = = =

Example

Application to Gradient Calculation

w i jm

------------

---------ni

------------= F̂

---------F̂

---------ni

---------=

Gradient Calculation

ajm 1–

Sm 1–

------------ a jm 1–

--------- 1=

sim F̂

---------

Sensitivity

w i jm

------------ sim

ajm 1–

--------- si

Gradient

Steepest Descent

k 1+ wi jm

a jm 1–

–= bim

k 1+ bim

k 1+ Wm

m 1–

T–= bm

k 1+ bmk sm–=

sm F̂

----------

---------

-----------

Next Step: Compute the Sensitivities (Backpropagation)

Jacobian Matrix

-----------------

n1m 1+

----------------

n1m 1+

----------------

n1m 1+

----------------

n2m 1+

----------------

n2m 1+

----------------

n2m 1+

----------------

Sm 1+m 1+

----------------

nSm 1+m 1+

----------------

nSm 1+m 1+

----------------

nim 1+

----------------

wi lm 1+

bim 1+

----------------------------------------------------------- wi jm 1+ a j

---------= =

nim 1+

---------------- wi jm 1+ f m n j

--------------------- wi jm 1+

n jm = =

f m n jm

---------------------=

nm----------------- Wm 1+ FÝ

mnm = FÝ

n1m 0 0

0 fÝm

0 0 fÝm

Backpropagation (Sensitivities)

sm F̂

nm---------- n

-----------------

----------------- FÝ

mnm Wm 1+

-----------------= = =

m( ) W

The sensitivities are computed by starting at the last layer, andthen propagating backwards through the network to the first layer.

sM 1–

Initialization (Last Layer)

siM F̂

----------t a–

Tt a–

---------------------------------------

tj a j– 2

----------------------------------- 2 ti ai– –ai

----------= = = =

2FÝMn

M( ) t a– –=

----------ai

----------f

----------------------- fÝM

n iM = = =

2 ti ai– – fÝM

n iM =

Summary

m 1++ = m 0 2 M 1– =

2FÝMn

M( ) t a– –=

m( ) W

m 1+= m M 1– 2 1 =

k 1+ Wm

m 1–

T–= b

mk 1+ b

Forward Propagation

Backpropagation

Weight Update

Example: Function Approximation

g p 14---p

1-2-1Network

Network

1-2-1Network

Initial Conditions

W10 0.27–

0.41–= b1

0 0.48–

0.13–= W2

0 0.09 0.17–= b20 0.48=

Network ResponseSine Wave

-2 -1 0 1 2-1

Forward Propagation

p 1= =

a1 f1 W1a0 b1+ l ogsig 0.27–

0.41–1

0.48–

0.13–+

logsig 0.75–

0.54–

1 e0.75+--------------------

1 e0.54+--------------------

0.368= =

f2 W2a1 b2

+ purelin 0.09 0.17–0.321

0.3680.48+( ) 0.446= = =

e t a– 1 4---p

a2– 1 4---1

0.446– 1.261= = = =

Transfer Function Derivatives

1 en–

+----------------- e

1 en–

------------------------ 11

1 en–

+-----------------–

1 en–

+-----------------

– a1 = = = =

dn 1= =

Backpropagation

2FÝ2n

2( ) t a– – 2 fÝ

2 1.261 – 2 1 1.261 – 2.522–= = = =

s 1 FÝ1n1

( ) W2 Ts 2 1 a1

1– a1

0 1 a21

– a21

0.17–2.522–= =

s1 1 0.321– 0.321 0

0 1 0.368– 0.368 0.090.17–

2.522–=

s 1 0.218 0

0 0.233

0.227–

0.0495–

0.0997= =

Weight Update

W21 W2

0 s2 a1 T

– 0.09 0.17– 0.1 2.522– 0.321 0.368–= =

W21 0.171 0.0772–=

b21 b2

0 s2– 0.48 0.1 2.522–– 0.732= = =

W11 W1

0 s 1 a0 T

– 0.27–

0.41–0.1 0.0495–

0.09971– 0.265–

0.420–= = =

b11 b1

0 s1– 0.48–

0.13–0.1 0.0495–

0.0997– 0.475–

0.140–= = =

Choice of Architecture

g p 1i 4----- p sin+=

-2 -1 0 1 2-1

1-3-1 Network

i = 1 i = 2

i = 4 i = 8

Choice of Network Architecture

g p 16 4

------ p sin+=

-2 -1 0 1 2-1

1-2-1 1-3-1

Convergence

g p 1 p sin+=

-2 -1 0 1 2-1

Generalization

p1 t1{ , } p2 t2{ , } pQ tQ{ , }

g p 14---p

sin+= p 2– 1.6– 1.2– 1.6 2 =

-2 -1 0 1 2-1

1-2-1 1-9-1

11 1 Backpropagation. 11 2 Multilayer Perceptron R – S 1 – S 2 – S 3 Network

Documents

1 Review – Backpropagation Backpropagation is the most well know and widely used neural network system It is a multi-layered, feedfoward, perceptron-like

Artificial Neural Networks for The Perceptron, Madaline ... · Artificial Neural Networks for The Perceptron, Madaline, and Backpropagation Family Bernard Widrow and Michael A. Lehr

Neural Networks. Plan Perceptron Linear discriminant Associative memories Hopfield networks Chaotic networks Multilayer perceptron Backpropagation

JST Backpropagation

ANALISIS PERBANDINGAN PENGENALAN TANDA · PDF fileANALISIS PERBANDINGAN PENGENALAN TANDA TANGAN DENGAN MENGGUNAKAN METODE PERCEPTRON DAN BACKPROPAGATION Skripsi Sebagai Salah Satu

Introduction to AI · · 2018-03-22환경 구축 / python tutorial 2. python / numpy tutorial 3. perceptron 구현 4. MLP & backpropagation ... 실습1 Assignment: python codecademy(8-Loops까지)

Learning strategies for neuronal nets - the ... · The backpropagation learning algorithm • The class map is done by a multi-layer perceptron, which produces a 1 on the output in

Lecture 4 Backpropagation - ttic.uchicago.eduttic.uchicago.edu/~shubhendu/Pages/Files/Lecture4_flat.pdf · Lecture 4 Backpropagation CMSC 35246. A General View of Backpropagation

PRESENTACIÓN DEL CURSO - ocw.ehu.eus · Red Multicapa Redes Neuronales ... Redes Perceptron –Tres Capas Redes Neuronales Artificiales y sus Aplicaciones. ... Construcción Backpropagation

Multilayer Perceptron Backpropagation Hagan

Adaptive SP & Machine Intelligence Advanced Learning ... networks (NNs), multilayer perceptron, the backpropagation algorithm, and nonlinear separation of patterns From feedforward

Multilayer Perceptron perceptron.pdf · Multilayer Perceptron ... input x belongs to C 1. Perceptron is cosmetically similar to logistic ... Learning Boolean XOR A simple perceptron

EASy Summer 2004Non-Symbolic AI lec 81 Non-Symbolic AI lecture 8 Backpropagation in a multi-layer perceptron

Linear Classification: The Perceptron - Penn Engineeringcis519/fall2017/...Improving the Perceptron • The Perceptron produces many θ‘s during training • The standard Perceptron

Artificial Neural Networks (Cont.) Chapter 4 Perceptron Gradient Descent Multilayer Networks Backpropagation Algorithm 1

Multilayer Perceptron (MLP): the Backpropagation (BP) Algorithm

−Multi-layer Perceptron −Forward Pass −Backpropagation

DISEÑO DEL REGISTRO SONICO SINTETICO (RSS) …tangara.uis.edu.co/biblioweb/tesis/2008/125938.pdf · 2.8 backpropagation ... esquema perceptron simple. 68 figura 24 ... esquema perceptron

Backpropagation - elvex.ugr.eselvex.ugr.es/decsai/deep-learning/slides/NN3 Backpropagation.pdf · Backpropagation Fernando Berzal, berzal@acm.org Backpropagation Redes neuronales

Introduction to Backpropagation Networks · Backpropagation Networks Introduction to Backpropagation - In 1986 a method for learning in multi-layer network, Backpropagation, was invented