LOGO Classification III Lecturer: Dr. Bo Yuan E-mail: [email protected]

Classification III

Lecturer Dr Bo Yuan

E-mail yuanbsztsinghuaeducn

Overview

Artificial Neural Networks

Biological Motivation

1011 The number of neurons in the human brain

104 The average number of connections of each neuron

10-3 The fastest switching times of neurons

10-10 The switching speeds of computers

10-1 The time required to visually recognize your mother

The power of parallelism

The information processing abilities of biological neural systems follow from highly parallel processes operating on representations that are distributed over many neurons

The motivation of ANN is to capture this kind of highly parallel computation based on distributed representations

Sequential machines vs Parallel machines

Group A

Using ANN to study and model biological learning processes

Group B

Obtaining highly effective machine learning algorithms regardless of how

closely these algorithms mimic biological processes4

Neural Network Representations

Robot vs Human

Perceptrons

iii xw

otherwise

01)( 110

1 otherwise

xwxwwifxxo nnn

Power of Perceptrons

Input sum Output

0 0 -08 0

0 1 -03 0

1 0 -03 0

1 1 03 1

Input sum Output

0 0 -03 0

0 1 02 1

1 0 02 1

1 1 07 1

AND OR

Error Surface

Gradient Descent

dd otwE

EwE )(

iiiii w

Ewwherewww

Learning Rate

Batch Learning

Delta Rule

)()(22

ddDdii

di xotw )(

xwxo )(

Batch Learning

GRADIENT_DESCENT (training_examples η)

Initialize each wi to some small random value

Until the termination condition is met Do

Initialize each Δwi to zero

For each ltx tgt in training_examples Do

bull Input the instance x to the unit and compute the output o

bull For each linear unit weight wi Do

ndash Δwi larr Δwi + η(t-o)xi

For each linear unit weight wi Do

bull wi larr wi + Δwi

Stochastic Learning

iiiii xotwwherewww )(

For example if xi=08 η=01 t=1 and o=0

Δwi = η(t-o)xi = 01times(1-0) times 08 = 008

Stochastic Learning NAND

InputTarget

Initial

Weights

Output

Error CorrectionFinal

WeightsIndividual SumFinal

Output

x0 x1 x2 t w0 w1 w2 C0 C1 C2 S o E R w0 w1 w2

C0+C1+C2

t-o LR x E

1 0 0 1 0 0 0 0 0 0 0 0 1 +01 01 0 0

1 0 1 1 01 0 0 01 0 0 01 0 1 +01 02 0 01

1 1 0 1 02 0 01 02 0 0 02 0 1 +01 03 01 01

1 1 1 0 03 01 01 03 01 01 05 0 0 0 03 01 01

1 0 0 1 03 01 01 03 0 0 03 0 1 +01 04 01 01

1 0 1 1 04 01 01 04 0 01 05 0 1 +01 05 01 02

1 1 0 1 05 01 02 05 01 0 06 1 0 0 05 01 02

1 1 1 0 05 01 02 05 01 02 08 1 -1 -01 04 0 01

1 0 0 1 04 0 01 04 0 0 04 0 1 +01 05 0 01

1 1 0 1 08 -2 -1 08 -2 0 06 1 0 0 08 -2 -1threshold=05 learning rate=01

Multilayer Perceptron

))(( pqqpqpqpqp

Input Output

Cannot be separated by a single line

)()( qpqpqp

OR NAND

Hidden Layer Representations

p q OR NAND AND

0 0 0 1 0

0 1 1 1 1

1 0 1 1 1

1 1 1 0 0

Input Hidden Output

The Sigmoid Threshold Unit

iii xwnet

neteneto

))(1()()(

1)( yy

FunctionSigmoid

Backpropagation Rule

outputsk

kkd otwE 2)(2

bull xji = the i th input to unit j

bull wji = the weight associated with the i th input to unit j

bull netj = sumwjixji (the weighted sum of inputs for unit j )

bull oj= the output of unit j

bull tj= the target output of unit j

bull σ = the sigmoid function

bull outputs = the set of units in the final layer

bull Downstream (j ) = the set of units directly taking the output of unit j as inputs

d xnet

Training Rule for Output Units

outputskkk

d otoo

E 2)(2

j oonet

)1()( jjjjj

d oootnet

jijjjjji

dji xooot

Ew )1()(

Training Rule for Hidden Units

)1( jDownstreamk

jjkjkj

jDownstreamkkjk

jDownstreamk j

oownet

)1(jDownstreamkkjkjjj woo

jijji xw

dk net

BP Framework

BACKPROPAGATION (training_examples η nin nout nhidden)

Create a network with nin inputs nhidden hidden units and nout output units

Initialize all network weights to small random numbers

bull Input the instance x to the network and computer the output o of every unit

bull For each output unit k calculate its error term δk

bull For each hidden unit h calculate its error term δh

bull Update each network weight wji 23

))(1( kkkkk otoo

outputsk

kkhhhh woo )1(

jijjijijiji xwwww

More about BP Networks hellip

Convergence and Local Minima

The search space is likely to be highly multimodal

May easily get stuck at a local solution

Need multiple trials with different initial weights

Evolving Neural Networks

Black-box optimization techniques (eg Genetic Algorithms)

Usually better accuracy

Can do some advanced training (eg structure + parameter)

Xin Yao (1999) ldquoEvolving Artificial Neural Networksrdquo Proceedings of the IEEE

pp 1423-1447

Representational Power

Deep Learning

Overfitting

Tend to occur during later iterations

Use validation dataset to terminate the training when necessary

Practical Considerations

Momentum

Adaptive learning ratebull Small slow convergence easy to get stuck

bull Large fast convergence unstable

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

Hopfield Network Energy Landscape of Hopfield Network

Beyond BP Networks

When does ANN work

Instances are represented by attribute-value pairs Input values can be any real values

The target output may be discrete-valued real-valued or a vector of several real- or discrete-valued attributes

The training samples may contain errors

Long training times are acceptable Can range from a few seconds to several hours

Fast evaluation of the learned target function may be required

The ability to understand the learned function is not important Weights are difficult for humans to interpret

Reading Materials

Text Book

Richard O Duda et al Pattern Classification Chapter 6 John Wiley amp Sons Inc Tom Mitchell Machine Learning Chapter 4 McGraw-Hill httppagemifu-berlinderojasneuralindexhtmlhtml

Online Demo

httpneuronengwayneedusoftwarehtml httpwwwcbuedu~pongaihopfieldhopfieldhtml

Online Tutorial

httpwwwautonlaborgtutorialsneural13pdf httpwwwcscmueduafscscmueduusermitchellftpfaceshtml

Wikipedia amp Google

Review

What is the biological motivation of ANN

When does ANN work

What is a perceptron

How to train a perceptron

What is the limitation of perceptrons

How does ANN solve non-linearly separable problems

What is the key idea of Backpropogation algorithm

What are the main issues of BP networks

What are the examples of other types of ANN31

Next Weekrsquos Class Talk

Volunteers are required for next weekrsquos class talk

Topic 1 Applications of ANN

Topic 2 Recurrent Neural Networks

Robot Driving

Character Recognition

Face Recognition

Hopfield Network

Length 20 minutes plus question time

Assignment

Topic Training Feedforward Neural Networks

Technique BP Algorithm

Task 1 XOR Problem

4 input samplesbull 0 0 0

bull 1 0 1

Task 2 Identity Function

8 input samplesbull 10000000 10000000

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Code (any programming language with detailed comments)

Due Sunday 14 December

Credit 15 33

Classification III

Overview

Biological Motivation (2)

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

More about BP Networks hellip (2)

Beyond BP Networks

Beyond BP Networks (2)

When does ANN work

Reading Materials

Review

Assignment

Overview

Artificial Neural Networks

Group A

Group B

Robot vs Human

Perceptrons

iii xw

otherwise

01)( 110

1 otherwise

xwxwwifxxo nnn

Input sum Output

0 0 -08 0

0 1 -03 0

1 0 -03 0

1 1 03 1

Input sum Output

0 0 -03 0

0 1 02 1

1 0 02 1

1 1 07 1

AND OR

Error Surface

Gradient Descent

dd otwE

EwE )(

iiiii w

Ewwherewww

Learning Rate

Batch Learning

Delta Rule

)()(22

ddDdii

di xotw )(

xwxo )(

Batch Learning

Stochastic Learning

InputTarget

Initial

Weights

Output

C0+C1+C2

t-o LR x E

1 0 0 1 0 0 0 0 0 0 0 0 1 +01 01 0 0

1 0 1 1 01 0 0 01 0 0 01 0 1 +01 02 0 01

1 1 0 1 02 0 01 02 0 0 02 0 1 +01 03 01 01

1 1 1 0 03 01 01 03 01 01 05 0 0 0 03 01 01

1 0 0 1 03 01 01 03 0 0 03 0 1 +01 04 01 01

1 0 1 1 04 01 01 04 0 01 05 0 1 +01 05 01 02

1 1 0 1 05 01 02 05 01 0 06 1 0 0 05 01 02

1 1 1 0 05 01 02 05 01 02 08 1 -1 -01 04 0 01

1 0 0 1 04 0 01 04 0 0 04 0 1 +01 05 0 01

))(( pqqpqpqpqp

Input Output

)()( qpqpqp

OR NAND

p q OR NAND AND

0 0 0 1 0

0 1 1 1 1

1 0 1 1 1

1 1 1 0 0

Input Hidden Output

iii xwnet

neteneto

))(1()()(

1)( yy

FunctionSigmoid

outputsk

kkd otwE 2)(2

d xnet

outputskkk

d otoo

E 2)(2

j oonet

)1()( jjjjj

d oootnet

jijjjjji

dji xooot

Ew )1()(

)1( jDownstreamk

jjkjkj

jDownstreamkkjk

jDownstreamk j

oownet

jijji xw

dk net

BP Framework

))(1( kkkkk otoo

outputsk

kkhhhh woo )1(

jijjijijiji xwwww

pp 1423-1447

Deep Learning

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

Group A

Group B

Robot vs Human

Perceptrons

iii xw

otherwise

01)( 110

1 otherwise

xwxwwifxxo nnn

Input sum Output

0 0 -08 0

0 1 -03 0

1 0 -03 0

1 1 03 1

Input sum Output

0 0 -03 0

0 1 02 1

1 0 02 1

1 1 07 1

AND OR

Error Surface

Gradient Descent

dd otwE

EwE )(

iiiii w

Ewwherewww

Learning Rate

Batch Learning

Delta Rule

)()(22

ddDdii

di xotw )(

xwxo )(

Batch Learning

Stochastic Learning

InputTarget

Initial

Weights

Output

C0+C1+C2

t-o LR x E

1 0 0 1 0 0 0 0 0 0 0 0 1 +01 01 0 0

1 0 1 1 01 0 0 01 0 0 01 0 1 +01 02 0 01

1 1 0 1 02 0 01 02 0 0 02 0 1 +01 03 01 01

1 1 1 0 03 01 01 03 01 01 05 0 0 0 03 01 01

1 0 0 1 03 01 01 03 0 0 03 0 1 +01 04 01 01

1 0 1 1 04 01 01 04 0 01 05 0 1 +01 05 01 02

1 1 0 1 05 01 02 05 01 0 06 1 0 0 05 01 02

1 1 1 0 05 01 02 05 01 02 08 1 -1 -01 04 0 01

1 0 0 1 04 0 01 04 0 0 04 0 1 +01 05 0 01

))(( pqqpqpqpqp

Input Output

)()( qpqpqp

OR NAND

p q OR NAND AND

0 0 0 1 0

0 1 1 1 1

1 0 1 1 1

1 1 1 0 0

Input Hidden Output

iii xwnet

neteneto

))(1()()(

1)( yy

FunctionSigmoid

outputsk

kkd otwE 2)(2

d xnet

outputskkk

d otoo

E 2)(2

j oonet

)1()( jjjjj

d oootnet

jijjjjji

dji xooot

Ew )1()(

)1( jDownstreamk

jjkjkj

jDownstreamkkjk

jDownstreamk j

oownet

jijji xw

dk net

BP Framework

))(1( kkkkk otoo

outputsk

kkhhhh woo )1(

jijjijijiji xwwww

pp 1423-1447

Deep Learning

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

Group A

Group B

Robot vs Human

Perceptrons

iii xw

otherwise

01)( 110

1 otherwise

xwxwwifxxo nnn

Input sum Output

0 0 -08 0

0 1 -03 0

1 0 -03 0

1 1 03 1

Input sum Output

0 0 -03 0

0 1 02 1

1 0 02 1

1 1 07 1

AND OR

Error Surface

Gradient Descent

dd otwE

EwE )(

iiiii w

Ewwherewww

Learning Rate

Batch Learning

Delta Rule

)()(22

ddDdii

di xotw )(

xwxo )(

Batch Learning

Stochastic Learning

InputTarget

Initial

Weights

Output

C0+C1+C2

t-o LR x E

1 0 0 1 0 0 0 0 0 0 0 0 1 +01 01 0 0

1 0 1 1 01 0 0 01 0 0 01 0 1 +01 02 0 01

1 1 0 1 02 0 01 02 0 0 02 0 1 +01 03 01 01

1 1 1 0 03 01 01 03 01 01 05 0 0 0 03 01 01

1 0 0 1 03 01 01 03 0 0 03 0 1 +01 04 01 01

1 0 1 1 04 01 01 04 0 01 05 0 1 +01 05 01 02

1 1 0 1 05 01 02 05 01 0 06 1 0 0 05 01 02

1 1 1 0 05 01 02 05 01 02 08 1 -1 -01 04 0 01

1 0 0 1 04 0 01 04 0 0 04 0 1 +01 05 0 01

))(( pqqpqpqpqp

Input Output

)()( qpqpqp

OR NAND

p q OR NAND AND

0 0 0 1 0

0 1 1 1 1

1 0 1 1 1

1 1 1 0 0

Input Hidden Output

iii xwnet

neteneto

))(1()()(

1)( yy

FunctionSigmoid

outputsk

kkd otwE 2)(2

d xnet

outputskkk

d otoo

E 2)(2

j oonet

)1()( jjjjj

d oootnet

jijjjjji

dji xooot

Ew )1()(

)1( jDownstreamk

jjkjkj

jDownstreamkkjk

jDownstreamk j

oownet

jijji xw

dk net

BP Framework

))(1( kkkkk otoo

outputsk

kkhhhh woo )1(

jijjijijiji xwwww

pp 1423-1447

Deep Learning

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

Robot vs Human

Perceptrons

iii xw

otherwise

01)( 110

1 otherwise

xwxwwifxxo nnn

Input sum Output

0 0 -08 0

0 1 -03 0

1 0 -03 0

1 1 03 1

Input sum Output

0 0 -03 0

0 1 02 1

1 0 02 1

1 1 07 1

AND OR

Error Surface

Gradient Descent

dd otwE

EwE )(

iiiii w

Ewwherewww

Learning Rate

Batch Learning

Delta Rule

)()(22

ddDdii

di xotw )(

xwxo )(

Batch Learning

Stochastic Learning

InputTarget

Initial

Weights

Output

C0+C1+C2

t-o LR x E

1 0 0 1 0 0 0 0 0 0 0 0 1 +01 01 0 0

1 0 1 1 01 0 0 01 0 0 01 0 1 +01 02 0 01

1 1 0 1 02 0 01 02 0 0 02 0 1 +01 03 01 01

1 1 1 0 03 01 01 03 01 01 05 0 0 0 03 01 01

1 0 0 1 03 01 01 03 0 0 03 0 1 +01 04 01 01

1 0 1 1 04 01 01 04 0 01 05 0 1 +01 05 01 02

1 1 0 1 05 01 02 05 01 0 06 1 0 0 05 01 02

1 1 1 0 05 01 02 05 01 02 08 1 -1 -01 04 0 01

1 0 0 1 04 0 01 04 0 0 04 0 1 +01 05 0 01

))(( pqqpqpqpqp

Input Output

)()( qpqpqp

OR NAND

p q OR NAND AND

0 0 0 1 0

0 1 1 1 1

1 0 1 1 1

1 1 1 0 0

Input Hidden Output

iii xwnet

neteneto

))(1()()(

1)( yy

FunctionSigmoid

outputsk

kkd otwE 2)(2

d xnet

outputskkk

d otoo

E 2)(2

j oonet

)1()( jjjjj

d oootnet

jijjjjji

dji xooot

Ew )1()(

)1( jDownstreamk

jjkjkj

jDownstreamkkjk

jDownstreamk j

oownet

jijji xw

dk net

BP Framework

))(1( kkkkk otoo

outputsk

kkhhhh woo )1(

jijjijijiji xwwww

pp 1423-1447

Deep Learning

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

Robot vs Human

Perceptrons

iii xw

otherwise

01)( 110

1 otherwise

xwxwwifxxo nnn

Input sum Output

0 0 -08 0

0 1 -03 0

1 0 -03 0

1 1 03 1

Input sum Output

0 0 -03 0

0 1 02 1

1 0 02 1

1 1 07 1

AND OR

Error Surface

Gradient Descent

dd otwE

EwE )(

iiiii w

Ewwherewww

Learning Rate

Batch Learning

Delta Rule

)()(22

ddDdii

di xotw )(

xwxo )(

Batch Learning

Stochastic Learning

InputTarget

Initial

Weights

Output

C0+C1+C2

t-o LR x E

1 0 0 1 0 0 0 0 0 0 0 0 1 +01 01 0 0

1 0 1 1 01 0 0 01 0 0 01 0 1 +01 02 0 01

1 1 0 1 02 0 01 02 0 0 02 0 1 +01 03 01 01

1 1 1 0 03 01 01 03 01 01 05 0 0 0 03 01 01

1 0 0 1 03 01 01 03 0 0 03 0 1 +01 04 01 01

1 0 1 1 04 01 01 04 0 01 05 0 1 +01 05 01 02

1 1 0 1 05 01 02 05 01 0 06 1 0 0 05 01 02

1 1 1 0 05 01 02 05 01 02 08 1 -1 -01 04 0 01

1 0 0 1 04 0 01 04 0 0 04 0 1 +01 05 0 01

))(( pqqpqpqpqp

Input Output

)()( qpqpqp

OR NAND

p q OR NAND AND

0 0 0 1 0

0 1 1 1 1

1 0 1 1 1

1 1 1 0 0

Input Hidden Output

iii xwnet

neteneto

))(1()()(

1)( yy

FunctionSigmoid

outputsk

kkd otwE 2)(2

d xnet

outputskkk

d otoo

E 2)(2

j oonet

)1()( jjjjj

d oootnet

jijjjjji

dji xooot

Ew )1()(

)1( jDownstreamk

jjkjkj

jDownstreamkkjk

jDownstreamk j

oownet

jijji xw

dk net

BP Framework

))(1( kkkkk otoo

outputsk

kkhhhh woo )1(

jijjijijiji xwwww

pp 1423-1447

Deep Learning

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

Perceptrons

iii xw

otherwise

01)( 110

1 otherwise

xwxwwifxxo nnn

Input sum Output

0 0 -08 0

0 1 -03 0

1 0 -03 0

1 1 03 1

Input sum Output

0 0 -03 0

0 1 02 1

1 0 02 1

1 1 07 1

AND OR

Error Surface

Gradient Descent

dd otwE

EwE )(

iiiii w

Ewwherewww

Learning Rate

Batch Learning

Delta Rule

)()(22

ddDdii

di xotw )(

xwxo )(

Batch Learning

Stochastic Learning

InputTarget

Initial

Weights

Output

C0+C1+C2

t-o LR x E

1 0 0 1 0 0 0 0 0 0 0 0 1 +01 01 0 0

1 0 1 1 01 0 0 01 0 0 01 0 1 +01 02 0 01

1 1 0 1 02 0 01 02 0 0 02 0 1 +01 03 01 01

1 1 1 0 03 01 01 03 01 01 05 0 0 0 03 01 01

1 0 0 1 03 01 01 03 0 0 03 0 1 +01 04 01 01

1 0 1 1 04 01 01 04 0 01 05 0 1 +01 05 01 02

1 1 0 1 05 01 02 05 01 0 06 1 0 0 05 01 02

1 1 1 0 05 01 02 05 01 02 08 1 -1 -01 04 0 01

1 0 0 1 04 0 01 04 0 0 04 0 1 +01 05 0 01

))(( pqqpqpqpqp

Input Output

)()( qpqpqp

OR NAND

p q OR NAND AND

0 0 0 1 0

0 1 1 1 1

1 0 1 1 1

1 1 1 0 0

Input Hidden Output

iii xwnet

neteneto

))(1()()(

1)( yy

FunctionSigmoid

outputsk

kkd otwE 2)(2

d xnet

outputskkk

d otoo

E 2)(2

j oonet

)1()( jjjjj

d oootnet

jijjjjji

dji xooot

Ew )1()(

)1( jDownstreamk

jjkjkj

jDownstreamkkjk

jDownstreamk j

oownet

jijji xw

dk net

BP Framework

))(1( kkkkk otoo

outputsk

kkhhhh woo )1(

jijjijijiji xwwww

pp 1423-1447

Deep Learning

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

Input sum Output

0 0 -08 0

0 1 -03 0

1 0 -03 0

1 1 03 1

Input sum Output

0 0 -03 0

0 1 02 1

1 0 02 1

1 1 07 1

AND OR

Error Surface

Gradient Descent

dd otwE

EwE )(

iiiii w

Ewwherewww

Learning Rate

Batch Learning

Delta Rule

)()(22

ddDdii

di xotw )(

xwxo )(

Batch Learning

Stochastic Learning

InputTarget

Initial

Weights

Output

C0+C1+C2

t-o LR x E

1 0 0 1 0 0 0 0 0 0 0 0 1 +01 01 0 0

1 0 1 1 01 0 0 01 0 0 01 0 1 +01 02 0 01

1 1 0 1 02 0 01 02 0 0 02 0 1 +01 03 01 01

1 1 1 0 03 01 01 03 01 01 05 0 0 0 03 01 01

1 0 0 1 03 01 01 03 0 0 03 0 1 +01 04 01 01

1 0 1 1 04 01 01 04 0 01 05 0 1 +01 05 01 02

1 1 0 1 05 01 02 05 01 0 06 1 0 0 05 01 02

1 1 1 0 05 01 02 05 01 02 08 1 -1 -01 04 0 01

1 0 0 1 04 0 01 04 0 0 04 0 1 +01 05 0 01

))(( pqqpqpqpqp

Input Output

)()( qpqpqp

OR NAND

p q OR NAND AND

0 0 0 1 0

0 1 1 1 1

1 0 1 1 1

1 1 1 0 0

Input Hidden Output

iii xwnet

neteneto

))(1()()(

1)( yy

FunctionSigmoid

outputsk

kkd otwE 2)(2

d xnet

outputskkk

d otoo

E 2)(2

j oonet

)1()( jjjjj

d oootnet

jijjjjji

dji xooot

Ew )1()(

)1( jDownstreamk

jjkjkj

jDownstreamkkjk

jDownstreamk j

oownet

jijji xw

dk net

BP Framework

))(1( kkkkk otoo

outputsk

kkhhhh woo )1(

jijjijijiji xwwww

pp 1423-1447

Deep Learning

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

Error Surface

Gradient Descent

dd otwE

EwE )(

iiiii w

Ewwherewww

Learning Rate

Batch Learning

Delta Rule

)()(22

ddDdii

di xotw )(

xwxo )(

Batch Learning

Stochastic Learning

InputTarget

Initial

Weights

Output

C0+C1+C2

t-o LR x E

1 0 0 1 0 0 0 0 0 0 0 0 1 +01 01 0 0

1 0 1 1 01 0 0 01 0 0 01 0 1 +01 02 0 01

1 1 0 1 02 0 01 02 0 0 02 0 1 +01 03 01 01

1 1 1 0 03 01 01 03 01 01 05 0 0 0 03 01 01

1 0 0 1 03 01 01 03 0 0 03 0 1 +01 04 01 01

1 0 1 1 04 01 01 04 0 01 05 0 1 +01 05 01 02

1 1 0 1 05 01 02 05 01 0 06 1 0 0 05 01 02

1 1 1 0 05 01 02 05 01 02 08 1 -1 -01 04 0 01

1 0 0 1 04 0 01 04 0 0 04 0 1 +01 05 0 01

))(( pqqpqpqpqp

Input Output

)()( qpqpqp

OR NAND

p q OR NAND AND

0 0 0 1 0

0 1 1 1 1

1 0 1 1 1

1 1 1 0 0

Input Hidden Output

iii xwnet

neteneto

))(1()()(

1)( yy

FunctionSigmoid

outputsk

kkd otwE 2)(2

d xnet

outputskkk

d otoo

E 2)(2

j oonet

)1()( jjjjj

d oootnet

jijjjjji

dji xooot

Ew )1()(

)1( jDownstreamk

jjkjkj

jDownstreamkkjk

jDownstreamk j

oownet

jijji xw

dk net

BP Framework

))(1( kkkkk otoo

outputsk

kkhhhh woo )1(

jijjijijiji xwwww

pp 1423-1447

Deep Learning

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

Gradient Descent

dd otwE

EwE )(

iiiii w

Ewwherewww

Learning Rate

Batch Learning

Delta Rule

)()(22

ddDdii

di xotw )(

xwxo )(

Batch Learning

Stochastic Learning

InputTarget

Initial

Weights

Output

C0+C1+C2

t-o LR x E

1 0 0 1 0 0 0 0 0 0 0 0 1 +01 01 0 0

1 0 1 1 01 0 0 01 0 0 01 0 1 +01 02 0 01

1 1 0 1 02 0 01 02 0 0 02 0 1 +01 03 01 01

1 1 1 0 03 01 01 03 01 01 05 0 0 0 03 01 01

1 0 0 1 03 01 01 03 0 0 03 0 1 +01 04 01 01

1 0 1 1 04 01 01 04 0 01 05 0 1 +01 05 01 02

1 1 0 1 05 01 02 05 01 0 06 1 0 0 05 01 02

1 1 1 0 05 01 02 05 01 02 08 1 -1 -01 04 0 01

1 0 0 1 04 0 01 04 0 0 04 0 1 +01 05 0 01

))(( pqqpqpqpqp

Input Output

)()( qpqpqp

OR NAND

p q OR NAND AND

0 0 0 1 0

0 1 1 1 1

1 0 1 1 1

1 1 1 0 0

Input Hidden Output

iii xwnet

neteneto

))(1()()(

1)( yy

FunctionSigmoid

outputsk

kkd otwE 2)(2

d xnet

outputskkk

d otoo

E 2)(2

j oonet

)1()( jjjjj

d oootnet

jijjjjji

dji xooot

Ew )1()(

)1( jDownstreamk

jjkjkj

jDownstreamkkjk

jDownstreamk j

oownet

jijji xw

dk net

BP Framework

))(1( kkkkk otoo

outputsk

kkhhhh woo )1(

jijjijijiji xwwww

pp 1423-1447

Deep Learning

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

Delta Rule

)()(22

ddDdii

di xotw )(

xwxo )(

Batch Learning

Stochastic Learning

InputTarget

Initial

Weights

Output

C0+C1+C2

t-o LR x E

1 0 0 1 0 0 0 0 0 0 0 0 1 +01 01 0 0

1 0 1 1 01 0 0 01 0 0 01 0 1 +01 02 0 01

1 1 0 1 02 0 01 02 0 0 02 0 1 +01 03 01 01

1 1 1 0 03 01 01 03 01 01 05 0 0 0 03 01 01

1 0 0 1 03 01 01 03 0 0 03 0 1 +01 04 01 01

1 0 1 1 04 01 01 04 0 01 05 0 1 +01 05 01 02

1 1 0 1 05 01 02 05 01 0 06 1 0 0 05 01 02

1 1 1 0 05 01 02 05 01 02 08 1 -1 -01 04 0 01

1 0 0 1 04 0 01 04 0 0 04 0 1 +01 05 0 01

))(( pqqpqpqpqp

Input Output

)()( qpqpqp

OR NAND

p q OR NAND AND

0 0 0 1 0

0 1 1 1 1

1 0 1 1 1

1 1 1 0 0

Input Hidden Output

iii xwnet

neteneto

))(1()()(

1)( yy

FunctionSigmoid

outputsk

kkd otwE 2)(2

d xnet

outputskkk

d otoo

E 2)(2

j oonet

)1()( jjjjj

d oootnet

jijjjjji

dji xooot

Ew )1()(

)1( jDownstreamk

jjkjkj

jDownstreamkkjk

jDownstreamk j

oownet

jijji xw

dk net

BP Framework

))(1( kkkkk otoo

outputsk

kkhhhh woo )1(

jijjijijiji xwwww

pp 1423-1447

Deep Learning

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

Batch Learning

Stochastic Learning

InputTarget

Initial

Weights

Output

C0+C1+C2

t-o LR x E

1 0 0 1 0 0 0 0 0 0 0 0 1 +01 01 0 0

1 0 1 1 01 0 0 01 0 0 01 0 1 +01 02 0 01

1 1 0 1 02 0 01 02 0 0 02 0 1 +01 03 01 01

1 1 1 0 03 01 01 03 01 01 05 0 0 0 03 01 01

1 0 0 1 03 01 01 03 0 0 03 0 1 +01 04 01 01

1 0 1 1 04 01 01 04 0 01 05 0 1 +01 05 01 02

1 1 0 1 05 01 02 05 01 0 06 1 0 0 05 01 02

1 1 1 0 05 01 02 05 01 02 08 1 -1 -01 04 0 01

1 0 0 1 04 0 01 04 0 0 04 0 1 +01 05 0 01

))(( pqqpqpqpqp

Input Output

)()( qpqpqp

OR NAND

p q OR NAND AND

0 0 0 1 0

0 1 1 1 1

1 0 1 1 1

1 1 1 0 0

Input Hidden Output

iii xwnet

neteneto

))(1()()(

1)( yy

FunctionSigmoid

outputsk

kkd otwE 2)(2

d xnet

outputskkk

d otoo

E 2)(2

j oonet

)1()( jjjjj

d oootnet

jijjjjji

dji xooot

Ew )1()(

)1( jDownstreamk

jjkjkj

jDownstreamkkjk

jDownstreamk j

oownet

jijji xw

dk net

BP Framework

))(1( kkkkk otoo

outputsk

kkhhhh woo )1(

jijjijijiji xwwww

pp 1423-1447

Deep Learning

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

Stochastic Learning

InputTarget

Initial

Weights

Output

C0+C1+C2

t-o LR x E

1 0 0 1 0 0 0 0 0 0 0 0 1 +01 01 0 0

1 0 1 1 01 0 0 01 0 0 01 0 1 +01 02 0 01

1 1 0 1 02 0 01 02 0 0 02 0 1 +01 03 01 01

1 1 1 0 03 01 01 03 01 01 05 0 0 0 03 01 01

1 0 0 1 03 01 01 03 0 0 03 0 1 +01 04 01 01

1 0 1 1 04 01 01 04 0 01 05 0 1 +01 05 01 02

1 1 0 1 05 01 02 05 01 0 06 1 0 0 05 01 02

1 1 1 0 05 01 02 05 01 02 08 1 -1 -01 04 0 01

1 0 0 1 04 0 01 04 0 0 04 0 1 +01 05 0 01

))(( pqqpqpqpqp

Input Output

)()( qpqpqp

OR NAND

p q OR NAND AND

0 0 0 1 0

0 1 1 1 1

1 0 1 1 1

1 1 1 0 0

Input Hidden Output

iii xwnet

neteneto

))(1()()(

1)( yy

FunctionSigmoid

outputsk

kkd otwE 2)(2

d xnet

outputskkk

d otoo

E 2)(2

j oonet

)1()( jjjjj

d oootnet

jijjjjji

dji xooot

Ew )1()(

)1( jDownstreamk

jjkjkj

jDownstreamkkjk

jDownstreamk j

oownet

jijji xw

dk net

BP Framework

))(1( kkkkk otoo

outputsk

kkhhhh woo )1(

jijjijijiji xwwww

pp 1423-1447

Deep Learning

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

InputTarget

Initial

Weights

Output

C0+C1+C2

t-o LR x E

1 0 0 1 0 0 0 0 0 0 0 0 1 +01 01 0 0

1 0 1 1 01 0 0 01 0 0 01 0 1 +01 02 0 01

1 1 0 1 02 0 01 02 0 0 02 0 1 +01 03 01 01

1 1 1 0 03 01 01 03 01 01 05 0 0 0 03 01 01

1 0 0 1 03 01 01 03 0 0 03 0 1 +01 04 01 01

1 0 1 1 04 01 01 04 0 01 05 0 1 +01 05 01 02

1 1 0 1 05 01 02 05 01 0 06 1 0 0 05 01 02

1 1 1 0 05 01 02 05 01 02 08 1 -1 -01 04 0 01

1 0 0 1 04 0 01 04 0 0 04 0 1 +01 05 0 01

))(( pqqpqpqpqp

Input Output

)()( qpqpqp

OR NAND

p q OR NAND AND

0 0 0 1 0

0 1 1 1 1

1 0 1 1 1

1 1 1 0 0

Input Hidden Output

iii xwnet

neteneto

))(1()()(

1)( yy

FunctionSigmoid

outputsk

kkd otwE 2)(2

d xnet

outputskkk

d otoo

E 2)(2

j oonet

)1()( jjjjj

d oootnet

jijjjjji

dji xooot

Ew )1()(

)1( jDownstreamk

jjkjkj

jDownstreamkkjk

jDownstreamk j

oownet

jijji xw

dk net

BP Framework

))(1( kkkkk otoo

outputsk

kkhhhh woo )1(

jijjijijiji xwwww

pp 1423-1447

Deep Learning

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

))(( pqqpqpqpqp

Input Output

)()( qpqpqp

OR NAND

p q OR NAND AND

0 0 0 1 0

0 1 1 1 1

1 0 1 1 1

1 1 1 0 0

Input Hidden Output

iii xwnet

neteneto

))(1()()(

1)( yy

FunctionSigmoid

outputsk

kkd otwE 2)(2

d xnet

outputskkk

d otoo

E 2)(2

j oonet

)1()( jjjjj

d oootnet

jijjjjji

dji xooot

Ew )1()(

)1( jDownstreamk

jjkjkj

jDownstreamkkjk

jDownstreamk j

oownet

jijji xw

dk net

BP Framework

))(1( kkkkk otoo

outputsk

kkhhhh woo )1(

jijjijijiji xwwww

pp 1423-1447

Deep Learning

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

))(( pqqpqpqpqp

Input Output

)()( qpqpqp

OR NAND

p q OR NAND AND

0 0 0 1 0

0 1 1 1 1

1 0 1 1 1

1 1 1 0 0

Input Hidden Output

iii xwnet

neteneto

))(1()()(

1)( yy

FunctionSigmoid

outputsk

kkd otwE 2)(2

d xnet

outputskkk

d otoo

E 2)(2

j oonet

)1()( jjjjj

d oootnet

jijjjjji

dji xooot

Ew )1()(

)1( jDownstreamk

jjkjkj

jDownstreamkkjk

jDownstreamk j

oownet

jijji xw

dk net

BP Framework

))(1( kkkkk otoo

outputsk

kkhhhh woo )1(

jijjijijiji xwwww

pp 1423-1447

Deep Learning

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

)()( qpqpqp

OR NAND

p q OR NAND AND

0 0 0 1 0

0 1 1 1 1

1 0 1 1 1

1 1 1 0 0

Input Hidden Output

iii xwnet

neteneto

))(1()()(

1)( yy

FunctionSigmoid

outputsk

kkd otwE 2)(2

d xnet

outputskkk

d otoo

E 2)(2

j oonet

)1()( jjjjj

d oootnet

jijjjjji

dji xooot

Ew )1()(

)1( jDownstreamk

jjkjkj

jDownstreamkkjk

jDownstreamk j

oownet

jijji xw

dk net

BP Framework

))(1( kkkkk otoo

outputsk

kkhhhh woo )1(

jijjijijiji xwwww

pp 1423-1447

Deep Learning

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

p q OR NAND AND

0 0 0 1 0

0 1 1 1 1

1 0 1 1 1

1 1 1 0 0

Input Hidden Output

iii xwnet

neteneto

))(1()()(

1)( yy

FunctionSigmoid

outputsk

kkd otwE 2)(2

d xnet

outputskkk

d otoo

E 2)(2

j oonet

)1()( jjjjj

d oootnet

jijjjjji

dji xooot

Ew )1()(

)1( jDownstreamk

jjkjkj

jDownstreamkkjk

jDownstreamk j

oownet

jijji xw

dk net

BP Framework

))(1( kkkkk otoo

outputsk

kkhhhh woo )1(

jijjijijiji xwwww

pp 1423-1447

Deep Learning

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

iii xwnet

neteneto

))(1()()(

1)( yy

FunctionSigmoid

outputsk

kkd otwE 2)(2

d xnet

outputskkk

d otoo

E 2)(2

j oonet

)1()( jjjjj

d oootnet

jijjjjji

dji xooot

Ew )1()(

)1( jDownstreamk

jjkjkj

jDownstreamkkjk

jDownstreamk j

oownet

jijji xw

dk net

BP Framework

))(1( kkkkk otoo

outputsk

kkhhhh woo )1(

jijjijijiji xwwww

pp 1423-1447

Deep Learning

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

outputsk

kkd otwE 2)(2

d xnet

outputskkk

d otoo

E 2)(2

j oonet

)1()( jjjjj

d oootnet

jijjjjji

dji xooot

Ew )1()(

)1( jDownstreamk

jjkjkj

jDownstreamkkjk

jDownstreamk j

oownet

jijji xw

dk net

BP Framework

))(1( kkkkk otoo

outputsk

kkhhhh woo )1(

jijjijijiji xwwww

pp 1423-1447

Deep Learning

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

outputskkk

d otoo

E 2)(2

j oonet

)1()( jjjjj

d oootnet

jijjjjji

dji xooot

Ew )1()(

)1( jDownstreamk

jjkjkj

jDownstreamkkjk

jDownstreamk j

oownet

jijji xw

dk net

BP Framework

))(1( kkkkk otoo

outputsk

kkhhhh woo )1(

jijjijijiji xwwww

pp 1423-1447

Deep Learning

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

)1( jDownstreamk

jjkjkj

jDownstreamkkjk

jDownstreamk j

oownet

jijji xw

dk net

BP Framework

))(1( kkkkk otoo

outputsk

kkhhhh woo )1(

jijjijijiji xwwww

pp 1423-1447

Deep Learning

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

BP Framework

))(1( kkkkk otoo

outputsk

kkhhhh woo )1(

jijjijijiji xwwww

pp 1423-1447

Deep Learning

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

pp 1423-1447

Deep Learning

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

Overfitting

Momentum

)1()( nwxnw jijijji

Training

Validation

Weight

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

Beyond BP Networks

Elman Network

0 1 1 0 0 0 1 1 0 1 0 1 hellip

1 0 0 1 hellip

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

Beyond BP Networks

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

When does ANN work

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

Reading Materials

Text Book

Online Demo

Online Tutorial

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

Review

When does ANN work

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

Robot Driving

Face Recognition

Hopfield Network

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

Task 1 XOR Problem

bull 1 0 1

bull 00010000 00010000

bull hellip

Use 3 hidden units

Deliverables

Report

Credit 15 33

Classification III

Overview

Robot vs Human

Perceptrons

Error Surface

Gradient Descent

Delta Rule

Batch Learning

Stochastic Learning

XOR (2)

BP Framework

Beyond BP Networks

When does ANN work

Reading Materials

Review

Assignment

Documents

LOGO Classification III Lecturer: Dr. Bo Yuan E-mail: [email protected]