L. Manevitz U. Haifa 1 Neural Networks: Capabilities and Examples L. Manevitz Computer Science Department HIACS Research Center University of Haifa

L. Manevitz U. Haifa 1

Neural Networks: Capabilities and ExamplesL. Manevitz

Computer Science Department

HIACS Research Center

University of Haifa


What Are Neural Networks?What Are They Good for?How Do We Use Them?

• Definitions and some history

• Basics– Basic Algorithms

– Examples

• Recent Examples

• Future Directions


Natural versus Artificial Neuron

• Natural Neuron McCullough Pitts Neuron


Definitions and History

• McCullough –Pitts Neuron

• Perceptron

• Adaline

• Linear Separability

• Multi-Level Neurons

• Neurons with Loops


Sample Feed forward Network (No loops)

•Weights •Weights

•Weights

•Input

•Output

•Wji•Vik

F(wji xj


Replacement of Threshold Neurons with Sigmoid or Differentiable Neurons

•Threshold •Sigmoid


Reason for Explosion of Interest

• Two co-incident affects (around 1985 – 87)

– (Re-)discovery of mathematical tools and algorithms for handling large networks

– Availability (hurray for Intel and company!) of sufficient computing power to make experiments practical.


Some Properties of NNs

• Universal: Can represent and accomplish any task.

• Uniform: “Programming” is changing weights

• Automatic: Algorithms for Automatic Programming; Learning


Networks are Universal

• All logical functions represented by three level (non-loop) network (McCullough-Pitts)

• All continuous (and more) functions represented by three level feed-forward networks (Cybenko et al.)

• Networks can self organize (without teacher).

• Networks serve as associative memories


Universality

• McCullough-Pitts: Adaptive Logic Gates; can represent any logic function

• Cybenko: Any continuous function representable by three-level NN.


Networks can “LEARN” and Generalize (Algorithms)

• One Neuron (Perceptron and Adaline) Very popular in 1960s – early 70s– Limited by representability (only linearly separable

• Feed forward networks (Back Propagation)– Currently most popular network (1987 –now)

• Kohonen self-Organizing Network (1980s – now)(loops)

• Attractor Networks (loops)


Learnability (Automatic Programming)

• One neuron: Perceptron and Adaline algorithms (Rosenblatt and Widrow-Hoff) (1960s –now)

Feed forward Networks: Backpropagation (1987 – now)

Associative Memories and Looped Networks (“Attractors”) (1990s – now)


Generalizability

• Typically train a network on a sample set of examples

• Use it on general class

• Training can be slow; but execution is fast.


•Pattern Identification

•(Note: Neuron is trained)

•weights

field receptivein threshold Axw ii kdkdkfjlll

field. receptive in the is letter The Axw ii

Perceptron


•weights

field receptivein threshold Axw ii kdkdkfjlll

Feed Forward Network

•weights


Classical Applications(1986 – 1997)

• “Net Talk” : text to speech

• ZIPcodes: handwriting analysis

• Glovetalk: Sign Language to speech

• Data and Picture Compression: “Bottleneck”

• Steering of Automobile (up to 55 m.p.h)

• Market Predictions

• Associative Memories

• Cognitive Modeling: (especially reading, …)

• Phonetic Typewriter (Finnish)


Neural Network

• Once the architecture is fixed; the only free parameters are the weights

• Thus Uniform ProgrammingUniform Programming

• Potentially Potentially Automatic ProgrammingAutomatic Programming

• Search for Learning AlgorithmsSearch for Learning Algorithms


Programming: Just find the weights!

• AUTOMATIC PROGRAMMING

• One Neuron: Perceptron or Adaline

• Multi-Level: Gradient Descent on Continuous Neuron (Sigmoid instead of step function).


Prediction

•Input/Output •NN

•delay

•Compare


Training NN to Predict


Finite Element Method

• Numerical Method for solving p.d.e.s

• Many user chosen parameters

• Replace user expertise with NNs.


FEM Flow chart


Problems and Methods


Finite Element Method and Neural Networks

• Place mesh on body

• Predict where to adapt mesh


Placing Mesh on Body (Manevitz, Givoli and Yousef)

• Need to place geometry on topology

• Method: Use Kohonen algorithm

• Idea: Identify neurons with FEM nodes

– Identify weights of nodes with geometric location

– Identify topology with adjaceny

– RESULT: Equi-probably placement


Kohonen Placement for FEM

• Include slide from Malik’s work.


Self-Organizing Network

•Weights from input to neurons

•Topology between neurons


Self-Organizing Network

•Weights from input give “location” to neuron

•Kohonen algorithm results in “winner” neuron

•After training, close input patterns have topologically close winners

•Results in Equi-probable Continuous

Mapping (without teacher)


Placement of Mesh via Self Organizing NNs


Placement of Mesh via Self Organizing NNs2

Iteration 0 Iteration 500;Quality =288

Iteration 2000;Quality = 238

Iteration 6000;Quality =223

Iteration 12000;Quality = 208

Iteration 30000;Quality =202


Comparison of NN and PLTMG

PLTMG (249 nodes) NN (225 nodes); Quality = 27922 )2()2(),( where),( yx

yyxx eeyxuuuyxf

Node

Error

Value

Error

Pltmg 2.4 E-02 4.51 E-02

NN 7.5 E-03 9.09E-03


FEM Temporal Adaptive Meshes


Prediction of Refinement of Elements

• Method simulates time

• Current adaptive method uses gradient

• Can just MISS all the action.

• We use NNs to PREDICT the gradient.

• Under development with Manevitz, Givoli and Bitar.


Training NN to Predict2


Refinement Predictors

•Need to choose features

•Need to identify kinds of elements


Other Predictions?

• Stock Market (really!)

• Credit Card Fraud (Master Card, USA)


Surfer’s Apprentice Program

• Manevitz and Yousef

• Make a “model” of user for retrieving information from internet.

• Many issues: here focus on retrieval of new pages similar to other pages of interest to user. Note ONLY POSITIVE DATA.



Bottleneck Network

•Train to Identity on Sample Data

•Should be identity only on similar data

•NOVELTY FILTER


How well does it work?

• Tested on Standard Reuter’s Data Base.

• Used 25% for training

• Withholding information on representation

• The best method for retrieval using only positive training. (Better than SVM, etc.)


How to help Intel? (Make Billions? Reset NASDAQ)

• Branch prediction?

• (Note similarity to FEM refinement.)

• Perhaps can use to give predictor that is even user or application dependent.

• (Note: Neural activity is, I am told, natural for VLSI design and there have been several such chips produced.)


Other Different Directions

• Modify basic model to handle temporal adaptivity. (Occurs in real neurons according to latest biological information.)

• Apply to model human diseases, etc.

Documents

L. Manevitz U. Haifa 1 Neural Networks: Capabilities and Examples L. Manevitz Computer Science Department HIACS Research Center University of Haifa