53
Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error Law of Effect Thorndike’s principle that responses are “stamped in” by rewards and “stamped out” by punishments.

Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Embed Size (px)

Citation preview

Page 1: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Edward L. Thorndike• Animals placed in

puzzle boxes• String pulled, latch

released, animal jumps out and receives food

• Learning by random trial and error

• Law of Effect – Thorndike’s principle that responses are “stamped in” by rewards and “stamped out” by punishments.

Page 2: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

B.F. Skinner• “Project Pigeon” –WWII Skinner

suggested that pigeons be used to guide missiles to their targets

• Operant – voluntary responses that are reinforced– Pecking at targets projected onto a

screen

• Reinforce – To follow a response with a stimulus that increases the frequency of the response– Food pellets

• Defense Department concluded that Project Pigeon was “for the birds”

Page 3: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Skinner Box

Page 4: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

CC vs. OC• Operant conditioning investigates the influence of

consequences on subsequent behavior.

• Classical conditioning looks at conditioned involuntary responses such as eyeblinks or salivation, whereas Operant conditioning investigates the learning of voluntary responses such as pecking at a target or pressing a lever.

• Where classical conditioning illustrates S-->R learning, operant conditioning is often viewed as R-->S learning.

• It is the consequence that follows the response that influences whether the response is likely or unlikely to occur again.

Page 5: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

B.F. Skinner• Skinner Box – experimental

conditions carefully introduced and removed and results on laboratory animals carefully observed.

• Rat deprived of food and placed in Skinner box with lever at one end

• Rat accidentally presses lever and food pellet is released.

• After trial and error the rat begins pressing the lever to receive food.

• What is the operant?– Pressing the lever

• What reinforces the behavior?– Food pellet

Page 6: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error
Page 7: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error
Page 8: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error
Page 9: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Types of Reinforcers

• Positive Reinforcers – A reinforcer that when presented increases the frequency of an operant.

• Ex. A hungry rat presses a bar in its cage and receives food. The food is a positive condition for the hungry rat. The rat presses the bar again, and again receives food. The rat's behavior of pressing the bar is strengthened by the consequence of receiving food.

Page 10: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Types of Reinforcement

• Negative Reinforcers – A reinforcer that when removed increases the frequency of an operant.

• Ex. A rat is placed in a cage and immediately receives a mild electrical shock on its feet. The shock is a negative condition for the rat. The rat presses a bar and the shock stops. The rat receives another shock, presses the bar again, and again the shock stops. The rat's behavior of pressing the bar is strengthened by the consequence of stopping the shock.

Page 11: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error
Page 12: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error
Page 13: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error
Page 14: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Positive or Negative Reinforcement?

1. Police pulling drivers over and giving prizes for buckling up.

2. A child snaps her fingers until her teacher calls on her.

3. A hospital patient is allowed extra visiting time after eating a complete meal.

4. Receiving a city utility discount for participating in a recycling program.

5. Grounding a teenager until his/her homework is finished.

6. A parent nagging a child to clean up her room.7. A rat presses a lever to terminate a shock or a loud

tone.8. A professor gives extra credit to students with

perfect attendance.

1. PR

2. NR

3. PR

4. PR

5. NR6. NR7. NR

8. PR

Page 15: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Types of Reinforcers

• Primary Reinforcers – An unlearned reinforcer.

• Ex. Food, water, adequate warmth (positive reinforcers) and pain (negative reinforcer)

• Secondary Reinforcers – A stimulus that gains reinforcement value through association with established reinforcers.

• Ex. Money, attention, social approval

Page 16: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

POSITIVE REINFORCERS NEGATIVE REINFORCERS

PRIMARY SECONDARY PRIMARY SECONDARY

Food Money Electric shock Rejection

Water Grades Intense heat Failure

Sex Status Pain of any sort

Criticism

Warmth Praise Suffocation Avoidance

Page 17: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Rewards and Punishments

• Reward – A pleasant stimulus that increases the frequency of the behavior it follows.

• Considered to synonymous with positive reinforcement.

• Punishment – An unpleasant stimulus that suppresses the behavior it follows.

• Usually better to reward children for desirable behavior.

• Pg. 160

Page 18: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Potential Negative Effects of Punishment

Fear ofmanager

Recurrenceof undesirable

employee behavior

Undesirableemotional reaction

Aggressive,disruptivebehavior

Apathetic,noncreativeperformance

High turnoverand absenteeism

But leads to

long-term

Short-termdecrease infrequency

ofundesirable

employeebehavior

Punishmentby

manager

Undesirableemployeebehavior

Antecedent

Which tendsto reinforce

Page 19: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error
Page 20: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Schedules of Reinforcement

• Continuous Reinforcement – A schedule of reinforcement in which every correct response is reinforced.

• Partial Reinforcement – One of several reinforcement schedules in which not every correct response is reinforced.

• Which method do you think is used more in real life?

Page 21: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

An Example of Continuous Reinforcement

• Each instance of a smile is reinforced

Page 22: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Schedules of Reinforcement

• Continuous reinforcement refers to reinforcement being administered to each instance of a response

• Partial reinforcement lies between continuous reinforcement and extinction

Page 23: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Schedules of Reinforcement

• Ratio Version – having to do with instances of the behavior.

• Ex. – Reinforce or reward the behavior after a set number or x many times that an action or behavior is demonstrated.

• Interval Version – having to do with the passage of time.

• Ex. – Reinforce the student after a set number or x period of time that the behavior is displayed.

Page 24: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

4 Basic Schedules of Reinforcement

• Fixed-interval schedule

• Variable-interval schedule

• Fixed-ratio schedule

• Variable-ratio schedule

Page 25: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Fixed-Interval Schedule

• Fixed-interval schedule – A schedule in which a fixed amount of time must elapse between the previous and subsequent times that reinforcement will occur.

• No response during the interval is reinforced.• The first response following the interval is

reinforced.• Produces an overall low rate of responding• Ex. I get one pellet of food every 5 minutes

when I press the lever

Page 26: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Fixed Interval Reinforcement

Page 27: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Variable-Interval Schedule

• Variable-interval Schedule – A schedule in which a variable amount of time must elapse between the previous and subsequent times that reinforcement is available.

• Produces an overall low consistent rate of responding.

• Ex. – I get a pellet of food on average every 5 minutes when I press the bar.

Page 28: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Variable Interval Reinforcement

Page 29: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Fixed-Ratio Schedule

• Fixed-ratio Schedule – A schedule in which reinforcement is provided after a fixed number of correct responses.

• These schedules usually produce rapid rates of responding with short post-reinforcement pauses

• The length of the pause is directly proportional to the number of responses required

• Ex. – For every 5 bar presses, I get one pellet of food

Page 30: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

An Example of Fixed Ratio Reinforcement

• Every fourth instance of a smile is reinforced

Page 31: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Fixed Ratio Reinforcement

Page 32: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Variable-Ratio Schedule

• Variable-ratio Schedule – A schedule in which reinforcement is provided after a variable number of correct responses.

• Produce an overall high consistent rate of responding.

• Ex. – On average, I press the bar 5 times for one pellet of food.

Page 33: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

An Example of Variable Ratio Reinforcement

• Random instances of the behavior are reinforced

Page 34: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Variable Ratio Reinforcement

Page 35: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error
Page 36: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

TYPE MEANING OUTCOME

Fixed Ratio

Reinforcement depends on a definite number of responses

Activity slows after reinforcement and then picks up

Variable Ratio

Number of responses needed for reinforcement varies

Greatest activity of all schedules

Fixed Interval

Reinforcement depends on a fixed time

Activity increases as deadline nears

Variable Interval

Time between reinforcement varies

Steady activity results

Page 37: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Comparisons of Schedules of Reinforcement

Fixed interval Reward on fixed time basis

Leads to average and irregular performance

Fast extinction of behavior

Fixed ratio

Variable ratio

Variable interval

Reward tied to specific number of responses

Leads quickly to very high and stable performance

Moderately fast extinction of behavior

SCHEDULEFORM OF REWARD

Reward given after varying periods of time

Leads to moderately high and stable performance

Slow extinction of behavior

Reward given for some behaviors

Leads to very high performance

Very slow extinction of behavior

INFLUENCE ON PERFORMANCE

EFFECTS ON BEHAVIOR

Page 38: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

FI, VI, FR, or VR?1. When I bake cookies, I can only put one set in at a

time, so after 10 minutes my first set of cookies is done. After another ten minutes, my second set of cookies is done. I get to eat a cookie after each set is done baking.

2. After every 10 math problems that I complete, I allow myself a 5 minute break.

3. I look over my notes every night because I never know how much time will go by before my next pop quiz.

4. When hunting season comes around, sometimes I’ll spend all day sitting in the woods waiting to get a shot at a big buck. It’s worth it though when I get a nice 10 point.

5. Today in Psychology class we were talking about Schedules of Reinforcement and everyone was eagerly raising their hands and participating. Miranda raised her hand a couple of times and was eventually called on.

1. FI

2. FR

3. VI

4. VI

5. VR

Page 39: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

FI, VI, FR, or VR?6. Madison spanks her son if she has to ask him three

times to clean up his room.7. Emily has a spelling test every Friday. She usually

does well and gets a star sticker.8. Steve’s a big gambling man. He plays the slot

machines all day hoping for a big win.9. Snakes get hungry at certain times of the day. They

might watch any number of prey go by before they decide to strike.

10. Mr. Vora receives a salary paycheck every 2 weeks. t11. Christina works at a tanning salon. For every 2 bottles

of lotion she sells, she gets 1 dollar in commission.

12. Mike is trying to study for his upcoming Psychology quiz. He reads five pages, then takes a break. He resumes reading and takes another break after he has completed 5 more pages.

6. FR7. FI

8. VR

9. VI

10. FI

11. FR

12. FR

Page 40: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

FI, VI, FR, or VR?13. Megan is fundraising to try to raise money so she can

go on the annual band trip. She goes door to door in her neighborhood trying to sell popcorn tins. She eventually sells some.

14. Kylie is a business girl who works in the big city. Her boss is busy, so he only checks her work periodically.

15. Mark is a lawyer who owns his own practice. His customers makes payments at irregular times.

16. Jessica is a dental assistant and gets a raise every year at the same time and never in between.

17. Andrew works at a GM factory and is in charge of attaching 3 parts. After he gets his parts attached, he gets some free time before the next car moves down the line.

18. Brittany is a telemarketer trying to sell life insurance. After so many calls, someone will eventually buy.

13. VR

14. VI

15. VI

16. FI

17. FR

18. VR

Page 41: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Applications of Operant Conditioning

• Programmed Learning – A method of learning in which complex tasks are broken down into simple steps, each of which is reinforced. Errors are not reinforced.

• Ex. – Hooked on Phonics

Page 42: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Applications of Operant Conditioning

• Socialization – Guidance of people into socially desirable behavior by means of verbal messages, the systematic use of rewards and punishments, and other methods of teaching.

• Ex. – Children play with other children who are generous and non-aggressive and avoid those who are not.

Page 43: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Applications of Operant Conditioning

• Behavior Modification

Praise this Ignore this

Page 44: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Applications of Operant Conditioning

• Token Economy – An environmental setting that fosters desired behavior by reinforcing it with tokens (secondary reinforcers) that can be exchanged for other reinforcers.

• Ex. – Book It!

Page 45: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Other Types of Learning

• Insight Learning – In Gestalt psychology, a sudden perception of relationships among elements of the “perceptual field”, permitting the solution of a problem.

• Latent Learning – Learning that is hidden or concealed.

• Observational Learning – Acquiring operants by observing other engage in them.

Page 46: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Insight Learning

• Wolfgang Kohler – German Gestalt psychologist

• Experiments with Chimpanzees

• Sultan learned to use a stick to rake in bananas placed outside his cage.

• When the bananas were placed outside of Sultan’s reach, he fitted two poles together to make a single pole long enough to reach the food

Page 47: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Insight Learning• “Aha” Learning• Cognitive Map – A mental

representation or “picture” of the elements in a learning situation, such as a maze.– Ex. – if someone pukes in

the hall that you usually take to your next class you will still be able to find your way because of your mental representation of this school.

Page 48: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Insight Learning

Page 49: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Latent Learning• E.C. Tolman – experiment with

rats.• Rats learn about their

environments in the absence of reinforcement.

• Some rats went through maze for food goals, while others were given no reinforcement for several days.

• After 10 days, rewards were put in with the rats that had previously been given no rewards for 2 or 3 trials.

• Those rats reached the food box as quickly as the rats that had been getting reinforcement for over a week.

• Rats learn about mazes in which they roam even if they are unrewarded for doing so.

Page 50: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

Observational Learning• Albert Bandura• Bobo doll experiment• Assistant placed in room with doll• Beat doll with hammer and hit doll• Kindergarten children watched this

display of aggression in a separate room.

• When placed in the room, they too were extremely aggressive with the doll

• Happened without ever being rewarded for the behavior.

• http://www.npr.org/templates/story/story.php?storyId=1187559

Page 51: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

•ModelingMonkey see, monkey do!

•DisinhibitionGet away with or are rewarded for violence

They always “get the girl/guy/money/car - etc.

•Increased arousalWorks the audience up

Watch the fans at a sporting event Watch your friends watch the WWF

•HabituationWe become used to - desensitization

Page 52: Edward L. Thorndike Animals placed in puzzle boxes String pulled, latch released, animal jumps out and receives food Learning by random trial and error

1. Violent behavior is not representative of most people

2. Violent behaviors are not real - special effects and technology

3. Most people use other than non-violent means to resolve conflict

Mediation

Arbitration

Negotiation

Other•Positive modeling

•Encourage other activities

•Monitor - watch first, watch with, explain

•Limit time

•Watch educational programs

•Act - call legislators

•Abiding by rating system