43
PSYCHOLOGYUnit 3: Learning“Operant Conditioning”

PSYCHOLOGYUnit 3: Learning “Operant Conditioning”

  • Upload
    zia

  • View
    36

  • Download
    0

Embed Size (px)

DESCRIPTION

PSYCHOLOGYUnit 3: Learning “Operant Conditioning”. What is Learning?. Most learning is... A ssociative L earning : R ealization that certain events occur together . Learning itself refers to a relatively durable change in behavior or knowledge that is due to experience . - PowerPoint PPT Presentation

Citation preview

Page 1: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

PSYCHOLOGYUnit 3: Learning“Operant Conditioning”

Page 2: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

What is Learning?Most learning is... Associative Learning: Realization that certain events occur together.Learning itself refers to a relatively durable change in behavior or knowledge that is due to experience.★ Classical Conditioning★ Operant Conditioning★ Observational Learning

(Latent, Abstract, Insight)

Page 3: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

BehaviorismEverything you know, everything you are is the result of human behavior.

In other words, psychology is the study of behavior, not of the mind!

Picked up steam in the late 1960s and during the 1970s. A reaction to the non-scientific work of Freud.

Page 4: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

B.F. SkinnerInstead of antecedents of behavior (what comes before) a new focus on consequences of behavior.

BF Skinner argued that, CC did not explain complex behavior.

2 categories of consequences: Reinforcement & Punishment

Reinforcement is designed to increase the probability that a behavior will occur again.

Punishment is designed to decrease the probability that a behavior will occur again.

Page 5: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

Operant ConditioningA type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Trial & Error---------------->Trial & Reward---------->Operant Conditioning Operant Response - Reinforcement - Learned Behavior

in rats:★ trial and error learning★ allows acquisition of motor programs that are not

instinctive★ behavior shaped by rewards★ develops as a result of the association of

reinforcement with a particular response★ on a proportion of occasions

Page 6: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

Positive reinforcement - when something is given (apply an aversive stimulus).

Negative reinforcement - when something is removed (remove an aversive stimulus).Skinner - punishment should be judicious, immediate, consistent, & severe enough actually to be a punishment.

Page 7: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

http://www.youtube.com/watch?v=guroaQRFsX4

Page 8: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

Positive ReinforcementStrengthens a response by presenting a stimulus after a response.

$$$ Getting Paid!

We may continue to go to work each day because we receive a paycheck on a weekly or monthly basis.  

***AWARDS***If we receive awards for writing short stories, we may be more likely to increase the frequency of writing short stories. 

"PRAISE!"Receiving praise for our karaoke performances can increase how often we sing. 

Page 9: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

Negative ReinforcementStrengthens a response by reducing or removing an aversive stimulus.Driving in heavy traffic is a negative condition for most of us. You leave home earlier than usual one morning, and don't run into heavy traffic. You leave home earlier again the next morning and again you avoid heavy traffic. Your behavior of leaving home earlier is strengthened by the consequence of the avoidance of heavy traffic.

The concept of Negative Reinforcement is difficult to learn because of the word negative. Negative Reinforcement is often confused with Punishment. They are very different, however.Negative Reinforcement strengthens a behavior because a negative condition is stopped or avoided as a consequence of the behavior.

Page 10: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

PunishmentPunishment, on the other hand, weakens a behavior because a negative condition is introduced or experienced as a consequence of the behavior.

Punishment is often mistakenly confused with negative reinforcement.

Remember, reinforcement always increases the chances that a behavior will occur

and Punishment always decreases the chances that a behavior will occur.

Page 11: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

PunishmentPositive Punishment: This type of punishment is also known as "punishment by application." Positive punishment involves presenting an aversive stimulus after a behavior as occurred.

For example, when a student talks out of turn in the middle of class, the teacher might scold the child for interrupting her.

Page 12: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

PunishmentNegative Punishment: This type of punishment is also known as "punishment by removal." Negative punishment involves taking away a desirable stimulus after a behavior as occurred.

For example, when the student from the previous example talks out of turn again, the teacher promptly tells the child that he will have to miss recess because of his behavior.

Page 13: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

Punishment also has some notable drawbacks. First, any behavior changes that result from

punishment are often temporary. "Punished behavior is likely to reappear after the punitive consequences are withdrawn," Skinner explained in his book About Behaviorism. Perhaps the greatest drawback is the fact that

punishment does not actually offer any information about more appropriate or desired behaviors. While subjects might be learning to not perform certain actions, they are not really learning anything about what they should be doing.Another thing to consider about punishment is that it

can have unintended and undesirable consequences. For example, while approximately 75 percent of

parents in the United States report spanking their children on occasion, researchers have found that this type of physical punishment can lead to antisocial behavior, aggressiveness and delinquency among children. For this reason, Skinner and other psychologists

suggest that any potential short-term gains from using punishment as a behavior modification tool need to be weighed again the potential long-term consequences.

Page 14: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

PunishmentAn event that DECREASES the behavior that it

follows.

Does punishment work?

Page 15: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

Tardies & D-HALLS The Breakfast Club was released in 1985.Saturday, March 24, 1984. Shermer High School, Shermer, Illinois. 60062.

Dear Mr. Vernon,

We accept the fact that we had to sacrifice a whole Saturday in detention for whatever it was that we did wrong…

what we did was wrong, but we think you’re crazy to make us write this essay telling you who we think we are. What do you care? You see us as you want to see us…

in the simplest terms & the most convenient definitions.

You see us as a brain, an athlete, a basket case, a princess & a criminal.

Correct?

That’s the way we saw each other at seven o’clock this morning. We were brainwashed.

Page 16: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

http://www.youtube.com/watch?v=FpHWRzGSnUs

Page 17: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”
Page 18: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

http://www.youtube.com/watch?v=nMGRck-fVJ0

Page 19: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

A lot of students are confused about negative reinforcement. What's the difference between that and punishment? Remember, it's "reinforcement" so the behavior increases, and because it's "negative," the reinforcer is removed after the response.

Page 20: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

Positive or Negative Reinforcement? Cleaning the house to get rid of the disgusting mess and/or to stop your mother from nagging

Page 21: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

Positive or Negative Reinforcement? Cleaning the house to get rid of the disgusting mess and/or to stop your mother from nagging

NEGATIVE REINFORCEMENTStrengthens a response by reducing or removing an aversive stimulus.Nagging/Mess as negative reinforcer to cleaning.

Page 22: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

Positive or Negative Reinforcement?

Taking aspirin to relieve a headache

Page 23: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

Positive or Negative Reinforcement?Taking aspirin to relieve a headache

NEGATIVE REINFORCEMENTStrengthens a response by reducing or removing an aversive stimulus.headache as negative reinforcer to taking medication.

Page 24: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

Positive or Negative Reinforcement?

Listening to your favorite music after studying for an hour

Page 25: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

Positive or Negative Reinforcement? Listening to your favorite music after studying for an hourPOSITIVE REINFORCEMENT: Strengthens a response by presenting a stimulus after a response.

Page 26: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

Positive or Negative Reinforcement?Leaving the movie theater if the movie is bad

Page 27: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

Positive or Negative Reinforcement? -- Leaving the movie theater if the movie is bad

Negative Reinforcement strengthens a behavior because a negative condition is stopped or avoided as a consequence of the behavior.

Page 28: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

Positive or Negative Reinforcement? Giving in to an argument or to a child or dog’s begging

Page 29: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

Positive or Negative Reinforcement? Giving in to an argument or to a child or dog’s begging

Negative Reinforcement strengthens a behavior because a negative condition is stopped or avoided as a consequence of the behavior.Negative reinforcement is NOT the same as punishment! Negative reinforcers, like all reinforcers, increase the frequency of the responses that they follow. (Punishment, in contrast, decreases the frequency of responses.)

Page 30: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

http://www.youtube.com/watch?v=i0ad2NSwGb0

Page 31: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

Fixed-ratio SchedulesA schedule that reinforces a response only after a specified number of responses. Examples in natural environments:Jobs that pay based on units delivered.

Employees often find this schedule undesirable because it produces a rate of response that leaves them nervous and exhausted at the end of the day.

They may feel pressured not to slow down or take rest breaks, since they feel that such will costs them money. This is an example of how a schedule can produce a high rate of response even though the response rate is aversive to the subject.

Examples in video games:Collecting tokens.

Many games require the player to collect a fixed number of tokens to advance to the next level, obtain a new life point, or receive some other reinforcers.

Attaining a new level in an RPG. Some RPG's clearly indicate how much experience is required to achieve the next level.

A high degree of certainty as to the level of work that will be required to achieve the next level puts the player on a fixed ratio schedule.

Page 32: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

A schedule of reinforcement that reinforces a response after an unpredictable number of responses.

Variable-ratio Schedule

Slot machines: Slot machines are programmed on VR schedule. The gambler has no way of predicting how many

times he must put a coin in the slot and pull the lever to hit a payoff but the more times a coin is inserted the greater the chance of a payout.

People who play slot machines are often reluctant to leave them, especially when they have had a large number of un-reinforced responses. They are concerned that someone else will win the moment they leave.

Playing golf:It only takes a few good shots to encourage the

player to keep playing or play again. The player is uncertain how good each shot will be, but the more often they play, the more likely they are to get a good shot.

Door to door salesmen: It is uncertain how many houses they will have to

visit to make a sale, but the more houses they try, the more likely that they will succeed.

Page 33: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

Fixed-interval ScheduleA schedule of reinforcement that reinforces a response only after a specified time has elapsed.

An example might be getting a raise every year and not in between. A major issue with this schedule is that people tend to improve their performance right before the time period expires so as to "look good" when the review comes around.Example: I give Bart a Butterfinger every ten minutes after he moons someone.

"HAHA!"In the Real World: A weekly paycheck is a good example of a fixed-interval schedule. The employee receives reinforcement every seven days, which may result in a higher response rate as payday approaches.

Page 34: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

Variable-interval ScheduleA schedule of reinforcement that reinforces a response at unpredictable time intervals.Reinforcing someone after a variable amount of time is the final schedule.

This may not be as true for punishment since consistency in the application is so important, but for all other types of reinforcement they tend to result in stronger responses.

If you have a boss who checks your work periodically, you understand the power of this schedule. Because you don’t know when the next ‘check-up’ might come, you have to be working hard at all times in order to be ready.In this sense, the variable schedules are more powerful and result in more consistent behaviors.

Page 35: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”
Page 36: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

http://www.youtube.com/watch?v=7k47hLVBYuY

Page 37: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

ShapingA procedure in Operant Conditioning - reinforces & guides behavior closer and closer towards a goal.Reinforcers guide behavior, step-by-step. Closer and closer to the target behavior through successive approximations. “Baby Steps”ReinforcersAny event that STRENGTHENS the behavior it follows.There are + and – reinforcers.+ Positive Reinforcers: Strengthens a response by presenting a stimulus after a response.- Negative Reinforcers: Strengthens a response by reducing or removing an aversive stimulus.

Page 38: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

http://www.youtube.com/watch?v=OCUWHP4YDgU

Page 39: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

http://www.youtube.com/watch?v=ncFCdCjBqcE

Page 40: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”
Page 41: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”
Page 42: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”

Classical vs. OperantThey both use acquisition, discrimination, SR, generalization and extinction.Classical Conditioning: automatic (respondent behavior). Ex.) Your dog gets sick and requires several painful trips to the vet. Now he hides every time he hears you rattle your keys. Automatic.Operant Conditioning: behavior where one can influence their environment with behaviors which have consequences (operant behavior).Ex.) Teacher comments on test.

Page 43: PSYCHOLOGYUnit 3:  Learning “Operant  Conditioning”