42

Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Embed Size (px)

Citation preview

Page 1: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment
Page 2: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Operant Conditioning• A type of learning in which behavior is

strengthened if followed by reinforcement or diminished if followed by punishment.

Page 3: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Edward Thorndike

• Law of Effect: rewarded behavior is likely to recur.

• 1905

Page 4: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Thorndike’s Puzzle Box

Page 5: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

B.F. Skinner(March 20, 1904 – August 18, 1990)

Page 6: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Operant Conditioning ChamberSkinner Box 1948

Page 7: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment
Page 8: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Skinner Reinforcement Schedules

• Continuous Reinforcement: every time rat does a behavior he gets food.

• Fixed ratio: if rat presses lever three times he gets food.

• Fixed interval: if rat presses lever once during a fixed time period – 30 seconds, he gets food.

Even if he presses lever 100 times he get only one food pellet.

Page 9: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Skinner Reinforcement Schedules

• Variable ratio: first it take 3 lever presses to get food, then 10, then 1, then 15, and so on.

(focus on animal’s behavior)• Variable interval: the administrator of the food

keeps changing the time period to receive food. First 20 seconds, then 45 seconds, then 10 seconds and so on.

(focus on time period for rewards)

Page 10: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Classical v. Operant• They both use acquisition, discrimination, SR,

generalization and extinction.•Classical Conditioning is automatic (respondent behavior). Dogs automatically salivate over meat, then bell- no thinking involved.

•Operant Conditioning involves behavior where one can influence his/her environment with behaviors which have consequences (operant behavior).

Page 11: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Shaping

• A procedure in Operant Conditioning in which reinforcers guide behavior closer and closer towards a goal.

Page 12: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

REINFORCEMENT PUNISHMENTPo

sitiv

eN

egati

ve

Page 13: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Reinforcer• Any event that STRENGTHENS the behavior it follows. A

stimulus or event that affects the likelihood that a behavior will REPEATED

Two Types of Reinforcement:

Positive and Negative

Page 14: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Types of Reinforcements:Positive Reinforcement

• Anything that increases the likelihood of a behavior by following it with a response.

• Something ADDED

Page 15: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Types of Reinforcements:Negative Reinforcement

• Anything that increases the likelihood of a behavior by following it with the removal of an undesirable event or state

• Taking away something you don’t like in order to make a behavior more likely

Page 16: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Wearing seat belt to stop dinging

Faking sick to leave school

Sleep if tiredRunning inside with cold weather

“I won’t make you do your chores IF you come to your grandmother’s this weekend instead of hang with your friends”NOT THE SAME AS PUNISHMENT!!

Page 17: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Punishment

Adding or subtracting something you don’t like in order to make a behavior less likely.

Unpleasant and negative consequence (stimulus) is added or removed to stop behavior

DECREASE an undesirable behavior.

Two types of PunishmentsPositive and Negative

Page 19: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Type of PunishmentNegative Punishment

Omission Training/Response Cost

Removing (stimulus) something enjoyable with intent to DECREASE behavior.

A positive event or object is usually removed

Time outsTaking car awayTaking phone awayTake away privileges.

.

Page 20: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

TRY IT!!a. Billy likes to campout in the backyard. He camped-

out on every Friday during the month of June. The last time he camped out, some older kids snuck up to his tent while he was sleeping and threw a bucket of cold water on him. Billy has not camped-out for three weeks

1. What behavior was changed? 2. Was the behavior strengthened or weakened? 3. What was the consequence? 4. Was the consequence added or subtracted?Positive Reinforcement, Negative Reinforcement, Response

Cost/Negative Punishment, Positive Punishment

Page 21: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

l. What behavior was changed? camping-out

2. Was the behavior strengthened or weakened? weakened (eliminate positive and negative reinforcement)

3. What was the consequence? having water thrown on him

4. Was the consequence added or subtracted? Added

ANSWER: Since a consequence was added and the behavior was weakened, the process was Positive Punishment.

Page 22: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Try It!b. Every time Madge raises her hand in class she is called on.

She raised her hand 3 time during the first class, 3 times in the second and 4 times during the last class.

l. What behavior was changed? 2. Was the behavior strengthened or weakened?3. What was the consequence? 4. Was the consequence added or subtracted? • Positive Reinforcement, Negative Reinforcement, Negative

Punishment /Response Cost, Positive Punishment

Page 23: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

1. What behavior was changed? Hand raising

2. Was the behavior strengthened or weakened? Strengthened

3. What was the consequence? Being called on

4. Was the consequence added or subtracted? Added

ANSWER: Since the consequence was added and the behavior was strengthened, the process is positive reinforcement.

Page 24: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

TRY IT!!c. When Greg follows a direction / command he earns a

point. At the end of each day, he can "buy" free time, TV. privileges, etc. with his points. When he misbehaves or doesn't follow a command, he loses points. Greg used to call his mom names. Since he has been on the point system, his name calling has been reduced to almost zero.

1. What behavior was changed? 2. Was the behavior strengthened or weakened? 3. What was the consequence? 4. Was the consequence added or subtracted?Positive Reinforcement, Negative Reinforcement, Response

Cost/Negative Punishment, Positive Punishment

Page 25: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

1. What behavior was changed? Calling his mom names

2. Was the behavior strengthened or weakened? Weakened (eliminate positive and negative reinforcement)

3. What was the consequence? Losing points

4. Was the consequence added or subtracted? Subtracted

ANSWER: Since the consequence was subtracted and the behavior was weakened, the process is negative punishment / response cost.

Page 26: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

TRY IT!!d. John does not go to the dentist every 6-months

for a checkup. Instead, he waited until a tooth really hurts, then goes to the dentist. After two emergency trips to the dentist, John now goes every 6-months.

1. What behavior was changed? 2. Was the behavior strengthened or weakened? 3. What was the consequence? 4. Was the consequence added or subtracted?Positive Reinforcement, Negative Reinforcement, Response

Cost/Negative Punishment, Positive Punishment

Page 27: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

1. What behavior was changed? going to the dentist

2. Was the behavior strengthened or weakened? strengthened (eliminate response cost / negative punishment)

3. What was the consequence? tooth no longer hurting

4. Was the consequence added or subtracted? Subtracted

ANSWER: Since the consequence was subtracted and the behavior was strengthened, the process is negative reinforcement.

Page 28: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment
Page 29: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Reinforcement SchedulesTiming and Frequency are important

Page 30: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Continuous Reinforcement• Reinforcing the desired response every time it occurs.

Rewards follow every correct response!!• + = great for new behaviors• - = easy to extinguish

Quick Acquisition

Quick Extinction

Page 31: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Partial Reinforcement

• Reinforcing a response only part of the time. Rewards follow only some correct responses

• The acquisition process is slower.

• Greater resistance to extinction.

• Ratio and Interval

Page 32: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Skinner’s Reinforcement Schedules(Partial Reinforcements)

• Ratio - refers to the amount of times a behavior must be performed before it is reinforced.

• Interval- refers to the amount of time that has passed before reinforcement occurs

Page 33: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Fixed Variable

Ratio

Interval

.

Page 34: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Fixed-ratio Schedules• A schedule that reinforces where a response is given only after a

specified number of responses. Reinforcement is given after the subject performs a behavior a fixed # of times.

Example: I give cookie monster a cookie every FIVE times he sings “C is for cookie”.

Page 35: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Variable-ratio Schedule• A schedule of reinforcement that

reinforces a response after anunpredictable number of responses. The number of

times the subject must

perform a behavior

before being rewarded is

unknown or random.

Example: I give Homer a donut at random times when he says “DOH!!!”

Page 36: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Fixed-interval Schedule• A schedule of

reinforcement that reinforces a response only after a specified time has elapsed.

Example: Waiting for a bus. Buses are supposed to arrive at stops at fixed intervals, and the passenger will miss the bus if they are not there at the time the bus arrives.

Page 37: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Variable-interval Schedule• A schedule of

reinforcement that reinforces a response at unpredictable time intervals. A behavior is only reinforced if performed after an unknown amount of time has passed.

Pop Quizzes

Page 38: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Fixed(predictable)

Variable(changes, random)

Ratio

paying a worker every time they make 10 items.

Collecting Tokens in a Video Game: In many video games, you have to collection so many tokens in order to receive some type of prize.

slot machine at a casino.

Sales bonuses: Call centers often offer random bonuses to employees. Workers never know how many calls they need to make in order to receive the bonus, but they know that they increase their chances the more calls or sales they make

Interval

A weekly paycheck is a good example of a fixed-interval schedule. The employee receives reinforcement every seven days

Quiz every Friday

Rats get food every 60 seconds with bar push

Don’t study until Thursday night

Fishing requires a great deal of patience because the fisherman never knows when a fish will get caught on the hook.

Pop quiz

Checking Your Email: Typically, you check your email at random times throughout the day instead of checking every time a single message is delivered

Study little all week

Page 39: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment
Page 40: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment
Page 41: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment

Reinforcement

Page 42: Operant Conditioning A type of learning in which behavior is strengthened if followed by reinforcement or diminished if followed by punishment