28
Decisions to be made in developing an adaptive testing system for K12 education G. Gage Kingsbury March 9, 2012

Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Decisions to be made in developing an adaptive testing system for K–12 education

G. Gage Kingsbury

March 9, 2012

Page 2: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Welcome and Introduction

5

Page 3: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Presenter

G. Gage Kingsbury

Vice President for the International Association

for Computerized Adaptive Testing (IACAT) and

Senior Research Fellow at the Northwest

Evaluation Association (NWEA)

6

Page 4: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Decisions to be made in developing an adaptive testing system for K–12 education

7

Page 5: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

The Idea

An adaptive test is a test that

adjusts its characteristics based

on the performance of a test taker.

8

Page 6: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Questions and Answers

9

Page 7: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Computerized Adaptive Testing

175

191

202

210

216

221

228 231

229 228 230

232 234 235 234 233 234 235

150

160

170

180

190

200

210

220

230

240

250

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

Test Questions

Achie

vem

ent S

core

Basic

Proficient

Advanced

20 Item Test

225 226

10

Page 8: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Pioneers of adaptive testing

• Alfred Binet

• Frederick Lord

• David J. Weiss

• Fumiko Samejima

• Mark Reckase

11

Page 9: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

First implementers

• David Foster

• Jim McBride

• Tony Zara

• Gage Kingsbury

12

Page 10: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

You have chosen to use an adaptive test

because …

• It can be more efficient than a fixed-form test

• It provides good information across a broader

spectrum of student performance

• It can provide immediate scoring and

reporting

• It can provide better security than a fixed-form

test

• It can be designed to measure growth

13

Page 11: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Since the first implementations

• We have seen international growth in the use

of CAT for

– Educational testing

– Medical outcomes assessment

– Certification and licensure

14

Page 12: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Accuracy of adaptive tests

• Compared to a fixed-form test

• As a function of test length

• Depending on termination procedure

15

Page 13: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Relationship between Spring and Fall Reading Scores

150

160

170

180

190

200

210

220

230

240

250

150 160 170 180 190 200 210 220 230 240 250

Spring RIT

Fa

ll R

IT

PP to CAT PP to PP

16

Page 14: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Students' Mean = 211.7 s.d. = 192 Proficiency = 205 11.11 Basic =

Test Information Functions for Grade 4 Mathematics

.00

.02

.04

.06

.08

.10

.12

165 175 185 195 205 215 225 235 245

RIT

Info

rma

tion

17

Page 15: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Choosing to use an adaptive test requires

making a series of decisions in the areas of…

• Psychometrics

• Interface (including accommodations)

• Item designs

• Test designs

• Test distribution

• Item usage

• Item and test security

• Proctor training

• Reporting

18

Page 16: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Basics of a theoretical CAT

• IRT model

• Item pool

• Select first item

• Select next item

• Terminate test

• Score

19

Page 17: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Decision areas for an operational CAT for

measuring student achievement

• Before the test (Test stuff)

– How will we develop the measurement scale?

– What mix of item styles will we need?

– Which IRT model is appropriate?

– What depth do we need in our item bank?

– How will we choose an operational item pool?

– What will our test blueprint include?

– How will we QA everything involved?

20

Page 18: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Questions and Answers

21

Page 19: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Decision areas for an operational CAT for

measuring student achievement

• Before the test (School stuff)

– School, teacher, and student identification

– Establishing a testing environment

– Teacher training

– Software/hardware setup

– Proctor training

– Student familiarization

– Student scheduling

– QA

22

Page 20: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Decision areas for an operational CAT for

measuring student achievement

• Test administration

– Student verification process

– Test selection

– Proctor throughout

– Identify previously used items

23

Page 21: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Decision areas for an operational CAT for

measuring student achievement

• Test event

– Apply test blueprint

– Select first item or set of items

– Check for effort

– Update item selection theta hat

– Update constraints

– Select next item

– Terminate test

24

Page 22: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Decision areas for an operational CAT for

measuring student achievement

• After the test

– Calculate final score

– Calculate growth

– Terminate test session

– Store data

– Identify student as completing test

– Compare to norms, growth norms, content, etc.

– Create individual student report

– Add information to teacher/administrator reports

25

Page 23: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Measuring growth and adaptive testing

• Measuring at multiple points in time

• The standard deviation of growth

• The standard error of growth

• Reduction of uncertainty

• Growth and instruction

26

Page 24: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Adaptive testing and idiosyncratic knowledge

patterns

• Can there be multiple thetas without

multidimensionality?

• Selecting items to reveal knowledge patterns

• A simple algorithm

• The impact on instruction

27

Page 25: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Field testing within an adaptive testing

system

• Calibration differences from paper to CAT

• Random sampling for calibration in CAT

• Using provisional calibrations in CAT field

tests

28

Page 26: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Cautionary notes

• Adaptive testing needs to be well tuned to

avoid bad tests.

• The item pool must support the stakes.

• Adaptive testing changes, but doesn’t

eliminate, security issues.

– Brain dump sites

• Limit desire. No test can do everything.

• Adaptive test development is never done.

29

Page 27: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Have fun

• The decisions to be made should consider the

good of the students for whom the test is

designed.

• Don’t try to build the perfect test—it won’t be.

• Consider a ―dry eye‖ policy—making kids cry

isn’t the purpose of the test.

30

Page 28: Decisions to be made in developing an adaptive testing ... · for Computerized Adaptive Testing (IACAT) and ... Decisions to be made in developing an adaptive testing system for K–12

Thank you

Gage Kingsbury

[email protected]

31