27
Chapter 9 Model Calibration John Hourdakis Center for Transportation Studies, U of Mn [email protected]

Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

Chapter 9

Model Calibration

John HourdakisCenter for Transportation Studies, U of Mn

[email protected]

Page 2: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

2

Why Calibrate?

• Computers Cannot Magically Replicate Reality!• Simulation Models Are Designed to be General• Driver Behavior and Road Characteristics Depend on

Location i.e. Minnesota vs CaliforniaVehicle Characteristics (Horsepower, Size, etc.)Weather Conditions (Dry, Wet, Ice, etc.)Day or Night

• Microscopic Simulators Can Adapt and Replicate Almost Any Condition if the Model Parameters Are Properly Adjusted

• What is Realistic and What is Not?

Page 3: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

3

Example of a Disaster

• Half of an Ordinary Double Cloverleaf• Freeway Speed 65 mph, Ramps 45 mph• Turning Warnings More Than 2000 Feet Away• Lot’s of Traffic: 6500 veh/hour on Mainline• Three Vehicle Types: Car, Truck, and Semi-Trailer

Page 4: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

4

Simulation Results From 5 Scenarios• Same Demands, Speed Limits, Turnings, Most of

Model Parameters.• Changed Car and Semi-Trailer :

• Acceleration • Normal Deceleration• Maximum Deceleration (Emergency Stop)

F57 sec/veh536 hours5500 vph27.4 mph-2f/s2

E30 sec/veh393 hours6152 vph35.3 mph-1f/s2

D9 sec/veh234 hours6396 vph46.7 mphBASE

F41 sec/veh484 hours6200 vph29.1 mph+1f/s2

D9 sec/veh257 hours6831 vph45.5 mph+2f/s2

LOSAv. DelayTotal Travel TimeAverage FlowAverage Speed

Page 5: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

5

Calibration ProcedureH is t o r ic a l

s y s t e m in p u td a t a

A c t u a ls y s t e m

S y s t e mo u t p u t d a t a

H is t o r ic a l s y s t e min p u t d a t a

S im u la t io nM o d e l

M o d e l o u t p u td a t a

C o m p a r e

V a l id ?C a l ib r a t e

N o

Page 6: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

6

REALITYCurrent Traffic Measurements, etc.

• Need to KNOW the Network to be Replicated• Depending on the Scope of the Project, Two

or More of the Following Are Needed :• Mainline Volumes (Every X Feet)• Mainline Speeds (Every X Feet)• Travel Times (Link or Between O/D Pairs)• Bottleneck Capacity (Measured, Not Theoretical)• Entrance Ramp Queues• Intersection Queues and Queue Discharge Rates

Page 7: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

7

Modeling Parameters• Global Parameters

• Vehicle Parameters• Size and Power• Driver Behavior

• Car-Following Model Parameters• Reaction Time• Speed Distibution Among Lanes (Overtaking

Manuevers)

• Local Section Parameters• Curvature and Grade• Speed Limit or Free Flow Speed• Lane Changing Distances• Headway/Hesitation Factor

Page 8: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

8

Model Parameter Issues

• Driver/Vehicle Characteristics• Literature Does Not Provide Adequate Information• Not Common to All Simulators• No Info on Effects of Weather/Pavement/Ambient

Conditions

• Car-Following Parameters• Depend on the Simulator’s Car-Following Model

Employed.• Most Simulators Do Not Adequately Describe the

Modeling Process (if at all).• User Has No Sense of Model Parameter Effects on Results.

Page 9: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

9

Calibration Issues

• Very Important For Model Accuracy and Robustness

• Accuracy Depends on Measurement Granularity• Averages Over Several Days is a Bad Choice• Might Need Additional Information to be Collected in

Turbulent Sections (Bottlenecks, Weaving Areas, etc.)

• Simulation Objective Affects Calibration• When Adaptive Control Strategies Are Simulated,

Stricter Validation is Needed• Modeling of an Isolated Interchange in Rural

Minnesota Will be Restrictive

Page 10: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

10

Calibration Issues

• VERY TIME CONSUMMING PROCESS• Currently Simulators Do Not Provide a Methodology or

Tools to Assist in Calibration• Often Users End Up in Endless Trial-and-Error Cycles• Sporadic Attempts Made in Literature to Streamline the

Process But:• Focused on a Particular Simulator• Too Complex or too Naive to be Effectively Used in

Practice• No Widely Accepted Methods/Standards Currently

Available

Page 11: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

11

Before Calibration!

• Check Geometry For Correctness• Disjoined Sections• Stuck Vehicles (Sizes of Accel/Decel Lanes)• Verify Location of Detectors

• Check Input For Accuracy• Entrance Volume Comparison (Perfect Match)• Exit Volume Comparison (Match Sum Over All

Hours)• Volume Totals on Mainline Stations Should Match

Page 12: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

12

Practical Calibration Methodology(Employed on Twin Cities Freeways)

• Need Simultaneous Boundary and Mainline Station Measurements.• Technique:

upstream mainline detector stations

• Objective is to Match the Simulated and Actual Mainline Traffic Measurements

• Traffic Measurements Used: Volume and Speed • Occupancy Affected by Detector Sensitivity (Unknown)

• Perform Calibration in Stages:• First 2 Stages Based on Volume and Speed in That Order• Further Improvements in Optional 3rd Stage:

Depending on Objective i.e. For Ramp Control -> Queue Length

Page 13: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

13

Goodness-of-Fit Test Measures

• Recommended Goodness-of-Fit Measures:

1. RMS Percent Error =(Measures Overall % Error)

2. Correlation Coefficient =(Measures Linear Association)

Where xi is the Simulated Traffic Measurement at Time i x’ is the Mean of the Simulated Traffic Measurementsyi is the Actual Traffic Measurement at Time iy’ is the Mean of the Actual Traffic Measurementsσx is the Standard Deviation of the Simulated Traffic Measurementσy is the Standard Deviation of the Actual Traffic Measurementsn is the Number of Traffic Measurement Observations

∑=

−−−

=n

i yx

ii yyxxn

CORL1

))((1

1σσ

∑=

−n

i i

ii

yyx

n 1

21

Page 14: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

14

Goodness-of-Fit Measures (Cont.)3. Theil’s U (Considers the Disproportionate Weight of

Large Errors)• 3 Components of Theil’s U

Us (Measure of Variance Proportion, Close to 1 Satisfactory)

Uc (Measure of Covariance Proportionor Unsystematic Error, Close to 0 Satisfactory)

Um (Measure of Bias Proportion orSystematic Error, Close to 0 Satisfactory)

Where σy and σx are the Standard Deviations of the Actual and Simulated Series

r is the Correlation Coefficient Between the Two Series

∑∑

==

=

+

n

ii

n

ii

n

iii

xn

yn

xyn

1

2

1

2

1

2

11

)(1

∑=

−= n

iii

xys

xy

nU

1

2

2

)(

)( σσ

∑=

−= n

iii

xyc

xy

nrU

1

2)(

)1(2 σσ

∑=

−= n

iii

m

xy

xynU

1

2

2

)(

)(

Page 15: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

15

Examples

50

100

150

200

1 7 13 19 25

31

37 43 49 55

61

67

tim e in 5-m in increm ents from m idnight

volu

me

(veh

icle

s/5-

min

)

ac tual volume

simulated volume

50

100

150

200

1 7 13

19

25 31 37

43 49 55

61 67

tim e in 5-m in increm e nts from midnight

volu

me

(veh

icle

s/5-

min

)

actual volume

s imulated volume

50

100

150

200

1 7 13

19

25 31 37

43 49 55

61 67

tim e in 5-m in increm e nts from midnight

volu

me

(veh

icle

s/5-

min

)

actual volume

s imulated volume

50

100

150

200

1 7 13 19

25

31 37 43 49

55

61 67

t ime in 5-m in in cr em ents from m idnigh t

volu

me

(veh

icle

s/5-

min

)s imulated volume

actual v olume

Figure 1(a): Illustration of unsatisfactory Um Figure 1(b): Illustration of unsatisfactory Uc

Figure 1(c): Illustration of unsatisfactory Uc Figure 1(d): Illustration of unsatisfactory Us

Page 16: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

16

Stage 1: Volume-Based Calibration

• Objective is to Match Simulated and Actual Mainline Station Volumes

• Simulation Model Calibrated Beginning Upstream and Proceeding Downstream

• Global Parameters Are Calibrated First:• Usually Accomplished in First Few Stations• Trial & Error Iterative Process For Each Parameter• RMSP, r and U are the Metrics Used in Each Iterations

• Local Parameters Calibrated at All Stations• Um, Uc and Us are the Metrics Used at This Point.

Page 17: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

17

Stage 2: Speed-based Calibration• Objective is to Match Simulated and Actual Mainline

Station Speeds and Bottleneck Locations• Actual Speeds Derived From Volume, Occupancy,

and Effective Vehicle Length (For Single Loop Detectors)

• Speed Contour Graphs Used For Comparing (Visually) Simulated and Actual Speeds

1. If Speed Contours Exhibit Significant Discrepancy, Revise Global Parameters From Stage 1

2. Beginning Upstream and Proceeding Downstream, Calibrate Local Parameters Until Mainline Speeds and All Bottleneck Locations in the 2 Contour Graphs Match

Page 18: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

18

Stage 3: Application Dependent

• For Adaptive Ramp control:• Compare Queue Lengths• Local Simulation Parameters Affecting Detector Output

Need Further Calibration in This Stage

• Simulated Entrance Ramp Queues Should Match Actual Ones• Queue Measurements From 30 sec Detector Counts• Queue = Metered Demand - Actual Demand (Upstream)

• Over-Calibration to be Avoided to Ensure Generality• Repeat Simulation With Different Random Seeds• Simulate Additional Days

Page 19: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

19

Example of Calibration

• AIMSUN Microsimulator• Mn/DOT Ramp Metering Evaluation.• 2 Test Sites (Only One Presented)

• TH 169 Northbound in Minneapolis, MN12 Miles Long: From I-494 to 63rd Avenue N24 Entrance Ramps, 25 Exit Ramps30 Detector Stations.5-Minute Volume and OccupancyMarch 21st to 23rd, 200014:00 to 20:00 hrs

Page 20: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

20

Test site 1: TH 169NBAA

AA

BB

BB

Page 21: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

21

Stage 1 Results (Volume)

• 500 Simulator Iterations Required, 2 Months (!)• Irregularities in Input Data Observed Due to

Sensor Misplacement

0.881Uc0.031Us0.088Um0.002U7.39%RMSP0.960r

ValuesGoodness-of-Fit Measure

Page 22: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

22

Actual Speed Contour16

9

171

173

175

177

179

181

183

185

187

189

191

193

195

197

199

201

203

205

207

209

211

213

215

217

219

221

223

225

227

229

231

233

235

237

239

426

428

430

432

434

437

439

441

443

769

767

40-50

50-6060-7070-8080-90

90-100100-110

TIMETIME UpstreamUpstream

DownstreamDownstream

Page 23: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

23

169

171

173

175

177

179

181

183

185

187

189

191

193

195

197

199

201

203

205

207

209

211

213

215

217

219

221

223

225

227

229

231

233

235

237

239

426

428

430

432

434

437

439

441

443

769

767

60-70

70-80

80-90

90-100

100-110

16

9

172

175

178

181

184

187

190

193

196

199

202

205

208

211

214

217

220

223

226

229

232

235

238

4 26

4 28

4 30

4 32

4 34

4 37

4 39

4 41

4 43

7 69

7 67

Spe e d (s im u la tion )40- 50

50- 60

60- 70

70- 80

80- 90

90- 10 0

100 -1 10

169

171

173

175

177

179

181

183

185

187

189

191

193

195

197

199

201

203

205

207

209

211

213

215

217

219

221

223

225

227

229

231

233

235

237

239

426

428

430

432

434

437

439

441

443

769

767

40-50

50-6060-7070-8080-90

90-100100-110

Stage 2 ResultsStage 2 Results(Speed)(Speed)

Simulated Speed After Stage 2 (About 200 Iterations)

Actual Speed

Simulated Speed After Stage 1

Page 24: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

24

Stage 3 Results (Queues)• Simulated and Actual Queues Did Not Match Before This Stage• 3rd Stage Required About 100 Iterations

Example Ramp

-10

0

10

2 0

3 0

4 0

50

6 0

70

8 0

t ime ( 5 - min inc re me nt s f ro m mid nig ht )

real queue

simulation queue afterstage 3simulation queue beforestage 3

Simulation vs Real Queues

Page 25: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

25

Validation Accuracy(Volume)

0.8810.0310.0880.0020.9607.390March 23rd

(Calibration)

0.8230.0540.1240.0010.9706.420March 22nd

(validation)

0.6810.0110.3090.0040.98010.620March 21st

(validation)

UcUsUmUrRMS%

Over All Stations.

Page 26: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

26

Results (Calibrated Parameters)AIMSUN Microscopic Simulator

55.00055.00060.00060.000Av. section speed (ramp section, kmph)

70.00070.00075.00090.000Av. section speed (weaving section, kmph)

105.000105.000100.000110.000Av. section speed (regular section, kmph)

70.00070.00050.00050.000Max. speed difference on-ramp (kmph)

60.00060.00040.00040.000Max. speed difference (kmph)

0.9900.9901.0001.000Percent recover

0.9400.9400.9500.950Percent overtake

0.6100.6100.5900.700Reaction time (sec)

-5.500-5.500-5.500-5.000Max. dec. rate (m/s2)

-5.000-5.000-5.000-4.500Normal dec. rate (m/s2)

3.0003.0003.0004.500Max. acc. rate (m/s2)

110.000110.000110.000105.000Max. desiredspeed (kmph)

After stage 3After stage 2After stage 1InitialParameter

Page 27: Model Calibration WHITE · 2010-02-01 · 9 Calibration Issues • Very Important For Model Accuracy and Robustness • Accuracy Depends on Measurement Granularity • Averages Over

27

Conclusion

• Garbage In >>> Garbage Out

• Simulation Useless/Dangerous Without Calibration

Questions?