22
© 2008 EMB. All rights reserved. Vehicle Symbol Development Serhat Guven Senior Consultant September 2009

Vehicle Symbol Development

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Vehicle Symbol Development

© 2008 EMB. All rights reserved.

Vehicle Symbol DevelopmentSerhat GuvenSenior Consultant

September 2009

Page 2: Vehicle Symbol Development

Vehicle Symbol Analysis

© 2008 EMB. All rights reserved. Slide 2

OUTLINEBackgroundVehicle Estimator

Initial EstimatorDiagnosticsVehicle Spatial Correction

Vehicle SymbolsVehicle RelativitiesSummary

PURPOSE: To discuss techniques for creating proprietary symbols within the context of multivariate review, including proper tools and diagnostics

Page 3: Vehicle Symbol Development

Background

© 2008 EMB. All rights reserved. Slide 3

Response Variable

Systematic Component

Random Component

= +

Signal:Function of the rating

factors/predictors

Noise:Reflects stochastic

process

Overall Mean“Best”Model

1 parameter for each

observation

Model Complexity

(Number of Parameters)

Basics of Predictive Modeling:

Page 4: Vehicle Symbol Development

Background

© 2008 EMB. All rights reserved. Slide 4

Vehicle Symbol/Relativity Analysis:

Vehicle is a major risk driver and accounts for much rate variation

Two elements:

Symbols

Relativities

Historically, focus on relativities and not symbols

Symbols on limited data, anecdotal evidence, competitors, bureaus, and judgment

Liability symbols relatively recent phenomenon

Regular reviews of relativities with “tweaking” symbols

Page 5: Vehicle Symbol Development

Background

© 2008 EMB. All rights reserved. Slide 5

Significant Variation in Pricing by Vehicle:

Page 6: Vehicle Symbol Development

Background

© 2008 EMB. All rights reserved. Slide 6

Issues with Symbol Analysis:

High-dimensionality

Vehicle symbol requires a large number of small units as building blocks

Companies have limited data, so many building blocks have little or no claims experience

High correlation

Symbols tends to be highly correlated with other rating variables (e.g., age, credit, tier, etc)

Multivariate framework required to handle highly correlated variables

Page 7: Vehicle Symbol Development

Background

© 2008 EMB. All rights reserved. Slide 7

Vehicle Symbol/Relativity Analysis

Vehicle symbol analysis is a multi-stage process.

Initial Estimator

Vehicle Symbols

Vehicle Risk Estimator

Symbol Relativities

Standardized fitted value from multivariate model

By coverage or cause of loss.

Frequency and severity determined separately

Spatial corrections

Component estimators combined

Overall estimators clustered to form symbols

Relativities calculated for each symbol

Page 8: Vehicle Symbol Development

Initial Estimator:

Component models built using vehicle characteristics

© 2008 EMB. All rights reserved. Slide 8

Base Price

Curb Weight

EngineSize

ModelYear

AirbagFeatures

Theft Deterrents

Vehicle Estimator

Modeled Vehicle Signal

Vehicle Component Relativities

Vehicle characteristics are used as proxies for VIN

Use diagnostics from statistical routine to assess performance

Page 9: Vehicle Symbol Development

Issues with raw residual

VINs are very small building blocks with limited data

Unordered categorical data creates interpretation problems

Use ‘spatial smoothing’ concepts to investigate signals

Vehicle Estimator

© 2008 EMB. All rights reserved. Slide 9

Diagnostics:

Need to identify additional signal in residual that can be attributed to the vehicle dimension

Basic residual definition

∑∑

∈=

VINii

VINii

VIN Expected

ActualResidual

Page 10: Vehicle Symbol Development

© 2008 EMB. All rights reserved. Slide 10

Vehicle Estimator

Similar issue for boundary analysis solved via smoothing of results

Distance-based smoothing

Adjacency-based smoothingDistance Adjacency

Smoothing techniques can be applied to vehicle residuals to identify signal

Diagnostics:

High number of dimensions (e.g. make/model)

May by only a few observations for any one dimension

Page 11: Vehicle Symbol Development

“Neighbor” Vehicles

Once the “neighbors” are determined, the same techniques used for territorial analysis can be applied

Instead of using latitude/longitude to build adjacency relationships, use vehicle dimensions

© 2008 EMB. All rights reserved. Slide 11

Page 12: Vehicle Symbol Development

© 2008 EMB. All rights reserved. Slide 12

Vehicle Estimator

Diagnostics:

Smoothing and diagnostics used to identify signal in the vehicle residualQQ Plot: P-Values vs Uniform

0.0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1.0

1.1

0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 1.1

Uniform

P-Va

lues

P-Values Smoothing 1 (20)

P-Values Smoothing 2 (40)

P-Values Smoothing 3 (60)

P-Values Smoothing 4 (80)

P-Values Smoothing 5 (100)

P-Values Time Period Selector Node 1

Uniform Line

QQ plot on smoothed residual indicates potential signal

If no signal, then all estimated cdfs would be above uniform cdf

Smoothing 4 (80)

0.6

0.8

1.0

1.2

1.4

1.6

1.8

2.0

2.2

2.4

0

20,000

40,000

60,000

80,000

100,000

120,000

140,000

160,000

180,000

200,000

0 2 4 6 8 10 12 14 16 18 20 22

Smoothing 4 (80) Band

Nod

e

Wei

ght

Consistency tests further support signal identification

Both time and data consistency tests should be performed

Page 13: Vehicle Symbol Development

Vehicle Spatial Correction:

Residual signal used to adjust scores from multivariate model

© 2008 EMB. All rights reserved. Slide 13

Base Price

Curb Weight

EngineSize

ModelYear

AirbagFeatures

Theft Deterrents

Vehicle Estimator

Modeled Vehicle Signal

Vehicle Component Relativities

SmoothedResidual

Spatial CorrectionLow

$

High$

Page 14: Vehicle Symbol Development

Vehicle Symbols

© 2008 EMB. All rights reserved. Slide 14

Clustering Basics

Clustering used to produce groupings that are predictive of the future:

Minimize within-group heterogeneity.

Maximize cross-group heterogeneity.

Commonly-used clustering methods:

Quantiles

Equal Weight

Similarity Methods

- Average Linkage

- Centroid

Page 15: Vehicle Symbol Development

Vehicle Symbols

© 2008 EMB. All rights reserved. Slide 15

Clustering Methodologies:

Quantiles

Create groups with an equal number of observations

Equal Weight

Create groups which have an equal amount of weight

Similarity Methods:

Rank the data set by the statistic you wish to cluster

Decide on which pair of records are the ‘most similar’

Group these records

Repeat until left with the desired number of groups

Page 16: Vehicle Symbol Development

Vehicle Symbols

© 2008 EMB. All rights reserved. Slide 16

Similarity MethodsAverage Linkage

Difference between clusters is the average difference between pairs of observations, one in each clusterTends to join clusters with small variances

CentroidDifference between clusters is the difference between the mean values of the clusters squaredVINs allocated to more extreme groups

0.04

0.06

0.08

0.1

0.12

0.14

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

Band

Stat

istic

0

0.05

0.1

0.15

0.2

0.25

Perc

enta

ge o

f Wei

ght

Percentageo f Weight

Statistic

0.04

0.06

0.08

0.1

0.12

0.14

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

Band

Stat

istic

0

0.02

0.04

0.06

0.08

0.1

0.12

0.14

0.16

Perc

enta

ge o

f Wei

ght

Percentageof Weight

Statistic

Average Linkage Centroid

Page 17: Vehicle Symbol Development

Clustering:

Vehicle estimator is clustered into new symbols

© 2008 EMB. All rights reserved. Slide 17

Base Price

Curb Weight

EngineSize

ModelYear

AirbagFeatures

Theft Deterrents

Vehicle Symbols

Modeled Vehicle Signal

Vehicle Component Relativities

SmoothedResidual

Spatial CorrectionLow

$

High$

1

1 71 81 92 0

1 4

1 5

1 6

4

5

1 0

1 1

1 2

1 3

6

7

8

9

2

3

Page 18: Vehicle Symbol Development

Symbol Relativities

© 2008 EMB. All rights reserved. Slide 18

Model fit using new symbol definitions instead of vehicle proxies

Test relativities using standard tests

Lift statistics used to judge explanatory power

Consistency tests used to judge predictive power

Refine symbols/relativities as appropriate

Incorporate rule-based restrictions

Apply actuarial knowledge

Investigate “neighbors” with very different relativities

Vehicle Symbols

Symbol Relativities

Page 19: Vehicle Symbol Development

Symbol Relativities

© 2008 EMB. All rights reserved. Slide 19

Source Data

ETL

Vehicle Proxy Model

Combine Components

Indications

Symbols

Component Symbol Model

Vehicle Residual Analysis

Page 20: Vehicle Symbol Development

Summary

© 2008 EMB. All rights reserved. Slide 20

Vehicle Symboling Overview

Accurate estimation of underlying risk associated with the vehicle

Step 1Obtain a separate estimator by claim type and by frequency and severity for each vehicle building block. Combine estimators, as appropriate.

BIFrequency

Severity

Estimator

Estimator

PDFrequency

Severity

Estimator

Estimator

CompFrequency

Severity

Estimator

Estimator

CollFrequency

Severity

Estimator

Estimator

BI Estimator

PD Estimator

Comp Estimator

Coll Estimator

Vehicle Symbols

Symbol Relativities

Step 2Cluster vehicle building blocks to develop symbols separately by coverage(s) and component

Step 3Determine by-coverage relativities for each vehicle group

Page 21: Vehicle Symbol Development

© 2008 EMB. All rights reserved. Slide 21

Summary

Technique has proven very successful based on proper hold-out sampling validation

Proven Results

US insurer

Canadian insurer

German insurance dataset

Page 22: Vehicle Symbol Development

Summary

© 2008 EMB. All rights reserved. Slide 22

Vehicle groupings are a major driver of risk, thus it is critical that companies review symbols and relativities regularly.

Issues exist that create special challenges with regards to symbol analysis.

High-dimensionality

Heavily correlated

Vehicle symbols require a range of different approaches and tools (as there are different loss drivers)

Diagnostics needed to ensure best model possible