19
Local Control Page 1 of 19 Analysis of Observational Health Care Data Using SAS Douglas Faries, Robert L. Obenchain, Josep Maria Haro and Andrew C. Leon Carey, NC: SAS Press Anticipated publication date: January 2010 Outline of Contents of Chapter 7 The Local Control Approach using JMP Robert L. Obenchain, PhD, FASA Principal Consultant, Risk Benefit Statistics LLC, Carmel, Indiana Adjunct Professor, Biostatistics, IU Medical School, Indianapolis Abstract The local control approach to adjustment for treatment selection bias and confounding in observational studies is illustrated here using JMP because local control is best implemented and applied in highly visual ways. The local control approach is also unique because it hierarchically clusters patients in baseline covariate x-space; applies simple nested analysis of variance (ANOVA) models (treatment within cluster); and ends up being highly flexible, non-parametric, and robust. Although the local control approach is classical rather than Bayesian, its primary output is a full distribution of local treatment differences (LTDs) that contains all potential information relevant to patient differential responses to treatment. All concepts are illustrated using freely distributable data on 10,000 patients that were generated to be like those from a published cardiovascular registry containing 996 patients. 7.1 Introduction 7.1.1 Fundamental Problems with Randomization in Human Studies 7.1.2 Fundamental Local Control Concepts 7.1.3 Statistical Methods most useful in Local Control 7.1.4 Contents of the remaining Sections of Chapter 7

Analysis of Observational Health Care Data Using SASlocalcontrolstatistics.org/other/LCdraft.pdfAnalysis of Observational Health Care Data Using SAS Douglas Faries, Robert L. Obenchain,

  • Upload
    others

  • View
    5

  • Download
    0

Embed Size (px)

Citation preview

Local Control Page 1 of 19

Analysis of Observational Health Care Data Using SAS Douglas Faries, Robert L. Obenchain, Josep Maria Haro and Andrew C. Leon

Carey, NC: SAS Press Anticipated publication date: January 2010

Outline of Contents of Chapter 7

The Local Control Approach using JMP Robert L. Obenchain, PhD, FASA

Principal Consultant, Risk Benefit Statistics LLC, Carmel, Indiana Adjunct Professor, Biostatistics, IU Medical School, Indianapolis

Abstract

The local control approach to adjustment for treatment selection bias and confounding in observational studies is illustrated here using JMP because local control is best implemented and applied in highly visual ways. The local control approach is also unique because it hierarchically clusters patients in baseline covariate x-space; applies simple nested analysis of variance (ANOVA) models (treatment within cluster); and ends up being highly flexible, non-parametric, and robust. Although the local control approach is classical rather than Bayesian, its primary output is a full distribution of local treatment differences (LTDs) that contains all potential information relevant to patient differential responses to treatment. All concepts are illustrated using freely distributable data on 10,000 patients that were generated to be like those from a published cardiovascular registry containing 996 patients.

7.1 Introduction

7.1.1 Fundamental Problems with Randomization in Human Studies 7.1.2 Fundamental Local Control Concepts 7.1.3 Statistical Methods most useful in Local Control 7.1.4 Contents of the remaining Sections of Chapter 7

Local Control Page 2 of 19

7.2 Some Traditional Analyses of Hypothetical Patient Registry Data

7.2.1 Introduction to the LSIM10K Data Set 7.2.2 Analyses of Mortality Rates and Costs using Covariate Adjustment 7.2.3 Analyses of Mortality Rates Using Estimated Propensity Score Deciles 7.2.4 Analyses of Mortality Rates and Costs using Inverse Probability Weighting

7.3 The Four Phases of a Local Control Analysis

7.3.1 Introduction to the Four Tactical Phases of a Local Control Analysis 7.3.2 Phase One: Revealing Bias in Global Estimates by Making Local Comparisons 7.3.3 Phase Two: Determining Whether an LTD Distribution is Salient 7.3.4 Phase Three: Performing Systematic Sensitivity Analyses

7.3.4.1 Alternative Clusterings that Readers Can Try on Their Own 7.3.4.2 Review of Clustering Concepts Useful in Sensitivity Analysis

7.3.5 Phase Four: Identifying Baseline Patient Characteristics Predictive of Differential Treatment Response

7.4 Conclusions

Acknowledgments

References Appendix: Propensity Scores and Blocking/Balancing Scores

A.1 Review of the Fundamental Theorem of Propensity Scoring A.2 Practical Problems with Estimated Propensity Scores A.3 Range of Blocking/Balancing Scores: From Most Fine to Most Coarse A.4 Cluster Membership is an Asymptotic Blocking/Balancing Score

Local Control Page 3 of 19

Selected Chapter 7 Figures in Color (In the SAS Press publication, all figures are in Gray Scale.)

Figure 7.1 Marginal and Within Propensity-Score-Decile Distributions of ejfract by treatment (0 or 1)

20

30

40

50

60

70

ejfr

act

0 1

trtm

20

30

40

50

60

70

ejfr

act

10 11 20 21 30 31 40 41 50 51 60 61 70 71 80 81 90 91 100

101

PSdec By Tr t

Local Control Page 4 of 19

Figure 7.2 Estimated within Decile propensity score Means vs. Observed Fractions of Patients Treated

Local Fraction Treated = 0.0101 + 1.0223 * Mean Estimated PS within Decile

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

Trt

Fra

ctio

n

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8

MeanPS

Local Control Page 5 of 19

Figure 7.3. Numbers of Treated and Untreated Patients by Propensity Score Decile

Figure 7.4 Detailed Histograms of Estimated Propensity Scores

0

1 0 0

2 0 0

3 0 0

4 0 0

5 0 0

6 0 0

7 0 0

8 0 0

9 0 0

1 0 0 0

Cou

nt

1 2 3 4 5 6 7 8 9 1 0

PS d e c ile

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

Untreated

Treated

= 0.397

= 0.521

Local Control Page 6 of 19

Figure 7.5 LTDs for 6-month Mortality (Treated minus Untreated) by PScore Decile

LTD Main Effect

= .03389

Figure 7.6 IPWs for Treated or Untreated Patients derived from PS Estimates

-0.1200

-0.1000

-0.0800

-0.0600

-0.0400

-0.0200

0.0000

0.0200

LTD

mor

t

0 1 2 3 4 5 6 7 8 9 10 11

PSdec ile

2.0

4.0

6.0

8.0

10.0

IPW

0 1

trtm

Local Control Page 7 of 19

Figure 7.7 IPW versus Unweighted Predictions of 6-Month Risk of Mortality

Treated or Untreated

RiskCA = 0.00259 + 0.8421*RiskIPW

0.00

0.10

0.20

0.30

Ris

kCA

0.0 0.1 0.2 0.3

RiskIPW

Local Control Page 8 of 19

Figure 7.8 Customized JMP Analyze Menu

Figure 7.9 Open Data File Dialog Box with the LSIM10K Data Set highlighted

Local Control Page 9 of 19

Figure 7.10 JMP Select Columns Dialog Box for Local Control

Figure 7.4 Vertical Cuts of the JMP Clustering Dendrogram Produce a Requested Number of Clusters from the Hierarchy

100Clusters

50Clusters

2Clusters

25Clusters

Local Control Page 10 of 19

Figure 7.6 LC Unbiasing TRACE for the LTD Main Effect in the First Outcome

First Outcome: Y1 = Mortality within Six Months Y1 = Across Cluster Average LTD Outcome

Y1 plus Two Sigma = Upper Limit for Average LTD Y1 minus Two Sigma = Lower Limit for Average LTD

NCreq = Number of Clusters Requested

Table showing the across cluster summary statistics displayed in Figure 7.6

NCreq NCinfo Y1 LTD MAIN

Y1 Local Std Err

Y1 Lower Limit

Y1 Upper Limit

1 1 0.0250 0.00313 0.0313 0.0188 5 5 -0.0270 0.00319 -0.0333 -0.0206 10 10 -0.0306 0.00326 -0.0371 -0.0241 20 20 -0.0315 0.00338 -0.0383 -0.0248 50 50 -0.0351 0.00340 -0.0419 -0.0283 100 100 -0.0372 0.00346 -0.0441 -0.0302 200 199 -0.0393 0.00351 -0.0463 -0.0322 500 492 -0.0398 0.00360 -0.0470 -0.0326 700 660 -0.0383 0.00363 -0.0456 -0.0311 900 816 -0.0365 0.00363 -0.0438 -0.0293

NCreg = Number of Clusters Requested

NCinfo = Number of Informative Clusters Found

-0.05

-0.045

-0.04

-0.035

-0.03

-0.025

-0.02

-0.015

Y

1 10

532

100503020

100050

0

300

200

NCreq

Local Control Page 11 of 19

Figure 7.7 LC Unbiasing TRACE for the LTD Main Effect in the Second Outcome

Second Outcome: Six Month Cumulative Cardiac Related Cost Y2 = Across Cluster Average LTD Outcome

Y2 plus Two Sigma = Upper Limit for Average LTD Y2 minus Two Sigma = Lower Limit for Average LTD

NCreq = Number of Clusters Requested

Table showing the across cluster summary statistics displayed in Figure 7.7

NCreq

NCinfo Y2 LTD

MAIN Y2 Loc Std Err

Y2 Low Limit

Y2 Upr Limit

1 1 255.08 206.21 -157.33 667.50 5 5 664.94 208.99 246.96 1082.92 10 10 698.80 214.19 270.42 1127.19 20 20 341.77 220.48 -99.19 782.74 50 50 -71.92 214.84 -501.59 357.75 100 100 -73.59 212.71 -499.02 351.83 200 199 -24.28 209.32 -442.93 394.37 500 492 127.11 184.59 -242.08 496.30 700 660 248.93 183.97 -119.01 616.87 900 816 312.96 181.35 -49.74 675.66

All results expressed in 1998 US Dollars ($)

-600

-400

-200

0

200

400

600

800

Y

1 10

532

100503020

100050

0

300

200

NCreq

Local Control Page 12 of 19

Figure 7.8 LTD Graphics and Summary Stats for 816 Informative Clusters

LTD Distribution for Y1 = mort6mo LTD Distribution for Y2 = cardcost Mean LTD 0.0385 Mean LTD 168.74 Std Dev 0.14882 Std Dev 4962.92 Std Err Mean 0.00150 Std Err Mean 50.09 Upper 95% Mean 0.0356 Upper 95% Mean 266.93 Lower 95% Mean 0.0415 Lower 95% Mean 70.55 Inform Clust Pats 9816 Inform Clust Pats 9816 Nonparametric Density Nonparametric Density Kernel Std 0.02131 Kernel Std 710.55

Note also that neither of the above LTD Distributions is very much like the “best fitting” Normal distributions displayed in RED on the above histograms and probability plots.

-4

-3

-2

-1

0

1

2

3

4

Nor

mal

Qua

ntile

Plo

t

-1

-0.8

-0.6

-0.4

-0.2 0

0.2

0.4

0.6

0.8 1

-4

-3

-2

-1

0

1

2

3

4

Nor

mal

Qua

ntile

Plo

t

-400

00

-300

00

-200

00

-100

00 0

1000

0

2000

0

3000

0

4000

0

5000

0

6000

0

Local Control Page 13 of 19

Figure 7.9 Bubble Plot of LTD Scatter for 816 Informative Clusters

Bubble Area = Total Number of Patients within Cluster

Local Control Page 14 of 19

Figure 7.10 Artificial LTD Distribution Dialog Box

Local Control Page 15 of 19

Figure 7.11 JMP Side-by-Side Comparison using the Spread Option of the Artificial and Observed LTD Distributions

-1.0

-0.8

-0.6

-0.4

-0.2

0.0

0.2

0.4

0.6

0.8

1.0

LTD

Artificial Observed

LTDtype

Local Control Page 16 of 19

Figure 7.14 JMP Multivariable Model for Predicting mort6mo LTDs from Seven Baseline Patient x-Characteristics

Summary of Fit RSquare 0.144396RSquare Adj 0.141948Root Mean Square Error 0.188451

Analysis of Variance

Source DF Sum of Squares

Mean Square

F Ratio

Model 28 58.65786 2.09492 58.9892

Error 9787 347.57256 0.03551 Prob > F

C. Total 9815 406.23042 <.0001

-1.0

-0.7

-0.4-0.2

0.1

0.4

0.7

1.0

Y1L

TD

Act

ual

-1.0 -0.7 -0.4 -0.1 0.2 0.4 0.6 0.8 1.0

Y1LTD Predicted P<.0001

RSq=0.14 RMSE=0.1885

Local Control Page 17 of 19

Figure A.1. The x-space geometry of linear functionals from logistic regression

Figure A.2 The x-space geometry of small, compact, and numerous clusters that intersect a thin slab of rounded propensity score estimates from logistic regression

Constant PS Estimate Calipersfrom Logistic Regression

x2

x3

x1

LinearLinearFunctionalFunctional

3-D Slab

UnsupervisedNo PS Estimates Needed

x2

x3

x1

3-D Clusters (Informative orUninformative)

Local Control Page 18 of 19

Table 7.1. Nested ANOVA table with effects for treatment within cluster

Source

Degrees of Freedom

Interpretation

Clusters

(Subgroups)

K = Number of Clusters

Cluster Means are Local Average

Treatment Effects (LATEs) when Xs are Instrumental Variables (IVs)

Treatment within Cluster

I = Number of Informative Clusters K

Local Treatment Differences (LTDs)

are of interest when X Variables either are or are not IVs.

Error

Number of

Patients K I

Outcome Uncertainty

and/or Model Lack of Fit

Table 7.3. Five general approaches to analysis of observational data

Covariate Adjustment (CA) using Multivariable Models

History: Generalization of ANOVA and regression models. Advantages: Ubiquitous; widely taught and well accepted; implemented in all statistical analysis packages. Disadvantages: Essentially ignores imbalance (for example, always uses all available data); global, parametric models are difficult to visualize and thus may be unrealistic; results are frustratingly sensitive to model specification details; p-values can be small simply due to large sample sizes.

Inverse Probability Weighting (IPW)

History: Heuristic modification of CA somewhat similar to Horvitz-Thompson adjustment in sample surveys. Advantages: As easy to perform as CA; does attempt to adjust for local variation in treatment selection fraction (imbalance); requires software for (diagonally) weighted regression. Disadvantages: Basically the same as CA; IPW focuses on up-weighting rarely observed outcomes (never really ignores observations from uninformative clusters); basic variance assumptions somewhat counterintuitive (least frequently observed outcomes are treated as being more precise)

Local Control Page 19 of 19

Propensity Score (PS) Matching and Subgrouping

History: Fundamental PS theory has attracted more and more attention over the last 25 some years; motivates use of traditional matching and subclassifying approaches. Advantages: Intuitive, weak assumptions; widely applicable; many results easily displayed using histograms. Disadvantages: Results may be less precise than they appear (are reported) to be; no built-in sensitivity analyses; not a standard method implemented in current commercial statistical software.

Instrumental Variable (IV) Methods

History: Adding IV variable(s) to structural equation models can identify causal effects when the given x-covariates are correlated with model error terms due to endogenous effects, omitted covariates, or errors in variables. Newest IV approaches use patient clustering and nonparametric, local PS estimates. Advantages: Near the top of the theoretical pecking order. Disadvantages: IV assumption is very strong and not testable; implemented only in some commercial statistical packages.

Local Control Methods using Patient x-clusterings (Unsupervised Learning)

History: Generalization of nested ANOVA (treatment within cluster) and hierarchical models. Advantages: Intuitive, weak assumptions; widely applicable; guaranteed asymptotic balancing scores finer than propensity scores; inferences based upon bootstrap confidence or tolerance intervals; built-in sensitivity analyses. Disadvantages: Quite new; completely different focus from that of traditional parametric models; not a standard method implemented in current statistical software packages.