36
1 Abstract Claims coming from human medical observational studies, when tested rigorously, most often fail to replicate. Whereas randomized clinical trials replicate over 80% of the time, medical observational studies replicate only 10 to 20% of the time. Multiple re-test studies reported JAMA failed to replicate. For example in the early 1990s, Vitamin E was reported to protect against heart attacks. Large, well-conducted randomized clinical trials did not replicate this claim. The claim that Type A Personality leads to heart attacks failed to replicate in two separate studies, yet the myth still lives. Clearly, there are systematic problems with how observational studies are conducted and analyzed that need to be identified and fixed. Edwards Deming, the most famous quality expert ever, says that any problem with a failed process is not the fault of the workers, scientists conducting observational studies, but of management. Funding agencies and journal editors need to fix a clearly broken process. Technical problems are identified. Tough management solution are proposed. A simple statistical analysis strategy is presented. Many human health problems can only be examined using observational data. Our proposals, technical and managerial, should lead to more reliable claims along with fair ways to judge their reliability. NISS

02 young vpi lecture 2014

Embed Size (px)

DESCRIPTION

Stan Young, PhD,. slides from Phil 6334 guest presentation.

Citation preview

Page 1: 02 young vpi lecture 2014

1

Abstract

Claims coming from human medical observational studies, when tested rigorously, most often fail to replicate. Whereas randomized clinical trials replicate over 80% of the time, medical observational studies replicate only 10 to 20% of the time. Multiple re-test studies reported JAMA failed to replicate. For example in the early 1990s, Vitamin E was reported to protect against heart attacks. Large, well-conducted randomized clinical trials did not replicate this claim. The claim that Type A Personality leads to heart attacks failed to replicate in two separate studies, yet the myth still lives. Clearly, there are systematic problems with how observational studies are conducted and analyzed that need to be identified and fixed. Edwards Deming, the most famous quality expert ever, says that any problem with a failed process is not the fault of the workers, scientists conducting observational studies, but of management. Funding agencies and journal editors need to fix a clearly broken process. Technical problems are identified. Tough management solution are proposed. A simple statistical analysis strategy is presented. Many human health problems can only be examined using observational data. Our proposals, technical and managerial, should lead to more reliable claims along with fair ways to judge their reliability.

NISS

Page 2: 02 young vpi lecture 2014

22

Contact Information

Stan YoungNational Institute of Statistical Scienceswww.niss.org [email protected] 685 9328

NISS

Page 3: 02 young vpi lecture 2014

NISS 3

Hayek, 1974, Nobel Lecture

It is often difficult enough for the expert, and certainly in many instances impossible for the layman, to distinguish between

legitimate and illegitimate claims advanced in the name of science.

It is often difficult enough for the expert, and certainly in many instances impossible for the layman, to distinguish between

legitimate and illegitimate claims advanced in the name of science.

Page 4: 02 young vpi lecture 2014

NISS 4

Hayek (2)

...much effort will have to be directed toward debunking

such arrogations*, some of which have by now become

the vested interests of established university departments.

*Claims without proper foundation

Page 5: 02 young vpi lecture 2014

5

Reliability of Literature Claims

S. Stanley Young

National Institute of Statistical [email protected], 919 685 9328

VIP Lecture

NISS

Page 6: 02 young vpi lecture 2014

66

Science point of view

What is the meaning of life?

What is real?

What is reproducible?

Fooled (fooling) by randomness?

NISS

Page 7: 02 young vpi lecture 2014

77

The Players

1. The workers – scientists2. The communicators –

a. PR peopleb. Bloggers c. Reporters d. Science writers

3. The consumers – public, regulatory agencies, trial lawyers

4. The management – funding agencies, journal editors

NISS

Page 8: 02 young vpi lecture 2014

88

The Worker is not the Problem.

W. Edwards Deming,

the most visionary innovator ever on quality control, said

The worker is not the problem. The problem is at the top! Management!

To Deming, blaming the workers—individual researchers— is as incorrect as it is useless.

Bringing the system under control is the responsibility of those managing it.

NISS

Page 9: 02 young vpi lecture 2014

9

Problems with observational studies “Everything is dangerous”

1. Data staging2. No written analysis protocol3. Multiple testing4. Multiple modeling5. Uncorrected bias6. Self-serving paper writing 7. Self-serving press release8. Actually believe the claims

9NISS

Page 10: 02 young vpi lecture 2014

Assertion : Every study is positive

Data Staging

Bias

Multiple testing

Multiple model searching

Any or all will lead to essentially all observational studies being positive!

10NISS

Page 11: 02 young vpi lecture 2014

11

First, data staging

Stan:

Why do you think data staging is a big issue?

Because it can be done in myriad ways, is rarely documented, and is usually not reproducible?

David Madigan

11NISS

Page 12: 02 young vpi lecture 2014

12

Multiple Testing: P-value, t-test

Population, real or theoretical

Two samples,random

NISS

Page 13: 02 young vpi lecture 2014

10-sided dice experiment

12/25/12 NISS

Page 14: 02 young vpi lecture 2014

14

How do you get a “p < 0.05”? Answer: Ask lots of questions.

61 questions95% chance of a positive study!

NISS

Page 15: 02 young vpi lecture 2014

1515

Let’s run an epidemiology study! 10-sided dice simulation: Coffee causes X.

NISS

Page 16: 02 young vpi lecture 2014

1616

P-value plot – 60 p-values.

NISS

Page 17: 02 young vpi lecture 2014

17

Cereal determines human gender Really?????

17NISS

Page 18: 02 young vpi lecture 2014

NISS

Page 19: 02 young vpi lecture 2014

19

2 Cancer types, 48 pesticies, 96 questions

Three claims made, only one appears valid.

Page 20: 02 young vpi lecture 2014

NISS

Multiple Modeling/Bias

Take a simple difference

Page 21: 02 young vpi lecture 2014

Paper, data, claim

American Cancer Society Cancer Prevention Study II

No association with CV deaths, corrected for PM2.5.

Ozone associated with respiratory deaths.21

Page 22: 02 young vpi lecture 2014

Jerrett et al. Large Search Space

22

Page 23: 02 young vpi lecture 2014

Covariate Adjustment

27 = 128

23

Page 24: 02 young vpi lecture 2014

Large search space

32 x 128 = 4,096

The data used in this paper is not available.

We are asked to trust that analysis decisions were good and claims are robust.

Any adjustment for multiple testing and/or multiple modeling renders p-values NS.

24

Page 25: 02 young vpi lecture 2014

2525

Crisis in science? 2011, 2012

Nature, 2012

Significance, 2011

NISS

Page 26: 02 young vpi lecture 2014

26

Claims from observational studies tested in RCTs

Page 27: 02 young vpi lecture 2014

27

What can funding agencies do?

Fund data generation and analysis separately.

Fund replication studies.

Require data used in publication be posted on publication.

Page 28: 02 young vpi lecture 2014

28

What can journal editors do?

Quality by inspection, p-value < 0.05, is not working. (Many workers are gaming the system.)

Management needs to re-design the system to build quality into the product.

Papers following good manufacturing procedures and addressing important questions, should be accepted without regard to statistical significance.

Require data used in publication be posted on publication.

Page 29: 02 young vpi lecture 2014

29

What can you, the consumer, do? (not much)

1. Be skeptical of observational study claims.2. Read the actual paper.3. Count the claims under consideration.4. Ask for the data set.5. Letter to editor : voodoo stats and trust me

science. (Educate editors.)6. Write to funding agency.7. Write to congressman.

29NISS

Page 30: 02 young vpi lecture 2014

30

“New” p-value plot, - log10 (p-value), Dmitri Zaykin

Page 31: 02 young vpi lecture 2014

31

Conclusions

Most science claims do not replicate.

Deming: Don't blame the worker (or expect them to adopt different methods).

Funding agencies and journal editors have been AWOL.

Require data to be placed in depository on publication.

Page 32: 02 young vpi lecture 2014

32

One irate study evaluator, 2012

Mens Sana Monograph, 2012

Page 33: 02 young vpi lecture 2014

3333

Contact Information

Stan YoungNational Institute of Statistical Scienceswww.niss.org [email protected] 685 9328

NISS

Page 34: 02 young vpi lecture 2014

Ozone/PM2.5 Acute Deaths LA

34

Page 35: 02 young vpi lecture 2014

3535

Suggestions for effective management of observational studies

No funding / publication without:

1. Public posting protocol before study initiation.

2. Public posting of data set on publication.

3. Clear statement of questions under consideration.

4. Conform to “Reproducible Research” guidelines.

5. Any claims must be independently replicated.

NISS

Page 36: 02 young vpi lecture 2014

3636

Congressional Management:True Science Transparency Act

Any federal agency proposing rule-making or legislation shall specifically name each document used to support the proposed rule-making or legislation and provide all data used in said document for viewing by the public.

See also OSTP memorandum, 22Feb2013.

NISS