32
Confidence Intervals Confidence Intervals First ICFA Instrumentation First ICFA Instrumentation School/Workshop School/Workshop At Morelia, Mexico, November 18-29, 2002 At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Harrison B. Prosper Florida State University Florida State University

Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

Embed Size (px)

Citation preview

Page 1: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

Confidence IntervalsConfidence Intervals

First ICFA Instrumentation School/WorkshopFirst ICFA Instrumentation School/WorkshopAt Morelia, Mexico, November 18-29, 2002At Morelia, Mexico, November 18-29, 2002

Harrison B. ProsperHarrison B. ProsperFlorida State UniversityFlorida State University

Page 2: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

2

OutlineOutline

• Lecture 1 Lecture 1 – Introduction– Confidence Intervals - Frequency Interpretation– Poisson Example– Summary

• Lecture 2 Lecture 2 – Deductive and Inductive Reasoning– Confidence Intervals - Bayesian Interpretation– Poisson Example– Summary

Page 3: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

3

IntroductionIntroduction

• We physicists often talk We physicists often talk about calculating about calculating “errors”, but what we “errors”, but what we really mean, of course, really mean, of course, isis– quantifying ourour

uncertainty

• A measurement is A measurement is notnot uncertain, but it has an uncertain, but it has an errorerror εε about which about which wewe are uncertain!are uncertain!

valuetrue

tmeasuremenˆ

ˆ

m

m

mm

Page 4: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

4

Introduction - iIntroduction - i

• One way to quantify One way to quantify uncertainty is the uncertainty is the standard deviationstandard deviation or, or, even better, the even better, the root root mean square deviationmean square deviation of the distribution of of the distribution of measurements. measurements.

• In 1937 Jerzy Neyman In 1937 Jerzy Neyman invented another invented another measure of uncertainty measure of uncertainty called a called a confidence confidence intervalinterval. .

222

22

ˆ

ˆˆ

)ˆ(

mm

mm

mm

mm

mm

ˆ bias

ˆˆdev. std.

rms

22

2

Page 5: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

5

Introduction - iiIntroduction - ii

• Consider the following questions Consider the following questions – What is the mass of the top quark?– What is the mass of the tau neutrino?– What is the mass of the Higgs boson?

• Here are possible answersHere are possible answers– mt = 174.3 ± 5.1 GeV – mν < 18.2 MeV– mH > 114.3 GeV

Page 6: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

6

Introduction – iiiIntroduction – iii

• These answers are unsatisfactoryThese answers are unsatisfactory– because they do not specify how much

confidenceconfidence we should place in them.

• Here are better answersHere are better answers– mmtt = 174.3 = 174.3 ± ± 5.1 GeV, 5.1 GeV, with CL = with CL = 0.6830.683– mmνν < 18.2 MeV, < 18.2 MeV, with CL = with CL = 0.9500.950– mmHH > 114.3 GeV, > 114.3 GeV, with CL = with CL = 0.9500.950

CL = Confidence LevelConfidence Level

Page 7: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

7

Introduction - ivIntroduction - iv

• Note that the statementsNote that the statements– mmtt = 174.3 = 174.3 ± ± 5.1 GeV, 5.1 GeV, CL = CL = 0.6830.683

– mmνν < 18.2 MeV, < 18.2 MeV, CL = CL = 0.9500.950

– mmHH > 114.3 GeV, > 114.3 GeV, CL = CL = 0.9500.950

are just an asymmetric way of writingare just an asymmetric way of writing– mmtt lies in lies in [169.2, 179.4][169.2, 179.4] GeV, CL =CL = 0.6830.683

– mmνν lies inlies in [0, 18.2][0, 18.2] MeV, CL =CL = 0.9500.950

– mmHH lies in lies in [114.3, [114.3, ∞)∞) GeV, CL =CL = 0.9500.950

Page 8: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

8

Introduction - vIntroduction - v

• The goal of these lectures is to explain the The goal of these lectures is to explain the precise meaning of statements of the formprecise meaning of statements of the form

θθ lies in [LL, UU], with CL = ββ• L = lower limitL = lower limit• U = upper limitU = upper limit

• For exampleFor example

mmtt lies in [169.2169.2, 179.4179.4] GeV, with CL = 0.6830.683

Page 9: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

9

What is a Confidence Level?What is a Confidence Level?

• A confidence level is a A confidence level is a probabilityprobability that that quantifies quantifies in some wayin some way the reliability of a the reliability of a given statementgiven statement

• But, what exactly is probability?But, what exactly is probability?

– BayesianBayesian: The degree of beliefdegree of belief in, or plausibility ofplausibility of, a statement (Bayes, Laplace, Jeffreys, Jaynes)

– FrequentistFrequentist: The relative frequencyrelative frequency with which something happens (Boole, Venn, Fisher, Neyman)

Page 10: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

10

Probability: An ExampleProbability: An Example

• Consider the statementConsider the statement– SS = “It will rain in Morelia on Monday”

• And the probability assignmentAnd the probability assignment– Pr{SS} = 0.01

• Bayesian interpretationBayesian interpretation– The plausibility ofplausibility of the statement SS is 0.01

• Frequentist interpretationFrequentist interpretation– The relative frequencyrelative frequency with which it rains

on Mondays in Morelia is 0.01

Page 11: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

11

Confidence Level: InterpretationConfidence Level: Interpretation

• Since probability can be interpreted in (at Since probability can be interpreted in (at least) two different ways, the interpretation of least) two different ways, the interpretation of statements such asstatements such as

– mmtt lies in [169.2169.2, 179.4179.4] GeV, with CL = 0.6830.683

depends on which interpretation of probability depends on which interpretation of probability is being used. is being used.

• A great deal of confusion arises in our field A great deal of confusion arises in our field because of our tendency to forget this factbecause of our tendency to forget this fact

Page 12: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

Confidence Intervals Confidence Intervals Frequency InterpretationFrequency Interpretation

Page 13: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

13

Confidence IntervalsConfidence Intervals

• The basic ideaThe basic idea– Imagine a set of ensemblesset of ensembles of experiments,

each member of which is associated with a fixed value of the parameter to be measured θθ (for example, the top quark mass).

– Each experiment EE, within an ensemble, yields an interval [l(E), u(E)][l(E), u(E)], which either contains or does not contain θθ.

Page 14: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

14

Coverage Probability Coverage Probability

– For a given ensemble, the fraction of experiments with intervals containing the θθ value associated with that ensemble is called the coverage probabilitycoverage probability of the ensemble.

– In general, the coverage probability will vary from one ensemble to another.

Page 15: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

15

ExampleExample

EE11 E3

EE55

E4

E2

EE11 E3

EE55

EE44

EE22

EE11

E3

E5

EE44

EE22

Ensemble with θ = θ1 with Pr = 0.4Pr = 0.4

Ensemble with θ = θ2 with Pr = 0.8Pr = 0.8

Ensemble with θ = θ3 with Pr = 0.6Pr = 0.6

Page 16: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

16

– If our experiment is selected at random from the ensemble to which it belongs (presumably the one associated with the true value of θθ) then the probability that its interval [l(E), u(E)] [l(E), u(E)] contains θθ is equal to the coverage probability of that ensemble.

– The crucial point is thisThe crucial point is this: We try to construct the set of ensembles so that the coverage probability over the set is never less than some pre-specified value ββ, called the confidence levelconfidence level.

Confidence Level – Frequency InterpretationConfidence Level – Frequency Interpretation

Page 17: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

17

Confidence Level - iiConfidence Level - ii

• Points to NotePoints to Note– In the frequency interpretation, the

confidence levelconfidence level is a property of the set of set of ensemblesensembles; In fact, it is the minimum coverage probability over the set.

– Consequently, if the set of ensembles is unspecified or unknown the confidence level is undefined.

Page 18: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

18

AnyAny set of intervals

with a minimum coverage probability equal to ββ is a set of confidence intervalsconfidence intervals at 100 β % confidence levelconfidence level (CL). (Neyman,1937)

Confidence intervals are defined not by how theyare constructed, but by their frequency properties.

)](),([ EuEl

Confidence Intervals – Formal DefinitionConfidence Intervals – Formal Definition

E Experimentl(E) Lower limitu(E) Upper limit

Page 19: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

19

Confidence Intervals: An ExampleConfidence Intervals: An Example

• Experiment:Experiment:– To measure the mean rate θθ of UHECRs

above 1020 eV per unit solid angle.

• Assume the probability of N events to Assume the probability of N events to be a given by a Poisson distributionbe a given by a Poisson distribution

dev. std.,,!

)|( nn

enP

n

Page 20: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

20

Confidence Intervals – Example - iiConfidence Intervals – Example - ii

• Goal: Compute a set of intervals Goal: Compute a set of intervals

for for NN = 0, 1, 2, … = 0, 1, 2, … with with CL = 0.683CL = 0.683 for a for a set of ensembles, each member of set of ensembles, each member of which is characterized by a different which is characterized by a different mean event count mean event count θθ. .

)](),([ nunl

Page 21: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

21

Why 68.3%?Why 68.3%?

• It is just a useful convention! It is just a useful convention! – It comes from the fact that for a Gaussian

distribution the confidence intervals given by [x-[x-σσ, x+, x+σσ]] are associated with a set of ensembles whose confidence level is 0.6830.683. (x x = measurement, σσ = std. dev.)

• The main reason for this convention is The main reason for this convention is the the Central Limit TheoremCentral Limit Theorem– Most sensible distributions become more

and more Gaussian as the data increase.

Page 22: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

22

NCount

Par

amet

er s

pace

Confidence Interval – General AlgorithmConfidence Interval – General Algorithm

)(Nu

)(Nl

RL 1 L R

For each value θfind an interval in N with probabilitycontent of at least β

Page 23: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

25

Example: Interval in N for Example: Interval in N for θθ = 10= 10

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 200

0.05

0.1

0.15Poisson Distribution (Mean = 10)

Count

Pro

babi

lity

Page 24: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

26

Confidence Intervals – Specific AlgorithmsConfidence Intervals – Specific Algorithms

• Neyman Neyman – Region: fixed probabilities on either side

• Feldman – CousinsFeldman – Cousins– Region: containing largest likelihood ratios

P(n| P(n| θθ)/ P(n|n))/ P(n|n)

• Mode – CenteredMode – Centered– Region: containing largest probabilities

P(n| P(n| θθ))

Page 25: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

27

Define

Nz

L zPNC )|()|(

Nz

R zPNC )|()|(

N

N

Left cumulative distribution functionLeft cumulative distribution function

Right cumulative distribution functionRight cumulative distribution function

Valid for both continuous and discrete distributions.

Neyman ConstructionNeyman Construction

Page 26: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

28

Solve

LL uNC )|(

RR lNC )|(where

1 L R

N

L

N

R

Neyman Construction - iiNeyman Construction - ii

Remember: Left is UP and Right is LOW!

Page 27: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

29

L R ( . ) /1 0 683 2Choose

n

iL uiP

0

)|(

1

0

)|(1)|(n

iniR liPliP

and solve

for the interval )](),([ nunl

Central Confidence IntervalsCentral Confidence Intervals

Page 28: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

30

Central Confidence Intervals - ii Central Confidence Intervals - ii

!)|(

n

enP

n

Poisson Distribution

)(nl

)(nu

n0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

0

5

10

15

20

Upper LimitsLower Limits

Central Intervals - Poisson

Count

Par

amet

er

Page 29: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

31

Comparison of Confidence IntervalsComparison of Confidence Intervals

θ

0 2 4 6 8 10 12 14 16 18 200

5

10

15

20Intervals - Poisson Distribution

Count

Par

amet

er

Central

Feldman-Cousins

Mode-Centered

N±√N

Page 30: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

32

Comparison of Confidence Interval WidthsComparison of Confidence Interval Widths

Central

Feldman-Cousins

Mode-Centered

N±√N0 2 4 6 8 10 12 14 16 18 20

0

2

4

6

8

10

CentralFeldman-CousinsModeRoot(N)

Width of Intervals

Count

Inte

rval

Wid

th

Page 31: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

33

Comparison of Coverage ProbabilitiesComparison of Coverage Probabilities

θ

Central

Feldman-Cousins

Mode-Centered

N±√N 0 5 10 15 200

0.2

0.4

0.6

0.8

1

CentralFeldman-CousinsModeRoot(N)68.3%

Coverage Probability

Poisson Parameter

Pro

babi

lity

Page 32: Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University

November 21, 2002 First ICFA Instrumentation

School/Workshop Harrison B. Prosper

34

SummarySummary

• The interpretation of The interpretation of confidence intervalsconfidence intervals and and confidence levelsconfidence levels depends on which interpretation of depends on which interpretation of probability one is usingprobability one is using

• The The coverage probabilitycoverage probability of an ensemble of of an ensemble of experiments is the fraction of experiments that experiments is the fraction of experiments that produce intervals containing the value of the produce intervals containing the value of the parameter associated with that ensembleparameter associated with that ensemble

• The confidence level is the The confidence level is the minimumminimum coverage coverage probability over a set of ensembles.probability over a set of ensembles.

• The confidence level is The confidence level is undefinedundefined if the set of if the set of ensembles is unspecified or unknownensembles is unspecified or unknown