Upload
bernice-parsons
View
217
Download
1
Tags:
Embed Size (px)
Citation preview
Confidence IntervalsConfidence Intervals
First ICFA Instrumentation School/WorkshopFirst ICFA Instrumentation School/WorkshopAt Morelia, Mexico, November 18-29, 2002At Morelia, Mexico, November 18-29, 2002
Harrison B. ProsperHarrison B. ProsperFlorida State UniversityFlorida State University
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
2
OutlineOutline
• Lecture 1 Lecture 1 – Introduction– Confidence Intervals - Frequency Interpretation– Poisson Example– Summary
• Lecture 2 Lecture 2 – Deductive and Inductive Reasoning– Confidence Intervals - Bayesian Interpretation– Poisson Example– Summary
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
3
IntroductionIntroduction
• We physicists often talk We physicists often talk about calculating about calculating “errors”, but what we “errors”, but what we really mean, of course, really mean, of course, isis– quantifying ourour
uncertainty
• A measurement is A measurement is notnot uncertain, but it has an uncertain, but it has an errorerror εε about which about which wewe are uncertain!are uncertain!
valuetrue
tmeasuremenˆ
ˆ
m
m
mm
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
4
Introduction - iIntroduction - i
• One way to quantify One way to quantify uncertainty is the uncertainty is the standard deviationstandard deviation or, or, even better, the even better, the root root mean square deviationmean square deviation of the distribution of of the distribution of measurements. measurements.
• In 1937 Jerzy Neyman In 1937 Jerzy Neyman invented another invented another measure of uncertainty measure of uncertainty called a called a confidence confidence intervalinterval. .
222
22
ˆ
ˆˆ
)ˆ(
mm
mm
mm
mm
mm
ˆ bias
ˆˆdev. std.
rms
22
2
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
5
Introduction - iiIntroduction - ii
• Consider the following questions Consider the following questions – What is the mass of the top quark?– What is the mass of the tau neutrino?– What is the mass of the Higgs boson?
• Here are possible answersHere are possible answers– mt = 174.3 ± 5.1 GeV – mν < 18.2 MeV– mH > 114.3 GeV
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
6
Introduction – iiiIntroduction – iii
• These answers are unsatisfactoryThese answers are unsatisfactory– because they do not specify how much
confidenceconfidence we should place in them.
• Here are better answersHere are better answers– mmtt = 174.3 = 174.3 ± ± 5.1 GeV, 5.1 GeV, with CL = with CL = 0.6830.683– mmνν < 18.2 MeV, < 18.2 MeV, with CL = with CL = 0.9500.950– mmHH > 114.3 GeV, > 114.3 GeV, with CL = with CL = 0.9500.950
CL = Confidence LevelConfidence Level
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
7
Introduction - ivIntroduction - iv
• Note that the statementsNote that the statements– mmtt = 174.3 = 174.3 ± ± 5.1 GeV, 5.1 GeV, CL = CL = 0.6830.683
– mmνν < 18.2 MeV, < 18.2 MeV, CL = CL = 0.9500.950
– mmHH > 114.3 GeV, > 114.3 GeV, CL = CL = 0.9500.950
are just an asymmetric way of writingare just an asymmetric way of writing– mmtt lies in lies in [169.2, 179.4][169.2, 179.4] GeV, CL =CL = 0.6830.683
– mmνν lies inlies in [0, 18.2][0, 18.2] MeV, CL =CL = 0.9500.950
– mmHH lies in lies in [114.3, [114.3, ∞)∞) GeV, CL =CL = 0.9500.950
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
8
Introduction - vIntroduction - v
• The goal of these lectures is to explain the The goal of these lectures is to explain the precise meaning of statements of the formprecise meaning of statements of the form
θθ lies in [LL, UU], with CL = ββ• L = lower limitL = lower limit• U = upper limitU = upper limit
• For exampleFor example
mmtt lies in [169.2169.2, 179.4179.4] GeV, with CL = 0.6830.683
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
9
What is a Confidence Level?What is a Confidence Level?
• A confidence level is a A confidence level is a probabilityprobability that that quantifies quantifies in some wayin some way the reliability of a the reliability of a given statementgiven statement
• But, what exactly is probability?But, what exactly is probability?
– BayesianBayesian: The degree of beliefdegree of belief in, or plausibility ofplausibility of, a statement (Bayes, Laplace, Jeffreys, Jaynes)
– FrequentistFrequentist: The relative frequencyrelative frequency with which something happens (Boole, Venn, Fisher, Neyman)
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
10
Probability: An ExampleProbability: An Example
• Consider the statementConsider the statement– SS = “It will rain in Morelia on Monday”
• And the probability assignmentAnd the probability assignment– Pr{SS} = 0.01
• Bayesian interpretationBayesian interpretation– The plausibility ofplausibility of the statement SS is 0.01
• Frequentist interpretationFrequentist interpretation– The relative frequencyrelative frequency with which it rains
on Mondays in Morelia is 0.01
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
11
Confidence Level: InterpretationConfidence Level: Interpretation
• Since probability can be interpreted in (at Since probability can be interpreted in (at least) two different ways, the interpretation of least) two different ways, the interpretation of statements such asstatements such as
– mmtt lies in [169.2169.2, 179.4179.4] GeV, with CL = 0.6830.683
depends on which interpretation of probability depends on which interpretation of probability is being used. is being used.
• A great deal of confusion arises in our field A great deal of confusion arises in our field because of our tendency to forget this factbecause of our tendency to forget this fact
Confidence Intervals Confidence Intervals Frequency InterpretationFrequency Interpretation
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
13
Confidence IntervalsConfidence Intervals
• The basic ideaThe basic idea– Imagine a set of ensemblesset of ensembles of experiments,
each member of which is associated with a fixed value of the parameter to be measured θθ (for example, the top quark mass).
– Each experiment EE, within an ensemble, yields an interval [l(E), u(E)][l(E), u(E)], which either contains or does not contain θθ.
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
14
Coverage Probability Coverage Probability
– For a given ensemble, the fraction of experiments with intervals containing the θθ value associated with that ensemble is called the coverage probabilitycoverage probability of the ensemble.
– In general, the coverage probability will vary from one ensemble to another.
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
15
ExampleExample
EE11 E3
EE55
E4
E2
EE11 E3
EE55
EE44
EE22
EE11
E3
E5
EE44
EE22
Ensemble with θ = θ1 with Pr = 0.4Pr = 0.4
Ensemble with θ = θ2 with Pr = 0.8Pr = 0.8
Ensemble with θ = θ3 with Pr = 0.6Pr = 0.6
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
16
– If our experiment is selected at random from the ensemble to which it belongs (presumably the one associated with the true value of θθ) then the probability that its interval [l(E), u(E)] [l(E), u(E)] contains θθ is equal to the coverage probability of that ensemble.
– The crucial point is thisThe crucial point is this: We try to construct the set of ensembles so that the coverage probability over the set is never less than some pre-specified value ββ, called the confidence levelconfidence level.
Confidence Level – Frequency InterpretationConfidence Level – Frequency Interpretation
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
17
Confidence Level - iiConfidence Level - ii
• Points to NotePoints to Note– In the frequency interpretation, the
confidence levelconfidence level is a property of the set of set of ensemblesensembles; In fact, it is the minimum coverage probability over the set.
– Consequently, if the set of ensembles is unspecified or unknown the confidence level is undefined.
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
18
AnyAny set of intervals
with a minimum coverage probability equal to ββ is a set of confidence intervalsconfidence intervals at 100 β % confidence levelconfidence level (CL). (Neyman,1937)
Confidence intervals are defined not by how theyare constructed, but by their frequency properties.
)](),([ EuEl
Confidence Intervals – Formal DefinitionConfidence Intervals – Formal Definition
E Experimentl(E) Lower limitu(E) Upper limit
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
19
Confidence Intervals: An ExampleConfidence Intervals: An Example
• Experiment:Experiment:– To measure the mean rate θθ of UHECRs
above 1020 eV per unit solid angle.
• Assume the probability of N events to Assume the probability of N events to be a given by a Poisson distributionbe a given by a Poisson distribution
dev. std.,,!
)|( nn
enP
n
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
20
Confidence Intervals – Example - iiConfidence Intervals – Example - ii
• Goal: Compute a set of intervals Goal: Compute a set of intervals
for for NN = 0, 1, 2, … = 0, 1, 2, … with with CL = 0.683CL = 0.683 for a for a set of ensembles, each member of set of ensembles, each member of which is characterized by a different which is characterized by a different mean event count mean event count θθ. .
)](),([ nunl
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
21
Why 68.3%?Why 68.3%?
• It is just a useful convention! It is just a useful convention! – It comes from the fact that for a Gaussian
distribution the confidence intervals given by [x-[x-σσ, x+, x+σσ]] are associated with a set of ensembles whose confidence level is 0.6830.683. (x x = measurement, σσ = std. dev.)
• The main reason for this convention is The main reason for this convention is the the Central Limit TheoremCentral Limit Theorem– Most sensible distributions become more
and more Gaussian as the data increase.
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
22
NCount
Par
amet
er s
pace
Confidence Interval – General AlgorithmConfidence Interval – General Algorithm
)(Nu
)(Nl
RL 1 L R
For each value θfind an interval in N with probabilitycontent of at least β
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
25
Example: Interval in N for Example: Interval in N for θθ = 10= 10
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 200
0.05
0.1
0.15Poisson Distribution (Mean = 10)
Count
Pro
babi
lity
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
26
Confidence Intervals – Specific AlgorithmsConfidence Intervals – Specific Algorithms
• Neyman Neyman – Region: fixed probabilities on either side
• Feldman – CousinsFeldman – Cousins– Region: containing largest likelihood ratios
P(n| P(n| θθ)/ P(n|n))/ P(n|n)
• Mode – CenteredMode – Centered– Region: containing largest probabilities
P(n| P(n| θθ))
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
27
Define
Nz
L zPNC )|()|(
Nz
R zPNC )|()|(
N
N
Left cumulative distribution functionLeft cumulative distribution function
Right cumulative distribution functionRight cumulative distribution function
Valid for both continuous and discrete distributions.
Neyman ConstructionNeyman Construction
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
28
Solve
LL uNC )|(
RR lNC )|(where
1 L R
N
L
N
R
Neyman Construction - iiNeyman Construction - ii
Remember: Left is UP and Right is LOW!
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
29
L R ( . ) /1 0 683 2Choose
n
iL uiP
0
)|(
1
0
)|(1)|(n
iniR liPliP
and solve
for the interval )](),([ nunl
Central Confidence IntervalsCentral Confidence Intervals
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
30
Central Confidence Intervals - ii Central Confidence Intervals - ii
!)|(
n
enP
n
Poisson Distribution
)(nl
)(nu
n0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
0
5
10
15
20
Upper LimitsLower Limits
Central Intervals - Poisson
Count
Par
amet
er
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
31
Comparison of Confidence IntervalsComparison of Confidence Intervals
θ
0 2 4 6 8 10 12 14 16 18 200
5
10
15
20Intervals - Poisson Distribution
Count
Par
amet
er
Central
Feldman-Cousins
Mode-Centered
N±√N
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
32
Comparison of Confidence Interval WidthsComparison of Confidence Interval Widths
Central
Feldman-Cousins
Mode-Centered
N±√N0 2 4 6 8 10 12 14 16 18 20
0
2
4
6
8
10
CentralFeldman-CousinsModeRoot(N)
Width of Intervals
Count
Inte
rval
Wid
th
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
33
Comparison of Coverage ProbabilitiesComparison of Coverage Probabilities
θ
Central
Feldman-Cousins
Mode-Centered
N±√N 0 5 10 15 200
0.2
0.4
0.6
0.8
1
CentralFeldman-CousinsModeRoot(N)68.3%
Coverage Probability
Poisson Parameter
Pro
babi
lity
November 21, 2002 First ICFA Instrumentation
School/Workshop Harrison B. Prosper
34
SummarySummary
• The interpretation of The interpretation of confidence intervalsconfidence intervals and and confidence levelsconfidence levels depends on which interpretation of depends on which interpretation of probability one is usingprobability one is using
• The The coverage probabilitycoverage probability of an ensemble of of an ensemble of experiments is the fraction of experiments that experiments is the fraction of experiments that produce intervals containing the value of the produce intervals containing the value of the parameter associated with that ensembleparameter associated with that ensemble
• The confidence level is the The confidence level is the minimumminimum coverage coverage probability over a set of ensembles.probability over a set of ensembles.
• The confidence level is The confidence level is undefinedundefined if the set of if the set of ensembles is unspecified or unknownensembles is unspecified or unknown