Upload
muhammad-nauman-usmani
View
219
Download
0
Embed Size (px)
Citation preview
8/13/2019 Confidence Level
1/31
CONFIDENCE LEVELS
We warn the reader that there is no universal convention for theterm confidence level
(The Review of Particles Properties, 1986)
1
8/13/2019 Confidence Level
2/31
Confidence levels
Part of descriptive statistics
Goal of an experiment: measure a theoretical parameter a
Quoting the result usually involves giving some interval [a,b]:
! Expresses probability that the true value is in this interval
! Allows information consumer to draw conclusions from the
result
! Set upper / lower limit on the true value of a parameter
2
8/13/2019 Confidence Level
3/31
Confidence level definition
Let some measured quantity bedistributed according to some p.d.f.P(x), we can determine the probability
that x lies within some interval, withsome confidence C
Prob(x x x+) =
x+
x
P(x)dx= C
We say:x lies in the interval [x- , x+] with confidenceC
Note: C is a probability according to the frequency limit3
8/13/2019 Confidence Level
4/31
Gaussian confidence intervals
If P(x) = Gaussian distribution with mean "and variance #2:some examples of confidence intervals:
x = 1 C= 68%x = 2 C = 95.4%
x = 1.64 C= 90%
x = 1.96 C= 95%
4
8/13/2019 Confidence Level
5/31
Types of confidence intervals
3 conventional ways to choose an interval around the center:
1. Symmetric interval: x-and x+equidistant from the mean2. Shortest interval: minimizes (x+- x-)
3. Central interval:
x
P(x) dx=
+x+
P(x) dx=1C
2
Prob(x x x+) =
x+
x
P(x)dx= C
For Gaussian (and any symmetric distribution):3 definitions are equivalent
5
8/13/2019 Confidence Level
6/31
One-tailed
confidence intervals
So far, we considered two-tailed intervals.
Useful as well: one-tailed limits
! Upper limit: x lies below x+at confidence level C:
! Lower Limit: x lies above x-at confidence level C:
x+
P(x) dx= C
+x
P(x) dx= C
6
8/13/2019 Confidence Level
7/31
Confidence intervals
in estimationIn a measurement two things involved:
! Physical parameter(s) X: mass, lifetime, ...
! Measurement of this parameter x
Given X,there is a p.d.f. for measuring x(resolution, QM,...)
But what you want to know:
! Given measurement x!x, what can I say about X ?
7
8/13/2019 Confidence Level
8/31
Can I say that
X lies within [x-!x, x+!x] with 68% probability?
Not in the sense of a frequency:X is not a random variable!!!
8
8/13/2019 Confidence Level
9/31
8/13/2019 Confidence Level
10/31
Possible experimental valuesx
parameter
! x2!!"# !
2!x"
x1!!"# !
1!x"
x1!!
0" x2!!0"
D(")
!0
Confidence belt Construction
10
Neyman Construction:
1. For each $find D($) with
probability C
2. Confidence interval includes
all $with observation at x0
NOTE: this is not a statementabout the probability of "butabout the interval!
8/13/2019 Confidence Level
11/31
Lower / Upper limits using the
confidence beltGiven measurement x0find X-and X+from confidence belt:
X+ upper limit at C.L. 1-%:
i.e. if X &X+: Probability to measure x 'x0is less than %
X-lower limit at C.L. 1-%:
i.e. if X ' X-: Probability to measure x &x0is less than %
11
+x0
P(x|X+) dx= 1
x0
P(x|X) dx= 1
8/13/2019 Confidence Level
12/31
12
8/13/2019 Confidence Level
13/31
Gaussian confidence levels
P(x|X): Gaussian with standard deviation #
Apply method to determine a 90% C.L. interval for X given ameasurement x0:
Equation for X-: requires that x0lies some number of standard deviations
above X-, which is the same as saying that X-lies the same number of #
below x0
(k: depends on the desired C.L.)
Confidence belt limited by two straight lines
13
x0
12
e (xX+)
2
22 dx = 0.05 =
+x0
12
e (xX)
2
22 dx
X = x0 k
8/13/2019 Confidence Level
14/31
Confidence levels near
a physical boundaryAssume a mass measurement with resolution 20 MeV
The true mass is 10 MeV
Use a 2#(95.4%) C.I. to quote the result: x 40 MeV
Consider cases:
(2.3 % probability that measurement > 50 MeV
(Measurement in range 40-50 MeV: limits will be true
(x = 0.2 40 MeV: correct lower limit to 0 and OK
(BUT what if x = - 50 MeV 40 MeV : X < -10 MeV @ 95% C.L. !!!???It is strictly speaking correct but ridiculous!
Only means of escape: BAYES TO THE RESCUE!
14
8/13/2019 Confidence Level
15/31
Bayesian Confidence Intervals
Bayes theorem:
P(theory): assume all positive masses equally likely
Now apply Bayes theorem:
For x = - 50 MeV 20 MeV: Denominator is one-sided 2.5 #Gaussian tail: 0.0062
Look for 90 % C.L. upper limit: Integral of numerator must be ~0.0006: 3.24 #
Results: mass < -50 MeV + 3.24 * 20 MeV = 15 MeV @ 90 % C.L.
15
P(theory|data) =P(data|theory)P(theory)
P(data)
P(true mass) = 0, m
8/13/2019 Confidence Level
16/31
Bayesian Confidence Interval:
ExampleMass measurementMeasurement x = - 50 MeV 20 MeV
Prior : assume all positive masses equally likely:
Denominator is one-sided 2.5 #Gaussian tail: 0.0062
Integral of numerator must be ~0.0006: 3.24 #
Result: mass < -50 MeV + 3.24 * 20 MeV = 15 MeV @ 90 % C.L.
For comparison: Frequentist 90 % Upper Limit: - 10 MeV
16
P(true mass) = 0, m
8/13/2019 Confidence Level
17/31
Confidence Intervals for
Discrete distributions Physical parameter: real
Measured variable discrete:
number of counts (Poisson)
Number of successes (Binomial)
May be unable to select region
with exact confidence C (e.g.90%):
Play safe: gives overcoverage
17
8/13/2019 Confidence Level
18/31
Binomial Confidence Intervals
Observed value: Number of successes (out of N trials) - DISCRETE
True value: single trial probability R - CONTINUOUS
If m successes found in N trials:
! Limits on the individual probability p: Find p+and p-such that (Using 95%central limit):
(CLOPPER-PEARSON COEFFICIENTS)
! In words: Were p&p+: Probability to get m counts or less is '2.5%
Were p'p-: Probability to get m counts or more is '2.5%
18
r=m+1
P(r;p+, N) = 0.975m1
r=0
P(r;p, N) = 0.975
8/13/2019 Confidence Level
19/31
19
From the archives:
8/13/2019 Confidence Level
20/31
Poisson
confidence intervalsn events observed from Poisson process of unknown mean "
The 90 % Poisson upper limit is the value "+such that:
i.e.: if true value of "is really "+probability for getting a number ofcounts n or smaller is 10%
Similarly, the 90 % Poisson lower limit is the value "-such that:
20
r=n+1
P(r; +) = 0.90, or equivalently
n
r=0
P(r; +) = 0.10
n1
r=0
P(r; ) = 0.90, or equivalently
r=n
P(r; ) = 0.10
8/13/2019 Confidence Level
21/31
Some Poisson Limits
21
8/13/2019 Confidence Level
22/31
Confidence intervals using
likelihood functionFor non-Gaussian estimators, still possible to determine confidenceinterval with a simple approximate technique using the likelihoodfunction, or equivalently the !2function
For a ML estimator for a parameter aIn the large sampleapproximation:
! The p.d.f. g(,a) becomes Gaussian:
! The likelihood function itself becomes Gaussian with the same #
Can extract #from likelihood scan!
22
g(a; a) = 1
2aexp
(a a)2
22a
L(a) = Lmaxexp
(a a)2
22a
8/13/2019 Confidence Level
23/31
Prescription for setting confidence
intervals using the likelihood1. Extract #from log-likelihood scan using:
2. Use fact that g(,a) is Gaussian to set confidence intervals:! e.g.: [c, d] = [ - #, + #] : 68% central confidence interval
Can be shown that this procedure can be used even if thelikelihood function is not Gaussian
! Exact only in the large sample limit
! May need to use asymmetric intervals around
23
logL(aNa) =logLmax N2
2
8/13/2019 Confidence Level
24/31
Example: lifetime fit
24
0 0.5 1 1.5 2 2.5 30
0.2
0.4
0.6
0.8
1h_tau
Entries 5
Mean 1.201
RMS 0.5881
0.5 1 1.5 2 2.5 3 3.5
-8
-7.5
-7
-6.5
-6
-5.5
-5
!"
-!#"
-!"
+!#"
+!"
0 0.5 1 1.5 2 2.5 30
0.5
1
1.5
2
2.5
3
3.5
4
h_tau
Entries 50
Mean 0.815
RMS 0.6943
0.8 0.9 1 1.1 1.2 1.3 1.4
-54
-53.5
-53
-52.5
-52
-51.5 !"
-!#"
-!"
+!#"
+!"
0 0.5 1 1.5 2 2.5 30
2
4
6
8
10
12
14
16
18
20
22h_tau
Entries 500
Mean 0.8455
RMS 0.7131
0.94 0.96 0.98 1 1.02 1.04 1.06 1.08 1.1 1.12 1.14
-519
18.5
-518
17.5
-517
16.5!"
-!#"
-!"
+!#"
+!"
Using estimator : = 1n
n
i=1
ti
logLmax
logLmax- 1/2
True )=1
8/13/2019 Confidence Level
25/31
Several variables:
Confidence regions1-D case: look for single parameter "
constructed interval [a,b] which contains true value with someprobability C
N-D case: Look for parameters "=("1, ..., "n)
In general, cannot find ai,biso that ai
8/13/2019 Confidence Level
26/31
Multidimensional Confidence
regionsUse the properties of the likelihood function in the large sample limit:
1. The ML estimator p.d.f. is Gaussian:
2. The Likelihood function is Gaussian with the same V:
3. If p.d.f. described by n-dim. Gaussian: Q(, a) distributed according to !2distribution with n d.o.f. :
26
g(a;a) =(2)n|V|1/2
exp2Q(a;a)
, Q(a;a) = (a a)TV1(a a)
V1 : inverse covariance matrix
L(a) =Lmaxexp
1
2Q(a, a)
,
Prob(Q(a, a) Q) =
Q0
f2(z, n) 2 distr. f o r n d.o.f.
dz
8/13/2019 Confidence Level
27/31
Constructing n-dimensional
Confidence regionPrescription for findingconfidence region at @ C.L. 1-#:
1. Determine value of Q#(tablesor numerically) so that:
2. Find contour at which
1. *logL = - Q#/ 2
27
Q0
f2(z, n)dz= 1
8/13/2019 Confidence Level
28/31
Q#values for some values of coverage and numbers of
parameters:
28
8/13/2019 Confidence Level
29/31
Upper limit on the mean of
Poisson variable with backgroundObserved number of events is sum of signal + background:n = ns+ nb(expectation values +s, +b)
Goal: construct upper limit for +s
n is Poisson distributed with mean +s+ +b:
ML estimate for +s: n - +b
Lower limit:
Upper limit:
Upper and lower limits are related to the limits without background
Problem if number of counts smaller than expected background, upper limit is < 0!
BAYES TO THE RESCUE!
29
= P(s obss ;
los ) =
nnobs
(los + b)n
n! e
(los +b)
= P(s obss ;
ups ) =
nnobs
(ups + b)n
n! e
(ups +b)
lo
s = lo
s (no background)
b
ups =
ups (no background) b
8/13/2019 Confidence Level
30/31
Upper limit with background:
Bayesian approach Bayesian approach (flat prior)
Solution
Reduces to classical solution for no background
30
8/13/2019 Confidence Level
31/31
Upper limit with background
31
Bayesian solution