Transcript
Page 1: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 SummaryG. Cowan 1

PHYSTAT2011

WorkshopSummary

CERN, 17-19 January, 2011

Glen CowanPhysics DepartmentRoyal Holloway, University of London

Page 2: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 SummaryG. Cowan 2

Outline

Frequentist methods

Bayesian methods

In practice: Tevatron, CMS, ATLAS, LHCb

Combining results

Look-elsewhere effect

Software tools

Applications: partons, gravity, astro

Banff Challenge 2a

Outlook

Page 3: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 SummaryG. Cowan 3

Frequentist methods

Order statistics for discovery (D. Cox)

Limits, etc. (L. Demortier, D. van Dyk)

Likelihood ratio tests (GDC, C. Roever, J. Conway)

More on p-values (F. Beaujean)

Page 4: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 4G. Cowan

Order statistics for discovery

Can HEP use this to look for a bump in a histogram? Need to use modified version where signal smeared over several bins.

D. Cox

Page 5: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 5G. Cowan

Limits, etc. L. Demortier

PCL, CLs, “unconstrained”limits for measurement ofX ~ Gaussian(μ)

Page 6: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 6G. Cowan

Limits, etc. (2) D. van Dyk

Proposed procedure:

But with PCL it should be easy to communicate where the limit(bound) is the observed one, and where it is the power threshold.

Page 7: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 7G. Cowan

Improving the model: template morphing

J. Conway

Often shift in nuisance parameters causes complicated changein a distribution – parametrize with e.g. template morphing:

Need to do this e.g. to properlyinclude systematics due to jetenergy scale, which affects different distributions in different ways.

Page 8: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 8G. Cowan

Marginalize vs. maximize J. Conway,C. Roever

The point was raised as to whether it is better in some sense toconstruct a ratio of marginalized or profile likelihoods.

Conway, Roever see little difference:

Page 9: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 9G. Cowan

Page 10: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 10G. Cowan

Page 11: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 11G. Cowan

One more comment on Asimov... GDC

n ~ Poisson(μs+b), median significance, assuming μ = 1,with which one would reject μ = 0.

Page 12: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 12G. Cowan

Bayesian methods

Reference priors (L. Demortier, J. Bernardo, M. Pierini)

Bayes factors (J. Berger)

Application to sparse spectra (A. Caldwell)

Page 13: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 13G. Cowan

Bayes factorsJ. Berger

E.g. apply to Poisson counting problem: N ~ Poisson(s+b)

(Not easy for HEP applications)(1)

(2)

→→

Page 14: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 14G. Cowan

Bayes factors (2)

(3) Lower bound on Bayes factor: make π(s) a delta function at s.

J. Berger

(4?) Can we not simply plot the Bayes factor vs. s (i.e. report B0s, not B01)?

So even lowest Bayes factors substantially bigger than the p-values.

Page 15: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 15G. Cowan

Bayes factors (3)

“Bayes factor more intuitive than a p-value.” (“5σ” ↔ B01 = ?)

“Automatic” inclusion of look-elsewhere effect (through prior).

Can we use e.g.

But, marginal likelihoods can be difficult to compute:

(K. Cranmer/R. Trotta)

Page 16: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 16G. Cowan

K. Cranmer/R. Trotta

Page 17: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 17G. Cowan

Reference priorsJ. Bernardo,L. Demortier,M. PieriniMaximize the expected Kullback–Leibler

divergence of posterior relative to prior:

This maximizes the expected posterior information about θ when the prior density is π(θ).

Finding reference priors “easy” for one parameter:

Page 18: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 18G. Cowan

Reference priors (2)J. Bernardo,L. Demortier,M. Pierini

Actual recipe to find reference prior nontrivial;see references from Bernardo’s talk, website ofBerger (www.stat.duke.edu/~berger/papers) and also Demortier, Jain, Prosper, PRD 82:33, 34002 arXiv:1002.1111:

Prior depends on order of parameters. (Is order dependence important? Symmetrize? Sample result from different orderings?)

Page 19: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 19G. Cowan

L. Demortier

Page 20: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 20G. Cowan

Bayesian discovery with sparse data A. Caldwell

Meaningful elicitation of priorfrom community consensus, here:

Consensus priors doable in practice? (Committee?)

Page 21: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 21G. Cowan

More on p-values (model validation) F. Beaujean

Suppose we’re only given p-values – how should thisinfluence a Bayesian’s degree of belief?

OK but... not same as P(M0|data), rather a justification of p-value.

Or, justify by saying that a bizarre result prompts one to comment on (and quantify) just how bizarre it is.

Page 22: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 22G. Cowan

Decision theory

J. Bernardo,D. van Dyk

Statistical methods can be formulated in elegant but (for particlephysicists) not fully familiar language of decision theory:

Specify loss function, minimize expected loss, etc.

How does this map onto the usual ways that physicists viewdiscovery/limits?

What are specific benefits for HEP of this approach?

“It’s a tool to be aware of.” – D. van Dyk

Page 23: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 23G. Cowan

Combining results N. Krasnikov,K. Cranmer

Given p-values p1,..., pN of H, what is combined p?

Rather, given the results of N (usually independent) experiments, what inferences can one draw from their combination?

Full combination is difficult but worth the effort for e.g.combined ATLAS/CMS Higgs search.

Form full likelihood function of the joint experiment;

Use to construct statistical tests (e.g. likelihood ratio)

Single common parameter of interest: μ = σ/σSM

Also (in principle) common nuisance parameters (e.g.,luminosity, parton uncerainties,...)

New software: RooStats (RooFit/ROOT).

Page 24: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 24G. Cowan

Combined ATLAS/CMS Higgs searchK. Cranmer

Given p-values p1,..., pN of H, what is combined p?

Better, given the results of N (usually independent) experiments, what inferences can one draw from their combination?

Full combination is difficult but worth the effort for e.g. Higgs search.

Page 25: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 25G. Cowan

Look-elsewhere effect O. Vitells,G. Ranucci,A. Caldwell

Page 26: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 26G. Cowan

Look-elsewhere effect (2) O. Vitells

Correction to p-value from theory of random fields; related to mean number of “upcrossings” of likelihood ratio(Davies, 1987).

estimate with MCat low referencelevel

Page 27: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 27G. Cowan

Look-elsewhere effect (3) O. Vitells

Generalization to multiple dimensions: number of upcrossingsreplaced by expectation of Euler characteristic:

Applications: astrophysics (coordinates on sky), search forresonance of unknown mass and width, ...

Page 28: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 28G. Cowan

Look-elsewhere effect in time series G. Ranucci

Page 29: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 29G. Cowan

Bayesian Analysis Toolkit (BAT)S. Pashapour

General framework specifically for Bayesian computation,especially MCMC for marginalizing posterior probabilities.

E.g. convergence diagnostics à la Gelman & Rubin,

Wish list(?): computation of marginal likelihoods; support for important types of priors (reference,...)

Page 30: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 30G. Cowan

RooStats G. Schott

Page 31: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 31G. Cowan

RooFit Workspaces G. Schott

Able to construct full likelihood for combination of channels(or experiments).

Page 32: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 32G. Cowan

RooStats G. Schott

Example of tools: Bayesian priors from formal rules:

Page 33: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 33G. Cowan

Examples of using RooStatsV. Zhukov,K. Cranmer

E.g. studying effect of correlated systematics in combined fitusing various methods with RooStats (V. Zhukov):

And of course the previously mentioned ATLAS/CMS Higgs combination (K. Cranmer).

Page 34: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 34G. Cowan

Lessons from the TevatronH. Prosper

Example: search for single top with multivariate analysis.Need accurate understanding of background (mature experiments).Would community accept MVA as readily if this were SUSY?

Page 35: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 35G. Cowan

Bayesian analysis for cross sectionof single top

H. Prosper

Page 36: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 36G. Cowan

Converted to frequentist p-valuefor discovery of single top

H. Prosper

Page 37: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 37G. Cowan

In practice: CMS, ATLAS, LHCbA. Harel,D. Casadei,J. Morata

Publications from LHC have started to arrive!!!!!

A. Harel (CMS)

D. Casadei (ATLAS)

Page 38: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 38G. Cowan

Example: CMS W’ searchA. Harel

Page 39: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 39G. Cowan

Example: ATLAS dijet resonance search D. Casadei

Page 40: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 40G. Cowan

Example: LHCb search for Bs → μμ J. Morata

Multivariate methods used: MLP, “Δχ2”, BDT, etc.

Page 41: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 41G. Cowan

Applications: parton densitiesS. Forte

Fits to parton densities(e.g., MSTW, CTEQ) have found “reasonable” χ2, but the standard procedure for errors, Δχ2 = 1, leads to unrealistically small variation of parton densities.

Could be consequence of: inconsistent data; inadequate model, e.g., parametric functions in fit (MSTW):

NNPDF approach: parameterize using neural network; much larger number (37×7 = 259) of parameters compared to MSTW (20) or CTEQ (26); regularize using cross validation.

Page 42: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 42G. Cowan

Partons (2)S. Forte

NNPDF predictions quote uncertainties using “standard”Bayesian propagation of experimental uncertainties as obtainedby CTEQ and MSTW when using very large Δχ2 (50 – 100).

I.e. limited parametric form was important source of the “Δχ2=?” problem?

Still need to address othersystematics (e.g. theory erorrs),data compatibility, etc.

Page 43: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 43G. Cowan

Applications: gravitational waves C. Roever,S. Sardy

Wavelet transforms to search for blips in time series:

Regularization to suppress noisy wavelet coefficients,→ bias-variance trade-off; mathematics similar to unfolding.

Page 44: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 44G. Cowan

Applications: astrophysicsO. Lahav

Wide variety of problems – mostly Bayesian approach.

Parameters can be: fundamental (e.g., curvature of universe); “geographical” (eccentricity of an orbit).

Wide use of Bayes factors for model selection.

HEP should look at astro community’s tools for computing these (e.g. MultiNest).

Page 45: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 45G. Cowan

Bayesian exoplanet search O. Lahav

← priors

Page 46: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 46G. Cowan

The Banff Challenge 2aT. Junk

Two problems:

The winners:

Page 47: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 47G. Cowan

Some thoughts on 5σ (van Dyk)D. van Dyk

Page 48: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 48G. Cowan

Some more thoughtsD. van Dyk

Need to develop more ways to insert nuisance parametersinto models in clever, physically motivated, controlled ways.

And figure out how to constrain these parameters withcontrol measurements; assign meaningful priors to them.

Page 49: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 49G. Cowan

OutlookGreat progress in methods/software/sophistication since the first“Confidence Limits” workshop at CERN in 2000.

And many areas where progress still being made:Spurious exclusion when no sensitivity (CLs, PCL, other?) Bayesian priors (reference and otherwise)Look-elsewhere effect (solved?)Software tools (RooStats, BAT)Ways to improve models......

The LHC data floodgates are open – we need to continue to improve and develop our analysis methods, taking full advantageof the lessons from the statistics community and other fields.

Page 50: PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary 50G. Cowan

Thanks

Thanks to all the speakers for contributing highly interesting talks.

Thanks to the non-particle physicists and especially the professionalstatisticians for sharing their insights and expertise.

Thanks to the organisers, Louis, Albert, Michelangelo, for arrangingsuch a stimulating and productive meeting.


Recommended