Transcript
Page 1: Unfolding of multivariate tools and statistical analysis ... · Unfolding of multivariate tools and statistical analysis for Higgs boson pair production searches in the ATLAS detector

Unfolding of mul t ivar iate tools and stat ist ical analysis for Higgs boson pair product ion searches in the ATLAS detector at the

Large Hadron Col l ider

Christina DimitriadiPartikeldagarna 2019

Linköping

Page 2: Unfolding of multivariate tools and statistical analysis ... · Unfolding of multivariate tools and statistical analysis for Higgs boson pair production searches in the ATLAS detector

Motivation

§ This study uses the results of the search for SMpair production of Higgs bosons in the 𝑏"𝑏𝜏$𝜏%final state, where both 𝜏 -leptons decay in ahadronic mode (𝜏'()𝜏'()).

§ Why di-Higgs?

v The Higgs boson has the unique ability tocouple to itself, resulting in the simultaneousproduction of two Higgs bosons

v The Higgs self-coupling is promptly linked tothe Higgs potential, whose actual shape hasnot been measured experimentally yet

Page 3: Unfolding of multivariate tools and statistical analysis ... · Unfolding of multivariate tools and statistical analysis for Higgs boson pair production searches in the ATLAS detector

Purpose of study

§ Rescoping of the multivariate analysis(MVA) (Phys. Rev. Lett. 121, 191801 (2018)) byintroducing a cut-based analysis.

§ MVA techniques are widely used in HighEnergy Physics, where there is an ever-increasing production of large data, whichtypically involve multiple variables includingtheir correlations.

§ But is MVA always the best option? What isthe gain by using MVA? Can a cut-basedanalysis perform equally well?

BDT score1- 0.8- 0.6- 0.4- 0.2- 0 0.2 0.4 0.6 0.8 1

Even

ts /

Bin

1

10

210

310

410Data NR HH at exp limitTop-quark

fakes (Multi-jets)hadt ®jet +(bb,bc,cc)tt ®Z

)t fakes (thadt ®jet OtherSM HiggsUncertaintyPre-fit background

ATLAS -113 TeV, 36.1 fb

2 b-tagshadthadt

BDT score1- 0.8- 0.6- 0.4- 0.2- 0 0.2 0.4 0.6 0.8 1D

ata/

Pred

.

0.5

1

1.5

Page 4: Unfolding of multivariate tools and statistical analysis ... · Unfolding of multivariate tools and statistical analysis for Higgs boson pair production searches in the ATLAS detector

Looking forward to answering your questions in the Poster Session later 😀

Unfolding of multivariate tools and statistical analysis for Higgs boson pair production searches in the ATLAS detector at the

Large Hadron Collider

MotivationSearches for Higgs boson pair production provide insight into

the Higgs boson self-coupling, promptly linked to the shape of

the Higgs potential in and beyond the Standard Model (SM).

BDT score1- 0.8- 0.6- 0.4- 0.2- 0 0.2 0.4 0.6 0.8 1

Even

ts /

Bin

1

10

210

310

410Data NR HH at exp limitTop-quark

fakes (Multi-jets)hadt ®jet +(bb,bc,cc)tt ®Z

)t fakes (thadt ®jet OtherSM HiggsUncertaintyPre-fit background

ATLAS -113 TeV, 36.1 fb

2 b-tagshadthadt

BDT score1- 0.8- 0.6- 0.4- 0.2- 0 0.2 0.4 0.6 0.8 1Da

ta/P

red.

0.5

1

1.5

Ø SM Higgs boson pair production at the LHC

Ø Higgs boson pair decay processes

Purpose of studyRescoping of multivariate analysis (MVA) by introducing a

cut-based analysis, so that the gain of using the MVA can be

estimated.

Phys. Rev. Lett. 121, 191801 (2018)

Ø Optimal cuts on these variables are found, so that mostof the background is removed while a large fraction ofthe signal survives. The aim is to find the combination ofcuts giving the highest significance.

Cut 1 Cut 2

Cut 3

Cut-based analysisWHY?

Better re-interpreted

by theorists

M E T H O D S

ConclusionNeither using the BDT nor fitting its score distribution

worsens the sensitivity to !! → ##$$ by 50%. This is

a logical outcome since the BDT is by construction the

best final discriminant and has the best separation.

However, it was of great importance to suggest a cut-

based analysis that could be easier re-interpreted by

theorists compared to the BDT analysis.

MVA – Boosted decision tree (BDT)⟹ to separate signal from background

Ø Expected upper limits on the HH production crosssection (without systematic uncertainties) @ 95% CL,when fitting &!! in SRs and CRs based on the optimalcombination of cuts

Ø Definition of variables associating the mass and '( of di-$ and di-# objects

Decay channel of interest for this study:

!! → #)#$*$+ → ##$,-.$,-.

• relatively clean final state

• relatively large branching ratio (7.4%)

• both /-leptons decay hadronically

• Look at the distributions of the BDT input variables

• Get expected upper limits on the HH production cross section

(without systematic uncertainties)

§ Fit the BDT score⇒ exp.: 15.2×234 @ 95% CL

§ Fit 566 in signal (SR) and control regions (CR) based on the

last 2 bins of the BDT score⇒ exp.: 21.7×234 @ 95% CL