1
N EURAL N ETWORKS FOR CALIBRATING ATLAS JETS M URTAZA S AFDARI ( MURTAZAS @ STANFORD . EDU ) AND N ICOLE H ARTMAN ( NICOLE 22@ STANFORD . EDU ) M OTIVATION ATLAS is a particle detector analyzing proton-proton collisions from the LHC. Jets are collimated sprays of particles in the detector. Other low energy collisions in a beam cross- ing can add extra energy to the jet degrad- ing our accuracy in reconstructing the jet’s transverse momentum, p T . Goal: Take energy depositions in the detec- tor and use regression and neural networks to improve the p T reconstruction that is uni- form over a range of p T values. D ATASET Our dataset consists of 6 million jets of detector level jets which contain: The true jet p T The (η,φ) coordinates with the correspond- ing p T for the clusters in a jet. Jet’s transverse area: A T Number of primary vertices in event, N PV The event weights. We divided our data into three samples: 80% as training set, 10% as CV set, and 10% as test set. Since we had vastly more low p T events, we re- vised the event weights by flattening the truth p T distribution in 4 GeV bins. Pixel image of a jet in a 8 × 8 matrix binned in the (η,φ) coordinates relative to the jet’s center with the bin’s greyness set by the p T . F EATURES p T (j no area sub)= clusters p T (cluster ) p T (j 0) = clusters p T (cluster ) - ρA T Rings of p T summed in annuli of ΔR =0.5 (Jet radius: R = p η 2 + φ 2 =0.4) Currently ATLAS accounts for pile-up using p reco T = p deposited T - ρA T - α(N PV - 1). Then numerical inversion accounts for the non- uniformities of the detector. C OST F UNCTION In initial linear regression studies we tried cost functions: J α (θ )= i w i (y -h(x)) 2 y β , β =2, 1, 0, -1. Optimize performance over entire p T range by combining β =0 and β =2 cost functions: J (θ )= i w i (y - h(x)) 2 1+ λ y 2 , where λ = 100 to appropriately scale J 2 to J 0 . R ESULTS I To ensure uniform performance over the entire p T range, we looked at the closure h θ (x)/y in 4 GeV bins and calculated the mean and standard deviation. R EFERENCES [1] The ATLAS Collaboration. Performance of pile-up mitigation tech- niques for jets in pp collisions at s =8 tev using the atlas detec- tor. arXiv 1510.03823, 2015. [2] Aviv Cukierman and Benjamin Nachman. Mathematical proper- ties of numerical inversion for jet calibrations. arXiv 1609.05195v1, 2016. F UTURE S TEPS Use a series of classifiers to predict which truth p T bin a given jet is in, then train a regression model on the combined outputs. Improve and understand CNN architecture to better adapt to ATLAS jet reconstruction. D ISCUSSION Every NN that we derived performs better than the current ATLAS standard, numerical inversion. The NN with annuli info does best, showing that jet substructure helps. All other NNs perform comparably. The j no area sub with ρA T contains the same info as j 0. No new info appears to be gained by including the N PV variable. R ESULTS II Since the neural networks aren’t expected to do well at the high and low p T limits, evaluate mod- els using the error in p T range [45,120] GeV. Models CV error Inversion on j 0 0.0327 ± 0.0006 Inversion on j no area sub 0.0311 ± 0.0006 NN with j 0 input 0.0151 ± 0.0007 NN with jj no area sub 0.0138 ± 0.0007 NN: j no area sub, ρA T 0.0153 ± 0.0007 NN: j no area sub, ρA T , N PV 0.0141 ± 0.0007 NN with 8 annuli and j 0 0.0129 ± 0.0007 All the neural networks are sequential and have 1 hid- den layer with 5 input nodes. We prototyped a CNN on 50,000 jets with (2 × 8 × 8) + (5 × 5 × 5) + (2 × 2) + 5 + 1 architecture, with a CV err over p T [20, 140] GeV of 0.076 ± 0.005

R II - Machine Learningcs229.stanford.edu/proj2016/poster/Safdari-Hartman...2016. FUTURE STEPS Use a series of classifiers to predict which truth p T bin a given jet is in, then train

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: R II - Machine Learningcs229.stanford.edu/proj2016/poster/Safdari-Hartman...2016. FUTURE STEPS Use a series of classifiers to predict which truth p T bin a given jet is in, then train

NEURAL NETWORKS FOR CALIBRATING ATLAS JETSMURTAZA SAFDARI ([email protected]) AND NICOLE HARTMAN ([email protected])

MOTIVATION

• ATLAS is a particle detector analyzingproton-proton collisions from the LHC.

• Jets are collimated sprays of particles in thedetector.

• Other low energy collisions in a beam cross-ing can add extra energy to the jet degrad-ing our accuracy in reconstructing the jet’stransverse momentum, pT .

• Goal: Take energy depositions in the detec-tor and use regression and neural networksto improve the pT reconstruction that is uni-form over a range of pT values.

DATASET

Our dataset consists of ∼ 6 million jets of detectorlevel jets which contain:

• The true jet pT• The (η, φ) coordinates with the correspond-

ing pT for the clusters in a jet.• Jet’s transverse area: AT• Number of primary vertices in event, NPV• The event weights.

We divided our data into three samples: 80% astraining set, 10% as CV set, and 10% as test set.Since we had vastly more low pT events, we re-vised the event weights by flattening the truth pTdistribution in 4 GeV bins.

Pixel image of a jet in a 8 × 8 matrix binned in the(η, φ) coordinates relative to the jet’s center withthe bin’s greyness set by the pT .

FEATURES• pT (j no area sub) =

∑clusters pT (cluster)

• pT (j0) =∑clusters pT (cluster) − ρAT

• Rings of pT summed in annuli of ∆R = 0.5

(Jet radius: R =√η2 + φ2 = 0.4)

Currently ATLAS accounts for pile-up usingprecoT = pdepositedT − ρAT − α(NPV − 1).

Then numerical inversion accounts for the non-uniformities of the detector.

COST FUNCTIONIn initial linear regression studies we tried costfunctions: Jα(θ) =

∑i wi

(y−h(x))2yβ

, β = 2, 1, 0,−1.

Optimize performance over entire pT range bycombining β = 0 and β = 2 cost functions:J(θ) =

∑i wi(y− h(x))2

(1 + λ

y2

), where λ = 100

to appropriately scale J2 to J0.

RESULTS ITo ensure uniform performance over the entire pT range, we looked at the closure hθ(x)/y in 4 GeV binsand calculated the mean and standard deviation.

REFERENCES

[1] The ATLAS Collaboration. Performance of pile-up mitigation tech-niques for jets in pp collisions at

√s = 8 tev using the atlas detec-

tor. arXiv 1510.03823, 2015.

[2] Aviv Cukierman and Benjamin Nachman. Mathematical proper-ties of numerical inversion for jet calibrations. arXiv 1609.05195v1,2016.

FUTURE STEPS• Use a series of classifiers to predict which truth

pT bin a given jet is in, then train a regressionmodel on the combined outputs.

• Improve and understand CNN architecture tobetter adapt to ATLAS jet reconstruction.

DISCUSSION

• Every NN that we derived performs betterthan the current ATLAS standard, numericalinversion.

• The NN with annuli info does best, showingthat jet substructure helps.

• All other NNs perform comparably.– The j no area subwith ρAT contains the

same info as j0.– No new info appears to be gained by

including the NPV variable.

RESULTS IISince the neural networks aren’t expected to dowell at the high and low pT limits, evaluate mod-els using the error in pT range [45,120] GeV.

Models CV error

Inversion on j0 0.0327 ± 0.0006Inversion on j no area sub 0.0311 ± 0.0006NN with j0 input 0.0151 ± 0.0007NN with jj no area sub 0.0138 ± 0.0007NN: j no area sub, ρAT 0.0153 ± 0.0007NN: j no area sub, ρAT , NPV 0.0141 ± 0.0007NN with 8 annuli and j0 0.0129 ± 0.0007

All the neural networks are sequential and have 1 hid-den layer with 5 input nodes.We prototyped a CNN on 50,000 jets with (2× 8× 8) +(5× 5× 5) + (2× 2) + 5 + 1 architecture, with a CV errover pT ∈ [20, 140] GeV of 0.076± 0.005