23
Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1 , T. Basaglia 2 , Z. W. Bell 3 , P. V. Dressendorfer 4 1 INFN Genova, Genova, Italy 2 CERN, Geneva, Switzerland 3 ORNL, Oak Ridge, TN, USA 4 IEEE, Piscataway, NJ, USA NSS 2012 Anaheim, CA

Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Embed Size (px)

Citation preview

Page 1: Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Maria Grazia Pia, INFN Genova 1

Publication patterns in HEP computing

M. G. Pia1, T. Basaglia2, Z. W. Bell3, P. V. Dressendorfer4

1INFN Genova, Genova, Italy2CERN, Geneva, Switzerland3ORNL, Oak Ridge, TN, USA4IEEE, Piscataway, NJ, USA

NSS 2012Anaheim, CA

Page 2: Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Maria Grazia Pia, INFN Genova 2

Analysis topicsGeneral tools−Geant4−ROOT

HEP experiments−LEP

ALEPH, DELPHI, L3, OPAL

−CDF

−BaBar−LHC

ALICE, ATLAS, CMS, LHCb, TOTEM

Grid computing −LCG

What they publish

How much

Where

Citations

Technology vs physics

Software vs hardware

Software/DAQ-trigger

Representative

Not exhaustiveNo time to report all the results

Page 3: Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Maria Grazia Pia, INFN Genova 3

Data sourcesThomson-Reuters: ISI Web of Knowledge− CERN subscription: since 1970, conference database not included− Search by keywords, collaboration name

Journal web sites− IEEE TNS− NIM, Comp. Phys. Comm. (Elsevier)− JINST (IOP/SISSA)➤ Full-text searches

CERN databases− CERN Document System− Greybook

Years: 1982-2011 (LEP), 1992-2011 (BaBar, LHC)− Reproducible sample for citation analysis

Publication analysis extended to 30 September 2012

Page 4: Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Maria Grazia Pia, INFN Genova 4

Data sampleContamination−Non-pertinent entries in the data sample

Omission− Pertinent papers are not included in the data sample

➩ Cross-checks−WoS/CDS, WoS/publishers’ web sites

WoS inconsistencies and errors− Total number of citations includes Conference database− Proceedings papers: false classifications and omissions ➩Manually corrected whenever possible

Automated analysis (whenever possible)

Manual evaluation: abstracts and full-text papers − Some degree of subjectivity

Page 5: Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Maria Grazia Pia, INFN Genova 5

S. Agostinelli et al.

Geant4: a simulation toolkitNIM A, vol. 506, no. 3, pp. 250-303, 2003

J. Allison et al.

Geant4 Developments and ApplicationsIEEE Trans. Nucl. Sci., vol. 53, no. 1, pp. 270-278, 2006

3301 citations (20 October2012)

Most cited CERN publication in WoS(excluding Rev. Part. Properties)

665 citations (20 October2012)

Many papers cite the NIM paper, but they omit citing the TNS one, even though both are indicated in http://cern.ch/geant4

Many papers that use Geant4 do not cite either reference

Citation analysis: until end 2011

Page 6: Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Maria Grazia Pia, INFN Genova 6

2003 2004 2005 2006 2007 2008 2009 2010 20110

100

200

300

400

500 Geant4 NIM Geant4 TNS

Year

Cit

ati

on

sRadiat. Prot. Dosim.

J. Korean Phys. Soc.Radiat. Meas.

Appl. Radiat. Isot.J. Phys. G

JHEPAstrop. Phys.

EPJCJINSTNIM B

Phys. Lett. BPhys. Rev. C

Phys. Med. Biol.Med. Phys.

Phys. Rev. Lett.TNS

Phys. Rev. DNIM A

0 50 100 150 200 250 300 350 400

Geant4 NIM: Citing Journals

Citations

30% Physics

75% citations(plot)

ISOLDE

ALICE

JET EFDA

BES III

N TOF

MiniBooNE

LUNA

CDF

HARP

LHCb

CMS

ATLAS

BaBar

0 20 40 60 80 100 120 140 160 180 200

G4 NIM: Citing Collaborations

Citations

LHCHEPOther

16% citations (plot)19% citations from collaborations

Born from LHC experimental requirementsMultidisciplinary sources of citations

Page 7: Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Maria Grazia Pia, INFN Genova 7

R. Brun and F. Rademakers

ROOT - An object oriented data analysis framework NIM A, vol. 389, no. 1-2, pp. 81-86, 1997

I. Antcheva et al.

ROOT - A C++ framework for petabyte data storage, statistical analysis and visualizationComp. Phys Comm., vol. 180, no. 12, pp. 2499-2512, 2009

584 citations (20 October 2012)

32 citations (20 October 2012)

AIHENP Workshop proceedings paper

Citation analysis: until end 2011

Page 8: Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Maria Grazia Pia, INFN Genova 8

1997

1998

1999

2000

2001

2002

2003

2004

2005

2006

2007

2008

2009

2010

2011

0

10

20

30

40

50

60 ROOT Proc. ROOT CPC

Year

Cit

ati

on

s

NIM ATNS

Comp. Phys. Comm.Phys. Rev. CPhys. Rev. D

JINSTPhys. Med. Biol.

EPJCMed. Phys.

JHEPNIM B

Lect. Notes Comp.Astropart. Phys.

0 20 40 60 80 100 120

ROOT Proc.: Citing Journals

Citations

AUGER

D0

H1

JET-EFDA

PHOBOS

RISING

ALICE

T2K

D0

CDF

0 1 2 3

Citations

75% citations

8% of all citations from collaborationsGeant4 % ROOT %

Technology 30.3 49.6

Physics 29.9 18.2

BioMedical 13.9 6.0

Field of citing journals

Page 9: Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Maria Grazia Pia, INFN Genova 9

HEP experiments

LEP• ALEPH• DELPHI• L3• OPAL

CDFBaBarLHC

• ALICE• ATLAS• CMS• LHCb• TOTEM

CDF: 1985/1988LEP: 1989BaBar: 1999LHC: 2008

Start of run

Page 10: Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Maria Grazia Pia, INFN Genova 10

Time distribution

Publication year Rescaled w.r.t. year of start run

CDF: 1985/1988LEP: 1989BaBar: 1999LHC: 2008

Start of run

Page 11: Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Maria Grazia Pia, INFN Genova 11

Time distribution

Same as previous slide, rescaled by the number of experiment members

CDF: 1985/1988LEP: 1989BaBar: 1999LHC: 2008

Start of run

Page 12: Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Maria Grazia Pia, INFN Genova 12

Publications Share of hardware, software and DAQ-trigger

publications

Page 13: Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Maria Grazia Pia, INFN Genova 13

Physics publications

LEP experiments completed their life-cycleLHC experiments: at an early stage of their physics production

Page 14: Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Maria Grazia Pia, INFN Genova 14

Technological publications

Roughly constant trends, once the number of publications is normalized to the number of collaborators

Page 15: Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Maria Grazia Pia, INFN Genova 15

Software vs. hardware

Hardware publications: approximately 4 times more than softwareDAQ-trigger publications: approximately 1.3 times more than software

Page 16: Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Maria Grazia Pia, INFN Genova 16

Journals: LEP and LHC

Still dominated by technological publications

LHCLEP

Dominated by physics publications

Page 17: Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Maria Grazia Pia, INFN Genova 17

CitationsThe most cited papers are often the general reference papers about the detector published by each experiment

Citations of the most cited paperALEPH: 340DELPHI: 309L3: 509OPAL: 473BaBar: 859ALICE: 116CMS: 129LHCb: 101TOTEM: 35

ATLAS: ATLAS pixel detector electronics and sensors: 185

0 citations: 6% 0 citations: 20%

0 citations: 27% 0 citations: 28%

Page 18: Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Maria Grazia Pia, INFN Genova 18

More references

more citations

ReferencesPhysics papers cite

more references than technological

papers

Bibliographical entries in software papers are often

web sites

Page 19: Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Maria Grazia Pia, INFN Genova 19

Sources of citations of physics papers

Nucl. Phys. A

Phys. Atom. Nucl.

Mod. Phys. Lett. A

Phys. Rep.

NIM A

J. Phys. G

Acta Phys. Pol. B

Int. J. Mod. Phys. A

Z. Phys. C

JHEP

Phys. Rev. Lett.

Nucl. Phys. B Proc. Suppl.

Nucl. Phys. B

EPJC

Phys. Lett. B

Phys. Rev. D

0 5 10 15 20 25

DELPHI ALEPH

Citations (%)

Ann. Rev. Nucl. Part. Sci.

J. Cosm. Astrop. Phys.

JINST

New J. Phys.

Progr. Theor. Phys. Suppl.

Int. J. Mod. Phys. A

J. Phys. G Nucl.

Mod. Phys. Lett. A

Nucl. Phys. A

Phys. Rev. C

Acta Phys. Pol. B

Phys. Rev. Lett.

EPJC

Phys. Lett. B

JHEP

Phys. Rev. D

0 5 10 15 20 25 30

CMS ATLAS

Citations (%)

LHCLEP

Samples in plots account for >90% of citations

Citations to HEP physics papers mostly come from journals specialized in HEP and a few related fields (astroparticle and nuclear physics)

Page 20: Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Maria Grazia Pia, INFN Genova 20

Sources of citations of technological papers

Int. J. Mod. Phys A

Comp.Phys. Comm.

Phys. Lett. B

Nucl. Phys. B Proc. Suppl.

JHEP

Phys. Rev. D

EPJC

JINST

TNS

NIM A

0 10 20 30 40 50 60

CMS ATLAS

Citations (%)

Rep. Prog. Phys.

Rev. Mod. Phys.

Ann. Rev. Nucl. Part. Sci.

Phys. Rep.

Int. J. Mod. Phys. A

Acta Phys. Pol. B

JHEP

Phys. Rev. D

Nucle. Phys. B

Comp. Phys. Comm.

Z. Phys. C

Nucl. Phys. B Proc. Suppl.

TNS

Phys. Lett. B

EPJC

NIM A

0 5 10 15 20 25 30 35 40

DELPHI ALEPH

Citations (%)

Citations from HEP physics and technology journals

LHCLEP

Page 21: Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Maria Grazia Pia, INFN Genova 21

2008-2011 More refined analysis of technological papers published since the start of LHC run

ATLAS CMS LHCb ALICE TOTEM LHC 0

5

10

15

20

25

TNS 2008-2011

Hardware Software DAQ-trigger

Nu

mb

er o

f p

aper

s

ATLAS CMS LHCb ALICE TOTEM LHC 0

5

10

15

20

25

30

35

40

45

50

NIM 2008-2011

Hardware Software

Nu

mb

er

of

pa

pe

rs

Page 22: Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Maria Grazia Pia, INFN Genova 22

ATLAS CMS LHCb ALICE TOTEM LHC 0

5

10

15

20

25

30

35

40 Hardware Software DAQ-trigger

Nu

mb

er

of

se

lf-c

ita

tio

ns

ATLAS CMS LHCb ALICE TOTEM LHC 0

5

10

15

20

25

30

35

40Hardware Software DAQ-trigger

Nu

mb

er

of

ou

tsid

e c

ita

tio

ns

TNS TNS

NIM ANIM A

Self-citations Outside citations

ATLAS CMS LHCb ALICE TOTEM LHC 0

10

20

30

40

50

60

70

80 Hardware Software

Nu

mb

er

of

se

lf-c

ita

tio

ns

ATLAS CMS LHCb ALICE TOTEM LHC 0

10

20

30

40

50

60

70

80 Hardware Software

Nu

mb

er

of

ou

tsid

e c

ita

tio

ns

2008-2011

Page 23: Maria Grazia Pia, INFN Genova 1 Publication patterns in HEP computing M. G. Pia 1, T. Basaglia 2, Z. W. Bell 3, P. V. Dressendorfer 4 1 INFN Genova, Genova,

Maria Grazia Pia, INFN Genova 23

ConclusionsSoftware is largely underrepresented in HEP scholarly literature w.r.t. hardware

Publication patterns appear similar in the LEP and LHC era

Citation patterns are different for publications by HEP experiments and about general software tools

Publish!…and don’t forget to cite