performance.netlib.orgperformance.netlib.org/tex/bib/pvm.pdfA Bibliography of Publications about PVM...

Preview:

Citation preview

A Bibliography of Publications about PVM (ParallelVirtual Machine) and MPI (Message Passing

Interface)

Nelson H. F. BeebeUniversity of Utah

Department of Mathematics, 110 LCB155 S 1400 E RM 233

Salt Lake City, UT 84112-0090USA

Tel: +1 801 581 5254FAX: +1 801 581 4148

E-mail: beebe@math.utah.edu, beebe@acm.org, beebe@computer.org (Internet)WWW URL: http://www.math.utah.edu/~beebe/

23 April 2018Version 3.171

Title word cross-reference

+ [BDV03, Cha02, HDB+13, Lee12]. 0[ICC02]. 1 [ICC02, LRQ01, VDL+15].$19.95 [Ano95b]. 2 [Bha98, BAS13, CGU12,ES11, KRKS11, KO14, WMRR17]. $24.95[Ano95c]. $27.50 [Ano96a]. 3 [And98,BCL00, BAS13, CP15, DYN+06, EFR+05,GCN+13, HF14a, HF14b, JR10, KO14,KD13, KHS01, KLR16, MSZG17, NSM12,SSS99, SH14, TPD15, WR01, YSL+12]. $35[Ano00a, Ano00b]. $35.00[Ano99a, Ano99c, Ano99b, Ano99d]. 3D[KA13]. $60 [Ano00a, Ano00b]. 3 [PBC+01].A [ARYT17]. α [JMdVG+17]. Ax = b[BG95]. D [UZC+12]. H2/H∞ [GWC95]. k

[She95, TK16]. M3 [JSH+05]. PVM+

[Wil94]. N[IHM05, Per99, Rol08b, SP99, SRK+12].SU(3) [BW12]. τ [RGDM15, RGDML16].XY [KO14].

-body [IHM05, Per99, SP99, SRK+12]. -D[DYN+06, SSS99, SH14, Bha98, ES11,KHS01, NSM12]. -Dimensional [LRQ01].-Lop [RGDM15, RGDML16]. -Means[TK16]. -Queens [Rol08b]. -set [She95].-stable [JMdVG+17].

. [Wil94].

/Fortran [TBG+02]. /many [KSG13]./OpenMP [VDL+15].

1

2

1 [HMKV94, SOHL+98]. 10-Gigabit[HcF05]. 100 [Str94]. 10th [DLO03, IEE96e].’11 [ACM11]. 11th [IEE97b, KKD04]. ’12[Hol12]. 128-processor [LL01]. 12th[DKD05, Bil95]. 13th[Ano95d, MTWD06, PSB+94]. 14th[CHD07, CHD09]. 15-18 [SL94a]. 15th[IEE95i, LKD08]. 16th [RWD09]. 17th[KGRD10, MC94]. 18-21 [DKD07]. 18th[CDND11]. 1990 [ACM90]. 1991[DE91, EJL92, IEE91]. 1992[KG93, R+92, VW92]. 1993[Ano94c, GGK+93, IEE93a, IEE93e,JPTE94, MMH93]. 1994 [Ano94a, Ano94e,DSZ94, DT94, GN95, GT94, HK95, IEE94h,PSB+94, SPE95, SPH95, VV95]. 1995[ACM95a, ACM96a, AGH+95, BH95, Gat95,Ham95a, IEE95b, IEE95a, IEE95d, IEE95h,IEE95i, JB96, NM95, Nar95, Ten95, UCW95,ZL96]. 1996 [ACM96b, Abr96, Boi97,ERS96, IEE96f, IEE96e, IEE96i, Ree96].1998 [ACM98b]. 1999 [ACM99]. 19th[TBD12, IEE05]. 1st [Abr96, BR95a,CGB+10, Kum94, Van95, Fer92].

2 [AKL99, BCAD06, BHS+02, BMPZ94a,CwCW+11, CD96, DPSD08, FST98a,FST98b, GFD03, GGHL+96, GT01,GHLL+98, GLT99, GLT00b, GLT00a,HGMW12, Jon96, LC97b, LSK04, MS02a,MK04, PS00a, SS99, SSL97, TRH00, VAT95,bT01a]. 2-D [BMPZ94a]. 2.0[BO01, LPD+11, LW97, Mat00b, NSM12].2.2 [HRR+11]. 2.X [KS96]. 2000[ACM00, LL01, LSK04, NU05, ZSnH01].2001 [ACM01, Old02]. 2003[ACM03, AS14, Don06, OL05]. 2004[ACM04]. 2005 [ACM05, DKD07]. 2006[ACM06a, MTW07]. 2007 [SM07]. 2008[SMCH15]. 2010 [CGB+10]. 2011 [LCK11].2012 [Hol12, TB14]. 2015 [IS16]. 21st[IEE95a]. 25nm [Ano03]. 26th[Ano93a, SL94a]. 27th [Ano94h]. 28th[ZL96]. 2D [ZZZ+15]. 2D-DWT [ZZZ+15].

2nd [FK95, IEE93c, Nag05, YM97].

3 [Bri95, Che10, GBH14, GPL+96, GLT12,Gro12, HDT+15]. 3-D [Bri95]. 3.0[Ano97, Bra97, BMR02, BRM03, DBB+16,KaM10, OP10]. 3.06 [Ano03]. 3.1 [WCC12].3.4 [Gei97, GKPS97]. 3.X [KS96]. 3000[HWM02]. 33rd [ACM95a]. 37th [ACM06a].3D [GAP97, Gra97, LO96]. 3D-Fall [Gra97].3rd [ACM06b, CZG+08, Ano95a, IEE96a].

4 [Ano03, HRZ97, KSHS01, NU05, SD13,SBT04]. 4.0 [DSGS17, JCP15, dOSMM+16].43 [UZC+12]. 45-degree [CT13]. 48th[IEE94e]. 4th[BDW97, EdS08, FF95, USE00].

5 [TRH00]. 512 [RBB97c]. 5th[AD98, Cha05, IEE94a, MdSC09].

600 [LSK04]. 6000 [AL93, NMW93]. 64[dCZG06]. 64-bit [Wil93]. 6th [ACDR94,DLM99, GT94, PW95, SHM+10, Sin93].

7th [ACM95b, CGKM11, DKP00, GN95,PBG+95].

857 [SMSW06]. 897 [HWS09]. 8th[CMMR12, CD01].

90 [Ben95, SM03]. 9076 [Bri95]. ’91[BG91, EJL92, IEE91]. ’92[Sie92a, Sie92b, VW92]. ’93 [Ano93f,GGK+93, GHH+93, IEE93a, IEE93e].93SC038 [FS93]. 93SC041 [Gle93]. ’94[BS94, DW94, GT94, IEE94b, IEE94h,PSB+94, SPE95, WPH94, dGJM94]. 947[LTDD14]. ’95[ACM95b, AH95, BH95, CLM+95, CJNW95,DMW96, FF95, HAM95b, IEE95l, Lev95,NM95, Van95, Ano98, FD97, KaM10].95/NT [FD97]. ’96[ACM96b, ACM96c, BDLS96, BFMR96,CH96, IEE96g, IEE96e, IEE96d, LHHM96,

3

Li96, Sil96, Was96, YH96]. ’97 [ACM97a].978 [Che10, SD13]. 978-0-12-415933-4[SD13]. 978-0-13-138768-3 [Che10]. 981[Riz17]. 9th [IEE95f, Kra02, YH96].

Aachen [Ano93a, GHH+93]. Abortable[CAWL17]. Abortable-locking [CAWL17].Abstract [MKW11, Wel94, BG94b, HTA08].Abstraction [SW12, YWTC15]. Abstracts[IS16]. ACC [APJ+16]. accelerate[SdM10, TBB12]. Accelerated[AB13, KA13, SCSL12, CP15, DCD+14,KM10, PTMF18, Sai10, iSYS12, SKM15,ZWL+17, ARYT17]. Accelerating[HF14a, HF14b, HKOO11, JK10, JLS+14,JNL+15, LSSZ15, LSVMW08, LSMW11,LAFA15, TMP16, TS12b, UZC+12,YEG+13, vdLJR11, HWX+13].Acceleration[CGBS+15, TK16, HE13, PRS16, SWS+12].Accelerator [APJ+16, SSAS12].Accelerator-Aware [APJ+16].Accelerators [AKL16, NTR16, SHM+10,MSZG17, UGT09]. Access[Bri10, HDT+15, IFA+16, JJPL17, LB98,SGH12, WTR03, CG99b, GBH14,HGMW12, LOHA01, MN91, SFL+94].accesses [TGL02]. accessible [BHW+12].Accident [Smi93a, SBR95]. According[LGM00]. ACCT [FVD00]. Accumulated[KS15b]. Accumulative [IH04]. Accurate[HD00, MLA+14, RSPM98]. Accurately[BGdS09]. Achieving[CBPP02, Gro01a, KKLL11, RH01]. ACM[ACM90, ACM95a, ACM95b, ACM97b,ACM98b, ACM04, ACM05, IEE02].ACM/IEEE [ACM97b, ACM98b, ACM05].ACO [Tsu12]. ACPC [Bos96, Vol93].Across [NE98, AL96, CZ95b]. ACSCI[Van95]. Active [CSAGR98, Pla02, SKH96].Activities [MSS97, CMV+94]. activity[Vet02]. Ad [IBC+10, ITT02]. Ad-Hoc[IBC+10]. Ada [Tou96, KP96, Tou96].Adam [Ano95b]. adaptable [BCM+16].

Adaptation [WST95]. Adapted [Uhl95a].Adapting [VFD02]. Adaptive[Ano94b, BCMR00, BKdSH01, Bir94,CKO+94, FSC+11, HWX+13, KK98, KT02,LFL11, MKC+12, MBES94, MRB17,MAGR01, OKW95, Ran05, RA09, SHM+12,SGZ00, SS09, STY99, Sta95a, TMW17,ZSG12, BDP+10, CLSP07, DLR94, EZBA16,EASS95, IDS16, LCL+12, SLGZ99, TCBV10,Was95a, Wil94, FSC+11].Adaptive-CoMPI [FSC+11]. Adas[HHC+18]. Adding[CB00, GRV01, PSM+14]. Address[SS01, DO96]. addresses [CGL+93].ADDT [SR96]. ADI [Sch01]. adjacent[Kan12]. adjoint [RMNM+12]. Adjusting[GSHL02]. ADOL [BGK08]. ADOL-C[BGK08]. adoption [CMV+94]. Adsmith[LKL96]. Advanced [Ano98, Ano00a, D+95,Gei96, Gei97, GLT99, GLT00b, GLT00a,GLT12, KG93, SSAS12, TG94, Ben95].Advances [Bha93, BBH+08, CHD07,CDND11, KGRD10, KKDV03, KKD04,KKD05, LKD08, LK10, MTWD06, RWD09,TBD12, AD98, BC14, BDW97, CD01,DKD05, DLM99, DKP00, DLO03, HPS+12,Kra02, HPS+13, IEE97a]. Advection[AKK+94, CT94a, TC94, CT94b].Advection-Chemistry [AKK+94].Aerospace [MAB05]. Affine [DMB16].Affinity [ETWaM12, AGG+95, NAAL01].Affordable [Rol94]. again [Har94]. against[GHD12]. Age[MdSC09, Ano94f, GJLT11, HK95]. AGEB[SAS01]. Agent[Mat01b, MCB05, ZWZ+95]. agent-based[MCB05]. agents [KBA02]. Aging[LRBG15]. Aging-Aware [LRBG15].AIMS [Yan94]. Air [AKK+94, BZ97,MPD04, MSML10, BTC+17, SH94, Syd94].airspace [TCP15]. Aix [GA96, Ano01a].Aix-les-Bains [GA96]. Al [Ano95b].Alamos [Old02]. Albuquerque[IEE91, IEE95d]. ALDY [GS96]. ALE

4

[HAA+11]. Algebra[BDT08, CDD+13, Coo95b, IS16, MGMH97,Neu94, van97, BKvH+14, Cal94, Coo95a,PMZM16, dCH93]. Algebraic[CGPR98, Lev95]. Algorithm[ACMR14, BST+13, BP99, BT01b, DYN+06,FJBB+00, HA10, HD02b, ITT02, MW98,PKD95, PB12, RDMB99, SAS01, Sch96a,SWH15, Sta95b, TK16, WHDB05, ART17,AAAA16, ARL+94, AD95, BB95, BAV08,BY12, BCM+16, CCU95, CT13, CSW99,GM94, GCN+13, GGL+08, GKK09, GP95,HWS09, IM95, JR13, KDSO12, KY10,KWEF18, Kan12, KBP16, KN17, KO14,KRC17, LYZ13, MM92, MLVS16, MK00,NB96, NAJ99, OKW95, OMK09, PGBF+07,PSLT99, Ram07, RJC95, RAGJ95, Sch96b,SOA11, Sur95a, TNIB17, Was95a,YULMTS+17, ZSK15, ZWL+17, dH94,van93, HWS09, LTDD14, Riz17, SMSW06].Algorithm-based [PKD95].Algorithm-Dependant [BP99].algorithmic [RJDH14]. Algorithms[ACM95b, ATC94, ADRCT98, ASA97,CCSM97, DALD18, DAK98, DK06, FB94,GAMR00, GK10, HO14, HHK94, IEE96d,KK02a, LHHM96, Li96, LAD16, MTSS94,MGMH97, MBS15, Nar95, Pet97, PBK00,SG15, VRS00, AK99, AL92, BHJ96, BMS+17,BID95, DDLM95, FR95, FP92, GWC95,HL17, HPLT99, HKOO11, HS95b, Jou94,JRM+94, KL95, KRG13, LFL11, LNW+12,MTK16, MJG+12, NP12, Ols95, PP16,Pan95b, PBK99, PD11, PCS94, RHG+96,SPE95, Sur95b, TSZC94, WCVR96, YLZ13].alias [SOA11]. alias-free [SOA11]. aligned[AGIS94]. Aligners [SMM+16]. Alignment[dOSMM+16, AMHC11]. all-port[RJMC93]. All-to-All[LZH17, LZH18, Tra02b]. Allocation[AGS97, BS01, DGG+12, RFRH96]. alloy[TG94]. ALM [PZ12]. Altera [TK16].Alternative[EM94, SWHP05, Tra12a, EKTB99].

ALWAN [HB96a, HB96b, MSB97].Amazon [ZLZ+11]. AMBER [SL95].AMBER4 [VM95]. American [Ara95].AMIP [Gat95]. Among [CB16]. AMPI[ZHK06]. AMPIC [CCHW03]. amplified[EZBA16]. AMR [NLRH07]. AN2 [HBT95].analogue [WWZ+96]. analyses [ANS95].Analysis[BHW+17, BR02, BGG+02, BBC+00,BDL98, CGLD01, EML00, FK01, FJK+17,Hol12, JF95, KL94, KNT02, KRG13, LCK11,MCLD01, NAW+96, NMS+14, Ost94, PZ12,PGAB+05, SPL+12, SBR95, SN01,TFGM02, Whi04, WM01, BB93, BBDH14,BBH+15, Che99, DSGS17, EPP+17, GR95,GFB+14, GKS+11, GE95, GE96, GT07,JB96, LC07, LLG12, LL16, LBH12,MMB+94, MMW96, MLA+14, MJPB16,Pat93, PHJM11, PGAB+07, SdSCP13,iSYS12, SS94, SDJ17, SPH95, Shi94, Sil96,SWL+01, SSG95, TMC09, TW12, TFZZ12,Uhl95a, Uhl95c, VM94, YCL14]. analytical[BHW+12, HK09, JS13, KN17]. Analyzer[JJPL17, KKM15]. Analyzers [Ano01a].Analyzing[BRU05, DF17, FM09, HG12, HcF05, PFG97].anasslich [Ano94c]. Anatomy [KWEF18].Andrew [Ano99c, Ano99d]. animal [LM99].anisotropic [LBB+16, SSB+16, YSVM+16].’Annai [CEF+95]. Annapolis [IEE96c].Annealing [FH97]. Annecy [VW92].Anniversary [Ano92, Ano93e]. annotated[GGH99]. Annotation [MGA+17].Announcements [Ano98]. Annual[ACM95b, Ano93b, Ano94h, IEE95b, USE00,Van95, Y+93, ACM95a, Eng00, IEE94e,IEE95l]. Ant [ITT02]. ante [Ano03].antenna [DSOF11]. Anthony[Ano95c, Ano00b]. Antonio[Ano95d, IEE95g, IEE97c]. Any[Gro02a, Mar07]. AP [PBC+01, SMTW96].AP/ [SMTW96]. AP1000[SH96, IM94, SWJ95]. AP3000 [TD99].API [DM98, LPD+11]. APIs [WCS+13].

5

APOLLO [Sta95b]. APOLLO-II [Sta95b].Appendix [Ano01a]. Appendixes[Ano01a]. APPL [AB93b, AB93a].Application[AKE00, BSN95, BGdS09, BS07, BFM97,BBH+15, Cha02, CRGM14, DFMD94,FDG97a, FDG97b, FSC+11, GB98, HT08,JFY00, JCH+08, KNT02, LD01, LMRG14,Mal01, MTSS94, MBB+12, NSLV16, NS16,PSSS01, Riz17, SBF+04, ST02a, SCL97,UTY02, ZZ04, ABC+00, ADMV05, ADR+05,BvdB94, BFLL99, BL97, BMPS03, CRM14,CRGM16, EPML99, FMFM15, GWVP+14,HZ96, KME09, LSG12, LCMG17, MMW96,MM03, MLA+14, MvWL+10, NMW93,RBAI17, Rol08b, SSS99, SFSV13, SL00,TCP15, Wor96, ZZZ+15, CG99a].application-centric [SFSV13].Application-Level[CRGM14, LMRG14, SBF+04, SCL97,BMPS03, CRM14, CRGM16, LCMG17].Applications[APJ+16, AGS97, Ano89, Ano96c, AZG17,BCLN97, BHV12, BBH+06, BRU05,BFMT96b, BFBW01, CBL10, CGLD01,Cha05, CJNW95, CRGM14, Cot98, CTK00,Cot04, Cza02, Cza03, DW02, DLM+17,DERC01, DHK97, DGF97, DGMJ93, EV01,EML00, FLD98, FD00, FGRD01, Fer92,FK95, Fin00, FC05, FM09, GKP97, GK10,HMK09, Hus98, IEE95l, ITT02, Jes93b,JJPL17, KB98, KBS04, KGK+03, KKP01,KK02b, Kuh98, Laf01, LAdS+15, LRG14,kLCCW07, LMRG14, dLR04, MSOGR01,MS02a, Mar02, Mat01b, MAB05, MC98,MG15, MANR09, PSM+14, Rei01, RPM+08,RBB15, RRBL01, SPL+12, SG12, SC04,SSB+17, TTSY00, TFGM02, VdS00, VY02,Vos03, Wal96a, WC09, Wis96a, WSN99,WBH97, WM01, dGJM94, ACH+11, ACJ12,Ano93a, Ano94f, Ano03, Ara95, Arn95,AGMJ06, BKH+13, BR04, BDV03].applications[BAG17, BFM96, BFMT96a, CGBS+15,

CDMS15, CLSP07, CBM+08, CIJ+10,CFPS95, CCHW03, CCM+06, DZ98a, DSZ94,D+95, DCH02, EKTB99, EGH99, EDSV09,FE17, FNSW99, FCS+12, Fin94, Fin95, FF95,GBR15, GS02, GHD12, GS96, GHH+93,HZ99, HAJK01, JC17, JPTE94, LMG17,LCMG17, LS08, MA09, MBKM12, MLC04,MSMC15, MS96b, NSBR07, NCB+12,NFG+10, PK05, PTL+16, Rab99, RS95,SJLM14, SPE95, SBG+12, SDJ17, SGH12,SG05, SLG95, SB01, SD16, TMC09, TBB12,Vet02, Wis96b, Wol92, WMP14, XLW+09,YZ14, ZLZ+11, BP93, TDBEE11, ATC94].Applied[FGRD01, HC06, KaM10, HMKV94, MM92,NF94, PGK+10, DMW96, Was96].Approach[AZG17, BHM94, BJ93, BHNW01, CRGM14,CD98, DLM+17, FFP03, GCBL12, HD00,KBA02, KK02a, LGM00, Mar06, PPR01,Pet00a, Pet00b, RGD13, Ros13, TJPF12,BK11, Bis04, BTC+17, CLYC16, CDP99,CRGM16, DiN96, EO15, FMS15, HDB+13,JS13, KPL+12, KSSS07, KJEM12, LSG12,MGG05, MS99b, NEM17, OW92, SVC+11,SEC15, TWFO09, WO09]. Approaches[JCH+08, Ney00, SWHP05, SM02, BFLL99,CB11, PS00b]. Approximate[Huc96, MM02, GGC+07, GG09, MM03].Approximation [SLJ+14, SJLM14]. April[ANS95, AH95, Ano93g, Ano94h, CH96,DR94, GH94, Ham95a, IEE92, IEE93b,IEE95f, IEE96e, IEE97b, IEE05, LCHS96,MC94, Nar95, Sie94, SW91, Ten95]. APS[GT94]. AQUAgpusph [CP15]. arbitrary[HP11]. ARCH [Ada97, Ada98].architectural [GGC+07]. Architecture[BG94a, CGC+11, EBKG01, EM02, FD97,Fuj08, HRZ97, IEE97c, ITKT00, LSZL02,PT01, PS01b, SMM+16, SC04, WKP11,YTH+12, BBCR99, BG94c, CSPM+96,CS96, DiN96, FHC+95, HK09, MRH+96,PWD+12, SWYC94, SSGF00, Squ03, SP11,WCC+07, YAJG+15, YEG+13, ZWZ+95].

6

architecture-independent [DiN96].Architectures [ACM95b, BDT08, BFG+10,CHPP01, HD02a, HD02b, HHK94, IEE96d,KDT+12, LHHM96, Li96, LZH17, LAD16,MS02b, MTSS94, MCS00, NO02b, Nar95,PZ12, TSCaM12, BDP+10, BN00, BKML95,CLM+95, CDZ+98, DM93, DZZY94, GDC15,GP95, Hos12, LCL+12, LDJK13, MLC04,NO02a, PY95, RFH+95, RMNM+12, SPL99,TDG13, TSZC94, Uhl95a, VDL+15, WST95,dlAMC11]. Area [CDHL95, Fis01, BHW+12,FGT96, FGG+98, KHB+99, Qu95].area-based [Qu95]. arising [ARvW03].Aristotle [FSV14]. Arithmetic[Ano98, JPT14, Sur95a]. Arithmetics[HD00]. Arizona [IEE95b, JB96]. ARM[MGL+17]. Array[DDPR97, HD02b, WG17, CCM12, DK13,HSE+17, JKN+13, Ott93, Wal02]. arrays[HCL05, RBS94]. Arrival[FPY08, MLVS16]. art [LF93b]. artifact[ZZZ+15]. Artificial [BPG94]. ARTUR[FJBB+00]. ARVO [BHW+12]. ARVO-CL[BHW+12]. ary [Pan95a]. Ascona [DR94].Ashes [Thr99]. ASL [FGRT00]. ASME[LF+93a]. aspects [CG99a]. assembly[TPD15]. Assessing [LMG17, dLR04,MABG96, TSCaM12, CMV+94].Assessment[Mat01b, TAH+01, Boi97, LH98].Assignment [Cza13, CK99]. assist [Kik93].Assisted [GTH96, GM13, MBBD13]. Astro[CC17]. Astronomical [JB96, SPH95].asymmetric [GCN+10]. Asynchronous[Ada97, Cav93, CZ95a, CDP99, HE02,BBDH14, BCK+09, CZ95b, DDYM99, Sch99].Athapascan [CP98]. Atlanta[AGH+95, Ara95, USE00, UCW95]. ATM[GFV99, HBT95, Jon96, LHD+94, LHD+95].Atmosphere [BS93]. Atmospheric[HK93, RSBT95]. atom [MGG05]. Atomic[LRT07, LAFA15, SYF96, DS13, Hin11,SY95, XF95]. atomics [BDW16]. atoms[JLS+14]. Attacks [PV97, GHD12].

Attraction [GB96]. audio [BJ13]. August[ATC94, Agr95a, BFMR96, DMW96, GT94,HAM95b, IEE94g, IEE95k, IEE95l, IEE96f,LF+93a, Ost94, PSB+94, PBG+95, Ree96,VV95, Was96]. Austin [IEE94b].Australasian [Bil95]. Australia[GN95, Nar95, ACDR94, Bil95]. Australian[ACDR94, GN95]. Austria[Bos96, BH95, Kra02, TBD12, Vol93].Austrian [Fer92, FK95].Austrian-Hungarian [Fer92, FK95]. Auto[CC17, DWM12, DBLG11, RDLQ12, WG17,FE17, SH14, TWFO09]. Auto-Generation[CC17, DWM12]. auto-parallelization[TWFO09]. Auto-scoping [RDLQ12].Auto-Tuning[WG17, DBLG11, FE17, SH14]. AutoLink[GMPD98]. AutoMap [GMPD98].Automata [Car07, BBK+94]. Automated[BMPS03, MVY95, LLG12, RFRH96, Yan94].Automatic [BVML12, BBH+08, BGK08,BHK+06, CBL10, Cza03, DW02, EML98,EML00, FAFD15, FFM11, GKCF13, HZ99,JFY00, JJY+03, JJPL17, KOI01, KHS12,MGA+17, NCB+17, OWSA95, Rab99,RGD13, SZ11, SR96, SSB+17, TJPF12,WC15, WM01, APBcF16, AMuHK15,AGG+95, BR04, BHRS08, CHKK15,CdGM96, CPR+95, HZ96, LME09, LF93b,WMP14, ZHK06, FVD00]. Automatically[WBSC17]. automation [Ano93a].automotive [Ano93a, Ano93a].Autotuning [BAG17]. Auxiliary[STMK97]. Available [Bak98, BF98].Avoidance [CRGM14]. AVTP [FHC+95].award [Str94]. Awards [Str94]. Aware[APJ+16, BHP+03, EGR15, HVA+16,LRBG15, MJB15, Pan14, ZLP17, CGH+14,GHZ12, HJYC10, HG12, JKN+13, KBG16,MBBD13, MSMC15, SHM+12, SPK+12,WRSY16]. awareness [HK09, VGS14].AXAF [NH95].

B [Ano01a]. Back [BIC+10]. Backend

7

[IOK00]. Backup [Gua16]. Bains [GA96].Balance [HE02]. balanced [EZBA16].Balancing [BKdSH01, DBA97, DI02, DK06,GCBL12, MM02, PT01, Pus95, ST97,Wal01a, Bir94, BS05, DZ96, DLR94,DvdLVS94, DR95, FMBM96, FH97, Hum95,JH97, MM03, NP94, SGS95, SY95].Balatonfured [DKP00]. balls [BBH+15].Baltimore [IEE02, SPH95]. Bamboo[NCB+12]. banded [DG95]. Bandwidth[NE01, RK01]. Bangalore[Kum94, PBPT95]. Barbara[ACM95b, AH95, IEE95f]. Barcelona[DLM99]. BARRACUDA [EPP+17].Barrier [CLdJ+15, SDB+16, YLZ13].Based [Ada97, AHD12, AAB+17, AP96,BHW+17, BDG+91b, BoFBW00, CAM12,CGC+02, CLP+99, CDPM03, DW02,DBK+09, FSC+11, FC05, For95, FSLS98,GSxx, HF14a, HF14b, HM01, Hus00, KLR16,LSZL02, LZH18, kL11, LWP04, LAFA15,MDM17, MGL+17, MMH98, NSLV16, NE01,NHT02, NPS12, PPT96a, PCY14, PFG97,PSSS01, RDMB99, SPL+12, SM03, Smi93a,ST02b, ST97, SJK+17a, SJK+17b, THS+15,TD98, WTTH17, WC09, WZHZ16, Wis96a,WM01, WJB14, YG96, YTH+12, ZWJK05,Ada98, AASB08, AAAA16, AVA+16, Ano03,BLPP13, BDG+92a, BCH+03, Bri95,BFMT96a, CwCW+11, CC10, CKmWH16,CRM14, CXB+12, DXB96, FE17, FFB99,FJZ+14, FNSW99, FSTG99, FFFC99,FWS+17, GS91a, GS92, GKS+11, Gra97,Gra09, GFPG12, HZ94, HWX+13, IM95,ITT99, JKM+17, KPL+12, KPNM16, LV12,LRW01, LKL96, LNW+12, LGG16]. based[LMM+15, MYB16, MMO+16, MKP+96,MCB05, MT96, MS99a, MS99b, MFPP03,Neu94, NHT06, OLG+16, OP98, PARB14,PES99, PPT96b, PK05, PAdS+17, PGK+10,PSHL11, PKD95, PSK+10, PSLT99, Qu95,Rag96, SJLM14, SS09, SG05, SSS99, SZ11,SVC+11, SLS96, SKB+14, Sto98, Str96,SLN+12, TBB12, TY14, TBD96, TWFO09,

TMPJ01, WO09, WTFO14, Wis96b, WCS99,YC98, YL09, YWC11, YSL+12, ZAFAM16,ZLP17, ZHK06, ZZG+14, ZWZ+95, vHKS94,BFMT96b, FH97, KSJ95, WAS95b, FO94,GK97, KSJ96, PY95, Sut96, TSZC94,ZPLS96]. Basel [Ano94i]. Basic[PGC02, BKvH+14, BR94]. basierte[Gra97]. Basis [OMK09, RB01]. Bath[BP93]. Bayesian [Fer10]. BC [IEE95i].BCS [FFP03]. BCS-MPI [FFP03]. be[CB00]. Beach [IEE93b]. beam[OIH10, RCFS96]. bearings [NF94].Beguelin [Ano95b]. Behavior [BFM97,DeP03, Ros13, LLG12, PPF89, YMYI11].behaviour [EPML99]. Beijing[CZG+08, LHHM96, Li96]. Beitrage[Ano94c]. Belgium [LCHS96]. Benard[TVV96]. Benchmark[BWV+12, DS16, HC10, Luo99, Mul02,MBB+12, RSPM98, RTH00, SGJ+03,Tra12b, UTY02, Ano03, BKML95, DWM12,DH95, DHS96, Mul03, MvWL+10, PHJM11,Reu01, RST02, Wor96, YSWY14].Benchmarking [GC05, HCA16, LCY96,MMU99, MCS00, WRA02, RST02].Benchmarks[CRE99, KS96, KAC02, MM07, NA01, RK01,TSB02, TSB03, WAS95b, ZSnH01, CDD+96,MMH99, Ste94, WT11, CE00, WT12].Beneficial [CB00]. benefit [SBG+12].Benefits [PSM+14, SIRP17].Benutzerprofile [Wil94].Benutzertreffens [Ano94c]. Beowulf[CMM03, Ste00, UP01]. Beowulf-Class[Ste00]. Berlin [PW95]. Bessel [KT10].Betriebssystemkern [Sei99]. Better[Str94]. Between[AAB+17, BS07, ASS+17, AKE00, BID95,GFV99, JAT97, LDCZ97, MSP93]. Beverly[IEE93f]. Beyond[Gei93a, GKPS97, Gei98, Gro12, Olu14,Gei93b, LSG12, Sch93, SHM+10].Biconjugate [GFPG12]. bidirectional[HE15]. Big

8

[GTS+15, LK14, VPS17, ASS+17, Str94].Biharmonic [RB01]. Bill [Ano99c, Ano99d].billion [KTJT03]. Billions [MRB17].binary [CG93, EPP+17, SGS95, TCBV10].binary-level [EPP+17]. binary-splitting[TCBV10]. Binding[CLL03, Coo95b, MG97, Coo95a]. Bindings[Ano98, VGRS16]. Bioinformatics[BBH12]. Biological [CNM11, BA06].Biomolecular [BCGL97, PZKK02]. BIP[CDP99, Tou00]. BIP-Myrinet [Tou00].BIP/Myrinet [CDP99]. bit[HLO+16, Wil93]. bit-parallel [HLO+16].bitonic [PSHL11]. Black[FSXZ14, Kha13, van93]. BLACS[DSW96, DS96a, Wal95]. blame [DSGS17].BLAS [Add01, ARvW03, FMFM15].BLASTP [LSMW11]. Block[DDPR97, SMM+16, WO95, ZB97, ADDR95,DR18, GP95, HKMCS94, HC08, WO96].Block-Cyclic[DDPR97, WO95, HKMCS94, HC08, WO96].block-tridiagonal [DR18]. Blocking[FH98, BCH+08, HKT+12, Nak03, HTA08].Blood [Pat93]. Blue [KMH+14, AAC+05,BGH+05, EFR+05, LM13, MV17, MSW+05].blurred [Wil94]. BMMC [CC99]. bodies[AGIS94, LHLK10]. Body[RB01, RTRG+07, IHM05, NS16, Per99,SP99, SRK+12, ADB94]. BOF [Mat00a].Boltzmann[MS95, Pri14, SJK+17a, SJK+17b]. Bonn[MTWD06]. Book [Ano95b, Ano95c,Ano96a, Ano99a, Ano99c, Ano99b, Ano99d,Ano00a, Ano00b, Che10, Mar06, Nag05,Per97, SD13, Vog13, Vre04, YM97]. books[YM97, Nov95]. Boosting [LRG14, SFO95].Boston [IEE94e]. Both [BGD12, KP96].Bottleneck [MWG97]. bottlenecks[DSG17, JKHK08]. Boulevard [ACM99].Bound [ASA97, MBKM12, ADMV05].boundaries [KGB+09]. boundary[PTT94, SBQZ14, SP11, SD99].boundary-value [SP11]. bounded

[MdSAS+18, PAdS+17]. BowMapCL[NTR16]. Box [JR13, JPP95].Box-counting [JR13]. brackets [GSMK17].Braga [IEE96g]. Branch[ASA97, ADMV05]. Breaking [OS97].breast [Str94]. Brest [IEE94c]. Bridge[VDL+15]. Bridges [DSS00]. Bridging[ACM04, AAB+17, ASS+17]. Bringing[FKKC96]. Brisbane [ACDR94, Nar95].Bristol [MC94]. British [IEE95a, IEE95e].Broadband [OIS+06, CLLASPDP99].Broadcast [PSM+14, YSP+05, MTK16].Broadcasts [SE02]. Brownian [SKM15].Brussels [LCHS96]. BSGP [HZG08]. BSP[Mar06, Bis04, GRRM99, Mar09, Roh00].BSP2OMP [Mar09]. BT [WT11, WT12].Budapest [FK95, KKD04]. Buffer[SEF+16, Tsu07]. buffers [MR96]. Build[HRSA97]. Building [FD04, Gei01, Gro02a,LBD+96, LVP04, WADC99, Arn95, HS95b,MSL12, PW95, Sur95b, Kos95b]. Bulk[Cer99, DLRR99, HZG08, TNIB17].bulk-synchronous [HZG08]. Burrows[NTR16]. Burst [SEF+16]. BUS [ITT99].BUSTER [XWZS96]. Butterfly [ST17].Butterfly-Patterned [ST17].

C [Gal97, Pri14, SSL97, TBG+02, VDL+15,Vre04, BGK08, BB00, CNC10, CCHW03,DARG13, Don06, FLMR17, FHK01, GTH96,GSI97, Gor01, KK02a, KPO00, LYSS+16,MHSK16, Qui03, SSB+17, SC95, TNIB17,UZC+12, YULMTS+17, YSVM+16, ZT17].C# [WLR05]. C-to-CUDA [UZC+12].C/C [KPO00]. C11 [BDW16]. C2CU[TNIB17]. CA [ACM95b, Ano89, BBG+95].Cache [LZH17, LZH18, MM07, NIO+02,NIO+03, SS01, SVC+11]. Cache-Coherent[SS01]. cache-friendly [SVC+11].Cache-Oblivious [LZH17, LZH18].Caching [kLCCW07, DO96, WMRR17].CAE [KDL+95a, KDL+95b]. CAF[GBR15, Mar05]. Caffe [AHHP17].calculating [EZBA16, KD12]. Calculation

9

[QRMG96, GSMK17, KN17, MM95, NS16,SR11]. Calculations[RB01, Sta95b, ART17, WH96]. calculus[PQ07]. Calif [IEE93f]. California[ACM97b, Gat95, IEE93a, NM95, USE94,AH95, GE95, GE96, Has95, IEE93b, IEE93f,IEE94g, IEE95c, IEE95f, LF+93a]. Call[DW02, MCP17]. Call-Graph [DW02].Calls [FHK01, AGLv96]. CALPHAD[TKP15]. Cambridge[Ano95b, Ano95c, Ano96a, Ano99a, Ano99c,Ano99b, Ano99d, Ano00a, Ano00b].CAMeL [KDL+95a, KDL+95b].CAMeL/PVM [KDL+95a, KDL+95b].CAMP [CLM+95]. Can [Gro02a, SBG+12].Canada [BG91, GGK+93, IEE95a, IEE95i,Vos03, IEE95e, Lev95]. Cancellation[TBS12]. cancer [Str94]. Cancun [Sie94].CAP [GTH96, MGMH97]. CAP-Specified[MGMH97]. Capabilities [Gei97, CG99a].capability [BBH+13b]. capable [KYL03].capacity [RCG95]. Capture [DW02].Capturing [FM09]. card [SR11]. Cardiac[ORA12]. cards [KY10, KME09]. Carlo[ADRCT98, AK99, DAK98, HJBB14,NSLV16, RR00, RP95, SK00, SKM15,WH96, ZZ04]. Carnegie [IEE94d].Carolina [ACM95a]. cars [Str94].CASCON [GGK+93]. Case[AIM97, BF01, BWW+12, BfDA94,BHLS+95, CML04, DARG13, DHP97,GL97a, GMdMBD+07, HHC+18, KCR+17,LSB15, RRBL01, SCL01, Tha98, BJ13,BJS99, Bri00, FO94, MS96b, PGK+10, Pri14,SIRP17, TPD15, Wal01b, ZSK15, LPD+11].casting [KGB+09]. CATCH [DW02].Cavanaugh [IEE93c]. CAVE [BBH+15].CAVE-CL [BBH+15]. cavities [BBH+15].Cavity [PKYW95, RM99]. CBFEM[OMK09]. CC [GB96, KYL03]. CC-COMA[GB96]. ccNUMA[CHPP01, CBPP02, MCS00, SSGF00]. CCp[BB00]. CE2014 [MBS15]. CEBAF[DZDR95]. Celebrating [EO15]. Cell

[DBK+09, JMS14, VDL+15, OOS+08,OIS+06]. Cells [MRB17]. Cellular [Car07].Cenju [GPL+96, KSHS01]. Cenju-3[GPL+96]. Cenju-4 [KSHS01]. Center[ACM98b, ACM99, ACM00, Hol12, IEE94b].centered [JPOJ12]. Centers [EGR15].Centre [IEE95e]. centric [SFSV13].century [IEE95a]. CERN [VV95]. Cesena[CH96]. Cetraro [D+95, KG93]. cf4ocl[FLMR17]. CFD[SPE95, AMS94, ADT14, CP97, HAJK01,HT01, JR10, DK02, PBK00, YPAE09].CFD-DEM [ADT14]. CG [ABF+17]. Ch[CNC10]. Chain [FK01]. Challenge[DGMJ93, LB96]. Challenges[Agr95a, Gro01a, Gro12, Ree96, Ten95,Wit16, BDG+92c, GScFM13]. Chamfer[YPZC95]. Chandra [Stp02]. Channel[GK97, LBD+96, SG05]. CHAOS [BLW98].Characteristic [OMK09]. Characteristics[WR01, WT12, BN00, GL99, WT11].Characterization [KB98, MM07, Wor96].Characterizing[BCM11, BGdS09, GScFM13, OdSSP12].Charge [BL95]. Charm [ZHK06]. Charts[DSS00]. Check [MC17, LCC+03].checkerboard [BW12]. Checking[CGZQ13, Gro00, HMK09, LCC+03,MdSAS+18, PAdS+17, RAS16, SMAC08,YYW+12]. Checkpoint [SSB+05, SBF+04,CRM14, ZWZ05, ZHK06, BDB+13].checkpoint-based [CRM14, ZHK06].Checkpoint-on-Failure [BDB+13].Checkpoint-Recovery [SBF+04].Checkpoint/Restart [SSB+05].Checkpointing [DCH02, LMRG14,SSB+05, TSS00b, BMPS03, BCH+08, CG96,LCMG17, PKD95, SSCC95, Ste96].chemical [NMW93]. Chemistry[AKK+94, BR95a, DMW96, SSGF00].Chemkin [Ano97, Bra97]. CHEMPI[RR01]. Chicago [CGKM11]. China[CZG+08, IEE97a, LHHM96, Li96]. Chip[Jes93b, URKG12, TDG13, dCZG06].

10

Cholesky [DG95, LC97b]. Chromosome[BM97, dOSMM+16]. Chromosome-Wide[dOSMM+16]. CICADA [MK94]. Circuit[WPC07, BJ95]. Circuits [GJN97].Circular [Tsu07]. Circulation[GAM+02, Nes10, RSBT95]. CIS [AH00].citation [Squ03]. City [Hol12]. civil[PW95]. CL [BHW+12, BBH+15, LW95].CL-PVM [LW95]. CL ARRAY [ZT17].clarified [WBBD15]. CLAS [DZDR95].Class[DFN12, Ste00, Dem96, MSL96, RFH+95].Classes [DeP03, GG09, Ott93]. classic[HL17]. Classical [BCGL97]. clauses[WC15]. Clemson [ACM95a]. Client[Ano93e, FSLS98, KS97, kLCCW07, Mat01b,Sch93, Sto98, Vis95]. Client-Agent-Server[Mat01b]. Client-Server[FSLS98, Sto98, Vis95]. Client-Side[kLCCW07]. Client/Server[Ano93e, Sch93]. climate [Str94]. CLIPS[Ano95a, Ano95e]. clMAGMA [CDD+13].clock [NB96]. CLOMP [BGdS09]. clone[ZWL+17]. Closer [HCZ16]. Closure[CGPR98, KH15, PPR01]. Cloud[SIS17, URKG12, ZLZ+11, ZLP17, GHZ12,GWVP+14]. Cluster[AUR01, BKGS02, BL95, BM97, CRE99,CMM03, HD02a, ES11, GGGC99, Gei94,Gei00, GSN+01, GT01, GC05, HD02b,ITKT00, IDD94, KKH03, KS96, KS01,KHS01, LR01, MFTB95, MM01, NO02b,OF00, PFG97, RB01, RsT06, RLL01,SCR92, SHHI01, SHTS01, ST02a, TOTH99,Tra02b, bT01a, AL93, BLP93, BALU95,BTC+17, BID95, CCF+94, Cou93, ED94,GK97, GMU95, Heb93, KEGM10, KO14,LC07, Liu95, MW93, MM03, NO02a,PDY14, RJDH14, SS94, SR95, ST02b,SLS96, SY95, SSN94, Tho94, THM+94,Tsu95, UH96, YWO95, ZLZ+11, MS04].cluster-based [SLS96]. Cluster-enabled[SHHI01]. clustered [KHB+99]. Clustering[BBH12, HA10, RJC95, GGL+08, YCL14].

Clustern [MS04]. Clusters[AH00, AHHP17, BDH+95, BDH+97,BWV+12, CSC96, DK06, GMdMBD+07,GSY+13, HPP02, HSMW94, HVA+16,Hus00, JNL+15, LC97a, LH95, LVP04, MS98,MFPP03, Pan14, PKB01, PT01, PS00a,Pus95, Rei01, dOSMM+16, SFG98, SvL99,Ste00, Tou00, UP01, WLNL03, WT12,YWCF15, YKI+96, AB95, ALR94, ADB94,ABG+96, ADMV05, BWT96, BDV03, Bru95,CRE01, EKTB99, GBF95, HCL05, Hus99,JKHK08, Jon96, JR10, JRM+94, KYL03,KYL05, KSL+12, KJEM12, LBD+96, Lee12,LLC13, LL95, LKYS04, NMW93, NN95,PS07, PRS+14, PM95, PR94c, PRS16, PL96,RCFS96, RGDML16, Slo05, SC96a, SL95,TFZZ12, WLNL06, WLYC12, YST08, YL09,YHL11, YWC11, ZHS99, dCH93]. CM[SBG+02]. CMMD [Har94, Har95]. CMPI[GHZ12]. CMS [FMS15]. CNF[IKM+01, IKM+02]. CO[ACM01, AHHP17, HJ98, Wal02].Co-Array [Wal02]. Co-designing[AHHP17]. Co-processed [HJ98]. Coarray[GBR15, YBMCB14]. coarrays [SMCH15].Coarse [ADRCT98, IOK00, KOI01, LGM00,NIO+02, NIO+03, Heb93, RJC95].Coarse-Grain [IOK00]. coarse-grained[Heb93, RJC95]. coarsening [PSLT99].Coast [IS16]. Coastal [GAM+02].CoCheck [MS96b, Ste96]. Code[AHP01, And98, BCGL97, CB00, CP97,CCK12, CCBPGA15, DDL00, DZDR95,HE02, KaM10, KAMAMA17, KHS01, LD01,MS02b, MM07, PBC+01, RGD13, SM03,SZBS95a, Sta95b, TGBS05, AMS94, ADB94,AFST95, BCAD06, BADC07, BW12, Bha98,Bri95, Cou93, DLR94, EZBA16, FMFM15,GSMK17, Heb93, IJM+05, KPL+12, KH10,MRH+96, MWO95, PKE+10, PSK+10,RP95, SZBS95b, SK00, SFLD15, SMSW06,TBD96, VBLvdG08, VDL+15, Wor96, YL09].codebooks [PMM95]. Codes [FAFD15,JFY00, SWH15, HWS09, HASnP00, JPP95,

11

KBG+09, LRW01, Mal01, OLG+16, WB96].Coding [Uhl94, Uhl95b, SCC96].Coefficients [MW98, ARYT17]. cognitive[PWD+12]. Coherence [MM07]. Coherent[SS01]. Collaborative [DCPJ12, DCPJ14].Collapse [PKYW95]. Collecting [BMR01].Collection [LTRA02, DH95, MGC+15].collection-oriented [MGC+15].Collections [JFGRF12]. Collective [BIL99,BIC05, CCA00, FVD00, FCLG07, FPY08,GLB00, GMdMBD+07, Hus99, KH96,MJG+12, PGAB+05, SG15, TRG05, VFD02,WRA02, HS12, HG12, HWW97, KHB+99,KBHA94, KMH+14, MBBD13, Pan95b,PGBF+07, PGAB+07, RJMC93, SCB14,SCB15, SS99, TD99, Tra12a, TFZZ12].Collectives [CSW12, SvL99, Zah12].Collector[GTS+15, WK08a, WK08c, WK08b].College [AGH+95, Ano94h]. Collision[QRMG96, Sta95b, ART17, FFFC99,LHLK10]. Collocative [MKW11]. Colony[ITT02]. Colorado [R+92, IEE05]. Colt[WN10]. Columbia[IEE95a, IEE95e, MAB05]. column[HSP+13]. column-stores [HSP+13].COMA [GB96]. Combined[CBHH94, TJPF12]. Combining[DP94, Rab98, SCB14, Sch96a, SMAC08,YPAE09, Bor99, Sch96b]. comes [Ano94f].Coming [HK95]. Commands [OLG01].comments [Str94]. commerce [Ano94f].commercial [Ano93g]. commodity[GGL+08]. Common[HEH98, DK13, WLR05]. Communicating[FKK+96b, GMPD98, FKK96a].Communication[ABF+17, BCG+10, BIL99, BIC05, DCPJ12,DZZY94, EM02, FST98a, FJK+17, FGKT97,FBSN01, GFD03, GFB+03, GGS99, GFV99,GLB00, GC05, HB96b, HC10, HDB+12,HC06, HIP02, KB98, KV98, KBG16, LRT07,LC93, LCVD94a, MH01, MMH98, MR96,Nit00, PLK+04, RK01, RRAGM97, RsT06,

SWHP05, SCP97, SGH12, SBG+02, SJ02,ST02b, SGL+00, SKH96, Sum12, TRG05,TGT05, TRH00, Tra02b, UMK97, WBH97,XH96, YC98, ZSG12, FH98, BHJ96,BVML12, BBH+13b, BS94, BMG07,CAHT17, CGL+93, Dem96, DWM12,DCPJ14, DGB+14, DBB+16, DS96b, GK97,GM13, Gra97, GL94, GB94, HB96a,HWX+13, Hus99, HWW97, KH96, KB01,KYL03, KYL05, KHB+99, LR06b, LFL11,MLAV10, MMU99, MABG96, Pan95b,Par93, PGK+10, PM95, PKE+10, PSK+10,PS00b, SH14, SC95, TG09].communication[Tra12a, Vet02, Wu99, WMP14].communication-based [PGK+10].Communication-buffers [MR96].Communication/Computation [HIP02].Communications [BPS01, CP98, CDHL95,CDH+95, FVD00, FST98b, GT01, GBS+07,GMdMBD+07, IEE95b, IEE95e, LZH17,LZH18, MB00, VFD02, YTH+12, bT01a,ADLL03a, ADLL03b, CDP99, HS12,KBHA94, MBBD13, McR92, MN91, MS99c,RGDML16, SCB14, SCB15, TD99, WLYC12].Communicators [DFKS01, GFD03,GFD05, FKS96, KH96, MJG+12].communities [ACM04]. Community[BHW+17, FCP+01]. Como [CLM+95].COMOPS [Luo99]. Compact[Uhl94, Uhl95b, Wor96]. compaction[VSW+13, WK08a, WK08b, WK08c].Compactly [KLR16]. Comparative [KB98,PSK08, SN01, AGR+95b, ED94, YCL14].Comparing[BF01, Fin97, GBR15, HVSH95, ICC02,LKJ03, ORA12, SSG95, WBSC17].Comparison [BvdB94, BS07, HC10,KBM97, LCW+03, Mat94, Mat95, Ney00,OP10, OF00, PPJ01, Pok96, RS93, RBB97a,SS01, SHH94b, VS00, Wal02, ZBd12, Ahm97,AB93b, BLP93, BID95, GMU95, Har94,Har95, JS13, KDSO12, KC06, MSP93, Ols95,PS07, PSHL11, Pri14, SdM10, SYR+09,

12

SWS+12, SHH94a, TSZC94].comparison-based [PSHL11].Comparisons [GGS99, PGC02, CLYC16].Compass [PWD+12]. Compatible[MM14, LBH12, OIH10]. Compcon[IEE93a]. compete [Ano96a]. CoMPI[FSC+11, FCS+12]. Compilation[FSSD17, HKMCS94, LRBG15, SBW91,Coe94, FM90, PGS+13, SHM+12]. Compile[GB94, TSY99, JE95]. Compile-time[GB94]. Compile/run [TSY99].Compile/run-time [TSY99]. compiled[KYL03, KYL05]. Compiler[Ano98, Dan12, IOK00, KSS00, KSHS01,MB12, Mar09, MKW11, SSE12, SKS01,TJPF12, TBG+02, TGBS05, BAG17,HEHC09, LME09, LHC+07, LLCD15, MA09,Mul03, PP16, RKBA+13, SHHI01, THH+05].Compilers[Ano01a, CFF+94, LZ97, MKV+01, SBT04,SS96, Hos12, PBG+95, ZT17]. Compiling[DMB16, Hos12, CGK11]. Complete[BdS07, GHLL+98, Nag05, Per97, SOHL+98,YM97, Ano99a, Ano99c, Ano99b, Ano99d,PRS+14, SOHL+96]. Completed [PTT94].Complex [BCGL97, GMPD98, MBS15].Complexity [NPS12]. component[HLP10, KRKS11, Squ03]. Components[BT01b, CT02, Fin00, Gro02a, Lus00, Wis01,LRW01]. Composed [Wel94]. Composing[PHA10]. composite [MALM95, YPA94].Compositing [GPC+17]. Composition[CTK00, Cot04, DLB07, FC05, KH15,CFP96]. compound [LLC13, SAP16].comprehensive [RST02]. Compression[FSC+11, KBS04, VPS17, AAAA16, HE15,UH96, Wu99]. compression-based[AAAA16]. COMPSAC [IEE95l].Compton [BCD96]. Computation[BKGS02, B+05, Cer99, DSM94, DSS00,EMO+93, ESM+94, Fer10, FF95, GS91b,HIP02, IEE94a, IEE96c, KS15b, Mar06,MR12, MSCW95, Nag05, PPR01, Sie92a,Sie92b, SMOE93, WTTH17, ACM97a,

ABDP15, Bis04, BALU95, Bos96, BHKR95,CL93, CMH99, CKP+93, DZZY94, HLM+17,HK94, KB01, KJJ+16, KG93, Lev95,MLAV10, Neu94, NZZ94, NCKB12, PF05,PKE+10, Roh00, Shi94, SH14, TBB12,TPD15, TW12, Vol93, Wan97, Was96, SM07].computation-communication [SH14].Computational[ALR94, CMM03, DFMD94, JFY00, KH15,Liv00, MBS15, R+92, SZBS95a, SM07, SN01,TDBEE11, TGEM09, WPH94, Whi04,AGMJ06, BvdB94, BDG+92c, BR95a,HVSC11, KBG+09, PBK99, RBB15, SPE95,SZBS95b, STT96, Str94, VDL+15, BR95a,CCHW03, R+92, SL94a, WPH94].Computationally [DFN12].Computations [AGH+95, ACGR97,CGU12, CGPR98, IH04, PBK00,PMvdG+13, WJ12, ANS95, AASB08, BL99,CG93, DMW96, EGDK92, HJYC10, KD13,MRRP11, MR96, Smi93b, SAP16, TS12b].Compute [DBK+09, KKLL11, ZLZ+11].computed [FWS+17, SSS99]. Computer[ACM06a, Ano94a, GTH96, IEE95l, IEE96h,IEE97c, IS16, KCR+17, Neu94, Old02,PSB+94, ST02a, Sum12, Ten95, URKG12,YTH+12, BN00, BS94, BKML95, BFM96,Cal94, CLM+95, GRTZ10, JWB96, Str94].Computer-Assisted [GTH96]. Computers[Ano89, BP99, BCL00, DGMJ93, FFP03,GC05, IEE95b, IEE95e, ITKT00, LF+93a,MFTB95, PSZE00, SPM+10, SS96, BvdB94,BB93, BBK+94, DLR94, Duv92, ESB13,GBF95, KOS+95a, LR06a, MMB+94, NF94,POL99, PBK99, Wal94a, Wal94b].Computing[ACM97b, ACM98b, ACM00, ACM01,ACM04, ACM06b, ACDR94, AIM97, BJ93,BBG+95, BDG+93a, BGR97a, BL95,BCP+97, BRST94, BDH+95, BDH+97,BHNW01, BBH12, CZ95a, CGB+10, CLL03,CNC10, Cze16, DDS+94, DERC01, DPP01,DKM+92, DGMS93, DT94, FTVB00,Fer98b, FGKT97, Fos98, FS93, GLN+08,

13

GS92, Gei93a, GBD+94, GSxx, Gei00, GN95,GL97a, GT94, Gua16, Hol12, HT01, IEE92,IEE93d, IEE93c, IEE94g, IEE95c, IEE95k,IEE95i, IEE96a, IEE96f, IFI95, KK02a,KS97, LCK11, LRG14, LC93, LR01, Lus00,dlFMBdlFM02, ME17, Mat94, Mat95, MS04,Nov95, PKYW95, PR94b, SHTS01, SCSL12,Sin93, SSSS97, Ste00, SGS10, SW91, Sun90a,Sun90b, Sun92, Sun93, Sun94a, Ten95,VV95, VW92, WN10, YH96, YG96,ACGdT02, ARYT17, AL92, AH95, ASCS95,Ano93g, Ano94e, Ano94h, Ano03, ADDR95].computing[AMV94, BPG94, BDG+92a, BDG+94,BKML95, Bru95, BHW+12, CZ95b, CZ96,CHKK15, DLRR99, DKD08, DW94, D+95,DMW96, DE91, EKTB99, EJL92, FBD01a,FGRD01, FO94, FS95, Fer98a, FS98,FME+12, FHC+95, GGGC99, GS02, GS91a,GS93, Gei93b, Gei94, GH94, GkLyCY97,HP05, HW11, HH14, HPY+93, HS95a,HH95, mH12, IEE97a, IM95, JPOJ12, JY95,JJM+11, JPTE94, KO14, Kos95b, KSSS07,LV12, LH98, LCHS96, LHD+94, LHD+95,LM13, Maf94, MZK93, Mal95, Mar07,PGS+13, PKB06, Pen95, PGK+10, PTT94,PBG+95, PNV01, PWD+12, RBS94,RJDH14, Sch93, SGS95, SMS00, STT96,Sti94, SP11, Sun94b, SGDM94, Sun95, SD99,TJD09, TKP15, TDB00, Tho94, TSS98,VM94, Vis95, Was96, YULMTS+17, YLC16,YSL+12, Zem94, ZWL13, ZGC94, ZHS99,ZKRA14, ACM98a, Kon00]. Computing[PW95, Per96, SCR92, TGEM09, Ano95b].Concept [KaM10, LTR00, SB95]. concern[Ano94i]. Concurrency[ME17, NPS12, DGB+14, PTG13].Concurrent[Ano89, BDG+91b, BRS92, BHV12, BKH+13,DG95, GS91b, GS92, GSxx, Gre94, HS93,Sun92, Sun93, ZDR01, BDG+92a, FS95,GS91a, GS93, LPD+11, NP12, RGDML16,RCG95, Sun94b, SGDM94, Wal94a, Wal94b,WK08a, WK08b, WK08c, ZWZ+95].

condensed [MC99]. Condition [GK10].Condor [CF01, PL96]. conduction[iSYS12]. Cone [RCFS96, OIH10].Conference[ACM90, ACM94, ACM96b, ACM96c,ACM97b, ACM98b, ACM04, Abr96, ATC94,AGH+95, Ano89, Ano93f, Ano94a, Ano94e,Ano94i, ACDR94, BBG+95, B+05, Boi97,Bos96, BFMR96, BH95, CGB+10, CH96,DSM94, DSZ94, DKD07, DKM+92, ERS95,ERS96, EJL92, FF95, Gat95, GN95, GT94,Ham95a, HAM95b, HS95a, HS94, Hol12,IEE92, IEE94f, IEE95b, IEE95a, IEE95e,IEE95i, IEE95l, IEE95j, IEE96a, IEE96d,IEE96h, IEE96i, IEE02, LCK11, LF+93a,MMH93, Nar95, OL05, PR94b, Ree96, R+92,SPE95, Sil96, SM07, Sin93, SW91, USE95,USE00, VW92, Vol93, WPH94, Y+93, YH96,ACM95a, ACM05, ACM06b, ANS95,Ano93b, Ano93c, Ano95a, BR95a, Bil95,BDLS96, DR94, Eng00, GH94, JPTE94,LCHS96, Mal95, PW95, Van95, ZL96,ACM94, Ano94g, IEE95b, KKDV03].Configurable [IEE94d, PKB+16, BB94].configurations [PTL+16]. conflict[TCP15]. conformational [MK94].Congress [CJNW95, GHH+93, PSB+94,BH95, dGJM94]. Congressi [GT94].Conjugate [BG95, GFPG12, MM92, Ols95].Connected [BT01b, KRKS11, OF00, Pet01].Connectivity [Whi94]. Conquer[CTK01, Cza02, Cza03]. conscious [ZA14].consistency [WBSC17, YYW+12].Consistent [TGT10, CG96, CG99a].Console [PES99]. Consortium [BRST94].Constrained [BSH15, EGR15]. Construct[DP94, EM94]. Constructing [DM93].construction [ART17]. Constructs[KDT+12, PGC02, BKH+13, BN00].consumer [ACJ12]. Contact [Nak03].CONTAIN [SBR95]. containers[Str12, ZT17]. content [GFB+14].contention [ALW+15, DSG17, Zah12].Context [DGG+12, DR18, MdSAS+18,

14

OLG+16, PAdS+17, SCB15].context-bounded [MdSAS+18, PAdS+17].Contexts [CS14]. Contiguous [WTR03].continual [NS16]. continuation [VY15].Continuous [TA14]. Contract [KPNM16].Contract-based [KPNM16]. contrarian[KSSS07]. Contrasts [GGS99]. Control[FLD98, FM09, IEE94e, MSS97, MBKM12,SFL+94, SHPT00]. controller [GWC95].convection [CEGS07, TVV96].Convention[ACM98b, ACM99, ACM00, Hol12, IEE94b].Converse [BK96]. Conversion [ZG95b].convex [GCN+13]. convolutions[DZZY94]. Cook [SD13]. Cooperation[Wis01, Str94]. Cooperative[DGF97, DiN96, HRSA97, kLCCW07,Pet00a, Pet00b, JKN+13, SHLM14].Coordinate [OP98]. coordinated[BCH+08]. COORDINATION[CH96, KAHS96, FKK96a, CH96]. Copley[IEE94e]. Copperhead [CGK11]. Copy[SWHP05]. copying [SH96]. CORBA[DPP01, Fin97, LRW01]. Core[ABB+10, Bri10, CZG+08, LZH17,SOHL+98, YGH+14, YTH+12, ACMZR11,BBG+14, BL99, FHB+13, HTA08, JR13,JJM+11, JR10, KSG13, LLCD15, LLH+14,MBBD13, PZ12, SFSV13, SVC+11, TFZZ12,VDL+15, WCC+07, WYLC12, dCZG06,MMH98, Nag05, Ano99a, Ano99b]. Cores[BBG+11, DT17, BMS+17, WO09]. Corfu[SM07]. correct [DM93]. correction[BCD96, FME+12]. Corrections [BL95].Correctness [HMK09]. Correlated[MM07]. corruption [FME+12].Coscheduling [GRV01, SGHL01]. Cosenza[KG93]. cosmological [BADC07, Sai10].Cost[KS15b, RLL01, GK97, GWVP+14, Wu99].costs [GB94]. Cots [HHC+18]. count[KVGH11]. counters [Rab99]. counting[JR13]. County [ACM98b]. Coupled[MBS15, SS01, SBR95, Gra97]. Coupling

[BS93, KR09, SB95, WB96]. course[STT96]. CoW [KMG99]. CPPvm [Gor01].CPS [Mat94]. CPU[DF17, JR13, KSL+12, Lee12, LRG14, LLC13,LFL11, OFA+15, PDY14, Pri14, SSB+17].CPU/GPU[KSL+12, Lee12, LLC13, OFA+15, SSB+17].CPU/multi [SAP16]. CPUs[KH12, LNK+15, ON12, SFSV13, YSWY14].CPVM [CG96]. Cracow [BDW97]. cranial[NAJ99]. CRANIUM [MBES94]. Crash[LCVD94b]. Crash-simulation [LCVD94b].crashworthiness [LCVD94a]. Crawler[Wal01a]. Cray [BL94, GRRM99, MP95,Sch96a, Sch96b, ABG+96, AZ95, AFST95,CCSM97, LKJ03, LSK04, MWO95, Oed93,RBB97c, SWS+12, SCC95]. CRAY-T3D[Sch96a, Sch96b]. CRAY-T3E [Che99].Creation [Hat98, MFC98, PS00a]. Crew[GHL97]. CRI [MSCW95]. CRI-MAP[MSCW95]. Critical[DSGS17, SLN+12, SDJ17]. Critical-blame[DSGS17]. critical-path [SDJ17]. cross[JR13]. cross-platform [JR13].cryptanalysis [BSN95]. Cryptographic[PV97, ABDP15]. cryptosystem [WLC07].CS [FST98a, FST98b, Jon96]. CS-2[FST98a, FST98b]. CS/2 [Jon96]. CT[DYN+06, NAJ99]. CT-scans [NAJ99].cube [Pan95a]. Cubes [DERC01]. CUDA[Pri14, AMuHK15, AAAA16, ACMZR11,Ano12, BHS18, BY12, BTC+17, BAG17,BSH15, BBH12, CAM12, CGU12, CNM11,CLYC16, CBM+08, CSV12, CB11, Cza13,DCD+14, DS13, DR18, DARG13, DLV16,DWL+10, DWL+12, DM12, EPP+17, ER12,FJZ+14, Fer10, FMFM15, FFM11, FWS+17,Fuj08, GDC15, GScFM13, GLN+08,GML+16, GFPG12, GWVP+14, GRTZ10,HE13, HJBB14, HVA+16, HLM+17, HD11,HLP10, HP11, HLP11, Hog13, HF14a,HF14b, HKOO11, HT08, HLO+16, JK10,JC17, JLS+14, JFGRF12, KRKS11, KD12,KAMAMA17, Kha13, KS13, KmWH10,

15

KVGH11, KME09, KO14, KH15, KD13,KA13, Lan09, LRG14, LGKQ10, LLG12,LSSZ15, LBH12, LSVMW08, LSMW11,LAD16, LBB+16, LYSS+16, LYZ13,MMO+16, MR12, MSML10, MdSAS+18,MGL+17, MM14, NSLV16, NS16, NBGS08,OIH10, ORA12, PGS+13, PRS+14, PHJM11].CUDA[PAdS+17, PSHL11, PTMF18, PRS16,RBAI17, Ros13, SSE12, SK10, iSYS12,SDJ17, STK08, SS09, Seg10, SKM15, SP11,SR11, SJK+17a, SJK+17b, TNIB17, TS12b,TA14, TCP15, Tsu12, UZC+12, WG17,WJ12, WMRR17, WWFT11, WJB14,XXL13, YULMTS+17, YHL11, YZ14,YMYI11, ZSK15, ZAFAM16, ZZG+14,ZBd12, ZLS+15, ZZZ+15, dlAMC11,dlAMCFN12, vdLJR11, Che10, SD13, Vog13].CUDA-Aware [HVA+16]. CUDA-based[AAAA16]. CUDA-BLASTP [LSMW11].CUDA-C [YULMTS+17].CUDA-compatible [LBH12].CUDA-Enabled[LSMW11, DS13, SR11, ZLS+15].CUDA-NP [YZ14]. CUDA-quicksort[MMO+16]. CUDA-sharing [PRS+14].CUDA-to-OpenCL [GScFM13].CUDA/MPI [LYSS+16]. cudaBayesreg[Fer10]. CUDAEASY [Sai10]. CUDAlign[SdM10, dOSMM+16]. culling [LHLK10].CUMODP [HLM+17]. CUMULVS[GKP97]. CURAND [Ano12]. Current[Bak98, GFD05, IFI95, BDG+93b, FK94,FHP+95]. Curse [OS97]. Customization[GSY+13]. cut [CG99a, CXB+12].cut-through [CXB+12]. CVL [Har94].Cybernetics [IEE95a]. cycles [PL96].Cyclic[DDPR97, WO95, HKMCS94, HC08, WO96].Cyclops [dCZG06]. Cyclops-64 [dCZG06].

D [And98, DYN+06, SSS99, SH14, VDL+15,Bha98, BCL00, Bri95, BMPZ94a, BAS13,CGU12, CP15, EFR+05, ES11, GCN+13,

HF14a, HF14b, JR10, KRKS11, KO14,KD13, KHS01, KLR16, MK94, MSZG17,NSM12, TPD15, WMRR17, WR01, YSL+12,vHKS94]. D-CICADA [MK94]. DAC[Cza02, Cza03]. Daemon [LB98]. Dagum[Stp02]. d’Aix [GA96]. d’Aix-Marlioz[GA96]. Dallas [ACM00, IEE95l]. Dame[IEE96i]. damping [YPA94]. DAMPVM[Cza02, Cza03]. DAMPVM/DAC[Cza02, Cza03]. DAMS [CD98]. Dangers[BCP+97]. DaReL [KN95]. Data[AJF16, BMR01, BCG+10, BGD12,CKmWH16, DERC01, DiN96, EGR15,EASS95, GTS+15, GB98, GMPD98, Gua16,HA10, HB96b, HC06, JDB+14, KA13, LK14,LDJK13, MV17, Man01, ME17, MGA+17,MJB15, NJ01, NPP+00b, NPP+00c, NA01,NLRH07, PCY14, Rei01, SGH12, SPK96,SR96, Str12, THS+15, WO95, Wel94,ZDR01, ZG95b, AB95, ASS+17, AGG+95,BK11, Ben95, BR12, BID95, CFKL00,CGK11, CGL+93, DRUC12, EP96, FB97,Fan98, FVLS15, FME+12, FKK+96b,FWS+17, GE95, GE96, HB96a, HC08, JB96,JCP15, JE95, JPOJ12, KN95, KJJ+16,KRG13, LOHA01, LF+93a, LL16, MA09,MMB+94, MMM13, MR96, NCB+12,NCB+17, NPP+00a, OPP00, PDY14,RJMC93, SJLM14, SSS99, SPH95, SK92,TW12, WO96, YCL14, YWO95, ZRQA11].data-centered [JPOJ12]. Data-Driven[ME17, NCB+12, NCB+17].Data-Intensive [Rei01]. Data-Parallel[AJF16, GB98, CKmWH16, SPK96, CGL+93,FKK+96b, MMB+94, MR96, SK92].data-parallelism [BR12].data-privatization [KRG13].Data-Structures [GMPD98]. Databank[FCP+01]. Database [AR01, BFZ97, EK97,MWG97, MM14, PPT96a, MN91, PPT96b,PPT96c, PMZM16]. databases[BA06, Bos96, ZWL13]. Dataflow[DT17, CSPM+96]. Datasets[VPS17, KGB+09]. Datatype

16

[Gro00, SWHP05, KHS12]. Datatypes[JDB+14, RTH00, SGH12, Tha98, CAHT17,THRZ99]. Dave [Stp02]. David[Ano96a, Ano99a, Ano99b, Nag05].DawnCC [MGA+17]. DAWNING[HWM02]. DAWNING-3000 [HWM02].Day [IS16]. dbx [NE98, NE01]. DC[B+05, IEE94h, IEE95k]. DCE[Sch93, FLD96, RS93, Sch93]. DDL [FB97].Deadlock[LZC+02, SG12, HPS+12, HPS+13].Deadlocks [FJK+17]. Debbuger [WCS99].Debugger [HM01, NE01, CH94, CG99b,MT96, XWZS96]. Debuggers [Ano01a].Debugging[BDGS93, GKP96, KKV01, KV98, Mor95,NE98, Wis97, ZLL+12, BL97, BS96a,DKF93, HLOC96, KCD+97, MLA+14].December [Bil95, Eng00, HHK94, IEE96a,Kum94, NM95, PBPT95, Y+93].Decimation [PCY14]. decoder [MC17].Decomposition [BJS97, CP97, EGH+14,DBVF01, ETV94, OMK09, SHHC18].decompositions [NZZ94]. deconfliction[TCP15]. Dedicated[WLNL03, Hus99, WLNL06]. Deep[AHHP17, SEC15]. Defined [Gua16].Defining [GAML01]. Deformable [STK08].Deforming [GAP97]. degree [CT13].degrees [KTJT03]. Delegation [YTH+12].Delegation-Based [YTH+12]. Delft[DSZ94]. Delivering [Hus98]. Delphi[ACGdT02]. Demand [CTK00]. Denmark[DW94, DMW96, Was96]. Dense[AKL16, BDT08, CDD+13, Fuj08, Hog13,PMvdG+13, ZBd12, BRR99]. Densities[MW98]. Density[BL95, MC17, CBHH94, ZWHS95]. Denver[ACM01, IEE05, R+92]. Dependable[GM95]. Dependant [BP99]. Dependence[LAdS+15]. Dependency [PPR01].Dependent [DFA+09, HO14, MFTB95,DM12, LBB+16, LYSS+16, ON12, SSB+16,TVV96, YPA94, YSVM+16, YSMA+17].

DEPICT [HM01]. Deploying[PKB01, CLLASPDP99]. depth [SSS99].Derivation [GB98]. Derived[JDB+14, RTH00, SWHP05, Tha98,CAHT17, Jou94, THRZ99]. Descent[Sch01]. description [TKP15]. descriptors[LNW+12]. Design[AS92, AAC+05, Ano01b, ACD+09, BCD+15,BBH+13b, BS96b, BMR02, BRM03, CLP+99,ETWaM12, FD02a, FFP03, GG09, HWM02,JSH+05, KVGH11, kLCC+06, kL11, LVP04,Man94, MMSW02, NPS12, OFA+15, Pan14,PLK+04, PCS94, SBG+02, SWYC94, SSL97,SPK+12, Sum12, THM+94, USE94, VGRS16,BR91, CARB10, CSS95, DS96b, FD02b,GL94, GkLyCY97, KA95, LC07, MAS06,OA17, PGK+10, PTW99, SL94b, Sep93,Sil96, SSD+94, SWL+01, Wal94a, Wal94b].design-pattern [MAS06]. designed[BSH15]. Designing[GKZ12, LAD16, SWHP05, SH14, WYLC12,ZLP17, AHHP17, DSOF11, Pan95b].Designs [HVA+16, AAAA16, MC17, Shi94].desktop [Mar07]. Detailed[DLV16, RSPM98, BTC+17, LR06b]. detect[Str94]. Detecting[AGG+95, PPJ01, ZRQA11]. Detection[BHW+17, CSW12, CBL10, CFMR95,DMMV97, EML98, FME+12, HHC+18,KSJ14, SG12, ZDD97, BBH+15, DKF94a,HDDG09, HGMW12, HPS+12, HPS+13,LZC+02, RAGJ95, TCP15, TDG13,TWFO09, WTFO14, YULMTS+17].Detector [DZDR95]. Determination[LAFA15]. Determine [BP99].Deterministic [CFMR95, DK02, ZLL+12].Develop [PD98]. Developer [IEE96i].developers [Str94]. Developing[BFZ97, CCSM97, Cot98, DDLM95, Reu03].Development [Ano01a, BDG+91b, BR95c,CHPP01, Cha02, Cot97, Cza02, DeP03,PS01a, SK00, SB01, TBD96, TDBEE11,ARvW03, ABC+00, BL97, BDG+92a,DSZ94, DHP97, KCD+97, LLC13, MMW96,

17

PES99, TBB12, ZL96, Sei99].Developments [Mat00a]. device[KKLL11, LS10, SBQZ14, YWTC15].Devices [GJN97]. DFB [WWZ+96]. DFN[RS93]. DFN-RPC [RS93]. Diagnosis[AP96, LAdS+15]. diagnostic [RSBT95].dictionary [LSSZ15]. Diego[Has95, LF+93a, NM95]. Difference[UZC+12, GFPG12, HE13, NZZ94, NB96,Pri14, Ram07, Str94, VM94]. Differences[AKE00, LDCZ97]. Different[AIM97, GL97b, JCH+08, Ney00, Rab98,RBB97a, BN00, PY95]. Differential[MFTB95, Riz17, JK10, NF94, RBB15, SP11].Differentiating [Cer99]. Differentiation[BBH+08, BGK08, CdGM96]. Diffusion[HF14a, HF14b, MW98, CEGS07, DM93,MM92]. Digest [IEE93a, IEE95c]. Digit[DALD18, LAD16]. Digital[KLR16, CIJ+10]. Dijon [YH96]. Dimemas[GLB00]. Dimensional[Car07, GA96, HD02b, KD12, LRQ01,MW98, SJK+17a, SJK+17b, AL93, KT02,LSSZ15, Ols95, PR94c, Ram07, RG18].Dimensions [SAS01, Ano93g, HP11].dipolar [LBB+16, LYSS+16]. DIPORSI[GGCGO01]. DipSystem [SPL99]. Direct[Bri10, GPC+17, LB98, WJB14, BCM+16,Gra09, HWS09, MM11, SWH15]. direction[BDG+93b]. Directions[IFI95, FK94, FHP+95, Sun96]. directive[LV12, NO02a, YL09]. directive-based[LV12, YL09]. directive/MPI [NO02a].Directives [BBG+01, BKO00, CCBPGA15,JFY00, LOHA01, VGS14]. directory[JCP15]. Discovering [FJK+17]. discovery[BK11, GWVP+14]. Discrete [ST17].diskless [PKD95]. Disks [dlFMBdlFM02].Dispersion [RSV+05]. Displacement[BJS97, PSSS01]. Dissemination [GL97a].Distance [MR12]. Distances [LAFA15].Distributed[AGS97, Ano95e, BMS+17, BME02,BGR97a, BL95, Bha93, BJ95, BRST94,

BT01b, BHKR95, CGB+10, CLL03, CSW97,CC99, DMB16, DBA97, DFMD94, DGF97,DHHW92, DHHW93a, EMO+93, ESM+94,FH95, Fan98, FTVB00, FK01, Fos98, FS93,FFFC99, GGCM99, GGCGO01, GCGS98,GCBM97, GWC95, GM95, HJ98, HC10,HRSA97, IEE93d, IEE93c, IEE94d, IEE94g,IEE95h, IEE95k, IEE95i, IEE95g, IEE96b,IEE96g, IEE96f, IEE05, JML01, KBA02,KP96, KDL+95b, KL95, KK02b, KSHS01,LC93, LHD+94, LHD+95, MZK93, MB12,MFTB95, MSCW95, Mat95, MBE03,NSBR07, NZZ94, NH95, Pen95, PKYW95,Pet00a, Pet00b, PTT94, PMM95, PBK00,PD98, PMvdG+13, RGD97, Sch94, SA93,SMOE93, SW91, Sun90a, Sun90b, TSS00b,THN00, Wil93, WO97, WCS99, YH96,ZDD97, ZDR01, AMBG93, AGR+95b, AB95,Ano94e]. distributed[Arn95, ADMV05, BSC99, BB95, Bir94,BMPZ94a, CBPP02, CH94, CEF+95,CBHH94, CLLASPDP99, CPR+95, CK99,DLR94, DR94, DHHW93b, DR95, EGH99,FB97, FS95, FS98, FHC+95, FHB+13,GBR97, GCN+10, GKK09, GkLyCY97,GP95, HPY+93, HHA95, IEE97a, JWB96,KN95, KSG13, KJJ+16, KDL+95a, LR06b,LFS93a, LFS93b, LH98, LKL96, Liu95,Maf94, MVTP96, Man98, MLC04, NAJ99,OLG+16, PK05, POL99, Par93, PR94c,RAGJ95, RFH+95, SSH08, SHHI01, SL94b,Sch93, SFL+94, SSC96, SPL99, Smi93b,SD99, TSP95, THM+94, Uhl95a, VM94,VB99, Vet02, Vis95, Wal94a, Wal94b, WPL95,Wan97, YLC16, YWO95, YX95, YPZC95,YZPC95, ZL96, ZGC94, ZHS99, Pet01].distributed-data [FB97].Distributed-Memory[CSW97, CC99, KN95, SSH08].distributed-shared [ADMV05].Distributing [AL92]. Distribution[HB96b, MJB15, NPP+00b, NPP+00c,NA01, SR96, AGG+95, CSW99, GS96,HB96a, JMdVG+17, KRC17, NPP+00a,

18

RJMC93, Wil94]. Distributions[ST17, WO95, HKMCS94, WO96, vHKS94].Divergence [SdSCP13, VSW+13].diversity [EO15]. Divide[CTK01, Cza02, Cza03].Divide-and-Conquer[CTK01, Cza02, Cza03]. DMMP [BB93].DMPI [HWM02, ZLL+12]. DNAml[CDZ+98]. DNMR [SR11]. docking[ESB13, ZWL13]. Document[MHSK16, AD95]. Documentation[BDG+xx]. Documents [Ano98]. does[KC94]. dog [LK14]. Domain[BMR01, CP97, EGH+14, kL11, ETV94,HE13, Nel93, NZZ94, Olu14, OMK09,Ram07, SHHC18, VM94]. Domaine [GA96].Domains [KR09]. Dongarra[Ano95b, Ano96a, Ano99a, Ano99b, Nag05].dOpenCL [KSG13]. Double[FKKC96, PTT94]. down [Str94].Downloadable [Ano98]. DP[Arn95, KLR+15]. DPVM [IHvA+00].draft [DHHW93b, GL92]. Draw [ST17].Dresden [MdSC09]. Driven[AIM97, ME17, PCY14, Hin11, NCB+12,NCB+17, Qu95, SIS17, TWFO09, WTFO14].Dror [Stp02]. drug [GWVP+14]. drugs[Str94]. DSIR [LTR00, RTL99]. DSM[KBVP07]. DSMPI [SSC96, SSC97]. DTM[PS07]. DTS [BHKR95]. Dual[BBC+00, GAM+02, DK02, CT13, LSSZ15].dual-dictionary [LSSZ15]. Dual-Level[BBC+00, GAM+02, DK02]. dual-scanline[CT13]. Dublin [LKD08]. During [DeP03].Dust [dlFMBdlFM02]. DVFS [PTL+16].DWT [ZZZ+15]. Dyn [WLNL03, WLNL06].Dyn-MPI [WLNL03, WLNL06]. Dynamic[ACGR97, AGS97, AUR01, CGLD01,CKmWH16, CML04, CK99, CTK01,DMB16, DBA97, DFMD94, FMBM96, FD00,GFD03, GFD05, GRV01, GCBL12,GMPD98, GL95a, KFL05, NPP+00c,NLRH07, PK98, PLK+04, PT01, Ran05,Smi93b, SY95, TS12a, VdS00, Vet02,

Wal01a, Wil94, YST08, Zel95, DDLM95,EO15, FH97, FCS+12, FKLB08, JC17,MSMC15, NSBR07, NF94, OKW95, RBAI17,RCG95, SCB14, SCB15, SKK+12, SKB+14,WRSY16, YPA94, DvdLVS94, FCS+12].dynamically [SSS99]. DynamicPVM[DvdLVS94]. Dynamics[BST+13, BCGL97, DR97, JFY00, KBM97,dlFMBdlFM02, MH01, OS97, SZBS95a,SA93, TDBEE11, TGEM09, YWCF15, ZB94,ALR94, ABG+96, AGMJ06, BvdB94, BHS18,BvdSvD95, BBK+94, BMPZ94b, BMPZ94a,CC00b, FHSO99, HVSC11, JAT97, JMS14,KFA96, KPK13, KRG13, LSVMW08,OKM12, PARB14, PBK99, RBB15, SPE95,SZBS95b, SKM15, TG94, WPH94].Dynamische [Wil94]. dynamite[IvdLH+00, IHvA+00]. Dynamite/DPVM[IHvA+00]. DySel [CKmWH16].

E-scale [Gua16]. each [Ano00a, Ano00b].Early[CD96, LV12, SLG95, EFR+05, KJA+93].Earth[KTJT03, Nak03, Nak05a, Nak05b, UTY02].Earthquake [UZC+12, KTJT03, KME09].Easily [PKB01]. East [IS16]. Easy[HCA16, TDG13, MJPB16, SBF94].EasyGrid [BR04]. EASYPVM [Saa94].ECMWF [HK93, HK95]. ed [Nag05].EDEM [Tsu95]. Edge[ZDD97, Gra97, RAGJ95]. edition[Ano99a, Ano99b, Ano00b]. Editors[AM07, GSA08]. education [ACM06a].EDV [Ano94c]. EDV-Benutzertreffens[Ano94c]. Edward [Che10]. Effect [DK06].Effective [MLAV10, RK01, TMC09, Tsu95,Cza13, JH97, KS15a]. Effects [SSE12].efficacy [GScFM13]. Efficiency[KS96, MTU+15, CZ96, MMU99, RS95].Efficient [ADT14, Att96, BHW+17,BGBP01, BCK+09, BHLS+95, BFG+10,BGD12, Bru95, BDH+95, BDH+97,BMPZ94b, CAWL17, CFP96, DZ98a,

19

DGG+12, FHPS94a, FHPS94b, HBT95,HKT+12, HT08, HC06, HLO+16, KGK+03,KD13, LAD16, MDM17, MB12, MRB17,NBK99, PGS+13, RJMC93, RRBL01,TGBS05, WSN99, WWFT11, YPZC95,ZWHS95, BfDA94, BHW+12, CGH+14,FM90, FNSW99, FHB+13, HCL05,KVGH11, LKL96, LA06, Pan95b, PRS+14,RR01, SOA11, TPD15, TDG13, YLC16,dCZG06, CRD99, THRZ99]. Efficiently[CC99, CCM+06, PHA10]. effortless[ITT99]. eigenproblem [BV99, GG99].eigensolvers [DR18]. Eigenvalue[DAK98, BSC99, THM+94]. Eighth[ERS95, Sie94, IEE96b]. Eilean [CSS95].einem [BL94]. Einfluß [Gra97].Einfuhrung [MS04]. Einstein [ARYT17].Einstein- [ARYT17]. Ejector[CCBPGA15]. elastic [PTG13]. elasticity[PTT94]. Elastodynamic [MAIVAH14].electric [BALU95, Ano03]. electrical[Sil96]. electroabsorption [WWZ+96].electromagnetic[DSOF11, NZZ94, OMK09]. electron[ART17]. electron-molecule [ART17].Electronic [GJN97]. Electronics [IEE95d].Electrosoft [Sil96]. electrostatic[VDL+15]. Element[MS02b, OD01, OMK09, SM02, VRS00,BB93, BCM+16, Gra09, HMKV94, KME09,KEGM10, Nak05a, Nak05b, PTT94].Elemental [PMvdG+13]. elements [KB13].Eliminating [DSG17]. elimination[ACMZR11]. elision [CLdJ+15]. elliptic[AGIS94, PR94c]. ELLPACK[BBH12, MKP+96]. ELLPACK-R[BBH12]. Else [Gei00]. elucidation [MK94].Embedded [YGH+14, ACJ12, CGK11,NEM17, TMW17, WCS+13]. Embedding[FS97, SML17, MS96a]. Embodying[Ser97]. emerging [RMNM+12]. Emission[Pat93, EZBA16]. emphasis [Bos96]. eMPI[MS96a]. eMPI/eMPICH [MS96a].eMPICH [MS96a]. Empirical

[SS94, VY02]. Employing [AGMJ06].emulation [MS99b]. emulator [LTLC94].enable [SPK+12]. Enabled[Fos98, GSY+13, LSMW11, Pan14, ZLP17,DS13, GLM+08, HJBB14, KTF03, RA09,SHHI01, SR11, ZLS+15]. Enabling[APBcF16, BGG+15, CLSP07, DGB+14,GBH14, HJYC10, NPS12, TY14, ZPI06,BR04, MA09, SHHC18]. encapsulation[DRUC12]. encoding [AAAA16, PGBF+07].endpoint [LLH+14]. endpoints [DGB+14].energies [TKP15]. Energy[BPG94, EGR15, KFL05, RBAI17, VW92,FKLB08, KN17, PTL+16, TDG13].Energy-Aware [EGR15]. energy-efficient[TDG13]. Engine[Wal01a, NPP+00a, Wal01b]. Engineering[Ano98, BPG94, BP93, EGH+14, IEE96h,KaM10, LSB15, LF+93a, MS02a, MBS15,Nag05, SM07, Str94, DMW96, IEE94c,PW95, Sil96, LF+93a]. engineers [HW11].Engines [SLJ+14, HSW+12, SHM+12].EngineTM [OIS+06]. English [Wil94].Enhance [AR01]. Enhanced[Ano98, CDHL95, CDH+95, FMSG17, KY10,PLR02, Saa94, BR95b, FE17].enhancement [ARL+94, Boi97].Enhancements[BDG+95, BCKP00, DM95b, DM95a].Enhancing [BFIM99, FSC+11, MVTP96,MSMC15, OFA+15]. Ensemble[Cot97, Cot98, BY12, FH97].Ensemble-Based [FH97]. ENSOLV[AMS94]. Entwicklung [Sei99].Environment [BDGS93, BFG+10, BFM97,BGL00, CHPP01, CTK01, DLB07, DI02,DHHW92, DHHW93a, DDL00, FTVB00,FWR+95, GJN97, GL97a, HRSA97, KBA02,KKH03, KDL+95b, KVH97, LC93, Lus00,MSOGR01, MM02, MFG+08, MSS97, NJ01,Ong02, Rol94, SDN99, SGL+00, SGHL01,TTP97, WL96a, ABG+96, BDG+92b,BDG+94, BK96, BT96, CEF+95,CLLASPDP99, DZ96, DL10, DHHW93b,

20

EASS95, FMBM96, FB95, Fan98, Fra95,GBR97, GGH99, GPL+96, GkLyCY97,HZ94, IJM+05, IvdLH+00, KCD+97, Kat93,KDL+95a, Kos95b, KFSS94, wL94, MSL12,MK97, NP94, PES99, PVKE01, PQ07,RNPM13, SSKF95, Sch93, SPK96, SBF94,SWYC94, Skj93, SSG95, TJD09, Tho94,WCC+07, WL96b, WLC07, ZPLS96].environmental [ANS95]. Environments[Ano95e, Ano01a, Bak98, BF98, DT94,GFB+03, Laf01, Mat94, Mat95, MFC98,PS01a, RB01, SHH94b, SSSS97, SCL00,TAH+01, ACGdT02, ARL+94, ALR94,ADDR95, AMV94, Bon96, BFIM99,CDH+94, CK99, DR94, DR95, EO15, HS93,HVSH95, LC07, MSP93, SS94, SHH94a,SAP16, TSS98, VB99, YS93, ZL96].environments-the [CDH+94]. EPS[GT94]. EPS-APS [GT94]. Epstein [BL95].Epstein-Nesbet [BL95]. Equation[ES11, LZ97, SAS01, VRS00, DM12, LBB+16,LYSS+16, MS95, NP94, ON12, Ols95, Pri14,iSYS12, SSB+16, YSVM+16, YSMA+17].Equations [And98, BG95, GK10, Huc96,LLY93, MFTB95, ORA12, ZB97, BHW+12,Che99, IM95, JK10, Jou94, MM11, NF94,RBB15, SP11, SMSW06, ZZG+14, dH94].Equi [LTRA02]. Equi-Join [LTRA02].equivalencing [LLG12]. Era[ABB+10, CZG+08, CGKM11, EdS08].Erratum [Ano01b, HF14b, Wal94b]. Error[DFC+07, HPS+12, HPS+13]. Errors[FCLG07, SD16]. Erweiterung [GBR97].ESA [Whi94]. ESBMC [MdSAS+18].ESBMC-GPU [MdSAS+18]. Espoo[RWD09]. ESPRIT [CDH+94]. Estimation[GK10, AMHC11, CCU95, GB94,JMdVG+17, KS13, ZWHS95]. Estuarine[LRQ01]. Ethernet [CC00a, Fin97, HcF05,KYL03, KYL05, OF00, PFG97]. EU[Ano03]. Eugene [MCdS+08]. Euler[DLR94, IDD94]. Euler/Navier[DLR94, IDD94]. EURO[HAM95b, BFMR96, HAM95b, BFMR96].

Euro-Par [BFMR96, HAM95b, BFMR96].Euromicro [IEE95h, IEE96g]. EuroMPI[CDND11, KGRD10, TBD12, TB14].EUROPE [LCHS96, Ano92, Ano93e,Ano93f, Ano94g, Tou96]. European[AD98, Ano94i, BR95a, BDLS96, BC00,BDW97, CHD07, CHD09, CD01, CDND11,DKD05, DLM99, DKP00, DLO03, KGRD10,Kra02, KKD04, LKD08, MTWD06, RWD09,TBD12, WPH94, DHK97]. EuroPVM[BDLS96, OL05, DKD07, MTW07].EUROPVM/MPI[OL05, DKD07, MTW07]. EuroPVMMPI[KKDV03]. EUROSIM[BH95, DSZ94, BH95]. Eurospace [Tou96].Eurospace-Ada-Europe [Tou96].Evaluate [MW98]. Evaluating[BWV+12, FVLS15, FST98a, GFD03,GFD05, GGCGO01, GB96, HWW97, LH95,SSSS97, ZSnH01, GScFM13, LTLC94, TG09,ZLZ+11]. Evaluation[ATM01, BF98, BIC+10, BFM97, BEG+10,CLP+99, DI02, FST98b, FSSD17, Han98,JCH+08, KS96, KK02b, KSS00, LGCH99,LNK+15, LZ97, kL11, LVP04, MH01,MGC12, NNON00, OM96, Pan14, Par93,RB01, SWHP05, SCP97, SEF+16, SBF+04,SM02, Sou01, SJK+17a, SJK+17b, TOTH99,TSB02, TSB03, TTSY00, UMK97, VY02,AB13, BBG+14, BBH. . . 13a, BMG07, CB11,DBB+16, HPR+95, HASnP00, HPS95, IM94,JC17, JMdVG+17, LV12, LNW+12,MKP+96, MM03, MT96, MMH99, NN95,PSK08, SL94b, SWS+12, SWYC94, SFSV13,TSP95, THM+94, TMPJ01, Wor96, YWO95,YS93, ZHK06]. Evaluations [MM14].Event [KKV01, NSLV16, THS+15, WM01].Event-Based [NSLV16]. everything[CCM+06]. everything-shared [CCM+06].Evolution [Mat01a, PS01a, RBB17, SSL97,SGDM94, GS93, SSD+94]. Evolutionary[B+05, DSM94, Rag96]. Evolving[Bad16, ER12, MdSC09]. Ewing[Ano95c, Ano99c, Ano99d, Ano00a, Ano00b].

21

EWOMP’99 [BC00]. Exact [dOSMM+16].Example [Che10, NB96, Pat93, SK10].Exascale [Bad16, LV12, LSG12].Exception [FMSG17]. exchange[MMM13, Pan95a]. excluded [BHW+12].executable [WMP14]. Execution[AHD12, BME02, DT17, FC05, FM09, GR07,KGK+03, Mar05, MFG+08, MAGR01,Ney00, STY99, SAP16, EPML99, Mor95,SMAC08, TNIB17, TSY99, TSY00, UGT09].Executions [GAML01]. Exhibition[HS95a, GH94, LCHS96]. Existing [CB00].EXOCHI [WCC+07]. Expand [CGC+02].Expanding [LA02]. expected [CAHT17].Experience[BCP+97, BT96, CP98, PS01a, Tou00,AMS94, CARB10, KJA+93, RSC+15].Experiences [AHP01, BFZ97, CMV+94,CLLASPDP99, GLN+08, GS91a, GSI97,GB96, GL95d, ITT02, JR10, KS97, Mar02,TGEM09, ZPLS96, ZKRA14, AL92,CCF+94, Sch94, SGDM94, BDG+93b].Experiment [Luo99]. Experimental[BIL99, BIC05, EGC02, Ser97, UMK97].Experiments [BPMN97, Coe94, LGM00,OS97, RR00, ZB97, RHG+96, HAJK01].Expert [BPG94]. experts [EO15].ExpEther [NMS+14]. Explicit[BHV12, GFPG12, SGHL01, LC97b].Explicitly [Mai12, SYR+09]. exploit[ZPI06]. Exploitation[GGL+08, GAM+02, BK11, GAM+00].Exploiting[Add01, Bri10, FKLB08, HEHC09, KFL05,NAAL01, Nob08, THH+05]. Exploration[AMuHK15, OFA+15, ABDP15, GE95,GE96, PDY14]. Explorations [BGG+15].Exploring [IFA+16, MBKM12, MTU+15].Expose [SAL+17]. Exposing [SD16].Exposition [IEE95d, LF+93a]. EXPRESS[KS96, Ahm97, FK94, LH95, SHH94a,SHH94b]. Expression[BN12, KH15, Sur95a]. expressions[SFLD15]. expressive [Tra12a, YLC16].

Extend [DFA+09]. Extended[BR02, HTA08, SS99]. Extending[ABB+10, BCC+00a, BCC+00b, BDB+13,CS96, CG99a, KDT+12, LMRG14, Mar03,OFA+15, RGDML16, SDV+95, TMTP96,CG96, GGHL+96]. Extensible[BL97, GS94]. Extension[AELGE16, BGR97a, CSAGR98, VAT95,Hum95, JH97, SG14, SC95, ZT17, GBR97].Extensions[Fis01, GOM+01, GHLL+98, HVA+16, HE15,DPSD08, HP05, Kat93, Ano99c, Ano99d].Extent [kL11]. Extent-Based [kL11].exterior [HMKV94]. external [BBB+94].Extraction [CBL10, HLO+16, dAT17].Extreme [MdSC09, ZKRA14].Extreme-scale [ZKRA14]. eyes [Str94].

F [FHPS94b, FHP+94]. F90 [DP94]. face[HDDG09]. Faces [Gro12]. facilitate[PKB06]. Facilitating[MC99, ZLL+12, ESB13]. Facilities[MMH98, MN91]. Facility[KG96, SHTS01, KZCS96, LHCT96].factorization [AZ95, BSvdG91, BRS92,DG95, KBP16, WLC07]. Factorizations[TD98, LC97b]. Fail[LFS92, LFS93a, LFS93b]. Fail-safe[LFS92, LFS93a, LFS93b]. Failure[BBH. . . 13a, CRGM14, BBH+13b, CGH+14,BDB+13]. failure-aware [CGH+14].failures [JS13]. Faithful [KLR16]. Fall[Gra97]. false [JE95]. family [AVA+16].farming [Str94]. Fast [Ben01, BHS+02,BBH12, CS14, DFN12, EM02, Hog13,JFGRF12, JMdVG+17, PSHL11, PR94c,PBC+01, RB01, SE02, SS09, STY99, SR11,UP01, WTR03, Lan09, LCL+12, NYNT12,TDG13, YULMTS+17, YLZ13, YBZL03,ZA14, AAB+17, DBLG11, PFG97]. Faster[Tsu12, ZG95a, ZG96]. Fat [Zah12].Fat-tree [Zah12]. FATCOP [CF01]. Fault[BBC+02, BCH+03, BHK+06, CF01,CFDL01, FBD01a, FBVD02, FD02a, FD04,

22

GFB+03, GKP97, GJR09, GL04, Gua16,IEE95c, JSH+05, LMRG14, LNLE00, dLR04,MSF00, RPM+08, TS12a, WC09, Wil93,BCH+08, FBD01b, FD02b, HG12, LMG17,LS08, PKD95, SG05, ZHK06, FD00].Fault-Management [GJR09].Fault-Tolerant [BHK+06, FD04, GFB+03,IEE95c, JSH+05, LMG17, LS08]. Faults[LAdS+15]. FCRC [ACM96b]. FD [And98].FD-TD [And98]. FDDI [LC93]. FDTD[DSOF11, VM94]. Fe [Old02, BJS99].feasibility [KBG16]. Feature[Qu95, ZWL+17]. Feature-driven [Qu95].Features[GLT99, GLT00b, GLT00a, GLT12, KAHS96,Ano00a, CRD99, WKS96, ZKRA14, dAT17].February [Ano95d, GE95, GE96, IEE93a,IEE94a, IEE97c]. FEM [GEW98].FEM-Systeme [GEW98]. Fermi[SP11, WKP11]. FETI [KLR+15]. few[NS16]. few-body [NS16]. Feynman[NS16]. FFT [DALD18, GB98, JKM+17,NSM12, SH14, WJB14]. FFT-Based[WJB14]. FFTs [EFR+05]. FFTW [KT10].FHP [BMS94a]. Field[KNT02, Goe02, TKP15]. fields[BALU95, RSBT95]. Fifth[DKM+92, HK93, IEE96f, SM07, IEE95c].filamentary [YPA94]. File[BIC+10, CGC+02, LRT07, kLCCW07, kL11,PLR02, RK01, TSS00b, Tsu07, WTR03,DL10, LL95, SBQZ14, iSYS12]. File-I[PLR02, RK01]. File-I/O [PLR02, RK01].film [SL00]. filter [BY12, CCU95]. Finding[FCLG07, GAVRRL17, PCS94]. Fine[AZG17, BBG+10, JCP15, SFL+94, YSS+17,BK11, KW14]. Fine-Grain[AZG17, JCP15, SFL+94, BK11, KW14].Fine-Grained [BBG+10, YSS+17]. Finite[DFN12, MS02b, MAIVAH14, OD01,OMK09, Pri14, SM02, UZC+12, VM94,VRS00, BB93, Gra09, GFPG12, HE13,HMKV94, KME09, KEGM10, KB13,Nak05a, Nak05b, NZZ94, NB96, Ram07].

Finite-Difference[UZC+12, VM94, HE13, NZZ94, Ram07].Finite-Element [MS02b, BB93, KME09,KEGM10, Nak05a, Nak05b]. Finland[RWD09]. Fire [JML01, SJ02]. First[AGH+95, BCD96, BC00, CH96, Dem96,DFN12, DW94, Gat95, HAM95b, Kum94,Nar95, PBPT95, SSP+94, USE94, AH95,BS94, PTMF18, PBPT95]. Fix [DLV16].FLAME [VBLvdG08]. flat [Nak05b].Flattening [THRZ99]. Flexibility [KK02b].Flexible [CS14, GR95, GBS+07, SHPT00,CARB10, DGB+14, GAM+00, HC08]. Flink[KWEF18]. flip [KO14]. Florida [ACM98b].Flow[BHW+17, BGD12, CGZQ13, CCBPGA15,FM09, Pat93, AMS94, AFST95, EP96, ED94,HK94, HTHD99, JAT97, LL16, MBKM12,Ols95, PTT94, RM99, SCC95, SU96, TS12b].Flow-Based [BHW+17]. Flows[GAP97, BCM+16, BTC+17, Heb93, LLG12].flowshop [CB11]. Fluid [DFMD94, GAP97,JFY00, SZBS95a, TDBEE11, TGEM09,ALR94, ATL+12, AGMJ06, BvdB94, BHS18,Bil95, HVSC11, MRRP11, PBK99, SPE95,SZBS95b, WPH94]. fluid-particulate[ATL+12]. fluids [HK94, WB96]. Flux[QRMG96, QRG95]. fly[KSJ14, THRZ99, BCAD06, BADC07]. FM[LC97a]. FMA [LO96]. Fock [CBHH94].Focus [Cla98]. foolish [Rol08a]. footprint[TS12b]. force [Goe02]. Forecast [AHP01].forecasting [Bjo95, KOS+95a]. Forest[JML01, NCKB12]. ForestGOMP[BFG+10]. Foreword [CHD09]. FORGE[WCVR96]. Fork [BGD12, SML17].Fork-Join [BGD12, SML17]. form[NCB+12, NCB+17]. Formal[BG94a, BdS07, GKS+11, GB98, LPD+11,PGK+10, VVD+09, BG94c, SZ11].Formalizing [FGRT00]. Format[BBH12, MDM17]. Forschung [Ano94c].Fortran[Ano97, Ben95, Bra97, GBR15, Ano98, AS14,

23

BW12, DZ98b, Don06, GML+16, HE13,HH14, HZ99, KaM10, Kuh98, LC97b,LCC+03, MWO95, iSYS12, SM03, SMCH15,TBG+02, Wal02, YBMCB14, YSVM+16,YSMA+17, vHKS94]. Fortran/PVM[MWO95]. Forum [Str94]. Forward[RMNM+12, BDB+13]. forwarding[CXB+12]. Foundation [Gei01]. four[GSMK17, MGG05]. four-atom [MGG05].four-particle [GSMK17]. Fourier[DBLG11, BCM+16]. Fourteenth [IEE95b].Fourth [Ano89, IEE93d, IEE95k, Sie92a,Sie92b, Ano94i, IEE96g]. FPGA[MTU+15, PWP+16, WTTH17].FPGA-Platform [WTTH17]. FPGAs[MC17, OFA+15, PGS+13, WZHZ16, Roh00].fractal [Wu99]. fragment [KS15a].fragments [OA17]. Framework [DGMS93,FC05, GGCGO01, GR07, MGL+17, NSZS13,PMvdG+13, SSB+05, SSAS12, Sun90a,Sun90b, WZHZ16, Ano93c, BA06, BR04,BAG17, EFR+05, FLMR17, GM13, KKM15,KJJ+16, KKJ+08, KH10, LME09, LGG16,LCMG17, LS08, PTL+16, RSC+15, SL00,TDB00, YLC16, YWTC15, ZT17, dAT17].Frameworks [OP10, ASS+17, KDSO12].France [ACM90, BR95a, BFMR96, CHD07,DE91, FR95, JPTE94, MCdS+08, VW92,YH96, GA96, IEE94c]. Francisco[BBG+95, IEE93a, IEE94g]. Frankfurt[Tou96]. Frankfurt/Main [Tou96].Fredericton [BG91]. Free[PKYW95, CP15, SOA11, Zah12]. freedom[KTJT03]. Frequency [IEE94e]. friendly[SVC+11]. Frontiers [ACM06b, IEE94a,IEE96c, Sie92a, Sie92b, Sie92a].Frontiers’95 [IEE94a]. Frontiers’96[IEE96c]. FSI [HAA+11]. FT[FD00, LNLE00]. FT-MPI [FD00]. Fujitsu[Ano98, AKL99, BHS+02, SWJ95, SH96].Fully [GA96, SSB+16]. Function[AGS97, Bri02, MCP17, RB01, SW12, HE15,JMdVG+17, KRC17]. Functional [ACM90,AJF16, CNM11, NW98, Ser97, CBHH94,

EP96, HSE+17, SFLD15, WZWS08].functionality [BFIM99]. Functions[BKGS02, Bru12, Hat98, MDM17, CdGM96,HWX+13, PNV01]. Fundamentals[Wal96a]. fused [TW12]. Fusion[FHK01, FMFM15, PKE+10]. fusions[FFM11]. Futhark [HSE+17]. Future[Dar01, IEE93d, Mat00a, BDG+93b, FK94,FHP+95, Gei94]. Futures [Kuh98]. fuzzing[LLCD15]. Fuzzy [MDM17].

G [OPM06]. G2 [Cot04, KTF03, OPM06].GA [Ara95]. GAIN [ARYT17].GAIN-MPI [ARYT17]. Gains [CMM03].Gallipoli [Ano93b]. GAMMA [CC00a].Gap [AAB+17, ASS+17]. Garbage[GTS+15]. Gas[BMS94b, BBK+94, BMS94a]. gather[MTK16]. gauge [BW12]. Gauss[BG95, LM99, Ols95]. GCel[SHH94a, SHH94b]. GECCO [B+05]. Geist[Ano95b]. Gemini [SWS+12]. gems[Fer04, mH12, Ngu08, PF05]. gene[PCS94, AAC+05, BGH+05, EFR+05,KMH+14, LM13, MV17, MSW+05].gene-finding [PCS94]. Gene/L[AAC+05, BGH+05, EFR+05, MSW+05].Gene/Q [KMH+14, LM13, MV17].General [Che10, IH04, MW98, SZBS95a,Sun94a, ABDP15, ADLL03a, ADLL03b,CBM+08, FLD96, KPNM16, PF05, RSBT95,SK10, SZBS95b, SMSW06, YPA94].General-Purpose [Che10, ABDP15,CBM+08, KPNM16, PF05, SK10].Generalized[DFKS01, FKS96, BSC99, SD99, van93].Generating [AZG17, CGL+93, ER12,IJM+05, PKB+16, SFLD15]. Generation[AB93a, CC17, FAFD15, Gei98, GTH96,HT08, JFY00, LTDD14, RGD13, SSB+17,TGBS05, VPS17, AB93b, CPR+95,DCD+14, DWM12, KHS12, KPL+12, KH10,SP11, WKS96, WMP14, ZKRA14].generational [WK08a, WK08b, WK08c].

24

generative [MAS06]. generator[Lan09, TNIB17, YL09]. Generic [ARS89,AKL99, GB98, BAS13, GM13, ZT17].Genetic[FTVB00, MTSS94, MSCW95, PB12,WKS96, Wal01a, WHDB05, AB13, BB95,FSTG99, HPLT99, RJC95, Wal01b, B+05].genetics [LM99]. Geneva [IEE97b].genomic [SdM10]. GeoComputation[Abr96, Abr96]. GeoFEM[NO02b, NO02a, Nak03]. geomechanics[BJS99]. geometrical [FMS15]. Geometry[STK08, STT96]. geophysical [Has95].Georeferring [GCGS98]. Georgia[USE00, UCW95]. German [EGH99,GBR97, Gra97, GEW98, Sei99, Wer95].Germany [BDLS96, GH94, KGRD10,MTWD06, MdSC09, PSB+94, Sch93, Tou96,Ano93a, BPG94, Cal94, GHH+93, WPH94].Gesellschaft [Ano94c]. get [Str94].Getting [Nob08]. GF100 [WKP11]. gHull[GCN+13]. GHz [Ano03]. Gibbs [TKP15].Gigabit [CC00a, HcF05, EGH99, OF00].Giganet [GT01, Tra02b, bT01a]. GIS[CFPS95, CCSM97]. Give [DZ98b]. Glenda[SBF94, Bic95]. Global [BSG00, DSS00,Pan95a, Ros13, SHTS01, STK08, SWH15,TTP97, HWS09, HCL05, HEHC09, LF+93a,Str94, Wan02, YLZ13, Zah12, ZWHS95].Globally [BHS+02]. GLUE [Rab98].GMRES [dH94]. Gmunden [Vol93]. GNU[YSMA+17]. go [KC94]. good [Mat03].Gottingen [Ano94c]. GP [LRBG15].GP-GPUs [LRBG15]. GPFS[AHP01, BIC+10, PTH+01a, PTH+01b].GPGPU [BGG+15, HA11, HCZ16,JKN+13, LME09, LDJK13, LYZ13,MBKM12, PTG13, TY14, YZ14, YEG+13].GPGPUs [JMdVG+17, LSB15]. gprof[GJLT11]. GPU [Che10, KA13, AKL16,AHHP17, BDP+10, BR12, BCD+12,BCD+15, BTC+17, BWV+12, BBH12,CCBPGA15, DF17, DS16, DK13, DALD18,DSOF11, DWL+10, DWL+12, ER12, Fer04,

FFM11, FSSD17, GCN+13, HVA+16,HSE+17, HK09, HK10, HZG08, mH12,JDB+14, JLS+14, JR13, JNL+15, JJPL17,JPT14, KDSO12, Kha13, KSL+12, KPL+12,KI17, KPNM16, KEGM10, KO14, LV12,Lee12, LRG14, LLC13, LAD16, MMO+16,MdSAS+18, MGL+17, Ngu08, NMS+14,NSM12, OFA+15, Pan14, PDY14, PF05,Pri14, RSC+15, RMNM+12, Sai10, SK10,SdM10, dOSMM+16, iSYS12, SS09, SCSL12,SIRP17, SAP16, SD16, SSB+17, SKM15,SKB+14, SG14, TBB12, TS12b, WKP11,YULMTS+17, YHL11, YCL14, YSS+17,ZRQA11, ZZG+14, ARYT17].GPU-Accelerated [KA13, SCSL12].GPU-Aware [Pan14]. GPU-based[MMO+16, SS09]. GPU-code [EZBA16].GPU-programming [HSE+17].GPU-Resident [JDB+14]. GPUDirect[YWCF15]. GPUMP [ZC10]. GPUrpc[IFA+16]. GPUs [BY12, DS13, DS16,GML+16, GFPG12, GPC+17, HLP10, HP11,HLP11, Hos12, IFA+16, JKM+17, JAK17,KGB+09, KKM15, KKLL11, KVGH11,LBH12, LRBG15, MA09, ON12, OIH10,PP16, PB12, SHLM14, SDB+16, SKK+12,Tsu12, VY15, WRSY16, WJ12, WJB14,YLZ13, YSWY14, ZC10, ZZZ+15].gpuSPHASE [WMRR17]. GPUVerify[BCD+12]. GQ [RFG+00]. GRACE[YKI+96, ZRQA11]. GRADE [DDL00].Gradient[BG95, GFPG12, KN17, MM92, Ols95].Grain[AZG17, IOK00, KOI01, MJPB16, NIO+02,NIO+03, BK11, JCP15, KW14, SFL+94].Grained [ADRCT98, BBG+10, LGM00,YSS+17, Heb93, RJC95]. Grammatical[RBB17]. Grand[DGMJ93, Ten95, BDG+92c]. Graph[BHW+17, DW02, MM14, NPS12, PPR01,STV97, HLP10, HKOO11, PP16, PD11].Graph-Based [NPS12].Graph-Partitioning [STV97]. Graphic

25

[HJBB14]. Graphical[BDG+91b, DDL00, BDG+92a, KCD+97,KFSS94, SSKF95, VDL+15]. Graphics[KS15b, LSVMW08, LSMW11, SLJ+14,vdLJR11, ABDP15, BHS18, CBM+08,DBLG11, Fer04, GKL95, HTA08, HSW+12,KFA96, KY10, KME09, LHLK10, MSZG17,PF05, SHM+12, SR11, WWFT11, ZLS+15,MSML10]. graphics-scalable [GKL95].Graphs[LGM00, OP10, EP96, MC99, MJPB16].Gravitational [ZSK15, KM10]. Greece[CD01, CDND11, SM07, TG94]. green[PTL+16]. Grenoble [JPTE94]. Grid[AB93a, CGB+10, CLL03, DPP01, Fos98,KT02, Laf01, Liv00, MRB17, PLK+04,Rei01, TGEM09, AB93b, Eng00, GLM+08,KRKS11, WYLC12, AASB08, BR04,CCHW03, DKD08, FC05, GFB+03, GL02,KTF03, KGK+03, KSSS07, LC07, LS08,NSBR07, RPM+08, RTRG+07, SHTS01].Grid-Adaptive [KT02]. Grid-Enabled[Fos98, GLM+08, KTF03]. Grids[NO02b, ACH+11, CC10, KBG+09, NO02a,NB96, BBH+06, GR07, Ram07, SN01].GROMACS [BvdSvD95]. Gropp[Ano95c, Ano99c, Ano99d, Ano00a, Ano00b].Gross [LBB+16, LYSS+16, SSB+16,YSVM+16, YSMA+17]. Ground[HTHD99, NS16]. groundwater[AFST95, EGDK92]. Group[AD98, Ano98, Ara95, ACDR94, CHD07,CHD09, CD01, CDND11, DKD05, DLM99,DKP00, GN95, KGRD10, Kra02, KKD04,LKD08, MC94, MTWD06, RWD09, TBD12,UMK97, BDW97, DLO03, MMU99].grouping [WPL95]. Groups [GOM+01].Grover [LYZ13]. Growth[PKYW95, BB95]. GTS [PKE+10]. Guest[AM07, GSA08]. GUI [VGS14].GUI-awareness [VGS14]. guidance[SDJ17]. Guide[Ano12, D+91, GBD+94, Lad04, Nov95,Per96, Ano95b, BDG+91a, McK94].

Guideline [Tra12b]. Guidelines [TGT10].GVirtuS [MGL+17].

Hack [DLV16]. Hague [Ano93e]. Halide[RKBA+13]. Hamburg [PSB+94].Hamiltonian [ART17]. Handling[DFC+07, FMSG17, LSB15, LGM00, RC97,FFFC99, LNW+12, THRZ99]. hands[KmWH10]. hands-on [KmWH10]. Harbor[BBC+00]. Hardware[BGG+15, BWW+12, Bru12, BCKP00,CDPM03, DW02, HSP+13, LSMW11,MFC98, PSM+14, PKB+16, vdLJR11, ER12,GGL+08, PMZM16, Rab99, SBG+12, SH94,SWS+12, YAJG+15, ZLS+15].Hardware-Based [CDPM03].Hardware-oblivious [HSP+13]. harmonic[GSMK17]. Harness[EBKG01, MS99b, PL96, FBD01a, FBD01b,FBVD02, FD02a, FD02b, MSF00, Gei98].Harrogate [CJNW95]. Hartree [CBHH94].HASEonGPU [EZBA16]. Haskell [WO97].Hate [Dan12]. Hawaii[ERS95, ERS96, HS94, MMH93, ZL96].HCA [KBG16]. HDL [Kat93, KMK16].HDMR [KD12]. Heading [Sch99]. Heat[SAS01, NP94, iSYS12]. Hector[RFRH96, RRG+99]. Heijen [Van95]. held[AGH+95, GA96, JB96, KG93, MMH93,Old02, R+92, SPH95, TG94]. Helios[SPK96]. Helmholtz [HMKV94]. Helps[Stp02]. HeNCE[BDG+92a, BDG+92b, BDG+93a, BDG+94].Henon [JPT14]. Herzliya [IEE96h].HeSSE [MRV00]. Heterogeneous[ABB+10, BDG+93a, BDGS93, BL95,BCP+97, BGR97b, BCKP00, CMMR12,DGMS93, DGMJ93, FDG97a, FDG97b,FLD98, Fos98, GS91b, IEE93f, KR09,KCR+17, LC93, MRV00, MM01, MM02,NTR16, PD98, SMS00, SGS10, TQDL01,VLO+08, ACGdT02, ADB94, ADDR95,AMV94, BDG+92c, BDG+94, BALU95,BRR99, BAG17, CCM12, CFPS95,

26

FMBM96, GKZ12, GCN+10, GKCF13,HK94, KSG13, KSL+12, Kos95b, LCL+12,LR06a, Lee12, Mai12, MSL12, MM03, NP94,NEM17, Pen95, RCFS96, Skj93, Smi93b,Sun94b, Sun95, TBB12, TMW17, TKP15,TDG13, VB99, WCC+07, YST08, YSL+12].HeteroMPI [LR06a, VLO+08]. Heuristic[BHM96, STV97, WH94]. HI[ERS96, HS94, IEE96e, ACM97a]. HICSS[ERS96, MMH93]. HICSS-26 [MMH93].HICSS-29 [ERS96]. hiCUDA [HA11].Hierarchical[BMR01, FBSN01, HA10, HL17, MALM95,RR02, ADMV05, BDV03, OKM12, YPZC95].hierarchies [SYR+09]. High[ACM97b, ACM98a, ACM98b, ACM00,ACM01, ACM04, BPG94, BRST94, BS07,CDD+13, CNM11, CDHL95, CS14, DPP01,DDL00, DE91, FGKT97, GSHL02, GBH99,GBS+07, GLDS96, HVA+16, HA11, Hol12,IEE92, IEE93c, IEE94g, IEE95k, IEE96a,IEE96f, IEE97c, IFI95, JJM+11, Kha13,KMK16, KEGM10, KH15, Laf01, LCK11,LC97a, LkLC+03, LBH12, LWP04, MW98,MPD04, ME17, MAB05, NU05, OIH10,OLG01, PKB01, PR94b, PTH+01b, Rab98,RH01, SPM+10, SCSL12, SJ02, Slo05,SVC+11, SSSS97, Tou00, Tsu07, VW92,WN10, YCL14, YWCF15, YSP+05, AH95,Ano03, BADC07, Ber96, BWT96, BID95,CHKK15, DL10, Duv92, EZBA16, ESB13,FME+12, GS02, GGC+07, GL96, GL97c,HDDG09, HW11, Hos12, KBP16, KME09,Lan09, LBD+96, MSL12, MSZG17, NS91,NFG+10, Old02, PGS+13, PGK+10, PF05].high [PTW99, Reu03, RJDH14, SG14,SFLD15, ZSK15, ZWL13, dAT17, CDH+95,DZ98b, D+95, DE91, GH94, HS95a, KD12,LCHS96, LC97b, SSH08, Ten95].High-Dimensional [MW98]. High-Level[CS14, DDL00, HA11, Hos12, SG14, SFLD15].High-order [KEGM10, KME09].High-Performance [ACM98a, FGKT97,IEE97c, LkLC+03, OLG01, PKB01, PR94b,

PTH+01b, Rab98, RH01, SPM+10, SCSL12,WN10, GLDS96, OIH10, SVC+11, Ano03,ESB13, FME+12, GL96, GL97c, HDDG09,KBP16, LBD+96, Old02, PGS+13, PGK+10,PF05, Reu03, RJDH14, SFLD15, ZSK15,HS95a, GH94, LCHS96, SSH08].High-Precision [Kha13]. High-Scalability[BS07]. High-Speed [CDHL95, KMK16,AH95, BWT96, CDH+95].high-throughput [ESB13]. Higher[MYB16, KB13, wL94]. higher-level [wL94].Higher-order [MYB16]. Highly [MM95,PV97, TMP16, CARB10, GBH14, VM95].highly-scalable [GBH14]. Hills [IEE93f].HiNet [AH95]. HIRLAM[Bjo95, HE02, KOS+95a]. histogramming[KRC17]. History [OWSA95]. Hitachi[Ano03, NNON00, TSB02, TSB03]. HLA[RTRG+07]. Hoare [KI17]. Hoc[IBC+10, ITT02]. Hogskolan [Eng00]. Hole[Kha13]. holistic [TWFO09].Homomorphisms [RG18]. homotopy[GWC95, SMSW06, VY15]. Honolulu[IEE96e]. honor [Str94]. Host[Ano95e, LLRS02]. Host-Parasite[LLRS02]. HOTB [GSMK17]. Hotel[IEE94e]. Hotel-Copley [IEE94e]. Hough[YULMTS+17]. house [ZLZ+11]. Houston[ACM06a, Ano95a, Cha05, DKM+92, Y+93].HP [CGB+10, BCM+16]. HPC[ASS+17, CGBS+15, GDC15, GKK09,LCVD94b, OLG+16, PRS+14, ZLP17].HPC2002 [Ano03]. HPCN [LCHS96].HPF[BP98, BF01, BID95, Bri00, BDV03, CM98,CDD+96, Coe94, FKK+96b, FKKC96,FKK96a, LZ97, OP98, OPP00, SM02, Str94].HPF-MPI [BP98]. HPL [Lee12]. HPVM[BCKP00, CLP+99]. HPVM-Based[CLP+99]. hull [GCN+13]. Hungarian[Fer92, FK95]. Hungary[DKP00, KKD04, VV95, FK95]. hunting[JPP95]. Husky [YLC16]. Huss [Ano96a,Ano99a, Ano99c, Ano99b, Ano99d, Nag05].

27

Huss-Lederman[Ano96a, Ano99a, Ano99c, Ano99b, Ano99d].Hybrid[BBG+10, BBH+06, CGC+11, CNM11,Cha02, DR97, GPC+17, HVSC11, IDS16,KS15a, KLR+15, LLRS02, LRG14, MS02b,NO02b, PZ12, SSB+16, VPS17, WT12,YHL11, YPAE09, YTH+12, ADR+05,BBG+14, CSPM+96, FMS15, GAVRRL17,GKK09, HDB+13, JR10, JMS14, KN17,KRG13, KJEM12, LLC13, LLH+14,MLAV10, MRRP11, NO02a, Nak05a, Nak05b,PARB14, PHJM11, SDJ17, SVC+11, WT11,WYLC12, WLYC12, YWC11, ZWL13].hybrid-core [BBG+14]. Hybridizing[LSG12]. HYDRA MPI [PBC+01]. Hyper[CSW99, SBT04, TBG+02, ZAT+07].Hyper-Rectangle [CSW99].Hyper-Threading[SBT04, TBG+02, ZAT+07]. hypercube[HS95b, Sur95b]. Hypercubes[Ano89, RJMC93, She95]. Hypercubic[HP11]. hyperelastic [OKW95].hypersonic [BTC+17]. Hyperspectral[VLO+08].

I-SPAN [LHHM96, Li96]. I-WAY [FGT96].I/O [Bos96, CFF+96, DRUC12, IRU01,IBC+10, LkLC+03, kLCC+06, MV17,MGC12, MG15, PSK08, PLR02, RK01,SBQZ14, Tha98, Tsu07, WSN99]. IASTED[Ham95a]. IBM [AL93, Ano03, BBB+94,BGBP01, BR95c, BR95b, Bri95, CE00,CDM93, FHPS94b, FHP+94, FHP+95,Fra95, FWR+95, GL95d, HSMW94,HMKV94, Heb93, JF95, KB98, KAC02,KHS01, KMH+14, LC97b, MP95, MW93,MABG96, NMW93, WZWS08, XH96].IBM-SP1 [FHPS94b]. ICA [IEE96d].ICAPP [Nar95]. ICCMSE [SM07]. ICIP[IEE94b]. ICPP [Agr95a]. ID [DGG+12].Idaho [Str94]. Ideas [IEE95d].identification [HPLT99]. identity [KN17].IEEE [ACM97b, ACM98b, ACM04, ACM05,

Bha93, IEE94e, IEE94g, IEE95b, IEE95a,IEE95k, IEE95g, IEE96b, IEE96f, IEE96d,IEE02, Nar95]. IEEE/ACM [ACM04].IFIP [Boi97, DR94, PSB+94]. IFS [AHP01].Igniting [ACM03]. II[DE91, GE95, HS94, BPS01, BWW+12,EM00b, GAVRRL17, Sta95b]. III[BPG94, BP93, DSM94, GE96, Has95,OKW95, SSGF00]. ILDJIT [CARB10]. I’ll[Har94]. Illumination [STK08, ZWHS95].ILU [ABF+17]. ILU-preconditioned[ABF+17]. im [Gra97]. Image [DYN+06,FJBB+00, GA96, GPC+17, KBA02, KS01,LSZL02, NJ01, PLR02, RRBL01, WN10,ARL+94, DZZY94, GDC15, JC96, KKLL11,RKBA+13, SLS96, UH96, Wu99,YULMTS+17, YPZC95, YZPC95, dAT17].Imagery[GGCM99, GGCGO01, GCGS98, GGGC99].Images [Uhl94, Uhl95b, VLO+08, NAJ99].Imaging [NH95, Has95, LM13, Pat93].imbalances [MLVS16].immunodominance [ZWL+17]. Impact[ADLL03a, ADLL03b, BRU05, Bru12,TSS00a, WHDB05, DO96, FSV14, SHHC18].impacts [Str94]. Implement[GM95, PPT96c]. Implementation[AB93a, AKL99, BGG+15, BGBP01, BPS01,BG95, BHP+03, BBS99, Ben01, BP98,BCD+15, Bjo95, BJS97, BIC+10, BMR02,BRM03, BMS94b, BMG07, CGC+02,CFMR95, DYN+06, DAK98, EFR+05, ES11,FH97, FD04, FHSO99, FSXZ14, FJBB+00,FHPS94a, FHPS94b, FHP+94, FSLS98,GBH99, GB98, GBS+07, Gro02a, HPP02,HRZ97, HKT+12, Huc96, HHA95, HAA+11,IBC+10, ITT02, IM94, JSS+15, JSH+05,LSZL02, LTRA02, LZ97, LWP04, MS02b,MW98, MN91, MT96, MRH+96, NSS12,NNON00, OLG01, Pan14, PLK+04, PS00a,Pet97, PBK99, PTH+01a, PTH+01b, PB12,RDMB99, RG18, RSV+05, SH94, SBF+04,SBG+02, Ser97, SCC96, SSC97, SZBS95a,SWJ95, SYF96, Sum12, Sur95a, TOTH99,

28

TBG+02, TRH00, TMPJ01, USE94, VT97,WH94, WPC07, YGH+14, YWO95, ZZG+14,ACGdT02, AS92, AAAA16, AAC+05].implementation [ADLL03a, ADLL03b,AB93b, BR91, BvdSvD95, BR95b, Ber96,BBCR99, BK96, BCK+09, BS01, BS05,Bor99, BRR99, BS96b, BDV03, Bri95, BB00,BAS13, CDZ+98, CEGS07, CG99a,CdGM96, CBHH94, CD96, DSW96, DS96a,DL10, DBB+16, DSOF11, DM12, FFB99,FWNK96, FGT96, FGG+98, GCC99, GG99,GG09, GAVRRL17, GL92, GL94, GL96,GLDS96, GL97c, GT07, GkLyCY97, HBT95,HCL05, HS95b, ITT99, IvdLH+00, JRM+94,JC96, KY10, KTF03, KBVP07, KL95,KVGH11, KB13, Lee12, LC07, LO96,MMO+16, Man94, MAIVAH14, MS95,MSZG17, ON12, OKW95, OA17, PHJM11,PR94a, PTW99, PCS94, Ram07, RRFH96,Sep93, SZBS95b, SCL97, Sto98, SNMP10,Sur95b, SL95, TKP15, TPD15, TS12b,TA14, TCP15, Tsu95, TVV96, VDL+15,VGRS16, VM95, Was95a, WMRR17].implementation[YPA94, ZLS+15, dH94, dlAMCFN12, van93].Implementations[AKK+94, Ano01a, ACMR14, AJF16, BM00,BS07, BEG+10, FB94, Gro02b, kLCC+06,LCW+03, Mar02, ORA12, Sap97, TSCaM12,TGEM09, VS00, WT12, ZDD97, CLSP07,ER12, ED94, GML+16, ICC02, KWEF18,MKP+96, NN95, Pri14, WT11, YCL14].implemented [BBDH14, EP96].Implementing[DPZ97, Fin94, Fin95, GL95b, HB96a,HB96b, LRT07, MMH98, MS99c, MSB97,SSC96, SS99, SMTW96, SGHL01, SCC95,Tra02a, Wil93, BWT96, LHZ97, YX95].Implementor [GL95b]. Implicit [MS02b,NA01, SGHL01, Bjo95, TSP95, WADC99].Importance [BCG+10, PCY14].Importance-Driven [PCY14]. Improve[KBS04, SKH96, Tha98, GK97, RHG+96].Improved

[Tra02b, MMO+16, dlAMCFN12].improvements [DPSD08]. Improving[CGZQ13, DZ96, DCPJ12, DCPJ14,GSY+13, HE02, IRU01, KH12, KK02b,LB98, MK97, PTG13, RSC+15, SCL00,XF95, CZ96, JKN+13]. in-house [ZLZ+11].In-memory [CRM14, HSP+13]. in-place[HSE+17, PSHL11]. Including[BWW+12, GLT12]. incompressible[BCM+16, Lou95, RM99, TS12b].Incorporating [LM94, LYZ13, TKP15].Incremental [dOSMM+16]. Independent[BCL00, BRU05, CSW12, CDMS15, DiN96,MV17, YBZL03]. Index [DALD18, LAD16].Index-Digit [DALD18, LAD16]. Indexers[Wal01a]. Indexers/Crawler [Wal01a].Indexing [LTR00]. India[CGB+10, IEE96a, Kum94, PBPT95].indicator [FSV14]. Industrial[BPMN97, DHK97, ALR94, ABCI95a,ABCI95b, BT96, EKTB99, Was96, Kon00].industries [Ano93a]. Industry[DM98, Ano94f]. Industry-Standard[DM98]. inefficiency [HGMW12]. Inertial[Str97]. Inference [LAdS+15]. Infiniband[SWHP05, LCW+03, LVP04, LWP04, PK05,PRS16, SPK+12, ZLP17].InfiniBand-based [PK05]. inflation[OdSSP12]. influence [Gra97].Information [Ano98, CGB+10, Ano93c,CG99b, MMR99, WADC99, PSB+94].Infrastructure [WLR05]. infrastructures[GWVP+14]. Initial[LLH+14, VDL+15, AL96, LSR95].Initiated [SSB+05]. initiatives [Sun95].initio [SSGF00, SEC15]. Injection[RRAGM97, SAL+17]. Inn [IEE93c].Innovation [ACM03]. Input[CFF+94, SHM+12, JWB96]. input-aware[SHM+12]. Input-Output [CFF+94].Input/output [JWB96]. Insight [IEE02].Inspection [BPMN97]. inspired[NEM17, TDB00]. instances[RBAI17, ZLZ+11]. Institute

29

[Old02, TG94]. Instrumentation[MVY95, Yan94]. Insurance [PZ12].Integer [ASA97, CF01, WLC07, ZC10,BHJ96, KVGH11]. InteGrade [CC10].integral [HK94]. Integrals [FBSN01, NS16].Integrate [GLRS01]. Integrated[CFDL01, DGMS93, HKN+01, KSV01,WL96a, DF17, HK10, KW14, VDL+15,WWZ+96, WL96b, XWZS96]. Integrating[BCLN97, CM98, Fin00, GJP01, KJA+93,KAHS96, wL94, WTFO14, TWFO09].Integration[CGC+11, CSW97, FD96, FB94, MAIVAH14,Sei99, AL96, CSW99, KB13, RBB15].Integrator [Per99, SP99]. Intel[Ano96c, Ano03, DSGS17, MP95, URKG12,VDL+15, YSMA+17]. Intelligence[BPG94]. intelligent [IEE95a, ZWZ+95].Intel(R) [TBG+02, SBT04]. INtensities[ARYT17]. Intensive[Rei01, BFLL99, BKML95, SL94a]. Inter[KFL05, LAFA15, FKLB08, LFL11, SDB+16].Inter-Atomic [LAFA15]. Inter-Node[KFL05, FKLB08, LFL11].inter-workgroup [SDB+16]. Interaction[DMMV97, GFV99, NSLV16, Sou01].interactions [PARB14]. Interactive[Coo95b, KPK13, KA13, NE98, RTRG+07,STK08, Coo95a, IJM+05].Intercommunication [TMP16].Interconnect[Bru12, SJ02, BWT96, SWS+12, TBD96].Interconnected [Hus00]. Interconnecting[MC98]. Interconnection[MANR09, SB95, AVA+16]. Interconnects[RA09]. Interface[Ano01b, BCFK99, BDH+97, CHD07, Cer99,CGH94, CDND11, DFKS01, DHHW92,DHHW93a, DBK+09, FKKC96, FSLS98,Gle93, GLS94, GL95c, GLDS96, GLT00b,HDB+12, HRSA97, KSJ95, KGRD10,KKDV03, KKD04, LKD08, LkLC+03, LW97,MPI98, MS98, MSS98, MBES94, MMSW02,MTWD06, PS01b, RWD09, SSL97, TDB00,

TW01, TBD12, WD96, Wer95, YHGL01,Ada98, AD98, Ano93d, Ano94d, BBB+94,BBCR99, Bru95, BDW97, BR94, CFKL00,CFF+96, CD01, CG99b, DKD05, DBB+16,DS96b, DLM99, DKP00, DLO03, HPY+93,HRR+11, KOB01, KSJ96, KBHA94, Kra02,NS91, Pie94, PR94a, SL94a, SWJ95,SDV+95, VM95, Wal94a, Wal94b, ZWL13,ZKRA14, AMHC11, BC14, BBH+06, BRU05,BDH+95, Cot04, DKD08, DiN96, FKS96,FGT96, FGG+98, GGHL+96, GLT99,GLS99, GLT00a, GL04, Han98, IBC+10].Interface [KTF03, KKD05, LK10, MSL96,RRFH96, SWHP05, SLG95, SWL+01,TGT05, YGH+14, Ano95c, Ano00a, Ano00b].InterfaceArchitecture [Sei99]. Interfaces[MGC12, Wit16, RJDH14, Tra12a].Interfacing [Lus00, PL96]. Intermediate[SML17]. internal [BBH+15].International [ACM94, ACM96b, ANS95,Abr96, ATC94, AGH+95, Ano93a, Ano94a,Ano94e, BPG94, Bos96, BFMR96, Cha05,CZG+08, CGKM11, CMMR12, CGB+10,CH96, DSM94, DW94, EV01, EdS08, ERS95,ERS96, EJL92, Gat95, GA96, GT94,Ham95a, HAM95b, HS95a, HS94, Hol12,IEE93c, IEE93b, IEE94d, IEE94g, IEE95b,IEE95c, IEE95a, IEE95k, IEE95i, IEE95f,IEE95l, IEE96a, IEE96f, IEE96e, IEE96d,IEE97b, IEE97c, IEE05, Kum94, LCK11,LF+93a, Lev95, LHHM96, Li96, MMH93,MCdS+08, MdSC09, Nar95, Ost94, PW95,PBG+95, PBPT95, Ree96, R+92, SHM+10,Sie94, Sil96, SM07, Tou96, VW92, Vol93,Vos03, Was96, YH96, ACM97a, AH95, BS94,DMW96, FR95, GH94, JPTE94, LCHS96,Mal95, ZL96, Ano93b, HHK94, Sch93].Internet [NE98]. Interoperabilitat[GBR97]. Interoperability[BoFBW00, Don06, PLR02, GBR97].Interoperable [Rab98, MSL12, YBMCB14].Interoperation[FDG97a, FDG97b, FLD98]. Interpolants[RB01]. interpolation [BAS13].

30

Interpretative [MKW11]. Interpreted[FSSD17]. Interpretive [CNC10].interprocess [SC95]. interprocessor[DS96b]. interrupts [CXB+12, SH96].Intervals [MDM17]. intra[GM13, VSW+13]. intra-node [GM13].intra-warp [VSW+13]. Introduccion[VP00]. Introducing [JKM+17, TBS12].Introduction[Ano96b, AM07, Che10, Cze16, DOSW95,GSA08, HW11, Mar02, Mat00b, SK10, VP00].Invasive [URKG12]. Inverse[Huc96, BV99, GGC+07, GG09, Wan02].Inversion [ACMR14, Kan12].Investigating [GMdMBD+07, Ros13].investigation [PHW+13]. Invisible[Wis97]. Invited [Gei93a]. IO[AHP01, BIC+10, CGC+02, CFF+96, DL10,FGRD01, FWNK96, FSLS98, LRT07,LGG16, PSK08, PTH+01a, PTH+01b,SW12, Sto98, TGL02, ZZ04]. IO/GPFS[PTH+01a]. IOMMU [YWCF15]. IOV[YWCF15, ZLP17]. IP [CCA00]. IPCC[SC95]. IPPS [IEE96e]. Ireland [LKD08].IRREGULAR[FR95, BMR01, Cza02, Cza03, BL99,HASnP00, LOHA01, MR96, NP12].irregularly [FR95, Smi93b]. ISA [Wit16].ISBN [Che10, SD13]. ISBN-13 [Che10].ISCA [Ano94e, YH96]. Ischia [ACM06b].Iserver [SHH94a, SHH94b].Iserver-Occam [SHH94a, SHH94b]. Ising[AL93, KO14]. Isolating [Lus00].Isosurface [PCY14]. ISPAN [HHK94].Israel [DSM94, IEE96h]. Israeli [IEE96h].ISSAC [Lev95]. ISSTA [Ost94]. Issue[AM07, BDB+13, BC00, GSA08, MPI98,CHD09, DKD07, Mar02, Old02]. Issues[BDT08, FD02a, KGK+03, MW98, Pan95b,PS01b, ZDD97, ARvW03, EGH99, FD02b,HHA95, PBK99]. Italy[CMMR12, CH96, DKD05, DKD07, D+95,DLO03, HS95a, IEE95h, KG93, OL05,ACM06b, Ano93b, CLM+95, DR94, Sil96].

Iteration [HF14a, HF14b]. iterations[Lou95, YST08]. Iterative[CCSM97, DK06, NO02b, Nak03, SC04,ADDR95, EDSV09, LSR95, MGG05, NO02a,Nak05a, Nak05b, OMK09, dH94]. Ithaca[PBG+95, Ree96]. IV [SPH95]. IWOMP[CZG+08, CGKM11, CMMR12, EdS08,MCdS+08, MdSC09, SHM+10]. IWPP[Kum94, PBPT95]. IWPP-94[Kum94, PBPT95]. IWWP [Kum94]. IX[R+92].

Jack[Ano95b, Ano96a, Ano99a, Ano99b, Nag05].Jacobi [BBDH14, CGU12, LM99]. JaMP[KBVP07]. January [ERS96, GE96, HS94,IEE95h, IEE96g, MMH93, USE95]. Janus[GJP01]. Japan[SHM+10, SPE95, HHK94, IFI95]. Jason[Che10]. Java[ACM98a, Ano97, BCFK99, BDY99, Bra97,BKO00, CGJ+00, CFKL00, CLL03, DeP03,Fer98b, Fer98a, GGS99, KOB01, KBVP07,LRW01, MSS98, MG97, NE98, RAS16,SMS00, SZ99, TDB00, VGRS16, VGS14,WN10, WCS99, YC98, YHGL01].Java-based [WCS99]. Java-MPI [GGS99].Java/CORBA [LRW01]. JavaNOW[TDB00]. Jaypee [CGB+10]. Jeff [Stp02].Jersey [Bha93]. Jerusalem [DSM94].Jiang [Ano95b]. Job [NSS12]. Jobs[GSHL02, OPM06, ZA14]. Join[BGD12, LTRA02, SML17, BMS+17, She95].Joint [GT94, Ano03, YHGL01, Ano93c].Jose [ACM97b, GE95, GE96]. JPEG[NU05]. JPT [BDY99]. JPVM[Fer98b, Fer98a, LGCH99]. Jr [ACM99].July[ACM95b, ACM97a, Boi97, EV01, GA96,Has95, IEE93c, IEE96i, Lev95, PW95, TG94].Jumpshot [ZLGS99]. June[ACM90, Ano94f, B+05, BG91, CZG+08,CGKM11, CMMR12, DSZ94, DW94, D+95,IEE94e, IEE95c, IEE95i, IEE96d, IEE96h,

31

KG93, LHHM96, Li96, MCdS+08, MdSC09,R+92, SL94a, SHM+10, TG94, Vos03].Jupiter [Str94]. Just[FKLB08, FSSD17, KFL05, FK94].Just-In-Time [FSSD17, FKLB08]. JVMPI[DeP03].

k-ary [Pan95a]. Kalman [BY12].Kanazawa [HHK94]. Kandrot [Che10].Karlsruhe [Cal94, Sch93]. Karlsruher[Reu01]. Katsevich [DYN+06]. Kaufmann[SD13]. KBLAS [AKL16]. Keele [Ano93c].KENO [RP95]. KENO-Va [RP95]. Kernel[CKmWH16, CFDL01, EBKG01, HKT+12,MBBD13, PWP+16, TY14, FMFM15,GM13, MMW96, SAP16, YBZL03, AKL99].Kernel-assisted [MBBD13, GM13].Kernel-based [CKmWH16, TY14].kernel-independent [YBZL03].Kernel-Level [HKT+12]. Kernels[BCD+15, KI17, KAC02, Pet01, Ros13,SSB+17, ARS89, BCD+12, FSV14, FVLS15,FFM11, KKM15, PTG13, PGS+13, TBB12].Kerr [Kha13]. key [LF+93a]. kind [SP11].Kinect [KPK13]. Kinetics[LD01, BTC+17]. King [ACM99].Kingdom [Boi97]. Kirchhoff [SSS99].Klagenfurt [Bos96]. Knapsack [ICC02].KNEM [GM13]. knowledge [FNSW99].knowledge-based [FNSW99]. Knoxville[PR94b]. Kohr [Stp02]. Kolmogorov[Str97]. KOP3D [KR09].Koppelrandkommunikation [Gra97]. Kpi[EML00]. KPN2GPU [BK11]. Kremlin[GJLT11]. Kronecker [LNW+12]. KSIX[AUR01]. KSR1 [BL94]. KU [IM94].Kungl [Eng00]. Kyoto[IFI95, SPE95, IFI95].

L [AAC+05, BGH+05, EFR+05, MSW+05].LA-MPI [YSP+05]. Lab [Str94]. Labeling[PPJ01, KRKS11]. labelling [HLP10].laboratory [JY95]. Lafayette[EV01, EdS08]. Lagrangian

[CT94a, CT94b, RSV+05, TC94]. Lahey[Ano98]. Lake [Hol12]. LAM[OF00, RsT06, SSB+05, Squ03, ZWZ05].LAM/MPI[OF00, RsT06, SSB+05, Squ03, ZWZ05].lambda [PQ07]. lambda-calculus [PQ07].LAMGAC [MSOGR01, MS02a]. LAN[CCU95, CDH+95, MSOGR01, MTSS94,TSZC94, ZGC94]. LAN-based [TSZC94].LAN-Message [MTSS94]. Lanczos[GP95, Sch96a, Sch96b]. Landing [dCZG06].Landsat [GGCM99, GCGS98].Landsat-TM [GGCM99, GCGS98]. Lane[HHC+18]. Language [ACM96a, NM95,PD98, TA14, WLR05, Ben95, CGK11, Hos12,Nob08, RKBA+13, Roh00]. Languages[CFF+94, FMSG17, FSSD17, CH96, Mar05,Olu14, SWS+12, PBG+95, SS96]. LANs[Fin97]. LAPACK [Add01, ARvW03].LaPerm [WRSY16]. LAPI [BGBP01].Laplace [ACMR14]. Large[AKE00, BHW+17, BZ97, BJS99, BHNW01,CGC+11, DALD18, FFP03, Huc96,JFGRF12, LLY93, MKC+12, MFPP03,PCY14, SGJ+03, SM03, SvL99, TGEM09,WT12, ZWJK05, AASB08, AMS94,BCA+06, BA06, BCH+08, Che99, CCHW03,DZZY94, FME+12, GG99, IM95, JLS+14,KEGM10, Kos95b, KA95, LS10, MLA+14,NFG+10, PTL+16, PD11, RMNM+12,SC96a, TBB12, WT11, ZWL13, ZA14].Large-Scale[AKE00, BHW+17, BZ97, FFP03, MFPP03,SM03, WT12, BJS99, SvL99, AASB08,BCH+08, Che99, FME+12, LS10, MLA+14,PD11, RMNM+12, WT11, ZA14].large-sized [JLS+14]. Larger [NB96].LargeScale [LAdS+15]. laser[EZBA16, WWZ+96]. Lastverteilung[Wil94]. Latency [Jes93a, Jon96, KBHA94,NCB+12, NCB+17, TBD96].latency-tolerant [NCB+12, NCB+17].Lattice [BBK+94, BMS94b, HLP11,SJK+17a, SJK+17b, BW12, BMS94a, Sai10,

32

SVC+11, BLPP13]. launches [Ano03].Layer [CSAGR98, HEH98, FKK96a, PTT94,dlAMC11, dlAMCFN12]. layered [DiN96].Layering [Hus01]. layers [KC94]. Layout[WG17, BGH+05, HP11, LDJK13, Str12].Lazy [TCBV10]. Leaks [DLV16]. Learned[GKPS97, MWO95]. Learning[AHHP17, Gro01b, FE17, KWEF18, LSSZ15,SEC15, TWFO09, WO09, WTFO14].learning-based [FE17]. Least[PWP+16, VRS00, DK13]. Least-Squares[VRS00]. Lecture [Gei93a]. Lederman[Ano96a, Ano99a, Ano99c, Ano99b, Ano99d,Nag05]. Leeds [Abr96]. legacy[BR04, LP00, LRW01]. Lemon [DRUC12].Lengths [GSHL02]. LEO [CCBPGA15].Leonardo [Stp02]. Lessons [MWO95].Level [AELGE16, BGG+15, BBC+00, CS14,CRGM14, DHHW92, DHHW93a, DDL00,GS91b, GAM+02, HA11, HKT+12, DK02,KCP+94b, KOW97, LVP04, LMRG14,NPP+00c, SHM+10, SBF+04, TS12a, TW01,XF95, BMPS03, CAWL17, CRM14,CRGM16, EPP+17, GGS99, HE15, HK09,Hos12, KCP+94a, wL94, LCMG17, LM13,MALM95, NS91, Nak05b, STY99, SCL97,SG14, SFLD15, YZ14, ZWZ05, ZZZ+15,BBH. . . 13a]. Leveraging[HDB+12, NPP+00c, SHLM14, LFL11]. LIB[NPP+00d]. libefp [KS15a]. libOMP[BGD12]. Libraries [BHLS+95, BWV+12,CGZQ13, DARG13, GFD05, IEE94f, IEE95j,MM14, ARvW03, BCM11, BfDA94, CRD99,GS94, PS07, Skj93, SDB94, SSG95, DHK97].Library[AKL16, Ada97, Boo01, BLW98, Coo95b,DHP97, EM02, FHK01, For95, GFB+03,GSI97, Gro02a, HB96b, ITKT00, JPT14,KBG16, OD01, PLK+04, PS01a, RR02,Saa94, SBG+02, Sta95b, SKH96, TD98,UTY02, WN10, YKLD17, ZC10, Ada98,AMHC11, Arn95, CSS95, CGG10, Coo95a,DRUC12, DXB96, FB97, Fan98, FKK+96b,GDC15, GLM+08, GL94, HB96a, HLM+17,

Har94, Har95, JKM+17, JC96, KS15a, KN95,LR06a, MSL96, PKB06, PS00b, RFH+95,SSC96, SH96, ZT17, CC95, McD96, Sum12].Life [PZ12, Str94]. Lifting [vdLJR11].Lightweight [CKmWH16, DT17, FLB+05,KMK16, FS95, Ott93]. Like [BST+13,BKO00, CGJ+00, KOB01, VGS14, CSS95].Likelihoods [MSCW95]. LIME [DRUC12].Limits [GB96, MBKM12]. Linda[Mat94, KS96, MSP93, BLP93, CSS95,Gal97, Mat95, TDB00]. Linda-like [CSS95].Line [BoFBW00, Wis98, Bor99]. Linear[ASA97, BDT08, BG95, CDD+13, Gao03,Huc96, LLY93, LZ97, MGMH97, MSB97,van97, BSN95, BKvH+14, BAV08, BRR99,CEGS07, DR18, Gra09, GFPG12, Jou94,MW98, MM11, OKW95, SCC96, SMSW06,dCH93, dH94]. Linear-scaling [Gao03].Lines [NE01, YULMTS+17]. Link[BGR97b, SJ02]. Linked [WJ12].Linkoping [FF95]. LINPACK [JNL+15].Linux[Sei99, SMTW96, USE00, SSSS97, Ano01a,GSN+01, MK04, OF00, PS07, PKB01,RsT06, Sei99, Slo05, SGL+00, YL09]. Linz[Kra02]. lipid [FHSO99]. Liquid[DSS00, JLS+14]. Lisbon [IEE93d]. LISP[ACM90]. List [Tra98, WJ12]. Lithe[PHA10]. Lithography [RDMB99].Liverpool [AD98]. LLVM [SML17]. Load[Ano94b, BKdSH01, BS05, DI02, DR95,DK06, GCBL12, HE02, MM02, NP94, PT01,Pus95, SGS95, ST97, Wal01a, Bir94,CKO+94, DZ96, DLR94, DvdLVS94,EZBA16, FMBM96, FH97, GS96, Hum95,JH97, MM03, SCL97, SY95, Wil94].load-balanced [EZBA16]. Local[BSG00, CDHL95, CCSM97, IKM+01,AMHC11, BY12, CGL+93, FSV14, IKM+02,LHD+94, LHD+95]. Locality[MJB15, ZLP17, BHRS08, HJYC10,RKBA+13, WRSY16]. Locality-Aware[MJB15, HJYC10]. localization [HC08].Locally [BHS+02]. Locating [PNV01].

33

Lockheed [Str94]. Locking[kL11, CAWL17, PGK+10]. Logging[BCH+03]. Logic[KI17, BJ95, KMC96, KMC97, POL99].LogP [CKP+93]. London[EJL92, Ano93g, Ano94f]. Look [HCZ16].lookup [BJ13]. Loop[DMB16, SHM+10, TJPF12, SHLM14,WYLC12, WLYC12, YST08, YWC11].Loops [AHD12, LOHA01]. Loosely[Ada97]. Lop [RGDML16, RGDM15].Louisiana [USE95, IEE96b]. Love [Dan12].Love-Hate [Dan12]. Low [BGG+15,GGS99, Jon96, MC17, NE01, RLL01, Str94,GK97, KBHA94, TBD96, ZRQA11].Low-Bandwidth [NE01]. Low-Cost[RLL01, GK97]. Low-Density [MC17].Low-Level [BGG+15, GGS99]. Low-life[Str94]. low-overhead [ZRQA11]. LPVM[ZG98]. LSS [BCAD06, BADC07]. LU[AZ95, BRS92, LC97b]. Lugano [GT94].Luminous [KNT02]. Lumsdaine[Ano99c, Ano99d]. Lusk[Ano95c, Ano99c, Ano99d, Ano00a, Ano00b].Lustre [DL10]. Luther [ACM99]. Lyngby[DW94, DMW96, Was96]. Lyon[BFMR96, FR95].

M [PBC+01]. M-SPH [PBC+01]. M6A[EM00a]. M6B [EM00b]. MA[Ano95b, Ano95c, Ano96a, Ano99a, Ano99c,Ano99b, Ano99d, Ano00a, Ano00b].Machine[AS92, AGIS94, BJ93, BS93, CHD07, D+91,FE17, Fis01, GBD+94, Gre94, KNT02,KKDV03, KKD04, LKD08, MTWD06,Nov95, Pat93, Per96, RWD09, TY14, VS00,Wel94, AD98, AL92, Ano95b, BR91,BDG+91a, BPC94, Bir94, BDLS96, BDW97,CARB10, CLM+95, Cav93, Cha96, Che99,CD01, CC00b, DM93, DKD05, DLM99,DKP00, DLO03, FM90, KWEF18, KMC97,Kra02, LG93, MN91, MRH+96, NB96, Sch94,SK92, SCC96, SL00, TW12, TWFO09,

WO09, WTFO14, ARL+94, BG94b, JPP95,KKD05, LK10, QRG95, SSSS96].machine-learning [TWFO09].machine-learning-based [WTFO14].Machines[BP99, BZ97, BCC+00a, BT01b, DR97,EGR15, GB96, GTS+15, HC10, MGL+17,STY99, SCSL12, ZWJK05, BCA+06, BSC99,BCC+00b, DDS+94, DCH02, GKZ12, KN95,PRS16, SL94b, TSY99, TSY00, WPL95,ZWL13, Gei01, YC98]. made [MJPB16].MAFFT [ZLS+15]. Magnetic[Y+93, PKE+10]. Magnetism [Y+93].Magnetohydrodynamic[KT02, WWFT11]. Magnetostatic [BB93].MagPIe [KHB+99]. Main [Tou96].Maintaining [PKB01]. maintenance[ZDR04, ZDR01]. Makes [ZG95b, Str94].Malleable [EDSV09, MSMC15]. Mambo[WZWS08]. Man [IEE95a]. Manageable[PKB01]. Managed [KCR+17, SYR+09].Management [AJ97, AUR01, BGR97b,BGL00, EK97, FDG97a, FDG97b, GJR09,PPT96a, PS00a, SIS17, STY99, THS+15,ARS89, DZ96, DF17, FLD96, GL95a, JCP15,LF+93a, PPT96b, PPT96c, YWTC15].manager [Sep93]. managers [FLD96].Managing[FLD98, FGKT97, Liv00, NPS12, Obe96].Manchek [Ano95b]. Manipulation[KKV01]. Manual[CSW12, NSLV16, Reu01]. Many[DT17, LZH17, LLCD15, RB01, YTH+12,ACMZR11, VDL+15, dCZG06]. Many-Core[LZH17, YTH+12, LLCD15, ACMZR11,KSG13, MBBD13, dCZG06]. Many-Cores[DT17]. Manycore [MJB15, KGB+09].Map [JPT14, FFM11, FJBB+00, MSCW95].MAPA [JJPL17]. Maple[Pet00a, Pet00b, Pet01]. Mapping[GAMR00, HC06, NTR16, RRBL01,TSZC94, WO09, DDLM95, EO15, HC08,TWFO09, WCS+13, WTFO14, WK08a,WK08c, dCZG06, WK08b]. MapReduce

34

[JS13, MMM13, PD11, WZHZ16]. Maps[BM97, KRC17]. Marc [Ano96a, Ano99a,Ano99c, Ano99b, Ano99d, Nag05]. March[ACM95a, ACM06a, Ano89, Ano93c, Cal94,DKM+92, IEE93f, IEE94d, IEE95b, IEE97a].Marine [LLRS02]. market [LF+93a].Markov [BBH12, FK01]. Marlioz [GA96].marshaling [CFKL00]. MARTE [RGD13].Martin [ACM99]. Maryland[IEE96c, SPH95]. MASA [SMM+16].MasPar [ARL+94]. Massachusetts[IEE94e]. masses [Cla98]. Massive[Sie92a, MALM95, OLG+16]. Massively[BJ93, BHS18, BBH12, DSZ94, IEE94a,IEE96c, Oed93, Sie92b, Sta95b, CS96, DR94,HVSC11, KN17, KmWH10, LCL+12,MYB16, RBB17, SRK+12, DSZ94].massively-parallel [MYB16]. Master[FH98, EML00, LTR00, HP05].master-slave [HP05].Master-Workerproblem [FH98].Master/Slave [LTR00]. Master/Worker[EML00]. Matching[GGC+07, KS01, MM02, OWSA95, WH94,MM03, Qu95, YPZC95, YZPC95].Materials [Y+93, SSP+94]. mathematical[Wan97, Has95]. Mathematics[Whi04, ANS95]. MATLAB[BKGS02, Whi04, Ano97, Bra97, ZZG+14].MATLAB-MPI [BKGS02]. MatlabMPI[KA04, Kep05]. MATOG [WG17].matrices [DR18, GG99, GSMK17, Kan12].Matrix [AKL16, BSvdG91, Cha96, DS13,Fuj08, GK10, PMvdG+13, TQDL01, TD98,ART17, CMH99, ER12, FAF16, FJZ+14,KBP16, PKD95, TPD15, XXL13].Matrix-Vector[AKL16, DS13, Fuj08, XXL13]. Maui[ACM97a]. Max [Ano94c].Max-Planck-Gesellschaft [Ano94c].maximisation [CCU95]. maximum[HKOO11]. Maxwell [And98]. May[ACM96b, ACM06b, AGH+95, BR95a, BS94,Cha05, DT94, EdS08, Gat95, HS95a, IEE95e,

IEE95d, IEE95i, PR94b, SPE95, SW91,SS96, Van95]. Maydan [Stp02]. MBCF[MMH99]. MCA [WCS+13]. McDonald[Stp02]. MCHF [SYF96]. McLean[IEE94a, Sie92a, Sie92b]. MCNP[MW93, McK94, WH96]. MD[IEE02, TMPJ01]. mdb [DKF94a]. MDE[RGD13]. Means [TK16]. Measurement[BFBW01, BFIM99, KRS99, Shi94, TMC09].Measurements [IHvA+00, EFR+05, GL99].mechanics [Bil95, MGG05, SL95].Mechanism[CGLD01, KSV01, MH01, THS+15, TSS00b,Tra02a, HWX+13, SIRP17, ZRQA11, ZA14].Mechanisms[Wal01a, CGBS+15, Ott93, TMTP96].Mechatronic [KDL+95b, KDL+95a].mEDA [VAT95]. mEDA-2 [VAT95].media [EZBA16, MAIVAH14]. Medicine[GA96]. medium [WLNL06].medium-scale [WLNL06]. Meeting[AD98, Ano93e, CHD07, CD01, CDND11,DKD05, DLM99, DKP00, DLO03, GA96,KGRD10, Kra02, KKD04, LKD08, MC94,MTWD06, RWD09, TBD12, BDW97, JB96,SPH95, Ano92, CHD09]. megabase[SdM10]. Meiko [FST98a, FST98b, Jon96].Melia [WZHZ16]. Mellon [IEE94d].Membership [MDM17]. membrane[FHSO99]. Memory[Att96, BME02, BWW+12, Bri10, BdS07,BT01b, CSW97, CC99, DM98, DMB16,DR97, DHHW92, DHHW93a, FB94,GCBM97, GB96, GSN+01, GSHL02,GLRS01, HC10, HDB+12, HDT+15, HT01,JJPL17, KB98, KS13, KSHS01, LSB15,Luo99, MB12, MRB17, MBE03, MMH98,MCdS+08, Mul02, NPP+00d, PBK00, Pok96,PMvdG+13, Ros13, STY99, ST02b, SW91,Thr99, VS00, VT97, ARS89, ABCI95a,ABCI95b, ADMV05, BCA+06, BVML12,BSC99, BMG07, CBPP02, Cha05, Cha96,CBHH94, CRM14, CC00b, DF17, DLR94,DBVF01, DS96b, DHHW93b, DPZ97, EV01,

35

FSV14, FHB+13, GCN+10, GBH14, GKK09,GL96, GL97c, GP95, HSP+13, HGMW12,HDB+13, HK09, JC17, JE95, KN95,KJA+93, KC06, LKL96, MLC04, NAJ99,NAAL01, OLG+16, PK05, PS00b, RGDM15,SSH08, SHHI01, SL94b, SBG+12, SYR+09].memory[SFL+94, SSC96, SPL99, SD16, TSY99,TSY00, Uhl95a, Vos03, Wal94a, Wal94b,WPL95, WK08a, WK08b, WK08c, WBSC17,WMRR17, YX95, LBD+96, GK97, SG05].Memory-Based [MMH98].Memory-Efficient [MRB17].memory-level [HK09]. Memory/Message[ST02b]. MemTo [GSN+01]. Menon[Stp02]. Mesh [HAA+11, MRB17, Ran05,BAS13, CLSP07, Cou93, GBR15, IDS16].mesh-particle [BAS13]. Meshes[MRB17, TPD15]. Message[AKL99, Att96, BZ97, BCH+03, BBG+01,BDH+97, BGR97b, BFM97, CHD07, Cer99,CGZQ13, CGH94, Cot97, Cot98, CTK00,CDND11, DFKS01, DHHW92, DHHW93a,DDL00, FKKC96, Fos98, FB94, GR07,GB96, Gle93, GLRS01, GLS94, GL95c,GLT00b, Hem94, KGRD10, KS97, KSV01,KKDV03, KKD04, LKD08, Luo99, MPI98,MP95, MS98, MBES94, MG97, MTWD06,MSS97, NW98, PBK00, Pok96, RC97,RRBL01, RWD09, RFG+00, SAL+17,ST02b, TBD12, WD96, Wer95, Wis97,YHGL01, ZWL13, ZG95a, ZG96, ZLL+12,Ada98, AD98, AAC+05, Ano93d, Ano94d,Ano95c, Ano00a, Ano00b, BBG+14, BL97,BvdSvD95, Bjo95, Bru95, BDW97, BFIM99,CGJ+00, CDZ+98, CRD99, CD01, CG99b,DKF93, DM93, DKD05, DS96b, DHHW93b,DOSW96, DLM99, DKP00, DLO03, FK94,GL92, HP05, HPY+93]. message[Hem96, KJA+93, Kra02, LR06a, LBD+96,wL94, LCY96, LMM+15, LC97b, NS91,PS07, PKB06, Pie94, PR94a, PS00b, Sei99,SWJ95, SDV+95, SZ99, SSG95, Sti94,TSZC94, VM95, Wal94a, Wal94b, ZKRA14,

ZA14, AMHC11, BC14, BBH+06, BRU05,BDH+95, Cot04, DKD08, DiN96, FKS96,FGT96, FGG+98, GGHL+96, GLDS96,GLT99, GLS99, GLT00a, GL04, Han98,IBC+10, KTF03, KKD05, LK10, MTSS94,MSL96, PS01b, RRFH96, SWHP05, SLG95,SWL+01, TGT05, TDB00, Wer95, YGH+14].Message-Passing [Att96, Cot97, Cot98,DHHW92, DDL00, GLS94, GL95c, GLT00b,MPI98, PBK00, Pok96, RRBL01, AAC+05,Ano94d, Ano95c, Ano00a, Ano00b,BvdSvD95, CDZ+98, GL92, Hem96,KJA+93, LR06a, LBD+96, wL94, LMM+15,PS00b, SSG95, Sti94, DiN96, GGHL+96,Han98, RRFH96, SLG95, Wer95, YGH+14].Message-Passing-Interface [Wer95].MessagePassing [Sei99]. Messages[KBS04, SKH96]. Messaging[HEH98, KC94]. Meta[BCLN97, FBD01a, FGRD01].Meta-Applications [BCLN97].Meta-computing [FBD01a, FGRD01].Metacomputer [OS97]. Metacomputing[Fin00, MSF00, MS99b, FBVD02].MetaHaskell [Mai12]. metaheuristics[ZSK15]. metal [JLS+14]. MetaMP[OW92]. metaprogramming [Mai12].meteorological [RSBT95]. Meteorology[HK93, HK95]. Method [ACMR14, BP99,BJS97, CGU12, FCLG07, GSI97, HC06,KMK16, OMK09, Riz17, TSS00a, ARYT17,BBDH14, BCM+16, DSOF11, ETV94, HE13,HMKV94, HJBB14, HPLT99, JMS14,KS15a, KD12, LCL+12, Nak05b, NS16,PTT94, Pri14, Qu95, SHHC18, TKP15,YBZL03, dlAMCFN12, AAB+17].Methodologies [Sun94b]. Methodology[MOL05, WTTH17, HPR+95, LM94,WMP14]. Methods [BCMR00, CMK00,DFN12, EGH+14, FGKT97, GFPG12,KLR+15, kL11, NA01, Sch01, SM07,TDBEE11, Whi04, ZB97, CEGS07, DF17,D+95, Gra09, Has95, LSR95, MM11, Nak05a,PGK+10, R+92, SL94a, SGS95]. Metrics

36

[DW02, PARB14]. Metropolis [HJBB14].Mexico [IEE91, Sie94]. MGCG [TSS00a].MGF [GLM+08]. MIAOW [BGG+15].MIC [CCBPGA15]. MICE [BK96]. Micro[Ano03, BWV+12, SGH12, YSWY14].Micro-applications [SGH12].Micro-Benchmark [BWV+12, YSWY14].microbenchmark [BO01]. Microcoded[PWP+16]. microtask [OIS+06]. MIDAS[BFZ97]. Middleware[AUR01, CLL03, CC10]. Middlewares[DPP01]. Midpoint [JMS14]. Migol [LS08].Migratable [KOW97]. Migrating[VSRC94, VSRC95, IvdLH+00, KBG+09].Migration [Ano94b, CCK+95, CLL03,CML04, CCBPGA15, CTK01, NPP+00c,NLRH07, Ott94, OS97, ST97, AMBG93,BBGL96, CKO+94, CRM14, CRGM16,CK99, DDYM99, HZ99, LCVD94b, LM13,RRFH96, SSS99, SCL97, Ste96]. Milan[HS95a]. million [LHLK10]. Millions[BBG+11]. MIMD[BvdB94, BB93, BCL00, Uhl95a, WST95].MIMD/DMMP [BB93]. MiMPI [GCC99].MINIME [DS16]. MINIME-GPU [DS16].minimization [POL99]. Minimum[KA95, Wu99, NCKB12]. mining [MA09].Mississippi[IEE94f, IEE95j, IEE94f, IEE95j].mitigating [OdSSP12]. Mitigation[BBH. . . 13a]. Mitsubishi [Ano03]. mittels[Wil94]. Mixed [ASA97, BEG+10, CF01,OPP00, ST02a, MRH+96, SK00, SB01].Mixed-Mode [BEG+10]. Mixing[CP98, GAP97]. mixture [EO15]. MK[NS91]. mm par2.0 [OKM12]. MN[Ano94h]. Mob [STV97]. Mobile [ITT02].Mode [BGK08, Bri02, BEG+10, LRT07,SB01, YX95]. Model[AP96, BGG+02, BdS07, CKmWH16, Cha02,CZG+08, Dar01, DFA+09, FSXZ14, FBSN01,GLB00, GLRS01, HLP11, KD12, LGG16,LA02, LRQ01, MKW11, NSLV16, NO02b,Ran05, RSV+05, RRBL01, SPM+10, SB95,

THN00, VT97, Wal01a, AL93, BSC99, Bir94,BG94b, BDV03, CMV+94, CL93, CKP+93,ED94, GKZ12, GCN+10, GkLyCY97,GWVP+14, GRTZ10, HPLT99, HK09,HK10, KOS+95a, KSL+12, LR06b, LA06,LLH+14, Mar05, MdSAS+18, MSZG17,MGC+15, NO02a, Nak05a, PAdS+17,RAS16, RGDML16, RCG95, Sch93, SH94,Sch99, SMAC08, Str94, VBLvdG08, Vis95,Wan02, WC15, WYLC12, YX95, TA14].Model-Based [AP96, LGG16]. Modeling[ACM96a, ATM01, BS07, CSC96, CDM93,FST98a, GAM+02, MOL05, NM95,RGDM15, SEF+16, TD99, VFD02, XH96,BDP+10, Bic95, KM10, KME09, KEGM10,MS99a, XXL13, YMYI11]. Modelling[FST98b, GC05, Ham95a, KDL+95b, BJS99,HTHD99, KDL+95a, MSML10]. Models[AKK+94, BS93, BZ97, CMK00, Cer99,CNM11, DK06, EMO+93, ESM+94, GJN97,PPF89, SS01, SMOE93, Whi04, BB95,CH96, Duv92, KO14, LV12, MCB05, Nes10,RSBT95, RBAI17, SYR+09, Wal00,WBSC17]. moderate [Uhl95a]. Modern[AHHP17, DARG13, KDT+12, LNK+15,SM07, HH14, PMZM16]. modes [WZWS08].Modified [Riz17, GP95, KD12]. Modular[CT02, HPP02, FWS+17, HLM+17].modulator [WWZ+96]. modulator/DFB[WWZ+96]. Module [Ano98]. Modules[AKK+94, DS96b]. modules-design[DS96b]. Molecular [ABG+96, BST+13,BCGL97, BL95, BS07, DR97, DI02, KBM97,LAFA15, MH01, SA93, YWCF15, ZB94,BvdSvD95, BBK+94, BMPZ94b, BMPZ94a,CC00b, DCD+14, FHSO99, JAT97, JMS14,KFA96, KRG13, LSVMW08, OKM12,PARB14, SL95, ZWL13]. molecule [ART17].Møller [BL95, KN17]. Monito [SGL+00].Monitor [KRS99, Whi94]. Monitoring[AH00, BCLN97, Beg93b, BFM96,BFMT96b, CD98, DBK+09, GSN+01, LY93,LW97, MWG97, MVY95, SGL+00, UP01,Wis98, Wis01, Yan94, Beg92, Beg93c, Beg93a,

37

BB94, BS96a, BFMT96a, FLB+05, LC07].Monodomain [ORA12]. Monte [HJBB14,RP95, WH96, ADRCT98, AK99, DAK98,NSLV16, RR00, SK00, SKM15, ZZ04].Monterey [Ano89, Gat95, USE94].Montpellier [DE91]. Montreal [Lev95].MOPS [GJN97]. Morehouse [AGH+95].Morgan [SD13]. Morton [LZH18].MOSIX [BBGL96]. motif [FMS15].motors [SKM15]. movement [MV17].Moving [HAA+11, LSG12]. MPE[GKL95, KFA96]. MPEG [NU05].MPEG-4 [NU05]. MPI [ARYT17, AD98,Ano95c, Ano99a, Ano99c, Ano99b, Ano99d,Ano00a, Ano00b, BDW97, CHD07, CHD09,CD01, CDND11, DKD05, DLM99, DKP00,DLO03, GBR97, GEW98, IEE96i, JMS14,KGRD10, Kra02, KKD04, LKD08,MTWD06, Nag05, Per97, PS01b, RWD09,RLVRGP12, ST02a, TDB00, TBD12, Vre04,WSN99, YM97, ST02b, ACGdT02, Ada97,Ada98, ACH+11, APJ+16, AASB08, ART17,ATM01, ACGR97, AK99, ABF+17, AHP01,ACMZR11, ALW+15, ADLL03a, ADLL03b,And98, FH98, AVA+16, Ano93d, Ano94d,Ano98, Ano01a, Ano03, AKE00, AKL99,AJF16, AIM97, ADR+05, AHHP17, Bad16,BV99, BCMR00, Bak98, BF98, BCFK99,BBG+10, BCG+10, BBG+11, BGBP01,BBS99, BBG+14, BA06, BCAD06, BADC07,BGR97a, BKGS02, Ben01, BW12, BHV12,BKH+13, BIL99, BIC05, BP98, BF01]. MPI[BBCR99, BBDH14, BK96, BKdSH01,Bha98, BfDA94, BHLS+95, BHS+02, Bis04,BBH. . . 13a, BBH+13b, BDB+13, BIC+10,BR04, BCM+16, BTC+17, BM00, Boo01,BBC+02, BCH+03, BHK+06, BBC+00,BS96b, BMR02, Bri02, BRM03, Bri10,BMPS03, BS07, BDL98, Bru95, BDH+95,BDH+97, Bru12, BLW98, BFBW01,BEG+10, BCH+08, BWV+12, CGC+02,CSW12, CGC+11, CwCW+11, CRE99,CE00, CRE01, CC10, CP98, CAHT17,CGJ+00, CFKL00, CSS95, CGBS+15,

CGG10, CB00, CDMS15, CBL10, Cha02,CEGS07, CDP99, CCA00, CFDL01, CLL03,CGZQ13, CC17, CSAGR98, CNC10, CC00a,CGH94, CCSM97, CFMR95, CDD+96,Coo95a, Coo95b, CFF+96, CRGM14,CRM14, CRGM16, CC99, CT02, CD96,CG99b, DPS05, DPSD08, Dan12, DSG17,DZ96, DZ98a, DR18, DW02, DLM+17,DZ98b, Dem96, DPP01, DLB07, DSW96].MPI [DS96a, DRUC12, DKD07, DI02,DL10, DCPJ12, DCPJ14, DAK98, DGG+12,DGB+14, DBB+16, HD02a, DXB96,DOSW95, DCH02, DBK+09, EZBA16,EGH99, EDSV09, ES11, FH97, FD96,FDG97a, FDG97b, FLD98, FD00, FBD01a,FBD01b, FGRD01, FBVD02, FD02a,FD02b, FD04, FCLG07, FB95, FB96, FB97,Fan98, FPY08, FFB99, FNSW99, FTVB00,FFP03, FMS15, FHK01, FKH02, FSC+11,FCS+12, Fin97, Fin94, Fin95, FWNK96,Fin00, FLB+05, FC05, FST98a, FST98b,FJK+17, FKK+96b, FKK96a, FGT96, Fos98,FHPS94a, FHPS94b, FHP+94, FHP+95,Fra95, FWR+95, FKLB08, FBSN01, FSLS98,GBR97, GFD03, GFD05, GDC15, GGGC99,GGCM99, Gao03, GBR15, GCGS98, GCC99,GCBL12, GGHL+96, Gei00, GR07, GGL+08,GJR09, GSI97, GBH14, GGS99, GR95,GLB00, Gle93, GM13, GT01, GBH99,GHZ12, GAVRRL17, GRRM99]. MPI[GAMR00, GKS+11, GB98, GMPD98,GPL+96, Gra97, GEW98, GBS+07,GLM+08, GL92, GL94, GLS94, GL95a,GL95b, GKL95, GL95c, GL96, GLDS96,GL97c, GL97b, GHLL+98, GL99, GLT99,GLS99, Gro00, GLT00b, GLT00a, Gro01a,Gro01b, Gro02a, GL02, Gro02b, GT07,GLT12, Gro12, GPC+17, GC05, GSY+13,Gua16, HJ98, HC10, Har94, Har95, HL17,Hat98, HO14, HD02b, HE02, Hem94, HZ96,Hem96, HRZ97, HZ99, HEH98, HGMW12,HMK09, HPS+12, HPS+13, Hin11, HRR+11,HDB+12, HDB+13, HDT+15, HKN+01,HLOC96, HKT+12, HVSC11, HWX+13,

38

HM01, HCA16, HG12, HcF05, Hus98, Hus00,Hus01, HWW97, IDS16, IRU01, ITKT00,ICC02, JF95, JDB+14, Jes93b, JJM+11,JS13, JNL+15, Jon96, JR10, JSH+05, KB01,KFA96, KS15a, KPW05, KW14, KWEF18,KD12, Kan12, KFL05, KB98, KK02a, KL94].MPI [KYL03, KYL05, KSJ95, KSJ96,KN17, KBS04, KGK+03, KHB+99, KBM97,KLR+15, KR09, KMG99, KEGM10, KRC17,KV98, KAC02, KC06, KBG16, KMH+14,KRG13, LK14, LAdS+15, LLRS02, LTDD14,LGM00, LRT07, LC97a, LR06b, LTRA02,Lee12, LZ97, LRW01, LPD+11, LLC13,LZH17, LZH18, kLCC+06, kLCCW07, kL11,LFL11, LS10, LCY96, LCW+03, LVP04,LWP04, LGG16, LYSS+16, LB96, LMG17,LCMG17, LNLE00, LO96, dLR04, LS08,LL01, LZC+02, LKJ03, LCC+03, LKYS04,LSK04, LLH+14, MBBD13, MMR99, MS02a,MS02b, MV17, MTK16, Man01, Man98,MLVS16, MLAV10, MKP+96, MSMC15,MSL12, MH01, MSL96, MS96a, MC98,MGG05, MAS06, MM02, MM03, MOL05,MCS00, MANR09, MRRP11, MG97,MMM13, MTW07, MK04, MCLD01,MMH98, MMH99, MS99c, MB00, MvWL+10,NAW+96, NO02b, NO02a, Nak05a]. MPI[Nak05b, NSBR07, NE98, NE01, Nes10,NSS12, NH95, NCB+12, NCB+17, NAJ99,NW98, Nit00, NHT02, NHT06, NFG+10,NN95, OM96, OLG+16, OKM12, OIS+06,OD01, OF00, Ong02, OP98, OL05, OMK09,Pac97, PARB14, Pan14, PK98, PES99,PLK+04, PSK08, PDY14, PS00a, PS01a,PHJM11, PTL+16, Per99, PZ12, PGK+10,PFG97, PLR02, PGAB+05, PGBF+07,PGAB+07, Pla02, PD11, PSSS01, PSK+10,PTH+01a, PTH+01b, PS00b, PTW99,QB12, Qui03, Rab98, Rab99, RDMB99,RR01, Ram07, RSBT95, Ran05, RA09,RAS16, RCFS96, RBB97a, RBB97b,RBB97c, RSPM98, RTH00, RH01, Reu01,RST02, Reu03, RGDM15, RGDML16,RNPM13, RPM+08, Roh00, Rol08b, RsT06,

RFRH96, RRG+99, RTRG+07, SE02,SCB14, SCB15, SPM+10, SSB+05, Sap97,SSB+16, SDJ17, SGH12]. MPI[SBF+04, SW12, SBG+02, SG05, Ser97,SS01, SWS+12, SG12, STY99, SM02, SM03,SP99, SZ11, SC04, SSC96, SS99, SZBS95a,SZBS95b, SDN99, SvL99, SJ02, SWJ95,SMTW96, SH96, SDB94, SLG95, SDV+95,SPH96, Slo05, SVC+11, SK00, SB01,SOHL+96, SOHL+98, SHHC18, SSL97,Squ03, Ste96, ST97, Sto98, SU96, Str96,Sum12, SN01, TOTH99, TAH+01, TSY99,TSY00, TKP15, Tha98, TGL02, TG09,TW01, TD99, Tra98, THRZ99, TRH00,Tra02b, Tra02a, TGT10, Tra12a, Tra12b,TMPJ01, TFGM02, Tsu07, TFZZ12, UTY02,URKG12, VFD02, VS00, VPS17, VSRC94,VSRC95, VGRS16, VdS00, VP00, VVD+09,WH96, Wal95, WO95, Wal96a, WD96, WO96,Wal01a, Wal01b, Wal00, WC09, WLNL03,WLNL06, Wer95, WST95, Whi04, WLR05,WWZ+96, Wis98, WB96, WM01, WADC99].MPI[Wor96, WRA02, WCS99, WT11, WYLC12,WT12, WLYC12, WMP14, XH96, XLW+09,YM97, YL09, YHL11, YWC11, YCL14,YBMCB14, YPAE09, YTH+12, YSP+05,Zah12, ZZ04, ZLZ+11, ZWZ05, ZLP17,ZLL+12, ZZ95, ZSnH01, ZKRA14, ZA14,bT01a, dlAMCFN12, KH96, Mar06, YM97,Ano96a, Ano99a, Ano99c, Ano99b, Ano99d].MPI-1 [SOHL+98]. MPI-2[Ano99c, Ano99d, Ano00a, AKL99, BCAD06,BHS+02, CwCW+11, CD96, DPSD08,GFD03, GGHL+96, GT01, GHLL+98,GLT99, GLT00b, GLT00a, HGMW12,LSK04, MS02a, MK04, PS00a, SS99, SSL97,TRH00, bT01a, BADC07]. MPI-3[GBH14, GLT12, HDT+15]. MPI-ACC[APJ+16]. MPI-Based[Ada97, FSC+11, RDMB99, SM03, Ada98,AVA+16, GKS+11, Gra97, LRW01, OLG+16,OP98, SZ11, TMPJ01]. MPI-basierte[Gra97]. MPI-benchmark [Reu01].

39

MPI-CHECK [LCC+03]. MPI-CUDA[DR18, dlAMCFN12]. MPI-DDL [FB97].MPI-Delphi [ACGdT02]. MPI-driven[Hin11]. MPI-F [FHPS94b, FHP+94].MPI-FM [LC97a]. MPI-FT [LNLE00].MPI-GLUE [Rab98]. MPI-Hybrid[CGC+11]. MPI-I [IRU01, Tsu07].MPI-I/O [IRU01, Tsu07].MPI-interoperable [YBMCB14]. MPI-IO[BIC+10, CGC+02, CFF+96, DL10,FWNK96, FSLS98, LRT07, LGG16, PSK08,PTH+01a, SW12, Sto98, TGL02, ZZ04].MPI-IO/GPFS [PTH+01a]. MPI-LAPI[BGBP01]. MPI-Level [LVP04]. MPI-like[CGJ+00]. MPI-only [LS10].MPI-OpenCL [JNL+15]. MPI-OpenMP[MS02b]. MPI-parallelized [KMG99].MPI-StarT [Hus98]. MPI-The[Ano99c, Ano99d]. MPI-thread [IDS16].MPI-Umgebung [GBR97]. MPI/CUDA[PHJM11]. MPI/GAMMA [CC00a].MPI/GPU [EZBA16]. MPI/GPU-code[EZBA16]. MPI/MBCF [MMH99].MPI/OpenMP[ADR+05, GAVRRL17, HKN+01, JR10,KS15a, KN17, KLR+15, KRG13, LLRS02,PZ12, SB01, WT11, WT12]. MPI/PVM[ES11]. MPI/RT [SKD+04]. MPI/RT-1.1[SKD+04]. MPI/SMPSs [MLAV10]. MPI1[Sti94]. MPI2 [MPI98, Wal96b]. MPI2007[MvWL+10]. MPI Allgather[GMdMBD+07]. MPI Connect [FGRD01].MPICH[BBC+02, BCH+03, BHK+06, Cot98, Cot04,GL97a, KTF03, LKJ03, OPM06, OF00,RFG+00, RsT06, SBG+02, TRG05].MPICH-CM [SBG+02]. MPICH-G2[Cot04, KTF03, OPM06]. MPICH-GQ[RFG+00]. MPICH-V [BBC+02, BHK+06].MPICH-V2 [BCH+03]. MPICH2[BMG07, Gro02b, ZSG12]. MPIConnect[FLD98]. mpicroscope [Tra12b]. mpiJava[BCFK99]. MPINE [Sou01]. MPIPOV[FFB99]. MPIT [HIP02]. MPIWiz

[XLW+09]. MPJ [CGJ+00]. MPL [XH96].MPL0* [CRD99]. MPP[CDJ95, DOSW96, GBR97]. MPP-Systeme[GBR97]. MPPs [BGR97a, RBB97a].MPSoC [KKJ+08, KH10, PSM+14].MPSoCs [MB12, NEM17]. MPVM[CCK+95]. MRI [LSSZ15]. MRO[MMM13]. MRO-MPI [MMM13]. Multi[Ada98, ABB+10, Bri10, BCKP00, CAWL17,CZG+08, DWL+10, EBKG01, FSXZ14,HD02b, HRZ97, JCH+08, JNL+15, KBA02,KT02, LM13, MG15, MB00, NMS+14, PZ12,RG18, RR02, Smi93a, ST02a, ST02b,SSB+17, WBH97, YGH+14, ACMZR11,AGMJ06, BCK+09, DCH02, DWL+12,Fin94, Fin95, FHB+13, HTA08, HE15, JR13,JJM+11, JR10, KSG13, KO14, LSG12, LS10,LLH+14, MALM95, NSM12, SCB15, SFSV13,SVC+11, SAP16, Str12, TS12b, TFZZ12,WCC+07, WO09, WADC99, WYLC12,ZAFAM16, ZWZ+95, ZZZ+15, SAP16, SG14].multi- [ACMZR11, KSG13].multi-/many-core [KSG13]. multi-agent[ZWZ+95]. Multi-agents [KBA02].Multi-cluster [ST02b, KO14]. Multi-Core[ABB+10, Bri10, CZG+08, YGH+14, PZ12,FHB+13, HTA08, JR13, JJM+11, JR10,LLH+14, SFSV13, SVC+11, TFZZ12,WCC+07, WYLC12]. multi-cores [WO09].multi-CPU [SAP16].multi-CPU/multi-GPU [SAP16].Multi-Dimensional [HD02b, KT02, RG18].multi-endpoint [LLH+14]. Multi-GPU[JNL+15, NMS+14, NSM12, TS12b, SAP16,SG14]. multi-kernel [SAP16]. Multi-level[CAWL17, LM13, HE15, MALM95, ZZZ+15].Multi-Network [BCKP00]. Multi-Node[HRZ97]. multi-petaflops [LSG12].multi-phase [ZAFAM16]. Multi-Physics[WBH97]. multi-place [BCK+09].Multi-platform [DWL+10, DWL+12].Multi-Processor [RR02, Smi93a, DCH02].multi-programming [WADC99].Multi-protocol [MB00]. multi-socket

40

[LS10]. Multi-Stage [FSXZ14].Multi-Threaded[MG15, Ada98, EBKG01, SCB15].multi-valued [Str12]. Multi-versioned[SSB+17]. multi-zonal [Fin94, Fin95].Multi-Zone [JCH+08, AGMJ06].Multiblock [IDD94, DLR94]. Multicast[CCA00, CDPM03, ZGN94]. Multicasting[SE02]. multicenter [CwCW+11].MultiCL [APBcF16]. multicomputer[SWJ95, TD99]. multicomputers[HWW97, Yan94, YX95]. Multiconference[Ten95]. Multicore[BDT08, CGC+11, CB16, DS16, KDT+12,LNK+15, WT12, CLYC16, GJLT11,HWX+13, JPOJ12, KN17, LS10, MBBD13,MM11, Nob08, OPW+12, PDY14, QB12,RGDML16, WCS+13, WT11, WLYC12,YHL11, YWC11, dlAMC11].multicore/many [MBBD13].multicore/many-core [MBBD13].multicores [UGT09]. multidestination[Pan95a]. multidimensional[CSW99, PDY14, ZT17]. multidisciplinary[Fin94, Fin95]. multifrontal [IM95].Multigrain [AZG17, IOK00]. Multigrid[BCMR00, AGIS94, IHM05, Lou95, Mic93,Mic95, PSLT99, RM99, Sta95a, ZZG+14].Multigroup [QRG95, QRMG96].Multilevel[PSSS01, BAV08, ETV94, GAM+00, JJY+03].multimedia [GFB+14]. multimethod[FGT96]. Multiobjective [RLVRGP12].Multiparadigm [FS98]. Multiphysics[NPS12]. Multiplatform [SMM+16].Multiple[BSG00, CB16, FGKT97, FBSN01, JPT14,JSH+05, LTR00, NTR16, Pet01, Tsu12,ZC10, ESB13, KGB+09, KKLL11, SHHC18].Multiple-Precision [ZC10, JPT14].Multiplication [AKL16, DS13, Fuj08,TQDL01, FAF16, FJZ+14, XXL13].Multipole [AAB+17, LCL+12, YBZL03].Multiported [SG15]. Multiprocessing

[MW93, VGS14]. Multiprocessor[Pet97, ABCI95a, ABCI95b, ADMV05].MultiProcessors[BDV03, CC99, HPP02, NPP+00d, SBW91,SS01, Tra98, JE95, KC06, SYR+09, AGIS94].multiprogrammed [TSY99].Multiprogramming [BHP+03].Multiprotocol [BHK+06]. Multirail[LVP04]. multiscale [CwCW+11].multiservice [CLLASPDP99]. multisource[ZDR04]. multistage [ZGN94]. Multistart[Cza13]. multitasking [FH95].multithread [GCC99, SWYC94, ZG98].multithread-safe [GCC99].Multithreaded [AZG17, DGG+12, PS01b,RBAA05, TGBS05, WJ12, DSG17, TMC09,TG09, WCC+07]. Multithreading[BBG+10, ZWL13]. Munich[BDLS96, GH94]. Mushy [Wit16]. MUST[HPS+12, HPS+13]. mutual [She95].MVICH [OF00]. Myocardial [Pat93].Myrinet [GBH99, CDP99, JSH+05,LCW+03, PTW99, Tou00].

n [Pan95a, ADB94, RTRG+07]. N-body[ADB94, RTRG+07]. n-cube [Pan95a].NAG [DHP97, For95, McD96]. NAMD[PZKK02]. Naming [MSF00]. Nancy[BR95a]. NanosCompiler [GAM+00].Narrow [YSS+17]. NAS[CRE99, CE00, CCF+94, CDD+96, KS96,KAC02, MMH99, WAS95b, WT11, WT12].NASA [MAB05]. NASLU [PHJM11].National [Str94, BRST94]. Native [SZ99].NATO [KG93, TG94]. NATUG [Ara95].NATUG-7 [Ara95]. nature [DSM94].Navier [Che99, DLR94, HSMW94, IDD94,Lou95, SCC95]. NB [BG91]. NC[Agr95a, SL94a]. NCS [AL92]. nCUBE2[BL94]. Near [PKYW95]. Nearest [DI02].Nearest-Neighbor [DI02]. Nebelung[MFG+08]. NEC[GPL+96, HRZ97, TRH00]. Necessary[NPP+00b]. Needed [Gei00]. Neighbor

41

[DI02]. neighborhood [HS12]. Nekbone[GML+16]. Nemesis [BMG07]. Nesbet[BL95]. Nested[AHD12, BR12, BS01, DLRR99, GLP+00,HA10, MMS07, TTSY00, ZLP17, aMST07,AGMJ06, BS05, HSE+17, THH+05, YZ14].Nests [DMB16]. Net[CNM11, NE98, NE01, PES99].Net-Console [PES99]. Net-dbx[NE98, NE01]. netCDF [LkLC+03].Netherlands [DSZ94, Ano93e, Van95].Nets [Sou01, Str94]. Network[ACM98a, AR01, BDG+91b, BDG+93a,BCKP00, CZ95a, CDHL95, CSC96, DM95b,DM95a, DBA97, DFMD94, DGMS93,DGMJ93, EK97, Fer98b, Fis01, GS91b,GS92, Gei93a, GSxx, Hus98, ITT02, LB98,LH95, MSCW95, MANR09, OF00, OWSA95,TW01, AL92, AH95, AVA+16, BDG+92a,BDG+92c, BDG+94, BSvdG91, BJ95,Bon96, BBK+94, BID95, BFM96, Coe94,CLLASPDP99, Fer98a, GS91a, Gei93b,GK97, GHZ12, HBT95, HK94, HH95, IM95,KMC96, KMC97, KA95, LH98, LHD+94,LHD+95, MK94, MRH+96, POL99, PR94c,PTW99, Rag96, SEC15, SPK+12, TSS98,YS93, ZPLS96, GK97].Network-Balancing [DBA97].Network-Based[BDG+91b, GS92, BDG+92a, IM95].Network-Specific [DM95b, DM95a].network-topology-aware [SPK+12].Networked [FGKT97, GBD+94, Nov95,Per96, Ano95b, BMPZ94b, BMS94a,BMPZ94a, GM94, HS93, RRG+99].Networking [ACM97b, ACM98b, ACM00,ACM01, ACM04, Hol12, LCK11, CXB+12,GH94, HS95a, ITT99, LCHS96, MZK93].Networks[CSV12, CDM93, DDPR97, GFV99, GHL97,HHK94, HLCZ00, HIP02, LHHM96, Li96,LHZ98, MBES94, QMGR00, SG15, TQDL01,Tou00, VLO+08, WAS95b, BK11, BRS92,CZ95b, CFPS95, DG95, DZ98a, Jou94,

LR06a, LTLC94, LHD+94, LHD+95,NFG+10, Pan95a, TDB00, ZGN94]. Neural[AGH+95, CAM12, CSV12, QMGR00, Str94,GkLyCY97, Rag96]. Neurocomputing[PSZE00]. Neutron[LD01, RS97, VRS00, WR01, MM92].Nevada [Ano94e]. never [Har94]. Neville[ACMZR11]. Newport [IEE93b]. News[Ano97, Ano03, Bra97, ESB13, KS15a, Str94].Newton [ZB97]. Next [GKPS97, Gei98,Gei01, VPS17, SP11, ZKRA14].Next-Generation [VPS17, ZKRA14]. NFS[CGC+02]. NHPDCC [BRST94]. NIC[MFPP03]. NIC-based [MFPP03]. Nice[ACM90]. nineteenth [IEE95l]. Ninth[ERS96, R+92]. NIST [SNMP10]. Nitzberg[Ano99c, Ano99d]. NLP [VB99]. NM[IEE95d, Old02]. NoC [HWX+13].NoC-based [HWX+13]. Node[HRZ97, KFL05, FKLB08, GM13, JR10,LFL11, Zah12]. Nodes [BBC+02, BCH+03,DBK+09, JNL+15, MKC+12]. Noise[SAL+17]. Non[BCG+10, CCSM97, Gua16, HTA08, MW98,Man01, WLNL03, WTR03, FH98, BCH+08,OKW95, OMK09, WLNL06]. Non-blocking[HTA08, FH98, BCH+08]. Non-Contiguous[WTR03]. Non-Data-Communication[BCG+10]. non-dedicated [WLNL06].non-iterative [OMK09]. Non-linear[MW98, OKW95]. Non-Local [CCSM97].Non-persistent [Man01]. Non-stop[Gua16]. nonaligned [AGIS94].Noncontiguous [JDB+14, TGL02].Nondeterminacy [DKF93].nondeterminism [Obe96].Nondeterministic [KSV01, CRD99].Nonlinear[Nak03, Was95a, ZB97, CEGS07, Jou94].nonnegative [KBP16]. nonsymmetric[dH94]. Nordic [FF95]. Norfolk [Sin93].normalized [Gra09]. North [CJNW95].Note [BR02, SGHL01]. Notre [IEE96i].novel [DDYM99, GKK09, MLVS16, MSL12].

42

November [ACM96c, ACM97b, ACM98b,ACM99, ACM00, ACM01, ACM03, ACM04,ACM05, Ano94c, ACDR94, BDW97, GN95,HK95, Hol12, IEE91, IEE93e, IEE94b,IEE94h, IEE02, LCK11, USE94]. novice[CGG10]. Novices [Stp02]. NOWs[SLGZ99]. NP [YZ14]. NPACI [PKB01].NPB [EGC02]. NR [Gua16]. NR-MPI[Gua16]. NRC [LD01]. NSGA[GAVRRL17]. NSW [GN95]. NT[Ano01a, Bak98, BF98, CLP+99, FD97,GGGC99, PS00a, SFG98, TAH+01].NTRUEncrypt [KY10]. NTUG [FF95].Nuclear [BPG94, GA96]. nuclei [NS16].NUMA [BCC+00a, BCC+00b, BFG+10,CAWL17, GTS+15, MKC+12, MJB15,OPW+12, SLN+12, TSCaM12, ZLP17].NumaGiC [GTS+15]. Number[BP99, HT08, WHDB05, Lan09]. Numerical[ACMR14, BS93, BCP+97, CSW97, DHK97,DHP97, FK01, For95, FB94, HH14, Hus98,IFI95, KM10, Kha13, McD96, NHT02,PKYW95, TDBEE11, YKLD17, AL92, Boi97,BCM+16, CSW99, FP92, GS94, JK10, KB13,Nob08, NHT06, Pri14, SMAC08, SU96].Numerically [BKML95, BFLL99]. nur[BL94]. Nutzung [GEW98]. NVIDIA[KME09, Seg10, XXL13, KKM15, Lan09].NX [Pie94, PR94a]. NY[IEE96f, PBG+95, Ree96, SS96].

O [Bos96, CFF+96, DRUC12, IRU01,IBC+10, LkLC+03, kLCC+06, MV17,MGC12, MG15, PSK08, PLR02, RK01,SBQZ14, Tha98, Tsu07, WSN99]. O2000[CML04]. O2WebCL [CHKK15].Oberammergau [BPG94]. Object[Ada97, BCFK99, CFKL00, FMSG17,MSL96, PD98, SWL+01, YHGL01, YX95,Ada98, BR91, DM12, LKL96, OKM12,RFH+95, SL94b, TDG13]. object-based[LKL96]. Object-Oriented[BCFK99, PD98, SWL+01, Ada98, DM12,OKM12, RFH+95]. Objects

[KH15, Man01, MFC98, HS93, SOA11, SC95,YWO95, ZPLS96]. Oblivious[LZH17, LZH18, UALK17, HSP+13].observations [ZKRA14]. observed[CAHT17]. Occam [ACDR94, GN95, MC94,EM94, SHH94a, SHH94b]. Ocean[BS93, GAM+02, Bic95, Mal01, Nes10,Sch99, Wal00]. Oceans [IEE94c, IEE94c].OCLoptimizer [FAFD15]. OCM[BoFBW00]. OCM-Based [BoFBW00].October [Ano93e, Ano94e, Ano94i, Ara95,BPG94, Bha93, BDLS96, CHD07, CGB+10,DSM94, DLO03, DE91, FK95, GGK+93,IEE94f, IEE95a, IEE95g, IEE95j, IEE96b,IEE96c, IFI95, JB96, Kra02, Old02, OL05,Sch93, Sie92a, Sie92b, Tou96, USE00,UCW95, Vol93]. ODE [Ano97, Bra97].ODEs [Pet97]. OdinMP [BB00].OdinMP/CCp [BB00]. Offering [EK97].Official [Ano98]. Offload [BRU05].Offloading [MGA+17, DSGS17, KBG16].oft [Rol08a]. Oil [FSXZ14, ZAFAM16].OKs [Ano03]. old [LK14]. OMB[BWV+12]. OMB-GPU [BWV+12]. OMIS[LW97]. Omni [KSS00, KSHS01].OmniRPC [SHTS01]. OMP [SGJ+03].OMP2001 [TSB03]. OMP2012 [MBB+12].OMPI [ACH+11, OM96]. OmpSs[ABF+17, YAJG+15]. on-chip [TDG13].On-Demand [CTK00]. On-Line[BoFBW00, Wis98]. On-the-fly [KSJ14].ONC [RS93]. One [BPS01, GFD03, GFD05,GBH14, GT01, HDB+12, LRT07, MH01,TGT05, TRH00, ZSG12, bT01a, DBB+16,LSK04, MS99c, Ols95, PGK+10, dlAMC11].one-dimensional [Ols95]. one-layer[dlAMC11]. One-Sided[BPS01, GFD03, GFD05, GT01, HDB+12,LRT07, MH01, TGT05, TRH00, ZSG12,bT01a, DBB+16, LSK04, MS99c, PGK+10].only [LS10, Squ03]. Ontario [GGK+93].onto [OFA+15]. OOMPI [MSL96]. OOPS[RFH+95]. OPAL [CwCW+11, NW98].OPAL-MPI [NW98]. opaque [SOA11].

43

Open [BGG+15, KDL+95b, AVA+16,KDL+95a, Nob08, GBS+07, VGRS16].Open-Source [BGG+15, AVA+16, Nob08].OpenACC[CCBPGA15, GML+16, JCP15, LSG12].OpenCL [ABDP15, APBcF16, AB13,BLPP13, BDW16, BN12, BHW+12,BBH+15, BAS13, CDD+13, CP15, CIJ+10,CHKK15, CCK12, CS14, DARG13, Di 14,DWL+10, DWL+12, FAFD15, FLMR17,FE17, FSV14, FVLS15, GScFM13, HD11,HE15, HHC+18, JSS+15, JKM+17, JR13,JNL+15, JMdVG+17, KKM15, KH12, KM10,KKLL11, KSL+12, KJJ+16, KB13, KPK13,Lee12, LNK+15, LL16, LAFA15, MC17,MAIVAH14, MTU+15, MSZG17, MHSK16,ON12, ORA12, PCY14, PHW+13, PB12,RG18, RGD13, RBB15, RBB17, SFSV13,SAP16, SSB+17, SG14, SFLD15, SGS10,Str12, THS+15, TK16, TMW17, TKP15,TY14, WTTH17, WZHZ16, YSWY14,YWTC15, YSL+12, ZWL+17, ZT17, dAT17].OpenCL-accelerated [ZWL+17].OpenCL-Based[WTTH17, WZHZ16, JKM+17].OpenCL-to-WebCL [CHKK15]. OpenGL[Ano98, LHZ97, ORA12]. openMosix[Slo05]. OpenMP [Cha05, CZG+08,CGKM11, CMMR12, EV01, JMS14,MdSC09, SHM+10, Vos03, OKM12, ST02a,ST02b, Add01, ARvW03, ABC+00, AHD12,AAB+17, AELGE16, ACMZR11, ATL+12,ADT14, ACJ12, Ano97, Ano01b, Ano03,AKE00, ADMV05, ADR+05, AGMJ06,AM07, ACD+09, ABB+10, BST+13, BR02,BHP+03, BME02, BN00, BF01, BBDH14,BWW+12, BCC+00a, BCC+00b, BGK08,BGG+02, BS01, BS05, BBC+00, Bra97,Bri00, BDV03, BdS07, BGdS09, BFG+10,BGD12, BC00, BS07, BB00, BKO00, BO01,BEG+10, CRE99, CE00, Car07, CB00,CGLD01, CDK+01, CLYC16, CM98,CHPP01, CBPP02, Cha02, CM05, CGKM11,CMMR12, Cla98, CCM+06, CCBPGA15,

CC00b, DM98, DW02, DBVF01, DSGS17,HD02a, DFC+07, DFA+09, ETWaM12,EM00a, EM00b, EV01, EdS08, FGRT00,FMSG17, FSXZ14, FM09, GSA08, GJP01].OpenMP[GSMK17, GG09, Goe02, GAVRRL17,GAM+00, GAML01, GOM+01, GAM+02,Gra09, HPP02, HP05, HDDG09, HA10,HO14, HD02b, HMK09, HASnP00, HKN+01,HAJK01, HVSC11, HLCZ00, HT01, HCL05,HEHC09, HJYC10, HAA+11, IJM+05,ICC02, IOK00, ITT02, JCP15, JKHK08,JPOJ12, JFY00, JJY+03, JCH+08, JJM+11,JR10, KB01, KS15a, KOB01, KaM10, KOI01,KN17, KKH03, KT02, KSJ14, KLR+15,KBVP07, KBG+09, KKV01, KT10, KH15,KAC02, KC06, Kuh98, KPO00, KRG13,KSS00, KSHS01, KJEM12, LOHA01, LP00,LLRS02, LD01, LME09, LLC13, LHC+07,LNW+12, LYSS+16, LA02, LA06, LMRG14,LHZ98, LL01, LLH+14, MKC+12, MS02b,Mal01, MM07, MB12, Mar02, Mar03,MLC04, Mar05, Mar09, MPD04, MCB05,Mat00a, Mat00b, Mat01a, Mat03, MGG05,MGC12, MG15, MM11, MFG+08, MKV+01].OpenMP [MBE03, MRRP11, MMSW02,MKW11, MM14, MMS07, MJB15, MJPB16,MCdS+08, Mul01, Mul02, Mul03, MBB+12,NO02b, Nak05a, NIO+02, NIO+03, NEM17,NPP+00b, NPP+00c, NPP+00a, NPP+00d,NAAL01, NA01, NNON00, Nob08, NU05,NHT02, NHT06, OOS+08, OP10, OPW+12,PARB14, PPJ01, PVKE01, PK05, PZ12,PGC02, PKE+10, Qui03, Ran05, RDLQ12,RLVRGP12, RBAA05, SSE12, SSB+16,SHHI01, SHTS01, SKS01, SLGZ99, SGZ00,SPL+12, SHPT00, SSAS12, SK00, SB01,Stp02, TBS12, TS12a, TSB02, TTSY00,TSS00a, TSCaM12, TJPF12, Thr99,TBG+02, THH+05, TGBS05, VDL+15,VPS17, VGS14, Vos03, Vre04, Wal00, Wal02,Wan02, WCC12, WC15, WPC07, WT11,WYLC12, WT12, WLYC12, YHL11,YWC11, YCL14, YKLD17, YPAE09,

44

YSVM+16, YSMA+17, YYW+12, ZAT+07,ZSnH01, aMST07, dCZG06, RM99, SSGF00,WCS+13]. OpenMP* [KDT+12].OpenMP-based [LNW+12].OpenMP-like [BKO00, KOB01, VGS14].OpenMP-oriented [MLC04].OpenMP-style [JPOJ12]. OpenMP/MPI[BEG+10, HMK09, LLC13, LYSS+16,MGG05, NO02b, Nak05a, SSB+16, SK00].OpenSHMEM [HVA+16]. OpenTuner[BAG17]. OpenUH [HEHC09, LHC+07].Operating [MMH98, RGD97, USE94,Wil93, ARS89, Sei99]. operational[KOS+95a]. Operations[BIL99, BIC05, CCA00, FCLG07, FPY08,GFD05, GLB00, PSM+14, PGAB+05,TRG05, TGT05, WRA02, BMG07, DS13,IDS16, KHB+99, KMH+14, PGAB+07,PKD95, SS99, TFZZ12]. Operators[NHT02, NHT06]. opportunistic [CC10].optical [MRH+96]. Optimal[BP99, GAMR00, ZGN94, BB95, ER12,PQ07, PTL+16, Sur95a]. optimiertes[Sei99]. optimisation [AMuHK15].Optimising [Boo01, FKH02]. Optimistic[SCL00, CXB+12, PY95]. Optimization[BSG00, BHNW01, DBA97, Goe02, HS12,Hus00, ITT02, KGK+03, KMH+14, MC17,MBS15, Mul01, NIO+02, NIO+03, PSSS01,SM03, SvL99, SWH15, TRG05, WTTH17,WJ12, Cou93, DSOF11, FCS+12, HWS09,KHS12, LME09, LDJK13, MALM95, PP16,PMM95, SKS01, SDJ17, Str12, TMW17,TFZZ12, VSW+13, Was96, XXL13].Optimizations [NSLV16, SSE12, iSYS12,TSS00a, BVML12, HEHC09, LL16, MV17].optimize [WLYC12]. Optimized[AKL16, Bri02, FAFD15, MAIVAH14, PM95,PTH+01a, THS+15, WJB14, BKvH+14,MMM13, Sei99]. optimizer[BHRS08, Rag96]. Optimizing[BGH+05, CXB+12, FMFM15, KKP01,MBE03, NSZS13, OM96, SSAS12, TGL02,TGT05, GS02, LHC+07, RKBA+13].

Options [RR00]. Orange [ACM98b]. orbit[SSN94]. Order [BL95, DFN12, LZH18,KN17, KME09, KEGM10, KB13, MYB16].ordering [Zah12]. ordinary[NF94, RBB15, SP11]. Oregon[ACM99, IEE93e, SW91]. Organization[BPC94, JFGRF12]. Oriented[Ada97, BCFK99, FMSG17, MSL96, PD98,YHGL01, Ada98, BR91, DM12, MGC+15,OKM12, RFH+95, SWL+01, MLC04].Origin [LL01, LSK04, ZSnH01].Origin2000 [Bri00, MH01]. original[RNPM13]. Orlando [ACM98b]. Orleans[IEE96b, USE95]. ORNL [Bor99]. OSCAR[IOK00, Slo05]. oscillator [BJ13, GSMK17].OSDI [USE94]. OSF [Sch93]. Other[OP10]. OtOt [DKF94b]. Otto[Ano96a, Ano99a, Ano99b, Nag05].out-of-core [BL99]. Output[CFF+94, HE02, JWB96]. Outstanding[LSB15]. Overcoming [JKHK08].Overhauling [BDW16]. Overhead[BR02, FST98a, XH96, CRGM16, KC94,KRS99, ZRQA11]. Overheads[BCG+10, BGdS09, BCM11, SS94]. Overlap[BRU05, DCPJ12, DCPJ14, MLAV10,PSK08, SH14]. Overlapped [GPC+17].Overlapping [KB01, kLCC+06, PKE+10,BBH+15, MMM13]. overlay [CXB+12].overlay-based [CXB+12]. Overview[CFF+96, Gre95, GL95c, Zol93, GHZ12,GPL+96, Wer95]. OWL [JKN+13].Ownership [FHB+13]. Oxford [Boi97].

P [CAM12, WHDB05]. P-RnaPredict[WHDB05]. P03M [BJ93]. P2P[GR07, GGL+08, GJR09, SBG+02].P2P-MPI [GGL+08, GJR09]. P4[KS96, Mat94, Mat95]. PA[ACM04, Ham95a, ACM96c]. Pablo[BFMT96a, BFMT96b]. Pablo-based[BFMT96a, BFMT96b]. Pacific [IEE95e].Package [BS93, KCP+94b, KOW97, LW95,OD01, SYF96, van97, BHW+12, BBH+15,

45

CwCW+11, Gao03, KCP+94a, LFS93a,LFS93b, SL95]. Packet [MBES94]. Packets[Uhl94, Uhl95b]. PaCT [Mal95]. PaCT-95[Mal95]. PACX [FGRD01, KR09, RBB97b].PACX-MPI [KR09, RBB97b]. Page[CML04, NPP+00c]. pages[Ano95b, Ano95c, Ano96a, Ano99a, Ano99c,Ano99b, Ano99d, Ano00a, Ano00b]. Pagoda[YSS+17]. pairwise [AMHC11]. Palazzo[GT94]. PALLAS [KVH97]. Papers[BDB+13, OL05, TB14, ACM90, CHD09,DKD07, IEE93a, IEE95c, KKDV03,MTW07, Old02, Ano93f, Cha05]. PARA[DW94, DMW96, Was96, CD96].parabolized [SCC95]. ParADE [KKH03].Paradigm [HIP02]. Paradigms[BGD12, CM98, HD02a, HD02b]. Paradyn[MHC94a, MHC94b]. Paragon[Ano96c, HWW97, MP95, PR94a]. Parallel[ACM95b, Ada97, ATC94, Agr95a,AMHC11, AGH+95, AS92, ADRCT98,AK99, AMBG93, ASA97, AL96, AP96,Ano95b, ACMR14, AB93a, AJF16, BHM94,BJ93, BBG+95, BCGL97, BFLL99, BP99,BG95, BS93, BDG+91a, BKGS02, Ben01,BP98, Bha93, Bic95, BGK08, Bis04,BALU95, BCL00, BSG00, BBC+00,BBG+01, BFZ97, BDL98, BDH+95,BDH+97, BT01b, BMS94b, BMPZ94a,BFM97, BKO00, BBH12, BGL00, CGC+02,CHD07, Cer99, CDZ+98, CCU95, CDK+01,Cha02, CGB+10, CNC10, CFF+94, CSW97,CMH99, CFPS95, CCSM97, Coo95b, CT94a,CT94b, CC00b, Cze16, DSM94, DERC01,DYN+06, DK13, Di 14, DI02, DSS00, D+91,DKM+92, DGMJ93, DT94, DZDR95, DK06,EKTB99, EGR15, EM00a, EM00b, EGDK92,EJL92, ES11, FGRD01, FHSO99, FJBB+00,FFP03, Fer98b, FHK01, Fis01, For95, FP92].Parallel [FB94, FS93, FF95, GCBM97,GLN+08, GBD+94, GKP97, GR07, GSI97,GSMK17, GB98, GHL97, GK10, GFPG12,GJN97, Gre94, GLS94, GL97a, GLS99,GkLyCY97, HJ98, HLP10, HO14, HK94,

HK93, HK95, HHK94, HT01, HAA+11,IEE93b, IEE94a, IEE94f, IEE95h, IEE95f,IEE95g, IEE95j, IEE96b, IEE96c, IEE96g,IEE96e, IEE96d, IEE97b, IEE05, ITKT00,IBC+10, IOK00, IDD94, IH04, IHM05,JAT97, JML01, Jou94, JRM+94, KFA96,Kan12, KK02a, KOI01, KNT02, Kat93,KBS04, Kep05, KR09, Kon00, KKP01,KMC96, KMC97, KS96, KKDV03, KKD04,KS01, KVH97, KHS01, Kuh98, KBG16,Kum94, Lad04, LTDD14, LTR00, LKD08,LSZL02, LTRA02, LHHM96, Li96, LZ97,LHZ97, kLCC+06, LO96, Lus00, MSOGR01,MS02b, MM92, MWG97, dlFMBdlFM02,Mar06, Mar07, MFTB95, MSCW95, Mat94].Parallel[Mat95, MBS15, MGC12, MG15, MRB17,MM11, Mic93, Mic95, MTWD06, MCLD01,MS95, MCdS+08, MBB+12, MSB97, NO02b,NO02a, Nak03, Nak05a, Nak05b, NSZS13,Nar95, NSS12, NAJ99, NJ01, Nov95, Oed93,OP10, OLG01, Ong02, Ott93, OWSA95,Pac97, PPT96a, PVKE01, Pat93, PSZE00,PV97, Per99, Per96, PLR02, PKB+16,PBC+01, Qui03, RR00, RDMB99, RBS94,Ree96, RS95, RC97, RSV+05, Roh00, Rol94,RWD09, RTL99, RLL01, SCP97, SPE95,SGZ00, Sch01, Sch96a, Sch96b, Seg10, Ser97,Sev98, She95, SM03, SP99, Sie94, Sie92a,Sie92b, Sin93, STV97, SWH15, Sou01,Sta95b, Ste94, SSN94, SGS10, Str96, Str97,Str94, SNMP10, Sun90a, Sun90b, Sun94a,Syd94, TMP16, TSS00b, TTP97, TC94,TCP15, TQDL01, THN00, TDBEE11].Parallel[Tsu07, TVV96, Uhl94, Uhl95b, UH96,UCW95, VLO+08, VRS00, VB99, WH96,Wal01a, Wel94, WAS95b, WHDB05, WO97,WSN99, WTR03, WT12, YM97, YHGL01,YH96, YPA94, YG96, YTH+12, YZPC95,YSL+12, ZB94, ZZ04, ZDR04, ZWJK05,ZAT+07, ZLS+15, ZZZ+15, ZGC94, ZB97,van97, ACM97a, ARvW03, APBcF16,ART17, AAAA16, AD98, AL92, ABF+17,

46

ASCS95, ADT14, AD95, ACJ12, Ano93g,Ano95c, Ano00b, ADB94, ADDR95, AB93b,AFST95, AB13, AGIS94, ADMV05, BHJ96,BBB+94, BR91, BA06, BHS18, BB95,BCAD06, BB93, BDG+92b, BB94, BPC94,Ben95, BvdSvD95, BKH+13, BAV08, BN00,Bir94, BCM+16, BKML95, Bos96, BFMR96,BID95, Bri95, Bru95, BDW97, BSH15,CARB10, CL93, CGK11, Cav93, CLdJ+15,CLSP07, CT13, CLYC16, CKmWH16,Cha05, Cha96, CGL+93, CEGS07, CH94].parallel[CZ96, Che99, CIJ+10, CS96, CSW99, Cla98,CEF+95, CDD+96, CdGM96, CBHH94,Coo95a, CCHW03, CLLASPDP99, CFF+96,CPR+95, CD01, CDH+94, CKP+93, CB11,DKF93, DKF94b, DR18, DLR94, DLRR99,DDS+94, DR94, DSZ94, DM93, DRUC12,DBVF01, DKD05, DvdLVS94, DXB96,DMW96, DLM99, DKP00, DLO03, Duv92,DZZY94, EASS95, EV01, FB96, FFB99,FM90, FO94, FSTG99, Fer98a, FMS15,FCS+12, FKK+96b, FFM11, FHC+95,GG99, GCN+10, GGL+08, GBF95, GG09,GFB+14, GAVRRL17, GKS+11, GEW98,GKK09, GKCF13, Gra09, GP95, HAM95b,HPY+93, HWS09, Heb93, HPS+96, HZ94,HZ99, HPLT99, HDB+13, HVSH95, HH95,HLOC96, HVSC11, HLO+16, IEE97a, IM95,JWB96, JC17, JY95, JJM+11, JC96,JMdVG+17, KCD+97, KOB01, KBP16,KN17, KOS+95a, KmWH10, KL95, Kos95b].parallel [KRC17, KG93, KFSS94, Kra02,KKJ+08, KH10, LM99, LCL+12, LH98,LS10, LCVD94a, LMM+15, Lou95, LG93,LM13, LL95, LC97b, LSR95, MMR99,MYB16, MMB+94, MZK93, MM95, Mar05,MSP93, MK00, MN91, MHC94a, MRRP11,MALM95, MLA+14, MRH+96, MMH99,Mor95, MC99, MR96, MvWL+10, NSBR07,Neu94, NB96, NBGS08, NCKB12, NF94,OdSSP12, Ols95, Olu14, OW92, PHA10,PPT96b, PPT96c, PKB06, PBG+95, PNV01,PBK99, PPF89, PY95, PBPT95, PSLT99,

PCS94, Ram07, RJC95, RBB15, Rol08b,RBB17, SJLM14, SSKF95, SH94, Sch94,Sch99, SPK96, SBF94, SWYC94, SK92,SCC96, SL00, SMAC08, SZ11, SPL99,SMS00, SVC+11, Smi93b, STT96, SH14,SRK+12, SLS96, Sta95a, Sti94, SMSW06,Sun95, Sur95a, Sut96, SL95, TJD09, TDB00,TMPJ01, Uhl95a, Uhl95c, VM95]. parallel[Vis95, Vos03, Wan97, Was96, Was95a,WK08a, WK08b, WK08c, Wol92, WT11,WYLC12, WLYC12, WMP14, YULMTS+17,YHL11, YWC11, YBZL03, YYW+12, ZL96,ZWHS95, ZAFAM16, ZWL13, ZWL+17,dH94, ARL+94, Ano94e, Ano94f, ACDR94,BDLS96, BS94, BG94b, Bos96, CC95, Cza13,DSM94, DHK97, DW94, EJL92, FR95,FF95, GN95, JPTE94, JPP95, KKD05,Kum94, LK10, LkLC+03, Mal95, MKP+96,OKW95, PQ07, QRG95, SSSS96, SPE95,Stp02, TDBEE11, TGEM09, Vol93, Vre04,WN10, YC98, ZPLS96, ZDR01, ZHS99].parallel-programming [KKJ+08].parallel/distributed [FHC+95, Wan97].parallele [GEW98]. paralleles [BL94].Parallelisation[SJK+17a, SJK+17b, WCVR96, LF93b].Parallelism[CGC+11, EdS08, EK97, FKKC96, GLP+00,GAM+02, GPC+17, DK02, KT02, Mar03,MGA+17, MMS07, MdSC09, RBAA05,SHM+10, SML17, SGZ00, TTSY00, Thr99,YPAE09, ATL+12, BK11, BR12, BS01, BS05,CCM12, GAM+00, HSP+13, HSE+17, HK09,JC17, JPOJ12, Kos95b, OPP00, RKBA+13,SLGZ99, SHPT00, THH+05, TWFO09,WO09, WTFO14, WRSY16, YZ14].Parallelization [AL93, And98, AIM97,BCM11, BS07, CRE99, CP97, Cou93, Cza03,ETV94, HA10, JR10, Kik93, KLR+15, LP00,OD01, Pok96, QMGR00, Rag96, RP95,RM99, RS97, SAS01, WPL95, WZWS08,WR01, aMST07, AGMJ06, BW12, BDY99,BJS99, CDD+96, Gao03, Goe02, IDS16,IJM+05, JJY+03, JMS14, KS15a, KD12,

47

KRG13, MCB05, MGG05, Nes10, NEM17,OLG+16, TWFO09, VBLvdG08].Parallelized[FBSN01, OMK09, KMG99, OKM12].parallelizer [BHRS08]. Parallelizing[BST+13, Car07, GGH99, IOK00, IKM+01,IKM+02, SR95, ZZ95, AMS94, BY12].Parallelldatorcentrum [Eng00].Parallizing [LRQ01]. parameter[HPLT99, JMdVG+17]. parameterized[CT13]. Parameters [GFV99, BAG17].Parametric [LLG12, Pat93]. Paramid[Ste94]. Paraperm [LTDD14]. Paraprox[SJLM14]. Parasite [LLRS02].paravirtualization [SBQZ14]. ParCo93[JPTE94]. PARCOACH [SCB14]. PARCS[LD01]. Paris [CHD07, Har94, Har95].Parity [MC17]. Parix[HVSH95, RS95, SHH94a, SHH94b]. Park[SL94a, IEE93c]. PARKBENCH[DHS96, DH95]. PARMACS[GR95, HZ96, HZ99]. PARMACS-to-MPI[HZ96]. ParNSS [HSMW94]. PARRAY[CCM12]. parsing [Sur95a]. Parsytec[SHH94a, SHH94b]. part[VSRC95, EM00a, EM00b, GK10]. Partial[DERC01, DLV16, FSSD17, KK02b,MFTB95, OM96, ST17]. partially[CdGM96]. Particle [GSI97, KHS01,NSLV16, ZZ04, BAS13, FFFC99, GSMK17,KPK13, RFH+95, VDL+15]. particle-based[FFFC99]. particle-in-cell [VDL+15].particle-mesh [BAS13]. particulate[ATL+12]. Partitionierung [Gra97].Partitioning [CTK01, kL11, STV97, CT13,Cha96, Gra97, GKCF13, YST08]. partners[Str94]. Pasadena [IEE95c]. PASCO[ACM97a]. passage [PTMF18]. Passing[AMHC11, AKL99, Att96, BZ97, BC14,BBH+06, BBG+01, BRU05, BDH+95,BDH+97, BGR97b, BFM97, CHD07, Cer99,CGH94, Cot97, Cot98, CTK00, Cot04,CDND11, DFKS01, DKD08, DHHW92,DHHW93a, DDL00, FKKC96, FKS96,

FGT96, Fos98, FGG+98, FB94, GR07,GB96, Gle93, GLRS01, GLS94, GL95c,GLDS96, GLT99, GLS99, GLT00b, GLT00a,GL04, IBC+10, KTF03, KGRD10, KS97,KSV01, KKDV03, KKD04, KKD05, LKD08,LK10, Luo99, MPI98, MTSS94, MS98,MSL96, MBES94, MG97, MTWD06, MSS97,NW98, PBK00, Pok96, PS01b, RRBL01,RWD09, RFG+00, SWHP05, SWL+01,ST02b, TGT05, TDB00, TBD12, WD96,Wer95, Wis97, YHGL01, ZG95a, ZG96,ZLL+12, Ada98, AD98, AAC+05, Ano93d,Ano94d, Ano95c, Ano00a, Ano00b, BL97,BvdSvD95, Bjo95, Bru95, BDW97, BFIM99].passing [CGJ+00, CDZ+98, CRD99, CD01,DKF93, DM93, DKD05, DS96b, DHHW93b,DOSW96, DLM99, DKP00, DLO03, FK94,FHB+13, GL92, HP05, HPY+93, Hem96,KJA+93, Kra02, LR06a, LBD+96, wL94,LCY96, LMM+15, LC97b, MP95, NS91,PS07, PKB06, Pie94, PR94a, PS00b, Sei99,SWJ95, SDV+95, SZ99, SSG95, Sti94,TSZC94, VM95, Wal94a, Wal94b, ZWL13,ZKRA14, DiN96, GGHL+96, Han98, Hem94,RRFH96, SLG95, Wer95, YGH+14]. Past[Dar01]. Path[CGPR98, GAMR00, SDJ17, SLN+12, Zel95].path-based [SLN+12]. Pathway [CNM11].PATOP [BFBW01]. Pattern [CSW12,CC17, JJPL17, RDMB99, MAS06, SJLM14].pattern-based [SJLM14].Pattern-Independent [CSW12].Patterned [ST17]. Patterns[DMMV97, FPY08, KB98, PKB+16,RRAGM97, SGH12, DZZY94, GAVRRL17,HGMW12, PM95, PSK+10]. PC[AH00, EKTB99, KS01, LKYS04, RLL01,Ste00, WLYC12, YST08, YL09, MMB+94].PC-Cluster [RLL01]. PCAT[ACDR94, GN95]. PCAT-93 [ACDR94].PCAT-94 [GN95]. PCG [BJS97]. PCI[GK97]. PCI-based [GK97]. PCRCW[BS94]. PCs [CRE99]. PCSC [LM94].PCTE [HZ94]. PCTRAN [KHS01]. PDCS

48

[YH96]. PDE[GBR15, NHT02, NHT06, NPS12]. PDES[PT01, SCL00, SCL01, HO14, HHA95].PDGC [CGB+10]. PDP [IEE96g]. Peer[GR07]. Peer-to-Peer [GR07]. PELCR[PQ07]. PEMPI [FB95]. PEMPIs[MOL05]. Pennsylvania[ACM96b, IEE94d]. pentadiagonal[Kan12]. Pentium [Ano03]. Pentium(R)[SBT04]. PENTRAN [KHS01]. people[ASCS95, Ano94i]. per-triangle [SOA11].perception [CLM+95]. perceptual[WPL95]. Performance[ACM97b, ACM98a, ACM98b, ACM00,ACM01, ACM04, ATM01, AR01, Ano01a,Ano01b, ADR+05, Bak98, BBGL96, BN00,BBDH14, BGG+02, BY12, BRM03,BRST94, BS07, BDL98, BCKP00, BHNW01,BFMT96b, BFBW01, BEG+10, CDD+13,CRE99, CDJ95, CGLD01, CNM11, Che99,CSC96, CCBPGA15, DPSD08, DM95b,DW02, DZ98b, DPP01, DWL+10, DBK+09,EGH99, EGC02, EML98, EML00, FD02a,FGRT00, FCP+01, FSC+11, FST98b,FGKT97, GFD03, GKP96, GGS99, GBH99,GRRM99, GBS+07, GC05, GMdMBD+07,GSY+13, HVA+16, HKN+01, Hol12, HF14a,HF14b, HPS95, Hus98, IEE92, IEE93c,IEE94g, IEE95k, IEE96a, IEE96f, IEE97c,IFI95, IRU01, IHvA+00, JSS+15, JC17,JCH+08, JS13, KDSO12, KaM10, KL94,KH12, KBS04, KBM97, KKP01, KH15,KC06, KK02b, KHS01, KSS00, Laf01,LAdS+15, LCK11, LC97a, LB98].Performance[LGCH99, LNK+15, LH98, LC93, LkLC+03,LNW+12, LS10, LCW+03, LVP04, LWP04,LDCZ97, LC97b, LKYS04, MMB+94,MKP+96, MPD04, ME17, MGMH97,MGC12, MM02, MM03, MOL05, MS99a,MHC94b, MMSW02, MK04, MCLD01,MMH99, MM14, MMS07, NSLV16, NMW93,NPP+00d, NMS+14, NN95, OF00, OLG01,PARB14, PKB01, PHJM11, PZ12, PR94b,

PFG97, PGAB+05, PGAB+07, PGC02,PY95, PTH+01b, PS01b, QB12, Rab98,RBB97a, RBB97c, RH01, RRAGM97, Ros13,RsT06, SGJ+03, SPM+10, SLJ+14,SWHP05, SCP97, SEF+16, SPL+12,SCSL12, SM02, SM03, SSC97, SJ02, SSSS97,SC96b, SKH96, SJK+17a, SJK+17b, TSB02,TSB03, TTSY00, Ten95, Tha98, TBG+02,TGT10, Tra12b, TFGM02, TFZZ12, VFD02,VY02, WN10, WAS95b, WM01, WT11,WT12, XF95, XH96, XXL13, YC98, Yan94,YWC11, YS93, YWCF15]. Performance[YSP+05, ZLGS99, ZWJK05, ZHK06,ZSnH01, ABDP15, Ahm97, ADLL03a,ADLL03b, Ano03, AFST95, BDP+10, Ber96,BDV03, BFM96, BFMT96a, BFIM99,CRE01, CAHT17, CLYC16, CBPP02,CBM+08, CHKK15, DM95a, DL10, DO96,D+95, DWL+12, DE91, Duv92, EFR+05,ESB13, FAF16, FD02b, FE17, FSV14,FME+12, Fin97, GS02, GGC+07, GK97,GR95, GHZ12, GML+16, GL96, GLDS96,GL97c, GL99, GWVP+14, HDDG09, HW11,HASnP00, HAJK01, HK10, HVSC11,HHA95, HG12, HcF05, JKHK08, JJM+11,JKN+13, KBP16, KKM15, KS13, LBD+96,LTLC94, LC07, LBH12, LCY96, LB96, LL01,LKJ03, LSK04, MC17, MP95, MSMC15,MSW+05, MSL12, MABG96, MHC94a,MSZG17, MJPB16, MGC+15, NU05,NFG+10, OIH10, Old02, PGS+13, PHW+13,PGK+10, PF05, PMZM16, PTW99, Rab99,Reu03, RGDM15, RJDH14, Sep93, SFO95].performance[SWJ95, Slo05, SVC+11, SK00, SFLD15,TMC09, TSP95, TG09, THM+94, VDL+15,Wor96, YCL14, ZSK15, ZWL13, dAT17,HS95a, GH94, LCHS96, SSH08].performance-aware [MSMC15].Performance-based [YWC11].Performance-Portable[JSS+15, DWL+10, DWL+12, FAF16].performance-prediction [BDV03].performance/cost [GWVP+14].

49

Performances [GFV99, DS96b, IM94].Performing [CC99]. Peridynamic[MSZG17]. Periscope [LGG16].Permutations [CC99, LTDD14].Persistent [Man01, SG12]. Persistent-Sets[SG12]. Personal [SSSS97]. personalized[BHJ96]. perturbation [KN17]. Perverse[Rol08a]. PES [MK94]. Pessimistic[BCH+03]. petaflops [LSG12]. Petascale[CGKM11, ZWL13, Gei01]. Petersburg[Mal95]. Petri [CNM11]. PFSLib [LL95].PGAS [SWS+12, SJK+17a, SJK+17b].Phase[CBL10, ED94, TKP15, TG94, ZAFAM16].phase-field [TKP15]. Phi[DSGS17, MTK16]. Philadelphia[ACM96b]. PHOENICS[SZBS95b, SZBS95a]. Phoenix[ACM03, IEE95b, Ten95]. Photo[JFGRF12]. Phylogenetic [MR12, LBH12].Physical [BM97, GJN97, GWVP+14].Physics [GT94, KH15, VW92, WBH97,ANS95, BPG94, DMW96]. PIC [BDV03].Picos [YAJG+15]. Pilot [OS97, CGG10].PINEAPL [DHK97]. Pinhole [NH95].Pipe [MTU+15]. Pipeline [GAMR00].Pipelined [GAML01]. Pipelines[MAGR01, FWS+17, RKBA+13].pipelining [MM11]. Pisa [Sil96].Pitaevskii [LBB+16, LYSS+16, SSB+16,YSVM+16, YSMA+17]. Pittsburgh[ACM96c, ACM04, Ham95a, IEE94d]. Place[IEE94e, BCK+09, HSE+17, PSHL11].placement [SLN+12, SPK+12]. Planck[Ano94c]. Planing [GAMR00]. planning[Zel95]. plant [FO94]. PLAPACK [van97].PLASMA [YKLD17].Plasmafusionsforschung [BL94].Platform [BKGS02, NO02b, WTTH17,BSH15, CB11, Cza13, DWL+10, DWL+12,HHA95, JR13, NO02a, XXL13, YSL+12].Platforms[AIM97, HD00, JML01, ZB97, GGC+07,GFB+14, MBBD13, TKP15, TS12b].

Plesset [BL95, KN17]. PLIERS [MMR99].plug [MS99b]. plug-in [MS99b]. plus[HDB+13]. PMaC [PTL+16]. PMD[Che99]. PML [Ram07]. PMPIO[FWNK96]. PMPIO-a [FWNK96]. pocl[JSS+15]. Point [GBS+07, HC10, KV98,ADLL03a, ADLL03b]. Point-to-Point[GBS+07, HC10, KV98, ADLL03a,ADLL03b]. Pointers [LRT07]. Poisson[BP98, WJB14]. Poland [BDW97]. Polder[OS97]. Policies [CML04, PZ12]. policy[MMM13]. Polling[DCPJ12, Pla02, DCPJ14, SH96]. Pollutant[RSV+05]. Pollution [AKK+94, BZ97,MPD04, MSML10, SH94, Syd94].POLSYS GLP [SMSW06].polygonization [TSP95]. polygons [CT13].polyhedral [BHRS08, KGB+09]. polymers[JAT97]. Polynomial[VY15, HLM+17, SMSW06]. port[CCHW03, Har94, RJMC93]. Portability[KaM10, RS95, RH01, ABDP15, FE17,MGC+15, PHW+13, Reu03]. Portable[Ano95c, Ano00b, BHV12, BHLS+95,CDH+94, DHK97, Di 14, FCLG07, FSLS98,GLS94, GL97a, GLS99, JSS+15, LNLE00,Man98, MKV+01, MG97, PPT96a, PBC+01,SSCC95, SDB+16, Sti94, Tra98, WCS+13,YBMCB14, Arn95, BCK+09, BfDA94, BB00,BL99, BAS13, CH94, CEF+95, DWL+10,DWL+12, FAF16, FWNK96, GR95, GL94,GS94, GLDS96, HZ94, HSW+12, JC96,KN95, LFS93a, LFS93b, LHC+07, MMB+94,PPT96b, PPT96c, PMZM16, SFLD15, Sto98,VM95]. portal [AASB08]. portals[BS96b, BMR02, BRM03]. Portfolio[SIS17]. Portfolio-driven [SIS17]. Porting[Ano96c, BSC99, BLW98, EM02, Har94,Har95, HASnP00, KGK+03, KME09, SR96,YKLD17, dCH93, BvdB94, HD11, MWO95,ZPLS96]. Portland[ACM99, ANS95, IEE93e, SW91]. Portugal[IEE93d, IEE96g]. Positron [Pat93].POSIX [LD01]. Post

50

[BBH+13b, Wit16, ABC+00]. Post-failure[BBH+13b]. Post-ISA [Wit16]. Poster[JJPL17, LZH17]. POSYBL [Mat94].Potential [EGC02, Gro01a, KS15a]. Potts[KO14]. POV [FFB99]. POV-Ray [FFB99].Power[LB96, EZBA16, FO94, HK10, Nel93, Bri95].Powered [NE98]. PP [IEE96d]. PPARDB[PPT96b, PPT96a, PPT96c].PPARDB/PVM [PPT96b, PPT96c].PPPE [CDH+94]. PPSN [DSM94].Practical [BHJ96, BCP+97, CZG+08,RHG+96, TGBS05, AMS94, BHRS08,LPD+11, McK94, Pan95b, VVD+09].Practice [ACM11, GN95]. Praktische[MS04]. Precedence [EGR15].Precedence-Constrained [EGR15].Precise [FJK+17]. Precision[Ano98, Kha13, ZC10, JPT14].Preconditioned[GFPG12, ABF+17, MM92].Preconditioner [BBS99, FSXZ14].Preconditioners [Huc96].Preconditioning [Nak03, GGC+07].predictability [GRRM99]. Predicting[RRAGM97]. Prediction[MOL05, WHDB05, ZWJK05, ADR+05,BDV03, CMV+94, HHA95, RBAI17, SEC15,SC96b, SSN94, Was95a, ZAT+07].Predictive [FJK+17]. Preemptive[BBH+06, BBGL96]. Preface[DKD07, OL05]. Prefetching [BIC+10].Prefix [WJ12, DK13, MYB16].Preliminary[BF98, Wal01a, RJC95, SWS+12].Preprocessors [Ano01a]. prescription[MRH+96]. Present [Dar01]. presented[ACM90]. preservation [IEE94c].Preserving [RNPM13]. Press[Ano95b, Ano95c, Ano96a, Ano99a, Ano99c,Ano99b, Ano99d, Ano00a, Ano00b]. Pricing[RR00]. Primitives[DDL00, FST98a, ABDP15, CIJ+10].Princeton [Bha93]. principles

[BSC99, HS12, SSP+94]. printing [YM97].priority [DR95, Man98]. Prism [SDN99].private [Str94]. privatization [KRG13].Probabilistic [LAdS+15]. Probability[QRMG96, Sta95b]. Problem[BSH15, DALD18, DAK98, GAMR00, ICC02,Lee06, MTSS94, RLVRGP12, ZSnH01,AB93b, DSM94, GM94, GKCF13, HMKV94,IHM05, MM92, SL00, SP11, Cza13].Problems[ASA97, BHM94, BHM96, BMR01, BPMN97,CGPR98, EML98, HAA+11, DK02, MBS15,Nak03, Riz17, AL96, CEGS07, FR95, LSR95,NZZ94, OMK09, SC96a, SD99]. procedure[AGLv96]. Proceedings[ACM94, ACM96c, ACM97a, ACM97b,ACM98b, ACM04, ACDR94, CJNW95,GN95, Hol12, IEE93f, IEE95d, IEE02, KG93,LCK11, MC94, R+92, SM07, Ten95, TG94,dGJM94, ACM96b, Ano94e, Ano94i, BPG94,Boi97, BH95, CLM+95, DSZ94, DE91,EJL92, FF95, GHH+93, HK95, HHK94,IEE94a, IEE94b, IEE94c, IEE95b, IEE95e,IEE96a, IEE97c, IEE05, JPTE94, Kum94,LF+93a, Li96, PSB+94, PBPT95, SPE95,SW91, WPH94, ACM90, ACM95a, ACM05,ACM06b, ACM06a, ATC94, Agr95a,AGH+95, AH95, Ano89, Ano92, Ano94a,BBG+95, Bha93, CHD07, CZG+08,CGKM11, CMMR12, CGB+10, CDND11,DKM+92, DT94, DLO03, EV01, EdS08,ERS95, ERS96, Fer92, FK95, Gat95,GGK+93, GA96, GT94, Ham95a, HS94,HK93, IEE91, IEE92, IEE93d, IEE93c,IEE93b, IEE93e, IEE94e, IEE94d, IEE94f,IEE94h, IEE94g, IEE95h, IEE95k].Proceedings[IEE95i, IEE95f, IEE95l, IEE95g, IEE95j,IEE96g, IEE96f, IEE96e, IEE96d, IEE96h,KGRD10, LKD08, MTWD06, MMH93,MCdS+08, MdSC09, Ost94, PR94b, Ree96,RWD09, SCR92, SHM+10, Sie94, TBD12,USE94, USE95, USE00, VW92, Vos03, Y+93,YH96, AD98, BG91, BDLS96, BS94, Bos96,

51

BFMR96, BDW97, CH96, CD01, DSM94,DKD05, DW94, DMW96, DLM99, DKP00,Eng00, FR95, GH94, HAM95b, HS95a,IEE96c, IEE97a, Kra02, KKD04, LCHS96,Mal95, PBG+95, Sch93, Tou96, VV95, Vol93,Was96]. Proceedings.[Ano93e, Ano94g, IEE96i, IEE97b, LHHM96].Process [AUR01, BGL00, CLL03, DeP03,DK06, FDG97a, FDG97b, FLD98, FPY08,KCP+94b, KOW97, PS00a, SC04, ST97,Tra02a, BK11, BBGL96, CK99, FLD96,GL95a, HRR+11, HG12, JLS+14, KCP+94a,MLVS16, MK00, SHHC18, Ste96].Process-Management [BGL00].processed [HJ98]. Processes [CB16,MW98, Pet00a, Pet00b, FS95, SPK+12].Processing [ATC94, Agr95a, AR01,BBG+95, DKM+92, GGCM99, GGCGO01,HJBB14, IEE93b, IEE93f, IEE95e, IEE95h,IEE95f, IEE95g, IEE96b, IEE96g, IEE96e,IEE96d, IEE97b, IEE05, IOK00, JDB+14,KOI01, KS15b, LSVMW08, MSML10, Nar95,NH95, NJ01, PLR02, PD98, Ree96, RRBL01,Rol94, SCP97, Sev98, Sie94, Sin93, VLO+08,WN10, AB95, Ano94f, BJ13, BHS18,BFMR96, CFPS95, CLLASPDP99, DSZ94,FWS+17, GDC15, GGGC99, Gre94,HAM95b, HPS+96, JC96, Kat93, Kum94,LHLK10, LG93, PSB+94, PBPT95,RKBA+13, Roh00, RCG95, SSS99, SLS96,VDL+15, Wol92, WWFT11]. Processor[HC06, Oed93, Ott94, PWP+16, RR02,Smi93a, SBT04, UALK17, ABDP15, DCH02,HC08, LL01, OIS+06, RNPM13].Processor-Oblivious [UALK17].Processors[AJ97, Bri10, HK93, HK95, MJB15, OLG01,PZKK02, BBG+14, CBM+08, DBLG11,HTA08, HWX+13, KmWH10]. Producing[HAJK01]. product[CMH99, ER12, SMSW06]. production[CLdJ+15, SL00]. productive [LV12].Productivity [BS07, KaM10, Wit16].products [Ano97, Bra97]. profile

[TWFO09, WTFO14]. profile-driven[TWFO09, WTFO14]. profiler [AS92].profiles [Wil94]. profiling[GPL+96, Rab99, Vet02]. Program[Ano96d, AB93a, BMS94b, CHPP01, Cot97,EML98, MM95, MRV00, Ney00, PS01b,TSY00, THN00, UTY02, CDZ+98, JF95,LP00, LLC13, OKM12, PPF89, Sai10,TNIB17, TMPJ01, ZL96]. programacion[VP00]. Programmable [OA17].Programmcode [BL94]. Programmer[Gua16, Wit16]. programmers [CGG10].Programming[ACM90, Ada97, ACGR97, ASA97, ACJ12,Ano96b, BBG+10, BLP93, BHV12, BF01,BBG+01, BKO00, CMK00, CDK+01,CKmWH16, Cha02, CZG+08, CF01, Cza03,DM98, DARG13, DDL00, DK06, DWL+10,EM00a, EM00b, FTVB00, FWR+95,GLRS01, GLS94, GLS99, HA11, HDB+12,HDT+15, KKH03, Kep05, KP96, KmWH10,KVH97, Lad04, Laf01, LLRS02, MSOGR01,Mat94, Mat95, MCdS+08, NO02b, SPM+10,SS01, SDN99, SHH94b, ST02a, ST02b,SGS10, Stp02, TTP97, VT97, Vre04, Wal01a,Wal02, WO97, YM97, YHGL01, ACGdT02,AMuHK15, Ano95c, Ano00b, AB13, BJ13,BCA+06, BB94, BS96a, BKH+13, CLYC16,Cha05, CEF+95, CDH+94, CGH+14,DWL+12, Duv92, EASS95, EV01, FB95,FB96, Fan98, FSTG99, Fer04, Fra95,FHB+13, FF95, GKZ12, Gei96, GBH14,GRTZ10, HTA08, HS93, HZ94].programming[HDB+13, HVSH95, HSW+12, HZG08,KDSO12, KOB01, KSG13, KSL+12,KPNM16, KFSS94, KKJ+08, LV12, LFS93a,LFS93b, LH98, LPD+11, LLH+14, MMB+94,MVTP96, MSP93, MC99, MGC+15, NO02a,Nak05a, NYNT12, NBGS08, OIS+06, Olu14,OW92, Pac97, PVKE01, PF05, Qui03,RJDH14, SK10, iSYS12, SSKF95, SYR+09,Seg10, SPK96, SBF94, SPL99, SHH94a,SD99, VP00, Vos03, Wal01b, Wan02,

52

WCC+07, WADC99, WYLC12, WLYC12,YHL11, YWC11, YX95, YS93, ZGC94,DR94, HSE+17, Che10, SD13]. Programs[AJF16, Beg93b, BKdSH01, BGK08,BGG+02, BDL98, BGL00, CSW12, CRE99,CHPP01, CD98, DLB07, DMMV97, Di 14,FKH02, FJK+17, GR07, GTH96, GL04,GC05, HC10, HKN+01, HM01, KFL05,KL94, KSJ14, KKV01, KSV01, Mar09,MVY95, MOL05, MBE03, MKW11,MCLD01, MJB15, NSZS13, NE98, NE01,NPP+00d, OM96, PPJ01, RH01, RFG+00,SGZ00, SBF+04, SR96, TGBS05, Wel94,Wis97, ZLL+12, Beg92, Beg93c, Beg93a,BCK+09, BMPS03, CRE01, CLdJ+15,CGL+93, CH94, CRM14, CFP96, DKF93,DKF94b, EP96, EPP+17, FLB+05, FKLB08,GGH99, GRRM99, GKS+11, GB94, HD11,HZ96, HLOC96, HEHC09, KCD+97, KS13,KO14, LGKQ10, LLG12, LL16, LBB+16,LYSS+16, LMM+15, LZC+02, LCC+03,MT96, MdSAS+18, Mor95, NBK99, Obe96,OdSSP12, PES99, PAdS+17, RAS16, Reu03].programs [RRG+99, SSB+16, SKS01,SMAC08, SZ11, SR95, SY95, SC96b,TMW17, THH+05, UGT09, VVD+09,YSVM+16, YSMA+17, YYW+12, ZRQA11].Progress[BRU05, LAdS+15, MLA+14, MC94].Progress-Dependence [LAdS+15].Project [BHK+06, BSH15, DHK97, MRV00,ABC+00, CDH+94]. Promise [Ano93e].Promotion [OCY+15, WBBD15].Propagation [EMO+93, ESM+94, JML01,SMOE93, KEGM10, RMNM+12].Properties [FGRT00, MS96b, SSP+94].Proposal [DHHW92, DHHW93a, DFC+07,DFA+09, ZKRA14]. Proposals [Wal96b].protected [GHD12]. protein[GAVRRL17, SEC15, ZAT+07]. proteins[BHW+12, BBH+15, FMS15]. Protocol[CAWL17, GSY+13, kL11, LMM+15, RA09,XF95, BDB+13, CwCW+11, DDYM99,MN91, MB00, ZPI06]. Protocol-based

[LMM+15]. Protocols[BCH+08, DM93, LH98]. Protoplanetary[dlFMBdlFM02]. Prototype[Ano01b, FHP+94, MMSW02, BK96,CCF+94, KYL03, KYL05]. prover [Sut96].Provide [Add01, LMRG14]. Provides[Ano98, Nel93]. Providing [GKP97, Zah12].Proving [MS96b]. PRS [UCW95].Pruning [SMM+16]. PS [AMV94]. Pseudo[Wal01a, Lan09]. Pseudo-search [Wal01a].Pseudorandom [WHDB05].Pseudospectra [BKGS02].pseudospectral [Bri95, MRRP11].PSPVM [BWT96]. Pthread [ZAT+07].Pthreads [AS14, TS12b]. PTX [iSYS12].Public [Str94, GWVP+14, Nel93, RST02].Public-private [Str94]. Puma [BS96b].purely [HSE+17]. Purpose[BDT08, Che10, SZBS95a, Sun94a, ABDP15,CBM+08, KPNM16, PF05, SK10, SZBS95b].PVaniM [BCLN97, TSS98]. PVFS [IRU01].PVM [AD98, BL94, BDLS96, BDW97,CHD07, CHD09, CD01, DKD05, DLM99,DKP00, DLO03, Kra02, KKD04, LKD08,McD96, MTWD06, RWD09, Wil94, AJ97,Ahm97, AS92, ACGR97, ADRCT98, AL92,AGR+95b, AB95, ASA97, AL96, ARL+94,AKK+94, AP96, Ano94b, Ano95e, Ano96b,Ano96c, ABCI95a, ABCI95b, ABG+96,AGLv96, AB93b, AB93a, ADMV05, BSN95,BLP93, BFLL99, BBGL96, BG95, BS93,BDG+91a, BDG+92b, Beg92, BDG+93b,BDG+93a, Beg93b, Beg93c, Beg93a,BDG+95, BS96a, BDG+xx, BL95, BR95b,Ber96, BJS97, BT96, BWT96, BG94a,Bon96, BG94b, BG94c, Bor99, BCD96,BRR99, BFZ97, BID95, BMS94b, BFM96,BFMT96a, BFMT96b, CMV+94, CP97,CDJ95, CKO+94, CCK+95, CSPM+96,CZ95a, CGPR98, CG93, CDHL95, CDH+95,CF01, CZ96, CS96, CG96, CG99a]. PVM[CSC96, CDM93, CdGM96, CPR+95, CT94a,CT94b, CFP96, CT02, CD98, CTK01, DG95,DKF94a, DDYM99, DM95b, DM95a, DP94,

53

DMMV97, DGF97, DFN12, D+91, DGMS93,DGMJ93, DHP97, DPZ97, EP96, EM94,EGDK92, ED94, EM02, EML98, EML00,ES11, EMO+93, ESM+94, EK97, FMBM96,FD96, FLD96, FH95, FHSO99, FO94,FSTG99, FJBB+00, Fin97, FD97, FS97,For95, FS93, GRV01, Gal97, GCBM97,GS91a, GS91b, GS92, GS93, Gei93a, Gei93b,GDB+93, GBD+94, Gei96, GKP96, Gei97,GKPS97, Gei98, GSxx, Gei00, Gei01,GTH96, GB96, GM95, GSHL02, GFV99,GGH99, GS96, Gor01, GHL97, Gre95, Gre94,GL97b, GMU95, GkLyCY97, HB96a, HB96b,HSMW94, HJ98, Har94, Har95, HBT95,HPS+96, Hem96, HEH98, HTHD99, HVSH95,HH95, HRSA97, Huc96, Hum95, HS95b].PVM [ITT99, IvdLH+00, IDD94, IKM+01,IKM+02, JAT97, JH97, JML01, JW96, JC96,KBA02, Kat93, KK98, KP96, KBM97,KDL+95a, KDL+95b, KG96, KCP+94a,KCP+94b, KOW97, KMC96, KS96, KZCS96,KS97, KV98, KAHS96, KK02b, LGM00,LB98, LSZL02, LHCT96, wL94, LFS92,LFS93a, LFS93b, LH95, LC93, LY93, LLY93,LW95, LHZ97, LKL96, LDCZ97, MW98,Man94, MVTP96, Man01, MP95,dlFMBdlFM02, MTSS94, MFTB95,MSCW95, MSP93, Mat94, Mat95, MMU99,Mat01b, MRV00, MK97, McK94, MC98,MFC98, MVY95, MS96b, Mic93, Mic95,MT96, MS99a, MS99b, MHC94a, MHC94b,MRH+96, MS95, MC99, MWO95, Nel93,NP94, Neu94, NBK99, Ney00, NB96, NAJ99,Nov95, Obe96, Ols95, OPP00, Ott94,OWSA95, PPR01, PK98, PPT96b, PPT96a,PPT96c, POL99, PT01, PKYW95]. PVM[Per96, Pet97, PTT94, Pla02, PNV01, PD98,PY95, PL96, Pus95, QRG95, QRMG96,Qu95, QMGR00, RR00, RS93, Rag96, RS95,RHG+96, RRAGM97, Rol94, RGD97, Saa94,SAS01, Sch94, Sch96a, Sch96b, SB95, SFG98,SGS95, SSS99, SPK96, Sep93, Sev98, Shi94,SA93, SR96, SHH94a, SHH94b, Smi93a,SBR95, SC96a, STT96, SMOE93, SGL+00,

SGHL01, SCL97, SSSS97, Sta95b, SY95,SYF96, SC96b, Str94, SKH96, Sun90a,Sun90b, Sun92, Sun93, Sun94a, SGDM94,Sun96, STMK97, SN01, SCL00, Sur95b,Sut96, SL95, TMTP96, TC94, TBD96,TD98, Tsu95, Uhl94, Uhl95b, UH96,UMK97, VSRC94, VSRC95, VB99, VAT95,WKS96, WH94, WCVR96, WAS95b, WO97,Wis96a, WL96a, Wis98, Wis96b, WL96b,WCS99, Wu99, WLC07, XWZS96, XF95,YG96, YKI+96, ZPLS96]. PVM[ZPI06, ZB94, Zem94, ZDR01, ZG95a,ZG95b, ZG96, ZG98, Zol93, van93, Ano95b].PVM-AMBER [SL95]. PVM-Based[WAS95b, FO94, PY95, Sut96, ZPLS96,LSZL02, TD98]. PVM-GRACE [YKI+96].PVM-Implementation [BJS97, Huc96].PVM-RPC [KS97]. PVM/C [GTH96].PVM/MPI [AD98, BDW97, CHD07,CHD09, CD01, DKD05, DLM99, DKP00,DLO03, Kra02, KKD04, LKD08, MTWD06,RWD09, ACGR97, SN01]. PVM3 [IM94].PVM3/AP1000 [IM94]. PVMaple[Pet00a, Pet00b, Pet01]. PVMe[BR95c, BR95b]. PVMGeant [DZDR95].PVMPI [FD96, FDG97a, FDG97b].PyCUDA [KPL+12]. PyOpenCL[KPL+12]. Python [BL97, DPS05, DPSD08,Di 14, GFB+14, SSH08]. PyTrilinos[SSH08].

Q [KMH+14, LM13, MV17]. QAPs [Tsu12].QCD [BLPP13, SVC+11]. QCG [ACH+11].QCG-OMPI [ACH+11]. QCMPI [TJD09].QR [GKK09, LC97b]. QSATS [Hin11].Quadratic [Cza13]. Quadrics[YSP+05, LCW+03]. quadtree[HS95b, PGBF+07, SCC96, Sur95b].qualitative [BLP93]. Quality [Boi97,RFG+00, WHDB05, Ano94i, Lan09, Boi97].Quality-of-Service [RFG+00].Quantifying [AKE00, LDCZ97].quantitative [BLP93, BBH+15].quantization [HE15]. Quantum

54

[BCGL97, BCL00, GRTZ10, Hin11, MGG05,NMW93, SK00, SSGF00, TJD09]. Quasi[DDYM99, Pla02, ZB97]. Quasi- [Pla02].Quasi-asynchronous [DDYM99].Quasi-Newton [ZB97]. Queens [Rol08b].Queensland [ACDR94]. Query [AR01].Quest [MWG97]. Queue[NSS12, CG99b, PTL+16, Sep93, ZA14].queues [Man98]. quicksort[MMO+16, MMO+16].

R [BBH12, JPOJ12, LR01]. R&D [Str94].R&D-100 [Str94]. Race[CFMR95, KSJ14, DKF94a]. Races[PPJ01, SAL+17, DKF94b, LLG12,ZRQA11, EPP+17]. Radial [RB01, KRC17].Radiance [GCBM97, KMG99, RC97].Radiology [GA96]. Rajeev [Ano00a].Raleigh [Agr95a]. Ramesh [Stp02].Random [HT08, LTDD14, Lan09].Randomized [Tra98]. Range[KBM97, MH01, BMPZ94a, PARB14, She95].range-join [She95]. Rank [Hat98].Ranking [Tra98]. Rapid [FWS+17].RASC [YCL14]. rate [BBG+14, YPA94].rationale [BBH+13b]. Ray [CG93, DP94,KGB+09, FWS+17, SGS95, FFB99].Ray-Tracing [DP94]. Rayleigh [TVV96].Rayleigh-Benard [TVV96]. rCUDA[PRS16, RSC+15, SIRP17]. RDMA[GSY+13, LWP04, Pan14, RA09].RDMA-Based [LWP04].RDMA-Enabled [GSY+13, Pan14, RA09].Re [MCP17]. Re-Vectorization [MCP17].Reaching [BHS+02]. Reaction[HF14a, HF14b]. Reactive [BCL00, Heb93].reactor [ANS95]. Reading [HK95]. Ready[Bri02, DZ98b]. Ready-Mode [Bri02]. Real[LHLK10, NSLV16, Tho94, UP01, YGH+14,Ano94f, Fer04, FLB+05, JR10, ZWZ+95,SKD+04]. Real-Time [UP01, YGH+14,LHLK10, Fer04, ZWZ+95, SKD+04].Real-World [NSLV16]. Realistic[YMYI11, ZSnH01, CKP+93]. Reality

[ACM96a, Ano93e, NM95, Wit16]. realizing[YZ14]. rebooting [GJLT11]. Receive[Bri02]. Receiver [ZG95b]. receptor[ESB13]. Rechnen [Ano94c, BL94, MS04].Recognition [CC17]. recomputation[RKBA+13]. Reconfigurable[MFC98, SPM+10, NYNT12].Reconfiguration [CS14, MSMC15].Reconstruction [BM97, DYN+06, GA96,LSSZ15, OIH10, RAGJ95]. Record[UALK17, CRD99]. Record&Replay[KSV01]. record/replay [CRD99].Recovery [SBF+04, BBH+13b, BDB+13,LFS93a, LFS93b, SSCC95, ZWZ05].Rectangle [CSW99]. rectified [WBBD15].Recurrences [ACGR97]. Recursive[DSS00, PWP+16, SD99]. Red [van93].redesign [HL17]. Redistribution[DDPR97, HC06, WO95, WO96, HC08,KN95]. Reduce [PSM+14]. Reduced[SW12]. Reducing[CRGM16, JE95, BCM11]. Reduction[FKH02, MFPP03, SG12, HL17, Jes93a,MLVS16, Pan95a, PQ07]. Redundancy[TS12a]. redundant [KJJ+16]. Reference[GHLL+98, Nag05, SOHL+98, YM97,Ano99a, Ano99c, Ano99b, Ano99d,SOHL+96, Per97, Ano96a]. Refinement[MRB17, Ran05, CLSP07, DLR94]. regions[LFL11]. regression [RBAI17]. Regular[HLP11, NHT02, NHT06]. Reims[MCdS+08]. RELAP5 [SBR95]. related[SD16]. Relating [EPML99]. relation[DO96, Hem96]. Relationship [Dan12].relativistic [BHS18]. relaxation [OKW95].Reliability [CGZQ13]. Reliable[SE02, Arn95]. Remark [SWH15].remedies [ALW+15]. Remo [IEE95h].Remote[BMR01, HDT+15, IFA+16, OCY+15,Tsu07, WBBD15, AGLv96, FHC+95,GBH14, HGMW12, RSC+15, SIRP17, SH96].Remote-Scope [OCY+15, WBBD15].Remotely [GGCM99, GGCGO01, GCGS98,

55

VLO+08, GGGC99]. Remoting [MGL+17].removal [ZZZ+15]. Rendering[GCBM97, LSZL02, SU96, UCW95].Rendezvous [RA09]. Reordering [Hat98].Reparallelization [KBG+09]. Repeated[WH94, Shi94]. Replacement [GHD12].Replay [CFMR95, HLOC96, UALK17,CRD99, MT96, NBK99, XLW+09].replay-based [MT96]. Replication[WC09, KJJ+16]. Replication-Based[WC09]. Report [DZ98b]. Reports[Ano98, ACM11]. Representation[BMR01, KD12, MDM17, SML17, CCM12].reproduce [AVA+16]. Reproducible[GL99, HCA16, XLW+09]. Requirements[GSHL02, GT07, Ber96, KBG16, LCVD94a].Research [Ano96d, BR02, MC94, SL94a,SGHL01, Ara95, BPG94, LP00, Oed93].Reservoir[OWSA95, ZAFAM16, ZZ95, Ano95d].Resident [JDB+14]. Resilient [CGH+14,Gua16, LCMG17, LMG17, MLVS16].Resolution[MAB05, Str94, BADC07, KN17].Resolving [Str97]. Resource[BGR97b, BSH15, KK98, SIS17, YSS+17,DZ96, FLD96, NEM17, ZA14].resource-conscious [ZA14].resource-restricted [NEM17]. Resources[LSB15, NAW+96, Kos95b, R+92].Response [BBC+00]. Restart[SSB+05, LMG17]. restarted [dH94].Restoration [FJBB+00]. Restore [Gua16].restricted [NEM17]. Restructuring[KAMAMA17]. Results[BIL99, BIC05, HSMW94, Wal01a, BR95c,DHS96, VDL+15]. retargetable [KKJ+08].rethinking [GJLT11]. Retrieval[RLL01, MMR99, MRH+96, RTL99].reusable [LTLC94]. reuse[BVML12, LM94, NAAL01]. Reverse[BGK08, LSB15, LM13]. Review[Ano95b, Ano95c, Ano96a, Ano99a, Ano99c,Ano99b, Ano99d, Ano00a, Ano00b, BDL98,

Che10, Mar06, MCLD01, Nag05, Per96,Per97, SD13, Vre04, Stp02, Vog13]. Reviews[Ano97, Bra97, YM97]. Revised [Cha05].Revision [MHSK16]. rewrite [SFLD15].REYES [LSZL02]. RFSA [SW12]. Rhine[Cal94]. Rhodes [TG94]. RHODOS[RGD97]. Rich [MKW11]. Right [ZG95b].Rim [IEE95e]. ring [ZZZ+15]. RISC[AL93, NMW93, BSvdG91]. RNA[WHDB05]. RnaPredict [WHDB05].Robert [Ano95b]. robotic [ZWZ+95].Robust [Att96, GR07, PSLT99]. Rocks[PKB01, Slo05]. Roe [dlAMCFN12]. Rohit[Stp02]. rolling [NF94]. Rome [CMMR12].roots [PNV01]. routed[Pan95b, RJMC93, ZGN94]. routers[Jes93a]. Routines[Add01, Sch96a, LSK04, Sch96b]. Routing[BHM94, BHM96, MTSS94, MBES94,WH94, BS94, Zah12]. RPC[KZCS96, KS97, RS93, SHTS01]. RPVM[CMM03, LR01]. RS[BGBP01, Cou93, Heb93, MW93]. RS/[Cou93, Heb93, MW93]. RS/6000[BGBP01]. RS6000 [CDM93]. RSA[WLC07]. RT [KAMAMA17]. RT-1.1[SKD+04]. RT-CUDA [KAMAMA17].RTL [BGG+15]. RUBIS [BR94]. Ruby[Ong02]. rules [SFLD15]. Run [DLR94,DGMJ93, FHK01, GOM+01, OP98, SBW91,SS96, KPL+12, RRG+99, Str94, TCBV10].Run-Time [FHK01, GOM+01, OP98, SS96,DLR94, SBW91, KPL+12, TSY99, TCBV10].Running[BZ97, CCM+06, YKI+96, CRE01, ZLZ+11].Runtime[AAB+17, BGD12, CFF+94, DMB16, DT17,Gro00, KBS04, KCR+17, NPP+00d, TJPF12,ZLP17, ALW+15, BL99, BR94, EPP+17,EO15, HPS+12, HPS+13, KW14, LLH+14,MA09, NPP+00a, TSY00, YAJG+15].Runtimes [AHHP17]. Russia [Mal95].RWA [RLVRGP12].

56

S [AHHP17, Roh00]. S-Caffe [AHHP17].S-language [Roh00]. S1 [GLT00b]. S3D[LSG12]. Safe [Pla02, GCC99, LFS92,LFS93a, LFS93b, NYNT12]. safety [GT07].salesman [GM94]. Salt [Hol12]. San[ACM97b, Ano95d, BBG+95, GE95, GE96,Has95, IEE93a, IEE94g, IEE95h, IEE95g,IEE97c, LF+93a, NM95]. Sanders [Che10].Sandy [VDL+15]. Santa[ACM95b, AH95, IEE95f, Old02]. Santorini[CD01, CDND11]. Santorini/Thera[CD01]. Saphir [Ano99c, Ano99d]. SAR[AB95]. Satellite [Uhl94, Uhl95b, SSN94].Satisfiability [IKM+01, IKM+02].Saturday [B+05]. Saturday-Wednesday[B+05]. Save [KFL05, FKLB08]. SBS[MSB97, WWZ+96]. SBS-Type [MSB97].SC’11 [LCK11]. SC2000 [ACM00].SC2001 [ACM01]. SC2002 [IEE02].SC2003 [ACM03]. SC97[ACM97b, ACM97b]. SC98[ACM98b, ACM98b]. SC’99 [ACM99].Scalability [BS07, FSC+11, KBS04, LL01,LKYS04, LSK04]. Scalable[Add01, AHHP17, BHW+17, BBC+02,BHNW01, BGL00, CDPM03, EFR+05,GFB+14, GS94, HGMW12, IEE92, IEE94f,IEE95j, IBC+10, KK98, kLCC+06, MFPP03,NBGS08, NPP+00d, NCKB12, NSM12,OLG01, PPJ01, PR94b, PBK00, SDJ17,SBF+04, Skj93, SS96, TPD15, UP01,VBLvdG08, VY02, ZLGS99, BBB+94, Bri95,CLSP07, FWS+17, GBH14, GM13, GKL95,HRR+11, HAJK01, KRC17, KRG13, LM99,LTLC94, MMB+94, MRRP11, PWD+12,SPK+12, Tra12a]. ScaLAPACK[BV99, BRR99, DHP97]. Scale[AKE00, BHW+17, BZ97, BHNW01, FFP03,MFPP03, SM03, TGEM09, WT12, AASB08,BCA+06, BJS99, BCH+08, Che99, DZZY94,FME+12, Gua16, Kos95b, LS10, MLA+14,PTL+16, PD11, RMNM+12, SvL99, TBB12,WLNL06, WT11, ZKRA14, ZA14].SCALEA [TFGM02]. Scaling

[CC17, KFL05, SLJ+14, FKLB08, Gao03,LFL11, PDY14]. scan [AAAA16, YLZ13].scanline [CT13]. scans [NAJ99]. SCASH[SHHI01]. SCATCI [ART17]. scatter[BCD96, MTK16]. Scattering[BCL00, NZZ94, OMK09]. SCF [MM95].schedule [NAAL01]. scheduler[ADDR95, TCBV10, WRSY16]. schedulers[NP12]. Scheduling[BBH+06, BSH15, CML04, DMB16, EGR15,GSHL02, GHL97, HC06, JW96, MJB15,NIO+02, NIO+03, TJPF12, APBcF16,DZ98a, JKN+13, LHCT96, MBKM12,NSBR07, OPW+12, Smi93b, SKK+12,SKB+14, WYLC12, WLYC12, YWC11].Scheme [CTK01, LNLE00, MW98, SBF+04,BBGL96, Bjo95, MRRP11, OKM12, SCC96,YPZC95, FM90]. Schemes[PPJ01, WYLC12, WLYC12, ZAT+07].School [VV95]. Schrodinger[DM12, ON12]. SCI[FS97, HEH98, Hus00, RR01, ZHS99].SCIDDLE [ABG+96, AGLv96].SCIDDLE-PVM [ABG+96]. Science[EGH+14, IEE95d, MMH93, Old02, SM07,ACM06a, DMW96, HK93]. Sciences[ERS96, HS94, ZL96, ERS95]. Scientific[AGH+95, APJ+16, BBG+95, DKM+92,DT94, Gat95, GL97a, HJ98, KK02a,LkLC+03, Mar06, Nag05, Sin93, SSB+17,VY02, WN10, Bis04, DW94, SBG+12,TBB12, Ano97, Bra97]. scientists[HW11, Str94]. SciPAL [KH15]. SCIPVM[ZHS99]. Scope[OCY+15, BDB+13, WBBD15]. scoping[RDLQ12, WC15]. Scottsdale [IEE95b].Scratchpad [JAK17, MB12]. Scripting[Ong02, KPL+12, Nob08]. scripting-based[KPL+12]. SCTP [KPW05, ZPI06]. SDK[TK16]. SDSM [CCM+06]. Seamless[KK02a]. Search[BSH15, Cza13, IKM+01, Wal01b, FMS15,IKM+02, Wal01a, ZSK15, CB11]. Searches[BSG00]. Searching

57

[JPT14, MM01, BA06, Wal01b]. Seattle[ACM05, BS94, LCK11, Ost94]. Second[Ano00b, BL95, DT94, DE91, IEE94d,IEE96d, IEE96i, LHHM96, Tou96, Vol93,WPH94, ACM97a, Ano99a, Ano99b,BFMR96, DMW96, FR95, KN17, Li96].Second-Order [BL95, KN17]. Secondary[WHDB05, SEC15, ZAT+07]. section[Ano93b, DKD08]. segment [FJZ+14].segment-based [FJZ+14]. Segmentation[KBA02, AD95, CCU95]. Seidel[BG95, LM99, Ols95]. seismic[AMBG93, KL95, KEGM10, LM13,RMNM+12, SSS99, WCVR96].Seismograms [DP94]. Select [KKDV03].Selected [DHS96, MTW07, OL05, TB14,CHD09, Cha05, DKD07, JC17]. selecting[PTL+16]. Selection[CKmWH16, PGBF+07, WKS96, ZWL+17].Selective [Nak03]. Self[NSS12, SLJ+14, TGT10, VFD02, NSBR07,WYLC12, WLYC12, YWC11].Self-Consistent [TGT10]. self-scheduling[NSBR07, WYLC12, WLYC12, YWC11].Self-Submitting [NSS12]. Self-Tuning[SLJ+14]. Semantic[MTU+15, DKF94a, OA17]. Semantically[MKW11]. semantics [RNPM13].Semaphores [TTP97]. Semi[CT94a, Bjo95, PSLT99, TC94, CT94b].semi-coarsening [PSLT99]. semi-implicit[Bjo95]. Semi-Lagrangian[CT94a, TC94, CT94b]. Semiconductor[GJN97, Ano03, LS10]. Seminar[Ano94f, Ano93g]. Send [GPC+17]. Sender[BCH+03]. Sensed [GGCM99, GGCGO01,GCGS98, VLO+08, GGGC99]. sensitive[GKCF13]. Sensitivity [dLR04]. Separable[Ben01, CdGM96]. September[Abr96, AD98, Ano93a, Ano93b, Ano95a,Bos96, BP93, BH95, CLM+95, CHD07,CJNW95, CD01, CDND11, DKD05, DKD07,DLM99, DKP00, DLO03, EJL92, FK95,FR95, GHH+93, IEE93d, IEE94c, JPTE94,

KGRD10, Kra02, KKD04, LKD08, Mal95,MTWD06, OL05, PSB+94, RWD09, SPH95,SM07, TBD12, VV95, VW92, WPH94, YH96].Sequence[GMU95, SMM+16, AMHC11, TSZC94].sequences [GAVRRL17, SdM10].Sequencing [VPS17]. Sequential [EK97,RPM+08, GGH99, SR95, TNIB17, TSZC94].Serial [SWH15, HPS+96, HWS09].serialization [CFKL00]. Serialized [KH10].Serielles [BL94]. Series [Nag05, BR94].Server [Ano93e, FSLS98, KS97, Mat01b,Sch93, Sto98, Vis95]. Servers[CGC+02, SIS17, GK97]. Service[RFG+00, LS08, SPK+12]. Services[FC05, AAC+05, ZKRA14]. Session[NYNT12, ZL96]. Set [SW12, WL96a,Ano00a, Ano00b, She95, WL96b]. Sets[SG12, CGL+93]. setting [GL95a]. Setup[NSLV16]. Seventh [BBG+95, HS94,IEE93b, IEE95g, IEE96h, Eng00, Y+93].several [GBR15]. SGI[Che99, CML04, KMG99, LB96, LL01,LKJ03, LSK04, TW12, ZSnH01].SGI/CRAY [Che99]. SGI/CRAY-T3E[Che99]. shadow [SOA11]. shallow[dlAMC11, dlAMCFN12]. Shane [SD13].Shanghai [IEE97a]. SHARE[Ano92, Ano93e, Ano94g]. Shared[BCA+06, BME02, Bri10, DM98, DMB16,FKH02, FB94, GB96, GLRS01, HC10,HDB+12, HT01, KB98, KSHS01, LRT07,Luo99, MBE03, MCdS+08, Mul02, NPP+00d,PBK00, Pok96, PS00b, Ros13, SS01, STY99,ST02b, Thr99, VS00, VT97, ABCI95a,ABCI95b, ADMV05, BMG07, CBPP02,Cha96, CCM+06, CC00b, DBVF01, DS96b,DPZ97, EV01, GCN+10, GL96, GL97c,HS93, HDB+13, JE95, KJA+93, KC06,LKL96, MLC04, PK05, RGDM15, SHHI01,SL94b, SFL+94, SSC96, TSY99, TSY00,Vos03, WMRR17, YWO95, YX95, Cha05].Shared-Memory[DM98, HDB+12, NPP+00d, Pok96, Thr99,

58

PS00b, ABCI95a, ABCI95b, BMG07, GL96,GL97c, KJA+93, PK05, TSY00]. Sharing[Att96, CML04, CB16, DiN96, JAK17, KK98,JE95, Ott93, PRS+14]. shear [JAT97].ShearLab [KLR16]. Shearlet [KLR16].Shearlets [KLR16]. SHMEM [BBDH14,Hus01, LSK04, Sch96a, Sch96b, SS01].Short [KBM97, MH01, BMPZ94a, PARB14].Short-Range[KBM97, MH01, BMPZ94a, PARB14].shorter [NB96]. Showcase [USE00].SHPCC [IEE92]. SHPCC-92 [IEE92].SIAM [BBG+95, DKM+92, Sin93]. Side[kLCCW07]. Sided[BPS01, GFD03, GFD05, GT01, HDB+12,LRT07, MH01, MB00, TGT05, TRH00,ZSG12, bT01a, BM00, DBB+16, LSK04,MS99c, PGK+10, GBH14]. SIGCSE[ACM06a]. Signal [IEE95e]. signals[Uhl95c]. Signatures [Gro00]. significance[AMHC11]. silent [FME+12]. silicon[Ano03, Goe02]. SIMD [BvdB94, HS95b,KDT+12, LL16, Sur95b, VSW+13]. Simple[MSF00, Mul01, SC04, ITT99, JH97, Nes10,PNV01]. simulate [Heb93]. Simulated[BHM94, BHM96, FH97, RSBT95].Simulating[DLM+17, KDL+95b, KDL+95a, NFG+10].Simulation[CDMS15, CCBPGA15, DMMV97, DZDR95,GSI97, GM95, GJN97, Ham95a, JML01,KBM97, KMK16, LLRS02, MFTB95,MPD04, MANR09, PCY14, PKYW95,PZKK02, RR00, RDMB99, SSAS12, Str97,Ten95, UZC+12, ZZ04, ZWJK05, dlAMC11,Ano95d, ADR+05, BJ95, BCM+16, BH95,BMPZ94b, CwCW+11, CSPM+96, DSOF11,FHSO99, FO94, FFFC99, GRTZ10, JAT97,JLS+14, KTJT03, KMC96, KMC97,LCVD94b, LCVD94a, LYZ13, MMW96,MALM95, NB96, NF94, OKM12, PARB14,PY95, RFH+95, SWYC94, SSP+94, SKM15,Str96, Syd94, Tho94, YPA94, YEG+13,YSL+12, Eng00]. Simulation-Based

[ZWJK05]. Simulations [CNM11, DFMD94,DI02, GAP97, HLP11, HF14a, HF14b, KT02,Kha13, NH95, RTRG+07, SM02, YPAE09,ADT14, ABG+96, BHS18, BADC07, Hin11,JMS14, LS10, LSVMW08, RMNM+12, SU96,WWFT11]. Simulator[CAM12, MRV00, UTY02, WPC07, AMV94,LS10, PWD+12, WZWS08, ZAFAM16, ZZ95,KTJT03, Nak03, Nak05a, Nak05b].Simulators [SB95, AVA+16]. Singapore[IEE96d]. Single [BM00, HF14a, HF14b,MB00, URKG12, AGIS94, KKLL11].Single-Chip [URKG12]. Single-sided[BM00]. single/multigrid [AGIS94]. Sinks[JPT14]. Sites [Ano98]. Sixth[HK95, IEE96c, MMH93, SW91]. size[GKCF13]. sized [JLS+14]. Sizes[DALD18, ZSnH01]. SKaMPI [KRS99,RSPM98, RH01, Reu01, RST02, Reu03].SkelCL [SG14]. Skeleton[GB98, IH04, RJDH14]. Skeletons [Ser97].Skjellum [Ano95c, Ano00b]. Slack[KFL05, FKLB08]. SLAE[ADRCT98, AK99]. Slave [LTR00, HP05].SLEPc [DR18]. SLICC [KBHA94]. Slices[GSHL02]. Small [HLP11, TS12b, Ano94h].small-footprint [TS12b]. Small-World[HLP11]. Smith [KDSO12]. Smithsonian[Str94]. smoking [YSL+12]. SMP[Add01, CRE99, CRE01, CCBPGA15,HD02a, DK06, GT01, GMdMBD+07,HD02b, Hus00, HIP02, JKHK08, KOI01,KKH03, KMG99, KAC02, NO02b, NO02a,ST02a, TOTH99, Tra02b, YWC11, bT01a].SMPCkpt [DCH02]. SMPI [DLM+17].SMPs [HLCZ00, NU05, SvL99]. SMPSs[MLAV10]. SMPSuperscalar [GCBL12].SMT [PAdS+17]. SMT-based [PAdS+17].snake [JPP95]. snake-in-the-box [JPP95].Snir [Ano96a, Ano99a, Ano99c, Ano99b,Ano99d, Nag05]. SnuCL [Lee12]. soccer[YMYI11]. socket [LS10]. Softshell[SKK+12]. Software[Ano94i, BME02, BPG94, BDG+xx, CZ95b,

59

ESB13, FFP03, GBF95, Gre95, HPR+95,HS94, HHA95, IEE95l, IEE96h, IFI95,KS15a, KC94, KAMAMA17, KG93, MBE03,NPS12, Ost94, PZ12, Sil96, TDBEE11,VdS00, Wis01, Wol92, Ano97, BSC99, Boi97,Bra97, BR94, CMV+94, CBPP02, DPZ97,Hum95, JH97, JB96, LM94, MK94, Neu94,Old02, PHA10, PK05, PGK+10, RAS16,SHHI01, Sch94, Sei99, SPH95, Str94, ZGN94,Ano94i, KG93, Sil96]. Solan [CGB+10].Solaris [Ano01a]. solidification [JLS+14].solids [Hin11]. Solution [DWL+10,FBSN01, HO14, RPM+08, SEF+16, Tsu12,VRS00, DWL+12, IM95, JK10, LSR95,MALM95, ON12, PRS+14, SC96a].solutions [AGIS94, LMG17]. Solve[Hog13, Riz17, BAV08, Che99, GGGC99].Solver [Ben01, BP98, CF01, HSMW94,IDD94, LZ97, SJK+17a, SJK+17b, WJB14,AMS94, CP15, DM12, JR10, LM99, Lou95,RM99, SRK+12, SCC95, THM+94, ZZG+14].Solvers [DFN12, DALD18, GK10, MSB97,NO02b, Nak03, NHT02, NLRH07, QRMG96,RS97, WR01, ABF+17, ADLL03a,ADLL03b, ADDR95, BRR99, CL93, DR18,MKP+96, MS95, NO02a, Nak05a, Nak05b,NHT06, PR94c, QRG95, SSH08]. Solving[ADRCT98, BHM94, BHM96, BV99, BG95,BDG+92c, BSH15, DALD18, GFPG12,Huc96, LLY93, MS02a, NF94, SAS01, SP11,SD99, BB95, DSM94, HHA95, LBB+16,LYSS+16, MM11, SSB+16, SMSW06,YSVM+16, YSMA+17]. SOM [GkLyCY97].Some [BDT08, Mul01, Pet97, AL92, NN95,RSBT95]. Sopron [VV95]. Sorrento[DKD05, DKD07]. sort [KVGH11, PSHL11].sorting [BHJ96, PSHL11]. Sound [SG12].Source [BGG+15, MM07, AVA+16,NCB+17, Nob08, PSK+10].Source-Code-Correlated [MM07].Sources [ZDR01, KM10]. South [ACM95a].southeast [ACM95a]. Sowing [GL97a]. SP[BGBP01, CE00, HMKV94, LC97b, WT11,WT12]. SP-1 [HMKV94]. SP-2 [LC97b].

SP1 [BR95c, FHPS94b, FHP+94, FHP+95,Fra95, FWR+95, GL95d, HSMW94, MP95].SP1/SP2 [FHP+95, Fra95, FWR+95]. SP2[BR95b, FHP+95, Fra95, FWR+95, HWW97,JF95, KB98, KHS01, MABG96, XH96].SPAA [ACM95b]. Space[CML04, CB16, HO14, MSF00, OFA+15,SAS01, SS01, TA14, SRK+12].Space-Sharing [CML04]. Space-Time[HO14, SRK+12]. SPAI [BBS99]. Spain[DLM99]. SPAN [LHHM96, Li96]. Spanish[VP00]. spanning [NCKB12]. Spark[KWEF18]. Sparse[AZ95, BBH12, DS13, Huc96, NHT02, TD98,ZB97, AK99, ADLL03a, ADLL03b, ER12,FJZ+14, GG99, Gra09, NHT06, XXL13].SPEC [Ano03, MvWL+10, MBB+12, NA01,SGJ+03, TSB03]. Special[AM07, BDT08, BDB+13, BC00, CHD09,DKD07, DKD08, GSA08, MPI98, Bos96,Mar02, PNV01, Reu01, Old02]. Specific[DM95b, DM95a, Olu14]. Specification[BG94a, BdS07, MGC12, MHSK16, BG94c,LPD+11]. Specifications[OFA+15, WMP14]. Specified [MGMH97].specifying [LPD+11]. specimen [Rol08b].SPECT [BCD96]. spectator [YMYI11].Spectra [Str97, SR11]. Spectral[MW98, BCM+16]. spectral/hp [BCM+16].Speculation [AELGE16, SHLM14].Speculative [RA09, dOSMM+16]. Speed[CDHL95, Tou00, AH95, Ano03, BWT96,BID95, KMK16, CDH+95]. Speeding[CSV12]. Speedup [VPS17]. SPH[CP15, OLG+16, PBC+01, WMRR17].Sphere [CT94a, CT94b]. spherical [KT10].SPICE3 [WPC07]. Spiking [CAM12]. Spin[HLP11, KO14]. splitting [TCBV10].SPMD[BST+13, Dar01, KAC02, Wal00, Wal02].SPMD-Like [BST+13]. Spokane [IEE93c].Sponge [HSW+12]. spontaneous[EZBA16]. Spring [Ano94g, IEE93a].SPTHEO [Sut96]. SPY [SSG95]. Squares

60

[PWP+16, VRS00]. SR [YWCF15, ZLP17].SR-IOV [YWCF15]. SR8000[NNON00, TSB02, TSB03]. SS7 [LTLC94].SSGM [HPS+96]. SSS [MMH98].SSS-CORE [MMH98]. St [Mal95].Stability [DSS00]. stable [JMdVG+17].Stage [FSXZ14]. Stampi [ITKT00].Standard [DM98, GSI97, GLP+00, GL95c,Hem94, MPI98, NH95, SKD+04, SGS10,Wer95, YKLD17, Ano94d, BDB+13, Bor99,Cla98, CG99b, DHHW93b, DOSW96, FB95,GK97, GL92, Hem96, Sti94, VM95, Wal94a,Wal94b, WD96, Ano97, Bra97, CGH94,DOSW95, GLDS96]. Standards[FKKC96, Thr99]. Star[CDM93, Coo95a, Coo95b]. STAR/MPI[Coo95a, Coo95b]. Start [Gro02b, Hus98].Startup [PS07]. State [ACM11, IEE94f,IEE95j, Wis96a, Wis96b, BTC+17, LF93b].state-to-state [BTC+17]. states [NS16].Static [NIO+02, NIO+03, RLVRGP12,SCB15, SCB14]. Static/dynamic [SCB15].Statics [TG94, TG94]. Stationary [MW98].Statistical [LR01, SNMP10, AMHC11,KKM15, Roh00, SL94a, Vet02]. Status[Bak98, DZ98b, GL95c, BDG+93b, FHP+95,Hem96, Sun96]. stealing [TCBV10].Steepest [Sch01]. Steering [GKP97, PK98].Stencil [CGU12, WTTH17, KD13, TBB12].stencil-based [TBB12]. step[Kos95b, ZG98]. Stereo [ZBd12, Qu95].Steve [Ano96a, Ano99a, Ano99b, Nag05].Steven [Ano96a, Ano99a, Ano99c, Ano99b,Ano99d, Nag05]. Still [HCA16]. Stochastic[DK02, LLRS02, MW98, PTMF18, RSV+05,JK10]. Stockholm [Eng00, HAM95b].Stokes [Che99, DLR94, HSMW94, IDD94,Lou95, PTT94, SCC95, ZZG+14]. stop[Gua16, LMG17]. stop-and-restart[LMG17]. Storage[ACM04, Hol12, LCK11, HP11, NFG+10].stores [HSP+13]. straight [YULMTS+17].Strategies[MM02, BVML12, CG99a, DBVF01, MM03,

OPW+12, PSK08, TSZC94, VB99].Strategy [AIM97, DI02, Hat98, VPS17,ZB94, ZSG12, DKF94b, DR95, MSL12].strayed [Rol08a]. stream[HSW+12, UGT09]. Streamline [CGC+11].StreamScan [YLZ13]. Strength [Kon00].String [MM02, MM03]. striped [KDSO12].Strongly [GAP97, ZZG+14]. Structural[PSSS01]. Structure[CBL10, LAFA15, SYF96, WHDB05,EPML99, SEC15, SY95, ZAT+07].Structured [FB96, Mar06, MRB17,NLRH07, Ran05, Bis04, CLSP07, FR95,GBR15, JAT97, Smi93b]. Structures[GMPD98, JY95, KA95, OKW95, SHPT00,WB96, YPA94]. studies [DHP97]. Study[AIM97, BF01, BHLS+95, DARG13, EGC02,FPY08, GL97a, HHC+18, KCR+17, LSB15,MM02, NSLV16, NA01, PK05, RRBL01,SCL01, TG94, AGR+95b, BJ13, BfDA94,BJS99, BY12, Bri00, CBM+08, DXB96,ED94, FO94, JR13, KBG16, LPD+11,LLH+14, MS96b, PSK08, PGK+10, PSHL11,RSBT95, RJC95, TPD15, Wal01b, ZSK15].Stuttgart [KGRD10, WPH94]. style[JPOJ12]. sub [MJG+12].sub-communicators [MJG+12].subcircuit [HLO+16]. subdomain[CEGS07]. subdomains [SHHC18].subgroup [XLW+09]. Submitting [NSS12].Subrange [Str97]. Subroutine [Saa94].subroutines [dCH93]. subsurface [ED94].subsystem [BMG07, MABG96].Subsystems [STMK97]. Subtle [SAL+17].Success [Gro01b, LF+93a]. Successes[Gro01a]. Successful [Gro12]. suffix[DK13]. Suitability [Mat01b]. suitable[MAS06]. Suite [ACMR14, AKE00,BWV+12, MBB+12, Riz17, Ano03, BO01,MvWL+10, TG09, YSWY14, SNMP10].Suites [MCS00, SGJ+03]. summation[IHM05]. Sums [ST17, MYB16]. SUN[BM00, SJ02, WSN99]. Sunderam[Ano95b]. Super [Gua16, YX95].

61

Super-Object [YX95]. Supercomputer[Ano93a, CLP+99, Str94, AAC+05, BGH+05,EFR+05, GL96, GL97c, KMH+14, NSM12,Ste94, GS91b, MAB05]. Supercomputers[BP93, BDG+92c, EKTB99, KN17, WT11].Supercomputing[ACM96b, ACM04, ACM05, BDG+91b,HK93, IEE91, IEE93e, IEE94h, Liu95, Sch94,ACM94, ACM96c, Ano93f, BG91].superlattice [Pri14]. superscalar [ACJ12].Supersonic [CCBPGA15]. Support[Ano98, BBG+10, BFBW01, CFF+94,DMMV97, FGRD01, GRV01, GOM+01,HRSA97, LMRG14, MK04, OP98, PSM+14,RR02, SDN99, SBT04, TW01, Wis98, Wis01,YSP+05, BBH. . . 13a, BL99, CC10, CZ95b,DLR94, Hos12, Maf94, TSY99, TSY00,TY14, WK08a, WK08b, WK08c, YAJG+15].Supported [KLR16, CDD+96].Supporting[FD00, FMSG17, GAML01, Gua16, MMS07,OOS+08, WLNL03, WLNL06, WCS99,YWCF15, FLD96, GAM+00]. Supports[AELGE16, CLL03, DGMS93]. suppression[WWZ+96]. Surface [KS15b, PKYW95,BHW+12, DCD+14, RAGJ95, TSP95].Survey [Sap97]. Survive [ABB+10].sustainable [CGBS+15]. SVD [CMH99].Swan [HD11]. Swapping [SC04]. Sweden[Eng00, HAM95b, FF95]. Swendsen[KO14]. Switch [SCL01, TBD96]. Switched[LC93, KYL03, KYL05]. SWITCHES[DT17]. Switzerland[GT94, Ano94i, IEE97b]. SX[HRZ97, TRH00]. SX-4 [HRZ97]. SX-5[TRH00]. Sydney [Bil95]. Sylvester[GK10]. Sylvester-Type [GK10].Symbolic [CCK12, Coo95b, Ste00,YYW+12, ACM97a, BHKR95, Coo95a,Lev95, LGKQ10, LLG12, SMAC08].Symmetric[BDV03, MDM17, BAV08, DCH02, GG99].Symposium [ACM95b, ACM96a, Ano94a,Ano95d, BG91, DE91, HHK94, IEE93c,

IEE93b, IEE94a, IEE94e, IEE94g, IEE95c,IEE95d, IEE95k, IEE95f, IEE95g, IEE96b,IEE96c, IEE96f, IEE96e, IEE97b, IEE97c,IEE05, LHHM96, Li96, NM95, Ost94, SL94a,Sie94, Sie92a, Sie92b, Ten95, Tou96, USE94,UCW95, ACM97a, ACM06a, Ano93a,Ano94h, Lev95, Old02]. synchronisation[SDB+16]. Synchronization[LA02, OCY+15, TGT05, BMG07, LA06,TMTP96, YLZ13]. Synchronizing [VT97].Synchronous[Ada97, BJ13, Cer99, DLRR99, HZG08].Synergia [SSAS12]. Synergistic [UGT09].Synthesis [CS14, GWC95]. synthesized[MC17]. Synthesizer [DS16]. Synthesizing[AJF16, NP12]. Synthetic [CC17, DP94].Syracuse [IEE96f]. SYSMO [MM95].System[Ada97, AJ97, AH00, BG95, BDG+xx, BL95,BFZ97, BGD12, CAM12, CGC+02, DBA97,DALD18, ERS95, ERS96, EK97, FBD01a,FBVD02, FFP03, Fis01, Gal97, GCBM97,GS91b, GS92, GSxx, GM95, Gre95, HS94,KBA02, LLRS02, LTR00, LLY93, Maf94,MRV00, MM02, MSF00, MMH98, MMS07,MMH93, NPP+00d, NMS+14, Oed93,PPT96a, RGD97, SGJ+03, SSB+05, SCP97,SA93, ST02b, Sun93, TSS00b, Tsu07, UP01,Wil93, ARS89, AS92, AL92, BB94, Bri95,BBH+15, DL10, FNSW99, FK94, GS91a,GS93, GS96, GMU95, GkLyCY97, HDDG09,Hum95, HS95b, IBC+10, ITT99, JH97,JLS+14, KW14, Kik93, LBD+96, LKL96,LL95, MA09, MMR99, MMB+94, MAS06,MM11, MS99b, MALM95, NAJ99, PPT96b,PPT96c, PK05, RJDH14, RTL99, SHHI01,SL94b, Sei99, SPL99]. system[SGDM94, Sun96, Sur95b, VSRC94,VSRC95, WCC+07, WZWS08, YPZC95,YZPC95, ZL96, ZPLS96, ZWZ+95, dCZG06,AL93, NMW93, Yan94]. System-Initiated[SSB+05]. system-on-a-chip [dCZG06].System/6000 [AL93, NMW93]. Systeme[GBR97, GEW98]. Systems [AAB+17,

62

Ano94b, Att96, BCGL97, BGBP01, BME02,BPG94, Bha93, CDJ95, CAWL17, CFF+94,CSW97, CJNW95, Coo95b, FD96, FGKT97,Fos98, Gua16, HRSA97, IEE93d, IEE94d,IEE95a, IEE95i, IEE96h, KKH03, KP96,KDL+95b, KCR+17, KS97, LY93, LW97,MWG97, MBE03, MJB15, MBB+12, SM03,SGS10, SS96, TMP16, THN00, USE94,YGH+14, YH96, ZB97, dGJM94, AGR+95b,ACMZR11, ATL+12, Ano94e, BBB+94,BAV08, CKO+94, CLYC16, CBPP02,Coo95a, CPR+95, DF17, DR94, DBVF01,DvdLVS94, FHB+13, GBR97, GCN+10,GEW98, GKK09, GKCF13, Gra09, GFPG12,GHH+93, HHA95, IM95, JB96, JJM+11,KSG13, KHB+99, KDL+95a, KFSS94,LR06b, LH98, LCVD94b, LLH+14, MSL12,MvWL+10, Old02, OPW+12, Pan95b, Par93,QB12, SSKF95, SPH95, SVC+11, Smi93b,SG14, SMSW06, SLN+12, Sun94b, TBB12,TMW17]. systems[TSP95, WCS+13, WWZ+96, WADC99,WYLC12, ZL96, ZGC94, dH94, dlAMC11,dlAMCFN12, JWB96]. Systemsoftware[Sei99]. systolic [BSC99].

T3D[AZ95, AFST95, CCSM97, HWW97, MP95,MWO95, Oed93, Sch96a, Sch96b, SCC95].T3E [BBS99, Boo01, Che99, GRRM99,LSK04, RBB97c]. T3E-512 [RBB97c].T3E-600 [LSK04]. T9000 [BR94]. table[BJ13]. Tabu [BSH15, Cza13, CB11]. Tags[Wis97]. Tails [Kha13]. takes [GDB+93].Talbot [ACMR14, Riz17]. Tapir [SML17].targeting [JKM+17]. Task[AHD12, AAB+17, FKKC96, GPC+17,IOK00, KOI01, LHCT96, Mar03, MJB15,NIO+02, NIO+03, NSZS13, NJ01, OP10,OS97, SGZ00, SPL+12, TBS12, TS12a,APBcF16, ABF+17, BGH+05, GKCF13,OdSSP12, OPW+12, OPP00, RRFH96,RFRH96, SKB+14, WC15]. Task-Based[AHD12, AAB+17, SPL+12, SKB+14].

Task-Overlapped [GPC+17].Task-Parallel[NSZS13, APBcF16, ABF+17]. Taskers[FLD96]. Tasking [DFA+09, KaM10,SHM+10, TSCaM12, WC15]. Tasks[ACD+09, DT17, DFA+09, JW96, OP98,RR02, RDLQ12, YSS+17, BS01, DDYM99,DR95, FKK+96b, FKK96a, IvdLH+00,PKE+10]. TAU [MMS07]. taxonomy[SPH96]. TBSCM [BP98]. TC2 [Boi97].TC2/WG2.5 [Boi97]. TCGMSG[GB96, Mat94, Mat95]. TCP [KPW05]. TD[And98]. Teaching[MK00, JY95, MK97, PKB06]. Technical[Ano93c, Ano98, MC94, USE95, ACM06a].Technique [BCD+15, HC06, HAA+11,HC08, Nes10, RBB17, MAIVAH14].Techniques [CP97, GS02, Mul01, SAL+17,SPL+12, TGBS05, Wis01, BPG94, Fer04,FCS+12, HKMCS94, JKN+13, KBG+09,NFG+10, PF05, SKS01, WST95].technologies [Mal95]. Technology[Ano97, Bra97, CGB+10, CSV12, Dan12,GN95, HS94, PWP+16, SBT04, TBG+02,Ano93a, Ano93c, D+95, DM12, IEE94c,NS16, ZAT+07]. Tekniska [Eng00].Telegraphic [ES11]. TELMAT [BR94].temperature [Hin11]. Template[GSI97, PKB06]. Templates [BN12, KH15].Tennessee [PR94b]. terabyte [KTJT03].Terabytes [IEE02]. teraflops [KTJT03].Terms [KD12]. Tessellation [SS09]. Test[SNMP10, TG09, AAAA16, CPR+95, GL92].Testbed [Mat01b, EGH99, PY95]. Testing[CCK12, DKF94b, Ost94, VdS00, CMV+94,DKF93]. Testsuite [WCC12]. Texas[ACM06a, IEE94b, IEE95l, IEE95g, IEE97c,Y+93]. Text[LTR00, MM01, RLL01, RTL99]. Textbook[Ano98]. textural [WKS96]. texture[HE15]. TFETI [SHHC18]. TH [CFDL01].TH-MPI [CFDL01]. Thakur [Ano00a].Their [Bru12, GOM+01, RG18, GSMK17].theorem [Sut96]. Theory

63

[GK10, BW12, CBHH94]. Thera [CD01].Think [HCA16]. Third[BPG94, Bos96, DSM94, GA96, IEE94g,Sil96, Was96, BDLS96, Mal95, IEE97c].Thirty [Y+93]. Thirty-seventh [Y+93].Thousands [PZKK02, BMS+17]. Thread[AELGE16, ETWaM12, GOM+01, GT07,Nit00, Pla02, STY99, HK09, IDS16,JKN+13, SPH96, SLN+12, YZ14].Thread-Level [AELGE16, HK09, YZ14].Thread-Safe [Pla02]. Thread-safety[GT07]. Threaded [BBG+10, MG15, Ada98,EBKG01, SCB15, SVC+11, TSY99, TSY00].threaded-MPI [SVC+11]. Threading[BHV12, SBT04, TBG+02, KPO00, KRG13,QB12, ZAT+07]. Threads [CP98, LD01,Lee06, BS01, MVTP96, ALW+15]. Three[Car07, GA96, Nak05b, Ram07, SAS01,GSMK17, LSSZ15, Mar05, PR94c]. three-[GSMK17]. Three-Dimensional[GA96, LSSZ15, PR94c]. Three-level[Nak05b]. Throughput[Tsu07, ESB13, PP16]. Tightly [SS01].Tightly-Coupled [SS01]. Tilewise[KS15b]. Time [BCL00, FHK01, FSSD17,GSHL02, GOM+01, HO14, KFL05,MFTB95, OP98, SCL01, SS96, TSP95,UP01, YGH+14, AL96, CDMS15, DLR94,DM12, Fer04, FLB+05, FKLB08, GB94,HE13, JE95, KC94, KPL+12, LHLK10,LBB+16, LYSS+16, LM13, MMW96, NZZ94,ON12, OdSSP12, PTMF18, Ram07, SBW91,SSB+16, SK92, SRK+12, TSY99, Tho94,TVV96, TCBV10, Uhl95c, VM94,YSVM+16, YSMA+17, ZWZ+95, SKD+04].time-dependent[DM12, LBB+16, LYSS+16, ON12, SSB+16,YSVM+16, YSMA+17]. time-domain[HE13, NZZ94, Ram07, VM94].time-independent [CDMS15].time-varying [Uhl95c]. times[MLVS16, NB96, SSS99]. timing [Ols95].tips [Fer04]. TLM [SC96a]. TM[GGCM99, GCGS98, KHS01]. TN

[DT94, BR94]. TOD [GPC+17]. TOD-Tree[GPC+17]. today [IEE94c]. Toeplitz[BV99, BAV08]. Tolerance[GKP97, GL04, LMRG14, LNLE00, RPM+08,TS12a, WC09, Wil93, SG05, ZHK06].Tolerant [BBC+02, BCH+03, BHK+06,CF01, CFDL01, FD00, FBD01a, FBVD02,FD02a, FD04, GFB+03, IEE95c, JSH+05,MSF00, BCH+08, FBD01b, FD02b, HG12,LMG17, LS08, NCB+12, NCB+17, PKD95].Tomographic [Pat93]. tomography[FWS+17, RCFS96]. tomorrow [IEE94c].Tool [Ano01b, Beg93b, BFMT96b, DW02,GSN+01, KAMAMA17, KSJ14, KKP01,LMRG14, MMSW02, MK04, NE98, SR96,SGL+00, Tra12b, WL96a, AGG+95,BDP+10, Beg92, Beg93c, Beg93a, BDY99,BFMT96a, BHW+12, CPR+95, DKF94a,FSTG99, HPR+95, HD11, LCC+03,MdSAS+18, TSS98, WL96b, WL96b].Tool-Set [WL96a]. Toolbox[Ano97, Bra97]. Toolkit[Ano12, LC07, LLC13, SLS96]. Tools[ABC+00, BDG+91b, BDG+93a, BS96a,BDL98, BoFBW00, Cha05, CDD+96, DT94,EV01, GMPD98, MHC94b, MCLD01,PKB01, STMK97, Vos03, Wan97, AVA+16,BDG+92a, BFIM99, Fan98, GBF95, LH98,MSW+05, MHC94a, ZL96].Tools-supported [CDD+96]. Top[AHP01, Gal97, Hus01, Man01, PTH+01b,Ser97, BBCR99, PTH+01a, SSC96, SCL97,CCHW03]. TOP-C [CCHW03]. ToPe[JKM+17]. topologies [BCM+16, MK00].Topology [DK06, Hat98, HM01, Tra02a,HRR+11, MBBD13, SPK+12].topology-aware [MBBD13].Topology-Based [HM01]. TOPPER[KKP01]. Toronto [GGK+93, Vos03].Torus [SG15]. Townsend [DT94]. TPVM[FS95, FS98]. Trace [Ney00]. Traceback[dOSMM+16]. Tracefiles [FCP+01]. Traces[CC17, MANR09, WM01, CDMS15,DWM12]. Tracing [CGLD01, DP94, KG96,

64

CG93, Mor95, SGS95]. Tracking[GAP97, HD02b]. Trading[BHM94, BHM96]. traffic [Zah12].Training [CSV12]. Transactional[BWW+12, MFG+08, SBG+12].Transactions [BWW+12]. Transfer[BKGS02]. Transfers [THS+15].Transform [YULMTS+17, KT10, DBLG11].Transformation[EP96, NSZS13, GSMK17, HZ96, TSY00].transformations [JE95, TG94].transformed [BY12]. Transforming[PSK+10]. Transforms[ACMR14, KLR16, HP11, Uhl95c, Zem94].Transient [SIS17]. transistor [Ano03].transistors [Ano03]. Transition [MRV00].Transitive [CGPR98, PPR01]. Translating[Mar09, NCB+12]. Translation[DDL00, SSE12, HCL05, LME09, NCB+17].Translator[KMK16, UZC+12, CHKK15, GScFM13].transmitters [WWZ+96]. Transparent[CCK+95, IFA+16, NPP+00c, SLGZ99,LFS93a, LFS93b, LFL11, NPP+00a, SOA11].Transparently [CB16]. Transport[KHS01, RS97, VRS00, WR01, ZZ04, Pri14,SH94, WH96]. Transporter [Fer92].transpose [Bha98]. Transposition[HD02b]. Transputer[Ara95, ACDR94, CJNW95, FK95, FF95,GN95, GHH+93, MC94, dGJM94, ZPLS96,Ara95, CJNW95, GHH+93, dGJM94].Transputers [ACDR94, AGR+95b, dCH93].Transtech [Ste94]. trap[LBB+16, SSB+16, YSVM+16]. TRAPPER[KFSS94, SSKF95]. travel [SSS99].travel-times [SSS99]. traveling [GM94].traversing [BDG+92b]. TreadMarks[LDCZ97]. Tree [GPC+17, ADB94, AB13,BCAD06, CG93, SGS95, Zah12]. Trees[CDPM03]. Trends [Duv92, IEE93d,MBS15, JPTE94, SGDM94, Sun96].Triangle [SL94a, SOA11]. Triangular[Hog13, MRB17]. tricks [Fer04, LK14].

Tridiagonal [DALD18, DR18]. Triolet[RJDH14]. Trivandrum [IEE96a]. Troy[SS96]. Truncated [ZB97]. truncating[Ram07]. TSMC [Ano03]. TSUBAME[NSM12]. Tsukuba [SHM+10]. TTIG[RRBL01]. Tucson [JB96]. Tuning [Cza02,Cza03, NPP+00d, SLJ+14, WG17, DBLG11,FE17, LGG16, SH14, Yan94, FVD00]. tuple[MYB16]. tuple-based [MYB16].Turbulence [Str97, MRRP11, Str96].turbulent [BCM+16]. Tutorial[EM00a, EM00b, GBD+94, GLT00b, Nov95,Per96, Ano95b]. TV [CIJ+10]. Twenty[ERS95, ERS96, HS94, IEE95c, MMH93].Twenty-Eighth [ERS95]. Twenty-fifth[IEE95c]. Twenty-Ninth [ERS96].Twenty-Seventh [HS94]. Twenty-Sixth[MMH93]. Two [CM98, STY99, SJK+17a,SJK+17b, YM97, AGR+95b, AL93,ADLL03a, ADLL03b, CB11, ED94, HAJK01,MSP93, dlAMCFN12]. Two-Dimensional[SJK+17a, SJK+17b, AL93]. two-layer[dlAMCFN12]. Two-level [STY99].two-phase [ED94]. TX[ACM00, Cha05, DKM+92, Ano95a, Ano95d].Type [GK10, MSB97, FVLS15, GFPG12].Types [Wel94, NYNT12]. typy [OA17].

U.S. [LD01]. U.S.A [Ano94e]. Uberblick[Wer95]. UK [Abr96, AD98, EJL92, HK95,BP93, CJNW95, MC94]. UKMO [RSBT95].ULFM [LCMG17]. Ultra [SJ02].Ultra-High [SJ02]. Ultrafast[KRC17, FWS+17]. Umgebung [GBR97].UML [RGD13]. UML/MARTE [RGD13].Umpire [VdS00]. Unbalanced [OP10].Uncertainty [MBS15]. Understand[DeP03]. Understanding [CRE01]. Unibus[KSSS07]. UNICOM [Ano93g]. unified[GKZ12, JC17, KSL+12]. unifies [RJDH14].uniform [KSG13]. uniformly [Tra12a].Unify [VSRC94, VSRC95]. unifying[CCM12]. Unintended [SAL+17]. unit[VDL+15, MSML10]. United [Boi97]. Units

65

[KS15b, LSVMW08, ABDP15, BHS18,LHLK10, WWFT11, HJBB14]. Universal[LW97, DDLM95]. University[CGB+10, IEE94d, IEE95j, R+92]. Unix[OLG01, RBS94]. unscharfer [Wil94].Unstructured [AB93a, NO02b, SM02,SM03, AB93b, NO02a, TPD15]. unveils[Ano03]. UPC [EGC02, MTK16, Mar05,SJK+17a, SJK+17b]. Update[KT10, GSMK17]. Updates[ESB13, KS15a, ZDR01, HSE+17]. UPM[NPP+00d]. ups [Ano03]. USA[ACM96b, ACM98b, ACM00, ACM06a,AGH+95, BBG+95, BS94, Cha05, CGKM11,DT94, EV01, EdS08, ERS96, Gat95,Ham95a, Hol12, IEE95b, IEE95d, IEE96f,IEE96e, IEE96i, MCdS+08, Old02, PBG+95,Ree96, Sin93, Ten95, ACM95b, ACM97b,Agr95a, Ano89, B+05, DKM+92, HS94,IEE94e, IEE95k, IEE02, Ost94, SL94a, SS96,USE94, USE95, USE00]. Usage[FD02a, FCLG07, FD02b, FVLS15]. Use[FJBB+00, Gro02a, HK93, HK95, MB12,PSZE00, Shi94, AB95, GEW98]. USENIX[USE94, USE95]. User[AD98, ACDR94, BDG+91a, CHD07, CD01,CDND11, DKD05, D+91, DHHW92,DHHW93a, DLM99, DKP00, DLO03,FCLG07, GBD+94, GN95, KGRD10,KCP+94b, KOW97, Kra02, KKD04, LKD08,MC94, MTWD06, NPP+00c, Nov95, Per96,RWD09, TBD12, XF95, ZWZ05, Ano95b,BBB+94, BDW97, KCP+94a, RSC+15,Reu01, Wil94, BBH. . . 13a]. User-Level[DHHW92, DHHW93a, KCP+94b, KOW97,NPP+00c, XF95, ZWZ05, KCP+94a,BBH. . . 13a]. Users [Ara95, CHD09]. uses[SH96]. Using[AR01, ADRCT98, AHP01, And98, AP96,Ano95e, AKE00, AZG17, AB93a, BST+13,BPMN97, BG95, BS93, BKGS02, BM97,Bon96, BBC+00, BBH12, CGC+11, CRE99,CMM03, CP97, CSPM+96, CC17, Che99,CCSM97, CDM93, CCHW03, CRGM14,

CT94a, CCBPGA15, CD98, DeP03, DARG13,DAK98, DGMJ93, EM02, EMO+93, ESM+94,EK97, FAFD15, FD04, FTVB00, FS93,GGCM99, GCGS98, GTH96, GM95, GK97,GS96, GMPD98, GHL97, GJN97, GLS94,GLT99, GLS99, GLT00b, GLT00a, HB96b,HSMW94, HJ98, HLP11, HT08, HRSA97,HT01, IOK00, IDD94, IKM+01, JFGRF12,JPP95, KB98, KOI01, KKV01, KS96, KA13,LLRS02, LTR00, LRT07, LTRA02, LY93,LLY93, LZ97, LAFA15, MTSS94, MPD04,MR12, MSCW95, MANR09, MBB+12,MSB97, NO02b, NIO+02, NIO+03, Neu94,NH95, NA01, OM96]. Using[OCY+15, OWSA95, PWP+16, PK98,PPT96c, POL99, PT01, Per99, Pet97,PBK00, PD98, Pus95, QRMG96, QMGR00,RR00, Reu03, RRBL01, RLVRGP12, RLL01,RRG+99, SAS01, Sev98, SSAS12, SP99,SA93, Smi93a, SBR95, STV97, SMOE93,Sta95b, ST17, SKH96, SCL01, SJK+17a,SJK+17b, TS12a, TSB02, TSB03, TK16,TBB12, Tha98, Tra98, Tsu07, VLO+08,WO95, Wal01a, WJ12, WLR05, Wis97,Wis01, WLYC12, ZBd12, van97, vdLJR11,AMHC11, AK99, ABF+17, AL96, ADT14,ABG+96, AB93b, AGIS94, AGG+95, BV99,BFLL99, BSC99, BDG+92c, Bic95, Bis04,BCM+16, BTC+17, BCD96, BID95, BAG17,BSH15, BMG07, CG93, CBM+08, CdGM96,CS14, CT94b, CC00b, DG95, DS13, DRUC12,DSOF11, DCH02, DM12, EGDK92, FB96,FSV14, FSC+11, Fin94, Fin95, FHC+95,FWS+17, GGGC99, GSMK17]. using[GG09, Goe02, GFB+14, GMU95, GRTZ10,HB96a, HDDG09, HP11, HPS+96, HPLT99,HASnP00, HLO+16, HAA+11, IJM+05,IM95, IKM+02, JF95, JKHK08, JLS+14,JJY+03, JJM+11, JPT14, JR10,JMdVG+17, KFA96, KRKS11, KY10, Kat93,KJJ+16, KR09, KMK16, KME09, KMC96,KMC97, KRC17, KD13, KPK13, LP00,LSG12, LSSZ15, LCY96, LSVMW08,LCMG17, LO96, MMR99, MP95, Mar06,

66

MSMC15, MAB05, McK94, MM11, Mic93,Mic95, MRH+96, MMM13, MSML10, MS95,MM14, MC99, MvWL+10, NO02a, Nak05a,NZZ94, NB96, NAJ99, NU05, OKM12,OIH10, Ols95, Pat93, PDY14, PNV01,PKE+10, QRG95, RJC95, RAS16, RCFS96,RBAI17, RM99, RCG95, SHLM14, SdM10,SLGZ99, SGS95, SSS99, SMS00, SOA11,SVC+11, SSGF00, SFLD15, SSN94, SU96,SP11, TC94, Tsu95, Uhl94, Uhl95b]. using[UH96, VM94, VB99, VGS14, VM95, WO96,Wal01b, WCS+13, WCVR96, WST95,WMRR17, WADC99, Wor96, WYLC12,XF95, YULMTS+17, YWC11, YWCF15,ZWHS95, ZSK15, ZAT+07, ZZ95, Ano95c,Ano00a, Ano00b]. UT [Hol12]. UTE [JF95].Utilising [SC96a]. Utilities [CC95]. UV2[TW12]. UVM [NSLV16].

V [JB96, BBC+02, BHK+06]. V2 [BCH+03].VA [Sin93, RP95]. Vacancy [HD02b].Vaidy [Ano95b]. Validation [BDV03,GLB00, WCC12, CMV+94, SCB14, SCB15].Value [vHKS94, AL96, LSR95, SP11, SD99].Value-based [vHKS94]. valued [Str12].VAMPIR [BHNW01, NAW+96].Vancouver [IEE95a, IEE95i]. Vapour[PKYW95]. Variable [Ano98, ZZG+14].Variables [FKH02]. Various [LH95].varying [Uhl95c]. VCMON [Whi94].vCUDA [SCSL12]. Vector [AKL16, DS13,Fuj08, KDT+12, LL16, Uhl95c, ER12,FVLS15, FJZ+14, GL96, GL97c, Har94,Har95, HE15, PMZM16, XXL13].Vectorization [IKM+01, MCP17, IKM+02].Vectorized [KB13]. vectors [AAAA16].Vegas [Ano94e]. Vehicle[BHM94, BHM96, WH94, BKvH+14].Vendor [Rab98, Bor99]. Venice[DLO03, OL05]. venture [Ano03].Verification [BCD+15, RAS16, Tra12b,LMM+15, SZ11, VVD+09]. verified[WBBD15]. verifier [BCD+12, LGKQ10].verify [MdSAS+18, SMAC08]. Verilog

[Kat93, KMK16]. Versatile [KSJ14].Version[BCGL97, CCK+95, MHSK16, Bjo95,BHW+12, BBH+15, Man94, Str94, Wal95].versioned [SSB+17]. Versions [Ano98].Versus [RTRG+07, Ahm97, CE00, KPW05,KAC02, KPO00, LMG17, LC97b, MFTB95,NSLV16, NHT02, NHT06, RS95, SZ99,Wal00, ZLZ+11]. verteilter [GBR97].VGRIDSG [AB93a]. VIA[Sei99, FKKC96, BHW+12, CGZQ13, DS96b,GB96, Hos12, HCL05, LAdS+15, LSSZ15,NPP+00c, SLJ+14, Sti94, VBLvdG08,YPZC95, ZLL+12, EM02, RR01]. VIA/SCI[RR01]. viable [Ano03]. Victoria [IEE95e].Video [KSJ95, KSJ96]. videogames[YMYI11]. Vienna [BH95, TBD12, Ben95].View [ZDR01, ZDR04]. ViMPIOS [Sto98].VinaMPI [ESB13]. ViPIOS [Sto98].Virginia [IEE92, IEE94a, Sie92a, Sie92b].VirtCL [YWTC15]. Virtual [ACM96a,AS92, ARL+94, BJ93, BP99, BS93, BG94b,CHD07, D+91, EGR15, Fis01, GBD+94,Gei01, Gre94, ITT99, JPP95, KNT02,KKDV03, KKD04, KKD05, LKD08, LK10,MTWD06, NM95, Nov95, Pat93, Per96,QRG95, RWD09, SSSS96, Sei99, SCSL12,TY14, Tsu07, Wel94, YC98, ARS89, AD98,AL92, Ano95b, BR91, BDG+91a, BPC94,BBCR99, Bir94, BDLS96, BCM+16, BFM96,BDW97, CARB10, Cav93, Cha96, CD01,CXB+12, DDS+94, DM93, DKD05, DLM99,DKP00, DLO03, DPZ97, ESB13, FM90,KMC97, Kra02, LG93, MN91, MRH+96,NB96, PRS16, Sch94, SK92, SCC96, SL00,WK08a, WK08b, WK08c, AGIS94, Sei99].virtual-time [SK92]. Virtualization[FC05, MGL+17, Ott94, YSS+17, ZLP17,RSC+15, SIRP17]. Virtualized[EGR15, YWCF15, RNPM13]. viruses[Str94]. viscoelastic [HK94, MAIVAH14].viscosity [ZZG+14]. viscous [RM99].Vision [KCR+17, JRM+94]. VISPAT[HPS95]. Visual

67

[BPMN97, FNSW99, PDY14, Ros13,ACGdT02, LC07, GE95, GE96].Visualization[BDGS93, GKP96, GKP97, HJ98, KA13,MVY95, NAW+96, PK98, PCY14, Wis96a,ZLGS99, Bor99, Eng00, FHC+95, HPS95,KFA96, TSS98, WST95, Wis96b].Visualizer [HKN+01]. VLSI [Jes93a]. VM[GHD12, McR92, Whi94]. VM-protected[GHD12]. VM/ESA [Whi94]. VMPP[LG93]. VOBLA [BKvH+14]. Vol[ATC94, HS94, Nag05]. Volatile[BBC+02, BCH+03]. Voltage[KFL05, FKLB08]. Volume[Ano99a, Ano99c, Ano99b, Ano99d, DFN12,GHLL+98, SOHL+98, BHW+12, WST95].Volumes [GAP97, SOA11]. Volumetric[KA13, KGB+09]. Voodoo [PMZM16].VOOM [BR91]. VORD [KSJ14]. VR[DBA97]. VRML[ACM96a, NM95, KSJ95, KSJ96].VRML-Based [KSJ95, KSJ96]. vs[FH98, BCH+08, Luo99, Nak05b]. VTC[NU05]. VTDIRECT95 [HWS09, SWH15].VxWorks [YGH+14].

WA [ACM05, LCK11]. Wailea[ERS96, HS94, MMH93]. Waknaghat[CGB+10]. Walker[Ano96a, Ano99a, Ano99b, Nag05]. wall[NB96]. wall-clock [NB96]. walls [JAT97].WAMM [BCLN97]. Wang [KO14].Warehousing [DERC01]. Warp[SCL01, HKOO11, MMW96, VSW+13].WARPED [MMW96]. WARPmemory[SFO95]. Washington [B+05, BS94,IEE93c, IEE94h, IEE95k, Ost94]. water[HTHD99, R+92, dlAMC11, dlAMCFN12].Waterman [KDSO12]. watershed [NAJ99].Wave [BBC+00, EMO+93, ESM+94,NSLV16, SMOE93, Gei94, KM10, KEGM10,Mal01, NB96, RMNM+12]. Wave-Particle[NSLV16]. Waveform [LSR95]. Wavelet[Uhl94, Uhl95b, Zem94, vdLJR11, Uhl95a,

Uhl95c]. Way [Vog13, FGT96]. ways[CZ96]. weak [SD16]. Weather[AHP01, HE02, Bjo95, KOS+95a, Mal01].web[CHKK15, AASB08, NE01, PES99, Wal01b].Web-Based [NE01, PES99]. WebCL[CHKK15]. WebCom [OPM06].WebCom-G [OPM06]. Wednesday [B+05].Weicheng [Ano95b]. weight [KA95].welcomes [Str94]. West [EV01, EdS08].Westin [IEE94e]. We’ve [GKPS97].WG10.3 [DR94]. WG2.5 [Boi97]. Wheeler[NTR16]. where [KC94]. which [SH96].Whippletree [SKB+14]. Wide[FGG+98, dOSMM+16, FGT96, KHB+99].Wide-area [FGG+98, FGT96]. WIEN[Gao03]. Will [CB00]. William[Ano95c, Ano99c, Ano99d, Ano00a, Ano00b].Williamsburg [IEE92]. Win32 [MS98].windows [QB12, Ano01a, CLP+99, FD97,GGGC99, PS01a, SFG98, SSSS97, TAH+01].Windows95 [SSSS96]. Winona [Ano94h].wireless [Bon96]. wissenschaftliche[MS04]. wissenschaftliches [Ano94c].without [BW12, Pla02, YLZ13]. WLAN[MSOGR01]. WMPI [BPS01, MS98,MSS98, MS99c, PS01a, SMS00]. WOMPAT[Cha05, EV01, Vos03]. Woollongong[GN95]. Work [HRSA97, Pet00a, Pet00b,OdSSP12, TCBV10]. work-stealing[TCBV10]. Worker [EML00, YG96].Worker-Based [YG96]. Workerproblem[FH98]. Workflow [LYZ13]. Workforce[Liv00]. workgroup [SDB+16]. Working[Ano98, Boi97, MCS00, Pet01, DR94].Workload [AGS97, DBVF01]. Workloads[CC17, APBcF16, AVA+16, SKB+14].WorkPlace [Ano97, Bra97]. workqueuing[VBLvdG08]. Workshop [ACM98a, Agr95a,BPG94, Bha93, BC00, Cha05, CZG+08,CGKM11, CMMR12, DW94, DT94, EV01,EdS08, Fer92, FK95, FF95, HK93, HK95,IEE93d, IEE93f, IEE94d, IEE95h, IEE96g,IFI95, KG93, Kuh98, Kum94, MdSC09,

REFERENCES 68

PBG+95, PBPT95, SCR92, SHM+10, Sch93,Vos03, Was96, AH95, BS94, Cal94, D+95,DMW96, FR95, GL95b, IEE93f].Workshops [MCdS+08]. Workstation[GHL97, HSMW94, KS96, LC97a, MFTB95,Pus95, YKI+96, AB95, ALR94, BLP93,BSvdG91, BRS92, BALU95, BWT96, CCU95,DG95, ED94, GBF95, Heb93, JRM+94,LL95, NMW93, NN95, PM95, PL96, RBS94,RCFS96, SC96a, SSN94, SL95, THM+94,Tsu95, UH96, YWO95, ZHS99, MS04].workstation-cluster [Heb93].Workstation-Clustern [MS04].Workstations[AR01, BL94, BL95, BM97, BDH+95,BDH+97, BMS94b, DDPR97, EK97, GS91b,HIP02, IDD94, Liu95, LHZ98, MSCW95,MM01, OWSA95, PFG97, TQDL01, VLO+08,AL93, BJ95, BID95, Bru95, BMPZ94b,BMS94a, BMPZ94a, CCF+94, Coe94, DZ98a,DOSW96, GM94, GMU95, HK94, Hus99,KMC96, KMC97, KA95, MK94, MM03,RRG+99, SFO95, SR95, TDB00, dCH93].World [CMMR12, CJNW95, FD00,GHH+93, HLP11, MC94, NSLV16, PSB+94,Wit16, dGJM94, GDB+93, JR10]. Worlds[Rab98]. wormhole[Pan95a, Pan95b, RJMC93, ZGN94].wormhole-routed[Pan95b, RJMC93, ZGN94]. worms[Pan95a]. WoTUG [MC94]. WoTUG-17[MC94]. WPVM [ASCS95, BPMN97].Wrapper [AS14]. Wrapping [LRW01].Write [BIC+10]. Write-Back [BIC+10].Writing [FAF16, SDB94, FNSW99].Written [KaM10]. WWW [KSJ95, KSJ96].

X [Bad16, FWS+17]. X-ray [FWS+17].X10 [CGH+14]. X11 [GKL95]. x86[MGL+17]. Xab[Beg92, Beg93b, Beg93c, Beg93a]. Xen[PRS16]. Xeon [DSGS17, MTK16]. XPVM[KG96]. XXI [EGH+14].

YLC [Gal97]. YMP [BL94]. Yorkshire[CJNW95].

Zero [SWHP05, Hin11]. Zero-Copy[SWHP05]. ZEUS [FF95]. Zipcode [wL94,SSD+94]. zonal [Fin94, Fin95]. Zone[JCH+08, AGMJ06]. zum [Wer95]. zur[GBR97, Sei99].

ReferencesAlQuraishi:2016:CBP

[AAAA16] Eman AlQuraishi, EmanAlDwaisan, Alaa AlSaqaa,and Imtiaz Ahmad. ACUDA-based parallel imple-mentation of a test vec-tors encoding algorithm incompression-based scan de-signs. International Jour-nal of Parallel, Emer-gent and Distributed Sys-tems: IJPEDS, 31(3):280–293, 2016. CODEN ????ISSN 1744-5760 (print),1744-5779 (electronic).

Agullo:2017:BGB

[AAB+17] Emmanuel Agullo, OlivierAumage, Berenger Bra-mas, Olivier Coulaud, andSamuel Pitoiset. Bridgingthe gap between OpenMPand task-based runtime sys-tems for the Fast MultipoleMethod. IEEE Transac-tions on Parallel and Dis-tributed Systems, 28(10):2794–2807, October 2017.CODEN ITDSEO. ISSN1045-9219 (print), 1558-2183(electronic)ITDSEO. URL

REFERENCES 69

https://www.computer.org/csdl/trans/td/2017/10/07912335-abs.html.

Almasi:2005:DIM

[AAC+05] G. Almasi, C. Archer, J. G.Castanos, J. A. Gunnels,C. C. Erway, P. Heidel-berger, X. Martorell, J. E.Moreira, K. Pinnow, J. Rat-terman, B. D. Steinmacher-Burow, W. Gropp, andB. Toonen. Design andimplementation of message-passing services for theBlue Gene/L supercom-puter. IBM Journal of Re-search and Development, 49(2/3):393–406, ???? 2005.CODEN IBMJAE. ISSN0018-8646 (print), 2151-8556(electronic). URL http://www.research.ibm.com/journal/rd/492/almasi.pdf.

Akzhalova:2008:WPL

[AASB08] Assel Zh. Akzhalova, Da-niar Y. Aizhulov, GalymzhanSeralin, and Gulnar Bal-akayeva. Web portalfor large-scale computationsbased on Grid and MPI.Scalable Computing: Prac-tice and Experience, 9(2):135–142, June 2008. CO-DEN ???? ISSN 1895-1767.URL http://www.scpe.org/vols/vol09/no2/SCPE_9_2_06.pdf; http://www.scpe.org/vols/vol09/no2/SCPE_9_2_06.zip.

Arthur:1993:PIU

[AB93a] T. Arthur and M. Bockelie.A parallel implementation ofthe unstructured grid gen-eration program VGRIDSGusing PVM and APPL. InSincovec [Sin93], pages 899–902. ISBN 0-89871-315-3.LCCN QA 76.58 S55 1993.Two volumes.

Arthur:1993:CUA

[AB93b] Trey Arthur and Michael J.Bockelie. A comparison ofusing APPL and PVM fora parallel implementation ofan unstructured grid gener-ation problem. TechnicalReport NASA CR-191425,National Aeronautics andSpace Administration, Lan-gley Research Center; Na-tional Technical InformationService, distributor, Hamp-ton, VA, USA, 1993. ?? pp.

Aloisio:1995:UPW

[AB95] G. Aloisio and M. A. Bochic-chio. The use of PVM withworkstation clusters for dis-tributed SAR data process-ing. In Hertzberger and Ser-azzi [HS95a], pages 570–581.ISBN 3-540-59393-4. ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.88.I57 1995.

Augusto:2013:APG

[AB13] Douglas A. Augusto andHelio J. C. Barbosa. Ac-celerated parallel genetic

REFERENCES 70

programming tree evalua-tion with OpenCL. Jour-nal of Parallel and Dis-tributed Computing, 73(1):86–100, January 2013. CO-DEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic). URL http://www.sciencedirect.com/science/article/pii/S074373151200024X.

Ayguade:2010:EOS

[ABB+10] Eduard Ayguade, Rosa M.Badia, Pieter Bellens, DanielCabrera, Alejandro Du-ran Roger Ferrer, MarcGonzalez, Francisco Igual,Daniel Jimenez-Gonzalez,Jesus Labarta, Luis Mar-tinell, Xavier Martorell,Rafael Mayo, Josep M.Perez, Judit Planas, and En-rique S. Quintana-Ortı. Ex-tending OpenMP to survivethe heterogeneous multi-coreera. International Journal ofParallel Programming, 38(5–6):440–459, October 2010.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=38&issue=5&spage=440.

Adhianto:2000:TOA

[ABC+00] L. Adhianto, F. Bodin,B. Chapman, L. Hascoet,A. Kneer, D. Lancaster,I. Wolton, and M. Wirtz.Tools for OpenMP appli-cation development: the

POST project. Concur-rency: practice and ex-perience, 12(12):1177–1191,October 2000. CODENCPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/cgi-bin/abstract/76500357/START; http://www3.interscience.wiley.com/cgi-bin/fulltext?ID=76500357&PLACEBO=IE.pdf.

Appiani:1995:PSI

[ABCI95a] E. Appiani, M. Bologna,M. Corvi, and M. Iardella.PVM in a shared-memoryindustrial multiprocessor.In Hertzberger and Ser-azzi [HS95a], pages 588–593.ISBN 3-540-59393-4. ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.88.I57 1995.

Appiani:1995:PSM

[ABCI95b] E. Appiani, M. Bologna,M. Corvi, and M. Iardella.PVM in a shared-memoryindustrial multiprocessor.In Hertzberger and Ser-azzi [HS95a], pages 588–593.ISBN 3-540-59393-4. ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.88.I57 1995.

Agosta:2015:OPP

[ABDP15] Giovanni Agosta, Alessan-dro Barenghi, AlessandroDi Federico, and Ger-ardo Pelosi. OpenCL per-formance portability for

REFERENCES 71

general-purpose computa-tion on graphics proces-sor units: an explorationon cryptographic primitives.Concurrency and Compu-tation: Practice and Ex-perience, 27(14):3633–3660,September 25, 2015. CO-DEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Aliaga:2017:CTP

[ABF+17] Jose I. Aliaga, MarıaBarreda, Goran Flegar,Matthias Bollhofer, andEnrique S. Quintana-Ortı.Communication in task-parallel ILU-preconditionedCG solvers using MPI +OmpSs. Concurrency andComputation: Practice andExperience, 29(21):??, Nov-ember 10, 2017. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic).

Arbenz:1996:MDS

[ABG+96] P. Arbenz, M. Billeter,P. Guntert, P. Luginbuhl,M. Taufer, and U. von Matt.Molecular dynamics simula-tions on Cray clusters us-ing the SCIDDLE-PVM en-vironment. In Bode et al.[BDLS96], pages 142–??ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E975 1996.

Abrahart:1996:GIC

[Abr96] R. J. Abrahart, editor. Geo-Computation 96. 1st Inter-national Conference on Geo-Computation: Leeds, UK,17–19 September 1996. ????,????, 1996. ISBN ????LCCN ????

Ayguade:2009:DOT

[ACD+09] Eduard Ayguade, NawalCopty, Alejandro Duran,Jay Hoeflinger, Yuan Lin,Federico Massaioli, XavierTeruel, Priya Unnikrishnan,and Guansong Zhang. Thedesign of OpenMP tasks.IEEE Transactions on Par-allel and Distributed Sys-tems, 20(3):404–418, March2009. CODEN ITDSEO.ISSN 1045-9219 (print),1558-2183 (electronic).

Arnold:1994:PCT

[ACDR94] D. Arnold, R. Christie,J. Day, and P. Roe, edi-tors. Parallel Computingand Transputers. PCAT-93.Proceedings of the 6th Aus-tralian Transputer and Oc-cam User Group Conference,November 3–4, 1993, Bris-bane, Queensland, Australia,volume 37 of Transputer andOccam Engineering Series.IOS Press, Postal Drawer10558, Burke, VA 2209-0558,USA, 1994. ISBN 90-5199-149-5. LCCN ????

Acacio:2002:MDM

[ACGdT02] M. Acacio, O. Canovas,

REFERENCES 72

J. M. Garcıa, and P. E. Lopezde Teruel. MPI-Delphi:an MPI implementationfor visual programming en-vironments and heteroge-neous computing. FutureGeneration Computer Sys-tems, 18(3):317–333, Jan-uary 2002. CODEN FG-SEVI. ISSN 0167-739X(print), 1872-7115 (elec-tronic). URL http://www.elsevier.com/gej-ng/10/19/19/60/32/28/abstract.html.

Alexandrov:1997:PMC

[ACGR97] V. Alexandrov, K. Chan,A. Gibbons, and W. Ryt-ter. On the PVM/MPI com-putations of dynamic pro-gramming recurrences. Lec-ture Notes in Computer Sci-ence, 1332:305–312, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Agullo:2011:QOM

[ACH+11] Emmanuel Agullo, CamilleCoti, Thomas Herault,Julien Langou, Sylvain Pey-ronnet, Ala Rezmerita,Franck Cappello, and JackDongarra. QCG-OMPI:MPI applications on grids.Future Generation Com-puter Systems, 27(4):357–369, April 2011. CODENFGSEVI. ISSN 0167-739X(print), 1872-7115 (elec-tronic).

Andersch:2012:PPE

[ACJ12] Michael Andersch, Chi ChingChi, and Ben Juurlink. Pro-gramming parallel embed-ded and consumer appli-cations in OpenMP super-scalar. ACM SIGPLAN No-tices, 47(8):281–282, August2012. CODEN SINODQ.ISSN 0362-1340 (print),1523-2867 (print), 1558-1160(electronic). PPOPP ’12conference proceedings.

ACM:1990:PAC

[ACM90] ACM, editor. Proceedingsof the 1990 ACM Confer-ence on LISP and Func-tional Programming: pa-pers presented at the confer-ence, Nice, France, June 27–29, 1990. ACM Press, NewYork, NY 10036, USA, 1990.ISBN 0-89791-368-X. LCCNQA 76.73 L23 A24 1990.ACM order no. 552900.

ACM:1994:CPI

[ACM94] ACM, editor. ConferenceProceedings. 1994 Interna-tional Conference on Super-computing. ACM Press, NewYork, NY 10036, USA, 1994.ISBN 0-89791-665-4. LCCN???? URL http://www.acm.org/pubs/contents/proceedings/supercomputing/181181/.

ACM:1995:PAS

[ACM95a] ACM, editor. Proceedingsof the 33rd annual southeastconference [ACM]: Clemson,

REFERENCES 73

South Carolina, March 17–18, 1995. ACM Press, NewYork, NY 10036, USA, 1995.ISBN 0-89791-747-2. LCCN????

ACM:1995:SAA

[ACM95b] ACM, editor. SPAA ’95,7th Annual ACM Sympo-sium on Parallel Algorithmsand Architectures: July 17–19, 1995, Santa Barbara,CA, USA, volume 7. ACMPress, New York, NY 10036,USA, 1995. ISBN 0-89791-717-0. LCCN QA76.642.A25 1995.

ACM:1996:SVR

[ACM96a] ACM, editor. 1995 Sympo-sium on the Virtual RealityModeling Language (VRML‘95). ACM Press, New York,NY 10036, USA, 1996.ISBN 0-89791-818-5. LCCN???? URL http://www.acm.org/pubs/contents/proceedings/graph/217306/.

ACM:1996:FCP

[ACM96b] ACM, editor. FCRC ’96:Conference proceedings ofthe 1996 International Con-ference on Supercomputing:Philadelphia, Pennsylvania,USA, May 25–28, 1996.ACM Press, New York, NY10036, USA, 1996. ISBN 0-89791-803-7. LCCN QA76.5I61 1996. ACM order num-ber 415961.

ACM:1996:SCP

[ACM96c] ACM, editor. Supercom-puting ’96 Conference Pro-ceedings: November 17–22, Pittsburgh, PA. ACMPress and IEEE ComputerSociety Press, New York,NY 10036, USA and 1109Spring Street, Suite 300,Silver Spring, MD 20910,USA, 1996. ISBN 0-89791-854-1. LCCN QA 76.88S8573 1996. URL http://www.supercomp.org/sc96/proceedings/. ACM Or-der Number: 415962, IEEEComputer Society Press Or-der Number: RS00126.

ACM:1997:PPS

[ACM97a] ACM, editor. PASCO ’97.Proceedings of the second in-ternational symposium onparallel symbolic computa-tion, July 20–22, 1997,Maui, HI. ACM Press, NewYork, NY 10036, USA, 1997.ISBN ???? LCCN ????

ACM:1997:SHP

[ACM97b] ACM, editor. SC’97: HighPerformance Networkingand Computing: Proceed-ings of the 1997 ACM/IEEESC97 Conference: Novem-ber 15–21, 1997, San Jose,California, USA. ACM Pressand IEEE Computer Soci-ety Press, New York, NY10036, USA and 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,1997. ISBN 0-89791-985-

REFERENCES 74

8. LCCN QA76.9.A25 A2651997. URL http://www.acm.org/pubs/contents/proceedings/commsec/266741/; http://www.supercomp.org/sc97/proceedings/.ACM SIGARCH order num-ber 415972. IEEE ComputerSociety Press order numberRS00160.

ACM:1998:AWJ

[ACM98a] ACM, editor. ACM 1998Workshop on Java for High-Performance Network Com-puting. ACM Press, NewYork, NY 10036, USA, 1998.ISBN ???? LCCN ????URL http://www.cs.ucsb.edu/conferences/java98/program.html. Possibly un-published, except electroni-cally.

ACM:1998:SHP

[ACM98b] ACM, editor. SC’98: HighPerformance Networkingand Computing: Proceed-ings of the 1998 ACM/IEEESC98 Conference: OrangeCounty Convention Cen-ter, Orlando, Florida, USA,November 7–13, 1998. ACMPress and IEEE ComputerSociety Press, New York,NY 10036, USA and 1109Spring Street, Suite 300,Silver Spring, MD 20910,USA, 1998. ISBN ????LCCN ???? URL http://www.supercomp.org/sc98/papers/.

ACM:1999:SPO

[ACM99] ACM, editor. SC’99: Ore-gon Convention Center 777NE Martin Luther King Jr.Boulevard, Portland, Ore-gon, November 11–18, 1999.ACM Press and IEEE Com-puter Society Press, NewYork, NY 10036, USA and1109 Spring Street, Suite300, Silver Spring, MD20910, USA, 1999.

ACM:2000:SHP

[ACM00] ACM, editor. SC2000:High Performance Network-ing and Computing. Dal-las Convention Center, Dal-las, TX, USA, November4–10, 2000. ACM Pressand IEEE Computer Soci-ety Press, New York, NY10036, USA and 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,2000. URL http://www.sc2000.org/proceedings/info/fp.pdf.

ACM:2001:SHP

[ACM01] ACM, editor. SC2001:High Performance Network-ing and Computing. Denver,CO, November 10–16, 2001.ACM Press and IEEE Com-puter Society Press, NewYork, NY 10036, USA and1109 Spring Street, Suite300, Silver Spring, MD20910, USA, 2001. ISBN 1-58113-293-X. LCCN ????

REFERENCES 75

ACM:2003:SII

[ACM03] ACM, editor. SC2003: Ig-niting Innovation. Phoenix,AZ, November 15–21, 2003.ACM Press and IEEE Com-puter Society Press, NewYork, NY 10036, USA and1109 Spring Street, Suite300, Silver Spring, MD20910, USA, 2003. ISBN 1-58113-695-1. LCCN ????

ACM:2004:SHP

[ACM04] ACM, editor. SC 2004:High Performance Comput-ing, Networking and Stor-age: Bridging communities:Proceedings of the IEEE/ACM Supercomputing 2004Conference, Pittsburgh, PA,November 6–12, 2004. ACMPress and IEEE ComputerSociety Press, New York,NY 10036, USA and 1109Spring Street, Suite 300, Sil-ver Spring, MD 20910, USA,2004. ISBN 0-7695-2153-3.LCCN ????

ACM:2005:PAI

[ACM05] ACM, editor. Proceedingsof the 2005 ACM/IEEE con-ference on Supercomputing2005, Seattle, WA, Novem-ber 12–18 2005. ACM Pressand IEEE Computer SocietyPress, New York, NY 10036,USA and 1109 Spring Street,Suite 300, Silver Spring, MD20910, USA, 2005. ISBN 1-59593-061-2. LCCN ????

ACM:2006:PST

[ACM06a] ACM, editor. Proceed-ings of the 37th SIGCSEtechnical symposium onComputer science education2006, Houston, Texas, USA,March 03–05, 2006. ACMPress, New York, NY 10036,USA, 2006. ISBN 1-59593-259-3. ACM order number457060.

ACM:2006:PCC

[ACM06b] ACM, editor. Proceedings ofthe 3rd conference on Com-puting Frontiers, May 3–5,2006, Ischia, Italy. ACMPress, New York, NY 10036,USA, 2006. ISBN 1-59593-302-6. ACM order number104060.

ACM:2011:SSP

[ACM11] ACM, editor. SC ’11 State ofthe Practice Reports. ACMPress, New York, NY 10036,USA, 2011. ISBN 1-4503-1139-3. LCCN ????

Antonelli:2014:ATS

[ACMR14] Laura Antonelli, StefaniaCorsaro, Zelda Marino, andMariarosaria Rizzardi. Al-gorithm 944: Talbot suite:Parallel implementations ofTalbot’s method for the nu-merical inversion of Laplacetransforms. ACM Transac-tions on Mathematical Soft-ware, 40(4):29:1–29:18, June2014. CODEN ACMSCU.ISSN 0098-3500 (print),1557-7295 (electronic).

REFERENCES 76

Alonso:2011:NEM

[ACMZR11] P. Alonso, R. Cortina,F. J. Martınez-Zaldıvar, andJ. Ranilla. Neville elimina-tion on multi- and many-coresystems: OpenMP, MPIand CUDA. The Journalof Supercomputing, 58(2):215–225, November 2011.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=58&issue=2&spage=215.

Ancona:1995:PAD

[AD95] M. Ancona and M. DeBenedetto. A parallel algo-rithm for ‘document segmen-tation’. In IEEE [IEE95h],pages 516–521. ISBN 0-8186-7031-2, 0-8186-7032-0.LCCN QA76.58 .E97 1995.

Alexandrov:1998:RAP

[AD98] Vassil Alexandrov and J. J.Dongarra, editors. Re-cent advances in parallel vir-tual machine and messagepassing interface: 5th Eu-ropean PVM/MPI User’sGroup Meeting, Liverpool,UK, September 7–9, 1998:proceedings, volume 1497of Lecture Notes in Com-puter Science. Springer-Ver-lag, Berlin, Germany / Hei-delberg, Germany / Lon-don, UK / etc., 1998.ISBN 3-540-65041-5 (soft-cover). ISSN 0302-9743

(print), 1611-3349 (elec-tronic). LCCN QA267.A1L43 no.1497. Jointly spon-sored by the Computer Sci-ence Dept., University ofLiverpool and Oak RidgeNational Laboratory.

Adamo:1997:AOO

[Ada97] J.-M. Adamo. ARCH, an ob-ject oriented MPI-based li-brary for asynchronous andloosely synchronous paral-lel system programming.Lecture Notes in ComputerScience, 1332:67–74, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Adamo:1998:MTO

[Ada98] Jean-Marc Adamo. Multi-threaded object-oriented MPI-based message passing in-terface: the ARCH library,volume SECS 446 of TheKluwer international seriesin engineering and com-puter science. Kluwer Aca-demic Publishers Group,Norwell, MA, USA, andDordrecht, The Nether-lands, 1998. ISBN 0-7923-8165-3. xiv + 185 pp.LCCN TK5102.5.A293 1998.US$120.00.

Antonuccio-Delogu:1994:PTN

[ADB94] V. Antonuccio-Delogu andU. Becciani. A parallel treeN-body code for heteroge-neous clusters. In Dongarraand Wasniewski [DW94],

REFERENCES 77

pages 17–32. ISBN 3-540-58712-8 (Berlin), 0-387-58712-8 (New York). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58.P35 1994. DM104.00.

Addison:2001:EOP

[Add01] Cliff Addison. Exploit-ing OpenMP to providescalable SMP BLAS andLAPACK routines. Lec-ture Notes in ComputerScience, 2073:3–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2073/20730003.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2073/20730003.pdf.

Arioli:1995:PSB

[ADDR95] M. Arioli, A. Drummond,I. S. Duff, and D. Ruiz. Aparallel scheduler for blockiterative solvers in hetero-geneous computing environ-ments. In Bailey et al.[BBG+95], pages 460–465.ISBN 0-89871-344-7. LCCNQA76.58.S55 1995.

Amestoy:2003:IIMa

[ADLL03a] Patrick R. Amestoy, Iain S.Duff, Jean-Yves L’Excellent,and Xiaoye S. Li. Impact ofthe implementation of MPIpoint-to-point communica-tions on the performance of

two general sparse solvers.Report TR/PA/03/14 andRR-4372 and LBNL-48968and RT/APO/01/4, CER-FACS, Toulouse, France,2003. ???? pp.

Amestoy:2003:IIMb

[ADLL03b] Patrick R. Amestoy, Iain S.Duff, Jean-Yves L’Excellent,and Xiaoye S. Li. Im-pact of the implementationof MPI point-to-point com-munications on the perfor-mance of two general sparsesolvers. Parallel Computing,29(7):833–849, July 2003.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic).

Aversa:2005:HDS

[ADMV05] Rocco Aversa, BeniaminoDi Martino, Nicola Maz-zocca, and Salvatore Ven-ticinque. A hierarchicaldistributed-shared memoryparallel Branch & Boundapplication with PVM andOpenMP for multiprocessorclusters. Parallel Comput-ing, 31(10–12):1034–1047,October/December 2005.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic).

Aversa:2005:PPT

[ADR+05] Rocco Aversa, Beniamino DiMartino, Massimiliano Rak,Salvatore Venticinque, andUmberto Villano. Perfor-mance prediction through

REFERENCES 78

simulation of a hybrid MPI/OpenMP application. Par-allel Computing, 31(10–12):1013–1033, October/December 2005. CODENPACOEJ. ISSN 0167-8191(print), 1872-7336 (elec-tronic).

Alexandrov:1998:CGP

[ADRCT98] V. Alexandrov, F. Dehne,A. Rau-Chaplin, and K. Taft.Coarse grained parallelMonte Carlo algorithms forsolving SLAE using PVM.Lecture Notes in ComputerScience, 1497:323–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Amritkar:2014:EPC

[ADT14] Amit Amritkar, Surya Deb,and Danesh Tafti. Ef-ficient parallel CFD-DEMsimulations using OpenMP.Journal of ComputationalPhysics, 256(??):501–519,January 1, 2014. CO-DEN JCTPAH. ISSN0021-9991 (print), 1090-2716(electronic). URL http://www.sciencedirect.com/science/article/pii/S0021999113006128.

Aldea:2016:OES

[AELGE16] Sergio Aldea, Alvaro Este-banez, Diego R. Llanos, andArturo Gonzalez-Escribano.An OpenMP extension thatsupports thread-level spec-ulation. IEEE Transac-tions on Parallel and Dis-

tributed Systems, 27(1):78–91, January 2016. CODENITDSEO. ISSN 1045-9219(print), 1558-2183 (elec-tronic)ITDSEO. URL http://www.computer.org/csdl/trans/td/2016/01/07014262-abs.html.

Ashby:1995:PPG

[AFST95] S. F. Ashby, R. D. Falgout,S. G. Smith, and A. F. B.Tompson. The parallel per-formance of a groundwa-ter flow code on the CrayT3D. In Bailey et al.[BBG+95], pages 131–136.ISBN 0-89871-344-7. LCCNQA76.58.S55 1995.

Ayguade:1995:DUA

[AGG+95] E. Ayguade, J. Garcia,M. Girones, J. Labarta,J. Torres, and M. Valero.Detecting and using affinityin an automatic data dis-tribution tool. In Pingaliet al. [PBG+95], pages 61–75. ISBN 3-540-58868-X.LCCN QA76.58 .W656 1994.

Aityan:1995:PFI

[AGH+95] S. K. Aityan, L. T. Grujic,R. J. Hathaway, G. S. Ladde,N. Medhin, and M. Sam-bandham, editors. Pro-ceedings of the First In-ternational Conference onNeural, Parallel and Scien-tific Computations held atMorehouse College, Atlanta,USA, May 28–31, 1995, Pro-ceedings of Neural Parallel

REFERENCES 79

and Scientific Computations1995. Dynamic Publishers,Atlanta, GA, USA, 1995.ISBN 0-9640398-9-3 (hard-back) 0-9640398-8-5 (paper-back). LCCN QA76.87 .I581995.

Averbuch:1994:PES

[AGIS94] A. Averbuch, E. Gab-ber, S. Itzikowitz, andB. Shoham. On the par-allel elliptic single/multigridsolutions about aligned andnonaligned bodies usingthe Virtual Machine forMultiprocessors. Scien-tific Programming, 3(1):13–32, Spring 1994. CODENSCIPEV. ISSN 1058-9244(print), 1875-919X (elec-tronic).

Arbenz:1996:SRP

[AGLv96] P. Arbenz, W. Gander, H. P.Luthi, and U. von Matt.Sciddle 4.0, or, remote pro-cedure calls in PVM. In Lid-dell et al. [LCHS96], pages820–?? ISBN 3-540-61142-8 (paperback). LCCNQA76.88 .H52 1996.

Ayguade:2006:ENO

[AGMJ06] Eduard Ayguade, MarcGonzalez, Xavier Martorell,and Gabriele Jost. Em-ploying nested OpenMP forthe parallelization of multi-zone computational fluid dy-namics applications. Jour-nal of Parallel and Dis-tributed Computing, 66(5):

686–697, May 2006. CODENJPDCER. ISSN 0743-7315(print), 1096-0848 (elec-tronic).

Agrawal:1995:PIW

[Agr95a] D. P. Agrawal, editor. Pro-ceedings of the 1995 ICPPWorkshop on Challenges forParallel Processing, August14, 1995, Raleigh, NC, USA.CRC Press, 2000 N.W. Cor-porate Blvd., Boca Raton,FL 33431-9868, USA, 1995.ISBN 0-8493-2618-4. LCCNQA76.58.I34 1995.

Almeida:1995:CST

[AGR+95b] F. Almeida, F. Garcia,J. Roda, D. Morales, Ro-driguez, and C. A com-parative study of two dis-tributed systems: PVMand transputers. In Cooket al. [CJNW95], pages 244–258. ISBN 90-5199-235-1(IOS Press), 4-274-90062-2(Ohmsha). LCCN ????

Alfaro:1997:FDW

[AGS97] F. J. Alfaro, J. A. Gallud,and J. L. Sanchez. A func-tion to dynamic workload al-location in distributed ap-plications. Lecture Notesin Computer Science, 1332:219–225, 1997. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

REFERENCES 80

Alnuweiri:1995:PHF

[AH95] Hussein M. Alnuweiri andMounir Hamdi, editors. Pro-ceedings of HiNet ’95: firstinternational workshop onhigh-speed network com-puting, April 25, 1995,Santa Barbara, California.IEEE Computer SocietyPress, 1109 Spring Street,Suite 300, Silver Spring,MD 20910, USA, 1995.ISBN 0-8186-7124-6. LCCNTK5105.5 .H56 1995.

Astalos:2000:CMS

[AH00] Jan Astalos and LadislavHluchy. CIS — a monitor-ing system for PC clusters.Lecture Notes in ComputerScience, 1908:225–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080225.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080225.pdf.

Agathos:2012:TBE

[AHD12] Spiros N. Agathos, Pana-giotis E. Hadjidoukas, andVassilios V. Dimakopou-los. Task-based execu-tion of nested OpenMPloops. Lecture Notes inComputer Science, 7312:210–222, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-

tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-30961-8_16/.

Awan:2017:CCD

[AHHP17] Ammar Ahmad Awan, KhaledHamidouche, Jahanzeb Maq-bool Hashmi, and Dha-baleswar K. Panda. S-Caffe:Co-designing MPI runtimesand Caffe for scalable deeplearning on modern GPUclusters. ACM SIGPLANNotices, 52(8):193–205, Au-gust 2017. CODEN SIN-ODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Ahmad:1997:EVP

[Ahm97] Ishfaq Ahmad. Expressversus PVM: a perfor-mance comparison. Par-allel Computing, 23(6):783–812, June 20, 1997. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.elsevier.com/cgi-bin/cas/tree/store/parco/cas_sub/browse/browse.cgi?year=1997&volume=23&issue=6&aid=1138.

Allsopp:2001:EUM

[AHP01] Nicholas K. Allsopp, John F.Hague, and Jean-PierreProst. Experiences in us-ing MPI–IO on top ofGPFS for the IFS weatherforecast code. LectureNotes in Computer Sci-ence, 2150:380–??, 2001.

REFERENCES 81

CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2150/21500380.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2150/21500380.pdf.

Aversa:1997:MDP

[AIM97] R. Aversa, G. Iannello, andN. Mazzocca. An MPIdriven parallelization strat-egy for different computingplatforms: a case study.Lecture Notes in ComputerScience, 1332:401–408, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Aguilar:1997:PMS

[AJ97] J. Aguilar and T. Jimenez.A processors managementsystem for PVM. Lec-ture Notes in Computer Sci-ence, 1300:158–??, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Aubrey-Jones:2016:SMI

[AJF16] Tristan Aubrey-Jones andBernd Fischer. Synthe-sizing MPI implementa-tions from functional data-parallel programs. Inter-national Journal of Paral-lel Programming, 44(3):552–573, June 2016. CODENIJPPE5. ISSN 0885-7458

(print), 1573-7640 (elec-tronic). URL http://link.springer.com/article/10.1007/s10766-015-0359-4.

Alexandrov:1999:PMC

[AK99] V. Alexandrov and A. Karaivanova.Parallel Monte Carlo al-gorithms for sparse SLAEusing MPI. In Dongarraet al. [DLM99], pages 283–290. ISBN 3-540-66549-8(softcover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

Armstrong:2000:QDB

[AKE00] Brian Armstrong, Seon WookKim, and Rudolf Eigen-mann. Quantifying dif-ferences between OpenMPand MPI using a large-scale application suite. Lec-ture Notes in Computer Sci-ence, 1940:482–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1940/19400482.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1940/19400482.pdf.

Andersen:1994:PIA

[AKK+94] B. S. Andersen, P. Kaae,C. Keable, W. Owczarz,J. Wasniewski, and Z. Zlatev.PVM implementations ofadvection-chemistry mod-ules of air pollution mod-

REFERENCES 82

els. In Dongarra and Was-niewski [DW94], pages 11–16. ISBN 3-540-58712-8 (Berlin), 0-387-58712-8(New York). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58.P35 1994. DM104.00.

Asai:1999:MIF

[AKL99] Noboru Asai, Thomas Ken-temich, and Pierre Lagier.MPI-2 implementation ona Fujitsu Generic MessagePassing Kernel. In ACM[ACM99], page ??

Abdelfattah:2016:KOL

[AKL16] Ahmad Abdelfattah, DavidKeyes, and Hatem Ltaief.KBLAS: an optimized li-brary for dense matrix-vector multiplication onGPU accelerators. ACMTransactions on Mathemat-ical Software, 42(3):18:1–18:31, May 2016. CODENACMSCU. ISSN 0098-3500(print), 1557-7295 (elec-tronic).

Alfano:1992:DNA

[AL92] M. Alfano and G. Lo Re.Distributing numerical al-gorithms: some experienceswith network computingsystem (NCS) and paral-lel virtual machine (PVM).In SCRI WCC’92 [SCR92],page ?? ISBN ????LCCN ???? Proceed-ings available via anonymousftp from ftp.scri.fsu.edu

in directory pub/parallel-workshop.92.

Altevogt:1993:PTD

[AL93] P. Altevogt and A. Linke.Parallelization of the two-dimensional Ising model ona cluster of IBM RISCSystem/6000 workstations.Parallel Computing, 19(9):1041–1052, September 1993.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic).

Alt:1996:PIA

[AL96] R. Alt and J. L. Lamotte.Parallel integration acrosstime of initial value problemsusing PVM. In Bode et al.[BDLS96], pages 323–??ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E975 1996.

Alund:1994:CFD

[ALR94] A. Alund, P. Lotstedt, andR. Ryden. Computationalfluid dynamics on work-station clusters in indus-trial environments. InDongarra and Wasniewski[DW94], pages 1–10. ISBN 3-540-58712-8 (Berlin), 0-387-58712-8 (New York). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58.P35 1994. DM104.00.

Amer:2015:MRC

[ALW+15] Abdelhalim Amer, HuiweiLu, Yanjie Wei, Pavan

REFERENCES 83

Balaji, and Satoshi Mat-suoka. MPI+Threads: run-time contention and reme-dies. ACM SIGPLAN No-tices, 50(8):239–248, Au-gust 2015. CODEN SIN-ODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Ayguade:2007:SIO

[AM07] Eduard Ayguade and Matthias S.Mueller. Special issue onOpenMP — Guest Edi-tors’ introduction. Inter-national Journal of Paral-lel Programming, 35(4):331–333, August 2007. CO-DEN IJPPE5. ISSN 0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=35&issue=4&spage=331.

Almasi:1993:PDS

[AMBG93] G. S. Almasi, T. McLuckie,J. Bell, and A. Gordon. Par-allel distributed seismic mi-gration. Concurrency: prac-tice and experience, 5(2):105–131, April 1993. CO-DEN CPEXEI. ISSN 1040-3108.

Agrawal:2011:PPS

[AMHC11] Ankit Agrawal, SanchitMisra, Daniel Honbo, andAlok Choudhary. Paral-lel pairwise statistical sig-nificance estimation of lo-cal sequence alignment using

Message Passing Interfacelibrary. Concurrency andComputation: Practice andExperience, 23(17):2269–2279, December 10, 2011.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Amato:1994:PEP

[AMS94] M. Amato, A. Matrone, andP. Schiano. A practical expe-rience in parallelizing a largeCFD code: the ENSOLVflow solver. In Gentzschand Harms [GH94], pages508–513. ISBN 0-387-57981-8 (New York), 3-540-57981-8 (Berlin). LCCNQA76.88.I57 1994. DM96.00.Two volumes.

anMey:2007:NPO

[aMST07] Dieter an Mey, SamuelSarholz, and Christian Ter-boven. Nested paralleliza-tion with OpenMP. In-ternational Journal of Par-allel Programming, 35(5):459–476, October 2007.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=35&issue=5&spage=459.

Al-Mouhamed:2015:EAO

[AMuHK15] Mayez Al-Mouhamed andAyaz ul Hassan Khan.Exploration of automaticoptimisation for CUDA

REFERENCES 84

programming. Interna-tional Journal of Paral-lel, Emergent and Dis-tributed Systems: IJPEDS,30(4):309–324, 2015. CO-DEN ???? ISSN 1744-5760 (print), 1744-5779(electronic). URL http://www.tandfonline.com/doi/abs/10.1080/17445760.2014.953158.

Aversa:1994:PSH

[AMV94] R. Aversa, N. Mazzocca, andU. Villano. PS: a simulatorfor heterogeneous computingenvironments. In Dekkeret al. [DSZ94], pages 335–343. ISBN 0-444-81784-0.LCCN QA76.58.E98 1994.

Andersson:1998:PFT

[And98] U. Andersson. Paralleliza-tion of a 3D FD-TD code forthe Maxwell equations us-ing MPI. Lecture Notes inComputer Science, 1541:12–19, 1998. CODEN LNCSD9.ISSN 0302-9743 (print),1611-3349 (electronic).

Anonymous:1989:PFC

[Ano89] Anonymous, editor. Pro-ceedings of the Fourth Con-ference on Hypercubes, Con-current Computers and Ap-plications, 6–8 March 1989,Monterey, CA, USA. GoldenGate Enterprises, Los Al-tos, CA, USA, 1989. LCCNQA76.5.C619215 1989. Twovolumes.

Anonymous:1992:PSE

[Ano92] Anonymous, editor. Pro-ceedings SHARE Europe An-niversary Meeting. SHAREEur. Assoc, Geneva, Switzer-land, 1992.

Anonymous:1993:ATA

[Ano93a] Anonymous, editor. Auto-motive technology and au-tomation: Supercomputerapplications in the automo-tive industries: 26th In-ternational symposium —September 1993, Aachen,Germany, ISATA — Pro-ceedings — 26th. Automo-tive Automation Ltd, Croy-don, UK, 1993. ISBN 0-947719-62-8. LCCN ????

Anonymous:1993:ISA

[Ano93b] Anonymous, editor. In-ternational section: An-nual conference — Septem-ber 1993, Gallipoli, Italy,Atti del Congresso Annuale— Associazione Italiana perl’Informatica ed il CalcoloAutomatico 1993. AICA,????, 1993. ISBN ????LCCN ????

Anonymous:1993:JFI

[Ano93c] Anonymous, editor. Jointframework for informationtechnology: Technical con-ference — March 1993,Keele, JFIT Technical Con-ference Digest. Dept. ofTrade and Industry, Infor-mation and Manufacturing

REFERENCES 85

Division, London, UK, 1993.ISBN ???? LCCN ????Anonymous:1993:MMP

[Ano93d] Anonymous. MPI: a mes-sage passing interface. Pro-ceedings of the Supercomput-ing Conference, pages 878–883, ???? 1993. CODEN???? ISBN 0-8186-4340-4.ISSN 1063-9535.

Anonymous:1993:PSE

[Ano93e] Anonymous, editor. Proceed-ings. SHARE Europe An-niversary Meeting. Client/Server— the Promise and the Re-ality: October 25–28, 1993,the Hague, the Netherlands.SHARE Europe, Geneva,Switzerland, 1993. ISBN???? ISSN 0254-6213.LCCN ????

Anonymous:1993:SEC

[Ano93f] Anonymous, editor. Super-computing Europe ’93. Con-ference Papers. Royal DutchFairs, Utrecht, Netherlands,1993. ISBN ???? LCCN ????

Anonymous:1993:CDP

[Ano93g] Anonymous, editor. Thecommercial dimensions ofparallel computing: UNI-COM seminar — April1993, London. Unicom Sem-inars Ltd, ????, 1993. ISBN???? LCCN ????

Anonymous:1994:ICS

[Ano94a] Anonymous, editor. 1994 In-ternational Computer Sym-posium Conference Proceed-ings. Nat. Chiao Tung Univ,

Hsinchu, Taiwan, 1994.ISBN ???? LCCN ???? 2vol.

Anonymous:1994:ALM

[Ano94b] Anonymous. Adaptive loadmigration systems for PVM.In IEEE [IEE94h], pages390–399. ISBN 0-8186-6607-2, 0-8186-6605-6, 0-8186-6606-4. ISSN 1063-9535. LCCN QA76.5 .S8941994. IEEE catalog number94CH34819.

Anonymous:1994:FWR

[Ano94c] Anonymous, editor. Forschungund wissenschaftliches Rech-nen: Beitrage anasslich des10. EDV-Benutzertreffensder Max-Planck-Gesellschaftin Gottingen, November1993, number 1 in Berichteund Mitteilungen — MaxPlanck Gesellschaft. Max-Planck-Gesellschaft, Munchen,Germany, 1994. ISBN ????ISSN 0341-7778. LCCNQ180.55.E4 M39 1993.

Anonymous:1994:MMP

[Ano94d] Anonymous. MPI: amessage-passing interfacestandard. InternationalJournal of SupercomputerApplications and High Per-formance Computing, 8(3/4):159–416, Fall-Winter1994. CODEN IJSAE9.ISSN 0890-2720.

Anonymous:1994:PDC

[Ano94e] Anonymous, editor. Paral-lel and distributed comput-

REFERENCES 86

ing systems: proceedings ofthe ISCA International Con-ference, Las Vegas, Nevada,U.S.A., October 6–8, 1994.ISCA, Raleigh, NC, USA,1994. ISBN 1-880843-09-9.LCCN QA76.58.I543 1994.

Anonymous:1994:PPC

[Ano94f] Anonymous, editor. Paral-lel processing comes of age:real applications from indus-try and commerce: Seminar— June 1994, London. Uni-com Seminars, ????, 1994.ISBN ???? LCCN ????

Anonymous:1994:PSE

[Ano94g] Anonymous, editor. Proceed-ings. SHARE Europe SpringConference. SHARE Europe(SEAS), Carouge/Geneva,Switzerland, 1994. ISBN???? LCCN ????

Anonymous:1994:SCC

[Ano94h] Anonymous, editor. Smallcollege computing: 27th An-nual symposium — April1994, Winona, MN, SCCS— Proceedings — 27th.SCCS, ????, 1994. ISBN???? LCCN ????

Anonymous:1994:SQC

[Ano94i] Anonymous, editor. Soft-ware quality concern forpeople: proceedings of thefourth European Confer-ence on Software Qual-ity, October 17–20, 1994,Basel, Switzerland. vdf Ver-lag der Fachvereine, Zurich,

Switzerland, 1994. ISBN 3-7281-2153-3. LCCN ????

Anonymous:1995:CCS

[Ano95a] Anonymous, editor. 3rdCLIPS conference — Septem-ber 1994, Houston, TX,NASA Publications N N95-19625-647, N95-19747-768.National Aeronautics andSpace Administration, Wash-ington, DC, USA, 1995.ISBN ???? LCCN ????

Anonymous:1995:BRPb

[Ano95b] Anonymous. Book review:PVM: Parallel virtual ma-chine: a users’ guide andtutorial for networked par-allel computing: By AlGeist, Adam Beguelin, JackDongarra, Weicheng Jiang,Robert Manchek and VaidySunderam. MIT Press, Cam-bridge, MA. (1994). 279pages. $19.95. Computersand Mathematics with Ap-plications, 30(9):122, Nov-ember 1995. CODENCMAPDK. ISSN 0898-1221 (print), 1873-7668(electronic). URL http://www.sciencedirect.com/science/article/pii/0898122195901973.

Anonymous:1995:BRU

[Ano95c] Anonymous. Book review:Using MPI: Portable par-allel programming with themessage-passing interface:By William Gropp, EwingLusk and Anthony Skjellum.MIT Press, Cambridge, MA.(1994). 307 pages. $24.95.

REFERENCES 87

Computers and Mathemat-ics with Applications, 30(9):122, November 1995. CO-DEN CMAPDK. ISSN0898-1221 (print), 1873-7668(electronic). URL http://www.sciencedirect.com/science/article/pii/089812219590199X.

Anonymous:1995:RSS

[Ano95d] Anonymous, editor. Reser-voir simulation: 13th Sym-posium — February 1995,San Antonio, TX, Papers— Society of PetroleumEngineers of AIME. Soci-ety of Petroleum Engineers,Richardson, TX, USA, 1995.ISBN ???? LCCN ????

Anonymous:1995:UPH

[Ano95e] Anonymous. Using PVMto host CLIPS in dis-tributed environments. In3rd CLIPS conference —September 1994, Houston,TX [Ano95a], pages 203–211. ISBN ???? LCCN ????

Anonymous:1996:BRMh

[Ano96a] Anonymous. Book re-view: MPI: the competereference: By Marc Snir,Steve Otto, Steven Huss-Lederman, David Walker,and Jack Dongarra. MITPress, Cambridge, MA.(1996). 336 pages. $27.50.Computers and Mathemat-ics with Applications, 31(11):140, June 1996. CO-DEN CMAPDK. ISSN0898-1221 (print), 1873-7668

(electronic). URL http://www.sciencedirect.com/science/article/pii/0898122196873494.

Anonymous:1996:IPP

[Ano96b] Anonymous. An intro-duction to PVM program-ming. World-Wide Web,1996. URL http://www.epm.ornl.gov/pvm/intro.html.

Anonymous:1996:PPA

[Ano96c] Anonymous. Porting PVMapplications to the In-tel Paragon. World-WideWeb, 1996. URL http://www.ccs.ornl.gov/news/guide/xps_pvm.html.

Anonymous:1996:RP

[Ano96d] Anonymous. Research pro-gram. World-Wide Web,1996. URL http://www.epm.ornl.gov/networking/.

Anonymous:1997:TNR

[Ano97] Anonymous. Technologynews & reviews: Chemkinsoftware; OpenMP FortranStandard; ODE toolbox forMatlab; Java products; Sci-entific WorkPlace 3.0. IEEEComputational Science &Engineering, 4(4):75–??, Oc-tober/December 1997. CO-DEN ISCEE4. ISSN 1070-9924 (print), 1558-190X(electronic). URL http://dlib.computer.org/cs/books/cs1997/pdf/c4075.pdf.

REFERENCES 88

Anonymous:1998:ANO

[Ano98] Anonymous. Announce-ments: New official Fortrantechnical reports; workinggroup 5 documents; OpenGLFortran 95 bindings; MPImodule provides enhancedFortran support; variableprecision arithmetic; Fortraninformation sites; new For-tran compiler versions fromLahey and Fujitsu; down-loadable advanced Fortrantextbook; Fortran engineer-ing textbook. ACM FortranForum, 17(3):1–2, December1998. CODEN ???? ISSN1061-7264 (print), 1931-1311(electronic).

Anonymous:1999:BRMa

[Ano99a] Anonymous. Book re-view: MPI — The com-plete reference: Volume1, the MPI core, secondedition: By Marc Snir,Steve Otto, Steven Huss-Lederman, David Walkerand Jack Dongarra. MITPress, Cambridge, MA.(1998). 426 pages. $35.00.Computers and Mathemat-ics with Applications, 37(3):130, February 1999. CO-DEN CMAPDK. ISSN0898-1221 (print), 1873-7668(electronic). URL http://www.sciencedirect.com/science/article/pii/S0898122199903590.

Anonymous:1999:BRMf

[Ano99b] Anonymous. Book re-view: MPI — The com-

plete reference: Volume1, the MPI core, secondedition: By Marc Snir,Steve Otto, Steven Huss-Lederman, David Walkerand Jack Dongarra. MITPress, Cambridge, MA(1998). 426 pages. $35.00.Computers and Mathemat-ics with Applications, 37(6):130, March 1999. CO-DEN CMAPDK. ISSN0898-1221 (print), 1873-7668(electronic). URL http://www.sciencedirect.com/science/article/pii/S0898122199902237.

Anonymous:1999:BRMb

[Ano99c] Anonymous. Book re-view: MPI-The completereference: Volume 2, theMPI-2 extensions: ByWilliam Gropp, StevenHuss-Lederman, AndrewLumsdaine, Ewing Lusk, BillNitzberg, William Saphirand Marc Snir. MIT Press,Cambridge, MA. (1998).344 pages. $35.00. Com-puters and Mathematicswith Applications, 37(3):130, February 1999. CO-DEN CMAPDK. ISSN0898-1221 (print), 1873-7668(electronic). URL http://www.sciencedirect.com/science/article/pii/S0898122199903619.

Anonymous:1999:BRMg

[Ano99d] Anonymous. Book re-view: MPI-The completereference: Volume 2, theMPI-2 extensions: By

REFERENCES 89

William Gropp, StevenHuss-Lederman, AndrewLumsdaine, Ewing Lusk, BillNitzberg, William Saphirand Marc Snir. MIT Press,Cambridge, MA. (1998).344 pages. $35.00. Com-puters and Mathematicswith Applications, 37(6):130, March 1999. CO-DEN CMAPDK. ISSN0898-1221 (print), 1873-7668(electronic). URL http://www.sciencedirect.com/science/article/pii/S0898122199902250.

Anonymous:2000:BRUd

[Ano00a] Anonymous. Book review:Using MPI-2: Advanced fea-tures of the message-passinginterface: By WilliamGropp, Ewing Lusk andRajeev Thakur. The MITPress, Cambridge, MA.(1999). 382 pages. $35(each); $60 (set). Comput-ers and Mathematics withApplications, 40(2–3):419,July/August 2000. CO-DEN CMAPDK. ISSN0898-1221 (print), 1873-7668(electronic). URL http://www.sciencedirect.com/science/article/pii/S0898122100902098.

Anonymous:2000:BRUe

[Ano00b] Anonymous. Book re-view: Using MPI: Portableparallel programming withthe message-passing inter-face: Second edition. ByWilliam Gropp, Ewing Luskand Anthony Skjellum. The

MIT Press, Cambridge,MA. (1999). 371 pages. $35(each); $60 (set). Comput-ers and Mathematics withApplications, 40(2–3):419,July/August 2000. CO-DEN CMAPDK. ISSN0898-1221 (print), 1873-7668(electronic). URL http://www.sciencedirect.com/science/article/pii/S0898122100902074.

Anonymous:2001:AAL

[Ano01a] Anonymous. Appendixes:Appendix A: Linux, Win-dows NT, AIX, Solaris; ap-pendix B: Compilers andpreprocessors, MPI imple-mentations, development en-vironments, debuggers, per-formance analyzers. TheInternational Journal ofHigh Performance Comput-ing Applications, 15(2):191–194, Summer 2001. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic).

Anonymous:2001:EDP

[Ano01b] Anonymous. Erratum: De-sign and prototype of aperformance tool interfacefor OpenMP. The Jour-nal of Supercomputing, 23(1):105–128, May 2001. CO-DEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=23&issue=1&spage=105.

REFERENCES 90

Anonymous:2003:MNIc

[Ano03] Anonymous. Micro news:IBM ups the ante in sili-con transistor speed; newbenchmark suite based onhigh-performance comput-ing applications, MPI andOpenMP [SPEC HPC2002];EU OKs Hitachi, MitsubishiElectric semiconductor jointventure; Intel launches Pen-tium 4 at 3.06 GHz; TSMCunveils viable 25nm transis-tors. IEEE Micro, 23(1):6–6,87, January/February 2003.CODEN IEMIDZ. ISSN0272-1732 (print), 1937-4143(electronic). URL http://dlib.computer.org/mi/books/mi2003/pdf/m1006.pdf.

Anonymous:2012:CTC

[Ano12] Anonymous. CUDA Toolkit5.0 CURAND guide. Webdocument, 2012. URL http://docs.nvidia.com/cuda/pdf/CURAND_Library.pdf.

ANS:1995:MCR

[ANS95] ANS, editor. Mathematicsand computations, reactorphysics, and environmentalanalyses: International con-ference — April 1995, Port-land, OR. American NuclearSociety, La Grange Park, IL,USA, 1995. ISBN 0-89448-198-3. LCCN TK9006.M371995. Two volumes.

Anglano:1996:PMB

[AP96] C. Anglano and L. Porti-nale. Parallel model-baseddiagnosis using PVM. Lec-ture Notes in Computer Sci-ence, 1156:331–334, 1996.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Aji:2016:MEA

[APBcF16] Ashwin M. Aji, Antonio J.Pena, Pavan Balaji, andWu chun Feng. MultiCL:Enabling automatic schedul-ing for task-parallel work-loads in OpenCL. Paral-lel Computing, 58(??):37–55, October 2016. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167819116300357.

Aji:2016:MAA

[APJ+16] Ashwin M. Aji, Lokendra S.Panwar, Feng Ji, KarthikMurthy, Milind Chabbi, Pa-van Balaji, Keith R. Bis-set, James Dinan, Wu chunFeng, John Mellor-Crummey,Xiaosong Ma, and Ra-jeev Thakur. MPI-ACC:Accelerator-aware MPI forscientific applications. IEEETransactions on Paral-lel and Distributed Sys-tems, 27(5):1401–1414, May2016. CODEN ITD-SEO. ISSN 1045-9219(print), 1558-2183 (elec-tronic)ITDSEO. URL http:

REFERENCES 91

//www.computer.org/csdl/trans/td/2016/05/07127020-abs.html.

AlHaddad:2001:UNW

[AR01] Mohammed Al Haddad andJerome Robinson. Using anetwork of workstations toenhance database query pro-cessing performance. Lec-ture Notes in Computer Sci-ence, 2131:352–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310352.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310352.pdf.

Arabnia:1995:TRA

[Ara95] Hamid Arabnia, editor.Transputer research andapplications 7: AmericanTransputer Users Group,October 23–25, 1994, At-lanta, GA (NATUG-7), vol-ume 42 of Transputer andoccam engineering series.IOS Press, Postal Drawer10558, Burke, VA 2209-0558,USA, 1995. ISBN 90-5199-187-8 (IOS Press), 4-274-90017-7 (Ohmsha). ISSN0925-4986. LCCN ????

Altas:1994:NIE

[ARL+94] I. Altas, M. Rezny, J. Louis,K. Burrage, R. Moore, andJ. Belward. A new im-age enhancement algorithm

on MasPar and Parallel Vir-tual Machine (PVM) en-vironments. In Dekkeret al. [DSZ94], pages 819–826. ISBN 0-444-81784-0.LCCN QA76.58.E98 1994.

Arnow:1995:DLB

[Arn95] D. M. Arnow. DP: a li-brary for building portable,reliable distributed applica-tions. In USENIX [USE95],pages 235–247. ISBN 1-880446-67-7. LCCN QA76.76 O63 U88 1995.

Abrossimov:1989:GVM

[ARS89] V. Abrossimov, M. Rozier,and M. Shapiro. Genericvirtual memory managementfor operating system ker-nels. Operating Systems Re-view, 23(5):123–136, 1989.CODEN OSRED8. ISSN0163-5980 (print), 1943-586X (electronic).

Al-Refaie:2017:PAH

[ART17] Ahmed F. Al-Refaie andJonathan Tennyson. A par-allel algorithm for Hamil-tonian matrix constructionin electron-molecule colli-sion calculations: MPI–SCATCI. Computer PhysicsCommunications, 221(??):53–62, December 2017. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465517302436.

REFERENCES 92

Addison:2003:OIA

[ARvW03] C. Addison, Y. Ren, andM. van Waveren. OpenMPissues arising in the devel-opment of parallel BLASand LAPACK libraries. Sci-entific Programming, 11(2):95–104, 2003. CODENSCIPEV. ISSN 1058-9244(print), 1875-919X (elec-tronic).

Al-Refaie:2017:PCT

[ARYT17] Ahmed F. Al-Refaie, Sergei N.Yurchenko, and JonathanTennyson. GPU AcceleratedINtensities MPI (GAIN-MPI): a new method ofcomputing Einstein-A coef-ficients. Computer PhysicsCommunications, 214(??):216–224, May 2017. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465517300255.

Al-Salman:1992:DIP

[AS92] Abdulmalik Salman Al-Salman. Design and imple-mentation of a profiler forthe parallel virtual machine(PVM) system. M.s. the-sis, University of Georgia,Athens, GA, USA, 1992. vi+ 51 pp. Directed by StevenC. Cater.

Awile:2014:PWF

[AS14] Omar Awile and Ivo F.Sbalzarini. A Pthreadswrapper for Fortran 2003.

ACM Transactions on Math-ematical Software, 40(3):19:1–19:15, April 2014. CO-DEN ACMSCU. ISSN0098-3500 (print), 1557-7295(electronic).

Alonso:1997:PBB

[ASA97] J. L. Alonso, H. Schmidt,and V. N. Alexandrov. Par-allel branch and bound algo-rithms for integer and mixedinteger linear programmingproblems under PVM. Lec-ture Notes in Computer Sci-ence, 1332:313–320, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Alves:1995:WPC

[ASCS95] A. Alves, L. Silva, J. Car-reira, and J. G. Silva.WPVM: parallel comput-ing for the people. InHertzberger and Serazzi[HS95a], pages 582–587.ISBN 3-540-59393-4. ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.88.I57 1995.

Anderson:2017:BGB

[ASS+17] Michael Anderson, ShadenSmith, Narayanan Sun-daram, Mihai Capota, ZheguangZhao, Subramanya Dul-loor, Nadathur Satish, andTheodore L. Willke. Bridg-ing the gap between HPCand big data frameworks.Proceedings of the VLDBEndowment, 10(8):901–912,

REFERENCES 93

April 2017. CODEN ????ISSN 2150-8097.

Agrawal:1994:PIC

[ATC94] Dharma P. Agrawal, K. C.(Kuo Chung) Tai, andJagdish Chandra, editors.Proceedings of the 1994 In-ternational Conference onParallel Processing, August15–19, 1994. Vol 3: Al-gorithms and applications.CRC Press, 2000 N.W. Cor-porate Blvd., Boca Raton,FL 33431-9868, USA, 1994.ISBN 0-8493-2496-3, 0-8493-2495-5. ISSN 0190-3918.LCCN QA 76.58 I55 1994.Three volumes.

Amritkar:2012:OPF

[ATL+12] Amit Amritkar, DaneshTafti, Rui Liu, Rick Kufrin,and Barbara Chapman.OpenMP parallelism forfluid and fluid-particulatesystems. Parallel Comput-ing, 38(9):501–517, Septem-ber 2012. CODEN PA-COEJ. ISSN 0167-8191(print), 1872-7336 (elec-tronic). URL http://www.sciencedirect.com/science/article/pii/S0167819112000476.

Al-Tawil:2001:PME

[ATM01] Khalid Al-Tawil and Csaba An-dras Moritz. Performancemodeling and evaluation ofMPI. Journal of Paralleland Distributed Computing,61(2):202–223, February 1,2001. CODEN JPDCER.ISSN 0743-7315 (print),

1096-0848 (electronic). URLhttp://www.idealibrary.com/links/doi/10.1006/jpdc.2000.1677; http://www.idealibrary.com/links/doi/10.1006/jpdc.2000.1677/pdf; http://www.idealibrary.com/links/doi/10.1006/jpdc.2000.1677/ref.

Attiya:1996:ERS

[Att96] H. Attiya. Efficient androbust sharing of mem-ory in message-passing sys-tems. Lecture Notes inComputer Science, 1151:56–??, ???? 1996. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Angskun:2001:DPM

[AUR01] Thara Angskun, PutchongUthayopas, and ArnonRungsawang. Dynamic pro-cess management in KSIXcluster middleware. Lec-ture Notes in Computer Sci-ence, 2131:209–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310209.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310209.pdf.

Andujar:2016:OSF

[AVA+16] Francisco J. Andujar, Juan A.

REFERENCES 94

Villar, Francisco J. Alfaro,Jose L. Sanchez, and Je-sus Escudero-Sahuquillo. Anopen-source family of toolsto reproduce MPI-basedworkloads in interconnectionnetwork simulators. TheJournal of Supercomputing,72(12):4601–4628, Decem-ber 2016. CODEN JO-SUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic).

Asenjo:1995:SLF

[AZ95] R. Asenjo and E. L. Za-pata. Sparse LU factor-ization of the Cray T3D.In Hertzberger and Ser-azzi [HS95a], pages 690–696.ISBN 3-540-59393-4. ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.88.I57 1995.

Arteaga:2017:GFG

[AZG17] Jaime Arteaga, StephaneZuckerman, and Guang R.Gao. Generating fine-grainmultithreaded applicationsusing a multigrain approach.ACM Transactions on Ar-chitecture and Code Opti-mization, 14(4):47:1–47:??,December 2017. CODEN???? ISSN 1544-3566(print), 1544-3973 (elec-tronic).

Beyer:2005:GEC

[B+05] Hans-Georg Beyer et al., ed-itors. Genetic and Evolu-tionary Computation Con-ference: GECCO 2005,

June 25–29, 2005 (Saturday-Wednesday) Washington,DC, USA. ACM Press, NewYork, NY 10036, USA, 2005.ISBN 1-59593-010-8 (paper-back). LCCN QA76.623.G44 2005. ACM order num-ber 910050.

Battre:2006:MFP

[BA06] Dominic Battre and David SigfredoAngulo. MPI framework forparallel searching in largebiological databases. Jour-nal of Parallel and Dis-tributed Computing, 66(12):1503–1511, December 2006.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic).

Bader:2016:EMT

[Bad16] David A. Bader. Evolv-ing MPI+X toward exas-cale. Computer, 49(8):10,August 2016. CODEN CP-TRB4. ISSN 0018-9162(print), 1558-0814 (elec-tronic). URL http://csdl.computer.org/csdl/mags/co/2016/08/mco2016080010.html.

Becciani:2007:FMH

[BADC07] U. Becciani, V. Antonuccio-Delogu, and M. Com-parato. FLY: MPI-2 highresolution code for LSScosmological simulations.Computer Physics Commu-nications, 176(3):211–217,February 1, 2007. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944

REFERENCES 95

(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465506003687.

Bruel:2017:ACC

[BAG17] Pedro Bruel, Marcos Amarıs,and Alfredo Goldman. Auto-tuning CUDA compiler pa-rameters for heterogeneousapplications using the Open-Tuner framework. Con-currency and Computation:Practice and Experience, 29(22):??, November 25, 2017.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Baker:1998:MNC

[Bak98] M. Baker. MPI on NT: Thecurrent status and perfor-mance of the available envi-ronments. Lecture Notes inComputer Science, 1497:63–??, 1998. CODEN LNCSD9.ISSN 0302-9743 (print),1611-3349 (electronic).

Blaszczyk:1995:PCE

[BALU95] A. Blaszczyk, Z. Andjelic,P. Levin, and A. Ustundag.Parallel computation of elec-tric fields in a heteroge-neous workstation cluster.In Hertzberger and Ser-azzi [HS95a], pages 606–611.ISBN 3-540-59393-4. ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.88.I57 1995.

Buyukkececi:2013:POI

[BAS13] Ferit Buyukkececi, OmarAwile, and Ivo F. Sbalzarini.

A portable OpenCL im-plementation of genericparticle-mesh and mesh-particle interpolation in 2Dand 3D. Parallel Comput-ing, 39(2):94–111, Febru-ary 2013. CODEN PA-COEJ. ISSN 0167-8191(print), 1872-7336 (elec-tronic). URL http://www.sciencedirect.com/science/article/pii/S0167819112000920.

Bernabeu:2008:MPA

[BAV08] Miguel O. Bernabeu, PedroAlonso, and Antonio M. Vi-dal. A multilevel parallel al-gorithm to solve symmetricToeplitz linear systems. TheJournal of Supercomputing,44(3):237–256, June 2008.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=44&issue=3&spage=237.

Bedrosian:1993:MFA

[BB93] G. Bedrosian and R. W.Benway. Magnetostaticfinite-element analysis onMIMD/DMMP parallel com-puters. In Yelon et al.[Y+93], pages 6772–6777.CODEN JAPIAU. ISBN1-56396-212-8. ISSN 0021-8979 (print), 1089-7550(electronic), 1520-8850. LCCNQC753 .C748 1990. Two vol-umes.

REFERENCES 96

Beguelin:1994:CMS

[BB94] A. Beguelin and B. Bruegge.A configurable monitoringsystem for parallel program-ming. In IEEE [IEE94d],page 206. ISBN 0-8186-5390-6. LCCN QA76.9.D5I5951994. IEEE catalog no.94TH0651-0.

Beaumont:1995:DPG

[BB95] P. M. Beaumont and P. T.Bradshaw. A distributedparallel genetic algorithmfor solving optimal growthmodels. ComputationalEconomics, 8(3):159–179,August 1995. CODENCNOMEL. ISSN 0927-7099.

Brunschen:2000:OCP

[BB00] Christian Brunschen andMats Brorsson. OdinMP/CCp — a portable imple-mentation of OpenMP forC. Concurrency: practiceand experience, 12(12):1193–1203, October 2000. CO-DEN CPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/cgi-bin/abstract/76500347/START; http://www3.interscience.wiley.com/cgi-bin/fulltext?ID=76500347&PLACEBO=IE.pdf.

Bala:1994:IEU

[BBB+94] V. Bala, J. Bruck, R. Bryant,R. Cypher, and P. De Jong.The IBM external user inter-face for scalable parallel sys-tems. Parallel Computing,

20(4):445–??, April 1994.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic).

Bova:2000:DLP

[BBC+00] Steve W. Bova, Clay P. Bres-hears, Christine E. Cuic-chi, Zeki Demirbilek, andHenry A. Gabb. Dual-levelparallel analysis of harborwave response using MPIand OpenMP. The Interna-tional Journal of High Per-formance Computing Appli-cations, 14(1):49–64, Spring2000. CODEN IHPCFL.ISSN 1094-3420 (print),1741-2846 (electronic).

Bosilca:2002:MVT

[BBC+02] George Bosilca, AurelienBouteiller, Franck Cappello,Samir Djilali, Gilles Fedak,Cecile Germain, ThomasHerault, Pierre Lemarinier,Oleg Lodygensky, Fred-eric Magniette, VincentNeri, and Anton Selikhov.MPICH-V: Toward a scal-able fault tolerant MPI forvolatile nodes. In IEEE[IEE02], page ?? ISBN0-7695-1524-X. LCCN???? URL http://www.sc-2002.org/paperpdfs/pap.pap298.pdf.

Bertozzi:1999:MIT

[BBCR99] M. Bertozzi, F. Boselli,G. Conte, and M. Reg-giani. An MPI implementa-tion on the top of the vir-

REFERENCES 97

tual interface architecture.In Dongarra et al. [DLM99],pages 199–206. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Bethune:2014:PAA

[BBDH14] Iain Bethune, J. Mark Bull,Nicholas J. Dingle, andNicholas J. Higham. Per-formance analysis of asyn-chronous Jacobi’s methodimplemented in MPI, SHMEMand OpenMP. The Interna-tional Journal of High Per-formance Computing Ap-plications, 28(1):97–111,February 2014. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/28/1/97.full.pdf+html.

Bailey:1995:PSS

[BBG+95] D. H. Bailey, P. E. Bjorstad,J. R. Gilbert, M. V.Mascagni, R. S. Schreiber,H. D. Simon, V. J. Torczon,and L. T. Watson, editors.Proceedings of the SeventhSIAM Conference on Paral-lel Processing for ScientificComputing (San Francisco,CA, USA). Society for In-dustrial and Applied Math-ematics, Philadelphia, PA,USA, 1995. ISBN 0-89871-344-7. LCCN QA76.58.S551995.

Bova:2001:PPM

[BBG+01] Steve W. Bova, Clay P. Bres-hears, Henry Gabb, BobKuhn, Bill Magro, RudolfEigenmann, Greg Gaertner,Stefano Salvini, and HowardScott. Parallel program-ming with message pass-ing and directives. Com-puting in Science and Engi-neering, 3(5):22–37, Septem-ber/October 2001. CODENCSENFA. ISSN 1521-9615(print), 1558-366X (elec-tronic). URL http://computer.org/cise/cs2001/c5022abs.htm; http://dlib.computer.org/cs/books/cs2001/pdf/c5022.pdf.

Balaji:2010:FGM

[BBG+10] Pavan Balaji, Darius Bunti-nas, David Goodell, WilliamGropp, and Rajeev Thakur.Fine-grained multithreadingsupport for hybrid threadedMPI programming. TheInternational Journal ofHigh Performance Comput-ing Applications, 24(1):49–57, February 2010. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/24/1/49.full.pdf+html.

Balaji:2011:MMC

[BBG+11] Pavan Balaji, Darius Bunti-nas, David Goodell, WilliamGropp, Torsten Hoefler,Sameer Kumar, Ewing Lusk,

REFERENCES 98

Rajeev Thakur, and Jes-per Larsson Traff. MPI onmillions of cores. ParallelProcessing Letters, 21(1):45–60, March 2011. CODENPPLTEE. ISSN 0129-6264(print), 1793-642X (elec-tronic).

Barrett:2014:EMM

[BBG+14] Brian W. Barrett, RonBrightwell, Ryan Grant, Si-mon D. Hammond, andK. Scott Hemmert. An eval-uation of MPI message rateon hybrid-core processors.The International Journalof High Performance Com-puting Applications, 28(4):415–424, November 2014.CODEN IHPCFL. ISSN1094-3420 (print), 1741-2846 (electronic). URLhttp://hpc.sagepub.com/content/28/4/415.

Barak:1996:PPM

[BBGL96] A. Barak, A. Braverman,I. Gilderman, and O. Laden.Performance of PVM withthe MOSIX preemptive pro-cess migration scheme. InIEEE [IEE96h], pages 38–45.ISBN 0-8186-7536-5. LCCNQA75.5 .I75 1996. IEEEComputer Society Press Or-der Number PR07536.

Bouteiller:2006:HPS

[BBH+06] Aurelien Bouteiller, Hinde-Lilia Bouziane, ThomasHerault, Pierre Lemarinier,and Franck Cappello. Hy-brid preemptive scheduling

of Message Passing Inter-face applications on Grids.The International Journal ofHigh Performance Comput-ing Applications, 20(1):77–90, Spring 2006. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/20/1/77.full.pdf+html.

Bischof:2008:AAD

[BBH+08] Christian H. Bischof, H. Mar-tin Bucker, Paul Hovland,Uwe Naumann, and JeanUtke, editors. Advances inAutomatic Differentiation,volume 64 of Lecture Notesin Computational Scienceand Engineering. Springer-Verlag, Berlin, Germany /Heidelberg, Germany / Lon-don, UK / etc., 2008. CO-DEN LNCSA6. ISBN 3-540-68935-4 (print), 3-540-68942-7 (e-book). ISSN1439-7358. LCCN QA304.I58 2008. URL http://link.springer.com/book/10.1007/978-3-540-68942-3; http://www.springerlink.com/content/978-3-540-68942-3.

Bustamam:2012:FPM

[BBH12] Alhadi Bustamam, KevinBurrage, and Nicholas A.Hamilton. Fast paral-lel Markov clustering inbioinformatics using mas-sively parallel computingon GPU with CUDA andELLPACK-R sparse for-

REFERENCES 99

mat. IEEE/ACM Trans-actions on ComputationalBiology and Bioinformat-ics, 9(3):679–692, May 2012.CODEN ITCBCY. ISSN1545-5963 (print), 1557-9964(electronic).

Bland:2013:EUL

[BBH. . . 13a] Wesley Bland, AurelienBouteiller, Thomas Her-ault, and Joshua Hursey. . . . An evaluation of User-Level Failure Mitigation sup-port in MPI. Comput-ing, 95(12):1171–1184, De-cember 2013. CODENCMPTA2. ISSN 0010-485X(print), 1436-5057 (elec-tronic). URL http://link.springer.com/article/10.1007/s00607-013-0331-3.

Bland:2013:PFR

[BBH+13b] Wesley Bland, AurelienBouteiller, Thomas Herault,George Bosilca, and JackDongarra. Post-failure re-covery of MPI communi-cation capability: Designand rationale. The Interna-tional Journal of High Per-formance Computing Appli-cations, 27(3):244–254, Au-gust 2013. CODEN IH-PCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/27/3/244.full.pdf+html.

Busa:2015:CCO

[BBH+15] Jan Busa, Jr., Jan Busa,

Shura Hayryan, Chin-KunHu, and Ming-Chya Wu.CAVE-CL: an OpenCL ver-sion of the package for detec-tion and quantitative anal-ysis of internal cavities ina system of overlappingballs: Application to pro-teins. Computer PhysicsCommunications, 190(??):224–227, May 2015. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465514004378.

Boryczko:1994:LGA

[BBK+94] K. Boryczko, M. Bubak,J. Kitowski, J. Moscinski,and R. Slota. Lattice gasautomata and molecular dy-namics on a network ofcomputers. In Gentzschand Harms [GH94], pages177–180. ISBN 0-387-57981-8 (New York), 3-540-57981-8 (Berlin). LCCNQA76.88.I57 1994. DM96.00.Two volumes.

Barnard:1999:MIS

[BBS99] Stephen T. Barnard, Luis M.Bernardo, and Horst D. Si-mon. An MPI implemen-tation of the SPAI pre-conditioner on the T3E.The International Journal ofHigh Performance Comput-ing Applications, 13(2):107–123, Summer 1999. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic).

REFERENCES 100

Brorsson:2000:SIE

[BC00] Mats Brorsson and BarbaraChapman. Special issue:EWOMP’99 — First Euro-pean Workshop on OpenMP.Concurrency: practice andexperience, 12(12):1117–1119, October 2000. CO-DEN CPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/cgi-bin/abstract/76500352/START; http://www3.interscience.wiley.com/cgi-bin/fulltext?ID=76500352&PLACEBO=IE.pdf.

Blas:2014:RAM

[BC14] Javier Garcia Blas and Je-sus Carretero. Recent ad-vances in the Message Pass-ing Interface. The Interna-tional Journal of High Per-formance Computing Appli-cations, 28(4):387–389, Nov-ember 2014. CODEN IH-PCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/28/4/387.

Barton:2006:SMP

[BCA+06] Christopher Barton, CClinCascaval, George Almasi,Yili Zheng, Montse Far-reras, Siddhartha Chatterje,and Jose Nelson Amaral.Shared memory program-ming for large scale ma-chines. ACM SIGPLAN No-tices, 41(6):108–117, June2006. CODEN SINODQ.

ISSN 0362-1340 (print),1523-2867 (print), 1558-1160(electronic).

Becciani:2006:FMP

[BCAD06] U. Becciani, M. Com-parato, and V. Antonuccio-Delogu. FLY MPI-2: aparallel tree code for LSS.Computer Physics Commu-nications, 174(7):605–606,April 1, 2006. CODENCPHCBZ. ISSN 0010-4655(print), 1879-2944 (elec-tronic). URL http://www.sciencedirect.com/science/article/pii/S0010465506000713.

Bircsak:2000:EONa

[BCC+00a] John Bircsak, Peter Craig,RaeLyn Crowell, Zarka Cve-tanovic, Jonathan Har-ris, C. Alexander Nel-son, and Carl D. Offner.Extending OpenMP forNUMA machines. InACM [ACM00], pages 68–69. URL http://www.sc2000.org/proceedings/techpapr/papers/pap226.pdf.

Bircsak:2000:EONb

[BCC+00b] John Bircsak, Peter Craig,RaeLyn Crowell, et al. Ex-tending OpenMP for NUMAmachines. Scientific Pro-gramming, 8(3):163–181,2000. CODEN SCIPEV.ISSN 1058-9244 (print),1875-919X (electronic).

REFERENCES 101

Bouchard:1996:FCS

[BCD96] V. Bouchard, P. Cinquin,and L. Desbat. FirstCompton scatter correc-tion in SPECT using PVM.In Grangeat and Amans[GA96], pages 109–111.ISBN 0-7923-4129-5. LCCNR857.T47 T485 1996.

Betts:2012:GVG

[BCD+12] Adam Betts, Nathan Chong,Alastair Donaldson, ShazQadeer, and Paul Thomson.GPUVerify: a verifier forGPU kernels. ACM SIG-PLAN Notices, 47(10):113–132, October 2012. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Betts:2015:DIV

[BCD+15] Adam Betts, Nathan Chong,Alastair F. Donaldson,Jeroen Ketema, Shaz Qadeer,Paul Thomson, and JohnWickerson. The design andimplementation of a verifi-cation technique for GPUkernels. ACM Transac-tions on Programming Lan-guages and Systems, 37(3):10:1–10:??, June 2015.CODEN ATPSDT. ISSN0164-0925 (print), 1558-4593(electronic).

Baker:1999:MOO

[BCFK99] M. Baker, B. Carpenter,G. Fox, and Sung Hoon Koo.mpiJava: An object-oriented

Java interface to MPI. Lec-ture Notes in Computer Sci-ence, 1586:748–??, 1999.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Balaji:2010:IND

[BCG+10] Pavan Balaji, AnthonyChan, William Gropp, Ra-jeev Thakur, and EwingLusk. The importanceof non-data-communicationoverheads in MPI. TheInternational Journal ofHigh Performance Comput-ing Applications, 24(1):5–15,February 2010. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/24/1/5.full.pdf+html.

Bala:1997:PVQ

[BCGL97] P. Bala, T. Clark, P. Gro-chowski, and B. Lesyng. Par-allel version of a quantumclassical molecular dynamicscode for complex molecularand biomolecular systems.Lecture Notes in ComputerScience, 1332:409–416, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Bouteiller:2003:MVF

[BCH+03] Aurelien Bouteiller, FranckCappello, Thomas Herault,Geraud Krawezik, PierreLemarinier, and FredericMagniette. MPICH-V2:

REFERENCES 102

a fault tolerant MPI forvolatile nodes based on pes-simistic sender based mes-sage logging. In ACM[ACM03], page ?? ISBN1-58113-695-1. LCCN???? URL http://www.sc-conference.org/sc2003/inter_cal/inter_cal_detail.php?eventid=10696#1; http://www.sc-conference.org/sc2003/paperpdfs/pap209.pdf.

Buntinas:2008:BVN

[BCH+08] Darius Buntinas, CamilleCoti, Thomas Herault,Pierre Lemarinier, LaurencePilard, Ala Rezmerita, EricRodriguez, and Franck Cap-pello. Blocking vs. non-blocking coordinated check-pointing for large-scale faulttolerant MPI protocols. Fu-ture Generation ComputerSystems, 24(1):73–84, Jan-uary 2008. CODEN FG-SEVI. ISSN 0167-739X(print), 1872-7115 (elec-tronic).

Bikshandi:2009:EPI

[BCK+09] Ganesh Bikshandi, Jose G.Castanos, Sreedhar B. Ko-dali, V. Krishna Nandi-vada, Igor Peshansky, Vi-jay A. Saraswat, Sayan-tan Sur, Pradeep Varma,and Tong Wen. Effi-cient, portable implementa-tion of asynchronous multi-place programs. ACM SIG-PLAN Notices, 44(4):271–

282, April 2009. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Bruno:2000:PEH

[BCKP00] G. Bruno, A. A. Chien,M. J. Katz, and P. M. Pa-padopoulos. Performanceenhancements for HPVM inmulti-network and heteroge-neous hardware. In Engquist[Eng00], pages 17–32. ISBN3-540-67264-8. ISSN 1439-7358. LCCN QA76.9.C65S535 2000.

Bolloni:2000:TIQ

[BCL00] Alessandro Bolloni, Ste-fano Crocchianti, and An-tonio Lagana. Time inde-pendent 3D quantum reac-tive scattering on MIMDparallel computers. Lec-ture Notes in Computer Sci-ence, 1908:338–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080338.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080338.pdf.

Baraglia:1997:IPW

[BCLN97] R. Baraglia, M. Cosso,D. Laforenza, and M. Nicosia.Integrating PVaniM intoWAMM for monitoringmeta-applications. Lecture

REFERENCES 103

Notes in Computer Sci-ence, 1332:226–233, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Bhattacharjee:2011:PLC

[BCM11] Abhishek Bhattacharjee,Gilberto Contreras, andMargaret Martonosi. Paral-lelization libraries: Charac-terizing and reducing over-heads. ACM Transactionson Architecture and CodeOptimization, 8(1):5:1–5:??,April 2011. CODEN ????ISSN 1544-3566 (print),1544-3973 (electronic).

Bolis:2016:APA

[BCM+16] A. Bolis, C. D. Cantwell,D. Moxey, D. Serson, andS. J. Sherwin. An adapt-able parallel algorithm forthe direct numerical simu-lation of incompressible tur-bulent flows using a Fourierspectral/hp element methodand MPI virtual topolo-gies. Computer PhysicsCommunications, 206(??):17–25, September 2016. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S001046551630100X.

Baiardi:2000:AMM

[BCMR00] Fabrizio Baiardi, SarahChiti, Paolo Mori, and LauraRicci. Adaptive multigridmethods in MPI. Lec-

ture Notes in ComputerScience, 1908:80–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080080.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080080.pdf.

Blackford:1997:PEN

[BCP+97] L. S. Blackford, A. Cleary,A. Petitet, R. C. Wha-ley, J. Demmel, I. Dhillon,H. Ren, K. Stanley, J. Don-garra, and S. Hammarling.Practical experience in thenumerical dangers of hetero-geneous computing. ACMTransactions on Mathemat-ical Software, 23(2):133–147, June 1997. CODENACMSCU. ISSN 0098-3500(print), 1557-7295 (elec-tronic). URL http://www.acm.org/pubs/citations/journals/toms/1997-23-2/p133-blackford/.

Bland:2013:SIP

[BDB+13] Wesley Bland, Peng Du, Au-relien Bouteiller, ThomasHerault, George Bosilca, andJack J. Dongarra. Special is-sue papers: Extending thescope of the Checkpoint-on-Failure protocol for forwardrecovery in standard MPI.Concurrency and Computa-tion: Practice and Experi-

REFERENCES 104

ence, 25(17):2381–2393, De-cember 10, 2013. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic).

Beguelin:1991:UGP

[BDG+91a] A. Beguelin, J. Dongarra,A. Geist, R. Manchek, andV. Sunderam. A user’sguide to PVM: Parallel vir-tual machine. TechnicalReport ORNL/TM-11826,Mathematical Sciences Sec-tion, Oak Ridge NationalLaboratory, Knoxville, TN,USA, September 1991.

Beguelin:1991:GDT

[BDG+91b] Adam Beguelin, Jack J.Dongarra, A. Geist, RobertManchek, and V. S. Sun-deram. Graphical develop-ment tools for network-basedconcurrent supercomputing.In IEEE [IEE91], pages 435–444. ISBN 0-8186-9158-1(IEEE: case), 0-8186-2158-3(IEEE: paper), 0-8186-6158-5 (IEEE: microfiche), 0-89791-459-7 (ACM). LCCNQA76.5 .S894 1991. IEEEcatalog no. 91CH3058-5.

Beguelin:1992:HGD

[BDG+92a] A. Beguelin, J. Dongarra,A. Geist, R. Manchek,K. Moore, R. Wade, andV. Sunderam. HeNCE:graphical development toolsfor network-based con-current computing. InIEEE [IEE92], pages 129–

136. ISBN 0-8186-2775-1. LCCN QA76.76.A65S331992. IEEE catalog no.92TH0432-5.

Beguelin:1992:PHT

[BDG+92b] A. Beguelin, J. Dongarra,A. Geist, R. Manchek, andV. Sunderam. PVM andHeNCE: traversing the par-allel environment. CRAYChannels, 14(4):22–25, Fall1992. CODEN CRCHE8.

Beguelin:1992:SCG

[BDG+92c] A. Beguelin, J. Dongarra,A. Geist, R. Manchek, andV. Sunderam. Solvingcomputational grand chal-lenges using a network ofheterogeneous supercomput-ers. In Dongarra et al.[DKM+92], pages 596–601.ISBN 0-89871-303-X. LCCNQA76.58.P76 1992.

Beguelin:1993:PHT

[BDG+93a] A. Beguelin, J. Dongarra,A. Geist, R. Manchek,K. Moore, and V. Sun-deram. PVM and HeNCE:Tools for heterogeneous net-work computing. In Kowa-lik and Grandinetti [KG93],page ?? ISBN 3-540-56451-9 (Berlin), 0-387-56451-9(New York). LCCN QA76.58.S629 1993.

Beguelin:1993:PEC

[BDG+93b] A. Beguelin, J. Dongarra,A. Geist, R. Manchek,S. Otto, and J. Walpole.PVM: Experiences, current

REFERENCES 105

status and future direction.In IEEE [IEE93e], pages765–766. ISBN 0-8186-4340-4 (paperback), 0-8186-4341-2 (microfiche), 0-8186-4342-0 (hardback), 0-8186-4346-3 (CD-ROM). ISSN 1063-9535. LCCN QA76.5 .S961993.

Beguelin:1994:HHN

[BDG+94] A. Beguelin, J. J. Dongarra,G. Al Geist, R. Manchek,and K. Moore. HeNCE: aheterogeneous network com-puting environment. Scien-tific Programming, 3(1):49–60, Spring 1994. CODENSCIPEV. ISSN 1058-9244(print), 1875-919X (elec-tronic).

Beguelin:1995:REP

[BDG+95] Adam Beguelin, Jack Don-garra, Al Geist, RobertManchek, and Vaidy Sun-deram. Recent enhance-ments to PVM. Interna-tional Journal of Supercom-puter Applications and HighPerformance Computing, 9(2):108–127, Summer 1995.CODEN IJSCFG. ISSN1078-3482.

Beguelin:19xx:PSS

[BDG+xx] A. Beguelin, J. J. Dongarra,G. A. Geist, R. Manchek,and V. S. Sunderam. PVMsoftware system and doc-umentation. Email tonetlib@ornl.gov, ???? 19xx.

Beguelin:1993:VDH

[BDGS93] Adam Beguelin, Jack Don-garra, Al Geist, and V. Sun-deram. Visualization and de-bugging in a heterogeneousenvironment. Computer, 26(6):88–95, June 1993. CO-DEN CPTRB4. ISSN0018-9162 (print), 1558-0814(electronic).

Bruck:1995:EMPb

[BDH+95] Jehoshua Bruck, DannyDolev, Ching-Tien Ho,Marcel-Catalin Rosu, andRay Strong. Efficient Mes-sage Passing Interface (MPI)for parallel computing onclusters of workstations. InACM [ACM95b], pages 64–73. ISBN 0-89791-717-0.LCCN QA76.642 .A25 1995.

Bruck:1997:EMP

[BDH+97] Jehoshua Bruck, DannyDolev, Ching-Tien Ho,Marcel-Catalin Rosu, andRay Strong. Efficient mes-sage passing interface (MPI)for parallel computing onclusters of workstations.Journal of Parallel and Dis-tributed Computing, 40(1):19–34, January 10, 1997.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic). URL http://www.idealibrary.com/links/doi/10.1006/jpdc.1996.1267/production;http://www.idealibrary.com/links/doi/10.1006/jpdc.1996.1267/production/

REFERENCES 106

pdf; http://www.idealibrary.com/links/doi/10.1006/jpdc.1996.1267/production/ref.

Browne:1998:RPA

[BDL98] Shirley Browne, Jack Don-garra, and Kevin London.Review of performance anal-ysis tools for MPI paral-lel programs. NHSE Re-view, 3, 1998. CODEN ????ISSN ???? URL http://www.cs.utk.edu/~browne/perftools-review/. Ac-cepted, to appear.

Bode:1996:PVM

[BDLS96] Arndt Bode, Jack Dongarra,T. Ludwig, and V. Sun-deram, editors. Parallelvirtual machine, EuroPVM’96: third European PVMconference, Munich, Ger-many, October 7–9, 1996:proceedings, volume 1156of Lecture Notes in Com-puter Science. Springer-Ver-lag, Berlin, Germany / Hei-delberg, Germany / Lon-don, UK / etc., 1996.ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E975 1996.

Baghsorkhi:2010:APM

[BDP+10] Sara S. Baghsorkhi, MatthieuDelahaye, Sanjay J. Patel,William D. Gropp, and Wenmei W. Hwu. An adaptiveperformance modeling toolfor GPU architectures. ACM

SIGPLAN Notices, 45(5):105–114, May 2010. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Bronevetsky:2007:CFS

[BdS07] Greg Bronevetsky and Bro-nis R. de Supinski. Com-plete formal specificationof the OpenMP memorymodel. International Jour-nal of Parallel Programming,35(4):335–392, August 2007.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=35&issue=4&spage=335.

Baboulin:2008:SID

[BDT08] Marc Baboulin, Jack J. Don-garra, and Stanimire To-mov. Some issues in denselinear algebra for multicoreand special purpose archi-tectures. LAPACK Work-ing Note 200, Department ofComputer Science, Univer-sity of Tennessee, Knoxville,Knoxville, TN 37996, USA,May 2008. URL http://www.netlib.org/lapack/lawnspdf/lawn200.pdf.

Briguglio:2003:PPM

[BDV03] Sergio Briguglio, BeniaminoDi Martino, and Grego-rio Vlad. A performance-prediction model for PICapplications on clusters of

REFERENCES 107

symmetric multiprocessors:Validation with hierarchicalHPF + OpenMP implemen-tation. Scientific Program-ming, 11(2):159–176, 2003.CODEN SCIPEV. ISSN1058-9244 (print), 1875-919X (electronic).

Bubak:1997:RAP

[BDW97] Marian Bubak, J. J. Don-garra, and Jerzy Was-niewski, editors. Recentadvances in parallel virtualmachine and message pass-ing interface: 4th EuropeanPVM/MPI user’s groupmeeting Cracow, Poland,November 3–5, 1997: pro-ceedings, volume 1332 ofLecture Notes in Com-puter Science. Springer-Ver-lag, Berlin, Germany / Hei-delberg, Germany / Lon-don, UK / etc., 1997. CO-DEN LNCSD9. ISBN 3-540-63697-8 (paperback). ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E973 1997.

Batty:2016:OSA

[BDW16] Mark Batty, Alastair F.Donaldson, and John Wick-erson. Overhauling SCatomics in C11 and OpenCL.ACM SIGPLAN Notices, 51(1):634–648, January 2016.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Beyls:1999:JJP

[BDY99] K. Beyls, E. D’Hollander,and Y. Yu. JPT: a Javaparallelization tool. InDongarra et al. [DLM99],pages 173–180. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Beguelin:1992:XTM

[Beg92] Adam Louis Beguelin. Xab:a tool for monitoring PVMprograms. Technical report,School of Computer Science,Carnegie Mellon University,Pittsburgh, PA, USA, June5, 1992.

Beguelin:1993:XTMb

[Beg93a] A. L. Beguelin. Xab: a toolfor monitoring PVM pro-grams. In Mudge et al.[MMH93], pages 102–103(vol. 2) (or 4–??). ISBN0-8186-3230-5. LCCN ????Four volumes. IEEE catalognumber 93TH0501-7.

Beguelin:1993:XAT

[Beg93b] Adam Beguelin. Xab: a toolfor monitoring PVM pro-grams. In IEEE [IEE93f],pages 92–97. ISBN 0-8186-2702-6. LCCN QA76.58.W654 1992.

Beguelin:1993:XTMa

[Beg93c] Adam L. Beguelin. Xab:a tool for monitoring PVMprograms. Research paperCMU-CS-93-164, School of

REFERENCES 108

Computer Science, CarnegieMellon University, Pitts-burgh, PA, USA, 1993. 8 pp.

Bull:2010:PEM

[BEG+10] J. Mark Bull, James En-right, Xu Guo, Chris May-nard, and Fiona Reid.Performance evaluation ofmixed-mode OpenMP/MPIimplementations. Inter-national Journal of Par-allel Programming, 38(5–6):396–417, October 2010.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=38&issue=5&spage=396.

Benkner:1995:VFA

[Ben95] S. Benkner. Vienna Fortran90 — an advanced data par-allel language. In Malyshkin[Mal95], pages 142–156.ISBN 3-540-60222-4. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.I547 1995.

Bencheva:2001:MPI

[Ben01] G. Bencheva. MPI par-allel implementation of afast separable solver. Lec-ture Notes in Computer Sci-ence, 2179:454–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/

bibs/2179/21790454.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2179/21790454.pdf.

Bernaschi:1996:RHP

[Ber96] Massimo Bernaschi. The re-quirements of a high per-formance implementation ofPVM. Future GenerationComputer Systems, 12(1):3–11, May 1996. CODENFGSEVI. ISSN 0167-739X(print), 1872-7115 (elec-tronic).

Baker:1998:MNP

[BF98] M. Baker and G. Fox. MPIon NT: a preliminary eval-uation of the available en-vironments. Lecture Notesin Computer Science, 1388:549–??, 1998. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Berthou:2001:COH

[BF01] Jean-Yves Berthou andEric Fayolle. ComparingOpenMP, HPF, and MPIprogramming: a study case.The International Journal ofHigh Performance Comput-ing Applications, 15(3):297–309, Fall 2001. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic).

REFERENCES 109

Bubak:2001:PMS

[BFBW01] Marian Bubak, W lodzimierzFunika, Bartosz Bali, andRoland Wismuller. Per-formance measurement sup-port for MPI applicationswith PATOP. LectureNotes in Computer Sci-ence, 1947:288–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1947/19470288.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1947/19470288.pdf.

Bischof:1994:CSM

[BfDA94] Christian Bischof and In-stitute for Defense Anal-yses. A case study ofMPI: portable and efficientlibraries. Technical re-port SRC-TR-94-130, Super-computing Research Center:IDA, Lanham, MD, USA,1994. 6 pp.

Broquedis:2010:FEO

[BFG+10] Francois Broquedis, NathalieFurmento, Brice Goglin,Pierre-Andre Wacrenier,and Raymond Namyst.ForestGOMP: An efficientOpenMP environment forNUMA architectures. In-ternational Journal of Par-allel Programming, 38(5–6):418–439, October 2010.CODEN IJPPE5. ISSN

0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=38&issue=5&spage=418.

Bubak:1999:EFP

[BFIM99] M. Bubak, W. Funika,K. Iskra, and R. Maruszewski.Enhancing the functional-ity of performance mea-surement tools for messagepassing environments. InDongarra et al. [DLM99],pages 67–74. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Baraglia:1999:PAN

[BFLL99] R. Baraglia, R. Ferrini,D. Laforenza, and A. La-gana. Parallel approachesto a numerically intensiveapplication using PVM. InDongarra et al. [DLM99],pages 364–371. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Bubak:1996:MPP

[BFM96] M. Bubak, W. Funika, andJ. Moscinski. Monitoringof performance of PVM ap-plications on virtual net-work computer. In Was-niewski [Was96], pages 147–156. ISBN 3-540-62095-8.LCCN QA76.58 .P35 1996.

REFERENCES 110

Bubak:1997:EPA

[BFM97] M. Bubak, W. Funika, andJ. Moscinski. Evaluation ofparallel application’s behav-ior in message passing en-vironment. Lecture Notesin Computer Science, 1332:234–241, 1997. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Bouge:1996:EPP

[BFMR96] Luc Bouge, P. Fraigniaud,A. Mignotte, and Y. Robert,editors. Euro-Par ’96 par-allel processing: second In-ternational Euro-Par Con-ference, Lyon, France, Au-gust 26–29, 1996: pro-ceedings, volume 1123–1124of Lecture notes in com-puter science. Springer-Ver-lag, Berlin, Germany / Hei-delberg, Germany / Lon-don, UK / etc., 1996. ISBN3-540-61626-8 (vol. 1), 3-540-61627-6 (vol. 2). ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.I554 1996, QA267.A1L43 no.1123-1124. Two vol-umes.

Bubak:1996:PBP

[BFMT96a] M. Bubak, W. Funika,J. Moscinski, and D. Tasak.Pablo-based performancemonitoring tool for PVM ap-plications. In Dongarra et al.[DMW96], pages 69–78.ISBN 3-540-60902-4. ISSN0302-9743 (print), 1611-

3349 (electronic). LCCNQA76.58.P35 1995.

Bubak:1996:PPM

[BFMT96b] M. Bubak, W. Funika,J. Moscinski, and D. Tasak.Pablo-Based performancemonitoring tool for PVM ap-plications. In Dongarra et al.[DMW96], pages 69–78.ISBN 3-540-60902-4. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.P35 1995.

Bozas:1997:PED

[BFZ97] G. Bozas, M. Fleischhauer,and S. Zimmermann. PVMexperiences in develop-ing the MIDAS paralleldatabase system. LectureNotes in Computer Sci-ence, 1332:427–434, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Bhavsar:1991:SSJ

[BG91] Virendrakumar ChhabulalBhavsar and Uday Govind-das Gujar, editors. Super-computing Symposium ’91,June 3–5, 1991, Frederic-ton, NB, Canada: sympo-sium proceedings. Universityof New Brunswick Press,Fredericton, NB, Canada,1991. ISBN 0-920114-14-8.LCCN QA76.88.S87 1991.

Boerger:1994:FSP

[BG94a] E. Boerger and U. Glaesser.A formal specification of

REFERENCES 111

the PVM architecture. InPehrson et al. [PSB+94],pages 402–409. CODENITATEC. ISBN 0-444-81990-8, 0-444-81989-4. ISSN 0926-5473. LCCN QA75.5.I37851994. Three volumes.

Borger:1994:AMP

[BG94b] E. Borger and U. Glasser.An abstract model of theParallel Virtual Machine(PVM). In Anonymous[Ano94e], pages 308–309.ISBN 1-880843-09-9. LCCNQA76.58.I543 1994.

Borger:1994:FSP

[BG94c] E. Borger and U. Glasser.A formal specification of thePVM architecture. IFIPTransactions. A. ComputerScience and Technology, A-51:402–409, ???? 1994. CO-DEN ITATEC. ISSN 0926-5473.

Barbour:1995:PIG

[BG95] A. E. Barbour and M. F.Gabre. Parallel implemen-tation of Gauss–Seidel andconjugate gradient for solv-ing system of linear equa-tions Ax = b using PVM.In Aityan et al. [AGH+95],pages 33–36. ISBN 0-9640398-9-3 (hardback) 0-9640398-8-5 (paperback).LCCN QA76.87 .I58 1995.

Banikazemi:2001:MLE

[BGBP01] Mohammad Banikazemi,Rama K. Govindaraju,

Robert Blackmore, and Dha-baleswar K. Panda. MPI-LAPI: An efficient im-plementation of MPI forIBM RS/6000 SP systems.IEEE Transactions on Par-allel and Distributed Sys-tems, 12(10):1081–1093, Oc-tober 2001. CODEN ITD-SEO. ISSN 1045-9219(print), 1558-2183 (elec-tronic). URL http://dlib.computer.org/td/books/td2001/pdf/l1081.pdf;http://www.computer.org/tpds/td2001/l1081abs.htm.

Broquedis:2012:LEO

[BGD12] Francois Broquedis, ThierryGautier, and Vincent Dan-jean. libOMP, an efficientOpenMP runtime system forboth fork-join and data flowparadigms. Lecture Notesin Computer Science, 7312:102–115, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-30961-8_8/.

Bronevetsky:2009:CAC

[BGdS09] Greg Bronevetsky, JohnGyllenhaal, and Bronis R.de Supinski. CLOMP:Accurately characterizingOpenMP application over-heads. International Jour-nal of Parallel Programming,37(3):250–265, June 2009.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640

REFERENCES 112

(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=37&issue=3&spage=250.

Blanco:2002:PMA

[BGG+02] V. Blanco, L. Garcıa, J. A.Gonzalez, C. Rodrıguez, andG. Rodrıguez. A perfor-mance model for the analysisof OpenMP programs. Par-allel and Distributed Com-puting Practices, 5(2):139–151, June 2002. CODEN???? ISSN 1097-2803.

Balasubramanian:2015:EGL

[BGG+15] Raghuraman Balasubrama-nian, Vinay Gangadhar, Zil-iang Guo, Chen-Han Ho,Cherin Joseph, JaikrishnanMenon, Mario Paulo Dru-mond, Robin Paul, SharathPrasad, Pradip Valathol,and Karthikeyan Sankar-alingam. Enabling GPGPUlow-level hardware explo-rations with MIAOW: anopen-source RTL implemen-tation of a GPGPU. ACMTransactions on Architec-ture and Code Optimiza-tion, 12(2):21:1–21:??, July2015. CODEN ???? ISSN1544-3566 (print), 1544-3973(electronic).

Bhanot:2005:OTL

[BGH+05] G. Bhanot, A. Gara, P. Hei-delberger, E. Lawless, J. C.Sexton, and R. Walkup.Optimizing task layout on

the Blue Gene/L supercom-puter. IBM Journal of Re-search and Development, 49(2/3):489–500, ???? 2005.CODEN IBMJAE. ISSN0018-8646 (print), 2151-8556(electronic). URL http://www.research.ibm.com/journal/rd/492/bhanot.pdf.

Bischof:2008:PRM

[BGK08] Christian Bischof, NielsGuertler, and AndreasKowarz. Parallel reversemode automatic differen-tiation for OpenMP pro-grams with ADOL-C. InBischof et al. [BBH+08],pages 163–173. CO-DEN LNCSA6. ISBN 3-540-68935-4 (print), 3-540-68942-7 (e-book). ISSN1439-7358. LCCN QA304.I58 2008. URL http://link.springer.com/content/pdf/10.1007/978-3-540-68942-3_15.

Butler:2000:SPM

[BGL00] Ralph Butler, WilliamGropp, and Ewing Lusk. Ascalable process-managementenvironment for parallel pro-grams. Lecture Notes inComputer Science, 1908:168–??, 2000. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080168.htm;

REFERENCES 113

http://link.springer-ny.com/link/service/series/0558/papers/1908/19080168.pdf.

Beisel:1997:EMD

[BGR97a] T. Beisel, E. Gabriel, andM. Resch. An extensionto MPI for distributed com-puting on MPPs. LectureNotes in Computer Science,1332:75–82, 1997. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Brune:1997:HMP

[BGR97b] Matthias Brune, Jorn Gehring,and Alexander Reinefeld.Heterogeneous message pass-ing and a link to resourcemanagement. The Jour-nal of Supercomputing, 11(4):355–369, December 1997.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=11&issue=4&spage=355; http://www.wkap.nl/oasis.htm/147011.Breitenecker:1995:ESC

[BH95] Felix Breitenecker and Ir-mgard Husinsky, editors.EUROSIM ’95: simula-tion congress: proceedings ofthe EUROSIM Conference,EUROSIM ’95, Vienna,Austria, 11–15 September1995. Elsevier, Amster-dam, The Netherlands, 1995.

ISBN 0-444-82241-0. LCCNA76.9.C65E966 1995.

Bhargava:1993:PIW

[Bha93] Bharat Bhargava, editor.Proceedings of the IEEEWorkshop on Advances inParallel and Distributed Sys-tems, October 6, 1993,Princeton, New Jersey.IEEE Computer SocietyPress, 1109 Spring Street,Suite 300, Silver Spring, MD20910, USA, 1993. ISBN 0-8186-5250-0, 0-8186-5251-9.LCCN QA76.58.I444 1993.

Bhanot:1998:DTM

[Bha98] Gyan Bhanot. A 2-d trans-pose MPI code. Research re-port RC 21217, T. J. Wat-son Research Center, IBMCorporation, Almaden, CA,USA, 1998.

Bader:1996:PPA

[BHJ96] David A. Bader, David R.Helman, and Joseph JaJa.Practical parallel algorithmsfor personalized communi-cation and integer sorting.ACM Journal of Experimen-tal Algorithmics, 1:3:1–3:??,???? 1996. CODEN ????ISSN 1084-6654.

Bouteiller:2006:MVP

[BHK+06] A. Bouteiller, T. Herault,G. Krawezik, P. Lemarinier,and F. Cappello. MPICH-Vproject: a multiprotocol au-tomatic fault-tolerant MPI.The International Journal

REFERENCES 114

of High Performance Com-puting Applications, 20(3):319–333, Fall 2006. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/20/3/319.full.pdf+html.

Bubeck:1995:DSC

[BHKR95] T. Bubeck, M. Hiller,W. Kuchlin, and W. Rosen-stiel. Distributed sym-bolic computation withDTS. In Ferreira andRolim [FR95], pages 231–248. ISBN 3-540-60321-2.LCCN QA76.642.I59 1995.

Bischof:1995:CSM

[BHLS+95] C. Bischof, S. Huss-Lederman,Xiaobai Sun, A. Tsao, andT. Turnbull. A case study ofMPI: Portable and efficientlibraries. In Bailey et al.[BBG+95], pages 728–733.ISBN 0-89871-344-7. LCCNQA76.58.S55 1995.

Bachem:1994:PCT

[BHM94] A. Bachem, W. Hochstattler,and M. Malich. Simulatedtrading — a new parallelapproach for solving vehiclerouting problems. In Jou-bert et al. [JPTE94], pages471–475. ISBN 0-444-81841-3. LCCN QA76.58 .P37941993.

Bachem:1996:STH

[BHM96] A. Bachem, Hochstattler,and M. Malich. The sim-ulated trading heuristic for

solving vehicle routing prob-lems. Discrete Applied Math-ematics, 65(1-3):47–72, ????1996. CODEN DAMADU.ISSN 0166-218X (print),1872-6771 (electronic).

Brunst:2001:POL

[BHNW01] Holger Brunst, Hans-ChristianHoppe, Wolfgang E. Nagel,and Manuela Winkler. Per-formance optimization forlarge scale computing: Thescalable VAMPIR approach.Lecture Notes in ComputerScience, 2074:751–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2074/20740751.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2074/20740751.pdf.

Barekas:2003:MAO

[BHP+03] Vasileios K. Barekas, Pana-giotis E. Hadjidoukas, Eleft-herios D. Polychronopoulos,et al. A multiprogrammingaware OpenMP implemen-tation. Scientific Program-ming, 11(2):133–141, 2003.CODEN SCIPEV. ISSN1058-9244 (print), 1875-919X (electronic).

Bondhugula:2008:PAP

[BHRS08] Uday Bondhugula, AlbertHartono, J. Ramanujam,and P. Sadayappan. A

REFERENCES 115

practical automatic polyhe-dral parallelizer and local-ity optimizer. ACM SIG-PLAN Notices, 43(6):101–113, June 2008. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Bisseling:2002:FMF

[BHS+02] Georg Bißeling, Hans-ChristianHoppe, Alexander Supalov,Pierre Lagier, and Jean La-tour. Fujitsu MPI-2: Fastlocally, reaching globally.Lecture Notes in ComputerScience, 2474:401–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.de/link/service/series/0558/bibs/2474/24740401.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740401.pdf.

Bazow:2018:MPS

[BHS18] Dennis Bazow, Ulrich Heinz,and Michael Strickland.Massively parallel simula-tions of relativistic fluiddynamics on graphics pro-cessing units with CUDA.Computer Physics Com-munications, 225(??):92–113, April 2018. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465517300279.

Berka:2012:PET

[BHV12] Tobias Berka, Helge Ha-genauer, and Marian Va-jtersic. Portable explicitthreading and concurrentprogramming for MPI ap-plications. Lecture Notesin Computer Science, 7204:81–90, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-31500-8_9/.

Busa:2012:ACO

[BHW+12] Jan Busa, Jr., ShuraHayryan, Ming-Chya Wu,Jan Busa, and Chin-KunHu. ARVO-CL: the OpenCLversion of the ARVO pack-age — an efficient toolfor computing the accessi-ble surface area and theexcluded volume of pro-teins via analytical equa-tions. Computer PhysicsCommunications, 183(11):2494–2497, November 2012.CODEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465512001580.

Bae:2017:SEF

[BHW+17] Seung-Hee Bae, DanielHalperin, Jevin D. West,Martin Rosvall, and BillHowe. Scalable and efficientflow-based community de-tection for large-scale graph

REFERENCES 116

analysis. ACM Transac-tions on Knowledge Discov-ery from Data (TKDD), 11(3):32:1–32:??, April 2017.CODEN ???? ISSN 1556-4681 (print), 1556-472X(electronic).

Bickham:1995:POM

[Bic95] J. L. Bickham. Paral-lel ocean modeling usingGlenda. In ACM [ACM95a],pages 58–63. ISBN 0-89791-747-2. LCCN ????

Bernaschi:2005:ERA

[BIC05] Massimo Bernaschi, GiulioIannello, and Saverio Crea.Experimental results aboutMPI collective communica-tion operations. ParallelProcessing Letters, 15(1/2):223–236, March/June 2005.CODEN PPLTEE. ISSN0129-6264 (print), 1793-642X (electronic).

Blas:2010:IEF

[BIC+10] Javier Garcia Blas, FlorinIsaila, Jesus Carretero,David Singh, and FelixGarcia-Carballeira. Imple-mentation and evaluation offile write-back and prefetch-ing for MPI-IO over GPFS.The International Journal ofHigh Performance Comput-ing Applications, 24(1):78–92, February 2010. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/24/1/78.full.pdf+html.

Branca:1995:CBH

[BID95] A. Branca, M. Ianigro, andA. Distante. A comparisonbetween HPF and PVM fordata parallel algorithms ona cluster of workstations us-ing a high speed network.In Hertzberger and Ser-azzi [HS95a], pages 930–931.ISBN 3-540-59393-4. ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.88.I57 1995.

Bilger:1995:AFM

[Bil95] R. W. Bilger, editor. 12thAustralasian fluid mechan-ics conference: – Decem-ber 1995, Sydney, Australia,Australasian Fluid Mechan-ics Conference 1995; EDIT12//V2. University of Syd-ney, ????, 1995. ISBN 0-86934-034-4. LCCN ????

Bernaschi:1999:ERA

[BIL99] M. Bernaschi, G. Iannello,and M. Lauria. Experimen-tal results about MPI col-lective communication op-erations. Lecture Notesin Computer Science, 1593:774–??, 1999. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Biradar:1994:ADL

[Bir94] Umesh V. Biradar. Adap-tive distributed load balanc-ing model for parallel virtualmachine. Master of science

REFERENCES 117

in computer science, De-partment of Computer Sci-ence, College of Engineer-ing, Lamar University, Beau-mont, TX, USA, 1994. viii +44 pp.

Bisseling:2004:PSC

[Bis04] Rob H. Bisseling. Paral-lel scientific computation:a structured approach us-ing BSP and MPI. Ox-ford University Press, Wal-ton Street, Oxford OX26DP, UK, 2004. ISBN 0-19-852939-2. xviii + 305 pp.LCCN QA76.58 .B57 2004.URL http://www.loc.gov/catdir/enhancements/fy0617/2004046141-d.html; http://www.loc.gov/catdir/enhancements/fy0617/2004046141-t.html.

Baiardi:1993:PVM

[BJ93] F. Baiardi and M. Jazayeri.P03M: a virtual machine ap-proach to massively parallelcomputing. Proceedings ofthe International Conferenceon Parallel Processing, pagesI–340–??, ???? 1993. CO-DEN PCPADL. ISSN 0190-3918.

Boianov:1995:DLC

[BJ95] L. Boianov and I. Jelly. Dis-tributed logic circuit simula-tion on a network of work-stations. In IEEE [IEE95h],pages 304–310. ISBN 0-8186-7031-2, 0-8186-7032-0.LCCN QA76.58 .E97 1995.

Barkati:2013:SPA

[BJ13] Karim Barkati and PierreJouvelot. Synchronous pro-gramming in audio process-ing: a lookup table os-cillator case study. ACMComputing Surveys, 46(2):24:1–24:??, November 2013.CODEN CMSVAN. ISSN0360-0300 (print), 1557-7341(electronic).

Bjorge:1995:ISS

[Bjo95] D. Bjorge. Implementationof the semi-implicit schemein a message passing versionof HIRLAM (weather fore-casting). In Hoffmann andKreitz [HK95], pages 75–90.ISBN 981-02-2211-4. LCCNQC866.E26 1994.

Blaheta:1997:PIP

[BJS97] R. Blaheta, O. Jakl, andJ. Stary. PVM-implementationof the PCG method withdisplacement decomposition.Lecture Notes in ComputerScience, 1332:321–328, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Blaheta:1999:LFM

[BJS99] R. Blaheta, O. Jakl, andJ. Stary. Large-scale FEmodelling in geomechanics:a case study in paralleliza-tion. In Dongarra et al.[DLM99], pages 299–306.ISBN 3-540-66549-8 (soft-cover). ISSN 0302-9743

REFERENCES 118

(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

Bhandarkar:1996:MPM

[BK96] M. A. Bhandarkar and L. V.Kale. MICE: a proto-type MPI implementation inConverse environment. InIEEE [IEE96i], pages 26–31.ISBN 0-8186-7533-0. LCCNQA76.642 .M67 1996.

Balevic:2011:KAD

[BK11] Ana Balevic and Bart Kien-huis. KPN2GPU: an ap-proach for discovery andexploitation of fine-graindata parallelism in pro-cess networks. ACMSIGARCH Computer Archi-tecture News, 39(4):66–71,September 2011. CODENCANED2. ISSN 0163-5964(print), 1943-5851 (elec-tronic).

Bhandarkar:2001:ALB

[BKdSH01] Milind Bhandarkar, L. V.Kale, Eric de Sturler, andJay Hoeflinger. Adap-tive load balancing forMPI programs. LectureNotes in Computer Sci-ence, 2074:108–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2074/20740108.htm;http://link.springer-

ny.com/link/service/series/

0558/papers/2074/20740108.pdf.

Bekas:2002:PCP

[BKGS02] Constantine Bekas, EfrosiniKokiopoulou, Efstratios Gal-lopoulos, and Valeria Si-moncini. Parallel compu-tation of pseudospectra us-ing transfer functions on aMATLAB-MPI cluster plat-form. Lecture Notes inComputer Science, 2474:199–??, 2002. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.de/link/service/series/0558/bibs/2474/24740199.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740199.pdf.

Berka:2013:CPC

[BKH+13] Tobias Berka, Giorgos Kol-lias, Helge Hagenauer, Mar-ian Vajtersic, and AnanthGrama. Concurrent pro-gramming constructs forparallel MPI applications.The Journal of Super-computing, 63(2):385–406,February 2013. CODENJOSUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic). URL http://link.springer.com/article/10.1007/s11227-011-0739-5.

Boryczko:1995:NIC

[BKML95] I. Boryczko, J. Kitowski,J. Moscinski, and A. Leszczyn-

REFERENCES 119

ski. Numerically intensivecomputing as a benchmarkfor parallel computer archi-tectures. In Hertzbergerand Serazzi [HS95a], pages118–123. ISBN 3-540-59393-4. ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.88 .I571995.

Bull:2000:PPJ

[BKO00] J. Mark Bull, Mark E.Kambites, and Jan Obdrza-lek. Parallel programming inJava with OpenMP-like di-rectives. In ACM [ACM00],page 150. URL http://www.sc2000.org/proceedings/info/fp.pdf.

Beaugnon:2014:VVO

[BKvH+14] Ulysse Beaugnon, AlexeyKravets, Sven van Haas-tregt, Riyadh Baghdadi,David Tweed, Javed Ab-sar, and Anton Lokhmo-tov. VOBLA: a vehi-cle for optimized basic lin-ear algebra. ACM SIG-PLAN Notices, 49(5):115–124, May 2014. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Ballico:1994:PSP

[BL94] M. Ballico and H. Lederer.Plasmafusionsforschung: Se-rielles und paralleles Rech-nen mit nur einem Pro-grammcode auf Cray YMP,nCUBE2, Workstations mit

PVM und KSR1. In Anony-mous [Ano94c], pages 232–234. ISBN ???? ISSN 0341-7778. LCCN Q180.55.E4M39 1993.

Bendrider:1995:SME

[BL95] M. Bendrider and J.-M.Leclercq. Second-orderMøller–Plesset and Epstein-Nesbet corrections to themolecular charge density:Distributed computing ona cluster of heterogeneousworkstations with the PVMsystem. In Bernardi andRivail [BR95a], pages 73–?? ISBN 1-56396-457-0. ISSN 0094-243X (print),1551-7616 (electronic), 1935-0465. LCCN QD39.3.E46E15 1995.

Beazley:1997:EMP

[BL97] D. M. Beazley and P. S.Lomdahl. Extensible mes-sage passing applicationdevelopment and debug-ging with Python. InIEEE [IEE97b], pages 650–655. ISBN 0-8186-7793-7. LCCN QA76.58 .I561997. IEEE catalog number97TB100107. IEEE Com-puter Society Press ordernumber PR07792.

Bubak:1999:TPR

[BL99] M. Bubak and P. Luszczek.Towards portable runtimesupport for irregular andout-of-core computations. InDongarra et al. [DLM99],pages 59–66. ISBN 3-540-

REFERENCES 120

66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Baraglia:1993:PWC

[BLP93] R. Baraglia, D. Laforenza,and R. Perego. Program-ming a workstation clus-ter with PVM and Linda:a qualitative and quantita-tive comparison. In Anony-mous [Ano93b], pages 101–114. ISBN ???? LCCN ????

Bach:2013:LQB

[BLPP13] Matthias Bach, Volker Lin-denstruth, Owe Philipsen,and Christopher Pinke. Lat-tice QCD based on OpenCL.Computer Physics Com-munications, 184(9):2042–2052, September 2013. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465513001288.

Bubak:1998:PCL

[BLW98] M. Bubak, P. Luszczek, andA. Wierzbowska. Port-ing CHAOS library to MPI.Lecture Notes in ComputerScience, 1497:131–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Bhandarkar:1997:CRP

[BM97] Suchendra M. Bhandarkarand Salem Machaka. Chro-mosome reconstruction from

physical maps using a clus-ter of workstations. TheJournal of Supercomputing,11(1):61–86, March 1997.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=11&issue=1&spage=61; http://www.wkap.nl/oasis.htm/141471.

Booth:2000:SSM

[BM00] S. Booth and E. Mourao.Single-sided MPI implemen-tations for SUN MPI. InACM [ACM00], page 46.URL http://www.sc2000.org/proceedings/techpapr/papers/pap182.pdf.

Basumallik:2002:TOE

[BME02] Ayon Basumallik, Seung-Jai Min, and Rudolf Eigen-mann. Towards OpenMPexecution on software dis-tributed shared memory sys-tems. Lecture Notes inComputer Science, 2327:457–??, 2002. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2327/23270457.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2327/23270457.pdf.

REFERENCES 121

Buntinas:2007:IES

[BMG07] Darius Buntinas, GuillaumeMercier, and William Gropp.Implementation and evalua-tion of shared-memory com-munication and synchroniza-tion operations in MPICH2using the Nemesis communi-cation subsystem. ParallelComputing, 33(9):634–644,September 2007. CODENPACOEJ. ISSN 0167-8191(print), 1872-7336 (elec-tronic).

Bronevetsky:2003:AAL

[BMPS03] Greg Bronevetsky, DanielMarques, Keshav Pingali,and Paul Stodghill. Au-tomated application-levelcheckpointing of MPI pro-grams. ACM SIGPLANNotices, pages 84–94, 2003.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Bubak:1994:PDS

[BMPZ94a] M. Bubak, J. Mosciniski,M. Pogoda, and W. Zdech-likiewicz. Parallel dis-tributed 2-D short-rangemolecular dynamics on net-worked workstations. InDongarra and Wasniewski[DW94], pages 127–135.ISBN 3-540-58712-8 (Berlin),0-387-58712-8 (New York).ISSN 0302-9743 (print),1611-3349 (electronic). LCCNQA76.58 .P35 1994. DM104.00.

Bubak:1994:EMD

[BMPZ94b] M. Bubak, J. Moscinski,M. Pogoda, and W. Zdech-likiewicz. Efficient molec-ular dynamics simulationon networked workstations.In Gruber and Tomassini[GT94], pages 191–194.ISBN 2-88270-011-3. LCCNQC20.7.E4I58 1994.

Baiardi:2001:CRD

[BMR01] Fabrizio Baiardi, PaoloMori, and Laura Ricci. Col-lecting remote data in ir-regular problems with hi-erarchical representation ofthe domain. Lecture Notesin Computer Science, 2131:304–??, 2001. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310304.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310304.pdf.

Brightwell:2002:DIM

[BMR02] Ron Brightwell, Arthur B.Maccabe, and Rolf Riesen.Design and implementationof MPI on Portals 3.0. Lec-ture Notes in Computer Sci-ence, 2474:331–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.de/link/service/series/0558/bibs/

REFERENCES 122

2474/24740331.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740331.pdf.

Bubak:1994:FLG

[BMS94a] M. Bubak, J. Moscinski,and R. Slota. FHP lat-tice gas on networked work-stations. In Gruber andTomassini [GT94], pages427–430. ISBN 2-88270-011-3. LCCN QC20.7.E4I581994.

Bubak:1994:IPL

[BMS94b] M. Bubak, J. Moscinski, andR. Slota. Implementationof parallel lattice gas pro-gram on workstations un-der PVM. In Dongarraand Wasniewski [DW94],pages 136–146. ISBN 3-540-58712-8 (Berlin), 0-387-58712-8 (New York). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58.P35 1994. DM104.00.

Barthels:2017:DJA

[BMS+17] Claude Barthels, Ingo Muller,Timo Schneider, GustavoAlonso, and Torsten Hoe-fler. Distributed join al-gorithms on thousands ofcores. Proceedings of theVLDB Endowment, 10(5):517–528, January 2017. CO-DEN ???? ISSN 2150-8097.

Berrendorf:2000:PCO

[BN00] Rudolf Berrendorf and GuidoNieken. Performance char-acteristics for OpenMP con-

structs on different par-allel computer architec-tures. Concurrency: practiceand experience, 12(12):1261–1273, October 2000. CO-DEN CPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/cgi-bin/abstract/76500355/START; http://www3.interscience.wiley.com/cgi-bin/fulltext?ID=76500355&PLACEBO=IE.pdf.

Bawidamann:2012:ETO

[BN12] Uwe Bawidamann and MarcoNehmeier. Expression tem-plates and OpenCL. LectureNotes in Computer Science,7204:71–80, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-31500-8_8/.

Bull:2001:MSO

[BO01] J. Mark Bull and DarraghO’Neill. A microbenchmarksuite for OpenMP 2.0. ACMSIGARCH Computer Archi-tecture News, 29(5):41–48,December 2001. CODENCANED2. ISSN 0163-5964(ACM), 0884-7495 (IEEE).

Bubak:2000:IOB

[BoFBW00] Marian Bubak, W. odz-imierz Funika, Bartosz Balis,and Roland Wismuller. In-teroperability of OCM-basedon-line tools. Lecture

REFERENCES 123

Notes in Computer Sci-ence, 1908:242–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080242.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080242.pdf.

Boisvert:1997:QNS

[Boi97] R. F. Boisvert, editor. Qual-ity of numerical software:assessment and enhance-ment / proceedings of theIFIP TC2/WG2.5 WorkingConference on the Qualityof Numerical Software, As-sessment and Enhancement,Oxford, United Kingdom, 8–12 July 1996. Chapman andHall, Ltd., London, UK,1997. ISBN 0-412-80530-8.LCCN QA297 .I35 1996.

Bonnet:1996:UPW

[Bon96] C. Bonnet. Using PVMin wireless network envi-ronments. In Bode et al.[BDLS96], pages 296–??ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E975 1996.

Booth:2001:OML

[Boo01] Stephen Booth. Optimisingthe MPI library for the T3E.Lecture Notes in ComputerScience, 2150:80–??, 2001.

CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2150/21500080.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2150/21500080.pdf.

Borkowski:1999:LVC

[Bor99] J. Borkowski. On line vi-sualization or combining thestandard ORNL PVM witha vendor PVM implemen-tation. In Dongarra et al.[DLM99], pages 157–164.ISBN 3-540-66549-8 (soft-cover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

Boszormenyi:1996:PCT

[Bos96] Laszlo Boszormenyi, edi-tor. Parallel computation:Third International ACPCConference with special em-phasis on parallel databasesand parallel I/O, Klagenfurt,Austria, September 23–25,1996: proceedings, volume1127 of Lecture notes in com-puter science. Springer-Ver-lag, Berlin, Germany / Hei-delberg, Germany / London,UK / etc., 1996. ISBN 3-540-61695-0. ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA267.A1L43 no.1127.

REFERENCES 124

Brebbia:1993:ASE

[BP93] C. A. Brebbia and H. Power,editors. Applications of Su-percomputers in Engineer-ing III, 27–29 September1993, Bath, UK. Compu-tational Mechanics Publica-tion, London, UK, 1993.ISBN 1-85312-236-X. LCCNTA345.I556 1993.

Berthou:1998:PHM

[BP98] J.-Y. Berthou and L. Plagne.Parallel HPF-MPI imple-mentation of the TBSCMPoisson solver. Lecture Notesin Computer Science, 1401:252–??, 1998. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Barbosa:1999:ADM

[BP99] J. Barbosa and A. Padilha.Algorithm-dependant methodto determine the optimalnumber of computers in par-allel virtual machines. Lec-ture Notes in Computer Sci-ence, 1573:508–521, 1999.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Beletsky:1994:OPV

[BPC94] V. Beletsky, T. Popova,and A. Chemeris. Orga-nization of a parallel vir-tual machine. In Horiguchiet al. [HHK94], pages 421–426. ISBN 0-8186-6507-6(case), 0-8186-6506-8 (mi-crofiche). LCCN QA76.58

.I5673 1994 Bar. IEEE cata-log number 94TH0697-3.

Becks:1994:NCT

[BPG94] K.-H. Becks and D. Perret-Gallix, editors. New comput-ing techniques in physics re-search III: proceedings of theThird International Work-shop on Software Engi-neering, Artificial Intelli-gence and Expert Systemsfor High Energy and Nu-clear Physics: October 4–8, 1993, Oberammergau,Germany. World ScientificPublishing Co. Pte. Ltd.,P. O. Box 128, FarrerRoad, Singapore 9128, 1994.ISBN 981-02-1699-8. LCCNQC793.47.E4I58 1993.

Barbosa:1997:EUW

[BPMN97] J. G. Barbosa, A. J. Padilha,J.-P. Madier, and T. Neu-bert. Experiments on us-ing WPVM for industrialvisual inspection problems.Lecture Notes in ComputerScience, 1300:828–??, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Baptista:2001:IOS

[BPS01] Tiago Baptista, Hernani Pe-droso, and Joao GabrielSilva. The implementa-tion of one-sided communi-cations for WMPI II. Lec-ture Notes in ComputerScience, 2131:61–??, 2001.CODEN LNCSD9. ISSN

REFERENCES 125

0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310061.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310061.pdf.

Balou:1991:DIV

[BR91] A. T. Balou and A. N.Refenes. The design and im-plementation of VOOM: aparallel virtual object ori-ented machine. Micro-processing and Micropro-gramming, 32(1-5):289–296,August 1991. CODENMMICDT. ISSN 0165-6074(print), 1878-7061 (elec-tronic).

Burrer:1994:RRB

[BR94] C. Burrer and P. Remy. RU-BIS: a runtime basic inter-face software on TELMATT9000 TN series. In de Glo-ria et al. [dGJM94], pages63–78. ISBN ???? LCCN????

Bernardi:1995:CCE

[BR95a] Francesco Bernardi andJean-Louis Rivail, editors.Computational chemistry:1st European conferenceon computational chemistry(May 1994, Nancy, France),number 330 in AIP Con-ference Proceedings. Amer-ican Institute of Physics,Woodbury, NY, USA, 1995.

ISBN 1-56396-457-0. ISSN0094-243X (print), 1551-7616 (electronic), 1935-0465.LCCN QD39.3.E46 E151995.

Bernaschi:1995:PEI

[BR95b] M. Bernaschi and G. Richelli.PVMe: an enhanced im-plementation of PVM forthe IBM 9076 SP2. InHertzberger and Serazzi[HS95a], pages 461–471.ISBN 3-540-59393-4. ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.88.I57 1995.

Bernaschi:1995:DRP

[BR95c] Massimo Bernaschi andGiorgio Richelli. Develop-ment and results of PVMeon the IBM 9076 SP1. Jour-nal of Parallel and Dis-tributed Computing, 29(1):75–83, August 15, 1995.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic). URL http://www.idealibrary.com/links/doi/10.1006/jpdc.1995.1107/production;http://www.idealibrary.com/links/doi/10.1006/jpdc.1995.1107/production/pdf.

Bane:2002:EOA

[BR02] M. K. Bane and G. D. Riley.Extended overhead analysisfor OpenMP (research note).Lecture Notes in ComputerScience, 2400:162–??, 2002.CODEN LNCSD9. ISSN

REFERENCES 126

0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2400/24000162.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2400/24000162.pdf.

Boeres:2004:ETF

[BR04] Cristina Boeres and VinodE. F. Rebello. EasyGrid:towards a framework forthe automatic Grid en-abling of legacy MPI ap-plications. Concurrencyand Computation: Prac-tice and Experience, 16(5):425–432, April 25, 2004.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Bergstrom:2012:NDP

[BR12] Lars Bergstrom and JohnReppy. Nested data-parallelism on the GPU.ACM SIGPLAN Notices,47(9):247–258, September2012. CODEN SINODQ.ISSN 0362-1340 (print),1523-2867 (print), 1558-1160(electronic).

Bramley:1997:TNR

[Bra97] Randall Bramley. Tech-nology news & reviews:Chemkin software; OpenMPFortran Standard; ODEtoolbox for Matlab; Javaproducts; Scientific Work-Place 3.0. IEEE Compu-

tational Science & Engi-neering, 4(4):75–78, Octo-ber/December 1997. CO-DEN ISCEE4. ISSN 1070-9924 (print), 1558-190X(electronic). URL http://dlib.computer.org/cs/books/cs1997/pdf/c4075.pdf.

Briscolini:1995:PID

[Bri95] M. Briscolini. A parallel im-plementation of a 3-D pseu-dospectral based code on theIBM 9076 scalable POWERparallel system. Paral-lel Computing, 21(11):1849–1862, November 29, 1995.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.elsevier.com/cgi-bin/cas/tree/store/parco/cas_sub/browse/browse.cgi?year=1995&volume=21&issue=11&aid=1027.

Brieger:2000:HOO

[Bri00] Leesa Brieger. HPF toOpenMP on the Origin2000:a case study. Concur-rency: practice and ex-perience, 12(12):1147–1154,October 2000. CODENCPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/cgi-bin/abstract/76500351/START; http://www3.interscience.wiley.com/cgi-bin/fulltext?ID=76500351&PLACEBO=IE.pdf.

REFERENCES 127

Brightwell:2002:RMR

[Bri02] Ron Brightwell. Ready-mode receive: An optimizedreceive function for MPI.Lecture Notes in ComputerScience, 2474:385–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.de/link/service/series/0558/bibs/2474/24740385.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740385.pdf.

Brightwell:2010:EDA

[Bri10] Ron Brightwell. Exploitingdirect access shared mem-ory for MPI on multi-coreprocessors. The Interna-tional Journal of High Per-formance Computing Appli-cations, 24(1):69–77, Febru-ary 2010. CODEN IH-PCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/24/1/69.full.pdf+html.

Brightwell:2003:DIP

[BRM03] Ron Brightwell, Rolf Riesen,and Arthur B. Maccabe. De-sign, implementation, andperformance of MPI on Por-tals 3.0. The Interna-tional Journal of High Per-formance Computing Appli-cations, 17(1):7–20, Spring2003. CODEN IHPCFL.ISSN 1094-3420 (print),1741-2846 (electronic).

Boudet:1999:PIH

[BRR99] V. Boudet, F. Rastello, andY. Robert. PVM imple-mentation of heterogeneousScaLAPACK dense linearsolvers. In Dongarra et al.[DLM99], pages 333–340.ISBN 3-540-66549-8 (soft-cover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

Benzoni:1992:CLF

[BRS92] A. Benzoni, G. Richelli, andV. S. Sunderam. ConcurrentLU factorization on work-station networks. In Evanset al. [EJL92], pages 159–166. ISBN 0-444-89212-5.LCCN QA76.58.I545 1991.

Briley:1994:NNH

[BRST94] W. R. Briley, D. S. Reese,A. Skjellum, and L. H.Turcotte. NHPDCC: TheNational High PerformanceDistributed Computing Con-sortium. In IEEE [IEE94f],pages 2–9. ISBN 0-8186-4980-1. LCCN QA76.58.S341993.

Bruck:1995:EMPa

[Bru95] Jehoshua Bruck. Efficientmessage passing interface(MPI) for parallel comput-ing on clusters of work-stations. Research reportRJ 9925 (87305), IBM T.J. Watson Research Cen-ter, Yorktown Heights, NY,USA, 1995. 31 pp.

REFERENCES 128

Brightwell:2005:AIO

[BRU05] Ron Brightwell, Rolf Riesen,and Keith D. Underwood.Analyzing the impact ofoverlap, offload, and inde-pendent progress for Mes-sage Passing Interface ap-plications. The Interna-tional Journal of High Per-formance Computing Appli-cations, 19(2):103–117, Sum-mer 2005. CODEN IH-PCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/19/2/103.full.pdf+html.

Bruning:2012:MFT

[Bru12] Ulrich Bruning. MPI func-tions and their impact on in-terconnect hardware. Lec-ture Notes in Computer Sci-ence, 7490:10, 2012. CO-DEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.com/accesspage/chapter/10.1007/978-3-642-33518-1_2.

Barth:1993:CNM

[BS93] N. H. Barth and S. L. Smith.Coupling numerical modelsof the atmosphere and oceanusing the parallel virtual ma-chine (PVM) package. InSincovec [Sin93], pages 71–75. ISBN 0-89871-315-3.LCCN QA 76.58 S55 1993.Two volumes.

Bolding:1994:PCR

[BS94] Kevin Bolding and LawrenceSnyder, editors. Parallelcomputer routing and com-munication: first interna-tional workshop, PCRCW’94, Seattle, Washington,USA, May 16–18, 1994:proceedings, number 853in Lecture Notes in Com-puter Science. Springer-Ver-lag, Berlin, Germany / Hei-delberg, Germany / London,UK / etc., 1994. ISBN 3-540-58429-3. ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58.P391994.

Beguelin:1996:TMD

[BS96a] A. Beguelin and V. Sun-deram. Tools for monitor-ing, debugging, and pro-gramming in PVM. In Bodeet al. [BDLS96], pages 7–13.ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E975 1996.

Brightwell:1996:DIM

[BS96b] R. Brightwell and L. Shuler.Design and implementationof MPI on Puma portals. InIEEE [IEE96i], pages 18–25.ISBN 0-8186-7533-0. LCCNQA76.642 .M67 1996.

Blikberg:2001:NPA

[BS01] Ragnhild Blikberg and TorSørevik. Nested parallelism:Allocation of threads to

REFERENCES 129

tasks and OpenMP imple-mentation. Scientific Pro-gramming, 9(2–3):185–194,Spring–Summer 2001. CO-DEN SCIPEV. ISSN 1058-9244 (print), 1875-919X(electronic). URL http://iospress.metapress.com/app/home/contribution.asp%3Fwasp=7pab6qgbaf8vxg991rwy%26referrer=parent%26backto=issue%2C11%2C11%3Bjournal%2C1%2C9%3Blinkingpublicationresults%2C1%2C1.

Blikberg:2005:LBO

[BS05] R. Blikberg and T. Sørevik.Load balancing and OpenMPimplementation of nestedparallelism. Parallel Com-puting, 31(10–12):984–998,October/December 2005.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic).

Brown:2007:HSP

[BS07] Russell Brown and IlyaSharapov. High-scalabilityparallelization of a molec-ular modeling application:Performance and produc-tivity comparison betweenOpenMP and MPI im-plementations. Interna-tional Journal of Paral-lel Programming, 35(5):441–458, October 2007. CO-DEN IJPPE5. ISSN 0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&

issn=0885-7458&volume=35&issue=5&spage=441.

Bassomo:1999:PGE

[BSC99] P. Bassomo, I. Sakho, andA. Corbel. Porting gen-eralized eigenvalue softwareon distributed memory ma-chines using systolic modelprinciples. In Dongarraet al. [DLM99], pages 396–403. ISBN 3-540-66549-8(softcover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

Bolton:2000:MPL

[BSG00] Hermanus P. J. Bolton,Jaco F. Schutte, and Al-bert A. Groenwold. Mul-tiple parallel local searchesin global optimization. Lec-ture Notes in ComputerScience, 1908:88–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080088.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080088.pdf.

Bukata:2015:SRC

[BSH15] Libor Bukata, PremyslSucha, and Zdenek Hanzalek.Solving the resource con-strained project schedulingproblem using the paral-lel tabu search designed for

REFERENCES 130

the CUDA platform. Jour-nal of Parallel and Dis-tributed Computing, 77(??):58–68, March 2015. CO-DEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic). URL http://www.sciencedirect.com/science/article/pii/S0743731514002226.

Bakhtiari:1995:APL

[BSN95] S. Bakhtiari and R. Safavi-Naini. Application of PVMto linear cryptanalysis. InGray and Naghdy [GN95],pages 278–279. ISBN ????LCCN ????

Bai:2013:SLA

[BST+13] Mingze Bai, Shixin Sun,Hong Tang, Yusheng Dou,and Glenn V. Lo. An SPMD-like algorithm for paralleliz-ing molecular dynamics us-ing OpenMP. Comput-ing in Science and Engi-neering, 15(4):48–56, July/August 2013. CODENCSENFA. ISSN 1521-9615(print), 1558-366X (elec-tronic).

Benzoni:1991:MFR

[BSvdG91] A. Benzoni, V. S. Sunderam,and R. van de Guijn. Ma-trix factorization on a RISCworkstation network. In Du-rand and El Dabaghi [DE91],pages 207–218. ISBN 0-444-89224-9. LCCN QA75.5.I5851991.

Blaszczyk:1996:EPI

[BT96] A. Blaszczyk and C. Trini-tis. Experience with PVMin an industrial environ-ment. In Bode et al.[BDLS96], pages 174–??ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E975 1996.

biewski:2001:MOS

[bT01a] Maciej Go biewski and Jes-per Larsson Traff. MPI-2 one-sided communicationson a Giganet SMP cluster.Lecture Notes in ComputerScience, 2131:16–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310016.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310016.pdf.

Bu:2001:PAC

[BT01b] Libor Bu and Pavel Tvrdık.A parallel algorithm for con-nected components on dis-tributed memory machines.Lecture Notes in ComputerScience, 2131:280–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310280.htm;

REFERENCES 131

http://link.springer-ny.com/link/service/series/0558/papers/2131/21310280.pdf.

Bonelli:2017:MCA

[BTC+17] Francesco Bonelli, MicheleTuttafesta, Gianpiero Colonna,Luigi Cutrone, and GiuseppePascazio. An MPI–CUDAapproach for hypersonicflows with detailed state-to-state air kinetics usinga GPU cluster. Com-puter Physics Communi-cations, 219(??):178–195,October 2017. CODENCPHCBZ. ISSN 0010-4655(print), 1879-2944 (elec-tronic). URL http://www.sciencedirect.com/science/article/pii/S0010465517301613.

Badia:1999:SIT

[BV99] J. M. Badia and A. M. Vi-dal. Solving the inverseToeplitz eigenproblem usingScaLAPACK and MPI. InDongarra et al. [DLM99],pages 372–379. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Baltas:1994:CPC

[BvdB94] N. D. Baltas and C. S.van den Berghe. Com-parison of the porting ofa computational fluid dy-namics application to SIMDand MIMD computers. InDekker et al. [DSZ94], pages

761–767. ISBN 0-444-81784-0. LCCN QA76.58.E98 1994.

Berendsen:1995:GMP

[BvdSvD95] H. J. C. Berendsen, D. van derSpoel, and R. van Drunen.GROMACS: a message-passing parallel moleculardynamics implementation.Computer Physics Com-munications, 91(1-3):43–56,September 1995. CODENCPHCBZ. ISSN 0010-4655(print), 1879-2944 (elec-tronic).

Baskaran:2012:ACO

[BVML12] Muthu Manikandan Baskaran,Nicolas Vasilache, BenoitMeister, and Richard Lethin.Automatic communicationoptimizations through mem-ory reuse strategies. ACMSIGPLAN Notices, 47(8):277–278, August 2012. CO-DEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic). PPOPP ’12 confer-ence proceedings.

Berg:2012:FCL

[BW12] Bernd A. Berg and HaoWu. Fortran code forSU(3) lattice gauge the-ory with and without MPIcheckerboard parallelization.Computer Physics Com-munications, 183(10):2145–2157, October 2012. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http:/

REFERENCES 132

/www.sciencedirect.com/science/article/pii/S0010465512001269.

Blum:1996:PIP

[BWT96] J. M. Blum, T. M. Warschko,and W. F. Tichy. PSPVM:implementing PVM on ahigh-speed interconnect forworkstation clusters. InBode et al. [BDLS96], pages235–?? ISBN 3-540-61779-5. ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58.E9751996.

Bureddy:2012:OGM

[BWV+12] D. Bureddy, H. Wang,A. Venkatesh, S. Potluri, andD. K. Panda. OMB-GPU:a micro-benchmark suite forevaluating MPI libraries onGPU clusters. Lecture Notesin Computer Science, 7490:110–120, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-33518-1_16/.

Bihari:2012:CIT

[BWW+12] Barna L. Bihari, MichaelWong, Amy Wang, Bronis R.de Supinski, and WangChen. A case for includ-ing transactions in OpenMPII: Hardware transactionalmemory. Lecture Notesin Computer Science, 7312:44–58, 2012. CODENLNCSD9. ISSN 0302-9743

(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-30961-8_4/.

Blattner:2012:PSC

[BY12] Timothy Blattner and Shim-ing Yang. Performance studyon CUDA GPUs for par-allelizing the local ensem-ble transformed Kalman fil-ter algorithm. Concurrencyand Computation: Prac-tice and Experience, 24(2):167–177, February 2012.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Bendtsen:1997:RLS

[BZ97] C. Bendtsen and Z. Zlatev.Running large-scale air pol-lution models on messagepassing machines. Lec-ture Notes in Computer Sci-ence, 1332:417–426, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Carpen-Amarie:2017:EOC

[CAHT17] Alexandra Carpen-Amarie,Sascha Hunold, and Jes-per Larsson Traff. On ex-pected and observed commu-nication performance withMPI derived datatypes. Par-allel Computing, 69(??):98–117, November 2017. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http:/

REFERENCES 133

/www.sciencedirect.com/science/article/pii/S0167819117301217.

Calmet:1994:RWC

[Cal94] J. Calmet, editor. Rhineworkshop on computer alge-bra — March 22–24, 1994,Karlsruhe, Germany. Uni-versitat Karlsruhe, Karl-sruhe, Germany, 1994. ISBN???? LCCN ????

Cabarle:2012:SNP

[CAM12] Francis George C. Cabarle,Henry Adorna, and Miguel A.Martınez. A spiking neu-ral P system simulator basedon CUDA. Lecture Notesin Computer Science, 7184:87–103, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-28024-5_8/.

Carbajal:2007:PTD

[Car07] Santiago Garcia Carbajal.Parallelizing three dimen-sional cellular automatawith OpenMP. Parallel Pro-cessing Letters, 17(4):349–361, December 2007. CO-DEN PPLTEE. ISSN 0129-6264 (print), 1793-642X(electronic).

Campanoni:2010:HFP

[CARB10] Simone Campanoni, Gio-vanni Agosta, Stefano CrespiReghizzi, and Andrea Di Bi-agio. A highly flexible, par-

allel virtual machine: de-sign and experience of ILD-JIT. Software—Practice andExperience, 40(2):177–207,February ??, 2010. CODENSPEXBL. ISSN 0038-0644(print), 1097-024X (elec-tronic).

Cavender:1993:APV

[Cav93] Mark Edward Cavender.Asynchronous parallel vir-tual machine. M.s. thesis,University of Texas at SanAntonio. Division of Math-ematics and Computer Sci-ence and Statistics, San An-tonio, TX, USA, 1993. vi +228 pp.

Chabbi:2017:EAL

[CAWL17] Milind Chabbi, AbdelhalimAmer, Shasha Wen, andXu Liu. An efficientabortable-locking protocolfor multi-level NUMA sys-tems. ACM SIGPLANNotices, 52(8):61–74, Au-gust 2017. CODEN SIN-ODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Cartwright:2000:AOE

[CB00] Keith L. Cartwright and Jo-seph D. Blahovec. AddingOpenMP to an existing MPIcode: Will it be benefi-cial? In ACM [ACM00],page 145. URL http://www.sc2000.org/proceedings/info/fp.pdf.

REFERENCES 134

Czapinski:2011:TST

[CB11] Michal Czapinski and Stu-art Barnes. Tabu Searchwith two approaches to par-allel flowshop evaluation onCUDA platform. Jour-nal of Parallel and Dis-tributed Computing, 71(6):802–811, June 2011. CO-DEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic). URL http://www.sciencedirect.com/science/article/pii/S0743731511000384.

Creech:2016:TSS

[CB16] Timothy Creech and Ra-jeev Barua. Transparentlyspace sharing a multicoreamong multiple processes.ACM Transactions on Par-allel Computing (TOPC),3(3):17:1–17:??, December2016. CODEN ???? ISSN2329-4949 (print), 2329-4957(electronic).

Cooper:1994:CHF

[CBHH94] M. D. Cooper, N. A. Bur-ton, R. J. Hall, and I. H.Hillier. Combined Hartree–Fock and density functionaltheory: a distributed mem-ory parallel implementation.Journal of molecular struc-ture. Theochem, 121:97–107,December 1994. CODENTHEODJ. ISSN 0166-1280(print), 1872-7999 (elec-tronic).

Casas:2010:APD

[CBL10] Marc Casas, Rosa M. Badia,and Jesus Labarta. Auto-matic phase detection andstructure extraction of MPIapplications. The Interna-tional Journal of High Per-formance Computing Appli-cations, 24(3):335–360, Au-gust 2010. CODEN IH-PCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/24/3/335.full.pdf+html.

Che:2008:PSG

[CBM+08] Shuai Che, Michael Boyer,Jiayuan Meng, David Tar-jan, Jeremy W. Sheaffer,and Kevin Skadron. A per-formance study of general-purpose applications ongraphics processors usingCUDA. Journal of Par-allel and Distributed Com-puting, 68(10):1370–1380,October 2008. CODENJPDCER. ISSN 0743-7315(print), 1096-0848 (elec-tronic).

Chapman:2002:APU

[CBPP02] B. Chapman, F. Bregier,A. Patil, and A. Prabhakar.Achieving performance un-der OpenMP on ccNUMAand software distributedshared memory systems.Concurrency and Compu-tation: Practice and Ex-perience, 14(8–9):713–739,July/August 2002. CODEN

REFERENCES 135

CCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic). URL http://www3.interscience.wiley.com/cgi-bin/abstract/95016122/START; http://www3.interscience.wiley.com/cgi-bin/fulltext?ID=95016122{\&}PLACEBO=IE.pdf.

Chapple:1995:PUL

[CC95] S. R. Chapple and L. J.Clarke. The Parallel UtilitiesLibrary. In IEEE [IEE95j],pages 21–30. ISBN 0-8186-6895-4. LCCN QA76.58 .S341994.

Cormen:1999:PBP

[CC99] Thomas H. Cormen andJames C. Clippinger. Per-forming BMMC permuta-tions efficiently on distributed-memory multiprocessorswith MPI. Algorithmica, 24(3–4):349–370, August 1999.CODEN ALGOEJ. ISSN0178-4617 (print), 1432-0541(electronic). URL http://link.springer.de/link/service/journals/00453/bibs/24n3p349.html; http://www.springerlink.com/openurl.asp?genre=article&issn=0178-4617&volume=24&issue=3&spage=349.

Ciaccio:2000:GMG

[CC00a] Giuseppe Ciaccio and Gio-vanni Chiola. GAMMAand MPI/GAMMA on gi-gabit ethernet. LectureNotes in Computer Sci-ence, 1908:129–??, 2000.

CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080129.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080129.pdf.

Couturier:2000:PMD

[CC00b] Raphael Couturier andChristophe Chipot. Paral-lel molecular dynamics us-ing OpenMP on a sharedmemory machine. Com-puter Physics Communica-tions, 124(1):49–59, Jan-uary 15, 2000. CODENCPHCBZ. ISSN 0010-4655(print), 1879-2944 (elec-tronic). URL http://www.sciencedirect.com/science/article/pii/S0010465599004324.

Cardoso:2010:MSO

[CC10] M. C. Cardoso and F. M.Costa. MPI support onopportunistic grids basedon the InteGrade middle-ware. Concurrency andComputation: Practice andExperience, 22(3):343–357,March 10, 2010. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic).

Chen:2017:AAG

[CC17] Jian Chen and Russell M.Clapp. Astro: Auto-generation of synthetic

REFERENCES 136

traces using scaling pat-tern recognition for MPIworkloads. IEEE Trans-actions on Parallel andDistributed Systems, 28(8):2159–2171, August 2017.CODEN ITDSEO. ISSN1045-9219 (print), 1558-2183(electronic)ITDSEO. URLhttps://www.computer.org/csdl/trans/td/2017/08/07809142-abs.html.

Chen:2000:MCO

[CCA00] Hsiang Ann Chen, Yvette O.Carrasco, and Amy W.Apon. MPI collective op-erations over IP multicast.Lecture Notes in ComputerScience, 1800:51–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1800/18000051.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1800/18000051.pdf.

Couder-Castaneda:2015:PCM

[CCBPGA15] C. Couder-Castaneda, H. Barrios-Pina, I. Gitler, and M. Ar-royo. Performance of acode migration for the sim-ulation of supersonic ejec-tor flow to SMP, MIC,and GPU using OpenMP,OpenMP+LEO, and Ope-nACC directives. Scien-tific Programming, 2015(??):739107:1–739107:20, ????2015. CODEN SCIPEV.

ISSN 1058-9244 (print),1875-919X (electronic). URLhttps://www.hindawi.com/journals/sp/2015/739107/.

Castagnera:1994:NEP

[CCF+94] K. Castagnera, D. Cheng,R. Fatoohi, E. Hook,B. Kramer, C. Manning,J. Musch, C. Niggley,W. Saphir, D. Sheppard,M. Smith, I. Stockdale,S. Welch, R. Williams, andD. Yip. NAS experienceswith a prototype clusterof workstations. In IEEE[IEE94h], pages 410–419.ISBN 0-8186-6607-2, 0-8186-6605-6, 0-8186-6606-4. ISSN1063-9535. LCCN QA76.5.S894 1994. IEEE catalognumber 94CH34819.

Cooperman:2003:UTC

[CCHW03] Gene Cooperman, HenriCasanova, Jim Hayes, andThomas Witzel. Using TOP-C and AMPIC to port largeparallel applications to theComputational Grid. FutureGeneration Computer Sys-tems, 19(4):587–596, May2003. CODEN FGSEVI.ISSN 0167-739X (print),1872-7115 (electronic).

Casas:1995:MMT

[CCK+95] Jeremy Casas, Dan L. Clark,Ravi Konuru, Steve W.Otto, Robert M. Prouty, andJonathan Walpole. MPVM:a migration transparent ver-sion of PVM. Computing

REFERENCES 137

systems: the journal of theUSENIX Association, 8(2):171–216, Spring 1995. CO-DEN CMSYE2. ISSN 0895-6340.Collingbourne:2012:STO

[CCK12] Peter Collingbourne, Cris-tian Cadar, and Paul H. J.Kelly. Symbolic testing ofOpenCL code. Lecture Notesin Computer Science, 7261:203–218, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-34188-5_18/.

Costa:2006:ROA

[CCM+06] J. J. Costa, T. Cortes,X. Martorell, E. Ayguade,and J. Labarta. RunningOpenMP applications effi-ciently on an everything-shared SDSM. Journalof Parallel and DistributedComputing, 66(5):647–658,May 2006. CODEN JPD-CER. ISSN 0743-7315(print), 1096-0848 (elec-tronic).

Chen:2012:PUA

[CCM12] Yifeng Chen, Xiang Cui,and Hong Mei. PARRAY:a unifying array representa-tion for heterogeneous paral-lelism. ACM SIGPLAN No-tices, 47(8):171–180, August2012. CODEN SINODQ.ISSN 0362-1340 (print),1523-2867 (print), 1558-1160

(electronic). PPOPP ’12conference proceedings.

Clematis:1997:DNL

[CCSM97] A. Clematis, A. Coda,M. Spagnuolo, and M. Mineter.Developing non-local itera-tive parallel algorithms forGIS on Cray T3D usingMPI. Lecture Notes inComputer Science, 1332:435–442, 1997. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Chamaret:1995:PFE

[CCU95] B. Chamaret, H. Cherefi,and S. Ubeda. Parallel fil-ter estimation maximisationalgorithm for segmentationon a LAN of workstation.In Bailey et al. [BBG+95],pages 68–69. ISBN 0-89871-344-7. LCCN QA76.58.S551995.

Coulaud:1996:EIP

[CD96] O. Coulaud and E. Dil-lon. Early implementationof Para++ with MPI-2. InIEEE [IEE96i], pages 95–101. ISBN 0-8186-7533-0.LCCN QA76.642 .M67 1996.

Cunha:1998:MPP

[CD98] J. C. Cunha and V. Duarte.Monitoring PVM programsusing the DAMS approach.Lecture Notes in ComputerScience, 1497:273–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

REFERENCES 138

Cotronis:2001:RAP

[CD01] Yiannis Cotronis and J. J.Dongarra, editors. Recentadvances in parallel virtualmachine and message pass-ing interface: 8th EuropeanPVM/MPI Users’ GroupMeeting, Santorini/Thera,Greece, September 23–26,2001: proceedings, volume2131 of Lecture Notes inComputer Science and Lec-ture Notes in Artificial In-telligence. Springer-Verlag,Berlin, Germany / Hei-delberg, Germany / Lon-don, UK / etc., 2001.ISBN 3-540-42609-4 (paper-back). LCCN QA76.58E975 2001; QA267.A1 L43no.2131. URL http://link.springer-ny.com/link/service/series/0558/tocs/t2131.htm.

Clemencon:1996:THM

[CDD+96] C. Clemencon, K. M.Decker, V. R. Deshpande,A. Endo, J. Fritscher,P. A. R. Lorenzo, N. Ma-suda, A. Muller, R. Ruhl,W. Sawyer, B. J. N. Wylie,and F. Zimmermann. Tools-supported HPF and MPIparallelization of the NASparallel benchmarks. InIEEE [IEE96c], pages 309–318. ISBN 0-8186-7551-9. LCCN QA76.58 .S951996. IEEE catalog number96TB100062.

Cao:2013:CHP

[CDD+13] Chongxiao Cao, Jack Don-garra, Peng Du, Mark Gates,Piotr Luszczek, and Stan-imire Tomov. clMAGMA:High performance dense lin-ear algebra with OpenCL.LAPACK Working Note275, Department of Com-puter Science, Universityof Tennessee, Knoxville,Knoxville, TN 37996, USA,March 2013. URL http://www.netlib.org/lapack/lawnspdf/lawn275.pdf.

Conforti:1996:PIA

[CdGM96] D. Conforti, L. de Luca,L. Grandinetti, and R. Mus-manno. A parallel imple-mentation of automatic dif-ferentiation for partially sep-arable functions using PVM.Parallel Computing, 22(5):643–656, August 8, 1996.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.elsevier.com/cgi-bin/cas/tree/store/parco/cas_sub/browse/browse.cgi?year=1996&volume=22&issue=5&aid=1065.

Cownie:1994:PPP

[CDH+94] J. Cownie, A. Dunlop,S. Hellberg, A. J. G.Hey, and D. Pritchard.Portable parallel program-ming environments-the ES-PRIT PPPE project. InDekker et al. [DSZ94], pages

REFERENCES 139

135–142. ISBN 0-444-81784-0. LCCN QA76.58.E98 1994.

Chang:1995:EPCb

[CDH+95] Sheue-Ling Chang, DavidHung-Chang Du, JenweiHsieh, Rose P. Tsang, andMengjou Lin. EnhancedPVM communications overa High-Speed LAN. IEEEparallel and distributed tech-nology: systems and applica-tions, 3(3):20–32, Fall 1995.CODEN IPDTEX. ISSN1063-6552 (print), 1558-1861(electronic).

Chang:1995:EPCa

[CDHL95] S.-L. Chang, D. H. C. Du,J. Hsieh, and M. Lin. En-hanced PVM communica-tions over a high-speed localarea network. In Alnuweiriand Hamdi [AH95], pages37–46. ISBN 0-8186-7124-6.LCCN TK5105.5 .H56 1995.

Casanova:1995:PPM

[CDJ95] Henri Casanova, Jack Don-garra, and Weicheng Jiang.The performance of PVMon MPP systems. Techni-cal report, University of Ten-nessee, Knoxville, Knoxville,TN 37996, USA, August1995. URL http://www.netlib.org/utk/papers/pvmmpp.ps; http://www.netlib.org/utk/papers/pvmmpp/pvmmpp.html; http://www.netlib.org/utk/people/JackDongarra/pdf/pvmmpp.pdf.

Chandra:2001:PPO

[CDK+01] Rohit Chandra, LeonardoDagum, David Kohr, DrorMaydan, Jeff McDonald,and Ramesh Menon. ParallelProgramming in OpenMP.Morgan Kaufmann Publish-ers, Los Altos, CA 94022,USA, 2001. ISBN 1-55860-671-8. xvi + 230pp. LCCN QA76.642.P38 2001. US$39.95.URL http://www.mkp.com/books_catalog/catalog.asp?ISBN=1-55860-671-8.

Colombet:1993:SMI

[CDM93] L. Colombet, L. Desbat, andF. Menard. Star modelingon IBM RS6000 networks us-ing PVM. In IEEE [IEE93c],pages 121–128. ISBN 0-8186-3900-8, 0-8186-3901-6.LCCN QA76.9.D5I593 1993.IEEE catalog no. 93TH0550-4.

Casanova:2015:SMA

[CDMS15] Henri Casanova, FredericDesprez, George S. Marko-manolis, and Frederic Suter.Simulation of MPI applica-tions with time-independenttraces. Concurrency andComputation: Practice andExperience, 27(5):1145–1168,April 10, 2015. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic).

Cotronis:2011:RAM

[CDND11] Yiannis Cotronis, Anthony

REFERENCES 140

Danalis, Dimitrios S. Nikolopou-los, and Jack Dongarra,editors. Recent Advancesin the Message PassingInterface: 18th EuropeanMPI Users’ Group Meeting,EuroMPI 2011, Santorini,Greece, September 18–21,2011. Proceedings, volume6960 of Lecture Notes inComputer Science. Springer-Verlag, Berlin, Germany /Heidelberg, Germany / Lon-don, UK / etc., 2011. CO-DEN LNCSD9. ISBN 3-642-24448-3 (print), 3-642-24449-1 (e-book). ISSN0302-9743 (print), 1611-3349 (electronic). LCCN???? URL http://www.springerlink.com/content/978-3-642-24449-0.

Chaussumier:1999:ACM

[CDP99] F. Chaussumier, F. Desprez,and L. Prylli. Asynchronouscommunications in MPI —the BIP/Myrinet approach.In Dongarra et al. [DLM99],pages 485–492. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Coll:2003:SHB

[CDPM03] Salvador Coll, Jose Duato,Fabrizio Petrini, and Fran-cisco J. Mora. Scalablehardware-based multicasttrees. In ACM [ACM03],page ?? ISBN 1-58113-695-1. LCCN ???? URL http:/

/www.sc-conference.org/sc2003/inter_cal/inter_cal_detail.php?eventid=10702#2; http://www.sc-conference.org/sc2003/paperpdfs/pap300.pdf.

Ceron:1998:PID

[CDZ+98] C. Ceron, J. Dopazo, E. L.Zapata, J. M. Carazo, andO. Trelles. Parallel imple-mentation of DNAml pro-gram on message-passing ar-chitectures. Parallel Com-puting, 24(5–6):701–716,June 1, 1998. CODENPACOEJ. ISSN 0167-8191(print), 1872-7336 (elec-tronic). URL http://www.elsevier.com/cas/tree/store/parco/sub/1998/24/5-6/1279.pdf.

Cappello:2000:MVM

[CE00] Franck Cappello and DanielEtiemble. MPI versusMPI+OpenMP on the IBMSP for the NAS Bench-marks. In ACM [ACM00],page 51. URL http://www.sc2000.org/proceedings/techpapr/papers/pap214.pdf.

Clemencon:1995:AEP

[CEF+95] C. Clemencon, A. Endo,J. Fritscher, A. Muller,R. Ruhl, and B. J. N. Wylie.The ’annai’ environment forportable distributed parallelprogramming. In El-Rewiniand Shriver [ERS95], pages

REFERENCES 141

242–251 (vol. 2). ISBN 0-8186-6935-7. LCCN ????

Chau:2007:MIP

[CEGS07] Ming Chau, Didier El Baz,Ronan Guivarch, and PierreSpiteri. MPI implementationof parallel subdomain meth-ods for linear and nonlinearconvection–diffusion prob-lems. Journal of Paralleland Distributed Computing,67(5):581–591, May 2007.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic).

Cerin:1999:DMP

[Cer99] C. Cerin. Differentiatingmessage passing interfaceand bulk synchronous par-allel computation models.Lecture Notes in ComputerScience, 1662:477–??, 1999.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Chen:2001:FFT

[CF01] Qun Chen and Michael C.Ferris. FATCOP: a fault tol-erant Condor–PVM mixedinteger programming solver.SIAM Journal on Opti-mization, 11(4):1019–1036,March/May 2001. CODENSJOPE8. ISSN 1052-6234(print), 1095-7189 (elec-tronic). URL http://epubs.siam.org/sam-bin/dbq/article/35391.

Chen:2001:TMK

[CFDL01] Yu Chen, Qian Fang, Zhi-hui Du, and Sanli Li. TH-MPI: OS kernel integratedfault tolerant MPI. Lec-ture Notes in ComputerScience, 2131:75–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310075.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310075.pdf.

Choudhary:1994:LCR

[CFF+94] Alok Choudhary, Ian Foster,Geoffrey Fox, Ken Kennedy,Carl Kesselman, CharlesKoelbel, Joel Saltz, andMarc Snir. Languages, com-pilers, and runtime systemssupport for parallel input-output, 1994. URL http://www.ccsf.caltech.edu/SIO/SIO.html. ScalableI/O Initiative Working Pa-per Number 3. On WWW athttp://www.ccsf.caltech.edu/SIO/SIO.html.

Corbett:1996:OMP

[CFF+96] P. Corbett, D. Feitelson,S. Fineberg, Yarsun Hsu,B. Nitzberg, J.-P. Prost,M. Snir, B. Traversat, andParkson Wong. Overviewof the MPI-IO parallel I/Ointerface. In Jain et al.[JWB96], pages 127–146.

REFERENCES 142

ISBN 0-7923-9735-5. LCCNQA76.58.I485 1996.

Carpenter:2000:OSM

[CFKL00] Bryan Carpenter, GeoffreyFox, Sung Hoon Ko, andSang Lim. Object se-rialization for marshalingdata in a Java interface toMPI. Concurrency: prac-tice and experience, 12(7):539–553, May 2000. CO-DEN CPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/cgi-bin/abstract/72516217/START; http://www3.interscience.wiley.com/cgi-bin/fulltext?ID=72516217&PLACEBO=IE.pdf.

Clemencon:1995:IRD

[CFMR95] C. Clemencon, J. Fritscher,M. J. Meehan, and R. Ruhl.An implementation of racedetection and determinis-tic replay with MPI. InHaridi et al. [HAM95b],pages 155–166. ISBN 3-540-60247-X. ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58.I5531995.

Cotronis:1996:ECP

[CFP96] J. Y. Cotronis, E. Floros,and N. Papazis. Efficientcomposition of PVM pro-grams. In Liddell et al.[LCHS96], pages 919–??ISBN 3-540-61142-8 (paper-back). LCCN QA76.88 .H521996.

Clematis:1995:PPH

[CFPS95] A. Clematis, B. Falcidieno,D. F. Prieto, and M. Spag-nuolo. Parallel process-ing on heterogeneous net-works for GIS applications.In Hertzberger and Ser-azzi [HS95a], pages 67–72.ISBN 3-540-59393-4. ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.88.I57 1995.

Chandrasekharan:1993:RTB

[CG93] N. Chandrasekharan andV. Goel. Ray tracing and bi-nary tree computations us-ing PVM. In Mudge et al.[MMH93], pages 104–105(vol. 2). ISBN 0-8186-3230-5. LCCN ???? Four vol-umes. IEEE catalog number93TH0501-7.

Clematis:1996:CEP

[CG96] A. Clematis and V. Gi-anuzzi. CPVM — extendingPVM for consistent check-pointing. In IEEE [IEE96g],pages 67–76. ISBN 0-8186-7376-1. LCCN QA76.58 .E971996. IEEE order numberPR07376.

Clematis:1999:EPC

[CG99a] A. Clematis and V. Gi-anuzzi. Extending PVMwith consistent cut capabil-ities: Application aspectsand implementation strate-gies. In Dongarra et al.[DLM99], pages 101–108.ISBN 3-540-66549-8 (soft-

REFERENCES 143

cover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

Cownie:1999:SID

[CG99b] J. Cownie and W. Gropp.A standard interface for de-bugger access to messagequeue information in MPI.In Dongarra et al. [DLM99],pages 51–58. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Chaudhuri:2010:PIC

[CGB+10] Pranay Chaudhuri, Suku-mar Ghosh, Raj KumarBuyya, Jian-Nong Cao, andOeepak Oahiya, editors.Proceedings of the 20101st International Conferenceon Parallel Distributed andGrid Computing (PDGC),Jaypee University of In-formation Technology Wak-naghat, Solan, HP, India,28–30 October, 2010. IEEEComputer Society Press,1109 Spring Street, Suite300, Silver Spring, MD20910, USA, 2010. ISBN 1-4244-7675-5. LCCN ????

Carretero:2015:AMM

[CGBS+15] Jesus Carretero, JavierGarcia-Blas, David E. Singh,Florin Isaila, Alexey Las-tovetsky, Thomas Fahringer,Radu Prodan, Peter Zangerl,Christi Symeonidou, Af-shin Fassihi, and Horacio

Perez-Sanchez. Accelera-tion of MPI mechanismsfor sustainable HPC ap-plications. Supercomput-ing Frontiers and Innova-tions, 2(2):28–45, ???? 2015.CODEN ???? ISSN2409-6008 (print), 2313-8734(electronic). URL http://superfri.org/superfri/article/view/35.

Calderon:2002:IMI

[CGC+02] Alejandro Calderon, FelixGarcıa, Jesus Carretero,Jose M. Perez, and JavierFernandez. An implemen-tation of MPI-IO on ex-pand: a parallel file sys-tem based on NFS servers.Lecture Notes in ComputerScience, 2474:306–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.de/link/service/series/0558/bibs/2474/24740306.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740306.pdf.

Camp:2011:SIU

[CGC+11] David Camp, ChristophGarth, Hank Childs, DavePugmire, and Kenneth I.Joy. Streamline integra-tion using MPI-hybrid par-allelism on a large multicorearchitecture. IEEE Trans-actions on Visualization andComputer Graphics, 17(11):1702–1713, November 2011.

REFERENCES 144

CODEN ITVGEA. ISSN1077-2626 (print), 1941-0506(electronic), 2160-9306.

Carter:2010:PLN

[CGG10] John D. Carter, William B.Gardner, and Gary Grewal.The Pilot library for noviceMPI programmers. ACMSIGPLAN Notices, 45(5):351–352, May 2010. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Clarke:1994:MMP

[CGH94] L. Clarke, I. Glendinning,and R. Hempel. The MPIMessage Passing InterfaceStandard. In Decker andRehmann [DR94], pages213–218. ISBN 0-8176-5090-3 (Boston), 3-7643-5090-3 (Basel). LCCNQA76.58.P767 1994.

Cunningham:2014:RXE

[CGH+14] David Cunningham, DavidGrove, Benjamin Herta,Arun Iyengar, KiyokuniKawachiya, Hiroki Mu-rata, Vijay Saraswat, MikioTakeuchi, and Olivier Tardieu.Resilient X10: efficientfailure-aware programming.ACM SIGPLAN Notices,49(8):67–80, August 2014.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Carpenter:2000:MML

[CGJ+00] Bryan Carpenter, VladimirGetov, Glenn Judd, An-thony Skjellum, and Ge-offrey Fox. MPJ: MPI-like message passing forJava. Concurrency: practiceand experience, 12(11):1019–1038, September 2000. CO-DEN CPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/cgi-bin/abstract/76000188/START; http://www3.interscience.wiley.com/cgi-bin/fulltext?ID=76000188&PLACEBO=IE.pdf.

Catanzaro:2011:CCE

[CGK11] Bryan Catanzaro, MichaelGarland, and Kurt Keutzer.Copperhead: compiling anembedded data parallel lan-guage. ACM SIGPLAN No-tices, 46(8):47–56, August2011. CODEN SINODQ.ISSN 0362-1340 (print),1523-2867 (print), 1558-1160(electronic). PPoPP ’11Conference proceedings.

Chapman:2011:OPE

[CGKM11] Barbara M. Chapman, William D.Gropp, Kalyan Kumaran,and Matthias S. Muller,editors. OpenMP in thePetascale Era: 7th In-ternational Workshop onOpenMP, IWOMP 2011,Chicago, IL, USA, June 13–15, 2011. Proceedings, vol-ume 6665 of Lecture Notes in

REFERENCES 145

Computer Science. Springer-Verlag, Berlin, Germany /Heidelberg, Germany / Lon-don, UK / etc., 2011. CO-DEN LNCSD9. ISBN 3-642-21486-X (print), 3-642-21487-8 (e-book). ISSN0302-9743 (print), 1611-3349 (electronic). LCCN???? URL http://www.springerlink.com/content/978-3-642-21487-5.

Chatterjee:1993:GLA

[CGL+93] S. Chatterjee, J. R. Gilbert,F. J. E. Long, R. Schreiber,and S.-H. Teng. Generat-ing local addresses and com-munication sets for data-parallel programs. ACMSIGPLAN Notices, 28(7):149–158, July 1993. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Caubet:2001:DTM

[CGLD01] Jordi Caubet, Judit Gimenez,Jesus Labarta, and LuizDeRose. A dynamic trac-ing mechanism for perfor-mance analysis of OpenMPapplications. Lecture Notesin Computer Science, 2104:53–??, 2001. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2104/21040053.htm;http://link.springer-

ny.com/link/service/series/

0558/papers/2104/21040053.pdf.

Chan:1998:PCT

[CGPR98] K. J. Chan, A. M. Gib-bons, M. Pias, and W. Ryt-ter. On the PVM compu-tations of transitive closureand algebraic path problems.Lecture Notes in ComputerScience, 1497:338–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Cecilia:2012:CSC

[CGU12] Jose Marıa Cecilia, Jose ManuelGarcıa, and Manuel Ujaldon.CUDA 2D stencil com-putations for the Jacobimethod. Lecture Notes inComputer Science, 7133:173–183, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-28151-8_17/.

Chen:2013:IRM

[CGZQ13] Zhezhe Chen, Qi Gao, Wen-bin Zhang, and Feng Qin.Improving the reliability ofMPI libraries via messageflow checking. IEEE Trans-actions on Parallel and Dis-tributed Systems, 24(3):535–549, March 2013. CODENITDSEO. ISSN 1045-9219.

Cheng:1994:PDP

[CH94] D. Cheng and R. Hood. Aportable debugger for par-

REFERENCES 146

allel and distributed pro-grams. In IEEE [IEE94h],pages 723–732. ISBN 0-8186-6607-2, 0-8186-6605-6,0-8186-6606-4. ISSN 1063-9535. LCCN QA76.5 .S8941994. IEEE catalog number94CH34819.

Ciancarini:1996:CLM

[CH96] Paolo Ciancarini and ChrisHankin, editors. Coordina-tion languages and models:First International Confer-ence COORDINATION ’96,Cesena, Italy, April 15–17,1996: proceedings, number1061 in Lecture Notes inComputer Science. Spring-er-Verlag, Berlin, Germany /Heidelberg, Germany / Lon-don, UK / etc., 1996.ISBN 3-540-61052-9. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.I52 1996.

Charny:1996:MPV

[Cha96] B. Charny. Matrix partition-ing on a virtual shared mem-ory parallel machine. IEEETransactions on Paralleland Distributed Systems,7(4):343–355, April 1996.CODEN ITDSEO. ISSN1045-9219 (print), 1558-2183(electronic).

Chapman:2002:PAD

[Cha02] Barbara Chapman. Par-allel application develop-ment with the hybrid MPI+ OpenMP programming

model. Lecture Notes inComputer Science, 2474:13–??, 2002. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.de/link/service/series/0558/bibs/2474/24740013.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740013.pdf.

Chapman:2005:SMP

[Cha05] Barbara M. Chapman, edi-tor. Shared memory parallelprogramming with OpenMP:5th International Workshopon OpenMP Applicationsand Tools, WOMPAT 2004,Houston, TX, USA, May17–18, 2004: Revised se-lected papers, volume 3349of Lecture Notes in Com-puter Science. Springer-Ver-lag, Berlin, Germany / Hei-delberg, Germany / Lon-don, UK / etc., 2005. CO-DEN LNCSD9. ISBN 3-540-24560-X. ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76 .A1L42 NO.3349. URL http://www.springerlink.com/openurl.asp?genre=issue&issn=0302-9743&volume=3349; http://www.springerlink.com/openurl.asp?genre=volume&id=doi:10.1007/b105895.

Cappello:2007:RAP

[CHD07] Franck Cappello, ThomasHerault, and Jack Dongarra,

REFERENCES 147

editors. Recent Advancesin Parallel Virtual Machineand Message Passing Inter-face: 14th European PVM/MPI User’s Group Meet-ing, Paris, France, Septem-ber 30 — October 3, 2007.Proceedings, volume 4757of Lecture Notes in Com-puter Science. Springer-Ver-lag, Berlin, Germany / Hei-delberg, Germany / Lon-don, UK / etc., 2007. CO-DEN LNCSD9. ISBN 3-540-75415-6 (print), 3-540-75416-4 (e-book). ISSN0302-9743 (print), 1611-3349 (electronic). LCCN???? URL http://www.springerlink.com/content/978-3-540-75416-9.

Cappello:2009:FSI

[CHD09] Franck Cappello, ThomasHerault, and Jack Don-garra. Foreword: Specialissue: selected papers fromthe 14th European PVM/MPI Users Group Meeting.Parallel Computing, 35(12):571, 2009. CODEN PA-COEJ. ISSN 0167-8191(print), 1872-7336 (elec-tronic). Held in Paris,September 30–October 3,2007.

Chergui:1999:UPP

[Che99] J. Chergui. Using PMDto parallel solve large-scaleNavier–Stokes equations.performance analysis onSGI/CRAY-T3E machine.

In Dongarra et al. [DLM99],pages 341–348. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Cheng:2010:BRBb

[Che10] Jie Cheng. Book re-view: CUDA by Exam-ple: An Introduction toGeneral-Purpose GPU Pro-gramming, by Jason Sandersand Edward Kandrot, ISBN-13 978-0-13-138768-3. Scal-able Computing: Prac-tice and Experience, 11(4):401, December 2010.CODEN ???? ISSN1895-1767. URL http://www.scpe.org/index.php/scpe/article/view/663.See [SK10].

Cho:2015:OAO

[CHKK15] Myeongjin Cho, YoungsunHan, Minseong Kim, andSeon Wook Kim. O2WebCL:an automatic OpenCL-to-WebCL translator for highperformance web comput-ing. The Journal of Su-percomputing, 71(6):2050–2065, June 2015. CODENJOSUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic). URL http://link.springer.com/article/10.1007/s11227-014-1260-4.

Chapman:2001:PDE

[CHPP01] B. Chapman, O. Hernan-dez, A. Patil, and A. Prab-hakar. Program devel-

REFERENCES 148

opment environment forOpenMP programs on cc-NUMA architectures. Lec-ture Notes in Computer Sci-ence, 2179:210–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2179/21790210.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2179/21790210.pdf.

Cho:2010:OPP

[CIJ+10] S. M. Cho, D. W. Im, O. Y.Jang, H. J. Song, B. D.Paulovicks, V. Sheinin, andH. Yeo. OpenCL and paral-lel primitives for digital TVapplications. IBM Journal ofResearch and Development,54(5):7:1–7:14, ???? 2010.CODEN IBMJAE. ISSN0018-8646 (print), 2151-8556(electronic).

Cook:1995:TAS

[CJNW95] B. M. Cook, M. R. Jane,P. Nixon, and P. M. Welch,editors. Transputer Applica-tions and Systems ’95. Pro-ceedings of the 1995 WorldTransputer Congress, 4–6September 1995, Harrogate,North Yorkshire, UK. IOSPress, Postal Drawer 10558,Burke, VA 2209-0558, USA,1995. ISBN 90-5199-235-1(IOS Press), 4-274-90062-2(Ohmsha). LCCN ????

Czarnul:1999:DAP

[CK99] P. Czarnul and H. Krawczyk.Dynamic assignment withprocess migration in dis-tributed environments. InDongarra et al. [DLM99],pages 509–516. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Chang:2016:DLD

[CKmWH16] Li-Wen Chang, Hee-SeokKim, and Wen mei W. Hwu.DySel: Lightweight dy-namic selection for kernel-based data-parallel program-ming model. ACM SIG-PLAN Notices, 51(4):667–680, April 2016. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Casas:1994:ALM

[CKO+94] J. Casas, R. Konuru,S. W. Otto, R. Prouty,and J. Walpole. Adaptiveload migration systems forPVM. In IEEE [IEE94h],pages 390–399. ISBN 0-8186-6607-2, 0-8186-6605-6, 0-8186-6606-4. ISSN1063-9535. LCCN QA76.5.S894 1994. URL http://sc94.ameslab.gov/AP/contents.html. IEEE cata-log number 94CH34819.

Culler:1993:LTR

[CKP+93] David E. Culler, Richard M.Karp, David A. Patterson,

REFERENCES 149

Abhijit Sahay, Klaus E.Schauser, Eunice Santos,Ramesh Subramonian, andThorsten von Eicken. LogP:towards a realistic model ofparallel computation. ACMSIGPLAN Notices, 28(7):1–12, July 1993. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Castro-Leon:1993:MCP

[CL93] E. Castro-Leon. A modelof computation with par-allel solvers. In Anony-mous [Ano93f], pages 189–198. ISBN ???? LCCN ????

Clark:1998:FOP

[Cla98] David Clark. Focus:OpenMP: a parallel stan-dard for the masses. IEEEConcurrency, 6(1):10–12,January/March 1998. CO-DEN IECMFX. ISSN1092-3063 (print), 1558-0849(electronic). URL http://dlib.computer.org/pd/books/pd1998/pdf/p1010.pdf.

Chabbi:2015:BEP

[CLdJ+15] Milind Chabbi, Wim Lavri-jsen, Wibe de Jong, KoushikSen, John Mellor-Crummey,and Costin Iancu. Barrierelision for production paral-lel programs. ACM SIG-PLAN Notices, 50(8):109–119, August 2015. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Chen:2003:GMD

[CLL03] L. Chen, C. LiWang, andF. C. M. Lau. A grid mid-dleware for distributed Javacomputing with MPI bind-ing and process migrationsupports. Journal of com-puter science and technology,18(4):505–514, 2003. CO-DEN JCTEEM. ISSN 1000-9000.

Corbacho-Lozano:1999:EDD

[CLLASPDP99] J. Corbacho-Lozano, O.-I. Lepe-Aldama, J. Sole-Pareta, and J. Domingo-Pascual. Experiences de-ploying a distributed paral-lel processing environmentover a broadband multiser-vice network. In Dongarraet al. [DLM99], pages 477–484. ISBN 3-540-66549-8(softcover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

Cantoni:1995:CCA

[CLM+95] Virginio Cantoni, L. Lom-bardi, M. Mosconi, M. Savini,and A. Setti, editors. CAMP’95, computer architec-tures for machine percep-tion: proceedings, Septem-ber 18–20, 1995, Como,Italy. IEEE Computer So-ciety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,1995. ISBN 0-8186-7134-3. LCCN QA76.9.A73W675

REFERENCES 150

1995. IEEE catalog no.95TB8093.

Chien:1999:DEH

[CLP+99] A. Chien, M. Lauria, R. Pen-nington, M. Showerman,G. Iannello, M. Buchanan,K. Connelly, L. Giannini,G. Koenig, S. Krishna-murthy, Q. Liu, S. Pakin,and G. Sampemane. Designand evaluation of an HPVM-based Windows NT super-computer. The Interna-tional Journal of High Per-formance Computing Appli-cations, 13(3):201–219, Fall1999. CODEN IHPCFL.ISSN 1094-3420 (print),1741-2846 (electronic).

Chandra:2007:ESP

[CLSP07] Sumir Chandra, Xiaolin Li,Taher Saif, and ManishParashar. Enabling scal-able parallel implementa-tions of structured adap-tive mesh refinement ap-plications. The Journalof Supercomputing, 39(2):177–203, February 2007.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=39&issue=2&spage=177.

Chang:2016:APC

[CLYC16] Chih-Hung Chang, Chih-Wei Lu, Chao-Tung Yang,and Tzu-Chieh Chang. An

approach of performancecomparisons with OpenMPand CUDA parallel pro-gramming on multicore sys-tems. Concurrency andComputation: Practice andExperience, 28(16):4230–4245, November 2016. CO-DEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Chapman:1998:OHI

[CM98] B. Chapman and P. Mehro-tra. OpenMP and HPF:Integrating two paradigms.Lecture Notes in ComputerScience, 1470:650–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Chapman:2005:O

[CM05] Barbara M. Chapman andFederico Massaioli. OpenMP.Parallel Computing, 31(10–12):957–959, October/December 2005. CODENPACOEJ. ISSN 0167-8191(print), 1872-7336 (elec-tronic).

Claver:1999:PCS

[CMH99] J. M. Claver, M. Mollar,and V. Hernandez. Paral-lel computation of the SVDof a matrix product. InDongarra et al. [DLM99],pages 388–395. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

REFERENCES 151

Cahir:2000:PMM

[CMK00] Margaret Cahir, RobertMoench, and Alice E.Koniges. Programmingmodels and methods. InKoniges [Kon00], chapter 3,pages 27–54. ISBN 1-55860-540-1. LCCN QA76.58.I483 2000. DiscussesPVM, MPI, SHMEM, High-Performance Fortran, andPOSIX threads.

Corbalan:2004:PMD

[CML04] Julita Corbalan, XavierMartorell, and Jesus Labarta.Page migration with dy-namic space-sharing schedul-ing policies: The case ofthe SGI O2000. Inter-national Journal of Paral-lel Programming, 32(4):263–288, August 2004. CO-DEN IJPPE5. ISSN 0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=32&issue=4&spage=263.

Carson:2003:CGU

[CMM03] Brett Carson, Robert Muri-son, and Ian A. Mason.Computational gains usingRPVM on a Beowulf clus-ter. R News: the Newslet-ter of the R Project, 3(1):21–26, June 2003. CO-DEN ???? ISSN 1609-3631. URL http://CRAN.R-project.org/doc/Rnews/.

Chapman:2012:OHW

[CMMR12] Barbara M. Chapman, Fed-erico Massaioli, Matthias S.Muller, and Marco Rorro,editors. OpenMP in a Het-erogeneous World: 8th In-ternational Workshop onOpenMP, IWOMP 2012,Rome, Italy, June 11–13,2012. Proceedings, volume7312 of Lecture Notes inComputer Science. Springer-Verlag, Berlin, Germany /Heidelberg, Germany / Lon-don, UK / etc., 2012. CO-DEN LNCSD9. ISBN 3-642-30960-7 (print), 3-642-30961-5 (e-book). ISSN0302-9743 (print), 1611-3349 (electronic). LCCN???? URL http://www.springerlink.com/content/978-3-642-30961-8.

Campanai:1994:EAS

[CMV+94] M. Campanai, O. Morales,S. Viti, R. Trotta, P. Vil-iani, and M. Lo Moro. Ex-periences assessing softwaretesting activities: the adop-tion of PVM, a predictionand validation model. InAnonymous [Ano94i], pages491–500. ISBN 3-7281-2153-3. LCCN ????

Chou:2010:CMI

[CNC10] Yu-Cheng Chou, Stephen S.Nestinger, and Harry H.Cheng. Ch MPI: Inter-pretive parallel computingin C. Computing in Sci-

REFERENCES 152

ence and Engineering, 12(2):54–67, March/April 2010.CODEN CSENFA. ISSN0740-7475 (print), 1558-1918(electronic).

Chalkidis:2011:HPH

[CNM11] Georgios Chalkidis, MasaoNagasaki, and Satoru Miyano.High performance hybridfunctional Petri net simu-lations of biological path-way models on CUDA.IEEE/ACM Transactionson Computational Biologyand Bioinformatics, 8(6):1545–1556, November 2011.CODEN ITCBCY. ISSN1545-5963 (print), 1557-9964(electronic).

Coelho:1994:EHC

[Coe94] F. Coelho. Experimentswith HPF compilation fora network of worksta-tions. In Gentzsch andHarms [GH94], pages 423–428. ISBN 0-387-57981-8 (New York), 3-540-57981-8 (Berlin). LCCNQA76.88.I57 1994. DM96.00.Two volumes.

Cooperman:1995:SBP

[Coo95a] G. Cooperman. STAR/MPI:binding a parallel library tointeractive symbolic algebrasystems. In Levelt [Lev95],pages 126–132. ISBN 0-89791-699-9. LCCN QA76.95 I59 1995.

Cooperman:1995:SMB

[Coo95b] Gene Cooperman. STAR/MPI: Binding a parallel li-brary to interactive symbolicalgebra systems. In Lev-elt [Lev95], pages 126–132.ISBN 0-89791-699-9. LCCNQA 76.95 I59 1995.

Cotronis:1997:MPP

[Cot97] J. Y. Cotronis. Message-passing program develop-ment by ensemble. Lec-ture Notes in Computer Sci-ence, 1332:242–249, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Cotronis:1998:DMP

[Cot98] Y. Cotronis. Develop-ing message-passing appli-cations on MPICH underensemble. Lecture Notesin Computer Science, 1497:145–??, 1998. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Cotronis:2004:CMP

[Cot04] Yiannis Cotronis. Compo-sition of Message PassingInterface applications overMPICH-G2. The Interna-tional Journal of High Per-formance Computing Ap-plications, 18(3):327–339,Fall 2004. CODEN IH-PCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.

REFERENCES 153

sagepub.com/content/18/3/327.full.pdf+html.

Coussement:1993:PMO

[Cou93] G. Coussement. Paralleliza-tion of a mesh optimizationcode on a RS/ 6000 clus-ter. In Anonymous [Ano93e],pages 185–212. ISBN ????ISSN 0254-6213. LCCN ????

Carvalho:1997:PCC

[CP97] L. M. R. Carvalho and J. M.L. M. Palma. Paralleliza-tion of a CFD code usingPVM and domain decom-position techniques. Lec-ture Notes in Computer Sci-ence, 1215:247–??, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Carissimi:1998:AEM

[CP98] A. Carissimi and M. Pasin.Athapascan: An experienceon mixing MPI communi-cations and threads. Lec-ture Notes in Computer Sci-ence, 1497:137–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Cercos-Pita:2015:ANF

[CP15] J. L. Cercos-Pita. AQUAg-pusph, a new free 3DSPH solver accelerated withOpenCL. Computer PhysicsCommunications, 192(??):295–312, July 2015. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944

(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465515000909.

Corno:1995:PTA

[CPR+95] F. Corno, P. Prinetto,M. Rebaudengo, M. SonzaReorda, and E. Veiluva. APVM tool for automatictest generation on paral-lel and distributed systems.In Hertzberger and Ser-azzi [HS95a], pages 39–44.ISBN 3-540-59393-4. ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.88.I57 1995.

ChassindeKergommeaux:1999:MER

[CRD99] J. Chassin de Kergom-meaux, M. Ronsse, andK. De Bosschere. MPL0*:Efficient record/replay ofnondeterministic features ofmessage passing libraries. InDongarra et al. [DLM99],pages 141–148. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Cappello:1999:PNB

[CRE99] F. Cappello, O. Richard, andD. Etiemble. Performanceof the NAS benchmarks ona cluster of SMP PCs usinga parallelization of the MPIprograms with OpenMP.Lecture Notes in ComputerScience, 1662:339–??, 1999.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

REFERENCES 154

Cappello:2001:UPS

[CRE01] Franck Cappello, OlivierRichard, and Daniel Etiem-ble. Understanding per-formance of SMP clus-ters running MPI programs.Future Generation Com-puter Systems, 17(6):711–720, April 2001. CODENFGSEVI. ISSN 0167-739X(print), 1872-7115 (elec-tronic). URL http://www.elsevier.com/gej-ng/10/19/19/45/33/30/abstract.html.

Cores:2014:FAM

[CRGM14] Ivan Cores, Gabriel Rodrıguez,Patricia Gonzalez, andMarıa J. Martın. Failureavoidance in MPI applica-tions using an application-level approach. The Com-puter Journal, 57(1):100–114, January 2014. CO-DEN CMPJA6. ISSN0010-4620 (print), 1460-2067(electronic). URL http://comjnl.oxfordjournals.org/content/57/1/100.full.pdf+html.

Cores:2016:ROM

[CRGM16] Ivan Cores, Monica Rodrıguez,Patricia Gonzalez, andMarıa J. Martın. Reduc-ing the overhead of an MPIapplication-level migrationapproach. Parallel Comput-ing, 54(??):72–82, May 2016.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http:/

/www.sciencedirect.com/science/article/pii/S0167819116000429.

Cores:2014:MAL

[CRM14] Ivan Cores, Gabriel Rodrıguez,and Marıa J. Martın. In-memory application-levelcheckpoint-based migrationfor MPI programs. TheJournal of Supercomput-ing, 70(2):660–670, Novem-ber 2014. CODEN JO-SUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic). URL http://link.springer.com/article/10.1007/s11227-014-1120-2.

Ciampolini:1996:EPM

[CS96] A. Ciampolini and C. Ste-fanelli. Extending PVM toa massively parallel archi-tecture. Future GenerationComputer Systems, 12(1):13–23, May 1996. CODENFGSEVI. ISSN 0167-739X(print), 1872-7115 (elec-tronic).

Coole:2014:FFH

[CS14] James Coole and GregStitt. Fast, flexible high-level synthesis from OpenCLusing reconfiguration con-texts. IEEE Micro, 34(1):42–53, January/February2014. CODEN IEMIDZ.ISSN 0272-1732.

Chetlur:1998:ALE

[CSAGR98] M. Chetlur, G. D. Sharma,N. Abu-Ghazaleh, andU. K. V. Rajasekaran. An

REFERENCES 155

active layer extension toMPI. Lecture Notes in Com-puter Science, 1497:97–??,1998. CODEN LNCSD9.ISSN 0302-9743 (print),1611-3349 (electronic).

Clement:1996:NPM

[CSC96] Mark J. Clement, Michael R.Steed, and Phyllis E. Cran-dall. Network performancemodeling for PVM clus-ters. In ACM [ACM96c],page ?? ISBN 0-89791-854-1. LCCN QA 76.88S8573 1996. URL http://www.supercomp.org/sc96/proceedings/SC96PROC/CLEMENT/INDEX.HTM. ACM OrderNumber: 415962, IEEEComputer Society Press Or-der Number: RS00126.

Cavenaghi:1996:UPS

[CSPM+96] M. A. Cavenaghi, R. Spolon,J. E. M. Perea-Martins,S. G. Domingues, andA. Garcia Neto. UsingPVM in the simulation ofa hybrid dataflow archi-tecture. In Bode et al.[BDLS96], pages 343–??ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E975 1996.

Carreira:1995:DEL

[CSS95] J. Carreira, L. Silva, andJ. G. Silva. On the design ofEilean: a Linda-like libraryfor MPI. In IEEE [IEE95j],pages 175–184. ISBN 0-8186-

6895-4. LCCN QA76.58 .S341994.

Chevitarese:2012:STN

[CSV12] Daniel Salles Chevitarese,Dilza Szwarcman, and Mar-ley Vellasco. Speeding upthe training of neural net-works with CUDA tech-nology. Lecture Notes inComputer Science, 7267:30–38, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-29347-4_4/.

Ciegis:1997:NID

[CSW97] R. Ciegis, R. Sablinskas, andJ. Wasniewski. Numericalintegration on distributed-memory parallel systems.Lecture Notes in ComputerScience, 1332:329–336, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Ciegis:1999:HDA

[CSW99] R. Ciegis, R. Sablinskas,and J. Wasniewski. Hyper-rectangle distribution algo-rithm for parallel multidi-mensional numerical integra-tion. In Dongarra et al.[DLM99], pages 275–282.ISBN 3-540-66549-8 (soft-cover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

REFERENCES 156

Calotoiu:2012:PID

[CSW12] Alexandru Calotoiu, Chris-tian Siebert, and Felix Wolf.Pattern-independent detec-tion of manual collectivesin MPI programs. LectureNotes in Computer Science,7484:28–39, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-32820-6_5/.

Cote:1994:PSA

[CT94a] J. Cote and S. J. Thomas.Parallel semi-Lagrangian ad-vection on the sphere usingPVM. In Pierce and Reg-nier [PR94b], pages 470–477.ISBN 0-8186-5680-8, 0-8186-5681-6. LCCN QA76.58.S321994. IEEE catalog no.94TH0637-9.

Cote:1994:PSL

[CT94b] J. Cote and S. J. Thomas.Parallel semi-Lagrangian ad-vection on the sphere us-ing PVM. In Dekker et al.[DSZ94], pages 801–808.ISBN 0-444-81784-0. LCCNQA76.58.E98 1994.

Cotronis:2002:MMP

[CT02] Yiannis Cotronis and ZachariasTsiatsoulis. Modular MPIand PVM components. Lec-ture Notes in Computer Sci-ence, 2474:252–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349

(electronic). URL http://link.springer.de/link/service/series/0558/bibs/2474/24740252.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740252.pdf.

Chang:2013:PDS

[CT13] Yao-Lin Chang and I-LunTseng. A parallel dual-scanline algorithm for par-titioning parameterized 45-degree polygons. ACMTransactions on Design Au-tomation of Electronic Sys-tems, 18(4):59:1–59:??, Oc-tober 2013. CODENATASFO. ISSN 1084-4309(print), 1557-7309 (elec-tronic).

Cotronis:2000:CMP

[CTK00] J. Y. Cotronis, Z. Tsi-atsoulis, and C. Kouni-akis. Composition of mes-sage passing applicationson-demand. Lecture Notesin Computer Science, 1908:192–??, 2000. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080192.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080192.pdf.

Czarnul:2001:DPD

[CTK01] Pawel Czarnul, Karen Tomko,

REFERENCES 157

and Henryk Krawczyk. Dy-namic partitioning of thedivide-and-conquer schemewith migration in PVM en-vironment. Lecture Notesin Computer Science, 2131:174–??, 2001. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310174.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310174.pdf.

Cao:2011:OMM

[CwCW+11] Chao Cao, Yun wen Chen,Yuning Wu, Erik Deumens,and Hai-Ping Cheng. OPAL:a multiscale multicenter sim-ulation package based onMPI-2 protocol. Interna-tional Journal of QuantumChemistry, 111(15):4020–4029, December 2011. CO-DEN IJQCB2. ISSN 0020-7608 (print), 1097-461X(electronic).

Cui:2012:OOB

[CXB+12] Zheng Cui, Lei Xia, Patrick G.Bridges, Peter A. Dinda,and John R. Lange. Op-timizing overlay-based vir-tual networking through op-timistic interrupts and cut-through forwarding. InHollingsworth [Hol12], pages99:1–99:?? ISBN 1-4673-0804-8. URL http:

//conferences.computer.org/sc/2012/papers/1000a029.pdf.

Cavender:1995:APN

[CZ95a] M. E. Cavender and Xi-aodong Zhang. Asyn-chronous PVM networkcomputing. In Bailey et al.[BBG+95], pages 772–773.ISBN 0-89871-344-7. LCCNQA76.58.S55 1995.

Cavender:1995:SSA

[CZ95b] Mark E. Cavender and Xi-aodong Zhang. Softwaresupport for asynchronouscomputing across networks.In IEEE [IEE95l], pages376–382. CODEN PSICD2.ISBN 0-8186-7119-X. ISSN0730-6512. LCCN QA 76.6C6295 1995. IEEE catalognumber 95CB35838.

Chengqing:1996:WIP

[CZ96] Ye Chengqing and Cui Zhen-qian. The ways of improv-ing parallel computing effi-ciency in PVM. Mini-MicroSystems, 17(4):12–16, April1996. CODEN XWJXEH.ISSN 1000-1220.

Czarnul:2002:DTI

[Cza02] Pawel Czarnul. Develop-ment and tuning of irregulardivide-and-conquer applica-tions in DAMPVM/DAC.Lecture Notes in ComputerScience, 2474:208–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349

REFERENCES 158

(electronic). URL http://link.springer.de/link/service/series/0558/bibs/2474/24740208.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740208.pdf.

Czarnul:2003:PTA

[Cza03] Pawel Czarnul. Program-ming, tuning and automaticparallelization of irregulardivide-and-conquer applica-tions in DAMPVM/DAC.The International Journal ofHigh Performance Comput-ing Applications, 17(1):77–93, Spring 2003. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic).

Czapinski:2013:EPM

[Cza13] Michal Czapinski. An ef-fective Parallel MultistartTabu Search for QuadraticAssignment Problem onCUDA platform. Jour-nal of Parallel and Dis-tributed Computing, 73(11):1461–1468, November 2013.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic). URL http://www.sciencedirect.com/science/article/pii/S074373151200175X.

Czech:2016:IPC

[Cze16] Zbigniew J. Czech. In-troduction to Parallel Com-puting. Cambridge Univer-sity Press, Cambridge, UK,2016. ISBN 1-107-17439-

2 (hardcover), 1-316-79583-7 (e-book). xvii + 354 pp.LCCN QA76.58 .C975 2016.

Chapman:2008:PPM

[CZG+08] Barbara Chapman, Weim-ing Zheng, Guang R. Gao,Mitsuhisa Sato, EduardAyguade, and DongshengWang, editors. A Practi-cal Programming Model forthe Multi-Core Era: 3rdInternational Workshop onOpenMP, IWOMP 2007,Beijing, China, June 3–7,2007 Proceedings, volume4935 of Lecture Notes inComputer Science. Springer-Verlag, Berlin, Germany /Heidelberg, Germany / Lon-don, UK / etc., 2008. CO-DEN LNCSD9. ISBN 3-540-69302-5 (print), 3-540-69303-3 (e-book). ISSN0302-9743 (print), 1611-3349 (electronic). LCCN???? URL http://www.springerlink.com/content/978-3-540-69303-1.

Dongarra:1991:UGP

[D+91] Jack Dongarra et al. AUsers’ Guide to PVM Par-allel Virtual Machine. OakRidge National Laboratory,Knoxville, TN, USA, July1991.

Dongarra:1995:HPC

[D+95] J. J. Dongarra et al., edi-tors. High performance com-puting: technology, meth-ods, and applications (Ad-

REFERENCES 159

vanced workshop, June 1994,Cetraro, Italy), volume 10of Advances in ParallelComputing. Elsevier, Am-sterdam, The Netherlands,1995. ISBN 0-444-82163-5. ISSN 0927-5452. LCCNQA76.88.H55 1995.

Dimov:1998:IMC

[DAK98] I. Dimov, V. Alexandrov,and A. Karaivanova. Im-plementation of Monte Carloalgorithms for eigenvalueproblem using MPI. Lec-ture Notes in Computer Sci-ence, 1497:346–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Dieguez:2018:SLP

[DALD18] Adrian Perez Dieguez, Mar-garita Amor, Jacobo Lobeiras,and Ramon Doallo. Solvinglarge problem sizes of index-digit algorithms on GPU:FFT and tridiagonal sys-tem solvers. IEEE Trans-actions on Computers, 67(1):86–101, January 2018.CODEN ITCOB4. ISSN0018-9340 (print), 1557-9956(electronic). URL http://ieeexplore.ieee.org/document/7970194/.

Danalis:2012:MCT

[Dan12] Anthony Danalis. MPI andcompiler technology: a love-hate relationship. LectureNotes in Computer Science,7490:12–13, 2012. CODEN

LNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/accesspage/chapter/10.1007/978-3-642-33518-1_4.

Darema:2001:SMP

[Dar01] Frederica Darema. TheSPMD model: Past, presentand future. Lecture Notesin Computer Science, 2131:1–??, 2001. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310001.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310001.pdf.

Demidov:2013:PCO

[DARG13] Denis Demidov, Karsten Ah-nert, Karl Rupp, and PeterGottschling. ProgrammingCUDA and OpenCL: a casestudy using modern C++libraries. SIAM Journalon Scientific Computing, 35(5):C453–C472, ???? 2013.CODEN SJOCE3. ISSN1064-8275 (print), 1095-7197(electronic).

deAndrade:2017:OFH

[dAT17] Douglas Coimbra de An-drade and Luıs GonzagaTrabasso. An OpenCLframework for high perfor-mance extraction of image

REFERENCES 160

features. Journal of Par-allel and Distributed Com-puting, 109(??):75–88, Nov-ember 2017. CODEN JPD-CER. ISSN 0743-7315(print), 1096-0848 (elec-tronic). URL http://www.sciencedirect.com/science/article/pii/S0743731517301624.

Demuynck:1997:DOD

[DBA97] K. Demuynck, J. Broeck-hove, and F. Arickx. Dy-namic optimization of adistributed VR system bynetwork-balancing. Lec-ture Notes in Computer Sci-ence, 1332:443–450, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Dinan:2016:IEM

[DBB+16] James Dinan, Pavan Bal-aji, Darius Buntinas, DavidGoodell, William Gropp,and Rajeev Thakur. Animplementation and evalu-ation of the MPI 3.0 one-sided communication inter-face. Concurrency and Com-putation: Practice and Ex-perience, 28(17):4385–4404,December 10, 2016. CO-DEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Dursun:2009:MPM

[DBK+09] Hikmet Dursun, Kevin J.Barker, Darren J. Kerbyson,Scott Pakin, Richard Sey-mour, Rajiv K. Kalia, Ai-

ichiro Nakano, and PriyaVashishta. An MPI per-formance monitoring inter-face for cell based computenodes. Parallel Process-ing Letters, 19(4):535–552,December 2009. CODENPPLTEE. ISSN 0129-6264(print), 1793-642X (elec-tronic).

Dotsenko:2011:ATF

[DBLG11] Yuri Dotsenko, Sara S.Baghsorkhi, Brandon Lloyd,and Naga K. Govindaraju.Auto-tuning of Fast FourierTransform on graphics pro-cessors. ACM SIGPLANNotices, 46(8):257–266, Au-gust 2011. CODEN SIN-ODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic). PPoPP’11 Conference proceedings.

DiMartino:2001:WDS

[DBVF01] Beniamino Di Martino,Sergio Briguglio, GregorioVlad, and Giuliana Fogaccia.Workload decompositionstrategies for shared mem-ory parallel systems withOpenMP. Scientific Pro-gramming, 9(2–3):109–122,Spring–Summer 2001. CO-DEN SCIPEV. ISSN 1058-9244 (print), 1875-919X(electronic). URL http://iospress.metapress.com/app/home/contribution.asp%3Fwasp=7pab6qgbaf8vxg991rwy%26referrer=parent%26backto=issue%2C5%2C11%3Bjournal%

REFERENCES 161

2C1%2C9%3Blinkingpublicationresults%2C1%2C1.

DAgostino:2014:CAM

[DCD+14] Daniele D’Agostino, AndreaClematis, Sergio Decherchi,Walter Rocchia, Luciano Mi-lanesi, and Ivan Merelli.CUDA accelerated molecu-lar surface generation. Con-currency and Computation:Practice and Experience, 26(10):1819–1831, July 2014.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

daCunha:1993:PLA

[dCH93] R. D. da Cunha and T. Hop-kins. Porting linear alge-bra subroutines from trans-puters to clusters of work-stations. In Grebe et al.[GHH+93], pages 660–667.ISBN 90-5199-140-1. LCCN????

Dow:2002:CMA

[DCH02] Chyi-Ren Dow, Jong-ShinChen, and Min-Chang Hsieh.Checkpointing MPI applica-tions on symmetric multi-processor machines usingSMPCkpt. The Journal ofSystems and Software, 63(2):137–150, August 15, 2002.CODEN JSSODM. ISSN0164-1212 (print), 1873-1228(electronic).

Didelot:2012:IMC

[DCPJ12] Sylvain Didelot, PatrickCarribault, Marc Perache,

and William Jalby. Im-proving MPI communica-tion overlap with collab-orative polling. LectureNotes in Computer Science,7490:37–46, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-33518-1_9/.

Didelot:2014:IMC

[DCPJ14] Sylvain Didelot, PatrickCarribault, Marc Perache,and William Jalby. Im-proving MPI communicationoverlap with collaborativepolling. Computing, 96(4):263–278, April 2014. CO-DEN CMPTA2. ISSN 0010-485X (print), 1436-5057(electronic). URL http://link.springer.com/article/10.1007/s00607-013-0327-z.

delCuvillo:2006:LOC

[dCZG06] Juan del Cuvillo, WeirongZhu, and Guang Gao. Land-ing OpenMP on Cyclops-64: an efficient mappingof OpenMP to a many-core system-on-a-chip. InACM [ACM06b], pages 41–50. ISBN 1-59593-302-6.ACM order number 104060.

Dozsa:2000:THL

[DDL00] Gabor Dozsa, Daniel Drotos,and Robert Lovas. Transla-tion of a high-level graphi-cal code to message-passing

REFERENCES 162

primitives in the GRADEprogramming environment.Lecture Notes in ComputerScience, 1908:258–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080258.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080258.pdf.

Decker:1995:TDU

[DDLM95] T. Decker, R. Diekmann,R. Luling, and B. Monien.Towards developing univer-sal dynamic mapping algo-rithms. In IEEE [IEE95g],pages 456–459. ISBN 0-8186-7195-5. LCCN QA 76.58 I421995. IEEE catalog number95TB8131.

Dongarra:1997:BCA

[DDPR97] J. J. Dongarra, F. Desprez,A. Petitet, and C. Ran-driamaro. Block-cyclic ar-ray redistribution on net-works of workstations. Lec-ture Notes in Computer Sci-ence, 1332:343–350, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Dean:1994:CPV

[DDS+94] C. E. Dean, R. C. Denny,P. C. Stephenson, G. J.Milne, and E. Pantos. Com-puting with parallel vir-tual machines. Journal de

physique. IV, Colloque, 4(C9):C9/445–448, November1994. CODEN JPICEI.ISSN 1155-4339.

Dan:1999:QAM

[DDYM99] Pei Dan, Wang Dong-sheng, Zhang Youhui, andShen Meiming. Quasi-asynchronous migration: anovel migration protocol forPVM tasks. Operating Sys-tems Review, 33(2):5–14,April 1999. CODEN OS-RED8. ISSN 0163-5980(print), 1943-586X (elec-tronic).

Durand:1991:HPC

[DE91] M. Durand and F. ElDabaghi, editors. Highperformance computing, II:proceedings of the SecondSymposium on High Perfor-mance Computing, Montpel-lier, France, 7–9 October,1991. North-Holland, Am-sterdam, The Netherlands,1991. ISBN 0-444-89224-9.LCCN QA75.5.I585 1991.

Demaine:1996:FCC

[Dem96] E. Demaine. First classcommunication in MPI. InIEEE [IEE96i], pages 189–194. ISBN 0-8186-7533-0.LCCN QA76.642 .M67 1996.

DePasquale:2003:UJU

[DeP03] C. J. DePasquale. Usingthe JVMPI to understandthe behavior of Java classesduring the development pro-

REFERENCES 163

cess. Cmg, 2(??):821–832,2003. CODEN ????

Dehne:2001:CPD

[DERC01] Frank Dehne, Todd Eavis,and Andrew Rau-Chaplin.Computing partial datacubes for parallel data ware-housing applications. Lec-ture Notes in Computer Sci-ence, 2131:319–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310319.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310319.pdf.

Dashti:2017:AMM

[DF17] Mohammad Dashti andAlexandra Fedorova. An-alyzing memory manage-ment methods on integratedCPU–GPU systems. ACMSIGPLAN Notices, 52(9):59–69, September 2017. CO-DEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Duran:2009:PEO

[DFA+09] Alejandro Duran, RogerFerrer, Eduard Ayguade,Rosa M. Badia, and JesusLabarta. A proposal toextend the OpenMP task-ing model with dependenttasks. International Jour-nal of Parallel Programming,

37(3):292–305, June 2009.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=37&issue=3&spage=292.

Duran:2007:PEH

[DFC+07] Alejandro Duran, Roger Fer-rer, Juan Jose Costa, MarcGonzalez, Xavier Martorell,Eduard Ayguade, and JesusLabarta. A proposal for er-ror handling in OpenMP.International Journal ofParallel Programming, 35(4):393–416, August 2007.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=35&issue=4&spage=393.

Demaine:2001:GCM

[DFKS01] E. D. Demaine, I. Foster,C. Kesselman, and M. Snir.Generalized communicatorsin the message passing in-terface. IEEE Transac-tions on Parallel and Dis-tributed Systems, 12(6):610–616, June 2001. CODENITDSEO. ISSN 1045-9219(print), 1558-2183 (elec-tronic). URL http://dlib.computer.org/td/books/td2001/pdf/l0610.pdf;http://www.computer.org/tpds/td2001/l0610abs.htm.

REFERENCES 164

Deshpande:1994:ADN

[DFMD94] Manish Deshpande, JinzhangFeng, Charles L. Merkle, andAshish Deshpande. Appli-cation of a distributed net-work in computational fluiddynamic simulations. TheInternational Journal of Su-percomputer Applications, 8(1):64–67, Spring 1994. CO-DEN IJSAE9. ISSN 0890-2720.

Diaz:2012:CCF

[DFN12] M. J. Castro Dıaz andE. Fernandez-Nieto. Aclass of computationallyfast first order finite vol-ume solvers: PVM meth-ods. SIAM Journal on Sci-entific Computing, 34(4):A2173–A2196, ???? 2012.CODEN SJOCE3. ISSN1064-8275 (print), 1095-7197(electronic).

DAmbra:1995:CBC

[DG95] P. D’Ambra and G. Giunta.Concurrent banded Choleskyfactorization on worksta-tion networks using PVM.Parallel Computing, 21(3):487–494, March 10, 1995.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic).

Dinan:2014:ECC

[DGB+14] James Dinan, Ryan E.Grant, Pavan Balaji, DavidGoodell, Douglas Miller,Marc Snir, and Rajeev

Thakur. Enabling communi-cation concurrency throughflexible MPI endpoints. TheInternational Journal ofHigh Performance Com-puting Applications, 28(4):390–405, November 2014.CODEN IHPCFL. ISSN1094-3420 (print), 1741-2846 (electronic). URLhttp://hpc.sagepub.com/content/28/4/390.

DiNapoli:1997:DCA

[DGF97] C. Di Napoli, M. Gior-dano, and M. M. Furnari.Distributed and coopera-tive applications in PVM.Lecture Notes in ComputerScience, 1332:83–90, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Dinan:2012:EMC

[DGG+12] James Dinan, David Good-ell, William Gropp, Ra-jeev Thakur, and PavanBalaji. Efficient multi-threaded context ID allo-cation in MPI. LectureNotes in Computer Science,7490:57–66, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-33518-1_11/.

deGloria:1994:TAS

[dGJM94] A. de Gloria, M. R. Jane,and D. Marini, editors.

REFERENCES 165

Transputer Applications andSystems ’94. Proceedings ofthe 1994 World TransputerCongress. IOS Press, PostalDrawer 10558, Burke, VA2209-0558, USA, 1994. ISBN???? LCCN ????

Dongarra:1993:UPR

[DGMJ93] J. J. Dongarra, A. Geist,R. Manchek, and W. Jiang.Using PVM 3.0 to rungrand challenge applicationson a heterogeneous networkof parallel computers. In Sin-covec [Sin93], pages 873–877.ISBN 0-89871-315-3. LCCNQA 76.58 S55 1993. Two vol-umes.

Dongarra:1993:IPF

[DGMS93] Jack Dongarra, G. A. Geist,Robert Manchek, and V. S.Sunderam. Integrated PVMframework supports hetero-geneous network comput-ing. Computers in Physics, 7(2):166–174 (or 166–175??),March-April 1993. CODENCPHYE2. ISSN 0894-1866(print), 1558-4208 (elec-tronic).

daCunha:1994:PIR

[dH94] Rudnei Dias da Cunha andTim Hopkins. A paral-lel implementation of therestarted GMRES iterativealgorithm for nonsymmet-ric systems of linear equa-tions. Advances in compu-tational mathematics, 2(3):261–277, ???? 1994. CO-

DEN ACMHEX. ISSN 1019-7168.

Dongarra:1995:PBC

[DH95] J. J. Dongarra and T. Hey.The ParkBench benchmarkcollection. Supercomputer,11(2-3):94–114, June 1995.CODEN SPCOEL. ISSN0168-7875.

Dongarra:1992:PUL

[DHHW92] Jack J. Dongarra, RolfHempel, Anthony J. G.Hey, and David W. Walker.A proposal for a user-level message-passing inter-face in a distributed mem-ory environment. Techni-cal Report TM-12231, OakRidge National Laboratory,Knoxville, TN, USA, Octo-ber 1992.

Dongarra:1993:PUM

[DHHW93a] J. Dongarra, R. Hempel,A. Hay, and D. Walker.A proposal for a user-levelmessage passing interface ina distributed memory en-vironment. Technical Re-port ORNL/TM-12231, OakRidge National Laboratory,Knoxville, TN, USA, Febru-ary 1993.

Dongarra:1993:DSM

[DHHW93b] J. J. Dongarra, R. Hempel,A. J. G. Hey, and D. W.Walker. A draft standardfor message passing in adistributed memory environ-ment. In Hoffmann and Kau-ranne [HK93], pages 465–

REFERENCES 166

481. ISBN 981-02-1429-4.LCCN QA76.58 E354 1992.

Derakhshan:1997:PEP

[DHK97] M. Derakhshan, S. Ham-marling, and A. Krom-mer. PINEAPL: a Euro-pean project on Parallel In-dustrial Numerical Applica-tions and Portable Libraries.Lecture Notes in ComputerScience, 1332:337–342, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Dongarra:1997:CSD

[DHP97] J. J. Dongarra, S. Ham-marling, and A. Petitet.Case studies on the de-velopment of ScaLAPACKand the NAG numeri-cal PVM library. InBoisvert [Boi97], pages 236–248. ISBN 0-412-80530-8. LCCN QA297 .I35 1996.URL http://www.netlib.org/utk/papers/woco96/woco96.html; http://www.netlib.org/utk/papers/woco96/woco96.ps; http://www.netlib.org/utk/people/JackDongarra/pdf/woco96.pdf.

Dongarra:1996:SRP

[DHS96] J. J. Dongarra, T. Hey, andE. Strohmaier. Selectedresults from the PARK-BENCH benchmark. InBouge et al. [BFMR96],pages 251–254. ISBN 3-540-61626-8 (vol. 1), 3-540-

61627-6 (vol. 2). ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.I554 1996, QA267.A1L43 no.1123-1124. Two vol-umes.

DiPierro:2014:PPP

[Di 14] Massimo Di Pierro. Portableparallel programs with Pythonand OpenCL. Computingin Science and Engineer-ing, 16(1):34–40, January/February 2014. CODENCSENFA. ISSN 1521-9615.

DiSerio:2002:ENN

[DI02] Angela Di Serio and Marıa B.Ibanez. Evaluation of anearest-neighbor load bal-ancing strategy for paral-lel molecular simulations inMPI environment. Lec-ture Notes in Computer Sci-ence, 2474:226–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.de/link/service/series/0558/bibs/2474/24740226.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740226.pdf.

DiNucci:1996:CDS

[DiN96] D. C. DiNucci. Co-operative data sharing:a layered approach toan architecture-independentMessage-Passing Interface.In IEEE [IEE96i], pages 58–65. ISBN 0-8186-7533-0.LCCN QA76.642 .M67 1996.

REFERENCES 167

Karniadakis:2002:DLP

[DK02] Suchuan Dong and George Em.Karniadakis. Dual-level par-allelism for deterministicand stochastic CFD prob-lems. In IEEE [IEE02],page ?? ISBN 0-7695-1524-X. LCCN ???? URLhttp://www.sc-2002.org/paperpdfs/pap.pap137.pdf.

Drosinos:2006:EPT

[DK06] Nikolaos Drosinos and Nec-tarios Koziris. The ef-fect of process topologyand load balancing on par-allel programming modelsfor SMP clusters and it-erative algorithms. TheJournal of Supercomputing,35(1):65–91, January 2006.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=35&issue=1&spage=65.

Deo:2013:PSA

[DK13] Mrinal Deo and Sean Keely.Parallel suffix array and leastcommon prefix for the GPU.ACM SIGPLAN Notices, 48(8):197–206, August 2013.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic). PPoPP ’13 Confer-ence proceedings.

DiMartino:2005:RAP

[DKD05] Beniamino Di Martino, Di-eter Kranzlmuller, and J. J.Dongarra, editors. Recentadvances in parallel virtualmachine and message pass-ing interface: 12th EuropeanPVM/MPI User’s GroupMeeting, Sorrento, Italy,September 18–21, 2005:proceedings, volume 3666of Lecture Notes in Com-puter Science. Springer-Ver-lag, Berlin, Germany / Hei-delberg, Germany / Lon-don, UK / etc., 2005. CO-DEN LNCSD9. ISBN 3-540-29009-5 (paperback). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58.E973 2005. URL http://springerlink.metapress.com/openurl.asp?genre=issue&issn=0302-9743&volume=3666.

DiMartino:2007:SIS

[DKD07] Beniamino Di Martino, Di-eter Kranzlmuller, and JackDongarra. Special issue onselected papers from the Eu-roPVM/MPI 2005 Confer-ence, Sorrento, Italy, 18-21September 2005 — preface.The International Journal ofHigh Performance Comput-ing Applications, 21(2):129–131, Summer 2007. ISSN1094-3420 (print), 1741-2846(electronic).

DiMartino:2008:SSG

[DKD08] Beniamino Di Martino, Di-

REFERENCES 168

eter Kranzlmuller, and JackDongarra. Special sec-tion: Grid computing andthe Message Passing In-terface. Future Genera-tion Computer Systems, 24(2):119–120, February 2008.CODEN FGSEVI. ISSN0167-739X (print), 1872-7115 (electronic).

Damodaran-Kamal:1993:NTD

[DKF93] S. K. Damodaran-Kamaland J. M. Francioni. Non-determinacy: testing anddebugging in message pass-ing parallel programs. ACMSIGPLAN Notices, 28(12):118–128, December 1993.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Damodaran-Kamal:1994:MSR

[DKF94a] S. K. Damodaran-Kamaland J. M. Francioni. mdb: asemantic race detection toolfor PVM. In Pierce and Reg-nier [PR94b], pages 702–709.ISBN 0-8186-5680-8, 0-8186-5681-6. LCCN QA76.58.S321994. IEEE catalog no.94TH0637-9.

Damodaran-Kamal:1994:TRP

[DKF94b] S. K. Damodaran-Kamaland J. M. Francioni. Test-ing races in parallel pro-grams with an OtOt strat-egy. In Ostrand [Ost94].CODEN SFENDP. ISBN0-89791-683-2. ISSN 0163-

5948. LCCN QA76.76.T48I58 1994.

Dongarra:1992:PFS

[DKM+92] J. Dongarra, P. Kennedy,P. Messina, D. C. Sorensen,and R. G. Voigt, editors.Proceedings of the FifthSIAM Conference on Par-allel Processing for Sci-entific Computing, 25–27March 1991, Houston, TX,USA. Society for Indus-trial and Applied Mathemat-ics, Philadelphia, PA, USA,1992. ISBN 0-89871-303-X.LCCN QA76.58.P76 1992.

Dongarra:2000:RAP

[DKP00] J. J. Dongarra, Peter Kac-suk, and Norbert Pod-horszki, editors. Recentadvances in parallel vir-tual machine and messagepassing interface: 7th Eu-ropean PVM/MPI Users’Group Meeting, Balaton-fured, Hungary, Septem-ber 10–13, 2000: proceed-ings, volume 1908 of Lec-ture Notes in ComputerScience. Springer-Verlag,Berlin, Germany / Heidel-berg, Germany / London,UK / etc., 2000. ISBN 3-540-41010-4 (softcover). ISSN0302-9743 (print), 1611-3349(electronic).

Dickens:2010:HPI

[DL10] Phillip M. Dickens andJeremy Logan. A high per-formance implementation of

REFERENCES 169

MPI-IO for a Lustre filesystem environment. Con-currency and Computation:Practice and Experience, 22(11):1433–1449, August 10,2010. CODEN CCPEBO.ISSN 1532-0626 (print),1532-0634 (electronic).

delaAsuncion:2011:SOL

[dlAMC11] Marc de la Asuncion,Jose M. Mantas, andManuel J. Castro. Sim-ulation of one-layer shal-low water systems on mul-ticore and CUDA archi-tectures. The Journal ofSupercomputing, 58(2):206–214, November 2011. CO-DEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=58&issue=2&spage=206.

delaAsuncion:2012:MCI

[dlAMCFN12] Marc de la Asuncion,Jose M. Mantas, Manuel J.Castro, and E. D. Fernandez-Nieto. An MPI-CUDA im-plementation of an improvedRoe method for two-layershallow water systems. Jour-nal of Parallel and Dis-tributed Computing, 72(9):1065–1072, September 2012.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic). URL http://www.sciencedirect.com/science/article/pii/S074373151100147X.

Desai:2007:CEM

[DLB07] Narayan Desai, Ewing Lusk,and Rick Bradshaw. Acomposition environment forMPI programs. The Interna-tional Journal of High Per-formance Computing Ap-plications, 21(2):166–173,May 2007. CODEN IH-PCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/21/2/166.full.pdf+html.

Marcos:2002:DDP

[dlFMBdlFM02] Carlos de la Fuente Mar-cos, Pierre Barge, and Raulde la Fuente Marcos. Dustdynamics in protoplanetarydisks: Parallel computingwith PVM. Journal of Com-putational Physics, 176(2):276–294, March 1, 2002.CODEN JCTPAH. ISSN0021-9991 (print), 1090-2716(electronic). URL http://www.sciencedirect.com/science/article/pii/S0021999101969785.

Dongarra:1999:RAP

[DLM99] J. J. Dongarra, E. Luque,and Tomas Margalef, ed-itors. Recent advancesin parallel virtual machineand message passing inter-face: 6th European PVM/MPI Users’ Group Meeting,Barcelona, Spain, Septem-ber 26–29, 1999: proceed-ings, volume 1697 of Lec-ture Notes in ComputerScience. Springer-Verlag,

REFERENCES 170

Berlin, Germany / Heidel-berg, Germany / London,UK / etc., 1999. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Degomme:2017:SMA

[DLM+17] Augustin Degomme, Ar-naud Legrand, George S.Markomanolis, Martin Quin-son, Mark Stillwell, andFrederic Suter. Simulat-ing MPI applications: TheSMPI approach. IEEETransactions on Parallel andDistributed Systems, 28(8):2387–2400, August 2017.CODEN ITDSEO. ISSN1045-9219 (print), 1558-2183(electronic)ITDSEO. URLhttps://www.computer.org/csdl/trans/td/2017/08/07855780-abs.html.

Dongarra:2003:RAP

[DLO03] Jack Dongarra, DomenicoLaforenza, and Salvatore Or-lando, editors. Recent ad-vances in parallel virtualmachine and message pass-ing interface: 10th Eu-ropean PVM/MPI User’sgroup Meeting, Venice, Italy,September 29–October 2,2003: Proceedings, volume2840 of Lecture Notes inComputer Science. Springer-Verlag, Berlin, Germany /Heidelberg, Germany / Lon-don, UK / etc., 2003. CO-DEN LNCSD9. ISBN 3-540-20149-1. ISSN 0302-9743

(print), 1611-3349 (elec-tronic). LCCN QA76.58.E973 2003. URL http://link.springer-ny.com/link/service/series/0558/tocs/t2840.htm.

DeKeyser:1994:RTL

[DLR94] J. DeKeyser, K. Lust, andD. Roose. Run-time loadbalancing support for aparallel multiblock Euler/Navier–Stokes code withadaptive refinement on dis-tributed memory comput-ers. Parallel Computing, 20(8):1069–1088, August 1994.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic).

Lu:2004:AFS

[dLR04] Charng da Lu and Daniel A.Reed. Assessing fault sen-sitivity in MPI applications.In ACM [ACM04], page 37.ISBN 0-7695-2153-3. LCCN????

DeSande:1999:NBS

[DLRR99] F. De Sande, C. Leon, C. Ro-driguez, and J. Roda. Nestedbulk synchronous parallelcomputing. In Dongarraet al. [DLM99], pages 189–198. ISBN 3-540-66549-8(softcover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

DiPietro:2016:CLD

[DLV16] Roberto Di Pietro, FlavioLombardi, and Antonio Vil-

REFERENCES 171

lani. CUDA leaks: adetailed hack for CUDAand a (partial) fix. ACMTransactions on EmbeddedComputing Systems, 15(1):15:1–15:??, February 2016.CODEN ???? ISSN1539-9087 (print), 1558-3465(electronic).

Despons:1993:CCP

[DM93] R. Despons and T. Muntean.Constructing correct proto-cols for a diffusion virtualmachine in message pass-ing parallel architectures.In Grebe et al. [GHH+93],pages 465–480. ISBN 90-5199-140-1. LCCN ????

Davies:1995:NSP

[DM95a] G. Davies and N. Mat-loff. Network-specific per-formance enhancements forPVM. In IEEE [IEE95k],pages 205–210. ISBN 0-8186-7088-6. LCCN QA76.9.D5I328 1995. IEEE catalog no.95TB8075.

Davies:1995:NPE

[DM95b] Gregory Davies and NormanMatloff. Network-specificperformance enhancementsfor PVM. In IEEE [IEE95k],pages 205–210. ISBN 0-8186-7088-6. LCCN QA76.9.D5I328 1995. IEEE catalog no.95TB8075.

Dagum:1998:OIS

[DM98] Leonardo Dagum and RameshMenon. OpenMP: Anindustry-standard API for

shared-memory program-ming. IEEE Computa-tional Science & Engineer-ing, 5(1):46–55, January/March 1998. CODEN IS-CEE4. ISSN 1070-9924(print), 1558-190X (elec-tronic). URL http://dlib.computer.org/cs/books/cs1998/pdf/c1046.pdf;http://www.computer.org/cse/cs1998/c1046abs.htm.

Dziubak:2012:OOI

[DM12] Tomasz Dziubak and JacekMatulewski. An object-oriented implementation of asolver of the time-dependentSchrodinger equation us-ing the CUDA technol-ogy. Computer PhysicsCommunications, 183(3):800–812, March 2012. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465511003948.

Dathathri:2016:CAL

[DMB16] Roshan Dathathri, Ravi TejaMullapudi, and Uday Bond-hugula. Compiling affineloop nests for a dynamicscheduling runtime on sharedand distributed memory.ACM Transactions on Par-allel Computing (TOPC), 3(2):12:1–12:??, August 2016.CODEN ???? ISSN2329-4949 (print), 2329-4957(electronic).

REFERENCES 172

DiMartino:1997:IPD

[DMMV97] B. Di Martino, A. Mazzeo,N. Mazzocca, and U. Vil-lano. Interaction patternsdetection in PVM programsto support simulation. Lec-ture Notes in Computer Sci-ence, 1332:250–256, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Dongarra:1996:APC

[DMW96] Jack J. Dongarra, Kay Mad-sen, and Jerzy Wasniewski,editors. Applied parallelcomputing: computationsin physics, chemistry, andengineering science: sec-ond international workshop,PARA ’95, Lyngby, Den-mark, August 21–24, 1995:proceedings, volume 1041of Lecture Notes in Com-puter Science. Springer-Ver-lag, Berlin, Germany / Hei-delberg, Germany / London,UK / etc., 1996. ISBN 3-540-60902-4. ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58.P351995.

Dinda:1996:PIA

[DO96] P. A. Dinda and D. R.O’Hallaron. The perfor-mance impact of address re-lation caching. In Szymanskiand Sinharoy [SS96], pages213–226. ISBN 0-7923-9635-9. LCCN QA76.58.L37 1996.

Donev:2006:ICF

[Don06] Aleksander Donev. Interop-erability with C in Fortran2003. ACM Fortran Forum,25(1):8–12, April 2006. ISSN1061-7264 (print), 1931-1311(electronic).

Sandes:2016:CIS

[dOSMM+16] Edans Flavius de Oliveira Sandes,Guillermo Miranda, XavierMartorell, Eduard Ayguade,George Teodoro, and AlbaCristina Magalhaes Melo.CUDAlign 4.0: Incrementalspeculative traceback for ex-act chromosome-wide align-ment in GPU clusters. IEEETransactions on Parallel andDistributed Systems, 27(10):2838–2850, October 2016.CODEN ITDSEO. ISSN1045-9219 (print), 1558-2183(electronic)ITDSEO. URLhttps://www.computer.org/csdl/trans/td/2016/10/07374729-abs.html.

Dongarra:1995:IMS

[DOSW95] Jack Dongarra, Steve W.Otto, Marc Snir, andDavid Walker. An in-troduction to the MPIStandard. Technical re-port CS-95-274, Univer-sity of Tennessee, Knoxville,Knoxville, TN 37996, USA,January 1995. URLhttp://www.netlib.org/tennessee/ut-cs-95-274.ps; http://www.netlib.org/utk/papers/intro-mpi/intro-mpi.html; http:

REFERENCES 173

//www.netlib.org/utk/people/JackDongarra/pdf/ut-cs-95-274.pdf. Appears inCACM [DOSW96].

Dongarra:1996:MPS

[DOSW96] Jack J. Dongarra, Steve W.Otto, Marc Snir, and DavidWalker. A message pass-ing standard for MPP andworkstations. Communica-tions of the ACM, 39(7):84–90, July 1996. CO-DEN CACMA2. ISSN0001-0782 (print), 1557-7317(electronic). URL http://www.acm.org/pubs/toc/Abstracts/cacm/234000.html.

DeRoeck:1994:CFP

[DP94] Y. H. De Roeck and R. E.Plessix. Combining F90and PVM to construct syn-thetic seismograms by ray-tracing. In IEEE [IEE94c],pages II–653–II–658. ISBN0-7803-2057-3, 0-7803-2056-5, 0-7803-2058-1. ISSN0197-7385. LCCN TC 1505O33197 1994. Three vol-umes. IEEE catalog no.94CH3472-8.

Denis:2001:THP

[DPP01] Alexandre Denis, ChristianPerez, and Thierry Priol.Towards high performanceCORBA and MPI middle-wares for grid computing.Lecture Notes in ComputerScience, 2242:14–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349

(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2242/22420014.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2242/22420014.pdf.

Dalcin:2005:MP

[DPS05] Lisandro Dalcın, RodrigoPaz, and Mario Storti.MPI for Python. Jour-nal of Parallel and Dis-tributed Computing, 65(9):1108–1115, September 2005.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic).

Dalcin:2008:MPP

[DPSD08] Lisandro Dalcın, RodrigoPaz, Mario Storti, and JorgeD’Elıa. MPI for Python:Performance improvementsand MPI-2 extensions. Jour-nal of Parallel and Dis-tributed Computing, 68(5):655–662, May 2008. CODENJPDCER. ISSN 0743-7315(print), 1096-0848 (elec-tronic).

Dou:1997:ISV

[DPZ97] Yong Dou, Zhengbing Pang,and Xingming Zhou. Imple-menting a software virtualshared memory on PVM. InIEEE [IEE97a]. ISBN 0-8186-7876-3 (paperback andcase), 0-8186-7878-X (mi-crofiche). LCCN QA76.58.A4 1997.

REFERENCES 174

Decker:1994:PEM

[DR94] K. M. (Karsten M.) Deckerand R. M. (Rene M.)Rehmann, editors. Pro-gramming environments formassively parallel distributedsystems: working confer-ence of the IFIP WG10.3,April 25–29, 1994, Ascona,Italy. Birkhauser, Cam-bridge, MA, USA; Berlin,Germany; Basel, Switzer-land, 1994. ISBN 0-8176-5090-3 (Boston), 3-7643-5090-3 (Basel). LCCNQA76.58.P767 1994.

Dowaji:1995:LBS

[DR95] S. Dowaji and C. Roucairol.Load balancing strategy andpriority of tasks in dis-tributed environments. InIEEE [IEE95b], pages 15–22. ISBN 0-7803-2493-5,0-7803-2492-7, 0-7803-2494-3. LCCN TK7885.A1 I5671995. IEEE catalog no.95CH35751.

DiMartino:1997:MDH

[DR97] V. Di Martino and G. Ruocco.Molecular dynamics on hy-brid memory machines. Lec-ture Notes in Computer Sci-ence, 1332:451–456, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Davina:2018:MCP

[DR18] A. Lamas Davina and J. E.Roman. MPI-CUDA paral-lel linear solvers for block-

tridiagonal matrices in thecontext of SLEPc’s eigen-solvers. Parallel Computing,74(??):118–135, ???? 2018.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167819117301874.

Deuzeman:2012:LMP

[DRUC12] Albert Deuzeman, SiebrenReker, Carsten Urbach, andETM Collaboration. Lemon:An MPI parallel I/O libraryfor data encapsulation usingLIME. Computer PhysicsCommunications, 183(6):1321–1335, June 2012. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465512000318.

Deshpande:1996:MIBb

[DS96a] V. Deshpande and W. Sawyer.An MPI implementation ofthe BLACS. In IEEE[IEE96a], pages 463–468.ISBN 0-8186-7557-8. LCCNQA76.88.I575 1996. IEEEcatalog number 96TB100074.

Djordjevic:1996:ICI

[DS96b] G. L. Djordjevic and M. K.Stojcev. An interprocessorcommunication interface formessage passing via sharedmemory modules-design andperformances. Computersand Artificial Intelligence= Vychislitel’nye mashiny

REFERENCES 175

i iskusstvennyi intellekt, 15(1):1–34, ???? 1996. CO-DEN CARIDY. ISSN 0232-0274.

Dang:2013:CES

[DS13] Hoang-Vu Dang and BertilSchmidt. CUDA-enabledsparse matrix-vector mul-tiplication on GPUs usingatomic operations. Par-allel Computing, 39(11):737–750, November 2013.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167819113001178.

Deniz:2016:MGM

[DS16] Etem Deniz and Alper Sen.MINIME-GPU: Multicorebenchmark synthesizer forGPUs. ACM Transactionson Architecture and CodeOptimization, 12(4):34:1–34:??, January 2016. CO-DEN ???? ISSN 1544-3566(print), 1544-3973 (elec-tronic).

Dang:2017:ECB

[DSG17] Hoang-Vu Dang, Marc Snir,and William Gropp. Elim-inating contention bottle-necks in multithreaded MPI.Parallel Computing, 69(??):1–23, November 2017. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167819117301187.

Dietrich:2017:CBA

[DSGS17] Robert Dietrich, Felix Schmitt,Alexander Grund, and JonasStolle. Critical-blame anal-ysis for OpenMP 4.0 of-floading on Intel Xeon Phi.The Journal of Systemsand Software, 125(??):381–388, March 2017. CO-DEN JSSODM. ISSN0164-1212 (print), 1873-1228 (electronic). URL //www.sciencedirect.com/science/article/pii/S0164121215002940.

Davidor:1994:PPS

[DSM94] Yuval Davidor, Hans-PaulSchwefel, and ReinhardManner, editors. Paral-lel problem solving from na-ture — PPSN III: Interna-tional Conference on Evo-lutionary Computation, theThird Conference on Par-allel Problem Solving fromNature, Jerusalem, Israel,October 9–14, 1994: pro-ceedings, number 866 inLecture Notes in Com-puter Science. Springer-Ver-lag, Berlin, Germany / Hei-delberg, Germany / London,UK / etc., 1994. ISBN 3-540-58484-6. ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58.I535 1994.

Dohi:2011:GIO

[DSOF11] Keisuke Dohi, Yuichiro Shi-bata, Kiyoshi Oguri, andTakafumi Fujimoto. GPUimplementation and opti-

REFERENCES 176

mization of electromagneticsimulation using the FDTDmethod for antenna design-ing. ACM SIGARCH Com-puter Architecture News, 39(4):26–31, September 2011.CODEN CANED2. ISSN0163-5964 (print), 1943-5851(electronic).

Domokos:2000:PRC

[DSS00] Gabor Domokos, Imre Sze-berenyi, and Paul H. Steen.Parallel, recursive compu-tation of global stabilitycharts for liquid bridges.Lecture Notes in ComputerScience, 1908:64–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080064.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080064.pdf.

Deshpande:1996:MIBa

[DSW96] V. Deshpande, W. Sawyer,and D. W. Walker. AnMPI implementation of theBLACS. In IEEE [IEE96i],pages 195–198. ISBN 0-8186-7533-0. LCCN QA76.642.M67 1996.

Dekker:1994:MPP

[DSZ94] L. (Leendert) Dekker, W. Smit,and J. C. Zuidervaart, ed-itors. Massively paral-lel processing applications

and development: proceed-ings of the 1994 EUROSIMConference on MassivelyParallel Processing Appli-cations and Development,Delft, The Netherlands, 21–23 June 1994. Elsevier, Am-sterdam, The Netherlands,1994. ISBN 0-444-81784-0.LCCN QA76.58.E98 1994.

Dongarra:1994:PSW

[DT94] Jack J. Dongarra andBernard Tourancheau, edi-tors. Proceedings of the Sec-ond Workshop on Environ-ments and Tools for Par-allel Scientific Computing:Townsend, TN, USA, 25–27May 1994. Society for Indus-trial and Applied Mathemat-ics, Philadelphia, PA, USA,1994. ISBN 0-89871-343-9.LCCN QA76.58.I568 1994.

Diavastos:2017:SLR

[DT17] Andreas Diavastos and Pe-dro Trancoso. SWITCHES:a lightweight runtime fordataflow execution of taskson many-cores. ACM Trans-actions on Architecture andCode Optimization, 14(3):31:1–31:??, September 2017.CODEN ???? ISSN1544-3566 (print), 1544-3973(electronic).

Duval:1992:TPP

[Duv92] D. Duval. Trends in paral-lel programming models forhigh performance comput-ers. In Ferenczi [Fer92],

REFERENCES 177

page 33. ISBN ???? LCCN????

Dikken:1994:DDL

[DvdLVS94] L. Dikken, F. van derLinden, J. Vesseur, andP. Sloot. DynamicPVM: Dy-namic load balancing on par-allel systems. In Gentzschand Harms [GH94], pages273–277. ISBN 0-387-57981-8 (New York), 3-540-57981-8 (Berlin). LCCNQA76.88.I57 1994. DM96.00.Two volumes.

Dongarra:1994:PSC

[DW94] Jack Dongarra and JerzyWasniewski, editors. Par-allel scientific computing:First International Work-shop, PARA ’94, Lyngby,Denmark, June 20–23, 1994:proceedings, volume 879of Lecture Notes in Com-puter Science. Springer-Ver-lag, Berlin, Germany / Hei-delberg, Germany / London,UK / etc., 1994. ISBN 3-540-58712-8 (Berlin), 0-387-58712-8 (New York). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58.P35 1994. DM104.00.

DeRose:2002:CCG

[DW02] L. DeRose and F. Wolf.CATCH — a call-graphbased automatic tool forcapture of hardware per-formance metrics for MPIand OpenMP applications.Lecture Notes in ComputerScience, 2400:167–??, 2002.

CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2400/24000167.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2400/24000167.pdf.

Du:2010:COT

[DWL+10] Peng Du, Rick Weber, Pi-otr Luszczek, Stanimire To-mov, Gregory Peterson,and Jack Dongarra. FromCUDA to OpenCL: Towardsa performance-portable so-lution for multi-platformGPU programming. LA-PACK Working Note 228,Department of ComputerScience, University of Ten-nessee, Knoxville, Knoxville,TN 37996, USA, Septem-ber 6, 2010. URL http://www.netlib.org/lapack/lawnspdf/lawn228.pdf. UT-CS-10-656.

Du:2012:COT

[DWL+12] Peng Du, Rick Weber, Pi-otr Luszczek, Stanimire To-mov, Gregory Peterson,and Jack Dongarra. FromCUDA to OpenCL: Towardsa performance-portable so-lution for multi-platformGPU programming. Par-allel Computing, 38(8):391–407, August 2012. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http:/

REFERENCES 178

/www.sciencedirect.com/science/article/pii/S0167819111001335.

Deshpande:2012:AGC

[DWM12] Vivek Deshpande, Xing Wu,and Frank Mueller. Auto-generation of communica-tion benchmark traces. ACMSIGMETRICS PerformanceEvaluation Review, 40(2):99–105, September 2012.CODEN ???? ISSN0163-5999 (print), 1557-9484(electronic).

Dong:1996:SPL

[DXB96] Li Dong, Li Xiaoming, andFang Binxing. The study onthe parallel library based onMPI. Mini-Micro Systems,17(12):17–19, 1996. CODENXWJXEH. ISSN 1000-1220.

Deng:2006:PIK

[DYN+06] Junjun Deng, Hengyong Yu,Jun Ni, Tao He, Shiy-ing Zhao, Lihe Wang, andGe Wang. A parallel im-plementation of the katse-vich algorithm for 3-D CTimage reconstruction. TheJournal of Supercomputing,38(1):35–47, October 2006.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=38&issue=1&spage=35.

Dantas:1996:ILB

[DZ96] M. A. R. Dantas and E. J.Zaluska. Improving load

balancing in an MPI envi-ronment with resource man-agement. In Liddell et al.[LCHS96], pages 959–960.ISBN 3-540-61142-8 (paper-back). LCCN QA76.88 .H521996.

Dantas:1998:ESM

[DZ98a] M. A. R. Dantas and E. J.Zaluska. Efficient schedul-ing of MPI applications onnetworks of workstations.Future Generation Com-puter Systems, 13(6):489–499, May 20, 1998. CODENFGSEVI. ISSN 0167-739X(print), 1872-7115 (elec-tronic). URL http://www.elsevier.com/gej-ng/10/19/19/28/20/21/abstract.html.

Delves:1998:HPF

[DZ98b] M. Delves and H. Zima.High Performance Fortran:a status report or: Arewe ready to give up MPI?Lecture Notes in ComputerScience, 1497:161–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Dragovitsch:1995:PPS

[DZDR95] P. Dragovitsch, X. Zhao,L. C. Dennis, and G. A.Riccardi. PVMGeant — aparallel simulation code forthe CLAS detector at CE-BAF. International Jour-nal of Supercomputer Ap-plications and High Perfor-mance Computing, 9(2):128–

REFERENCES 179

137, Summer 1995. CODENIJSCFG. ISSN 1078-3482.

Dykes:1994:CCP

[DZZY94] S. G. Dykes, XiaodongZhang, Yan Zhou, andHaixu Yang. Communica-tion and computation pat-terns of large scale imageconvolutions on parallel ar-chitectures. In Siegal [Sie94],pages 926–931. ISBN 0-8186-5602-6. LCCN QA76.58.I581994. IEEE catalog no.94CH34819.

Edjlali:1995:DPP

[EASS95] G. Edjlali, G. Agrawal,A. Sussman, and J. Saltz.Data parallel programmingin an adaptive environment.In IEEE [IEE95f], pages827–832. ISBN 0-8186-7074-6. LCCN QA 76.58 I56 1995.IEEE catalog no. 95TH8052.

Elwasif:2001:AMT

[EBKG01] Wael R. Elwasif, David E.Bernholdt, James A. Kohl,and G. A. Geist. An archi-tecture for a multi-threadedharness kernel. LectureNotes in Computer Sci-ence, 2131:126–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310126.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310126.pdf.

Eppstein:1994:CSP

[ED94] M. J. Eppstein and D. E.Dougherty. A compara-tive study of PVM work-station cluster implementa-tions of a two-phase sub-surface flow model. Ad-vances in water resources,17(3):181–??, ???? 1994.CODEN AWREDI. ISSN0309-1708 (print), 1872-9657(electronic).

Eigenmann:2008:ONE

[EdS08] Rudolf Eigenmann and Bro-nis R. de Supinski, editors.OpenMP in a New Era ofParallelism: 4th Interna-tional Workshop, IWOMP2008 West Lafayette, IN,USA, May 12–14, 2008Proceedings, volume 5004of Lecture Notes in Com-puter Science. Springer-Ver-lag, Berlin, Germany / Hei-delberg, Germany / Lon-don, UK / etc., 2008. CO-DEN LNCSD9. ISBN 3-540-79560-X (print), 3-540-79561-8 (e-book). ISSN0302-9743 (print), 1611-3349 (electronic). LCCN???? URL http://www.springerlink.com/content/978-3-540-79561-2.

ElMaghraoui:2009:MIM

[EDSV09] K. El Maghraoui, Travis J.Desell, Boleslaw K. Szyman-ski, and Carlos A. Varela.Malleable iterative MPI ap-plications. Concurrency

REFERENCES 180

and Computation: Prac-tice and Experience, 21(3):393–413, March 10, 2009.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Eleftheriou:2005:SFF

[EFR+05] M. Eleftheriou, B. G. Fitch,A. Rayshubskiy, T. J. C.Ward, and R. S. Germain.Scalable framework for 3DFFTs on the Blue Gene/Lsupercomputer: Implemen-tation and early perfor-mance measurements. IBMJournal of Research andDevelopment, 49(2/3):457–464, ???? 2005. CO-DEN IBMJAE. ISSN0018-8646 (print), 2151-8556(electronic). URL http://www.research.ibm.com/journal/rd/492/eleftheriou.pdf.

El-Ghazawi:2002:UPP

[EGC02] Tarek El-Ghazawi and FrancoisCantonnet. UPC perfor-mance and potential: aNPB experimental study.In IEEE [IEE02], page ??ISBN 0-7695-1524-X. LCCN???? URL http://www.sc-2002.org/paperpdfs/pap.pap316.pdf.

Eppstein:1992:PGC

[EGDK92] Margaret J. Eppstein, Jo-seph F. Guarnaccia, David EmeryDougherty, and Robert S.Kerr. Parallel groundwatercomputations using PVM. In

Russell et al. [R+92], pages713–720. ISBN 1-85166-871-3 (set), 1-85312-169-X (set:Computational MechanicsPublications, Southamp-ton), 1-56252-098-9 (set:Computational MechanicsPublications, Boston), 1-85166-791-1 (v. 1: Else-vier Applied Science), 1-85312-197-5 (v. 1: Com-putational Mechanics Pub-lications, Southampton), 1-56252-123-3 (v. 1: Compu-tational Mechanics Publica-tions, New York), 1-85166-870-5 (v. 2), 1-85312-198-3 (v. 2), 1-56252-124-1 (v.2). LCCN GB656.2.E42 C651992 v.1-2 (c1992). Two vol-umes.

Eickermann:1999:PID

[EGH99] T. Eickermann, H. Grund,and J. Henrichs. Perfor-mance issues of distributedMPI applications in a Ger-man gigabit testbed. InDongarra et al. [DLM99],pages 3–10. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Erhel:2014:DDM

[EGH+14] Jocelyne Erhel, Martin J.Gander, Laurence Halpern,Geraldine Pichot, TaoufikSassi, and Olof Widlund, ed-itors. Domain Decomposi-tion Methods in Science andEngineering XXI, volume 98of Lecture Notes in Com-

REFERENCES 181

putational Science and En-gineering. Springer-Verlag,Berlin, Germany / Heidel-berg, Germany / London,UK / etc., 2014. ISBN 3-319-05788-X (paperback), 3-319-05789-8 (e-book). ISSN1439-7358 (print), 2197-7100(electronic). LCCN QA71-90. URL http://0-dx.doi.org.fama.us.es/10.1007/978-3-319-05789-7.

Ebrahimirad:2015:EAS

[EGR15] Vahid Ebrahimirad, MaziarGoudarzi, and AboozarRajabi. Energy-awarescheduling for precedence-constrained parallel vir-tual machines in virtual-ized data centers. Journalof Grid Computing, 13(2):233–253, June 2015. CO-DEN ???? ISSN 1570-7873(print), 1572-9184 (elec-tronic). URL http://link.springer.com/article/10.1007/s10723-015-9327-x.

Evans:1992:PCP

[EJL92] D. J. Evans, G. R. Jou-bert, and H. Liddell, editors.Parallel computing ’91: pro-ceedings of the InternationalConference on Parallel Com-puting ’91, London, UK, 3–6September 1991, volume 4 ofAdvances in parallel comput-ing. North-Holland, Amster-dam, The Netherlands, 1992.ISBN 0-444-89212-5. LCCNQA76.58.I545 1991.

Exbrayat:1997:OPS

[EK97] M. Exbrayat and H. Kosch.Offering parallelism to a se-quential database manage-ment system on a networkof workstations using PVM.Lecture Notes in ComputerScience, 1332:457–435, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Eberl:1999:PCP

[EKTB99] M. Eberl, W. Karl, C. Trini-tis, and A. Blaszczyk. Par-allel computing on PC clus-ters — an alternative to su-percomputers for industrialapplications. In Dongarraet al. [DLM99], pages 493–498. ISBN 3-540-66549-8(softcover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

Elamvazuthi:1994:OPA

[EM94] C. Elamvazuthi and G. A.Manson. Occam, PVM andthe alternative construct. InMiles and Chalmers [MC94],pages 56–68. ISBN 90-5199-163-0. LCCN ????

Eigenmann:2000:TMPa

[EM00a] Rudolf Eigenmann and TimMattson. Tutorial M6A:Parallel programming withOpenMP: Part I. In ACM[ACM00], page 21. URLhttp://www.sc2000.org/proceedings/info/fp.pdf.

REFERENCES 182

Eigenmann:2000:TMPb

[EM00b] Rudolf Eigenmann and TimMattson. Tutorial M6B:Parallel programming withOpenMP: Part II. In ACM[ACM00], page 23. URLhttp://www.sc2000.org/proceedings/info/fp.pdf.

Espenica:2002:PPA

[EM02] Roberto Espenica and PedroMedeiros. Porting PVM tothe VIA architecture using afast communication library.Lecture Notes in ComputerScience, 2474:341–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.de/link/service/series/0558/bibs/2474/24740341.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740341.pdf.

Espinosa:1998:ADP

[EML98] A. Espinosa, T. Margalef,and E. Luque. Auto-matic detection of PVM pro-gram performance problems.Lecture Notes in ComputerScience, 1497:19–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Espinosa:2000:APA

[EML00] Antonio Espinosa, TomasMargalef, and Emilio Luque.Automatic performance anal-ysis of master/worker PVM

applications with Kpi. Lec-ture Notes in ComputerScience, 1908:47–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080047.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080047.pdf.

Ewing:1993:DCW

[EMO+93] R. E. Ewing, D. Mitchum,P. O’Leary, R. C. Sharp-ley, and J. S. Sochacki.Distributed computation ofwave propagation models us-ing PVM. In IEEE [IEE93e],pages 22–31. ISBN 0-8186-4340-4 (paperback), 0-8186-4341-2 (microfiche), 0-8186-4342-0 (hardback), 0-8186-4346-3 (CD-ROM). ISSN1063-9535. LCCN QA76.5.S96 1993.

Engquist:2000:SVG

[Eng00] Bjorn Engquist, editor. Sim-ulation and visualization onthe grid: Parallelldator-centrum, Kungl. TekniskaHogskolan, seventh annualconference, Stockholm, Swe-den, December 1999: pro-ceedings, volume 13 of Lec-ture Notes in Computa-tional Science and Engineer-ing. Springer-Verlag, Berlin,Germany / Heidelberg, Ger-many / London, UK / etc.,2000. ISBN 3-540-67264-

REFERENCES 183

8. ISSN 1439-7358. LCCNQA76.9.C65 S535 2000.

Emani:2015:CDM

[EO15] Murali Krishna Emani andMichael O’Boyle. Celebrat-ing diversity: a mixture ofexperts approach for run-time mapping in dynamicenvironments. ACM SIG-PLAN Notices, 50(6):499–508, June 2015. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Ebner:1996:TFP

[EP96] R. Ebner and A. Pfaffin-ger. Transformation of func-tional programs into dataflow graphs implementedwith PVM. In Bode et al.[BDLS96], pages 251–??ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E975 1996.

Espinosa:1999:REB

[EPML99] A. Espinosa, F. Parcerisa,T. Margalef, and E. Luque.Relating the execution be-haviour with the structureof the application. InDongarra et al. [DLM99],pages 91–100. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Eizenberg:2017:BBL

[EPP+17] Ariel Eizenberg, YuanfengPeng, Toma Pigli, William

Mansky, and Joseph Devi-etti. BARRACUDA: binary-level analysis of runtimeRAces in CUDA programs.ACM SIGPLAN Notices,52(6):126–140, June 2017.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

ElZein:2012:GOC

[ER12] Ahmed H. El Zein and Al-istair P. Rendell. Generat-ing optimal CUDA sparsematrix–vector product im-plementations for evolv-ing GPU hardware. Con-currency and Computation:Practice and Experience,24(1):3–13, January 2012.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

El-Rewini:1995:PTE

[ERS95] H. El-Rewini and B. D.Shriver, editors. Proceed-ings of the Twenty-EighthHawaii International Con-ference on System Sciences.IEEE Computer SocietyPress, 1109 Spring Street,Suite 300, Silver Spring, MD20910, USA, 1995. ISBN 0-8186-6935-7. LCCN ????

El-Rewini:1996:PTN

[ERS96] Hesham El-Rewini andBruce D. Shriver, editors.Proceedings of the Twenty-Ninth Hawaii InternationalConference on System Sci-ences (HICSS-29): Wailea,

REFERENCES 184

HI, USA, 3–6 January 1996.IEEE Computer SocietyPress, 1109 Spring Street,Suite 300, Silver Spring, MD20910, USA, 1996. ISBN0-8186-7324-9. ISSN 1060-3425. LCCN ???? Five vol-umes.

Ewedafe:2011:PID

[ES11] Simon Uzezi Ewedafe andRio Hirowati Shariffudin.Parallel implementation of2-D telegraphic equation onMPI/PVM cluster. Inter-national Journal of Par-allel Programming, 39(2):202–231, April 2011. CO-DEN IJPPE5. ISSN 0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=39&issue=2&spage=202.

Ellingson:2013:SNU

[ESB13] Sally R. Ellingson, Jeremy C.Smith, and Jerome Baudry.Software news and up-dates: VinaMPI: Facilitat-ing multiple receptor high-throughput virtual dockingon high-performance com-puters. Journal of Com-putational Chemistry, 34(25):2212–2221, September30, 2013. CODEN JC-CHDD. ISSN 0192-8651(print), 1096-987X (elec-tronic).

Ewing:1994:DCW

[ESM+94] Richard E. Ewing, Robert C.

Sharpley, Derek Mitchum,P. O’Leary, and J. S.Sochacki. Distributed com-putation of wave propaga-tion models using PVM.IEEE parallel and dis-tributed technology: systemsand applications, 2(1):26–31, Spring 1994. CODENIPDTEX. ISSN 1063-6552(print), 1558-1861 (elec-tronic).

Escaig:1994:PMD

[ETV94] Y. Escaig, G. Touzot, andM. Vayssade. Parallelizationof a multilevel domain de-composition method. Com-puting systems in engi-neering: an internationaljournal, 5(3):253–263, June1994. CODEN COSEEO.ISSN 0956-0521.

Eichenberger:2012:DOT

[ETWaM12] Alexandre E. Eichenberger,Christian Terboven, MichaelWong, and Dieter an Mey.The design of OpenMPthread affinity. LectureNotes in Computer Science,7312:15–28, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-30961-8_2/.

Eigenmann:2001:OSM

[EV01] Rudolf Eigenmann andMichael J. Voss, editors.OpenMP shared memory

REFERENCES 185

parallel programming: In-ternational Workshop onOpenMP Applications andTools, WOMPAT 2001,West Lafayette, IN, USA,July 30–31, 2001: Pro-ceedings, volume 2104 ofLecture Notes in Com-puter Science. Springer-Ver-lag, Berlin, Germany / Hei-delberg, Germany / Lon-don, UK / etc., 2001. CO-DEN LNCSD9. ISBN 3-540-42346-X (paperback). ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.642 .I589 2001; QA267.A1L43 no.2104. URL http://link.springer-ny.com/link/service/series/0558/tocs/t2104.htm.

Eckert:2016:HAL

[EZBA16] C. H. J. Eckert, E. Zenker,M. Bussmann, and D. Al-bach. HASEonGPU —an adaptive, load-balancedMPI/GPU-code for calcu-lating the amplified spon-taneous emission in highpower laser media. Com-puter Physics Communi-cations, 207(??):362–374,October 2016. CODENCPHCBZ. ISSN 0010-4655(print), 1879-2944 (elec-tronic). URL http://www.sciencedirect.com/science/article/pii/S0010465516301436.

Fabeiro:2016:WPP

[FAF16] Jorge F. Fabeiro, DiegoAndrade, and Basilio B.

Fraguela. Writing a performance-portable matrix multiplica-tion. Parallel Computing, 52(??):65–77, February 2016.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167819115001611.

Fabeiro:2015:AGO

[FAFD15] Jorge F. Fabeiro, Diego An-drade, Basilio B. Fraguela,and Ramon Doallo. Au-tomatic generation of op-timized OpenCL codes us-ing OCLoptimizer. TheComputer Journal, 58(11):3057–3073, November 2015.CODEN CMPJA6. ISSN0010-4620 (print), 1460-2067(electronic).

Fang:1998:DDL

[Fan98] Niandong Fang. Distributeddata library and tools for anMPI programming environ-ment, volume 1 of Researchreports in computer science.Shaker, Aachen, Germany,1998. ISBN 3-8265-4101-4. xx + 195 pp. LCCN???? Also published as dis-sertation of the University ofBasel.

Freeman:1994:SMM

[FB94] T. L. Freeman and J. M.Bull. Shared memory andmessage passing implemen-tations of parallel algorithmsfor numerical integration.Lecture Notes in Computer

REFERENCES 186

Science, 879:219–228, 1994.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Fang:1995:PMS

[FB95] Niandong Fang and H. Burkhart.PEMPI — from MPI stan-dard to programming envi-ronment. In IEEE [IEE95j],pages 31–38. ISBN 0-8186-6895-4. LCCN QA76.58 .S341994.

Fang:1996:SPP

[FB96] N. Fang and H. Burkhart.Structured parallel program-ming using MPI. In Lid-dell et al. [LCHS96], pages840–847. ISBN 3-540-61142-8 (paperback). LCCNQA76.88 .H52 1996.

Fang:1997:MDD

[FB97] Niandong Fang and Hel-mar Burkhart. MPI-DDL: a distributed-data li-brary for MPI. FutureGeneration Computer Sys-tems, 12(5):407–419, April1, 1997. CODEN FG-SEVI. ISSN 0167-739X(print), 1872-7115 (elec-tronic). URL http://www.elsevier.com/gej-ng/10/19/19/27/17/23/abstract.html.

Fagg:2001:FTM

[FBD01a] Graham E. Fagg, AntoninBukovsky, and Jack J. Don-garra. Fault tolerant MPIfor the HARNESS meta-

computing system. Lec-ture Notes in Computer Sci-ence, 2073:355–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2073/20730355.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2073/20730355.pdf.

Fagg:2001:HFT

[FBD01b] Graham E. Fagg, AntoninBukovsky, and Jack J. Don-garra. HARNESS and faulttolerant MPI. Parallel Com-puting, 27(11):1479–1495,October 2001. CODEN PA-COEJ. ISSN 0167-8191(print), 1872-7336 (elec-tronic). URL http://www.elsevier.com/gej-ng/10/35/21/47/41/32/abstract.html; http://www.elsevier.nl/gej-ng/10/35/21/47/41/32/article.pdf; http://www.netlib.org/utk/people/JackDongarra/PAPERS/harness-ftmpi-pc.pdf.

Friedel:2001:HMC

[FBSN01] Peter Friedel, Jorg Bergmann,Stephan Seidl, and Wolf-gang E. Nagel. An hierar-chical MPI communicationmodel for the parallelized so-lution of multiple integrals.Lecture Notes in ComputerScience, 2110:474–??, 2001.CODEN LNCSD9. ISSN

REFERENCES 187

0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2110/21100474.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2110/21100474.pdf.

Fagg:2002:FTM

[FBVD02] Graham E. Fagg, AntoninBukovsky, Sathish Vadhi-yar, and Jack J. Dongarra.Fault tolerant MPI for theHARNESS MetaComputingsystem. Technical report????, University of Ten-nessee, Knoxville, Knoxville,TN 37996, USA, 2002. 14 pp.URL http://www.netlib.org/netlib/utk/people/JackDongarra/PAPERS/ft-mpi-iccs-gef.pdf.

Floros:2005:TGS

[FC05] Evangelos Floros and Yian-nis Cotronis. Towards a Gridservices based framework forthe virtualization, executionand composition of MPI ap-plications. Parallel Process-ing Letters, 15(1/2):85–98,March/June 2005. CODENPPLTEE. ISSN 0129-6264(print), 1793-642X (elec-tronic).

Falzone:2007:PMF

[FCLG07] Christopher Falzone, An-thony Chan, Ewing Lusk,and William Gropp. Aportable method for finding

user errors in the usage ofMPI collective operations.The International Journal ofHigh Performance Comput-ing Applications, 21(2):155–165, May 2007. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/21/2/155.full.pdf+html.

Ferschweiler:2001:CDP

[FCP+01] Ken Ferschweiler, Mari-acarla Calzarossa, CherriPancake, Daniele Tessera,and Dylan Keon. A com-munity databank for per-formance tracefiles. Lec-ture Notes in Computer Sci-ence, 2131:233–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310233.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310233.pdf.

Filgueira:2012:DCD

[FCS+12] Rosa Filgueira, Jesus Car-retero, David E. Singh, Ale-jandro Calderon, and Al-berto Nunez. Dynamic–CoMPI: dynamic optimiza-tion techniques for MPI par-allel applications. The Jour-nal of Supercomputing, 59(1):361–391, January 2012.CODEN JOSUED. ISSN

REFERENCES 188

0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=59&issue=1&spage=361.

Fagg:1996:PIP

[FD96] Graham Fagg and JackDongarra. PVMPI: Anintegration of PVM andMPI systems. Calcula-teurs Paralleles, 8(2):151–166, 1996. CODEN ????ISSN 1260-3198. URLhttp://www.netlib.org/utk/papers/pvmpi/paper.html; http://www.netlib.org/utk/papers/pvmpi/pvmpi.ps; http://www.netlib.org/utk/people/JackDongarra/pdf/pvmpi.pdf.

Fischer:1997:AAP

[FD97] Markus Fischer and JackDongarra. Another architec-ture: PVM on Windows 95/NT. In ????, editor, Concur-rent Computing Conference,Atlanta, GA, March 10–11,1994, page ?? ????, ????,1997. URL http://www.netlib.org/utk/people/JackDongarra/PAPERS/nt-paper.ps; http://www.netlib.org/utk/people/JackDongarra/pdf/nt-paper.pdf.

Fagg:2000:FMF

[FD00] Graham E. Fagg and Jack J.Dongarra. FT-MPI: FaultTolerant MPI, supporting

dynamic applications in adynamic world. LectureNotes in Computer Sci-ence, 1908:346–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080346.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080346.pdf.

Fagg:2002:HFTa

[FD02a] Graham E. Fagg and Jack J.Dongarra. HARNESS faulttolerant MPI design, us-age and performance is-sues. Technical report????, University of Ten-nessee, Knoxville, Knoxville,TN 37996, USA, 2002.URL http://www.netlib.org/netlib/utk/people/JackDongarra/PAPERS/ft-mpi-fgcs-grid-se.pdf.

Fagg:2002:HFTb

[FD02b] Graham E. Fagg and Jack J.Dongarra. HARNESS faulttolerant MPI design, us-age and performance is-sues. Future GenerationComputer Systems, 18(8):1127–1142, October 2002.CODEN FGSEVI. ISSN0167-739X (print), 1872-7115 (electronic).

Fagg:2004:BUF

[FD04] Graham E. Fagg and Jack J.

REFERENCES 189

Dongarra. Building and us-ing a fault-tolerant MPI im-plementation. The Interna-tional Journal of High Per-formance Computing Ap-plications, 18(3):353–361,Fall 2004. CODEN IH-PCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/18/3/353.full.pdf+html.

Fagg:1997:HMAa

[FDG97a] G. Fagg, J. Dongarra, andA. Geist. HeterogeneousMPI application interop-eration and process man-agement under PVMPI.Technical report CS-97-???, University of Ten-nessee, Knoxville, Knoxville,TN 37996, USA, June1997. URL http://www.netlib.org/utk/papers/pvmmpi97.ps; http://www.netlib.org/utk/people/JackDongarra/pdf/pvmmpi97.pdf.

Fagg:1997:HMAb

[FDG97b] G. E. Fagg, J. J. Don-garra, and A. Geist. Het-erogeneous MPI applicationinteroperation and processmanagement under PVMPI.Lecture Notes in ComputerScience, 1332:91–98, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Falch:2017:RAM

[FE17] Thomas L. Falch andAnne C. Elster. Machinelearning-based auto-tuningfor enhanced performanceportability of OpenCL ap-plications. Concurrencyand Computation: Prac-tice and Experience, 29(8):??, April 25, 2017. CO-DEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Ferenczi:1992:AHW

[Fer92] S. Ferenczi, editor. 1stAustrian-Hungarian Work-shop on Transporter Appli-cations. Proceedings. Hun-garian Acad.of Sci, Bu-dapest, Hungary, 1992.ISBN ???? LCCN ????

Ferrari:1998:JNPb

[Fer98a] Adam Ferrari. JPVM:network parallel comput-ing in Java. Concur-rency: practice and expe-rience, 10(11–13):985–992,September 1998. CO-DEN CPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/cgi-bin/abstract?ID=10050413;http://www3.interscience.

wiley.com/cgi-bin/fulltext?ID=10050413&PLACEBO=IE.pdf. Special Issue: Java forHigh-performance NetworkComputing.

Ferrari:1998:JNPa

[Fer98b] Adam J. Ferrari. JPVM:

REFERENCES 190

Network parallel comput-ing in Java. In ACM[ACM98a], page ?? ISBN???? LCCN ????URL http://www.cs.ucsb.edu/conferences/java98/papers/jpvm.pdf; http://www.cs.ucsb.edu/conferences/java98/papers/jpvm.ps.Possibly unpublished, exceptelectronically.

Fernando:2004:GGP

[Fer04] Randima Fernando, editor.GPU gems: programmingtechniques, tips, and tricksfor real-time graphics, vol-ume 1 of GPU gems. Ad-dison-Wesley, Reading, MA,USA, 2004. ISBN 0-321-22832-4. xvv + 765 pp.LCCN T385 .G6879 2004.US$45.99.

FerreiradaSilva:2010:PBC

[Fer10] Adelino Ferreira da Silva.cudaBayesreg: Bayesiancomputation in CUDA.The R Journal, 2(2):48–55, December 2010. CO-DEN ???? ISSN 2073-4859. URL http://journal.r-project.org/archive/2010-2/RJournal_2010-2_Ferreira~da-Silva.pdf.

Fritzson:1995:PPA

[FF95] Peter Fritzson and LeifFinmo, editors. Paral-lel programming and ap-plications: proceedings ofthe Workshop on Paral-

lel Programming and Com-putation (ZEUS ’95) andthe 4th Nordic TransputerConference (NTUG ’95):Linkoping, Sweden. IOSPress, Postal Drawer 10558,Burke, VA 2209-0558, USA,1995. ISBN 90-5199-229-7(IOS Press), 4-274-90056-8(Ohmsha). LCCN ????

Fava:1999:MPI

[FFB99] A. Fava, M. Fava, andM. Bertozzi. MPIPOV: aparallel implementation ofPOV-Ray based on MPI. InDongarra et al. [DLM99],pages 426–433. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Frugoli:1999:DCH

[FFFC99] G. Frugoli, A. Fava, E. Fava,and G. Conte. Dis-tributed collision handlingfor particle-based simula-tion. In Dongarra et al.[DLM99], pages 410–417.ISBN 3-540-66549-8 (soft-cover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

Fousek:2011:AFC

[FFM11] Jan Fousek, Jiri Filipovic,and Matus Madzin. Au-tomatic fusions of CUDA–GPU kernels for parallelmap. ACM SIGARCH Com-puter Architecture News, 39(4):98–99, September 2011.

REFERENCES 191

CODEN CANED2. ISSN0163-5964 (print), 1943-5851(electronic).

Fernandez:2003:BMN

[FFP03] Juan Fernandez, EitanFrachtenberg, and Fab-rizio Petrini. BCS-MPI: anew approach in the sys-tem software design forlarge-scale parallel comput-ers. In ACM [ACM03],page ?? ISBN 1-58113-695-1. LCCN ???? URL http://www.sc-conference.org/sc2003/inter_cal/inter_cal_detail.php?eventid=10716#1; http://www.sc-conference.org/sc2003/paperpdfs/pap306.pdf.

Foster:1998:WAI

[FGG+98] Ian Foster, Jonathan Geisler,William Gropp, NicholasKaronis, Ewing Lusk, GeorgeThiruvathukal, and StevenTuecke. Wide-area imple-mentation of the MessagePassing Interface. Par-allel Computing, 24(12–13):1735–1749, November1, 1998. CODEN PA-COEJ. ISSN 0167-8191(print), 1872-7336 (elec-tronic). URL http://www.elsevier.com/cas/tree/store/parco/sub/1998/24/12-13/1352.pdf.

Foster:1997:MMC

[FGKT97] Ian Foster, Jonathan Geisler,Carl Kesselman, and StevenTuecke. Managing multi-

ple communication meth-ods in high-performance net-worked computing systems.Journal of Parallel and Dis-tributed Computing, 40(1):35–48, January 10, 1997.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic). URL http://www.idealibrary.com/links/doi/10.1006/jpdc.1996.1266/production;http://www.idealibrary.com/links/doi/10.1006/jpdc.1996.1266/production/pdf; http://www.idealibrary.com/links/doi/10.1006/jpdc.1996.1266/production/ref.

Fagg:2001:PIS

[FGRD01] Graham E. Fagg, EdgarGabriel, Michael Resch,and Jack J. Dongarra.Parallel IO support formeta-computing applica-tions: MPI Connect IO ap-plied to PACX–MPI. Lec-ture Notes in Computer Sci-ence, 2131:135–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310135.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310135.pdf.

Fahringer:2000:FOP

[FGRT00] Thomas Fahringer, MichaelGerndt, Graham Riley, and

REFERENCES 192

Jesper Larsson Traff. For-malizing OpenMP perfor-mance properties with ASL.Lecture Notes in ComputerScience, 1940:428–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1940/19400428.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1940/19400428.pdf.

Foster:1996:MIW

[FGT96] I. Foster, J. Geisler, andS. Tuecke. MPI on theI-WAY: a wide-area, mul-timethod implementation ofthe Message Passing Inter-face. In IEEE [IEE96i],pages 10–17. ISBN 0-8186-7533-0. LCCN QA76.642.M67 1996.

Fan:1995:DMP

[FH95] W. C. Fan and J. A. Hal-bleib, Sr. Distributed mul-titasking ITS with PVM.Transactions of the Amer-ican Nuclear Society, 72(????):146–147, ???? 1995.CODEN TANSAO. ISSN0003-018X.

Fachat:1997:IEB

[FH97] Andre Fachat and Karl HeinzHoffmann. Implementationof Ensemble-Based Simu-lated Annealing with dy-namic load balancing un-der MPI. Computer Physics

Communications, 107(1–3):49–53, December 1997. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465597000969.

Andre:1998:BVN

[FH98] Andre Fachat and Karl HeinzHoffmann. Blocking vs.non-blocking communica-tion under MPI on a master-workerproblem. Preprint-Reihe des Chemnitzer SFB393 Sonderforschungsbere-ich NumerischeSimulationauf Massiv Parallelen Rech-nern 98,18, UniversitatChemnitz-Zwickau, Chem-nitz, Germany, 1998.

Friedley:2013:OPE

[FHB+13] Andrew Friedley, TorstenHoefler, Greg Bronevetsky,Andrew Lumsdaine, andChing-Chen Ma. Owner-ship passing: efficient dis-tributed memory program-ming on multi-core systems.ACM SIGPLAN Notices, 48(8):177–186, August 2013.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic). PPoPP ’13 Confer-ence proceedings.

Franke:1995:AAV

[FHC+95] E. A. Franke, S. D. Huffman,W. M. Carter, J. P. Baum-gartner, and D. J. Wen-zel. AVTP — an architec-

REFERENCES 193

ture for visualization usingremote parallel/distributedcomputing. In Grinsteinand Erbacher [GE95], pages230–237. CODEN PSISDG.ISBN 0-8194-1757-2. ISSN0277-786X (print), 1996-756X (electronic). LCCNTS510.S63 v.2410.

Field:2001:RTF

[FHK01] Antony J. Field, Thomas L.Hansen, and Paul H. J.Kelly. Run-time fusionof MPI calls in a par-allel C++ library. Lec-ture Notes in Computer Sci-ence, 2017:363–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2017/20170363.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2017/20170363.pdf.

Franke:1994:MMP

[FHP+94] H. Franke, P. Hochschild,P. Pattnaik, J.-P. Prost, andM. Snir. MPI-F: an MPI pro-totype implementation onIBM SP1. In Dongarra andTourancheau [DT94], pages43–55. ISBN 0-89871-343-9.LCCN QA76.58.I568 1994.

Franke:1995:MIS

[FHP+95] H. Franke, P. Hochschild,P. Pattnaik, J.-P. Prost, andM. Snir. MPI on IBM

SP1/SP2: current statusand future directions. InIEEE [IEE95j], pages 39–48.ISBN 0-8186-6895-4. LCCNQA76.58 .S34 1994.

Franke:1994:EIM

[FHPS94a] H. Franke, P. Hochschild,P. Pattnaik, and M. Snir.An efficient implementa-tion of MPI. In Deckerand Rehmann [DR94], pages219–230. ISBN 0-8176-5090-3 (Boston), 3-7643-5090-3 (Basel). LCCNQA76.58.P767 1994.

Franke:1994:MEI

[FHPS94b] H. Franke, P. Hochschild,P. Pattnaik, and M. Snir.MPI-F: An efficient imple-mentation of MPI on IBM-SP1. In Agrawal et al.[ATC94], pages III–197–III–201. ISBN 0-8493-2496-3,0-8493-2495-5. ISSN 0190-3918. LCCN QA 76.58 I551994. Three volumes.

Fang:1999:PMD

[FHSO99] Zhiwu Fang, A. D. J.Haymet, Wataru Shinoda,and Susumu Okazaki. Par-allel molecular dynamicssimulation: Implementationof PVM for a lipid mem-brane. Computer PhysicsCommunications, 116(2–3):295–310, February 1999.CODEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465598000897.

REFERENCES 194

Fineberg:1994:IMM

[Fin94] S. A. Fineberg. Implement-ing multidisciplinary andmulti-zonal applications us-ing MPI. In IEEE [IEE94a],pages 496–503. ISBN 0-8186-6965-9. LCCN QA76.58.S951994. IEEE catalog no.95TH8024.

Fineberg:1995:IMM

[Fin95] Samuel A. Fineberg. Im-plementing multidisciplinaryand multi-zonal applicationsusing MPI. Frontiers ofMassively Parallel Computa-tion — Conference Proceed-ings, pages 496–503, ????1995. IEEE catalog number95TH8024.

Fin:1997:CPM

[Fin97] Torsten Fin. Comparing theperformance of MPI, PVM,and CORBA on EthernetLANs. Berichte zur Rech-nerarchitektur 3(4), Insti-tut fur Informatik, Lehrstuhlfur Rechnerarchitektur und-kommunikation, Friedrich-Schiller-Universitat Jena,Jena, Germany, 1997. 12 pp.

Fink:2000:IMC

[Fin00] Torsten Fink. IntegratingMPI components into meta-computing applications. Lec-ture Notes in Computer Sci-ence, 1908:208–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/

link/service/series/0558/bibs/1908/19080208.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080208.pdf.

Fischer:2001:SAN

[Fis01] Markus Fischer. Systemarea network extensions tothe parallel virtual machine.Lecture Notes in ComputerScience, 2131:98–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310098.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310098.pdf.

Fernandez:2000:UPM

[FJBB+00] Gustavo J. Fernandez, JulioJacobo-Berlles, Patricia Boren-sztejn, Marisa Bauza, andMarta Mejail. Use of PVMfor MAP image restoration:a parallel implementationof the ARTUR algorithm.Lecture Notes in ComputerScience, 1908:113–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080113.htm;http://link.springer-

ny.com/link/service/series/

REFERENCES 195

0558/papers/1908/19080113.pdf.

Forejt:2017:PPA

[FJK+17] Vojtach Forejt, SaurabhJoshi, Daniel Kroening,Ganesh Narayanaswamy,and Subodh Sharma. Pre-cise predictive analysis fordiscovering communicationdeadlocks in MPI programs.ACM Transactions on Pro-gramming Languages andSystems, 39(4):15:1–15:??,September 2017. CODENATPSDT. ISSN 0164-0925(print), 1558-4593 (elec-tronic).

Feng:2014:SBS

[FJZ+14] Xiaowen Feng, Hai Jin, RanZheng, Zhiyuan Shao, andLei Zhu. A segment-basedsparse matrix–vector multi-plication on CUDA. Con-currency and Computation:Practice and Experience, 26(1):271–286, January 2014.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Flower:1994:EJM

[FK94] Jon Flower and Adam Ko-lawa. Express is not justa message passing system:current and future direc-tions in Express. Paral-lel Computing, 20(4):597–614, April 31, 1994. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http:

//www.elsevier.com/cgi-bin/cas/tree/store/parco/cas_sub/browse/browse.cgi?year=1994&volume=20&issue=4&aid=860.

Ferenczi:1995:PAH

[FK95] Szabolcs Ferenczi and Pe-ter Kacsuk, editors. Pro-ceedings of the 2nd Austrian-Hungarian Workshop onTransputer Applications:September 29–October 1,1994, Budapest, Hungary.Hungarian Academy of Sci-ences, Central Research In-titute for Physics, Budapest,Hungary, 1995. ISBN ????LCCN ???? Technical re-port KFKI-1995-2/M,N.

Fischer:2001:DNM

[FK01] Markus Fischer and PeterKemper. Distributed numer-ical Markov chain analysis.Lecture Notes in ComputerScience, 2131:272–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310272.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310272.pdf.

Field:2002:OSR

[FKH02] A. J. Field, P. H. J. Kelly,and T. L. Hansen. Op-timising shared reductionvariables in MPI programs.

REFERENCES 196

Lecture Notes in ComputerScience, 2400:630–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2400/24000630.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2400/24000630.pdf.

Foster:1996:MCL

[FKK96a] I. T. Foster, D. R. Kohr, Jr.,and R. Krishnaiyer. MPI asa coordination layer for com-municating HPF tasks. InIEEE [IEE96i], pages 68–78.ISBN 0-8186-7533-0. LCCNQA76.642 .M67 1996.

Foster:1996:CDT

[FKK+96b] I. T. Foster, D. R. Kohr,Jr., R. Krishnaiyer, Choud-hary, and A. Communicat-ing data-parallel tasks: anMPI library for HPF. InIEEE [IEE96a], pages 433–438. ISBN 0-8186-7557-8. LCCN QA76.88.I5751996. IEEE catalog number96TB100074.

Foster:1996:DSB

[FKKC96] Ian Foster, David R. Kohr,Jr., Rakesh Krishnaiyer, andAlok Choudhary. Dou-ble standards: Bringingtask parallelism to HPF viathe message passing inter-face. In ACM [ACM96c],page ?? ISBN 0-89791-854-1. LCCN QA 76.88

S8573 1996. URL http://www.supercomp.org/sc96/proceedings/SC96PROC/FOSTER2/INDEX.HTM. ACM OrderNumber: 415962, IEEEComputer Society Press Or-der Number: RS00126.

Freeh:2008:JTD

[FKLB08] Vincent W. Freeh, NandiniKappiah, David K. Lowen-thal, and Tyler K. Bletsch.Just-in-time dynamic volt-age scaling: Exploiting inter-node slack to save energyin MPI programs. Jour-nal of Parallel and Dis-tributed Computing, 68(9):1175–1185, September 2008.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic).

Foster:1996:GCM

[FKS96] I. Foster, C. Kesselman,and M. Snir. Generalizedcommunicators in the Mes-sage Passing Interface. InIEEE [IEE96i], pages 42–49.ISBN 0-8186-7533-0. LCCNQA76.642 .M67 1996.

Florez:2005:LMM

[FLB+05] German Florez, Zhen Liu,Susan M. Bridges, AnthonySkjellum, and Rayford B.Vaughn. Lightweight mon-itoring of MPI programsin real time. Concurrencyand Computation: Prac-tice and Experience, 17(13):1547–1578, November 2005.CODEN CCPEBO. ISSN

REFERENCES 197

1532-0626 (print), 1532-0634(electronic).

Fagg:1996:TGR

[FLD96] G. E. Fagg, K. S. Lon-don, and J. J. Dongarra.Taskers and general resourcemanagers: PVM support-ing DCE process manage-ment. In Bode et al.[BDLS96], pages 180–??ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E975 1996.

Fagg:1998:MMH

[FLD98] G. E. Fagg, K. S. London,and J. J. Dongarra. MPI-Connect: Managing hetero-geneous MPI applicationsinteroperation and processcontrol. Lecture Notes inComputer Science, 1497:93–??, 1998. CODEN LNCSD9.ISSN 0302-9743 (print),1611-3349 (electronic).

Fachada:2017:CCF

[FLMR17] Nuno Fachada, Vitor V.Lopes, Rui C. Martins,and Agostinho C. Rosa.cf4ocl: a C framework forOpenCL. Science of Com-puter Programming, 143(??):9–19, September 1, 2017.CODEN SCPGD4. ISSN0167-6423 (print), 1872-7964(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167642317300540.

Feeley:1990:PVM

[FM90] Marc Feeley and James S.Miller. A parallel vir-tual machine for efficientScheme compilation. InACM [ACM90], pages 119–130. ISBN 0-89791-368-X.LCCN QA 76.73 L23 A241990. URL http://www.acm.org/pubs/citations/proceedings/lfp/91556/p119-feeley/. ACM orderno. 552900.

Furlinger:2009:CAE

[FM09] Karl Furlinger and ShirleyMoore. Capturing and an-alyzing the execution con-trol flow of OpenMP appli-cations. International Jour-nal of Parallel Programming,37(3):266–276, June 2009.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=37&issue=3&spage=266.

Fabero:1996:DLB

[FMBM96] J. C. Fabero, I. Martin,A. Bautista, and S. Molina.Dynamic load balancing ina heterogeneous environ-ment under PVM. In IEEE[IEE96g], pages 414–419.ISBN 0-8186-7376-1. LCCNQA76.58 .E97 1996. IEEEorder number PR07376.

Fiala:2012:DCS

[FME+12] David Fiala, Frank Mueller,

REFERENCES 198

Christian Engelmann, RolfRiesen, Kurt Ferreira, andRon Brightwell. Detec-tion and correction of silentdata corruption for large-scale high-performance com-puting. In Hollingsworth[Hol12], pages 78:1–78:??ISBN 1-4673-0804-8. URLhttp://conferences.computer.org/sc/2012/papers/1000a046.pdf.

Filipovic:2015:OCC

[FMFM15] Jirı Filipovic, Matus Madzin,Jan Fousek, and LudekMatyska. Optimizing CUDAcode by kernel fusion: ap-plication on BLAS. TheJournal of Supercomput-ing, 71(10):3934–3957, Oc-tober 2015. CODEN JO-SUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic). URL http://link.springer.com/article/10.1007/s11227-015-1483-z.

Ferretti:2015:MCH

[FMS15] Marco Ferretti, Mirto Musci,and Luigi Santangelo. MPI–CMS: a hybrid parallel ap-proach to geometrical mo-tif search in proteins. Con-currency and Computation:Practice and Experience,27(18):5500–5516, Decem-ber 25, 2015. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic).

Fan:2017:SEE

[FMSG17] Xing Fan, Mostafa Mehrabi,Oliver Sinnen, and NasserGiacaman. Supporting en-hanced exception handlingwith OpenMP in object–oriented languages. In-ternational Journal of Par-allel Programming, 45(6):1366–1389, December 2017.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640(electronic).

Ferenc:1999:VMK

[FNSW99] D. Ferenc, J. Nabrzyski,M. Stroinski, and P. Wierze-jewski. Visual MPI, aknowledge-based system forwriting efficient MPI ap-plications. In Dongarraet al. [DLM99], pages 257–266. ISBN 3-540-66549-8(softcover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

Femminella:1994:PBP

[FO94] A. Femminella and A. Omodeo.PVM-based parallel com-puting: a case study onpower plant simulation. Mi-croprocessing and Micropro-gramming, 40(10-12):875–878, December 1994. CO-DEN MMICDT. ISSN0165-6074 (print), 1878-7061(electronic).

Ford:1995:NNN

[For95] Brian Ford. The newNAG numerical PVM li-

REFERENCES 199

brary (or A new parallelnumerical library based onPVM). In IFIP WorkingGroup 2.5 [IFI95], page ??ISBN ???? LCCN???? URL http://www.nsc.liu.se/~boein/ifip/kyoto/workshop-info/proceedings/ford/ford1.html.

Foster:1998:GEM

[Fos98] Ian Foster. A grid-enabledMPI: Message passing inheterogeneous distributedcomputing systems. InACM [ACM98b], page ??ISBN ???? LCCN???? URL http://www.supercomp.org/sc98/papers/.

Freeman:1992:PNA

[FP92] T. L. (Len) Freeman andC. (Christopher) Phillips.Parallel numerical algo-rithms. Prentice Hall Inter-national Series in ComputerScience. Prentice-Hall Inter-national, Englewood Cliffs,NJ 07632, USA, 1992. ISBN0-13-651597-5. xii + 315pp. LCCN QA76.9.A43 F741992. US$40.00. Chapter 5discusses HPF and PVM.

Faraj:2008:SPA

[FPY08] Ahmad Faraj, Pitch Patara-suk, and Xin Yuan. A studyof process arrival patternsfor MPI collective opera-tions. International Journalof Parallel Programming, 36(6):543–570, December 2008.CODEN IJPPE5. ISSN

0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=36&issue=6&spage=543.

Ferreira:1995:PAI

[FR95] Afonso Ferreira and JoseRolim, editors. Parallelalgorithms for irregularlystructured problems: sec-ond international workshop,IRREGULAR 95, Lyon,France, September, 4–6,1995: proceedings. Springer-Verlag, Berlin, Germany /Heidelberg, Germany / Lon-don, UK / etc., 1995.ISBN 3-540-60321-2. LCCNQA76.642.I59 1995.

Franke:1995:MPEa

[Fra95] Hubertus Franke. MPI pro-gramming environment forIBM SP1/SP2. Research re-port RC 19991 (88480), IBMT. J. Watson Research Cen-ter, Yorktown Heights, NY,USA, 1995. 9 pp.

Fritscher:1993:PDC

[FS93] J. F. Fritscher and F. Sukup.93SC038 parallel distributedcomputing using PVM. InAnonymous [Ano93a], pages221–228. ISBN 0-947719-62-8. LCCN ????

Ferrari:1995:TDC

[FS95] A. J. Ferrari and V. S. Sun-deram. TPVM: distributedconcurrent computing with

REFERENCES 200

lightweight processes. InIEEE [IEE95k], pages 211–218. ISBN 0-8186-7088-6. LCCN QA76.9.D5 I3281995. IEEE catalog no.95TB8075.

Fischer:1997:ESP

[FS97] M. Fischer and J. Simon.Embedding SCI into PVM.Lecture Notes in ComputerScience, 1332:177–184, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Ferrari:1998:MDC

[FS98] Adam Ferrari and V. S.Sunderam. Multiparadigmdistributed computing withTPVM. Concurrency: prac-tice and experience, 10(3):199–228, March 1998. CO-DEN CPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/cgi-bin/abstract?ID=5374;http://www3.interscience.

wiley.com/cgi-bin/fulltext?ID=5374&PLACEBO=IE.pdf.

Filgueira:2011:ACE

[FSC+11] Rosa Filgueira, David E.Singh, Jesus Carretero, Ale-jandro Calderon, and FelixGarcıa. Adaptive-CoMPI:Enhancing MPI-based ap-plications’ performance andscalability by using adap-tive compression. TheInternational Journal ofHigh Performance Comput-ing Applications, 25(1):93–114, February 2011. CODEN

IHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/25/1/93.full.pdf+html.

Fuerle:1998:IPC

[FSLS98] T. Fuerle, E. Schikuta,C. Loeffelhardt, and K. Stockinger.On the implementation of aportable, client-server basedMPI-IO interface. LectureNotes in Computer Science,1497:172–??, 1998. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Fumero:2017:JTG

[FSSD17] Juan Fumero, Michel Steuwer,Lukas Stadler, and ChristopheDubach. Just-in-time GPUcompilation for interpretedlanguages with partial eval-uation. ACM SIGPLAN No-tices, 52(7):60–73, July 2017.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Folino:1998:EMC

[FST98a] G. Folino, G. Spezzano,and D. Talia. Evaluatingand modeling communica-tion overhead of MPI prim-itives on the Meiko CS-2.Lecture Notes in ComputerScience, 1497:27–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

REFERENCES 201

Folino:1998:PEM

[FST98b] G. Folino, G. Spezzano,and D. Talia. Perfor-mance evaluation and mod-elling of MPI communica-tions on the Meiko CS-2.Lecture Notes in ComputerScience, 1401:932–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Fernandez:1999:PGP

[FSTG99] F. Fernandez, J. M. Sanchez,M. Tomassini, and J. A.Gomez. A parallel geneticprogramming tool based onPVM. In Dongarra et al.[DLM99], pages 241–248.ISBN 3-540-66549-8 (soft-cover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

Fang:2014:API

[FSV14] Jianbin Fang, Henk Sips,and Ana Lucia Varbanescu.Aristotle: A performanceimpact indicator for theOpenCL kernels using lo-cal memory. Scientific Pro-gramming, 22(3):239–257,???? 2014. CODENSCIPEV. ISSN 1058-9244(print), 1875-919X (elec-tronic).

Feng:2014:MSP

[FSXZ14] Chunsheng Feng, Shi Shu,Jinchao Xu, and Chen-SongZhang. A multi-stage pre-conditioner for the black

oil model and its OpenMPimplementation. In Er-hel et al. [EGH+14], pages141–153. ISBN 3-319-05788-X (paperback), 3-319-05789-8 (e-book). ISSN1439-7358 (print), 2197-7100(electronic). LCCN QA71-90. URL http://link.springer.com/chapter/10.1007/978-3-319-05789-7_11/.

Fernandez:2000:DCE

[FTVB00] Francisco Fernandez, MarcoTomassini, Leonardo Van-neschi, and Laurent Bucher.A distributed computing en-vironment for genetic pro-gramming using MPI. Lec-ture Notes in Computer Sci-ence, 1908:322–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080322.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080322.pdf.

Fujimoto:2008:DMV

[Fuj08] Noriyuki Fujimoto. Densematrix-vector multiplicationon the CUDA architec-ture. Parallel ProcessingLetters, 18(4):511–530, De-cember 2008. CODENPPLTEE. ISSN 0129-6264(print), 1793-642X (elec-tronic).

REFERENCES 202

Fagg:2000:AAC

[FVD00] Graham E. Fagg, Sathish S.Vadhiyar, and Jack J.Dongarra. ACCT: Auto-matic Collective Commu-nications Tuning. LectureNotes in Computer Sci-ence, 1908:354–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080354.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080354.pdf.

Fang:2015:EVD

[FVLS15] Jianbin Fang, Ana LuciaVarbanescu, Xiangke Liao,and Henk Sips. Evaluat-ing vector data type us-age in OpenCL kernels.Concurrency and Computa-tion: Practice and Experi-ence, 27(17):4586–4602, De-cember 10, 2015. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic).

Fineberg:1996:PPI

[FWNK96] S. A. Fineberg, P. Wong,B. Nitzberg, and C. Kusz-maul. PMPIO-a portableimplementation of MPI-IO.In IEEE [IEE96c], pages188–195. ISBN 0-8186-7551-9. LCCN QA76.58 .S951996. IEEE catalog number96TB100062.

Franke:1995:MPEb

[FWR+95] Hubertus Franke, C. EricWu, Michel Riviere, PratapPattnaik, and Marc Snir.MPI programming environ-ment for IBM SP1/SP2. InIEEE [IEE95i], pages 127–135. ISBN 0-8186-7025-8.LCCN ???? IEEE catalognumber 95CH35784.

Frust:2017:RDP

[FWS+17] Tobias Frust, Michael Wag-ner, Jan Stephan, GuidoJuckeland, and Andre Bieberle.Rapid data processing forultrafast X-ray computedtomography using scalableand modular CUDA basedpipelines. Computer PhysicsCommunications, 219(??):353–360, October 2017. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465517301674.

Grangeat:1996:PTI

[GA96] Pierre Grangeat and Jean-Louis Amans, editors. Pro-ceedings of the Third Inter-national Meeting on FullyThree-Dimensional ImageReconstruction in Radiol-ogy and Nuclear Medicine,held July 4–6, 1995 at Do-maine d’Aix-Marlioz, Aix-les-Bains, France. KluwerAcademic Publishers Group,Norwell, MA, USA, and Dor-drecht, The Netherlands,

REFERENCES 203

1996. ISBN 0-7923-4129-5.LCCN R857.T47 T485 1996.

Galibert:1997:YCL

[Gal97] O. Galibert. YLC, A C++Linda system on top ofPVM. Lecture Notes inComputer Science, 1332:99–106, 1997. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Gonzalez:2000:NSF

[GAM+00] Marc Gonzalez, EduardAyguade, Xavier Martorell,Jesus Labarta, Nacho Navarro,and Jose Oliver. NanosCom-piler: supporting flexiblemultilevel parallelism ex-ploitation in OpenMP. Con-currency: practice and ex-perience, 12(12):1205–1218,October 2000. CODENCPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/cgi-bin/abstract/76500358/START; http://www3.interscience.wiley.com/cgi-bin/fulltext?ID=76500358&PLACEBO=IE.pdf.

Gonzalez:2002:DLP

[GAM+02] Marc Gonzalez, EduardAyguade, Xavier Martorell,Jesus Labarta, and Phu V.Luong. Dual-level par-allelism exploitation withOpenMP in coastal oceancirculation modeling. Lec-ture Notes in Computer Sci-ence, 2327:469–??, 2002.

CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2327/23270469.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2327/23270469.pdf.

Gonzalez:2001:DSP

[GAML01] M. Gonzalez, E. Ayguade,X. Martorell, and J. Labarta.Defining and supportingpipelined executions inOpenMP. Lecture Notesin Computer Science, 2104:155–??, 2001. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2104/21040155.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2104/21040155.pdf.

Gonzalez:2000:PAM

[GAMR00] Daniel Gonzalez, Fran-cisco Almeida, Luz Ma-rina Moreno, and CasianoRodrıguez. Pipeline al-gorithms on MPI: Opti-mal mapping of the pathplaning problem. LectureNotes in Computer Sci-ence, 1908:104–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http:

REFERENCES 204

//link.springer-ny.com/link/service/series/0558/bibs/1908/19080104.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080104.pdf.

Gao:2003:LSP

[Gao03] Shiwu Gao. Linear-scalingparallelization of the WIENpackage with MPI. Com-puter Physics Communi-cations, 153(2):190–198,June 15, 2003. CODENCPHCBZ. ISSN 0010-4655(print), 1879-2944 (elec-tronic). URL http://www.sciencedirect.com/science/article/pii/S0010465503002248.

Galaktionov:1997:MST

[GAP97] A. S. Galaktionov, P. D. An-derson, and G. W. M. Pe-ters. Mixing simulations:Tracking strongly deformingfluid volumes in 3D flows.Lecture Notes in ComputerScience, 1332:436–469, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Gates:1995:PFI

[Gat95] W. Lawrence (William Lawrence)Gates, editor. Proceedingsof the First InternationalAMIP Scientific Conference:Monterey, California, USA,15–19 May 1995, number732 in World Meteorologi-cal Organization — Publi-cations — WMO TD 1995.

World Meteorological Orga-nization, Geneva, Switzer-land, 1995. ISBN ????LCCN SIO 1 WO326 v.92.

Gonzalez-Alvarez:2017:HMO

[GAVRRL17] David L. Gonzalez-Alvarez,Miguel A. Vega-Rodrıguez,and Alvaro Rubio-Largo.A hybrid MPI/OpenMPparallel implementation ofNSGA–II for finding pat-terns in protein sequences.The Journal of Supercom-puting, 73(6):2285–2312,June 2017. CODEN JO-SUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic).

Gupta:1994:CTE

[GB94] M. Gupta and P. Banerjee.Compile-time estimation ofcommunication costs of pro-grams. Journal of Program-ming Languages, 2(3):191–225, September 1994. CO-DEN JPLAER. ISSN 0963-9306.

Ghosh:1996:ELM

[GB96] K. Ghosh and S. Breit. Eval-uating the limits of mes-sage passing via the sharedattraction memory on CC-COMA machines: Experi-ences with TCGMSG andPVM. In ACM [ACM96b],pages 173–180. ISBN 0-89791-803-7. LCCN QA76.5I61 1996. ACM order num-ber 415961.

REFERENCES 205

Gorlatch:1998:GMI

[GB98] Sergei Gorlatch and Hol-ger Bischof. A genericMPI implementation for adata-parallel skeleton: For-mal derivation and applica-tion to FFT. Parallel Pro-cessing Letters, 8(4):447–??,December 1998. CODENPPLTEE. ISSN 0129-6264(print), 1793-642X (elec-tronic).

Geist:1994:PPV

[GBD+94] Al Geist, Adam Beguelin,Jack Dongarra, WeichengJiang, Robert Manchek, andVaidyalingam S. Sunderam.PVM: Parallel Virtual Ma-chine: a Users’ Guide andTutorial for Networked Par-allel Computing. Scien-tific and engineering com-putation. MIT Press, Cam-bridge, MA, USA, 1994.ISBN 0-262-57108-0 (paper-back). xvii + 279 pp.LCCN QA76.58 .P85 1994.US$27.50. URL http://www.mitpress.com/book-home.tcl?isbn=0262571080.

Gentzsch:1995:STP

[GBF95] W. Gentzsch, U. Block, andF. Ferstl. Software toolsfor parallel computers andworkstation clusters. In Fer-enczi and Kacsuk [FK95],pages 23–42. ISBN ????LCCN ???? Technical reportKFKI-1995-2/M,N.

Golebiewski:1999:HPI

[GBH99] M. Golebiewski, M. Baum,and R. Hempel. High per-formance implementation ofMPI for myrinet. Lec-ture Notes in Computer Sci-ence, 1557:510–521, 1999.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Gerstenberger:2014:EHS

[GBH14] Robert Gerstenberger, Ma-ciej Besta, and TorstenHoefler. Enabling highly-scalable remote memory ac-cess programming with MPI-3 One Sided. ScientificProgramming, 22(2):75–91,???? 2014. CODENSCIPEV. ISSN 1058-9244(print), 1875-919X (elec-tronic).

Gabriel:1997:EMU

[GBR97] Edgar Gabriel, ThomasBeisel, and Michael Resch.Erweiterung einer MPI-Umgebung zur Interoper-abilitat verteilter MPP-Systeme. (German) [Exten-sion of an MPI environ-ment for interoperabilitywith distributed MPI sys-tems]. Studienarbeit ange-wandte Informatik RUS 37,Rechenzentrum UniversitatStuttgart, Stuttgart, Ger-many, 1997.

Garain:2015:CCF

[GBR15] Sudip Garain, Dinshaw S.Balsara, and John Reid.

REFERENCES 206

Comparing Coarray For-tran (CAF) with MPI forseveral structured meshPDE applications. Journalof Computational Physics,297(??):237–253, Septem-ber 15, 2015. CODENJCTPAH. ISSN 0021-9991(print), 1090-2716 (elec-tronic). URL http://www.sciencedirect.com/science/article/pii/S002199911500354X.

Graham:2007:OMH

[GBS+07] Richard L. Graham, Brian W.Barrett, Galen M. Ship-man, Timothy S. Woodall,and George Bosilca. OpenMPI: a high performance,flexible implementation ofMPI point-to-point commu-nications. Parallel Pro-cessing Letters, 17(1):79–88, March 2007. CODENPPLTEE. ISSN 0129-6264(print), 1793-642X (elec-tronic).

Grove:2005:CBP

[GC05] D. A. Grove and P. D.Coddington. Communi-cation benchmarking andperformance modelling ofMPI programs on clustercomputers. The Journalof Supercomputing, 34(2):201–217, November 2005.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&

issn=0920-8542&volume=34&issue=2&spage=201.

Garcia:2012:DLB

[GCBL12] Marta Garcia, Julita Cor-balan, Rosa Maria Badia,and Jesus Labarta. A dy-namic load balancing ap-proach with SMPSuper-scalar and MPI. LectureNotes in Computer Science,7174:10–23, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-30397-5_2/.

GarciaSalcines:1997:PRR

[GCBM97] E. Garcia Salcines, G. Cer-ruela Garcia, J. I. Bena-vides Benitez, and F. MunozGarcia. Parallel renderingof radiance on distributedmemory system by PVM.Lecture Notes in ComputerScience, 1332:502–507, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Garcia:1999:MMI

[GCC99] F. Garcia, A. Calderon, andJ. Carretero. MiMPI: amultithread-safe implemen-tation of MPI. In Dongarraet al. [DLM99], pages 207–214. ISBN 3-540-66549-8(softcover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

REFERENCES 207

Garcia-Consuegra:1998:DGR

[GCGS98] J. D. Garcia-Consuegra,J. A. Gallud, and G. Se-bastian. Distributed geore-ferring of remotely sensedLandsat-TM imagery us-ing MPI. Lecture Notesin Computer Science, 1541:161–166, 1998. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Gelado:2010:ADS

[GCN+10] Isaac Gelado, Javier Cabezas,Nacho Navarro, John E.Stone, Sanjay Patel, andWen mei W. Hwu. An asym-metric distributed sharedmemory model for hetero-geneous parallel systems.ACM SIGPLAN Notices, 45(3):347–358, March 2010.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Gao:2013:GGA

[GCN+13] Mingcen Gao, Thanh-TungCao, Ashwin Nanjappa,Tiow-Seng Tan, and ZhiyongHuang. gHull: a GPU al-gorithm for 3D convex hull.ACM Transactions on Math-ematical Software, 40(1):3:1–3:19, September 2013.CODEN ACMSCU. ISSN0098-3500 (print), 1557-7295(electronic).

Geist:1993:PTW

[GDB+93] A. Geist, J. Dongarra,A. Beguelin, B. Manchek,and Weicheng Jiang. PVMtakes over the world. InIEEE [IEE93e], page 618.ISBN 0-8186-4340-4 (paper-back), 0-8186-4341-2 (mi-crofiche), 0-8186-4342-0 (hard-back), 0-8186-4346-3 (CD-ROM). ISSN 1063-9535.LCCN QA76.5 .S96 1993.

Galizia:2015:MCL

[GDC15] Antonella Galizia, DanieleD’Agostino, and AndreaClematis. An MPI–CUDAlibrary for image process-ing on HPC architectures.Journal of Computationaland Applied Mathemat-ics, 273(??):414–427, Jan-uary 1, 2015. CODENJCAMDI. ISSN 0377-0427(print), 1879-1778 (elec-tronic). URL http://www.sciencedirect.com/science/article/pii/S0377042714002374.

Grinstein:1995:VDE

[GE95] Georges G. Grinstein andRobert F. Erbacher, ed-itors. Visual data ex-ploration and analysis II:8–10 February 1995, SanJose, California, volume2410 of Proceedings of theSPIE — The InternationalSociety for Optical Engi-neering. Society of Photo-optical Instrumentation En-gineers (SPIE), Bellingham,WA, USA, 1995. CODEN

REFERENCES 208

PSISDG. ISBN 0-8194-1757-2. ISSN 0277-786X(print), 1996-756X (elec-tronic). LCCN TS510.S63v.2410.

Grinstein:1996:VDE

[GE96] Georges G. Grinstein andRobert F. Erbacher, editors.Visual data exploration andanalysis III: 31 January–2February, 1996, San Jose,California, volume 2421 (or2656??) of Proceedings of theSPIE — The InternationalSociety for Optical Engi-neering. Society of Photo-optical Instrumentation En-gineers (SPIE), Bellingham,WA, USA, 1996. CODENPSISDG. ISBN 0-8194-2030-1. ISSN 0277-786X(print), 1996-756X (elec-tronic). LCCN TS510.S63v.2656.

Geist:1993:ILP

[Gei93a] G. A. Geist. Invited lec-ture: PVM 3 beyond net-work computing. In Volk-ert [Vol93], pages 194–203. ISBN 3-540-57314-3 (Berlin), 0-387-57314-3(New York). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA267.A1L43 no.734. DM58.00.

Geist:1993:PBN

[Gei93b] G. A. Geist. PVM 3 be-yond network computing. InVolkert [Vol93], pages 194–203. ISBN 3-540-57314-3 (Berlin), 0-387-57314-3

(New York). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA267.A1L43 no.734. DM58.00.

Geist:1994:CCW

[Gei94] G. A. Geist. Cluster com-puting: the wave of thefuture? In Dongarraand Wasniewski [DW94],pages 236–246. ISBN 3-540-58712-8 (Berlin), 0-387-58712-8 (New York). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58.P35 1994. DM104.00.

Geist:1996:APP

[Gei96] G. A. Geist. Advanced pro-gramming in PVM. In Bodeet al. [BDLS96], pages 1–??ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E975 1996.

Geist:1997:ACP

[Gei97] G. A. Geist. Advanced ca-pabilities in PVM 3.4. Lec-ture Notes in Computer Sci-ence, 1332:107–115, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Geist:1998:HNG

[Gei98] G. A. Geist. Harness:The next generation beyondPVM. Lecture Notes inComputer Science, 1497:74–??, 1998. CODEN LNCSD9.ISSN 0302-9743 (print),1611-3349 (electronic).

REFERENCES 209

Geist:2000:PMW

[Gei00] Al Geist. PVM and MPI:What else is needed forcluster computing? Lec-ture Notes in ComputerScience, 1908:1–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080001.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080001.pdf.

Geist:2001:BFN

[Gei01] G. Al Geist. Building a foun-dation for the next PVM:Petascale Virtual Machines.Lecture Notes in ComputerScience, 2131:2–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310002.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310002.pdf.

Grabowsky:1998:NMP

[GEW98] Lothar Grabowsky, ThomasErmer, and Jorg Werner.Nutzung von MPI fur par-allele FEM-Systeme. (Ger-man) [Use of MPI for paral-lel FEM systems]. Preprint-Reihe des Chemnitzer SFB393 Sonderforschungsbereich

NumerischeSimulation aufMassiv Parallelen Rechnern97,08; RA-TR 02-97, Uni-versitat Chemnitz-Zwickau,Chemnitz, Germany, 1998.

Gabriel:2003:FTC

[GFB+03] Edgar Gabriel, Graham E.Fagg, Antonin Bukovsky,Thara Angskun, and Jack J.Dongarra. A fault-tolerantcommunication library forGrid environments. In????, editor, 17th AnnualACM International Con-ference on Supercomput-ing (ICS’03) InternationalWorkshop on Grid Com-puting and e-Science, June21, 2003, San Francisco,page ?? ????, ????, 2003.ISBN ???? LCCN ????URL http://www.netlib.org/netlib/utk/people/JackDongarra/PAPERS/FTMPI-SF-gabriel.pdf.

Gonina:2014:SMC

[GFB+14] Ekaterina Gonina, Ger-ald Friedland, Eric Bat-tenberg, Penporn Koanan-takool, Michael Driscoll,Evangelos Georganas, andKurt Keutzer. Scalablemultimedia content analysison parallel platforms usingPython. ACM Transactionson Multimedia Computing,Communications, and Ap-plications, 10(2):18:1–18:??,February 2014. CODEN???? ISSN 1551-6857

REFERENCES 210

(print), 1551-6865 (elec-tronic).

Gabriel:2003:EPM

[GFD03] Edgar Gabriel, GrahamFagg, and Jack Dongarra.Evaluating the performanceof MPI-2 dynamic commu-nicators and one-sided com-munication. In Dongarraet al. [DLO03], page ?? CO-DEN LNCSD9. ISBN 3-540-20149-1. ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58.E973 2003. URL http://www.netlib.org/netlib/utk/people/JackDongarra/PAPERS/europvm-mpi-2003-mpi2.pdf.

Gabriel:2005:EDC

[GFD05] Edgar Gabriel, Graham E.Fagg, and Jack J. Don-garra. Evaluating dynamiccommunicators and one-sided operations for cur-rent MPI libraries. TheInternational Journal ofHigh Performance Comput-ing Applications, 19(1):67–79, Spring 2005. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/19/1/67.full.pdf+html.

Gravvanis:2012:SFD

[GFPG12] G. A. Gravvanis, C. K.Filelis-Papadopoulos, andK. M. Giannoutakis. Solv-ing finite difference linear

systems on GPUs: CUDAbased parallel explicit pre-conditioned biconjugate con-jugate gradient type meth-ods. The Journal of Su-percomputing, 61(3):590–604, September 2012. CO-DEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=61&issue=3&spage=590.

Giordano:1999:IBP

[GFV99] M. Giordano, M. M. Furnari,and F. Vitobello. Interactionbetween PVM parametersand communication perfor-mances on ATM networks.Lecture Notes in ComputerScience, 1557:586–587, 1999.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Garzon:1999:PIE

[GG99] E. M. Garzon and I. Gar-cia. A parallel imple-mentation of the eigenprob-lem for large, symmetricand sparse matrices. InDongarra et al. [DLM99],pages 380–387. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Giannoutakis:2009:DIP

[GG09] Konstantinos M. Gian-noutakis and George A.

REFERENCES 211

Gravvanis. Design and im-plementation of parallel ap-proximate inverse classes us-ing OpenMP. Concurrencyand Computation: Prac-tice and Experience, 21(2):115–131, February 2009.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Giannoutakis:2007:MHP

[GGC+07] K. M. Giannoutakis, G. A.Gravvanis, B. Clayton,A. Patil, T. Enright, andJ. P. Morrison. Match-ing high performance ap-proximate inverse precon-ditioning to architecturalplatforms. The Journalof Supercomputing, 42(2):145–163, November 2007.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=42&issue=2&spage=145.

Gallud:2001:EDF

[GGCGO01] J. A. Gallud, J. Garcıa-Consuegra, J. M. Garcıa,and L. Orozco. Evaluatingthe DIPORSI framework:Distributed processing of re-motely sensed imagery. Lec-ture Notes in Computer Sci-ence, 2131:401–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/

link/service/series/0558/bibs/2131/21310401.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310401.pdf.

Gallud:1999:DPR

[GGCM99] J. A. Gallud, J. Garcia-Consuegra, and A. Mar-tinez. Distributed pro-cessing of remotely sensedLandsat-TM imagery usingMPI. Parallel and Dis-tributed Computing Prac-tices, 2(2):??, ???? 1999.CODEN ???? ISSN 1097-2803. URL http://www.cs.okstate.edu/~pdcp/vols/vol02/vol02no2abs.html#gallud.

Gallud:1999:CCU

[GGGC99] J. A. Gallud, J. M. Garcia,and J. Garcia-Consuegra.Cluster computing usingMPI and Windows NT tosolve the processing of re-motely sensed imagery. InDongarra et al. [DLM99],pages 442–449. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Godlevsky:1999:PSA

[GGH99] A. Godlevsky, M. Gazak,and L. Hluchy. Parallelizingof sequential annotated pro-grams in PVM environment.In Dongarra et al. [DLM99],pages 517–524. ISBN 3-540-66549-8 (softcover). ISSN

REFERENCES 212

0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Geist:1996:MEM

[GGHL+96] A. Geist, W. Gropp, S. Huss-Lederman, A. Lumsdaine,E. Lusk, W. Saphir, T. Skjel-lum, and M. Snir. MPI-2: extending the Message-Passing Interface. In Bougeet al. [BFMR96], pages128–135. ISBN 3-540-61626-8 (vol. 1), 3-540-61627-6 (vol. 2). ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.I554 1996, QA267.A1L43 no.1123-1124. Two vol-umes.

Gawman:1993:PCT

[GGK+93] Ann Gawman, W. MorvenGentleman, E. Kidd, Per-Ake Larson, and J. Slonim,editors. Proceedings CAS-CON ’93: Toronto, On-tario, Canada, 24–28 Octo-ber 1993. Nat. Res. Coun-cil of Canada, Ottawa, Ont.,Canada, 1993. ISBN ????LCCN QA76.76.S64 C3781993 v.1-2. Two volumes.

Genaud:2008:EPC

[GGL+08] Stephane Genaud, PierreGancarski, Guillaume Latu,Alexandre Blansche, ChoopanRattanapoka, and DamienVouriot. Exploitation ofa parallel clustering algo-rithm on commodity hard-ware with P2P-MPI. TheJournal of Supercomputing,

43(1):21–41, January 2008.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=43&issue=1&spage=21.

Getov:1999:MJM

[GGS99] Vladimir Getov, Paul Gray,and Vaidy Sunderam. MPIand Java-MPI: Contrastsand comparisons of low-level communication perfor-mance. In ACM [ACM99],page ??

Gentzsch:1994:HPC

[GH94] Wolfgang Gentzsch and UweHarms, editors. High-performance computing andnetworking: internationalconference and exhibition,Munich, Germany, April18–20, 1994: proceedings,volume 797 of Lecture notesin computer science. Spring-er-Verlag, Berlin, Ger-many / Heidelberg, Ger-many / London, UK /etc., 1994. ISBN 0-387-57981-8 (New York), 3-540-57981-8 (Berlin). LCCNQA76.88.I57 1994. DM96.00.Two volumes.

Ghosh:2012:RAA

[GHD12] Sudeep Ghosh, Jason Hiser,and Jack W. Davidson. Re-placement attacks againstVM-protected applications.ACM SIGPLAN Notices,

REFERENCES 213

47(7):203–214, July 2012.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic). VEE ’12 conferenceproceedings.

Grebe:1993:TAS

[GHH+93] R. Grebe, J. Hektor, S. C.Hilton, M. R. Jane, andP. H. Welch, editors. Trans-puter applications and sys-tems ’93: proceedings ofthe 1993 World TransputerCongress, 20–22 Septem-ber 1993, Aachen, Germany.IOS Press, Postal Drawer10558, Burke, VA 2209-0558,USA, 1993. ISBN 90-5199-140-1. LCCN ????Goumopoulos:1997:PCS

[GHL97] C. Goumopoulos, E. Housos,and O. Liljenzin. Parallelcrew scheduling on worksta-tion networks using PVM.Lecture Notes in ComputerScience, 1332:470–477, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Gropp:1998:MCR

[GHLL+98] William Gropp, StevenHuss-Lederman, AndrewLumsdaine, Ewing Lusk, BillNitzberg, William Saphir,and Marc Snir. MPI: TheComplete Reference. Volume2, The MPI-2 Extensions.Scientific and EngineeringComputation. MIT Press,Cambridge, MA, USA, sec-ond edition, 1998. ISBN

0-262-57123-4 (vol. 2), 0-262-69216-3 (set). 350pp. LCCN QA76.642 .M651998. US$30 (paperback).URL http://mitpress.mit.edu/book-home.tcl?isbn=0262571234. See alsovolume 1 [SOHL+98].

Gong:2012:OCN

[GHZ12] Yifan Gong, Bingsheng He,and Jianlong Zhong. Anoverview of CMPI: networkperformance aware MPI inthe cloud. ACM SIG-PLAN Notices, 47(8):297–298, August 2012. CO-DEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic). PPOPP ’12 confer-ence proceedings.

Garcia:2011:KRR

[GJLT11] Saturnino Garcia, Dongh-wan Jeon, Christopher M.Louie, and Michael BedfordTaylor. Kremlin: rethinkingand rebooting gprof for themulticore age. ACM SIG-PLAN Notices, 46(6):458–469, June 2011. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Grecki:1997:MPE

[GJN97] M. Grecki, G. Jablonski,and A. Napieralski. MOPS— parallel environment forsimulation of electronic cir-cuits using physical mod-els of semiconductor devices.

REFERENCES 214

Lecture Notes in ComputerScience, 1332:478–485, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Gerlach:2001:IOJ

[GJP01] Jens Gerlach, Zheng-YuJiang, and Hans-WernerPohl. Integrating OpenMPinto Janus. Lecture Notesin Computer Science, 2104:101–??, 2001. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2104/21040101.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2104/21040101.pdf.

Genaud:2009:FMP

[GJR09] Stephane Genaud, Em-manuel Jeannot, and ChoopanRattanapoka. Fault-managementin P2P-MPI. Interna-tional Journal of Paral-lel Programming, 37(5):433–461, October 2009. CO-DEN IJPPE5. ISSN 0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=37&issue=5&spage=433.

Gillett:1997:UMC

[GK97] Richard Gillett and RichardKaufmann. Using the Mem-

ory Channel Network —using a cluster of stan-dard PCI-based servers witha low-cost network to im-prove communication perfor-mance. IEEE Micro, 17(1):19–25, January/February1997. CODEN IEMIDZ.ISSN 0272-1732 (print),1937-4143 (electronic).

Granat:2010:PSS

[GK10] Robert Granat and Bo Kagstrom.Parallel solvers for Sylvester-type matrix equations withapplications in conditionestimation, Part I: The-ory and algorithms. ACMTransactions on Mathemat-ical Software, 37(3):32:1–32:32, September 2010. CO-DEN ACMSCU. ISSN0098-3500 (print), 1557-7295(electronic).

Grasso:2013:APS

[GKCF13] Ivan Grasso, Klaus Kofler,Biagio Cosenza, and ThomasFahringer. Automatic prob-lem size sensitive task par-titioning on heterogeneousparallel systems. ACM SIG-PLAN Notices, 48(8):281–282, August 2013. CO-DEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic). PPoPP ’13 Confer-ence proceedings.

Granat:2009:NPQ

[GKK09] Robert Granat, Bo Kagstrom,and Daniel Kressner. A

REFERENCES 215

novel parallel QR algorithmfor hybrid distributed mem-ory HPC systems. LAPACKWorking Note 216, Depart-ment of Computing Scienceand HPC2N, Umea Univer-sity, S-901 Umea, Sweden,April 2009. URL http://www.netlib.org/lapack/lawnspdf/lawn216.pdf.

Gropp:1995:MGX

[GKL95] W. Gropp, E. Karrels, andE. Lusk. MPE graphics-scalable X11 graphics inMPI. In IEEE [IEE95j],pages 49–54. ISBN 0-8186-6895-4. LCCN QA76.58 .S341994.

Guan:1997:PDI

[GkLyCY97] Huiwei Guan, Chi kwongLi, To yat Cheung, andSongnian Yu. Parallel de-sign and implementationof SOM neural computingmodel in PVM environmentof a distributed system. InIEEE [IEE97a], pages 26–31. ISBN 0-8186-7876-3 (pa-perback and case), 0-8186-7878-X (microfiche). LCCNQA76.58 .A4 1997.

Geist:1996:VDP

[GKP96] G. A. Geist, James Kohn,and Philip Papadopou-los. Visualization, debug-ging, and performance inPVM. Technical report, OakRidge National Laboratory,Knoxville, TN, USA, 1996.11 pp. URL http://www.

epm.ornl.gov/~geist/CapeCod.ps.

Geist:1997:CPF

[GKP97] G. A. Geist, II, James ArthurKohl, and Philip M. Pa-padopoulos. CUMULVS:Providing fault tolerance, vi-sualization, and steering ofparallel applications. Inter-national Journal of Super-computer Applications andHigh Performance Com-puting, 11(3):224–235, Fall1997. CODEN IJSCFG.ISSN 1078-3482.

Geist:1997:BPW

[GKPS97] G. A. Geist, J. A. Kohl,P. M. Papadopoulos, andS. L. Scott. Beyond PVM3.4: What we’ve learned,what’s next, and why. Lec-ture Notes in Computer Sci-ence, 1332:116–126, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Gopalakrishnan:2011:FAM

[GKS+11] Ganesh Gopalakrishnan,Robert M. Kirby, StephenSiegel, Rajeev Thakur,William Gropp, Ewing Lusk,Bronis R. De Supinski, Mar-tin Schulz, and Greg Bron-evetsky. Formal analysisof MPI-based parallel pro-grams. Communicationsof the ACM, 54(12):82–91,December 2011. CODENCACMA2. ISSN 0001-0782(print), 1557-7317 (elec-tronic).

REFERENCES 216

Garland:2012:DUP

[GKZ12] Michael Garland, Man-junath Kudlur, and YiliZheng. Designing a uni-fied programming modelfor heterogeneous machines.In Hollingsworth [Hol12],pages 67:1–67:?? ISBN 1-4673-0804-8. URL http://conferences.computer.org/sc/2012/papers/1000a064.pdf.

Gropp:1992:TIM

[GL92] Bill Gropp and Ewing Lusk.A test implementation of theMPI draft message-passingstandard. Technical report,Mathematics and ComputerScience Division, ArgonneNational Laboratory, 9700South Cass Avenue, Ar-gonne, IL 60439-4801, USA,1992.

Gropp:1994:MCL

[GL94] W. Gropp and E. Lusk.The MPI communication li-brary: its design and aportable implementation. InIEEE [IEE94f], pages 160–165. ISBN 0-8186-4980-1.LCCN QA76.58.S34 1993.

Gropp:1995:DPM

[GL95a] W. Gropp and E. Lusk. Dy-namic process managementin an MPI setting. InIEEE [IEE95g], pages 530–533. CODEN PSPDF8.ISBN 0-8186-7195-5. ISSN1063-6374. LCCN QA 76.58

I42 1995. IEEE catalog num-ber 95TB8131.

Gropp:1995:IMM

[GL95b] W. Gropp and E. Lusk. Im-plementing MPI: the 1994MPI Implementors’ Work-shop. In IEEE [IEE95j],pages 55–59. ISBN 0-8186-6895-4. LCCN QA76.58 .S341994.

Gropp:1995:MMI

[GL95c] W. Gropp and E. Lusk. TheMPI message-passing inter-face standard: Overviewand status. In Dongarraet al. [D+95], pages 265–270. ISBN 0-444-82163-5. ISSN 0927-5452. LCCNQA76.88.H55 1995.

Gropp:1995:EIS

[GL95d] W. D. Gropp and E. Lusk.Experiences with the IBMSP1. IBM Systems Jour-nal, 34(2):249–262, 1995.CODEN IBMSA7. ISSN0018-8670. URL http://www.research.ibm.com/journal/sj34-2.html#seven.

Gropp:1996:HPM

[GL96] W. Gropp and E. Lusk. Ahigh-performance MPI im-plementation on a shared-memory vector supercom-puter. Parallel Computing,22(11):1513–??, ???? 1996.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic).

REFERENCES 217

Gropp:1997:SMC

[GL97a] W. Gropp and E. Lusk. Sow-ing MPICH: a case studyin the dissemination of aportable environment forparallel scientific comput-ing. International Journal ofSupercomputer Applicationsand High Performance Com-puting, 11(2):103–114, Sum-mer 1997. CODEN IJSCFG.ISSN 1078-3482.

Gropp:1997:WPM

[GL97b] W. Gropp and E. Lusk.Why are PVM and MPIso different? LectureNotes in Computer Science,1332:3–10, 1997. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Gropp:1997:HPM

[GL97c] William Gropp and EwingLusk. A high-performanceMPI implementation on ashared-memory vector su-percomputer. Parallel Com-puting, 22(11):1513–1526,January 26, 1997. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.elsevier.com/cgi-bin/cas/tree/store/parco/cas_sub/browse/browse.cgi?year=1997&volume=22&issue=11&aid=1113.

Gropp:1999:RMM

[GL99] W. Gropp and E. Lusk.Reproducible measurements

of MPI performance char-acteristics. In Dongarraet al. [DLM99], pages 11–18. ISBN 3-540-66549-8(softcover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

Gropp:2002:MG

[GL02] William Gropp and EwingLusk. MPI on the Grid.Lecture Notes in ComputerScience, 2474:12–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.de/link/service/series/0558/bibs/2474/24740012.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740012.pdf.

Gropp:2004:FTM

[GL04] William Gropp and EwingLusk. Fault tolerance inMessage Passing Interfaceprograms. The Interna-tional Journal of High Per-formance Computing Ap-plications, 18(3):363–372,Fall 2004. CODEN IH-PCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/18/3/363.full.pdf+html.

Girona:2000:VDC

[GLB00] Sergi Girona, Jesus Labarta,and Rosa M. Badia. Val-idation of dimemas com-munication model for MPI

REFERENCES 218

collective operations. Lec-ture Notes in ComputerScience, 1908:39–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080039.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080039.pdf.

Gropp:1996:HPP

[GLDS96] William Gropp, Ewing Lusk,Nathan Doss, and AnthonySkjellum. High-performance,portable implementation ofthe MPI Message PassingInterface Standard. Par-allel Computing, 22(6):789–828, September 20, 1996.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.elsevier.com/cgi-bin/cas/tree/store/parco/cas_sub/browse/browse.cgi?year=1996&volume=22&issue=6&aid=1075.

Glendinning:1993:MMP

[Gle93] I. Glendinning. 93SC041the MPI message passinginterface. In Anonymous[Ano93a], pages 229–236.ISBN 0-947719-62-8. LCCN????

Gregoretti:2008:MGE

[GLM+08] F. Gregoretti, G. Laccetti,A. Murli, G. Oliva, and

U. Scafuri. MGF: a grid-enabled MPI library. FutureGeneration Computer Sys-tems, 24(2):158–165, Febru-ary 2008. CODEN FGSEVI.ISSN 0167-739X (print),1872-7115 (electronic).

Garland:2008:PCE

[GLN+08] Michael Garland, ScottLe Grand, John Nickolls,Joshua Anderson, Jim Hard-wick, Scott Morton, Ev-erett Phillips, Yao Zhang,and Vasily Volkov. Parallelcomputing experiences withCUDA. IEEE Micro, 28(4):13–27, July/August 2008.CODEN IEMIDZ. ISSN0272-1732 (print), 1937-4143(electronic).

Gonzalez:2000:TSN

[GLP+00] J. A. Gonzalez, C. Leon,F. Piccoli, M. Printista,J. L. Roda, C. Rodrıguez,and F. Sande. Towardsstandard nested parallelism.Lecture Notes in ComputerScience, 1908:96–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080096.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080096.pdf.

Gonzalez:2001:MIM

[GLRS01] J. A. Gonzalez, C. Leon,

REFERENCES 219

C. Rodrıguez, and F. Sande.A model to integrate mes-sage passing and sharedmemory programming. Lec-ture Notes in Computer Sci-ence, 2131:114–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310114.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310114.pdf.

Gropp:1994:UMP

[GLS94] William Gropp, EwingLusk, and Anthony Skjel-lum. Using MPI: PortableParallel Programming withthe Message-Passing Inter-face. Scientific and engi-neering computation. MITPress, Cambridge, MA,USA, 1994. ISBN 0-262-57104-8. xx + 307 pp.LCCN QA76.642 G76 1994.US$24.95. URL http://www.mitpress.com/book-home.tcl?isbn=0262571048.

Gropp:1999:UMP

[GLS99] William Gropp, Ewing Lusk,and Anthony Skjellum. Us-ing MPI: Portable Paral-lel Programming with theMessage Passing Interface.Scientific and EngineeringComputation. MIT Press,Cambridge, MA, USA, sec-ond edition, November 1999.ISBN 0-262-57132-3 (vol. 1),

0-262-57134-X (set). 350 pp.LCCN QA76.642.G76 1999.US$32.50. URL http://www.mitpress.com/book-home.tcl?isbn=0262571323.

Gropp:1999:UMA

[GLT99] William Gropp, Ewing Lusk,and Rajeev Thakur. Us-ing MPI-2: Advanced Fea-tures of the Message Pass-ing Interface. Scientificand Engineering Computa-tion. MIT Press, Cambridge,MA, USA, November 1999.ISBN 0-262-57133-1. 275pp. LCCN QA76.642 .G7621999. US$32.50. URL http://www.mitpress.com/book-home.tcl?isbn=0262571331.

Gropp:2000:UMA

[GLT00a] William Gropp, Ewing Lusk,and Rajeev Thakur. Us-ing MPI-2: Advanced Fea-tures of the Message Pass-ing Interface. Scientificand engineering computa-tion. MIT Press, Cambridge,MA, USA, 2000. ISBN 0-262-57133-1. xxi + 382pp. LCCN QA76.642 .G7621999.

Gropp:2000:TSU

[GLT00b] William Gropp, Ewing (Rusty)Lusk, and Rajeev S. Thakur.Tutorial S1: Using MPI-2:a tutorial on advanced fea-tures of the message-passinginterface. In ACM [ACM00],page 11. URL http://www.sc2000.org/proceedings/info/fp.pdf.

REFERENCES 220

Gropp:2012:AMI

[GLT12] William Gropp, Ewing Lusk,and Rajeev Thakur. Ad-vanced MPI including newMPI-3 features. LectureNotes in Computer Science,7490:14, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/accesspage/chapter/10.1007/978-3-642-33518-1_5.

Gajecki:1994:NAT

[GM94] M. Gajecki and J. Moscin-ski. A new algorithm forthe traveling salesman prob-lem on networked worksta-tions. In Dongarra andWasniewski [DW94], pages229–235. ISBN 3-540-58712-8 (Berlin), 0-387-58712-8(New York). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58.P35 1994. DM104.00.

Gianuzzi:1995:UPI

[GM95] V. Gianuzzi and F. Merani.Using PVM to implement adistributed dependable sim-ulation system. In IEEE[IEE95h], pages 529–535.ISBN 0-8186-7031-2, 0-8186-7032-0. LCCN QA76.58 .E971995.

Goglin:2013:KGS

[GM13] Brice Goglin and StephanieMoreaud. KNEM: a genericand scalable kernel-assistedintra-node MPI communi-

cation framework. Jour-nal of Parallel and Dis-tributed Computing, 73(2):176–188, February 2013.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic). URL http://www.sciencedirect.com/science/article/pii/S0743731512002316.

Gu:2007:IPC

[GMdMBD+07] Feng Long Gu, Hyacinthe NzigouM., Guilherme de Melo Bap-tista Domingues, TakeshiNanri, and Kazuaki Mu-rakami. Investigatingthe performance of col-lective communications onSMP clusters: a case forMPI Allgather. In Simosand Maroulis [SM07], pages52–56. ISBN 0-7354-0476-3(set), 0-7354-0477-1 (vol. 1),0-7354-0478-X (vol. 2). ISSN0094-243X (print), 1551-7616 (electronic), 1935-0465.LCCN Q183.9 .I524 2007.URL http://proceedings.aip.org/getpdf/servlet/GetPDFServlet?filetype=pdf& id=APCPCS000963000002000052000001& idtype=cvips.

Gong:2016:NPG

[GML+16] Jing Gong, Stefano Markidis,Erwin Laure, Matthew Ot-ten, Paul Fischer, and MisunMin. Nekbone performanceon GPUs with OpenACCand CUDA Fortran imple-mentations. The Journalof Supercomputing, 72(11):4160–4180, November 2016.

REFERENCES 221

CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic).

Goujon:1998:AAT

[GMPD98] D. S. Goujon, M. Michel,J. Peeters, and J. E. De-vaney. AutoMap and Au-toLink: Tools for communi-cating complex and dynamicdata-structures using MPI.Lecture Notes in ComputerScience, 1362:98–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Guan:1995:SCC

[GMU95] Xiaojun Guan, Richard J.Mural, and Edward C. Uber-bacher. Sequence compari-son on a cluster of worksta-tions using the PVM system.In IEEE [IEE95f], pages190–195. CODEN PSPDF8.ISBN 0-8186-7074-6. ISSN1063-6374. LCCN QA 76.58I56 1995. IEEE catalog no.95TH8052.

Gray:1995:PCT

[GN95] J. P. Gray and F. Naghdy,editors. Parallel Comput-ing: Technology and Prac-tice. PCAT-94. Proceed-ings of the 7th AustralianTransputer and Occam UserGroup Conference: Wool-longong, NSW, Australia, 8–9 November 1994. IOS Press,Postal Drawer 10558, Burke,VA 2209-0558, USA, 1995.ISBN ???? LCCN ????

Goedecker:2002:OPF

[Goe02] Stefan Goedecker. Optimiza-tion and parallelization of aforce field for silicon usingOpenMP. Computer PhysicsCommunications, 148(1):124–135, October 1, 2002.CODEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465502004666.

Gonzalez:2001:OET

[GOM+01] Marc Gonzalez, Jose Oliver,Xavier Martorell, EduardAyguade, Jesus Labarta,and Nacho Navarro. OpenMPextensions for thread groupsand their run-time support.Lecture Notes in ComputerScience, 2017:324–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2017/20170324.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2017/20170324.pdf.

Gorzig:2001:CCP

[Gor01] Steffen Gorzig. CPPvm— C++ and PVM. Lec-ture Notes in ComputerScience, 2131:83–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/

REFERENCES 222

bibs/2131/21310083.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310083.pdf.

Guarracino:1995:PMB

[GP95] M. R. Guarracino andF. Perla. A parallel modifiedblock Lanczos algorithm fordistributed memory archi-tectures. In IEEE [IEE95h],pages 424–431. ISBN 0-8186-7031-2, 0-8186-7032-0.LCCN QA76.58 .E97 1995.

Grosset:2017:TTT

[GPC+17] A. V. Pascal Grosset, Man-asa Prasad, Cameron Chris-tensen, Aaron Knoll, andCharles Hansen. TOD-tree: Task-overlapped directsend tree image composit-ing for hybrid MPI paral-lelism and GPUs. IEEETransactions on Visualiza-tion and Computer Graph-ics, 23(6):1677–1690, June2017. CODEN ITVGEA.ISSN 1077-2626 (print),1941-0506 (electronic), 2160-9306. URL https://www.computer.org/csdl/trans/tg/2017/06/07433468-abs.html.

Govindan:1996:OMP

[GPL+96] V. Govindan, Y. Park, X. Li,S. Crear, and O. Johnson.An overview of a MPI profil-ing environment for the NECCenju-3. In IEEE [IEE96i],pages 185–188. ISBN 0-8186-

7533-0. LCCN QA76.642.M67 1996.

Gillich:1995:FPP

[GR95] S. Gillich and B. Ries. Flex-ible, portable performanceanalysis for PARMACSand MPI. In Hertzbergerand Serazzi [HS95a], pages937–?? ISBN 3-540-59393-4. ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.88 .I571995.

Genaud:2007:PMP

[GR07] Stephane Genaud and ChoopanRattanapoka. P2P–MPI: apeer-to-peer framework forrobust execution of messagepassing parallel programs onGrids. Journal of Grid Com-puting, 5(1):27–42, March2007. CODEN ???? ISSN1570-7873 (print), 1572-9184(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=1570-7873&volume=5&issue=1&spage=27.

Grabowsky:1997:MBK

[Gra97] Lothar Grabowsky. MPI-basierte Koppelrandkom-munikation und Einflußder Partitionierung im 3D-Fall. (German) [MPI-basedcoupled edge communi-cation and influence ofpartitioning in 3D-Fall].Preprint-Reihe des Chem-nitzer SFB 393 97,17, Uni-versitat Chemnitz-Zwickau,

REFERENCES 223

Chemnitz, Germany, 1997.13 pp.

Gravvanis:2009:OBP

[Gra09] George A. Gravvanis. OpenMPbased parallel normalized di-rect methods for sparse finiteelement linear systems. TheJournal of Supercomputing,47(1):44–52, January 2009.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=47&issue=1&spage=44.

Grengbondai:1994:CPU

[Gre94] Jules Crephat Grengbondai.Concurrent processing un-der parallel virtual machine(PVM). M.s. thesis, Depart-ment of Computer Science,Southern Illinois Universityat Carbondale, Carbondale,IL, USA, 1994. vi + 97 pp.

Greenfield:1995:OPS

[Gre95] J. Greenfield. An overviewof the PVM software system.In IEEE [IEE95d], pages 17–23. ISBN ???? LCCN ????

Gropp:2000:RCD

[Gro00] William D. Gropp. Run-time checking of datatypesignatures in MPI. Lec-ture Notes in Computer Sci-ence, 1908:160–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http:

//link.springer-ny.com/link/service/series/0558/bibs/1908/19080160.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080160.pdf.

Gropp:2001:CSA

[Gro01a] William D. Gropp. Chal-lenges and successes inachieving the potential ofMPI. Lecture Notes inComputer Science, 2131:7–??, 2001. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310007.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310007.pdf.

Gropp:2001:LSM

[Gro01b] William D. Gropp. Learn-ing from the success of MPI.Lecture Notes in ComputerScience, 2228:81–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2228/22280081.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2228/22280081.pdf.

REFERENCES 224

Gropp:2002:BLC

[Gro02a] William Gropp. Build-ing library components thatcan use any MPI imple-mentation. Lecture Notesin Computer Science, 2474:280–??, 2002. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.de/link/service/series/0558/bibs/2474/24740280.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740280.pdf.

Gropp:2002:MNS

[Gro02b] William Gropp. MPICH2:a new start for MPI im-plementations. LectureNotes in Computer Sci-ence, 2474:7–??, 2002. CO-DEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.de/link/service/series/0558/bibs/2474/24740007.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740007.pdf.

Gropp:2012:MBW

[Gro12] William Gropp. MPI 3and beyond: Why MPI issuccessful and what chal-lenges it faces. LectureNotes in Computer Science,7490:1–9, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.

springer.com/chapter/10.1007/978-3-642-33518-1_1/.

Gonzalez:1999:PPM

[GRRM99] J. A. Gonzalez, C. Ro-driguez, J. L. Roda, andD. G. Morales. Perfor-mance and predictability ofMPI and BSP programson the CRAY T3E. InDongarra et al. [DLM99],pages 27–34. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Gutierrez:2010:QCS

[GRTZ10] Eladio Gutierrez, SergioRomero, Marıa A. Trenas,and Emilio L. Zapata. Quan-tum computer simulation us-ing the CUDA programmingmodel. Computer PhysicsCommunications, 181(2):283–300, February 2010.CODEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465509003117.

Gaito:2001:ADC

[GRV01] A. Gaito, M. Rak, andU. Villano. Adding dy-namic coscheduling sup-port to PVM. LectureNotes in Computer Sci-ence, 2131:106–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/

REFERENCES 225

link/service/series/0558/bibs/2131/21310106.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310106.pdf.

Geist:1991:ENB

[GS91a] G. A. Geist and V. S.Sunderam. Experienceswith network based con-current computing on thePVM system. Technical Re-port ORNL/TM-11760, OakRidge National Laboratory,Knoxville, TN, USA, Jan-uary 1991.

Geist:1991:PSS

[GS91b] G. A. Geist and V. S. Sun-deram. The PVM sys-tem: Supercomputer levelconcurrent computation ona heterogeneous network ofworkstations. In Stout andWolfe [SW91], pages 258–261. ISBN 0-8186-2291-1.LCCN QA76.5 .D58 1991.

Geist:1992:NBC

[GS92] G. A. Geist and V. S. Sun-deram. Network-based con-current computing on thePVM system. Concur-rency: practice and experi-ence, 4(4):293–312 (or 293–311??), June 1992. CODENCPEXEI. ISSN 1040-3108.

Geist:1993:EPC

[GS93] G. A. Geist and V. S. Sun-deram. The evolution of thePVM concurrent computing

system. In IEEE [IEE93a],pages 549–557. ISBN 0-8186-3400-6. LCCN QA75.5.C581993. IEEE catalog no.93CH3251-6.

Gropp:1994:SEP

[GS94] W. Gropp and B. Smith.Scalable, extensible, andportable numerical libraries.In IEEE [IEE94f], pages 87–93. ISBN 0-8186-4980-1.LCCN QA76.58.S34 1993.

Gold:1996:UAL

[GS96] C. Gold and T. Schneken-burger. Using the ALDYload distribution systemfor PVM applications. InBode et al. [BDLS96], pages278–?? ISBN 3-540-61779-5. ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58.E9751996.

Geist:19xx:NBC

[GSxx] G. A. Geist and V. S. Sun-deram. Network based con-current computing on thePVM system. Technical re-port, Oak Ridge NationalLaboratory and Emory Uni-versity, Knoxville, TN, USAand Atlanta, GA, USA,19xx.

Garg:2002:TOA

[GS02] Rajat P. Garg and IlyaSharapov. Techniques foroptimizing applications: highperformance computing. SunBluePrints Program. SunMicrosystems Press, Palo

REFERENCES 226

Alto, CA, USA, 2002. ISBN0-13-093476-3. xliii + 616pp. LCCN QA76.88 .G372002. URL http://www.sun.com/books/catalog/garg.html/index.html;http://www.sun.com/solutions/blueprints/tools/.

Gao:2008:GEI

[GSA08] Guang R. Gao, MitsuhisaSato, and Eduard Ayguade.Guest Editors introduction:Special issue on OpenMP.International Journal ofParallel Programming, 36(3):287–288, June 2008.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=36&issue=3&spage=287.

Gardner:2013:CCE

[GScFM13] Mark Gardner, Paul Sathre,Wu chun Feng, and GabrielMartinez. Characterizingthe challenges and evaluat-ing the efficacy of a CUDA-to-OpenCL translator. Par-allel Computing, 39(12):769–786, December 2013.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167819113001075.

Gine:2002:ALT

[GSHL02] Francesc Gine, Francesc Sol-sona, Porfidio Hernandez,

and Emilio Luque. Ad-justing the lengths of timeslices when scheduling PVMjobs with high memory re-quirements. Lecture Notesin Computer Science, 2474:156–??, 2002. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.de/link/service/series/0558/bibs/2474/24740156.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740156.pdf.

Gerlach:1997:ECS

[GSI97] J. Gerlach, M. Sato, andY. Ishikawa. Experienceswith the C++ standard tem-plate library and MPI fora parallel particle simula-tion method. Lecture Notesin Computer Science, 1225:961–??, 1997. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Germanas:2017:HUP

[GSMK17] D. Germanas, A. Stepsys,S. Mickevicius, and R. K.Kalinauskas. HOTB up-date: Parallel code for cal-culation of three- and four-particle harmonic oscilla-tor transformation brack-ets and their matrices us-ing OpenMP. ComputerPhysics Communications,215(??):259–264, June 2017.CODEN CPHCBZ. ISSN

REFERENCES 227

0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465517300401.

Gine:2001:MMM

[GSN+01] Francesc Gine, Francesc Sol-sona, Xavi Navarro, Por-fidio Hernandez, and EmilioLuque. MemTo: a mem-ory monitoring tool for aLinux cluster. LectureNotes in Computer Sci-ence, 2131:225–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310225.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310225.pdf.

Gu:2013:PCI

[GSY+13] Zheng Gu, Matthew Small,Xin Yuan, Aniruddha Marathe,and David K. Lowenthal.Protocol customization forimproving MPI performanceon RDMA-enabled clus-ters. International Jour-nal of Parallel Program-ming, 41(5):682–703, Oc-tober 2013. CODENIJPPE5. ISSN 0885-7458(print), 1573-7640 (elec-tronic). URL http://link.springer.com/article/10.1007/s10766-013-0242-0.

Gruber:1994:PJE

[GT94] Ralf Gruber and MarcoTomassini, editors. Proceed-ings of the 6th Joint EPS-APS International Con-ference on Physics Com-puting: Physics Comput-ing ’94, Palazzo dei Con-gressi, Lugano, Switzer-land, 22–26 August 1994.European Physical Society,Geneva, Switzerland, 1994.ISBN 2-88270-011-3. LCCNQC20.7.E4I58 1994.

Golbiewski:2001:MOS

[GT01] Maciej Go lbiewski and Jes-per Larsson Traff. MPI-2 one-sided communicationson a Giganet SMP cluster.Lecture Notes in ComputerScience, 2131:16–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310016.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310016.pdf.

Gropp:2007:TSM

[GT07] William Gropp and Ra-jeev Thakur. Thread-safetyin an MPI implementation:Requirements and analysis.Parallel Computing, 33(9):595–604, September 2007.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic).

REFERENCES 228

Gennart:1996:CAG

[GTH96] B. A. Gennart, J. TarragaGimenez, and R. D. Her-sch. Computer-assisted gen-eration of PVM/C++ pro-grams using CAP. LectureNotes in Computer Science,1156:259–269, ???? 1996.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Gidra:2015:NGC

[GTS+15] Lokesh Gidra, Gael Thomas,Julien Sopena, Marc Shapiro,and Nhan Nguyen. Nu-maGiC: a garbage collec-tor for big data on bigNUMA machines. ACMSIGARCH Computer Ar-chitecture News, 43(1):661–673, March 2015. CODENCANED2. ISSN 0163-5964(print), 1943-5851 (elec-tronic).

Guang:2016:NMN

[Gua16] Suo Guang. NR-MPI: Anon-stop and fault resilientMPI supporting program-mer defined data backupand restore for E-scale su-per computing systems. Su-percomputing Frontiers andInnovations, 3(1):4–21, ????2016. CODEN ???? ISSN2409-6008 (print), 2313-8734(electronic). URL http://superfri.org/superfri/article/view/89.

Ge:1995:DHA

[GWC95] Yuzhen Ge, L. T. Wat-son, and E. G. Collins,Jr. Distributed homotopyalgorithms for H2/H∞ con-troller synthesis. In Bai-ley et al. [BBG+95], pages84–89. ISBN 0-89871-344-7.LCCN QA76.58.S55 1995.

Guerrero:2014:PCM

[GWVP+14] Gines D. Guerrero, Richard M.Wallace, Jose L. Vazquez-Poletti, Jose M. Cecilia,Jose M. Garcıa, Daniel Mo-zos, and Horacio Perez-Sanchez. A performance/cost model for a CUDAdrug discovery applicationon physical and public cloudinfrastructures. Concur-rency and Computation:Practice and Experience, 26(10):1787–1798, July 2014.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Hadjidoukas:2010:NOP

[HA10] Panagiotis E. Hadjidoukasand Laurent Amsaleg. NestedOpenMP parallelization ofa hierarchical data cluster-ing algorithm. Parallel Pro-cessing Letters, 20(2):187–208, June 2010. CODENPPLTEE. ISSN 0129-6264(print), 1793-642X (elec-tronic).

Han:2011:HHL

[HA11] Tianyi David Han andTarek S. Abdelrahman.

REFERENCES 229

hiCUDA: High-level GPGPUprogramming. IEEE Trans-actions on Parallel and Dis-tributed Systems, 22(1):78–90, January 2011. CODENITDSEO. ISSN 1045-9219(print), 1558-2183 (elec-tronic).

Hussain:2011:PIA

[HAA+11] Masroor Hussain, Muham-mad Abid, Mushtaq Ahmad,Ashfaq Khokhar, and ArifMasud. A parallel imple-mentation of ALE movingmesh technique for FSI prob-lems using OpenMP. In-ternational Journal of Par-allel Programming, 39(6):717–745, December 2011.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=39&issue=6&spage=717.

Hoeflinger:2001:PSP

[HAJK01] Jay Hoeflinger, Prasad Alav-illi, Thomas Jackson, andBob Kuhn. Producingscalable performance withOpenMP: Experiments withtwo CFD applications. Par-allel Computing, 27(4):391–413, March 2001. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336 (electronic). URLhttp://www.elsevier.nl/gej-ng/10/35/21/47/28/26/abstract.html; http:

//www.elsevier.nl/gej-ng/10/35/21/47/28/26/article.pdf.

Hamza:1995:PII

[Ham95a] M. H. Hamza, editor. Pro-ceedings of the IASTEDInternational Conference.Modelling and Simulation:Pittsburgh, PA, USA, 27–29April 1995. IASTEC-ActaPress, Anaheim, CA, USA,1995. ISBN 0-88986-218-4. LCCN QA76.9.C65 I2951995.

Haridi:1995:EPP

[HAM95b] Seif Haridi, Khayri Ali,and Peter Magnusson, edi-tors. EURO-PAR ’95 par-allel processing: First Inter-national EURO PAR Con-ference, Stockholm, Swe-den, August 29–31, 1995:proceedings, number 966in Lecture Notes in Com-puter Science. Springer-Ver-lag, Berlin, Germany / Hei-delberg, Germany / Lon-don, UK / etc., 1995.ISBN 3-540-60247-X. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.I553 1995.

Hansen:1998:EMP

[Han98] Per Brinch Hansen. Anevaluation of the Message-Passing Interface. ACMSIGPLAN Notices, 33(3):65–72, March 1998. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),

REFERENCES 230

1558-1160 (electronic). Theauthor criticizes MPI, andremarks “MPI . . . lack[s] theelegance and security thatcan only by checked by aparallel programming lan-guage.”.

Hardwick:1994:PVL

[Har94] Jonathan C. Hardwick.Porting a vector library: acomparison of MPI, paris,CMMD and PVM (or, “I’llnever have to port CVLagain”). Research paperCMU-CS-94-200, School ofComputer Science, CarnegieMellon University, Pitts-burgh, PA, USA, 1994. 16pp.

Hardwick:1995:PVL

[Har95] J. C. Hardwick. Porting avector library: a compari-son of MPI, Paris, CMMDand PVM. In IEEE [IEE95j],pages 68–77. ISBN 0-8186-6895-4. LCCN QA76.58 .S341994.

Hassanzadeh:1995:MMG

[Has95] Siamak Hassanzadeh, edi-tor. Mathematical meth-ods in geophysical imagingIII: 12–13 July 1995, SanDiego, California, volume2571 of Proceedings of theSPIE — The InternationalSociety for Optical Engi-neering. Society of Photo-optical Instrumentation En-gineers (SPIE), Bellingham,WA, USA, 1995. CODEN

PSISDG. ISBN 0-8194-1930-3. ISSN 0277-786X(print), 1996-756X (elec-tronic). LCCN TS510.S63v.2571.

Hisley:2000:PPE

[HASnP00] Dixie Hisley, Gagan Agrawal,Punyam Satya-narayana,and Lori Pollock. Port-ing and performance eval-uation of irregular codesusing OpenMP. Concur-rency: practice and ex-perience, 12(12):1241–1259,October 2000. CODENCPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/cgi-bin/abstract/76500349/START; http://www3.interscience.wiley.com/cgi-bin/fulltext?ID=76500349&PLACEBO=IE.pdf.

Hatazaki:1998:RRS

[Hat98] T. Hatazaki. Rank re-ordering strategy for MPItopology creation functions.Lecture Notes in ComputerScience, 1497:188–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Hachler:1996:IAC

[HB96a] G. Hachler and H. Burkhart.Implementing the ALWANcommunication and datadistribution library usingPVM. In Bode et al.[BDLS96], pages 243–250.ISBN 3-540-61779-5. ISSN

REFERENCES 231

0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E975 1996.

Haechler:1996:IAC

[HB96b] G. Haechler and H. Burkhart.Implementing the ALWANcommunication and data dis-tribution library using PVM.Lecture Notes in ComputerScience, 1156:243–??, ????1996. CODEN LNCSD9.ISSN 0302-9743 (print),1611-3349 (electronic).

Hausner:1995:EIP

[HBT95] M. Hausner, M. Burrows,and C. A. Thekkath. Ef-ficient implementation ofPVM on the AN2 ATMnetwork. In Hertzbergerand Serazzi [HS95a], pages562–569. ISBN 3-540-59393-4. ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.88 .I571995.

Huang:2006:ECS

[HC06] Jih-Woei Huang and Chih-Ping Chu. An efficientcommunication schedulingmethod for the processormapping technique applieddata redistribution. TheJournal of Supercomput-ing, 37(3):297–318, Septem-ber 2006. CODEN JO-SUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic). URL http://www.springerlink.com/openurl.asp?genre=article&

issn=0920-8542&volume=37&issue=3&spage=297.

Huang:2008:FPM

[HC08] Jih-Woei Huang and Chih-Ping Chu. A flexible pro-cessor mapping technique to-ward data localization forblock-cyclic data redistri-bution. The Journal ofSupercomputing, 45(2):151–172, August 2008. CO-DEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=45&issue=2&spage=151.

Hamid:2010:CMB

[HC10] Nor Asilah Wati AbdulHamid and Paul Codding-ton. Comparison of MPIbenchmark programs onshared memory and dis-tributed memory machines(point-to-point communica-tion). The InternationalJournal of High Perfor-mance Computing Applica-tions, 24(4):469–483, Nov-ember 2010. CODEN IH-PCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/24/4/469.full.pdf+html.

Hunold:2016:RMB

[HCA16] Sascha Hunold and Alexan-dra Carpen-Amarie. Re-producible MPI benchmark-

REFERENCES 232

ing is still not as easy asyou think. IEEE Transac-tions on Parallel and Dis-tributed Systems, 27(12):3617–3630, December 2016.CODEN ITDSEO. ISSN1045-9219 (print), 1558-2183(electronic)ITDSEO. URLhttps://www.computer.org/csdl/trans/td/2016/12/07426807-abs.html.

Hurwitz:2005:AMP

[HcF05] Justin (Gus) Hurwitz andWu chun Feng. Analyz-ing MPI performance over10-gigabit Ethernet. Jour-nal of Parallel and Dis-tributed Computing, 65(10):1253–1260, October 2005.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic).

Huang:2005:TME

[HCL05] Lei Huang, Barbara Chap-man, and Zhenying Liu. To-wards a more efficient im-plementation of OpenMPfor clusters via translationto global arrays. ParallelComputing, 31(10–12):1114–1139, October/December2005. CODEN PACOEJ.ISSN 0167-8191 (print),1872-7336 (electronic).

Hu:2016:CLG

[HCZ16] Liang Hu, Xilong Che, andSi-Qing Zheng. A closerlook at GPGPU. ACMComputing Surveys, 48(4):60:1–60:??, May 2016. CO-DEN CMSVAN. ISSN

0360-0300 (print), 1557-7341(electronic).

He:2000:PAA

[HD00] Yun (Helen) He and ChrisH. Q. Ding. Platforms:An accurate arithmetics ap-proach. In ACM [ACM00],page 150. URL http://www.sc2000.org/proceedings/info/fp.pdf.

Ding:2002:MOP

[HD02a] Yun He and Chris H. Q.Ding. MPI and OpenMPparadigms on cluster of SMParchitectures. In IEEE[IEE02], page ?? ISBN0-7695-1524-X. LCCN???? URL http://www.sc-2002.org/paperpdfs/pap.pap325.pdf.

He:2002:MOP

[HD02b] Yun He and Chris H. Q.Ding. MPI and OpenMPparadigms on cluster of SMParchitectures: The vacancytracking algorithm for multi-dimensional array transpo-sition. Parallel and Dis-tributed Computing Prac-tices, 5(2):117–128, June2002. CODEN ???? ISSN1097-2803.

Harvey:2011:STP

[HD11] M. J. Harvey and G. DeFabritiis. Swan: a tool forporting CUDA programs toOpenCL. Computer PhysicsCommunications, 182(4):1093–1099, April 2011. CO-DEN CPHCBZ. ISSN

REFERENCES 233

0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465511000117.

Hoefler:2012:LMO

[HDB+12] Torsten Hoefler, James Di-nan, Darius Buntinas, PavanBalaji, and Brian W. Bar-rett. Leveraging MPI’s one-sided communication inter-face for shared-memory pro-gramming. Lecture Notesin Computer Science, 7490:132–141, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-33518-1_18/.

Hoefler:2013:MMN

[HDB+13] Torsten Hoefler, James Di-nan, Darius Buntinas, Pa-van Balaji, and Brian Bar-rett . . . . MPI + MPI: anew hybrid approach to par-allel programming with MPIplus shared memory. Com-puting, 95(12):1121–1136,December 2013. CODENCMPTA2. ISSN 0010-485X(print), 1436-5057 (elec-tronic). URL http://link.springer.com/article/10.1007/s00607-013-0324-2.

Hadjidoukas:2009:HPF

[HDDG09] P. E. Hadjidoukas, V. V.Dimakopoulos, M. Delakis,and C. Garcia. A high-performance face detection

system using OpenMP. Con-currency and Computation:Practice and Experience,21(15):1819–1837, October2009. CODEN CCPEBO.ISSN 1532-0626 (print),1532-0634 (electronic).

Hoefler:2015:RMA

[HDT+15] Torsten Hoefler, JamesDinan, Rajeev Thakur,Brian Barrett, Pavan Balaji,William Gropp, and KeithUnderwood. Remote mem-ory access programming inMPI-3. ACM Transac-tions on Parallel Computing(TOPC), 2(2):9:1–9:??, July2015. CODEN ???? ISSN2329-4949 (print), 2329-4957(electronic).

Heikonen:2002:ILB

[HE02] Jussi Heikonen and KalleEerola. Improving load bal-ance in a weather code:Asynchronous output inHIRLAM with MPI. Lec-ture Notes in Computer Sci-ence, 2367:567–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2367/23670567.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2367/23670567.pdf.

Hadi:2013:CFA

[HE13] Mohammed F. Hadi and

REFERENCES 234

Seyed A. Esmaeili. CUDAFortran acceleration for thefinite-difference time-domainmethod. Computer PhysicsCommunications, 184(5):1395–1400, May 2013. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465513000118.

Havran:2015:EBT

[HE15] Vlastimil Havran and PetrEgert. Extensions to bidi-rectional texture functioncompression with multi-level vector quantizationin OpenCL. Comput-ers and Graphics, 48(??):1–10, May 2015. CO-DEN COGRD2. ISSN0097-8493 (print), 1873-7684(electronic). URL http://www.sciencedirect.com/science/article/pii/S0097849315000060.

Hebeker:1993:CPC

[Heb93] F.-K. Hebeker. On a coarse-grained parallel code to sim-ulate reactive flows on anIBM RS/ 6000 workstation-cluster. In Brebbia andPower [BP93], pages 253–262. ISBN 1-85312-236-X.LCCN TA345.I556 1993.

Herland:1998:CML

[HEH98] B. G. Herland, M. Eberl,and H. Hellwagner. Acommon messaging layer forMPI and PVM over SCI.Lecture Notes in Computer

Science, 1401:576–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Huang:2009:EGO

[HEHC09] Lei Huang, Deepak Eachempati,Marcus W. Hervey, andBarbara Chapman. Ex-ploiting global optimiza-tions for OpenMP programsin the OpenUH compiler.ACM SIGPLAN Notices,44(4):289–290, April 2009.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Hempel:1994:MSM

[Hem94] R. Hempel. The MPIStandard for Message Pass-ing. In Gentzsch andHarms [GH94], pages 247–252. ISBN 0-387-57981-8 (New York), 3-540-57981-8 (Berlin). LCCNQA76.88.I57 1994. DM96.00.Two volumes.

Hempel:1996:SMM

[Hem96] R. Hempel. The statusof the MPI message-passingstandard and its relationto PVM. In Bode et al.[BDLS96], pages 14–21.ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E975 1996.

Holmen:2014:ASI

[HF14a] John K. Holmen and David L.Foster. Accelerating sin-

REFERENCES 235

gle iteration performance ofCUDA–based 3D reaction–diffusion simulations. Inter-national Journal of Paral-lel Programming, 42(2):343–363, April 2014. CODENIJPPE5. ISSN 0885-7458(print), 1573-7640 (elec-tronic). URL http://link.springer.com/article/10.1007/s10766-013-0251-z.See erratum [HF14b].

Holmen:2014:EAS

[HF14b] John K. Holmen and David L.Foster. Erratum to: Acceler-ating single iteration perfor-mance of CUDA–based 3Dreaction–diffusion simula-tions. International Journalof Parallel Programming, 42(2):364, April 2014. CODENIJPPE5. ISSN 0885-7458(print), 1573-7640 (elec-tronic). URL http://link.springer.com/content/pdf/10.1007/s10766-014-0305-x.pdf. See [HF14a].

Hursey:2012:AFA

[HG12] Joshua Hursey and Richard L.Graham. Analyzing faultaware collective performancein a process fault toler-ant MPI. Parallel Com-puting, 38(1–2):15–25, Jan-uary/February 2012. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167819111001414.

Hermanns:2012:SDM

[HGMW12] Marc-Andre Hermanns, MarkusGeimer, Bernd Mohr, andFelix Wolf. Scalable detec-tion of MPI-2 remote mem-ory access inefficiency pat-terns. The InternationalJournal of High Perfor-mance Computing Applica-tions, 26(3):227–236, Au-gust 2012. CODEN IH-PCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/26/3/227.full.pdf+html.

Hong:1995:PNP

[HH95] Lin Hong and Chen Huap-ing. PVM and networkparallel computing. Mini-Micro Systems, 16(2):53–58,February 1995. CODENXWJXEH. ISSN 1000-1220.

Hanson:2014:NCM

[HH14] Richard J. Hanson and TimHopkins. Numerical com-puting with modern For-tran. Applied mathemat-ics. Society for Industrialand Applied Mathemat-ics, Philadelphia, PA, USA,2014. ISBN 1-61197-311-2 (paperback), 1-61197-312-0 (e-book). xv + 244 pp.LCCN QA76.73.F25 H3672013.

Hui:1995:SPS

[HHA95] Chi-Chung Hui, MounirHamdi, and Ishfaq Ahmad.

REFERENCES 236

Software platform for solv-ing PDEs on distributed sys-tems: Implementation is-sues and performance pre-diction. In IEEE [IEE95l],pages 383–388. CODENPSICD2. ISBN 0-8186-7119-X. ISSN 0730-6512. LCCNQA 76.6 C6295 1995. IEEEcatalog number 95CB35838.

Huang:2018:ACO

[HHC+18] Kai Huang, Biao Hu, LongChen, Alois Knoll, and Zhi-hua Wang. Adas on Cotswith OpenCL: A case studywith lane detection. IEEETransactions on Computers,67(4):559–565, ???? 2018.CODEN ITCOB4. ISSN0018-9340 (print), 1557-9956(electronic). URL http://ieeexplore.ieee.org/document/8057795/.

Horiguchi:1994:ISP

[HHK94] S. Horiguchi, D. FrankHsu, and M. Kimura, ed-itors. International Sym-posium on Parallel Archi-tectures, Algorithms, andNetworks (ISPAN): proceed-ings of the 1994, Decem-ber 14–16, 1994, Kanazawa,Japan. IEEE Computer So-ciety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,1994. ISBN 0-8186-6507-6 (case), 0-8186-6506-8 (mi-crofiche). LCCN QA76.58.I5673 1994 Bar. IEEE cata-log number 94TH0697-3.

Hinde:2011:QMD

[Hin11] Robert J. Hinde. QSATS:MPI-driven quantum simu-lations of atomic solids atzero temperature. Com-puter Physics Communi-cations, 182(11):2339–2349,November 2011. CODENCPHCBZ. ISSN 0010-4655(print), 1879-2944 (elec-tronic). URL http://www.sciencedirect.com/science/article/pii/S0010465511001615.

Huttunen:2002:MCC

[HIP02] Pentti Huttunen, JouniIkonen, and Jari Porras.MPIT — communication/computation paradigm fornetworks of SMP worksta-tions. Lecture Notes inComputer Science, 2367:160–??, 2002. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2367/23670160.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2367/23670160.pdf.

Haimes:1998:UPM

[HJ98] R. Haimes and K. E. Jor-dan. Using PVM andMPI for co-processed, dis-tributed and parallel sci-entific visualization. Lec-ture Notes in Computer Sci-ence, 1388:1098–??, 1998.CODEN LNCSD9. ISSN

REFERENCES 237

0302-9743 (print), 1611-3349(electronic).

Hall:2014:MMC

[HJBB14] Clifford Hall, Weixiao Ji,and Estela Blaisten-Barojas.The Metropolis Monte Carlomethod with CUDA en-abled Graphic ProcessingUnits. Journal of Com-putational Physics, 258(??):871–879, February 1, 2014.CODEN JCTPAH. ISSN0021-9991 (print), 1090-2716(electronic). URL http://www.sciencedirect.com/science/article/pii/S0021999113007626.

Huang:2010:ELA

[HJYC10] Lei Huang, Haoqiang Jin,Liqi Yi, and Barbara Chap-man. Enabling locality-aware computations in OpenMP.Scientific Programming, 18(3–4):169–181, ???? 2010.CODEN SCIPEV. ISSN1058-9244 (print), 1875-919X (electronic).

Hoffmann:1993:PFE

[HK93] Geerd-R. Hoffmann andTuomo Kauranne, editors.Proceedings of the FifthECMWF Workshop on theUse of Parallel Processors inMeteorology. Parallel Super-computing in AtmosphericScience. World ScientificPublishing Co. Pte. Ltd.,P. O. Box 128, FarrerRoad, Singapore 9128, 1993.ISBN 981-02-1429-4. LCCNQA76.58 E354 1992.

Henriksen:1994:PCF

[HK94] P. Henriksen and R. Ke-unings. Parallel compu-tation of the flow of in-tegral viscoelastic fluids ona heterogeneous network ofworkstations. InternationalJournal for Numerical Meth-ods in Fluids, 18(12):1167–1183, June 1994. CODENIJNFDW. ISSN 0271-2091.

Hoffmann:1995:CAP

[HK95] Geerd-R. Hoffmann andNorbert Kreitz, editors.Coming of age: proceedingsof the Sixth ECMWF Work-shop on the Use of Paral-lel Processors in Meteorol-ogy, Reading, UK, Novem-ber 21–25, 1994. World Sci-entific Publishing Co. Pte.Ltd., P. O. Box 128, FarrerRoad, Singapore 9128, 1995.ISBN 981-02-2211-4. LCCNQC866.E26 1994.

Hong:2009:AMG

[HK09] Sunpyo Hong and HyesoonKim. An analytical modelfor a GPU architecture withmemory-level and thread-level parallelism awareness.ACM SIGARCH ComputerArchitecture News, 37(3):152–163, June 2009. CO-DEN CANED2. ISSN0163-5964 (ACM), 0884-7495 (IEEE).

Hong:2010:IGP

[HK10] Sunpyo Hong and Hye-soon Kim. An inte-

REFERENCES 238

grated GPU power and per-formance model. ACMSIGARCH Computer Ar-chitecture News, 38(3):280–289, June 2010. CODENCANED2. ISSN 0163-5964(ACM), 0884-7495 (IEEE).

Hiranandani:1994:CTB

[HKMCS94] S. Hiranandani, K. Kennedy,J. Mellor-Crummey, andA. Sethi. Compilationtechniques for block-cyclicdistributions. In ACM[ACM94], pages 392–403.ISBN 0-89791-665-4. LCCN???? URL http://www.acm.org/pubs/contents/proceedings/supercomputing/181181/.

Hoeflinger:2001:IPV

[HKN+01] Jay Hoeflinger, Bob Kuhn,Wolfgang Nagel, Paul Pe-tersen, Hrabri Rajic, SanjivShah, Jeff Vetter, MichaelVoss, and Renee Woo. Anintegrated performance vi-sualizer for MPI/OpenMPprograms. Lecture Notesin Computer Science, 2104:40–??, 2001. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2104/21040040.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2104/21040040.pdf.

Hong:2011:ACG

[HKOO11] Sungpack Hong, Sang KyunKim, Tayo Oguntebi, andKunle Olukotun. Accel-erating CUDA graph algo-rithms at maximum warp.ACM SIGPLAN Notices, 46(8):267–276, August 2011.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic). PPoPP ’11 Confer-ence proceedings.

Hori:2012:EKL

[HKT+12] Atsushi Hori, ToyohisaKameyama, Yuichi Tsujita,Mitaro Namiki, and YutakaIshikawa. An efficient kernel-level blocking MPI imple-mentation. Lecture Notesin Computer Science, 7490:153–162, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-33518-1_20/.

Hasanov:2017:HRC

[HL17] Khalid Hasanov and AlexeyLastovetsky. Hierarchical re-design of classic MPI reduc-tion algorithms. The Jour-nal of Supercomputing, 73(2):713–725, February 2017.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic).

Hu:2000:ONS

[HLCZ00] Y. Charlie Hu, Honghui Lu,

REFERENCES 239

Alan L. Cox, and WillyZwaenepoel. OpenMP fornetworks of SMPs. Journalof Parallel and DistributedComputing, 60(12):1512–1530, December 1, 2000.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848 (electronic). URLhttp://www.idealibrary.com/links/doi/10.1006/jpdc.2000.1658; http://www.idealibrary.com/links/doi/10.1006/jpdc.2000.1658/pdf; http://www.idealibrary.com/links/doi/10.1006/jpdc.2000.1658/ref.

Haque:2017:CCL

[HLM+17] S. Anisul Haque, X. Li,F. Mansouri, M. MorenoMaza, D. Mohajerani, andW. Pan. CUMODP: aCUDA library for mod-ular polynomial computa-tion. ACM Communica-tions in Computer Alge-bra, 51(3):89–91, September2017. CODEN ???? ISSN1932-2232 (print), 1932-2240(electronic).

Hung:2016:EBP

[HLO+16] Che-Lun Hung, Chun-YuanLin, Chia-Shin Ou, Yuan-Hong Tseng, Po-Yen Hung,Ship-Peng Li, and Chun-Ting Fu. Efficient bit-parallel subcircuit extractionusing CUDA. Concurrencyand Computation: Prac-tice and Experience, 28(16):

4326–4338, November 2016.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Hong:1996:RDM

[HLOC96] Chul-Eui Hong, Bum-SikLee, Gi-Won On, and Dong-Hae Chi. Replay for de-bugging MPI parallel pro-grams. In IEEE [IEE96i],pages 156–160. ISBN 0-8186-7533-0. LCCN QA76.642.M67 1996.

Hawick:2010:PGC

[HLP10] K. A. Hawick, A. Leist,and D. P. Playne. Parallelgraph component labellingwith GPUs and CUDA.Parallel Computing, 36(12):655–678, December 2010.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic).

Hawick:2011:RLS

[HLP11] K. A. Hawick, A. Leist,and D. P. Playne. Regu-lar lattice and small-worldspin model simulations us-ing CUDA and GPUs. In-ternational Journal of Par-allel Programming, 39(2):183–201, April 2011. CO-DEN IJPPE5. ISSN 0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=39&issue=2&spage=183.

REFERENCES 240

Huband:2001:DTB

[HM01] Simon Huband and ChrisMcDonald. DEPICT: atopology-based debugger forMPI programs. LectureNotes in Computer Sci-ence, 2026:109–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2026/20260109.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2026/20260109.pdf.

Hilbrich:2009:MCC

[HMK09] Tobias Hilbrich, Matthias S.Muller, and Bettina Kram-mer. MPI correctness check-ing for OpenMP/MPI appli-cations. International Jour-nal of Parallel Programming,37(3):277–291, June 2009.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=37&issue=3&spage=277.

Hakula:1994:FEM

[HMKV94] H. Hakula, J. Malinen,P. Kallberg, and P. Valve.The finite element methodapplied to the exteriorHelmholtz problem on theIBM SP-1. In Dongarraand Wasniewski [DW94],

pages 262–269. ISBN 3-540-58712-8 (Berlin), 0-387-58712-8 (New York). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58.P35 1994. DM104.00.

Haynes:2014:MOA

[HO14] Ronald D. Haynes and Ben-jamin W. Ong. MPI–OpenMP algorithms for theparallel space-time solutionof time dependent PDEs.In Erhel et al. [EGH+14],pages 179–187. ISBN 3-319-05788-X (paperback), 3-319-05789-8 (e-book). ISSN1439-7358 (print), 2197-7100(electronic). LCCN QA71-90. URL http://link.springer.com/chapter/10.1007/978-3-319-05789-7_14/.

Hogg:2013:FDT

[Hog13] J. D. Hogg. A fastdense triangular solve inCUDA. SIAM Journal onScientific Computing, 35(3):C303–C322, ???? 2013.CODEN SJOCE3. ISSN1064-8275 (print), 1095-7197(electronic).

Hollingsworth:2012:SPI

[Hol12] Jeffrey Hollingsworth, edi-tor. SC ’12: Proceedings ofthe International Conferenceon High Performance Com-puting, Networking, Storageand Analysis, Salt Lake Con-vention Center, Salt LakeCity, UT, USA, November

REFERENCES 241

10–16, 2012. IEEE Com-puter Society Press, 1109Spring Street, Suite 300, Sil-ver Spring, MD 20910, USA,2012. ISBN 1-4673-0804-8.

Hosking:2012:CHL

[Hos12] Tony Hosking. Compilinga high-level language forGPUs: (via language sup-port for architectures andcompilers). ACM SIGPLANNotices, 47(6):1–12, June2012. CODEN SINODQ.ISSN 0362-1340 (print),1523-2867 (print), 1558-1160(electronic). PLDI ’12 pro-ceedings.

Hadjidoukas:2005:OEM

[HP05] P. E. Hadjidoukas and T. S.Papatheodorou. OpenMPextensions for master-slavemessage passing comput-ing. Parallel Computing,31(10–12):1155–1167, Octo-ber/December 2005. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic).

Hawick:2011:HSL

[HP11] K. A. Hawick and D. P.Playne. Hypercubic stor-age layout and transformsin arbitrary dimensions us-ing GPUs and CUDA. Con-currency and Computation:Practice and Experience, 23(10):1027–1050, July 2011.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Hidalgo:1999:MMP

[HPLT99] J. I. Hidalgo, M. Prieto,J. Lanchares, and F. Tirado.A method for model param-eter identification using par-allel genetic algorithms. InDongarra et al. [DLM99],pages 291–298. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Hadjidoukas:2002:MOI

[HPP02] Panagiotis E. Hadjidoukas,Eleftherios D. Polychronopou-los, and Theodore S. Pa-patheodorou. A modu-lar OpenMP implementa-tion for clusters of multipro-cessors. Parallel and Dis-tributed Computing Prac-tices, 5(2):153–168, June2002. CODEN ???? ISSN1097-2803.

Hariri:1995:STE

[HPR+95] S. Hariri, Sung-Yong Park,R. Reddy, M. Subramanyan,R. Yadav, G. C. Fox, andM. Parashar. Software toolevaluation methodology. InIEEE [IEE95i], pages 3–10.ISBN 0-8186-7025-8. LCCN???? IEEE catalog number95CH35784.

Hondroudakis:1995:PEV

[HPS95] A. Hondroudakis, R. Proc-ter, and K. Shanmugam.Performance evaluation andvisualization with VISPAT.In Malyshkin [Mal95], pages

REFERENCES 242

180–185. ISBN 3-540-60222-4. ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58.I5471995.

Heckathorn:1996:SSP

[HPS+96] H. Heckathorn, B. Popp,W. Smith, D. Conklin, D. A.Newman, and F. Wieland.SSGM: from serial to parallelprocessing using PVM. Pro-ceedings of the SPIE — TheInternational Society for Op-tical Engineering, 2741:267–277, ???? 1996. CODENPSISDG. ISSN 0277-786X(print), 1996-756X (elec-tronic).

Hilbrich:2012:MRE

[HPS+12] Tobias Hilbrich, JoachimProtze, Martin Schulz, Bro-nis R. de Supinski, andMatthias S. Muller. MPIruntime error detectionwith MUST: advances indeadlock detection. InHollingsworth [Hol12], pages30:1–30:?? ISBN 1-4673-0804-8. URL http://conferences.computer.org/sc/2012/papers/1000a010.pdf.

Hilbrich:2013:MRE

[HPS+13] Tobias Hilbrich, JoachimProtze, Martin Schulz, Bro-nis R. de Supinski, andMatthias S. Muller. MPIruntime error detection withMUST: Advances in dead-lock detection. Scientific

Programming, 21(3–4):109–121, ???? 2013. CODENSCIPEV. ISSN 1058-9244(print), 1875-919X (elec-tronic).

Hariri:1993:MPI

[HPY+93] S. Hariri, J. B. Park, F.-K. Yu, M. Parashar, andG. C. Fox. A messagepassing interface for paral-lel and distributed comput-ing. In IEEE [IEE93c], pages84–91. ISBN 0-8186-3900-8, 0-8186-3901-6. LCCNQA76.9.D5I593 1993. IEEEcatalog no. 93TH0550-4.

Hoefler:2011:SPT

[HRR+11] Torsten Hoefler, Rolf Raben-seifner, Hubert Ritzdorf,Bronis R. de Supinski,Rajeev Thakur, and Jes-per Larsson Traff. The scal-able process topology in-terface of MPI 2.2. Con-currency and Computation:Practice and Experience, 23(4):293–310, March 25, 2011.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Hoyos-Rivera:1997:UPB

[HRSA97] G. J. Hoyos-Rivera andV. G. Sanchez-Arias. Us-ing PVM to build an in-terface to support cooper-ative work in a distributedsystems environment. Lec-ture Notes in Computer Sci-ence, 1332:127–134, 1997.CODEN LNCSD9. ISSN

REFERENCES 243

0302-9743 (print), 1611-3349(electronic).

Hempel:1997:IMN

[HRZ97] R. Hempel, H. Ritzdorf, andF. Zimmermann. Implemen-tation of MPI on NEC’s SX-4 multi-node architecture.Lecture Notes in ComputerScience, 1332:185–193, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Hartley:1993:CPS

[HS93] C. L. Hartley and V. S. Sun-deram. Concurrent program-ming with shared objects innetworked environments. InIEEE [IEE93b], pages 471–478. ISBN 0-8186-3442-1.LCCN QA 76.58 I56 1993.IEEE catalog no. 93TH0513-2.

Hesham:1994:PTS

[HS94] E.-R. Hesham and B. D.Shriver, editors. Proceed-ings of the Twenty-SeventhHawaii International Con-ference on System Sciences.Vol. II: Software Technol-ogy, January 4–7, 1994,Wailea, HI, USA, volume 27.IEEE Computer SocietyPress, 1109 Spring Street,Suite 300, Silver Spring, MD20910, USA, 1994. ISBN0-8186-5060-5. ISSN 1060-3425. LCCN ???? IEEEcatalog no. 94TH0607-2.

Hertzberger:1995:HPM

[HS95a] Bob Hertzberger and GiuseppeSerazzi, editors. High-Performance computing andnetworking: InternationalConference and Exhibition,Milan, Italy, May 3–5, 1995:proceedings, number 919in Lecture Notes in Com-puter Science. Springer-Ver-lag, Berlin, Germany / Hei-delberg, Germany / London,UK / etc., 1995. ISBN 3-540-59393-4. ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.88 .I571995.

Hungenahally:1995:PIQ

[HS95b] A. Hungenahally and A. Suresh.PVM implementation ofquadtree building algorithmson SIMD hypercube sys-tem. IEEE InternationalConference on Algorithmsand Architectures for Par-allel Processing, 2:855–858,???? 1995. IEEE catalognumber 95TH0682-5.

Hoefler:2012:OPC

[HS12] Torsten Hoefler and TimoSchneider. Optimizationprinciples for collectiveneighborhood communica-tions. In Hollingsworth[Hol12], pages 98:1–98:??ISBN 1-4673-0804-8. URLhttp://conferences.computer.org/sc/2012/papers/1000a028.pdf.

REFERENCES 244

Henriksen:2017:FPF

[HSE+17] Troels Henriksen, NielsG. W. Serup, Martin Els-man, Fritz Henglein, andCosmin E. Oancea. Futhark:purely functional GPU-programming with nestedparallelism and in-place ar-ray updates. ACM SIG-PLAN Notices, 52(6):556–571, June 2017. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Haeuser:1994:RNS

[HSMW94] J. Haeuser, M. Spel, J. Muy-laert, and R. D. Williams.Results for the Navier–Stokes solver ParNSS onworkstation clusters andIBM SP1 using PVM. InWagner et al. [WPH94],pages 432–442. ISBN 0-471-95063-7. LCCN QA911.E951994.

Heimel:2013:HOP

[HSP+13] Max Heimel, Michael Saecker,Holger Pirk, Stefan Mane-gold, and Volker Markl.Hardware-oblivious paral-lelism for in-memory column-stores. Proceedings of theVLDB Endowment, 6(9):709–720, July 2013. CODEN???? ISSN 2150-8097.

Hormati:2012:SPS

[HSW+12] Amir H. Hormati, MehrzadSamadi, Mark Woh, TrevorMudge, and Scott Mahlke.Sponge: portable stream

programming on graph-ics engines. ACM SIG-PLAN Notices, 47(4):381–392, April 2012. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Hu:2001:PCC

[HT01] Hong Hu and Edward L.Turner. Parallel CFD com-puting using shared mem-ory OpenMP. LectureNotes in Computer Sci-ence, 2073:1137–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2073/20731137.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2073/20731137.pdf.

Howes:2008:U

[HT08] L. Howes and D. B. Thomas.Efficient random numbergeneration and applicationusing CUDA. In Nguyen[Ngu08], chapter 37, pages805–830. ISBN 0-321-51526-9. LCCN T385.G6882 2008. URL http://www.loc.gov/catdir/toc/ecip0720/2007023985.html.

Ha:2008:NBP

[HTA08] Phuong Hoai Ha, PhilippasTsigas, and Otto J. Anshus.Non-blocking programmingon multi-core graphics pro-

REFERENCES 245

cessors: (extended abstract).ACM SIGARCH ComputerArchitecture News, 36(5):19–28, December 2008. CODENCANED2. ISSN 0163-5964(ACM), 0884-7495 (IEEE).

Hluchy:1999:GWF

[HTHD99] L. Hluchy, V. D. Tran,L. Halada, and M. Do-brucky. Ground water flowmodelling in PVM. InDongarra et al. [DLM99],pages 450–460. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Huckle:1996:PIS

[Huc96] T. Huckle. PVM-implementationof sparse approximate in-verse preconditioners forsolving large sparse lin-ear equations. LectureNotes in Computer Science,1156:166–173, ???? 1996.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Humphres:1995:LBE

[Hum95] Christopher Wade Humphres.A load balancing extensionfor the PVM software sys-tem. M.e.e. thesis, Depart-ment of Electrical Engineer-ing, University of Alabama,Tuscaloosa, AL, USA, 1995.viii + 98 pp.

Husbands:1998:MSD

[Hus98] Parry J. Husbands. MPI-StarT: Delivering network

performance to numer-ical applications. InACM [ACM98b], page ??ISBN ???? LCCN???? URL http://www.supercomp.org/sc98/papers/.

Huse:1999:CCD

[Hus99] L. P. Huse. Collectivecommunication on dedicatedclusters of workstations. InDongarra et al. [DLM99],pages 469–476. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Huse:2000:MOS

[Hus00] Lars Paul Huse. MPI op-timization for SMP basedclusters interconnected withSCI. Lecture Notes inComputer Science, 1908:56–??, 2000. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080056.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080056.pdf.

Huse:2001:LST

[Hus01] Lars Paul Huse. LayeringSHMEM on top of MPI.Lecture Notes in ComputerScience, 2131:44–??, 2001.CODEN LNCSD9. ISSN

REFERENCES 246

0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310044.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310044.pdf.

Hamidouche:2016:CAO

[HVA+16] Khaled Hamidouche, Ak-shay Venkatesh, Ammar Ah-mad Awan, Hari Subra-moni, Ching-Hsiang Chu,and Dhabaleswar K. Panda.CUDA-aware OpenSHMEM:Extensions and designs forhigh performance OpenSH-MEM on GPU clusters. Par-allel Computing, 58(??):27–36, October 2016. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167819116300345.

Houzeaux:2011:HMO

[HVSC11] G. Houzeaux, M. Vazquez,X. Saez, and J. M. Cela.Hybrid MPI–OpenMP per-formance in massively par-allel computational fluiddynamics. In Tromeur-Dervout et al. [TDBEE11],pages 293–297. CO-DEN LNCSA6. ISBN 3-642-14437-3 (print), 3-642-14438-1 (e-book). ISSN1439-7358. LCCN ???? URLhttp://link.springer.com/content/pdf/10.1007/978-

3-642-14438-7_31. Pro-ceedings of the twentiethmeeting, Parallel CFD 2008,held May 19–22, 2008 inLyon, France.

Hoekstra:1995:CPP

[HVSH95] A. G. Hoekstra, F. Van derLinden, P. M. A. Sloot, andL. O. Hertzberger. Com-paring the Parix and PVMparallel programming envi-ronments. In Fritzson andFinmo [FF95], pages 288–292. ISBN 90-5199-229-7(IOS Press), 4-274-90056-8(Ohmsha). LCCN ????

Hager:2011:IHP

[HW11] Georg Hager and GerhardWellein. Introduction tohigh performance comput-ing for scientists and engi-neers, volume 7 of Chap-man and Hall/CRC compu-tational science series. CRCPress, 2000 N.W. Corpo-rate Blvd., Boca Raton,FL 33431-9868, USA, 2011.ISBN 1-4398-1192-X. xxv +330 + 4 pp. LCCN QA76.88.H34 2011.

Huang:2002:DDD

[HWM02] Wei Huang, Zhe Wang, andJie Ma. Design of DMPIon DAWNING-3000. Lec-ture Notes in Computer Sci-ence, 2474:314–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.de/link/

REFERENCES 247

service/series/0558/bibs/2474/24740314.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740314.pdf.

He:2009:AVS

[HWS09] Jian He, Layne T. Wat-son, and Masha Sosonk-ina. Algorithm 897: VT-DIRECT95: Serial and par-allel codes for the globaloptimization algorithm di-rect. ACM Transactionson Mathematical Software,36(3):17:1–17:24, July 2009.CODEN ACMSCU. ISSN0098-3500 (print), 1557-7295(electronic). See remark[SWH15].

Hwang:1997:EMC

[HWW97] Kai Hwang, Choming Wang,and Cho-Li Wang. Evaluat-ing MPI collective commu-nication on the SP2, T3D,and Paragon multicomput-ers. In IEEE [IEE97c], pages106–115. ISBN 0-8186-7764-3. LCCN QA76.9.A73I5661997. IEEE catalog number97TB100094.

Huang:2013:ACM

[HWX+13] Libo Huang, Zhiying Wang,Nong Xiao, Yongwen Wang,and Qiang Dou. Adap-tive communication mecha-nism for accelerating MPIfunctions in NoC-based mul-ticore processors. ACMTransactions on Architec-ture and Code Optimization,

10(3):18:1–18:??, September2013. CODEN ???? ISSN1544-3566 (print), 1544-3973(electronic).

Hellberg:1994:PPP

[HZ94] S. A. Hellberg and E. Za-luska. A portable parallelprogramming environmentbased around PCTE. Infor-mation and Software Tech-nology, 36(7):419–425, July1994. CODEN ISOTE7.ISSN 0950-5849 (print),1873-6025 (electronic).

Hempel:1996:APT

[HZ96] R. Hempel and F. Zim-mermann. On the au-tomatic PARMACS-to-MPItransformation in applica-tion programs. In Lid-dell et al. [LCHS96], pages1033–1034. ISBN 3-540-61142-8 (paperback). LCCNQA76.88 .H52 1996.

Hempel:1999:AMP

[HZ99] Rolf Hempel and Falk Zim-mermann. Automatic mi-gration from PARMACSto MPI in parallel For-tran applications. Scien-tific Programming, 7(1):39–46, ???? 1999. CODENSCIPEV. ISSN 1058-9244(print), 1875-919X (elec-tronic). URL http://iospress.metapress.com/app/home/contribution.asp%3Fwasp=64cr5a4mg33tuhcbdr02%26referrer=parent%26backto=issue%2C3%2C7%3Bjournal%

REFERENCES 248

2C8%2C9%3Blinkingpublicationresults%2C1%2C1.

Hou:2008:BBS

[HZG08] Qiming Hou, Kun Zhou,and Baining Guo. BSGP:bulk-synchronous GPU pro-gramming. ACM Transac-tions on Graphics, 27(3):19:1–19:??, August 2008.CODEN ATGRDF. ISSN0730-0301 (print), 1557-7368(electronic).

Isaila:2010:SMP

[IBC+10] Florin Isaila, FranciscoJavier Garcia Blas, JesusCarretero, Wei keng Liao,and Alok Choudhary. A scal-able Message Passing Inter-face implementation of anad-hoc parallel I/O system.The International Journal ofHigh Performance Comput-ing Applications, 24(2):164–184, May 2010. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/24/2/164.full.pdf+html.

Isabel:2002:CMO

[ICC02] Dorta Isabel, Leon Coro-moto, and Rodrıguez Casiano.Comparing MPI and OpenMPimplementations of the 0-1knapsack problem. Paral-lel and Distributed Comput-ing Practices, 5(2):129–137,June 2002. CODEN ????ISSN 1097-2803.

Issman:1994:PME

[IDD94] E. Issman, G. Degrez, andJ. De Keyser. A paral-lel multiblock Euler/Navier–Stokes solver on a clus-ter of workstations us-ing PVM. In Gentzschand Harms [GH94], pages157–162. ISBN 0-387-57981-8 (New York), 3-540-57981-8 (Berlin). LCCNQA76.88.I57 1994. DM96.00.Two volumes.

Ibanez:2016:HMT

[IDS16] Dan Ibanez, Ian Dunn, andMark S. Shephard. HybridMPI-thread parallelizationof adaptive mesh operations.Parallel Computing, 52(??):133–143, February 2016.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167819116000041.

IEEE:1991:PSA

[IEE91] IEEE, editor. Proceedings,Supercomputing ’91: Albu-querque, New Mexico, Nov-ember 18–22, 1991. IEEEComputer Society Press,1109 Spring Street, Suite300, Silver Spring, MD20910, USA, 1991. ISBN0-8186-9158-1 (IEEE: case),0-8186-2158-3 (IEEE: pa-per), 0-8186-6158-5 (IEEE:microfiche), 0-89791-459-7(ACM). LCCN QA76.5.S894 1991. IEEE catalog no.91CH3058-5.

REFERENCES 249

IEEE:1992:PSH

[IEE92] IEEE, editor. Proceed-ings / Scalable High Per-formance Computing Con-ference, SHPCC-92, April26–29, 1992, Williamsburg,Virginia. IEEE ComputerSociety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,1992. ISBN 0-8186-2775-1. LCCN QA76.76.A65S331992. IEEE catalog no.92TH0432-5.

IEEE:1993:DPC

[IEE93a] IEEE, editor. Digest ofpapers: Compcon spring’93, San Francisco, Cal-ifornia, February 22–26,1993. IEEE Computer So-ciety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,1993. ISBN 0-8186-3400-6. LCCN QA75.5.C58 1993.IEEE catalog no. 93CH3251-6.

IEEE:1993:PSI

[IEE93b] IEEE, editor. Proceed-ings / Seventh InternationalParallel Processing Sympo-sium, April 13–16, 1993,Newport Beach, Califor-nia. IEEE Computer SocietyPress, 1109 Spring Street,Suite 300, Silver Spring, MD20910, USA, 1993. ISBN0-8186-3442-1. LCCN QA76.58 I56 1993. IEEE cat-alog no. 93TH0513-2.

IEEE:1993:PIS

[IEE93c] IEEE, editor. Proceedingsof the 2nd InternationalSymposium on High Per-formance Distributed Com-puting, July 20–23, 1993,Spokane, Washington, Ca-vanaugh’s Inn at the Park,Proceedings of the Interna-tional Symposium on HighPerformance DistributedComputing 2nd. IEEE Com-puter Society Press, 1109Spring Street, Suite 300, Sil-ver Spring, MD 20910, USA,1993. ISBN 0-8186-3900-8, 0-8186-3901-6. LCCNQA76.9.D5I593 1993. IEEEcatalog no. 93TH0550-4.

IEEE:1993:PFW

[IEE93d] IEEE, editor. Proceed-ings of the Fourth Workshopon Future Trends of Dis-tributed Computing Systems,September 22–24, 1993, Lis-bon, Portugal. IEEE Com-puter Society Press, 1109Spring Street, Suite 300, Sil-ver Spring, MD 20910, USA,1993. ISBN 0-8186-4430-3.LCCN QA76.9.D5I335 1993.IEEE catalog no. 93TH0574-4.

IEEE:1993:PSP

[IEE93e] IEEE, editor. Proceedings,Supercomputing ’93: Port-land, Oregon, November 15–19, 1993. IEEE ComputerSociety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,

REFERENCES 250

1993. ISBN 0-8186-4340-4 (paperback), 0-8186-4341-2 (microfiche), 0-8186-4342-0 (hardback), 0-8186-4346-3 (CD-ROM). ISSN 1063-9535. LCCN QA76.5 .S961993.

IEEE:1993:WHP

[IEE93f] IEEE, editor. Workshopon Heterogeneous Process-ing (1992: Beverly Hills,Calif.) Proceedings / Work-shop on Heterogeneous Pro-cessing, March 23, 1992,Beverly Hills, California.IEEE Computer SocietyPress, 1109 Spring Street,Suite 300, Silver Spring,MD 20910, USA, 1993.ISBN 0-8186-2702-6. LCCNQA76.58 .W654 1992.

IEEE:1994:FSF

[IEE94a] IEEE, editor. Frontiers’95,the 5th Symposium on theFrontiers of Massively Par-allel Computation: proceed-ings, February 6–9, 1995,McLean, Virginia. IEEEComputer Society Press,1109 Spring Street, Suite300, Silver Spring, MD20910, USA, 1994. ISBN0-8186-6965-9. LCCNQA76.58.S95 1994. IEEEcatalog no. 95TH8024.

IEEE:1994:IPN

[IEE94b] IEEE, editor. ICIP ’94:proceedings, November 13–16, 1994, Austin Conven-tion Center, Austin, Texas.IEEE Computer Society

Press, 1109 Spring Street,Suite 300, Silver Spring, MD20910, USA, 1994. ISBN0-8186-6952-7 (casebound),0-8186-6950-0 (paperback),0-8186-6951-9 (microfiche).LCCN TA1637.I25 1994.Three volumes. IEEE cata-log no. 94CH35708.

IEEE:1994:OOE

[IEE94c] IEEE, editor. Oceans 94:Oceans engineering for to-day’s technology and tomor-row’s preservation: proceed-ings, 13–16 September 13–16, 1994, Brest, France,Oceans. IEEE Computer So-ciety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,1994. ISBN 0-7803-2057-3, 0-7803-2056-5, 0-7803-2058-1. ISSN 0197-7385.LCCN TC 1505 O331971994. Three volumes. IEEEcatalog no. 94CH3472-8.

IEEE:1994:PSI

[IEE94d] IEEE, editor. Proceedings /Second International Work-shop on Configurable Dis-tributed Systems, March 21–23, 1994, Carnegie Mel-lon University, Pittsburgh,Pennsylvania. IEEE Com-puter Society Press, 1109Spring Street, Suite 300, Sil-ver Spring, MD 20910, USA,1994. ISBN 0-8186-5390-6.LCCN QA76.9.D5I595 1994.IEEE catalog no. 94TH0651-0.

REFERENCES 251

IEEE:1994:PIF

[IEE94e] IEEE, editor. Proceedingsof the 1994 IEEE FrequencyControl Symposium (the48th annual symposium),1–3 June 1994, WestinHotel-Copley Place, Boston,Massachusetts, USA. IEEEComputer Society Press,1109 Spring Street, Suite300, Silver Spring, MD20910, USA, 1994. ISBN0-7803-1945-1. LCCN TK7872 O7 I34 1994. IEEE cat-alog no. 94CH3446-2.

IEEE:1994:PSP

[IEE94f] IEEE, editor. Proceedingsof the Scalable Parallel Li-braries Conference, October6–8, 1993, Mississippi State,Mississippi. IEEE ComputerSociety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,1994. ISBN 0-8186-4980-1.LCCN QA76.58.S34 1993.

IEEE:1994:PTI

[IEE94g] IEEE, editor. Proceed-ings of the Third IEEEInternational Symposiumon High Performance Dis-tributed Computing, August2–5, 1994, San Francisco,California. IEEE ComputerSociety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,1994. ISBN 0-8186-6395-2.LCCN QA76.9.D5I328 1994.IEEE catalog no. 94TH0667-6.

IEEE:1994:PSW

[IEE94h] IEEE, editor. Proceed-ings, Supercomputing ’94:Washington, DC, November14–18, 1994, Supercomput-ing. IEEE Computer SocietyPress, 1109 Spring Street,Suite 300, Silver Spring, MD20910, USA, 1994. ISBN 0-8186-6607-2, 0-8186-6605-6,0-8186-6606-4. ISSN 1063-9535. LCCN QA76.5 .S8941994. IEEE catalog number94CH34819.

IEEE:1995:IIC

[IEE95a] IEEE, editor. 1995 IEEEInternational Conference onSystems, Man, and Cyber-netics: intelligent systemsfor the 21st century: Van-couver, British Columbia,Canada, October 22–25,1995. IEEE Computer So-ciety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,1995. ISBN 0-7803-2559-1.LCCN TA168.I19 1995. Fivevolumes. IEEE catalog no.95CH3576-7.

IEEE:1995:CPI

[IEE95b] IEEE, editor. Conferenceproceedings of the 1995 IEEEFourteenth Annual Interna-tional Phoenix Conferenceon Computers and Commu-nications: Scottsdale, Ari-zona, USA, March 28–31,1995, volume 14. IEEE Com-puter Society Press, 1109Spring Street, Suite 300, Sil-

REFERENCES 252

ver Spring, MD 20910, USA,1995. ISBN 0-7803-2493-5,0-7803-2492-7, 0-7803-2494-3. LCCN TK7885.A1 I5671995. IEEE catalog no.95CH35751.

IEEE:1995:DPT

[IEE95c] IEEE, editor. Digest ofpapers / the Twenty-fifthInternational Symposiumon Fault-Tolerant Comput-ing, June 27–30, 1995,Pasadena, California. IEEEComputer Society Press,1109 Spring Street, Suite300, Silver Spring, MD20910, USA, 1995. ISBN 0-8186-7079-7. LCCN QA 76.9F38 I57 1995. IEEE catalogno. 95CB35823.

IEEE:1995:ISE

[IEE95d] IEEE, editor. Ideas in Sci-ence and Electronics Expo-sition and Symposium. Pro-ceedings: Albuquerque, NM,USA, 9–11 May 1995, vol-ume 17 of Annual Ideasin Science and ElectronicsExposition and SymposiumConference. IEEE ComputerSociety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,1995. ISBN ???? LCCN ????

IEEE:1995:IPR

[IEE95e] IEEE, editor. IEEE PacificRim Conference on Commu-nications, Computers, andSignal Processing: proceed-ings / May 17–19, 1995,Victoria Conference Centre,

Victoria, British Columbia,Canada. IEEE ComputerSociety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,1995. ISBN 0-7803-2553-2. LCCN TK 5101 A1 I341995. IEEE catalog no.95CH35765.

IEEE:1995:PIP

[IEE95f] IEEE, editor. Proceedings/ 9th International Paral-lel Processing Symposium,April 25–28, 1995, SantaBarbara, California. IEEEComputer Society Press,1109 Spring Street, Suite300, Silver Spring, MD20910, USA, 1995. ISBN0-8186-7074-6. LCCN QA76.58 I56 1995. IEEE cat-alog no. 95TH8052.

IEEE:1995:PSI

[IEE95g] IEEE, editor. Proceed-ings / Seventh IEEE Sym-posium on Parallel and Dis-tributed Processing, Octo-ber 25–28, 1995, San An-tonio, Texas. IEEE Com-puter Society Press, 1109Spring Street, Suite 300, Sil-ver Spring, MD 20910, USA,1995. ISBN 0-8186-7195-5. LCCN QA 76.58 I421995. IEEE catalog number95TB8131.

IEEE:1995:PEW

[IEE95h] IEEE, editor. Proceed-ings: Euromicro Workshopon Parallel and DistributedProcessing, San Remo, Italy,

REFERENCES 253

January 25–27, 1995, Eu-romicro Workshop on Paral-lel and Distributed Process-ing 1995; 3rd. IEEE Com-puter Society Press, 1109Spring Street, Suite 300, Sil-ver Spring, MD 20910, USA,1995. ISBN 0-8186-7031-2, 0-8186-7032-0. LCCNQA76.58 .E97 1995.

IEEE:1995:PIC

[IEE95i] IEEE, editor. Proceedings ofthe 15th International Con-ference on Distributed Com-puting Systems: Vancou-ver, BC, Canada, 30 May–2 June 1995. IEEE Com-puter Society Press, 1109Spring Street, Suite 300, Sil-ver Spring, MD 20910, USA,1995. ISBN 0-8186-7025-8.LCCN ???? IEEE catalognumber 95CH35784.

IEEE:1995:PSP

[IEE95j] IEEE, editor. Proceedingsof the 1994 Scalable ParallelLibraries Conference: Oc-tober 12–14, 1994, Missis-sippi State University, Mis-sissippi. IEEE Computer So-ciety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,1995. ISBN 0-8186-6895-4.LCCN QA76.58 .S34 1994.

IEEE:1995:PFI

[IEE95k] IEEE, editor. Proceed-ings of the Fourth IEEEInternational Symposiumon High Performance Dis-tributed Computing, August

2–4, 1995, Washington, DC,USA. IEEE Computer So-ciety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,1995. ISBN 0-8186-7088-6. LCCN QA76.9.D5 I3281995. IEEE catalog no.95TB8075.

IEEE:1995:PNA

[IEE95l] IEEE, editor. Proceedings:the nineteenth annual In-ternational Computer Soft-ware and Applications Con-ference (COMPSAC ’95):August 9–11, 1995, Dal-las, Texas. IEEE ComputerSociety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,1995. ISBN 0-8186-7119-X. LCCN QA 76.6 C62951995. IEEE catalog no.95CB35838.

IEEE:1996:ICH

[IEE96a] IEEE, editor. 3rd In-ternational Conference onHigh Performance Comput-ing: proceedings, Decem-ber 19–22, 1996, Trivan-drum, India. IEEE Com-puter Society Press, 1109Spring Street, Suite 300, Sil-ver Spring, MD 20910, USA,1996. ISBN 0-8186-7557-8. LCCN QA76.88.I5751996. IEEE catalog number96TB100074.

IEEE:1996:EIS

[IEE96b] IEEE, editor. Eighth IEEESymposium on Parallel and

REFERENCES 254

Distributed Processing: Oc-tober 23–26, 1996, NewOrleans, Louisiana. IEEEComputer Society Press,1109 Spring Street, Suite300, Silver Spring, MD20910, USA, 1996. ISBN0-8186-7683-3, 0-8186-7685-X (microfiche). LCCNQA76.58 .I42 1996. IEEEComputer Society Press or-der number PR07683. IEEEOrder Plan catalog number96TB100088.

IEEE:1996:FSS

[IEE96c] IEEE, editor. Frontiers’96,the Sixth Symposium onthe Frontiers of MassivelyParallel Computation: Oc-tober 27–31, 1996, An-napolis, Maryland: proceed-ings. IEEE Computer So-ciety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,1996. ISBN 0-8186-7551-9. LCCN QA76.58 .S951996. IEEE catalog number96TB100062.

IEEE:1996:PIS

[IEE96d] IEEE, editor. Proceedingsof 1996 IEEE Second In-ternational Conference onAlgorithms and Architec-tures for Parallel Processing,ICA PP ’96: June 11–13,1996, Singapore. IEEE Com-puter Society Press, 1109Spring Street, Suite 300,Silver Spring, MD 20910,USA, 1996. ISBN 0-7803-3529-5 (softbound), 0-7803-

3530-9 (microfiche). LCCNQA76.58.I33 1996. IEEEcatalog number 96TH8204.

IEEE:1996:PII

[IEE96e] IEEE, editor. Proceed-ings of IPPS ’96. The 10thInternational Parallel Pro-cessing Symposium: Hon-olulu, HI, USA, 15–19 April1996. IEEE Computer So-ciety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,1996. ISBN 0-8186-7255-2. LCCN QA76.58 .I5651996. IEEE catalog number96TB100038. IEEE Com-puter Society Press ordernumber PR07255.

IEEE:1996:PFI

[IEE96f] IEEE, editor. Proceedings ofthe Fifth IEEE InternationalSymposium on High Perfor-mance Distributed Comput-ing, Syracuse, NY, USA, 6–9 August 1996. IEEE Com-puter Society Press, 1109Spring Street, Suite 300, Sil-ver Spring, MD 20910, USA,1996. ISBN 0-8186-7582-9. LCCN QA 76.88 I521996. IEEE catalog numberTB100069.

IEEE:1996:PFE

[IEE96g] IEEE, editor. Proceed-ings of the fourth EuromicroWorkshop on Parallel andDistributed Processing (PDP’96): January 24–26, 1996,Braga, Portugal. IEEE Com-puter Society Press, 1109

REFERENCES 255

Spring Street, Suite 300, Sil-ver Spring, MD 20910, USA,1996. ISBN 0-8186-7376-1. LCCN QA76.58 .E971996. IEEE order numberPR07376.

IEEE:1996:PSI

[IEE96h] IEEE, editor. Proceedingsof the Seventh Israeli Con-ference on Computer Sys-tems and Software Engineer-ing: June 12–13, 1996, Her-zliya, Israel. IEEE Com-puter Society Press, 1109Spring Street, Suite 300,Silver Spring, MD 20910,USA, 1996. ISBN 0-8186-7536-5. LCCN QA75.5 .I751996. IEEE Computer So-ciety Press Order NumberPR07536.

IEEE:1996:PSM

[IEE96i] IEEE, editor. Proceed-ings. Second MPI Devel-oper’s Conference: NotreDame, IN, USA, 1–2 July1996. IEEE Computer So-ciety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,1996. ISBN 0-8186-7533-0.LCCN QA76.642 .M67 1996.

IEEE:1997:APD

[IEE97a] IEEE, editor. Advances inparallel and distributed com-puting: March 19–21, 1997,Shanghai, China: proceed-ings. IEEE Computer So-ciety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,

1997. ISBN 0-8186-7876-3 (paperback and case),0-8186-7878-X (microfiche).LCCN QA76.58 .A4 1997.

IEEE:1997:PIP

[IEE97b] IEEE, editor. Proceedings.11th International Paral-lel Processing Symposium,April 1–5, 1997, Geneva,Switzerland. IEEE Com-puter Society Press, 1109Spring Street, Suite 300, Sil-ver Spring, MD 20910, USA,1997. ISBN 0-8186-7793-7. LCCN QA76.58 .I561997. IEEE catalog number97TB100107. IEEE Com-puter Society Press ordernumber PR07792.

IEEE:1997:TIS

[IEE97c] IEEE, editor. Third Interna-tional Symposium on High-Performance Computer Ar-chitecture: proceedings,February 1–5, 1997, San An-tonio, Texas. IEEE Com-puter Society Press, 1109Spring Street, Suite 300, Sil-ver Spring, MD 20910, USA,1997. ISBN 0-8186-7764-3. LCCN QA76.9.A73I5661997. IEEE catalog number97TB100094.

IEEE:2002:STI

[IEE02] IEEE, editor. SC2002: FromTerabytes to Insight. Pro-ceedings of the IEEE ACMSC 2002 Conference, Nov-ember 16–22, 2002, Bal-timore, MD, USA. IEEEComputer Society Press,

REFERENCES 256

1109 Spring Street, Suite300, Silver Spring, MD20910, USA, 2002. ISBN 0-7695-1524-X. LCCN ????

IEEE:2005:IPD

[IEE05] IEEE, editor. 19th Inter-national Parallel and Dis-tributed Processing Sympo-sium: proceedings: April4–8, 2005, Denver, Col-orado. IEEE Computer So-ciety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,2005. ISBN 0-7695-2312-9.LCCN ???? IEEE Com-puter Society Order NumberP2312.

Iida:2016:GET

[IFA+16] Yuki Iida, Yusuke Fujii,Takuya Azumi, NobuhikoNishio, and Shinpei Kato.GPUrpc: Exploring trans-parent access to remoteGPUs. ACM Transac-tions on Embedded Com-puting Systems, 16(1):17:1–17:??, November 2016. CO-DEN ???? ISSN 1539-9087(print), 1558-3465 (elec-tronic).

IFIP:1995:KWC

[IFI95] IFIP Working Group 2.5,editor. Kyoto Workshop1995: Current Directionsin Numerical Software andHigh Performance Comput-ing, 19–20 October 1995,Kyoto, Japan. ????, ????,1995. ISBN ???? LCCN

???? URL http://www.nsc.liu.se/~boein/ifip/kyoto/kyoto.html#reid;http://www.nsc.liu.se/

~boein/ifip/kyoto/workshop-info/proceedings/.

Iwasaki:2004:NPS

[IH04] Hideya Iwasaki and Zhen-jiang Hu. A new paral-lel skeleton for general ac-cumulative computations.International Journal ofParallel Programming, 32(5):389–414, October 2004.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=32&issue=5&spage=389.

Izaguirre:2005:PMS

[IHM05] Jesus A. Izaguirre, Scott S.Hampton, and ThierryMatthey. Parallel multigridsummation for the N -bodyproblem. Journal of Paralleland Distributed Computing,65(8):949–962, August 2005.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic).

Iskra:2000:PMD

[IHvA+00] K. A. Iskra, Z. W. Hen-drikse, G. D. van Al-bada, B. J. Overeinder,and P. M. A. Sloot. Per-formance measurements onDynamite/DPVM. Lec-ture Notes in Computer

REFERENCES 257

Science, 1908:27–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080027.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080027.pdf.

Ierotheou:2005:GOC

[IJM+05] C. S. Ierotheou, H. Jin,G. Matthews, S. P. Johnson,and R. Hood. GeneratingOpenMP code using an in-teractive parallelization en-vironment. Parallel Com-puting, 31(10–12):999–1012,October/December 2005.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic).

Iwama:2001:PLS

[IKM+01] Kazuo Iwama, DaisukeKawai, Shuichi Miyazaki,Yasuo Okabe, and JunUmemoto. Parallelizing lo-cal search for CNF sat-isfiability using vectoriza-tion and PVM. LectureNotes in Computer Sci-ence, 1982:123–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1982/19820123.htm;http://link.springer-

ny.com/link/service/series/

0558/papers/1982/19820123.pdf.

Iwama:2002:PLS

[IKM+02] Kazuo Iwama, DaisukeKawai, Shuichi Miyazaki,Yasuo Okabe, and JunUmemoto. Parallelizing localsearch for CNF satisfiabil-ity using vectorization andPVM. ACM Journal of Ex-perimental Algorithmics, 7:2, ???? 2002. CODEN ????ISSN 1084-6654.

Iwashita:1994:IPE

[IM94] S. Iwashita and K. Mu-rakami. Implementation andperformances evaluation ofKU PVM3/AP1000. En-gineering Sciences Reports,Kyushu University, 16(3):345–352, December 1994.CODEN SRKHEK. ISSN0388-1717.

Ingle:1995:MAS

[IM95] N. K. Ingle and T. J.Mountziaris. A multifrontalalgorithm for the solution oflarge systems of equationsusing network-based parallelcomputing. Computers &Chemical Engineering, 19(6-7):671–681, June-July 1995.CODEN CCENDW. ISSN0098-1354.

Ishizaka:2000:CGT

[IOK00] Kazuhisa Ishizaka, MotokiObata, and Hironori Kasa-hara. Coarse-grain task par-allel processing using theOpenMP backend of the

REFERENCES 258

OSCAR multigrain paral-lelizing compiler. LectureNotes in Computer Sci-ence, 1940:457–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1940/19400457.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1940/19400457.pdf.

Ilroy:2001:IMP

[IRU01] Jonathan Ilroy, Cyrille Ran-driamaro, and Gil Utard.Improving MPI-I/O perfor-mance on PVFS. Lec-ture Notes in Computer Sci-ence, 2150:911–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2150/21500911.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2150/21500911.pdf.

Ilie:2016:AEC

[IS16] Silvana Ilie and Arne Stor-johann. Abstracts of the2015 East Coast ComputerAlgebra Day. ACM Commu-nications in Computer Al-gebra, 50(1):35–39, March2016. CODEN ???? ISSN1932-2232 (print), 1932-2240(electronic).

Satake:2012:OGA

[iSYS12] Shin ichi Satake, HajimeYoshimori, and TakayukiSuzuki. Optimizations of aGPU accelerated heat con-duction equation by a pro-gramming of CUDA For-tran from an analysis of aPTX file. Computer PhysicsCommunications, 183(11):2376–2385, November 2012.CODEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465512002068.

Imamura:2000:ASM

[ITKT00] Toshiyuki Imamura, YuichiTsujita, Hiroshi Koide, andHiroshi Takemiya. An ar-chitecture of Stampi: MPIlibrary on a cluster of par-allel computers. LectureNotes in Computer Sci-ence, 1908:200–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080200.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080200.pdf.

Ishihara:1999:VBS

[ITT99] S. Ishihara, S. Tani, andA. Takahara. Virtual BUS:a simple implementation ofan effortless networking sys-tem based on PVM. In

REFERENCES 259

Dongarra et al. [DLM99],pages 461–468. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Islam:2002:IAC

[ITT02] Mohammad Towhidul Islam,Parimala Thulasiraman, andRuppa K. Thulasiram. Im-plementation of ant colonyoptimization algorithm formobile ad hoc network ap-plications: OpenMP expe-riences. Parallel and Dis-tributed Computing Prac-tices, 5(2):177–191, June2002. CODEN ???? ISSN1097-2803.

Iskra:2000:IDE

[IvdLH+00] K. A. Iskra, F. van der Lin-den, Z. W. Hendrikse, B. J.Overeinder, G. D. van Al-bada, and P. M. A. Sloot.The implementation of dy-namite: an environment formigrating PVM tasks. Oper-ating Systems Review, 34(3):40–55, July 2000. CODENOSRED8. ISSN 0163-5980(print), 1943-586X (elec-tronic).

Jatala:2017:SSG

[JAK17] Vishwesh Jatala, JayvantAnantpur, and Amey Karkare.Scratchpad sharing in GPUs.ACM Transactions on Ar-chitecture and Code Opti-mization, 14(2):15:1–15:??,July 2017. CODEN ????

ISSN 1544-3566 (print),1544-3973 (electronic).

Jabbarzadeh:1997:PSS

[JAT97] A. Jabbarzadeh, J. D. Atkin-son, and R. I. Tanner. Par-allel simulation of shearflow of polymers betweenstructured walls by molecu-lar dynamics simulation onPVM. Computer PhysicsCommunications, 107(1–3):123–136, December 1997.CODEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S001046559700088X.

Jacoby:1996:ADA

[JB96] G. H. (George H.) Jacobyand Jeannette V. Barnes,editors. Astronomical dataanalysis software and sys-tems V: meeting held at Tuc-son, Arizona, 23–25 October1995, volume 101 of Astro-nomical Society of the Pa-cific Conference Series. As-tronomical Society of thePacific, San Francisco, CA,USA, 1996. ISBN ????ISSN 1080-7926. LCCNQB51.3.E43 A87 1995.

Juhasz:1996:PIP

[JC96] Z. Juhasz and D. Crookes.A PVM implementation of aportable parallel image pro-cessing library. In Bode et al.[BDLS96], pages 188–??ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-

REFERENCES 260

3349 (electronic). LCCNQA76.58.E975 1996.

Jarzabek:2017:PEU

[JC17] Lukasz Jarzabek and PawelCzarnul. Performance eval-uation of unified mem-ory and dynamic paral-lelism for selected parallelCUDA applications. TheJournal of Supercomputing,73(12):5378–5401, Decem-ber 2017. CODEN JO-SUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic). URL http://link.springer.com/content/pdf/10.1007/s11227-017-2091-x.pdf.

Jin:2008:PEM

[JCH+08] Haoqiang Jin, BarbaraChapman, Lei Huang, Di-eter an Mey, and ThomasReichstein. Performanceevaluation of a multi-zone application in differentOpenMP approaches. In-ternational Journal of Par-allel Programming, 36(3):312–325, June 2008. CO-DEN IJPPE5. ISSN 0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=36&issue=3&spage=312.

Jaeger:2015:FGD

[JCP15] Julien Jaeger, Patrick Car-ribault, and Marc Perache.Fine-grain data manage-

ment directory for OpenMP4.0 and OpenACC. Con-currency and Computation:Practice and Experience,27(6):1528–1539, April 25,2015. CODEN CCPEBO.ISSN 1532-0626 (print),1532-0634 (electronic).

Jenkins:2014:PMD

[JDB+14] John Jenkins, James Di-nan, Pavan Balaji, TomPeterka, Nagiza F. Sama-tova, and Rajeev Thakur.Processing MPI deriveddatatypes on noncontigu-ous GPU-resident data.IEEE Transactions on Par-allel and Distributed Sys-tems, 25(10):2627–2637, Oc-tober 2014. CODEN ITD-SEO. ISSN 1045-9219(print), 1558-2183 (elec-tronic)ITDSEO. URL http://www.computer.org/csdl/trans/td/2014/10/06600679-abs.html.

Jeremiassen:1995:RFS

[JE95] T. E. Jeremiassen and S. J.Eggers. Reducing false shar-ing on shared memory mul-tiprocessors through com-pile time data transforma-tions. ACM SIGPLANNotices, 30(8):179–188, Au-gust 1995. CODEN SIN-ODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Jesshope:1993:LRV

[Jes93a] C. Jesshope. Latency reduc-tion in VLSI routers. Par-

REFERENCES 261

allel Processing Letters, 3(4):485–494, December 1993.CODEN PPLTEE. ISSN0129-6264 (print), 1793-642X (electronic).

Jesshope:1993:MCA

[Jes93b] C. Jesshope. The MPI chipand its applications. InAnonymous [Ano93c], pages47–54. ISBN ???? LCCN????

Jann:1995:AMP

[JF95] Joefon Jann and HubertusFranke. Analysis of an MPIprogram using UTE on theIBM SP2. Research re-port RC 20085 (88832), IBMT. J. Watson Research Cen-ter, Yorktown Heights, NY,USA, 1995. 11 pp.

Johnson:2012:FOL

[JFGRF12] Tim Johnson, Pierre Fite-Georgel, Rahul Raguram,and Jan-Michael Frahm.Fast organization of largephoto collections usingCUDA. Lecture Notes inComputer Science, 6554:463–476, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/content/pdf/10.1007/978-3-642-35740-4_36.

Jin:2000:AGO

[JFY00] Haoqiang Jin, MichaelFrumkin, and Jerry Yan.Automatic generation of

OpenMP directives and itsapplication to computa-tional fluid dynamics codes.Lecture Notes in ComputerScience, 1940:440–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1940/19400440.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1940/19400440.pdf.

Jackson:1997:SYE

[JH97] D. J. Jackson and C. W.Humphres. A simple yeteffective load balancing ex-tension to the PVM soft-ware system. Parallel Com-puting, 22(12):1647–1660,February 21, 1997. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.elsevier.com/cgi-bin/cas/tree/store/parco/cas_sub/browse/browse.cgi?year=1997&volume=22&issue=12&aid=1112.

Jin:2011:HPC

[JJM+11] Haoqiang Jin, Dennis Jes-persen, Piyush Mehrotra,Rupak Biswas, Lei Huang,and Barbara Chapman.High performance comput-ing using MPI and OpenMPon multi-core parallel sys-tems. Parallel Comput-ing, 37(9):562–575, Septem-ber 2011. CODEN PA-

REFERENCES 262

COEJ. ISSN 0167-8191(print), 1872-7336 (elec-tronic). URL http://www.sciencedirect.com/science/article/pii/S0167819111000159.

Jo:2017:PMA

[JJPL17] Gangwon Jo, Jaehoon Jung,Jiyoung Park, and JaejinLee. Poster: MAPA: an au-tomatic memory access pat-tern analyzer for GPU ap-plications. ACM SIGPLANNotices, 52(8):443–444, Au-gust 2017. CODEN SIN-ODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Jin:2003:AMP

[JJY+03] Haoqiang Jin, Gabriele Jost,Jerry Yan, et al. Auto-matic multilevel paralleliza-tion using OpenMP. Sci-entific Programming, 11(2):177–190, 2003. CODENSCIPEV. ISSN 1058-9244(print), 1875-919X (elec-tronic).

Januszewski:2010:ANS

[JK10] M. Januszewski and M. Kos-tur. Accelerating numer-ical solution of stochasticdifferential equations withCUDA. Computer PhysicsCommunications, 181(1):183–188, January 2010. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465509002999.

Jeun:2008:OPB

[JKHK08] Woo-Chul Jeun, Yang-SukKee, Soonhoi Ha, andChangdon Kee. Overcomingperformance bottlenecks inusing OpenMP on SMP clus-ters. Parallel Computing, 34(10):570–592, October 2008.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic).

Jan:2017:ITF

[JKM+17] Bilal Jan, Fiaz Gul Khan,Bartolomeo Montrucchio,Anthony Theodore Chronopou-los, Shahaboddin Shamshir-band, and Abdul NasirKhan. Introducing ToPe–FFT: An OpenCL-basedFFT library targeting GPUs.Concurrency and Computa-tion: Practice and Expe-rience, 29(21):??, Novem-ber 10, 2017. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic).

Jog:2013:OCT

[JKN+13] Adwait Jog, Onur Kayiran,Nachiappan ChidambaramNachiappan, Asit K. Mishra,Mahmut T. Kandemir, OnurMutlu, Ravishankar Iyer,and Chita R. Das. OWL: co-operative thread array awarescheduling techniques forimproving GPGPU perfor-mance. ACM SIGPLAN No-tices, 48(4):395–406, April2013. CODEN SINODQ.ISSN 0362-1340 (print),

REFERENCES 263

1523-2867 (print), 1558-1160(electronic).

Jie:2014:ASP

[JLS+14] Liang Jie, KenLi Li, LinShi, RangSu Liu, and JingMei. Accelerating solidifica-tion process simulation forlarge-sized system of liquidmetal atoms using GPU withCUDA. Journal of Com-putational Physics, 257(??):521–535, January 15, 2014.CODEN JCTPAH. ISSN0021-9991 (print), 1090-2716(electronic). URL http://www.sciencedirect.com/science/article/pii/S0021999113006803.

Julian-Moreno:2017:FPA

[JMdVG+17] Guillermo Julian-Moreno,Jorge E. Lopez de Ver-gara, Ivan Gonzalez, Luisde Pedro, Javier Royuela delVal, and Federico Simmross-Wattenberg. Fast parallelα-stable distribution func-tion evaluation and pa-rameter estimation usingOpenCL in GPGPUs. Statis-tics and Computing, 27(5):1365–1382, September 2017.CODEN STACE3. ISSN0960-3174 (print), 1573-1375(electronic).

Jorba:2001:SFF

[JML01] Josep Jorba, Tomas Mar-galef, and Emilio Luque.Simulation of forest firepropagation on parallel &distributed PVM platforms.Lecture Notes in Computer

Science, 2131:386–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310386.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310386.pdf.

Jung:2014:MCM

[JMS14] Jaewoon Jung, TakaharuMori, and Yuji Sugita. Mid-point cell method for hy-brid (MPI + OpenMP)parallelization of molecu-lar dynamics simulations.Journal of ComputationalChemistry, 35(14):1064–1072, May 30, 2014. CODENJCCHDD. ISSN 0192-8651(print), 1096-987X (elec-tronic).

Jo:2015:ALM

[JNL+15] Gangwon Jo, Jeongho Nah,Jun Lee, Jungwon Kim,and Jaejin Lee. Acceler-ating LINPACK with MPI-OpenCL on clusters ofmulti-GPU nodes. IEEETransactions on Paral-lel and Distributed Sys-tems, 26(7):1814–1825, July2015. CODEN ITD-SEO. ISSN 1045-9219(print), 1558-2183 (elec-tronic)ITDSEO. URL http://www.computer.org/csdl/trans/td/2015/07/06846313-abs.html.

REFERENCES 264

Jones:1996:LLM

[Jon96] Chris R. Jones. Low la-tency MPI for Meiko CS/2and ATM clusters. Thesis(m.a.), Department of Com-puter Science, University ofCalifornia, Santa Barbara,Santa Barbara, CA, USA,1996.

Joubert:1994:PAL

[Jou94] A. Joubert. Parallel algo-rithms for linear and nonlin-ear equations derived fromnetworks. In Joubert et al.[JPTE94], pages 145–152.ISBN 0-444-81841-3. LCCNQA76.58 .P3794 1993.

Jiang:2012:OSP

[JPOJ12] Lei Jiang, Pragneshku-mar B. Patel, George Os-trouchov, and FerdinandJamitzky. OpenMP-styleparallelism in data-centeredmulticore computing with R.ACM SIGPLAN Notices, 47(8):335–336, August 2012.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic). PPOPP ’12 confer-ence proceedings.

Juric:1995:UPV

[JPP95] M. Juric, W. D. Potter, andM. Plaksin. Using the Paral-lel Virtual Machine for hunt-ing snake-in-the-box codes.In Arabnia [Ara95], pages97–102. ISBN 90-5199-187-8 (IOS Press), 4-274-90017-7

(Ohmsha). ISSN 0925-4986.LCCN ????

Joldes:2014:SSH

[JPT14] Mioara Joldes, ValentinaPopescu, and WarwickTucker. Searching for sinksfor the Henon map usinga multiple-precision GPUarithmetic library. ACMSIGARCH Computer Archi-tecture News, 42(4):63–68,September 2014. CODENCANED2. ISSN 0163-5964(print), 1943-5851 (elec-tronic).

Joubert:1994:PCT

[JPTE94] G. R. Joubert, F. J. Pe-ters, D. Trystram, and D. J.Evans, editors. Parallelcomputing: trends and ap-plications: proceedings ofthe international conferenceParCo93, Grenoble, France,7–10 September 1993, vol-ume 9 of Advances in parallelcomputing. North-Holland,Amsterdam, The Nether-lands, 1994. ISBN 0-444-81841-3. LCCN QA76.58.P3794 1993.

Jost:2010:EUH

[JR10] Gabriele Jost and BobRobins. Experiences usinghybrid MPI/OpenMP in thereal world: Parallelization ofa 3D CFD solver for multi-core node clusters. ScientificProgramming, 18(3–4):127–138, ???? 2010. CODENSCIPEV. ISSN 1058-9244

REFERENCES 265

(print), 1875-919X (elec-tronic).

Jimenez:2013:BCA

[JR13] Jesus Jimenez and JuanRuiz de Miras. Box-countingalgorithm on GPU andmulti-core CPU: an OpenCLcross-platform study. TheJournal of Supercomputing,65(3):1327–1352, Septem-ber 2013. CODEN JO-SUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic). URL http://link.springer.com/article/10.1007/s11227-013-0885-z.

Judd:1994:PIV

[JRM+94] D. Judd, N. K. Ratha, P. K.McKinley, J. Weng, andA. K. Jain. Parallel im-plementation of vision algo-rithms on workstation clus-ters. In IEEE [IEE94e],pages 317–321 (vol. 3). ISBN0-7803-1945-1. LCCN TK7872 O7 I34 1994. IEEE cat-alog no. 94CH3446-2.

Jin:2013:PCU

[JS13] Hui Jin and Xian-He Sun.Performance comparison un-der failures of MPI andMapReduce: an analyti-cal approach. Future Gen-eration Computer Systems,29(7):1808–1815, Septem-ber 2013. CODEN FG-SEVI. ISSN 0167-739X(print), 1872-7115 (elec-tronic). URL http://

www.sciencedirect.com/science/article/pii/S0167739X13000290.

Jung:2005:DIM

[JSH+05] Hyungsoo Jung, DonginShin, Hyuck Han, Jai W.Kim, Heon Y. Yeom, andJongsuk Lee. Design andimplementation of multi-ple fault-tolerant MPI overMyrinet (M3). In ACM[ACM05], page 32. ISBN 1-59593-061-2. LCCN ????

Jaaskelainen:2015:PPP

[JSS+15] Pekka Jaaskelainen, CarlosSanchez de La Lama, ErikSchnetter, Kalle Raiskila,Jarmo Takala, and HeikkiBerg. pocl: A performance-portable OpenCL imple-mentation. InternationalJournal of Parallel Pro-gramming, 43(5):752–785,October 2015. CODENIJPPE5. ISSN 0885-7458(print), 1573-7640 (elec-tronic). URL http://link.springer.com/article/10.1007/s10766-014-0320-y.

Ju:1996:SPT

[JW96] Jiubin Ju and Yong Wang.Scheduling PVM tasks. Op-erating Systems Review, 30(3):22–31, July 1996. CO-DEN OSRED8. ISSN 0163-5980 (print), 1943-586X(electronic).

Jain:1996:IOP

[JWB96] Ravi Jain, John Werth,and James C. Browne, edi-

REFERENCES 266

tors. Input/output and par-allel and distributed com-puter systems. Kluwer Aca-demic Publishers Group,Norwell, MA, USA, and Dor-drecht, The Netherlands,1996. ISBN 0-7923-9735-5.LCCN QA76.58.I485 1996.

Jin:1995:LTP

[JY95] Lan Jin and Lan Yang. Alaboratory for teaching par-allel computing on parallelstructures. SIGCSE Bul-letin (ACM Special Inter-est Group on Computer Sci-ence Education), 27(1):71–75, March 1995. CODENSIGSD3. ISSN 0097-8418(print), 2331-3927 (elec-tronic).

Kumar:1995:MWD

[KA95] S. Kumar and H. Adeli. Min-imum weight design of largestructures on a network ofworkstations. Microcom-puters in Civil Engineering,10(6):423–432, November1995. CODEN MCENE7.ISSN 0885-9507.

Kepner:2004:M

[KA04] Jeremy Kepner and StanAhalt. MatlabMPI. Journalof Parallel and DistributedComputing, 64(8):997–1005,August 2004. CODENJPDCER. ISSN 0743-7315(print), 1096-0848 (elec-tronic).

Kumar:2013:GAI

[KA13] Piyush Kumar and AnupamAgrawal. Gpu-acceleratedinteractive visualization of3D volumetric data usingCUDA. International Jour-nal of Image and Graph-ics (IJIG), 13(2):??, April2013. CODEN ???? ISSN0219-4678. URL http://doi.acm.org/10.1142/S0219467813400032.

Krawezik:2002:SOV

[KAC02] Geraud Krawezik, Guil-laume Alleon, and FranckCappello. SPMD OpenMPversus MPI on a IBMSMP for 3 kernels of theNAS benchmarks. Lec-ture Notes in Computer Sci-ence, 2327:425–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2327/23270425.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2327/23270425.pdf.

Krone:1996:ICF

[KAHS96] O. Krone, M. Aguilar,B. Hirsbrunner, and V. Sun-deram. Integrating coor-dination features in PVM.In Ciancarini and Han-kin [CH96], pages 432–435.ISBN 3-540-61052-9. ISSN0302-9743 (print), 1611-

REFERENCES 267

3349 (electronic). LCCNQA76.58.I52 1996.

Kapinos:2010:PPP

[KaM10] Paul Kapinos and Dieteran Mey. Productivity andperformance portability ofthe OpenMP 3.0 taskingconcept when applied toan engineering code writ-ten in Fortran 95. In-ternational Journal of Par-allel Programming, 38(5–6):379–395, October 2010.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=38&issue=5&spage=379.

Khan:2017:RCS

[KAMAMA17] Ayaz H. Khan, MayezAl-Mouhamed, MuhammedAl-Mulhem, and Adel F.Ahmed. RT-CUDA: Asoftware tool for CUDAcode restructuring. Inter-national Journal of Paral-lel Programming, 45(3):551–594, June 2017. CODENIJPPE5. ISSN 0885-7458(print), 1573-7640 (elec-tronic).

Kanal:2012:PAI

[Kan12] M. E. Kanal. Parallel al-gorithm on inversion for ad-jacent pentadiagonal matri-ces with MPI. The Journalof Supercomputing, 59(2):1071–1078, February 2012.

CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=59&issue=2&spage=1071.

Katamneni:1993:PPE

[Kat93] Sreevenu Katamneni. Paral-lel processing extensions toVerilog HDL using the PVMenvironment. M.s.e.e. the-sis, Department of Electri-cal Engineering, Universityof Alabama, Tuscaloosa, AL,USA, 1993. viii + 108 pp.

Karlsson:1998:CCC

[KB98] S. Karlsson and M. Brors-son. A comparative char-acterization of communica-tion patterns in applica-tions using MPI and sharedmemory on an IBM SP2.Lecture Notes in ComputerScience, 1362:189–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Kaiser:2001:OCC

[KB01] Timothy H. Kaiser andScott B. Baden. Overlappingcommunication and compu-tation with OpenMP andMPI. Scientific Program-ming, 9(2–3):73–81, Spring–Summer 2001. CODENSCIPEV. ISSN 1058-9244(print), 1875-919X (elec-tronic). URL http://iospress.metapress.com/

REFERENCES 268

app/home/contribution.asp%3Fwasp=7pab6qgbaf8vxg991rwy%26referrer=parent%26backto=issue%2C2%2C11%3Bjournal%2C1%2C9%3Blinkingpublicationresults%2C1%2C1.

Kruzel:2013:VOI

[KB13] Filip Kruzel and KrzysztofBanas. Vectorized OpenCLimplementation of numeri-cal integration for higher or-der finite elements. Com-puters and Mathematics withApplications, 66(10):2030–2044, December 2013. CO-DEN CMAPDK. ISSN0898-1221 (print), 1873-7668(electronic). URL http://www.sciencedirect.com/science/article/pii/S089812211300521X.

Kabir:2002:DIS

[KBA02] Yacine Kabir and A. Belhadj-Aissa. Distributed imagesegmentation system by amulti-agents approach (un-der PVM environment). Lec-ture Notes in Computer Sci-ence, 2474:138–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.de/link/service/series/0558/bibs/2474/24740138.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740138.pdf.

Klemm:2009:RTM

[KBG+09] Michael Klemm, MatthiasBezold, Stefan Gabriel,

Ronald Veldema, and MichaelPhilippsen. Reparalleliza-tion techniques for migratingOpenMP codes in computa-tional grids. Concurrencyand Computation: Prac-tice and Experience, 21(3):281–299, March 10, 2009.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Kulkarni:2016:HAP

[KBG16] Kedar Kulkarni, ShreeyaBadhe, and GeetanjaliGadre. HCA aware paral-lel communication library:A feasibility study for of-floading MPI requirements.Supercomputing Frontiersand Innovations, 3(3):56–60, ???? 2016. CO-DEN ???? ISSN 2409-6008 (print), 2313-8734(electronic). URL http://superfri.org/superfri/article/view/109.

Knies:1994:SLL

[KBHA94] A. D. Knies, F. R. Barriuso,W. J. Harrod, and G. B.Adams, III. SLICC: a low la-tency interface for collectivecommunications. In IEEE[IEE94h], pages 89–96. ISBN0-8186-6607-2, 0-8186-6605-6, 0-8186-6606-4. ISSN 1063-9535. LCCN QA76.5 .S8941994. IEEE catalog number94CH34819.

Kitowski:1997:CPM

[KBM97] J. Kitowski, K. Boryczko,and J. Moscinski. Compari-

REFERENCES 269

son of PVM and MPI perfor-mance in short-range molec-ular dynamics simulation.Lecture Notes in ComputerScience, 1332:11–16, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Kannan:2016:HPP

[KBP16] Ramakrishnan Kannan, GreyBallard, and Haesun Park.A high-performance paral-lel algorithm for nonneg-ative matrix factorization.ACM SIGPLAN Notices, 51(8):9:1–9:??, August 2016.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Ke:2004:RCM

[KBS04] Jian Ke, Martin Burtscher,and Evan Speight. Runtimecompression of MPI mes-sages to improve the perfor-mance and scalability of par-allel applications. In ACM[ACM04], page 59. ISBN 0-7695-2153-3. LCCN ????

Klemm:2007:JIO

[KBVP07] Michael Klemm, MatthiasBezold, Ronald Veldema,and Michael Philippsen.JaMP: an implementation ofOpenMP for a Java DSM.Concurrency and Computa-tion: Practice and Experi-ence, 19(18):2333–2352, De-cember 25, 2007. CODENCCPEBO. ISSN 1532-0626

(print), 1532-0634 (elec-tronic).

Karamcheti:1994:SOM

[KC94] Vijay Karamcheti and An-drew A. Chien. Softwareoverhead in messaging lay-ers: where does the timego? ACM SIGPLAN No-tices, 29(11):51–60, Novem-ber 1994. CODEN SINODQ.ISSN 0362-1340 (print),1523-2867 (print), 1558-1160(electronic). URL http://www.acm.org:80/pubs/citations/proceedings/asplos/195473/p51-karamcheti/.

Krawezik:2006:PCM

[KC06] Geraud Krawezik and FranckCappello. Performance com-parison of MPI and OpenMPon shared memory multi-processors. Concurrencyand Computation: Practiceand Experience, 18(1):29–61, January 2006. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic).

Kacsuk:1997:GDD

[KCD+97] Peter Kacsuk, Jose C.Cunha, Gabor Dozsa, JoaoLourenco, Tibor Fadgyas,and Tiago Antao. A graph-ical development and de-bugging environment forparallel programs. Paral-lel Computing, 22(13):1747–1770, February 28, 1997.CODEN PACOEJ. ISSN

REFERENCES 270

0167-8191 (print), 1872-7336(electronic). URL http://www.elsevier.com/cgi-bin/cas/tree/store/parco/cas_sub/browse/browse.cgi?year=1997&volume=22&issue=13&aid=1126.

Konuru:1994:ULP

[KCP+94a] R. Konuru, J. Casas,R. Prouty, S. Otto, andJ. Walpole. A user-level pro-cess package for PVM. InPierce and Regnier [PR94b],pages 48–55. ISBN 0-8186-5680-8, 0-8186-5681-6.LCCN QA76.58.S32 1994.IEEE catalog no. 94TH0637-9.

Konuru:1994:UPP

[KCP+94b] R. Konuru, J. Casas,R. Prouty, S. Otto, andJ. Walpole. A user-level pro-cess package for PVM. InPierce and Regnier [PR94b],pages 48–55. ISBN 0-8186-5680-8, 0-8186-5681-6.LCCN QA76.58.S32 1994.IEEE catalog no. 94TH0637-9.

Kotselidis:2017:HMR

[KCR+17] Christos Kotselidis, JamesClarkson, Andrey Rod-chenko, Andy Nisbet, JohnMawer, and Mikel Lujan.Heterogeneous managed run-time systems: a computervision case study. ACMSIGPLAN Notices, 52(7):74–82, July 2017. CODENSINODQ. ISSN 0362-1340

(print), 1523-2867 (print),1558-1160 (electronic).

Kanal:2012:MMC

[KD12] M. E. Kanal and M. Demi-ralp. A modified methodof calculating High Dimen-sional Model Representa-tion (HDMR) Terms forparallelization with MPIand CUDA. The Jour-nal of Supercomputing, 62(1):199–213, October 2012.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=62&issue=1&spage=199.

Krotkiewski:2013:ESC

[KD13] Marcin Krotkiewski andMarcin Dabrowski. Effi-cient 3D stencil computa-tions using CUDA. Paral-lel Computing, 39(10):533–548, October 2013. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.sciencedirect.com/science/article/pii/S016781911300094X.

Klingebiel:1995:COD

[KDL+95a] P. Klingebiel, R. Diekmann,U. Lefarth, M. Fischer, andJ. Seuss. CAMeL/PVM: anopen, distributed CAE envi-ronment for modelling andsimulating mechatronic sys-tems. In Breitenecker andHusinsky [BH95], pages 645–

REFERENCES 271

650. ISBN 0-444-82241-0.LCCN A76.9.C65E966 1995.

Klingebiel:1995:CPO

[KDL+95b] P. Klingebiel, R. Diekmann,U. Lefarth, M. Fischer, andJ. Seuss. CAMeL/PVM: Anopen, distributed CAE envi-ronment for modelling andsimulating mechatronic sys-tems. In Breitenecker andHusinsky [BH95], pages 645–650. ISBN 0-444-82241-0.LCCN A76.9.C65E966 1995.

Kakimoto:2012:PCG

[KDSO12] Takeshi Kakimoto, KeisukeDohi, Yuichiro Shibata, andKiyoshi Oguri. Perfor-mance comparison of GPUprogramming frameworkswith the striped Smith–Waterman algorithm. ACMSIGARCH Computer Archi-tecture News, 40(5):70–75,December 2012. CODENCANED2. ISSN 0163-5964(print), 1943-5851 (elec-tronic). HEART ’12 confer-ence proceedings.

Klemm:2012:EOV

[KDT+12] Michael Klemm, AlejandroDuran, Xinmin Tian, HidekiSaito, and Diego Caballero.Extending OpenMP* withvector constructs for mod-ern multicore SIMD archi-tectures. Lecture Notesin Computer Science, 7312:59–72, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.

springer.com/chapter/10.1007/978-3-642-30961-8_5/.

Komatitsch:2010:HOF

[KEGM10] Dimitri Komatitsch, Gor-don Erlebacher, DominikGoddeke, and David Michea.High-order finite-elementseismic wave propagationmodeling with MPI on alarge GPU cluster. Journalof Computational Physics,229(20):7692–7714, October1, 2010. CODEN JCT-PAH. ISSN 0021-9991(print), 1090-2716 (elec-tronic). URL http://www.sciencedirect.com/science/article/pii/S0021999110003396.

Kepner:2005:PPM

[Kep05] Jeremy Kepner. Parallel pro-gramming with MatlabMPI.World-Wide Web site., 2005.URL http://www.ll.mit.edu/MatlabMPI/.

Kale:1996:PMD

[KFA96] R. P. Kale, M. E. Fleharty,and P. M. Alsing. Parallelmolecular dynamics visual-ization using MPI with MPEgraphics. In IEEE [IEE96i],pages 104–110. ISBN 0-8186-7533-0. LCCN QA76.642.M67 1996.

Kappiah:2005:JTD

[KFL05] Nandini Kappiah, Vin-cent W. Freeh, and David K.Lowenthal. Just in time dy-namic voltage scaling: Ex-ploiting inter-node slack to

REFERENCES 272

save energy in MPI pro-grams. In ACM [ACM05],page 33. ISBN 1-59593-061-2. LCCN ????

Kramer-Fuhrmann:1994:TGP

[KFSS94] O. Kramer-Fuhrmann, L. Schafers,and C. Scheidler. TRAP-PER — a graphical pro-gramming environment forparallel systems. In Becksand Perret-Gallix [BPG94],pages 3–15. ISBN 981-02-1699-8. LCCN QC793.47.E4I581993.

Kowalik:1993:SPC

[KG93] Janusz S. Kowalik and LucioGrandinetti, editors. Soft-ware for parallel computa-tion: Proceedings of theNATO Advanced Workshopon Software for ParallelComputation, held at Ce-traro, Cosenza, Italy, June22–26, 1992, volume 106of NATO ASI series. Se-ries F, Computer and sys-tems sciences. Springer-Ver-lag, Berlin, Germany / Hei-delberg, Germany / London,UK / etc., 1993. ISBN 3-540-56451-9 (Berlin), 0-387-56451-9 (New York). LCCNQA76.58 .S629 1993.

Kohl:1996:PTF

[KG96] J. A. Kohl and G. A. Geist.The PVM 3.4 tracing facilityand XPVM 1.1. In El-Rewiniand Shriver [ERS96], pages290–299. ISBN 0-8186-7324-9. ISSN 1060-3425. LCCN???? Five volumes.

Kainz:2009:RCM

[KGB+09] Bernhard Kainz, MarkusGrabner, Alexander Bornik,Stefan Hauswiesner, JudithMuehl, and Dieter Schmal-stieg. Ray casting of multi-ple volumetric datasets withpolyhedral boundaries onmanycore GPUs. ACMTransactions on Graphics,28(5):152:1–152:9, Decem-ber 2009. CODEN AT-GRDF. ISSN 0730-0301(print), 1557-7368 (elec-tronic).

Keller:2003:TEE

[KGK+03] Rainer Keller, Edgar Gabriel,Bettina Krammer, Matthias S.Muller, and Michael M.Resch. Towards efficientexecution of MPI applica-tions on the Grid: Port-ing and optimization is-sues. Journal of Grid Com-puting, 1(2):133–149, ????2003. CODEN ???? ISSN1570-7873 (print), 1572-9184(electronic). URL http://ipsapp008.kluweronline.com/IPS/content/ext/x/J/6160/I/4/A/4/abstract.htm.

Keller:2010:RAM

[KGRD10] Rainer Keller, Edgar Gabriel,Michael Resch, and JackDongarra, editors. RecentAdvances in the MessagePassing Interface: 17th Eu-ropean MPI Users’ GroupMeeting, EuroMPI 2010,Stuttgart, Germany, Septem-

REFERENCES 273

ber 12–15, 2010. Proceed-ings, volume 6305 of Lec-ture Notes in ComputerScience. Springer-Verlag,Berlin, Germany / Heidel-berg, Germany / London,UK / etc., 2010. CO-DEN LNCSD9. ISBN 3-642-15645-2 (print), 3-642-15646-0 (e-book). ISSN0302-9743 (print), 1611-3349 (electronic). LCCN???? URL http://www.springerlink.com/content/978-3-642-15646-5.

Kafura:1996:CCC

[KH96] D. Kafura and L. Huang.Collective communicationand communicators in mpi++.In IEEE [IEE96i], pages 79–86. ISBN 0-8186-7533-0.LCCN QA76.642 .M67 1996.

Kwon:2010:SPC

[KH10] Seongnam Kwon and Soon-hoi Ha. Serialized parallelcode generation frameworkfor MPSoC. ACM Transac-tions on Design Automationof Electronic Systems, 15(2):11:1–11:??, February 2010.CODEN ATASFO. ISSN1084-4309 (print), 1557-7309(electronic).

Karrenberg:2012:IPO

[KH12] Ralf Karrenberg and Se-bastian Hack. Improvingperformance of OpenCL onCPUs. Lecture Notes inComputer Science, 7210:

1–20, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-28652-0_1/.

Kramer:2015:SET

[KH15] Stephan C. Kramer and Jo-hannes Hagemann. SciPAL:Expression templates andcomposition closure objectsfor high performance compu-tational physics with CUDAand OpenMP. ACM Trans-actions on Parallel Com-puting (TOPC), 1(2):15:1–15:??, January 2015. CO-DEN ???? ISSN 2329-4949(print), 2329-4957 (elec-tronic).

Khanna:2013:HPN

[Kha13] Gaurav Khanna. High-precision numerical simula-tions on a CUDA GPU:Kerr black hole tails. Jour-nal of Scientific Comput-ing, 56(2):366–380, Au-gust 2013. CODEN JS-COEB. ISSN 0885-7474(print), 1573-7691 (elec-tronic). URL http://link.springer.com/article/10.1007/s10915-012-9679-3;http://link.springer.

com/content/pdf/10.1007/s10915-012-9679-3.pdf.

Kielmann:1999:MMC

[KHB+99] Thilo Kielmann, RutgerF. H. Hofman, Henri E.

REFERENCES 274

Bal, Aske Plaat, and RaoulA. F. Bhoedjang. Mag-PIe: MPI’s collective com-munication operations forclustered wide area systems.ACM SIGPLAN Notices, 34(8):131–140, August 1999.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic). URL http://www.acm.org/pubs/citations/proceedings/ppopp/301104/p131-kielmann/.

Kucukboyaci:2001:PPT

[KHS01] Vefa Kucukboyaci, AlirezaHaghighat, and Glenn E.Sjoden. Performance ofPENTRAN TM 3-D paral-lel particle transport codeon the IBM SP2 and PC-TRAN cluster. LectureNotes in Computer Science,2131:36–??, 2001. CO-DEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310036.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310036.pdf.

Kjolstad:2012:ADG

[KHS12] Fredrik Kjolstad, TorstenHoefler, and Marc Snir.Automatic datatype gen-eration and optimization.ACM SIGPLAN Notices, 47(8):327–328, August 2012.

CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic). PPOPP ’12 confer-ence proceedings.

Kojima:2017:HLG

[KI17] Kensuke Kojima and At-sushi Igarashi. A Hoarelogic for GPU kernels. ACMTransactions on Compu-tational Logic, 18(1):3:1–3:??, April 2017. CODEN???? ISSN 1529-3785(print), 1557-945X (elec-tronic).

Kikuchi:1993:PAS

[Kik93] S. Kikuchi. Paralleliza-tion assist system. Joho-Shori (J. Information Pro-cessing Soc. Japan), 34(9):1158–1169, September 1993.CODEN JOSHA4. ISSN0447-8053.

Kranz:1993:IMP

[KJA+93] David Kranz, Kirk L. John-son, Anant Agarwal, JohnKubiatowicz, and Beng-Hong Lim. Integratingmessage-passing and shared-memory: early experience.ACM SIGPLAN Notices, 28(7):54–63, July 1993. CO-DEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Kwon:2012:HAO

[KJEM12] Okwan Kwon, Fahed Jubair,Rudolf Eigenmann, and

REFERENCES 275

Samuel Midkiff. A hybridapproach of OpenMP forclusters. ACM SIGPLANNotices, 47(8):75–84, August2012. CODEN SINODQ.ISSN 0362-1340 (print),1523-2867 (print), 1558-1160(electronic). PPOPP ’12conference proceedings.

Kim:2016:DOF

[KJJ+16] Junghyun Kim, GangwonJo, Jaehoon Jung, JungwonKim, and Jaejin Lee. A dis-tributed OpenCL frameworkusing redundant computa-tion and data replication.ACM SIGPLAN Notices,51(6):553–569, June 2016.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Kemelmakher:1998:SAR

[KK98] M. Kemelmakher and O. Kremien.Scalable and adaptive re-source sharing in PVM. Lec-ture Notes in Computer Sci-ence, 1497:196–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Karniadakis:2002:PSC

[KK02a] George Em Karniadakis andRobert M. Kirby. Paral-lel Scientific Computing inC++ and MPI: a Seam-less Approach to Parallel Al-gorithms. Cambridge Uni-versity Press, Cambridge,UK, 2002. ISBN 0-521-

52080-0 (paperback), 0-521-81754-4 (hardcover). xi +616 pp. LCCN QA76.58.K37 2003. US$50.00 (pa-perback), US$130.00 (hard-cover). URL ftp://uiarchive.cso.uiuc.edu/pub/etext/gutenberg/;http://www.loc.gov/catdir/description/cam031/2002034805.html; http://www.loc.gov/catdir/samples/cam033/2002034805.html; http://www.loc.gov/catdir/toc/cam031/2002034805.html.

Krysztop:2002:IFP

[KK02b] Bartosz Krysztop and Hen-ryk Krawczyk. Improvingflexibility and performanceof PVM applications by dis-tributed partial evaluation.Lecture Notes in ComputerScience, 2474:376–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.de/link/service/series/0558/bibs/2474/24740376.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740376.pdf.

Kranzlmuller:2004:RAP

[KKD04] Dieter Kranzlmuller, PeterKacsuk, and Jack J. Don-garra, editors. Recent Ad-vances in Parallel VirtualMachine and Message Pass-ing Interface: 11th Eu-ropean PVM/MPI Users’Group Meeting, Budapest,Hungary, September 19–22,

REFERENCES 276

2004: proceedings, volume3241 of Lecture Notes inComputer Science. Springer-Verlag, Berlin, Germany /Heidelberg, Germany / Lon-don, UK / etc., 2004. CO-DEN LNCSD9. ISBN 3-540-23163-3. ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58.E973 2004. URL http://www.springerlink.com/openurl.asp?genre=issue&issn=0302-9743&volume=3241; http://www.springerlink.com/openurl.asp?genre=volume&id=doi:10.1007/b100820.

Kranzlmuller:2005:RAP

[KKD05] Dieter Kranzlmuller, PeterKacsuk, and Jack Dongarra.Recent advances in Par-allel Virtual Machine andMessage Passing Interface.The International Journal ofHigh Performance Comput-ing Applications, 19(2):99–101, Summer 2005. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/19/2/99.full.pdf+html.

Kranzlmuller:2003:RAP

[KKDV03] Dieter Kranzlmuller, PeterKacsuk, Jack Dongarra, andJens Volkert. Recent ad-vances in parallel virtual ma-chine and message passinginterface (select papers fromthe EuroPVMMPI 2002Conference). The Interna-

tional Journal of High Per-formance Computing Appli-cations, 17(1):3–5, Spring2003. CODEN IHPCFL.ISSN 1094-3420 (print),1741-2846 (electronic).

Kee:2003:POP

[KKH03] Yang-Suk Kee, Jin-Soo Kim,and Soonhoi Ha. ParADE:An OpenMP programmingenvironment for SMP clustersystems. In ACM [ACM03],page ?? ISBN 1-58113-695-1. LCCN ???? URL http://www.sc-conference.org/sc2003/inter_cal/inter_cal_detail.php?eventid=10708#0; http://www.sc-conference.org/sc2003/paperpdfs/pap130.pdf.

Kwon:2008:RPP

[KKJ+08] Seongnam Kwon, YongjooKim, Woo-Chul Jeun, Soon-hoi Ha, and Yunheung Paek.A retargetable parallel-programming framework forMPSoC. ACM Transac-tions on Design Automa-tion of Electronic Systems,13(3):39:1–39:??, July 2008.CODEN ATASFO. ISSN1084-4309 (print), 1557-7309(electronic).

Kim:2011:ASC

[KKLL11] Jungwon Kim, HonggyuKim, Joo Hwan Lee, andJaejin Lee. Achieving a sin-gle compute device image inOpenCL for multiple GPUs.ACM SIGPLAN Notices, 46

REFERENCES 277

(8):277–288, August 2011.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic). PPoPP ’11 Confer-ence proceedings.

Karami:2015:SPA

[KKM15] Ali Karami, Farshad Khun-jush, and Seyyed Ali Mir-soleimani. A statistical per-formance analyzer frame-work for OpenCL kernelson Nvidia GPUs. TheJournal of Supercomput-ing, 71(8):2900–2921, Au-gust 2015. CODEN JO-SUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic). URL http://link.springer.com/article/10.1007/s11227-014-1338-z.

Konstantinou:2001:TTO

[KKP01] Dimitris Konstantinou, Nec-tarios Koziris, and GeorgePapakonstantinou. TOP-PER: a tool for optimiz-ing the performance of par-allel applications. Lec-ture Notes in Computer Sci-ence, 2131:148–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310148.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310148.pdf.

Kobler:2001:DOP

[KKV01] Rene Kobler, Dieter Kran-zlmuller, and Jens Volk-ert. Debugging OpenMPprograms using event ma-nipulation. Lecture Notesin Computer Science, 2104:81–??, 2001. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2104/21040081.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2104/21040081.pdf.

Karrels:1994:PAM

[KL94] E. Karrels and E. Lusk. Per-formance analysis of MPIprograms. In Dongarra andTourancheau [DT94], pages195–200. ISBN 0-89871-343-9. LCCN QA76.58.I5681994.

Kofakis:1995:DPI

[KL95] P. Kofakis and J. Louis.Distributed parallel imple-mentation of seismic algo-rithms. In Hassanzadeh[Has95], pages 229–238. CO-DEN PSISDG. ISBN 0-8194-1930-3. ISSN 0277-786X(print), 1996-756X (elec-tronic). LCCN TS510.S63v.2571.

Liao:2011:DEM

[kL11] Wei keng Liao. Design andevaluation of MPI file do-

REFERENCES 278

main partitioning methodsunder extent-based file lock-ing protocol. IEEE Trans-actions on Parallel and Dis-tributed Systems, 22(2):260–272, February 2011. CODENITDSEO. ISSN 1045-9219(print), 1558-2183 (elec-tronic).

Liao:2006:SDI

[kLCC+06] Wei keng Liao, KeninColoma, Alok Choudhary,Lee Ward, Eric Russell, andNeil Pundit. Scalable de-sign and implementationsfor MPI parallel overlap-ping I/O. IEEE Transac-tions on Parallel and Dis-tributed Systems, 17(11):1264–1276, November 2006.CODEN ITDSEO. ISSN1045-9219 (print), 1558-2183(electronic).

Liao:2007:CCS

[kLCCW07] Wei keng Liao, KeninColoma, Alok Choudhary,and Lee Ward. Coopera-tive client-side file cachingfor MPI applications. TheInternational Journal ofHigh Performance Comput-ing Applications, 21(2):144–154, May 2007. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/21/2/144.full.pdf+html.

Klawonn:2015:HMO

[KLR+15] Axel Klawonn, Martin

Lanser, Oliver Rheinbach,Holger Stengel, and Ger-hard Wellein. HybridMPI/OpenMP paralleliza-tion in FETI–DP methods.In Mehl et al. [MBS15],pages 67–84. ISBN 3-319-22996-6, 3-319-22997-4 (e-book). LCCN QA71-90;TA329. URL http://link.springer.com/chapter/10.1007/978-3-319-22997-3_4/.

Kutyniok:2016:SFD

[KLR16] Gitta Kutyniok, Wang-QLim, and Rafael Reisen-hofer. ShearLab 3D: Faith-ful digital shearlet trans-forms based on compactlysupported shearlets. ACMTransactions on Mathemati-cal Software, 42(1):5:1–5:42,February 2016. CODENACMSCU. ISSN 0098-3500(print), 1557-7295 (elec-tronic).

Khanna:2010:NMG

[KM10] Gaurav Khanna and JustinMcKennon. Numericalmodeling of gravitationalwave sources accelerated byOpenCL. Computer PhysicsCommunications, 181(9):1605–1611, September 2010.CODEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465510001682.

REFERENCES 279

Kormicki:1996:PLS

[KMC96] M. Kormicki, A. Mahmood,and B. S. Carlson. Paral-lel logic simulation on a net-work of workstations usingPVM. In IEEE [IEE96b],pages 2–9. ISBN 0-8186-7683-3, 0-8186-7685-X (mi-crofiche). LCCN QA76.58.I42 1996. IEEE Com-puter Society Press ordernumber PR07683. IEEE Or-der Plan catalog number96TB100088.

Kormicki:1997:PLS

[KMC97] Maciek Kormicki, AusifMahmood, and Bradley S.Carlson. Parallel logic sim-ulation on a network ofworkstations using paral-lel virtual machine. ACMTransactions on Design Au-tomation of Electronic Sys-tems, 2(2):123–134, Jan-uary 1997. CODENATASFO. ISSN 1084-4309(print), 1557-7309 (elec-tronic). URL http://www.acm.org/pubs/articles/journals/todaes/1997-2-2/p123-kormicki/p123-kormicki.pdf; http://www.acm.org/pubs/citations/journals/todaes/1997-2-2/p123-kormicki/.

Komatitsch:2009:PHO

[KME09] Dimitri Komatitsch, DavidMichea, and Gordon Er-lebacher. Porting a high-order finite-element earth-quake modeling applica-

tion to NVIDIA graphicscards using CUDA. Jour-nal of Parallel and Dis-tributed Computing, 69(5):451–460, May 2009. CODENJPDCER. ISSN 0743-7315(print), 1096-0848 (elec-tronic).

Koholka:1999:MPR

[KMG99] R. Koholka, H. Mayer, andA. Goller. MPI-parallelizedradiance on SGI CoW andSMP. Lecture Notes inComputer Science, 1557:549–558, 1999. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Kumar:2014:OMC

[KMH+14] Sameer Kumar, AmithMamidala, Philip Heidel-berger, Dong Chen, andDaniel Faraj. Optimizationof MPI collective operationson the IBM Blue Gene/Q su-percomputer. The Interna-tional Journal of High Per-formance Computing Appli-cations, 28(4):450–464, Nov-ember 2014. CODEN IH-PCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/28/4/450.

Kobayashi:2016:HSV

[KMK16] Ryohei Kobayashi, Tomo-hiro Misono, and KenjiKise. A high-speed Ver-ilog HDL simulation method

REFERENCES 280

using a lightweight transla-tor. ACM SIGARCH Com-puter Architecture News, 44(4):26–31, September 2016.CODEN CANED2. ISSN0163-5964 (print), 1943-5851(electronic).

Kirk:2010:PMP

[KmWH10] David B. Kirk and Wenmei W. Hwu. Programmingmassively parallel processorshands-on with CUDA. Mor-gan Kaufmann Publishers,Los Altos, CA 94022, USA,2010. ISBN 0-12-381472-3. xviii + 258 pp. LCCNQA76.642 .K57 2010. Chap-ter 7 (pages 125–140) dis-cusses GPU floating-pointconsiderations.

Kalns:1995:DPD

[KN95] E. T. Kalns and L. M.Ni. DaReL: a portabledata redistribution libraryfor distributed-memory ma-chines. In IEEE [IEE95j],pages 78–87. ISBN 0-8186-6895-4. LCCN QA76.58 .S341994.

Katouda:2017:MOH

[KN17] Michio Katouda and TakahitoNakajima. MPI/OpenMPhybrid parallel algorithmfor resolution of identitysecond-order Møller–Plessetperturbation calculation ofanalytical energy gradientfor massively parallel multi-core supercomputers. Jour-nal of Computational Chem-istry, 38(8):489–507, March

30, 2017. CODEN JC-CHDD. ISSN 0192-8651(print), 1096-987X (elec-tronic).

Kasprzyk:2002:APV

[KNT02] Leszek Kasprzyk, RyszardNawrowski, and AndrzejTomczewski. Applicationof a parallel virtual ma-chine for the analysis ofa luminous field. Lec-ture Notes in Computer Sci-ence, 2474:122–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.de/link/service/series/0558/bibs/2474/24740122.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740122.pdf.

Komura:2014:CPG

[KO14] Yukihiro Komura and Yu-taka Okabe. CUDA pro-grams for the GPU com-puting of the Swendsen–Wang multi-cluster spin flipalgorithm: 2D and 3DIsing, Potts, and XY mod-els. Computer Physics Com-munications, 185(3):1038–1043, March 2014. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465513003743.

Kambites:2001:OLI

[KOB01] M. E. Kambites, J. Obdrzalek,

REFERENCES 281

and J. M. Bull. AnOpenMP-like interface forparallel programming inJava. Concurrency andComputation: Practice andExperience, 13(8–9):793–814, July/August 2001. CO-DEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic). URL http://www3.interscience.wiley.com/cgi-bin/abstract/84503220/START; http://www3.interscience.wiley.com/cgi-bin/fulltext?ID=84503220&PLACEBO=IE.pdf.

Kasahara:2001:ACG

[KOI01] Hironori Kasahara, Mo-toki Obata, and KazuhisaIshizaka. Automatic coarsegrain task parallel process-ing on SMP using OpenMP.Lecture Notes in ComputerScience, 2017:189–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2017/20170189.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2017/20170189.pdf.

Koniges:2000:ISP

[Kon00] Alice E. Koniges, editor.Industrial Strength ParallelComputing. Morgan Kauf-mann Publishers, Los Altos,CA 94022, USA, 2000. ISBN1-55860-540-1. xxv + 597

pp. LCCN QA76.58 .I4832000.

Kauranne:1995:OHM

[KOS+95a] T. Kauranne, J. Oinonen,S. Saarinen, O. Serimaa, andJ. Hietaniemi. The opera-tional HIRLAM 2 model onparallel computers (weatherforecasting). In Hoffmannand Kreitz [HK95], pages63–74. ISBN 981-02-2211-4.LCCN QC866.E26 1994.

Koski:1995:STL

[Kos95b] Kimmo Koski. A step to-wards large scale parallelism:Building a parallel comput-ing environment from het-erogeneous resources. FutureGeneration Computer Sys-tems, 11(4–5):491–498, Au-gust 1995. CODEN FG-SEVI. ISSN 0167-739X(print), 1872-7115 (elec-tronic).

Konuru:1997:MUL

[KOW97] Ravi B. Konuru, Steve W.Otto, and Jonathan Walpole.A migratable user-level pro-cess package for PVM. Jour-nal of Parallel and Dis-tributed Computing, 40(1):81–102, January 10, 1997.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic). URL http://www.idealibrary.com/links/doi/10.1006/jpdc.1996.1270/production;http://www.idealibrary.com/links/doi/10.1006/

REFERENCES 282

jpdc.1996.1270/production/pdf; http://www.idealibrary.com/links/doi/10.1006/jpdc.1996.1270/production/ref.

Kermarrec:1996:PDS

[KP96] Y. Kermarrec and L. Pautet.Programming distributedsystems with both Ada 95and PVM. In Toussaint[Tou96], pages 206–216.ISBN 3-540-60757-9. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.73.A35I57 1995.

Kuckuk:2013:IPD

[KPK13] Sebastian Kuckuk, To-bias Preclik, and HaraldKostler. Interactive parti-cle dynamics using OpenCLand Kinect. InternationalJournal of Parallel, Emer-gent and Distributed Sys-tems: IJPEDS, 28(6):519–536, 2013.

Klockner:2012:PPS

[KPL+12] Andreas Klockner, NicolasPinto, Yunsup Lee, BryanCatanzaro, Paul Ivanov,and Ahmed Fasih. Py-CUDA and PyOpenCL: ascripting-based approach toGPU run-time code gener-ation. Parallel Computing,38(3):157–174, March 2012.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167819111001281.

Kolesnichenko:2016:CBG

[KPNM16] Alexey Kolesnichenko, Christo-pher M. Poskitt, Sebas-tian Nanz, and BertrandMeyer. Contract-basedgeneral-purpose GPU pro-gramming. ACM SIG-PLAN Notices, 51(3):75–84, March 2016. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Kuhn:2000:OVT

[KPO00] Bob Kuhn, Paul Petersen,and Eamonn O’Toole. OpenMPversus threading in C/C++.Concurrency: practice andexperience, 12(12):1165–1176, October 2000. CO-DEN CPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/cgi-bin/abstract/76500354/START; http://www3.interscience.wiley.com/cgi-bin/fulltext?ID=76500354&PLACEBO=IE.pdf.

Kamal:2005:SVT

[KPW05] Humaira Kamal, BradPenoff, and Alan Wagner.SCTP versus TCP for MPI.In ACM [ACM05], page 30.ISBN 1-59593-061-2. LCCN????

Klimach:2009:PCH

[KR09] Harald Klimach and Sabine P.Roller. Parallel couplingof heterogeneous domainswith KOP3D using PACX-MPI. In Tuncer et al.

REFERENCES 283

[TGEM09], pages 339–345.CODEN LNCSA6. ISBN 3-540-92743-3 (print), 3-540-92744-1 (e-book). ISSN1439-7358. LCCN ???? URLhttp://link.springer.com/content/pdf/10.1007/978-3-540-92744-0_42. ParallelCFD 2007 was held in An-talya, Turkey, from May 21to 24, 2007.

Kranzlmuller:2002:RAP

[Kra02] Dieter Kranzlmuller, editor.Recent advances in parallelvirtual machine and mes-sage passing interface: 9thEuropean PVM/MPI Users’Group Meeting, Linz, Aus-tria, September 29–October2, 2002: proceedings, vol-ume 2474 of Lecture Notes inComputer Science. Springer-Verlag, Berlin, Germany /Heidelberg, Germany / Lon-don, UK / etc., 2002. ISBN3-540-44296-0 (softcover).LCCN QA76.58 .E975 2002.Also available via the WorldWide Web.

Kouetcha:2017:USP

[KRC17] Daniella Nguemalieu Kou-etcha, Hamidreza Ramezani,and Nathalie Cohaut. Ul-trafast scalable parallel algo-rithm for the radial distri-bution function histogram-ming using MPI maps. TheJournal of Supercomputing,73(4):1629–1653, April 2017.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic).

Kunaseth:2013:ASD

[KRG13] Manaschai Kunaseth, David F.Richards, and James N.Glosli. Analysis of scalabledata-privatization thread-ing algorithms for hybridMPI/OpenMP paralleliza-tion of molecular dynamics.The Journal of Supercom-puting, 66(1):406–430, Oc-tober 2013. CODEN JO-SUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic). URL http://link.springer.com/article/10.1007/s11227-013-0915-x.

Kalentev:2011:CCL

[KRKS11] Oleksandr Kalentev, AbhaRai, Stefan Kemnitz, andRalf Schneider. Connectedcomponent labeling on a 2Dgrid using CUDA. Jour-nal of Parallel and Dis-tributed Computing, 71(4):615–620, April 2011. CO-DEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic).

Kranzlmueller:1999:MOM

[KRS99] D. Kranzlmueller, R. Reuss-ner, and C. Schaubschlaeger.Monitor overhead measure-ment with SKaMPI. InDongarra et al. [DLM99],pages 43–50. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

REFERENCES 284

Kotsis:1996:EEP

[KS96] G. Kotsis and F. Sukup. Ef-ficiency evaluation of PVM2.X, PVM 3.X, P4, EX-PRESS and LINDA on aworkstation cluster using theNAS parallel benchmarks.In Zaky and Lewis [ZL96],pages 149–171. ISBN 0-7923-9675-8. LCCN QA76.58.T651996.

Krantz:1997:CSC

[KS97] A. T. Krantz and V. S.Sunderam. Client servercomputing on message pass-ing systems: Experienceswith PVM-RPC. LectureNotes in Computer Science,1300:110–??, 1997. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Krawczyk:2001:PIM

[KS01] Henryk Krawczyk and JamilSaif. Parallel image match-ing on PC cluster. Lec-ture Notes in Computer Sci-ence, 2131:312–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310312.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310312.pdf.

Kim:2013:MPE

[KS13] Yooseong Kim and Avi-

ral Shrivastava. Memoryperformance estimation ofCUDA programs. ACMTransactions on EmbeddedComputing Systems, 13(2):21:1–21:??, September 2013.CODEN ???? ISSN1539-9087 (print), 1558-3465(electronic).

Kaliman:2015:SNU

[KS15a] Ilya A. Kaliman and Lyud-mila V. Slipchenko. Softwarenews and updates: HybridMPI/OpenMP paralleliza-tion of the effective frag-ment potential method inthe libefp software library.Journal of ComputationalChemistry, 36(2):129–135,January 15, 2015. CODENJCCHDD. ISSN 0192-8651(print), 1096-987X (elec-tronic).

Kovanen:2015:TAC

[KS15b] Janne Kovanen and TapaniSarjakoski. Tilewise accumu-lated cost surface computa-tion with graphics process-ing units. ACM Transac-tions on Spatial Algorithmsand Systems (TSAS), 1(2):8:1–8:27, November 2015.CODEN ???? ISSN2374-0353 (print), 2374-0361(electronic). URL http://dl.acm.org/citation.cfm?id=2803172.

Kegel:2013:DTU

[KSG13] Philipp Kegel, Michel Steuwer,and Sergei Gorlatch. dOpenCL:

REFERENCES 285

Towards uniform program-ming of distributed hetero-geneous multi-/many-coresystems. Journal of Par-allel and Distributed Com-puting, 73(12):1639–1648,December 2013. CODENJPDCER. ISSN 0743-7315(print), 1096-0848 (elec-tronic). URL http://www.sciencedirect.com/science/article/pii/S0743731513001597.

Kusano:2001:OOC

[KSHS01] Kazuhiro Kusano, MitsuhisaSato, Takeo Hosomi, andYoshiki Seo. The OmniOpenMP compiler on thedistributed shared mem-ory of Cenju-4. LectureNotes in Computer Science,2104:20–??, 2001. CO-DEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2104/21040020.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2104/21040020.pdf.

Katkere:1995:VBW

[KSJ95] A. Katkere, J. Schlenzig,and R. Jain. VRML-BasedWWW interface to MPIvideo. In Nadeau and More-land [NM95], pages 25–31,137. ISBN 0-89791-818-5. LCCN QA76.76.H94 S951995. ACM order number434953.

Katkere:1996:VWI

[KSJ96] A. Katkere, J. Schlenzig,and R. Jain. VRML-based WWW interface toMPI video. In ACM[ACM96a], pages 25–31, 137.ISBN 0-89791-818-5. LCCN???? URL http://www.acm.org/pubs/contents/proceedings/graph/217306/.

Kim:2014:VVF

[KSJ14] Young-Joo Kim, Sejun Song,and Yong-Kee Jun. VORD:A versatile on-the-fly racedetection tool in OpenMPprograms. InternationalJournal of Parallel Program-ming, 42(6):900–930, De-cember 2014. CODENIJPPE5. ISSN 0885-7458(print), 1573-7640 (elec-tronic). URL http://link.springer.com/article/10.1007/s10766-013-0257-6.

Kim:2012:OUP

[KSL+12] Jungwon Kim, Sangmin Seo,Jun Lee, Jeongho Nah,Gangwon Jo, and Jaejin Lee.OpenCL as a unified pro-gramming model for hetero-geneous CPU/GPU clusters.ACM SIGPLAN Notices, 47(8):299–300, August 2012.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic). PPOPP ’12 confer-ence proceedings.

REFERENCES 286

Kusano:2000:PEO

[KSS00] Kazuhiro Kusano, ShigehisaSatoh, and Mitsuhisa Sato.Performance evaluation ofthe omni OpenMP compiler.Lecture Notes in ComputerScience, 1940:403–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1940/19400403.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1940/19400403.pdf.

Kurzyniec:2007:UCA

[KSSS07] Dawid Kurzyniec, Mag-dalena Slawinska, JaroslawSlawinski, and Vaidy Sun-deram. Unibus: a con-trarian approach to Gridcomputing. The Jour-nal of Supercomputing, 42(1):125–144, October 2007.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=42&issue=1&spage=125.

Kranzlmuller:2001:IRM

[KSV01] Dieter Kranzlmuller, Chris-tian Schaubschlager, andJens Volkert. An inte-grated record&replay mech-anism for nondeterministicmessage passing programs.Lecture Notes in Computer

Science, 2131:192–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310192.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310192.pdf.

Keppens:2002:OPM

[KT02] R. Keppens and G. Toth.OpenMP parallelism formulti-dimensional grid-adaptivemagnetohydrodynamic sim-ulations. Lecture Notesin Computer Science, 2329:940–??, 2002. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2329/23290940.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2329/23290940.pdf.

Koval:2010:USB

[KT10] Peter Koval and J. D. Tal-man. Update of sphericalBessel transform: FFTWand OpenMP. Com-puter Physics Communi-cations, 181(12):2212–2213,December 2010. CODENCPHCBZ. ISSN 0010-4655(print), 1879-2944 (elec-tronic). URL http://

REFERENCES 287

www.sciencedirect.com/science/article/pii/S0010465510003188.

Karonis:2003:MGG

[KTF03] Nicholas T. Karonis, BrianToonen, and Ian Foster.MPICH-G2: a Grid-enabledimplementation of the Mes-sage Passing Interface. Jour-nal of Parallel and Dis-tributed Computing, 63(5):551–563, May 2003. CODENJPDCER. ISSN 0743-7315(print), 1096-0848 (elec-tronic).

Komatitsch:2003:BDF

[KTJT03] Dimitri Komatitsch, SeijiTsuboi, Chen Ji, and JeroenTromp. A 14.6 billion de-grees of freedom, 5 teraflops,2.5 terabyte earthquake sim-ulation on the Earth Sim-ulator. In ACM [ACM03],page ?? ISBN 1-58113-695-1. LCCN ???? URL http://www.sc-conference.org/sc2003/inter_cal/inter_cal_detail.php?eventid=10711#1; http://www.sc-conference.org/sc2003/paperpdfs/pap124.pdf.

Kuhn:1998:FFW

[Kuh98] Bob Kuhn. Fortran Fu-tures: Workshop: OpenMPfor parallel Fortran applica-tions. ACM Fortran Fo-rum, 17(3):22, December1998. CODEN ???? ISSN1061-7264 (print), 1931-1311(electronic).

Kumar:1994:PPI

[Kum94] V. K. Prasanna Kumar, edi-tor. Parallel processing: 1stIWWP: proceedings of theFirst International Work-shop on Parallel Processing(IWPP-94), December 26–31, 1994, Bangalore, In-dia. Tata McGraw-Hill Pub.Co, New Delhi, India, 1994.ISBN 0-07-462332-X. LCCNQA 76.58 I587 1994.

Kranzlmueller:1998:DPP

[KV98] D. Kranzlmueller and J. Volk-ert. Debugging point-to-point communication in MPIand PVM. Lecture Notesin Computer Science, 1497:265–??, 1998. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Kolonias:2011:DIE

[KVGH11] Vasileios Kolonias, Artemios G.Voyiatzis, George Goulas,and Efthymios Housos. De-sign and implementation ofan efficient integer countsort in CUDA GPUs. Con-currency and Computation:Practice and Experience,23(18):2365–2381, Decem-ber 25, 2011. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic).

Krotz-Vogel:1997:PPP

[KVH97] W. Krotz-Vogel and H.-C. Hoppe. The PALLAS

REFERENCES 288

parallel programming en-vironment. Lecture Notesin Computer Science, 1332:257–266, 1997. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Kamal:2014:IFG

[KW14] Humaira Kamal and AlanWagner. An integrated fine-grain runtime system forMPI. Computing, 96(4):293–309, April 2014. CODENCMPTA2. ISSN 0010-485X(print), 1436-5057 (elec-tronic). URL http://link.springer.com/article/10.1007/s00607-013-0329-x.

Kamburugamuve:2018:AML

[KWEF18] Supun Kamburugamuve,Pulasthi Wickramasinghe,Saliya Ekanayake, and Ge-offrey C. Fox. Anatomyof machine learning algo-rithm implementations inMPI, Spark, and Flink.The International Journal ofHigh Performance Comput-ing Applications, 32(1):61–73, January 2018. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic).

Kamal:2010:EIN

[KY10] A. A. Kamal and A. M.Youssef. Enhanced imple-mentation of the NTRUEn-crypt algorithm using graph-ics cards. In Chaudhuri et al.[CGB+10], pages 168–174.

ISBN 1-4244-7675-5. LCCN????

Karwande:2003:CMC

[KYL03] Amit Karwande, Xin Yuan,and David K. Lowen-thal. CC–MPI: a com-piled communication ca-pable MPI prototype forEthernet switched clusters.ACM SIGPLAN Notices, 38(10):95–106, October 2003.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Karwande:2005:MPC

[KYL05] Amit Karwande, Xin Yuan,and David K. Lowenthal. AnMPI prototype for compiledcommunication on Ethernetswitched clusters. Jour-nal of Parallel and Dis-tributed Computing, 65(10):1123–1133, October 2005.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic).

Krantz:1996:RFP

[KZCS96] A. T. Krantz, A. Zadroga,S. E. Chodrow, and V. S.Sunderam. An RPC facil-ity for PVM. In Liddellet al. [LCHS96], pages 798–?? ISBN 3-540-61142-8 (pa-perback). LCCN QA76.88.H52 1996.

Lopez:2002:ESM

[LA02] Felix Cesar Garcıa Lopezand Nieves Luz Frıas Ar-rocha. Expanding the

REFERENCES 289

synchronization model forOpenMP. Parallel and Dis-tributed Computing Prac-tices, 5(2):169–175, June2002. CODEN ???? ISSN1097-2803.

Lopez:2006:ESM

[LA06] F. C. Garcıa Lopez andN. L. Frıas Arrocha. An effi-cient synchronization modelfor OpenMP. Journalof Parallel and DistributedComputing, 66(11):1359–1365, November 2006. CO-DEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic).

Ladd:2004:GPP

[Lad04] Scott Ladd. Guide to Par-allel Programming. Spring-er-Verlag, Berlin, Germany /Heidelberg, Germany / Lon-don, UK / etc., 2004. ISBN0-387-40577-1. 465 (est.) pp.LCCN ???? Includes CD-ROM.

Lobeiras:2016:DEI

[LAD16] Jacobo Lobeiras, Mar-garita Amor, and RamonDoallo. Designing effi-cient index-digit algorithmsfor CUDA GPU architec-tures. IEEE Transactionson Parallel and DistributedSystems, 27(5):1331–1343,May 2016. CODEN ITD-SEO. ISSN 1045-9219(print), 1558-2183 (elec-tronic)ITDSEO. URL http://www.computer.org/csdl/

trans/td/2016/05/07138631-abs.html.

Laguna:2015:DPF

[LAdS+15] Ignacio Laguna, Dong H.Ahn, Bronis R. de Supin-ski, Saurabh Bagchi, andTodd Gamblin. Diagno-sis of performance faultsin LargeScale MPI appli-cations via probabilisticprogress-dependence infer-ence. IEEE Transactionson Parallel and DistributedSystems, 26(5):1280–1289,May 2015. CODEN ITD-SEO. ISSN 1045-9219(print), 1558-2183 (elec-tronic)ITDSEO. URL http://csdl.computer.org/csdl/trans/td/2015/05/06803050-abs.html.

Laforenza:2001:PHP

[Laf01] Domenico Laforenza. Pro-gramming high performanceapplications in grid envi-ronments. Lecture Notesin Computer Science, 2131:8–??, 2001. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310008.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310008.pdf.

Lorentz:2015:AMS

[LAFA15] Istvan Lorentz, Razvan An-donie, and Levente Fabry-

REFERENCES 290

Asztalos. Acceleratingmolecular structure determi-nation based on inter-atomicdistances using OpenCL.IEEE Transactions on Par-allel and Distributed Sys-tems, 26(12):3250–3263, De-cember 2015. CODENITDSEO. ISSN 1045-9219(print), 1558-2183 (elec-tronic)ITDSEO. URL http://csdl.computer.org/csdl/trans/td/2015/12/06995963-abs.html.

Langdon:2009:FHQ

[Lan09] W. B. Langdon. A fasthigh quality pseudo randomnumber generator for nVidiaCUDA. In Franz Roth-lauf, editor, GECCO ’09Proceedings of the 11th An-nual Conference Companionon Genetic and EvolutionaryComputation Conference:Late Breaking Papers, pages2511–2513. ACM Press, NewYork, NY 10036, USA,2009. ISBN 1-60558-505-X.LCCN ???? URL http://www.cs.ucl.ac.uk/staff/W.Langdon/ftp/gp-code/random-numbers/cuda_park-miller.tar.gz.

Loos:1996:MPS

[LB96] T. Loos and R. Bramley.MPI performance on theSGI Power Challenge. InIEEE [IEE96i], pages 203–206. ISBN 0-8186-7533-0.LCCN QA76.642 .M67 1996.

Lavi:1998:IPD

[LB98] R. Lavi and A. Barak. Im-proving the PVM daemonnetwork performance by di-rect network access. Lec-ture Notes in ComputerScience, 1497:44–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Loncar:2016:CPS

[LBB+16] Vladimir Loncar, AntunBalaz, Aleksandar Bo-gojevic, Srdjan Skrbic,Paulsamy Muruganandam,and Sadhan K. Adhikari.CUDA programs for solv-ing the time-dependent dipo-lar Gross–Pitaevskii equa-tion in an anisotropic trap.Computer Physics Com-munications, 200(??):406–410, March 2016. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465515004361.

Lawton:1996:BHP

[LBD+96] J. V. Lawton, J. J. Brosnan,M. P. Doyle, S. D. O. Ri-ordain, and T. G. Reddin.Building a high-performancemessage-passing system forMEMORY CHANNEL clus-ters. Digital Technical Jour-nal of Digital EquipmentCorporation, 8(2):96–116,October 1996. CODENDTJOEL. ISSN 0898-901X.

REFERENCES 291

URL http://www.digital.com:80/DTJM08/DTJM08P8.PS.

Ling:2012:HPP

[LBH12] Cheng Ling, Khaled Benkrid,and Tsuyoshi Hamada.High performance phyloge-netic analysis on CUDA-compatible GPUs. ACMSIGARCH Computer Archi-tecture News, 40(5):52–57,December 2012. CODENCANED2. ISSN 0163-5964(print), 1943-5851 (elec-tronic). HEART ’12 confer-ence proceedings.

Lewis:1993:PCP

[LC93] M. J. Lewis and R. E.Cline, Jr. PVM com-munication performance ina switched FDDI heteroge-neous distributed comput-ing environment. In Bhar-gava [Bha93], pages 13–19.ISBN 0-8186-5250-0, 0-8186-5251-9. LCCN QA76.58.I4441993.

Lauria:1997:MFH

[LC97a] Mario Lauria and AndrewChien. MPI-FM: High per-formance MPI on work-station clusters. Jour-nal of Parallel and Dis-tributed Computing, 40(1):4–18, January 10, 1997.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic). URL http://www.idealibrary.com/links/doi/10.1006/jpdc.1996.1264/production;

http://www.idealibrary.com/links/doi/10.1006/jpdc.1996.1264/production/pdf; http://www.idealibrary.com/links/doi/10.1006/jpdc.1996.1264/production/ref.

Luecke:1997:HPF

[LC97b] G. R. Luecke and J. J.Coyle. High PerformanceFortran versus explicit mes-sage passing on the IBMSP-2 for the parallel LU,QR, and Cholesky factoriza-tions. Supercomputer, 13(2):4–14, ???? 1997. CODENSPCOEL. ISSN 0168-7875.

Li:2007:DIV

[LC07] Kuan-Ching Li and Hsun-Chang Chang. The designand implementation of visualperformance monitoring andanalysis toolkit for clusterand Grid environments. TheJournal of Supercomputing,40(3):299–317, June 2007.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=40&issue=3&spage=299.

Luecke:2003:MCT

[LCC+03] Glenn Luecke, Hua Chen,James Coyle, Jim Hoek-stra, Marina Kraeva, andYan Zou. MPI-CHECK:a tool for checking Fortran90 MPI programs. Con-currency and Computation:

REFERENCES 292

Practice and Experience, 15(2):93–100, February 2003.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Liddell:1996:HPC

[LCHS96] Heather Mary Liddell, A. Col-brook, B. Hertzberger, andP. Sloot, editors. High-performance computing andnetworking: internationalconference and exhibition,HPCN EUROPE 1966,Brussels, Belgium, April 15–19, 1996: proceedings, vol-ume 1067 of Lecture notes incomputer science. Springer-Verlag, Berlin, Germany /Heidelberg, Germany / Lon-don, UK / etc., 1996. ISBN3-540-61142-8 (paperback).LCCN QA76.88 .H52 1996.

Lathrop:2011:SPI

[LCK11] Scott Lathrop, Jim Costa,and William Kramer, ed-itors. SC’11: Proceed-ings of 2011 InternationalConference for High Per-formance Computing, Net-working, Storage and Anal-ysis, Seattle, WA, Novem-ber 12–18 2011. ACM Pressand IEEE Computer SocietyPress, New York, NY 10036,USA and 1109 Spring Street,Suite 300, Silver Spring, MD20910, USA, 2011. ISBN 1-4503-0771-X. LCCN ????

Lashuk:2012:MPA

[LCL+12] Ilya Lashuk, Aparna Chan-dramowlishwaran, Harper

Langston, Tuan-Anh Nguyen,Rahul Sampath, AashayShringarpure, Richard Vuduc,Lexing Ying, Denis Zorin,and George Biros. A mas-sively parallel adaptive fastmultipole method on hetero-geneous architectures. Com-munications of the ACM,55(5):101–109, May 2012.CODEN CACMA2. ISSN0001-0782 (print), 1557-7317(electronic).

Losada:2017:RMA

[LCMG17] Nuria Losada, Ivan Cores,Marıa J. Martın, and Pa-tricia Gonzalez. ResilientMPI applications using anapplication-level checkpoint-ing framework and ULFM.The Journal of Supercom-puting, 73(1):100–113, Jan-uary 2017. CODEN JO-SUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic).

Lonsdale:1994:CRP

[LCVD94a] G. Lonsdale, J. Clincke-maillie, S. Vlachoutsis, andJ. Dubois. Communica-tion requirements in par-allel crashworthiness simu-lation. In Gentzsch andHarms [GH94], pages 55–61. ISBN 0-387-57981-8 (New York), 3-540-57981-8 (Berlin). LCCNQA76.88.I57 1994. DM96.00.Two volumes.

Lonsdale:1994:CMH

[LCVD94b] G. Lonsdale, J. Clincke-

REFERENCES 293

maillie, S. Vlachoutsis, andJ. Dubois. Crash-simulationmigration to HPC sys-tems. In Dekker et al.[DSZ94], pages 439–446.ISBN 0-444-81784-0. LCCNQA76.58.E98 1994.

Liu:2003:PCM

[LCW+03] Jiuxing Liu, Balasubrama-nian Chandrasekaran, Jiesh-eng Wu, Weihang Jiang,Sushmitha Kini, WeikuanYu, Darius Buntinas, PeteWyckoff, and D. K. Panda.Performance comparison ofMPI implementations overInfiniBand, Myrinet andQuadrics. In ACM [ACM03],page ?? ISBN 1-58113-695-1. LCCN ???? URL http://www.sc-conference.org/sc2003/inter_cal/inter_cal_detail.php?eventid=10696#0; http://www.sc-conference.org/sc2003/paperpdfs/pap310.pdf.

Liu:1996:BMP

[LCY96] L. T. Liu, D. E. Culler, andC. Yoshikawa. Benchmark-ing message passing per-formance using MPI. InReeves [Ree96], pages 101–110. ISBN 0-8186-7623-X.LCCN QA76.58 .I34 1996.Three volumes.

Lee:2001:APT

[LD01] D. J. Lee and T. J. Downar.The application of POSIXthreads and OpenMP tothe U.S. NRC neutron ki-

netics code PARCS. Lec-ture Notes in ComputerScience, 2104:90–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2104/21040090.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2104/21040090.pdf.

Lu:1997:QPD

[LDCZ97] Honghui Lu, Sandhya Dwarkadas,Alan L. Cox, and WillyZwaenepoel. Quantifyingthe performance differencesbetween PVM and Tread-Marks. Journal of Paralleland Distributed Computing,43(2):65–78, June 15, 1997.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic). URL http://www.idealibrary.com/links/doi/10.1006/jpdc.1997.1332/production;http://www.idealibrary.com/links/doi/10.1006/jpdc.1997.1332/production/pdf; http://www.idealibrary.com/links/doi/10.1006/jpdc.1997.1332/production/ref.

Liu:2013:DLO

[LDJK13] Jun Liu, Wei Ding, Ohy-oung Jang, and MahmutKandemir. Data layout op-timization for GPGPU ar-chitectures. ACM SIG-PLAN Notices, 48(8):283–

REFERENCES 294

284, August 2013. CO-DEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic). PPoPP ’13 Confer-ence proceedings.

Lee:2006:PT

[Lee06] Edward A. Lee. The problemwith threads. Computer, 39(5):33–42, May 2006. CO-DEN CPTRB4. ISSN0018-9162 (print), 1558-0814(electronic).

Lee:2012:SMO

[Lee12] Jaejin Lee. SnuCL and anMPI + OpenCL implemen-tation of HPL on heteroge-neous CPU/GPU clusters.In ????, editor, ATIP ’12:Proceedings of the ATIP/A*CRC Workshop on Ac-celerator Technologies forHigh-Performance Comput-ing: Does Asia Lead theWay?, page ?? ACM Press,New York, NY 10036, USA,2012. ISBN 1-4503-1644-1.LCCN ????

Levelt:1995:IIS

[Lev95] A. H. M. Levelt, editor.ISSAC ’95: Internationalsymposium on symbolic andalgebraic computation —July 10–12, 1995, Montreal,Canada, ISSAC — Proceed-ings. ACM Press, New York,NY 10036, USA, 1995. ISBN0-89791-699-9. LCCN QA76.95 I59 1995.

Law:1993:EDM

[LF+93a] K. H. Law, R. E. Ful-ton, et al., editors. Engi-neering data management:key to success in a globalmarket: proceedings of the1993 ASME InternationalComputers in Engineer-ing Conference and Expo-sition, August 8–12, SanDiego, California, COM-PUTERS IN ENGINEER-ING VOL COM. Ameri-can Society Mech. Engi-neers, United EngineeringCenter, 345 E. 47th St., NewYork, NY 10017, USA, 1993.ISBN 0-7918-1169-7. LCCNTA345.A86 1993.

Levesque:1993:SAA

[LF93b] J. M. Levesque and R. Fried-man. The state of the art inautomatic parallelisation. InAnonymous [Ano93f], pages95–107. ISBN ???? LCCN????

Lim:2011:ATC

[LFL11] Min Yeol Lim, Vincent W.Freeh, and David K. Lowen-thal. Adaptive, trans-parent CPU scaling al-gorithms leveraging inter-node MPI communicationregions. Parallel Comput-ing, 37(10–11):667–683, Oc-tober/November 2011. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167819111000871.

REFERENCES 295

Leon:1992:FP

[LFS92] Juan Leon, Allan L. Fisher,and Peter Steenkiste. Fail-safe PVM. In SCRI WCC’92[SCR92], page ?? ISBN???? LCCN ???? Proceed-ings available via anonymousftp from ftp.scri.fsu.eduin directory pub/parallel-workshop.92.

Leon:1993:FPA

[LFS93a] J. Leon, A. L. Fisher,and P. Steenkiste. Fail-safe PVM: a portable pack-age for distributed program-ming with transparent re-covery. Technical ReportCMU-CS-93-124, Carnegie-Mellon University, Depart-ment of Computer Science,1993.

Leon:1993:FPP

[LFS93b] Juan Leon, Allan L. Fisher,and Peter Alfons Steenkiste.Fail-safe PVM: a portablepackage for distributed pro-gramming with transparentrecovery. Technical report,School of Computer Science,Carnegie Mellon University,Pittsburgh, PA, USA, 1993.22 pp.

Loyot:1993:VVM

[LG93] E. C. Loyot, Jr. and A. S.Grimshaw. VMPP: a virtualmachine for parallel process-ing. In IEEE [IEE93b], pages735–740. ISBN 0-8186-3442-1. LCCN QA 76.58 I56 1993.

IEEE catalog no. 93TH0513-2.

Lee:1999:PEJ

[LGCH99] Bu-Sung Lee, Yan Gu, Wen-tong Cai, and Alfred Heng.Performance evaluation ofJPVM. Parallel Process-ing Letters, 9(3):401–??,September 1999. CODENPPLTEE. ISSN 0129-6264(print), 1793-642X (elec-tronic).

Liu:2016:MBM

[LGG16] Weifeng Liu, Michael Gerndt,and Bin Gong. Model-based MPI-IO tuning withPeriscope tuning framework.Concurrency and Computa-tion: Practice and Experi-ence, 28(1):3–20, January2016. CODEN CCPEBO.ISSN 1532-0626 (print),1532-0634 (electronic).

Li:2010:SVC

[LGKQ10] Guodong Li, Ganesh Gopalakr-ishnan, Robert M. Kirby,and Dan Quinlan. A sym-bolic verifier for CUDA pro-grams. ACM SIGPLAN No-tices, 45(5):357–358, May2010. CODEN SINODQ.ISSN 0362-1340 (print),1523-2867 (print), 1558-1160(electronic).

Lassous:2000:HGA

[LGM00] Isabelle Guerin Lassous,Jens Gustedt, and MichelMorvan. Handling graphsaccording to a coarse grained

REFERENCES 296

approach: Experiments withPVM and MPI. Lec-ture Notes in ComputerScience, 1908:72–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080072.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080072.pdf.

Leung:1995:EPE

[LH95] K.-C. Leung and M. Hamdi.Evaluating PVM and Ex-press on various networkclusters. In Alnuweiri andHamdi [AH95], pages 57–66.ISBN 0-8186-7124-6. LCCNTK5105.5 .H56 1995.

Leung:1998:PAN

[LH98] Ka-Cheong Leung and MounirHamdi. Performance assess-ment of network protocolsand parallel programmingtools for distributed comput-ing systems. InternationalJournal of Computer Sys-tems Science and Engineer-ing, 13(1):67–80, January1998. CODEN CSSEEI.ISSN 0267-6192.

Liao:2007:OOP

[LHC+07] Chunhua Liao, Oscar Her-nandez, Barbara Chap-man, Wenguang Chen, andWeimin Zheng. OpenUH:an optimizing, portable

OpenMP compiler. Con-currency and Computation:Practice and Experience,19(18):2317–2332, Decem-ber 25, 2007. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic).

Lee:1996:TSF

[LHCT96] Bu-Sung Lee, A. Heng,W. Cai, and Tai-Ann Tan.Task scheduling facility forPVM. Parallel Process-ing Letters, 6(4):563–574,December 1996. CODENPPLTEE. ISSN 0129-6264(print), 1793-642X (elec-tronic).

Lin:1994:DNC

[LHD+94] Mengjou Lin, Jehwei Hsieh,D. H. C. Du, J. P. Thomas,and J. A. MacDonald. Dis-tributed network computingover local ATM networks. InIEEE [IEE94h], pages 154–163. ISBN 0-8186-6607-2,0-8186-6605-6, 0-8186-6606-4. ISSN 1063-9535. LCCNQA76.5 .S894 1994. IEEEcatalog number 94CH34819.

Lin:1995:DNC

[LHD+95] Mengjou Lin, J. Hsieh,D. H. C. Du, J. P. Thomas,and J. A. MacDonald. Dis-tributed network computingover local ATM networks.IEEE Journal on SelectedAreas in Communications,13(4):733–748, May 1995.CODEN ISACEM. ISSN

REFERENCES 297

0733-8716 (print), 1558-0008(electronic).

Li:1996:PSI

[LHHM96] G.-J. Li, D. F. Hsu,S. Horiguchi, and B. Maggs,editors. Proceedings. Sec-ond International Sympo-sium on Parallel Archi-tectures, Algorithms, andNetworks (I-SPAN ’96):June 12–14, 1996, Beijing,China. IEEE Computer So-ciety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,1996. ISBN 0-8186-7460-1. LCCN QA76.58.I56731996. IEEE catalog number96TB100044.

Liu:2010:RTC

[LHLK10] Fuchang Liu, TakahiroHarada, Youngeun Lee, andYoung J. Kim. Real-timecollision culling of a mil-lion bodies on graphics pro-cessing units. ACM Trans-actions on Graphics, 29(6):154:1–154:??, December2010. CODEN ATGRDF.ISSN 0730-0301 (print),1557-7368 (electronic).

Li:1997:PIO

[LHZ97] Wei Li, Xiaohu Huang, andNanning Zheng. Parallelimplementing OpenGL onPVM. Parallel Comput-ing, 23(12):1839–1850, De-cember 15, 1997. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336

(electronic). URL http://www.elsevier.com/cgi-bin/cas/tree/store/parco/cas_sub/browse/browse.cgi?year=1997&volume=23&issue=12&aid=1248.

Lu:1998:ONW

[LHZ98] Honghui Lu, Y. CharlieHu, and Willy Zwaenepoel.OpenMP on networks ofworkstations. In ACM[ACM98b], page ?? ISBN???? LCCN ???? URLhttp://www.supercomp.org/sc98/TechPapers/sc98_FullAbstracts/Lu1105/index.htm.

Li:1996:SIS

[Li96] Guo-Jie Li, editor. SecondInternational Symposium onParallel Architectures, Algo-rithms, and Networks (I-SPAN ’96): proceedings,June 12–14, 1996, Beijing,China. IEEE Computer So-ciety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,1996. ISBN 0-8186-7460-1. LCCN QA76.58.I5651996. IEEE catalog number94TH0697-3.

Liu:1995:WCD

[Liu95] Xiaomao Liu. Workstationscluster for distributed super-computing. Mini-Micro Sys-tems, 16(2):45–52, February1995. CODEN XWJXEH.ISSN 1000-1220.

REFERENCES 298

Livny:2000:MYW

[Liv00] Miron Livny. Manag-ing your workforce on acomputational grid. Lec-ture Notes in ComputerScience, 1908:3–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080003.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080003.pdf.

Lastovetsky:2010:RAP

[LK10] Alexey Lastovetsky andTahar Kechadi. Recent ad-vances in Parallel VirtualMachine and Message Pass-ing Interface. The Interna-tional Journal of High Per-formance Computing Appli-cations, 24(1):3–4, Febru-ary 2010. CODEN IH-PCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/24/1/3.full.pdf+html.

LaSalle:2014:MBD

[LK14] Dominique LaSalle andGeorge Karypis. MPI forbig data: New tricks foran old dog. Parallel Com-puting, 40(10):754–767, De-cember 2014. CODEN PA-COEJ. ISSN 0167-8191(print), 1872-7336 (elec-tronic). URL http://

www.sciencedirect.com/science/article/pii/S0167819114000830.

Lastovetsky:2008:RAP

[LKD08] Alexey Lastovetsky, TaharKechadi, and Jack Don-garra, editors. Recent Ad-vances in Parallel VirtualMachine and Message Pass-ing Interface: 15th Eu-ropean PVM/MPI Users’Group Meeting, Dublin, Ire-land, September 7–10, 2008.Proceedings, volume 5205of Lecture Notes in Com-puter Science. Springer-Ver-lag, Berlin, Germany / Hei-delberg, Germany / Lon-don, UK / etc., 2008. CO-DEN LNCSD9. ISBN 3-540-87474-7 (print), 3-540-87475-5 (e-book). ISSN0302-9743 (print), 1611-3349 (electronic). LCCN???? URL http://www.springerlink.com/content/978-3-540-87475-1.

Luecke:2003:CPM

[LKJ03] Glenn R. Luecke, MarinaKraeva, and Lili Ju. Com-paring the performance ofMPICH with Cray’s MPIand with SGI’s MPI. Con-currency and Computation:Practice and Experience,15(9):779–802, August 10,2003. CODEN CCPEBO.ISSN 1532-0626 (print),1532-0634 (electronic).

Liang:1996:AEO

[LKL96] Wen-Yew Liang, Chun-Ta

REFERENCES 299

King, and Feipei Lai. Ad-smith: an efficient object-based distributed sharedmemory system on PVM.In Li [Li96]. ISBN 0-8186-7460-1. LCCN QA76.58.I5651996. IEEE catalog number94TH0697-3.

Li:2003:PNH

[LkLC+03] Jianwei Li, Wei keng Liao,Alok Choudhary, RobertRoss, Rajeev Thakur, WilliamGropp, Rob Latham, An-drew Siegel, Brad Gal-lagher, and Michael Zingale.Parallel netCDF: a high-performance scientific I/Ointerface. In ACM [ACM03],page ?? ISBN 1-58113-695-1. LCCN ???? URL http://www.sc-conference.org/sc2003/inter_cal/inter_cal_detail.php?eventid=10722#1; http://www.sc-conference.org/sc2003/paperpdfs/pap258.pdf.

Luecke:2004:PSM

[LKYS04] Glenn R. Luecke, MarinaKraeva, Jing Yuan, andSilvia Spanoyannis. Per-formance and scalability ofMPI on PC clusters. Con-currency and Computation:Practice and Experience, 16(1):79–107, January 2004.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Ludwig:1995:PPF

[LL95] T. Ludwig and S. Lam-berts. PFSLib — a paral-

lel file system for worksta-tion clusters. In Malyshkin[Mal95], pages 246–251.ISBN 3-540-60222-4. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.I547 1995.

Luecke:2001:SPO

[LL01] Glenn R. Luecke and Wei-Hua Lin. Scalability andperformance of OpenMPand MPI on a 128-processorSGI Origin 2000. Con-currency and Computa-tion: Practice and Experi-ence, 13(10):905–928, Au-gust 25, 2001. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic). URL http://www3.interscience.wiley.com/cgi-bin/abstract/85007180/START; http://www3.interscience.wiley.com/cgi-bin/fulltext?ID=85007180&PLACEBO=IE.pdf.

Lin:2016:VDF

[LL16] Yu-Te Lin and Jenq-KuenLee. Vector data flow anal-ysis for SIMD optimiza-tions on OpenCL programs.Concurrency and Compu-tation: Practice and Ex-perience, 28(5):1629–1654,April 10, 2016. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic).

Li:2013:COM

[LLC13] Hung-Fu Li, Tyng-Yeu

REFERENCES 300

Liang, and Jun-Yao Chiu.A compound OpenMP/MPI program developmenttoolkit for hybrid CPU/GPU clusters. The Jour-nal of Supercomputing, 66(1):381–405, October 2013.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://link.springer.com/article/10.1007/s11227-013-0912-0.

Lidbury:2015:MCC

[LLCD15] Christopher Lidbury, An-drei Lascu, Nathan Chong,and Alastair F. Donald-son. Many-core compilerfuzzing. ACM SIGPLANNotices, 50(6):65–76, June2015. CODEN SINODQ.ISSN 0362-1340 (print),1523-2867 (print), 1558-1160(electronic).

Li:2012:PFA

[LLG12] Peng Li, Guodong Li,and Ganesh Gopalakrish-nan. Parametric flows: auto-mated behavior equivalenc-ing for symbolic analysis ofraces in CUDA programs.In Hollingsworth [Hol12],pages 29:1–29:?? ISBN 1-4673-0804-8. URL http://conferences.computer.org/sc/2012/papers/1000a009.pdf.

Luo:2014:ISM

[LLH+14] Miao Luo, Xiaoyi Lu,Khaled Hamidouche, Kr-

ishna Kandalla, and Dha-baleswar K. Panda. Initialstudy of multi-endpoint run-time for MPI + OpenMPhybrid programming modelon multi-core systems. ACMSIGPLAN Notices, 49(8):395–396, August 2014. CO-DEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Langlais:2002:SSM

[LLRS02] M. Langlais, G. Latu,J. Roman, and P. Silan.Stochastic simulation of amarine host-parasite sys-tem using a hybrid MPI/OpenMP programming. Lec-ture Notes in Computer Sci-ence, 2400:436–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2400/24000436.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2400/24000436.pdf.

Li:1993:SLL

[LLY93] Q. Li, J.-C. Liu, and T. G.Yip. Solving large linearequations using PVM sys-tem. In Law et al. [LF+93a],pages 685–690. ISBN 0-7918-1169-7. LCCN TA345.A861993.

Loh:1994:ISR

[LM94] B. C. Loh and G. A. Manson.

REFERENCES 301

Incorporating software reuseinto the PCSC methodology.In de Gloria et al. [dGJM94],pages 929–941. ISBN ????LCCN ????

Larsen:1999:SPG

[LM99] M. Larsen and P. Mad-sen. A scalable paral-lel Gauss–Seidel and Jacobisolver for animal genetics.In Dongarra et al. [DLM99],pages 356–363. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Lu:2013:MLP

[LM13] Ligang Lu and Karen Mager-lein. Multi-level paral-lel computing of reversetime migration for seismicimaging on Blue Gene/Q.ACM SIGPLAN Notices, 48(8):291–292, August 2013.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic). PPoPP ’13 Confer-ence proceedings.

Lee:2009:OGC

[LME09] Seyong Lee, Seung-Jai Min,and Rudolf Eigenmann.OpenMP to GPGPU: a com-piler framework for auto-matic translation and op-timization. ACM SIG-PLAN Notices, 44(4):101–110, April 2009. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Losada:2017:ARV

[LMG17] Nuria Losada, Marıa J.Martın, and Patricia Gonzalez.Assessing resilient versusstop-and-restart fault-tolerantsolutions in MPI applica-tions. The Journal of Super-computing, 73(1):316–329,January 2017. CODEN JO-SUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic).

Lopez:2015:PBV

[LMM+15] Hugo A. Lopez, EduardoR. B. Marques, FranciscoMartins, Nicholas Ng, CesarSantos, Vasco ThudichumVasconcelos, and NobukoYoshida. Protocol-based ver-ification of message-passingparallel programs. ACMSIGPLAN Notices, 50(10):280–298, October 2015. CO-DEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Losada:2014:EAL

[LMRG14] N. Losada, M. J. Martın,G. Rodrıguez, and P. Gonzalez.Extending an application-level checkpointing tool toprovide fault tolerance sup-port to OpenMP appli-cations. J.UCS: Jour-nal of Universal Com-puter Science, 20(9):1351–??, ???? 2014. CO-DEN ???? ISSN 0948-695X (print), 0948-6968

REFERENCES 302

(electronic). URL http://www.jucs.org/jucs_20_9/extending_an_application_level.

Lee:2015:OPE

[LNK+15] Joo Hwan Lee, Nimit Niga-nia, Hyesoon Kim, KaushikPatel, and Hyojong Kim.OpenCL performance evalu-ation on modern multicoreCPUs. Scientific Program-ming, 2015(??):859491:1–859491:20, ???? 2015. CO-DEN SCIPEV. ISSN 1058-9244 (print), 1875-919X(electronic). URL https://www.hindawi.com/journals/sp/2015/859491/.

Louca:2000:MFP

[LNLE00] S. Louca, N. Neophytou,A. Lachanas, and P. Evripi-dou. MPI-FT: Portablefault tolerance scheme forMPI. Parallel ProcessingLetters, 10(4):371–??, De-cember 2000. CODENPPLTEE. ISSN 0129-6264(print), 1793-642X (elec-tronic). URL http://ejournals.wspc.com.sg/ppl/10/1004/S0129626400000342.html.

Lima:2012:PEO

[LNW+12] Antonio M. Lima, MarcoA. S. Netto, Thais Webber,Ricardo M. Czekster, CesarA. F. De Rose, and PauloFernandes. Performanceevaluation of OpenMP-based algorithms for han-dling Kronecker descrip-

tors. Journal of Paralleland Distributed Computing,72(5):678–692, May 2012.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic). URL http://www.sciencedirect.com/science/article/pii/S0743731512000354.

Lu:1996:PIF

[LO96] E. J.-L. Lu and D. I. Okun-bor. Parallel implementationof 3D FMA using MPI. InIEEE [IEE96i], pages 119–124. ISBN 0-8186-7533-0.LCCN QA76.642 .M67 1996.

Labarta:2001:NOD

[LOHA01] J. Labarta, J. Oliver,D. S. Henty, and EduardAyguade. New OpenMP di-rectives for irregular data ac-cess loops. Scientific Pro-gramming, 9(2–3):175–183,Spring–Summer 2001. CO-DEN SCIPEV. ISSN 1058-9244 (print), 1875-919X(electronic). URL http://iospress.metapress.com/app/home/contribution.asp%3Fwasp=7pab6qgbaf8vxg991rwy%26referrer=parent%26backto=issue%2C10%2C11%3Bjournal%2C1%2C9%3Blinkingpublicationresults%2C1%2C1.

Lou:1995:PIN

[Lou95] J. Z. Lou. A parallel in-compressible Navier–Stokessolver with multigrid iter-ations. In Bailey et al.[BBG+95], pages 167–168.ISBN 0-89871-344-7. LCCNQA76.58.S55 1995.

REFERENCES 303

Landman:2000:PLR

[LP00] Joseph Landman and PiotrPiecuch. Parallelization of alegacy research program us-ing OpenMP. ACM FortranForum, 19(2):16–23, August2000. CODEN ???? ISSN1061-7264 (print), 1931-1311(electronic).

Li:2011:FSM

[LPD+11] Guodong Li, Robert Palmer,Michael DeLisi, GaneshGopalakrishnan, and Robert M.Kirby. Formal specifica-tion of MPI 2.0: Casestudy in specifying a prac-tical concurrent program-ming API. Science of Com-puter Programming, 76(2):65–81, February 1, 2011.CODEN SCPGD4. ISSN0167-6423 (print), 1872-7964(electronic).

Li:2001:PCS

[LR01] Michael Na Li and A. J.Rossini. RPVM: Cluster sta-tistical computing in R.R News: the Newsletterof the R Project, 1(3):4–7, September 2001. CO-DEN ???? ISSN 1609-3631. URL http://CRAN.R-project.org/doc/Rnews/.

Lastovetsky:2006:HTM

[LR06a] Alexey Lastovetsky and RaviReddy. HeteroMPI: To-wards a message-passing li-brary for heterogeneous net-works of computers. Jour-

nal of Parallel and Dis-tributed Computing, 66(2):197–220, February 2006.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic).

Le:2006:DMC

[LR06b] Thuy T. Le and Jalel Rejeb.A detailed MPI communi-cation model for distributedsystems. Future Genera-tion Computer Systems, 22(3):269–278, February 2006.CODEN FGSEVI. ISSN0167-739X (print), 1872-7115 (electronic).

Lotfi:2015:AAC

[LRBG15] Atieh Lotfi, Abbas Rahimi,Luca Benini, and Rajesh K.Gupta. Aging-aware compi-lation for GP-GPUs. ACMTransactions on Architec-ture and Code Optimiza-tion, 12(2):24:1–24:??, July2015. CODEN ???? ISSN1544-3566 (print), 1544-3973(electronic).

Lee:2014:BCA

[LRG14] Changmin Lee, Won WooRo, and Jean-Luc Gau-diot. Boosting CUDA ap-plications with CPU–GPUhybrid computing. Inter-national Journal of Paral-lel Programming, 42(2):384–404, April 2014. CODENIJPPE5. ISSN 0885-7458(print), 1573-7640 (elec-tronic). URL http://link.

REFERENCES 304

springer.com/article/10.1007/s10766-013-0252-y.

Luo:2001:PDE

[LRQ01] Jun Luo, Sanguthevar Ra-jasekaran, and Chenxia Qiu.Parallizing 1-dimensional es-tuarine model. LectureNotes in Computer Sci-ence, 2131:257–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310257.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310257.pdf.

Latham:2007:IMI

[LRT07] Robert Latham, RobertRoss, and Rajeev Thakur.Implementing MPI-IO atomicmode and shared file point-ers using MPI one-sidedcommunication. The In-ternational Journal of HighPerformance ComputingApplications, 21(2):132–143,May 2007. CODEN IH-PCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/21/2/132.full.pdf+html.

Li:2001:WMB

[LRW01] Maozhen Li, Omer F. Rana,and David W. Walker.Wrapping MPI-based legacycodes as Java/CORBA com-

ponents. Future GenerationComputer Systems, 18(2):213–223, October 2001. CO-DEN FGSEVI. ISSN 0167-739X (print), 1872-7115(electronic). URL http://www.elsevier.com/gej-ng/10/19/19/60/31/29/abstract.html.

Luckow:2008:MFT

[LS08] Andre Luckow and Bet-tina Schnor. Migol: afault-tolerant service frame-work for MPI applications inthe Grid. Future Genera-tion Computer Systems, 24(2):142–152, February 2008.CODEN FGSEVI. ISSN0167-739X (print), 1872-7115 (electronic).

Lin:2010:TLS

[LS10] Paul T. Lin and John N.Shadid. Towards large-scalemulti-socket, multicore par-allel simulations: Perfor-mance of an MPI-only semi-conductor device simulator.Journal of ComputationalPhysics, 229(19):6804–6818,September 20, 2010. CO-DEN JCTPAH. ISSN0021-9991 (print), 1090-2716(electronic). URL http://www.sciencedirect.com/science/article/pii/S0021999110002846.

Lashgar:2015:CSR

[LSB15] Ahmad Lashgar, EbadSalehi, and Amirali Bani-asadi. A case study inreverse engineering GPG-PUs: Outstanding memory

REFERENCES 305

handling resources. ACMSIGARCH Computer Archi-tecture News, 43(4):15–21,September 2015. CODENCANED2. ISSN 0163-5964(print), 1943-5851 (elec-tronic).

Levesque:2012:HEA

[LSG12] John M. Levesque, Ra-manan Sankaran, and RayGrout. Hybridizing S3Dinto an exascale applica-tion using OpenACC: anapproach for moving tomulti-petaflops and beyond.In Hollingsworth [Hol12],pages 15:1–15:?? ISBN 1-4673-0804-8. URL http://conferences.computer.org/sc/2012/papers/1000a040.pdf.

Luecke:2004:PSS

[LSK04] Glenn R. Luecke, SilviaSpanoyannis, and MarinaKraeva. The performanceand scalability of SHMEMand MPI-2 one-sided rou-tines on a SGI Origin 2000and a Cray T3E-600. Con-currency and Computation:Practice and Experience, 16(10):1037–1060, August 25,2004. CODEN CCPEBO.ISSN 1532-0626 (print),1532-0634 (electronic).

Liu:2011:CBA

[LSMW11] Weiguo Liu, Bertil Schmidt,and Wolfgang Muller-Wittig.CUDA-BLASTP: Acceler-ating BLASTP on CUDA-

enabled graphics hardware.IEEE/ACM Transactionson Computational Biologyand Bioinformatics, 8(6):1678–1684, November 2011.CODEN ITCBCY. ISSN1545-5963 (print), 1557-9964(electronic).

Lumsdaine:1995:WIM

[LSR95] A. Lumsdaine, J. M. Squyres,and M. W. Reichelt. Wave-form iterative methods forparallel solution of ini-tial value problems. InIEEE [IEE95j], pages 88–97.ISBN 0-8186-6895-4. LCCNQA76.58 .S34 1994.

Li:2015:AMR

[LSSZ15] Jiansen Li, Jianqi Sun, YingSong, and Jun Zhao. Ac-celerating MRI reconstruc-tion via three-dimensionaldual-dictionary learning us-ing CUDA. The Journal ofSupercomputing, 71(7):2381–2396, July 2015. CODENJOSUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic). URL http://link.springer.com/article/10.1007/s11227-015-1386-z.

Liu:2008:AMD

[LSVMW08] Weiguo Liu, Bertil Schmidt,Gerrit Voss, and WolfgangMuller-Wittig. Accelerat-ing molecular dynamics sim-ulations using graphics pro-cessing units with CUDA.Computer Physics Commu-nications, 179(9):634–641,

REFERENCES 306

November 1, 2008. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465508002191.

Lazzarino:2002:PBP

[LSZL02] Oscar Lazzarino, AndreaSanna, Claudio Zunino, andFabrizio Lamberti. A PVM-based parallel implementa-tion of the REYES im-age rendering architecture.Lecture Notes in ComputerScience, 2474:165–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.de/link/service/series/0558/bibs/2474/24740165.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740165.pdf.

Langr:2014:APP

[LTDD14] Daniel Langr, Pavel Tvrdık,Tomas Dytrych, and Jerry P.Draayer. Algorithm 947:Paraperm — parallel gen-eration of random permu-tations with MPI. ACMTransactions on Mathemati-cal Software, 41(1):5:1–5:26,October 2014. CODENACMSCU. ISSN 0098-3500(print), 1557-7295 (elec-tronic).

Lazar:1994:SRE

[LTLC94] A. A. Lazar, K. H. Tseng,Koon Seng Lim, and W. Choe.

A scalable and reusable em-ulator for evaluating the per-formance of SS7 networks.IEEE Journal on SelectedAreas in Communications,12(3):395–404, April 1994.CODEN ISACEM. ISSN0733-8716 (print), 1558-0008(electronic).

Laohawee:2000:PDT

[LTR00] P. Laohawee, A. Tangpong,and A. Rungsawang. Paral-lel DSIR text indexing sys-tem: Using multiple mas-ter/slave concept. Lec-ture Notes in Computer Sci-ence, 1908:297–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080297.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080297.pdf.

Lee:2002:IPC

[LTRA02] Nung Kion Lee, DavidTaniar, J. Wenny Rahayu,and Mafruz Zaman Ashrafi.Implementation of paral-lel collection equi-join us-ing MPI. Lecture Notesin Computer Science, 2367:217–??, 2002. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/

REFERENCES 307

bibs/2367/23670217.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2367/23670217.pdf.

Luo:1999:SMV

[Luo99] Yong Luo. Shared mem-ory vs. message passing:The COMOPS benchmarkexperiment. The Journalof Supercomputing, 13(3):283–301, May 1999. CO-DEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=13&issue=3&spage=283; http://www.wkap.nl/oasis.htm/206582.

Lusk:2000:IIC

[Lus00] Ewing Lusk. Isolatingand interfacing the com-ponents of a parallel com-puting environment. Lec-ture Notes in ComputerScience, 1908:5–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080005.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080005.pdf.

Lee:2012:EED

[LV12] Seyong Lee and Jeffrey S.Vetter. Early evaluation

of directive-based GPU pro-gramming models for pro-ductive exascale computing.In Hollingsworth [Hol12],pages 23:1–23:?? ISBN 1-4673-0804-8. URL http://conferences.computer.org/sc/2012/papers/1000a051.pdf.

Liu:2004:BMI

[LVP04] Jiuxing Liu, Abhinav Vishnu,and Dhabaleswar K. Panda.Building multirail Infini-Band clusters: MPI-level de-sign and performance eval-uation. In ACM [ACM04],page 33. ISBN 0-7695-2153-3. LCCN ????

Li:1995:CPP

[LW95] Liwei Li and Paul S.Wang. The CL-PVMpackage. SIGSAM Bul-letin (ACM Special Inter-est Group on Symbolic andAlgebraic Manipulation), 29(3–4):2–8, December 1995.CODEN SIGSBZ. ISSN0163-5824 (print), 1557-9492(electronic).

Ludwig:1997:OUI

[LW97] T. Ludwig and R. Wis-mueller. OMIS 2.0 — auniversal interface for mon-itoring systems. LectureNotes in Computer Sci-ence, 1332:267–276, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

REFERENCES 308

Liu:2004:HPR

[LWP04] Jiuxing Liu, Jiesheng Wu,and Dhabaleswar K. Panda.High performance RDMA-based MPI implementationover InfiniBand. Inter-national Journal of Par-allel Programming, 32(3):167–198, June 2004. CO-DEN IJPPE5. ISSN 0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=32&issue=3&spage=167.

Li:1993:MSU

[LY93] Q. Li and T. G. Yip. Mon-itoring systems using PVM.In Law et al. [LF+93a], pages781–785. ISBN 0-7918-1169-7. LCCN TA345.A86 1993.

Loncar:2016:OOM

[LYSS+16] Vladimir Loncar, Luis E.Young-S., Srdjan Skrbic,Paulsamy Muruganandam,Sadhan K. Adhikari, andAntun Balaz. OpenMP,OpenMP/MPI, and CUDA/MPI C programs for solv-ing the time-dependent dipo-lar Gross–Pitaevskii equa-tion. Computer PhysicsCommunications, 209(??):190–196, December 2016.CODEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465516302272.

Lu:2013:WGA

[LYZ13] Xiangwen Lu, Jiabin Yuan,and Weiwei Zhang. Work-flow of the Grover algo-rithm simulation incorpo-rating CUDA and GPGPU.Computer Physics Com-munications, 184(9):2035–2041, September 2013. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465513001148.

Li:1997:EHC

[LZ97] Konming Gary Li andNabil M. Zamel. Anevaluation of HPF compil-ers and the implementationof a parallel linear equa-tion solver using HPF andMPI. In ACM [ACM97b],page ?? ISBN 0-89791-985-8. LCCN QA76.9.A25A265 1997. URL http://www.supercomp.org/sc97/proceedings/TECH/LI/INDEX.HTM. ACM SIGARCH or-der number 415972. IEEEComputer Society Press or-der number RS00160.

Luecke:2002:DDM

[LZC+02] Glenn R. Luecke, Yan Zou,James Coyle, Jim Hoekstra,and Marina Kraeva. Dead-lock detection in MPI pro-grams. Concurrency andComputation: Practice andExperience, 14(11):911–932,August 25, 2002. CODENCCPEBO. ISSN 1532-0626

REFERENCES 309

(print), 1532-0634 (elec-tronic). URL http://www3.interscience.wiley.com/cgi-bin/abstract/97519209/START; http://www3.interscience.wiley.com/cgi-bin/fulltext?ID=97519209{\&}PLACEBO=IE.pdf.

Li:2017:PCO

[LZH17] Shigang Li, Yunquan Zhang,and Torsten Hoefler. Poster:Cache-oblivious MPI all-to-all communications onmany-core architectures.ACM SIGPLAN Notices, 52(8):445–446, August 2017.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Li:2018:COM

[LZH18] Shigang Li, Yunquan Zhang,and Torsten Hoefler. Cache-oblivious MPI all-to-all com-munications based on Mor-ton order. IEEE Transac-tions on Parallel and Dis-tributed Systems, 29(3):542–555, ???? 2018. CODENITDSEO. ISSN 1045-9219(print), 1558-2183 (elec-tronic)ITDSEO. URL http://ieeexplore.ieee.org/document/8091010/.

Ma:2009:CRS

[MA09] Wenjing Ma and GaganAgrawal. A compiler andruntime system for enablingdata mining applicationson GPUs. ACM SIG-

PLAN Notices, 44(4):287–288, April 2009. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Mavriplis:2005:HRAa

[MAB05] Dimitri J. Mavriplis, Michael J.Aftosmis, and Marsha Berger.High resolution aerospaceapplications using the NASAColumbia Supercomputer.In ACM [ACM05], page 61.ISBN 1-59593-061-2. LCCN????

Miguel:1996:APN

[MABG96] Jose Miguel, Agustin Ar-ruabarrena, Ramon Bei-vide, and Jose Angel Gre-gorio. Assessing the per-formance of the new IBMSP2 communication subsys-tem. IEEE parallel and dis-tributed technology: systemsand applications, 4(4):12–22, Winter 1996. CODENIPDTEX. ISSN 1063-6552(print), 1558-1861 (elec-tronic).

Maffeis:1994:SSD

[Maf94] S. Maffeis. System sup-port for distributed com-puting. In Gentzsch andHarms [GH94], pages 293–301. ISBN 0-387-57981-8 (New York), 3-540-57981-8 (Berlin). LCCNQA76.88.I57 1994. DM96.00.Two volumes.

REFERENCES 310

Moreno:2001:AEP

[MAGR01] Luz Marina Moreno, Fran-cisco Almeida, Daniel Gonzalez,and Casiano Rodrıguez.Adaptive execution of pipelines.Lecture Notes in ComputerScience, 2131:217–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310217.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310217.pdf.

Mainland:2012:EHM

[Mai12] Geoffrey Mainland. Explic-itly heterogeneous metapro-gramming with MetaHaskell.ACM SIGPLAN Notices,47(9):311–322, September2012. CODEN SINODQ.ISSN 0362-1340 (print),1523-2867 (print), 1558-1160(electronic).

Molero-Armenta:2014:OOI

[MAIVAH14] M. Molero-Armenta, UrsulaIturraran-Viveros, S. Apari-cio, and M. G. Hernandez.Optimized OpenCL imple-mentation of the Elasto-dynamic Finite IntegrationTechnique for viscoelasticmedia. Computer PhysicsCommunications, 185(10):2683–2696, October 2014.CODEN CPHCBZ. ISSN0010-4655 (print), 1879-2944

(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465514001702.

Malyshkin:1995:PCT

[Mal95] Victor Malyshkin, editor.Parallel computing technolo-gies: third international con-ference, PaCT-95, St. Pe-tersburg, Russia, Septem-ber 12–25, 1995: pro-ceedings, number 964 inLecture Notes in Com-puter Science. Springer-Ver-lag, Berlin, Germany / Hei-delberg, Germany / Lon-don, UK / etc., 1995.ISBN 3-540-60222-4. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.I547 1995.

Malfetti:2001:AOW

[Mal01] Paolo Malfetti. Appli-cation of OpenMP toweather, wave and oceancodes. Scientific Pro-gramming, 9(2–3):99–107,Spring–Summer 2001. CO-DEN SCIPEV. ISSN 1058-9244 (print), 1875-919X(electronic). URL http://iospress.metapress.com/app/home/contribution.asp%3Fwasp=7pab6qgbaf8vxg991rwy%26referrer=parent%26backto=issue%2C4%2C11%3Bjournal%2C1%2C9%3Blinkingpublicationresults%2C1%2C1.

Mirvis:1995:HML

[MALM95] Y. Mirvis, F. Abdi, B. Laje-vardi, and P. Murthy. Hi-

REFERENCES 311

erarchical multi-level opti-mization solution for mas-sive parallel simulation ofcomposite system. AIAA/ASME/ASCE/AHSStructures, Structural Dy-namics & Materials Confer-ence — Collection of Tech-nical Papers, 4, ???? 1995.CODEN CPSCDO. ISSN0273-4508.

Manchek:1994:DIP

[Man94] Robert J. Manchek. De-sign and implementation ofPVM version 3. M.s. the-sis, University of Tennessee,Knoxville, Knoxville, TN37996, USA, 1994. viii + 81pp.

Mans:1998:PDP

[Man98] Bernard Mans. Portable dis-tributed priority queues withMPI. Concurrency: prac-tice and experience, 10(3):175–198, March 1998. CO-DEN CPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/cgi-bin/abstract?ID=5373;http://www3.interscience.

wiley.com/cgi-bin/fulltext?ID=5373&PLACEBO=IE.pdf.

Manis:2001:PNP

[Man01] G. Manis. Persistent andnon-persistent data objectson top of PVM and MPI.Lecture Notes in ComputerScience, 2131:91–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http:

//link.springer-ny.com/link/service/series/0558/bibs/2131/21310091.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310091.pdf.

Miguel-Alonso:2009:INS

[MANR09] J. Miguel-Alonso, J. Navari-das, and F. J. Ridruejo. In-terconnection network sim-ulation using traces ofMPI applications. Inter-national Journal of Par-allel Programming, 37(2):153–174, April 2009. CO-DEN IJPPE5. ISSN 0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=37&issue=2&spage=153.

Marowka:2002:ISI

[Mar02] Ami Marowka. Introduc-tion to the special issue:OpenMP: Experiences, im-plementations and applica-tions. Parallel and Dis-tributed Computing Prac-tices, 5(2):v, June 2002. CO-DEN ???? ISSN 1097-2803.

Marowka:2003:EOT

[Mar03] Ami Marowka. Extend-ing OpenMP for task par-allelism. Parallel Process-ing Letters, 13(3):341–??,September 2003. CODENPPLTEE. ISSN 0129-6264(print), 1793-642X (elec-tronic).

REFERENCES 312

Marowka:2005:EMT

[Mar05] Ami Marowka. Executionmodel of three parallel lan-guages: OpenMP, UPC andCAF. Scientific Program-ming, 13(2):127–135, ????2005. CODEN SCIPEV.ISSN 1058-9244 (print),1875-919X (electronic).

Marowka:2006:BRP

[Mar06] Ami Marowka. Book review:Parallel Scientific Compu-tation: A Structured Ap-proach using BSP and MPI.Scalable Computing: Prac-tice and Experience, 7(2):107–108, June 2006. CO-DEN ???? ISSN 1895-1767.URL http://www.scpe.org/vols/vol07/no2/vol07no2bookreview.html.

Marowka:2007:PCD

[Mar07] Ami Marowka. Parallel com-puting on any desktop. Com-munications of the ACM, 50(9):74–78, September 2007.CODEN CACMA2. ISSN0001-0782 (print), 1557-7317(electronic).

Marowka:2009:BCT

[Mar09] Ami Marowka. BSP2OMP: acompiler for translating BSPprograms to OpenMP. In-ternational Journal of Par-allel, Emergent and Dis-tributed Systems: IJPEDS,24(4):293–310, 2009. CO-DEN ???? ISSN 1744-5760(print), 1744-5779 (elec-tronic).

Mehta:2006:MSG

[MAS06] Paras Mehta, Jose NelsonAmaral, and Duane Szafron.Is MPI suitable for a gener-ative design-pattern system?Parallel Computing, 32(7–8):616–626, September 2006.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic).

Mattson:1994:PEP

[Mat94] T. G. Mattson. Program-ming environments for par-allel computing: a compari-son of CPS, linda, P4, PVM,POSYBL, and TCGMSG. InHesham and Shriver [HS94],pages 586–594. ISBN 0-8186-5060-5. ISSN 1060-3425.LCCN ???? IEEE catalogno. 94TH0607-2.

Mattson:1995:PEP

[Mat95] Timothy G. Mattson. Pro-gramming environments forparallel and distributed com-puting: a comparison ofP4, PVM, Linda, andTCGMSG. InternationalJournal of SupercomputerApplications and High Per-formance Computing, 9(2):138–161, Summer 1995. CO-DEN IJSCFG. ISSN 1078-3482.

Mattson:2000:BOF

[Mat00a] Tim Mattson. BOF:OpenMP and its futuredevelopments. In ACM[ACM00], page 106. URL

REFERENCES 313

http://www.sc2000.org/proceedings/info/fp.pdf.

Mattson:2000:IO

[Mat00b] Timothy G. Mattson. An in-troduction to OpenMP 2.0.Lecture Notes in ComputerScience, 1940:384–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1940/19400384.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1940/19400384.pdf.

Mattson:2001:EO

[Mat01a] Timothy Mattson. The evo-lution of OpenMP. Lec-ture Notes in ComputerScience, 1947:19–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1947/19470019.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1947/19470019.pdf.

Matuszek:2001:APS

[Mat01b] Mariusz R. Matuszek. As-sessment of PVM suitabil-ity to testbed client-agent-server applications. Lec-ture Notes in ComputerScience, 2131:69–??, 2001.CODEN LNCSD9. ISSN

0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310069.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310069.pdf.

Mattson:2003:HGO

[Mat03] Timothy G. Mattson. Howgood is OpenMP. Scien-tific Programming, 11(2):81–93, 2003. CODENSCIPEV. ISSN 1058-9244(print), 1875-919X (elec-tronic).

Mourao:2000:SSC

[MB00] Elson Mourao and StephenBooth. Single sided commu-nications in multi-protocolMPI. Lecture Notes inComputer Science, 1908:176–??, 2000. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080176.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080176.pdf.

Marongiu:2012:OCE

[MB12] Andrea Marongiu and LucaBenini. An OpenMP com-piler for efficient use of dis-tributed scratchpad memoryin MPSoCs. IEEE Trans-

REFERENCES 314

actions on Computers, 61(2):222–236, February 2012.CODEN ITCOB4. ISSN0018-9340 (print), 1557-9956(electronic).

Muller:2012:SOA

[MBB+12] Matthias S. Muller, JohnBaron, William C. Brant-ley, Huiyu Feng, andDaniel Hackenberg. SPECOMP2012— an applicationbenchmark suite for paral-lel systems using OpenMP.Lecture Notes in Com-puter Science, 7312:223–236, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-30961-8_17/.

Ma:2013:KAT

[MBBD13] Teng Ma, George Bosilca,Aurelien Bouteiller, andJack J. Dongarra. Kernel-assisted and topology-awareMPI collective communica-tions on multicore/many-core platforms. Jour-nal of Parallel and Dis-tributed Computing, 73(7):1000–1010, July 2013. CO-DEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic). URL http://www.sciencedirect.com/science/article/pii/S0743731513000166.

Min:2003:OOP

[MBE03] Seung-Jai Min, Ayon Ba-

sumallik, and Rudolf Eigen-mann. Optimizing OpenMPprograms on software dis-tributed shared memorysystems. InternationalJournal of Parallel Pro-gramming, 31(3):225–249,June 2003. CODENIJPPE5. ISSN 0885-7458(print), 1573-7640 (elec-tronic). URL /ips/frames/Refs/referenceskapmain.asp?J=4773&I=33&A=5&LK=NM; http://ipsapp007.kluweronline.com/content/getfile/4773/33/5/abstract.htm; http://ipsapp007.kluweronline.com/content/getfile/4773/33/5/fulltext.pdf.

McKenzie:1994:CIM

[MBES94] N. R. McKenzie, K. Bolding,C. Ebeling, and L. Snyder.CRANIUM: An interface formessage passing on adaptivepacket routing networks. InBolding and Snyder [BS94],pages 266–280. ISBN 3-540-58429-3. ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58.P391994.

Malits:2012:ELG

[MBKM12] Roman Malits, EvgenyBolotin, Avinoam Kolodny,and Avi Mendelson. Explor-ing the limits of GPGPUscheduling in control flowbound applications. ACMTransactions on Architec-ture and Code Optimiza-tion, 8(4):29:1–29:??, Jan-

REFERENCES 315

uary 2012. CODEN ????ISSN 1544-3566 (print),1544-3973 (electronic).

Mehl:2015:RTC

[MBS15] Miriam Mehl, ManfredBischoff, and Michael Schafer,editors. Recent Trends inComputational Engineering— CE2014: Optimization,Uncertainty, Parallel Algo-rithms, Coupled and Com-plex Problems, volume 105 ofLecture Notes in Computa-tional Science and Engineer-ing. Springer-Verlag, Berlin,Germany / Heidelberg, Ger-many / London, UK / etc.,2015. ISBN 3-319-22996-6,3-319-22997-4 (e-book). 317(est.) pp. LCCN QA71-90; TA329. URL http://www.springerlink.com/content/978-3-319-22997-3.

Miles:1994:PTO

[MC94] Roger Miles and AlanChalmers, editors. Progressin Transputer and occam Re-search, WoTUG-17 Proceed-ings of the 17th World occamand Transputer User GroupTechnical Meeting, April 10–13, 1994, Bristol, UK, vol-ume 38 of Transputer andOccam Engineering Series.IOS Press, Postal Drawer10558, Burke, VA 2209-0558,USA, 1994. ISBN 90-5199-163-0. LCCN ????

Medeiros:1998:IPM

[MC98] P. D. Medeiros and J. C.

Cunha. InterconnectingPVM and MPI applications.Lecture Notes in ComputerScience, 1497:105–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Morrison:1999:FPP

[MC99] J. P. Morrison and R. W.Connolly. Facilitating par-allel programming in PVMusing condensed graphs. InDongarra et al. [DLM99],pages 181–188. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Maier:2017:OLD

[MC17] Andrew J. Maier andBruce F. Cockburn. Op-timization of low-densityparity check decoder per-formance for OpenCL de-signs synthesized to FPGAs.Journal of Parallel and Dis-tributed Computing, 107(??):134–145, September 2017.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic). URL http://www.sciencedirect.com/science/article/pii/S0743731517301004.

Massaioli:2005:OPA

[MCB05] Federico Massaioli, FilippoCastiglione, and MassimoBernaschi. OpenMP par-allelization of agent-basedmodels. Parallel Computing,31(10–12):1066–1081, Octo-

REFERENCES 316

ber/December 2005. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic).

McDonald:1996:NNP

[McD96] K. McDonald. The NAG Nu-merical PVM Library. InDongarra et al. [DMW96],pages 419–428. ISBN 3-540-60902-4. ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58.P351995.

Mueller:2008:OSM

[MCdS+08] Matthias S. Mueller, Bar-bara M. Chapman, Bronis R.de Supinski, Allen D. Mal-ony, and Michael Voss, edi-tors. OpenMP Shared Mem-ory Parallel Programming:International Workshops,IWOMP 2005 and IWOMP2006, Eugene, OR, USA,June 1–4, 2005, Reims,France, June 12–15, 2006.Proceedings, volume 4315of Lecture Notes in Com-puter Science. Springer-Ver-lag, Berlin, Germany / Hei-delberg, Germany / Lon-don, UK / etc., 2008. CO-DEN LNCSD9. ISBN 3-540-68554-5 (print), 3-540-68555-3 (e-book). ISSN0302-9743 (print), 1611-3349 (electronic). LCCN???? URL http://www.springerlink.com/content/978-3-540-68555-5.

McKinney:1994:PGU

[McK94] G. W. McKinney. A prac-tical guide to using MCNPwith PVM. Transactions ofthe American Nuclear Soci-ety, 71(????):397–398, ????1994. CODEN TANSAO.ISSN 0003-018X.

Moore:2001:RPA

[MCLD01] Shirley Moore, David Cronk,Kevin London, and JackDongarra. Review of per-formance analysis tools forMPI parallel programs. Lec-ture Notes in Computer Sci-ence, 2131:241–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310241.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310241.pdf.

Moreira:2017:FCR

[MCP17] Rubens E. A. Moreira, Syl-vain Collange, and FernandoMagno Quintao Pereira.Function call re-vectorization.ACM SIGPLAN Notices, 52(8):313–326, August 2017.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

McRae:1992:VC

[McR92] S. J. McRae. VM commu-nications. In Anonymous

REFERENCES 317

[Ano92], pages 439–453.

Mierendorff:2000:WMB

[MCS00] Hermann Mierendorff, KlareCassirer, and Helmut Schwamborn.Working with MPI bench-marking suites on ccNUMAarchitectures. Lecture Notesin Computer Science, 1908:18–??, 2000. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080018.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080018.pdf.

Marin:2017:ERF

[MDM17] Manuel Marin, David De-four, and Federico Milano.An efficient representationformat for fuzzy intervalsbased on symmetric mem-bership functions. ACMTransactions on Mathemat-ical Software, 43(3):23:1–23:??, January 2017. CO-DEN ACMSCU. ISSN0098-3500 (print), 1557-7295(electronic). URL https://dl.acm.org/citation.cfm?id=2939364.

Monteiro:2018:EGC

[MdSAS+18] Felipe R. Monteiro, Erick-son H. da S. Alves, Is-abela S. Silva, Hussama I.Ismail, Lucas C. Cordeiro,and Eddie B. de Lima Filho.

ESBMC-GPU: a context-bounded model checkingtool to verify CUDA pro-grams. Science of Com-puter Programming, 152(??):63–69, January 15, 2018.CODEN SCPGD4. ISSN0167-6423 (print), 1872-7964(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167642317301934.

Muller:2009:EOA

[MdSC09] Matthias S. Muller, Bro-nis R. de Supinski, and Bar-bara M. Chapman, editors.Evolving OpenMP in anAge of Extreme Parallelism:5th International Workshopon OpenMP, IWOMP 2009Dresden, Germany, June 3–5, 2009 Proceedings, volume5568 of Lecture Notes inComputer Science. Springer-Verlag, Berlin, Germany /Heidelberg, Germany / Lon-don, UK / etc., 2009. CO-DEN LNCSD9. ISBN 3-642-02284-7 (print), 3-642-02303-7 (e-book). ISSN0302-9743 (print), 1611-3349 (electronic). LCCN???? URL http://www.springerlink.com/content/978-3-642-02303-3.

Matheou:2017:DDC

[ME17] George Matheou and ParaskevasEvripidou. Data-driven con-currency for high perfor-mance computing. ACMTransactions on Architec-ture and Code Optimization,

REFERENCES 318

14(4):53:1–53:??, December2017. CODEN ???? ISSN1544-3566 (print), 1544-3973(electronic).

Megson:1998:CRH

[MFC98] G. M. Megson, R. S. Fish,and D. N. J. Clarke. Cre-ation of reconfigurable hard-ware objects in PVM en-vironments. Lecture Notesin Computer Science, 1497:215–??, 1998. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Milovanovic:2008:NEE

[MFG+08] Milos Milovanovic, RogerFerrer, Vladimir Gajinov,Osman S. Unsal, AdrianCristal, Eduard Ayguade,and Mateo Valero. Nebelung:Execution environment fortransactional OpenMP. In-ternational Journal of Par-allel Programming, 36(3):326–346, June 2008. CO-DEN IJPPE5. ISSN 0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=36&issue=3&spage=326.

Moody:2003:SNB

[MFPP03] Adam Moody, Juan Fer-nandez, Fabrizio Petrini,and Dhabaleswar K. Panda.Scalable NIC-based reduc-tion on large-scale clus-ters. In ACM [ACM03],

page ?? ISBN 1-58113-695-1. LCCN ???? URL http://www.sc-conference.org/sc2003/inter_cal/inter_cal_detail.php?eventid=10716#2; http://www.sc-conference.org/sc2003/paperpdfs/pap316.pdf.

Martin:1995:DPC

[MFTB95] I. Martin, J. C. Fabero,F. Tirado, and A. Bautista.Distributed parallel comput-ers versus PVM on a work-station cluster in the simula-tion of time dependent par-tial differential equations. InIEEE [IEE95h], pages 20–26.ISBN 0-8186-7031-2, 0-8186-7032-0. LCCN QA76.58 .E971995.

Mintchev:1997:TPM

[MG97] S. Mintchev and V. Getov.Towards portable messagepassing in Java: BindingMPI. Lecture Notes inComputer Science, 1332:135–142, 1997. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Mehta:2015:MTP

[MG15] Kshitij Mehta and EdgarGabriel. Multi-threaded par-allel I/O for OpenMP ap-plications. InternationalJournal of Parallel Pro-gramming, 43(2):286–309,April 2015. CODENIJPPE5. ISSN 0885-7458(print), 1573-7640 (elec-

REFERENCES 319

tronic). URL http://link.springer.com/article/10.1007/s10766-014-0306-9.

Mendonca:2017:DAA

[MGA+17] Gleison Mendonca, BrenoGuimaraes, Pericles Alves,Marcio Pereira, Guido Araujo,and Fernando Magno QuintaoPereira. DawnCC: Auto-matic annotation for dataparallelism and offloading.ACM Transactions on Ar-chitecture and Code Opti-mization, 14(2):13:1–13:??,July 2017. CODEN ????ISSN 1544-3566 (print),1544-3973 (electronic).

Mehta:2012:SPE

[MGC12] Kshitij Mehta, Edgar Gabriel,and Barbara Chapman.Specification and perfor-mance evaluation of par-allel I/O interfaces forOpenMP. Lecture Notesin Computer Science, 7312:1–14, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-30961-8_1/.

Muralidharan:2015:COP

[MGC+15] Saurav Muralidharan, MichaelGarland, Bryan Catanzaro,Albert Sidelnik, and MaryHall. A collection-orientedprogramming model for per-formance portability. ACMSIGPLAN Notices, 50(8):

263–264, August 2015. CO-DEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Medvedev:2005:OMA

[MGG05] Dmitry M. Medvedev, Eve-lyn M. Goldfield, andStephen K. Gray. AnOpenMP/MPI approach tothe parallelization of itera-tive four-atom quantum me-chanics. Computer PhysicsCommunications, 166(2):94–108, March 1, 2005. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465504005260.

Montella:2017:VCB

[MGL+17] Raffaele Montella, GiulioGiunta, Giuliano Laccetti,Marco Lapegna, CarloPalmieri, Carmine Ferraro,Valentina Pelliccia, Cheol-Ho Hong, Ivor Spence, andDimitrios S. Nikolopoulos.On the virtualization ofCUDA based GPU remot-ing on ARM and x86 ma-chines in the GVirtuS frame-work. International Jour-nal of Parallel Programming,45(5):1142–1163, October2017. CODEN IJPPE5.ISSN 0885-7458 (print),1573-7640 (electronic).

Mazzariol:1997:PCS

[MGMH97] M. Mazzariol, B. A. Gen-

REFERENCES 320

nart, V. Messerli, and R. D.Hersch. Performance ofCAP-specified linear algebraalgorithms. Lecture Notesin Computer Science, 1332:351–358, 1997. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Matthey:2001:EMO

[MH01] T. Matthey and J. P.Hansen. Evaluation ofMPI’s one-sided communi-cation mechanism for short-range molecular dynamicson the Origin2000. Lec-ture Notes in Computer Sci-ence, 1947:356–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1947/19470356.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1947/19470356.pdf.

Hwu:2012:GCG

[mH12] Wen mei Hwu, editor. GPUcomputing gems. Appli-cations of GPU computingseries. Morgan Kaufmann,Boston, MA, jade edition,2012. ISBN 0-12-385963-8(hardback). xvi + 541 +16 pp. LCCN T385 .G68752012.

Miller:1994:PPP

[MHC94a] B. P. Miller, J. K. Hollingsworth,and M. D. Callaghan. The

Paradyn parallel perfor-mance tools and PVM. InDongarra and Tourancheau[DT94], pages 201–210.ISBN 0-89871-343-9. LCCNQA76.58.I568 1994.

Miller:1994:PPT

[MHC94b] B. P. Miller, J. K. Holling-worth, and M. D. Callaghan.The Paradyn performancetools and PVM. In Dongarraand Tourancheau [DT94],pages 201–210. ISBN0-89871-343-9. LCCNQA76.58.I568 1994.

Munshi:2016:OCS

[MHSK16] Aaftab Munshi, Lee Howes,Bartosz Sochacki, and KhronosOpenCL Working Group.The OpenCL C specificationversion: 2.0 document revi-sion: 33. Web document.,April 13, 2016. URL https://www.khronos.org/registry/OpenCL/specs/opencl-2.0-openclc.pdf.

Michielse:1993:PMU

[Mic93] P. Michielse. Parallel multi-grid using PVM. Super-computer, 10(6):10–23, ????1993. CODEN SPCOEL.ISSN 0168-7875.

Michielse:1995:PMU

[Mic95] Peter Michielse. Paral-lel multigrid using PVM.Applied Numerical Math-ematics: Transactions ofIMACS, 19(1-2):63–69, Nov-ember 1995. CODEN AN-MAEL. ISSN 0168-9274

REFERENCES 321

(print), 1873-5460 (elec-tronic).

Muddukrishna:2015:LAT

[MJB15] Ananya Muddukrishna, Pe-ter A. Jonsson, and MatsBrorsson. Locality-awaretask scheduling and datadistribution for OpenMPprograms on NUMA sys-tems and manycore proces-sors. Scientific Program-ming, 2015(??):981759:1–981759:16, ???? 2015. CO-DEN SCIPEV. ISSN 1058-9244 (print), 1875-919X(electronic). URL https://www.hindawi.com/journals/sp/2015/981759/.

Mittal:2012:CAS

[MJG+12] Anshul Mittal, Nikhil Jain,Thomas George, Yogish Sab-harwal, and Sameer Kumar.Collective algorithms forsub-communicators. ACMSIGPLAN Notices, 47(8):315–316, August 2012. CO-DEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic). PPOPP ’12 confer-ence proceedings.

Muddukrishna:2016:GGO

[MJPB16] Ananya Muddukrishna, Pe-ter A. Jonsson, ArturPodobas, and Mats Brors-son. Grain graphs: OpenMPperformance analysis madeeasy. ACM SIGPLAN No-tices, 51(8):28:1–28:??, Au-gust 2016. CODEN SIN-

ODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Matyska:1994:DCS

[MK94] Ludek Matyska and JaroslavKoca. D-CICADA: a soft-ware for conformational PESelucidation on network ofworkstations. Journal ofComputational Chemistry,15(9):937–946, September1994. CODEN JCCHDD.ISSN 0192-8651 (print),1096-987X (electronic).

McDonald:1997:IPT

[MK97] Chris McDonald and Kam-ran Kazemi. Improvingthe PVM teaching envi-ronment. SIGCSE Bul-letin (ACM Special Inter-est Group on Computer Sci-ence Education), 29(1):219–223, March 1997. CODENSIGSD3. ISSN 0097-8418(print), 2331-3927 (elec-tronic).

McDonald:2000:TPA

[MK00] Chris McDonald and Kam-ran Kazemi. Teaching par-allel algorithm with processtopologies. SIGCSE Bul-letin (ACM Special Inter-est Group on Computer Sci-ence Education), 32(1):70–74, March 2000. CODENSIGSD3. ISSN 0097-8418(print), 2331-3927 (elec-tronic).

REFERENCES 322

Mohror:2004:PTS

[MK04] Kathryn Mohror and Karen L.Karavanic. Performance toolsupport for MPI-2 on Linux.In ACM [ACM04], page 28.ISBN 0-7695-2153-3. LCCN????

Maheo:2012:AOL

[MKC+12] Aurele Maheo, Souad Koliaı,Patrick Carribault, MarcPerache, and William Jalby.Adaptive OpenMP for largeNUMA nodes. Lecture Notesin Computer Science, 7312:254–257, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-30961-8_20/.

Markus:1996:PEM

[MKP+96] S. Markus, S. B. Kim,K. Pantazopoulos, A. L.Ocken, E. N. Houstis, P. Wu,S. Weerawarana, and D. Ma-harry. Performance evalu-ation of MPI implementa-tions and MPI based Par-allel ELLPACK solvers. InIEEE [IEE96i], pages 162–169. ISBN 0-8186-7533-0.LCCN QA76.642 .M67 1996.

Min:2001:PCO

[MKV+01] Seung Jai Min, Seon WookKim, Michael Voss, Sang IkLee, and Rudolf Eigen-mann. Portable compil-ers for OpenMP. Lec-ture Notes in Computer

Science, 2104:11–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2104/21040011.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2104/21040011.pdf.

Mokbel:2011:ASR

[MKW11] Mohammed F. Mokbel,Robert D. Kent, and MichaelWong. An abstract se-mantically rich compiler col-locative and interpretativemodel for OpenMP pro-grams. The Computer Jour-nal, 54(8):1325–1343, Au-gust 2011. CODEN CM-PJA6. ISSN 0010-4620(print), 1460-2067 (elec-tronic). URL http://comjnl.oxfordjournals.org/content/54/8/1325.full.pdf+html.

Mitra:2014:AAP

[MLA+14] Subrata Mitra, Ignacio La-guna, Dong H. Ahn, SaurabhBagchi, Martin Schulz, andTodd Gamblin. Accurateapplication progress anal-ysis for large-scale paral-lel debugging. ACM SIG-PLAN Notices, 49(6):193–203, June 2014. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

REFERENCES 323

Marjanovic:2010:ECC

[MLAV10] Vladimir Marjanovic, JesusLabarta, Eduard Ayguade,and Mateo Valero. Effectivecommunication and compu-tation overlap with hybridMPI/SMPSs. ACM SIG-PLAN Notices, 45(5):337–338, May 2010. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Marowka:2004:OOA

[MLC04] Ami Marowka, ZhenyingLiu, and Barbara Chapman.OpenMP-oriented applica-tions for distributed sharedmemory architectures. Con-currency and Computation:Practice and Experience, 16(4):371–384, April 10, 2004.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Marendic:2016:NMR

[MLVS16] P. Marendic, J. Lemeire,D. Vucinic, and P. Schelkens.A novel MPI reduction al-gorithm resilient to im-balances in process arrivaltimes. The Journal of Su-percomputing, 72(5):1973–2013, May 2016. CODENJOSUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic). URL http://link.springer.com/article/10.1007/s11227-016-1707-x.

Majumdar:1992:PPC

[MM92] A. Majumdar and W. R.Martin. Parallel precondi-tioned conjugate gradient al-gorithm applied to neutrondiffusion problem. Transac-tions of the American Nu-clear Society, 65:209–210,1992. CODEN TANSAO.ISSN 0003-018X.

Mantovani:1995:HPS

[MM95] M. L. Mantovani andM. Malagoli. Highly par-allel SCF calculation: theSYSMO program. In IEEE[IEE95h], pages 502–507.ISBN 0-8186-7031-2, 0-8186-7032-0. LCCN QA76.58 .E971995.

Michailidis:2001:TSH

[MM01] Panagiotis D. Michailidisand Konstantinos G. Mar-garitis. Text searchingon a heterogeneous clus-ter of workstations. Lec-ture Notes in Computer Sci-ence, 2131:378–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310378.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310378.pdf.

Michailidis:2002:PSL

[MM02] Panagiotis D. Michailidisand Konstantinos G. Mar-

REFERENCES 324

garitis. A performance studyof load balancing strate-gies for approximate stringmatching on an MPI het-erogeneous system environ-ment. Lecture Notes inComputer Science, 2474:432–??, 2002. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.de/link/service/series/0558/bibs/2474/24740432.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740432.pdf.

Michailidis:2003:PEL

[MM03] Panagiotis D. Michailidisand Konstantinos G. Mar-garitis. Performance evalua-tion of load balancing strate-gies for approximate stringmatching application on anMPI cluster of heteroge-neous workstations. FutureGeneration Computer Sys-tems, 19(7):1075–1104, Oc-tober 2003. CODEN FG-SEVI. ISSN 0167-739X(print), 1872-7115 (elec-tronic).

Marathe:2007:SCC

[MM07] Jaydeep Marathe and FrankMueller. Source-code-correlated cache coherencecharacterization of OpenMPbenchmarks. IEEE Trans-actions on Parallel and Dis-tributed Systems, 18(6):818–834, June 2007. CODEN

ITDSEO. ISSN 1045-9219(print), 1558-2183 (elec-tronic).

Michailidis:2011:PDM

[MM11] Panagiotis D. Michailidisand Konstantinos G. Mar-garitis. Parallel direct meth-ods for solving the sys-tem of linear equations withpipelining on a multicoreusing OpenMP. Journalof Computational and Ap-plied Mathematics, 236(3):326–341, September 1, 2011.CODEN JCAMDI. ISSN0377-0427 (print), 1879-1778(electronic). URL http://www.sciencedirect.com/science/article/pii/S0377042711004183.

Morishima:2014:PEG

[MM14] Shin Morishima and Hi-roki Matsutani. Perfor-mance evaluations of graphdatabase using CUDA andOpenMP compatible li-braries. ACM SIGARCHComputer Architecture News,42(4):75–80, 2014. CODENCANED2. ISSN 0163-5964(print), 1943-5851 (elec-tronic).

Malony:1994:PAP

[MMB+94] A. Malony, B. Mohr, P. Beck-man, D. Gannon, S. Yang,and F. Bodin. Perfor-mance analysis of pC++: aportable data-parallel pro-gramming system for scal-able parallel computers. InSiegal [Sie94], pages 75–84.

REFERENCES 325

ISBN 0-8186-5602-6. LCCNQA76.58.I58 1994. IEEEcatalog no. 94CH34819.

Mudge:1993:PTS

[MMH93] T. N. Mudge, V. Miluti-novic, and L. Hunter, ed-itors. Proceedings of theTwenty-Sixth Hawaii Inter-national Conference on Sys-tem Science (HICSS-26),held in Wailea, Hawaii inJanuary 5–8, 1993. IEEEComputer Society Press,1109 Spring Street, Suite300, Silver Spring, MD20910, USA, 1993. ISBN0-8186-3230-5. LCCN ????Four volumes. IEEE catalognumber 93TH0501-7.

Morimoto:1998:IMM

[MMH98] K. Morimoto, T. Mat-sumoto, and K. Hiraki. Im-plementing MPI with thememory-based communica-tion facilities on the SSS-CORE operating system.Lecture Notes in ComputerScience, 1497:223–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Morimoto:1999:PEM

[MMH99] K. Morimoto, T. Mat-sumoto, and K. Hiraki. Per-formance evaluation of theMPI/MBCF with the NASparallel benchmarks. InDongarra et al. [DLM99],pages 19–26. ISBN 3-540-66549-8 (softcover). ISSN

0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Mohamed:2013:MMM

[MMM13] Hisham Mohamed and StephaneMarchand-Maillet. MRO-MPI: MapReduce overlap-ping using MPI and an opti-mized data exchange policy.Parallel Computing, 39(12):851–866, December 2013.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167819113001026.

Manca:2016:CQI

[MMO+16] Emanuele Manca, AndreaManconi, Alessandro Orro,Giuliano Armano, and Lu-ciano Milanesi. CUDA-quicksort: an improvedGPU-based implementationof quicksort. Concurrencyand Computation: Practiceand Experience, 28(1):21–43, January 2016. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic).

MacFarlane:1999:PPI

[MMR99] A. MacFarlane, J. A. Mc-Cann, and S. E. Robert-son. PLIERS: a parallel in-formation retrieval systemusing MPI. In Dongarraet al. [DLM99], pages 317–324. ISBN 3-540-66549-8(softcover). ISSN 0302-9743(print), 1611-3349 (elec-

REFERENCES 326

tronic). LCCN QA76.58E973 1999.

Morris:2007:SNO

[MMS07] Alan Morris, Allen D. Mal-ony, and Sameer S. Shende.Supporting nested OpenMPparallelism in the TAU per-formance system. Inter-national Journal of Paral-lel Programming, 35(4):417–436, August 2007. CO-DEN IJPPE5. ISSN 0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=35&issue=4&spage=417.

Mohr:2002:DPP

[MMSW02] Bernd Mohr, Allen D. Mal-ony, Sameer Shende, and Fe-lix Wolf. Design and proto-type of a performance toolinterface for OpenMP. TheJournal of Supercomputing,23(1):105–128, August 2002.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://ipsapp008.kluweronline.com/content/getfile/5189/37/8/abstract.htm; http://ipsapp008.kluweronline.com/content/getfile/5189/37/8/fulltext.pdf.

Matuszek:1999:BPG

[MMU99] M. R. Matuszek, A. Mazurkiewicz,and P. W. Uminski. Bench-marking the PVM groupcommunication efficiency. In

Dongarra et al. [DLM99],pages 499–508. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Martin:1996:WTW

[MMW96] D. E. Martin, T. J. McBrayer,and P. A. Wilsey. WARPED:a time warp simulation ker-nel for analysis and applica-tion development. In H. El-Rewini and B. D. Shriver,editors, Proceedings of theTwenty-Ninth Hawaii In-ternational Conference onSystem Sciences, volume 1,pages 5–?? ????, ????, 1996.ISBN 0-8186-7324-9. LCCN????

Meleshchuk:1991:IPP

[MN91] S. B. Meleshchuk and A. N.Nedumov. Implementationof a protocol for paralleldatabase access with vir-tual machine communica-tions facilities. Program-mirovanie, 17(1):35–42, Jan-uary/February 1991. CO-DEN PCSODA. ISSN 0132-3474, 0361-7688. Englishtranslation in Programmingand Computer Software, vol.17, no. 1, pp. 27–32, Novem-ber 1991.

Midorikawa:2005:PNM

[MOL05] Edson Toshimi Midorikawa,Helio Marci Oliveira, andJean Marcos Laine. PEM-PIs: a new methodology formodeling and prediction of

REFERENCES 327

MPI programs performance.International Journal ofParallel Programming, 33(5):499–527, October 2005.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=33&issue=5&spage=499.

Mork:1995:DPP

[Mor95] P. Mork. Debugging par-allel programs with execu-tion tracing. In Ferenczi andKacsuk [FK95], pages 176–183. ISBN ???? LCCN???? Technical reportKFKI-1995-2/M,N.

Manke:1995:MPP

[MP95] J. W. Manke and J. C. Pat-terson. Message passing per-formance of Intel Paragon,IBM SP1 and CRAY T3Dusing PVM. In Bailey et al.[BBG+95], pages 768–769.ISBN 0-89871-344-7. LCCNQA76.58.S55 1995.

Martin:2004:HPA

[MPD04] Marıa J. Martın, MartaParada, and Ramon Doallo.High performance air pol-lution simulation usingOpenMP. The Journal ofSupercomputing, 28(3):311–321, June 2004. CODENJOSUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic). URL http://ipsapp008.kluweronline.

com/IPS/content/ext/x/J/5189/I/54/A/5/abstract.htm.

MPIForum:1998:SIM

[MPI98] MPI Forum. Special issue:MPI2: a message-passinginterface standard. Inter-national Journal of Super-computer Applications andHigh Performance Comput-ing, 12(1–2):1–299, Spring–Summer 1998. CODENIJSCFG. ISSN 1078-3482.

Muller:1996:CDI

[MR96] A. Muller and R. Ruhl.Communication-buffers fordata-parallel, irregular com-putations. In Szymanski andSinharoy [SS96], pages 295–298. ISBN 0-7923-9635-9.LCCN QA76.58.L37 1996.

Martins:2012:PDC

[MR12] Wellington S. Martins andThiago F. Rangel. Phyloge-netic distance computationusing CUDA. Lecture Notesin Computer Science, 7409:168–178, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-31927-3_15/.

Meister:2017:PME

[MRB17] Oliver Meister, Kaveh Rah-nema, and Michael Bader.Parallel memory-efficientadaptive mesh refinement onstructured triangular meshes

REFERENCES 328

with billions of grid cells.ACM Transactions on Math-ematical Software, 43(3):19:1–19:27, January 2017.CODEN ACMSCU. ISSN0098-3500 (print), 1557-7295(electronic). URL https://dl.acm.org/citation.cfm?id=2947668.

Mo:1996:IOP

[MRH+96] J. Mo, F. Romelfanger, R. J.Hanisch, D. Redding, S. Sir-lin, and A. Boden. Imple-mentation of an optical pre-scription retrieval code usingPVM (parallel virtual ma-chine) in a mixed architec-ture network. In Jacoby andBarnes [JB96], pages 100–103. ISBN ???? ISSN 1080-7926. LCCN QB51.3.E43A87 1995.

Mininni:2011:HMO

[MRRP11] Pablo D. Mininni, DuaneRosenberg, Raghu Reddy,and Annick Pouquet. A hy-brid MPI–OpenMP schemefor scalable parallel pseu-dospectral computations forfluid turbulence. Par-allel Computing, 37(6–7):316–326, June/July 2011.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167819111000512.

Mazzocca:2000:TPP

[MRV00] N. Mazzocca, M. Rak, andU. Villano. The tran-sition from a PVM pro-

gram simulator to a het-erogeneous system simula-tor: The HeSSE project.Lecture Notes in ComputerScience, 1908:266–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080266.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080266.pdf.

Morinishi:1995:PIB

[MS95] K. Morinishi and N. Sato-fuka. Parallel implemen-tation of the Boltzmannequation solvers using PVM.In Satofuka et al. [SPE95],pages 339–346. ISBN 0-444-82317-4. LCCN QA911 .P351994.

McMahon:1996:EEE

[MS96a] T. P. McMahon and A. Skjel-lum. eMPI/eMPICH: em-bedding MPI. In IEEE[IEE96i], pages 180–184.ISBN 0-8186-7533-0. LCCNQA76.642 .M67 1996.

Menden:1996:PPP

[MS96b] J. Menden and G. Stellner.Proving properties of PVMapplications — a case studywith CoCheck. In Bode et al.[BDLS96], pages 134–??ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E975 1996.

REFERENCES 329

Marinho:1998:WMP

[MS98] J. Marinho and J. G. Silva.WMPI — message passinginterface for Win32 clusters.Lecture Notes in ComputerScience, 1497:113–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Mierendorff:1999:PMB

[MS99a] H. Mierendorff and H. Schwamborn.Performance modeling basedon PVM. In Dongarraet al. [DLM99], pages 75–82. ISBN 3-540-66549-8(softcover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

Migliardi:1999:PEH

[MS99b] M. Migliardi and V. Sun-deram. PVM emulationin the harness metacom-puting system: a plug-in based approach. InDongarra et al. [DLM99],pages 117–124. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Mourao:1999:IMO

[MS99c] F. E. Mourao and J. G.Silva. Implementing MPI’sone-sided communicationsfor WMPI. In Dongarraet al. [DLM99], pages 231–240. ISBN 3-540-66549-8(softcover). ISSN 0302-9743

(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

Macias:2002:SEA

[MS02a] Elsa M. Macıas and AlvaroSuarez. Solving engineer-ing applications with LAM-GAC over MPI-2. Lec-ture Notes in Computer Sci-ence, 2474:130–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.de/link/service/series/0558/bibs/2474/24740130.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740130.pdf.

Mahinthakumar:2002:HMO

[MS02b] G. Mahinthakumar andF. Saied. A hybrid MPI-OpenMP implementation ofan implicit finite-elementcode on parallel architec-tures. The InternationalJournal of High Perfor-mance Computing Applica-tions, 16(4):371–393, Win-ter 2002. CODEN IH-PCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic).

Mertens:2004:CCP

[MS04] Stephan Mertens and Alexan-der Schinner. Cluster Com-puting: Praktische Einfuhrungin das wissenschaftlicheRechnen auf Workstation-Clustern. Springer-Verlag,

REFERENCES 330

Berlin, Germany / Heidel-berg, Germany / London,UK / etc., 2004. ISBN 3-540-42299-4. 300 (est.) pp.LCCN ???? Includes CD-ROM.

Mysliwiec:1997:IPS

[MSB97] G. Mysliwiec, J. Sipowicz,and H. Burkhart. Imple-menting parallel SBS-typelinear solvers using ALWAN.Lecture Notes in ComputerScience, 1332:359–366, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Matise:1995:PCG

[MSCW95] T. C. Matise, M. D.Schroeder, D. M. Chiarulli,and D. E. Weeks. Paral-lel computation of geneticlikelihoods using CRI-MAP,PVM, and a network of dis-tributed workstations. Hu-man heredity, 45(2):103–??, ???? 1995. CODENHUHEAS. ISSN 0001-5652.

Migliardi:2000:SFT

[MSF00] Mauro Migliardi, Vaidy Sun-deram, and Arrigo Frisiani.A simple, fault tolerant nam-ing space for the HARNESSmetacomputing system. Lec-ture Notes in Computer Sci-ence, 1908:152–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080152.htm;

http://link.springer-ny.com/link/service/series/0558/papers/1908/19080152.pdf.

McCandless:1996:OOM

[MSL96] B. C. McCandless, J. M.Squyres, and A. Lums-daine. Object oriented MPI(OOMPI): a class libraryfor the Message Passing In-terface. In IEEE [IEE96i],pages 87–94. ISBN 0-8186-7533-0. LCCN QA76.642.M67 1996.

Massetto:2012:NSB

[MSL12] Francisco Isidro Massetto,Liria Matsumoto Sato, andKuan-Ching Li. A novelstrategy for building inter-operable MPI environmentin heterogeneous high per-formance systems. TheJournal of Supercomputing,60(1):87–116, April 2012.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=60&issue=1&spage=87.

Martin:2015:EPM

[MSMC15] Gonzalo Martın, David E.Singh, Maria-Cristina Mari-nescu, and Jesus Carretero.Enhancing the performanceof malleable MPI applica-tions by using performance-aware dynamic reconfigu-ration. Parallel Comput-ing, 46(??):60–77, July 2015.

REFERENCES 331

CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167819115000642.

Molnar:2010:APM

[MSML10] F. Molnar, Jr., T. Szakaly,R. Meszaros, and I. Lagzi.Air pollution modelling us-ing a Graphics Process-ing Unit with CUDA.Computer Physics Commu-nications, 181(1):105–112,January 2010. CODENCPHCBZ. ISSN 0010-4655(print), 1879-2944 (elec-tronic). URL http://www.sciencedirect.com/science/article/pii/S0010465509002872.

Macias:2001:PPA

[MSOGR01] Elsa M. Macıas, AlvaroSuarez, C. N. Ojeda-Guerra,and E. Robayna. Pro-gramming parallel applica-tions with LAMGAC in aLAN–WLAN environment.Lecture Notes in ComputerScience, 2131:158–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310158.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310158.pdf.

Matrone:1993:LPC

[MSP93] A. Matrone, P. Schiano,

and V. Puoti. LINDA andPVM: a comparison betweentwo environments for par-allel programming. Paral-lel Computing, 19(8):949–957, August 1993. CODENPACOEJ. ISSN 0167-8191(print), 1872-7336 (elec-tronic).

Mysliwiec:1997:CAM

[MSS97] G. Mysliwiec, J. Sipowicz,and R. Schaefer. Con-trol activities in messagepassing environment. Lec-ture Notes in Computer Sci-ence, 1332:143–150, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Martins:1998:JIW

[MSS98] P. Martins, L. M. Silva,and J. Silva. A Java in-terface for WMPI. Lec-ture Notes in Computer Sci-ence, 1497:121–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Martorell:2005:BGP

[MSW+05] X. Martorell, N. Smeds,R. Walkup, J. R. Brun-heroto, G. Almasi, J. A.Gunnels, L. DeRose, J. Labarta,F. Escale, J. Gimenez,H. Servat, and J. E. Mor-eira. Blue Gene/L perfor-mance tools. IBM Journal ofResearch and Development,49(2/3):407–424, ???? 2005.CODEN IBMJAE. ISSN

REFERENCES 332

0018-8646 (print), 2151-8556(electronic). URL http://www.research.ibm.com/journal/rd/492/martorell.pdf.

Mossaiby:2017:OIH

[MSZG17] F. Mossaiby, A. Shojaei,M. Zaccariotto, and U. Gal-vanetto. OpenCL implemen-tation of a high performance3D peridynamic model ongraphics accelerators. Com-puters and Mathematics withApplications, 74(8):1856–1870, October 15, 2017.CODEN CMAPDK. ISSN0898-1221 (print), 1873-7668(electronic). URL http://www.sciencedirect.com/science/article/pii/S0898122117304030.

Miei:1996:IER

[MT96] T. Miei and N. Takahashi.Implementation and evalua-tion of a replay-based de-bugger for PVM programs.Transactions of the Infor-mation Processing Societyof Japan, 37(7):1308–1319,July 1996. CODEN JS-GRD5. ISSN 0387-5806.

Mallon:2016:MUB

[MTK16] Damian A. Mallon, Guillermo L.Taboada, and Lars Koesterke.MPI and UPC broadcast,scatter and gather algo-rithms in Xeon Phi. Con-currency and Computation:Practice and Experience,28(8):2322–2340, June 10,2016. CODEN CCPEBO.

ISSN 1532-0626 (print),1532-0634 (electronic).

Marin:1994:GAL

[MTSS94] F. J. Marin, O. Trelles-Salazar, and F. Sandoval.Genetic algorithms on LAN-Message passing architec-tures using PVM: Applica-tion to the routing prob-lem. In Davidor et al.[DSM94], pages 534–545 (or534–543??). ISBN 3-540-58484-6. ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58.I535 1994.

Momeni:2015:EEO

[MTU+15] Amir Momeni, HamedTabkhi, Yash Ukidave, Gu-nar Schirner, and DavidKaeli. Exploring the effi-ciency of the OpenCL pipesemantic on an FPGA. ACMSIGARCH Computer Archi-tecture News, 43(4):52–57,September 2015. CODENCANED2. ISSN 0163-5964(print), 1943-5851 (elec-tronic).

Mohr:2007:SPE

[MTW07] Bernd Mohr, Jesper Lars-son Traff, and Joachim Wor-ringen. Selected papersfrom EuroPVM/MPI 2006.Parallel Computing, 33(9):593–594, September 2007.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic).

REFERENCES 333

Mohr:2006:RAP

[MTWD06] Bernd Mohr, Jesper LarssonTraff, Joachim Worringen,and Jack Dongarra, editors.Recent Advances in Par-allel Virtual Machine andMessage Passing Interface:13th European PVM/MPIUser’s Group Meeting Bonn,Germany, September 17–20,2006 Proceedings, volume4192 of Lecture Notes inComputer Science. Springer-Verlag, Berlin, Germany /Heidelberg, Germany / Lon-don, UK / etc., 2006. CO-DEN LNCSD9. ISBN 3-540-39110-X (print), 3-540-39112-6 (e-book). ISSN0302-9743 (print), 1611-3349 (electronic). LCCN???? URL http://www.springerlink.com/content/978-3-540-39112-8.

Muller:2001:SSO

[Mul01] Matthias Muller. Somesimple OpenMP optimiza-tion techniques. LectureNotes in Computer Science,2104:31–??, 2001. CO-DEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2104/21040031.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2104/21040031.pdf.

Muller:2002:SMB

[Mul02] Matthias S. Muller. Ashared memory benchmarkin OpenMP. Lecture Notesin Computer Science, 2327:380–??, 2002. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2327/23270380.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2327/23270380.pdf.

Muller:2003:OCB

[Mul03] Matthias S. Muller. AnOpenMP compiler bench-mark. Scientific Program-ming, 11(2):125–131, 2003.CODEN SCIPEV. ISSN1058-9244 (print), 1875-919X (electronic).

Malakar:2017:DMO

[MV17] Preeti Malakar and Venka-tram Vishwanath. Datamovement optimizations forindependent MPI I/O onthe Blue Gene/Q. Paral-lel Computing, 61(??):35–51, January 2017. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.sciencedirect.com/science/article/pii/S016781911630062X.

Manis:1996:EPT

[MVTP96] G. Manis, C. Voliotis,P. Tsanakas, and G. Pa-

REFERENCES 334

pakonstantinou. EnhancingPVM with threads in dis-tributed programming. InLiddell et al. [LCHS96],pages 1013–?? ISBN 3-540-61142-8 (paperback). LCCNQA76.88 .H52 1996.

Muller:2010:SMA

[MvWL+10] Matthias S. Muller, Matthijsvan Waveren, Ron Lieber-man, Brian Whitney, HidekiSaito, Kalyan Kumaran,John Baron, William C.Brantley, Chris Parrott,Tom Elken, Huiyu Feng,and Carl Ponder. SPECMPI2007 — an applicationbenchmark suite for paral-lel systems using MPI. Con-currency and Computation:Practice and Experience, 22(2):191–205, February 2010.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Mehra:1995:AIM

[MVY95] P. Mehra, B. Van Voorst,and J. Yan. Automatedinstrumentation, monitoringand visualization of PVMprograms. In Bailey et al.[BBG+95], pages 832–837.ISBN 0-89871-344-7. LCCNQA76.58.S55 1995.

McKinney:1993:MMI

[MW93] G. W. McKinney and J. T.West. MultiprocessingMCNP on an IBM RS/ 6000cluster. Transactions of theAmerican Nuclear Society,

68(pt.A):212–214, 1993. CO-DEN TANSAO. ISSN 0003-018X.

Mamontov:1998:AES

[MW98] Y. V. Mamontov andM. Willander. An algo-rithm to evaluate spectraldensities of high-dimensionalstationary diffusion stochas-tic processes with non-linearcoefficients: The generalscheme and issues on imple-mentation with PVM. Lec-ture Notes in Computer Sci-ence, 1541:315–321, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Manegold:1997:QBM

[MWG97] S. Manegold, F. Waas, andD. Gudlat. In quest ofthe bottleneck — monitoringparallel database systems.Lecture Notes in ComputerScience, 1332:277–284, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Morton:1995:LLP

[MWO95] Don Morton, Kefei Wang,and David O. Ogbe. Lessonslearned in porting Fortran/PVM code to the CrayT3D. IEEE parallel anddistributed technology: sys-tems and applications, 3(1):4–11, Spring 1995. CODENIPDTEX. ISSN 1063-6552(print), 1558-1861 (elec-tronic).

REFERENCES 335

Maleki:2016:HOT

[MYB16] Sepideh Maleki, AnnieYang, and Martin Burtscher.Higher-order and tuple-based massively-parallel pre-fix sums. ACM SIG-PLAN Notices, 51(6):539–552, June 2016. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Maly:1993:DCP

[MZK93] K. Maly, M. Zubair, andS. Kelbar. Distributedcomputing with parallelnetworking. In IEEE[IEE93d], pages 375–379.ISBN 0-8186-4430-3. LCCNQA76.9.D5I335 1993. IEEEcatalog no. 93TH0574-4.

Nikolopoulos:2001:SID

[NA01] Dimitrios S. Nikolopoulosand Eduard Ayguade. Astudy of implicit data distri-bution methods for OpenMPusing the SPEC bench-marks. Lecture Notes inComputer Science, 2104:115–??, 2001. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2104/21040115.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2104/21040115.pdf.

Nikolopoulos:2001:EMA

[NAAL01] D. S. Nikolopoulos, E. Ar-tiaga, E. Ayguade, andJ. Labarta. Exploitingmemory affinity in OpenMPthrough schedule reuse.ACM SIGARCH ComputerArchitecture News, 29(5):49–55, December 2001. CODENCANED2. ISSN 0163-5964(ACM), 0884-7495 (IEEE).

Nagle:2005:BRM

[Nag05] Dan Nagle. Book review:MPI — The Complete Ref-erence, Vol. 1, The MPICore, 2nd ed., Scientificand Engineering Computa-tion Series, by Marc Snir,Steve Otto, Steven Huss–Lederman, David Walkerand Jack Dongarra. Scien-tific Programming, 13(1):57–63, ???? 2005. CODENSCIPEV. ISSN 1058-9244(print), 1875-919X (elec-tronic).

Nicolescu:1999:PWA

[NAJ99] C. Nicolescu, B. Albers, andP. Jonker. Parallel water-shed algorithm on imagesfrom cranial CT-scans us-ing PVM and MPI on adistributed memory system.In Dongarra et al. [DLM99],pages 418–425. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

REFERENCES 336

Nakajima:2003:PIS

[Nak03] Kengo Nakajima. Paral-lel iterative solvers of Ge-oFEM with selective block-ing preconditioning for non-linear contact problems onthe Earth Simulator. InACM [ACM03], page ??ISBN 1-58113-695-1. LCCN???? URL http://www.sc-conference.org/sc2003/inter_cal/inter_cal_detail.php?eventid=10703#1; http://www.sc-conference.org/sc2003/paperpdfs/pap155.pdf.

Nakajima:2005:PIS

[Nak05a] Kengo Nakajima. Paralleliterative solvers for finite-element methods using anOpenMP/MPI hybrid pro-gramming model on theEarth Simulator. ParallelComputing, 31(10–12):1048–1065, October/December2005. CODEN PACOEJ.ISSN 0167-8191 (print),1872-7336 (electronic).

Nakajima:2005:TLH

[Nak05b] Kengo Nakajima. Three-level hybrid vs. flat MPI onthe Earth Simulator: Par-allel iterative solvers forfinite-element method. Ap-plied Numerical Mathemat-ics: Transactions of IMACS,54(2):237–255, July 2005.CODEN ANMAEL. ISSN0168-9274 (print), 1873-5460(electronic).

Narashimhan:1995:IIF

[Nar95] V. L. Narashimhan, editor.ICAPP 95. IEEE First In-ternational Conference onAlgorithms and Architec-tures for Parallel Process-ing, Brisbane, Australia, 19–21 April, 1995. IEEE Com-puter Society Press, 1109Spring Street, Suite 300,Silver Spring, MD 20910,USA, 1995. ISBN 0-7803-2018-2 (paperback), 0-7803-2019-0 (microfiche). LCCNQA76.6.I15 1995. Two vol-umes. IEEE catalog no.95TH0682-5.

Nagel:1996:VVA

[NAW+96] W. E. Nagel, A. Arnold,M. Weber, H. C. Hoppe, andK. Solchenbach. VAMPIR:Visualization and analysis ofMPI resources. Supercom-puter, 12(1):69–80, January1996. CODEN SPCOEL.ISSN 0168-7875.

NicCanna:1996:LGS

[NB96] C. Nic Canna and C. J.Bean. Larger grids andshorter wall-clock times ona parallel virtual machine(PVM) — an example us-ing a finite difference wavesimulation algorithm. InAbrahart [Abr96], pages 2–?? ISBN ???? LCCN ????

Nickolls:2008:SPP

[NBGS08] John Nickolls, Ian Buck,Michael Garland, and KevinSkadron. Scalable parallel

REFERENCES 337

programming with CUDA.ACM Queue: Tomorrow’sComputing Today, 6(2):40–53, March 2008. CO-DEN AQCUAE. ISSN1542-7730 (print), 1542-7749(electronic).

Neyman:1999:ERP

[NBK99] M. Neyman, M. Bukowski,and P. Kuzora. Efficient re-play of PVM programs. InDongarra et al. [DLM99],pages 83–90. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Nguyen:2012:BTM

[NCB+12] Tan Nguyen, Pietro Cicotti,Eric Bylaska, Dan Quin-lan, and Scott B. Baden.Bamboo: translating MPIapplications to a latency-tolerant, data-driven form.In Hollingsworth [Hol12],pages 39:1–39:?? ISBN 1-4673-0804-8. URL http://conferences.computer.org/sc/2012/papers/1000a032.pdf.

Nguyen:2017:ATM

[NCB+17] Tan Nguyen, Pietro Ci-cotti, Eric Bylaska, DanQuinlan, and Scott Baden.Automatic translation ofMPI source into a latency-tolerant, data-driven form.Journal of Parallel and Dis-tributed Computing, 106(??):1–13, August 2017. CO-DEN JPDCER. ISSN

0743-7315 (print), 1096-0848(electronic). URL http://www.sciencedirect.com/science/article/pii/S0743731517300771.

Nobari:2012:SPM

[NCKB12] Sadegh Nobari, Thanh-TungCao, Panagiotis Karras, andStephane Bressan. Scal-able parallel minimum span-ning forest computation.ACM SIGPLAN Notices, 47(8):205–214, August 2012.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic). PPOPP ’12 confer-ence proceedings.

Neophytou:1998:NDJ

[NE98] N. Neophytou and P. Evripi-dou. Net-dbx: a Javapowered tool for interac-tive debugging of MPI pro-grams across the Internet.Lecture Notes in ComputerScience, 1470:181–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Neophytou:2001:NDW

[NE01] Neophytos Neophytou andParaskevas Evripidou. Net-dbx: a Web-based debug-ger of MPI programs overlow-bandwidth lines. IEEETransactions on Paralleland Distributed Systems,12(9):986–995, September2001. CODEN ITD-SEO. ISSN 1045-9219(print), 1558-2183 (elec-

REFERENCES 338

tronic). URL http://dlib.computer.org/td/books/td2001/pdf/l0986.pdf;http://www.computer.org/tpds/td2001/l0986abs.htm.

Nelson:1993:PPP

[Nel93] M. L. Nelson. PVM pro-vides power in the public do-main. Parallelogram, 53:20–21, May-June 1993. CODENPRALEH. ISSN 0953-7252.

Neugebauer:2017:PAR

[NEM17] Olaf Neugebauer, MichaelEngel, and Peter Marwedel.A parallelization approachfor resource-restricted em-bedded heterogeneous MP-SoCs inspired by OpenMP.The Journal of Systemsand Software, 125(??):439–448, March 2017. CO-DEN JSSODM. ISSN0164-1212 (print), 1873-1228 (electronic). URL //www.sciencedirect.com/science/article/pii/S0164121216301534.

Nesterov:2010:SPT

[Nes10] Oleksandr Nesterov. A sim-ple parallelization techniquewith MPI for ocean circula-tion models. Journal of Par-allel and Distributed Com-puting, 70(1):35–44, January2010. CODEN JPDCER.ISSN 0743-7315 (print),1096-0848 (electronic).

Neun:1994:UPB

[Neu94] W. Neun. Using PVM basedsoftware for parallel compu-tation in computer algebra.

In Calmet [Cal94], pages 46–51. ISBN ???? LCCN ????

Neyman:2000:CDA

[Ney00] Marcin Neyman. Com-parison of different ap-proaches to trace PVM pro-gram execution. LectureNotes in Computer Sci-ence, 1908:274–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080274.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080274.pdf.

Nordling:1994:SOD

[NF94] P. Nordling and P. Fritz-son. Solving ordinary dif-ferential equations on par-allel computers — appliedto dynamic rolling bearingssimulation. In Dongarraand Wasniewski [DW94],pages 397–415. ISBN 3-540-58712-8 (Berlin), 0-387-58712-8 (New York). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58.P35 1994. DM104.00.

Nunez:2010:NTS

[NFG+10] Alberto Nunez, Javier Fernandez,Jose D. Garcia, Felix Gar-cia, and Jesus Carretero.New techniques for simulat-ing high performance MPIapplications on large stor-age networks. The Jour-

REFERENCES 339

nal of Supercomputing, 51(1):40–57, January 2010.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=51&issue=1&spage=40.

Nguyen:2008:GG

[Ngu08] Hubert Nguyen, editor.GPU gems 3, volume 3 ofGPU gems. Addison-Wes-ley, Reading, MA, USA,2008. ISBN 0-321-51526-9.l + 942 pp. LCCN T385.G6882 2008. URL http://www.loc.gov/catdir/toc/ecip0720/2007023985.html.

Nguyen:1995:SPI

[NH95] D. Nguyen and B. Hill-berg. Simulations of pin-hole imaging for AXAF: Dis-tributed processing using theMPI standard. In Shawet al. [SPH95], pages 361–366 (or 361–363??). ISBN0-937707-96-1. ISSN 1080-7926. LCCN QB51.3.E43A87 1994.

Norden:2002:OVM

[NHT02] M. Norden, S. Holmgren,and M. Thune. OpenMPversus MPI for PDE solversbased on regular sparse nu-merical operators. Lec-ture Notes in Computer Sci-ence, 2331:681–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349

(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2331/23310681.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2331/23310681.pdf.

Norden:2006:OVM

[NHT06] Markus Norden, SverkerHolmgren, and MichaelThune. OpenMP versus MPIfor PDE solvers based onregular sparse numerical op-erators. Future GenerationComputer Systems, 22(1–2):194–203, January 2006.CODEN FGSEVI. ISSN0167-739X (print), 1872-7115 (electronic).

Nakano:2002:SCG

[NIO+02] Hirofumi Nakano, KazuhisaIshizaka, Motoki Obata,Keiji Kimura, and Hi-ronori Kasahara. Staticcoarse grain task schedul-ing with cache optimiza-tion using OpenMP. Lec-ture Notes in Computer Sci-ence, 2327:479–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2327/23270479.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2327/23270479.pdf.

REFERENCES 340

Nakano:2003:SCG

[NIO+03] Hirofumi Nakano, KazuhisaIshizaka, Motoki Obata,Keiji Kimura, and Hi-ronori Kasahara. Staticcoarse grain task schedul-ing with cache optimiza-tion using OpenMP. Inter-national Journal of Paral-lel Programming, 31(3):211–223, June 2003. CODENIJPPE5. ISSN 0885-7458(print), 1573-7640 (elec-tronic). URL /ips/frames/Refs/referenceskapmain.asp?J=4773&I=33&A=4&LK=NM; http://ipsapp007.kluweronline.com/content/getfile/4773/33/4/abstract.htm; http://ipsapp007.kluweronline.com/content/getfile/4773/33/4/fulltext.pdf.

Nitsche:2000:TCM

[Nit00] Thomas Nitsche. Threadcommunication over MPI.Lecture Notes in ComputerScience, 1908:145–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080145.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080145.pdf.

Nicolescu:2001:DTP

[NJ01] Cristina Nicolescu and PieterJonker. A data and

task parallel image pro-cessing environment. Lec-ture Notes in Computer Sci-ence, 2131:393–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310393.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310393.pdf.

Norden:2007:DDM

[NLRH07] Markus Norden, Henrik Lof,Jarmo Rantakokko, andSverker Holmgren. Dynamicdata migration for struc-tured AMR solvers. In-ternational Journal of Par-allel Programming, 35(5):477–491, October 2007.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=35&issue=5&spage=477.

Nadeau:1995:SVR

[NM95] David R. Nadeau andJohn L. Moreland, edi-tors. 1995 Symposium onthe Virtual Reality Model-ing Language, VRML ’95,San Diego, California, De-cember 14–15, 1995. ACMPress, New York, NY 10036,USA, 1995. ISBN 0-89791-818-5. LCCN QA76.76.H94

REFERENCES 341

S95 1995. ACM order num-ber 434953.

Nomura:2014:PAM

[NMS+14] Shimpei Nomura, TakujiMitsuishi, Jun Suzuki, YukiHayashi, Masaki Kan, andHideharu Amano. Perfor-mance analysis of the multi-GPU system with ExpEther.ACM SIGARCH ComputerArchitecture News, 42(4):9–14, September 2014. CO-DEN CANED2. ISSN0163-5964 (print), 1943-5851(electronic).

Nanayakkara:1993:PIR

[NMW93] A. Nanayakkara, D. Mon-crieff, and S. Wilson. Per-formance of IBM RISC Sys-tem/6000 workstation clus-ters in a quantum chem-ical application. Paral-lel Computing, 19(9):1053–1062, September 1993. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic).

Nupairoj:1995:PES

[NN95] N. Nupairoj and L. M. Ni.Performance evaluation ofsome MPI implementationson workstation clusters. InIEEE [IEE95j], pages 98–105. ISBN 0-8186-6895-4.LCCN QA76.58 .S34 1994.

Nishitani:2000:IEO

[NNON00] Yasunori Nishitani, KiyoshiNegishi, Hiroshi Ohta, andEiji Nunohiro. Imple-mentation and evaluation

of OpenMP for HitachiSR8000. Lecture Notes inComputer Science, 1940:391–??, 2000. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1940/19400391.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1940/19400391.pdf.

Nakajima:2002:PISb

[NO02a] Kengo Nakajima and Hi-roshi Okuda. Parallel itera-tive solvers for unstructuredgrids using a directive/MPIhybrid programming modelfor the GeoFEM platform onSMP cluster architectures.Concurrency and Compu-tation: Practice and Ex-perience, 14(6–7):411–429,May/June 2002. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic). URL http://www3.interscience.wiley.com/cgi-bin/abstract/94515747/START; http://www3.interscience.wiley.com/cgi-bin/fulltext?ID=94515747{\&}PLACEBO=IE.pdf.

Nakajima:2002:PISa

[NO02b] Kengo Nakajima and Hi-roshi Okuda. Paralleliterative solvers for un-structured grids using anOpenMP/MPI hybrid pro-

REFERENCES 342

gramming model for the Ge-oFEM platform on SMPcluster architectures. Lec-ture Notes in Computer Sci-ence, 2327:437–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2327/23270437.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2327/23270437.pdf.

Noble:2008:GMY

[Nob08] Michael S. Noble. Get-ting more from your mul-ticore: exploiting OpenMPfrom an open-source nu-merical scripting language.Concurrency and Compu-tation: Practice and Ex-perience, 20(16):1877–1891,November 2008. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic).

Novotny:1995:BPP

[Nov95] Mark Novotny. BOOKS:PVM — parallel virtual ma-chine: a users’ guide andtutorial for networked par-allel computing. Comput-ers in Physics, 9(6):607–??, ???? 1995. CODENCPHYE2. ISSN 0894-1866(print), 1558-4208 (elec-tronic).

Nemer-Preece:1994:LBH

[NP94] Nicole Anne Nemer-Preece.Load balancing the heatequation in a heterogeneousenvironment with PVM.M.s. thesis, University ofMissouri, Rolla, Rolla, MO,USA, 1994. viii + 52 pp.

Nguyen:2012:SCS

[NP12] Donald Nguyen and KeshavPingali. Synthesizing con-current schedulers for irreg-ular algorithms. ACM SIG-PLAN Notices, 47(4):333–344, April 2012. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Nikolopoulos:2000:TRD

[NPP+00a] Dimitrios S. Nikolopou-los, Theodore S. Pap-atheodorou, Constantine D.Polychronopoulos, et al. Atransparent runtime datadistribution engine for OpenMP.Scientific Programming, 8(3):143–162, 2000. CODENSCIPEV. ISSN 1058-9244(print), 1875-919X (elec-tronic).

Nikolopoulos:2000:DDN

[NPP+00b] Dimitrios S. Nikolopou-los, Theodore S. Pap-atheodorou, Constantine D.Polychronopoulos, JesusLabarta, and Eduard Ayguade.Is data distribution neces-sary in OpenMP? In ACM[ACM00], page 68. URL

REFERENCES 343

http://www.sc2000.org/proceedings/techpapr/papers/pap192.pdf.

Nikolopoulos:2000:LTD

[NPP+00c] Dimitrios S. Nikolopou-los, Theodore S. Pap-atheodorou, Constantine D.Polychronopoulos, JesusLabarta, and Eduard Ayguade.Leveraging transparent datadistribution in OpenMP viauser-level dynamic page mi-gration. Lecture Notes inComputer Science, 1940:415–??, 2000. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1940/19400415.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1940/19400415.pdf.

Nikolopoulos:2000:ULR

[NPP+00d] Dimitrios S. Nikolopou-los, Theodore S. Pap-atheodorou, Constantine D.Polychronopoulos, JesusLabarta, and Eduard Ayguade.UPM LIB: a runtime systemfor tuning the memory per-formance of OpenMP pro-grams on scalable shared-memory multiprocessors.Lecture Notes in ComputerScience, 1915:85–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http:

//link.springer-ny.com/link/service/series/0558/bibs/1915/19150085.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1915/19150085.pdf.

Notz:2012:GBS

[NPS12] Patrick K. Notz, Roger P.Pawlowski, and James C.Sutherland. Graph-basedsoftware design for man-aging complexity and en-abling concurrency in multi-physics PDE software. ACMTransactions on Mathemati-cal Software, 39(1):1:1–1:21,November 2012. CODENACMSCU. ISSN 0098-3500(print), 1557-7295 (elec-tronic).

Nagaraj:1991:MHL

[NS91] U. Nagaraj and U. S. Shukla.MK: a high level interface formessage passing. In Bhavsarand Gujar [BG91], pages493–502. ISBN 0-920114-14-8. LCCN QA76.88.S87 1991.

Naumenko:2016:ACT

[NS16] Mikhail A. Naumenko andVyacheslav V. Samarin. Ap-plication of CUDA technol-ogy to calculation of groundstates of few-body nuclei byFeynman’s continual inte-grals method. Supercom-puting Frontiers and Inno-vations, 3(2):80–95, ????2016. CODEN ???? ISSN2409-6008 (print), 2313-8734

REFERENCES 344

(electronic). URL http://superfri.org/superfri/article/view/102.

Nascimento:2007:DDS

[NSBR07] Aline P. Nascimento, Alexan-dre C. Sena, Cristina Boeres,and Vinod E. F. Rebello.Distributed and dynamicself-scheduling of parallelMPI Grid applications. Con-currency and Computation:Practice and Experience,19(14):1955–1974, Septem-ber 25, 2007. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic).

Nadal-Serrano:2016:PSC

[NSLV16] Jose M. Nadal-Serranoand Marisa Lopez-Vallejo.A performance study ofCUDA UVM versus man-ual optimizations in a real-world setup: Applicationto a Monte Carlo wave-particle event-based in-teraction model. IEEETransactions on Paral-lel and Distributed Sys-tems, 27(6):1579–1588, June2016. CODEN ITD-SEO. ISSN 1045-9219(print), 1558-2183 (elec-tronic)ITDSEO. URL http://csdl.computer.org/csdl/trans/td/2016/06/07175058-abs.html.

Nukada:2012:SMG

[NSM12] Akira Nukada, Kento Sato,and Satoshi Matsuoka. Scal-able multi-GPU 3-D FFT

for TSUBAME 2.0 super-computer. In Hollingsworth[Hol12], pages 44:1–44:??ISBN 1-4673-0804-8. URLhttp://conferences.computer.org/sc/2012/papers/1000a044.pdf.

Neuberger:2012:MIS

[NSS12] John M. Neuberger, NandorSieben, and James W. Swift.An MPI implementation ofa self-submitting parallel jobqueue. International Jour-nal of Parallel Programming,40(4):443–464, August 2012.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=40&issue=4&spage=443.

Nandivada:2013:TFO

[NSZS13] V. Krishna Nandivada, JunShirako, Jisheng Zhao, andVivek Sarkar. A transfor-mation framework for op-timizing task-parallel pro-grams. ACM Transac-tions on Programming Lan-guages and Systems, 35(1):3:1–3:??, April 2013. CO-DEN ATPSDT. ISSN0164-0925 (print), 1558-4593(electronic).

Nogueira:2016:BBW

[NTR16] David Nogueira, PedroTomas, and Nuno Roma.BowMapCL: Burrows–Wheelermapping on multiple hetero-

REFERENCES 345

geneous accelerators. IEEE/ACMTransactions on Computa-tional Biology and Bioin-formatics, 13(5):926–938,September 2016. CODENITCBCY. ISSN 1545-5963(print), 1557-9964 (elec-tronic).

Norcen:2005:HPJ

[NU05] Roland Norcen and AndreasUhl. High performanceJPEG 2000 and MPEG-4 VTC on SMPs usingOpenMP. Parallel Comput-ing, 31(10–12):1082–1098,October/December 2005.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic).

Nitsche:1998:FMP

[NW98] T. Nitsche and W. We-bers. Functional messagepassing with OPAL-MPI.Lecture Notes in ComputerScience, 1497:281–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Ng:2012:STT

[NYNT12] Nicholas Ng, Nobuko Yoshida,Xin Yu Niu, and Kuen HungTsoi. Session types: to-wards safe and fast reconfig-urable programming. ACMSIGARCH Computer Archi-tecture News, 40(5):22–27,December 2012. CODENCANED2. ISSN 0163-5964(print), 1943-5851 (elec-

tronic). HEART ’12 confer-ence proceedings.

Nguyen:1994:DCE

[NZZ94] S. T. Nguyen, B. J.Zook, and Xiaodong Zhang.Distributed computationof electromagnetic scatter-ing problems using finite-difference time-domain de-compositions. In IEEE[IEE94g], pages 85–93. ISBN0-8186-6395-2. LCCNQA76.9.D5I328 1994. IEEEcatalog no. 94TH0667-6.

Omar:2017:PSF

[OA17] Cyrus Omar and JonathanAldrich. Programmable se-mantic fragments: the de-sign and implementation oftypy. ACM SIGPLAN No-tices, 52(3):81–92, March2017. CODEN SINODQ.ISSN 0362-1340 (print),1523-2867 (print), 1558-1160(electronic).

Oberhuber:1996:MNP

[Obe96] M. Oberhuber. Manag-ing nondeterminism in PVMprograms. In Bode et al.[BDLS96], pages 347–??ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E975 1996.

Orr:2015:SUR

[OCY+15] Marc S. Orr, Shuai Che,Ayse Yilmazer, Bradford M.Beckmann, Mark D. Hill,

REFERENCES 346

and David A. Wood. Syn-chronization using remote-scope promotion. ACMSIGARCH Computer Ar-chitecture News, 43(1):73–86, March 2015. CODENCANED2. ISSN 0163-5964(print), 1943-5851 (elec-tronic).

Okulicka-Dluzewska:2001:PFE

[OD01] Felicja Okulicka-D luzewska.Parallelization of finite el-ement package by MPI li-brary. Lecture Notes inComputer Science, 2131:427–??, 2001. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310427.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310427.pdf.

Olivier:2012:CMW

[OdSSP12] Stephen L. Olivier, Bro-nis R. de Supinski, MartinSchulz, and Jan F. Prins.Characterizing and miti-gating work time inflationin task parallel programs.In Hollingsworth [Hol12],pages 65:1–65:?? ISBN 1-4673-0804-8. URL http://conferences.computer.org/sc/2012/papers/1000a066.pdf.

Oed:1993:CRM

[Oed93] Wilfried Oed. The CrayResearch massively paral-lel processor system CRAYT3D. Technical report, CrayResearch GmbH, Munchen,Germany, November 151993.

Ong:2000:PCL

[OF00] Hong Ong and Paul A. Far-rell. Performance compar-ison of LAM/MPI, MPICH,and MVICH on a Linux clus-ter connected by a Giga-bit Ethernet network. InUSENIX [USE00], page ??ISBN 1-880446-17-0. LCCN???? URL http://www.usenix.org/publications/library/proceedings/als2000/ong.html.

Owaida:2015:EDS

[OFA+15] Muhsen Owaida, GabrielFalcao, Joao Andrade, Chris-tos Antonopoulos, NikolaosBellas, Madhura Purnapra-jna, David Novo, Geor-gios Karakonstantis, An-dreas Burg, and PaoloIenne. Enhancing designspace exploration by extend-ing CPU/GPU specificationsonto FPGAs. ACM Trans-actions on Embedded Com-puting Systems, 14(2):33:1–33:??, March 2015. CO-DEN ???? ISSN 1539-9087(print), 1558-3465 (elec-tronic).

REFERENCES 347

Okitsu:2010:HPC

[OIH10] Yusuke Okitsu, FumihikoIno, and Kenichi Hagi-hara. High-performancecone beam reconstructionusing CUDA compatibleGPUs. Parallel Comput-ing, 36(2–3):129–141, Febru-ary/March 2010. CODENPACOEJ. ISSN 0167-8191(print), 1872-7336 (elec-tronic).

Ohara:2006:MMP

[OIS+06] M. Ohara, H. Inoue, Y. So-hda, H. Komatsu, andT. Nakatani. MPI micro-task for programming theCell Broadband EngineTM

processor. IBM SystemsJournal, 45(1):85–102, ????2006. CODEN IBMSA7.ISSN 0018-8670. URL http://www.research.ibm.com/journal/sj/451/ohara.html.

Oh:2012:MOO

[OKM12] Kwang Jin Oh, Ji HoonKang, and Hun Joo Myung.mm par2.0: An object-oriented molecular dynam-ics simulation program par-allelized using a hierar-chical scheme with MPIand OPENMP. Com-puter Physics Communi-cations, 183(2):440–441,February 2012. CODENCPHCBZ. ISSN 0010-4655(print), 1879-2944 (elec-tronic). URL http://www.sciencedirect.com/science/article/pii/S0010465511003407.

Oakley:1995:ADR

[OKW95] D. R. Oakley, N. F. Knight,Jr., and D. D. Warner.Adaptive dynamic relax-ation algorithm for non-linear hyperelastic struc-tures. III. Parallel implemen-tation. Computer Methods inApplied Mechanics and En-gineering, 126(1-2):111–129,September 1995. CODENCMMECC. ISSN 0045-7825,0374-2830.

Orlando:2005:PSP

[OL05] Salvatore Orlando and DomenicoLaforenza. Preface: Se-lected papers from the EU-ROPVM/MPI 2003 Con-ference, Venice, Italy, 29September–2 October 2003.The International Journalof High Performance Com-puting Applications, 19(1):47, Spring 2005. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/19/1/47.full.pdf+html.

Oldehoeft:2002:SIS

[Old02] Rod Oldehoeft, editor. Spe-cial issue on software forhigh-performance systems:papers from the symposiumof the Los Alamos Com-puter Science Institute, heldin Santa Fe, NM, USA onOctober 15–18, 2001, vol-ume 23(1) of The journalof supercomputing. KluwerAcademic Publishers Group,

REFERENCES 348

Norwell, MA, USA, and Dor-drecht, The Netherlands,2002. CODEN JOSUED.ISSN 0920-8542 (print),1573-0484 (electronic).

Ong:2001:SUC

[OLG01] Emil Ong, Ewing Lusk, andWilliam Gropp. ScalableUnix commands for par-allel processors: a high-performance implementa-tion. Lecture Notes inComputer Science, 2131:410–??, 2001. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310410.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310410.pdf.

Oger:2016:DMM

[OLG+16] G. Oger, D. Le Touze,D. Guibert, M. de Leffe,J. Biddiscombe, J. Sou-magne, and J.-G. Picci-nali. On distributed mem-ory MPI-based paralleliza-tion of SPH codes in mas-sive HPC context. ComputerPhysics Communications,200(??):1–14, March 2016.CODEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465515003070.

Olszewski:1995:TCC

[Ols95] Luke Olszewski. A tim-ing comparison of the con-jugate gradient and Gauss–Seidel parallel algorithms ina one-dimensional flow equa-tion using PVM. In ACM[ACM95a], pages 205–212.ISBN 0-89791-747-2. LCCN????

Olukotun:2014:BPP

[Olu14] Kunle Olukotun. Beyondparallel programming withdomain specific languages.ACM SIGPLAN Notices, 49(8):179–180, August 2014.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Ogawa:1996:OOM

[OM96] Hirotaka Ogawa and SatoshiMatsuoka. OMPI: Optimiz-ing MPI programs using par-tial evaluation. In ACM[ACM96c], page ?? ISBN0-89791-854-1. LCCN QA76.88 S8573 1996. URLhttp://www.supercomp.org/sc96/proceedings/SC96PROC/OGAWA/INDEX.HTM. ACMOrder Number: 415962,IEEE Computer SocietyPress Order Number: RS00126.

Ozgun:2009:PCB

[OMK09] Ozlem Ozgun, Raj Mittra,and Mustafa Kuzuoglu. Par-allelized characteristic ba-sis finite element method

REFERENCES 349

(CBFEM–MPI) — a non-iterative domain decompo-sition algorithm for electro-magnetic scattering prob-lems. Journal of Com-putational Physics, 228(6):2225–2238, April 1, 2009.CODEN JCTPAH. ISSN0021-9991 (print), 1090-2716(electronic). URL http://www.sciencedirect.com/science/article/pii/S0021999108006293.

OBroin:2012:OIS

[ON12] Cathal O Broin and L. A. A.Nikolopoulos. An OpenCLimplementation for the solu-tion of the time-dependentSchrodinger equation onGPUs and CPUs. Com-puter Physics Communi-cations, 183(10):2071–2080,October 2012. CODENCPHCBZ. ISSN 0010-4655(print), 1879-2944 (elec-tronic). URL http://www.sciencedirect.com/science/article/pii/S0010465512001774.

Ong:2002:MRS

[Ong02] Emil Ong. MPI Ruby:Scripting in a parallelenvironment. Comput-ing in Science and Engi-neering, 4(4):78–82, July/August 2002. CODENCSENFA. ISSN 1521-9615(print), 1558-366X (elec-tronic). URL http://csdl.computer.org/comp/mags/cs/2002/04/c4078abs.htm;http://csdl.computer.

org/dl/mags/cs/2002/04/

c4078.htm; http://csdl.computer.org/dl/mags/cs/2002/04/c4078.pdf.

OBrien:2008:SOC

[OOS+08] Kevin OBrien, KathrynOBrien, Zehra Sura, TongChen, and Tao Zhang.Supporting OpenMP onCell. International Jour-nal of Parallel Programming,36(3):289–311, June 2008.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=36&issue=3&spage=289.

Orlando:1998:MBR

[OP98] S. Orlando and R. Perego.An MPI-based run-time sup-port to coordinate HPFtasks. Lecture Notes in Com-puter Science, 1497:289–??,1998. CODEN LNCSD9.ISSN 0302-9743 (print),1611-3349 (electronic).

Olivier:2010:COO

[OP10] Stephen L. Olivier andJan F. Prins. Comparison ofOpenMP 3.0 and other taskparallel frameworks on un-balanced task graphs. In-ternational Journal of Par-allel Programming, 38(5–6):341–360, October 2010.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/

REFERENCES 350

openurl.asp?genre=article&issn=0885-7458&volume=38&issue=5&spage=341.

ODowd:2006:WGM

[OPM06] Padraig J. O’Dowd, AdarshPatil, and John P. Mor-rison. WebCom-G andMPICH-G2 jobs. Scal-able Computing: Practiceand Experience, 7(3):75–86,September 2006. CODEN???? ISSN 1895-1767.URL http://www.scpe.org/vols/vol07/no3/SCPE_7_3_07.pdf; http://www.scpe.org/vols/vol07/no3/SCPE_7_3_07.zip.

Orlando:2000:MDT

[OPP00] S. Orlando, P. Palmerini,and R. Perego. Mixed dataand task parallelism withHPF and PVM. ClusterComputing, 3(3):201–213,2000. CODEN ???? ISSN1386-7857.

Olivier:2012:OTS

[OPW+12] Stephen L. Olivier, Al-lan K. Porterfield, Kyle B.Wheeler, Michael Spiegel,and Jan F. Prins. OpenMPtask scheduling strategies formulticore NUMA systems.The International Journal ofHigh Performance Comput-ing Applications, 26(2):110–124, May 2012. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/26/2/110.full.pdf+html.

Oliveira:2012:CCO

[ORA12] Rafael Sachetto Oliveira,Bernardo Martins Rocha,and Ronan Mendonca Amorim.Comparing CUDA, OpenCLand OpenGL implementa-tions of the cardiac mon-odomain equations. Lec-ture Notes in Computer Sci-ence, 7204:111–120, 2012.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.com/chapter/10.1007/978-3-642-31500-8_12/.

Overeinder:1997:BCD

[OS97] B. J. Overeinder and P. M. A.Sloot. Breaking the curseof dynamics by task migra-tion: Pilot experiments inthe Polder Metacomputer.Lecture Notes in ComputerScience, 1332:194–207, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Ostrand:1994:PIS

[Ost94] Thomas Ostrand, editor.Proceedings of the 1994 In-ternational Symposium onSoftware Testing and Anal-ysis (ISSTA): August 17–19, 1994, Seattle, Wash-ington, USA, ACM Sigsoft.ACM Press, New York, NY10036, USA, 1994. CO-DEN SFENDP. ISBN 0-89791-683-2. ISSN 0163-5948. LCCN QA76.76.T48I58 1994.

REFERENCES 351

Otto:1993:PAC

[Ott93] S. W. Otto. Parallel ar-ray classes and lightweightsharing mechanisms. Scien-tific Programming, 2(4):203–216, Winter 1993. CODENSCIPEV. ISSN 1058-9244(print), 1875-919X (elec-tronic).

Otto:1994:PVM

[Ott94] S. W. Otto. Processor vir-tualization and migration forPVM. In Dongarra andTourancheau [DT94], pages66–75. ISBN 0-89871-343-9.LCCN QA76.58.I568 1994.

Otto:1992:MAP

[OW92] S. W. Otto and M. Wolfe.The MetaMP approach toparallel programming. InSiegel [Sie92a], pages 562–565. ISBN 0-8186-2772-7. LCCN QA76.58.S95 1992.IEEE catalog no. 92CH3185-6.

Ouenes:1995:PRA

[OWSA95] A. Ouenes, W. W. Weiss,J. A. Sultan, and J. An-war. Parallel reservoir au-tomatic history matching us-ing a network of worksta-tions and PVM. In Anony-mous [Ano95d], pages 125–134. ISBN ???? LCCN ????

Pacheco:1997:PPM

[Pac97] Peter S. Pacheco. Paral-lel programming with MPI.Morgan Kaufmann Publish-ers, Los Altos, CA 94022,

USA, 1997. ISBN 1-55860-339-5. xxii + 418 pp. LCCNQA76.642 .P3 1997.

Pereira:2017:SBC

[PAdS+17] Phillipe Pereira, Higo Albu-querque, Isabela da Silva,Hendrio Marques, FelipeMonteiro, Ricardo Ferreira,and Lucas Cordeiro. SMT-based context-bounded modelchecking for CUDA pro-grams. Concurrency andComputation: Practice andExperience, 29(22):??, Nov-ember 25, 2017. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic).

Panda:1995:GRW

[Pan95a] D. K. Panda. Global re-duction in wormhole k-aryn-cube networks with multi-destination exchange worms.In IEEE [IEE95f], pages652–659. ISBN 0-8186-7074-6. LCCN QA 76.58 I56 1995.IEEE catalog no. 95TH8052.

Panda:1995:IDE

[Pan95b] D. K. Panda. Issues in de-signing efficient and prac-tical algorithms for col-lective communication onwormhole-routed systems. InAgrawal [Agr95a], pages 8–15. ISBN 0-8493-2618-4.LCCN QA76.58.I34 1995.

Panda:2014:GAM

[Pan14] Dhabaleswar K. Panda.GPU-aware MPI on RDMA-

REFERENCES 352

enabled clusters: Design,implementation and evalu-ation. IEEE Transactionson Parallel and DistributedSystems, 25(10):2595–2605,October 2014. CODENITDSEO. ISSN 1045-9219(print), 1558-2183 (elec-tronic)ITDSEO. URL http://www.computer.org/csdl/trans/td/2014/10/06587715-abs.html.

Parsons:1993:EDC

[Par93] I. Parsons. Evaluation of dis-tributed communication sys-tems. In Gawman et al.[GGK+93], pages 956–970vol.2. ISBN ???? LCCNQA76.76.S64 C378 1993 v.1-2. Two volumes.

Pal:2014:PMH

[PARB14] Anirban Pal, AbhishekAgarwala, Soumyendu Raha,and Baidurya Bhattacharya.Performance metrics in a hy-brid MPI–OpenMP basedmolecular dynamics simula-tion with short-range inter-actions. Journal of Par-allel and Distributed Com-puting, 74(3):2203–2214,March 2014. CODEN JPD-CER. ISSN 0743-7315(print), 1096-0848 (elec-tronic). URL http://www.sciencedirect.com/science/article/pii/S0743731513002505.

Patterson:1993:PPE

[Pat93] Christopher S. Patterson.Parametric positron emis-sion tomographic imaging

using parallel virtual ma-chine: with an example us-ing myocardial blood flowanalysis. M.s. thesis, Univer-sity of Tennessee, Knoxville,Knoxville, TN 37996, USA,1993. x + 132 pp.

Puzniakowski:2012:TOI

[PB12] Tadeusz Puzniakowski andMarek A. Bednarczyk. To-wards an OpenCL imple-mentation of ‘genetic algo-rithms’ on GPUs. Lec-ture Notes in Computer Sci-ence, 7053:190–203, 2012.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.com/chapter/10.1007/978-3-642-25261-7_15/.

Pringle:2001:TPF

[PBC+01] Gavin J. Pringle, Steven P.Booth, Hugh M. P. Couch-man, Frazer R. Pearce, andAlan D. Simpson. To-wards a portable, fast par-allel AP3M-SPH code: HY-DRA MPI. Lecture Notesin Computer Science, 2131:360–??, 2001. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310360.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310360.pdf.

REFERENCES 353

Pingali:1995:LCP

[PBG+95] K. Pingali, U. Banerjee,D. Gelernter, A. Nico-lau, and D. Padua, edi-tors. Languages and com-pilers for parallel computing:7th International Workshop,Ithaca, NY, USA, August 8–10, 1994: proceedings, vol-ume 892 of Lecture notes incomputer science. Springer-Verlag, Berlin, Germany /Heidelberg, Germany / Lon-don, UK / etc., 1995.ISBN 3-540-58868-X. LCCNQA76.58 .W656 1994.

Plazek:1999:IIC

[PBK99] J. Plazek, K. Banas, andJ. Kitowski. Implementa-tion issues of computationalfluid dynamics algorithmson parallel computers. InDongarra et al. [DLM99],pages 349–355. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Plazek:2000:SCC

[PBK00] Joanna P lazek, KrzysztofBanas, and Jacek Kitowski.Scalable CFD computationsusing message-passing anddistributed shared mem-ory algorithms. LectureNotes in Computer Sci-ence, 1908:282–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/

link/service/series/0558/bibs/1908/19080282.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080282.pdf.

Prasanna:1995:FIP

[PBPT95] Viktor K. Prasanna, V. P.Bhatkar, L. M. Patnaik,and S. K. Tripathi, edi-tors. First IWPP paral-lel processing: proceedingsof the First InternationalWorkshop on Parallel Pro-cessing (IWPP-94): Decem-ber 26–31, 1994, Banga-lore, India. Taka McGraw-Hill Pub. Co, New Delhi;New York, 1995. ISBN 0-07-462332-X. LCCN QA 76.58I587 1994.

Puthukattukaran:1994:DIP

[PCS94] J. Puthukattukaran, S. Cha-lasani, and P. Senapathy.Design and implementa-tion of parallel algorithmsfor gene-finding. In IEEE[IEE94g], pages 186–193.ISBN 0-8186-6395-2. LCCNQA76.9.D5I328 1994. IEEEcatalog no. 94TH0667-6.

Peng:2014:IDI

[PCY14] Yi Peng, Li Chen, and Jun-Hai Yong. Importance-driven isosurface decima-tion for visualization of largesimulation data based onOpenCL. Computing in Sci-ence and Engineering, 16(1):24–32, January/February

REFERENCES 354

2014. CODEN CSENFA.ISSN 1521-9615.

Poggi:1998:UPD

[PD98] Agostino Poggi and GiulioDestri. Using PVM to de-velop a distributed object-oriented language for het-erogeneous processing. TheJournal of Systems and Soft-ware, 40(2):139–150, Febru-ary 1998. CODEN JS-SODM. ISSN 0164-1212(print), 1873-1228 (elec-tronic).

Plimpton:2011:MML

[PD11] Steven J. Plimpton andKaren D. Devine. MapRe-duce in MPI for large-scalegraph algorithms. Paral-lel Computing, 37(9):610–632, September 2011. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167819111000172.

Pawliczek:2014:VED

[PDY14] Piotr Pawliczek, WitoldDzwinel, and David A. Yuen.Visual exploration of databy using multidimensionalscaling on multicore CPU,GPU, and MPI cluster. Con-currency and Computation:Practice and Experience, 26(3):662–682, March 10, 2014.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Pennington:1995:DHC

[Pen95] R. L. Pennington. Dis-tributed and heterogeneouscomputing. In Vandoni andVerkerk [VV95], pages 25–57. ISBN 92-9083-069-7.CERN report 95-01.

Pernice:1996:RPP

[Per96] Michael Pernice. Review of“PVM: Parallel Virtual Ma-chine. A User’s Guide andTutorial for Networked Par-allel Computing”. IEEE par-allel and distributed technol-ogy: systems and applica-tions, 4(1):84, Spring 1996.CODEN IPDTEX. ISSN1063-6552 (print), 1558-1861(electronic). URL http://dlib.computer.org/pd/books/pd1996/pdf/p1084.pdf.

Pernice:1997:BRM

[Per97] Michael Pernice. Bookreview: MPI: The Com-plete Reference. IEEE Con-currency, 5(1):80–81, Jan-uary/March 1997. CO-DEN IECMFX. ISSN1092-3063 (print), 1558-0849(electronic). URL http://dlib.computer.org/pd/books/pd1997/pdf/p1080.pdf.

Pereira:1999:PBI

[Per99] N. S. A. Pereira. A par-allel N -body integrator us-ing MPI. Lecture Notesin Computer Science, 1573:

REFERENCES 355

627–639, 1999. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Papagapiou:1999:NWD

[PES99] A. Papagapiou, P. Evripi-dou, and G. Samaras. Net-Console: a Web-based de-velopment environment forMPI programs. In Dongarraet al. [DLM99], pages 249–256. ISBN 3-540-66549-8(softcover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

Petcu:1997:ISM

[Pet97] D. Petcu. Implementa-tion of some multiproces-sor algorithms for ODEs us-ing PVM. Lecture Notesin Computer Science, 1332:375–382, 1997. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Petcu:2000:PDAa

[Pet00a] Dana Petcu. PVMaple: adistributed approach to co-operative work of Mapleprocesses. Technical re-port, Westers University ofTimisoara, Timisoara, Ro-mania, May 2000. URLhttp://www.risc.uni-linz.ac.at/software/distmaple/misc/PVMaple.ps.gz.

Petcu:2000:PDAb

[Pet00b] Dana Petcu. PVMaple: adistributed approach to co-

operative work of Mapleprocesses. Lecture Notesin Computer Science, 1908:216–??, 2000. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080216.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080216.pdf.

Petcu:2001:WMM

[Pet01] Dana Petcu. Working withmultiple Maple kernels con-nected by Distributed Mapleor PVMaple. Technicalreport, Westers Universityof Timisoara, Timisoara,Romania, March 2001.URL http://www.risc.uni-linz.ac.at/software/distmaple/misc/petcu2001.ps.gz.

Pharr:2005:GGP

[PF05] Matt Pharr and RandimaFernando, editors. GPUgems 2: programming tech-niques for high-performancegraphics and general-purposecomputation, volume 2 ofGPU gems. Addison-Wes-ley, Reading, MA, USA,2005. ISBN 0-321-33559-7 (hardcover). xlix + 814pp. LCCN T385 .G688 2005.URL http://www-docs.tu-cottbus.de/bibliothek/public/katalog/420569.

REFERENCES 356

PDF; http://www.loc.gov/catdir/toc/ecip055/2004030181.html.

Piernas:1997:APM

[PFG97] J. Piernas, A. Flores, andJ. M. Garcia. Analyz-ing the performance of MPIin a cluster of workstationsbased on Fast Ethernet.Lecture Notes in ComputerScience, 1332:17–24, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Pjesivac-Grbovic:2005:PAM

[PGAB+05] J. Pjesivac-Grbovic, T. Angskun,G. Bosilca, G. E. Fagg,E. Gabriel, and J. J. Don-garra. Performance analy-sis of MPI collective oper-ations. In IEEE [IEE05],pages 272a–272a. ISBN 0-7695-2312-9. LCCN ????IEEE Computer Society Or-der Number P2312.

Pjesivac-Grbovic:2007:PAM

[PGAB+07] Jelena Pjesivac-Grbovic,Thara Angskun, GeorgeBosilca, Graham E. Fagg,Edgar Gabriel, and Jack J.Dongarra. Performanceanalysis of MPI collectiveoperations. The Journal ofNetworks, Software Tools,and Cluster Computing, 10(2):127–143, ???? 2007.ISSN 1386-7857.

Pjesivac-Grbovic:2007:MCA

[PGBF+07] Jelena Pjesivac-Grbovic,George Bosilca, Graham E.

Fagg, Thara Angskun, andJack J. Dongarra. MPIcollective algorithm selec-tion and quadtree encoding.Parallel Computing, 33(9):613–623, September 2007.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic).

Prabhakar:2002:PCB

[PGC02] Achal Prabhakar, VladimirGetov, and Barbara Chap-man. Performance com-parisons of basic OpenMPconstructs. Lecture Notesin Computer Science, 2327:413–??, 2002. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2327/23270413.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2327/23270413.pdf.

Pervez:2010:FMA

[PGK+10] Salman Pervez, GaneshGopalakrishnan, Robert M.Kirby, Rajeev Thakur, andWilliam Gropp. For-mal methods applied tohigh-performance comput-ing software design: a casestudy of MPI one-sidedcommunication-based lock-ing. Software—Practiceand Experience, 40(1):23–43,January ??, 2010. CODENSPEXBL. ISSN 0038-0644

REFERENCES 357

(print), 1097-024X (elec-tronic).

Papakonstantinou:2013:ECC

[PGS+13] Alexandros Papakonstanti-nou, Karthik Gururaj,John A. Stratton, DemingChen, Jason Cong, and Wen-Mei W. Hwu. Efficient com-pilation of CUDA kernels forhigh-performance comput-ing on FPGAs. ACM Trans-actions on Embedded Com-puting Systems, 13(2):25:1–25:??, September 2013. CO-DEN ???? ISSN 1539-9087(print), 1558-3465 (elec-tronic).

Pan:2010:CPS

[PHA10] Heidi Pan, Benjamin Hind-man, and Krste Asanovic.Composing parallel soft-ware efficiently with Lithe.ACM SIGPLAN Notices,45(6):376–387, June 2010.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Pennycook:2011:PAH

[PHJM11] S. J. Pennycook, S. D. Ham-mond, S. A. Jarvis, andG. R. Mudalige. Perfor-mance analysis of a hy-brid MPI/CUDA implemen-tation of the NASLU bench-mark. ACM SIGMETRICSPerformance Evaluation Re-view, 38(4):23–29, March2011. CODEN ???? ISSN

0163-5999 (print), 1557-9484(electronic).

Pennycook:2013:IPP

[PHW+13] S. J. Pennycook, S. D. Ham-mond, S. A. Wright, J. A.Herdman, I. Miller, andS. A. Jarvis. An investi-gation of the performanceportability of OpenCL.Journal of Parallel and Dis-tributed Computing, 73(11):1439–1450, November 2013.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic). URL http://www.sciencedirect.com/science/article/pii/S0743731512001669.

Pierce:1994:NMP

[Pie94] P. Pierce. The NX mes-sage passing interface. Par-allel Computing, 20(4):463–480, April 1994. CODENPACOEJ. ISSN 0167-8191(print), 1872-7336 (elec-tronic).

Papadopoulos:1998:DVS

[PK98] P. M. Papadopoulos andJ. A. Kohl. Dynamic vi-sualization and steering us-ing PVM and MPI. Lec-ture Notes in Computer Sci-ence, 1497:297–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Park:2005:SOA

[PK05] Inho Park and Seon WookKim. Study of OpenMP ap-plications on the InfiniBand-

REFERENCES 358

based software distributedshared-memory system. Par-allel Computing, 31(10–12):1099–1113, October/December 2005. CODENPACOEJ. ISSN 0167-8191(print), 1872-7336 (elec-tronic).

Papadopoulos:2001:NRC

[PKB01] Philip M. Papadopoulos,Mason J. Katz, and GregBruno. NPACI rocks clus-ters: Tools for easily deploy-ing and maintaining man-ageable high-performanceLinux clusters. LectureNotes in Computer Science,2131:10–??, 2001. CO-DEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310010.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310010.pdf.

Paul:2006:TLF

[PKB06] Jerome L. Paul, MichalKouril, and Kenneth A.Berman. A template libraryto facilitate teaching mes-sage passing parallel com-puting. In ACM [ACM06a],pages 464–468. ISBN 1-59593-259-3. ACM ordernumber 457060.

Prabhakar:2016:GCH

[PKB+16] Raghu Prabhakar, DavidKoeplinger, Kevin J. Brown,

HyoukJoong Lee, Christo-pher De Sa, Christos Kozyrakis,and Kunle Olukotun. Gen-erating configurable hard-ware from parallel patterns.ACM SIGPLAN Notices,51(4):651–665, April 2016.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Plank:1995:ADC

[PKD95] J. S. Plank, YoungbaeKim, and J. J. Dongarra.Algorithm-based disklesscheckpointing for fault tol-erant matrix operations. InIEEE [IEE95c], pages 351–360. ISBN 0-8186-7079-7. LCCN QA 76.9 F38I57 1995. IEEE catalog no.95CB35823.

Preissl:2010:OCC

[PKE+10] Robert Preissl, Alice Koniges,Stephan Ethier, WeixingWang, and Nathan Wich-mann. Overlapping commu-nication with computationusing OpenMP tasks on theGTS magnetic fusion code.Scientific Programming, 18(3–4):139–151, ???? 2010.CODEN SCIPEV. ISSN1058-9244 (print), 1875-919X (electronic).

Periyathamby:1995:NSG

[PKYW95] U. Periyathamby, B. C.Khoo, K. S. Yeo, and Q. X.Wang. A numerical simula-tion of the growth and col-lapse of vapour cavity near

REFERENCES 359

a free surface on distributedcomputing through PVM. InBilger [Bil95], pages 815–818. ISBN 0-86934-034-4.LCCN ????

Pruyne:1996:ICP

[PL96] Jim Pruyne and MironLivny. Interfacing Condorand PVM to harness thecycles of workstation clus-ters. Future GenerationComputer Systems, 12(1):67–85, May 1996. CODENFGSEVI. ISSN 0167-739X(print), 1872-7115 (elec-tronic).

Plachetka:2002:QTS

[Pla02] Tomas Plachetka. (quasi-)thread-safe PVM and (quasi-) thread-safe MPI with-out active polling. Lec-ture Notes in Computer Sci-ence, 2474:296–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.de/link/service/series/0558/bibs/2474/24740296.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740296.pdf.

Park:2004:DID

[PLK+04] K.-L. Park, H.-J. Lee, O.-Y. Kwon, S.-Y. Park, H.-W. Park, and S.-D. Kim.Design and implementationof a dynamic communica-tion MPI library for thegrid. International Journal

of Computer Applications,26(3):1–8, 2004. ISSN 1206-212X (print), 1925-7074(electronic). URL https://www.tandfonline.com/doi/full/10.1080/1206212X.2004.11441738.

Piriyakumar:2002:EFI

[PLR02] Douglas Antony Louis Piriyaku-mar, Paul Levi, and RolfRabenseifner. Enhanced fileinteroperability with par-allel MPI file-I/O in im-age processing. LectureNotes in Computer Sci-ence, 2474:174–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.de/link/service/series/0558/bibs/2474/24740174.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740174.pdf.

Pfenning:1995:OCP

[PM95] Jorg-Thomas Pfenning andChristoph Moll. Opti-mized communication pat-terns on workstation clus-ters. Parallel Computing, 21(3):373–388, March 10, 1995.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.elsevier.com/cgi-bin/cas/tree/store/parco/cas_sub/browse/browse.cgi?year=1995&volume=21&issue=3&aid=964.

REFERENCES 360

Piscaglia:1995:DOC

[PMM95] P. Piscaglia, B. Macq, andP. Maes. Distributed opti-mization of codebooks. Sig-nal Processing: Image Com-munication, 7(3):211–223,September 1995. CODENSPICEF. ISSN 0923-5965(print), 1879-2677 (elec-tronic).

Poulson:2013:ENF

[PMvdG+13] Jack Poulson, Bryan Marker,Robert A. van de Geijn,Jeff R. Hammond, andNichols A. Romero. Elemen-tal: a new framework for dis-tributed memory dense ma-trix computations. ACMTransactions on Mathemat-ical Software, 39(2):13:1–13:24, February 2013. CO-DEN ACMSCU. ISSN0098-3500 (print), 1557-7295(electronic).

Pirk:2016:VVA

[PMZM16] Holger Pirk, Oscar Moll,Matei Zaharia, and SamMadden. Voodoo — avector algebra for portabledatabase performance onmodern hardware. Proceed-ings of the VLDB Endow-ment, 9(14):1707–1718, Oc-tober 2016. CODEN ????ISSN 2150-8097.

Plagianakos:2001:LCP

[PNV01] V. P. Plagianakos, N. K.Nousis, and M. N. Vra-hatis. Locating and comput-ing in parallel all the sim-

ple roots of special func-tions using PVM. Journalof Computational and Ap-plied Mathematics, 133(1–2):545–554, August 1, 2001.CODEN JCAMDI. ISSN0377-0427 (print), 1879-1778(electronic). URL http://www.sciencedirect.com/science/article/pii/S0377042700006750.

Pokorny:1996:CMP

[Pok96] S. Pokorny. A comparison ofmessage-passing paralleliza-tion to shared-memory par-allelization. Lecture Notesin Computer Science, 1156:22–??, ???? 1996. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Parrilia:1999:UPD

[POL99] L. Parrilia, J. Ortega, andA. Lloris. Using PVM fordistributed logic minimiza-tion in a network of com-puters. In Dongarra et al.[DLM99], pages 541–548.ISBN 3-540-66549-8 (soft-cover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

Pai:2016:CTO

[PP16] Sreepathi Pai and KeshavPingali. A compiler forthroughput optimization ofgraph algorithms on GPUs.ACM SIGPLAN Notices,51(10):1–19, October 2016.CODEN SINODQ. ISSN

REFERENCES 361

0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Poplawski:1989:MPP

[PPF89] D. A. Poplawski, S. Pahwa,and J. M. Francioni. Mod-els of parallel program be-havior. In Anonymous[Ano89], pages 857–860 (vol.2). LCCN QA76.5.C6192151989. Two volumes.

Park:2001:CSL

[PPJ01] So-Hee Park, Mi-YoungPark, and Yong-Kee Jun. Acomparison of scalable la-beling schemes for detectingraces in OpenMP programs.Lecture Notes in ComputerScience, 2104:68–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2104/21040068.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2104/21040068.pdf.

Pagourtzis:2001:PCT

[PPR01] Aris Pagourtzis, Igor Potapov,and Wojciech Rytter. PVMcomputation of the transi-tive closure: The depen-dency graph approach. Lec-ture Notes in Computer Sci-ence, 2131:249–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http:

//link.springer-ny.com/link/service/series/0558/bibs/2131/21310249.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310249.pdf.

Papakostas:1996:PSP

[PPT96a] N. Papakostas, G. Papakon-stantinou, and P. Tsanakas.PPARDB / PVM: a portablePVM based parallel databasemanagement system. LectureNotes in Computer Science,1127:219–??, ???? 1996.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Papakostas:1996:PPP

[PPT96b] N. Papakostas, G. Papakon-stantinou, and P. Tsanakas.PPARDB/PVM: a portablePVM based parallel databasemanagement system. InBoszormenyi [Bos96]. ISBN3-540-61695-0. ISSN 0302-9743 (print), 1611-3349(electronic). LCCN QA267.A1L43 no.1127.

Papakostas:1996:UPI

[PPT96c] N. Papakostas, G. Papakon-stantinou, and P. Tsanakas.Using PVM to implementPPARDB/PVM, a portableparallel database manage-ment system. In Bode et al.[BDLS96], pages 108–??ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E975 1996.

REFERENCES 362

Pedicini:2007:PPE

[PQ07] Marco Pedicini and FrancescoQuaglia. PELCR: Paral-lel environment for opti-mal lambda-calculus reduc-tion. ACM Transactions onComputational Logic, 8(3):14:1–14:??, July 2007. CO-DEN ???? ISSN 1529-3785(print), 1557-945X (elec-tronic).

Pierce:1994:PIN

[PR94a] P. Pierce and G. Reg-nier. The Paragon imple-mentation of the NX mes-sage passing interface. InProceedings of the ScalableHigh-Performance Comput-ing Conference, May 23–25, 1994, Knoxville, Ten-nessee [PR94b], pages 184–190. ISBN 0-8186-5680-8, 0-8186-5681-6. LCCNQA76.58.S32 1994. IEEEcatalog no. 94TH0637-9.

Pierce:1994:PSH

[PR94b] P. Pierce and G. Regnier,editors. Proceedings of theScalable High-PerformanceComputing Conference, May23–25, 1994, Knoxville,Tennessee. IEEE ComputerSociety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,1994. ISBN 0-8186-5680-8, 0-8186-5681-6. LCCNQA76.58.S32 1994. IEEEcatalog no. 94TH0637-9.

Pozo:1994:FTE

[PR94c] R. Pozo and K. Reming-ton. Fast three-dimensionalelliptic solvers on distributednetwork clusters. In Jou-bert et al. [JPTE94], pages201–208. ISBN 0-444-81841-3. LCCN QA76.58 .P37941993.

Priimak:2014:FDN

[Pri14] Dmitri Priimak. Finite dif-ference numerical methodfor the superlattice Boltz-mann transport equationand case comparison ofCPU(C) and GPU(CUDA)implementations. Journalof Computational Physics,278(??):182–192, December1, 2014. CODEN JCT-PAH. ISSN 0021-9991(print), 1090-2716 (elec-tronic). URL http://www.sciencedirect.com/science/article/pii/S0021999114005828.

Pena:2014:CEC

[PRS+14] Antonio J. Pena, Car-los Reano, Federico Silla,Rafael Mayo, Enrique S.Quintana-Ortı, and Jose Du-ato. A complete and ef-ficient CUDA-sharing so-lution for HPC clusters.Parallel Computing, 40(10):574–588, December 2014.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167819114001227.

REFERENCES 363

Prades:2016:CAX

[PRS16] Javier Prades, Carlos Reano,and Federico Silla. CUDAacceleration for Xen virtualmachines in InfiniBand clus-ters with rCUDA. ACMSIGPLAN Notices, 51(8):35:1–35:??, August 2016.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Pedroso:2000:MPC

[PS00a] Hernani Pedroso and Joao GabrielSilva. MPI-2 process cre-ation & management imple-mentation for NT clusters.Lecture Notes in ComputerScience, 1908:184–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080184.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080184.pdf.

Protopopov:2000:SMC

[PS00b] Boris V. Protopopov andAnthony Skjellum. Shared-memory communication ap-proaches for an MPI message-passing library. Concur-rency: practice and expe-rience, 12(9):799–820, Au-gust 10, 2000. CODENCPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/

cgi-bin/abstract/72516482/START; http://www3.interscience.wiley.com/cgi-bin/fulltext?ID=72516482&PLACEBO=IE.pdf.

Pedroso:2001:WLE

[PS01a] Hernani Pedroso and Joao GabrielSilva. The WMPI libraryevolution: Experience withMPI development for Win-dows environments. Lec-ture Notes in Computer Sci-ence, 1900:1157–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1900/19001157.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1900/19001157.pdf.

Protopopov:2001:MMP

[PS01b] Boris V. Protopopov andAnthony Skjellum. A mul-tithreaded Message Pass-ing Interface (MPI) architec-ture: Performance and pro-gram issues. Journal of Par-allel and Distributed Com-puting, 61(4):449–466, April1, 2001. CODEN JPDCER.ISSN 0743-7315 (print),1096-0848 (electronic). URLhttp://www.idealibrary.com/links/doi/10.1006/jpdc.2000.1674; http://www.idealibrary.com/links/doi/10.1006/jpdc.2000.1674/pdf; http:

REFERENCES 364

//www.idealibrary.com/links/doi/10.1006/jpdc.2000.1674/ref.

Pandey:2007:SCM

[PS07] Nirved Pandey and G. K.Sharma. Startup com-parison for message pass-ing libraries with DTM onLinux clusters. The Jour-nal of Supercomputing, 39(1):59–72, January 2007.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=39&issue=1&spage=59.

Pehrson:1994:IPP

[PSB+94] Bjorn Pehrson, Imre Simon,Klaus Brunnstein, EckartRaubold, Karen Duncan,and Karl Krueger, edi-tors. Information process-ing ’94: proceedings of theIFIP 13th World ComputerCongress, Hamburg, Ger-many, 28 August–2 Septem-ber, 1994, volume A-51, A-52, A-53 of IFIP Trans-actions. A. Computer Sci-ence and Technology. North-Holland, Amsterdam, TheNetherlands, 1994. CODENITATEC. ISBN 0-444-81990-8, 0-444-81989-4. ISSN 0926-5473. LCCN QA75.5.I37851994. Three volumes.

Peters:2011:FPC

[PSHL11] Hagen Peters, Ole Schulz-Hildebrandt, and Norbert

Luttenberger. Fast in-place,comparison-based sortingwith CUDA: a study withbitonic sort. Concurrencyand Computation: Practiceand Experience, 23(7):681–693, May 2011. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic).

Patrick:2008:CEO

[PSK08] Christina M. Patrick, Seung-Woo Son, and Mahmut Kan-demir. Comparative eval-uation of overlap strategieswith study of I/O over-lap in MPI-IO. OperatingSystems Review, 42(6):43–49, October 2008. CODENOSRED8. ISSN 0163-5980(print), 1943-586X (elec-tronic).

Preissl:2010:TMS

[PSK+10] Robert Preissl, MartinSchulz, Dieter Kranzlmuller,Bronis R. de Supinski, andDaniel J. Quinlan. Trans-forming MPI source codebased on communicationpatterns. Future Genera-tion Computer Systems, 26(1):147–154, January 2010.CODEN FGSEVI. ISSN0167-739X (print), 1872-7115 (electronic).

Prieto:1999:PRM

[PSLT99] M. Prieto, R. Santiago, I. M.Llorente, and F. Tirado.A parallel robust multigridalgorithm based on semi-

REFERENCES 365

coarsening. In Dongarraet al. [DLM99], pages 307–316. ISBN 3-540-66549-8(softcover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

Peng:2014:BAH

[PSM+14] Yuanxi Peng, Manuel Saldana,Christopher A. Madill, Xi-aofeng Zou, and Paul Chow.Benefits of adding hardwaresupport for broadcast andreduce operations in MP-SoC applications. ACMTransactions on Reconfig-urable Technology and Sys-tems (TRETS), 7(3):17:1–17:??, August 2014. CO-DEN ???? ISSN 1936-7406(print), 1936-7414 (elec-tronic).

Plunkett:2001:AMD

[PSSS01] Craig L. Plunkett, Alfred G.Striz, and J. Sobieszczanski-Sobieski. Application ofMPI in displacement basedmultilevel structural opti-mization. Lecture Notesin Computer Science, 2131:335–??, 2001. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310335.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310335.pdf.

Payrits:2000:UPC

[PSZE00] Szabolcs Payrits, ZoltanSzatmary, Laszlo Zalanyi,and Peter Erdi. Useof parallel computers inneurocomputing. LectureNotes in Computer Sci-ence, 1908:313–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080313.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080313.pdf.

Pears:2001:DLB

[PT01] Arnold N. Pears and NicolaThong. A dynamic loadbalancing architecture forPDES using PVM on clus-ters. Lecture Notes inComputer Science, 2131:166–??, 2001. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310166.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310166.pdf.

Pai:2013:IGC

[PTG13] Sreepathi Pai, Matthew J.Thazhuthaveetil, and R. Govin-darajan. Improving GPGPU

REFERENCES 366

concurrency with elastic ker-nels. ACM SIGPLAN No-tices, 48(4):407–418, April2013. CODEN SINODQ.ISSN 0362-1340 (print),1523-2867 (print), 1558-1160(electronic).

Prost:2001:MIG

[PTH+01a] Jean-Pierre Prost, RichardTreumann, Richard Hedges,Bin Jia, and Alice Koniges.MPI-IO/GPFS, an opti-mized implementation ofMPI-IO on top of GPFS.In ACM [ACM01], page ??ISBN 1-58113-293-X. LCCN???? URL http://www.sc2001.org/papers/pap.pap186.pdf.

Prost:2001:THP

[PTH+01b] Jean-Pierre Prost, RichardTreumann, Richard Hedges,Alice Koniges, and AlisonWhite. Towards a high-performance implementationof MPI–IO on top of GPFS.Lecture Notes in ComputerScience, 1900:1253–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1900/19001253.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1900/19001253.pdf.

Peraza:2016:PGQ

[PTL+16] Joshua Peraza, Ananta Ti-

wari, Michael Laurenzano,Laura Carrington, and Al-lan Snavely. PMaC’s greenqueue: a framework forselecting energy optimalDVFS configurations in largescale MPI applications. Con-currency and Computation:Practice and Experience, 28(2):211–231, February 2016.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Pierro:2018:SFP

[PTMF18] Vincenzo Pierro, LuigiTroiano, Elena Mejuto, andGiovanni Filatrella. Stochas-tic first passage time acceler-ated with CUDA. Journal ofComputational Physics, 361(??):136–149, May 15, 2018.CODEN JCTPAH. ISSN0021-9991 (print), 1090-2716(electronic). URL http://www.sciencedirect.com/science/article/pii/S0021999118300494.

Phan-Thien:1994:CDL

[PTT94] N. Phan-Thien and D. Tul-lock. Completed dou-ble layer boundary elementmethod in elasticity andStokes flow: Distributedcomputing through PVM.Computational mechanics,14(4):370–383, July 1994.CODEN CMMEEE. ISSN0178-7675.

Prylli:1999:DHP

[PTW99] L. Prylli, B. Tourancheau,and R. Westrelin. The

REFERENCES 367

design for a high perfor-mance MPI implementationon the Myrinet network. InDongarra et al. [DLM99],pages 223–230. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Puskas:1995:LBW

[Pus95] Z. Puskas. Load balancingon workstation clusters us-ing PVM. In Ferenczi andKacsuk [FK95], pages 112–123. ISBN ???? LCCN???? Technical reportKFKI-1995-2/M,N.

Peinado:1997:HPC

[PV97] M. Peinado and R. Venkate-san. Highly parallel cryp-tographic attacks. Lec-ture Notes in Computer Sci-ence, 1332:367–374, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Park:2001:PPE

[PVKE01] Insung Park, Michael J.Voss, Seon Wook Kim, andRudolf Eigenmann. Parallelprogramming environmentfor OpenMP. Scientific Pro-gramming, 9(2–3):143–161,Spring–Summer 2001. CO-DEN SCIPEV. ISSN 1058-9244 (print), 1875-919X(electronic). URL http://iospress.metapress.com/app/home/contribution.asp%3Fwasp=7pab6qgbaf8vxg991rwy%

26referrer=parent%26backto=issue%2C8%2C11%3Bjournal%2C1%2C9%3Blinkingpublicationresults%2C1%2C1.

Pahl:1995:CCB

[PW95] Peter Jan Pahl and HeinrichWerner, editors. Comput-ing in civil and building en-gineering: 6th Internationalconference — July 1995,Berlin, Computing in Civiland Building Engineering6th. A. A. Balkema, Brook-field, VT, USA, 1995. ISBN90-5410-556-9, 90-5410-557-7. LCCN TA345 .I565 1995v.1-2. Two volumes.

Preissl:2012:CSS

[PWD+12] Robert Preissl, Theodore M.Wong, Pallab Datta, My-ron Flickner, RaghavendraSingh, Steven K. Esser,William P. Risk, Horst D.Simon, and Dharmendra S.Modha. Compass: a scalablesimulator for an architec-ture for cognitive computing.In Hollingsworth [Hol12],pages 54:1–54:?? ISBN 1-4673-0804-8. URL http://conferences.computer.org/sc/2012/papers/1000a085.pdf.

Pang:2016:MKR

[PWP+16] Yeyong Pang, ShaojunWang, Yu Peng, XiyuanPeng, Nicholas J. Fraser, andPhilip H. W. Leong. Amicrocoded kernel recursiveleast squares processor us-ing FPGA technology. ACM

REFERENCES 368

Transactions on Reconfig-urable Technology and Sys-tems (TRETS), 10(1):5:1–5:??, December 2016. CO-DEN ???? ISSN 1936-7406(print), 1936-7414 (elec-tronic).

Prasad:1995:PPB

[PY95] S. K. Prasad and K. M.Yu. Performance of a PVM-based optimistic simulationtestbed on different paral-lel architectures. In Hamza[Ham95a], pages 511–514.ISBN 0-88986-218-4. LCCNQA76.9.C65 I295 1995.

Perla:2012:PAH

[PZ12] Francesca Perla and PaoloZanetti. Performance anal-ysis of an hybrid MPI/OpenMP ALM softwarefor life insurance poli-cies on multi-core architec-tures. Lecture Notes inComputer Science, 7312:250–253, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-30961-8_19/.

Phillips:2002:NBS

[PZKK02] James C. Phillips, GengbinZheng, Sameer Kumar, andLaxmikant V. Kale. NAMD:Biomolecular simulation onthousands of processors.In IEEE [IEE02], page ??ISBN 0-7695-1524-X. LCCN

???? URL http://www.sc-2002.org/paperpdfs/pap.pap277.pdf.

Qiu:2012:PWM

[QB12] Judy Qiu and Seung-HeeBae. Performance of win-dows multicore systems onthreading and MPI. Con-currency and Computation:Practice and Experience, 24(1):14–28, January 2012.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Quoy:2000:PNN

[QMGR00] Mathias Quoy, Sorin Moga,Philippe Gaussier, and Ar-naud Revel. Paralleliza-tion of neural networks us-ing PVM. Lecture Notesin Computer Science, 1908:289–??, 2000. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080289.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080289.pdf.

Qaddouri:1995:MFS

[QRG95] A. Qaddouri, R. Roy, andB. Goulard. Multigroup fluxsolvers using PVM [ParallelVirtual Machine]. In ANS[ANS95], pages 1554–1562.ISBN 0-89448-198-3. LCCN

REFERENCES 369

TK9006.M37 1995. Two vol-umes.

Qaddouri:1996:CPC

[QRMG96] A. Qaddouri, R. Roy,M. Mayrand, and B. Goulard.Collision probability calcu-lation and multigroup fluxsolvers using PVM. Nu-clear Science and Engineer-ing, 123(3):392–402, July1996. CODEN NSENAO.ISSN 0029-5639.

Qu:1995:FAS

[Qu95] Su Qu. Feature-drivenarea-based stereo matchingmethod on PVM. M.s. the-sis, University of Georgia,Athens, GA, USA, 1995. x +110 pp. Directed by HamidR. Arabnia.

Quinn:2003:PPC

[Qui03] Michael J. (Michael Jay)Quinn. Parallel program-ming in C with MPI andOpenMP. McGraw-Hill,New York, NY, USA, 2003.ISBN 0-07-123265-6, 0-07-282256-2. xiv + 529 pp.LCCN QA76.73.C15 Q552003; QA76.73 .C15 Q552003.

Russell:1992:CMW

[R+92] Thomas F. Russell et al., ed-itors. Computational meth-ods in water resources IX:Proceedings of the NinthInternational Conferenceon Computational Methodsin Water Resources, held

at the University of Col-orado, Denver, in June1992. Elsevier Applied Sci-ence, London, UK, 1992.ISBN 1-85166-871-3 (set),1-85312-169-X (set: Com-putational Mechanics Pub-lications, Southampton), 1-56252-098-9 (set: Compu-tational Mechanics Publica-tions, Boston), 1-85166-791-1 (v. 1: Elsevier AppliedScience), 1-85312-197-5 (v.1: Computational Mechan-ics Publications, Southamp-ton), 1-56252-123-3 (v. 1:Computational MechanicsPublications, New York), 1-85166-870-5 (v. 2), 1-85312-198-3 (v. 2), 1-56252-124-1(v. 2). LCCN GB656.2.E42C65 1992 v.1-2 (c1992). Twovolumes.

Rashti:2009:SAM

[RA09] Mohammad J. Rashti andAhmad Afsahi. A spec-ulative and adaptive MPIrendezvous protocol overRDMA-enabled intercon-nects. International Jour-nal of Parallel Programming,37(2):223–246, April 2009.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=37&issue=2&spage=223.

Rabenseifner:1998:MGI

[Rab98] R. Rabenseifner. MPI-GLUE: Interoperable high-

REFERENCES 370

performance MPI combin-ing different vendor’s MPIworlds. Lecture Notes inComputer Science, 1470:563–??, 1998. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Rabenseifner:1999:APM

[Rab99] R. Rabenseifner. Auto-matic profiling of MPI ap-plications with hardwareperformance counters. InDongarra et al. [DLM99],pages 35–42. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Ragg:1996:PEN

[Rag96] T. Ragg. Parallelizationof an evolutionary neuralnetwork optimizer based onPVM. In Bode et al.[BDLS96], pages 351–??ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E975 1996.

Ratha:1995:DED

[RAGJ95] N. K. Ratha, T. Acar,M. Gokmen, and A. K. Jain.A distributed edge detec-tion and surface reconstruc-tion algorithm. In Can-toni et al. [CLM+95], pages149–154. ISBN 0-8186-7134-3. LCCN QA76.9.A73W6751995. IEEE catalog no.95TB8093.

Ramadan:2007:TDM

[Ram07] Omar Ramadan. Three di-mensional MPI parallel im-plementation of the PMLalgorithm for truncatingfinite-difference time-domainGrids. Parallel Computing,33(2):109–115, March 2007.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic).

Rantakokko:2005:DMO

[Ran05] Jarmo Rantakokko. A dy-namic MPI–OpenMP modelfor structured adaptive meshrefinement. Parallel Process-ing Letters, 15(1/2):37–47,March/June 2005. CODENPPLTEE. ISSN 0129-6264(print), 1793-642X (elec-tronic).

Rehman:2016:VMJ

[RAS16] Waqas Ur Rehman, Muham-mad Sohaib Ayub, and Ju-naid Haroon Siddiqui. Verifi-cation of MPI Java programsusing software model check-ing. ACM SIGPLAN No-tices, 51(8):55:1–55:??, Au-gust 2016. CODEN SIN-ODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Roussos:2001:BMB

[RB01] George Roussos and B. J. C.Baxter. Biharmonic manybody calculations for fastevaluation of radial basisfunction interpolants in clus-

REFERENCES 371

ter environments. Lec-ture Notes in Computer Sci-ence, 2131:288–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310288.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310288.pdf.

Rufai:2005:MPO

[RBAA05] Raimi Rufai, Muslim Bozyigit,Jaralla Alghamdi, and MoatazAhmed. Multithreaded par-allelism with OpenMP. Par-allel Processing Letters, 15(4):367–378, December 2005.CODEN PPLTEE. ISSN0129-6264 (print), 1793-642X (electronic).

Rejitha:2017:EPC

[RBAI17] R. S. Rejitha, Shajulin Bene-dict, Suja A. Alex, andShany Infanto. Energy pre-diction of CUDA applicationinstances using dynamic re-gression models. Computing,99(8):765–790, August 2017.CODEN CMPTA2. ISSN0010-485X (print), 1436-5057 (electronic).

Resch:1997:CMP

[RBB97a] M. Resch, H. Berger, andT. Boenisch. A compar-ison of MPI performanceon different MPPs. Lec-ture Notes in Computer

Science, 1332:25–32, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Resch:1997:PM

[RBB97b] Michael Resch, ThomasBeisel, and Holger Berger.PACX-MPI. BI: Informatio-nen fur Nutzer des Rechen-zentrums 1997,11/12, Uni-versitat Stuttgart, ZentraleUniversitatseinrichtung, Stuttgart,Germany, 1997.

Resch:1997:PMC

[RBB97c] Michael Resch, Holger Berger,and Thomas Bonisch. Per-formance of MPI on a CrayT3E-512. BI: Informationenfur Nutzer des Rechenzen-trums 1997,5/6, UniversitatStuttgart, Zentrale Univer-sitatseinrichtung, Stuttgart,Germany, 1997. ?? pp.Third European CRAY-SGIMPP Workshop.

Rodriguez:2015:OPI

[RBB15] Marcos Rodrıguez, FernandoBlesa, and Roberto Bar-rio. OpenCL parallel inte-gration of ordinary differ-ential equations: Applica-tions in computational dy-namics. Computer PhysicsCommunications, 192(??):228–236, July 2015. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465515000703.

REFERENCES 372

Russo:2017:MPG

[RBB17] Igor L. S. Russo, Heder S.Bernardino, and Helio J. C.Barbosa. A massively par-allel grammatical evolutiontechnique with OpenCL.Journal of Parallel and Dis-tributed Computing, 109(??):333–349, November 2017.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic). URL http://www.sciencedirect.com/science/article/pii/S074373151730206X.

Reale:1994:PCU

[RBS94] F. Reale, F. Bocchino, andS. Sciortino. Parallel com-puting on Unix workstationarrays. Computer PhysicsCommunications, 83(2-3):130–140, December 1994.CODEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic).

Reinhard:1997:MHP

[RC97] E. Reinhard and A. Chalmers.Message handling in paral-lel radiance. Lecture Notesin Computer Science, 1332:486–493, 1997. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Reimann:1996:CBT

[RCFS96] D. A. Reimann, V. Chaud-hary, M. J. Flynn, and I. K.Sethi. Cone beam tomog-raphy using MPI on het-erogeneous workstation clus-ters. In IEEE [IEE96i], pages

142–148. ISBN 0-8186-7533-0. LCCN QA76.642 .M671996.

Ross:1995:DCM

[RCG95] D. L. Ross, J. S. Collins, andJ. H. George. A dynamic ca-pacity model using concur-rent processing. Neural, Par-allel and Scientific Compu-tations, 3(2):249–262, June1995. CODEN NPACEM.ISSN 1061-5369.

Royuela:2012:ASO

[RDLQ12] Sara Royuela, AlejandroDuran, Chunhua Liao,and Daniel J. Quinlan.Auto-scoping for OpenMPtasks. Lecture Notes inComputer Science, 7312:29–43, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-30961-8_3/.

Radhakrishna:1999:MBP

[RDMB99] H. Radhakrishna, S. Di-vakar, N. Magotra, andS. R. J. Brueck. MPI-based parallel implementa-tion of a lithography pat-tern simulation algorithm.Lecture Notes in ComputerScience, 1593:109–??, 1999.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

REFERENCES 373

Reeves:1996:PIC

[Ree96] A. Reeves, editor. Proceed-ings of the 1996 Interna-tional Conference on Chal-lenges for Parallel Process-ing, Ithaca, NY, USA, Au-gust 12, 1996, volume 1.IEEE Computer SocietyPress, 1109 Spring Street,Suite 300, Silver Spring,MD 20910, USA, 1996.ISBN 0-8186-7623-X. LCCNQA76.58 .I34 1996. Threevolumes.

Reinefeld:2001:CDI

[Rei01] Alexander Reinefeld. Clus-ters for data-intensive ap-plications in the grid. Lec-ture Notes in ComputerScience, 2131:12–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310012.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310012.pdf.

Reussner:2001:SSK

[Reu01] Ralf H. Reussner. SKaMPI:the special Karlsruher MPI-benchmark: user man-ual. Interner Bericht 99,02,Fakultat fur Informatik,Universitat Karlsruhe, Karl-sruhe, Germany, 2001. 78pp.

Reussner:2003:USD

[Reu03] Ralf H. Reussner. Us-ing SKaMPI for develop-ing high-performance MPIprograms with performanceportability. Future Gen-eration Computer Systems,19(5):749–759, July 2003.CODEN FGSEVI. ISSN0167-739X (print), 1872-7115 (electronic).

Roy:2000:MGQ

[RFG+00] Alain J. Roy, Ian Foster,William Gropp, NicholasKaronis, Volker Sander, andBrian Toonen. MPICH-GQ:Quality-of-service for mes-sage passing programs. InACM [ACM00], page 54.URL http://www.sc2000.org/proceedings/techpapr/papers/pap234.pdf.

Reynders:1995:OOO

[RFH+95] John V. W. Reynders,David W. Forslund, Paul J.Hinker, Marydell Thol-burn, David G. Kilman,and William F. Humphrey.OOPS: an object-orientedparticle simulation class li-brary for distributed archi-tectures. Computer PhysicsCommunications, 87(1–2):212–224, May 2, 1995. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/001046559400172X.

REFERENCES 374

Russ:1996:HAT

[RFRH96] S. H. Russ, B. Flachs,J. Robinson, and B. Heckel.Hector: automated taskallocation for MPI. InIEEE [IEE96e], pages 344–348. ISBN 0-8186-7255-2. LCCN QA76.58 .I5651996. IEEE catalog number96TB100038. IEEE Com-puter Society Press ordernumber PR07255.

Rasch:2018:MDH

[RG18] Ari Rasch and Sergei Gor-latch. Multi-dimensional ho-momorphisms and their im-plementation in OpenCL.International Journal ofParallel Programming, 46(1):101–119, February 2018.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640(electronic).

Rough:1997:PRD

[RGD97] J. Rough, A. Goscinski, andD. De Paoli. PVM onthe RHODOS distributedoperating system. Lec-ture Notes in Computer Sci-ence, 1332:208–218, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Rodrigues:2013:MAA

[RGD13] A. Wendell O. Rodrigues,Frederic Guyomarc’h, andJean-Luc Dekeyser. AnMDE approach for au-tomatic code generationfrom UML/MARTE to

OpenCL. Computing in Sci-ence and Engineering, 15(1):46–55, January/February2013. CODEN CSENFA.ISSN 1521-9615.

Rico-Gallego:2015:ILM

[RGDM15] Juan-Antonio Rico-Gallegoand Juan-Carlos Dıaz-Martın.τ -Lop: Modeling perfor-mance of shared memoryMPI. Parallel Comput-ing, 46(??):14–31, July 2015.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167819115000447.

Rico-Gallego:2016:EIL

[RGDML16] Juan-Antonio Rico-Gallego,Juan-Carlos Dıaz-Martın,and Alexey L. Lastovetsky.Extending τ -lop to modelconcurrent MPI communi-cations in multicore clus-ters. Future GenerationComputer Systems, 61(??):66–82, August 2016. CO-DEN FGSEVI. ISSN 0167-739X (print), 1872-7115(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167739X16300346.

Reussner:2001:APP

[RH01] Ralf Reussner and Gun-nar Hunzelmann. Achiev-ing performance portabil-ity with SKaMPI for high-performance MPI programs.Lecture Notes in ComputerScience, 2074:841–??, 2001.

REFERENCES 375

CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2074/20740841.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2074/20740841.pdf.

Roda:1996:PEI

[RHG+96] J. Roda, J. Herrera, J. Gon-zalez, C. Rodriguez, F. Almeida,and D. Gonzalez. Practi-cal experiments to improvePVM algorithms. In Bodeet al. [BDLS96], pages 30–??ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E975 1996.

Rizzardi:2017:ATS

[Riz17] Mariarosaria Rizzardi. Al-gorithm 981: Talbot SuiteDE: Application of mod-ified Talbot’s method tosolve differential problems.ACM Transactions on Math-ematical Software, 44(2):18:1–18:23, September 2017.CODEN ACMSCU. ISSN0098-3500 (print), 1557-7295(electronic). URL http://dl.acm.org/citation.cfm?id=3089248.

Ratha:1995:CUC

[RJC95] N. K. Ratha, A. K. Jain,and M. J. Chung. Clus-tering using a coarse-grainedparallel genetic algorithm: a

preliminary study. In Can-toni et al. [CLM+95], pages331–338. ISBN 0-8186-7134-3. LCCN QA76.9.A73W6751995. IEEE catalog no.95TB8093.

Rodrigues:2014:TPS

[RJDH14] Christopher Rodrigues, ThomasJablin, Abdul Dakkak, andWen-Mei Hwu. Triolet: aprogramming system thatunifies algorithmic skele-ton interfaces for high-performance cluster comput-ing. ACM SIGPLAN No-tices, 49(8):247–258, Au-gust 2014. CODEN SIN-ODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Robinson:1993:ECD

[RJMC93] D. F. Robinson, D. Judd,P. K. McKinely, and B. H. C.Cheng. Efficient collec-tive data distribution in all-port wormhole-routed hy-percubes. Proceedings of theSupercomputing Conference,pages 792–801, ???? 1993.CODEN ???? ISBN 0-8186-4340-4. ISSN 1063-9535.

Rabenseifner:2001:ECF

[RK01] Rolf Rabenseifner and Al-ice E. Koniges. Effec-tive communication and file-I/O bandwidth benchmarks.Lecture Notes in ComputerScience, 2131:24–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349

REFERENCES 376

(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310024.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310024.pdf.

Ragan-Kelley:2013:HLC

[RKBA+13] Jonathan Ragan-Kelley, Con-nelly Barnes, Andrew Adams,Sylvain Paris, Fredo Du-rand, and Saman Amaras-inghe. Halide: a languageand compiler for optimizingparallelism, locality, and re-computation in image pro-cessing pipelines. ACM SIG-PLAN Notices, 48(6):519–530, June 2013. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Rungsawang:2001:LCP

[RLL01] A. Rungsawang, A. Lao-hakanniyom, and M. Lert-prasertkune. Low-costparallel text retrieval us-ing PC-cluster. LectureNotes in Computer Sci-ence, 2131:419–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310419.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310419.pdf.

Rubio-Largo:2012:UMO

[RLVRGP12] Alvaro Rubio-Largo, Miguel A.Vega-Rodrıguez, and Juan A.Gomez-Pulido. Using a mul-tiobjective OpenMP+MPIDE for the static RWAproblem. Lecture Notesin Computer Science, 6927:224–231, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/content/pdf/10.1007/978-3-642-27549-4_29.

Roe:1999:PMI

[RM99] Kevin Roe and PiyushMehrotra. Parallelization ofa multigrid incompressibleviscous cavity flow solver us-ing openMP. NASA con-tractor report NASA/CR-1999-209551, NASA LangleyResearch Center, Hampton,VA, USA, 1999. ???? pp.Also ICASE report 99-36.

Rietmann:2012:FAS

[RMNM+12] Max Rietmann, Peter Mess-mer, Tarje Nissen-Meyer,Daniel Peter, Piero Basini,Dimitri Komatitsch, OlafSchenk, Jeroen Tromp, LapoBoschi, and Domenico Gi-ardini. Forward and ad-joint simulations of seismicwave propagation on emerg-ing large-scale GPU archi-tectures. In Hollingsworth[Hol12], pages 38:1–38:??ISBN 1-4673-0804-8. URL

REFERENCES 377

http://conferences.computer.org/sc/2012/papers/1000a104.pdf.

Rodrigues:2013:POM

[RNPM13] Eduardo R. Rodrigues,Philippe O. A. Navaux,Jairo Panetta, and Celso L.Mendes. Preserving the orig-inal MPI semantics in avirtualized processor envi-ronment. Science of Com-puter Programming, 78(4):412–421, April 1, 2013.CODEN SCPGD4. ISSN0167-6423 (print), 1872-7964(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167642312001335.

Rohrl:2000:PPS

[Roh00] Armin Rohrl. Parallel pro-cessing in statistical compu-tation: BSP, FPGas andMPI for the S-language.These sciences, EPF Lau-sanne, Lausanne, Switzer-land, 2000. 137 pp.

Rolfe:1994:PAP

[Rol94] T. J. Rolfe. PVM: An af-fordable parallel processingenvironment. In Anony-mous [Ano94h], pages 118–125. ISBN ???? LCCN ????

Rolfe:2008:PFO

[Rol08a] Timothy J. Rolfe. Per-verse and foolish oft Istrayed. SIGCSE Bul-letin (ACM Special Inter-est Group on ComputerScience Education), 40(2):

52–55, June 2008. CO-DEN SIGSD3. ISSN 0097-8418 (print), 2331-3927(electronic). URL ftp://ftp.math.utah.edu/pub/mirrors/ftp.ira.uka.de/bibliography/Misc/DBLP/2008.bib.

Rolfe:2008:SMA

[Rol08b] Timothy J. Rolfe. A spec-imen MPI application: N -queens in parallel. SIGCSEBulletin (ACM Special In-terest Group on ComputerScience Education), 40(4):42–45, December 2008. CO-DEN SIGSD3. ISSN 0097-8418 (print), 2331-3927(electronic).

Rosen:2013:PVA

[Ros13] Paul Rosen. Performance:A visual approach to inves-tigating shared and globalmemory behavior of CUDAkernels. Computer Graph-ics Forum, 32(3pt2):161–170, June 2013. CODENCGFODY. ISSN 0167-7055(print), 1467-8659 (elec-tronic).

Ramon:1995:PKV

[RP95] J. Ramon and P. Pena.Parallelization of KENO-VaMonte Carlo code. Com-puter Physics Communi-cations, 88(1):76–82, July1995. CODEN CPHCBZ.ISSN 0010-4655 (print),1879-2944 (electronic). URLhttp://www.sciencedirect.

REFERENCES 378

com/science/article/pii/001046559500025B.

Rodriguez:2008:FTS

[RPM+08] Gabriel Rodrıguez, Xoan C.Pardo, Marıa J. Martın, Pa-tricia Gonzalez, and DanielDıaz. A fault tolerance solu-tion for sequential and MPIapplications on the Grid.Scalable Computing: Prac-tice and Experience, 9(2):101–109, June 2008. CO-DEN ???? ISSN 1895-1767.URL http://www.scpe.org/vols/vol09/no2/SCPE_9_2_03.pdf; http://www.scpe.org/vols/vol09/no2/SCPE_9_2_03.zip.

Rabaea:2000:EPM

[RR00] Adrian Rabaea and Mon-ica Rabaea. Experimentswith parallel Monte Carlosimulation for pricing op-tions using PVM. Lec-ture Notes in Computer Sci-ence, 1908:330–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080330.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080330.pdf.

Rageb:2001:CEM

[RR01] Khaled Rageb and WolfgangRehm. CHEMPI: efficientMPI for VIA/SCI. Preprint-Reihe des Chemnitzer SFB

393, Technische UniversitatChemnitz, Chemnitz, Ger-many, 2001. 12 pp.

Rauber:2002:LSH

[RR02] Thomas Rauber and GudulaRunger. Library sup-port for hierarchical multi-processor tasks. In IEEE[IEE02], page ?? ISBN0-7695-1524-X. LCCN???? URL http://www.sc-2002.org/paperpdfs/pap.pap176.pdf.

Roda:1997:PPI

[RRAGM97] J. L. Roda, C. Rodriguez,F. Almeida, and D. Gonzalez-Morales. Predicting theperformance of injectioncommunication patterns onPVM. Lecture Notes inComputer Science, 1332:33–40, 1997. CODEN LNCSD9.ISSN 0302-9743 (print),1611-3349 (electronic).

Roig:2001:EMM

[RRBL01] Concepcio Roig, Ana Ripoll,Javier Borras, and EmilioLuque. Efficient mappingfor message-passing appli-cations using the TTIGmodel: a case study inimage processing. Lec-ture Notes in Computer Sci-ence, 2131:370–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310370.htm;

REFERENCES 379

http://link.springer-ny.com/link/service/series/0558/papers/2131/21310370.pdf.

Robinson:1996:TMI

[RRFH96] J. Robinson, S. H. Russ,B. Flachs, and B. Heckel. Atask migration implementa-tion of the Message-PassingInterface. In IEEE [IEE96f],pages 61–68. ISBN 0-8186-7582-9. LCCN QA 76.88 I521996. IEEE catalog numberTB100069.

Russ:1999:UHR

[RRG+99] Samuel H. Russ, JonathanRobinson, Matt Gleeson,Brad Meyers, Laxman Ra-jagopalan, and Chun-HeongTan. Using Hector to runMPI programs over net-worked workstations. Con-currency: practice andexperience, 11(4):189–204,April 10, 1999. CODENCPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/cgi-bin/abstract?ID=61004080;http://www3.interscience.

wiley.com/cgi-bin/fulltext?ID=61004080&PLACEBO=IE.pdf. Special Issue: Appli-cations of Distributed Com-puting Environments.

Rabenseifner:1993:CDR

[RS93] R. Rabenseifner and A. Schuch.Comparison of DCE RPC,DFN-RPC, ONC and PVM.In Schill [Sch93], pages 39–

46. ISBN 3-540-57306-2, 0-387-57306-2. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.9.C55I58 1993.

Reinefeld:1995:PVE

[RS95] A. Reinefeld and V. Sch-necke. Portability versus effi-ciency? parallel applicationson PVM and Parix. In Fritz-son and Finmo [FF95], pages35–49. ISBN 90-5199-229-7 (IOS Press), 4-274-90056-8(Ohmsha). LCCN ????

Roy:1997:PNT

[RS97] R. Roy and Z. Stankovski.Parallelization of neutrontransport solvers. Lec-ture Notes in Computer Sci-ence, 1332:494–501, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Rambu:1995:DSS

[RSBT95] N. Rambu, S. Stefan,D. Borsan, and S. Talpos. Adiagnostic study of some me-teorological fields simulatedwith UKMO and MPI atmo-spheric general circulationmodels. In Gates [Gat95],pages 493–498. ISBN ????LCCN SIO 1 WO326 v.92.

Reano:2015:IUE

[RSC+15] Carlos Reano, Federico Silla,Adrian Castello, Antonio J.Pena, Rafael Mayo, En-rique S. Quintana-Ortı, andJose Duato. Improving

REFERENCES 380

the user experience of therCUDA remote GPU virtu-alization framework. Con-currency and Computation:Practice and Experience,27(14):3746–3770, Septem-ber 25, 2015. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic).

Reussner:1998:SDA

[RSPM98] R. Reussner, P. Sanders,L. Prechelt, and M. Mueller.SKaMPI: a detailed, accu-rate MPI benchmark. Lec-ture Notes in ComputerScience, 1497:52–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Reussner:2002:SCB

[RST02] Ralf Reussner, Peter Sanders,and Jesper Larsson Traff.SKaMPI: a comprehensivebenchmark for public bench-marking of MPI. Scien-tific Programming, 10(1):55–65, 2002. CODENSCIPEV. ISSN 1058-9244(print), 1875-919X (elec-tronic). URL http://iospress.metapress.com/app/home/contribution.asp%3Fwasp=9ejnuvwuvby9737jte27%26referrer=parent%26backto=issue%2C6%2C9%3Bjournal%2C2%2C12%3Blinkingpublicationresults%2C1%2C1.

Rozman:2006:CPL

[RsT06] Igor Rozman, Marjan sterk,

and Roman Trobec. Com-munication performance ofLAM/MPI and MPICH ona Linux cluster. ParallelProcessing Letters, 16(3):323–334, September 2006.CODEN PPLTEE. ISSN0129-6264 (print), 1793-642X (electronic).

Roberti:2005:PIL

[RSV+05] Debora R. Roberti, Roberto P.Souto, Haroldo F. CamposVelho, Gervasio A. Degrazia,and Domenico Anfossi. Par-allel implementation of a La-grangian stochastic modelfor pollutant dispersion. In-ternational Journal of Par-allel Programming, 33(5):485–498, October 2005.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=33&issue=5&spage=485.

Reussner:2000:BMD

[RTH00] Ralf Reussner, Jesper Lars-son Traff, and Gunnar Hun-zelmann. A benchmarkfor MPI derived datatypes.Lecture Notes in ComputerScience, 1908:10–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080010.htm;http://link.springer-

REFERENCES 381

ny.com/link/service/series/0558/papers/1908/19080010.pdf.

Rungsawang:1999:PDT

[RTL99] A. Rungsawang, A. Tang-pong, and P. Laohawee. Par-allel DSIR text retrieval sys-tem. In Dongarra et al.[DLM99], pages 325–332.ISBN 3-540-66549-8 (soft-cover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

Rycerz:2007:IBS

[RTRG+07] Katarzyna Rycerz, AlfredoTirado-Ramos, Alessia Gua-landris, Simon F. PortegiesZwart, Marian Bubak, andPeter M. A. Sloot. Interac-tive N-body simulations onthe Grid: HLA versus MPI.The International Journal ofHigh Performance Comput-ing Applications, 21(2):210–221, May 2007. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/21/2/210.full.pdf+html.

Ropo:2009:RAP

[RWD09] Matti Ropo, Jan Wester-holm, and Jack Dongarra,editors. Recent Advancesin Parallel Virtual Machineand Message Passing Inter-face: 16th European PVM/MPI Users’ Group Meeting,Espoo, Finland, September

7–10, 2009. Proceedings, vol-ume 5759 of Lecture Notes inComputer Science. Springer-Verlag, Berlin, Germany /Heidelberg, Germany / Lon-don, UK / etc., 2009. CO-DEN LNCSD9. ISBN 3-642-03769-0 (print), 3-642-03770-4 (e-book). ISSN0302-9743 (print), 1611-3349 (electronic). LCCN???? URL http://www.springerlink.com/content/978-3-642-03770-2.

Simonsen:1993:DMD

[SA93] H. H. Simonsen and J. Amund-sen. Distributed moleculardynamics using the PVMsystem. In Sincovec [Sin93],pages 183–186. ISBN 0-89871-315-3. LCCN QA76.58 S55 1993. Two vol-umes.

Saarinen:1994:EES

[Saa94] S. Saarinen. EASYPVM —an enhanced subroutine li-brary for PVM. In Gentzschand Harms [GH94], pages267–272. ISBN 0-387-57981-8 (New York), 3-540-57981-8 (Berlin). LCCNQA76.88.I57 1994. DM96.00.Two volumes.

Sainio:2010:CGA

[Sai10] J. Sainio. CUDAEASY— a GPU accelerated cos-mological lattice program.Computer Physics Com-munications, 181(5):906–912, May 2010. CO-

REFERENCES 382

DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465510000159.

Sato:2017:NIT

[SAL+17] Kento Sato, Dong H. Ahn,Ignacio Laguna, Gregory L.Lee, Martin Schulz, andChristopher M. Chambreau.Noise injection techniquesto expose subtle and unin-tended message races. ACMSIGPLAN Notices, 52(8):89–101, August 2017. CO-DEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Saphir:1997:SMI

[Sap97] William Saphir. A sur-vey of MPI implementations.NHSE Review, 2(1):??, Nov-ember 1997.

Soldado:2016:ECM

[SAP16] Fabio Soldado, FernandoAlexandre, and Herve Paulino.Execution of compoundmulti-kernel OpenCL com-putations in multi-CPU/multi-GPU environments.Concurrency and Compu-tation: Practice and Ex-perience, 28(3):768–787,March 10, 2016. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic).

Sahimi:2001:AAS

[SAS01] Mohd Salleh Sahimi, NormaAlias, and Elankovan Sun-dararajan. The AGEB al-gorithm for solving the heatequation in three space di-mensions and its paralleliza-tion using PVM. Lec-ture Notes in Computer Sci-ence, 2073:918–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2073/20730918.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2073/20730918.pdf.

Schuster:1995:CSM

[SB95] G. Schuster and F. Breit-enecker. Coupling simula-tors with the model intercon-nection concept and PVM.In Breitenecker and Husin-sky [BH95], pages 321–326.ISBN 0-444-82241-0. LCCNA76.9.C65E966 1995.

Smith:2001:DMM

[SB01] Lorna Smith and MarkBull. Development of mixedmode MPI/OpenMP appli-cations. Scientific Program-ming, 9(2–3):83–98, Spring–Summer 2001. CODENSCIPEV. ISSN 1058-9244(print), 1875-919X (elec-tronic). URL http://iospress.metapress.com/app/home/contribution.

REFERENCES 383

asp%3Fwasp=7pab6qgbaf8vxg991rwy%26referrer=parent%26backto=issue%2C3%2C11%3Bjournal%2C1%2C9%3Blinkingpublicationresults%2C1%2C1.

Seyfarth:1994:GEE

[SBF94] B. R. Seyfarth, J. L. Bick-ham, and M. R. Fernan-dez. Glenda: an environ-ment for easy parallel pro-gramming. In Pierce andRegnier [PR94b], pages 637–641. ISBN 0-8186-5680-8, 0-8186-5681-6. LCCNQA76.58.S32 1994. IEEEcatalog no. 94TH0637-9.

Schulz:2004:IES

[SBF+04] Martin Schulz, Greg Bron-evetsky, Rohit Fernan-des, Daniel Marques, Ke-shav Pingali, and PaulStodghill. Implementa-tion and evaluation of ascalable application-levelcheckpoint-recovery schemefor MPI programs. In ACM[ACM04], page 38. ISBN 0-7695-2153-3. LCCN ????

Selikhov:2002:MCC

[SBG+02] Anton Selikhov, GeorgeBosilca, Cecile Germain,Gilles Fedak, and FranckCappello. MPICH-CM: acommunication library de-sign for a P2P MPI imple-mentation. Lecture Notesin Computer Science, 2474:323–??, 2002. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://

link.springer.de/link/service/series/0558/bibs/2474/24740323.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740323.pdf.

Schindewolf:2012:WSA

[SBG+12] Martin Schindewolf, BarnaBihari, John Gyllenhaal,Martin Schulz, Amy Wang,and Wolfgang Karl. Whatscientific applications canbenefit from hardware trans-actional memory? InHollingsworth [Hol12], pages90:1–90:?? ISBN 1-4673-0804-8. URL http://conferences.computer.org/sc/2012/papers/1000a073.pdf.

Sani:2014:PDF

[SBQZ14] Ardalan Amiri Sani, KevinBoos, Shaopu Qin, and LinZhong. I/O paravirtualiza-tion at the device file bound-ary. ACM SIGARCH Com-puter Architecture News, 42(1):319–332, March 2014.CODEN CANED2. ISSN0163-5964 (print), 1943-5851(electronic).

Smith:1995:CRC

[SBR95] K. A. Smith, A. J. Baratta,and G. E. Robinson. Cou-pled RELAP5 and CON-TAIN accident analysis us-ing PVM. Nuclear safety,36(1):94–108, January–June1995. CODEN NUSAAZ.ISSN 0029-5604.

REFERENCES 384

Smith:2004:SIP

[SBT04] Kevin B. Smith, Aart J. C.Bik, and Xinmin Tian. Sup-port for the Intel(R) Pen-tium(R) 4 processor withhyper-threading technologyin Intel(R) 8.0 compilers. In-tel Technology Journal, 8(1):19–31, February 2004. ISSN1535-766X. URL http://developer.intel.com/technology/itj/2004/volume08issue01/art02_compilers/p01_abstract.htm.

Saltz:1991:MRT

[SBW91] J. Saltz, H. Berryman, andJ. Wu. Multiprocessors andrun-time compilation. Con-currency: practice and ex-perience, 3(6):573–592, De-cember 1991. CODENCPEXEI. ISSN 1040-3108.

Stubbs:1995:ICE

[SC95] S. S. Stubbs and D. L.Carver. IPCC++: a C++extension for interprocesscommunication with objects.In IEEE [IEE95l], pages205–210. ISBN 0-8186-7119-X. LCCN QA 76.6 C62951995. IEEE catalog no.95CB35838.

Smith:1996:UWC

[SC96a] N. P. G. Smith and C. Christopou-los. Utilising workstationclusters with PVM for thesolution of large TLM prob-lems. In Silvester [Sil96],pages 3–11. ISBN 1-85312-395-1. LCCN TK5.I59 1996.

Steed:1996:PPP

[SC96b] M. R. Steed and M. J.Clement. Performance pre-diction of PVM programs.In IEEE [IEE96e], pages803–807. ISBN 0-8186-7255-2. LCCN QA76.58.I565 1996. IEEE catalognumber 96TB100038. IEEEComputer Society Press or-der number PR07255.

Sievert:2004:SMP

[SC04] Otto Sievert and HenriCasanova. A simple MPIprocess swapping architec-ture for iterative applica-tions. The InternationalJournal of High Perfor-mance Computing Applica-tions, 18(3):341–352, Fall2004. CODEN IHPCFL.ISSN 1094-3420 (print),1741-2846 (electronic). URLhttp://hpc.sagepub.com/content/18/3/341.full.pdf+html.

Saillard:2014:PCS

[SCB14] Emmanuelle Saillard, PatrickCarribault, and Denis Barthou.PARCOACH: Combiningstatic and dynamic valida-tion of MPI collective com-munications. The Interna-tional Journal of High Per-formance Computing Appli-cations, 28(4):425–434, Nov-ember 2014. CODEN IH-PCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.

REFERENCES 385

sagepub.com/content/28/4/425.

Saillard:2015:SDV

[SCB15] Emmanuelle Saillard, PatrickCarribault, and Denis Barthou.Static/dynamic validation ofMPI collective communica-tions in multi-threaded con-text. ACM SIGPLAN No-tices, 50(8):279–280, Au-gust 2015. CODEN SIN-ODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Stagg:1995:IPN

[SCC95] A. K. Stagg, D. D. Cline,and G. F. Carey. Implement-ing a parabolized Navier–Stokes flow solver on theCray T3D. In Bailey et al.[BBG+95], pages 143–148.ISBN 0-89871-344-7. LCCNQA76.58.S55 1995.

Shyu:1996:ILQ

[SCC96] Shyong Jian Shyu, H. K.-C. Chang, and K.-C. Chou.Implementation of a lin-ear quadtree coding schemeon the parallel virtual ma-chine. International Journalof High Speed Computing, 8(1):65–79, March 1996. CO-DEN IHSCEZ. ISSN 0129-0533.

Schill:1993:DOD

[Sch93] Alexander Schill, editor.DCE — the OSF dis-tributed computing environ-ment: client/server model

and beyond: InternationalDCE Workshop, Karlsruhe,Germany, October 7–8,1993: proceedings, number731 in Lecture Notes inComputer Science. Spring-er-Verlag, Berlin, Ger-many / Heidelberg, Ger-many / London, UK / etc.,1993. ISBN 3-540-57306-2, 0-387-57306-2. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.9.C55I58 1993.

Schneenman:1994:DSS

[Sch94] Richard D. Schneenman.Distributed supercomputingsoftware: experiences withthe parallel virtual machine— PVM. Technical ReportNISTIR 5381, U.S. Dept. ofCommerce, National Insti-tute of Standards and Tech-nology, Gaithersburg, MD,USA, 1994. vi + 18 pp.

Schuele:1996:PLA

[Sch96a] J. Schuele. Parallel Lanc-zos algorithm on a CRAY-T3D combining PVM andSHMEM routines. LectureNotes in Computer Science,1156:158–??, ???? 1996.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Schule:1996:PLA

[Sch96b] J. Schule. Parallel Lanc-zos algorithm on a CRAY-T3D combining PVM andSHMEM routines. In Bode

REFERENCES 386

et al. [BDLS96], pages158–165. ISBN 3-540-61779-5. ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58.E9751996.

Schuele:1999:HAP

[Sch99] J. Schuele. Heading for anasynchronous parallel oceanmodel. In Dongarra et al.[DLM99], pages 404–409.ISBN 3-540-66549-8 (soft-cover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

Schevtschenko:2001:PAS

[Sch01] I. V. Schevtschenko. Aparallel ADI and steep-est descent methods. Lec-ture Notes in Computer Sci-ence, 2131:265–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310265.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310265.pdf.

Song:1997:ALL

[SCL97] Jianjian Song, Heng KekChoo, and Kuok Ming Lee.Application-level load mi-gration and its implementa-tion on top of PVM. Con-currency: practice and ex-perience, 9(1):1–19, January

1997. CODEN CPEXEI.ISSN 1040-3108.

Suppi:2000:IOP

[SCL00] Remo Suppi, FernandoCores, and Emilio Luque.Improving optimistic PDESin PVM environments. Lec-ture Notes in Computer Sci-ence, 1908:304–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080304.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080304.pdf.

Suppi:2001:PCS

[SCL01] Remo Suppi, FernandoCores, and Emilio Luque.PDES: a case study us-ing the switch time warp.Lecture Notes in ComputerScience, 2131:327–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310327.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310327.pdf.

Santos:1997:ECP

[SCP97] L. P. Santos, V. Castro,and A. Proenca. Evalua-tion of the communication

REFERENCES 387

performance on a parallelprocessing system. Lec-ture Notes in ComputerScience, 1332:41–48, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

SCRI:1992:PWC

[SCR92] Proceedings of the Work-shop on Cluster Comput-ing. Supercomputing Com-putations Research Insti-tute, Florida State Univer-sity, Tallahassee, FL, USA,December 1992. ISBN ????LCCN ???? Proceed-ings available via anonymousftp from ftp.scri.fsu.eduin directory pub/parallel-workshop.92.

Shi:2012:VGA

[SCSL12] Lin Shi, Hao Chen, Jian-hua Sun, and Kenli Li.vCUDA: GPU-acceleratedhigh-performance comput-ing in virtual machines.IEEE Transactions on Com-puters, 61(6):804–816, June2012. CODEN ITCOB4.ISSN 0018-9340 (print),1557-9956 (electronic).

Szeberenyi:1999:SGB

[SD99] I. Szeberenyi and G. Domokos.Solving generalized bound-ary value problems with dis-tributed computing and re-cursive programming. InDongarra et al. [DLM99],pages 267–274. ISBN 3-540-66549-8 (softcover). ISSN

0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

SM-D:2013:BRC

[SD13] SM-D. Book review: CUDAProgramming, Shane Cook.Morgan Kaufmann. ISBN978-0-12-415933-4. Net-work Security, 2013(1):4,January 2013. CODENNTSCF5. ISSN 1353-4858(print), 1872-9371 (elec-tronic). URL http://www.sciencedirect.com/science/article/pii/S1353485813700151.

Sorensen:2016:EER

[SD16] Tyler Sorensen and Alas-tair F. Donaldson. Exposingerrors related to weak mem-ory in GPU applications.ACM SIGPLAN Notices,51(6):100–113, June 2016.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Skjellum:1994:WLM

[SDB94] A. Skjellum, N. E. Doss,and P. V. Bangalore. Writ-ing libraries in MPI. InIEEE [IEE94f], pages 166–173. ISBN 0-8186-4980-1.LCCN QA76.58.S34 1993.

Sorensen:2016:PIW

[SDB+16] Tyler Sorensen, Alastair F.Donaldson, Mark Batty,Ganesh Gopalakrishnan,and Zvonimir Rakamaric.Portable inter-workgroup

REFERENCES 388

barrier synchronisation forGPUs. ACM SIGPLAN No-tices, 51(10):39–58, Octo-ber 2016. CODEN SIN-ODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Schmitt:2017:SCP

[SDJ17] Felix Schmitt, Robert Diet-rich, and Guido Juckeland.Scalable critical-path anal-ysis and optimization guid-ance for hybrid MPI–CUDAapplications. The Interna-tional Journal of High Per-formance Computing Appli-cations, 31(6):485–498, Nov-ember 2017. CODEN IH-PCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic).

Sandes:2010:CUG

[SdM10] Edans Flavius O. Sandesand Alba Cristina M. A.de Melo. CUDAlign: us-ing GPU to accelerate thecomparison of megabase ge-nomic sequences. ACM SIG-PLAN Notices, 45(5):137–146, May 2010. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Sistare:1999:MSP

[SDN99] Steve Sistare, Erica Dorenkamp,and Nick Nevin. MPI sup-port in the Prism program-ming environment. In ACM[ACM99], page ??

Sampaio:2013:DA

[SdSCP13] Diogo Sampaio, Rafael Mar-tins de Souza, SylvainCollange, and FernandoMagno Quintao Pereira. Di-vergence analysis. ACMTransactions on Program-ming Languages and Sys-tems, 35(4):13:1–13:??, De-cember 2013. CODENATPSDT. ISSN 0164-0925(print), 1558-4593 (elec-tronic).

Skjellum:1995:EMP

[SDV+95] A. Skjellum, N. E. Doss,K. Viswanathan, A. Chow-dappa, and P. V. Banga-lore. Extending the messagepassing interface (MPI). InIEEE [IEE95j], pages 106–118. ISBN 0-8186-6895-4.LCCN QA76.58 .S34 1994.

Sack:2002:FMB

[SE02] Paul Sack and Anne C.Elster. Fast MPI broad-casts through reliable mul-ticasting. Lecture Notesin Computer Science, 2367:445–??, 2002. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2367/23670445.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2367/23670445.pdf.

REFERENCES 389

Spencer:2015:DLN

[SEC15] Matt Spencer, Jesse Eick-holt, and Jianlin Cheng. Adeep learning network ap-proach to ab initio proteinsecondary structure predic-tion. IEEE/ACM Transac-tions on Computational Bi-ology and Bioinformatics, 12(1):103–112, January 2015.CODEN ITCBCY. ISSN1545-5963 (print), 1557-9964(electronic).

Schenck:2016:EPM

[SEF+16] Wolfram Schenck, Salem ElSayed, Maciej Foszczynski,Wilhelm Homberg, and DirkPleiter. Evaluation and per-formance modeling of a burstbuffer solution. OperatingSystems Review, 50(3):12–26, December 2016. CODENOSRED8. ISSN 0163-5980(print), 1943-586X (elec-tronic).

Segovia:2010:PPN

[Seg10] Alejandro Segovia. Parallelprogramming with NVIDIACUDA. Linux Journal,2010(200):2:1–2:??, Decem-ber 2010. CODEN LI-JOFX. ISSN 1075-3583(print), 1938-3827 (elec-tronic).

Seifert:1999:ESI

[Sei99] Friedrich Seifert. Entwick-lung von Systemsoftware zurIntegration der Virtual In-terfaceArchitecture (VIA) in

den Linux Betriebssystemk-ern fur optimiertes Mes-sagePassing. (German) [De-velopment of system soft-ware for integration of theVirtual InterfaceArchitec-ture (VIA) in the Linux op-erating system for optimizedmessage passing]. Diplomar-beit, Technische UniversitatChemnitz-Zwickau, Chem-nitz, Germany, 1999. 115 pp.

Sept:1993:DIP

[Sep93] Doug Sept. The design,implementation and perfor-mance of a queue man-ager for PVM. M.s. the-sis, Computer Science De-partment, University of Ten-nessee, Knoxville, Knoxville,TN 37996, USA, 1993. viii +45 pp.

Serot:1997:EPF

[Ser97] J. Serot. Embodying paral-lel functional skeletons: Anexperimental implementa-tion on top of MPI. Lec-ture Notes in Computer Sci-ence, 1300:629–??, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Sevenich:1998:PPU

[Sev98] Richard Sevenich. Paral-lel processing using PVM.Linux Journal, 45:??, Jan-uary 1998. CODEN LI-JOFX. ISSN 1075-3583(print), 1938-3827 (elec-tronic).

REFERENCES 390

Scott:1998:PWN

[SFG98] S. L. Scott, M. Fischer,and A. Geist. PVM onWindows and NT clusters.Lecture Notes in ComputerScience, 1497:231–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Schoinas:1994:FGA

[SFL+94] Ioannis Schoinas, BabakFalsafi, Alvin R. Lebeck,Steven K. Reinhardt, James R.Larus, and David A. Wood.Fine-grain access control fordistributed shared memory.ACM SIGPLAN Notices,29(11):297–306, November1994. CODEN SINODQ.ISSN 0362-1340 (print),1523-2867 (print), 1558-1160(electronic). URL http://www.acm.org:80/pubs/citations/proceedings/asplos/195473/p297-schoinas/.

Steuwer:2015:GPP

[SFLD15] Michel Steuwer, ChristianFensch, Sam Lindley, andChristophe Dubach. Gener-ating performance portablecode using rewrite rules:from high-level functionalexpressions to high-performanceOpenCL code. ACM SIG-PLAN Notices, 50(9):205–217, September 2015. CO-DEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Siegelin:1995:BPW

[SFO95] C. Siegelin, U. Finger, andC. O’Donnell. Boostingthe performance of work-stations through WARP-memory. In Haridi et al.[HAM95b], pages 703–706.ISBN 3-540-60247-X. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.I553 1995.

Shen:2013:ACE

[SFSV13] Jie Shen, Jianbin Fang,Henk Sips, and Ana LuciaVarbanescu. An application-centric evaluation of OpenCLon multi-core CPUs. Par-allel Computing, 39(12):834–850, December 2013.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167819113001014.

Selikhov:2005:CMB

[SG05] A. Selikhov and C. Germain.A Channel Memory basedfault tolerance for MPI ap-plications. Future Gen-eration Computer Systems,21(5):709–715, May 2005.CODEN FGSEVI. ISSN0167-739X (print), 1872-7115 (electronic).

Sharma:2012:SRP

[SG12] Subodh Sharma and GaneshGopalakrishnan. A sound re-duction of persistent-sets fordeadlock detection in MPI

REFERENCES 391

applications. Lecture Notesin Computer Science, 7498:194–209, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-33296-8_15/.

Steuwer:2014:SHL

[SG14] Michel Steuwer and SergeiGorlatch. SkelCL: a high-level extension of OpenCLfor multi-GPU systems. TheJournal of Supercomput-ing, 69(1):25–33, July 2014.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://link.springer.com/article/10.1007/s11227-014-1213-y.

Sack:2015:CAM

[SG15] Paul Sack and WilliamGropp. Collective algo-rithms for multiported torusnetworks. ACM Trans-actions on Parallel Com-puting (TOPC), 1(2):12:1–12:??, January 2015. CO-DEN ???? ISSN 2329-4949(print), 2329-4957 (elec-tronic).

Sunderam:1994:PCC

[SGDM94] V. S. Sunderam, G. A.Geist, J. Dongarra, andR. Manchek. The PVMconcurrent computing sys-tem: Evolution, experi-ences, and trends. Paral-

lel Computing, 20(4):531–545, March 31, 1994. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.elsevier.com/cgi-bin/cas/tree/store/parco/cas_sub/browse/browse.cgi?year=1994&volume=20&issue=4&aid=861.

Schneider:2012:MAC

[SGH12] Timo Schneider, RobertGerstenberger, and TorstenHoefler. Micro-applicationsfor communication dataaccess patterns and MPIdatatypes. Lecture Notesin Computer Science, 7490:121–131, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-33518-1_17/.

Solsona:2001:IEI

[SGHL01] Francesc Solsona, FrancescGine, Porfidio Hernandez,and Emilio Luque. Imple-menting explicit and implicitcoscheduling in a PVM en-vironment (research note).Lecture Notes in ComputerScience, 1900:1165–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1900/19001165.htm;http://link.springer-

ny.com/link/service/series/

REFERENCES 392

0558/papers/1900/19001165.pdf.

Saito:2003:LSP

[SGJ+03] Hideki Saito, Greg Gaertner,Wesley Jones, Rudolf Eigen-mann, Hidetoshi Iwashita,Ron Lieberman, Matthijsvan Waveren, and BrianWhitney. Large system per-formance of SPEC OMPbenchmark suites. Inter-national Journal of Paral-lel Programming, 31(3):197–209, June 2003. CODENIJPPE5. ISSN 0885-7458(print), 1573-7640 (elec-tronic). URL /ips/frames/Refs/referenceskapmain.asp?J=4773&I=33&A=3&LK=NM; http://ipsapp007.kluweronline.com/content/getfile/4773/33/3/abstract.htm; http://ipsapp007.kluweronline.com/content/getfile/4773/33/3/fulltext.pdf.

Solsona:2000:MCM

[SGL+00] Francesc Solsona, FrancescGine, Josep Lerida, Por-fidio Hernandez, and EmilioLuque. Monito: a commu-nication monitoring tool fora PVM–Linux environment.Lecture Notes in ComputerScience, 1908:233–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080233.htm;

http://link.springer-ny.com/link/service/series/0558/papers/1908/19080233.pdf.

Sekharan:1995:LBM

[SGS95] Chandra N. Sekharan, Vi-neet Goel, and R. Srid-har. Load balancing meth-ods for ray tracing and bi-nary tree computing usingPVM. Parallel Comput-ing, 21(12):1963–1978, De-cember 12, 1995. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.elsevier.com/cgi-bin/cas/tree/store/parco/cas_sub/browse/browse.cgi?year=1995&volume=21&issue=12&aid=1028.

Stone:2010:OPP

[SGS10] John E. Stone, David Go-hara, and Guochun Shi.OpenCL: a parallel pro-gramming standard for het-erogeneous computing sys-tems. Computing in Sci-ence and Engineering, 12(3):66–73, May/June 2010.CODEN CSENFA. ISSN0740-7475 (print), 1558-1918(electronic).

Scherer:2000:APO

[SGZ00] Alex Scherer, Thomas Gross,and Willy Zwaenepoel.Adaptive parallelism forOpenMP task parallel pro-grams. Lecture Notes inComputer Science, 1915:

REFERENCES 393

113–??, 2000. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1915/19150113.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1915/19150113.pdf.

Schmidt:1994:IAP

[SH94] M. Schmidt and R. Hanisch.Implementation of an airpollution transport model onparallel hardware. In Dekkeret al. [DSZ94], pages 277–284. ISBN 0-444-81784-0.LCCN QA76.58.E98 1994.

Sitsky:1996:MLW

[SH96] D. Sitsky and E. Hayashi.An MPI library which usespolling, interrupts and re-mote copying for the Fu-jitsu AP1000+. In Liet al. [LHHM96], pages 43–49. ISBN 0-8186-7460-1. LCCN QA76.58.I56731996. IEEE catalog number96TB100044.

Song:2014:DAT

[SH14] Sukhyun Song and Jeffrey K.Hollingsworth. Designingand auto-tuning parallel 3-D FFT for computation-communication overlap. ACMSIGPLAN Notices, 49(8):181–192, August 2014. CO-DEN SINODQ. ISSN0362-1340 (print), 1523-2867

(print), 1558-1160 (elec-tronic).

Shen:1995:PSM

[She95] H. Shen. Parallel k-set mu-tual range-join in hyper-cubes. Microprocessing andMicroprogramming, 41(7):443–448, November 1995.CODEN MMICDT. ISSN0165-6074 (print), 1878-7061(electronic).

Sloot:1994:CIO

[SHH94a] P. M. A. Sloot, A. G. Hoek-stra, and L. O. Hertzberger.A comparison of the Iserver-Occam, Parix, Express, andPVM programming envi-ronments on a ParsytecGCel. In Gentzsch andHarms [GH94], pages 253–259. ISBN 0-387-57981-8 (New York), 3-540-57981-8 (Berlin). LCCNQA76.88.I57 1994. DM96.00.Two volumes.

Sloot:1994:CIP

[SHH94b] P. M. A. Sloot, A. G. Hoek-stra, and L. O. Hertzberger.A comparison of the Iserver-Occam, Parix, Express, andPVM programming envi-ronments on a ParsytecGCel. In Gentzsch andHarms [GH94], pages 253–259. ISBN 0-387-57981-8 (New York), 3-540-57981-8 (Berlin). LCCNQA76.88.I57 1994. DM96.00.Two volumes.

REFERENCES 394

Sojka:2018:IEM

[SHHC18] Radim Sojka, David Horak,Vaclav Hapla, and Mar-tin Cermak. The impactof enabling multiple sub-domains per MPI processin the TFETI domain de-composition method. Ap-plied Mathematics and Com-putation, 319(??):586–597,February 15, 2018. CO-DEN AMHCBQ. ISSN0096-3003 (print), 1873-5649(electronic). URL http://www.sciencedirect.com/science/article/pii/S0096300317304927.

Sato:2001:CEO

[SHHI01] Mitsuhisa Sato, HiroshiHarada, Atsushi Hasegawa,and Yutaka Ishikawa. Cluster-enabled OpenMP: An OpenMPcompiler for the SCASHsoftware distributed sharedmemory system. ScientificProgramming, 9(2–3):123–130, Spring–Summer 2001.CODEN SCIPEV. ISSN1058-9244 (print), 1875-919X (electronic). URLhttp://iospress.metapress.com/app/home/contribution.asp%3Fwasp=7pab6qgbaf8vxg991rwy%26referrer=parent%26backto=issue%2C6%2C11%3Bjournal%2C1%2C9%3Blinkingpublicationresults%2C1%2C1.

Shing:1994:UPC

[Shi94] C.-C. Shing. Use PVMon computation of anal-ysis of repeated measure-ment designs. In Sall and

Lehman [SL94a], pages 139–142. ISBN 1-886658-00-5.LCCN QA276.4.S95 1994.

Samadi:2014:LGU

[SHLM14] Mehrzad Samadi, Amir Hor-mati, Janghaeng Lee, andScott Mahlke. LeveragingGPUs using cooperative loopspeculation. ACM Trans-actions on Architecture andCode Optimization, 11(1):3:1–3:??, February 2014.CODEN ???? ISSN1544-3566 (print), 1544-3973(electronic).

Sato:2010:BLL

[SHM+10] Mitsuhisa Sato, ToshihiroHanawa, Matthias S. Muller,Barbara M. Chapman, andBronis R. de Supinski, ed-itors. Beyond Loop LevelParallelism in OpenMP:Accelerators, Tasking andMore: 6th InternationalWorkshop on OpenMP,IWOMP 2010, Tsukuba,Japan, June 14–16, 2010Proceedings, volume 6132of Lecture Notes in Com-puter Science. Springer-Ver-lag, Berlin, Germany / Hei-delberg, Germany / Lon-don, UK / etc., 2010. CO-DEN LNCSD9. ISBN 3-642-13216-2 (print), 3-642-13217-0 (e-book). ISSN0302-9743 (print), 1611-3349 (electronic). LCCN???? URL http://www.springerlink.com/

REFERENCES 395

content/978-3-642-13217-9.

Samadi:2012:AIA

[SHM+12] Mehrzad Samadi, Amir Hor-mati, Mojtaba Mehrara,Janghaeng Lee, and ScottMahlke. Adaptive input-aware compilation for graph-ics engines. ACM SIGPLANNotices, 47(6):13–22, June2012. CODEN SINODQ.ISSN 0362-1340 (print),1523-2867 (print), 1558-1160(electronic). PLDI ’12 pro-ceedings.

Shah:2000:FCS

[SHPT00] Sanjiv Shah, Grant Haab,Paul Petersen, and JoeThroop. Flexible con-trol structures for paral-lelism in OpenMP. Con-currency: practice and ex-perience, 12(12):1219–1239,October 2000. CODENCPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/cgi-bin/abstract/76500348/START; http://www3.interscience.wiley.com/cgi-bin/fulltext?ID=76500348&PLACEBO=IE.pdf.

Sato:2001:OGR

[SHTS01] Mitsuhisa Sato, MotonariHirano, Yoshio Tanaka, andSatoshi Sekiguchi. Om-niRPC: a Grid RPC facilityfor cluster and global com-puting in OpenMP. Lec-ture Notes in Computer Sci-ence, 2104:130–??, 2001.

CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2104/21040130.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2104/21040130.pdf.

Siegel:1992:FFS

[Sie92a] H. J. Siegel, editor. Frontiers’92, the Fourth Symposiumon the Frontiers of MassiveParallel Computation, Octo-ber 19–21, 1992, McLean,Virginia. IEEE ComputerSociety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,1992. ISBN 0-8186-2772-7. LCCN QA76.58.S95 1992.IEEE catalog no. 92CH3185-6.

Siegel:1992:FSF

[Sie92b] H. J. Siegel, editor. TheFourth Symposium on theFrontiers of Massively Par-allel Computation: Fron-tiers ’92 / October 19–21, 1992, McLean Virginia.IEEE Computer SocietyPress, 1109 Spring Street,Suite 300, Silver Spring,MD 20910, USA, 1992.ISBN 0-8186-2772-7. LCCNQA76.58.S95 1992. IEEEcatalog number 92CH3185-6.

Siegal:1994:PEI

[Sie94] Howard Jay Siegal, editor.

REFERENCES 396

Proceedings / Eighth Inter-national Parallel Process-ing Symposium, April 26–29, 1994, Cancun, Mex-ico. IEEE Computer SocietyPress, 1109 Spring Street,Suite 300, Silver Spring,MD 20910, USA, 1994.ISBN 0-8186-5602-6. LCCNQA76.58.I58 1994. IEEEcatalog no. 94CH34819.

Silvester:1996:SEE

[Sil96] P. P. Silvester, editor. Soft-ware for electrical engineer-ing analysis and design:Third International Confer-ence on Software for Elec-trical Engineering Analy-sis and Design, Electrosoft’96, Pisa, Italy. Computa-tional Mechanics Publica-tions, Boston, MA, USA,1996. ISBN 1-85312-395-1.LCCN TK5.I59 1996.

Sincovec:1993:SCP

[Sin93] Richard F. Sincovec, editor.SIAM Conference on Par-allel Processing for Scien-tific Computing (6th: 1993:Norfolk, VA, USA). Soci-ety for Industrial and Ap-plied Mathematics, Philadel-phia, PA, USA, 1993. ISBN0-89871-315-3. LCCN QA76.58 S55 1993. Two vol-umes.

Silla:2017:BRG

[SIRP17] Federico Silla, Sergio Iserte,Carlos Reano, and JavierPrades. On the benefits

of the remote GPU virtu-alization mechanism: TherCUDA case. Concurrencyand Computation: Prac-tice and Experience, 29(13),July 10, 2017. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic).

Sharma:2017:PDR

[SIS17] Prateek Sharma, David Ir-win, and Prashant Shenoy.Portfolio-driven resourcemanagement for transientcloud servers. Proceedingsof the ACM on Measurementand Analysis of ComputingSystems (POMACS), 1(1):5:1–5:??, June 2017. CO-DEN ???? ISSN 2476-1249.URL http://dl.acm.org/citation.cfm?id=3084442.

Sistare:2002:UHP

[SJ02] Steven J. Sistare and Christo-pher J. Jackson. Ultra-high performance communi-cation with MPI and the SunFire(TM ) link interconnect.In IEEE [IEE02], page ??ISBN 0-7695-1524-X. LCCN???? URL http://www.sc-2002.org/paperpdfs/pap.pap142.pdf.

Szo:2017:PET

[SJK+17a] Mate Szoke, Tamas IstvanJozsa, Adam Koleszar,Irene Moulitsas, and LaszloKonozsy. Performance eval-uation of a two-dimensionallattice Boltzmann solver

REFERENCES 397

using CUDA and PGASUPC based parallelisation.ACM Transactions on Math-ematical Software, 44(1):8:1–8:??, July 2017. CO-DEN ACMSCU. ISSN0098-3500 (print), 1557-7295(electronic). URL https://dl.acm.org/citation.cfm?id=3085590.

Szoke:2017:PET

[SJK+17b] Mate Szoke, Tamas IstvanJozsa, Adam Koleszar,Irene Moulitsas, and LaszloKonozsy. Performance eval-uation of a two-dimensionallattice Boltzmann solver us-ing CUDA and PGAS UPCbased parallelisation. ACMTransactions on Mathemati-cal Software, 44(1):8:1–8:22,July 2017. CODEN ACM-SCU. ISSN 0098-3500(print), 1557-7295 (elec-tronic).

Samadi:2014:PPB

[SJLM14] Mehrzad Samadi, Davoud AnousheJamshidi, Janghaeng Lee,and Scott Mahlke. Para-prox: pattern-based ap-proximation for data par-allel applications. ACMSIGARCH Computer Ar-chitecture News, 42(1):35–50, March 2014. CODENCANED2. ISSN 0163-5964(print), 1943-5851 (elec-tronic).

Shen:1992:VTD

[SK92] S. Shen and L. Klein-rock. The virtual-time

data-parallel machine. InSiegel [Sie92b], pages 46–53.ISBN 0-8186-2772-7. LCCNQA76.58.S95 1992. IEEEcatalog number 92CH3185-6.

Smith:2000:DPM

[SK00] Lorna Smith and Paul Kent.Development and perfor-mance of a mixed OpenMP/MPI quantum Monte Carlocode. Concurrency: practiceand experience, 12(12):1121–1129, October 2000. CO-DEN CPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/cgi-bin/abstract/76500350/START; http://www3.interscience.wiley.com/cgi-bin/fulltext?ID=76500350&PLACEBO=IE.pdf.

Sanders:2010:CEI

[SK10] Jason Sanders and EdwardKandrot. CUDA by ex-ample: an introduction togeneral-purpose GPU pro-gramming. Addison-Wes-ley, Reading, MA, USA,2010. ISBN 0-13-138768-5. xix + 290 pp. LCCNQA76.76.A65.

Steinberger:2014:WTB

[SKB+14] Markus Steinberger, MichaelKenzel, Pedro Boechat,Bernhard Kerbl, Mark Dok-ter, and Dieter Schmalstieg.Whippletree: task-basedscheduling of dynamic work-loads on the GPU. ACM

REFERENCES 398

Transactions on Graphics,33(6):228:1–228:??, Novem-ber 2014. CODEN AT-GRDF. ISSN 0730-0301(print), 1557-7368 (elec-tronic).

Skjellum:2004:RTM

[SKD+04] Anthony Skjellum, ArkadyKanevsky, Yoginder S. Dan-dass, Jerrell Watts, StevePaavola, Dennis Cottel,Greg Henley, L. ShaneHebert, Zhenqian Cui, AnnaRounbehler, and The Real-Time Message Passing Inter-face (Mpi and Rt) Forum.The Real-Time MessagePassing Interface Standard(MPI/RT-1.1). Concurrencyand Computation: Practiceand Experience, 16(S1):Si–S322, December 25, 2004.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Subramaniam:1996:CLU

[SKH96] Krishnan R. Subramaniam,Suraj C. Kothari, and DonHeller. A communicationlibrary using active mes-sages to improve perfor-mance of PVM. Jour-nal of Parallel and Dis-tributed Computing, 39(2):146–152, December 15, 1996.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic). URL http://www.idealibrary.com/links/doi/10.1006/jpdc.1996.0162/production;

http://www.idealibrary.com/links/doi/10.1006/jpdc.1996.0162/production/pdf.

Skjellum:1993:SLH

[Skj93] A. Skjellum. Scalable li-braries in a heterogeneousenvironment. In IEEE[IEE93c], pages 13–20. ISBN0-8186-3900-8, 0-8186-3901-6. LCCN QA76.9.D5I5931993. IEEE catalog no.93TH0550-4.

Steinberger:2012:SDS

[SKK+12] Markus Steinberger, Bern-hard Kainz, Bernhard Kerbl,Stefan Hauswiesner, MichaelKenzel, and Dieter Schmal-stieg. Softshell: dynamicscheduling on GPUs. ACMTransactions on Graphics,31(6):161:1–161:??, Novem-ber 2012. CODEN AT-GRDF. ISSN 0730-0301(print), 1557-7368 (elec-tronic).

Spiechowicz:2015:GAM

[SKM15] J. Spiechowicz, M. Kostur,and L. Machura. GPU ac-celerated Monte Carlo sim-ulation of Brownian mo-tors dynamics with CUDA.Computer Physics Com-munications, 191(??):140–149, June 2015. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465515000417.

REFERENCES 399

Satoh:2001:COT

[SKS01] Shigehisa Satoh, KazuhiroKusano, and Mitsuhisa Sato.Compiler optimization tech-niques for OpenMP pro-grams. Scientific Pro-gramming, 9(2–3):131–142,Spring–Summer 2001. CO-DEN SCIPEV. ISSN 1058-9244 (print), 1875-919X(electronic). URL http://iospress.metapress.com/app/home/contribution.asp%3Fwasp=7pab6qgbaf8vxg991rwy%26referrer=parent%26backto=issue%2C7%2C11%3Bjournal%2C1%2C9%3Blinkingpublicationresults%2C1%2C1.

Sall:1994:CIS

[SL94a] J. Sall and A. Lehman, ed-itors. Computational inten-sive statistical methods: 26thSymposium on the interface— June 15-18, 1994, Re-search Triangle Park, NC,USA, volume 26 of Com-puting Science and Statis-tics Conference. Fairfax Sta-tion: Interface Foundation ofNorth America, ????, 1994.ISBN 1-886658-00-5. LCCNQA276.4.S95 1994.

Scales:1994:DES

[SL94b] D. J. Scales and M. S. Lam.The design and evaluationof a shared object systemfor distributed memory ma-chines. In USENIX [USE94],pages 101–114. ISBN 1-880446-66-9. LCCN QA76.76 O63 U87 1994.

Swanson:1995:PAP

[SL95] Eric Swanson and Terry P.Lybrand. PVM-AMBER: aparallel implementation ofthe AMBER molecular me-chanics package for work-station clusters. Journal ofComputational Chemistry,16(9):1131–1140, Septem-ber 1995. CODEN JC-CHDD. ISSN 0192-8651(print), 1096-987X (elec-tronic).

Shyu:2000:APV

[SL00] Shyong-Jian Shyu and B. M. T.Lin. An application ofparallel virtual machineframework to film pro-duction problem. Com-puters and Mathematicswith Applications, 39(12):53–62, June 2000. CO-DEN CMAPDK. ISSN0898-1221 (print), 1873-7668(electronic). URL http://www.sciencedirect.com/science/article/pii/S0898122100001292.

Skjellum:1995:EAM

[SLG95] Anthony Skjellum, EwingLusk, and William Gropp.Early applications in theMessage-Passing Interface(MPI). International Jour-nal of Supercomputer Ap-plications and High Perfor-mance Computing, 9(2):79–94, Summer 1995. CODENIJSCFG. ISSN 1078-3482.

REFERENCES 400

Scherer:1999:TAP

[SLGZ99] Alex Scherer, Honghui Lu,Thomas Gross, and WillyZwaenepoel. Transpar-ent adaptive parallelismon NOWs using OpenMP.ACM SIGPLAN Notices, 34(8):96–106, August 1999.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic). URL http://www.acm.org/pubs/citations/proceedings/ppopp/301104/p96-scherer/.

Samadi:2014:SPS

[SLJ+14] Mehrzad Samadi, JanghaengLee, D. Anoushe Jamshidi,Scott Mahlke, and AmirHormati. Scaling perfor-mance via self-tuning ap-proximation for graphics en-gines. ACM Transactions onComputer Systems, 32(3):7:1–7:??, September 2014.CODEN ACSYEC. ISSN0734-2071 (print), 1557-7333(electronic).

Su:2012:CPB

[SLN+12] ChunYi Su, Dong Li,Dimitrios S. Nikolopou-los, Matthew Grove, KirkCameron, and Bronis R.de Supinski. Critical path-based thread placement forNUMA systems. ACMSIGMETRICS PerformanceEvaluation Review, 40(2):106–112, September 2012.CODEN ???? ISSN

0163-5999 (print), 1557-9484(electronic).

Sloan:2005:HPL

[Slo05] Joseph D. (Joseph Don-ald) Sloan. High perfor-mance Linux clusters withOSCAR, Rocks, openMosix,and MPI. O’Reilly & As-sociates, Inc., 981 ChestnutStreet, Newton, MA 02164,USA, 2005. ISBN 0-596-00570-9. xv + 350 pp.LCCN QA76.58; QA76.58.S56 2005eb; QA76.58 .S562005; QA76.58 .S58 2005;QA76.58 .S595 2005. URLhttp://www.oreilly.com/catalog/9780596005702.

Squyres:1996:CBP

[SLS96] J. M. Squyres, A. Lums-daine, and R. L. Steven-son. A cluster-based paral-lel image processing toolkit.In Grinstein and Erbacher[GE96], pages 228–239. CO-DEN PSISDG. ISBN 0-8194-2030-1. ISSN 0277-786X(print), 1996-756X (elec-tronic). LCCN TS510.S63v.2656.

Shires:2002:EHM

[SM02] D. Shires and R. Mohan.An evaluation of HPF andMPI approaches and perfor-mance in unstructured finiteelement simulations. Jour-nal of Mathematical Mod-elling and Algorithms, 1(3):153–167, 2002. CODEN ????ISSN 1570-1166.

REFERENCES 401

Shires:2003:OPF

[SM03] Dale Shires and Ram Mo-han. Optimization andperformance of a Fortran90 MPI-based unstructuredcode on large-scale paral-lel systems. The Journalof Supercomputing, 25(2):131–141, June 2003. CO-DEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://ipsapp009.kluweronline.com/content/getfile/5189/44/4/abstract.htm; http://ipsapp009.kluweronline.com/content/getfile/5189/44/4/fulltext.pdf.

Simos:2007:CMS

[SM07] Theodore E. Simos andGeorge Maroulis, editors.Computation in ModernScience and Engineering:Proceedings of the [Fifth]International Conferenceon Computational Meth-ods in Science and En-gineering 2007 (ICCMSE2007), Corfu, Greece, 25–30 September 2007, volume2A, 2B of AIP ConferenceProceedings (#963). Amer-ican Institute of Physics,Woodbury, NY, USA, 2007.ISBN 0-7354-0476-3 (set),0-7354-0477-1 (vol. 1), 0-7354-0478-X (vol. 2). ISSN0094-243X (print), 1551-7616 (electronic), 1935-0465. LCCN Q183.9 .I5242007. URL http://www.

springer.com/physics/atoms/book/978-0-7354-0478-6.

Siegel:2008:CSE

[SMAC08] Stephen F. Siegel, Anas-tasia Mironova, George S.Avrunin, and Lori A. Clarke.Combining symbolic execu-tion with model checkingto verify parallel numeri-cal programs. ACM Trans-actions on Software Engi-neering and Methodology, 17(2):10:1–10:??, April 2008.CODEN ATSMER. ISSN1049-331X (print), 1557-7392 (electronic).

Shterenlikht:2015:FC

[SMCH15] Anton Shterenlikht, LeeMargetts, Luis Cebamanos,and David Henty. Fortran2008 coarrays. ACM FortranForum, 34(1):10–30, April2015. CODEN ???? ISSN1061-7264 (print), 1931-1311(electronic).

Smith:1993:MBA

[Smi93a] K. A. Smith. Multi-processor based accidentusing PVM. In Sin-covec [Sin93], pages 262–265.ISBN 0-89871-315-3. LCCNQA 76.58 S55 1993. Two vol-umes.

Smith:1993:DSI

[Smi93b] S. L. Smith. Dynamicscheduling of irregularlystructured parallel computa-tions in heterogeneous dis-tributed systems. ACM

REFERENCES 402

SIGPLAN Notices, 28(1):86, January 1993. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Schardl:2017:TEF

[SML17] Tao B. Schardl, William S.Moses, and Charles E. Leis-erson. Tapir: Embeddingfork-join parallelism intoLLVM’s intermediate rep-resentation. ACM SIG-PLAN Notices, 52(8):249–265, August 2017. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Sandes:2016:MMA

[SMM+16] Edans F. De O. Sandes,Guillermo Miranda, XavierMartorell, Eduard Ayguade,George Teodoro, and AlbaC. M. A. De Melo. MASA:a multiplatform architecturefor sequence aligners withblock pruning. ACM Trans-actions on Parallel Com-puting (TOPC), 2(4):28:1–28:??, March 2016. CO-DEN ???? ISSN 2329-4949(print), 2329-4957 (elec-tronic).

Sochacki:1993:DCW

[SMOE93] J. S. Sochacki, D. Mitchum,P. O’Leary, and R. E. Ewing.Distributed computation ofwave propagation models us-ing PVM. In IEEE [IEE93e],pages 22–33. ISBN 0-8186-4340-4 (paperback), 0-8186-

4341-2 (microfiche), 0-8186-4342-0 (hardback), 0-8186-4346-3 (CD-ROM). ISSN1063-9535. LCCN QA76.5.S96 1993.

Silva:2000:HPC

[SMS00] Luıs Moura Silva, PauloMartins, and Joao GabrielSilva. Heterogeneous par-allel computing using Javaand WMPI. Concur-rency: practice and ex-perience, 12(11):1077–1091,September 2000. CO-DEN CPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/cgi-bin/abstract/76000189/START; http://www3.interscience.wiley.com/cgi-bin/fulltext?ID=76000189&PLACEBO=IE.pdf.

Su:2006:APP

[SMSW06] Hai-Jun Su, J. MichaelMcCarthy, Masha Sosonk-ina, and Layne T. Wat-son. Algorithm 857: POL-SYS GLP—a parallel gen-eral linear product homo-topy code for solving poly-nomial systems of equa-tions. ACM Transactions onMathematical Software, 32(4):561–579, December 2006.CODEN ACMSCU. ISSN0098-3500 (print), 1557-7295(electronic).

Sitsky:1996:IMU

[SMTW96] D. Sitsky, P. Mackerras,A. Tridgell, and D. Walsh.

REFERENCES 403

Implementing MPI underAP/ linux. In IEEE[IEE96i], pages 32–39. ISBN0-8186-7533-0. LCCNQA76.642 .M67 1996.

Sunderam:2001:CAP

[SN01] Vaidy Sunderam and ZsoltNemeth. A comparativeanalysis of PVM/MPI andcomputational Grids. Lec-ture Notes in ComputerScience, 2131:14–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310014.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310014.pdf.

Suciu:2010:PIN

[SNMP10] A. Suciu, I. Nagy, K. Mar-ton, and I. Pinca. Par-allel implementation of theNIST Statistical Test Suite.In Ioan Alfred Letia, editor,Proceedings, 2010 IEEE 6thInternational Conference onIntelligent Computer Com-munication and Processing:Cluj-Napoca, Romania, Au-gust 26–28, 2010, pages 363–368. IEEE Computer SocietyPress, 1109 Spring Street,Suite 300, Silver Spring, MD20910, USA, 2010. ISBN 1-4244-8228-3 (print), 1-4244-8230-5 (electronic). LCCNQA76.76.E95. URL http:

//ieeexplore.ieee.org/servlet/opac?punumber=5598248. IEEE catalog num-ber CFP1009D-ART.

Sintorn:2011:EAF

[SOA11] Erik Sintorn, Ola Olsson,and Ulf Assarsson. Anefficient alias-free shadowalgorithm for opaque andtransparent objects usingper-triangle shadow vol-umes. ACM Transactionson Graphics, 30(6):153:1–153:??, December 2011. CO-DEN ATGRDF. ISSN0730-0301 (print), 1557-7368(electronic).

Snir:1996:MCR

[SOHL+96] Marc Snir, Steve W. Otto,Steven Huss-Lederman, David W.Walker, and Jack Dongarra.MPI: the complete reference.MIT Press, Cambridge, MA,USA, 1996. ISBN 0-262-69184-1. xii + 336 pp.LCCN QA76.642.M65 1996.US$27.50.

Snir:1998:MCR

[SOHL+98] Marc Snir, Steve W. Otto,Steven Huss-Lederman, David W.Walker, and Jack Don-garra. MPI: The Com-plete Reference. Volume 1,The MPI-1 Core. Scien-tific and Engineering Com-putation. MIT Press, Cam-bridge, MA, USA, secondedition, September 1998.ISBN 0-262-69215-5 (vol. 1),0-262-69216-3 (set). 450

REFERENCES 404

pp. LCCN QA76.642 .M651998. US$35 (paperback).URL http://mitpress.mit.edu/book-home.tcl?isbn=0262692155. See alsovolume 2 [GHLL+98].

SousaPinto:2001:PEI

[Sou01] Jorge Sousa Pinto. Par-allel evaluation of interac-tion nets with MPINE. Lec-ture Notes in Computer Sci-ence, 2051:353–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2051/20510353.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2051/20510353.pdf.

Sidonio:1999:PBI

[SP99] N. Sidonio and A. Pereira.A parallel N -body integra-tor using MPI. LectureNotes in Computer Science,1573:627–??, 1999. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Stpiczynski:2011:SKB

[SP11] Przemyslaw Stpiczynski andJoanna Potiopa. Solv-ing a kind of boundary-value problem for ordinarydifferential equations usingFermi — the next gen-eration CUDA computingarchitecture. Journal of

Computational and AppliedMathematics, 236(3):384–393, September 1, 2011.CODEN JCAMDI. ISSN0377-0427 (print), 1879-1778(electronic). URL http://www.sciencedirect.com/science/article/pii/S0377042711004237.

Satofuka:1995:PCF

[SPE95] N. Satofuka, Jacques Peri-aux, and Akin Ecer, ed-itors. Parallel computa-tional fluid dynamics: newalgorithms and applications:proceedings of the ParallelCFD ’94 Conference, Ky-oto, Japan, 16–19 May 1994.Elsevier, Amsterdam, TheNetherlands, 1995. ISBN 0-444-82317-4. LCCN QA911.P35 1994.

Shaw:1995:ADA

[SPH95] R. A. (Richard A.) Shaw,H. E. (Harry E.) Payne, andJ. J. E. (Jeffrey J. E.) Hayes,editors. Astronomical dataanalysis software and sys-tems IV: meeting held atBaltimore, Maryland, 25–28September 1994, volume 77of Astronomical Society ofthe Pacific Conference Se-ries. Astronomical Society ofthe Pacific, San Francisco,CA, USA, 1995. ISBN 0-937707-96-1. ISSN 1080-7926. LCCN QB51.3.E43A87 1994.

Skjellum:1996:TTM

[SPH96] A. Skjellum, B. Protopopov,and S. Hebert. A thread

REFERENCES 405

taxonomy for MPI. InIEEE [IEE96i], pages 50–57.ISBN 0-8186-7533-0. LCCNQA76.642 .M67 1996.

Sener:1996:DPP

[SPK96] C. Sener, Y. Paker, andA. Kiper. Data-parallel pro-gramming on Helios, paral-lel environment and PVM.In Yetongnon and Hariri[YH96], pages 2–?? ISBN???? LCCN ????

Subramoni:2012:DSI

[SPK+12] H. Subramoni, S. Potluri,K. Kandalla, B. Barth, J. Vi-enne, J. Keasler, K. Tomko,K. Schulz, A. Moody, andD. K. Panda. Designof a scalable InfiniBandtopology service to en-able network-topology-awareplacement of processes. InHollingsworth [Hol12], pages70:1–70:?? ISBN 1-4673-0804-8. URL http://conferences.computer.org/sc/2012/papers/1000a076.pdf.

Silva:1999:DPP

[SPL99] F. Silva, H. Paulino, andL. Lopes. DipSystem: aparallel programming sys-tem for distributed memoryarchitectures. In Dongarraet al. [DLM99], pages 525–532. ISBN 3-540-66549-8(softcover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

Schmidl:2012:PAT

[SPL+12] Dirk Schmidl, Peter Philip-pen, Daniel Lorenz, Chris-tian Rossel, and MarkusGeimer. Performance anal-ysis techniques for task-based OpenMP applica-tions. Lecture Notes inComputer Science, 7312:196–209, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-30961-8_15/.

Saldana:2010:MPM

[SPM+10] Manuel Saldana, Arun Patel,Christopher Madill, DanielNunes, Danyao Wang, PaulChow, Ralph Wittig, HenryStyles, and Andrew Put-nam. MPI as a programmingmodel for high-performancereconfigurable computers.ACM Transactions on Re-configurable Technology andSystems (TRETS), 3(4):22:1–22:??, November 2010.CODEN ???? ISSN1936-7406 (print), 1936-7414(electronic).

Squyres:2003:CAL

[Squ03] Jeffrey M. Squyres. Acomponent architecture forLAM/MPI (citation only).ACM SIGPLAN Notices,page ??, 2003. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

REFERENCES 406

Sivaraman:1995:PSP

[SR95] H. Sivaraman and C. S.Raghavendra. Paralleliz-ing sequential programs to acluster of workstations. InAgrawal [Agr95a], pages 38–41. ISBN 0-8493-2618-4.LCCN QA76.58.I34 1995.

Sivaraman:1996:AAD

[SR96] H. Sivaraman and C. S.Raghavendra. ADDT: Au-tomatic data distributiontool for porting programsto PVM. In El-Rewini andShriver [ERS96], pages 557–564. ISBN 0-8186-7324-9.ISSN 1060-3425. LCCN ????Five volumes.

Szalay:2011:FCD

[SR11] Zsofia Szalay and Janos Ro-honczy. Fast calculation ofDNMR spectra on CUDA-enabled graphics card. Jour-nal of Computational Chem-istry, 32(7):1262–1270, May2011. CODEN JCCHDD.ISSN 0192-8651 (print),1096-987X (electronic).

Speck:2012:MST

[SRK+12] R. Speck, D. Ruprecht,R. Krause, M. Emmett,M. Minion, M. Winkel,and P. Gibbon. A mas-sively space-time paral-lel N -body solver. InHollingsworth [Hol12], pages92:1–92:?? ISBN 1-4673-0804-8. URL http://conferences.computer.

org/sc/2012/papers/1000a083.pdf.

Schmidt:1994:EAO

[SS94] B. K. Schmidt and V. S.Sunderam. Empirical anal-ysis of overheads in clus-ter environments. Concur-rency: practice and expe-rience, 6(1):1–32, February1994. CODEN CPEXEI.ISSN 1040-3108.

Szymanski:1996:LCR

[SS96] Boleslaw K. Szymanski andBalaram Sinharoy, editors.Languages, Compilers andRun-Time Systems for Scal-able Computers, 22–24 May1995, Troy, NY, USA.Kluwer Academic PublishersGroup, Norwell, MA, USA,and Dordrecht, The Nether-lands, 1996. ISBN 0-7923-9635-9. LCCN QA76.58.L371996.

Silva:1999:IME

[SS99] P. Silva and J. G. Silva. Im-plementing MPI-2 extendedcollective operations. InDongarra et al. [DLM99],pages 125–132. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Shan:2001:CMS

[SS01] Hongzhang Shan and Jaswinder PalSingh. A comparison of MPI,SHMEM and cache-coherentshared address space pro-gramming models on a

REFERENCES 407

tightly-coupled multiproces-sors. International Jour-nal of Parallel Programming,29(3):283–318, June 2001.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640(electronic). URL http://ipsapp009.lwwonline.com/content/getfile/4773/21/3/abstract.htm; http://ipsapp009.lwwonline.com/content/getfile/4773/21/3/fulltext.pdf.

Schwarz:2009:GFG

[SS09] Michael Schwarz and MarcStamminger. GPU: FastGPU-based adaptive tessel-lation with CUDA. Compu-ter Graphics Forum, 28(2):365–374, April 2009. CO-DEN CGFODY. ISSN0167-7055 (print), 1467-8659(electronic).

Shan:2012:OAA

[SSAS12] Hongzhang Shan, ErichStrohmaier, James Amund-son, and Eric G. Stern. Op-timizing the advanced accel-erator simulation frameworkSynergia using OpenMP.Lecture Notes in Com-puter Science, 7312:140–153, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-30961-8_11/.

Sankaran:2005:LMC

[SSB+05] Sriram Sankaran, Jeffrey M.Squyres, Brian Barrett,Vishal Sahay, Andrew Lums-daine, Jason Duell, PaulHargrove, and Eric Roman.The LAM/MPI checkpoint/restart framework: System-initiated checkpointing. TheInternational Journal ofHigh Performance Comput-ing Applications, 19(4):479–493, Winter 2005. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/19/4/479.full.pdf+html.

Sataric:2016:HOM

[SSB+16] Bogdan Sataric, VladimirSlavnic, Aleksandar Belic,Antun Balaz, PaulsamyMuruganandam, and Sad-han K. Adhikari. HybridOpenMP/MPI programs forsolving the time-dependentGross–Pitaevskii equationin a fully anisotropic trap.Computer Physics Com-munications, 200(??):411–417, March 2016. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465515004440.

Sotomayor:2017:ACG

[SSB+17] Rafael Sotomayor, Luis MiguelSanchez, Javier GarciaBlas, Javier Fernandez, and

REFERENCES 408

J. Daniel Garcia. Au-tomatic CPU/GPU gen-eration of multi-versionedOpenCL kernels for C++scientific applications. In-ternational Journal of Paral-lel Programming, 45(2):262–282, April 2017. CODENIJPPE5. ISSN 0885-7458(print), 1573-7640 (elec-tronic). URL http://link.springer.com/article/10.1007/s10766-016-0425-6.

Silva:1996:IDS

[SSC96] L. M. Silva, J. G. Silva, andS. Chapple. Implementingdistributed shared memoryon top of MPI: the DSMPIlibrary. In IEEE [IEE96g],pages 50–57. ISBN 0-8186-7376-1. LCCN QA76.58 .E971996. IEEE order numberPR07376.

Silva:1997:IPD

[SSC97] Luis M. Silva, Joao GabrielSilva, and Simon Chapple.Implementation and perfor-mance of DSMPI. Scien-tific Programming, 6(2):201–214, Summer 1997. CODENSCIPEV. ISSN 1058-9244(print), 1875-919X (elec-tronic).

Silva:1995:PCR

[SSCC95] L. M. Silva, J. G. Silva,S. Chapple, and L. Clarke.Portable checkpointing andrecovery. In IEEE [IEE95k],pages 188–195. ISBN 0-8186-7088-6. LCCN QA76.9.D5

I328 1995. IEEE catalog no.95TB8075.

Skjellum:1994:DEZ

[SSD+94] A. Skjellum, S. G. Smith,N. E. Doss, A. P. Leung,and M. Morari. The de-sign and evolution of Zip-code. Parallel Computing, 20(4):565–596, March 31, 1994.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic).

Sabne:2012:ECO

[SSE12] Amit Sabne, Putt Sakdhna-gool, and Rudolf Eigen-mann. Effects of compileroptimizations in OpenMPto CUDA translation. Lec-ture Notes in Computer Sci-ence, 7312:169–181, 2012.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.com/chapter/10.1007/978-3-642-30961-8_13/.

Stellner:1995:CMP

[SSG95] G. Stellner, M. Schumann,and M. Girnghuber. Com-paring message-passing li-braries with the SPY anal-ysis environment. Informa-tionstechnik und technischeInformatik: IT + TI, 37(2):46–52, April 1995. CODENITINEV. ISSN 0944-2774.

Sosa:2000:IQC

[SSGF00] C. P. Sosa, G. Scalmani,R. Gomperts, and M. J.Frisch. Ab initio quantum

REFERENCES 409

chemistry on a ccNUMA ar-chitecture using openMP.III. Parallel Computing, 26(7–8):843–856, July 2000.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336 (electronic). URLhttp://www.elsevier.nl/gej-ng/10/35/21/42/29/25/abstract.html; http://www.elsevier.nl/gej-ng/10/35/21/42/29/25/article.pdf.

Sala:2008:PHP

[SSH08] Marzio Sala, W. F. Spotz,and M. A. Heroux. PyTrili-nos: High-performancedistributed-memory solversfor Python. ACM Transac-tions on Mathematical Soft-ware, 34(2):7:1–7:33, March2008. CODEN ACMSCU.ISSN 0098-3500 (print),1557-7295 (electronic).

Schafers:1995:TGP

[SSKF95] L. Schafers, C. Scheidler,and O. Kramer-Fuhrmann.TRAPPER: a graphical pro-gramming environment forparallel systems. FutureGeneration Computer Sys-tems, 11(4-5):351–361, Au-gust 1995. CODEN FG-SEVI. ISSN 0167-739X(print), 1872-7115 (elec-tronic).

Squyres:1997:DEM

[SSL97] J. M. Squyres, B. Saphir,and A. Lumsdaine. The de-sign and evolution of the

MPI-2 C++ interface. Lec-ture Notes in ComputerScience, 1343:57–??, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Stone:1994:PSO

[SSN94] L. C. Stone, S. B. Shukla,and B. Neta. Parallelsatellite orbit prediction us-ing a workstation cluster.Computers and Mathemat-ics with Applications, 28(8):1–8, October 1994. CO-DEN CMAPDK. ISSN0898-1221 (print), 1873-7668(electronic).

Shelton:1994:FPS

[SSP+94] W. A. Shelton, G. M. Stocks,F. J. Pinski, R. G. Jor-dan, Y. Liu, L. Qui, J. B.Staunton, D. D. Johnson,and B. Ginatempo. Firstprinciples simulation of ma-terials properties. In Pierceand Regnier [PR94b], pages103–110. ISBN 0-8186-5680-8, 0-8186-5681-6. LCCNQA76.58.S32 1994. IEEEcatalog no. 94TH0637-9.

Sen:1999:PBD

[SSS99] Vikramaditya Sen, Mri-nal K. Sen, and Paul L.Stoffa. PVM based 3-DKirchhoff depth migrationusing dynamically computedtravel-times: an applica-tion in seismic data pro-cessing. Parallel Comput-ing, 25(3):231–248, March

REFERENCES 410

22, 1999. CODEN PA-COEJ. ISSN 0167-8191(print), 1872-7336 (elec-tronic). URL http://www.elsevier.com/cas/tree/store/parco/sub/1999/25/3/1389.pdf.

Santana:1996:PVM

[SSSS96] M. S. Santana, P. S.Souza, R. C. Santana, andS. S. Souzza. ParallelVirtual Machine for Win-dows95. In Bode et al.[BDLS96], pages 288–??ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E975 1996.

Souza:1997:EPH

[SSSS97] P. S. Souza, L. J. Senger,M. J. Santana, and R. C.Santana. Evaluating per-sonal high performance com-puting with PVM on Win-dows and LINUX environ-ments. Lecture Notes inComputer Science, 1332:49–56, 1997. CODEN LNCSD9.ISSN 0302-9743 (print),1611-3349 (electronic).

Stellner:1997:LBB

[ST97] G. Stellner and J. Trini-tis. Load balancing based onprocess migration for MPI.Lecture Notes in ComputerScience, 1300:150–??, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Smyk:2002:AMM

[ST02a] Adam Smyk and MarekTudruj. Application ofmixed MPI OpenMP pro-gramming in a multi SMPcluster computer. Lec-ture Notes in Computer Sci-ence, 2328:288–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2328/23280288.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2328/23280288.pdf.

Smyk:2002:OMP

[ST02b] Adam Smyk and Marek Tu-druj. OpenMP / mpi pro-gramming in a multi-clustersystem based on sharedmemory/message passingcommunication. LectureNotes in Computer Sci-ence, 2326:241–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2326/23260241.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2326/23260241.pdf.

Steele:2017:UBP

[ST17] Guy L. Steele, Jr. and Jean-Baptiste Tristan. Usingbutterfly-patterned partial

REFERENCES 411

sums to draw from discretedistributions. ACM SIG-PLAN Notices, 52(8):341–355, August 2017. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Stals:1995:AMP

[Sta95a] L. Stals. Adaptive multi-grid in parallel. In Baileyet al. [BBG+95], pages 367–372. ISBN 0-89871-344-7.LCCN QA76.58.S55 1995.

Stankovski:1995:MPA

[Sta95b] Z. Stankovski. A massivelyparallel algorithm for thecollision probability calcu-lations in the APOLLO-IIcode using the PVM library.In ANS [ANS95], pages1573–1583. ISBN 0-89448-198-3. LCCN TK9006.M371995. Two volumes.

Stephens:1994:PBT

[Ste94] R. Stephens. Parallel bench-marks on the TranstechParamid supercomputer. Inde Gloria et al. [dGJM94],pages 136–146. ISBN ????LCCN ????

Stellner:1996:CCP

[Ste96] G. Stellner. CoCheck:checkpointing and processmigration for MPI. InIEEE [IEE96e], pages 526–531. ISBN 0-8186-7255-2. LCCN QA76.58 .I5651996. IEEE catalog number96TB100038. IEEE Com-

puter Society Press ordernumber PR07255.

Sterling:2000:SCB

[Ste00] Thomas Sterling. Symboliccomputing with Beowulf-class PC clusters. Lec-ture Notes in ComputerScience, 1908:7–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080007.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080007.pdf.

Still:1994:PPC

[Sti94] C. H. Still. Portable paral-lel computing via the MPI1message-passing standard.Computers in Physics, 8(5):533–536, 538–539, September-October 1994. CODENCPHYE2. ISSN 0894-1866(print), 1558-4208 (elec-tronic).

Schmitz:2008:IIG

[STK08] Arne Schmitz, MarkusTavenrath, and Leif Kobbelt.Illumination: Interactiveglobal illumination for de-formable geometry in CUDA.Computer Graphics Forum,27(7):1979–1986, October2008. CODEN CGFODY.ISSN 0167-7055 (print),1467-8659 (electronic).

REFERENCES 412

Sunderam:1997:TAS

[STMK97] V. Sunderam, B. Topol,S. Moyer, and A. Krantz.Tools and auxiliary sub-systems in PVM. Lec-ture Notes in Computer Sci-ence, 1332:285–294, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Stockinger:1998:VPC

[Sto98] Kurt Stockinger. ViMPIOS— a portable, client-serverbased implementation ofMPI-IO on ViPIOS. Diplom-Arbeit, Universitat Wien,Vienna, Austria, 1998. 155pp.

Stpiczynski:2002:PPO

[Stp02] Przemyslaw Stpiczynski.Parallel Programming inOpenMP helps novices: areview of Parallel Program-ming in OpenMP by RohitChandra, Leonardo Dagum,Dave Kohr, Dror May-dan, Jeff McDonald, andRamesh Menon. IEEEDistributed Systems On-line, 3(8), 2002. ISSN1541-4922 (print), 1558-1683(electronic). URL http://dsonline.computer.org/0208/d/bks_a.htm.

Strok:1994:NJI

[Str94] Dale C. Strok. In thenews: Jupiter impacts: Res-olution makes a big differ-ence. supercomputer farm-ing down under. HPF Forum

welcomes comments. Smith-sonian Awards honor com-putational scientists. low-life computer viruses. PVMdevelopers get R&D-100award. the eyes have it. neu-ral nets detect breast can-cer. better cars through co-operation. parallel version ofglobal climate model. Lock-heed to run Idaho NationalEngineering Lab. public-private partners: new drugs,new software. IEEE Compu-tational Science & Engineer-ing, 1(3):88–90, Fall 1994.CODEN ISCEE4. ISSN1070-9924 (print), 1558-190X (electronic).

Strietzel:1996:PTS

[Str96] M. Strietzel. Parallel tur-bulence simulation based onMPI. In Liddell et al.[LCHS96], pages 283–289.ISBN 3-540-61142-8 (paper-back). LCCN QA76.88 .H521996.

Strietzel:1997:PTS

[Str97] M. Strietzel. Parallel tur-bulence simulation: Resolv-ing the inertial subrange ofKolmogorov’s spectra. Lec-ture Notes in Computer Sci-ence, 1332:508–516, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Strzodka:2012:DLO

[Str12] Robert Strzodka. Datalayout optimization for

REFERENCES 413

multi-valued containers inOpenCL. Journal of Paralleland Distributed Computing,72(9):1073–1082, Septem-ber 2012. CODEN JPD-CER. ISSN 0743-7315(print), 1096-0848 (elec-tronic). URL http://www.sciencedirect.com/science/article/pii/S0743731511002115.

Soch:1996:PCG

[STT96] M. Soch, J. Trdlicka, andP. Tvrdik. PVM, computa-tional geometry, and parallelcomputing course. In Bodeet al. [BDLS96], pages 38–??ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E975 1996.

Soch:1997:PGP

[STV97] M. Soch, P. Tvrdik, andM. Volf. Parallel graph-partitioning using the mobheuristic. Lecture Notesin Computer Science, 1332:383–389, 1997. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Shen:1999:ATL

[STY99] Kai Shen, Hong Tang, andTao Yang. Adaptive two-level thread managementfor fast MPI execution onshared memory machines. InACM [ACM99], page ??

Stone:1996:RNF

[SU96] J. Stone and M. Underwood.Rendering of numerical flowsimulations using MPI. InIEEE [IEE96i], pages 138–141. ISBN 0-8186-7533-0.LCCN QA76.642 .M67 1996.

Sumimoto:2012:MCL

[Sum12] Shinji Sumimoto. The MPICommunication Library forthe K computer: Its designand implementation. LectureNotes in Computer Science,7490:11, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/accesspage/chapter/10.1007/978-3-642-33518-1_3.

Sunderam:1990:PFPa

[Sun90a] V. S. Sunderam. PVM: aframework for parallel dis-tributed computing. Tech-nical Report ORNL/TM-11375, Dept. of Math andComputer Science, EmoryUniversity, Atlanta, GA,USA, February 1990. Seealso [Sun90b].

Sunderam:1990:PFPb

[Sun90b] V. S. Sunderam. PVM: aframework for parallel dis-tributed computing. Con-currency: practice and ex-perience, 2(4):315–339, De-cember 1990. CODENCPEXEI. ISSN 1040-3108.See also the earlier technicalreport [Sun90a].

REFERENCES 414

Sunderam:1992:CCP

[Sun92] Vaidy Sunderam. Concur-rent computing with PVM.In SCRI WCC’92 [SCR92],page ?? ISBN ????LCCN ???? Proceed-ings available via anonymousftp from ftp.scri.fsu.eduin directory pub/parallel-workshop.92.

Sunderam:1993:PCC

[Sun93] V. Sunderam. The PVMconcurrent computing sys-tem. In Anonymous[Ano93g], pages 20–84. ISBN???? LCCN ????

Sunderam:1994:GPP

[Sun94a] V. Sunderam. Generalpurpose parallel computingwith PVM. In Anony-mous [Ano94f], pages 185–198. ISBN ???? LCCN ????

Sunderam:1994:MSH

[Sun94b] V. S. Sunderam. Method-ologies and systems for het-erogeneous concurrent com-puting. In Joubert et al.[JPTE94], pages 29–45.ISBN 0-444-81841-3. LCCNQA76.58 .P3794 1993.

Sunderam:1995:RIH

[Sun95] V. S. Sunderam. Recentinitiatives in heterogeneousparallel computing. In Grayand Naghdy [GN95], pages1–16. ISBN ???? LCCN ????

Sunderam:1996:PSS

[Sun96] V. Sunderam. The PVMsystem: status, trends,and directions. In Bodeet al. [BDLS96], pages 68–??ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E975 1996.

Suresh:1995:IOP

[Sur95a] H. Suresh. Implementa-tion of an optimal par-allel algorithm for arith-metic expression parsing. InNarashimhan [Nar95], page925 vol.2. ISBN 0-7803-2018-2 (paperback), 0-7803-2019-0 (microfiche). LCCNQA76.6.I15 1995. Twovolumes. IEEE catalog no.95TH0682-5.

Suresh:1995:PIQ

[Sur95b] H. Suresh. PVM imple-mentation of quadtree build-ing algorithms on SIMDhypercube system. InNarashimhan [Nar95], pages855–858 (vol. 2). ISBN0-7803-2018-2 (paperback),0-7803-2019-0 (microfiche).LCCN QA76.6.I15 1995.Two volumes. IEEE catalogno. 95TH0682-5.

Suttner:1996:SPB

[Sut96] C. B. Suttner. SPTHEO— a PVM-based paralleltheorem prover. LectureNotes in Computer Science,1156:116–125, ???? 1996.CODEN LNCSD9. ISSN

REFERENCES 415

0302-9743 (print), 1611-3349(electronic).

Smelyanskiy:2011:HPL

[SVC+11] Mikhail Smelyanskiy, KarthikeyanVaidyanathan, Jee Choi,Balint Joo, Jatin Chhugani,Michael A. Clark, andPradeep Dubey. High-performance lattice QCD formulti-core based parallel sys-tems using a cache-friendlyhybrid threaded-MPI ap-proach. In Lathrop et al.[LCK11], pages 69:1–69:11.ISBN 1-4503-0771-X. LCCN????

Sistare:1999:OMC

[SvL99] Steve Sistare, Rolf vande-Vaart, and Eugene Loh. Op-timization of MPI collectiveson clusters of large-scaleSMPs. In ACM [ACM99],page ??

Stout:1991:SDM

[SW91] Quentin F. Stout andMichael Joseph Wolfe, edi-tors. The Sixth DistributedMemory Computing Confer-ence proceedings April 28–May 1, 1991, Portland, Ore-gon. IEEE Computer SocietyPress, 1109 Spring Street,Suite 300, Silver Spring, MD20910, USA, 1991. ISBN 0-8186-2291-1. LCCN QA76.5.D58 1991.

Sehrish:2012:RFS

[SW12] Saba Sehrish and Jun Wang.Reduced Function Set Ab-

straction (RFSA) for MPI-IO. The Journal of Su-percomputing, 59(1):131–146, January 2012. CO-DEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=59&issue=1&spage=131.

Sosonkina:2015:RAV

[SWH15] Masha Sosonkina, Layne T.Watson, and Jian He. Re-mark on algorithm 897: VT-DIRECT95: Serial and par-allel codes for the globaloptimization algorithm DI-RECT. ACM Transactionson Mathematical Software,41(3):22:1–22:2, June 2015.CODEN ACMSCU. ISSN0098-3500 (print), 1557-7295(electronic). See [HWS09].

Santhanaraman:2005:DZC

[SWHP05] Gopalakrishnan Santhanara-man, Jiesheng Wu, WeiHuang, and Dhabaleswar K.Panda. Designing zero-copy Message Passing Inter-face derived datatype com-munication over infiniband:Alternative approaches andperformance evaluation. TheInternational Journal ofHigh Performance Comput-ing Applications, 19(2):129–142, Summer 2005. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.

REFERENCES 416

sagepub.com/content/19/2/129.full.pdf+html.

Sitsky:1995:IPM

[SWJ95] D. Sitsky, D. Walsh, andC. Johnson. Implementationand performance of the MPImessage passing interface onthe Fujitsu AP1000 multi-computer. Australian Com-puter Science Communica-tions, 17(1):475–481, ????1995. CODEN ACSCDD.ISSN 0157-3055.

Skjellum:2001:OOA

[SWL+01] Anthony Skjellum, Diane G.Wooley, Ziyang Lu, MichaelWolf, Purushotham V. Ban-galore, Andrew Lumsdaine,Jeffrey M. Squyres, andBrian McCandless. Object-oriented analysis and designof the Message Passing In-terface. Concurrency andComputation: Practice andExperience, 13(4):245–292,April 10, 2001. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic). URL http://www3.interscience.wiley.com/cgi-bin/abstract/78502300/START; http://www3.interscience.wiley.com/cgi-bin/fulltext?ID=78502300&PLACEBO=IE.pdf.

Shan:2012:PEH

[SWS+12] Hongzhang Shan, Nicholas J.Wright, John Shalf, Kather-ine Yelick, Marcus Wagner,and Nathan Wichmann. A

preliminary evaluation of thehardware acceleration of theCray Gemini interconnectfor PGAS languages andcomparison with MPI. ACMSIGMETRICS PerformanceEvaluation Review, 40(2):92–98, September 2012. CO-DEN ???? ISSN 0163-5999(print), 1557-9484 (elec-tronic).

Shee:1994:DMA

[SWYC94] Jang Chung Shee, Chao ChinWu, Lin Wen You, andCheng Chen. Design of amultithread architecture andits parallel simulation andevaluation environment. InAnonymous [Ano94a], pages69–76 (vol. 1). ISBN ????LCCN ???? 2 vol.

Stathopoulos:1995:DLB

[SY95] A. Stathopoulos and A. Yn-nerman. Dynamic load bal-ancing of atomic structureprograms on a PVM clus-ter. In Hertzberger and Ser-azzi [HS95a], pages 384–391.ISBN 3-540-59393-4. ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.88.I57 1995.

Sydow:1994:PSA

[Syd94] A. Sydow. Parallel simu-lation of air pollution. InPehrson et al. [PSB+94],pages 605–612. CODENITATEC. ISBN 0-444-81990-8, 0-444-81989-4. ISSN 0926-5473. LCCN QA75.5.I37851994. Three volumes.

REFERENCES 417

Stathopoulos:1996:PIM

[SYF96] Andreas Stathopoulos, An-ders B. Ynnerman, andCharlotte Froese Fischer.A PVM implementation ofthe MCHF atomic structurepackage. International Jour-nal of Supercomputer Ap-plications and High Perfor-mance Computing, 10(1):41–61, Spring 1996. CODENIJSCFG. ISSN 1078-3482.

Schneider:2009:CPM

[SYR+09] Scott Schneider, Jae-SeungYeom, Benjamin Rose,John C. Linford, AdrianSandu, and Dimitrios S.Nikolopoulos. A compari-son of programming mod-els for multiprocessors withexplicitly managed mem-ory hierarchies. ACM SIG-PLAN Notices, 44(4):131–140, April 2009. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Stankovic:1999:NVJ

[SZ99] N. Stankovic and K. Zhang.Native versus Java mes-sage passing. In Dongarraet al. [DLM99], pages 165–172. ISBN 3-540-66549-8(softcover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

Siegel:2011:AFV

[SZ11] Stephen F. Siegel and Tim-othy K. Zirkel. Automatic

formal verification of MPI-based parallel programs.ACM SIGPLAN Notices, 46(8):309–310, August 2011.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic). PPoPP ’11 Confer-ence proceedings.

Simmunovic:1995:MIP

[SZBS95a] S. Simmunovic, T. Zacharia,N. Baltas, and D. B. Spald-ing. MPI implementation ofPhoenics: a general purposecomputational fluid dynam-ics code. In Tentner [Ten95],pages 122–127. ISBN 1-56555-078-1. LCCN ????

Simunovic:1995:MIP

[SZBS95b] S. Simunovic, T. Zacharia,N. Baltas, and D. B. Spald-ing. MPI implementationof PHOENICS: a generalpurpose computational fluiddynamics code. In Tent-ner [Ten95], pages 122–127.ISBN 1-56555-078-1. LCCN????

Thompson:2014:CIC

[TA14] Elizabeth A. Thompson andTimothy R. Anderson. ACUDA implementation ofthe Continuous Space Lan-guage Model. The Journalof Supercomputing, 68(1):65–86, April 2014. CODENJOSUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic). URL http://link.springer.com/article/10.1007/s11227-013-1023-7.

REFERENCES 418

Takeda:2001:AME

[TAH+01] K. Takeda, N. K. Allsopp,J. C. Hardwick, P. C. Macey,D. A. Nicole, S. J. Cox, andD. J. Lancaster. An as-sessment of MPI environ-ments for Windows NT. TheJournal of Supercomputing,19(3):315–323, July 2001.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.wkap.nl/oasis.htm/338207.

Traff:2014:SPE

[TB14] Jesper Larsson Traff andSiegfried Benkner. Se-lected papers from EuroMPI2012. Computing, 96(4):259–261, April 2014. CODENCMPTA2. ISSN 0010-485X(print), 1436-5057 (elec-tronic). URL http://link.springer.com/article/10.1007/s00607-013-0335-z.

Tao:2012:UGA

[TBB12] Jian Tao, Marek Blazewicz,and Steven R. Brandt. UsingGPU’s to accelerate stencil-based computation kernelsfor the development of largescale scientific applicationson heterogeneous systems.ACM SIGPLAN Notices, 47(8):287–288, August 2012.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic). PPOPP ’12 confer-ence proceedings.

Touhafi:1996:DPC

[TBD96] A. Touhafi, W. Brissinck,and E. F. Dirkx. Devel-opment of PVM code for alow latency switch based in-terconnect. In Bode et al.[BDLS96], pages 229–??ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E975 1996.

Traff:2012:RAM

[TBD12] Jesper Larsson Traff, SiegfriedBenkner, and Jack J. Don-garra, editors. Recent Ad-vances in the Message Pass-ing Interface: 19th EuropeanMPI Users’ Group Meet-ing, EuroMPI 2012, Vienna,Austria, September 23–26,2012. Proceedings, volume7490 of Lecture Notes inComputer Science. Springer-Verlag, Berlin, Germany /Heidelberg, Germany / Lon-don, UK / etc., 2012. CO-DEN LNCSD9. ISBN 3-642-33517-9 (print), 3-642-33518-7 (e-book). ISSN0302-9743 (print), 1611-3349 (electronic). LCCN???? URL http://www.springerlink.com/content/978-3-642-33518-1.

Tian:2002:IOC

[TBG+02] Xinmin Tian, Aart Bik,Milind Girkar, Paul Grey,Hideki Saito, and ErnestoSu. Intel(R) OpenMPC++/Fortran compiler for

REFERENCES 419

hyper-threading technology:Implementation and per-formance. Intel Tech-nology Journal, 6(1):36–46, February 2002. ISSN1535-766X. URL http://developer.intel.com/technology/itj/2002/volume06issue01/vol6iss1_hyper_threading_technology.pdf.

Tahan:2012:ITC

[TBS12] Oussama Tahan, MatsBrorsson, and MohamedShawky. Introducing taskcancellation to OpenMP.Lecture Notes in ComputerScience, 7312:73–87, 2012.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.com/chapter/10.1007/978-3-642-30961-8_6/.

Thomas:1994:PSA

[TC94] S. J. Thomas and J. Cote.Parallel Semi-Lagrangianadvection using PVM. InDekker et al. [DSZ94], pages801–808. ISBN 0-444-81784-0. LCCN QA76.58.E98 1994.

Tzannes:2010:LBS

[TCBV10] Alexandros Tzannes, George C.Caragea, Rajeev Barua, andUzi Vishkin. Lazy binary-splitting: a run-time adap-tive work-stealing scheduler.ACM SIGPLAN Notices,45(5):179–190, May 2010.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867

(print), 1558-1160 (elec-tronic).

Thompson:2015:PCI

[TCP15] Elizabeth Thompson, NathanClem, and David A. Pe-ter. Parallel CUDA im-plementation of conflict de-tection for application toairspace deconfliction. TheJournal of Supercomput-ing, 71(10):3787–3810, Oc-tober 2015. CODEN JO-SUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic). URL http://link.springer.com/article/10.1007/s11227-015-1467-z.

Tourino:1998:PBL

[TD98] J. Tourino and R. Doallo.A PVM-based library forsparse matrix factorizations.Lecture Notes in ComputerScience, 1497:304–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Tourino:1999:MMC

[TD99] J. Tourino and R. Doallo.Modeling MPI collectivecommunications on theAP3000 Multicomputer. InDongarra et al. [DLM99],pages 133–140. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Thiruvathukal:2000:JNW

[TDB00] George K. Thiruvathukal,Phillip M. Dickens, and

REFERENCES 420

Shahzad Bhatti. Java on net-works of workstations (Ja-vaNOW): a parallel comput-ing framework inspired byLinda and the Message Pass-ing Interface (MPI). Con-currency: practice and ex-perience, 12(11):1093–1116,September 2000. CO-DEN CPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/cgi-bin/abstract/76000187/START; http://www3.interscience.wiley.com/cgi-bin/fulltext?ID=76000187&PLACEBO=IE.pdf.

Tromeur-Dervout:2011:PCF

[TDBEE11] Damien Tromeur-Dervout,Gunther Brenner, David R.Emerson, and Jocelyne Er-hel, editors. Parallel Com-putational Fluid Dynamics2008: Parallel NumericalMethods, Software Develop-ment and Applications, vol-ume 74 of Lecture Notes inComputational Science andEngineering. Springer-Ver-lag, Berlin, Germany / Hei-delberg, Germany / Lon-don, UK / etc., 2011. CO-DEN LNCSA6. ISBN 3-642-14437-3 (print), 3-642-14438-1 (e-book). ISSN1439-7358. LCCN ???? URLhttp://link.springer.com/book/10.1007/978-3-642-14438-7; http://www.springerlink.com/content/978-3-642-14438-7. Pro-ceedings of the twentieth

meeting, Parallel CFD 2008,held May 19–22, 2008 inLyon, France.

Totoni:2013:EFE

[TDG13] Ehsan Totoni, Mert Dikmen,and Marıa Jesus Garzaran.Easy, fast, and energy-efficient object detection onheterogeneous on-chip archi-tectures. ACM Transac-tions on Architecture andCode Optimization, 10(4):45:1–45:??, December 2013.CODEN ???? ISSN1544-3566 (print), 1544-3973(electronic).

Tentner:1995:HPC

[Ten95] A. Tentner, editor. HighPerformance ComputingSymposium 1995 ‘GrandChallenges in ComputerSimulation’. Proceedings ofthe 1995 Simulation Mul-ticonference: Phoenix, AZ,USA, 9–13 April 1995. So-ciety for Computer Simula-tion, San Diego, CA, USA,1995. ISBN 1-56555-078-1.LCCN ????

Truong:2002:PAM

[TFGM02] Hong-Linh Truong, ThomasFahringer, Michael Geissler,and Georg Madsen. Per-formance analysis for MPIapplications with SCALEA.Lecture Notes in ComputerScience, 2474:421–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http:/

REFERENCES 421

/link.springer.de/link/service/series/0558/bibs/2474/24740421.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740421.pdf.

Tu:2012:PAO

[TFZZ12] Bibo Tu, Jianping Fan, Jian-feng Zhan, and XiaofangZhao. Performance anal-ysis and optimization ofMPI collective operationson multi-core clusters. TheJournal of Supercomputing,60(1):141–162, April 2012.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=60&issue=1&spage=141.

Turchi:1994:SDA

[TG94] Patrice E. A. Turchi and An-tonios Gonis, editors. Stat-ics and dynamics of alloyphase transformations: Pro-ceedings of a NATO Ad-vanced Study Institute onStatics and Dynamics of Al-loy Phase Transformations,held June 21–July 3, 1992,in Rhodes, Greece, volume319 of NATO ASI Series BPhysics. Plenum Press, NewYork, NY, USA, 1994. ISBN0-306-44626-X. ISSN 0258-1221. LCCN TN690.S771994.

Thakur:2009:TSE

[TG09] Rajeev Thakur and William

Gropp. Test suite for eval-uating performance of mul-tithreaded MPI communi-cation. Parallel Comput-ing, 35(12):608–617, Decem-ber 2009. CODEN PA-COEJ. ISSN 0167-8191(print), 1872-7336 (elec-tronic).

Tian:2005:PCT

[TGBS05] Xinmin Tian, Milind Girkar,Aart Bik, and Hideki Saito.Practical compiler tech-niques on efficient multi-threaded code generation forOpenMP programs. TheComputer Journal, 48(5):588–601, September 2005.CODEN CMPJA6. ISSN0010-4620 (print), 1460-2067(electronic). URL http://comjnl.oxfordjournals.org/cgi/content/abstract/48/5/588; http://comjnl.oxfordjournals.org/cgi/reprint/48/5/588.

Tuncer:2009:PCF

[TGEM09] Ismail H. Tuncer, UlgenGulcat, David R. Emerson,and Kenichi Matsuno, edi-tors. Parallel ComputationalFluid Dynamics 2007: Im-plementations and Experi-ences on Large Scale andGrid Computing, volume 67of Lecture Notes in Com-putational Science and En-gineering. Springer-Verlag,Berlin, Germany / Heidel-berg, Germany / London,UK / etc., 2009. CO-

REFERENCES 422

DEN LNCSA6. ISBN 3-540-92743-3 (print), 3-540-92744-1 (e-book). ISSN1439-7358. LCCN ???? URLhttp://link.springer.com/book/10.1007/978-3-540-92744-0; http://www.springerlink.com/content/978-3-540-92744-0. Paral-lel CFD 2007 was held in An-talya, Turkey, from May 21to 24, 2007.

Thakur:2002:ONA

[TGL02] Rajeev Thakur, WilliamGropp, and Ewing Lusk. Op-timizing noncontiguous ac-cesses in MPI-IO. Par-allel Computing, 28(1):83–105, January 2002. CODENPACOEJ. ISSN 0167-8191(print), 1872-7336 (elec-tronic). URL http://www.elsevier.com/gej-ng/10/35/21/60/27/32/abstract.html; http://www.elsevier.nl/gej-ng/10/35/21/60/27/32/00001686.pdf.

Thakur:2005:OSO

[TGT05] Rajeev Thakur, WilliamGropp, and Brian Too-nen. Optimizing the syn-chronization operations inMessage Passing Interfaceone-sided communication.The International Journal ofHigh Performance Comput-ing Applications, 19(2):119–128, Summer 2005. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.

sagepub.com/content/19/2/119.full.pdf+html.

Traff:2010:SCM

[TGT10] Jesper Larsson Traff, William D.Gropp, and Rajeev Thakur.Self-consistent MPI perfor-mance guidelines. IEEETransactions on Parallel andDistributed Systems, 21(5):698–709, May 2010. CODENITDSEO. ISSN 1045-9219(print), 1558-2183 (elec-tronic).

Thakur:1998:CUM

[Tha98] Rajeev S. Thakur. Acase for using MPI’s de-rived datatypes to im-prove I/O performance. InACM [ACM98b], page ??ISBN ???? LCCN???? URL http://www.supercomp.org/sc98/papers/.

Tian:2005:CEN

[THH+05] Xinmin Tian, Jay P. Hoe-flinger, Grant Haab, Yen-Kuang Chen, Milind Girkar,and Sanjiv Shah. A com-piler for exploiting nestedparallelism in OpenMP pro-grams. Parallel Comput-ing, 31(10–12):960–983, Oc-tober/December 2005. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic).

Trefftz:1994:DPE

[THM+94] C. Trefftz, C. C. Huang,P. K. McKinley, T. Y. Li,

REFERENCES 423

and Z. Zeng. Design andperformance evaluation of adistributed eigenvalue solveron a workstation cluster.In IEEE [IEE94b], pages608–615. ISBN 0-8186-6952-7 (casebound), 0-8186-6950-0 (paperback), 0-8186-6951-9 (microfiche). LCCNTA1637.I25 1994. Threevolumes. IEEE catalog no.94CH35708.

Tran:2000:PPM

[THN00] Viet D. Tran, LadislavHluchy, and Giang T.Nguyen. Parallel programmodel for distributed sys-tems. Lecture Notes inComputer Science, 1908:250–??, 2000. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080250.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080250.pdf.

Thomsen:1994:RTS

[Tho94] P. G. Thomsen. Realtime simulation in a clus-ter computing environment.In Dongarra and Was-niewski [DW94], pages 493–497. ISBN 3-540-58712-8 (Berlin), 0-387-58712-8(New York). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58.P35 1994. DM104.00.

Throop:1999:SOS

[Thr99] Joe Throop. Standards:OpenMP: Shared-memoryparallelism from the ashes.Computer, 32(5):108–109,May 1999. CODEN CP-TRB4. ISSN 0018-9162(print), 1558-0814 (elec-tronic). URL http://dlib.computer.org/co/books/co1999/pdf/r5108.pdf.

Traeff:1999:FFE

[THRZ99] J. L. Traeff, R. Hempel,H. Ritzdoff, and F. Zim-mermann. Flattening onthe fly: Efficient handlingof MPI derived datatypes.In Dongarra et al. [DLM99],pages 109–116. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Takizawa:2015:ODT

[THS+15] Hiroyuki Takizawa, ShoichiHirasawa, Makoto Sug-awara, Isaac Gelado, Hi-roaki Kobayashi, and Wenmei W. Hwu. Optimizeddata transfers based onthe OpenCL event manage-ment mechanism. Scien-tific Programming, 2015(??):576498:1–576498:16, ????2015. CODEN SCIPEV.ISSN 1058-9244 (print),1875-919X (electronic). URLhttps://www.hindawi.com/journals/sp/2015/576498/.

REFERENCES 424

Tabakin:2009:QPE

[TJD09] Frank Tabakin and BrunoJulia-Dıaz. QCMPI: a par-allel environment for quan-tum computing. ComputerPhysics Communications,180(6):948–964, June 2009.CODEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465508004141.

Thoman:2012:AOL

[TJPF12] Peter Thoman, HerbertJordan, Simone Pellegrini,and Thomas Fahringer.Automatic OpenMP loopscheduling: a combinedcompiler and runtime ap-proach. Lecture Notes inComputer Science, 7312:88–101, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-30961-8_7/.

Tang:2016:AKM

[TK16] Qing Y. Tang and Mo-hammed A. S. Khalid. Ac-celeration of k-means algo-rithm using Altera SDK forOpenCL. ACM Transactionson Reconfigurable Technol-ogy and Systems (TRETS),10(1):6:1–6:??, December2016. CODEN ???? ISSN1936-7406 (print), 1936-7414(electronic).

Tennyson:2015:MOI

[TKP15] P. Gerald Tennyson, G. M.Karthik, and G. Phaniku-mar. MPI + OpenCL im-plementation of a phase-field method incorporatingCALPHAD description ofGibbs energies on hetero-geneous computing plat-forms. Computer PhysicsCommunications, 186(??):48–64, January 2015. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465514003208.

Tallent:2009:EPM

[TMC09] Nathan R. Tallent andJohn M. Mellor-Crummey.Effective performance mea-surement and analysis ofmultithreaded applications.ACM SIGPLAN Notices,44(4):229–240, April 2009.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Tampouratzis:2016:AIH

[TMP16] Nikolaos Tampouratzis, Pav-los M. Mattheakis, and Ioan-nis Papaefstathiou. Accel-erating intercommunicationin highly parallel systems.ACM Transactions on Ar-chitecture and Code Opti-mization, 13(4):40:1–40:??,December 2016. CODEN???? ISSN 1544-3566

REFERENCES 425

(print), 1544-3973 (elec-tronic).

Trobec:2001:IEM

[TMPJ01] R. Trobec, M.Sterk, M. Praprot-nik, and D. Janezic. Im-plementation and evalua-tion of MPI-based paral-lel MD program. Inter-national Journal of Quan-tum Chemistry, 84(1):23–31, ???? 2001. CODENIJQCB2. ISSN 0020-7608(print), 1097-461X (elec-tronic). URL http://www3.interscience.wiley.com/cgi-bin/abstract/84002438/START; http://www3.interscience.wiley.com/cgi-bin/fulltext/84002438/FILE?TPL=ftx_start; http://www3.interscience.wiley.com/cgi-bin/fulltext?ID=84002438&PLACEBO=IE.pdf.

Theodoropoulos:1996:ESP

[TMTP96] P. Theodoropoulos, G. Ma-nis, P. Tsanakas, and G. Pa-pakonstantinou. Extendingsynchronization PVM mech-anisms. In Bode et al.[BDLS96], pages 315–??ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E975 1996.

Taylor:2017:AOO

[TMW17] Ben Taylor, Vicent SanzMarco, and Zheng Wang.Adaptive optimization forOpenCL programs on em-bedded heterogeneous sys-

tems. ACM SIGPLANNotices, 52(4):11–20, May2017. CODEN SINODQ.ISSN 0362-1340 (print),1523-2867 (print), 1558-1160(electronic).

Takafuji:2017:CCC

[TNIB17] Daisuke Takafuji, KojiNakano, Yasuaki Ito, andJacir Bordim. C2CU: aCUDA–C program genera-tor for bulk execution of asequential algorithm. Con-currency and Computation:Practice and Experience, 29(17), September 10, 2017.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Takahashi:1999:IEM

[TOTH99] T. Takahashi, F. O’Carroll,H. Tezuka, and A. Hori. Im-plementation and evaluationof MPI on an SMP cluster.Lecture Notes in ComputerScience, 1586:1178–??, 1999.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Toussaint:1996:AES

[Tou96] Marcel Toussaint, editor.Ada in Europe: Second In-ternational Eurospace-Ada-Europe Symposium, Frank-furt/Main, Germany, Oc-tober 2–6, 1995: proceed-ings, number 1031 in Lec-ture Notes in ComputerScience. Springer-Verlag,Berlin, Germany / Hei-

REFERENCES 426

delberg, Germany / Lon-don, UK / etc., 1996.ISBN 3-540-60757-9. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.73.A35I57 1995.

Tourancheau:2000:HSN

[Tou00] Bernard Tourancheau. Highspeed networks for clusters,the BIP-Myrinet experience.Lecture Notes in ComputerScience, 1908:9–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080009.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080009.pdf.

Thebault:2015:SEI

[TPD15] Loıc Thebault, Eric Petit,and Quang Dinh. Scal-able and efficient implemen-tation of 3D unstructuredmeshes computation: a casestudy on matrix assembly.ACM SIGPLAN Notices, 50(8):120–129, August 2015.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Tinetti:2001:HNW

[TQDL01] Fernando Tinetti, Anto-nio Quijano, Armando DeGiusti, and Emilio Luque.Heterogeneous networks of

workstations and the par-allel matrix multiplication.Lecture Notes in ComputerScience, 2131:296–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310296.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310296.pdf.

Traeff:1998:PRL

[Tra98] J. L. Traeff. Portablerandomized list ranking onmultiprocessors using MPI.Lecture Notes in ComputerScience, 1497:395–??, 1998.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Traff:2002:IMP

[Tra02a] Jesper Larsson Traff. Im-plementing the MPI pro-cess topology mechanism.In IEEE [IEE02], page ??ISBN 0-7695-1524-X. LCCN???? URL http://www.sc-2002.org/paperpdfs/pap.pap122.pdf.

Traff:2002:IMA

[Tra02b] Jesper Larsson Traff. Im-proved MPI all-to-all com-munication on a GiganetSMP cluster. Lecture Notesin Computer Science, 2474:392–??, 2002. CODENLNCSD9. ISSN 0302-9743

REFERENCES 427

(print), 1611-3349 (elec-tronic). URL http://link.springer.de/link/service/series/0558/bibs/2474/24740392.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740392.pdf.

Traff:2012:AUE

[Tra12a] Jesper Larsson Traff. Al-ternative, uniformly expres-sive and more scalable in-terfaces for collective com-munication in MPI. Paral-lel Computing, 38(1–2):26–36, January/February 2012.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167819111001402.

Traff:2012:MTM

[Tra12b] Jesper Larsson Traff. mpicroscope:Towards an MPI bench-mark tool for performanceguideline verification. Lec-ture Notes in Computer Sci-ence, 7490:100–109, 2012.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.com/chapter/10.1007/978-3-642-33518-1_15/.

Thakur:2005:OCC

[TRG05] Rajeev Thakur, Rolf Raben-seifner, and William Gropp.Optimization of collectivecommunication operationsin MPICH. The Inter-

national Journal of HighPerformance ComputingApplications, 19(1):49–66,Spring 2005. CODEN IH-PCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/19/1/49.full.pdf+html.

Traff:2000:IMO

[TRH00] Jesper Larsson Traff, HubertRitzdorf, and Rolf Hempel.The implementation of MPI-2 one-sided communicationfor the NEC SX-5. InACM [ACM00], pages 45–46. URL http://www.sc2000.org/proceedings/techpapr/papers/pap181.pdf.

Tahan:2012:UDT

[TS12a] Oussama Tahan and Mo-hamed Shawky. Using dy-namic task level redun-dancy for OpenMP faulttolerance. Lecture Notesin Computer Science, 7179:25–36, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-28293-5_3/.

Thibault:2012:AIF

[TS12b] Julien C. Thibault and InancSenocak. Accelerating in-compressible flow compu-tations with a Pthreads–CUDA implementation on

REFERENCES 428

small-footprint multi-GPUplatforms. The Journalof Supercomputing, 59(2):693–719, February 2012.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=59&issue=2&spage=693.

Takahashi:2002:PEH

[TSB02] Daisuke Takahashi, Mit-suhisa Sato, and TaisukeBoku. Performance evalua-tion of the Hitachi SR8000using OpenMP benchmarks.Lecture Notes in ComputerScience, 2327:390–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2327/23270390.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2327/23270390.pdf.

Takahashi:2003:PEH

[TSB03] Daisuke Takahashi, Mit-suhisa Sato, and TaisukeBoku. Performance evalua-tion of the Hitachi SR8000using SPEC OMP2001benchmarks. Interna-tional Journal of Paral-lel Programming, 31(3):185–196, June 2003. CODENIJPPE5. ISSN 0885-7458(print), 1573-7640 (elec-

tronic). URL /ips/frames/Refs/referenceskapmain.asp?J=4773&I=33&A=2&LK=NM; http://ipsapp007.kluweronline.com/content/getfile/4773/33/2/abstract.htm; http://ipsapp007.kluweronline.com/content/getfile/4773/33/2/fulltext.pdf.

Terboven:2012:AOT

[TSCaM12] Christian Terboven, DirkSchmidl, Tim Cramer, andDieter an Mey. Assess-ing OpenMP tasking imple-mentations on NUMA ar-chitectures. Lecture Notesin Computer Science, 7312:182–195, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-30961-8_14/.

Ten:1995:TPE

[TSP95] S. V. Ten, V. V. Savchenko,and A. A. Pasko. Time per-formance evaluation of im-plicit surface polygonizationon distributed systems. InGray and Naghdy [GN95],pages 183–193. ISBN ????LCCN ????

Topol:1998:PTV

[TSS98] Brad Topol, John T. Stasko,and Vaidy Sunderam. PVaniM:a tool for visualization innetwork computing envi-ronments. Concurrency:practice and experience,

REFERENCES 429

10(14):1197–1222, Decem-ber 10, 1998. CODENCPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/cgi-bin/abstract?ID=40005932;http://www3.interscience.

wiley.com/cgi-bin/fulltext?ID=40005932&PLACEBO=IE.pdf.

Tatebe:2000:IOO

[TSS00a] Osamu Tatebe, MitsuhisaSato, and Satoshi Sekiguchi.Impact of OpenMP opti-mizations for the MGCGmethod. Lecture Notes inComputer Science, 1940:471–??, 2000. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1940/19400471.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1940/19400471.pdf.

Tavora:2000:DCM

[TSS00b] Vıtor N. Tavora, Luıs M.Silva, and Joao GabrielSilva. Distributed check-pointing mechanism for aparallel file system. Lec-ture Notes in Computer Sci-ence, 1908:137–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/

bibs/1908/19080137.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080137.pdf.

Tsunekawa:1995:EIE

[Tsu95] H. Tsunekawa. Effectiveimplementation of EDEMworkstation cluster usingPVM. In Pahl and Werner[PW95], pages 503–508.ISBN 90-5410-556-9, 90-5410-557-7. LCCN TA345.I565 1995 v.1-2. Two vol-umes.

Tsujita:2007:RMP

[Tsu07] Y. Tsujita. Remote MPI-I/O on a parallel vir-tual file system using acircular buffer for highthroughput. InternationalJournal of Computer Ap-plications, 29(3):291–299,2007. ISSN 1206-212X(print), 1925-7074 (elec-tronic). URL https://www.tandfonline.com/doi/full/10.1080/1206212X.2007.11441859.

Tsutsui:2012:AMG

[Tsu12] Shigeyoshi Tsutsui. ACOon multiple GPUs withCUDA for faster solutionof QAPs. Lecture Notesin Computer Science, 7492:174–184, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.

REFERENCES 430

1007/978-3-642-32964-7_18/.

Tang:1999:CRT

[TSY99] Hong Tang, Kai Shen, andTao Yang. Compile/run-time support for threadedMPI execution on multi-programmed shared mem-ory machines. ACM SIG-PLAN Notices, 34(8):107–118, August 1999. CO-DEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic). URL http://www.acm.org/pubs/citations/proceedings/ppopp/301104/p107-tang/.

Tang:2000:PTR

[TSY00] Hong Tang, Kai Shen, andTao Yang. Program trans-formation and runtime sup-port for threaded MPI ex-ecution on shared-memorymachines. ACM Transac-tions on Programming Lan-guages and Systems, 22(4):673–700, 2000. CODENATPSDT. ISSN 0164-0925(print), 1558-4593 (elec-tronic). URL http://www.acm.org/pubs/citations/journals/toplas/2000-22-4/p673-tang/.

Trelles-Salazar:1994:MSS

[TSZC94] O. Trelles-Salazar, E. L.Zapata, and J.-M. Carazo.Mapping strategies for se-quential sequence compar-ison algorithms on LAN-

based message passing ar-chitectures. In Gentzschand Harms [GH94], pages197–202. ISBN 0-387-57981-8 (New York), 3-540-57981-8 (Berlin). LCCNQA76.88.I57 1994. DM96.00.Two volumes.

Theodoropoulos:1997:GSP

[TTP97] P. Theodoropoulos, P. Tsanakas,and G. Papakonstantinou.Global semaphores in a par-allel programming environ-ment. Lecture Notes inComputer Science, 1332:151–158, 1997. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Tanaka:2000:PEO

[TTSY00] Yoshizumi Tanaka, Ken-jiro Taura, Mitsuhisa Sato,and Akinori Yonezawa.Performance evaluation ofOpenMP applications withnested parallelism. Lec-ture Notes in Computer Sci-ence, 1915:100–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1915/19150100.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1915/19150100.pdf.

Twerda:1996:PIT

[TVV96] A. Twerda, A. P. Van denBerg, and A. J. Van der

REFERENCES 431

Steen. Parallel implemen-tation of time dependentRayleigh-Benard convection.Supercomputer, 12(2):36–47,March 1996. CODEN SP-COEL. ISSN 0168-7875.

Tourancheau:2001:SMN

[TW01] Bernard Tourancheau andRoland Westrelin. Sup-port for MPI at the net-work interface level. Lec-ture Notes in ComputerScience, 2131:52–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310052.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310052.pdf.

Thorson:2012:SUF

[TW12] Greg Thorson and MichaelWoodacre. SGI UV2:a fused computation anddata analysis machine. InHollingsworth [Hol12], pages105:1–105:?? ISBN 1-4673-0804-8. URL http://conferences.computer.org/sc/2012/papers/1000a080.pdf.

Tournavitis:2009:THA

[TWFO09] Georgios Tournavitis, ZhengWang, Bjorn Franke, andMichael F. P. O’Boyle. To-wards a holistic approachto auto-parallelization: inte-

grating profile-driven paral-lelism detection and machine-learning based mapping.ACM SIGPLAN Notices,44(6):177–187, June 2009.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Tien:2014:EOS

[TY14] Tsan-Rong Tien and Yi-Ping You. Enabling OpenCLsupport for GPGPU inkernel-based virtual ma-chine. Software—Practiceand Experience, 44(5):483–510, May 2014. CODENSPEXBL. ISSN 0038-0644(print), 1097-024X (elec-tronic).

Utterback:2017:POR

[UALK17] Robert Utterback, KunalAgrawal, I-Ting AngelinaLee, and Milind Kulkarni.Processor-oblivious recordand replay. ACM SIG-PLAN Notices, 52(8):145–161, August 2017. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Uselton:1995:PRS

[UCW95] Samuel P. Uselton, Michael BrianCox, and Craig M. Witten-brink, editors. 1995 Par-allel Rendering Symposium(PRS 95): Atlanta, Geor-gia, October 30–31, 1995.ACM Press, New York, NY10036, USA, 1995. ISBN

REFERENCES 432

0-89791-774-1 (softbound)[invalid checksum], 0-7803-3120-6 (microfiche). LCCNQA76.58.P3778 1995. ACMorder number 428957. IEEEComputer Society Press or-der number 95TB8134.

Udupa:2009:SES

[UGT09] Abhishek Udupa, R. Govin-darajan, and Matthew J.Thazhuthaveetil. Synergis-tic execution of stream pro-grams on multicores withaccelerators. ACM SIG-PLAN Notices, 44(7):99–108, July 2009. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Uhl:1996:PIC

[UH96] A. Uhl and J. Hammerle.Parallel image compressionon a workstation cluster us-ing PVM. In Bode et al.[BDLS96], pages 301–??ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-3349 (electronic). LCCNQA76.58.E975 1996.

Uhl:1994:PCC

[Uhl94] A. Uhl. Parallel compactcoding of satellite imageswith wavelet packets usingPVM. In Kumar [Kum94],pages 382–387. ISBN 0-07-462332-X. LCCN QA 76.58I587 1994.

Uhl:1995:AWA

[Uhl95a] A. Uhl. Adapted waveletanalysis on moderate par-

allel distributed memoryMIMD architectures. InFerreira and Rolim [FR95],pages 275–283. ISBN3-540-60321-2. LCCNQA76.642.I59 1995.

Uhl:1995:PCC

[Uhl95b] A. Uhl. Parallel compactcoding of satellite imageswith wavelet packets usingPVM. In Prasanna et al.[PBPT95], pages 382–387.ISBN 0-07-462332-X. LCCNQA 76.58 I587 1994.

Uhl:1995:VPW

[Uhl95c] A. Uhl. Vector and par-allel wavelet transformsfor the analysis of time-varying signals. In Baileyet al. [BBG+95], pages 9–14.ISBN 0-89871-344-7. LCCNQA76.58.S55 1995.

Uminski:1997:EEP

[UMK97] P. W. Uminski, M. R. Ma-tuszek, and H. Krawczyk.Experimental evaluation ofPVM group communication.Lecture Notes in ComputerScience, 1332:57–66, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Uthayopas:2001:FSR

[UP01] Putchong Uthayopas andSugree Phatanapherom. Fastand scalable real-time mon-itoring system for Beowulfclusters. Lecture Notes inComputer Science, 2131:

REFERENCES 433

201–??, 2001. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310201.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310201.pdf.

Urena:2012:IMI

[URKG12] Isaıas A. Compres Urena,Michael Riepen, MichaelKonow, and Michael Gerndt.Invasive MPI on Intel’ssingle-chip cloud computer.Lecture Notes in ComputerScience, 7179:74–85, 2012.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.com/chapter/10.1007/978-3-642-28293-5_7/.

USENIX:1994:PFU

[USE94] USENIX, editor. Proceedingsof the First USENIX Sympo-sium on Operating SystemsDesign and Implementation(OSDI), November 14–17,1994, Monterey, Califor-nia, USA. USENIX, Berke-ley, CA, USA, 1994. ISBN1-880446-66-9. LCCN QA76.76 O63 U87 1994.

USENIX:1995:PUT

[USE95] USENIX, editor. Proceedingsof the 1995 USENIX Tech-nical Conference, January16–20, 1995, New Orleans,

Louisiana, USA. USENIX,Berkeley, CA, USA, 1995.ISBN 1-880446-67-7. LCCNQA 76.76 O63 U88 1995.

USENIX:2000:PAL

[USE00] USENIX, editor. Pro-ceedings of the 4th AnnualLinux Showcase and Confer-ence, Atlanta, October 10–14, 2000, Atlanta, Geor-gia, USA. USENIX, Berke-ley, CA, USA, 2000. ISBN1-880446-17-0. LCCN ????URL http://www.usenix.org/publications/library/proceedings/als2000/.

Uehara:2002:MBP

[UTY02] Hitoshi Uehara, MasanoriTamura, and Mitsuo Yokokawa.An MPI benchmark pro-gram library and its applica-tion to the Earth simulator.Lecture Notes in ComputerScience, 2327:219–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2327/23270219.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2327/23270219.pdf.

Unat:2012:AFD

[UZC+12] Didem Unat, Jun Zhou,Yifeng Cui, Scott B. Baden,and Xing Cai. Accelerat-ing a 43D finite-differenceearthquake simulation with

REFERENCES 434

a C-to-CUDA translator.Computing in Science andEngineering, 14(3):48–59,May/June 2012. CODENCSENFA. ISSN 1521-9615(print), 1558-366X (elec-tronic).

vanderPas:1993:PIG

[van93] R. van der Pas. ThePVM implementation of aGeneralized Red Black algo-rithm. Supercomputer, 10(4-5):72–85, July-September1993. CODEN SPCOEL.ISSN 0168-7875.

VanKatwijk:1995:AAC

[Van95] Jan Van Katwijk, editor.ACSCI ’95: 1st Annual con-ference — May 1995, Hei-jen, The Netherlands, Pro-ceedings of the Annual Con-ference — Advanced Schoolfor Computing and Imag-ing, 1st. ASCI, Delft, TheNetherlands, 1995. ISBN 90-90-08344-8. LCCN QA75.5.A38x 1995.

vandeGeijn:1997:UPP

[van97] Robert A. van de Geijn.Using PLAPACK: Paral-lel Linear Algebra Pack-age. MIT Press, Cambridge,MA, USA, 1997. ISBN 0-262-72026-4. xvii + 194pp. LCCN QA185.D37 V361997. US$27.50. With con-tributions by Philip Alpatovand others.

Vlassov:1995:MEP

[VAT95] V. Vlassov, H. Ahmed, andL.-E. Thorelli. mEDA-2:An extension of PVM. InMalyshkin [Mal95], pages288–293. ISBN 3-540-60222-4. ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58.I5471995.

Vazquez:1999:PNS

[VB99] G. E. Vazquez and N. B.Brignole. Parallel NLPstrategies using PVM on het-erogeneous distributed en-vironments. In Dongarraet al. [DLM99], pages 533–540. ISBN 3-540-66549-8(softcover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58E973 1999.

VanZee:2008:SPF

[VBLvdG08] Field G. Van Zee, PaoloBientinesi, Tze Meng Low,and Robert A. van deGeijn. Scalable paralleliza-tion of FLAME code via theworkqueuing model. ACMTransactions on Mathemat-ical Software, 34(2):10:1–10:29, March 2008. CO-DEN ACMSCU. ISSN0098-3500 (print), 1557-7295(electronic).

Vapirev:2015:IRC

[VDL+15] A. Vapirev, J. Deca, G. Lapenta,S. Markidis, I. Hur, and J.-L. Cambier. Initial results oncomputational performance

REFERENCES 435

of Intel many integratedcore, Sandy Bridge, andgraphical processing unitarchitectures: implementa-tion of a 1D C++/OpenMPelectrostatic particle-in-cellcode. Concurrency andComputation: Practice andExperience, 27(3):581–593,March 10, 2015. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic).

vanderLaan:2011:AWL

[vdLJR11] Wladimir J. van der Laan,Andrei C. Jalba, and JosB. T. M. Roerdink. Ac-celerating wavelet liftingon graphics hardware us-ing CUDA. IEEE Transac-tions on Parallel and Dis-tributed Systems, 22(1):132–146, January 2011. CODENITDSEO. ISSN 1045-9219(print), 1558-2183 (elec-tronic).

Vetter:2000:DST

[VdS00] Jeffrey S. Vetter and Bro-nis R. de Supinski. Dy-namic software testing ofMPI applications with Um-pire. In ACM [ACM00],page 70. URL http://www.sc2000.org/proceedings/techpapr/papers/pap208.pdf.

Vetter:2002:DSP

[Vet02] Jeffrey Vetter. Dynamicstatistical profiling of com-munication activity in dis-

tributed applications. ACMSIGMETRICS PerformanceEvaluation Review, 30(1):240–250, June 2002. CO-DEN ???? ISSN 0163-5999(print), 1557-9484 (elec-tronic).

Vadhiyar:2002:PMS

[VFD02] Sathish S. Vadhiyar, Gra-ham E. Fagg, and Jack J.Dongarra. Performancemodeling for self adapt-ing collective communica-tions for MPI. In Oldehoeft[Old02], page ?? CODENJOSUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic). URL http://www.netlib.org/utk/people/JackDongarra/PAPERS/coll-lacsi-2001.pdf.

Vega-Gisbert:2016:DIJ

[VGRS16] Oscar Vega-Gisbert, Jose E.Roman, and Jeffrey M.Squyres. Design and im-plementation of Java bind-ings in Open MPI. Par-allel Computing, 59(??):1–20, November 2016. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167819116300758.

Vikas:2014:MGA

[VGS14] Vikas, Nasser Giacaman,and Oliver Sinnen. Multipro-cessing with GUI-awarenessusing OpenMP-like direc-tives in Java. Parallel Com-

REFERENCES 436

puting, 40(2):69–89, Febru-ary 2014. CODEN PA-COEJ. ISSN 0167-8191(print), 1872-7336 (elec-tronic). URL http://www.sciencedirect.com/science/article/pii/S0167819113001439.

vonHanxleden:1994:VDF

[vHKS94] R. von Hanxleden, K. Kennedy,and J. Saltz. Value-based distributions in For-tran D. In Gentzschand Harms [GH94], pages434–440. ISBN 0-387-57981-8 (New York), 3-540-57981-8 (Berlin). LCCNQA76.88.I57 1994. DM96.00.Two volumes.

Viswanathan:1995:PCM

[Vis95] Kishore Viswanathan. Aparallel client-server modelfor distributed computing.M.s. thesis, Departmentof Computer Science, Mis-sissippi State University,Starkville, MS, USA, 1995.vii + 79 pp.

Valencia:2008:PPR

[VLO+08] David Valencia, Alexey Las-tovetsky, Maureen O’Flynn,Antonio Plaza, and JavierPlaza. Parallel processingof remotely sensed hyper-spectral images on hetero-geneous networks of work-stations using HeteroMPI.The International Journalof High Performance Com-puting Applications, 22(4):386–407, November 2008.

CODEN IHPCFL. ISSN1094-3420 (print), 1741-2846 (electronic). URLhttp://hpc.sagepub.com/content/22/4/386.full.pdf+html.

Varadarajan:1994:FDT

[VM94] V. Varadarajan and R. Mit-tra. Finite-difference time-domain (FDTD) analysisusing distributed comput-ing. IEEE Microwave andGuided Wave Letters, 4(5):144–145, May 1994. CODENIMGLE3. ISSN 1051-8207(print), 1558-2329 (elec-tronic).

Vincent:1995:HPP

[VM95] James J. Vincent and Ken-neth M. Merz Jr. A highlyportable parallel implemen-tation of AMBER4 usingthe message passing inter-face standard. Journal ofComputational Chemistry,16(11):1420–1427, Novem-ber 1995. CODEN JC-CHDD. ISSN 0192-8651(print), 1096-987X (elec-tronic).

Vogel:2013:BWC

[Vog13] Thomas Vogel. All theWay to CUDA [book re-view]. Computing in Sci-ence and Engineering, 15(5):6–8, September/October2013. CODEN CSENFA.ISSN 1521-9615.

REFERENCES 437

Volkert:1993:PCS

[Vol93] Jens Volkert, editor. Parallelcomputation: Second Inter-national ACPC Conference,Gmunden, Austria, October4–6, 1993: proceedings, vol-ume 734 of Lecture Notes inComputer Science. Spring-er-Verlag, Berlin, Ger-many / Heidelberg, Ger-many / London, UK / etc.,1993. ISBN 3-540-57314-3 (Berlin), 0-387-57314-3(New York). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA267.A1L43 no.734. DM58.00.

Voss:2003:OSM

[Vos03] Michael J. Voss, editor.OpenMP shared memoryparallel programming: In-ternational Workshop onOpenMP Applications andTools, WOMPAT 2003,Toronto, Canada, June 26–27, 2003: Proceedings, vol-ume 2716 of Lecture Notes inComputer Science. Spring-er-Verlag, Berlin, Ger-many / Heidelberg, Ger-many / London, UK / etc.,2003. CODEN LNCSD9.ISBN 3-540-40435-X (soft-cover). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.642.I589 2003. URL http://link.springer-ny.com/link/service/series/0558/tocs/t2716.htm; http://www.springerlink.com/openurl.asp?genre=issue&

issn=0302-9743&volume=2716.

VidalMacia:2000:IPM

[VP00] Antonio Vidal Macia andJose Luis Perez Gomez.Introduccion a la progra-macion en MPI. (Spanish)[Introduction to program-ming in MPI]. Technical re-port SPUPV-2000.209, De-partamento de Sistemas In-formaticos y Computacion,Facultad de Informatica,Universidad Politecnica deValencia, Servicio de Pub-licaciones, Valencia, Spain,2000. 78 pp.

Vargas-Perez:2017:HMO

[VPS17] Sandino Vargas-Perez andFahad Saeed. A hybridMPI–OpenMP strategy tospeedup the compression ofbig next-generation sequenc-ing datasets. IEEE Trans-actions on Parallel and Dis-tributed Systems, 28(10):2760–2769, October 2017.CODEN ITDSEO. ISSN1045-9219 (print), 1558-2183(electronic)ITDSEO. URLhttps://www.computer.org/csdl/trans/td/2017/10/07895161-abs.html.

Vrenios:2004:PPC

[Vre04] A. Vrenios. Parallel Pro-gramming in C with MPIand OpenMP [book review].IEEE Distributed SystemsOnline, 5(1):7.1–7.3, ????2004. CODEN ???? ISSN1541-4922 (print), 1558-1683

REFERENCES 438

(electronic). URL http://ieeexplore.ieee.org/iel5/8968/28452/01270716.pdf?isnumber=28452&prod=JNL&arnumber=1270716&arSt=+7.1&ared=+7.3&arAuthor=Vrenios%2C+A.; http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=28452&arnumber=1270716&count=8&index=5.

Varin:2000:PAL

[VRS00] E. Varin, R. Roy, andG. Samba. Parallel algo-rithms for the least-squaresfinite element solution of theneutron transport equation.Lecture Notes in ComputerScience, 1908:121–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1908/19080121.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1908/19080121.pdf.

VanVoorst:2000:CMI

[VS00] Brian Van Voorst and StevenSeidel. Comparison of MPIimplementations on a sharedmemory machine. Lec-ture Notes in Computer Sci-ence, 1800:847–??, 2000.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1800/18000847.htm;

http://link.springer-ny.com/link/service/series/0558/papers/1800/18000847.pdf.

Vaughan:1994:MPM

[VSRC94] P. L. Vaughan, A. Skjel-lum, D. S. Reese, and Fei-Chen Cheng. Migrating fromPVM to MPI. I. the Unifysystem. In IEEE [IEE94a],pages 488–495. ISBN 0-8186-6965-9. LCCN QA76.58.S951994. IEEE catalog no.95TH8024.

Vaughan:1995:MPM

[VSRC95] Paula L. Vaughan, AnthonySkjellum, Donna S. Reese,and Fei-Chen Cheng. Mi-grating from PVM to MPI,part I: The Unify system.Frontiers of Massively Paral-lel Computation — Confer-ence Proceedings, pages 488–495, ???? 1995. IEEE cata-log number 95TH8024.

Vaidya:2013:SDO

[VSW+13] Aniruddha S. Vaidya, AnahitaShayesteh, Dong Hyuk Woo,Roy Saharoy, and Mani Az-imi. SIMD divergence op-timization through intra-warp compaction. ACMSIGARCH Computer Ar-chitecture News, 41(3):368–379, June 2013. CODENCANED2. ISSN 0163-5964(print), 1943-5851 (elec-tronic). ICSA ’13 conferenceproceedings.

REFERENCES 439

Vlassov:1997:SSM

[VT97] V. Vlassov and L.-E. Thorelli.A synchronizing sharedmemory: Model and pro-gramming implementation.Lecture Notes in ComputerScience, 1332:159–166, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Vandoni:1995:CSC

[VV95] C. E. Vandoni and C. Verk-erk, editors. 1994 CERNSchool of Computing: So-pron, Hungary, 28 August–10 September 1994: pro-ceedings. CERN, Geneva,Switzerland, 1995. ISBN 92-9083-069-7. CERN report95-01.

Vo:2009:FVP

[VVD+09] Anh Vo, Sarvani Vakkalanka,Michael DeLisi, GaneshGopalakrishnan, Robert M.Kirby, and Rajeev Thakur.Formal verification of prac-tical MPI programs. ACMSIGPLAN Notices, 44(4):261–270, April 2009. CO-DEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Verkerk:1992:PIC

[VW92] C. Verkerk and W. Woj-cik, editors. Proceedingsof the International Confer-ence on Computing in HighEnergy Physics ’92, An-necy, France, 21–25 Septem-

ber 1992. CERN, Geneve,Switzerland, 1992. ISBN 92-9083-049-2. LCCN QC783.3C65 1992. CERN report 92-07.

Vetter:2002:EPE

[VY02] Jeffrey S. Vetter and AndyYoo. An empirical perfor-mance evaluation of scal-able scientific applications.In IEEE [IEE02], page ??ISBN 0-7695-1524-X. LCCN???? URL http://www.sc-2002.org/paperpdfs/pap.pap222.pdf.

Verschelde:2015:PHC

[VY15] Jan Verschelde and Xi-angcheng Yu. Polynomialhomotopy continuation onGPUs. ACM Communica-tions in Computer Algebra,49(4):130–133, December2015. CODEN ???? ISSN1932-2232 (print), 1932-2240(electronic).

Wong:1999:BMM

[WADC99] F. C. Wong, A. C. Arpaci-Dusseau, and D. E. Culler.Building MPI for multi-programming systems usingimplicit information. InDongarra et al. [DLM99],pages 215–222. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Walker:1994:DSM

[Wal94a] David W. Walker. Thedesign of a standard mes-

REFERENCES 440

sage passing interface fordistributed memory concur-rent computers. Paral-lel Computing, 20(4):657–673, March 31, 1994. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). URL http://www.elsevier.com/cgi-bin/cas/tree/store/parco/cas_sub/browse/browse.cgi?year=1994&volume=20&issue=4&aid=865; http://www.epm.ornl.gov/~walker/mpi/papers/parcomp94.ps.Z. See erratum [Wal94b].

Walker:1994:EDS

[Wal94b] David W. Walker. Erra-tum to: “The design ofa standard message pass-ing interface for distributedmemory concurrent comput-ers”. Parallel Computing,20(8):1215, August 1994.CODEN PACOEJ. ISSN0167-8191 (print), 1872-7336(electronic). See [Wal94a].

Walker:1995:MVB

[Wal95] D. W. Walker. An MPIversion of the BLACS. InIEEE [IEE95j], pages 129–146. ISBN 0-8186-6895-4.LCCN QA76.58 .S34 1994.

Walker:1996:MFA

[Wal96a] David W. Walker. MPI:from fundamentals to appli-cations. Technical report,Oak Ridge National Labo-ratory, Knoxville, TN, USA,1996. URL http://www.

epm.ornl.gov/~walker/mpi/SLIDES/mpi-tutorial.html.

Walker:1996:MP

[Wal96b] David W. Walker. MPI2 pro-posals. World-Wide Web,1996. URL http://www.epm.ornl.gov/~walker/mpi/mpi2-proposals.html.

Wallcraft:2000:SOV

[Wal00] Alan J. Wallcraft. SPMDOpenMP versus MPI forocean models. Concur-rency: practice and ex-perience, 12(12):1155–1164,October 2000. CODENCPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/cgi-bin/abstract/76500353/START; http://www3.interscience.wiley.com/cgi-bin/fulltext?ID=76500353&PLACEBO=IE.pdf.

Walker:2001:DLB

[Wal01a] Reginald L. Walker. Dy-namic load balancing model:Preliminary results for paral-lel pseudo-search engine in-dexers/crawler mechanismsusing MPI and genetic pro-gramming. Lecture Notesin Computer Science, 1981:61–??, 2001. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1981/19810061.htm;http://link.springer-

REFERENCES 441

ny.com/link/service/series/0558/papers/1981/19810061.pdf.

Walker:2001:SEC

[Wal01b] Reginald L. Walker. Searchengine case study: searchingthe Web using genetic pro-gramming and MPI. Par-allel Computing, 27(1–2):71–89, January 2001. CO-DEN PACOEJ. ISSN0167-8191 (print), 1872-7336 (electronic). URLhttp://www.elsevier.nl/gej-ng/10/35/21/47/25/25/abstract.html; http://www.elsevier.nl/gej-ng/10/35/21/47/25/25/article.pdf.

Wallcraft:2002:CCA

[Wal02] Alan J. Wallcraft. A com-parison of Co-Array Fortranand OpenMP Fortran forSPMD programming. TheJournal of Supercomputing,22(3):231–250, July 2002.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://ipsapp008.kluweronline.com/content/getfile/5189/36/1/abstract.htm; http://ipsapp008.kluweronline.com/content/getfile/5189/36/1/fulltext.pdf.

Wang:1997:TPD

[Wan97] Paul S. Wang. Tools for par-allel/distributed mathemati-cal computation. In ACM

[ACM97a], pages 188–195.ISBN ???? LCCN ????

Wang:2002:OPG

[Wan02] Ping Wang. OpenMP pro-gramming for a global in-verse model. Scientific Pro-gramming, 10(3):253–261,2002. CODEN SCIPEV.ISSN 1058-9244 (print),1875-919X (electronic).

Wasniowski:1995:NAP

[Was95a] R. A. Wasniowski. Nonlin-ear adaptive prediction al-gorithm and its parallel im-plementation. Informatica(Ljubljana, Slovenia), 19(3):371–377, September 1995.CODEN INFOFF. ISSN0350-5596.

White:1995:PNP

[WAS95b] S. White, A. Alund, andV. S. Sunderam. Perfor-mance of the NAS parallelbenchmarks on PVM-Basednetworks. Journal of Paralleland Distributed Computing,26(1):61–71, April 1, 1995.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic). URL http://www.idealibrary.com/links/doi/10.1006/jpdc.1995.1048/production;http://www.idealibrary.com/links/doi/10.1006/jpdc.1995.1048/production/pdf.

Wasniewski:1996:APC

[Was96] Jerzy Wasniewski, editor.Applied parallel computing:

REFERENCES 442

industrial computation andoptimization: Third Interna-tional Workshop, PARA ’96,Lyngby, Denmark, August18–21, 1996: proceedings,volume 1184 of Lecture notesin computer science. Spring-er-Verlag, Berlin, Germany /Heidelberg, Germany / Lon-don, UK / etc., 1996.ISBN 3-540-62095-8. LCCNQA76.58 .P35 1996.

Wolf:1996:CFS

[WB96] K. Wolf and E. Brakkee.Coupling fluids and struc-tures codes on MPI. InIEEE [IEE96i], pages 130–137. ISBN 0-8186-7533-0.LCCN QA76.642 .M67 1996.

Wickerson:2015:RSP

[WBBD15] John Wickerson, MarkBatty, Bradford M. Beck-mann, and Alastair F. Don-aldson. Remote-scope pro-motion: clarified, rectified,and verified. ACM SIG-PLAN Notices, 50(10):731–747, October 2015. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Wolf:1997:CMP

[WBH97] K. Wolf, E. Brakkee, andD. P. Ho. Communication inmulti-physics applications.Lecture Notes in ComputerScience, 1332:167–176, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Wickerson:2017:ACM

[WBSC17] John Wickerson, MarkBatty, Tyler Sorensen, andGeorge A. Constantinides.Automatically comparingmemory consistency mod-els. ACM SIGPLAN No-tices, 52(1):190–204, Jan-uary 2017. CODEN SIN-ODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Walters:2009:RBF

[WC09] John Paul Walters and VipinChaudhary. Replication-based fault tolerance forMPI applications. IEEETransactions on Paralleland Distributed Systems,20(7):997–1010, July 2009.CODEN ITDSEO. ISSN1045-9219 (print), 1558-2183(electronic).

Wang:2015:AST

[WC15] Chun-Kun Wang and Peng-Sheng Chen. Automaticscoping of task clausesfor the OpenMP taskingmodel. The Journal ofSupercomputing, 71(3):808–823, March 2015. CODENJOSUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic). URL http://link.springer.com/article/10.1007/s11227-014-1326-3.

Wang:2007:EAP

[WCC+07] Perry H. Wang, Jami-son D. Collins, Gautham N.

REFERENCES 443

Chinya, Hong Jiang, XinminTian, Milind Girkar, Nick Y.Yang, Guei-Yuan Lueh, andHong Wang. EXOCHI: ar-chitecture and programmingenvironment for a hetero-geneous multi-core multi-threaded system. ACM SIG-PLAN Notices, 42(6):156–166, June 2007. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Wang:2012:OVT

[WCC12] Cheng Wang, Sunita Chan-drasekaran, and BarbaraChapman. An OpenMP3.1 validation testsuite.Lecture Notes in Com-puter Science, 7312:237–249, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-30961-8_18/.

Wu:1999:JBD

[WCS99] X. Wu, Q. Chen, and X.-H. Sun. A Java-baseddistributed debbuger sup-porting MPI and PVM.Parallel and DistributedComputing Practices, 2(4):??, ???? 1999. CO-DEN ???? ISSN 1097-2803. URL http://www.cs.okstate.edu/~pdcp/vols/vol02/vol02no4abs.html#wu.

Wang:2013:PMO

[WCS+13] Cheng Wang, Sunita Chan-drasekaran, Peng Sun, Bar-bara Chapman, and JimHolt. Portable mappingof openMP to multicoreembedded systems usingMCA APIs. ACM SIG-PLAN Notices, 48(5):153–162, May 2013. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Wedemeijer:1996:PSA

[WCVR96] H. Wedemeijer, H. L. H.Cox, D. J. Verschuur, andI. L. Ritsema. Paralleli-sation of seismic algorithmsusing PVM and FORGE.In Liddell et al. [LCHS96],pages 352–?? ISBN 3-540-61142-8 (paperback). LCCNQA76.88 .H52 1996.

Walker:1996:MSM

[WD96] D. W. Walker and J. J.Dongarra. MPI: a stan-dard message passing inter-face. Supercomputer, 12(1):56–68, January 1996. CO-DEN SPCOEL. ISSN 0168-7875.

Welch:1994:PVM

[Wel94] L. R. Welch. A paral-lel virtual machine for pro-grams composed of abstractdata types. IEEE Transac-tions on Computers, 43(11):1249–1261, November 1994.CODEN ITCOB4. ISSN

REFERENCES 444

0018-9340 (print), 1557-9956(electronic).

Werner:1995:UMP

[Wer95] Jorg Werner. Uberblick zumMessage-Passing-InterfaceStandard, MPI. (German)[Overview of the Message-Passing Interface Standard,MPI]. Parlab-Mitteilungen04/95, Technische Uni-versitat Chemnitz-Zwickau,Chemnitz, Germany, 1995.35 pp.

Weber:2017:MAL

[WG17] Nicolas Weber and MichaelGoesele. MATOG: Ar-ray layout auto-tuning forCUDA. ACM Transac-tions on Architecture andCode Optimization, 14(3):28:1–28:??, September 2017.CODEN ???? ISSN1544-3566 (print), 1544-3973(electronic).

Wark:1994:PIR

[WH94] P. Wark and J. Holt. PVMimplementation of a re-peated matching heuristicfor vehicle routing. InArnold et al. [ACDR94],pages 207–216 (or 207–214??). ISBN 90-5199-149-5.LCCN ????

Wagner:1996:PMM

[WH96] J. C. Wagner and A. Haghighat.Parallel MCNP Monte Carlotransport calculations withMPI. Transactions of theAmerican Nuclear Society,

75(??):338–339, ???? 1996.CODEN TANSAO. ISSN0003-018X.

Wiese:2005:IPN

[WHDB05] Kay C. Wiese, AndrewHendriks, Alain Desch-enes, and Belgacem BenYoussef. The impact ofpseudorandom number qual-ity on P-RnaPredict, aparallel genetic algorithmfor RNA secondary struc-ture prediction. In Beyeret al. [B+05], pages 479–480.ISBN 1-59593-010-8 (paper-back). LCCN QA76.623.G44 2005. URL http://www.cs.bham.ac.uk/~wbl/biblio/gecco2005lbp/papers/52-wiese.pdf. ACM ordernumber 910050.

White:1994:VVC

[Whi94] R. White. VCMON —the VM/ESA ConnectivityMonitor. In Anonymous[Ano94g], pages 783–792.ISBN ???? LCCN ????

White:2004:CMM

[Whi04] R. E. (Robert E.) White.Computational Mathemat-ics: Models, Methods, andAnalysis with MATLABand MPI. Chapman andHall/CRC, Boca Raton, FL,USA, 2004. ISBN 1-58488-364-2. xvi + 385 pp. LCCNQA297 .W495 2004.

Wilkinson:1993:IFT

[Wil93] Timothy James Wilkinson.Implementing Fault Toler-

REFERENCES 445

ance in a 64-bit DistributedOperating System. PhDthesis, Systems ArchitectureResearch Centre, City Uni-versity, London, UK, July1993.

Wilhelms:1994:DAL

[Wil94] Gerhard Wilhelms. Dy-namische adaptive Lastverteilungfur PVM mittels unschar-fer Benutzerprofile – PVM+

(English: Dynamic adaptiveload distribution for PVM byblurred user profiles – PVM+

). Dissertation, Math.-Naturwiss. Fakultat, Univer-sitat Augsburg, Augsburg,Germany, 1994. iv + 74 pp.

Wismueller:1996:SBV

[Wis96a] R. Wismueller. State basedvisualization of PVM appli-cations. Lecture Notes inComputer Science, 1156:91–??, ???? 1996. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Wismuller:1996:SBV

[Wis96b] R. Wismuller. State basedvisualization of PVM ap-plications. In Bode et al.[BDLS96]. ISBN 3-540-61779-5. ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58.E9751996.

Wismueller:1997:DMP

[Wis97] R. Wismueller. Debuggingmessage passing programs

using invisible message tags.Lecture Notes in ComputerScience, 1332:295–304, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Wismueller:1998:LMS

[Wis98] R. Wismueller. On-linemonitoring support in PVMand MPI. Lecture Notesin Computer Science, 1497:312–??, 1998. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic).

Wismuller:2001:UMT

[Wis01] Roland Wismuller. Us-ing monitoring techniques tosupport the cooperation ofsoftware components. Lec-ture Notes in Computer Sci-ence, 2131:183–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310183.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310183.pdf.

Witchel:2016:PPW

[Wit16] Emmett Witchel. Program-mer productivity in a worldof mushy interfaces: Chal-lenges of the post-ISA real-ity. Operating Systems Re-view, 50(2):591, June 2016.CODEN OSRED8. ISSN

REFERENCES 446

0163-5980 (print), 1943-586X (electronic).

Wei:2012:OLL

[WJ12] Zheng Wei and JosephJaja. Optimization of linkedlist prefix computations onmultithreaded GPUs usingCUDA. Parallel Process-ing Letters, 22(4):1250012,December 2012. CODENPPLTEE. ISSN 0129-6264(print), 1793-642X (elec-tronic).

Wu:2014:OFB

[WJB14] Jing Wu, Joseph JaJa, andElias Balaras. An op-timized FFT-based directPoisson solver on CUDAGPUs. IEEE Transac-tions on Parallel and Dis-tributed Systems, 25(3):550–559, March 2014. CODENITDSEO. ISSN 1045-9219(print), 1558-2183 (elec-tronic).

Wegiel:2008:MCVa

[WK08a] Michal Wegiel and ChandraKrintz. The mapping collec-tor: virtual memory supportfor generational, parallel,and concurrent compaction.ACM SIGARCH ComputerArchitecture News, 36(1):91–102, March 2008. CODENCANED2. ISSN 0163-5964(ACM), 0884-7495 (IEEE).

Wegiel:2008:MCVb

[WK08b] Michal Wegiel and ChandraKrintz. The Mapping Col-

lector: virtual memory sup-port for generational, par-allel, and concurrent com-paction. Operating Sys-tems Review, 42(2):91–102,March 2008. CODEN OS-RED8. ISSN 0163-5980(print), 1943-586X (elec-tronic).

Wegiel:2008:MCVc

[WK08c] Michal Wegiel and ChandraKrintz. The mapping collec-tor: virtual memory supportfor generational, parallel,and concurrent compaction.ACM SIGPLAN Notices,43(3):91–102, March 2008.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Wittenbrink:2011:FGG

[WKP11] Craig M. Wittenbrink, Em-mett Kilgariff, and ArjunPrabhu. Fermi GF100 GPUarchitecture. IEEE Micro,31(2):50–59, March/April2011. CODEN IEMIDZ.ISSN 0272-1732 (print),1937-4143 (electronic).

Wagner:1996:GSG

[WKS96] T. Wagner, C. Kueblbeck,and C. Schittko. Ge-netic selection and gener-ation of textural featureswith PVM. In Bode et al.[BDLS96], pages 305–??ISBN 3-540-61779-5. ISSN0302-9743 (print), 1611-

REFERENCES 447

3349 (electronic). LCCNQA76.58.E975 1996.

Lehman:1994:IZP

[wL94] Li wei Lehman. Integratingzipcode and PVM: towards ahigher-level message-passingenvironment. Technical re-port MSSU-EIRS-ERC 94-2, Engineering ResearchCenter for ComputationalField Simulation, MississippiState University, Starkville,MS, USA, 1994. 7 pp.

Wismueller:1996:TSI

[WL96a] R. Wismueller and T. Lud-wig. The tool-set — anintegrated tool environmentfor PVM. Lecture Notesin Computer Science, ??(1067):1029–??, ???? 1996.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Wismuller:1996:TSI

[WL96b] R. Wismuller and T. Lud-wig. The Tool Set —an integrated tool environ-ment for PVM. In Liddellet al. [LCHS96]. ISBN 3-540-61142-8 (paperback). LCCNQA76.88 .H52 1996.

Wu:2007:IFR

[WLC07] C.-L. Wu, D.-C. Lou, andS.-Y. Chen. Integer factor-ization for RSA cryptosys-tem under a PVM environ-ment. International Journalof Computer Systems Sci-ence and Engineering, 22(1–

2):??, January/March 2007.CODEN CSSEEI. ISSN0267-6192.

Weatherly:2003:DMS

[WLNL03] D. Brent Weatherly, David K.Lowenthal, Mario Nakazawa,and Franklin Lowenthal.Dyn-MPI: Supporting MPIon non dedicated clus-ters. In ACM [ACM03],page ?? ISBN 1-58113-695-1. LCCN ???? URL http://www.sc-conference.org/sc2003/inter_cal/inter_cal_detail.php?eventid=10708#1; http://www.sc-conference.org/sc2003/paperpdfs/pap126.pdf.

Weatherly:2006:DMS

[WLNL06] D. Brent Weatherly, David K.Lowenthal, Mario Nakazawa,and Franklin Lowenthal.Dyn-MPI: Supporting MPIon medium-scale, non-dedicated clusters. Jour-nal of Parallel and Dis-tributed Computing, 66(6):822–838, June 2006. CO-DEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic).

Willcock:2005:UMC

[WLR05] Jeremiah Willcock, AndrewLumsdaine, and Arch Robi-son. Using MPI with C#and the Common LanguageInfrastructure. Concurrencyand Computation: Prac-tice and Experience, 17(7–8):895–917, June/July 2005.

REFERENCES 448

CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Wu:2012:UHM

[WLYC12] Chao-Chin Wu, Lien-Fu Lai,Chao-Tung Yang, and Po-Hsun Chiu. Using hybridMPI and OpenMP program-ming to optimize communi-cations in parallel loop self-scheduling schemes for mul-ticore PC clusters. TheJournal of Supercomputing,60(1):31–61, April 2012.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=60&issue=1&spage=31.

Wolf:2001:APA

[WM01] Felix Wolf and BerndMohr. Automatic perfor-mance analysis of MPI ap-plications based on eventtraces. Lecture Notes inComputer Science, 1900:123–??, 2001. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/1900/19000123.htm;http://link.springer-

ny.com/link/service/series/0558/papers/1900/19000123.pdf.

Wu:2014:MAG

[WMP14] Xing Wu, Frank Mueller,and Scott Pakin. A method-ology for automatic genera-tion of executable commu-nication specifications fromparallel MPI applications.ACM Transactions on Par-allel Computing (TOPC),1(1):6:1–6:??, September2014. CODEN ???? ISSN2329-4949 (print), 2329-4957(electronic).

Winkler:2017:GSM

[WMRR17] Daniel Winkler, MichaelMeister, Massoud Reza-vand, and Wolfgang Rauch.gpuSPHASE — a sharedmemory caching implemen-tation for 2D SPH usingCUDA. Computer PhysicsCommunications, 213(??):165–180, April 2017. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465516303666.

Wendykier:2010:PCH

[WN10] Piotr Wendykier and James G.Nagy. Parallel Colt: ahigh-performance Java li-brary for scientific comput-ing and image processing.ACM Transactions on Math-ematical Software, 37(3):31:1–31:22, September 2010.CODEN ACMSCU. ISSN0098-3500 (print), 1557-7295(electronic).

REFERENCES 449

Walker:1995:RBD

[WO95] David W. Walker andSteve W. Otto. Redistribu-tion of block-cyclic data dis-tributions using MPI. Tech-nical Report ORNL/TM-12999, Oak Ridge NationalLaboratory, Knoxville, TN,USA, June 1995. iii + 20pp. URL http://www.epm.ornl.gov/~walker/mpi/redistribution.ps.Z.

Walker:1996:RBC

[WO96] D. W. Walker and S. W.Otto. Redistribution ofblock-cyclic data distribu-tions using MPI. Concur-rency: practice and expe-rience, 8(9):707–728, Nov-ember 1996. CODENCPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/cgi-bin/abstract?ID=23305.

Winstanley:1997:PDP

[WO97] N. Winstanley and J. O’Donnell.Parallel distributed pro-gramming with Haskell+PVM.Lecture Notes in ComputerScience, 1300:670–??, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Wang:2009:MPM

[WO09] Zheng Wang and MichaelF. P. O’Boyle. Map-ping parallelism to multi-cores: a machine learningbased approach. ACM SIG-

PLAN Notices, 44(4):75–84, April 2009. CODENSINODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Wolbers:1992:SPP

[Wol92] S. Wolbers. Software forparallel processing applica-tions. In Verkerk and Woj-cik [VW92], pages 111–116.ISBN 92-9083-049-2. LCCNQC783.3 C65 1992. CERNreport 92-07.

Worley:1996:MPE

[Wor96] P. H. Worley. MPI perfor-mance evaluation and char-acterization using a com-pact application benchmarkcode. In IEEE [IEE96i],pages 170–177. ISBN 0-8186-7533-0. LCCN QA76.642.M67 1996.

Weng:2007:OIS

[WPC07] Tien-Hsiung Weng, Ruey-Kuen Perng, and BarbaraChapman. OpenMP imple-mentation of SPICE3 cir-cuit simulator. Interna-tional Journal of Paral-lel Programming, 35(5):493–505, October 2007. CO-DEN IJPPE5. ISSN 0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=35&issue=5&spage=493.

REFERENCES 450

Wagner:1994:CFD

[WPH94] S. (Siegfried) Wagner, J. (Jacques)Periaux, and E. H. (Ernst-Heinrich) Hirschel, editors.Computational fluid dynam-ics ’94: proceedings of theSecond European Compu-tational Fluid DynamicsConference, 5–8 September1994, Stuttgart, Germany.Wiley, New York, NY, USA,1994. ISBN 0-471-95063-7.LCCN QA911.E95 1994.

Wang:1995:PPG

[WPL95] Cho-Li Wang, V. K. Prasanna,and Young Won Lim. Par-allelization of perceptualgrouping on distributedmemory machines. In Can-toni et al. [CLM+95], pages323–330. ISBN 0-8186-7134-3. LCCN QA76.9.A73W6751995. IEEE catalog no.95TB8093.

Wu:2001:PCS

[WR01] Guang Jun Wu and RobertRoy. Parallelization of char-acteristics solvers for 3Dneutron transport. Lec-ture Notes in Computer Sci-ence, 2131:344–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2131/21310344.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2131/21310344.pdf.

Worsch:2002:BCM

[WRA02] Thomas Worsch, Ralf Reuss-ner, and Werner Augustin.On benchmarking collec-tive MPI operations. Lec-ture Notes in Computer Sci-ence, 2474:271–??, 2002.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.de/link/service/series/0558/bibs/2474/24740271.htm; http://link.springer.de/link/service/series/0558/papers/2474/24740271.pdf.

Wang:2016:LLA

[WRSY16] Jin Wang, Norm Rubin, Al-bert Sidelnik, and SudhakarYalamanchili. LaPerm:locality aware schedulerfor dynamic parallelism onGPUs. ACM SIGARCHComputer Architecture News,44(3):583–595, June 2016.CODEN CANED2. ISSN0163-5964 (print), 1943-5851(electronic).

Wisniewski:1999:SME

[WSN99] Len Wisniewski, Brad Smis-loff, and Nils Nieuwejaar.Sun MPI I/O: Efficient I/Ofor parallel applications. InACM [ACM99], page ??

West:1995:AVV

[WST95] J. E. West, M. M. Stephens,and L. H. Turcotte. Adap-tation of volume visualiza-tion techniques to MIMD ar-chitectures using MPI. In

REFERENCES 451

IEEE [IEE95j], pages 147–156. ISBN 0-8186-6895-4.LCCN QA76.58 .S34 1994.

Wu:2011:PCH

[WT11] Xingfu Wu and Valerie Tay-lor. Performance charac-teristics of hybrid MPI/OpenMP implementationsof NAS parallel benchmarksSP and BT on large-scalemulticore supercomputers.ACM SIGMETRICS Per-formance Evaluation Re-view, 38(4):56–62, March2011. CODEN ???? ISSN0163-5999 (print), 1557-9484(electronic).

Wu:2012:PCH

[WT12] Xingfu Wu and Valerie Tay-lor. Performance charac-teristics of hybrid MPI/OpenMP implementations ofNAS Parallel BenchmarksSP and BT on large-scalemulticore clusters. TheComputer Journal, 55(2):154–167, February 2012.CODEN CMPJA6. ISSN0010-4620 (print), 1460-2067(electronic). URL http://comjnl.oxfordjournals.org/content/55/2/154.full.pdf+html.

Wang:2014:IPD

[WTFO14] Zheng Wang, Georgios Tour-navitis, Bjorn Franke, andMichael F. P. O’boyle.Integrating profile-drivenparallelism detection andmachine-learning-based map-

ping. ACM Transactions onArchitecture and Code Op-timization, 11(1):2:1–2:??,February 2014. CODEN???? ISSN 1544-3566(print), 1544-3973 (elec-tronic).

Worringen:2003:FPN

[WTR03] Joachim Worringen, Jes-per Larson Traff, and Hu-bert Ritzdorf. Fast paral-lel non-contiguous file ac-cess. In ACM [ACM03],page ?? ISBN 1-58113-695-1. LCCN ???? URL http://www.sc-conference.org/sc2003/inter_cal/inter_cal_detail.php?eventid=10722#0; http://www.sc-conference.org/sc2003/paperpdfs/pap319.pdf.

Waidyasooriya:2017:OBF

[WTTH17] Hasitha Muthumala Waidya-sooriya, Yasuhiro Takei,Shunsuke Tatsumi, andMasanori Hariyama. OpenCL-based FPGA-platform forstencil computation andits optimization methodol-ogy. IEEE Transactionson Parallel and DistributedSystems, 28(5):1390–1402,May 2017. CODEN ITD-SEO. ISSN 1045-9219(print), 1558-2183 (elec-tronic)ITDSEO. URLhttps://www.computer.org/csdl/trans/td/2017/05/07582502-abs.html.

Wu:1999:MCC

[Wu99] P.-Y. Wu. Minimum com-

REFERENCES 452

munication cost fractal im-age compression on PVM.In Dongarra et al. [DLM99],pages 434–441. ISBN 3-540-66549-8 (softcover). ISSN0302-9743 (print), 1611-3349(electronic). LCCN QA76.58E973 1999.

Wong:2011:EMS

[WWFT11] Hon-Cheng Wong, Un-HongWong, Xueshang Feng, andZesheng Tang. Efficientmagnetohydrodynamic sim-ulations on graphics pro-cessing units with CUDA.Computer Physics Com-munications, 182(10):2132–2160, October 2011. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465511001676.

Wilson:1996:SMS

[WWZ+96] G. C. Wilson, T. H. Wood,J. L. Zyskind, J. W. Sulhoff,J. E. Johnson, T. Tanbun-Ek, and P. A. Morton.SBS and MPI suppressionin analogue systems withintegrated electroabsorptionmodulator/DFB laser trans-mitters. Electronics Let-ters, 32(16):1502–1504, ????1996. CODEN ELLEAK.ISSN 0013-5194 (print),1350-911X (electronic).

Wu:2012:DPL

[WYLC12] Chao-Chin Wu, Chao-TungYang, Kuan-Chou Lai, and

Po-Hsun Chiu. Designingparallel loop self-schedulingschemes using the hybridMPI and OpenMP program-ming model for multi-coregrid systems. The Jour-nal of Supercomputing, 59(1):42–60, January 2012.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=59&issue=1&spage=42.

Wang:2016:MMF

[WZHZ16] Zeke Wang, Shuhao Zhang,Bingsheng He, and WeiZhang. Melia: A MapRe-duce framework on OpenCL-based FPGAs. IEEE Trans-actions on Parallel and Dis-tributed Systems, 27(12):3547–3560, December 2016.CODEN ITDSEO. ISSN1045-9219 (print), 1558-2183(electronic)ITDSEO. URLhttps://www.computer.org/csdl/trans/td/2016/12/07425227-abs.html.

Wang:2008:PIM

[WZWS08] Kun Wang, Yu Zhang,Huayong Wang, and XiaoweiShen. Parallelization of IBMMambo system simulator infunctional modes. OperatingSystems Review, 42(1):71–76, January 2008. CODENOSRED8. ISSN 0163-5980(print), 1943-586X (elec-tronic).

REFERENCES 453

Xu:1995:IPP

[XF95] H. Xu and T. W. Fisher. Im-proving PVM performanceusing ATOMIC user-levelprotocol. In Alnuweiri andHamdi [AH95], pages 108–117. ISBN 0-8186-7124-6.LCCN TK5105.5 .H56 1995.

Xu:1996:MCO

[XH96] Zhiwei Xu and Kai Hwang.Modeling communicationoverhead: MPI and MPLperformance on the IBMSP2. IEEE parallel anddistributed technology: sys-tems and applications, 4(1):9–24, Spring 1996. CODENIPDTEX. ISSN 1063-6552(print), 1558-1861 (elec-tronic).

Xue:2009:MSR

[XLW+09] Ruini Xue, Xuezheng Liu,Ming Wu, Zhenyu Guo,Wenguang Chen, WeiminZheng, Zheng Zhang, andGeoffrey Voelker. MPIWiz:subgroup reproducible re-play of MPI applications.ACM SIGPLAN Notices,44(4):251–260, April 2009.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Xiong:1996:BID

[XWZS96] Jianxin Xiong, DingxingWang, Weimin Zheng, andMeiming Shen. BUSTER:an integrated debuggerfor PVM. In IEEE

[IEE96d]. ISBN 0-7803-3529-5 (softbound), 0-7803-3530-9 (microfiche). LCCNQA76.58.I33 1996. IEEEcatalog number 96TH8204.

Xu:2013:PMO

[XXL13] Shiming Xu, Wei Xue, andHai Xiang Lin. Perfor-mance modeling and opti-mization of sparse matrix-vector multiplication onNVIDIA CUDA platform.The Journal of Super-computing, 63(3):710–721,March 2013. CODEN JO-SUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic). URL http://link.springer.com/article/10.1007/s11227-011-0626-0;http://link.springer.

com/content/pdf/10.1007/s11227-011-0626-0.

Yelon:1993:PTS

[Y+93] W. B. Yelon et al., editors.Proceedings of the Thirty-seventh Annual Conferenceon Magnetism and Mag-netic Materials: December1–4, 1992, Houston, Texas,volume 73(10) of Journalof Applied Physics. Amer-ican Institute of Physics,Woodbury, NY, USA, May1993. CODEN JAPIAU.ISBN 1-56396-212-8. ISSN0021-8979 (print), 1089-7550 (electronic), 1520-8850.LCCN QC753 .C748 1990.Two volumes.

REFERENCES 454

Yazdanpanah:2015:PHR

[YAJG+15] Fahimeh Yazdanpanah, Car-los Alvarez, Daniel Jimenez-Gonzalez, Rosa M. Badia,and Mateo Valero. Picos:a hardware runtime archi-tecture support for OmpSs.Future Generation Com-puter Systems, 53(??):130–139, December 2015. CO-DEN FGSEVI. ISSN 0167-739X (print), 1872-7115(electronic). URL http://www.sciencedirect.com/science/article/pii/S0167739X14002702.

Yan:1994:PTA

[Yan94] J. C. Yan. Performance tun-ing with AIMS — an Auto-mated Instrumentation andMonitoring System for mul-ticomputers. In Hesham andShriver [HS94], pages 625–633. ISBN 0-8186-5060-5.ISSN 1060-3425. LCCN ????IEEE catalog no. 94TH0607-2.

Yang:2014:PMI

[YBMCB14] Chaoran Yang, WesleyBland, John Mellor-Crummey,and Pavan Balaji. Portable,MPI-interoperable CoarrayFortran. ACM SIGPLANNotices, 49(8):81–92, Au-gust 2014. CODEN SIN-ODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Ying:2003:NPK

[YBZL03] Lexing Ying, George Biros,Denis Zorin, and Harper

Langston. A new paral-lel kernel-independent fastmultipole method. InACM [ACM03], page ??ISBN 1-58113-695-1. LCCN???? URL http://www.sc-conference.org/sc2003/inter_cal/inter_cal_detail.php?eventid=10707#2; http://www.sc-conference.org/sc2003/paperpdfs/pap166.pdf.

Yalamanchilli:1998:CPJ

[YC98] Narendar Yalamanchilli andWilliam Cohen. Com-munication performance ofJava based Parallel Vir-tual Machines. In ACM[ACM98a], page ?? ISBN???? LCCN ????URL http://www.cs.ucsb.edu/conferences/java98/papers/passing.pdf; http://www.cs.ucsb.edu/conferences/java98/papers/passing.ps. Possibly unpublished,except electronically.

Yang:2014:HPD

[YCL14] Luobin Yang, Steve C. Chiu,and Wei-Keng Liao. Highperformance data cluster-ing: a comparative anal-ysis of performance forGPU, RASC, MPI, andOpenMP implementations.The Journal of Supercom-puting, 70(1):284–300, Oc-tober 2014. CODEN JO-SUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic). URL http://link.

REFERENCES 455

springer.com/article/10.1007/s11227-013-0906-y.

Yu:2013:AGA

[YEG+13] Zhibin Yu, Lieven Eeck-hout, Nilanjan Goswami,Tao Li, Lizy John, Hai Jin,and Chengzhong Xu. Ac-celerating GPGPU archi-tecture simulation. ACMSIGMETRICS PerformanceEvaluation Review, 41(1):331–332, June 2013. CO-DEN ???? ISSN 0163-5999(print), 1557-9484 (elec-tronic).

Yoon:1996:WBP

[YG96] D.-K. Yoon and J.-L. Gau-diot. Worker-based parallelcomputing on PVM. LectureNotes in Computer Science,1123:506–??, ???? 1996.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Yang:2014:IMP

[YGH+14] Xu Yang, Deyuan Guo,Hu He, Haijing Tang, andYanjun Zhang. An im-plementation of Message-Passing Interface over Vx-Works for real-time embed-ded multi-core systems. TheComputer Journal, 57(11):1756–1764, November 2014.CODEN CMPJA6. ISSN0010-4620 (print), 1460-2067(electronic). URL http://comjnl.oxfordjournals.org/content/57/11/1756.

Yetongnon:1996:PII

[YH96] K. Yetongnon and S. Hariri,editors. Proceedings of theISCA International Con-ference. Parallel and Dis-tributed Computing Sys-tems: Dijon, France, 25–27 September 1996 (PDCS’96: 9th). IEEE ComputerSociety Press, 1109 SpringStreet, Suite 300, SilverSpring, MD 20910, USA,1996. ISBN ???? LCCN????

Yero:2001:JOO

[YHGL01] Eduardo J. H. Yero, MarcoA. A. Henriques, Javier R.Garcıa, and Alina C. Leyva.JOINT: An object ori-ented message passing in-terface for parallel pro-gramming in Java. Lec-ture Notes in Computer Sci-ence, 2110:637–??, 2001.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2110/21100637.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2110/21100637.pdf.

Yang:2011:HCO

[YHL11] Chao-Tung Yang, Chih-LinHuang, and Cheng-Fang Lin.Hybrid CUDA, OpenMP,and MPI parallel program-ming on multicore GPU clus-ters. Computer Physics

REFERENCES 456

Communications, 182(1):266–269, January 2011. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465510002262.

Yuasa:1996:RPG

[YKI+96] F. Yuasa, S. Kawabata,T. Ishikawa, D. Perret-Gallix, and T. Kaneko. Run-ning PVM-GRACE on work-station clusters. LectureNotes in Computer Science,1156:335–??, ???? 1996.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

YarKhan:2017:PPN

[YKLD17] Asim YarKhan, JakubKurzak, Piotr Luszczek, andJack Dongarra. Portingthe PLASMA numerical li-brary to the OpenMP stan-dard. International Jour-nal of Parallel Programming,45(3):612–633, June 2017.CODEN IJPPE5. ISSN0885-7458 (print), 1573-7640(electronic).

Yang:2009:DBM

[YL09] Chao-Tung Yang and Kuan-Chou Lai. A directive-based MPI code genera-tor for Linux PC clus-ters. The Journal of Su-percomputing, 50(2):177–207, November 2009. CO-DEN JOSUED. ISSN0920-8542 (print), 1573-0484

(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=50&issue=2&spage=177.

Yang:2016:HTM

[YLC16] Fan Yang, Jinfeng Li, andJames Cheng. Husky: to-wards a more efficient andexpressive distributed com-puting framework. Proceed-ings of the VLDB Endow-ment, 9(5):420–431, January2016. CODEN ???? ISSN2150-8097.

Yan:2013:SFS

[YLZ13] Shengen Yan, GuopingLong, and Yunquan Zhang.StreamScan: fast scan al-gorithms for GPUs withoutglobal barrier synchroniza-tion. ACM SIGPLAN No-tices, 48(8):229–238, August2013. CODEN SINODQ.ISSN 0362-1340 (print),1523-2867 (print), 1558-1160(electronic). PPoPP ’13Conference proceedings.

Yalamov:1997:BRT

[YM97] Plamen Y. Yalamov andSvetozar Margenov. Bookreviews: Two books on MPI:Parallel Programming withMPI; MPI: The CompleteReference (2nd printing).IEEE Concurrency, 5(4):81, October/December 1997.CODEN IECMFX. ISSN1092-3063 (print), 1558-0849(electronic). URL http:

REFERENCES 457

//dlib.computer.org/pd/books/pd1997/pdf/p4080.pdf.

Yilmaz:2011:RMS

[YMYI11] Erdal Yilmaz, Eray Molla,Cansin Yildiz, and VeysiIsler. Realistic model-ing of spectator behav-ior for soccer videogameswith CUDA. Computersand Graphics, 35(6):1063–1069, December 2011. CO-DEN COGRD2. ISSN0097-8493 (print), 1873-7684(electronic). URL http://www.sciencedirect.com/science/article/pii/S0097849311001476.

Yi:1994:PID

[YPA94] Sung Yi, K. H. Pierson,and M. F. Ahmad. Par-allel implementation of dy-namic simulation to filamen-tary composite structureswith general rate dependentdamping. Computing sys-tems in engineering: an in-ternational journal, 5(4-6):469–477, August-December1994. CODEN COSEEO.ISSN 0956-0521.

Yilmaz:2009:HPC

[YPAE09] E. Yilmaz, R. U. Payli, H. U.Akay, and A. Ecer. Hybridparallelism for CFD simula-tions: Combining MPI withOpenMP. In Tuncer et al.[TGEM09], pages 401–408.CODEN LNCSA6. ISBN 3-540-92743-3 (print), 3-540-92744-1 (e-book). ISSN

1439-7358. LCCN ???? URLhttp://link.springer.com/content/pdf/10.1007/978-3-540-92744-0_50. ParallelCFD 2007 was held in An-talya, Turkey, from May 21to 24, 2007.

You:1995:EIM

[YPZC95] J. You, E. Pissaloux, W. P.Zhu, and H. A. Cohen. Effi-cient image matching: a hi-erarchical Chamfer matchingscheme via distributed sys-tem. Real-Time Imaging, 1(4):245–259, October 1995.CODEN REIMFQ. ISSN1077-2014.

Young:1993:PEN

[YS93] Y.-H. Young and K. Siko-rski. Performance evaluationof network programming en-vironments. In Mudge et al.[MMH93], pages 106–107(vol. 2). ISBN 0-8186-3230-5. LCCN ???? Four vol-umes. IEEE catalog number93TH0501-7.

Yuan:2012:PCS

[YSL+12] Zhiyong Yuan, Weixin Si,Xiangyun Liao, ZhaoliangDuan, Yihua Ding, and Jian-hui Zhao. Parallel comput-ing of 3D smoking simula-tion based on OpenCL het-erogeneous platform. TheJournal of Supercomputing,61(1):84–102, July 2012.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/

REFERENCES 458

openurl.asp?genre=article&issn=0920-8542&volume=61&issue=1&spage=84.

Young-S:2017:OGI

[YSMA+17] Luis E. Young-S., PaulsamyMuruganandam, Sadhan K.Adhikari, Vladimir Lon-car, Dusan Vudragovic, andAntun Balaz. OpenMPGNU and Intel Fortran pro-grams for solving the time-dependent Gross–Pitaevskiiequation. Computer PhysicsCommunications, 220(??):503–506, November 2017.CODEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465517302321.

Yu:2005:HPB

[YSP+05] Weikuan Yu, Sayantan Sur,Dhabaleswar K. Panda,Rob T. Aulwes, and Rich L.Graham. High perfor-mance broadcast supportin LA-MPI over quadrics.The International Journal ofHigh Performance Comput-ing Applications, 19(4):453–463, Winter 2005. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/19/4/453.full.pdf+html.

Yeh:2017:PFG

[YSS+17] Tsung Tai Yeh, Amit Sabne,Putt Sakdhnagool, RudolfEigenmann, and Timothy G.

Rogers. Pagoda: Fine-grained GPU resource vir-tualization for narrow tasks.ACM SIGPLAN Notices, 52(8):221–234, August 2017.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Yang:2008:DPL

[YST08] Chao-Tung Yang, Wen-Chung Shih, and Shian-Shyong Tseng. Dynamicpartitioning of loop iter-ations on heterogeneousPC clusters. The Jour-nal of Supercomputing, 44(1):1–23, April 2008. CO-DEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0920-8542&volume=44&issue=1&spage=1.

Young-S:2016:OFP

[YSVM+16] Luis E. Young-S., Dusan Vu-dragovic, Paulsamy Muru-ganandam, Sadhan K. Ad-hikari, and Antun Balaz.OpenMP Fortran and C pro-grams for solving the time-dependent Gross–Pitaevskiiequation in an anisotropictrap. Computer PhysicsCommunications, 204(??):209–213, July 2016. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http:/

REFERENCES 459

/www.sciencedirect.com/science/article/pii/S001046551630073X.

Yan:2014:OMB

[YSWY14] Xin Yan, Xiaohua Shi, LinaWang, and Haiyan Yang. AnOpenCL micro-benchmarksuite for GPUs and CPUs.The Journal of Supercom-puting, 69(2):693–713, Au-gust 2014. CODEN JO-SUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic). URL http://link.springer.com/article/10.1007/s11227-014-1112-2.

Yoshinaga:2012:DBM

[YTH+12] Kazumi Yoshinaga, YuichiTsujita, Atsushi Hori, MikikoSato, and Mitaro Namiki.Delegation-based MPI com-munications for a hy-brid parallel computer withmany-core architecture. Lec-ture Notes in ComputerScience, 7490:47–56, 2012.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.com/chapter/10.1007/978-3-642-33518-1_10/.

Yam-Uicab:2017:FHT

[YULMTS+17] R. Yam-Uicab, J. L. Lopez-Martinez, J. A. Trejo-Sanchez, H. Hidalgo-Silva,and S. Gonzalez-Segura. Afast Hough transform al-gorithm for straight linesdetection in an image us-ing GPU parallel computing

with CUDA-C. The Journalof Supercomputing, 73(11):4823–4842, November 2017.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic).

Yang:2011:PBP

[YWC11] Chao-Tung Yang, Chao-Chin Wu, and Jen-HsiangChang. Performance-basedparallel loop self-schedulingusing hybrid OpenMP andMPI programming on mul-ticore SMP clusters. Con-currency and Computation:Practice and Experience, 23(8):721–744, June 10, 2011.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Younge:2015:SHP

[YWCF15] Andrew J. Younge, John PaulWalters, Stephen P. Crago,and Geoffrey C. Fox. Sup-porting high performancemolecular dynamics in virtu-alized clusters using IOMMU,SR-IOV, and GPUDirect.ACM SIGPLAN Notices, 50(7):31–38, July 2015. CO-DEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Yonezawa:1995:IED

[YWO95] Naoki Yonezawa, KoichiWada, and Motoko Obata.Implementation and evalu-ation of distributed shareddata objects on a work-

REFERENCES 460

station cluster. In IEEE[IEE95e], pages 319–322.ISBN 0-7803-2553-2. LCCNTK 5101 A1 I34 1995. IEEEcatalog number 95CH35765.

You:2015:VFO

[YWTC15] Yi-Ping You, Hen-Jung Wu,Yeh-Ning Tsai, and Yen-Ting Chao. VirtCL: aframework for OpenCL de-vice abstraction and man-agement. ACM SIGPLANNotices, 50(8):161–172, Au-gust 2015. CODEN SIN-ODQ. ISSN 0362-1340(print), 1523-2867 (print),1558-1160 (electronic).

Yong:1995:SOM

[YX95] Dou Yong and Zhou Xing-ming. Super-object model:implementing shared mem-ory programming mode ondistributed memory multi-computers. Chinese Jour-nal of Computers, 18(7):481–487, July 1995. CODENJIXUDT. ISSN 0254-4164.

Yu:2012:SCC

[YYW+12] Fang Yu, Shun-Ching Yang,Farn Wang, Guan-ChengChen, and Che-Chang Chan.Symbolic consistency check-ing of OpenMP parallel pro-grams. ACM SIGPLAN No-tices, 47(5):139–148, May2012. CODEN SINODQ.ISSN 0362-1340 (print),1523-2867 (print), 1558-1160(electronic). LCTES ’12 pro-ceedings.

Yang:2014:CNR

[YZ14] Yi Yang and Huiyang Zhou.CUDA-NP: realizing nestedthread-level parallelism inGPGPU applications. ACMSIGPLAN Notices, 49(8):93–106, August 2014. CO-DEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

You:1995:PIM

[YZPC95] J. You, W. P. Zhu, E. Pissa-loux, and H. A. Cohen.Parallel image matching ona distributed system. InNarashimhan [Nar95], pages870–873 (vol. 2). ISBN0-7803-2018-2 (paperback),0-7803-2019-0 (microfiche).LCCN QA76.6.I15 1995.Two volumes. IEEE catalogno. 95TH0682-5.

Zounmevo:2014:FRC

[ZA14] Judicael A. Zounmevo andAhmad Afsahi. A fast andresource-conscious MPI mes-sage queue mechanism forlarge-scale jobs. FutureGeneration Computer Sys-tems, 30(??):265–290, Jan-uary 2014. CODEN FG-SEVI. ISSN 0167-739X(print), 1872-7115 (elec-tronic). URL http://www.sciencedirect.com/science/article/pii/S0167739X13001489.

Zaza:2016:CBP

[ZAFAM16] Ayham Zaza, Abeeb A.Awotunde, Faisal A. Fairag,

REFERENCES 461

and Mayez A. Al-Mouhamed.A CUDA based parallelmulti-phase oil reservoir sim-ulator. Computer PhysicsCommunications, 206(??):2–16, September 2016. CO-DEN CPHCBZ. ISSN0010-4655 (print), 1879-2944(electronic). URL http://www.sciencedirect.com/science/article/pii/S0010465516300996.

Zahavi:2012:FTR

[Zah12] Eitan Zahavi. Fat-tree rout-ing and node ordering pro-viding contention free traf-fic for MPI global collectives.Journal of Parallel and Dis-tributed Computing, 72(11):1423–1432, November 2012.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic). URL http://www.sciencedirect.com/science/article/pii/S0743731512000305.

Zhong:2007:PPS

[ZAT+07] Wei Zhong, Gulsah Altun,Xinmin Tian, Robert Har-rison, Phang C. Tai, andYi Pan. Parallel proteinsecondary structure predic-tion schemes using Pthreadand OpenMP over hyper-threading technology. TheJournal of Supercomput-ing, 41(1):1–16, July 2007.CODEN JOSUED. ISSN0920-8542 (print), 1573-0484(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&

issn=0920-8542&volume=41&issue=1&spage=1.

Zdetsis:1994:PMD

[ZB94] A. D. Zdetsis and R. Biswas.A parallel molecular dynam-ics strategy for PVM. InTurchi and Gonis [TG94],pages 713–718. ISBN 0-306-44626-X. ISSN 0258-1221.LCCN TN690.S77 1994.

Zilli:1997:TBN

[ZB97] G. Zilli and L. Bergam-aschi. Truncated block New-ton and quasi-Newton meth-ods for sparse systems ofnonlinear equations. experi-ments on parallel platforms.Lecture Notes in ComputerScience, 1332:390–400, 1997.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic).

Zhu:2012:CDS

[ZBd12] Ke Zhu, Matthias Butenuth,and Pablo d’Angelo. Com-parison of dense stereo us-ing CUDA. Lecture Notesin Computer Science, 6554:398–410, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/content/pdf/10.1007/978-3-642-35740-4_31.

Zhao:2010:GMP

[ZC10] Kaiyong Zhao and XiaowenChu. GPUMP: a multiple-precision integer library for

REFERENCES 462

GPUs. In IEEE, edi-tor, IEEE 10th Interna-tional Conference on Com-puter and Information Tech-nology (CIT), 2010: June29, 2010–July 1, 2010,Bradford, West Yorkshire,UK, pages 1164–1168. IEEEComputer Society Press,1109 Spring Street, Suite300, Silver Spring, MD20910, USA, 2010. ISBN0-7695-4108-9 (print), 1-4244-7547-3. LCCN ????IEEE Computer Society Or-der Number E4108. BMSPart Number: CFP10355-CDR.

Zhang:1997:DED

[ZDD97] Xiaodong Zhang, Sandra G.Dykes, and Hong Deng.Distributed edge detec-tion: Issues and implemen-tations. IEEE Computa-tional Science & Engineer-ing, 4(1):72–82, January/March 1997. CODEN IS-CEE4. ISSN 1070-9924(print), 1558-190X (elec-tronic). URL http://dlib.computer.org/cs/books/cs1997/pdf/c1072.pdf;http://www.computer.org/cse/cs1998/c1072abs.htm.

Zhang:2001:PPV

[ZDR01] Xin Zhang, Lingli Ding,and Elke A. Rundensteiner.PVM: Parallel View Main-tenance under concurrentdata updates of distributedsources. Lecture Notes inComputer Science, 2114:

230–??, 2001. CODENLNCSD9. ISSN 0302-9743 (print), 1611-3349(electronic). URL http://link.springer-ny.com/link/service/series/0558/bibs/2114/21140230.htm;http://link.springer-

ny.com/link/service/series/0558/papers/2114/21140230.pdf.

Zhang:2004:PMV

[ZDR04] Xin Zhang, Lingli Ding, andElke A. Rundensteiner. Par-allel multisource view main-tenance. VLDB Journal:Very Large Data Bases, 13(1):22–48, January 2004.CODEN VLDBFR. ISSN1066-8888 (print), 0949-877X (electronic).

Zelek:1995:DPP

[Zel95] J. S. Zelek. Dynamicpath planning. In IEEE[IEE95a], pages 1285–1290(vol. 2). ISBN 0-7803-2559-1. LCCN TA168.I19 1995.Five volumes. IEEE catalogno. 95CH3576-7.

Zemla:1994:WTC

[Zem94] A. Zemla. Wavelet trans-forms computing on PVM.In Dongarra and Was-niewski [DW94], pages 534–546. ISBN 3-540-58712-8 (Berlin), 0-387-58712-8(New York). ISSN 0302-9743(print), 1611-3349 (elec-tronic). LCCN QA76.58.P35 1994. DM104.00.

REFERENCES 463

Zhou:1995:FMP

[ZG95a] H. Zhou and A. Geist.Faster message passing inPVM. In Alnuweiri andHamdi [AH95], pages 67–73.ISBN 0-8186-7124-6. LCCNTK5105.5 .H56 1995.

Zhou:1995:RMR

[ZG95b] Honbo Zhou and Al Geist.“receiver makes right” dataconversion in PVM. InIEEE [IEE95b], pages 458–464. ISBN 0-7803-2493-5,0-7803-2492-7, 0-7803-2494-3. LCCN TK7885.A1 I5671995. IEEE catalog no.95CH35751.

Zhou:1996:FMP

[ZG96] Honbo Zhou and Al Geist.Faster message passing inPVM. Technical report,Mathematical Sciences Sec-tion, Oak Ridge NationalLaboratory, Knoxville, TN,USA, 1996. 7 pp. URL http://www.epm.ornl.gov/~zhou/patm.ps.

Zhou:1998:LST

[ZG98] Honbo Zhou and Al Geist.LPVM: a step towardsmultithread PVM. Con-currency: practice andexperience, 10(5):407–416,April 25, 1998. CODENCPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/cgi-bin/abstract?ID=5385;http://www3.interscience.

wiley.com/cgi-bin/fulltext?ID=5385&PLACEBO=IE.pdf.

Zielinski:1994:PPS

[ZGC94] K. Zielinski, M. Gajecki,and G. Czajkowski. Par-allel programming systemsfor LAN distributed com-puting. In IEEE [IEE94b],pages 600–607. ISBN 0-8186-6952-7 (casebound), 0-8186-6950-0 (paperback), 0-8186-6951-9 (microfiche). LCCNTA1637.I25 1994. Threevolumes. IEEE catalog no.94CH35708.

Zu:1994:OSM

[ZGN94] Hong Zu, Ya-Dong Gui,and L. M. Ni. Opti-mal software multicast inwormhole-routed multistagenetworks. In IEEE [IEE94h],pages 703–712. ISBN 0-8186-6607-2, 0-8186-6605-6,0-8186-6606-4. ISSN 1063-9535. LCCN QA76.5 .S8941994. IEEE catalog number94CH34819.

Zheng:2006:PEA

[ZHK06] Gengbin Zheng, Chao Huang,and Laxmikant V. Kale.Performance evaluation ofautomatic checkpoint-basedfault tolerance for AMPIand Charm++. OperatingSystems Review, 40(2):90–99, April 2006. CODENOSRED8. ISSN 0163-5980(print), 1943-586X (elec-tronic).

REFERENCES 464

Zoraja:1999:SPD

[ZHS99] Ivan Zoraja, Hermann Hell-wagner, and Vaidy Sun-deram. SCIPVM: Paral-lel distributed computingon SCI workstation clus-ters. Concurrency: prac-tice and experience, 11(3):121–138, March 1999. CO-DEN CPEXEI. ISSN 1040-3108. URL http://www3.interscience.wiley.com/cgi-bin/abstract?ID=61003667;http://www3.interscience.

wiley.com/cgi-bin/fulltext?ID=61003667&PLACEBO=IE.pdf.

Zounmevo:2014:ESC

[ZKRA14] Judicael A. Zounmevo, DriesKimpe, Robert Ross, andAhmad Afsahi. Extreme-scale computing servicesover MPI: Experiences, ob-servations and features pro-posal for next-generationmessage passing interface.The International Journalof High Performance Com-puting Applications, 28(4):435–449, November 2014.CODEN IHPCFL. ISSN1094-3420 (print), 1741-2846 (electronic). URLhttp://hpc.sagepub.com/content/28/4/435.

Zaky:1996:PDT

[ZL96] Amr Zaky and Ted Lewis,editors. Program devel-opment tools and environ-ments for parallel and dis-tributed systems: Session;

28th Hawaii internationalconference on system sci-ences — 1995, volume 2of Kluwer International Se-ries in Software Engineering.Kluwer Academic PublishersGroup, Norwell, MA, USA,and Dordrecht, The Nether-lands, 1996. ISBN 0-7923-9675-8. LCCN QA76.58.T651996.

Zaki:1999:TSP

[ZLGS99] Omer Zaki, Ewing Lusk,William Gropp, and Debo-rah Swider. Toward scal-able performance visualiza-tion with Jumpshot. TheInternational Journal ofHigh Performance Comput-ing Applications, 13(3):277–288, Fall 1999. CODENIHPCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic).

Zhou:2012:DFD

[ZLL+12] Xu Zhou, Kai Lu, Xi-cheng Lu, Xiaoping Wang,and Baohua Fan. dMPI:Facilitating debugging ofMPI programs via de-terministic message pass-ing. Lecture Notes inComputer Science, 7513:172–179, 2012. CODENLNCSD9. ISSN 0302-9743(print), 1611-3349 (elec-tronic). URL http://link.springer.com/chapter/10.1007/978-3-642-35606-3_20/.

REFERENCES 465

Zhang:2017:DLN

[ZLP17] Jie Zhang, Xiaoyi Lu,and Dhabaleswar K. (DK)Panda. Designing localityand NUMA aware MPI run-time for nested virtualiza-tion based HPC cloud withSR–IOV enabled InfiniBand.ACM SIGPLAN Notices,52(7):187–200, July 2017.CODEN SINODQ. ISSN0362-1340 (print), 1523-2867(print), 1558-1160 (elec-tronic).

Zhu:2015:PIM

[ZLS+15] Xiangyuan Zhu, Kenli Li,Ahmad Salah, Lin Shi, andKeqin Li. Parallel implemen-tation of MAFFT on CUDA-enabled graphics hardware.IEEE/ACM Transactions onComputational Biology andBioinformatics, 12(1):205–218, January 2015. CODENITCBCY. ISSN 1545-5963(print), 1557-9964 (elec-tronic).

Zhai:2011:CVH

[ZLZ+11] Yan Zhai, Mingliang Liu,Jidong Zhai, Xiaosong Ma,and Wenguang Chen. Cloudversus in-house cluster: eval-uating Amazon cluster com-pute instances for runningMPI applications. In ACM[ACM11], pages 11:1–11:10.ISBN 1-4503-1139-3. LCCN????

Zollweg:1993:OP

[Zol93] J. A. Zollweg. Overviewof PVM. In Anonymous[Ano93e], pages 981–986.ISBN ???? ISSN 0254-6213.LCCN ????

Zarrelli:2006:EPE

[ZPI06] Roberto Zarrelli, MarioPetrone, and Angelo Ian-naccio. Enabling PVM toexploit the SCTP protocol.Journal of Parallel and Dis-tributed Computing, 66(11):1472–1479, November 2006.CODEN JPDCER. ISSN0743-7315 (print), 1096-0848(electronic).

Zambonelli:1996:EPP

[ZPLS96] F. Zambonelli, M. Pu-gassi, L. Leonardi, andN. Scarabottolo. Experi-ences on porting a ParallelObjects environment froma transputer network to aPVM-based system. In IEEE[IEE96g]. ISBN 0-8186-7376-1. LCCN QA76.58 .E971996. IEEE order numberPR07376.

Zheng:2011:GLO

[ZRQA11] Mai Zheng, Vignesh T.Ravi, Feng Qin, and GaganAgrawal. GRace: a low-overhead mechanism for de-tecting data races in GPUprograms. ACM SIG-PLAN Notices, 46(8):135–146, August 2011. CO-DEN SINODQ. ISSN0362-1340 (print), 1523-2867

REFERENCES 466

(print), 1558-1160 (elec-tronic). PPoPP ’11 Confer-ence proceedings.

Zhao:2012:ASO

[ZSG12] Xin Zhao, Gopalakrish-nan Santhanaraman, andWilliam Gropp. Adap-tive strategy for one-sidedcommunication in MPICH2.Lecture Notes in ComputerScience, 7490:16–26, 2012.CODEN LNCSD9. ISSN0302-9743 (print), 1611-3349(electronic). URL http://link.springer.com/chapter/10.1007/978-3-642-33518-1_7/.

Zarrabi:2015:GSA

[ZSK15] Amirreza Zarrabi, Khairul-mizam Samsudin, and Et-tikan K. Karuppiah. Grav-itational search algorithmusing CUDA: a case studyin high-performance meta-heuristics. The Journal ofSupercomputing, 71(4):1277–1296, April 2015. CODENJOSUED. ISSN 0920-8542(print), 1573-0484 (elec-tronic). URL http://link.springer.com/article/10.1007/s11227-014-1360-1.

Zoltani:2001:EPO

[ZSnH01] Csaba K. Zoltani, PunyamSatya-narayana, and DixieHisley. Evaluating perfor-mance of OpenMP and MPIon the SGI Origin 2000with benchmarks of realis-tic problem sizes. Parallel

and Distributed ComputingPractices, 4(4):??, December2001. CODEN ???? ISSN1097-2803.

Zouaoui:2017:CNG

[ZT17] Chakib Mustapha AnouarZouaoui and NasreddineTaleb. CL ARRAY: a newgeneric library of multi-dimensional containers forC++ compilers with ex-tension for OpenCL frame-work. Computer Languages,Systems and Structures, 50(??):53–81, December 2017.CODEN ???? ISSN1477-8424 (print), 1873-6866(electronic). URL http://www.sciencedirect.com/science/article/pii/S147784241630135X.

Zareski:1995:EPG

[ZWHS95] D. Zareski, B. Wade, P. Hub-bard, and P. Shirley. Ef-ficient parallel global il-lumination using densityestimation. In Useltonet al. [UCW95], pages 47–54, 104–105. ISBN 0-89791-774-1 (softbound) [in-valid checksum], 0-7803-3120-6 (microfiche). LCCNQA76.58.P3778 1995. ACMorder number 428957. IEEEComputer Society Press or-der number 95TB8134.

Zheng:2005:SBP

[ZWJK05] Gengbin Zheng, Terry Wilmarth,Praveen Jagadishprasad,and Laxmikant V. Kale.Simulation-based perfor-mance prediction for large

REFERENCES 467

parallel machines. Inter-national Journal of Paral-lel Programming, 33(2–3):183–207, June 2005. CO-DEN IJPPE5. ISSN 0885-7458 (print), 1573-7640(electronic). URL http://www.springerlink.com/openurl.asp?genre=article&issn=0885-7458&volume=33&issue=2&spage=183.

Zhang:2013:MPI

[ZWL13] Xiaohua Zhang, Sergio E.Wong, and Felice C. Light-stone. Message passing in-terface and multithreadinghybrid for parallel moleculardocking of large databaseson petascale high perfor-mance computing machines.Journal of ComputationalChemistry, 34(11):915–927,April 30, 2013. CODENJCCHDD. ISSN 0192-8651(print), 1096-987X (elec-tronic).

Zhu:2017:OAP

[ZWL+17] Huming Zhu, Yanfei Wu, PeiLi, Peng Zhang, Zhe Ji, andMaoguo Gong. An OpenCL-accelerated parallel immun-odominance clone selectionalgorithm for feature selec-tion. Concurrency and Com-putation: Practice and Expe-rience, 29(9), May 10, 2017.CODEN CCPEBO. ISSN1532-0626 (print), 1532-0634(electronic).

Zhu:1995:RTC

[ZWZ+95] Miaoliang Zhu, ChunmingWu, Youjun Zhang, Yi Jin,and Jie Li. A real-time and concurrent intel-ligent robotic system basedon multi-agent architecture.High Technology Letters, 5(10):20–24, October 1995.CODEN GTONE8. ISSN1002-0470.

Zhang:2005:ULC

[ZWZ05] Youhui Zhang, DongshengWong, and Weimin Zheng.User-level checkpoint and re-covery for LAM/MPI. Oper-ating Systems Review, 39(3):72–81, July 2005. CODENOSRED8. ISSN 0163-5980(print), 1943-586X (elec-tronic).

Zhuang:1995:PRS

[ZZ95] Xinglai Zhuang and JianpingZhu. Parallelizing a reser-voir simulator using MPI. InIEEE [IEE95j], pages 165–174. ISBN 0-8186-6895-4.LCCN QA76.58 .S34 1994.

Zeyao:2004:AMI

[ZZ04] Mo Zeyao and HuangZhengfeng. Application ofMPI-IO in parallel parti-cle transport Monte–Carlosimulation. Parallel Algo-rithms and Applications, 19(4):227–236, ???? 2004. CO-DEN PAAPEC. ISSN 1063-7192. URL http://www.informaworld.com/smpp/content~content=a714592658.

REFERENCES 468

Zheng:2014:IMS

[ZZG+14] Liang Zheng, Huai Zhang,Taras Gerya, Matthew Kne-pley, David A. Yuen, andYaolin Shi. Implementa-tion of a multigrid solveron a GPU for Stokes equa-tions with strongly variableviscosity based on Matlaband CUDA. The Interna-tional Journal of High Per-formance Computing Appli-cations, 28(1):50–60, Febru-ary 2014. CODEN IH-PCFL. ISSN 1094-3420(print), 1741-2846 (elec-tronic). URL http://hpc.sagepub.com/content/28/1/50.full.pdf+html.

Zhu:2015:PML

[ZZZ+15] Leqing Zhu, Yadong Zhou,Daxing Zhang, DadongWang, Huiyan Wang, andXun Wang. Parallel multi-level 2D-DWT on CUDAGPUs and its applicationin ring artifact removal.Concurrency and Computa-tion: Practice and Experi-ence, 27(17):5188–5202, De-cember 10, 2015. CODENCCPEBO. ISSN 1532-0626(print), 1532-0634 (elec-tronic).

Recommended