57
1| Page Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy [email protected] What percentage of the facility CI was developed in-house versus by reusing existing solutions? Data systems and services, software/middleware and tools; Almost all data and software from Unidata are made available freely and openly and use open source licensing, so they can be reused. What external CI capabilities and services and/or externally developed tools (if any) does the facility use and who provides them? How were these tools identified and what criteria was used to select the tools? In addition to Unidata-developed software, we also provide externally developed software to our users. Such tools are identified based on the needs of the academic users and deliberated by our governing committees. List up to 3 of your most and least favorite CI components with a 1 sentence explanation for each. What aspects about the facility CI and its operation would you like to share as best practices? NetCDF is Unidata's most widely used software. The challenge is to provide support to a very large and diverse user base in almost every country in the world and all geoscience domains and sectors. The Local Data Manager and THREDDS Data Server applications also have a diverse user community in both operational and research settings. Providing support to an ever expanding community remains an ongoing challenge. Another challenge stems from the rapid growth in the volume of data, so a push approach will not not be sustainable. The increasing volume and diversity of data sources, coupled with the growing user base, also creates challenges in scaling and interoperability. What aspects of the facility CI and its operation do you see as challenges/gaps? Are there any pitfalls/mistakes you would like to share? What aspects would you be interested in outsourcing? As stated earlier, maintaining high quality of support to a growing and expanding user base in an era of shrinking or level budgets remains a challenge. There are also sociological and cultural challenges with changing technologies and adoption and use of new tools and services. Migration to cloud platforms poses challenges in developing business and cost recovery models. Key Risks

Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy [email protected] What percentage of the facility CI was developed in-house versus by reusing

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

1|P a g e

Affiliation Name E-mail

UCAR/Unidata MohanRamamurthy [email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

Datasystemsandservices,software/middlewareandtools;AlmostalldataandsoftwarefromUnidataaremadeavailablefreelyandopenlyanduseopensourcelicensing,sotheycanbereused.

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

InadditiontoUnidata-developedsoftware,wealsoprovideexternallydevelopedsoftwaretoourusers.Suchtoolsareidentifiedbasedontheneedsoftheacademicusersanddeliberatedbyourgoverningcommittees.

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

NetCDFisUnidata'smostwidelyusedsoftware.Thechallengeistoprovidesupporttoaverylargeanddiverseuserbaseinalmosteverycountryintheworldandallgeosciencedomainsand sectors. The Local DataManager and THREDDS Data Server applications also have adiverseusercommunity inbothoperationalandresearchsettings.Providingsupporttoaneverexpandingcommunityremainsanongoingchallenge.Anotherchallengestemsfromtherapid growth in the volume of data, so a push approachwill not not be sustainable. Theincreasingvolumeanddiversityofdata sources, coupledwith thegrowinguserbase,alsocreateschallengesinscalingandinteroperability.

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

Asstatedearlier,maintaininghighqualityofsupporttoagrowingandexpandinguserbaseinan era of shrinking or level budgets remains a challenge. There are also sociological andcultural challenges with changing technologies and adoption and use of new tools andservices. Migration to cloud platforms poses challenges in developing business and costrecoverymodels.

KeyRisks

Page 2: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

2|P a g e

The lackofNSF-fundedoperational cloud facilities forhostingdataanddelivering servicesremains a key gap. Also, most CI facilities are operating independently without muchcollaborationandpartnership.Inadditiontosharingknowledgeandexpertise,adiscussiononhowthefacilitiescanshareotherresourcesandinfrastructurewouldbevaluable.

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

Unidata provides education and training, through workshops in Boulder and at differentuniversities,onaregularbasistostudentsandfacultyonitsproductsandservices.Inaddition,Unidatahostsseveralinternsandmentorsthemeverysummer.

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

ExplodingdatavolumesandscalingofCI tomeet thegrowingneeds remainsa challenge.Cybersecurityisanotherchallengingarea.EntrainingandretainingprofessionalsintoscientificCIareasisachallengegiventhatgraduatingstudentsandprofessionalsarepaidmuchmorebytheITandsoftwareindustrythatisthriving.

Doyouhaveanyothersuggestionsfortheworkshop?

Clearly stated goals for theworkshop andmore in-depth discussions on important issues(ratherthanmanyoverviewpresentations)islikelytoleadtomeaningfuloutcomes.

Page 3: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

3|P a g e

Affiliation Name E-mail

NEON TomGulbransen,Battelle [email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

3ingestionqueues,4transformationpipelines,2websites.Tailoredsounlikelytoreuse.

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

6 external host partners for community distribution and limited data product creation.AeroNet,MG-Rast,SRA,BOLD,PhenoCam,AmeriFlux

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

Sensormessagingandcontrolchallengingatsitesinfrequentlyvisited.Ingestionqueueswhichcanaccommodatedozensofdatatypesandsources.APIswhichgreatlysimplypowerfuldataaccessandsharingoptions.

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

ThefusionofclassicalITsystemsdevelopmentnowinntegralkyreliesoncodewrittenbynon-ITanalysts.Thevalueofthelatterwasunderestimatedinitially,andwillbeover-emphasizedgoingforwardduringcommunityengagement.

KeyRisks

Sensor unreliability is a risk addressed by engineering.User diversitywill create demandsbeyond the dev team capacity. Initial Ops period will reveal if/where/when/howcyberinfrastructuremayneedtoautomatemorechecksandeditsbility.

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

Page 4: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

4|P a g e

Lots of cyberinfrastructure recruitment and resultant learning curve climbing duringconstruction. Scientific cosers are being herded toward conventions to promote easierinteroperabilityandexpansionthroughexternalcontributionswhichcanbeevaluated.

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

Usercommunitytraceabilityandexpansionofuser'sdemands.

Doyouhaveanyothersuggestionsfortheworkshop?

Shareregistrantsinfo.

Page 5: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

5|P a g e

Affiliation Name E-mail

Ocean ObservatoryInitiative(OOI)

Ivan Rodero, RutgersUniversity

[email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

TheinfrastructureoftheCIhasbeendevelopedin-housefollowingindustrybestpractices.Itincludes thedata lifecyclemanagement system,and thenetworkand systemarchitecturedistributed across two geographically distributed data centers. The customized softwarestack,includingcoredatamanagementsystemanduserinterfacehasbeenalsodeveloped.TheCIarchitectureandbestpracticesareavailabletoothertoreuse.

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

TheOOICIusesanumberofexternalservicesandtools,includinganApacheserverforrawdata delivery, a THREEDS server for asynchronous data product delivery, Alfresco fordocument configurationmanagementand shipboarddatadelivery, andanumberof toolssuchRedmineandConfluencefordocumentationandconfigurationmanagement,gerritandJenkinsforcontinuousintegration,andphpBBforforums.Thesetoolswereselectedbasedonrequirementsandprioritizingopensourcesolutions,whenneeded.

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

1)On-demanddataproductdelivery:OOIprovidesuserswithagraphicaluserinterface(i.e.,OOINetdataportal)forplottinganddownloadingon-demanddataproducts.Theportalalsoprovidesaccesstolivevideoandotherdataproducts.2)Rawdataarchive:dataisavailablefordownloadin“raw”indicatesdataastheyarereceiveddirectlyfromtheinstrument,ininstrument-specificformat.3)Machine-to-machineAPI:aREFTfuluserinterfaceisavailabletoaccessOOICIprogrammaticallyusingauthenticationmechanisms.We’dliketosharethearchitectureoftheenterprise-levelinformationlifecyclemanagementsystem,includingnetworkingandmonitoringcomponentswhichuseindustrybestpractices.

Page 6: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

6|P a g e

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

TwoofthemostimportantchallengesoftheOOICIare1)evolvingrequirements(e.g.,datarates, services), 2) and integration of new components (e.g., new instruments). There arelessonslearntrelatedtotheimplementationof industrybestpracticesforthedeploymentandoperationofaproduction-levelCI.

KeyRisks

OneofthehighestrisksfortheOOICIisrelatedtotheuncertaintiesforkeepingthefundinglevel for operating and maintaining the core infrastructure, the software stack andfundamentalservices.Forexample, the lackofexpandingthestorage infrastructure in thefutureisarisk.Amitigationstepwasincludingexpandabletape-basestorageinfrastructureintheinformationlifecyclemanagementsystem.

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

CI-relatedworkforcedevelopmentisatdifferentlevels.Ontheonehand,technicalpersonnelare engaged with continuous training on the technologies involved in CI (e.g. Palo Altotraining,DellCompellent,ApacheCassandra,etc.).Ontheotherhand,OOIengagedwithNSF-fundedCTSCforthedevelopmentofacomprehensivecyber-securityplan.

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

New CI requirements/challenges in the next 5-10 are related to the expansion of the CInetworkwithnewinstruments,increasingdataratesandevolvingdatadeliverymechanisms.

Doyouhaveanyothersuggestionsfortheworkshop?

Notatthistime

Page 7: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

7|P a g e

Affiliation Name E-mail

NationalNanotechnologyCoordinatedInfrastructure(NNCI)

Azad Naeemi, GeorgiaInstituteofTechnology

[email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

Institutedeveloped components include a self-service firewallmanagement, and a sharedaccess model where institute purchased equipment is provided to faculty who in returnprovidesharedaccesstotheirpurchasedhardware.

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

WeareactivelyimplementingtheOpenScienceGrid,Globus,scienceDMZ,andperfSONARfile and networking components. In addition,we are implementing Ohio SupercomputingCenter’sPBSTools,OpenXDMoDfromtheUniversityatBuffalo.

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

1)Rapidlygrowingdatasources.Ourstoragesystemshavegrownexponentiallysince2009to8petabytes.2)Utilizationpatternsthataremanysmalljobs,i.e.highthroughputcomputing(HTC)vsthefewverylargemonolithicjobs(HPC).WeaimtofunnelthesetypesofworkloadstoOSG,andimplementhardwarededicatedtorunningOSGcomputation.

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

KeyRisks

Notatthistime

Page 8: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

8|P a g e

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

Wehireundergraduatestudents,contributetoLinuxClusterInstituteworkshopsandareintheprocessofdeployinganinstructionalcluster.

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

As a major technological research institution, the Georgia Institute of Technology, whichincludesacademicunitsandtheGeorgiaTechResearchInstitute(GTRI),hasdirectexperiencewithmanyofthecurrentandemergingresearchchallengesfacingtoday's

Doyouhaveanyothersuggestionsfortheworkshop?

Notatthistime

Page 9: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

9|P a g e

Affiliation Name E-mail

NHERI Tim Cockerill, University ofTexas - Texas AdvancedComputingCenter

[email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

NearlyalloftheCIcomponentsaredevelopedin-housebyTACCandaremadeavailableasopensourceingithub.

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

WeusetheDjangowebframeworkbasedonourpreviousexperienceswiththisandotherframeworks.Wealsohavea local implementationof theFedoraDigitalObjectRepositoryManagementSystemforourarchivingourpublisheddata.

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

TheDataDepotisourmostusedCIcomponent.Ourusershavealreadyuploadedmorethan16TBofdatainadditiontothe40TBwetransitionedinfromthepredecessorprojectNEES.Weallowallfiletypesandweencourageouruserstouploadanyandalldatatheyneedtodotheirresearch-wefeelthatnotrestrictingtheusersiskeytotheiradoptionofourCI.WeworkedwithMathworkstoacquireaMATLABlicensethatenablesallacademicuserstoaccessMATLABviaourCI.TheengineeringcommunityareheavyMATLABusers,andthishasalsohelpedwithadoption.WeimplementedJupyterNotebooksandareprovidingtrainingonhowtousethemalongwithbasicPythonscriptingskills.WeareseeingprettystronguptakeofJupyter.Itrunsprettyfastinthecloud,andusersarefindingittobeascapableasMATLAB.

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

Page 10: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

10|P a g e

Challenge: operation of a tightly-coupled operation across hemispheresIt is preliminary to speak of lessons lesson learned, as LSST is in construction. However,accurateanddetailedmodeltoeffectivelycommunicate,coordinateandmaintaintheabilitytotraceCIfeaturestotherequirementsandbusinessneed.IsanareaoffocuswhichLSSTfeelswillhelpmeetthischallenge.

KeyRisks

Forthisproject,sincetheCIisallatTACC,thereisnotmuchrisk.

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

WeprovideroughlymonthlytrainingwebinarswhicharerecordedandthenmadeavailablepersistentlyonYouTube.Wealsohavesummerprogramsforhighschoolstudents-thisyeartheybuiltaninstrumentedmodel,experimentedwiththatmodelonashaketable,andthenanalyzedtheirresultsusingourCI.

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

Performanceisthepriority,sincewebdatatransferandremoteuseofinteractivetoolslikeMATLAB are slower than on a local laptop. Also expanded simulation and dataanalysis/visualizationcapabilitiesonthewebportalsothatwecaptureallresearchersinthiscommunity.

Doyouhaveanyothersuggestionsfortheworkshop?

Notatthistime

Page 11: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

11|P a g e

Affiliation Name E-mail

LSST DonPetraivck,NCSA-UIUCJeffKantor,WilliamO'Mullane

[email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

R:LSSTisinconstruction,butthefollowingareunderway,LSSThasfundedthedevelopmentof a significant, high bandwidth network between Chile and the United States. LSST isdevelopingQSERV,aspatiallyshareddatabasewhichisanticipatedtorequire40PBofdiskprovisioning,over250nodeby2025.

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

-LSSTUsesHT-CONDORforthebasisofitsproductionsystem.HT-Condorisastandardinthoughputcomputing,isusedinLHCandtheDarkEnergysurvey.HTCondorsupportsthevariousbatchusecasesidentifiedinLSST.LSSThashadacollaborativeengagementwithHTCondorformanyyears.LSSThasusedXSEDEandBlueWatersduringitspre-constructionphasefordemonstrationsoffeasibilityofitsproductionsystem,andhasusedsimulationdatageneratedontheOpenScienceGrid.–Theseweretheobviouschoicesduestoagencysupportandavailability.LSSThasbuiltuponauthenticationandauthorizationsystemworkthatisalsoinuseinLIGO.Thereasonisthatthesystemsupportsavarietyofauthenticationandauthorizationprotocol,andinteroperatedwithIncommon.NationaleducationandresearchidentityfederationsareseenasusefulsourceofidentityinformationforLSST,wheretheclassofallUSandallChileanprofessionalastronomershavedatarights.LSST’sMasterInformationSecurityPlanwasdevelopedinConsultationwiththeCTSC.CTSCwasselecteddueitisknowledgeofcontemporarysecuritystandards,asappliedtoNSFprojects.LSST’sscienceuserinterfaceisbasedontheFireflyToolKitdevelopedatIPACatCaltech.ThisisacommonlyusedadvancedtoolkitusedwithinOpticalAstronomy.Rucio,acomponentdevelopedatCERNfortheLHCisbeingevaluatedforinternalfilesynchronization,asisPegasusfortheproductionworkflows.Bothofthesecomponentswereselectedduetotheirusewithsimilarusecasesinotherexperiments.

Page 12: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

12|P a g e

JupyterisafoundationalcomponenttosupportinternalqualityassessmentandtosupportexploitationofthedataattheUNandChileanLSSTDataAccessCenters.Jupyterisawell-supportedmethodofexposingaspectsofafacilityinastructuredwaytoalargegroupofusers.BROisuseforintrusiondetectionattheLSSTChileansites,andatNCSA.BROisselectedforusutilityinbeinganintrusiondetectionsystemwherelargevolumesofdataretransferredbetweensites,andsuetothebodyofexpertisewiththesystematNCSA

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

-LSSTUsesHT-CONDORforthebasisofitsproductionsystem.HT-Condorisastandardinthroughoutcomputing,isusedinLHCandtheDarkEnergysurvey.HTCondorsupportsthevariousbatchusecasesidentifiedinLSST.LSSThashadacollaborativeengagementwithHTCondorformanyyears.LSSThasusedXSEDEandBlueWatersduringitspre-constructionphasefordemonstrationsoffeasibilityofitsproductionsystem,andhasusedsimulationdatageneratedontheOpenScienceGrid.–Theseweretheobviouschoicesduestoagencysupportandavailability.LSSThasbuiltuponauthenticationandauthorizationsystemworkthatisalsoinuseinLIGO.Thereasonisthatthesystemsupportsavarietyofauthenticationandauthorizationprotocol,andinteroperatedwithIncommon.NationaleducationandresearchidentityfederationsareseenasusefulsourceofidentityinformationforLSST,wheretheclassofallUSandallChileanprofessionalastronomershavedatarights.LSST’sMasterInformationSecurityPlanwasdevelopedinConsultationwiththeCTSC.CTSCwasselecteddueitisknowledgeofcontemporarysecuritystandards,asappliedtoNSFprojects.LSST’sscienceuserinterfaceisbasedontheFireflyToolKitdevelopedatIPACatCaltech.ThisisacommonlyusedadvancedtoolkitusedwithinOpticalAstronomy.Rucio,acomponentdevelopedatCERNfortheLHCisbeingevaluatedforinternalfilesynchronization,asisPegasusfortheproductionworkflows.Bothofthesecomponentswereselectedduetotheirusewithsimilarusecasesinotherexperiments.JupyterisafoundationalcomponenttosupportinternalqualityassessmentandtosupportexploitationofthedataattheUNandChileanLSSTDataAccessCenters.Jupyterisawell-supportedmethodofexposingaspectsofafacilityinastructuredwaytoalargegroupofusers.BROisuseforintrusiondetectionattheLSSTChileansites,andatNCSA.BROisselectedforusutilityinbeinganintrusiondetectionsystemwherelargevolumesofdataretransferredbetweensites,andsuetothebodyofexpertisewiththesystematNCSA

Page 13: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

13|P a g e

1)UpgradingthenorthsouthnetworkfromLaSerena,ChiletoNCSAinthecontextofaMREFCproject.2) Dealingwith the evolution of processors, in particular the reduction of the amount ofmemory per core, and the need to increase the level of threading in LSST Codes.3)Selectingthetechnologiesneededtosupportendusersinthedataaccesscenter.

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

Challenge:operationofatightly-coupledoperationacrosshemispheresItispreliminarytospeakoflessonslessonlearned,asLSSTisinconstruction.However,accurateanddetailedmodeltoeffectivelycommunicate,coordinateandmaintaintheabilitytotraceCIfeaturestotherequirementsandbusinessneed.IsanareaoffocuswhichLSSTfeelswillhelpmeetthischallenge.

KeyRisks

Changes incomputingplatformsovertheremainingperiodofconstructionandoperationsthrough2034areaconcern.LSSThasdataprocessingaccessandarchivefacilities inthreecontinents. Foreachcontinentthepaceofsustainablechangewillvary. Forexample,weexpectcloudcomputingtolaginSouthAmerica.Theresponsetothesechallengesincludesprovidingsoftwareisolationlayers,forexampleKubernetes,whichcanbedeployedinlocallyprovisioned or in commercial systems.Wecurrentlyusecouldservicesforsoftwarebuildandtest.TheEPOcomponentofLSSThasa very large clouddeployment component. Our baseline thinking allows for use of cloudservicesfordisasterrecovery,foropportunisticbulkcomputing,andforelasticexpansionoftheUSDataAccesscenters.Ourbaselinemayevolveasconstructionproceeds.

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

Projectstaffattendworkshopsandconferences.AtNCSAsignificantworkinCIisperformedbyNCSAstaff.NCSAhasaprogramofworktodeveloptheHPCworkforce,includingrespondingtoNSFcallsforproposalsfortrainingCyberInfrastructureProfessionals.Additionally,NCSAhasaprogramofresearchandsupportingitsinfrastructure,includingoperationalsecuritygroup,supportfortheLinuxClusterInstitute(LCI),whichtrainsInfrastructureprofessionals.

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

Page 14: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

14|P a g e

KeepingtheCIeffortsinChileandtheintheUScoordinatedandwithaliketechnologybase.ChangesinCItechnologiesandhowCIisabsorbedbytheproject.LSSThasobligationstoprovidecomputingfacilitiesinChile,whereforexamplecloudfunctionalityisnotequivalenttothefunctionalityavailableintheUS.

Doyouhaveanyothersuggestionsfortheworkshop?

Notatthistime

Page 15: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

15|P a g e

Affiliation Name E-mail

NationalOpticalAstronomyObservatory(NOAO)

SeanMcManus

[email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

data reduction pipeline (DEC Community Pipeline); TADA (Telescope Automatic DataArchiver);yesthesetoolsaremostlyopen-source

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

Scientific Linux, IBM General Parallel File System, Puppet, Foreman, Libvirt, Django. Thecriteriausedtoselecttoolsvaries.Forsomeopen-sourcetools,thereisminimalinvestmentneededtotrysomething,andthereforedoesn'trequireaformalselectionprocess.Forpaidsoftware contracts, there is obviouslymore vetting by internal IT staff,management, andprocurement.Aspartofnormalvettingwetrytolookatwhatisworking/notworkingforotherpeerorganizationsinsideandoutsideofAURA.

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

1) Mass storage: We require inexpensive storage on the multi-Petabyte scale to storeastronomydataproducts;2) Bandwidth: Reliable, fast bandwidth across continents is needed to move data fromtelescopetoarchive;3)Software:Thesoftwarestackmustmeetoperationalrequirementsbutalsobesustainableinsideflatorshrinkingbudgetenvelope.

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

Forsmalldepartments,itisdifficulttoachieveabalanceofexperienceversusmotivationandfamiliaritywithcuttingedgetools.Lowstaffturnovercanresultinstaffbeingsettledononeparticulartechnology,andlaggingbehindrecentdevelopmentsinIT.Ontheotherhand,it's

Page 16: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

16|P a g e

notcost-effectivetoreacttothelatest/greatestthingthatcomesouteveryyear.Abalanceofnewversusproventoolsmustbemade.

KeyRisks

workforcereductionduetobudgets,evenasmallone,couldhavesignificantimpact.

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

Webudgetforcontinuingeducation,butwhetherornotstaffparticipateisvoluntary

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

transitionfromNOAO/LSST/GeminitoNCOA

Doyouhaveanyothersuggestionsfortheworkshop?

n/a

Page 17: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

17|P a g e

Affiliation Name E-mail

LIGO StuartAnderson,Caltech [email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

Allofthefollowingin-houseCIcomponentsareavailableforreuse:*LIGODataReplicator(bulkdatatransfers)*MetadatadatabasesandtoolsdesignedforGWobservations*low-latencydatadistributiononlargeclusters*DataMonitoringTools*low-latencytransienteventalertsystem*NetworkDataServer*WebandMatlabbasedDataViewertools*GWDetectorstatusmonitoringservice*GWdetectionandparameterestimationpipelines*Libraryofgravitationalwavealgorithms*LIGOOpenScienceCenternotebooks*Jobaccountingsystem

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

*HTCondor/Pegasus/BOINC*OSG*Docker/Singularity/Shifter*CVMFS/StashCache/Xrootd/GridFTP*Shibboleth/Grouper/CILogon/Kerberos/LDAP/GSI*OracleHSM/ZFS/HDFS*GitHub/GitLab/Travis/Jenkins*JupyterHubThesetoolswherepredominantlyidentifiedbyfirstrecognizinganeedandthenchargingasmallgrouptoresearch(sometimesaself-forminggroup)toresearchwhatiscurrentlyavailable.Insomecasesthatgrouptakesasolutiontofullscaleprototype(builditandtheywillcome),andinothersthealternativesarepresentedtoaLIGOcomputingcommitteetoevaluatetheprosandconsfirst.andMatlabbasedDataViewertools*GWDetectorstatusmonitoringservice*GWdetectionandparameterestimationpipeline*Libraryofgravitationalwavealgorithms*LIGOOpenScienceCenternotebooks*Jobaccountingsystem

Page 18: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

18|P a g e

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

*IdentityandAccessManagementwasachallengeduringtheearlyphasesofLIGO,leadingtosignificantlossinproductivityduetounnecessarybarrierstoefficientaccesstoneededinformationandsystems.IntegratingShibboleth,Grouper,InCommon,andCILogonintoLIGO'sCIhasbeenagamechanger.InvestinginI&AMearlyoninaprojectishighlyrecommended.*IntheearlyyearsofLIGOattemptstouseOSGtorunLIGOdataanalysistasksfailed.Inthelastfewyearsthishasbecomeamajorsuccess,inpartduetomorematuretoolsformanagingdataintensiveworkflows(e.g.,Pegasus,CVMFS,andcontainerization),andinpartduetomorematuregravitationalwavedataanalysispipelines.*LIGOinitiallyinvestedinahomegrownjobexecutionenvironmentthatattemptedtominimizetheamountofcodeneededtobedevelopedbyscientistsperformingsearchesforgravitationalwaves..However,thatprovedinpracticetobeinsufficientlyflexibleandthependulumswungovertoallowingscientiststodeveloparbitrarya.outexecutablesmanagedbyHTCondor.Inhindsite,theoptimumwouldhavebeensomewherein-between.

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

*IntegratingCIwithinternationalcollaboratorsremainsasignificantchallenge..OSGhasrecentlyprovidedamajorbreakthroughforprovidingauniforminterfacetoplanandexecuteLIGOworkflowsoninternationalcomputingresources.However,internationalfederatedI&AMremainsasignificantchallengeforLIGO.*FindingtherightsetofCItosupportbothtightlycontrolledproductiondataanalysisandallowingcreativenewideasbedevelopedisachallenge.

KeyRisks

* Funding for CI experts that support scientific personnel to use existing CI*SustainabilityofCIandbeingabletoeffectivelyidentifynewCIthatwillbeavailableinthelong-termbeforeinvestinglimitedinternalresources.

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

*Sendingstudentstosummerschoolsandsimilartrainingopportunities.*Sendingprofessionalstafftoconferencesandworkshops.*Invitingexternalexpertstoprovidetrainingatinternalscientificmeetings.

Page 19: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

19|P a g e

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

*Inter-federationagreementsthatcomplywithinternationalprivacylawswhilestillreleasingenoughinformationtobeusefulforinternationalscientificcollaborations.*Trainingtheteachers.AsmostoftheworkforcecomesfromacademicresearchgroupshowdowetrainacademicfacultytobeabletotraintheirnewstudentstousemodernCI.*long-termstabilityofsoftwarepackaginganddistributionthatwillallowreproducibilityofscientificresultsonaninterestingtimescale.

Doyouhaveanyothersuggestionsfortheworkshop?

Notatthistime

Page 20: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

20|P a g e

Affiliation Name E-mail

LIGO AlbertLazzarini,Caltech

[email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

PleaseseewhitepapersubmittedbyStuartAndersonforallattendeesfromLIGO

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

PleaseseewhitepapersubmittedbyStuartAndersonforallattendeesfromLIGO

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

PleaseseewhitepapersubmittedbyStuartAndersonforallattendeesfromLIGO

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

PleaseseewhitepapersubmittedbyStuartAndersonforallattendeesfromLIGO

KeyRisks

PleaseseewhitepapersubmittedbyStuartAndersonforallattendeesfromLIGO

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

PleaseseewhitepapersubmittedbyStuartAndersonforallattendeesfromLIGO

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

PleaseseewhitepapersubmittedbyStuartAndersonforallattendeesfromLIGO

Doyouhaveanyothersuggestionsfortheworkshop?

Page 21: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

21|P a g e

What is the appropriate scale and relationship among large NSF computing facilities,computingfacilitiesthatarepartofe.g.,physicslargefacilitiesandMRIresourcesprovidedtoindividualcollaborationinstitutions?DoesNSFhaveapolicyonthese?

Page 22: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

22|P a g e

Affiliation Name E-mail

ARF JonC.Meyer,UCSanDiego

[email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

weareintheprocessofdevelopingdatadeliveryviamodernmessagequeueandwelcometheopportunitytocollaborateandhaveothersreuse.

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

Some vendors' tools are used due the demand for certain types of data to be regularlyproducedduringaseagoingmission

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

Uninterrupted Internet connectivity. Research vessels at sea need consistent, reliablecommunicationpathstobeabletoproducescientificallyinterestingdatainneartorealtime.

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

KeyRisks

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

Somespecializedandgeneralcomputing-relatedtraining.

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

Page 23: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

23|P a g e

High-speed,realtimedeliveryofdatafromtheocean.Abilitytointeractwithfieldresearchersseamlesslyfrom

Doyouhaveanyothersuggestionsfortheworkshop?

Notatthistime

Page 24: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

24|P a g e

Affiliation Name E-mail

Gemini Chris Morrison, GeminiObservatory

[email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

none(notethatwedonotincludesoftwareinourdefinitionofCI)

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

Googleappsforbusiness;Amazonwebservices;zoomconferencingservices.Identifiedinallcasesbyindustrysurveys&bestpractices;selectionviarequirementsanalysis,insomecasesusabilityanalyses,andvalueformoney.

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

Challenges:1.Netappstorage.Largeimpactifthisredundantsystemfails.2.Backupstorageinfrastructure.Expensive,complexandrequiressignificantexpertise.3.Remoteaccessconnectivity.Bringsusermanagementandsecurityconcerns.Bestpractices:1.Geminiinfrastructurehassignificantredundancy,asaresultoflessonslearnedinpreviousfailures.2.Useofcloudservice(AWS)forlarge-scaledataarchivingandaccess.3.CIreplacementpolicyonequipmentatendofwarranty.

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

Challenges&gaps:seeabove.Lessonstoshare:Redundancy(storage,networking,VMclusters,connectivity).Lessonstolearninthemeeting:offsitestoragemethods&dataretention.

Page 25: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

25|P a g e

KeyRisks

Dependencies:AccesstoGoogle(forbusinessapplications);AWS(forarchivestorage)-lowlikelihood,highimpactrisks.Mitigation:RedundantnetworklinksinHawaiiandChile.BackupplanforanextendedoutageofAWSwouldbetobringthearchiveinhousetemporarilyuntilservicerestored.

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

Enterprisespecialisttrainingcoursesandcertifications.

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

Challenge:IntegrationofGeminiCIintoalargerCenter,andaligningserviceswithotherProgramsinthatCenter.WedonotseesignificantchangesinthetechnicalchallengeforGeminiCI,asthetelescopeswillnotfundamentallychangethewaytheyoperateatnight.

Doyouhaveanyothersuggestionsfortheworkshop?

1.FutureroleofNSFincoordinatingorprovidingCIthroughgrantfunding.2.Large-scalesciencedatastorageandaccessviacloudservices-bestpractices.

Page 26: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

26|P a g e

Affiliation Name E-mail

DKIST,NSO Steve Berukoff and EricCross,NSO

[email protected]@nso.edu

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

FortheDKISTtelescopeBuiltIn-House•InstrumentControlSystems•FacilityControlSystems•Telescope•Enclosure•Environmental•AdaptiveOptics,WavefrontControl•Coude•SafetySystems•AretheseusefultootherCIorganizations?Uncleariftheywouldbeusefulelsewhere.

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

•OpenSourcesoftware;givenbudgetaryconstraintsDKISTCIisleveragingOpenSourcewhereapplicable.ThedeploymentofOpenSourceiscenteredwithintheInfrastructurelayers.•GlobusGridFTPwillbeustilizedtomovedatafromthetelescopeonMauitotheBoulderDataCenter.•CEPHobjectstorageforlong-termdatastorage

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

•ComplexityofDKISTInstrumentshasdrivenaflexiblebutcustomizableapproachtoinstrumentcontrols.•DatanetworkmanagementhasprovidedachallengetoDKIST.WehavenetworkInterconnectsbetweentheDKISTFacilityonMaui,theUniversityofHawaii,theUniversityofColorado,andalsoleveragingInternet2.•ComplexityofDKISTInstrumentshasdrivenaflexiblebutcustomizableapproachto

Page 27: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

27|P a g e

instrumentcontrols.•DatanetworkmanagementhasprovidedachallengetoDKIST.WehavenetworkInterconnectsbetweentheDKISTFacilityonMaui,theUniversityofHawaii,theUniversityofColorado,andalsoleveragingInternet2.•ThecombinationofPetascaledatavolumeunderaveryconstrainedbudgetchallengestheabilityoftheCItosupportitscommunity.BestPractices•BecauseofthedistributednatureoftheprogramwithmultipleproductownersfollowingSystemsEngineeringpracticesfordevelopingeffectiverequirementsandinterfacecontrols.

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

• Ensuring the end to end CI design from Facility Control, Data Acquisition and end-userdistributionisbuilt-intotheoveralldesignandbudget.

KeyRisks

•Operationalfundinglevelsshouldallowappropriatemaintenancetobecompletedwithappropriatepersonnel.•Long-Termoperationallifetimesmandateavoidanceofmonolithicarchitectures.Mitigation•AbilitytobuildinfrastructurebuildingblocksbydevelopingaroadmapforDIBBSawards.

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

•Professionaldevelopmentconferences

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

•Ensurewecandeliverthescopethatweneedtosupportourcommunity.

Doyouhaveanyothersuggestionsfortheworkshop?

Notatthistime

Page 28: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

28|P a g e

Affiliation Name E-mail

ARF Suzanne Carbotte,ColumbiaUniversity

[email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

R2Rhasdevelopedanetwork file system for storageof data anddocuments; a relationaldatabaseforstorageofassociatedmetadata;aWebportalforsearch,browse,anddownload;scriptedtoolsfordatacataloging,archiving,processing,andassessment;andasuiteofWebservices for interoperability. Most are built on existing open-source software such asPostgreSQL,ApacheHTTP/Tomcat,MapServer,etc.SelectedtoolsfordataprocessinghavebeenreleasedinthepublicdomainviaGitHub.

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

R2Ruses commercialprovisioning in selected cases forWeb servicehosting (Linode.com),domainservices(Site5.com),anddeepstorage(AmazonGlacier).

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

1.R2R'snetworkfilesystemistheheartofitsdailyoperation,usedforbothinternalprocessingworkflowsandservingcontenttotheWeb.ThefilesystemisbuiltonasuiteofFibreChannelstoragearrays,switches,andLinuxservers.2.R2R's"NavManager"softwarepackageisusedroutinelytocreateasuiteofquality-controlledshiptracknavigationproducts,whicharereusedbydownstreamQAprocessesandWebservices.3.R2R's"LinkedData"serverdisseminatestheCruiseCataloginastandards-compliantformat,whichisharvestedbyothergeosciencedatarepositoriesaswellasbyglobalsearchindexessuchasGoogle.WhataspectsaboutthefacilityCIanditsoperationwouldyouliketoshareasbestpractices?Itisnotuncommontorevisitold(er)datapackages,inordertoextractadditionalinformationand/orrefinequalityassessment.Maintainingdatapackagesonspinningdiskfora5ormore-yearslidingwindowhasprovenadvantageous,andcanbesustainedusing(lessexpensive)HDDsratherthanSSDs.Everydigitalresourcepublishedonline(vessel,cruise,dataset,document,sample,person,

Page 29: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

29|P a g e

award,etc)shouldhaveagloballyuniquepersistentidentifier.Thisenablesinteroperabilitywithotherrepositories,reliablecitation,andlinkingtothescientificliterature.

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

Thevolumeofenvironmentalsensordatabeingproducedbymodernresearchvessels,isincreasingfasterthanthediskstoragecapacitythatcanbedeployedwithaffordableenterprise-gradelocalequipment.Commercialprovisioningprovidesanaffordablesolutionfordeepstorage,butnotforlocaldataprocessingoregress.AcademicprovisioningviasystemslikeXSEDEisdifficultbecausetheresourcesaredisjointedandconstantlyevolving,andcarrytheriskofabruptterminationwhenthegrantperiodends.Datatransferisalsohamperedbylocalcampusnetworkbandwidth.Whileprogresshasbeenmadetowardstandardization,theUS.academicfleetstillproducesdatainaveryheterogeneousmanner.Eachcruiseisunique.Significantmanpowerisstillrequiredtostayabreastofchangingdirectorystructuresandfileformats,andtorecoverfromoperatorerrors.

KeyRisks

Maintaininglocalserver,storage,andnetworkinfrastructureremainsanongoingchallenge,especially with the increased need to providemonitoring, metrics, and network security.Commercial provisioning shifts resources from a local to a remote location, but does noteliminatetheneedforasystemadministratoranddoesnotreducecosts.

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

R2RstaffattendannualcommunitymeetingssuchasESIP,RDA,andRVTEC,tostayabreastofemergingtechnologies.Juniorstaffworkintandemwithseniorstaff,receivingon-the-jobtraining.

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

Theabilitytostoreandmovelargevolumesofdataasenvironmentalsensorscontinuetoevolvefasterthanstorage/networkresources;thelackof"smart"self-documentingsensors;andthelackofdesignatedlong-termarchivesforsomedatatypesremainsignificantchallenges.

Doyouhaveanyothersuggestionsfortheworkshop?

Notatthistime

Page 30: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

30|P a g e

Affiliation Name E-mail

NationalCenterforAtmosphericResearch(NCAR)

AaronAndersen,UCAR

[email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

AnumberofcomponentsoftheCIweredevelopedinhouse.Afewconcreteexamplesinclude:-ResearchDataArchiveservices-publicinterfacecanbefoundat:https://rda.ucar.edu/-ParallelPythontoolsforpostproductionofNetCDFfilesandspecificallyclimatedata:https://www2.cisl.ucar.edu/tdd/asap/parallel-python-tools-post-processing-climate-data-SystemAccountingManager(SAM)onHPCsystemshttps://www2.cisl.ucar.edu/user-support/systems-accounting-manager(currentlyNCARspecific)-VAPORistheVisualizationandAnalysisPlatformforOcean,Atmosphere,andSolarResearchers.VAPORprovidesaninteractive3Dvisualizationenvironmentthatcanalsoproduceanimationsandstillframeimageshttps.://www.vapor.ucar.edu/-NCARCommandLanguage-NCLisaninterpretedlanguagedesignedspecificallyforscientificdataanalysisandvisualization.AlltoolswereprimarilydevelopedwiththeneedsoftheAtmosphericsciencecommunityinmind.AllcomponentsareavailableforreuseexceptforSAM.SAMcouldbecustomizedandutilizedbyothersbutwouldrequiresomegeneralizationorsitespecificcustomization.

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

AgoodnumberofexternalCIcapabilitiesand/orexternallydevelopedtoolsareinuseatNCARwithintheComputingandInformationSystemsLab(CISL)..Highlightsinclude:-NCARDataSharingService-GlobusToolkit-https://www.globus.org/-NCARalsoutilizesXDMoDaspartofthesuiteoftoolsusedtomanagetheHPCresources-http://open.xdmod.org/WithintheNCARWyomingsupercomputingcentertwocommercialpackagesareinusetocontrol,manageandmonitorthefacility.-ThecoreofthefacilityutilizesBuildingAutomation,hardware,softwareandsensorsfromJohnsonControlsInc.basedontheMetasysBuildingAutomationSystemhttp://www.johnsoncontrols.com/buildings/building-management/building-automation-systems-bas-MorerecentlyNCARhasdeployedanadvancedsystemtoallowhigherfidelitysamplingof

Page 31: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

31|P a g e

theelectricalinfrastructure.ThosecomponentswereprovidedbySchneiderElectricSoftwareLLC.undertheirWonderwarebrand.ThesetwocommercialpackageswerepurchasedutilizingaformalRFPprocessandwereevaluatedbyatechnicalteam,businessteamandpricingteam.Technicalrequirementsweredevelopedinpartnershipwithexternalengineeringfirms.

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

ThethreemostusedCIcomponentsaretheHighPerformanceComputingsystems,HighPerformanceDiskStorage(GLADE)andthetapearchiveHPSS.TheHPCsystemsareregularlyseegreaterthan90%systemutilization.GLADEsimilarlyhasbeenexceptionallypopularprovidingcommonsharedspaceacrossHPC,dataanalysisandvisualizationplatforms.FinallytheHPSSbasedarchivesystemisstillthecornerstoneofdataarchivalatNCARandinsomerespectsistoopopular:-HPCsystemsutilizetestanddevelopmenthardwarethatismuchsmallerscalebutprovidescapabilitiestonotimpactproductionworkwhileupgrading,patchingoraddingnewtoolstotheuserenvironment.OncechangestothetestenvironmentsarestabletheteamscanthenupgradeorchangethelargeHPCenvironments.Herecomplexityandscaleprovidesignificantchallenges.-TheGLADEenvironmentistechnicallychallengingprovidingaverylarge(50PB)highperformanceInfiniBandstorageenvironment.Howeverthetechnicalchallengesareonlyonecomponentoftheenvironment,userretentionpoliciesandmanagementofquotasareequallyaschallenging.-HPSSpresentsamorefinancialchallenge.Historicalarchivalstoragepolicieswerepredicatedoncomputingbeingexpensivebutstoragebeingcheap.CurrentlythoseeconomicassumptionsarenolongervalidandCISLhasembarkedonmodificationstostoragepolicies.Thateffortistoonewbutmaybecomeabestpractice.

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

Weseehumancapitalaspossiblyoneofourmostchallengingareascurrently.ExpertiseinHPC,largedatastorageandITenvironmentsareinhighdemand.Weoftenfindrecruitingstaffachallengeespeciallywheresomeareaslikedataanalyticsanddatascienceareinsignificantdemandinthecommercialaswellasresearchsectors.Keepingpacewithsalariesinachallengingfederalenvironmentisprovingdifficult.ClosertothefacilityoperationlevelweareseeinghighlydynamicHPCenergyconsumptionbasedoncomputingworkloads.AllHPCvendorsareactivelypursuingpowersavingcapabilitiesallthewaydowntothechiplevel,turningdownclocksorcomponentsondemand.Overallthisisagoodthingascomputingsystemsofthepastwerenotoriously

Page 32: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

32|P a g e

wasteful.However,computingcomponentsthatturnupanddownoncomputingtimescales(subseconds)maynotbeamatchfortraditionalbuildingautomationsystemsormorebroadlyutilityproviders.Largechangesinelectricaldemandinfluencemechanicalcoolingsystemsaswellasthecapacityoftheutility.TheNWSChasahighlyenergyefficientdesignthatadaptstothedemandsoftheCIhousedinthefacility.

KeyRisks

Workforcedevelopment,recruitingandretentionareasignificantrisk.

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

NCARhasanumberofeffortsunderwayasweseeworkforcedevelopmentascritical.TheNWSChasbeenutilizedasateachinglaboratorywith7summerinternsoverthelast5yearsworkingwithinthefacility.Withinthattimeframe,3womenand2minoritystudentshavebeenthroughthree-monthintensivesummerinternships.AllbuttwoofthosestudentshaveremainedinfieldsengagedwithlargeCI.CISlalsomanagestheSummerInternshipsinParallelComputationalScience(SIParCS).ThegoaloftheSIParCSprogramistomakealong-term,positiveimpactonthequalityanddiversityoftheworkforceneededtouseandoperate21stcenturysupercomputers.Graduatestudentsandundergraduatestudents(whohavecompletedtheirsophomoreyearbysummer2017)gainsignificanthands-onexperienceinhigh-performancecomputingandrelatedfieldsthatuseHPCforscientificdiscoveryandmodeling.MorerecentlytheOperationsManagerattheNWSChasbeenengagedaspartofthestateofWyomingWorkforceDevelopmentCouncil.Wyominginparticularislookingtodevelopgreaterinroadsspecifictolargecomputingfacilitieswithmoretraditionaltrades,communitycollegesandnon-traditionalstudents.

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

SpecifictomodelingandsimulationweseeahighlydisruptiveCIenvironmentwithsignificantcomputing architecture diversity on the horizon and new clear winners. Heterogeneouscomputing architectures are now commonplace but the complexity and scale remainchallenging.Thereisalsoanexplosionofdataanddataresourcesthathaslongbeenpromisedbutwearestartingtoseewithgreaterclarity.Newmethodssuchasmachinelearningoffersomepromisebuttherearemanypathsandoptions.NCARcertainlydoesn'thavethecapabilitytoexploreallpossiblepathsandwillneedtopartneracrossmanydisciplinestofindanswers.

Doyouhaveanyothersuggestionsfortheworkshop?

Notatthistime

Page 33: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

33|P a g e

Affiliation Name E-mail

IncorporatedResearchInstitutionsforSeismology(IRIS)

Tim Ahern, University ofWashington

[email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

Mostcomponentshavebeendevelopedinhouseoverthe30yearslifeoftheDMC.Ofcoursecommercial andopen source software systems are usedwhen appropriate such asDBMSsoftware.Muchofourinfrastructureissomewhatdomainspecificsuchasreceptionofrealtimedataandtoolsthatworkwithdomainspecificdata.

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

We use commercial software for virtualization (VmWare), PostgreSql for DBMS software,commercial geolocation software. All external tools were acquired using IRIS purchasingguidelines,multiplebidsetc.

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

1)Webservices,methodstoabstract timeseriesandmetadataaccessboth internallyandexternally2)storageRAIDindexingschemetoimproveaccesstocommodityRAID3)Synchronizationofdataversionsacrossmultiplestoragesystems(1primaryand1secondaryateachoftheDMCandtheADC)

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

Scalability.Access toseismologicaldatacanbeepisodicespeciallyafterearthquakes. Alsocertain preprocessing services can exceed our internal capabilities. The promise of cloudresourceshaspotentialbutnotyetrealized.

KeyRisks

Page 34: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

34|P a g e

Lossofkeypersonnelandtheirknowledge.NSFbudgetsaremakingfacilitieslikeourmoreandmorevulnerable.

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

BothNSFandcommerciallysponsoredtrainingcourses.Weparticipateastimeandfinancialresourcesallow

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

Reducingthecosttomaintainourinfrastructureandfindingexternalresourcesperhapscloud,thatcanmeetourdemandsandfitourwayofdoingbusinessnottheirs.

Doyouhaveanyothersuggestionsfortheworkshop?

Nothingatthistime,notabletospendmuchtimeonthis.....

Page 35: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

35|P a g e

Affiliation Name E-mail

UNAVCO FranBoler,UNAVCO [email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

EssentiallyallcomponentsofUNAVCO’sCIhavebeendevelopedinhouse.ThisincludesdatahandlingfordataarrivingatUNAVCOfrommultiplevarietiesfieldinstrumentationandfromavarietyofproviders,archiving,anddistributionfunctions.MostoftheCIthataidsindatahandling is not available for reuse since it is highly customized.An exception is theGNSSpreprocessing software tool called “teqc”, which is widely shared with the community.SelectedCIcomponentshavebeendevelopedinpartnershipwithotherinstitutionsandaresharedwiththemincludingSARwebservicesdevelopedviatheNASASSARAprojectissharedwith the Alaska Satellite Facility; and the Geodesy Seamless Archive Centers open sourcesoftware was developed with NASA ACCESS support by UNAVCO with UCSD and NASA’sCrustalDynamicsDataInformationSystems.GSACiswidelyshared.

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

CertainproprietarysoftwareprovidedbysensormanufacturersforhandlingrawdataarepartofUNAVCO’sCI.Theseareprescribedwhenamanufacturerisselectedasasensorprovider.MuchofUNAVCO’sSARdatahandlinginfrastructureiscurrentlybeingmigratedtotheXSEDEcloud.Commercialcloudstorageisemployedasoneofourbackupstrategies.

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

Thedatasystemsthatweoperate(softwareandhardware)thatreceive,handleanddeliverGNSSdatatoourexternalcustomerbasehavethelargestuserbaseandareused24/7.Wehavebeen“saved”manytimesoverbyhavingfailoversystemsatthereadyfortheinevitablehiccupsinsystems.

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

Agapislackofadequateresourcestokeepsoftwareandtoalesserextenthardwareuptodate.Functionalityisregularlyaddedthroughtimeasnewcomponentsoftwaresystems,andthis functionality is developed with technologies reflecting the era during which it was

Page 36: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

36|P a g e

developed,withsomeattempttoseeintothefuture;thesecomponentstendtoremainpartof operational infrastructure (we call them legacy components, but they are still key toaccomplishing our tasks). All along the way technical debt is incurred, and of coursetechnologymovesahead.Thisisafurtherchallengetomovingcapabilitiestothecloud.Wearetryingtoslowlyandonatrialbasismovecomponentstothecloud.Legacycomponentsareafurtherriskas itbecomesincreasinglydifficulttofindprogrammerswithappropriateskillsetstomaintainthem.Thepriorityisalmostnevertorebuildtheseoldersystemsaslongastheycontinuetooperate.AnotherchallengeisthewidevarietyoftechnologiesinuseintheEarthSciencestomeetCIneedsofvariousdomains.Tryingtocoverallbases isnearlyimpossible;tryingtoidentifywhichtechnologieswillemergeasmostusefulisachallengeforall.TheEarthCubeinitiativeisclearlyexposing/highlightingthis.

KeyRisks

Keyrisksarerelatedtothetechnicaldebtdescribedinaprevioussection.Anotherkeyriskislooming retirement of staff members with decades of domain knowledge and in-depthknowledgeofourCIcomponents.Further,thereisstrongcompetitioninourgeographicareaforskilledCIworkers.

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

Wesendstaffmemberstotraining.Weengageinterns.

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

Makinguseof thecloud (withappropriate returnon investment).Continuing to trackandidentify trends in technologies and being able to respond nimbly.Managing functionalitydemandsunderresourceconstraints.

Doyouhaveanyothersuggestionsfortheworkshop?

Notatthistime

Page 37: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

37|P a g e

Affiliation Name E-mail

IceCube Gonzalo Merino,University of WisconsinMadison

[email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

1)Datamanagementsoftware,handlingdataarchive,transferfromthesouthpoleandreplicationtolongtermarchives.2)Softwareframeworktomanagedistributedworkloads.UsedtomanageandbookkeepalltheIceCubesimulationproduction.Inbothcases,otherscoulduse,butthisdoesnothappenyet.

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

1)SouthPolebroadbandsatellitesSPTR,DSCSandSkynet.ProvidedbyNASA,throughUSAP.ThisistheonlyavailableservicefordailybulkdatatransferfromtheSouthPole.~100Gbytes/day.2)Tapestorageforlongtermdataarchive.ProvidedbycollaboratinginstitutionsNERSCandDESY-Zeuthen.Theseinstitutionsalreadyoperatelargescaleautomatedtapefacilitiesforseveralexperiments.Theserviceisofferedasin-kindcontributiontotheCollaboration.3)OpenScienceGrid.ProvidingaccesstomillionsofCPUhoursinopportunisticresources.Also,operatingcoreGridservicesthatprovideusaccesstoIceCubecollaboratingsitesinEuropeandCanada.WehavebeenparticipatinginOSGforseveralyears.Distributedcomputing,andinparticularopportunisticcomputing,representsabigadvantageinourfieldwherealotofthedataprocessingandanalysisispleasantlyparallel.4)XSEDE.PartoftheIceCubesimulationchainreliesonGPUs.WestartedrequestingallocationsinGPU-capableXSEDEresourcesin2016toenlargethecomputingcapacityavailableforIceCubeandincreasetheanalysispotential.5)Globusdatatransferservice(globus.org).Convenientdatatransferserviceusedtoschedule/steerdatatransfersfromUW-Madisontoarchivelocations:NERSCandDESY-Zeuthen.Selectedbecauseitprovidedtheneededfunctionality(integrity,retries,etc)currentlyatnocost.Also,interestedinongoingdevelopmentstointerfacemoreefficientlytheHPSStapesystematNERSCwithGlobus(fileintegrity,performance).

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

Page 38: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

38|P a g e

1)MaindataprocessingclusteratUW-Madison.LargeCPUandGPUclustercoupledtoamulti-petabytefilesystem(Lustre)usedby~300researcherstoanalyzetheIceCubedata.Themostchallengingparttooperateisthestorage,includingmonitoring,accounting,etc.However,operatingourownLustreclusterseemstostillbethemostcosteffectivesolutionforoursize(~6Petabytesofdisk).2)User-friendlyscalable/elasticcomputinginfrastructure:OSGandHTCondorhaveprovidedgreatcapabilitiessofarinthisfront.However,westillseealotofroomforimprovementintheuserexperience:higherefficiency,easeofuse,interfacetocloudresources,etc.

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

Everytimewehavebeenabletoleverageexisting3rdpartyservicestobuildourinfrastructurearoundthem,wehaveseenbenefits indoingthat.Fromlargearchivestoragefacilities, todatatransferservices,toworkloadmanagementservices,ourlessonlearntisthatitseemsworthforustoinvestonhavingasolidinterfacewithexistingservicesratherthantryingtoreplicatethem,orreinventthewheel.

KeyRisks

Withtheuseofexternalservices,therecomesdependenciesandrisk.Mitigationstrategiesarethereforeanimportanttopic.Inourcase,severaloftheseexternalservicesarecomingfrom the academic ecosystem, so some coordination inside or between agencies couldaddresspartoftherisk.Partofitwouldbeensuringthatthosecommonservicesthatmanyresearchersdependon,aresustainable.

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

Assistingtovariousworkshopsandconferences inthefield:NSFcyberinfrastructure,OpenScienceGrid,NationalDataService...

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

Understanding how to best adapt IceCube analysis code to new emerging computingarchitecturesandsoftwareframeworkssuchasmanycore,GPU,FPGA,machinelearninganddataanalyticsframeworks,etcandengagetheworkforcewiththerequiredskillsthatweneedtomakethishappen.Hiringandretainingthispersonnelisgettingincreasinglydifficultaswecompetehead-onwiththeITprivateindustry.

Doyouhaveanyothersuggestionsfortheworkshop?

Notatthistime

Page 39: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

39|P a g e

Affiliation Name E-mail

NSCL Andreas Stolz, MichiganStateUniversity

[email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

Dataacquisitionandanalysissoftwareframework(NSCLDAQ/SpecTcl/DDAS),availabletoothers.Controlssoftware(EPICS)development,availabletoothers.Businessprocesssoftware;customandcustomizedapplications.

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

Dataacquisition(DAQ)andexperimentaldataanalysisonLinuxbasedinfrastructure.CommodityPCs/Servers.StorageusingcommodityhardwareandZFS/Linux.Thisiswidelyused,freelyavailablesoftwareandlowcost.DAQisdevelopedin-house.Analysisapplicationsaretypicalfreelyavailablephysicsapplications(GEANT,ROOT,etc.)Businessprocess:ERP(IFSsoftware),Sharepointworkflowsanddocumentmanagement.Engineeringsoftware?Solidworksetc.Networking/Internet–externalaccessprovidedbyMSU

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

Infrastructure–virtualization:Normalforenterpriseinfrastructure,butdoesrequireexpertiseforsupport.Sharepoint:Usedforbusinessprocesses,collaborationetc.Againrequiringdeveloperandadministratorexpertise.Security:Networkandsystemssecurityincludingtechnicalcontrolsthemselvesandtheworkloadaroundmaintaininganddocumentingsame.Adoptingconfigurationmanagementtoolsandtestingdeploymentprocesses.Systemconfiguration–maintainingstableoperationsalongwithongoingsoftwarechangesandsecurityupdates.

Page 40: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

40|P a g e

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

Securityisongoingchallenge.

KeyRisks

Mainrisksaresimilartoanyenterprise:securityanddisasterrecovery.

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

Participatinginrelevantworkshops.CISecuritytrainingforallusers.

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

Providingincreaseddataaccesstooutsidevisitorsandexperimentersinfaceofincreasingdatasetsizesandsecurityrestrictions.FutureDAQsystemsforFRIBexperiments.

Doyouhaveanyothersuggestionsfortheworkshop?

Notatthistime

Page 41: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

41|P a g e

Affiliation Name E-mail

InternationalOceanDiscoveryProgram(IODP)

Jim Rosser, Texas A&MUniversity

[email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

SeveralCIcomponentsaredevelopedandmaintainedin-house:instrumenthostdatauploaders,webservices,webscienceapplications,databases,businessapplications(procurement,inventory,crewtracking).Yes,theseareavailabletoothersforreuse,but,inmostcases,wouldrequireextensiveeffort.

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

OurapproachistofocusonJRSOcorecompetenciesandleveragecommodityservicesfromotherorganizationswhenpossible.Forexample,TexasA&MUniversityprovidesmanysharedservicesthatweusetosupportJRSOoperations,includingemail;directoryservices;storageservices;webconferencing;videostreaming;softwaretraining;cloudstorage;financial,travelandHRmanagementsystems;cybersecurityassessmenttools;softwareprocurement;projectmanagementassistance,etc.

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

1.WAN(includingVSAT)operationsandsupport.SustaininghighlyavailableWANservicesisquitechallengingwhentheresearchvessel(JR)operatesglobally.2.OracleODAs.OracleODAssignificantlyincreasedJRSOdatabaseengineperformance.However,therehasbeenasteeplearningcurveforconfiguringandmaintainingthiscapability.3.Cybersecurity.MinimizingsecurityriskwhilesupportinginternationalcustomerswhobringmanydifferentpersonaldevicesonboardtheJRandexpectassuredaccesstotheship'sportfolioofsciencelabservices(e.g.,LAN,serverstorage,applicationanddatabaseservices).

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

Page 42: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

42|P a g e

MinimizingsecurityriskwhilesupportinginternationalcustomerswhobringmanydifferentpersonaldevicesonboardtheJRandexpectassuredaccesstotheship'sportfolioofsciencelabservices(e.g.,LAN,serverstorage,applicationanddatabaseservices).

KeyRisks

Commerciallyavailabletoolsareincreasinglycloud-based(e.g.,AdobeCreativeSuite,macOSapps,etc.).OurmeagercommunicationbandwidthsupportingtheJRrulesthoseout.Yet,manysoftwarepublishersprovidenoalternative.Thisissueisprobablyuniquetofacilitiesoperatinginlowbandwidth,highlatencyenvironments,andprobablyalsoappliestoorganizations,suchasDoD,thatoperateisolatednetworks(SIPRNet,JWICS,etc).Thisisagrowingproblemthatcontinuestochallengeus.

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

Technologyspecifictrainingforallaspectsofinfrastructure,softwaredevelopmentanddatamanagement.

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

BetterWANlinkfortheJR.Adoptionofautomation/configurationmanagementtools,suchasChef,Ansible,Salt,etc.Makingdatamorediscoverable.

Doyouhaveanyothersuggestionsfortheworkshop?

Notatthistime

Page 43: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

43|P a g e

Affiliation Name E-mail

CHESS Werner Sun, CornellUniversity

[email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

Ourhigh-availabilityclustersandComputeFarmweredevelopedusingcommodityhardwareandopen-sourcesoftware,assembledandconfiguredin-housetomeettherequirementsofourfacility.Theseconfigurationscouldbesharedwithotherfacilities.

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

WeprovideCHESSuserswithremotedatadownloadcapabilitiesusingGlobus.WeselectedthistoolforitsexcellentperformanceandbecauseofitswidespreadadoptionintheNSFLargeFacilitycommunity.

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

High-availabilityLinuxserverclustersformthebackboneofourCI.Weusethemforourcentralfilesystems,coreinfrastructureservices,webanddatabaseservers,andhardwarecontrolsystems.Incommissioningtheseclusters,wegainedexperiencewithselectingfreeandopen-sourcesoftwareandcommodityhardwaresolutionswithoutsacrificingreliabilityandperformance.TheCHESSdataacquisitionsystemisacentralrepositorythatreceivesrawdatafrommultipleinputstreamsandprovidesaccessforofflineanalysisandprocessing.Wedevelopedbackup,archive,androtationprocedurestoensurediskaccesstotworun-cycles'worthofdataandtaperetrievalforallpreviousdata.

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

Page 44: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

44|P a g e

Wewouldbeinterestedinlearningaboutmethodsforprovisioningtemporaryaccountsandimplementingfine-grainedauthorizationforCHESSusers.

KeyRisks

Wefacean increasinglychallengingcybersecuritythreat landscape.Wearealwaysseekingwaystobalancesecuringourfacilitycontrolsystemswhilemaintainingusability,access,andproductivity.

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

Onlinetutorials,managerialandtechnicaltrainings.

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

UpgradestothescientificcapabilitiesoftheCHESSfacilitywillresultinincreaseddatathroughputandvolumes,whichwilleventuallyexhaustasinglesystem'sabilitytobothserveasthedatastoreandtheaccesspoint.Wemayneedmultipleingressandseparateanalysissystems.

Doyouhaveanyothersuggestionsfortheworkshop?

Notatthistime

Page 45: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

45|P a g e

Affiliation Name E-mail

PSC/CMU

JamesA.Marsteller

[email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

KeyRisks

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

Doyouhaveanyothersuggestionsfortheworkshop?

Page 46: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

46|P a g e

Page 47: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

47|P a g e

Affiliation Name E-mail

NationalRadioAstronomyObservatory(NRAO)

BrianGlendenning,NRAO

[email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

100%(basedonopensourcesoftware),yes

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

AmazonAWS(modest),NSFXSEDE(experimental);Convenience/capability(AWS),cost(XSEDE)

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

1.TheCASAdatareductionpackageisalarge(2MSLOC)packagebothusedforinternaloperationsuseanddownloadedbyfacilityusers(2kdownloadsperyear).2.Our"pipelines"embedexpertknowledgeinapythonscriptingframeworkforautomatedscienceproduction.3.Ourcomputinginfrastructurehasmultiple"archive"storageclusters,withattachedLustreandcomputationalclustersfordataprocessing.Wehavetotakethelongview-wehaveusabledatafrom40yearsago,oursoftwarepackageslivefordecades.

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

Keepingsoftwarepackagesreasonablyhigh-performanceoverdecadesisanissueforus.

KeyRisks

Page 48: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

48|P a g e

DurableagreementswithHPCfacilities,IaaSresearchclouds,Internationalcompatibilitywithuserauthenticationmechanismsetc.

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

Ph.D.student/Post-docengagementwithwritingresearchcodes.Summer/co-opstudents.

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

Seefinalbulletpointsinwhitepaper.

Doyouhaveanyothersuggestionsfortheworkshop?

Notatthistime

Page 49: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

49|P a g e

Affiliation Name E-mail

Ocean Networks Canada

Benoit Pirenne

[email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

The Oceans 2.0 was entirely developed in house, starting in 2005. The code is not in the public domain owing to the decision made by ONC to pursue commercial applications of the system.

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

External tools include standard tools such as OS (Linux), Java, Javascript and attendant libraries; Oracle as an RDMS, Cassandra for non-relational data... ERDDAP was integrated to provide standard access to specific data types. Jira for supporting all aspect of the development, including time sheets and billing on a per project basis Confluence for internal and external documentation

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

Until recently, the challenging elements included: - Cassandra: performance issues with the tool and the complexity of the fine-tuning required , Java memory allocation issues, difficulty with profiling complex code to understand where memory and time are actually spent, despite having an advanced test environment

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

Continuously evolving the technology and the services available and getting the continued funding for the required manpower. Providing easy to use data discovery interfaces that will be addressing user needs in the face of growing instrumentation, observing locations and expanding time

KeyRisks

Page 50: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

50|P a g e

Risksinclude-maintainingtheleveloffundingtoenablecontinuousimprovementstothefacility:aCIisneverover!Mitigationrequiresmakingmanagementandfundingagenciesunderstandthat.

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

We have had large fractions of the team of 20+ software engineers attend classes in: - the Agile Scrum methodology - usability - Kaisen

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

- As the facility continues to grow, a continuous emphasis on verification of our scalability, and possible adaptation will be necessary. - The support of multiple clients, re-organizing into a multi-project based entity - Need to support critical customers (e..g, Public Safety) with defined SLAs

Doyouhaveanyothersuggestionsfortheworkshop?

Notatthistime

Page 51: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

51|P a g e

Affiliation Name E-mail

Oregon State University, College of Earth, Ocean,

and Atmospheric Sciences, Regional Class Research

Vessel Program

Christopher Romsos

[email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

The most significant CI component built in-house is our "datapresence" system. In a nutshell, the datapresence system captures and archives data from resident (or visiting) sensors, replicates the information shoreside, and presents the information to both the shipboard and shoreside science parties for use/consumption. The datapresence system includes functionality for data quality assessment, flagging, alert and user notification. Other CI components developed in-house include several databases for project management including a risk-register database application. Yes, these components are available for others to use.

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

There is a high likelihood that the most if not all RCRVs shall be provisioned with satellite service through HiSeasNet at UCSD (https://hiseasnet.ucsd.edu/), though some UNOLS ships are experimenting with going out and negotiating their own contracts for satellite service opting (out of the HighSeasNet program in areas where better deals can be struck such as the Gulf of Mexico). We, the RCRV datapresence developers, are currently formalizing an MOU with Leidos Antarctic Support contractors to share components of our acquisition and visualization code. Part of this process includes choosing an open source license under which to distribute software. Lastly, we've incorporated data and map services (hosted locally aboard the ship) from the Marine Geoscience Datasystem at Lamont-Doherty Earth Observatory (LDEO) into our real-time displays for scientific situational awareness. Specifically, the Global Multi-Resolution Topography Data Synthesis provides our base layer for the map interface http://www.marine-geo.org/portals/gmrt/ Other sources of thematic background information for this interface are provided by NOAA Fisheries, Office of Coast Survey, USGS, and various academic sources.

Page 52: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

52|P a g e

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

1) Ship to shore (and back) data replication over high latency, low bandwitdh satellite networks. This problem, akin to the Long Fat Network problem of high bandwidth-delay product, is the most challenging issue that we are working on. We've had good success in increasing our throughput by optimizing the TCP window and buffer sizes and are now looking at managed WAN optimizatoin solutions to provide this service. 2) Cybersecurity is another challenge for the project. The RCRVs shall be equipped with integrated monitoring control systems to cover everything from bridge to engine room systems. Securing these online systems is a priority and a challenge.

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

At this project phase (construction) we don't yet have lessons learned to share.

KeyRisks

Key risks include security and expertise. As indicated the RCRVs shall present a significant CI advancement from current. To mitigate each of these risks we have an operations plan that includes support and oversight (budget and personnel) from a Class Management Office. However, the level of expertise for the technical support personnel (Marine Technicians) that sail with the ships will have to rise. Evidence to support this expertise risk can be gleaned from organizations that have recently taken operations responsibility for new research vessels.

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

Ah, a perfect follow-up question. A key component of our operations plan during transition to operations and post-delivery under Class Management will be technology transfer and training for new operators. We expect much of this initial ' workforce development' to take the form of hands on work during transition but additional training will be made possible through the Class Management Office during operations. In addition to periodic training we have staff that shall travel to each vessel on a rotating schedule (multiple visits per year) to inspect sensor systems, perform calibrations and maintenance, as well as conduct specific training while on a site visit.

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

BYOD IoT sensors - We must keep abreast of security and integration issues these devices present. On-Prem IaaS and PaaS - These industry trends or options are attractive but difficult to implement under the current model of support and operations (see expertise risk above). Cybersecurity - Particularly as it applies to on-board integrated monitoring and control systems.

Page 53: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

53|P a g e

Doyouhaveanyothersuggestionsfortheworkshop?

Notatthistime

Page 54: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

54|P a g e

Affiliation Name E-mail

Florida International University

Julio Ibarra

[email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

N/A

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

N/A

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

N/A

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

N/A

KeyRisks

N/A

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

NA/

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

N/A

Page 55: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

55|P a g e

Doyouhaveanyothersuggestionsfortheworkshop?

N/A

Page 56: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

56|P a g e

Affiliation Name E-mail

2-Dimensional Crystal Consortium, Pennsylvania State University

Yuanxi Wang

[email protected]

What percentage of the facility CI was developed in-house versus by reusing existingsolutions?

N/A

WhatexternalCIcapabilitiesandservicesand/orexternallydevelopedtools(ifany)doesthefacilityuseandwhoprovidesthem?Howwerethesetoolsidentifiedandwhatcriteriawasusedtoselectthetools?

N/A

Listupto3ofyourmostandleastfavoriteCIcomponentswitha1sentenceexplanationforeach.Whataspectsabout the facilityCI and itsoperationwouldyou like to shareasbestpractices?

N/A

WhataspectsofthefacilityCIanditsoperationdoyouseeaschallenges/gaps?Arethereanypitfalls/mistakes you would like to share? What aspects would you be interested inoutsourcing?

N/A

KeyRisks

N/A

WhatCI-relatedworkforcedevelopmentactivitiesdoesyourfacilitiesengagein?

N/A

WhatdoyouseeasyourkeynewCIrequirementsandchallengesinthenext5-10years

N/A

Doyouhaveanyothersuggestionsfortheworkshop?

Page 57: Affiliation Name E-mail UCAR/Unidata Mohan Ramamurthy ... · UCAR/Unidata Mohan Ramamurthy mohan@ucar.edu What percentage of the facility CI was developed in-house versus by reusing

57|P a g e

N/A