Upload
others
View
4
Download
0
Embed Size (px)
Citation preview
D. Ciangottini - DOMA Access - INFN cache federation for CMS
INFN regional cache federation for CMS workflows
DOMA Access Meeting4/12/2018
Diego Ciangottinion behalf of CMS Collaboration and INFN-Cache team
1
D. Ciangottini - DOMA Access - INFN cache federation for CMS
Outline
â—Ź Introductionâ—‹ cache on ephemeral storage for opportunistic resourcesâ—‹ INFN geo-distributed cache with unmanaged storage
â—Ź Tests and resultsâ—Ź Ongoing activitiesâ—Ź Plans
2
D. Ciangottini - DOMA Access - INFN cache federation for CMS
In synergy with the activity on smart caches for eXtreme DataCloud
Introduction
â—Ź Integration and functionality scouting of XCache @INFN for CMS workflowsâ—Ź Two main scenarios of application:
â—‹ Automatic procedures for the creation of an XCache cluster on-demandâ– ready-to-use recipe for both bare metal and cloud environment/orchestration
● complete high level infrastructure description → QoS configuration■mitigate latency for dynamic resources (cloud and HPC)
○ Creation of an INFN (regional) cache federation■enabling “storage-less” T2s and efficient scale out over opportunistic resources ■propaedeutic to distributed cache layer for data-lake environment
3
D. Ciangottini - DOMA Access - INFN cache federation for CMS
XCache for dynamic resource provisioning
Computing resources are opportunistically deployed on cloud/HPC resources
â—Ź storage not necessarily availableâ—‹ remote read latencyâ—‹ I/O inefficient
The cache introduction may offer:â—Ź ephemeral storage for hot data near the
computing providerâ—Ź optimized wan access, only for data not
already on the cache
Cache resources
HPC
Scenario 1
4
D. Ciangottini - DOMA Access - INFN cache federation for CMS
INFN regional XCache layer
Few world-wide custodial centers with data replica managed by the experiment
â—Ź Computing Tiers access data directly from closest custodial center
Using cache for a Content Delivery Network approach:
â—Ź geo-distributed network of unmanaged storages
â—Ź common namespace (no data replication)â—Ź request mitigation to custodial sites
Custodial data
Regional cache
HPC
Tier2
Tier3
Scenario 2
5
D. Ciangottini - DOMA Access - INFN cache federation for CMS
Cache for opportunistic resources
Cache Redirector
Disk proxy cache
Disk proxy cache
Sto
rage
fede
ratio
n
xrootd proxy cache
RAM Disk
WN
STORAGE
STORAGE
STORAGE
STORAGE
Opportunistic resources
Remote CMS AAA Federation
In case of computing on opportunistic resources the remote data access pattern can be improved providing:
â—Ź an on-demand cache layer near cpu resources (same cloud provider) STORAGE
Scenario 1
Automated deployment through:â—Ź Ansibleâ—Ź K8s (soon also in helm)â—Ź Mesos/Marathon
6
D. Ciangottini - DOMA Access - INFN cache federation for CMS
Testing with CMS workflows
â—Ź Real CMS analysis workflows on cloud resources (2 volunteer users)
â—‹ 2k jobs @OpenTelekomCloud (OTC)â—‹ ~150k of users jobs completed reading from
standalone cache cluster deployed at OTC
â—Ź DODAS (*) have been used for:â– same configuration for setup on different
cloud providers â– automated deployment through:
â—Ź Ansible for infrastructureâ—Ź K8s or Mesos/Marathon for container
orchestration
Cloud resource provider
Opportunistic CMS startd Service
Opportunistic Cache Service
Redirector
XcacheXcache Xcache
WNWN WN
Opportunistic Storage Service
Ceph/HDFS/IOVolumes/?
WLCG XRootD
Federation
(*) https://dodas-ts.github.io/dodas-doc/Scenario 1
7
D. Ciangottini - DOMA Access - INFN cache federation for CMS 8
Cache performance: Net Out - Net In
2.8Gbps outbound network
2.8Gbps network inbound
8
Cache Network IN
Not negligible gain with significant spike of direct cache disk/memory usage
Cache Network OUT
D. Ciangottini - DOMA Access - INFN cache federation for CMS
Success rates
9
Two categories of tasks by input data latency:
â—Ź 200-300ms latency links
â—Ź <100ms latency links
~100% failed for remote reading at first try (cold cache)~80% succeeded at second CRAB retry (warm cache)
No significant discrepancy w.r.t. results with standard CRAB jobs
D. Ciangottini - DOMA Access - INFN cache federation for CMS
CPU efficiency overhead
10
No cache overhead observed
Local read reference
Cache hit - Avg CPU efficiency
No cache overhead observedâ—Ź local-like performances when a cache hit occursâ—‹ No cache overhead observed
with this workflow samplesâ—‹ slight improvement due to good
performances of the low latency storage flavor provided
D. Ciangottini - DOMA Access - INFN cache federation for CMS
Distributed testbed deployment
â—Ź Deployment a geo-distributed cache:â—‹ Clients contact the cache redirectorâ—‹ Redirector steers client to
â– the cache that actually have file on diskâ– If no cache has the requested file, a round
robin selection of cache server is used
â—Ź Network of unmanaged storages for hot data
XCacheT2_IT_Bari
XCacheCNAF
WLCG XRootD
Federation
Clients
Cache redirector
Scenario 2
11
XCacheT2_IT_Legnaro
D. Ciangottini - DOMA Access - INFN cache federation for CMS 12
Distributed testbed deployment: testbed
Current functional test setup:
â—Ź CNAF XCache redirector federating 3 servers:â—‹ CNAF XCache server (5TB)â—‹ T2 Bari XCache server (10TB)â—‹ T2 Legnaro XCache server (22TB)
â—Ź Redirecting part of the CMS analysis workflows to contact National redirectorâ—‹ based on dataset name requested
â—Ź Tier2 at Pisa is joining the testbed
12
Scenario 2
D. Ciangottini - DOMA Access - INFN cache federation for CMS 1313
INFN XCache: functional checks 1/2
● Test tasks submitted to T2_IT_Bari with empty cache● Comparing jobs running at Bari (pointing to cache) with “Ignore locality” ones on
other sites
Bari → Cache
Pisa → No-Cache
Legnaro → No-Cache
Avg Job CPU Eff.
No penalty in CPU eff in case of empty cache
Performances of jobs reading from empty cache is comparable with remote reading.
Scenario 2 LOW LATENCY LINKS
D. Ciangottini - DOMA Access - INFN cache federation for CMS
INFN XCache: functional checks 2/2
14
Analysis task with XCache @INFN
Input dataset ZeroBias-RunB2017/AODSIM - task with 300 jobsStarting from empty cache vs run with remote read on T2_IT_*
Avg CPU eff
80%60%
35%Same task with remote read
Latency hiding + same file reused in different jobs
Scenario 2 HIGH LATENCY LINKS
D. Ciangottini - DOMA Access - INFN cache federation for CMS
Work in progress: cache simulation
â—Ź Starting from regional CMS job statistics of the latest 2-3 monthsâ—‹ time series of CMS jobs @ItalianTiers2â—‹ LRU cache eviction simulation
â—Ź Regional cache effect evaluation varying:â—‹ cache sizeâ—‹ Datasets to be cached
â—Ź Metrics:â—‹ hit rate vs cache sizeâ—‹ cpu hrs miss vs hit per cache size
Base ground for decision on contributions on the optimization of cache algorithms and cache population (e.g. different cache algos w.r.t. LRU and inter cluster load balancing - round robin so far-)
15
N.B. at a beginning phase: hoping for inputs and coordination to avoid work duplication
D. Ciangottini - DOMA Access - INFN cache federation for CMS
Work in progress: AI investigation
â—Ź starting from data-popularity ML studies already performed (CMS, ATLAS, ALICE)â—Ź tuning to adapt in the context of regional caches
â—‹ access time informationâ—‹ per region studies
● two main algorithms subgroups:○ put data ahead → simulate smart data fetching effect○ optimize what stays on disk → simulate smart eviction effect
Possible evolution to studies on client-cache routing based on real-time metrics (networks, recent failures etc..)
16
N.B. at a beginning phase: hoping for inputs and coordination to avoid work duplication
Activity synergy with a PhD thesis in Perugia
D. Ciangottini - DOMA Access - INFN cache federation for CMS
Conclusions
1717
â—Ź Working on 2 scenarios: â—‹ cache for dynamic resourcesâ—‹ regional cache layer for a future data-lake model
â—Ź Performance evaluation motivates further activitiesâ—‹ on-demand deployment â—‹ latency hiding and local-like performances on hits
â—Ź Work in progress:
Production-like regional deployment
Cache simulation with historical dataAI-based caches management
External decision agents interface
D. Ciangottini - DOMA Access - INFN cache federation for CMS
BACKUP
18