FAX status. Overview Status of endpoints and redirectors Monitoring Failover Overflow

FAX status

Overview Status of endpoints and redirectors Monitoring Failover Overflow

Endpoints Status on Sat. 15 Nov. Got one more site: RO-07-NIPNE Problems: We work on CSCS Not working at all: Nikhef Flip-flopping: FZK-LCG2 and NDGF-T1

Direct access Expired cert Wrong config Test jobs were unable to get proxy

Upstream redirection

Downstream redirection Redirectors moved to AI machines

Moving redirectors Herve had to move all the EU redirectors to the Agile Infrastructure. Simultaneously upgraded to xrootd 4.0.4. Started with DE redirector. Had to re-implement access rules. Continued with two redirectors per day. But old machines got re-introduced, confused everybody. A new set of changes being applied right now. Now situation clear, but sites need to restart their services as IPs changed.

Monitoring Machine receiving info from AMQ and giving it to SSB etc. had to move to Agile Infrastructure. Took much more time then expected but its done now. EU sites were moving to sending monitoring data to CERN. Current state may be seen here (thanks to Igor Pelevanyuk): http://dashb- xrootd-comp.cern.ch/cosmic/ATLASmigrationMonitoring/ http://dashb- xrootd-comp.cern.ch/cosmic/ATLASmigrationMonitoring/ Still a lot of effort needed to make summary and detailed monitoring match: http://dashb-ai-621.cern.ch/cosmic/DB_ML_Comparator/ http://dashb-ai-621.cern.ch/cosmic/DB_ML_Comparator/ Started deeper analysis of Panda job info data transported into Hadoop at CERN. Further improvements in FSB

Cost matrix

Overflow Slowly expanding: BNL still missing, even the reverse proxy hardware is there. ANALY_AGLT2_SL6ANALY_INFN-T1 ANALY_CONNECTANALY_IN2P3-CC ANALY_BU_ATLASANALY_MPPMU ANALY_MWT2_SL6ANALY_DESY-HH ANALY_OU_OCHEPANALY_QMUL_SL6 ANALY_SLAC ANALY_SFU Cant use data from the rest of EU cloud

Snakey overflow plots - success

Snakey overflow plots - failures

Overflow - workload

Overflow workload

Overflow job efficiency

Overflow CPU efficiency

Reactions Up to now only two sites noticed the overflows: TRIUMF Jedi sent a lot of jobs to almost all US cloud sites, all reading from TRIUMF. Saturated their proxy (1Gb/s). They since made it 2 Gb/s. QMUL Chris Walker noticed 5Gbps+ at their NAT gateway, ~10TB/day. Not a problem for now.

Failover Jobs per 4 hours

FAX status. Overview Status of endpoints and redirectors Monitoring Failover Overflow

Documents

EDB Postgres Failover Manager Guide...1.1 What’s New The following changes have been made to EDB Postgres Failover Manager 2.0 to create EDB Postgres Failover Manager 2.1: Failover

ITKwebcollege...6 Unterrichtseinheit UE 12 740 Module 8: Implementing failover clustering Planning a failover cluster Preparing to implement failover clustering Failover-cluster storage

Information About Failover...1-4 Cisco ASA Series CLI Configuration Guide Chapter 1 Information About Failover Failover and Stateful Failover Links Although you can configure the failover

Active-Passive Failover Cluster - Kofax · Active -Passive Failover Cluster Administrator Guide Table of Contents Perceptive Content on a failover cluster

DHCP Failover

Failover - Cisco · if specific failover conditions are met. If those conditions are met, failover occurs. The ASA supports two failover modes, Active/Active failover and Active/Standby

EDB Failover Manager - EnterpriseDB · 2019-12-06 · EDB Failover Manager, Release 3.7 EDB Failover Manager™ EDB Postgres Failover Manager (EFM) is a high-availability module from

EDB Failover Manager Guide - EnterpriseDBget.enterprisedb.com/docs/EDB_Failover_Manager_Guide_v2.0.3.pdf · EDB Failover Manager Guide ... Before configuring a Failover Manager cluster,

Data Mover Failover

Failover Overview

DHCP Failover Hot

Failover Cluster Step

Failover with Consul

Failover plan

Apa failover mode

EDB Postgres Failover Manager Guide - EnterpriseDBget.enterprisedb.com/docs/EDB_Failover_Manager_Guide_v2.1.pdf · EDB Postgres Failover Manager Guide ... failover to a Standby node

Pix Asa Failover

Cisco - PIX/ASA Active/Standby Failover Configuration Example · WARNING: Failover message decryption failure. ASA Modules Failover Failover message block alloc failed AIP Module

Failover using VRRP, OSPF & BFD - MikroTikmum.mikrotik.com/presentations/ME16/presentation_3912_1476763… · Bidirectional Forwarding Detection • Session between two endpoints

Network Failover