Upload
1cloudroadcom
View
37
Download
0
Tags:
Embed Size (px)
DESCRIPTION
Citation preview
© 2009 VMware Inc. All rights reserved
VMware for Business Continuity
What’s new with SRM 5
Vadim Shvarts Sr. Systems EngineerVMware [email protected]
2 Confidential
Introduction – VMware for Business Continuity
3
43% of companies experiencing disasters never re-open, and 29% close within two years.
(McGladrey and Pullen)
93% of business that lost their data center for 10 days went bankrupt within one year.
(National Archives & Records Administration)
40% of all companies that experience a major disaster will go out of business if they cannot
gain access to their data within 24 hours.(Gartner)
Top executives say 10 hours to recovery;IT managers say up to 30 hours.
(Harris Interactive)
Disasters Happen. Do You Need Protection?
4
Business-Critical Applications Require Business Continuity
Availability Expectations on vSphere Continue to IncreaseRTO’s decreasing from >24 hours to <12 hours
38%
43%
53%
25% 25%
18%
% of Application Instances Running on VMware in Customer Base
MSExchange
MS SQL
MS SharePoint
OracleMiddleware
OracleDB
SAP
Source: VMware customer survey, Jan 2010 and April 2011 interim results,Data: Total number of instances of that workload deployed in your organization and the percentage of those instances that are virtualized
2010
2011
42%
47%
67%
34% 28% 28%
5
Drawbacks Of Traditional Business Continuity Solutions
Middleware / Java
Oracle RAC
Oracle DataGuard DB Mirroring
MS Clustering DB Access Groups
CCR / SCR
App Server Cluster
Session State Replication
Backup Data replication
Application-level availability silos:Complex and expensive
Shared availability services:Longer RTOs and RPOs
Availability requirements
Local Availability
Data Protection
Disaster Recovery
6
Improving Business Continuity At All Levels
Local Availability
vSphere High Availability
vSphere Fault Tolerance
vMotion and Storage vMotion
Data Protection
vSphere Data Recovery
Storage APIs for Data Protection
Local Site Failover Site
Disaster Recovery
vCenter Site Recovery Manager
Includes vSphere Replication
Newin 2011
Improved in 2011
Improved in 2011
vSphere vSpherevSphere vSphere vSphere
Improved in 2011
7
Transforming Cost And Complexity Of Business Continuity
Continuous
Hours
Days
RTO / RPO
Cost ($ per app)
$10,000
Minutes
$1,000$100
Shared availability services
(traditional backup, replication)
VMware business continuity
(HA, FT, vMotion, SRM, VDR)
App-level availability(Oracle RAC, MSCS, …)
• Much better RTOs than traditional backup and replication• Similar or lower cost
• Similar RTOs to app-level availability solutions• Much lower cost / complexity
8
Better Business Continuity Is #1 Objective For Virtualization
Top Five Objectives for Virtualization
Use virtualization to improve Business Continuity and Disaster Recovery (BCDR) 46%
Improve virtual machine performance 33%
Increase the server consolidation ratio 32%
Improve VM environment management 31%
More mission-critical applications 24%
Source: WW VMware customer survey, January 2010
N=1083
9 Confidential
Simple and Reliable DR with vSphere and SRM
10
Challenges of Traditional Disaster Recovery
ExpensiveComplex
Recovery Plans
?
?
?
??
??
?
Unreliable Failovers
Apps
Hosts
Storage
Network
Software
Hosts
Storage
Facilities
>$10K per app
Failure to meet business requirements• Long RTOs – days to weeks• Too much time and resources consumed=
+ +
11
vSphere Provides The Best Foundation For Disaster Recovery
Flexible Infrastructure• Eliminate need for identical hardware across
sites• Enable waterfalling of equipment to recovery site
Simple Application Protection• Entire system – including application, OS,
and data – is stored as virtual machine files• Entire system can be protected with data
protection tools
Cost-Efficient Infrastructure• Reduced hardware requirements at recovery
site• Use recovery hardware to run low-priority apps
Encapsulation
Consolidation
HardwareIndependence
vSphere
vSphere vSphere
12
Encapsulation Simplifies Application Protection And Recovery
Simplify recovery• No operating system re-install or bare-metal recovery
• No time spent reconfiguring hardware
Standardize recovery process• Consistent process independent of applications,
operating systems and hardware
Configure hardware
Install OS
Configure OS
Install backup agent
Start “Single-step automatic recovery”
RestoreVM
Poweron VM
Physical
Virtual
40+ Hrs.
< 4 Hrs.
13
vCenter Site Recovery Manager Ensures Simple, Reliable DR
Provide cost-efficient replication of applications to failover site• Built-in vSphere Replication• Broad support for storage-based
replication
Simplify management of recovery and migration plans• Replace manual runbooks with
centralized recovery plans• From weeks to minutes to set up new
plan
Automate failover and migration processes for reliable recovery• Enable frequent non-disruptive testing• Ensure fast, automated failover• Automate failback processes
Site Recovery Manager Complements vSphere to provide the simplest and most reliable disaster protection and site migration for all applications
VMware vSphere
VMwarevCenter Server
Site RecoveryManager
VMwarevCenter Server
Site RecoveryManager
VMware vSphere
Site A (Primary) Site B (Recovery)
Servers Servers
14
SRM Momentum
Introduced in Q2’ 2008
125,000+ units sold
5,000+ customers
50% annual growth in 2010
“If your organization is already taking advantage of virtualization, then adding Site Recovery Manager to handle disaster recovery is a no-brainer.”
― Jerry Wilkin Senior Systems Administrator, Dayton Superior Corp
15
Key Components Of SRM 5
Storage
vCenter ServerSite
Recovery Manager
Choice of Replication Options
Required at Both Protected and Recovery Sites
vSphere
Site Recovery Manager• Manages recovery plans
• Automates failovers and failbacks
• Tightly integrated with vCenter and replication
vSphere Replication• Bundled with SRM
• Replicates virtual machines between vSphere clusters
Storage-Based Replication (3rd party)• Provided by replication vendor
• Integrated via replication adapters created, certified and supported by replication vendor
16
Site Recovery Manager Complements vSphere For DR
Traditional DR VMware
Consolidation to reduce costs X
Hardware independence at failover site X
Encapsulation for simple recovery of entire systems X
vSphere Replication X
Simple management of recovery and migration plans X
Automated DR failover and non-disruptive testing X
Streamline planned migrations and automated failback X
SRMFunctionality
vSphereFunctionality
17
SRM Provides Broad Application Coverage
Continuous
Hours
Days
App-level geo-clustering / load balancing
RTO
RTO: 30 minutes to hoursRPO: Flexible based on storage replication
RPOSynchronousHoursDays
Site Recovery Manager
Tier 1
Tier 2
Tier 3
18
SRM Supports Flexible Topologies
Active-PassiveFailover
Active-ActiveFailover
Bi-directional Failover
Shared Recovery Sites
Production
Recovery
Production
Recovery
Production
Production
• Most common traditional scenario
• Expensive dedicated resources
• Leverage recovery infrastructure for test, development, training
• Utilize sunk cost of recovery site
• Production applications at both sites
• Each site acts as the recovery site for the other
• Many-to-one failover
• Particularly useful for Remote Office / Branch Office
19
What’s New In Site Recovery Manager 5.0?
vSphere Replication Bundled with SRM at no additional cost Provides simple, cost-efficient replication
between vSphere clusters
Automated failback Bi-directional recovery plans Automates failback to original site
Planned migration New workflow that can be applied to any
recovery plan Ensures no data-loss, application-
consistent migrations of virtual machines
Others More granular control over VM startup order Protection-side APIs IPv6 support
Expand DR coverage to Tier 2 apps and smaller sites
Streamline planned migrations(for disaster avoidance, planned maintenance, …)
20 Confidential
Cost-Efficient Replication To Expand DR Coverage
21
DR Coverage Often Limited Due To High Protection Costs
Tier 1 Apps - Protected
Tier 2 / 3 Apps – Backup only
Corporate Datacenter
Small Sites – Backup only
Small BusinessRemote Office / Branch Office
Need to expand DR protection
• Tier 2 / 3 applications in larger datacenters
• Small and medium businesses
• Remote office / branch offices
OS
APP
OS
APP
OS
APP
OS
APP
OS
APP
OS
APP
OS
APP
OS
APP
OS
APP
OS
APP
22
SRM Provides Broad Choice of Replication Options
vSphere Replication Simple, cost-efficient replication for Tier 2 applications and smaller sites
Storage-based ReplicationHigh-performance replication for business-critical applications in larger sites
vCenter ServerSite
Recovery Manager
vSphere
vCenter ServerSite
Recovery Manager
vSpherevSphere
Replication
Storage-based replication
Site A (Primary) Site B (Recovery)
23
vSphere Replication For Cost-Efficient, Simple Replication
Reduce storage costs by 2X• Support for heterogeneous
storage across sites, including non-replicating storage
• Use lower-end or older storage at failover site
Eliminate replication software costs
• vSphere Replication included with Site Recovery Manager at no additional cost
Manage replication directly from vCenter
• Eliminate complex interactions with storage teams
Manage replication at the individual VM level
• Eliminate need for complicated VM-to-LUN mapping
15 minute RPOs• Set RPOs between 15
minutes and 24 hours
Efficient network utilization• Replicate only changed disk
areas
Highly scalable• 500 virtual machines
Limitations• No automated failback• File-level consistency only
(except planned migration)• No FT, templates, linked
clones, physical RDMs
Cost-efficient Simple Powerful
24
Storage Replication
Expand DR Protection To Tier 2 Apps And Small Sites
Tier 1 Apps
Tier 2 / 3 Apps
Corporate Datacenter
Small Sites
Small BusinessRemote Office / Branch Office
vSphere Replication
vSphere Replication
vSphere
$1,000
$2,000$2,000/VM
Tier 1 Storage Failover Site
Replication SW
SRMEnterprise
$600/VM
Tier 2 Storage Failover Site
SRM Standard
Storage, Replication, and SRM Costs per Protected VM
Storage ReplicationLarge site
vSphere ReplicationSmall site
25
Simplify Replication Management With vSphere Replication
Overview
Benefits
vSphere Replication provides simple management of replication Managed directly from vCenter Managed at the individual VM-level
Eliminate complex interactions between vSphere and storage teams to set up replication
Eliminate need to shuffle VMs between datastores to map applications to replicated LUNs
Hub
LUN 1
LUN 2
VMFS A
Datastore Group
Web
SharePoint
SQL
App
vSphere Replication
Web
SharePoint
SQL
App
vSphere Admin
Storage Admin
vSphere Admin
Storage-based Replication
Datastore
VMFS BDatastore
26 Confidential
ESXi
Recovery SiteProtected Site
ESXESXESXi
VSR Agent vSphere Replication
Server
Tightly Integrated With SRM, vCenter and ESX
Site Recovery Manager
Site Recovery Manager
vSphere Replication Management Server
vSphere Replication Management Server
Any storage supported by
vSphere
Any storage supported by
vSphere
vCenter Server vCenter Server
vSphere Replication Architecture
27 Confidential
Simple Recovery and Migration Plans
28
Simple Setup And Management of Recovery And Migration Plans
Weeks or months to set up
Error-prone
Quickly falls out of sync with apps and infrastructure changes
Simple recovery plan set up in minutes
Fewer steps means far less room for errors
Simple to keep in sync with changes
…to Simple Recovery PlansFrom Complex Runbooks…
29
Step 2
Step 3
Step 4
Step 5
Five Simple Steps To Create Recovery And Migration Plans
Create Recovery Plans in 5 Steps…
Step 1
Map production site resources to recovery site• Resource pools• vSwitches• VM folders
Select virtual machine protection groups to include in recovery
Specify boot sequence of recovered VMs
Customize IP addresses of recovered VMs
Select low-priority VMs to suspend at recovery site
…And Eliminate Manual Steps of Traditional Recovery
Coordinate storage and replication processes for recovery
• Stop replication and make replicated LUNs writable
• Present data to applications• Present VMs to vSphere
Reconfigure individual hosts
Reconfigure physical switching infrastructure
Recover entire systems including OSand application binaries
X
X
X
X
Add messages and custom scriptsOptional
30
Application Consistent Recovery With SRM
Storage-based replication: application consistency widely available
• Enabled by replication management software
• Typically relies on agents in the VMs to properly quiesce applications
• For both DR failover and planned migrations
vSphere Replication: Application consistency for planned migrations only
• File-system consistency for DR failover via VSS requester in VMware Tools
Application Consistency Enabled by Replication Provider
Quiesce application
Replicate app-consistent VM
App-consistent VM presented
to SRM
Replication management
31 Confidential
Fully Automated Disaster Failovers and Planned Migrations
32
Beyond DR: Disaster Avoidance And Planned Migrations
Recover from unexpected site failure
• Full or partial site failure
The most critical but least frequent use-case
• Unexpected site failures do not happen often
• When they do, fast recovery is critical to the business
Anticipate potential datacenter outages
• For example: in case of planned hurricane, floods, forced evacuation, etc.
Initiate preventive failover for smooth migration
• Leverage SRM ‘planned migration’ to ensure no data-loss
• ‘Automated failback’ enables easy return to original site
Most frequent SRM use case• Planned datacenter
maintenance• Global load balancing
Streamline routine migrations across sites
• Test to minimize risk• Execute partial failovers• Leverage SRM ‘planned
migration’ to ensure no data-loss
• ‘Automated failback’ enables bi-directional migrations
Disaster Failover Disaster Avoidance Planned Migration
3 typical use-cases for SRM
33
SRM Reduces Recovery Risk With Frequent Testing
During the testing gap, organizations can’t be sure that they can recover the current IT environment
A failover scenario may take days or weeks to complete, leaving the business at extreme risk
SRM provides assurance that DR objectives will be met.
Lack of confidence in DR process
TimeDR Test DR Test
Changes to Applications and
Infrastructure Configuration
TESTING GAP
RecoveryRisk
Traditional Disaster Recovery
RecoveryRisk
DR Test DR TestTime
Site Recovery Manager
Frequent DR Testing
34
SRM Enables Frequent Non-Disruptive Testing
Overview
Benefits
Automate test execution• Execute recovery plan• Customizable for testing with extra callouts
and breakpoints• Log results of the test
Isolated test environment• Snapshot replicated LUNs• Launch VMs in fenced network• Reset environment after test
Confidence and documentation that DR requirements are satisfied
Quickly identify and remediate potential issues
Reduce cost and resources required for DR testing• Eliminate traditional ‘DR testing weekends’
Non-disruptive TestingRecovery Site
Isolated test environment
LUN snapshot
vSphere
Recovery Site
Replication
35
Automate DR Failover Processes
Overview
Benefits
Automatically detect site failures Require user to manually initiate failover
Automate recovery process Stop replication and present replicated LUNs
to vSphere Execute user-defined recovery plan
Ensure fast and predictable failovers and migrations
Consistently meet business requirements
Minimize risk of user errors
Site BSite A
Replication
1 Raise alert when hearbeat lost
2 User initiates failover
X3
Stop replication and present LUNs to vSphere
4 Recover VMs
DR Failover
vSphere vSphere
36
Testing and Executing Recovery Plans
Steps in recovery plan Status and time
stamps
When to execute
User confirmation
message
37
Planned Migrations For App Consistency & No Data Loss
Overview
Benefits
Two workflows can be applied to recovery plans: DR failover Planned migration
Planned migration ensures application consistency and no data-loss during migration Graceful shutdown of production VMs in
application consistent state Data sync to complete replication of VMs Recover fully replicated VMs
Better support for planned migrations
No loss of data during migration process
Recover ‘application-consistent’ VMs at recovery site
Planned Migration
Site BSite A
Replication
1 Shut down production VMs
2 Sync data, stop replication and present LUNs to vSphere
3 Recover app-consistent VMs
vSphere vSphere
38
Simplify failback process Automate replication management Eliminate need to set up new recovery plan
Streamline frequent bi-directional migarations
Automated Failback To Streamline Bi-Directional Migrations
Re-protect VMs from Site B to Site A Reverse replication Apply reverse resource mapping
Automate failover from Site B to Site A Reverse original recovery plan
Restrictions Does not apply if Site A has undergone major
changes / been rebuilt Not available with vSphere Replication
Overview
Benefits
Automated Failback
Site BSite A
Reverse Replication
Reverse original recovery plan
vSphere vSphere
39 Confidential
Next Steps
40
Successful Business Continuity Requires Careful Planning
Business Requirements / Business Impact Analysis (BIA)• Map service Tiers by availability requirements and cost
• For each service, identify Availability requirements, Recovery Time Objectives (RTO), Recovery Point Objectives (RPO)
Application Dependency Mapping• Identify dependencies between application
components
• Weakest link in the chain? (AD, DNS, etc)
Business Continuity Design• App-specific solutions / virtualization
for HA and DR / backup only
• Budget ahead of time
• Project planning / phasing
Use Professional Services• VMware PSO
• VMware BCDR Competency partners (300+ highly qualified partners)
41
SRM 5 Editions Lineup
SRM 5
Standard Enterprise
Price per protected virtual machine (license only)
$195 $495
Scalability Limits
• Maximum protected VMs 75 virtual machines (1) Unlimited(2)
Features
• Support for storage-based replication
• Centralized recovery plans
• Non-disruptive testing
• Automated DR failover
• vSphere Replication
• Automated failback
• Planned migration
New in SRM 5.01. Maximum of 75 VMs per site and per SRM instance
2. Subject to the product’s technical scalability limits
42
VMware BC/DR Service Offerings
VMware vCenter Site Recovery Manager Jumpstart
• The VMware vCenter Site Recovery Manager Jumpstart provides you with a proof-of-concept, on-site installation and configuration of SRM
• 3 days on-site, 5 participants max
Custom BCDR Plan and Design Service
• Comprehensive architectural design for BCDR, covering data protection, local availability, and disaster recovery.
• Address customer-specific requirements
• Flexible engagement model and duration
© 2009 VMware Inc. All rights reserved
Questions?