Upload
arrow-ecs-uk
View
891
Download
0
Embed Size (px)
DESCRIPTION
Enterprise Management – Solution Suite Overview A look at each new solution suite and how they can be used in harmony to make strides towards delivering IT as a service. Mark Stockham, Enterprise Management Specialist
Citation preview
© 2011 VMware Inc. All rights reserved
Enterprise Management
Mark Stockham
VMware Strategist: Management, Automation and Cloud
2
Disclaimer
This session may contain product features that are currently under development.
This session/overview of the new technology represents no commitment from VMware to deliver these features in any generally available product.
Features are subject to change, and must not be included in contracts, purchase orders, or sales agreements of any kind.
Technical feasibility and market demand will affect final delivery.
Pricing and packaging for any new technologies or features discussed or presented have not been determined.
3
When your customers travel towards an issue at 70mph…….
Understands all of the underlying systems
Scales!
Provides a simple, actionable view of what’s going on
Alerts you to problems before users notice or SLA is breached
Provides likely root cause analysis point to the problem area
A single console puts everything into context
We need to take an approach that:
4
vCenter Operations Management Suite 5.0 Packaging
Standard Edition Enterprise Plus Edition
VC Ops Mgr 5.0 – Std. VC Ops Mgr 5.0(incl. CapIQ)
VC Infra Navigator **
VC Configuration Mgr
** Not Available a-la-carte.
Chargeback
Advanced Edition
VC Ops Mgr 5.0(incl. CapIQ)
For cloud and heterogeneous environments
For large vSphere environments
Automated Operations Management
Pricing and Packaging changes effective Q1 2012.
For SMB and small vSphere environments
Enterprise Edition
VC Ops Mgr 5.0 (incl. CapIQ)
VC Infra Navigator **
VCM for vSphere **
Chargeback
For virtual and cloud infrastructure
New SKU New Name
5
vCenter Operations Suite - Feature Comparison
Standard Advanced Enterprise Enterprise Plus
Edition Feature Summary
Performance Management
Performance analytics for vSphere
Performance analytics for vSphere
Performance analytics for vSphere, Hyperic and VCM included
Extensible to 3rd party monitoring tools
Customizable Dashboards
Performance analytics for all VMware + 3rd party monitoring tools included
Customizable Dashboards
VC Ops
Manager 5.0Capacity
Management Alerting and
trending
Alerting, trending and reporting
Resource optimization What-if scenarios
Alerting, trending and reporting
Resource optimization What-if scenarios
Alerting, trending and reporting
Resource optimization What-if scenarios
Configuration & Compliance
vSphere change events
vSphere change events vSphere change events Configuration and
compliance for vSphere hosts
vSphere change events
Configuration and compliance for vSphere hosts, VMs and physical servers
VCM
5.4
Application Mapping
None None Virtualized applications running on vSphere
Virtualized applications running on vSphere
VIN
1.0Cost Metering and Reporting
None None vSphere environments vSphere environments CB 2.0
6
vCenter Operations Management Suite
Business Benefits delivered by vCenter Operations
“We’ll be able to reduce our monitoring tools from
over 300 to about 30.”
Kaiser Permanente
ComprehensiveVisibility
IntelligentAutomation
ProactiveManagement
“Notified the storage team before they were
even aware of an issue.”
Maximus
“Troubleshooting time reduced by 50%”
TUI Infotec
• Higher QoS
• Fewer Incidents
• Tool Consolidation
• Compliance
• Faster MTTR
• Improved Collaboration
• Resource Utilization
• …
7
How does Cloud impact Operations Management?
Scale
Higher VM/Admin Ratios
More Responsibility
More Data
1
Change
Rapid Change
Dynamic Environment
Unplanned Demand
2
Convergence
New Architectures
Heterogeneous Environments
Embedded Management
3
“… have 2 VI admins managing 1,600 VMs.”
Christus Health
“False alerts and data overload -- 25,000 static thresholds and metrics”
Stanford Medical
“Lot of friction between the infrastructure and
operations teams”
Maximus
8
Automated Cloud Operations Workflow
HealthDegradation
RootCause
AnalysisIsolateRemediate
Plan Optimize
Performance Issue
Change Configuration
CapacityShortfall
9
So………
What going wrong right now?
10
vC Ops Badges – Standard vs. Advanced
11
vC Ops Default UI – Major and Minor Badges
• High level Understanding
• Calculated from scores of Minor Badges
Major x 3
Minor x 8• Specifics
• Guidance
12
Operations: Major Badge – Health
“How is this object doing right now?"
• Identifies current problems in the system
• Issues that need to be resolved immediately to avoid problems
High Health is good (100-0)
Heatmap
• Provides quick view of many objects at once
• Shows Health of all parent and child objects
• Go back in time (6 hours) and see the “weather” of the Virt Infrastructure
Health Score is calculated from its Minor Badges
• Workload
• Anomalies
• Faults
13
Operations: Health Minor Badge – Workload
Measures how hard an object is working?
High Workload is bad (0-100 or more!)
• Percentage of Demand divided by effective capacity
• As workload approaches (and exceeds) 100% Performance Problems!
Starving object for resources!
Focused attention
• CPU
• Memory
• Disk I/O
• Network I/O
95
Improved Network and Disk I/O calculations
Eliminates idle networks and storage from showing High Workload
Limit the erroneous 100% Workload scores
14
Operations: Health Minor Badge – Anomalies
Measures how normal is this object behaving?
• Is what the vC Ops 1.x Health score was, but now inversed
Derived from the number of metrics that are outside of their “Normal” trended ranges
• Learns dynamic ranges of “Normal” for each metric
• Identifies metric abnormalities
Low Anomalies is good (0-100)
• Zero meaning the object is performing exactly the way vC Ops expects it to for that time of the day, that day of the week
• A high number of anomalies are usually an indication of a problem
Anomalies Chart
• Current number of Abnormal Metrics
• Problem/Noise Threshold
Crossing problem threshold will increase the Anomalies Score
Does not generate an alert in this vSphere UI
15
Operations: Health Minor Badge – Faults
Measures the degree of faults or problems the object is experiencing
• Pulled from active vCenter events
VMware specific knowledge of which vCenter Events affect Availability and Performance (examples):
• Loss of redundancy in NICs or HBAs
• Memory checksum errors
• HA failover problems
Low Faults is good (0-100)• Each fault has a default score (e.g. 25,
50, 75, 100)• Highest individual Fault Score drives the
Fault object Score
Best Practices:
• Do not change the Faults Threshold
• Use Alerts View to manage Faults
Faults shown in Widget
16
Operations: Details
Workload Badge Focus : Datastore Example
Space Available
Throughput
IOPS
Latency
17
Operations: Details
Anomalies Badge FocusSubset of the
Anomalies for an object
Help with any troubleshooting
efforts
Visualize magnitude and
impact
18
Operations: Details
Fault Badge Focus
Details of vCenter Faults
19
That’s great, but what’s next, where are we going?
20
Past Data – Past deployment trends and VM sizes considered
At-a-glance charts and graphs
• Datacenter to individual VM
• Managed objects and compute resources
• Deployed and remaining capacity perspectives
21
The Present - Capacity Optimization
Identify and Reclaim Inefficient, Unused Capacity
• Right-size or decommission unused or inefficient VMs
• Use the capacity modeling to predict the savings
• Eliminate waste and reduce costs in your organization
22
The Future – “drop dead” date and “what if” scenario modelling
Confidential
23
Capacity Planning: Major Badge – Risk
Are there future risks to my systems and VI?
Identifies potential problems that could eventually hurt the performance
Low Risk is good (0-100)
Risk Score is calculated from its Minor Badges
• Time Remaining
• Capacity Remaining
• Stress
Risk Chart
• Shows Risk score over the last 7 days
24
Capacity Planning: Risk Minor Badge – Time Remaining
Measures time remaining before each resource type reaches its capacity
• CPU
• Memory
• Disk
• Network I/O
Early warning of upcoming provisioning needs
• Avoid future performance issues
High Time Remaining is good (100-0)
Graph shows resource utilization trends
25
Capacity Planning: Risk Minor Badge – Capacity Remaining
Measures how many more VMs can be placed on the object
Percentage of Total VM “Slots” Remaining
• Based on the average size of the VM on the object (e.g. VM profile)
• Each object has its OWN VM profile size: Host, Cluster, Datacenter, Etc.
High Capacity Remaining is good (100-0)
• Zero mean no room left for more VMs
333 More VMs correlates to 77% Capacity Remaining for this object
26
Capacity Planning: Risk Minor Badge – Stress
Stress measures long-term or chronic workload
• Workload shows an instantaneous value
• Stress looks over a longer period of time
Quickly find and resolve
• Undersized objects
• Population contention
Low Stress is good (0-100)
Stress score encompasses a six (6) week period
• Workloads > 70% = “Stressed”
• Threshold Configurable
Chart shows weeks break down of Stress for each day/hour averaged over the last six (6) Weeks
27
Capacity Planning: Major Badge – Efficiency
Are there optimization opportunities in my systems?
Shows you how to run a leaner datacenter
Save $$$ by better utilizing resources
High Efficiency is good (100-0)
Efficiency Score is calculated from its Minor Badges
• Reclaimable Waste
• Density
Graph Depicts VMs by Percent
• Optimal – Optimally Provisioned VMs
• Waste – Over Provisioned VMs
• Stress – Under Provisioned VMs
Three Resources Considered• CPU• Memory• Disk Space
Note: VMs can appear in Stress and Waste
28
Capacity Planning: Efficiency Minor Badge – Reclaimable Waste
Measures the over-provisioning for an object
It identifies the amount of reclaimable resources
• CPU
• Memory
• Disk
Low Reclaimable Waste is good (0-100)
Reclaimable Waste = Reclaimable Capacity / Deployed Capacity
• Score depicts the MAX of the CPU, Memory and Disk calculation
• Disk calculation can also include old snapshots and templates
Graph shows breakdown of the Waste section of the Efficiency Badge pie chart
• % Idle VMs (based on configured settings)
• % Powered Off VMs
• % Oversized VMs
29
Efficiency Configuration – Powered-Off & Idle VMs
Access via Configuration Widget
Powered-Off Threshold
• Based on % time
Idle VM Detection
• Based on % time
- AND -
• All or One of the following thresholds
• CPU
• Disk I/O
• Network I/O
Listed as Powered-Off if the total powered-off time > given % Time Powered-Off Threshold in a given
time interval
Listed as Idle if the total time during which all or any of the resource usage is below the specified thresholds in a given time interval
30
Efficiency Configuration – Oversized VMs
Access via Configuration Widget
Oversized Detection
• CPU and/or Memory Workload
Oversized Threshold
• What percentage of Oversized is acceptable
• When should an object be reported
An Object is oversized if its degree oversized is greater than
the % Oversized threshold
For the given time interval, CapacityIQ first calculates if a physical resource (viz. CPU,
Memory) is over-sized based on the configurable Utilization Less Than
threshold.
31
Capacity Planning: Efficiency Minor Badge – Density
Contrasts Actual vs. Ideal Density
Identify Optimal Resource Deployment Before Contention Occurs
Greater Consolidation $$$
High Density is good (100-0)
Measures consolidation ratios:
• VMs/Host Ratios
• vCPU/Physical CPU Ratios
• vMem/Physical Memory Ratios
32
Health Degradation – Smart Alerts (All Editions)
Overview Proactive alerts that provide early
warning on building issues
Identify upcoming health, performance and capacity issues
Automatic root cause analysis of offending metrics across all layers
Benefits Advance notification of abnormal
behavior help avoid incidents
Immediately focus on the root cause rather than symptoms or false alerts
Root cause and potential impact
Early warningSmart Alert
33
What does my environment look like?
From a configuration perspective?
34
VCM
Compliance Checking of vSphere Hosts (New in Enterprise Edition)
vSphere
Compliance
Policy
Reference Host
vSphere
vSphere
vSphere
vSphere
vCenter
vSphere
vSphere
vSphere
vSphere
vCenter
Reference Host
Overview Leverage out-of-the-box vSphere
hardening guidelines from CP&C
Assess vSphere compliance from within vCenter
Push compliance to the entire environment through Host Profiles
Launch into vCM to visualize information across the infrastructure
Benefits Maintain a compliant infrastructure
35
What does my environment look like?
From a relationship perspective?
36
Navigator – Value Proposition Summary
vCenter Operations: Common language and context with App owners for problem resolution
• App context needed for triage, change impact, proactive resource planning
• Triage for actual & potential issues for applications and services
• Proactive planning to avoid VM starvation as cause of application issues
• Manage application / change impact of planned and unplanned VI changes
•
Site Recovery Manager : Create & update protection groups/plans based on app visibility
• Planning: Use the app context to protect the right VMs, know external dependencies.
• Change: Look for changes in app topology that require removing/adding VMs to protection plans.
• Test: Use up-to-date app dependency info when running DR tests & remediating plans
• Visualize protection readiness with application / VM overlays
HA / DRS Use app context to build HA clusters, affinity rules & placement constraints
• Know what app components/VMs of a specific app are more important to prioritize
• identify Apps (with versions) and ensure all components of app reside on same clusters to set up HA
• Set DRS rules to keep certain app components of app together or apart on different hosts
• vApps: Provide the Application dependency context as basis for creating and defining vApps
37
Complete dependency views of related application components
Visualize App dependencies
With VC related information (SRM, vApp, etc)
Map and Tabular
views
38
vCenter Infrastructure Navigator Capabilities
Automated discovery and dependency
mapping
Rapid and accurate discovery and dependency mapping of application services across virtual infrastructure
Rapid updates that keep mapping information up-to-date
Seamlessly present dependency Maps, summaries in vCenter UI
Search / Filter VMs based on application components
Integrates App dependencies in
day to day operational
interfaceOverlay relevant VC solution info (SRM, Datastore, Host, vApp ) for making decisions
OOTB knowledge base to identify application workloads
39
What does my environment look like?
From a cost perspective?
40
Resource Optimization based on Cost (New)
Opportunities to optimize resources
Overview vCenter Chargeback Manager 2.0
now included in some vCenter Operations Editions
Integration of cost metering and capacity analytics
Reporting of financial value of committed and reclaimable capacity
Benefits Make infrastructure optimization
decisions at a business-levelRecommended
configuration sizeCost
Savings
41
Key Capabilities
Track usage and configuration data for Cloud/virtual resources
Resource Metering
Define cost metrics related to virtual infrastructure resources
Price Modeling
Provide IT cost transparency with simplified reporting
Cost Reporting
Associate VMs and costs to different departments, business units, and groups
Hierarchy Management
42
What does my environment look like?
If I use have View in my environment?
43
Tech Preview: “vCenter Operations for View”
Overview End-to-end monitoring of infrastructure
Included PCoIP performance monitoring
Desktop, Pool and User Contexts
Self-Learning performance analytics
Automated alerts
Remediation guidance
Benefits Get to root cause quickly; Reduce MTTI
Respond proactively before support calls
Remediate quickly and accurately
Improve resource utilization by identifying over-provisioned hardware and track down bottlenecks
44
Summary
45
What’s New in vC Ops Mgr 5.0 – Scalability
Scales to – 6k powered on and 8k total VMs
Small – Supports up to 1500 Powered ON VMs
• 4 vCPUs
• 16 GB memory
Medium – Between 1500 and 3000 Powered ON VMs
• 8 vCPUs
• 24 GB memory
Large – 3000 to 6000 Powered ON VMs
• 16 vCPUs
• 32 GB memory
These are TOTALS for the vApp
• These resources may not be split EQUALLY between the 2 VMs
46
vCenter Operations Suite – Target Customer Profile
Standard Advanced Enterprise Enterprise Plus
Target Buyer
• VI Admins • VI Admins • Infrastructure Teams (VP/Director Infrastructure)
• Infrastructure + Operations Teams (VP/Director Operations)
Target Customer Profile
• vSphere centric• SMB’s with <100
VMs with less dynamic environments
• Requires performance analytics and capacity trending but not capacity planning or optimization
• vSphere centric• >100 VMs • Requires
performance analytics, capacity trending , capacity planning and optimization
• Large vSphere centric • Everything for the vSphere
environment including host configuration management
• Require performance analytics and capacity optimization and planning
• Require application mapping for vSphere
• Showback and possible chargeback needs
• Do not require management guest configuration data
• Can be extended to all of Enterprise Plus functionality
• Organizations of any size. that require vSphere and in guest functionality
• Can be heterogeneous environments.
• Require configuration changes and capacity optimization and planning for vSphere
• Require application mapping for vSphere
• Require configuration and compliance at the guest level
Scope of Solution
• vSphere• No PS
• vSphere• No PS
• vSphere• Limited PS• Includes VMware adapters
for vCenter Server, Configuration Manager and Hyperic
• Adapters for 3rd party monitoring tools optional
vSphere + heterogeneous environments
PS required Includes VMware
adapters for vCenter Server, Configuration Manager and Hyperic
Adapters for 3rd party monitoring tools included
47
Demo
48
Thank you