Upload
others
View
4
Download
0
Embed Size (px)
Citation preview
PG&E’s Emerging Technologies Program ET11PGE1051
Data Center Infrastructure Management (DCIM)
of IT Systems
ET Project Number: ET11PGE1051
Project Manager: Ilyssa Lu Pacific Gas and Electric Company Prepared By: Jameson Morrell CH2M HILL 150 Spear Street, Suite 750 San Francisco, CA 94105
Issued: April 25, 2012
Copyright, 2012, Pacific Gas and Electric Company. All rights reserved.
i
PG&E’s Emerging Technologies Program ET11PGE1051
ACKNOWLEDGEMENTS
Pacific Gas and Electric Company’s (PG&E’s) Emerging Technologies Program is responsible for this project. It was developed as part of PG&E’s Emerging Technology – DCIM program under internal project number ET11PGE1051. CH2M HILL team members Jameson Morrell, Chad Dikos, David Gray, and Andrew Solberg conducted this technology evaluation for PG&E with guidance and management from Ilyssa Lu. For more information on this project, contact [email protected].
LEGAL NOTICE
This report was prepared for PG&E for use by its employees and agents. Neither PG&E nor any of its employees and agents:
(1) Makes any written or oral warranty, expressed or implied, including but not limited to those concerning merchantability or fitness for a particular purpose;
(2) Assumes any legal liability or responsibility for the accuracy, completeness, or usefulness of any information, apparatus, product, process, method, or policy contained herein; or
(3) Represents that its use would not infringe any privately owned rights, including but not limited
to patents, trademarks, or copyrights.
ii
PG&E’s Emerging Technologies Program ET11PGE1051
ABBREVIATIONS AND ACRONYMS
BMS building management system
CMBD configuration management database
ECM energy conservation measure
DCIM data center infrastructure management
IT information technology
kW Kilowatt
kWh kilowatt-hour
PG&E Pacific Gas and Electric Company
PUE power utilization effectiveness
ROI return on investment
SLA service level agreement
UPS uninterruptable power supply
VM virtual machine
iii
PG&E’s Emerging Technologies Program ET11PGE1051
CONTENTS
EXECUTIVE SUMMARY ____________________________________________________ 1
INTRODUCTION _________________________________________________________ 2
BACKGROUND __________________________________________________________ 2
EMERGING TECHNOLOGY _________________________________________________ 4
ASSESSMENT OBJECTIVES __________________________________________________ 6
Goals of Sentilla Management .................................................. 6
TECHNOLOGY EVALUATION ________________________________________________ 7
APPROACH ____________________________________________________________ 8
RESULTS _______________________________________________________________ 9
Installation and Mapping ......................................................... 9
Outputs and Analysis ............................................................ 10
Output Validation ................................................................. 10
EVALUATIONS _________________________________________________________ 11
Software Implementation and Outputs .................................... 11
OBSERVED SAVINGS AND BENEFITS _________________________________________ 12
Qualitative .......................................................................... 12
Quantitative ........................................................................ 12
Alignment with Initial expectations ......................................... 12
Sentilla post-installation review 13 AT&T post-instillation review 13
RECOMMENDATIONS ____________________________________________________ 14
“Dense” data centers with high asset utilization ....................... 14
Market Adoption ................................................................... 14
PG&E Incentive .................................................................... 15
APPENDICES __________________________________________________________ 16
Appendix A – Business Case and Incentive Calculator ............... 16
Business case simulation (Sheet 1) 16 Incentive Calculation (Sheet 2) 17
Appendix B – DCIM Vendors .................................................. 18
DCIM vendors identified: 18 Viridity: (http://www.viridity.com) 18
iv
PG&E’s Emerging Technologies Program ET11PGE1051
Nlyte: (http://www.nlyte.com) 19
Synapsense: (http://www.synapsense.com) 19
Sentilla: (http://www.sentilla.com) 20
REFERENCES ___________________________________________________________ 21
1
PG&E’s Emerging Technologies Program ET11PGE1051
EXECUTIVE SUMMARY
PROJECT GOAL
While there has long been building management systems software to aid management with
decisions about energy efficiency, the software to measure the energy efficiency of IT
systems in data centers has only recently been developed. This new type of software is the
focus of this emerging technology assessment.
PROJECT DESCRIPTION
PG&E arranged for CH2M HILL to observe and validate a proof of concept for the
installation, data management, analysis, outputs, and recommendations of a software
provider, Sentilla Software, for a large data center operator, AT&T. Sentilla’s software fills a
new niche not previously monitored or visible to data center managers. The Sentilla product
loads on top of existing systems and “manages the managers” who collect information.
CH2M HILL functionally assessed the main software, which included functions for monitoring
and calculating energy usage and server utilization. The Sentilla software ultimately
provides a visual map of the energy usage and percentage of utilization across a defined
boundary in the data center. Energy and usage charts combined with engineering and data
center experience help determine which servers could be consolidated or shut down (for
example, identifying servers with constant power draws over period of time as an indicator
that there is little to no server activity).
PROJECT RESULTS
Overall, there were positive sentiments regarding the fact that Sentilla’s software met the
goals of identifying equipment that could be consolidated or taken offline and demonstrating
potential savings for AT&T.
Both software supplier and data center operators were pleased with the initial investment,
installation, outputs, and findings from the implementation of Sentilla software. Accepting
the proof of concept and rolling the software out to full production sites was the ultimate
positive outcome of this technology assessment.
PROJECT RECOMMENDATIONS
The Sentilla software was easy to deploy, nonintrusive, and produced data outputs that
converted into easily understood reports regarding idle and underutilized servers. With
other software suppliers entering this marketplace, companies will soon have multiple
options to cost-effectively and quickly monitor server utilization. This new generation of
software will be largely useful in three ways:
1. Information used to build a business case for mobilizing the process and governance
to regularly consolidate or decommission servers
2. The ongoing monitoring of IT equipment for regular preventative maintenance and
maintaining efficiency performance standards
3. Consolidating racks and servers will save space and allow for a greater density of
servers and less data centers that may free up capital for other uses.
2
PG&E’s Emerging Technologies Program ET11PGE1051
INTRODUCTION While the growth in data center electricity demand has slowed, it currently accounts for
2 percent of all electricity used in the United States (Koomey, 2011). Data centers are
generally large facilities, in a single location, with concentrated electricity demands.
Historically, due both to the concentration and to large amounts of energy required by data
centers, innovative research and development has established a focus for energy
conservation. However, efforts to increase the energy efficiency of data centers have
primarily focused on electrical usage in facilities, for example, on cooling, air handling, and
power loss during distribution.
Currently, power utilization effectiveness (PUE) is the metric used to benchmark the energy
efficiency of data centers. The PUE is derived by dividing the total power used by the
amount of power used for information technology (IT) equipment:
PUE = Total data center power use
IT equipment power use
The average PUE for data centers is a ratio between 1.8 and 1.92 (Koomey, 2011), down
from 2.0 estimated in 2008. (Koomey, 2008) Most of the improvement in PUE can be
credited to the improved efficiency of data center design. More efficient facility designs and
retrofits have lowered the non-IT equipment power usage, the numerator in the PUE
equation. However, the denominator IT equipment power use has remained constant (for
example, 20/10 = 2; 18/10 = 1.8 over the last few years.
The focus of a new trend in energy efficiency and this emerging technology assessment is
the PUE denominator: IT technology equipment power use. Added efficiency can
achieved by installing more efficient individual hardware components such as Energy Star-
rated servers and by exercising operational discipline in turning off or consolidating
underutilized IT equipment. Common sense dictates that both the efficiency and usage of
facilities and IT equipment be addressed. While there has long been building management
systems software to aid management with decisions about energy efficiency, the software to
measure the energy efficiency of IT systems in data centers has only recently been
developed. This new type of software is the focus of this emerging technology assessment.
BACKGROUND There is an emerging market for energy efficiency focused exclusively on reducing IT
equipment power use, the denominator of PUE. And while IT technology hardware
equipment manufacturers have recently made progress in energy efficiency, the impacts of
efficient server designs and data storage configurations and the growth in cloud computing
are burgeoning and hard to ascertain (McKinsey, 2010).
Meanwhile, two inefficiencies in data centers stand out: powering and maintaining unused
and underutilized servers. A conservative estimate, based on data center surveys and
studies, places the level of unused (“comatose” or “zombie”) servers in data centers at
3
PG&E’s Emerging Technologies Program ET11PGE1051
about 10 percent (Green Grid, 2010). This means that large data centers with 5,000
servers, for example, house about 500 idle servers. These servers are fully powered but
idle, on hold to accommodate computing demand that never comes. How does this happen?
Root-cause analysis indicates that IT solutions are over-engineered and overbuilt because
business units often procure IT services in “silos” based on initial reliability and sizing
estimates. In other words, a best guess estimate is made without actually knowing the
actual computing needs and this leads to over engineered solutions to account for
overlapping worst-case scenarios. It is standard practice for business units to engage IT
support to design, procure, build, and operate IT solutions. There is generally a 1:1
relationship between IT and business unit with a service level agreement (SLA) approved for
the ongoing operation of each server.
This standard practice has led to the development of many silos of business IT servers
operating in isolation, with excessive reserves of computing power. These silo-ed servers
are idle or underutilized, stacked next to one another on racks, creating “server sprawl” in
data centers. Studies show estimates of server utilization rates can be as low as 7 to 10
percent. For every 100 watts of power consumed by data centers, only 3 watts are
associated with computing (Aggar, 2011). If the denominator in PUE was changed to the
“computing” power of IT equipment, these data centers would score a ratio of 33 instead of
approximately 2.
CH2M HILL estimates that the cost for energy, operations and maintenance, and capital
investment to operate an average server is roughly $4,900 a year. For large data centers,
this may add up to millions of dollars a year spent on idle servers. That estimate accounts
for the cost of zombies alone and does not even include the potential cost savings to be
realized by consolidating underutilized hardware (Simulation tool provided in Appendix A).
The reason IT and business managers have not addressed this issue is visibility: there is a
lack of mapping to identify underutilized IT hardware in data centers.
4
PG&E’s Emerging Technologies Program ET11PGE1051
EMERGING TECHNOLOGY Most IT managers do not have a detailed view of IT equipment energy consumption because
equipment supports operational functions with widely varying demand patterns. Also,
energy use has traditionally not been tracked. The decision to consolidate or remove idle
equipment is made on a case-by-case basis with limited visibility of entire data center
consumption patterns. Further, many business managers are not aware of the lifetime costs
associated with idle servers or underutilized IT equipment. They are, instead, focused
narrowly on annual budgets for IT-related SLAs.
To fill this visibility void, IT equipment manufacturers and software designers are beginning
to develop equipment and software that generate reports and visualizations of actual or
computed utilization metrics across disparate IT equipment in data centers. As a result, data
center managers may now install software that will provide regular views across IT
equipment to spot idle or underutilized outliers. Both IT and business managers may use
this data to develop business cases and processes to turn off and consolidate IT equipment,
thus saving energy, operation and maintenance costs, and capital earmarked for new
data centers.
Ideally, a data center infrastructure management (DCIM) information solution will provide
visibility across all three data center assets and operations, including:
1. Facility systems (cooling and power)
2. Network systems (cabling, area networks)
3. IT systems (servers, storage, applications, & virtual machine - VM)
CH2M HILL conducted a limited review of ten software tools in this market niche. Four of
those tools were reviewed using a more detailed “desktop analysis” based on available
public information (Appendix B). The suppliers of software solutions currently available all
claim to meet the ideal standard, but programs tend to exhibit capabilities focused on one
type of data center system. All suppliers claim that their software will increase the visibility
and quantification of historical and real-time data in order to optimize existing systems and
provide accurate information for future system optimization and planning. The premise is:
more efficient facility and IT systems lead to cost savings.
A complete DCIM system solution addresses all three data center systems with regard to
these subjects:
Asset tracking (all three systems)
Real-time data collection
Historical data collection
Performance visualization
Capacity planning and reporting
Single data repository
Holistic approach
5
PG&E’s Emerging Technologies Program ET11PGE1051
As discussed earlier, data center energy use falls into three general categories: facility
systems, network systems, and IT systems. The figures below show a typical energy
allocation among these systems.
6
PG&E’s Emerging Technologies Program ET11PGE1051
The range of software programs currently available in the market place all seem to fall short
of providing a complete DCIM solution. Current software tools seem to focus expertise on a
specific data center system, but have gaps that do not support a holistic solution. For
example, one solution seems to address IT systems well, but lacks details on facility
systems. Another system presents the opposite problem: the product is robust for
measuring data center temperatures and monitoring facility systems, but weak for
measuring IT systems.
As DCIM solutions mature, the vendors with the most holistic approach will offer the data
center business the greatest value. It is therefore important to identify and understand the
gaps in software products. Existing solutions are generally strong in measuring facility
systems and there is a new emerging market for measuring the efficiency of IT systems. In
the future, these two specialty areas will likely merge into a single, well-integrated holistic
solution where, for example, computing demand triggers a cooling demand: a “just-in-time”
approach to energy use. This development would follow other trends in energy efficiency
that, in essence, automatically turns off energy supplies when demand lowers.
ASSESSMENT OBJECTIVES PG&E arranged for CH2M HILL to observe and validate a proof of concept for the
installation, data management, analysis, outputs, and recommendations of a software
provider, Sentilla Software, for a large data center operator, AT&T.
The software itself does not reduce energy usage; it is merely the means to an end. To
realize the benefits of the visibility provided by the software, a process and governance
must first be implemented. Ultimately, CH2M HILL’s evaluation rests on the assumption that
the software will meet the expectations of both the software provider and data center
operator. To set the baseline for success for the proof of concept, CH2M HILL interviewed
stakeholders from Sentilla Software and AT&T.
GOALS OF SENTILLA MANAGEMENT Conduct a technology “proof of concept” of an IT system-tracking tool that
can improve data center “hardware” efficiency and reduce energy demand
and costs.
Focus tracking specifically on optimizing the IT load of a data center to use
less energy, and not on lighting; heating, ventilation, and air conditioning,
and so forth (Because these are all part of a connected system, a decrease in
IT loads will have an impact on other aspects and reduce energy demand).
Demonstrate the software’s effectiveness to improve energy effectiveness.
Prove the concept of Sentilla software at an AT&T data center.
Identify dormant, broken, and inefficient equipment to consolidate or take it
offline and demonstrate potential for savings for large companies such as
AT&T, who have been running applications in multiple data centers for
decades.
Expect to demonstrate the potential for an approximate 15-percent reduction
in energy costs.
7
PG&E’s Emerging Technologies Program ET11PGE1051
Goals of AT&T MANAGEMENT
AT&T has many large data centers and, therefore, is making a significant
investment to achieve large energy and cost savings. The pilot proof of
concept aims to validate and achieve a fast return on investment (ROI).
Sentilla Software will provide AT&T with intelligence regarding the electrical
distribution load (uninterruptible power supply [UPS] and power distribution
unit [PDU]) and capacity level. When adding servers, the goal is to add them
to a floor that is running at 30-percent capacity and not at 80-percent
capacity. This will provide for maximum efficiency of power distribution (and
identify where to add load and save incremental bits of energy. Note that this
will not be the big energy saver).
Sentilla software can “reach out and touch” each appliance to determine and
monitor server capacity (CPU usage). Visibility will enable AT&T to merge
applications from several servers on one server while continuing to monitor
and document how much power servers use. Combining CPU usage and
power monitoring can help IT engineers make better decisions. This will
initiate a “domino” effect that will allow applications to be re-racked and
stacked within a smaller footprint. This will result in more space, more racks,
and better growth with more efficient use of energy.
The objective of this technology assessment is to observe the deployment of
an IT systems-focused solution with a large commercial operator of data
centers.
CH2M HILL is bringing together expertise in two areas to deliver this assessment:
the design, construction, and operation of data centers and the deployment of
software solutions. Together, these two technical perspectives cover the full range of
opportunities for energy efficiency in facilities and operations.
TECHNOLOGY EVALUATION Sentilla’s software fills a new niche not previously monitored or visible to data center
managers. The Sentilla product is “nonintrusive;” it loads on top of existing systems and
“manages the managers” who collect information. Typical software users include: data
center manager, building managers, and IT services department staff, specifically managers
of various SLAs. The software can generate a report that provides a snapshot of a defined
time that can, in turn, be used to brief business managers who generally do not log on to
the system as end users.
CH2M HILL functionally assessed the main software, which included functions for
monitoring and calculating energy usage directly, where available, or as a function of the
percentage of utilization and manufacturing specifications. The software works in several
ways by treating the data center racks and servers as variables in an equation where the
software calculates the total consumption level. Variables for different racks of servers are
calculated directly by monitoring energy usage and the percentage of utilization. For other
servers, the percentage of utilization telemetry data is known and manufacturing
specifications are used to estimate energy usage. In other cases, energy is monitored at a
8
PG&E’s Emerging Technologies Program ET11PGE1051
higher (rack) level and each server’s energy usage is estimated using a back calculation
based on manufacturing specifications. The Sentilla software ultimately provides a visual
map of the energy usage and percentage of utilization across a defined boundary in the data
center. Sentilla Software also offers other modeling and predictive modules that were not
observed during this assessment.
APPROACH CH2M HILL staff researched the DCIM software market and attended introductory and
training sessions with Sentilla staff while arranging to observe and, in some cases,
participate, in several steps, including:
Onsite installation of Sentilla software
Onsite mapping and data management exercises
Review of analysis and outputs with Sentilla and AT&T representatives
Attendance at meetings between Sentilla and AT&T representatives reviewing
recommendations on idle and underutilized equipment
Meeting with AT&T representatives to validate data outputs and observations
Before and after each step, CH2M HILL staff interviewed Sentilla and AT&T representatives
to assess progress toward goals outlined earlier in this report (see: Assessment Objectives).
The critical process that CH2M HILL staff assessed was the reaction of AT&T to the server
utilization and energy usage made visible via Sentilla reports.
The AT&T data center for this implementation was a development/testing site where a
heterogeneous mix of hardware supports numerous testing efforts for AT&T business units.
The approach rationale was to test Sentilla software in a development environment both to
prove the software capabilities and check for any impacts on the operation of data center
systems. (AT&T production facilities have more “homogenous” hardware and applications.)
Because the data center location is a development environment, a few key observations
need to be noted. There is greater mix of old and new equipment technologies in
development facilities than in production data centers. A higher level of asset changeover in
development environments makes it harder, in some cases, to track and test Sentilla
software capabilities.
The fact that it was a development environment made it difficult to track down the owners
of servers for permission to deploy the Sentilla software. This actually delayed the rollout
and led to the use of a smaller data set of servers. This later proved to be an issue when
attempting to quickly turn around and act on the recommendation reports generated by
the software (discussed in more detail in the results and evaluation sections that follow).
9
PG&E’s Emerging Technologies Program ET11PGE1051
RESULTS
INSTALLATION AND MAPPING The following is overview of the results observed by CH2M HILL personnel who
shadowed Sentilla staff both onsite at the AT&T data center and during multiple
offsite meetings. CH2M HILL staff observed the installation of Sentilla software and
the onsite mapping of and integration of IT equipment with various data sources in
order to calculate energy consumption and utilization percentages.
Initially, Sentilla staff installed the Sentilla software application on one server onsite
at the data center. For security, the system runs onsite on AT&T’s network, ensuring
all data and analysis remains inside AT&T’s network with no outside connections.
A significant effort was made to get security permissions to access every server. This
delayed the project timeline significantly and ultimately led to having to use a
smaller sample data set of servers for the proof of concept trial. This is likely only an
issue at test/development data centers; AT&T representatives indicated that
production data centers have clearer server and account definitions and a single
permission covers a broader swath of hardware.
For software configuration, data centers typically have an asset management
database that defines the server hierarchy. There was some effort involved in
integrating Sentilla software with different systems to get the data. At the
test/development data center, the UPS comes from a building management system
(BMS) system. The “meter level” data comes from a configuration management
database (CMDB) system.
Data Center Impact: There appeared to be minimal impact on data center staff or
systems during implementation for data configuration. A Sentilla programmer can
complete the entire data mapping by herself/himself, provided there is a sufficient
asset management system that defines the assets and a readable hierarchy
is available.
A Sentilla programmer toured the facility and inventoried the floor (all racks and
servers) to verify that what was mapped in the software conformed with the
hardware the Sentilla software was monitoring.
Once mapped, software was configured by Sentilla staff and test reports run. Test
reports consisted of data, exported into Microsoft (MS) Excel, that were converted
into usable graphics and inserted into MS PowerPoint slides, which were used when
briefing AT&T representatives on asset utilization and overall efficiency.
The elapsed time for installation was a matter of months. This was primarily due to
the delays in securing permission from various business owners to map servers
onsite. The time from server installation of Sentilla software to initial data input was
a matter of days. Initially the system was to have been run over a period of 90
days, with outputs reviewed at 30-day intervals to ascertain a behavior of time
trend. However, the delays due to the number of permissions required shortened the
proof of concept phase to a single 30-day period.
10
PG&E’s Emerging Technologies Program ET11PGE1051
OUTPUTS AND ANALYSIS Sentilla representatives met with AT&T representatives to review the outputs and
analysis and to agree on the next steps for following the proof of concept. In
addition, Sentilla and CH2M HILL representatives reviewed both the outputs and
analysis. CH2M HILL secured server identification numbers to validate independently
with AT&T that the servers onsite were the ones that had been monitored and that
the results captured in the Sentilla system reflected the characteristics of those
servers.
Together, CH2M HILL and Sentilla staff reviewed a spreadsheet and calculations,
provided by Sentilla, that reflected an initial 30-day data collection period. A
presentation between AT&T and Sentilla representatives contained the same
information proof of concept results presented included:
149 systems were monitored
16 idle servers were found
56 servers had <10-percent utilization
Total of 72 servers were idle/underutilized
Estimates by Sentilla for taking servers offline:
11,572 estimated kilowatt-hours saved for the June 16 to July 13 period
$25,000 estimated energy savings per year
$18,000 estimated savings in basic server contract costs per year
An additional 60-day monitoring period confirmed that the servers were, indeed,
idle/underutilized and a decommission recommendation report was prepared by
Sentilla staff for AT&T representatives.
OUTPUT VALIDATION CH2M HILL, Sentilla Software, PG&E, and AT&T staff met and reviewed the analysis
and discussed next steps. Discussion focused on the 16 idle servers. AT&T staff
confirmed that they had seen 16 idle servers and had escalated this to the Windows
asset team (the representing staff was non-Windows).The ensuing conversation
outlined the challenges to simply “pulling the plug” on the idle servers, including:
Process and authorization steps do not exist to turn off idle servers
Separation of teams by server type complicated decision-making authority
Servers were leased and a separate lease team needed to be involved, which
further diluted decision-making authority
There was concern that the 30-day testing window was too short to secure a
data sample for a test/development environment (this would not be an issue
for production centers)
AT&T managers agreed that a decommissioning process needed to be defined quickly
and that there were clear cost and energy savings that would interest AT&T
Operations and Green Teams.
11
PG&E’s Emerging Technologies Program ET11PGE1051
Overall, AT&T managers were impressed with the visibility provided by the Sentilla
software, and deemed it a huge step forward from the current process. The current
process involves teams manually roving data centers to search for underutilized
assets. Making underutilized assets more “visible” would be a massive advantage.
AT&T managers reached agreement to deploy Sentilla software at a full production
data center.
Communication efforts to follow up with AT&T on the decommissioning of the 16 idle
servers were not completed satisfactorily because the period for this technology
assessment expired before an action could be reported to CH2M HILL reviewers.
EVALUATIONS
SOFTWARE IMPLEMENTATION AND OUTPUTS Relatively speaking, the Sentilla software appears to be a nonintrusive, easy-to-
install application that quickly delivers visibility across data center IT equipment. The
actual recommendation reporting on underutilized and idle servers appears to be a
semi-manual process. The software, as observed by CH2M HILL staff, did not
produce a report that says “x, y, and z servers are zombies.” The software did
provide the end user with real-time energy and usage charts that could be
downloaded and combined with engineering and data center experience to determine
which servers could be shut down (for example, identifying servers with constant
power draws over period of time as an indicator that there is little to no
server activity).
Aside from server access, the software requires little effort from data center staff to
implement. The significant effort observed to get security/server access is likely due
to the fact the data center is a test/development environment. AT&T staff assumed
that production data centers would have clearer ownership, which would make it
easier to identify business owners and quickly gain permission to deploy.
Ongoing maintenance to update the Sentilla software could take the form of using a
“batch” process wherein utilization reports would periodically aid data center staff to
clean out underutilized servers. Another option would be to have the Sentilla
software continuously synchronize with the CMDB, which would enable the system to
remain current without input from an operator. The Sentilla software automatically
alerts data center staff when a new device appears in the CMDB.
Software updates can be made remotely or onsite, depending on the level of access
available to Sentilla staff. For an AT&T data center, server and data sensitivity might
preclude access over the internet and, in that case, an administrator would most
likely need to be at the data center to view or run reports and updates.
12
PG&E’s Emerging Technologies Program ET11PGE1051
OBSERVED SAVINGS AND BENEFITS
QUALITATIVE AT&T reported that having an “actionable” report of underutilized servers would
provide a map for energy savings. Without this type of report, technicians would
need to hunt and peck for servers to unplug, and even then, would be unsure of
whether a server was critical or in use. That approach would not be helpful in gaining
traction for the effort to unplug unused servers.
The ease with which Sentilla software identifies underutilized servers would create
the momentum for data centers and business mangers to develop processes,
objectives, and metrics to consolidate servers, reduce energy usage, and free capital
earmarked for additional data centers and IT equipment. Actionable reports would be
catalysts for organizational change. Data center managers would find it difficult to
justify why facilities still have idle and underutilized servers in the face of a report
that says, in very basic terms, “unplug these servers.”
QUANTITATIVE Business case simulations (Appendix A, Sheet A) reveal the types of potential
benefits that AT&T might realize by monitoring at the CPU level, using a friendly
reporting interface to identify idle or underutilized equipment:
When removing a server, from an energy standpoint, there is a two-fold
impact: it reduces baseline energy usage by more watts than the server uses
due to PUE and distribution losses.
Each server has associated annual license, maintenance, and software fees
that are accounted for in the SLA to cover “run costs.”
Understanding utilization rates across processors in real-time, 24 hours a day,
encourages consolidation, which increases the density of IT equipment by
using many different consolidation techniques (VMware, cloud, and so forth).
However, the first step is to visualize the opportunity. Ongoing monitoring
encourages moving away from “server sprawl” to creating dense, efficient
data centers.
The ability to compare a “healthy” server with equipment with signs of
breaking down leads to more effective scheduling of maintenance and cost-
saving warranty work.
Dense, efficient data centers free up capital normally reserved to pay for
more and more IT equipment and data centers. What started as an efficiency
project actually has the potential to become a major capital savings initiative
for companies who spend a lot on IT services.
ALIGNMENT WITH INITIAL EXPECTATIONS A traditional technology assessment is generally a head-to-head comparison of some
type of new technology to legacy devices. However, this technology assessment
13
PG&E’s Emerging Technologies Program ET11PGE1051
differs in that the software reviewed provides visibility into the operational
performance of equipment in much the same way as metering devices or other
energy auditing assessment tools, which, in turn, lead to energy conservation
measures (ECMs) recommendations. In that scenario, an evaluation of the actual
performance of the software against the initial expectations of end users is critical.
Did the software perform as expected?
SENTILLA POST-INSTALLATION REVIEW
CH2M HILL team members met and debriefed with Sentilla representatives on the
lessons and observations of the installation at the AT&T data center. Below are key
findings from these meetings.
Getting access to the servers took longer than originally anticipated (delays
from original project plan). The consensus was that it was a little more
difficult than usual, potentially due to the testing and development nature of
the target data center.
Sentilla Software representatives wanted to incorporate more servers into
monitoring, but were nonetheless pleased to find the idle or
underutilized servers.
Based on the Sentilla Software recommendation report to the AT&T
decommissioning team, Sentilla technicians were confident that the server
information and data provided to the decommission team was valid.
AT&T representatives suggested that the proof of concept results may
actually be three times higher in a production environment than in the testing
facility used for this assessment.
Sentilla staff were excited that the software passed the proof of concept and
are beginning work to roll out software to AT&T production data centers.
Overall, there were positive sentiments regarding the fact that Sentilla’s
software met the goals of identifying equipment that could be consolidated or
taken offline and demonstrating potential savings for AT&T.
AT&T POST-INSTILLATION REVIEW
These key points emerged during post-installation meetings between CH2M HILL
team members and AT&T management:
AT&T has faith that Sentilla software deploys in a nonintrusive way and
delivers valuable visibility into idle or underutilized assets.
Sentilla software output motivated AT&T managers to take immediate action
to look at decommissioning or consolidating underutilized assets.
AT&T managers gained insight into internal barriers to discovering idle
servers, but found that internal organization and processes initially
hampered action.
AT&T managers moved immediately to deploy Sentilla software into
production sites beyond the proof of concept and to develop a concrete
business case for wider adoption.
14
PG&E’s Emerging Technologies Program ET11PGE1051
Both software supplier and data center operators were pleased with the initial
investment, installation, outputs, and findings from the implementation of Sentilla
software. Accepting the proof of concept and rolling the software out to full
production sites was the ultimate positive outcome of this technology assessment.
RECOMMENDATIONS
“DENSE” DATA CENTERS WITH HIGH ASSET UTILIZATION The Sentilla software was easy to deploy, nonintrusive, and produced data outputs
that converted into easily understood reports regarding idle and underutilized
servers. With other software suppliers entering this marketplace, companies will
soon have multiple options to cost-effectively and quickly monitor server utilization.
This new generation of software will be largely useful in three ways:
1. It can be used to build a business case for mobilizing the process and
governance to consolidate or decommission servers based on potential cost
savings. Business and IT managers working together, with clean metrics, can
reap solid savings on hardware costs, license fees, and energy costs by using
this software to manage results and document cost and energy savings.
Savings may be realized within months of deployment, depending on the
organizational ability to quickly assess and act on opportunities.
2. The ongoing monitoring of IT equipment performance for preventative
maintenance and efficiency may encourage the development of contractual
agreements with hardware suppliers to guarantee performance. Monitoring IT
components for underperformance on efficiency specifications will enable
quick servicing or replacement of equipment based on contractual service
levels. Over time, this will hold suppliers accountable for efficiency claims and
ultimately conserve additional energy and run costs.
3. Finally, consolidating racks and servers will save space and allow for a greater
density of efficiently utilized and maintained servers. Given the typical
computing needs of today’s companies, the need for less space will translate
into fewer data centers, which will lower costs and free up capital for other
uses, for example, to upgrade servers to more efficient models.
All three of these processes are dependent upon a trend line that demonstrates, over
time, the actual versus expected performance and utilization of IT assets. That
information by itself will not generate cost savings. To realize the cost savings
benefits, organizations must interpret and act on the data by establishing clear
processes and governance. Thus, the software is the means to end; it is critical for
the ongoing measurement and verification of any efficiency program.
MARKET ADOPTION Overall, there is a strong business case (see Appendix A) for data center managers
to adopt software that can identify idle and underutilized equipment in order to
consolidate or decommission it. The payback is quick and tangible. Beyond the
15
PG&E’s Emerging Technologies Program ET11PGE1051
financial benefits, there is currently a market and government focus on the high
amount of energy consumption in data centers. There is call for greater efficiency
and accountability in these facilities. Further, “watch dog” organizations are publicly
pushing for greater transparency regarding the energy and carbon footprints of
corporate data centers and calling for greater investment in renewable energy. Even
if data centers shift to renewable energy, poor server utilization rates would imply
that the energy is, in essence, being wasted. Renewable energy used to power idle
or underutilized IT assets would be drawn from a limited resource pool and would no
longer be available to offset the use of nonrenewable energy, thereby creating
unnecessary energy production. When social, political, and economic forces start to
align, it is only a matter of time before server utilization joins PUE as a key
benchmark. This makes a strong case for further expansion and adoption of DCIM
solutions that monitor IT systems.
PG&E INCENTIVE Even though DCIM-specific software to monitor IT systems is an emerging
technology, it does provide the data required to identify energy conservation
measures in much the same way that a facility energy audit identifies conservation
measures and payback scenarios. An organization still must take measures to
achieve energy savings and there is generally an incentive for embracing the
technology that replaces older, less efficient assets. Incentives entice organizations
to act on new technology and, in some cases, help adjust the ROI to fit payback
periods within capital expenditure guidelines and jump-start adoption.
It is unlikely that there will be an incentive for software that does not deliver energy
conservation directly. A more traditional approach would be to incentivize and speed
the replacement of older, less efficient servers while consolidating and increasing the
density and utilization of servers in data centers. In that case, an incentive could be
awarded for each idle server decommissioned (Appendix A, Sheet B). As a pre-
requisite, software could be deployed to identify idle or underutilized servers within a
given boundary of racks and to monitor and verify that server population utilization
rates increase.
These types of incentives mirror the more typical asset-for-asset replacement-type
incentives normally deployed by PG&E. Additionally the software performs an audit-
like function by defining a boundary and providing auditable “before” and “after”
snapshots. The incentive achieves several objectives by rewarding:
Investment in visibility software
Decommissioning of servers
Reporting and managing the utilization rate as a metric
The recommendation is to publicize and reward active management of idle and
underutilized servers. There is no need for an incentive to improve the business case
to turn off or consolidate servers. Operations and maintenance and capital savings
alone would entice any business manager with visibility data on a large number of
idle or underutilized servers.
16
PG&E’s Emerging Technologies Program ET11PGE1051
APPENDICES
APPENDIX A – BUSINESS CASE AND INCENTIVE CALCULATOR
Server Consolidation Simulation PGE 1-5-12 D.xlsx
BUSINESS CASE SIMULATION (SHEET 1)
Sheet A contains a business case simulator allowing an organization to input
variables to simulate benefits such as energy and operational savings as well net
present value and internal rate of return. Input variables include:
Name of Data Center
Location
Contact
Phone Number
Existing Data Center Conditions
Number of Energized Racks in Data Center
Average Servers per Rack
Average Peak Power Draw per Active Server (watts)
Typical Server Load Factor
Average Idle Server Power Draw as a % of an Active Server
Marginal Power Usage Effectiveness (PUE)
Average Electric Rate ($ per kilowatt-hour [kWh])
Marginal Annual O&M per Server ($)
Capital Cost per Server in Data Center ($)
Cost of Capital
Changes from Implementation of Recommendations
Reduction in Energized Server Requirements (%)
Percent of Surplus Servers Decommissioned
Year that Capital Savings Are Realized
Monetization of Increased Risk, if Any (Annual Cost)
Monetization of Reduced Risk, if Any (Annual Benefit)
Annual Software License Fee ($)
Incentives by PG&E
17
PG&E’s Emerging Technologies Program ET11PGE1051
Payment per Annual kWh Saved
Payment per kilowatt (kW) of Reduced Demand
Implementation Costs ($)
Life of Software License (input 1 to 10 years)
INCENTIVE CALCULATION (SHEET 2)
Incentive calculation can be determined by setting an incentive value in the business
case simulation (Line 28, Column D) for payment per annual kWh saved and/or
payment per kilowatt of reduced demand. The incentive calculator in Sheet 2 will
compute an incentive, depending on the scenario presented in the simulation, to
provide an amount per server.
18
PG&E’s Emerging Technologies Program ET11PGE1051
APPENDIX B – DCIM VENDORS
DCIM VENDORS IDENTIFIED:
1. Nlyte http://www.nlyte.com/
2. Sentilla http://www.sentilla.com/
3. Viridity http://www.viridity.com/
4. Power Assure http://www.powerassure.com/products/data-center-
optimization-products
5. SynapSense http://www.synapsense.com/go/index.cfm
6. 1E http://www.1e.com/
7. Miserware http://www.miserware.com/company/
8. EnergyWare http://www.energyware.com/
9. Modius, http://www.modius.com/
10. OSIsoft
http://www.osisoft.com/value/industry/Datacenter_IT___Telecom.aspx
11. Aperature (Emerson)
http://emersonnetworkpower.com/enUS/Brands/Aperture/Pages/default.aspx
12. Fieldview Solutions http://www.fieldviewsolutions.com/
VIRIDITY: (HTTP://WWW.VIRIDITY.COM)
Products/Solution Brands - Energy Center, Energy Check
Information at the component level (device, rack, row, and at the data center levels)
for a comprehensive understanding of how a device is configured and how much
power the device is using in total.
Unobtrusive software-only approach.
Integrates with VMware‘s vSphere software, enabling administrators to examine the
utilization and power use of virtual machines.
Ability for users to group IT assets by owner or by the application they support. For
example, a user can group and examine all servers that support a transaction-
processing application; this might help the user facilitate an internal chargeback for
IT services to the appropriate line of business.
Datacenter Genome Database (GxDB), the repository of IT equipment power and
utilization information that Viridity uses to estimate IT power draw from measured
utilization values.
Software-only, it can have a lower price point and faster deployment time than
systems that come bundled with sensors.
19
PG&E’s Emerging Technologies Program ET11PGE1051
DCIM solution:
Asset Tracking ………………... No (Facility Systems), Yes (IT Systems)
Real Time Data Collection…… Yes
Historical Data Collection…….. Yes
Performance Visualization…… Yes
Capacity Planning…………….. No
Reporting ………………………. Yes
Single Data Repository……….. No
Holistic Approach……………… No
NLYTE: (HTTP://WWW.NLYTE.COM)
Products/Solution Brands - nlyte 6.0
Nlyte Process: Discover, Visualize, Model effects of change, Control, Report, Predict
Uses sophisticated automation to optimize the physical capacities of power, cooling,
and space while improving IT service delivery
Good visualization tools – dashboard, data center room layout cabinet loading
PUE calculations
Does not appear to have temperature, airflow, and pressure monitoring
DCIM solution:
Asset Tracking ………………... No (Facility Systems), Yes (IT Systems)
Real Time Data Collection…… Yes
Historical Data Collection…….. Yes
Performance Visualization…… Yes
Capacity Planning…………….. Yes (IT systems)
Reporting ………………………. Yes
Single Data Repository ……….. No
Holistic Approach……………… No
SYNAPSENSE: (HTTP://WWW.SYNAPSENSE.COM/GO/INDEX.CFM)
Products/Solution Brands – Environmental Monitoring, Real-Time PUE,
SynapSense Active Control, Assessment & Optimization Consulting Services,
Power Suite, LiveImaging
Wireless real-time and historical monitoring of data center temperature and pressure
Graphical visualization of data, “LiveImaging”
Wireless P3 Smart Plug and SmartLink sensors for measuring average and
peak power
20
PG&E’s Emerging Technologies Program ET11PGE1051
Branch Circuit Monitoring (BCM) supporting up to 42 panel board
Overall, appears to be focused on facility systems and not IT systems or
network systems
Not apparent that their solutions are integrated into a single, holistic approach
DCIM solution:
Asset Tracking ………………... Yes (Facility Systems), No (IT Systems)
Real Time Data Collection…… Yes
Historical Data Collection…….. Yes
Performance Visualization…… Yes
Capacity Planning…………….. Yes, (Facility Systems), No (IT Systems)
Reporting ………………………. Yes
Single Data Repository ………. No
Holistic Approach………………. No
SENTILLA: (HTTP://WWW.SENTILLA.COM)
Products/Solution Brands – Sentilla Integration Software delivered to customers
through both Software-as-a-Services (SaaS) on demand as well as on premise with a
physical or virtual appliance.
Sentilla integrates information from IT equipment and facilities, providing a
comprehensive, single view of the entire data center’s profile. Monitored data types
include system utilization and availability, network traffic, energy consumption,
temperature, and more.
Monitors utilization efficiency of your IT infrastructure. Sentilla tracks IT utilization,
costs, and efficiency of systems and applications in one place.
Overall, seems to be dependent on existing system monitoring systems to be able to
access information. Holistic solution if coupled with complete monitoring of
facility systems.
DCIM solution:
Asset Tracking ………………... Yes (Facility Systems), Yes (IT Systems)
Real Time Data Collection…… Yes (depending on existing
instrumentation)
Historical Data Collection…….. Yes
Performance Visualization…… Yes
Capacity Planning…………….. Yes, (Facility Systems), Yes (IT Systems)
Reporting ………………………. Yes
Single Data Repository ……….. Yes
Holistic Approach……………… Yes
21
PG&E’s Emerging Technologies Program ET11PGE1051
REFERENCES Andy Hawkins (2011), Determining the Implications of Unused Servers and How They
Can Be Addressed. Presentation: The Green Grid Technical Forum 2011.
Michael Chui, Markus Löffler, and Roger Roberts (2010), The Internet of Things.
McKinsey Quarterly 2010, Number 2.
Mark Blackburn (2010), Unused Servers Survey Results Analysis. Rev 2010-0, The
Green Grid.
Mark Aggar (June 2011), The IT Energy Efficiency Imperative.
www.microsoft.com/environment : Microsoft Corporation.
Jacques Bughin, Michael Chui, and James Manyika (August 2010), Clouds, Big Data, and
Smart Assets: Ten Tech-enabled Business Trends to Watch. McKinsey Quarterly.
Jonathan G. Koomey, Ph.D. (August 1, 2011), Growth in Data Center Electricity Use
2005 to 2010. www.analyticspress.com/datacenters.html : Analytics Press.