22
Copyright © 2010 Platform Computing Corporation. All Rights Reserved. TORONTO 11/22/2011 More Science, Less Computer Science Jeff Yamamoto, Senior Alliance Manager, Asia Pacific Platform Computing SC 11

More Science, Less Computer Science

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: More Science, Less Computer Science

Copyright © 2010 Platform Computing Corporation. All Rights Reserved.TORONTO 11/22/2011

More Science, Less Computer Science

Jeff Yamamoto, Senior Alliance Manager, Asia PacificPlatform Computing

SC 11

Page 2: More Science, Less Computer Science

Copyright © 2010 Platform Computing Corporation. All Rights Reserved.2

• Overview – Platform Computing & Fujitsu Partnership

• PCM Fujitsu Edition Overviewo Basic Packageo Enterprise Package

• Success Stories

Agenda

Page 3: More Science, Less Computer Science

Copyright © 2010 Platform Computing Corporation. All Rights Reserved.3

Fujitsu/Platform Computing StrengthsFujitsu’s global HPC expertise with Platform’s 19 years of leadership in cluster, gridand cloud management software

FujitsuHPC Expertise

Fujitsu HPC Software Stack

・system design andconfiguration tool kit

・operational monitoringtool kit

・system function/performancetest suite tool kit

Platform ComputingHPC technology

・parallel executionenvironment

・MPI・cluster monitoring tool・cluster management tool・GUI tool

FujitsuHPC Expertise

Platform ComputingHPC technology

For easy set up and management of high quality x86 clusterCopyright 2010 FUJITSU LIMITED

9

・job scheduler

Page 4: More Science, Less Computer Science

Copyright © 2010 Platform Computing Corporation. All Rights Reserved.4

Fujitsu’s x86 HPC Cluster Eco-system

Fujitsu x86 HPC ECO system

OperationSupport Services

ApplicationLayer

from ISV’s

FUJITSUMiddlewareStack

System Management Support ServicesTrouble Shooting Support Services

Oil and Gas Bio CAE OtherApps Areas

Fujitsu Kit Platform Computing HPC Middleware

Others

OS

PRIMERGYHW

RX200 BX900 CX1000

Fully validated x86 HPC Cluster Stack by Fujitsu/Platform Computing

88

Consulting

Page 5: More Science, Less Computer Science

Copyright © 2010 Platform Computing Corporation. All Rights Reserved.

Typical Cluster Management Solutions

Require customers to…• Assemble disparate software components

o Open source componentso Multiple commercial components

• Certify each component individually• Get support from multiple vendors

However, many customers…• Lack Linux® OS commercial app support• Experience different quality levels for each component

Page 6: More Science, Less Computer Science

Copyright © 2010 Platform Computing Corporation. All Rights Reserved.

• Install certified clusters quickly and easily• Multiple provisioning modes, repository

snapshots

Cluster Provisioning

• Proactive alerting, configurable exception conditions• Graphing & reporting, network analysis• Integrating a new workload management solution.• Fresh new GUI. (new dashboard/rackview/host monitoring)• CUDA 4.0 kit • ICR 3.3

Cluster Management

• Qualified with Fujitsu on a range of servers, storage subsystems & interconnect technologies

• Supported by a premier HPC support organization

Pre-tested, Certified,

Supported

PCM Fujitsu Edition - Basic Package

Page 7: More Science, Less Computer Science

Copyright © 2010 Platform Computing Corporation. All Rights Reserved.

• Easy– Easy to make your cluster

production-ready– Easy to manage your cluster– Easy to submit & manage jobs

• Simplified– Simplified software deployment– Simplified application

integration

• Powerful– Powerful, yet easy to use

unified interface– Powerful security policies

Unified, Web-Based Interface

Cluster Provisioning & Management

Job Submission & Management

Page 8: More Science, Less Computer Science

Copyright © 2010 Platform Computing Corporation. All Rights Reserved.

PCM Fujitsu Edition- Enterprise PackageThe easiest and most complete HPC cluster solution

•Robust and feature-rich workload management•Heterogeneous cluster management•Next generation, integrated web-based interface•OS Multi boot

Complete Product

•Easy to use job submission portal•Customizable, self-documenting app templates•High performing MPI libraries

Integrated

•Certified with server, storage & interconnect vendors

•Supported by the world leader in HPC management solutions

Certified

Enterprise Package

Page 9: More Science, Less Computer Science

Copyright © 2010 Platform Computing Corporation. All Rights Reserved.

PCM Fujitsu Edition- Enterprise Package– Complete Product

Cluster Management

Workload Management

OS OS OS OS OS

Monitoring &

R

eporting

MPI Library

Unified Web-Based Interface

Ansys Fluent

MSC NASTRA

NBlast LS-DYNA

Home-Grown

App

App IntegrationGPU Scheduling

OS Multi-Boot

HPC Services

Page 10: More Science, Less Computer Science

Copyright © 2010 Platform Computing Corporation. All Rights Reserved.10

Effective Cluster Management

Organized, easy to use objects tree

Object list within the selected

group

Detail of the selected

object within the group

Comprehensive cluster management interface for provisioning your HPC cluster

Page 11: More Science, Less Computer Science

Copyright © 2010 Platform Computing Corporation. All Rights Reserved.11

Effective Cluster Management

Alert summary linked to the alerts page

Monitoring section allowing

selections of multiple metrics

Enhanced rack view with support of

blade centers

Custom actions can be

added

Comprehensive cluster management interface for managing your HPC cluster

Page 12: More Science, Less Computer Science

Copyright © 2010 Platform Computing Corporation. All Rights Reserved.12

Effective Cluster Management

Comprehensive cluster management interface for monitoring your HPC cluster

External load metrics can be used for alerts

Alert summaryLast update time etc.

Page 13: More Science, Less Computer Science

Copyright © 2010 Platform Computing Corporation. All Rights Reserved.

• Dynamically changes the OS based on workload– Linux & Windows operating system multi-boot– Based on resource policies to simplify management– Dynamic; transparent to end users– Higher cluster utilization for lower TCO

Effective Cluster Management

Page 14: More Science, Less Computer Science

Copyright © 2010 Platform Computing Corporation. All Rights Reserved.

Take immediate advantage of the exceptional HPC performance provided by GPUs

• DEPLOY: Quickly deploy workload to GPU resources– Submit jobs to resources with GPUs & CPUs within the same cluster– Remotely manage & view the status of your jobs

• MANAGE: Easily manage heterogeneous clusters– Install CUDA across a cluster is a couple of clicks– Deploy & manage both CPU & GPU resources

• MONITOR: Monitor resources with GPUs– GPU utilization, temperature & status– Detect ECC error accumulation

GPUs: Schedule, Monitor & Manage

Page 15: More Science, Less Computer Science

Copyright © 2010 Platform Computing Corporation. All Rights Reserved.

3rd Party Management SW Integration

Easily customize Platform HPC based on your organization’s unique requirements• Monitor status for non-server devices in the HPC

environment• Set up alerts for 3rd party devices• Customize to interface with 3rd party management software

Fabric Mngr

3rd Party SW

PlatformHPC

Storage Mngr

Page 16: More Science, Less Computer Science

Copyright © 2010 Platform Computing Corporation. All Rights Reserved.

Run applications without wrapper scripts

• AvailableTemplates:– ANSYS Mechanical– ANSYS Fluent– ClustalW– HMMER– LS-DYNA– MSC Nastran– NCBI Blast– NWChem– Schlumberger ECLIPSE– Simulia Abaqus– Generic template for in-

house / open source apps

Additional templates coming soon!

Powerful Application SupportJob submission templates

• Easy to use interfaces• Self-documenting• Rapid user productivity• Minimize job submission errors

Page 17: More Science, Less Computer Science

Copyright © 2010 Platform Computing Corporation. All Rights Reserved.

Product ModelsBasic Package Enterprise

PackageCluster management Yes YesWorkload management Partial CompleteWorkload, system monitoring & reporting Yes Yes

Dynamic OS multi-boot No YesMPI library Open source YesCommercial app integrations & templates Partial Yes

Unified web portal Yes Yes Max. no licenses per cluster Unlimited Unlimited

Head node fail over Yes Yes

License type Floating Floating

Licensed for LSF add-on modules Not applicable

Platform MultiCluster

Platform Application Center Enterprise

Platform MakeCloud bursting &

adaptive extensions

Page 18: More Science, Less Computer Science

Copyright © 2010 Platform Computing Corporation. All Rights Reserved.

PCM Fujitsu Edition

Key Benefits

More science, less computer science

Faster time to system readiness

Reduced user training requirements

Easily integrate applications

Improve utilization & workload throughput

Page 19: More Science, Less Computer Science

Copyright © 2010 Platform Computing Corporation. All Rights Reserved.19

Supported by Fujitsu & Platform, a premier HPC support organization:

• Pre-tested HPC solutions• Significant investment in

certification & validation• Support “from the source” –

developers on staff• 7 x 24, around the globe• Deep Linux & HPC competency

World Class Engineering & Support

“I really don’t know how we would have done it without Platform and the support that we get.” Dr. Athanasoulis, Harvard Medical School

“This was a great support experience for me. One of the best I have had. Even better considering the fact we are in different parts of the world.” Major Asian Energy Corporation

“Thanks a lot. You saved me. All my budget for Platform Support over the past few years has yielded a return as good as gold.” Prestigious US School of Public Health

Page 20: More Science, Less Computer Science

Copyright © 2010 Platform Computing Corporation. All Rights Reserved.

Success Stories

Page 21: More Science, Less Computer Science

Copyright © 2010 Platform Computing Corporation. All Rights Reserved.21

Project Overview:• £34 Million pan-Wales HPC infrastructure shared among 8

universities. (http://www.hpcwire.com/hpcwire/2011-03-23/hpc_wales_taps_fujitsu_to_build_supercomputing_grid.html)

• 1266 HPC nodes spread out over the following 8 sites:• Hub Centres: Cardiff, Swansea• Tier 1 Sites: Aberystwyth, Bangor, University of Glamorgan,• Tier 2A Sites: Swansea Metropolitan U., U. of Wales, Newport

Glyndwr U., Wrexham

• Platform Computing provides the HPC stack delivering capabilities for cluster management/provisioning, workload management, MPI library and dynamic OS multiboot.

HPC Wales (Fujitsu) Strategic Win

Page 22: More Science, Less Computer Science

Copyright © 2009 Platform Computing Corporation. All Rights Reserved.

Thank You!

Email:[email protected]

http://ts.fujitsu.com/products/standard_servers/high_performance_computing/