30
© 2009 IBM Corporation Architectures for Massive Parallel Data Base Clusters providing Linear Scale-Out and Fault Tolerance on Commodity Hardware for OLTP Workloads Lightning Talk: XLDB Workshop 2013 @CERN, 28.05.2013 Romeo Kienzler, IBM Innovation Center Zurich

Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

Embed Size (px)

DESCRIPTION

 

Citation preview

Page 1: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

Architectures for Massive Parallel Data Base Clusters providing Linear Scale-Out and Fault Tolerance on Commodity Hardware for OLTP Workloads

Lightning Talk: XLDB Workshop 2013 @CERN, 28.05.2013Romeo Kienzler, IBM Innovation Center Zurich

Page 2: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

2

Source: If applicable, describe source origin

Shared Disk vs. Shared Nothing

Centralized Locking Distributed Locking

Compute Node Fault Tolerance Partition Replication

Ad-Hoc Load Balancing Data Partitioning, Data Skew

Resource-Starvation on Disk System Linear Scale-Out for Writes

Write-Limited Write-Limited for Distributed Two Phase Commit

Requires Distributed Buffering Effectiveness of Local Buffer Pools

Inherent Data-Shipping support Performance Impact on Data-Shipping

Page 3: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

3

Source: If applicable, describe source origin

Show-Stopper for Shared-Nothing

Partition-Skew for Random Access Patterns

Page 4: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

4

Source: If applicable, describe source origin

BUT

Large-Scale Shared-Disk Systems introduce Bottlenecks

Page 5: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

5

Source: If applicable, describe source origin

IDEA

Cluster File System

Page 6: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

6

Source: If applicable, describe source origin

GPFS Declustered RAID

Page 7: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

7

Source: If applicable, describe source origin

GPFS Declustered RAID

Page 8: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

8

Source: If applicable, describe source origin

GPFS - Example

Page 9: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

9

Source: If applicable, describe source origin

GPFS - Example

Page 10: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

10

Source: If applicable, describe source origin

IDEA

Compute Nodes without Disks

Page 11: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

11

Source: If applicable, describe source origin

Problem: No Data Locality

200K Disks => 60 ms

Page 12: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

12

Source: If applicable, describe source origin

Problem: No Data Locality

-------------------------------

Page 13: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

13

Source: If applicable, describe source origin

IDEA

Point-To-Point Connections

Page 14: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

14

Source: If applicable, describe source origin

Switching Fabric

Page 15: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

15

Source: If applicable, describe source origin

Network Bottleneck Problem Solved

Page 16: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

16

Source: If applicable, describe source origin

IDEA

Centralized Lock Management

Page 17: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

17

Source: If applicable, describe source origin

Centralized Locking

Infiniband

Low Latency Up to 60 Gbit/s RDMA

Source: http://thetechjournal.com

Source: http://www.mellanox.co.jp

Page 18: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

18

Source: If applicable, describe source origin

Centralized Buffer Pool

Page 19: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

19

Source: If applicable, describe source origin

IDEA

Centralized Lock Management

Switching FabricCompute NodesClients

Cluster File System

Centralized Buffer Pool

Page 20: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

20

Source: If applicable, describe source origin

DB2 pureScale – General Concepts

Based on DB2z Parallel Sysplex concept1¹ Shared disk concept

Multiple DB2 worker nodes Single GPFS file system

Centralized buffer pool and lock management

¹For example, Toronto Dominion Bank (TD Bank) has had 100 percent availability of customer information for 10 consecutive years, including two DB2 for z/OS upgrades during that timeframe.

Page 21: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

21

Source: If applicable, describe source origin

DB2 pureScale – Operation Model

Infiniband, RDMA

Infiniband, 10 GBit Ethernet, 8 Gbit/s SAN

Page 22: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

22

Source: If applicable, describe source origin

DB2 pureScale – Fault Tolerance

Active-active concept Clean pages don't need to be recovered -> GPFS reliability Dirty pages are known to the CF

CF locks dirty pages Recovery DB2 instance flushes dirty pages to GPFS

Page 23: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

23

Source: If applicable, describe source origin

DB2 pureScale – Recovery Performance

Page 24: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

24

Source: If applicable, describe source origin

DB2 pureScale - Scale-Out

0123456789

101112

0 5 10 15

Page 25: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

25

Source: If applicable, describe source origin

Summary

● Linear Scale-Out● Fault Tolerance● Commodity Hardware● Support for OLTP Workloads

Page 26: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

26

Source: If applicable, describe source origin

Summary

● Linear Scale-Out● Fault Tolerance● Commodity Hardware● Support for OLTP Workloads

Page 27: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

27

Source: If applicable, describe source origin

Summary

● Linear Scale-Out● Fault Tolerance● Commodity Hardware● Support for OLTP Workloads

Page 28: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

28

Source: If applicable, describe source origin

Summary

● Linear Scale-Out● Fault Tolerance● Commodity Hardware● Support for OLTP Workloads

Page 29: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

29

Source: If applicable, describe source origin

Summary

● Linear Scale-Out● Fault Tolerance● Commodity Hardware● Support for OLTP Workloads

Page 30: Architecturesfor massive parallel data base clustersproviding linear scale out and fault tolerance on commodityhardware for OLTP workloads - XLDB Conference 13 @CERN

© 2009 IBM Corporation

IBM Presentation Template Full Version

30

Source: If applicable, describe source origin

Summary

● Linear Scale-Out● Fault Tolerance● Commodity Hardware● Support for OLTP Workloads