33
1 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved. Isilon Performance Name

Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

Embed Size (px)

Citation preview

Page 1: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

1 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

Isilon Performance Name

Page 2: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

2 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

Agenda

• Architecture Overview

• Next Generation Hardware

• Performance Caching

• Performance Streaming Reads

• Performance Tuning

Page 3: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

3 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

OneFS Architecture Overview

Page 4: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

4 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

OneFS Network Architecture

Isilon IQ Storage

Layer

IB for Intracluster

Communication

Servers

Client/Application Layer Standard 1GbE / 10GbE layer

Servers

Servers

Page 5: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

5 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

Isilon Scale-Out NAS Product Family perf

orm

ance

capacity

Near-Linear Scaling of Performance and Capacity

NL400

Purpose-built for cost-effective, high capacity storage

X200

A slimmer solution cost-optimized for performance

X400

A flexible solution to accelerate high-concurrent and sequential throughput

workloads

S200

Purpose-built for transactional & IOPS-intensive workloads

Page 6: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

6 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

Storage Workloads perf

orm

ance

capacity

NL400

Purpose-built for cost-effective, high capacity storage

X400

A flexible solution to accelerate high-concurrent and sequential throughput

workloads

X200

A slimmer solution cost-optimized for performance

S200

Purpose-built for transactional & IOPS-intensive workloads

Commercial HPC

Scientific HPC

Electronic Design

Automation

Hadoop / Analytics

Media & Entertainment

Medical Imaging

Active Archive

Home Directories

Cold Archive

Page 7: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

7 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

Next Generation Hardware

Page 8: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

8 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

S210 – Next Generation S Series

Specifications

• CPU: dual, 6-core Ivy Bridge Processors

• RAM: 32GB to 256GB

• Drives: 24 X 2.5-inch bays

• HDD: 2.5-inch SAS, 300GB-1.2TB each

• SSD: up to 6 SSDs, 200GB-800GB each

• Self-encrypted options available

• Front-end I/O: 2x1GbE + 2x10GbE

• Back-end I/O: QDR Infiniband, 1m-100m

cabling

• Chassis

• Standard 2U enclosure

• Dual redundant, hot swappable PSUs

Page 9: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

9 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

Specifications

• CPU: dual, 8-Core Ivy Bridge Processors

• RAM: 32GB to 256GB

• Drives: 36 X 3.5-inch bays

• HDD: 3.5-inch SATA, 1/2/3/4TB each

• SSD: up to 6 SSDs, 400GB/800GB each

• Self-encrypted options available

• Front-end I/O: 2x1GbE + 2x10GbE

• Back-end I/O: QDR Infiniband, 1m-100m

cabling

• Chassis

• Standard 4U enclosure

• Dual redundant, hot swappable PSUs

(high line only)

X410 – Next Generation X Series

Page 10: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

10 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

PERFORMANCE CACHING

Page 11: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

11 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

Flash Uses

Performance

Cost

GNA

Global Name Acceleration adds flash to existing

clusters/nodes that otherwise wouldn’t have access

Metadata Read

Stores some metadata on flash for preferred access

Metadata Write

Stores all metadata on flash for faster writes and reads

Data on Flash File Data is stored on flash.

Combined with either Metadata Read or Metadata

Write polices

L3 (new in OneFS 7.1.1)

Intelligently chooses data and meta data to cache on flash.

Page 12: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

12 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

Caching

• OneFS uses extra RAM to cache frequently used file data and internal structures

• Blocks are cached at multiple layers – L1 – RAM

– L2 - RAM

– L3 - SSD - New in 7.1.1

The importance of caching on OneFS

Page 13: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

13 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

Cache Layout • Most node RAM

dedicated to read cache

• L1 read cache – Cached File Data

– Local to node

– Removes internode latency, ~100 µs

• L2 read cache – Cached Block Data from

local drives

– Accessible by other nodes

– Removes disk latency, ~5-7 ms (SATA)

Node Memory

L1

L2

Services

Protocols

Filesystem State

Page 14: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

14 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

Cache Layout with L3 Node Memory

L1

L2

Services

Protocols

Filesystem State

L3

L3 Cache

– SSDs

– Stores active

• Metadata

• Data Blocks

Node SSDs

Page 15: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

15 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

L3 Cache: Functional Overview

L3 cache on SSDs, not in RAM – Think of it as an extension of L2 cache (just not in RAM)

Durable – Unlike L2, no cache loss on reboot

Reduce expensive random read I/O from HDDs

Caches both metadata and user data, unlike other metadata polices

Page 16: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

16 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

L3 Cache: Performance

SpecSFS – Op/s equivalent to metadata read acceleration

– Metadata-write SSD strategy still provides best performance

10-20x latency improvements for repeated random read workflows – EDA/build, VMDK …

Improves many workflows – The primary exception are those where Metadata Write is

required today

Page 17: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

17 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

L3 Benefits

• Greatly simplify sizing

• Allows nodes to be sold with fewer SSDs – 2% rule for SSD capacity is not required when using L3

• Less degradation if your metadata exceeds your SSD capacity

• 7.1.1 will default to L3

Page 18: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

18 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

Performance Characteristics

Page 19: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

19 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

Streaming Read

• OneFS spreads out a single file across multiple disks – Streaming performance isn’t limited to a single disk

• A single large read (>128KB) can be serviced by multiple disks at once

• Prefetch - OneFS issues read requests to drives ahead of the client requests to hide latency

Page 20: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

20 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

Streaming read Visual Example

Key

In L2

In L1 Read

On Disk

Initial Read First Read Second Read

Page 21: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

21 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

Streaming Read changes in 7.1.1

• Prefetch can be offloaded to a separate thread

• Up to 64MB of data from disk to L2

• Up to 8MB of data into L1

• Previous versions are less aggressive and less efficient

Prefetch Improvements

Page 22: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

22 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

Beginning Of SMB 3.0 - MultiChannel

7.1.1 brings the first SMB 3.0 feature to OneFS

SMB 3.0 client can establish multiple TCP connections to file servers – High performance with bandwidth

aggregation

– Fault tolerant with multiple network paths

3 configurations can benefit: – Single RSS-capable NIC

– Multiple NICs

– Teamed NICs (LACP)

SMB 3.0 Client

Isilon Cluster

Page 23: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

23 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

SMB 3.0 Multi-Channel

• Requirements – Windows 2012/Windows 8 or later

– OneFS 7.1.1 or later

• Limited to a single node

• Interfaces must be the same type

• Still requires the client and application to be smart about requesting and processing data

Page 24: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

24 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

Biggest Performance Increase Is Using Multiple NICs w/MultiChannel

Network

NIC

RSS

NIC RSS

NIC

RSS

NIC RSS

Network

NIC

NIC

NIC

NIC

SMB Client 1

SMB Server 1

SMB Client 2

SMB Server 2

NIC

RSS

NIC

RSS

NIC

RSS

NIC

RSS

Network

NIC

NIC

NIC

NIC

SMB Client 1

SMB Server 1

SMB Client 2

SMB Server 2

1 session, without Multichannel • No automatic failover • Can’t use full bandwidth

• Only one NIC engaged • Only one CPU core engaged

1 session, with Multichannel • Automatic NIC failover • Combined NIC bandwidth

• Multiple NICs engaged • Multiple CPU cores engaged

Network

Page 25: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

25 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

Peak Aggregate Throughput

0

1000

2000

3000

4000

5000

6000

7000

8000

x400 x410

Total MiB/s

7.0.2.3

7.1.0.1

7.1.1.rc

SMB2 Read, 5-node clusters

Page 26: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

26 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

Performance Tuning

Page 27: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

27 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

OneFS Performance Tuning

• Tuning should be done via File Pool Policies

– Easy to configure and verify

– Access to some tuning parameters

• Can be done in the Web Interface File System Explorer

– Doesn’t require SmartPools license

– Not easy to verify configuration

– Access to some tuning parameters

• Can be done in the CLI with the isi set command

– Doesn’t require SmartPools license

– Not easy to verify configuration

– Access to all tuning parameters

Page 28: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

28 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

File Policies Choosing The Correct SSD Strategy

SSD Usage Use case

L3 Cache (Default in 7.1.1)

Use for all except Write centric workflows

Metadata Read Acceleration

Read centric, low SSD:HDD ratio

Metadata Write Acceleration

Write centric, moderate SSD:HDD ratio

Data On SSD Performance critical, small data set, high SSD:HDD ratio

Avoid SSD Archived, cold data

Page 29: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

29 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

File Policies Choosing the Correct Access Policy

Policy Prefetch Use Case

Optimize for Random Access Disabled Virtual Machines

Optimize for concurrent Access Minimum Home directories

Optimize for streaming Access Maximum Large files

SmartCache Use Case

On Random

Off Streaming

Page 30: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

30 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

Spec Total Ops/s Home Directory Mix, Per-Node

0

5000

10000

15000

20000

25000

7.0.2.3 7.1.0.1 7.1.1.rc

Total Ops/sec

x400

x410

Page 31: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

31 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

Takeaways

Page 32: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream

32 © Copyright 2014 EMC Corporation. All rights reserved. © Copyright 2014 EMC Corporation. All rights reserved.

Get excited about 7.1.1 & Performance

• Customers with lots of namespace operations

– Home directories, EDA, etc

– X400 nodes get a ~10% improvement in SpecSFS tests

– X410 nodes get 30-60% more performance than X400 nodes

• Aggregate read performance is 15-30% better

• Applications that need great single-stream performance with Win8/Windows2012 – up to 1.4GB/s

• This software update will allow customers to unlock performance on nodes they already own!

Where We Really Deliver With This Release

Page 33: Isilon Performance - Dell EMC · –X410 nodes get 30-60% more performance than X400 nodes •Aggregate read performance is 15-30% better •Applications that need great single-stream