25
Functional Assessment of Erasure Coded Storage Archive Blair Crossman Taylor Sanchez Josh Sackos LA-UR-13- 25967 Computer Systems, Cluster, and Networking Summer Institute

Functional Assessment of Erasure Coded Storage Archive

  • Upload
    burian

  • View
    36

  • Download
    0

Embed Size (px)

DESCRIPTION

LA-UR -13-25967. Functional Assessment of Erasure Coded Storage Archive. Computer Systems, Cluster, and Networking Summer Institute. Blair Crossman Taylor Sanchez Josh Sackos. 1. Presentation Overview. Introduction Caringo Testing Scality Testing Conclusions. 2. Storage Mediums. - PowerPoint PPT Presentation

Citation preview

Page 1: Functional Assessment of Erasure Coded Storage Archive

Functional Assessment of Erasure Coded Storage

Archive

Blair Crossman Taylor Sanchez Josh Sackos

LA-UR-13-25967

Computer Systems, Cluster, and Networking Summer Institute

Page 2: Functional Assessment of Erasure Coded Storage Archive

Presentation Overview

• Introduction

• Caringo Testing

• Scality Testing

• Conclusions

1

Page 3: Functional Assessment of Erasure Coded Storage Archive

Storage Mediums

• Tapeo Priced for capacity not bandwidth

• Solid State Driveso Priced for bandwidth not capacity

• Hard Disko Bandwidth scales with more drives

2

Page 4: Functional Assessment of Erasure Coded Storage Archive

Object Storage: Flexible Containers

• Files are stored in data containers

• Meta data outside of file system

• Key-value pairs

• File system scales with machines

• METADATA EXPLOSIONS!!

3

Page 5: Functional Assessment of Erasure Coded Storage Archive

What is the problem?

• RAID, replication, and tape systems were not designed for exascale computing and storage

• Hard disk capacity continues to grow

• Solution to multiple hard disk failures is needed

4

Page 6: Functional Assessment of Erasure Coded Storage Archive

Erasure Coding : Reduce Rebuild Recalculate

Reduce! Rebuild! Recalculate!

5

Page 7: Functional Assessment of Erasure Coded Storage Archive

Project Description

• Erasure coded object storage file system is a potential replacement for LANL’s tape archive system

• Installed and configured two prototype archives o Scalityo Caringo

• Verified the functionality of systems

6

Page 8: Functional Assessment of Erasure Coded Storage Archive

Functionality Not Performance

Caringoo SuperMicro admin nodeo 1GigE interconnecto 10 IBM System x3755

4 x 1TB HDDo Erasure coding:

o n=3o k=3

Scalityo SuperMicro admin nodeo 1GigE interconnecto 6 HP Proliant (DL160 G6)

4 x 1TB HDDo Erasure coding:

o n=3o k=3

7

Page 9: Functional Assessment of Erasure Coded Storage Archive

Project Testing Requirements

• Datao Ingest : Retrieval : Balance : Rebuild

• Metadatao Accessibility : Customization : Query

• POSIX Gatewayo Read : Write : Delete : Performance overhead

8

Page 10: Functional Assessment of Erasure Coded Storage Archive

How We Broke Data

• Pulled out HDDs (Scality, kill daemon)

• Turned off nodes

• Uploaded files, downloaded files

• Used md5sum to compare originals to downloaded copies

9

Page 11: Functional Assessment of Erasure Coded Storage Archive

Caringo: The automated storage system

• Warewulf/Perceus like diskless (RAM) boot

• Reconfigurable, requires reboot

• DHCP PXE boot provisioned

• Little flexibility or customizability

• http://www.caringo.com

10

Page 12: Functional Assessment of Erasure Coded Storage Archive

No Node Specialization

• Nodes "bid" for tasks• Lowest latency wins• Distributes the work

• Each node performs all tasks• Administrator : Compute : Storage

• Automated Power management• Set a sleep timer• Set an interval to check disks

• Limited Administration Options

11

Page 13: Functional Assessment of Erasure Coded Storage Archive

Caringo Rebuilds Data As It Is Written

• Balances data as writteno Primary Access Nodeo Secondary Access Node

• Automatedo New HDD/Node: auto balancedo New drives format automaticallyo Rebuilds Constantlyo If any node goes down rebuild starts immediatelyo Volumes can go "stale”o 14 Day Limit on unused volumes

12

Page 14: Functional Assessment of Erasure Coded Storage Archive

What’s a POSIX Gateway

• Content File Servero Fully Compliant POSIX objecto Performs system administration taskso Parallel writes

• Was not available for testing

13

Page 15: Functional Assessment of Erasure Coded Storage Archive

“Elastic” Metadata

• Accessible

• Query: key valueso By file size, date, etc.

• Indexing requires “Elastic Search” machine to do indexing o Can be the bottleneck in system

14

Page 16: Functional Assessment of Erasure Coded Storage Archive

Minimum Node Requirements

• Needs a full n + k nodes to:• rebuild• write• balance

• Does not need full n +k to:• read• query metadata• administration

15

Page 17: Functional Assessment of Erasure Coded Storage Archive

Static Disk Install

• Requires disk install

• Static IP addresses

• Optimizations require deeper knowledge

• http://www.scality.com

16

Page 18: Functional Assessment of Erasure Coded Storage Archive

Virtual Ring Resilience• Success until less virtual nodes available than

n+k erasure configuration.• Data stored to ‘ring’ via distributed hash table

17

Page 19: Functional Assessment of Erasure Coded Storage Archive

Manual Rebuilds, But Flexible• Rebuilds on less than required nodes

o Lacks full protection• Populates data back to additional node• New Node/HDD: Manually add node• Data is balanced during:

• Writing• Rebuilding

18

Page 20: Functional Assessment of Erasure Coded Storage Archive

Indexer Sold Separately

• Query all erasure coding metadata per server

• Per item metadata

• User Definable

• Did not test Scality’s ‘Mesa’ indexing service• Extra software

19

Page 21: Functional Assessment of Erasure Coded Storage Archive

Fuse gives 50% Overhead, but scalable

20

Page 22: Functional Assessment of Erasure Coded Storage Archive

On the right path

• Scalityo Static installation, flexible erasure codingo Helpfulo Separate indexero 500MB file limit ('Unlimited' update coming)

• Caringoo Variable installation, strict erasure codingo Good documentationo Indexer includedo 4TB file limit (addressing bits limit)

21

Page 23: Functional Assessment of Erasure Coded Storage Archive

Very Viable

• Some early limitations

• Changes needed on both products

• Scality seems more ready to make those changes.

22

Page 24: Functional Assessment of Erasure Coded Storage Archive

Questions?

23

Page 25: Functional Assessment of Erasure Coded Storage Archive

AcknowledgementsSpecial Thanks to :

Dane Gardner - NMC InstructorMatthew Broomfield - NMC Teaching Assistant

HB Chen - HPC-5 - MentorJeff Inman - HPC-1- MentorCarolyn Connor - HPC-5, Deputy Director ISTIAndree Jacobson - Computer & Information Systems Manager NMCJosephine Olivas - Program Administrator ISTI Los Alamos National Labs, New Mexico Consortium, and ISTI

24