24
THE CURRENT AND THE FUTURE OF CEPH HAOMAI WANG 2015.10.30

The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

  • Upload
    dangdan

  • View
    223

  • Download
    0

Embed Size (px)

Citation preview

Page 1: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

THE CURRENT AND THE FUTURE OF CEPHHAOMAI WANG 2015.10.30

Page 2: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

ABOUT

I’M HAOMAI WANG

▸ Ceph core developer

▸ GSOC 2014, 2015 Ceph mentor

▸ Maintain KeyValueStore, AsyncMessenger, focus on Performance optimization

▸ Involve in database, local filesystem and storage

▸ NetBSD on VirtualBox author

[email protected]

Page 3: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

TEXT

AGENDA

▸ What is Ceph?

▸ The current Ceph and the roadmap

Page 4: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

WHAT IS CEPH?

Page 5: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

WHAT IS CEPH?

CEPH MOTIVATION PRINCIPLES

▸ everything must scale horizontally no single point of failure commodity hardware

▸ self-manage whenever possible

▸ move beyond legacy approaches

▸ client/cluster instead of client/server

▸ avoid ad hoc high-availability

▸ open source

Page 6: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

WHAT IS CEPH?

CEPH MOTIVATION PRINCIPLES

Page 7: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

WHAT IS CEPH?

CEPH ECOSYSTEM

Page 8: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

WHAT IS CEPH?

FEATURES

Page 9: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

WHAT IS CEPH?

REPLICATION/TIERING

Page 10: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

WHAT IS CEPH?

CRUSH▸ Cephs data distribution mechanism

▸ Pseudo-random placement algorithm

▸ Deterministic function of inputs

▸ Clients can compute data location

▸ Rule-based configuration

▸ Desired/required replica count

▸ Affinity/distribution rules

▸ Infrastructure topology

▸ Weighting

▸ Excellent data distribution

▸ De-clustered placement

▸ Excellent data-re-distribution

▸ Migration proportional to change

▸ failure prediction*

Page 11: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

WHAT IS CEPH?

USE CASES

▸ The largest Ceph cluster: CERN

▸ Yahoo Flick

▸ SourceForge

▸ DreamHost

▸ eBay

▸ Deutsche Telekom AG

▸ OpenStack Cloud(~44%)

Page 12: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

WHAT IS CEPH?

VENDOR

▸ Redhat

▸ Intel

▸ Sandisk

▸ Samsung

▸ Fujitsu

▸ Suse

▸ Canonical

Page 13: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

THE CURRENT CEPH AND THE ROADMAP

Page 14: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

THE CURRENT CEPH AND THE ROADMAP

INTERNAL OVERVIEW

Dispatch Layer

IO Replicated

ObjectStore Layer

File System

Block Device Interface

Sockets

TCP

IP

Ethernet

Virtual Memory

Messenger Layer

Recovery Scrub Tiering

Scheduler

Thread

DRAM

IO Controller

Disk

Network Controller

Port

Memory Library

CPU Interconnect

Queue

FileJournal

FileStore

LibRBDApplication

RadosGW

LibRadosSession

Messenger

Page 15: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

THE CURRENT CEPH AND THE ROADMAP

CEPH STORAGE ENGINE

▸ FileStore

▸ NewStore: Replacing FileStore*

▸ KeyValueStore

▸ LevelDB/RocksDB/LMDB

▸ Kinetics API

▸ Samsung uFTL*

▸ Sandisk SSD Library*

▸ MemStore

▸ Memory Management(malloc/free)

▸ NVM(PMBackend, libpmem)*

Page 16: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

THE CURRENT CEPH AND THE ROADMAP

THE NEW TIERING

▸ The new storage mountain

▸ The new challenge:

▸ More storage medium

▸ More complexity management way

▸ Data lake

▸ Migrate data with “temperature”

Page 17: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

THE CURRENT CEPH AND THE ROADMAP

NETWORK

▸ TCP Messenger

▸ posix socket

▸ DPDK*

▸ SolarFlare*

▸ RDMA

Page 18: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

THE CURRENT CEPH AND THE ROADMAP

QOS

▸ Priority based

▸ client priority

▸ message priority

▸ mLock algorithm*

▸ each message with “tag”

▸ exchange window size p2p

Page 19: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

THE CURRENT CEPH AND THE ROADMAP

LIBRADOS▸ Object

▸ Name

▸ Attributes

▸ Data

▸ key/value data

▸ random access insertion, deletion, range query/list

▸ Operation

▸ CAS(Compare And Swap)

▸ Group Operation: Atomic, Rollback

▸ Snapshot: Object Granularity

▸ Copy On Write

▸ Rados Classes

▸ code runs directly inside storage server I/O path

▸ Watch/Notify

▸ Multi Object Transactions*

Page 20: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

THE CURRENT CEPH AND THE ROADMAP

RADOS CLASSES - COMPUTE IN STORAGE SIDE

▸ write new RADOS “methods”

▸ code runs directly inside storage server I/O path

▸ simple plugin API; admin deploys a .so

▸ read-side methods

▸ process data, return result

▸ write-side methods

▸ process, write; read, modify, write

▸ generate an update transaction that is applied atomically

▸ Use cases:

▸ distributed “grep”

▸ LUA interpreter

Page 21: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

THE CURRENT CEPH AND THE ROADMAP

RBD

▸ Thin Provision

▸ Snapshot

▸ Clone

▸ Multi-Client Support

▸ Kernel Client

▸ KVM/XEN

▸ VMWare VVOL*

▸ iSCSI

▸ LIO TCMU + loopback(FUSE)*

▸ Active/Passive*

▸ Active/Active**

Page 22: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

THE CURRENT CEPH AND THE ROADMAP

RADOSGW

▸ S3/Swift

▸ Active/Slave

▸ One Writer

▸ Multi Active Sites*

▸ Hadoop/Spark FileSystem Interface*

▸ NFS protocol aware*

Page 23: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

THE CURRENT CEPH AND THE ROADMAP

CEPHFS

▸ Dynamic subtree partition

▸ Strict posix compatible

▸ NFS

▸ QEMU VM

▸ virtues

▸ nfs over sock

▸ FSCK

▸ Multi-tenant

Page 24: The current Ceph and the future - cnblogs.comfiles.cnblogs.com/.../The_current_and_the_future_of_Ceph.pdf · ABOUT I’M HAOMAI WANG Ceph core developer GSOC 2014, 2015 Ceph mentor

THANK YOU!

2015.10

END