Cassandra Training Introduction & Data Modeling. 2 Aims Introduction to Cassandra By the end of...

Cassandra TrainingIntroduction & Data Modeling

2 Introduction to Cassandra

• By the end of today you should know:

• How Cassandra organises data

• How to configure replicas

• How to choose between consistency and availability

• How to efficiently model data for both reads and writes

• You need to consider Active-Active scenarios

• Who to ask to help you & sign off on your data model

• HINT: Ask Neil directly or email harch@expedia.com.

Agenda – 100ft

• Quick Introduction

• Data Structures

• Efficient Data Modeling

• Data Modeling Examples

Elevator Pitch

Write path optimised

Eventually consistent (ms)

Distributed Hash Table

Highly durable

Tunable consistency

DHT 101

Each physical node is assigned a token

Nodes own the range from the previous token

Cassandra Write Path

The coordinator will send the update to two nodes, starting at

the owning node and working clockwise

128-bit hash used to compute partition key

Keys are therefore distributed randomly around the ring

If Unavailable - Hinted Handoff

• SSTables are sequential and immutable

• Data may reside across SSTables

• SSTables are periodically compacted together

Cassandra Read Path

Data read command sent to closest replica - snitch

Digest commands sent to other replicas – CL

Read Repair Chance 10% - digest all replicas

Start & Interrogate C*

• vagrant box add dse.box http://htraining.s3.amazonaws.com/dse.box

• mkdir ~/vagrant

• curl http://htraining.s3.amazonaws.com/vagrant-dse.tar.gz > ~/vagrant/dse.tar.gz

• cd ~/vagrant && tar xzvf dse.tar.gz

• cd dse && vagrant up

• vagrant ssh node1

• nodetool ring

Cassandra Read Path

Read Mechanics

Find Candidate SSTables - Bloom Filters

Seek Through SSTables

Memory Mapped Files

Check Memtable

-> minimise sstables for best efficiency

Deletion & Tombstones

Deleted data marked as removed – tombstone

Stops zombie data – distributed system

Tombstones collected after a few days – configurable

Brewer’s Theorem

Distributed Data

– only 2 at a time –

Consistency

Availability

Partition Tolerance

Brewer’s Theorem

CA - normal operation, no partition, consistency and

availability provided

Brewer’s Theorem

AP - partition occurs, maintaining two mutable, disconnected state

copies breaks consistency, availability is conserved

Brewer’s Theorem

CP - partition occurs, to maintain consistency we need to take one

side offline, sacrificing availability

Tuneable Consistency

Cassandra Consistency Level

Specify node number to agree on read/write

Choose consistency or availability:

CL.LOCAL_QUORUM, CL.ONE

Eventual consistency will bring both sides into

agreement eventually

Agenda – 100ft

• Data Structures

Data Model

Keyspace

Analogous to Database/Schema

Segregate Applications

Replication configured at this level

Data Model

Column Family

Analogous to Table

Contains many rows

Caches configurable at this level

Data Model

Each one has a partition key - hash

Has many columns – up to 2Bn

Columns don’t have to be defined ahead of time

Rows in the same CF can have different columns

No sorting by rows, model ordering in rows

Data Model

Columns

Sorted by name before being written to SSTable

Name and Value are typed

Values can be type-validated

Column update is timestamped

Can have TTL

Data Model

Counter Columns

Distributed counters

Can get false counts

Data Model

Super Columns – Don’t Use

Blob of columns stored inside a single column

Have to read and write whole blob

Memory intensive

Conflicts resolved for whole blob - bad

Secondary Indices

Can define an index on a column

Cassandra will maintain an inverted index

Use sparingly

Low Cardinality Columns Only

Often times better to maintain own view

Thrift vs CQL

Thrift

Original interface, hash style syntax

SQL-like syntax but highly limited

Sent over Thrift but plans for own protocol

Scaling Cassandra

Imagine RF=3, Quorum, Nodes=6

Each query impacts 2 nodes sync

Each write will touch all 3 nodes, though async

To scale writes add more nodes

To scale reads, add more replicas

Agenda – 100ft

• Data Structures

Data Modelling - Concepts

Rows in same CF will live on different nodes

High cost of multi-get

De-normalise your data into rows

Don’t Put Consistent Load on Single Row

Will heat up replica nodes

Data Modelling - Concepts

Writes to Single Row Atomic & Isolated

Columns are Ordered

Column Range Slicing Efficient

Mutating data often needs compaction tuning

Wide Rows

Efficient Reads

Store how you want to fetch

Fetch most efficient over few rows

Store what you want to fetch in few rows

Time Series

Use Timestamp for Column Name – ordered

Range slicing efficient

Can limit row length by using date partition key

e.g. 20121004

Composite Columns

Composite Column

e.g. time1:log_class, time1:log_message,

time2:log_class, time2:log_message

Time Series

Writing to a Single Row Hotspots

Use Round Robin Over Rows

e.g. 20121004:1, 20121004:2, etc…

Compound Keys

Compound Key in CQL3

Partition Key is the row key

Compound Key = Partition Key + Composite Key

e.g. partition key = 20121004, composite key = time1

20121004 => time1:name, time1:msg, time2:name, time2:msg

Agenda – 100ft

• Data Structures

Working with CQL

• cqlsh -3 192.168.33.21

• CREATE KEYSPACE my_app_data

WITH strategy_class = SimpleStrategy

AND strategy_options:replication_factor = 2;

• DESCRIBE KEYSPACE my_app_data;

Compound Keys

USE my_app_data;

CREATE COLUMNFAMILY logs (

day text, -- partition key

log_id timeuuid, -- clustering column

log_class text,

log_message text,

primary key (day, log_id)

DESCRIBE columnfamilies;

Compound Keys

INSERT INTO logs

(day,log_id,log_class,log_message)

VALUES (‘20130604’, ‘2013-06-04 10:05:00’, ‘error’, ‘it

broke’)

USING CONSISTENCY ONE;

INSERT INTO logs

(day,log_id,log_class,log_message)

VALUES (‘20130604’, ‘2013-06-04 11:05:00’, ‘error’, ‘it broke again’)

USING CONSISTENCY QUORUM;

Compound Keys

SELECT * FROM logs USING CONSISTENCY ONE

day=‘20130604’;

SELECT * FROM logs USING CONSISTENCY QUORUM

day=‘20130604’

AND log_id > ‘2013-06-04

11:00:00’;

TRY WITH CL.TWO: vagrant suspend node2

Setting CL and range querying columns, losing consistency

Compound Keys

cassandra-cli -h 192.168.33.21

use my_app_data;

list logs;

See the raw Cassandra data

Code Example - Clients

Hector

Solid Java Client

In Use in Production

Round Robin

Node Discovery

Code Example - Clients

Astyanax

Netflix Open Source Library

Simpler APIs

Code Example

Example: Storing Payment Methods

https://github.com/neilbeveridge/example-compoundkeys

Code Example

Requirements

Store 1-10 payment methods

Use a single row

Code Example

Non-CQL

Define a composite column class

public static final class Composite {

private @Component(ordinal = 0)

String paymentUuid;

private @Component(ordinal = 1)

String field;

Code Example

Writing Data

UUID paymentUUID = TimeUUIDUtils.getUniqueTimeUUIDinMillis();

String sPaymentUUID = paymentUUID.toString();

batch.withRow(PAYMENTS_CF, userId)

.putColumn(new Composite(sPaymentUUID, "pvtoken"), paymentInfo.pvToken, null)

.putColumn(new Composite(sPaymentUUID, "name"), paymentInfo.name, null)

.putColumn(new Composite(sPaymentUUID, "number"), paymentInfo.number, null)

Code Example

Reading Data

Need some logic to handle record boundaries

//handle the payment info boundary

if (lastSeen != null && !column.getName().getPaymentUuid().equals(lastSeen)) {

payments.add(payment);

payment = new PaymentInfo();

payment.paymentUUID = UUID.fromString(column.getName().paymentUuid);

lastSeen = column.getName().getPaymentUuid();

Code Example

A Bit Messy

Code Example

Need to define a Schema

Cassandra needs it to split up the row for us

Code Example

Schema

create table paymentinfo_cql (

user text,

paymentid timeuuid,

name text,

number text,

pvtoken text,

primary key (user,paymentid)

Code Example

Inserting Data

insert into paymentinfo_cql (

user, paymentid, name, number, pvtoken

) values (

'%1$s','%2$s','%3$s','%4$s','%5$s’

Code Example

Reading Data

select * from paymentinfo_cql where user='%s

Multi Datacentre Support

Cassandra RF=2 (availability), Solr RF=1 (offline search)

RFs set per Column Family and per logical datacentre

Multi Datacentre Support

Both DCs participate in same ring

Cassandra walks clockwise as normal to fulfill RFs

Performance Tuning Levers

Memory Mapped Files

SSTables memory mapped

Visible as high virtual memory consumption

Read fastest when working set fits in free RAM

Row Cache

Saves locating SSTables, seeking, reconciliation

Off-heap – IPC marshaling penalty

Whole row in memory

Good for small numbers of hot rows – Gaussian dist.

Key Cache

Saves seeking through SSTables

Beneficial for large SSTables - tiered compaction

On-heap

Cache hit-rates exposed over JMX

Take care using memory that might be stolen from

the read path (VirtMem)

• By the end of today you should know:

• How Cassandra organises data

• How to configure replicas

• How to choose between consistency and availability

• How to efficiently model data for both reads and writes

• You need to consider Active-Active scenarios

• Who to ask to help you & sign off on your data model

• HINT: Ask Neil directly or email harch@expedia.com.

Code Example

Questions

htraining.s3.amazonaws.com/cassandra-training.pptx

Cassandra Training Introduction & Data Modeling. 2 Aims Introduction to Cassandra By the end of...

Documents

How Do I Cassandra?

Cassandra is great but how do I test my application?

When and how to migrate from a relational database to Cassandra

Seastar / ScyllaDB, or how we implemented a 10-times faster Cassandra

Evaluating Apache Cassandra as a Cloud Database...Evaluating Apache Cassandra as a Cloud Database So what is Apache Cassandra and how does it stack up against the criteria for cloud

IIT Mandi Catalyst successfully organises Himalayan

Cassandra Day NYC - Cassandra anti patterns

How Cassandra Deletes Data (Alain Rodriguez, The Last Pickle) | Cassandra Summit 2016

Cassandra Day Atlanta 2016 - Monitoring Cassandra

Cassandra Core Concepts - Cassandra Day Toronto

Apache Cassandra™ Documentationcourses.physics.illinois.edu/cs425/fa2017/cassandra10.pdfApache Cassandra 1.0 Documentation Introduction to Apache Cassandra Apache Cassandra is a

Instaclustr: When and how to migrate from a relational database to Cassandra

Cassandra/eCassandra Help Guide · MINISTRY OF DEFENCE Cassandra/eCassandra Help Guide Guidance as to how to Use Cassandra/eCassandra to set up a new Hazard Log and the components

Cassandra at eBay - Cassandra Summit 2013

Introduction to Cassandra • Why Spark + Cassandra ... · • Introduction to Cassandra • Why Spark + Cassandra • Problem background and overall architecture •Implementation

Cassandra Day Atlanta 2015: Python & Cassandra

Apache Cassandra at Target - Cassandra Summit 2014

Nagios XI – How to Monitor Apache Cassandra Distributed Databases

How you can contribute to Apache Cassandra

Cassandra: How it works and what it's good for! - bcs.org · Dynamo 101 • The parts Cassandra took - Consistent hashing - Replication - Strategies for replication - Gossip - Hinted