25
HBase/PH ENIX @ Scale A study of Salesforce’s use of HBase and Phoenix Lars Hofhansl Vice President and Principal Architect at Salesforce Apache HBase, Apache Phoenix committer and PMC member

HBase/PHOENIX @ Scale

Embed Size (px)

Citation preview

Page 1: HBase/PHOENIX @ Scale

HBase/PH ENIX @ ScaleA study of Salesforce’s use of HBase and Phoenix

Lars Hofhansl

Vice President and Principal Architect at Salesforce

Apache HBase, Apache Phoenix committer and PMC member

Page 2: HBase/PHOENIX @ Scale

Two Years Ago, I showed you this

Page 3: HBase/PHOENIX @ Scale
Page 4: HBase/PHOENIX @ Scale

Zookeeper?

Page 5: HBase/PHOENIX @ Scale

Zookeeper?

HBase?

Page 6: HBase/PHOENIX @ Scale

Zookeeper?

HBase?

HDFS?

Page 7: HBase/PHOENIX @ Scale

Zookeeper?

Commodity

Hardware?

HBase?

HDFS?

Page 8: HBase/PHOENIX @ Scale

Zookeeper?

Commodity

Hardware?

HBase?

HDFS?Unstructured

Data?

Page 9: HBase/PHOENIX @ Scale

HBase/Phoenix are BIG* at Salesforce

* Numbers are from some time in the past and do not reflect the current scale

Page 10: HBase/PHOENIX @ Scale

Heavy users of relational and semi-structure data

Page 11: HBase/PHOENIX @ Scale

Mix of customer* and internal data

* Through the Salesforce Platform Offering

Page 12: HBase/PHOENIX @ Scale

Typical Use Cases

Samples of Customer Data:

Login data to track anomalies in real-time

Archiving, historical data moved from operational, relational storage to HBase

Denormalized feed views for Salesforce Chatter

Chatter @mention low-latency relevancy queries

Storage of user activity on marketing campaigns for reporting and AI/ML

Samples of Internal usage:

Periodic thread dumps from all AppServers

Machine metrics from all machines

Page 13: HBase/PHOENIX @ Scale

Customer Facing

Page 14: HBase/PHOENIX @ Scale

> 100 clusters of varying size

Page 15: HBase/PHOENIX @ Scale

~4bn write requests / day

Page 16: HBase/PHOENIX @ Scale

~80TB written / day

Page 17: HBase/PHOENIX @ Scale

That’s about ~8 gbit/s, sustained

Page 18: HBase/PHOENIX @ Scale

~600m read requests / day

Page 19: HBase/PHOENIX @ Scale

~500GB read / day

Page 20: HBase/PHOENIX @ Scale

Central Metrics Database

Page 21: HBase/PHOENIX @ Scale

Central Metrics Database

Page 22: HBase/PHOENIX @ Scale

Collecting data from > 80.000 machines

Page 23: HBase/PHOENIX @ Scale

11.4 trillion metrics stored and growing

Page 24: HBase/PHOENIX @ Scale

2.8 tn metrics in 6 months and growing

Page 25: HBase/PHOENIX @ Scale

210 bn reads in 6 months