CS 744: Big Data Systems Shivaram Venkataraman Fall 2018

CS 744: Big Data Systemspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-bigtable.pdf · MOTIVATION Storing large amounts of semi-structured data - Traditionally done using database

Download PDF Report

Upload
others
View
6
Download
0

Embed Size (px)

Citation preview

CS 744: Big Data Systems

Shivaram Venkataraman Fall 2018

ADMINISTRIVIA

-  Assignment 1 -  Projects -  Piazza

MOTIVATION

Storing large amounts of semi-structured data - Traditionally done using database systems

Varied processing needs

- low latency to bulk processing - data size - schema

Page 4: CS 744: Big Data Systemspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-bigtable.pdf · MOTIVATION Storing large amounts of semi-structured data - Traditionally done using database

BIGTABLE: HIGHLIGHTS

1. Scalability: Petabytes of data, thousands of machines

2. Wide applicability: Handles > 60 applications 3. Fault tolerant: High availability 4. High Performance

Page 5: CS 744: Big Data Systemspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-bigtable.pdf · MOTIVATION Storing large amounts of semi-structured data - Traditionally done using database

OUTLINE

-  Data Model and API -  Architecture -  Master, Tabletserver functionality -  Optimizations

Page 6: CS 744: Big Data Systemspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-bigtable.pdf · MOTIVATION Storing large amounts of semi-structured data - Traditionally done using database

DATA MODEL

Rows Column Families Versions “Timestamps”

Page 7: CS 744: Big Data Systemspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-bigtable.pdf · MOTIVATION Storing large amounts of semi-structured data - Traditionally done using database

WRITE API

Single row at a time! Set a number of columns or delete some Apply is atomic Support for read-modify-write transactions

Page 8: CS 744: Big Data Systemspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-bigtable.pdf · MOTIVATION Storing large amounts of semi-structured data - Traditionally done using database

SCAN API

Fetch any number of columns, column families Filter rows by regex Iterator pattern, rows arriving in sorted order

Page 9: CS 744: Big Data Systemspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-bigtable.pdf · MOTIVATION Storing large amounts of semi-structured data - Traditionally done using database

TaBLETS

Page 10: CS 744: Big Data Systemspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-bigtable.pdf · MOTIVATION Storing large amounts of semi-structured data - Traditionally done using database

SYSTEM ARCHITECHTURE

GFS: Store tablets, replicate

Chubby: Leader election, store metadata

BigTable TabletServer BigTable TabletServer BigTable TabletServer

BigTable Master: metadata ops, rebalancing

Serve data from tablets

Page 11: CS 744: Big Data Systemspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-bigtable.pdf · MOTIVATION Storing large amounts of semi-structured data - Traditionally done using database

CHUBBY: A LOCK SERVICE

Leader election: Classic problem in distributed systems Approach: Build a separate service to handle leader election Properties:

- Uses Paxos algorithm - Low write throughput - Store small amounts of data

Page 12: CS 744: Big Data Systemspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-bigtable.pdf · MOTIVATION Storing large amounts of semi-structured data - Traditionally done using database

TABLET LOCATION

- Hierarchical metadata -  Root of metadata in

Chubby

-  Client library caches tablet locations

Page 13: CS 744: Big Data Systemspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-bigtable.pdf · MOTIVATION Storing large amounts of semi-structured data - Traditionally done using database

MASTER FUNCTIONALITIES

Tablet assignment - Master tracks tablet à tablet server mapping - METADATAhas the complete list of tablets - Each tabletserver has list of tablets that are being served

- Uses heartbeat + Chubby to detect tablet server failures - On master failure, scan METADATAand list tablet servers

Page 14: CS 744: Big Data Systemspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-bigtable.pdf · MOTIVATION Storing large amounts of semi-structured data - Traditionally done using database

WORKER FUNCTIONALITY

Tablets stored in GFS Writes

- Commit log - Insert memtable

Read

- Merge SSTable and memtable

Page 15: CS 744: Big Data Systemspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-bigtable.pdf · MOTIVATION Storing large amounts of semi-structured data - Traditionally done using database

WORKER FUNCTIONALITY

Challenge: Memtable keeps growing over time Minor Compaction

- Freeze memtable, write it as SSTable to disk - But now need to merge more SSTables

Major Compaction

- Read memtable + all SSTables for this tablet - Write out new SSTable. Handles garbage collection

Page 16: CS 744: Big Data Systemspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-bigtable.pdf · MOTIVATION Storing large amounts of semi-structured data - Traditionally done using database

NOTABLE OPTIMIZATIONS

Caching - Scan Cache: key-value pairs returned by the SSTable - Block Cache: SSTables blocks that were read from GFS.

Bloom filter

- Probabilistic data structure: Definitely not or maybe in it - Use this to eliminate SSTables that need to be read

Page 17: CS 744: Big Data Systemspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-bigtable.pdf · MOTIVATION Storing large amounts of semi-structured data - Traditionally done using database

OTHER OPTIMIZATIONS

-  Single commit log per tabletserver -  Sort commit log entries during recovery -  Tablet Splitting

-  Tablet server records changes in METADATA table -  Child tablets share SSTables with parent

Page 18: CS 744: Big Data Systemspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-bigtable.pdf · MOTIVATION Storing large amounts of semi-structured data - Traditionally done using database

LADIS (2009)

Page 19: CS 744: Big Data Systemspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-bigtable.pdf · MOTIVATION Storing large amounts of semi-structured data - Traditionally done using database

BIGTABLE: DISCUSSION

Generality vs. Specificity Simplicity, Layering Scalability User overheads

Page 20: CS 744: Big Data Systemspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-bigtable.pdf · MOTIVATION Storing large amounts of semi-structured data - Traditionally done using database

QUESTIONS / DISCUSSION ?

CommunicationTheory of Secrecy Systemspages.cs.wisc.edu/~rist/642-spring-2014/shannon-secrecy.pdfCommunicationTheory of Secrecy Systems By C. E. SHANNON 1 INTRODUCTION AND SUMMARY

Documents

BigTable: A Distributed Storage System for Structured Datacourse/DBMS/class/bigtable/bigtable.pdf · • BigTable is a working system catering the needs for a wide variety of applications

Documents

PRETZEL:Openingthe BlackBoxof Machine Learning Prediction ...pages.cs.wisc.edu/~shivaram/cs744-slides/cs744-pretzel-qinyuan.pdf · Machine Learning Prediction Serving 1.Models are

Documents

Replicated Data Managemen t in Distributed Systemspages.cs.wisc.edu/~remzi/Classes/739/Papers/basic... · 2016-10-31 · Replicated Data Managemen t in Distributed Systems Mustaque

Documents

TensorFlow: A system for large-scale machine learningpages.cs.wisc.edu/~shivaram/cs744-readings/TensorFlow.pdf · Josh Levenberg, Rajat Monga, Sherry Moore, Derek G. Murray, Benoit

Documents

Partial Results in Database Systemspages.cs.wisc.edu/~wlang/mod347-langAemb.pdf · 2014-05-02 · Partial Results in Database Systems Willis Lang, Rimma V. Nehme, Eric Robinson, Jeffrey

Documents

Rethinking SIMD Vectorization for In-Memory Databasespages.cs.wisc.edu/~shivaram/cs744-slides/cs744-harshal-simd.pdf · Sri Harshal Parimi . Motivation ´ Need for fast analytical

Documents

Jayashankar - University of Wisconsin–Madisonpages.cs.wisc.edu/.../cs744-jayashankar-mesos.pdf · Mesos Architecture . Resource Offer ... Master is designed to be soft state, new

Documents

CS 744: Big Data Systemspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-trill.pdf · - Real-time streaming, temporal queries on logs, progressive queries etc. Language Integration

Documents

DISTRIBUTED SYSTEMSpages.cs.wisc.edu/~shivaram/cs537-sp19-notes/dist...– client/server: web server and web client – cluster: page rank computation. Why Go Distributed? More computing

Documents

LFS, DISTRIBUTED SYSTEMSpages.cs.wisc.edu/.../dist/cs537-lfs-dist-notes.pdf · LFS reclaims segments (not individual inodes and data blocks) - Want future overwites to be to sequential

Documents

Energy Attribution and Accounting for Multi-core Systemspages.cs.wisc.edu/~ramak/rawork/RTDEMS.pdf · 2012. 1. 24. · Accurate Energy Attribution and Accounting for Multi-core Systems

Documents

- Varun Batra - University of Wisconsin–Madisonpages.cs.wisc.edu/.../cs744-slides/cs744-pipedream-varun.pdf · 2018-10-16 · §Mixing of Forward and Backward passes with different

Documents

Ray: A Distributed Framework for Emerging AI Applicationspages.cs.wisc.edu/~shivaram/cs744-readings/ray-arxiv.pdf · neous and evolves dynamically. A simulation can take from a few

Documents

Integrated Modeling for Optimization of Energy Systemspages.cs.wisc.edu/~ferris/talks/orsnz-dec.pdfIntegrated Modeling for Optimization of Energy Systems Michael C. Ferris University

Documents

MULTIMEDIA COMMUNICATION SYSTEMSpages.cs.wisc.edu/~kzhao32/projects/ece434p2motionAnalysis.pdf · ECE 434 Multimedia Systems Project 02 Motion Analysis 2013 November 10 Compare the

Documents

An example of CP system - uniroma1.itaniello/ds2016/DS2016-BigTable.pdf · An example of CP system: Google’s BigTable ... CP systems give up availability to get strong consistency

Documents

On Debugging Non-Answers in Keyword Search Systemspages.cs.wisc.edu/~wentaowu/papers/edbt15-non-answer.pdf · On Debugging Non-Answers in Keyword Search Systems Akanksha Baid Wentao

Documents

Introduction to Operating Systemspages.cs.wisc.edu/~remzi/Classes/537/Spring2012/Book/intro.pdf2.2 Virtualizing Memory Now let’s consider memory. The model of physical memory pre-sented

Documents

ASAP: Fast, Approximate Graph Pattern Mining at Scalepages.cs.wisc.edu/~shivaram/cs744-slides/cs744-yunang-asap.pdfosdi18_slides_iyer.pdf Iyer, Anand Padmanabha, et al. "ASAP: fast,

Documents

Devirtualizing memory in heterogeneous systemspages.cs.wisc.edu/~swapnilh/resources/asplos18_dvm_final.pdfheads, particularly for accelerators. Supporting conventional VM in accelerators

Documents

Managing Server Load in Global Memory Systemspages.cs.wisc.edu/~vernon/papers/gms.97sigm.pdfManaging Server Load in Global Memory Systems Geoffrey M. Voelker, Herve´ A. Jamrozik,

Documents

Zero-Interaction Authentication April 15, 2003 Mark D.Corner, Brian D. Noble Presented by Seong Oun Hwang CS744 Special Topics in System Architecture:

Documents

CS 744: Big Data Systemspages.cs.wisc.edu/~shivaram/cs744-slides/cs744-spark-streaming-v2.pdfCONTINUOUS OPERATOR MODEL Long-lived operators Distributed Checkpoints High overhead for

Documents

GraphX : Graph Processing in a Distributed Dataflow Frameworkpages.cs.wisc.edu/~shivaram/cs744-slides/cs744-graphx-bidyut.pdf · Google Knowledge graph :570MVertices, 18B Edges (

Documents

Magellan: Toward Building Entity Matching Management Systemspages.cs.wisc.edu/~anhai/papers/magellan-vldb16.pdf · The Magellan Solution: To address these limitations, we describe

Documents

BigTable - cseweb.ucsd.educseweb.ucsd.edu/classes/fa16/cse291-g/applications/ln/BigTable.pdf · Rows •Arbitrary strings •Reads and writes of rows are atomic (even across columns)

Documents

CS 537: INTRO TO OPERATING SYSTEMSpages.cs.wisc.edu/~shivaram/cs537-sp19-notes/intro/cs537-intro.pdf · CS 537: INTRO TO OPERATING SYSTEMS Shivaram Venkataraman Spring 2019 . WHO

Documents

Protocol- and Situation-Aware Distributed Storage Systemspages.cs.wisc.edu › ~ra › ram-thesis.pdf · Bharath Ramakrishnan reminded me to have fun. I am grateful to my brothers

Documents

Clipper: A Low-Latency Online Prediction Serving Systempages.cs.wisc.edu/~shivaram/cs744-readings/clipper.pdfClipper: A Low-Latency Online Prediction Serving System Daniel Crankshaw*,

Documents