33
Go DataDriven PROUDLY PART OF THE XEBIA GROUP @mids106 [email protected] Building a Big Data Warehouse Joris Bontje Big Data Hacker

Building a Big Data Warehouse

  • Upload
    mids106

  • View
    103

  • Download
    0

Embed Size (px)

DESCRIPTION

Big Data presentation at XebiCon2013 - http://xebicon.nl/

Citation preview

Page 1: Building a Big Data Warehouse

GoDataDrivenPROUDLY PART OF THE XEBIA GROUP

@[email protected]

Building a Big Data Warehouse

Joris BontjeBig Data Hacker

Page 2: Building a Big Data Warehouse

GoDataDriven

About MeBig Data HackerData Driven Solution ArchitectHadoop Trainer

Page 3: Building a Big Data Warehouse
Page 4: Building a Big Data Warehouse

GoDataDriven

About GoDataDriven

Page 5: Building a Big Data Warehouse

Data Warehouse Evolution

Page 6: Building a Big Data Warehouse

http://en.wikipedia.org/wiki/Data_warehouse

In computing, a data warehouse is a database used for reporting and data analysis.

Page 7: Building a Big Data Warehouse

GoDataDriven

Database Architecture (1.0)

Products)Customers)Orders)

Inventory)Sales)DB)

Page 8: Building a Big Data Warehouse

GoDataDriven

Analytical Database (2.0)

Sales&

Inventory&Customers&

Products&Orders&

Page 9: Building a Big Data Warehouse

GoDataDriven

Basic DWH Architecture

TX#DB#

Analy+cal#DB#

BI#ETL

Page 10: Building a Big Data Warehouse

GoDataDriven

Data Marts

TX#DB# DW#

Sales#

Mktg#

Prch#

BI#

Page 11: Building a Big Data Warehouse

GoDataDriven

Multiple Data-Sources

other&

Files&

TX&DB&

DW&

Sales&

Mktg&

Prch&

BI&

Page 12: Building a Big Data Warehouse

GoDataDriven

Operational Data Store

DW#ODS#

other#

Files#

TX#DB# Sales#

Mktg#

Prch#

BI#

Page 13: Building a Big Data Warehouse

Hadoop

Page 14: Building a Big Data Warehouse

GoDataDriven

No Hadoop

DW#ODS#

other#

Files#

TX#DB# Sales#

Mktg#

Prch#

BI#

Page 15: Building a Big Data Warehouse

GoDataDriven

ETL Engine

other&

Files&

TX&DB& Sales&

Mktg&

Prch&

DW BI&

Page 16: Building a Big Data Warehouse

GoDataDriven

Tiered Data Warehouse

other&

Files&

TX&DB& Sales&

Mktg&

Prch&

BI&

Page 17: Building a Big Data Warehouse

GoDataDriven

Analytical Query Engine

other&

Files&

TX&DB&

BI&

Page 18: Building a Big Data Warehouse

Tools

Page 19: Building a Big Data Warehouse

GoDataDriven

Tools

Page 20: Building a Big Data Warehouse

Tools Applied

Page 21: Building a Big Data Warehouse

GoDataDriven

Tools Applied

Page 22: Building a Big Data Warehouse

Considerations

Page 23: Building a Big Data Warehouse

GoDataDriven

ConsiderationsBig Data is dirtyAutomate everythingMonitoring and QA become the same thing

Page 24: Building a Big Data Warehouse

My Past TrendsBig Data Forum 2012

Page 25: Building a Big Data Warehouse

GoDataDriven

My Past Trends

Cloud / On-demand

Page 26: Building a Big Data Warehouse

GoDataDriven

My Past Trends

Hadoop Hardware

Page 27: Building a Big Data Warehouse

GoDataDriven

My Past Trends

Batch → Real-Time

Page 28: Building a Big Data Warehouse

New TrendsXebiCon 2013

Page 29: Building a Big Data Warehouse

GoDataDriven

TrendsImpala

Open Source, Real-time Query enginefor Hadoop

Page 30: Building a Big Data Warehouse

GoDataDriven

Trends

Defacto standard for Hadoop metadata

Page 31: Building a Big Data Warehouse

GoDataDriven

Simple Database Architecture

Products)Customers)Orders)

Inventory)Sales)DB)

Page 32: Building a Big Data Warehouse

GoDataDriven

The future?

Products)Customers)Orders)

Inventory)Sales)

Page 33: Building a Big Data Warehouse

GoDataDriven

We’re hiring / Questions? / Thank you!

@[email protected]

Joris BontjeBig Data Hacker