18
1 Hosted by Barry Thompson, Founder & CTO of Tervela

Hadoop, Big Data, and the Future of the Enterprise Data Warehouse

  • Upload
    tervela

  • View
    1.138

  • Download
    1

Embed Size (px)

DESCRIPTION

Under the umbrella of big data, the nature of data warehousing inside enterprises is undergoing a massive transformation. Originally designed as a clearinghouse for organizing data to discover and analyze historical trends, business units are now putting extreme pressure on their data groups to enhance their services. Their goals: provide better customer service, real-time marketing, and more efficient business operations.In this webcast, Big Data expert Barry Thompson will discuss how will enterprise data warehouses are evolving to meet these challenges. Some of the topics we will cover include:- How Hadoop and other big data technologies are coexisting with traditional data warehouses - Dealing with multiple big data sources – and multiple versions of the truth - Techniques like warehouse replication and parallel data loading that enable platforms with different levels of service for different types of applications

Citation preview

Page 1: Hadoop, Big Data, and the Future of the Enterprise Data Warehouse

1

Hosted by Barry Thompson,

Founder & CTO of Tervela

Page 2: Hadoop, Big Data, and the Future of the Enterprise Data Warehouse

What We’ll Discuss Today…

• How is the role of the data warehouse changing in the face of big data?

• How are Hadoop and other big data technologies coexisting with traditional data warehouses?

• What happens when we have multiple big data sources (and multiple versions of the truth)?

• How do I use replication, data loading, cloud integration, and other technologies during this transition period?

2

Page 3: Hadoop, Big Data, and the Future of the Enterprise Data Warehouse

About the presenter...

• Barry Thompson

• Founder and CTO of Tervela

• Visionary with 20 years of experience

• Background in transformative technologies (robotics, imaging & traditional enterprise)

• Technology leadership for AIG, NatWest and UBS

• X-Prize board of trustees

3

Page 4: Hadoop, Big Data, and the Future of the Enterprise Data Warehouse

Data Complexity Exploding

4

X X X

End PointEnd PointProliferationProliferation

End PointEnd PointProliferationProliferationData Data

ExplosionExplosion

Data Data ExplosionExplosion RegulatoryRegulatory

RequirementsRequirements

RegulatoryRegulatoryRequirementsRequirementsGlobalGlobal

DistributionDistribution

GlobalGlobalDistributionDistribution

30 billion30 billionpieces of content shared on Facebook every month

30 million30 millionnetworked sensor nodes with 30% annual growth 5 billion5 billion

mobile phones in use in 2010

40 billion 40 billion Devices connected to the Internet by the end of the decade

78%78%

23%23%of Asia is on the

Internet

58%58%of Europe is

on the Internet

of North America is on the Internet

Dodd-FrankDodd-Frank

Basel IIIBasel III

Consumer ProtectionConsumer Protection

More DataBy More People and Apps

In More PlacesFaster

HIPAAHIPAA

64 exabytes64 exabytesAmount of data moved around the Internet per month by the end of the decade

Page 5: Hadoop, Big Data, and the Future of the Enterprise Data Warehouse

It Should Be Easy

5

Operational Data Stores

Traditional Data Warehouses

Hadoop Map-Reduce

Transactional Data

Structured Analysis

Unstructured Analysis

Page 6: Hadoop, Big Data, and the Future of the Enterprise Data Warehouse

But It’s Not

6

Operational Data Stores

Traditional Data Warehouses

Hadoop Map-Reduce

Transactional Data

Structured Analysis

Unstructured Analysis

NoSQLNoSQL

Real-Time Decision Real-Time Decision SupportSupport

Real-Time Real-Time OperationsOperations

Real-Time Real-Time AnalyticsAnalytics

ETL ETL ReplacementReplacement

Page 7: Hadoop, Big Data, and the Future of the Enterprise Data Warehouse

What’s Driving This Activity?

7

Accessibility of Big Data Streams

Multi-Format, Multi-Type

Inconsistent Ingest Rates

Scaling Across Geographies

Explosion in Real-Time Analytics

Page 8: Hadoop, Big Data, and the Future of the Enterprise Data Warehouse

A Question For You…

1) We have an integrated Hadoop - Data Warehouse strategy

2) We aren't sure how Hadoop should fit with our warehouse

3) There's no interaction between Hadoop and our Data Warehouse

4) We aren't running Hadoop

5) I don’t know

8

What is the relationship at your company between Hadoop and your corporate data warehouse?

Page 9: Hadoop, Big Data, and the Future of the Enterprise Data Warehouse

Warehousing… The Old Way

9

Operational Data Store (Database)

Data Feeds & Web Services

FlatFiles

ETL

Data Warehouse

Data Mart Business Report

Data Mart Analytic App

Slows down data

availability

Slows down data

availability

Single location, single point of

failure

Single location, single point of

failure

Inflexible data formats

Inflexible data formats

I don’t fit I don’t fit

Page 10: Hadoop, Big Data, and the Future of the Enterprise Data Warehouse

The New Warehouse Paradigm

10

Operational Data Store (Database)

Data Feeds & Web Services

FlatFiles

Business User

ETLData Warehouse

Data Mart

Data Mart

Analytic AppETL

Backup Warehouse

Analytic AppHadoop

Real-Time Console

Real-time apps get

immediate access to data

Real-time apps get

immediate access to data

The right format, the right

processing

The right format, the right

processing

DR & Backup for Big Data

DR & Backup for Big Data

Dat

a F

abric

Dat

a F

abric

Page 11: Hadoop, Big Data, and the Future of the Enterprise Data Warehouse

What is a Data Fabric?

11

Features•Data Capture•Data Movement•Data Availability•Data Protection•Data Management

Tervela Data FabricSoftware, Hardware Appliances

or Cloud Services

apps & SOA file systems DBs ODS/clusters clouds

clouds warehouses analytics

Data Stores

Data Sources

Requirements•High performance•No loss•Centralized Management & Visibility •Ease of integration•5 9’s of reliability

Page 12: Hadoop, Big Data, and the Future of the Enterprise Data Warehouse

High-Performance & Parallel Loading

12

• Guaranteed delivery of data into multiple systems• Buffered and streamed to deal with slow consumers• Efficient multi-casting avoids excessive network traffic

Page 13: Hadoop, Big Data, and the Future of the Enterprise Data Warehouse

Real-Time Analytics

13

• Streaming avoids bottlenecks in ETL or warehousing• Delivers the right format for your analytic system• Best way to handle the explosion of analytic apps

Page 14: Hadoop, Big Data, and the Future of the Enterprise Data Warehouse

Cloud Integration

14

• Buffering simplifies big data transfer over slow WANs • Stream data between cloud apps without temp storage• Bridge your cloud apps with on-premise systems

Page 15: Hadoop, Big Data, and the Future of the Enterprise Data Warehouse

Global Data Synchronization

15

• Backup heterogeneous Big Data over unreliable WANs• Create active-active configuration for DR & scale• Geographic distribution for better local performance

Page 16: Hadoop, Big Data, and the Future of the Enterprise Data Warehouse

Big Data Replication

16

• 10-100x faster than existing / native replication over WAN• Multi-cast replication saves bandwidth• Local data improves performance

Page 17: Hadoop, Big Data, and the Future of the Enterprise Data Warehouse

For More Information

• @tervela• [email protected]• www.tervela.com

17

Request a trial:

Read some case studies:

http://tervela.com/download

http://tervela.com/customers

Page 18: Hadoop, Big Data, and the Future of the Enterprise Data Warehouse

Thank you!

18