32
Separating Hadoop Myths from Reality Rob Anderson

Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

Embed Size (px)

DESCRIPTION

According to Gartner, Hadoop is near the top of the Hype Cycle. While some customers have questions about the enterprise capabilities of Hadoop, the answers are clear as production deployments continue to expand. This session will use successful customer experiences to highlight the power of Hadoop and separate the myths from reality.

Citation preview

Page 2: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

1  

The  Myths  &  Reali.es  Surrounding  Hadoop    Rob  Anderson  VP  Systems  Engineering  

Page 3: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

2  

Sales  

SCM   CRM  

Public  

Web  Logs  Produc7on  

Data  Sensor    Data  Click  

Streams  Loca7on  

Social  Media  

Billing  

Enterprise  Data  Hub  

Hadoop  Changes  Analy.cs  

“Simple  algorithms  and  lots  of  data  trump  complex  models  ”  

Halevy,    Norvig,  and    Pereira,  Google  IEEE  Intelligent  Systems    

Page 4: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

3  

Page 5: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

4  

Page 6: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

5  

Data  Warehouse  

Volume  

Variety  

Velocity  

Page 7: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

6  

Page 8: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

7  

Big Data is hard to move…because it’s BIG

Page 9: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

8  

What  was  the  genius  of  Hadoop?  

§  Fueling  an  industry  revolu7on  by  providing  infinite  capability  to  store  and  process  big  data  

§  Expanding  analy7cs  across  data  types  

§  Compelling  economics  –   20  to  100X  more  cost  effec7ve  than  alterna7ves  

Page 10: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

9  

Page 11: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

10  

Random  Wri.ng  in  MapR  S1

S2

S3 S5 S4

S1, S2, S4 S1, S3 S1, S4, S5 S2, S4, S5 S3

Client  wri.ng  data  

CLDB  Ask  for  64M  block  

Create  cont.  

Picks  master  and  2  replica  slaves  

Write  next  chunk  to  S2  

S2, S3, S5

aZach  

Page 12: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

11  

Page 13: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

12  

MapR  Spout  

TwiZer  

TwiZer      API  

TwiZerLogger  

Storm        MapR  

Op7onal  MapReduce  

DFS  

Page 14: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

13  hZp://www.flickr.com/photos/onemoreshotrog/8085462024/  

Page 15: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

14  

Hadoop  Distribu.ons  

Page 16: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

Hadoop:  The  Disrup.ve  Technology    at  the  Core  of  Big  Data  

Page 17: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

16  

Page 18: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

17  

The  Reality  is    Architecture  MaHers  

Page 19: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

MapR  Data  System  

Architecture  Comparison  

HBase  

JVM  

HDFS  

JVM  

ext3/ext4  

Disks  

Other  Distribu7ons  

Disks  

MapR  M7  

Page 20: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

Architecture  Results  

Results  with  other  distribu.ons  

Results  with  MapR  M7  

Page 21: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

20  

Page 22: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

Produc.on  Success  with  Hadoop  

Page 23: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

22  

2000+  Nodes  Fortune  100  Retailer  

Page 24: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

23  

1000+  Nodes  Fortune  100  Financial  Services  Company  

Page 25: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

24  

Page 26: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

25  

Produc7on  Hadoop  in    Waste  Management  

Waste  Management  Logis.cs  

Page 27: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

26  

Suntory  whiskey  

Page 28: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

27  

Page 29: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

28  

Unique  Iden.ty  Ini.a.ve,  India    

Page 30: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013
Page 31: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013

30  

 Thank  you  Big  Data  Spain!  

Page 32: Separating Hadoop Myths from Reality by ROB ANDERSON at Big Data Spain 2013