22
Overview SCALE14x 2016

Overview SCALE14x 2016. Agenda/Schedule -Apache Bigtop Overview -Apache Spark Overview/Getting Started -Lunch Break -Apache Ignite -Workshop, tutorial,

Embed Size (px)

DESCRIPTION

What is Bigtop? Setting the standard for testing, packaging and integration of leading big/fast data components

Citation preview

Page 1: Overview SCALE14x 2016. Agenda/Schedule -Apache Bigtop Overview -Apache Spark Overview/Getting Started -Lunch Break -Apache Ignite -Workshop, tutorial,

Overview

SCALE14x 2016

Page 2: Overview SCALE14x 2016. Agenda/Schedule -Apache Bigtop Overview -Apache Spark Overview/Getting Started -Lunch Break -Apache Ignite -Workshop, tutorial,

Agenda/Schedule-Apache Bigtop Overview-Apache Spark Overview/Getting Started-Lunch Break-Apache Ignite-Workshop, tutorial, open time

http://workshops.bigtop.rocks(click on Agenda button)

Page 3: Overview SCALE14x 2016. Agenda/Schedule -Apache Bigtop Overview -Apache Spark Overview/Getting Started -Lunch Break -Apache Ignite -Workshop, tutorial,

What is Bigtop?

Setting the standard for testing, packaging and integration of leading big/fast data components

Page 4: Overview SCALE14x 2016. Agenda/Schedule -Apache Bigtop Overview -Apache Spark Overview/Getting Started -Lunch Break -Apache Ignite -Workshop, tutorial,

and many other…

Components as Building Blocks

Page 5: Overview SCALE14x 2016. Agenda/Schedule -Apache Bigtop Overview -Apache Spark Overview/Getting Started -Lunch Break -Apache Ignite -Workshop, tutorial,

-------------------------------------------------------------------------

Dependency Hell!!

hdfszookeeperhbasekafkaspark...mapredooziehiveetc ---

------

------

------

------

------

------

------

------

------

-

------

------

------

------

------

------

------

------

------

----

------

------

------

------

------

------

------

------

------

----

------

------

------

------

------

------

------

------

------

----

------

------

------

------

------

------

------

------

------

----

------

------

------

------

------

------

------

------

------

----

Build all the Things!!!

Page 6: Overview SCALE14x 2016. Agenda/Schedule -Apache Bigtop Overview -Apache Spark Overview/Getting Started -Lunch Break -Apache Ignite -Workshop, tutorial,

The BOMBuild of Materials (BOM)

* List of >=1 components* Gradle for build/actions* Produce sets of debs/rpms

Page 7: Overview SCALE14x 2016. Agenda/Schedule -Apache Bigtop Overview -Apache Spark Overview/Getting Started -Lunch Break -Apache Ignite -Workshop, tutorial,

Bigtop OriginsYahoo!, 2010

Created, fostered early Hadoop communityWorking on Hadoop 0.20 stack

2011Yahoo!’s to Cloudera, solving early problems of packaging and maintaining first commercial supported Hadoop distro

Page 8: Overview SCALE14x 2016. Agenda/Schedule -Apache Bigtop Overview -Apache Spark Overview/Getting Started -Lunch Break -Apache Ignite -Workshop, tutorial,

Early value addProvide a common foundation for proper integration of growing number of Hadoop family components

Foundation provides solid base for validating applications running on top of the stack(s)

Provide neutral packaging and deployment/config

Page 9: Overview SCALE14x 2016. Agenda/Schedule -Apache Bigtop Overview -Apache Spark Overview/Getting Started -Lunch Break -Apache Ignite -Workshop, tutorial,

Early Mission AccomplishedFoundation for commercial Hadoop distros/services

Leveraged by app providers…

Page 10: Overview SCALE14x 2016. Agenda/Schedule -Apache Bigtop Overview -Apache Spark Overview/Getting Started -Lunch Break -Apache Ignite -Workshop, tutorial,

What now?

We are done right?1?!?

Page 11: Overview SCALE14x 2016. Agenda/Schedule -Apache Bigtop Overview -Apache Spark Overview/Getting Started -Lunch Break -Apache Ignite -Workshop, tutorial,

Industry/Ecosystem Evolution&

New Community Needs/Ideas

Page 12: Overview SCALE14x 2016. Agenda/Schedule -Apache Bigtop Overview -Apache Spark Overview/Getting Started -Lunch Break -Apache Ignite -Workshop, tutorial,

Where should we spend our time?,which users should benefit?

Page 13: Overview SCALE14x 2016. Agenda/Schedule -Apache Bigtop Overview -Apache Spark Overview/Getting Started -Lunch Break -Apache Ignite -Workshop, tutorial,

Moving beyond oob mapreduce…

Page 14: Overview SCALE14x 2016. Agenda/Schedule -Apache Bigtop Overview -Apache Spark Overview/Getting Started -Lunch Break -Apache Ignite -Workshop, tutorial,

Lambda/Stream Architectures

HDFS + Zookeeper +

Page 15: Overview SCALE14x 2016. Agenda/Schedule -Apache Bigtop Overview -Apache Spark Overview/Getting Started -Lunch Break -Apache Ignite -Workshop, tutorial,

Get out from the Apache dome

Page 16: Overview SCALE14x 2016. Agenda/Schedule -Apache Bigtop Overview -Apache Spark Overview/Getting Started -Lunch Break -Apache Ignite -Workshop, tutorial,

New focus and target end users

Data engineers vs distro builders

Enhance Operations/Deployment

Reference implementations & tutorials

Page 17: Overview SCALE14x 2016. Agenda/Schedule -Apache Bigtop Overview -Apache Spark Overview/Getting Started -Lunch Break -Apache Ignite -Workshop, tutorial,

Laying new foundation with 1.0+Self-starter, non-kitchen sink building -Making gradle tooling smarter -Jenkins job autogen -leveraging containers for parallelization

Page 18: Overview SCALE14x 2016. Agenda/Schedule -Apache Bigtop Overview -Apache Spark Overview/Getting Started -Lunch Break -Apache Ignite -Workshop, tutorial,

Data data data…Smarter/Realistic test data -bigpetstore -bigtop-bazaar -weather data gen

Tutorial/Learning Data sets -githubarchive.org -more tbd…

Page 19: Overview SCALE14x 2016. Agenda/Schedule -Apache Bigtop Overview -Apache Spark Overview/Getting Started -Lunch Break -Apache Ignite -Workshop, tutorial,

Deployment/MgmtUpdated puppet modules -newest best practices -next level enhanced security options

Wider range of starter deployment topologies

Include some handling of test/tutorial data

Page 20: Overview SCALE14x 2016. Agenda/Schedule -Apache Bigtop Overview -Apache Spark Overview/Getting Started -Lunch Break -Apache Ignite -Workshop, tutorial,

More components…

Page 21: Overview SCALE14x 2016. Agenda/Schedule -Apache Bigtop Overview -Apache Spark Overview/Getting Started -Lunch Break -Apache Ignite -Workshop, tutorial,

Sounds interesting, how can I help?

*Join mailing list, ask questions, suggest features, etc

*Contribute (components, tutorials, docs)

*Report bugs

Page 22: Overview SCALE14x 2016. Agenda/Schedule -Apache Bigtop Overview -Apache Spark Overview/Getting Started -Lunch Break -Apache Ignite -Workshop, tutorial,

Thank You, Q&A

Nate D’[email protected]@kaiyzen