43
MongoDB: Operational Big Data Senior Solutions Architect, MongoDB [email protected] Norberto Leite @nleite

Mongo DB: Operational Big Data Database

Embed Size (px)

DESCRIPTION

MongoDB is the leading NoSQL database due to a plenitude of reasons, open source, general purpose, document oriented database supported by a large community and educational platform. It's horizontal scalability features allows this to fit in the operational big data scenarios where the business needs point to realtime analytics and ever-increasing data sets. This talk will focus on the usage of MongoDB for big data operational purposes and why it's ideal to be used in such scenarios. Also integration with other notable big data technology out there like Hadoop and BI tools. Norberto Leite - Senior Solutions Architect, @MongoDB. Mongo DB presentation during the Pentaho & Big Data Ecosystem - Live Seminar 2013

Citation preview

Page 1: Mongo DB: Operational Big Data Database

MongoDB: Operational Big Data

Senior Solutions Architect, MongoDB

[email protected]

Norberto Leite

@nleite

Page 2: Mongo DB: Operational Big Data Database

Agenda

•  MongoDB Intro

•  Big Data

•  MongoDB Operation Big Data(base)

•  Use Cases

•  QA

Page 3: Mongo DB: Operational Big Data Database

Ola!

•  Norberto Leite

•  Solutions Architect –  wingman

•  Barcelona/Brussels

Page 4: Mongo DB: Operational Big Data Database

MongoDB

Page 5: Mongo DB: Operational Big Data Database

MongoDB

The leading NoSQL database

Document Database

Open-Source

General Purpose

Page 6: Mongo DB: Operational Big Data Database

5,000,000+ MongoDB Downloads

100,000+ Online Education Registrants

20,000+ MongoDB User Group Members

20,000+ MongoDB Days Attendees

20,000+ MongoDB Management Service (MMS) Users

Global Community

Page 7: Mongo DB: Operational Big Data Database

MongoDB Overview

300+ employees 600+ customers

Offices in New York, Palo Alto, Washington DC, London, Dublin, Barcelona and Sydney Over $231 million in funding

Page 8: Mongo DB: Operational Big Data Database

MongoDB Overview

Agile Scalable

Page 9: Mongo DB: Operational Big Data Database

MongoDB Vision

To provide the best database for how we build and run apps today

Build –  New and complex data –  Flexible –  New languages –  Faster development

Run –  Big Data scalability –  Real-time –  Commodity hardware –  Cloud

Page 10: Mongo DB: Operational Big Data Database

Operational Database Landscape

Page 11: Mongo DB: Operational Big Data Database

Document Data Model

Relational MongoDB { ! first_name: ‘Paul’,! surname: ‘Miller’,! city: ‘London’,! location: [45.123,47.232],! cars: [ ! { model: ‘Bentley’,! year: 1973,! value: 100000, … },! { model: ‘Rolls Royce’,! year: 1965,! value: 330000, … }! }!}!

Page 12: Mongo DB: Operational Big Data Database

MongoDB is full featured

Rich Queries •  Find Paul’s cars •  Find everybody in London with a car built

between 1970 and 1980

Geospatial •  Find all of the car owners within 5km of Trafalgar Sq.

Text Search •  Find all the cars described as having leather seats

Aggregation •  Calculate the average value of Paul’s car collection

Map Reduce •  What is the ownership pattern of colors by

geography over time? (is purple trending up in China?)

{ ! first_name: ‘Paul’,! surname: ‘Miller’,! city: ‘London’,! location: [45.123,47.232],! cars: [ ! { model: ‘Bentley’,! year: 1973,! value: 100000, … },! { model: ‘Rolls Royce’,! year: 1965,! value: 330000, … }! }!}!

MongoDB

Page 13: Mongo DB: Operational Big Data Database

Developers are more productive

Page 14: Mongo DB: Operational Big Data Database

Big Data

Page 15: Mongo DB: Operational Big Data Database

Best definition so far!

Page 16: Mongo DB: Operational Big Data Database

RDBMS Scale = Bigger Computers

“Clients can also opt to run zEC12 without a raised datacenter floor -- a first for high-end IBM mainframes.”

IBM Press Release 28 Aug, 2012

Page 17: Mongo DB: Operational Big Data Database

Vertical Scalability

Page 18: Mongo DB: Operational Big Data Database
Page 19: Mongo DB: Operational Big Data Database

This Was a Problem for Google

Source: http://googleblog.blogspot.com/2010/06/our-new-search-index-caffeine.html

250

,000

+ M

BP

’s =

= 4.

1 m

iles 2010 Search Index Size:

100,000,000 GB

New data added per day 100,000+ GB

Databases they could use 0

Page 20: Mongo DB: Operational Big Data Database

And for Facebook

2010: 13,000,000 queries per second

Page 21: Mongo DB: Operational Big Data Database

And for Facebook

2010: 13,000,000 queries per second

TPC Top Results

TPC #1 DB: 504,161 tps

Page 22: Mongo DB: Operational Big Data Database

And for Facebook

2010: 13,000,000 queries per second

TPC Top Results

TPC #1 DB: 504,161 tps

Top 10 combined: 1,370,368 tps

Page 23: Mongo DB: Operational Big Data Database

Living in the Post-transactional Future

Order-processing systems largely “done” (RDBMS); primary focus on better search and recommendations or adapting prices on the fly (NoSQL)

Vast majority of its engineering is focused on recommending better movies (NoSQL), not processing monthly bills (RDBMS)

Easy part is processing the credit card (RDBMS). Hard part is making it location aware, so it knows where you are and what you’re buying (NoSQL)

Page 24: Mongo DB: Operational Big Data Database

Shift in What We’re Computing

Page 25: Mongo DB: Operational Big Data Database

How IT/Data Scientists Define Big Data

Source: Silicon Angle, 2012

Page 26: Mongo DB: Operational Big Data Database

MongoDB Operational Big Data(base)

Page 27: Mongo DB: Operational Big Data Database

Consideration – Online vs. Offline

•  Long-running •  High-Latency •  Availability is lower priority

•  Real-time •  Low-latency •  High availability

Online Offline vs.

Page 28: Mongo DB: Operational Big Data Database

Consideration – Online vs. Offline

Online Offline vs.

Page 29: Mongo DB: Operational Big Data Database

MongoDB/NoSQL Is Good for…

360° View of the Customer

Mobile & Social Apps Fraud Detection

User Data Management

Content Management &

Delivery Reference Data

Product Catalogs Machine to Machine Apps Data Hub

Page 30: Mongo DB: Operational Big Data Database

MongoDB and Enterprise IT Stack

EDW Hadoop

Man

agem

ent &

Mon

itorin

g Security &

Auditing

RDBMS

CRM, ERP, Collaboration, Mobile, BI

OS & Virtualization, Compute, Storage, Network

RDBMS

Applications

Infrastructure

Data Management

Online Data Offline Data

Page 31: Mongo DB: Operational Big Data Database

Horizontal Scalability

Page 32: Mongo DB: Operational Big Data Database
Page 33: Mongo DB: Operational Big Data Database

MongoDB Architecture

Page 34: Mongo DB: Operational Big Data Database

Use Cases

Page 35: Mongo DB: Operational Big Data Database

Leading Organizations Rely on MongoDB

Page 36: Mongo DB: Operational Big Data Database

Fortune 500 & Global 500

•  10 of the Top Financial Services Institutions

•  10 of the Top Electronics Companies

•  10 of the Top Media and Entertainment Companies

•  8 of the Top Retailers

•  6 of the Top Telcos

•  5 of the Top Technology Companies

•  4 of the Top Healthcare Companies

Page 37: Mongo DB: Operational Big Data Database

Data Hub User Data Management

Big Data Content Mgmt & Delivery Mobile & Social

MongoDB Solutions

Page 38: Mongo DB: Operational Big Data Database

Customer example: Online Travel

Travel

•  Flights, hotels and cars •  Real-time offers •  User profiles, reviews •  User metadata (previous

purchases, clicks, views)

•  User segmentation •  Offer recommendation engine •  Ad serving engine •  Bundling engine

Algorithms

MongoDB Connector for

Hadoop

Page 39: Mongo DB: Operational Big Data Database

Machine Learning

Ad-Serving

•  Catalogs and products •  User profiles •  Clicks •  Views •  Transactions

•  User segmentation •  Recommendation engine •  Prediction engine

Algorithms

MongoDB Connector for

Hadoop

Page 40: Mongo DB: Operational Big Data Database

Data Hub

Insurance

•  Insurance policies •  Demographic data •  Customer web data •  Call center data •  Real-time churn detection

•  Customer action analysis •  Churn prediction

algorithms

Churn Analysis

MongoDB Connector for

Hadoop

Page 41: Mongo DB: Operational Big Data Database

Obrigado!

Norberto Leite

@nleite

Senior Solutions Architect, MongoDB

[email protected]

Page 42: Mongo DB: Operational Big Data Database

QA ?

Page 43: Mongo DB: Operational Big Data Database