Upload
imc-institute
View
4.450
Download
0
Tags:
Embed Size (px)
Citation preview
Big Data on Public Cloud
Assoc. Prof. Dr. Thanachart NumnondaExecutive DirectorIMC Institute13 March 2015
2
“B ัy 2015, 20% of Global 1000 organizationsWill have established a strategic focus on
information infrastructure ”
Gartner
3
Big Data Landscape
Source: Big Data in the Enterprise. When to Use What?
4
Big Data Landscape
Source : http://www.vitria.com/
5
6
NoSQL
7
A scalable fault-tolerant distributed system for data storage and processing
Completely written in javaOpen source & distributed under Apache license
What is Hadoop?
8
Hadoop Environment
Source: Hadoop in Practice; Alex Holmes
9
Major Hadoop Components
Hadoop Distributed File System(HDFS)
Map/Reduce System
10
Hadoop Distribution
Microsoft Azure
11
Big Data Future Architecture
Sscial Media Images e-mails Crawlers ERP CRM LOB APPs
Unstructured and Structured Data
Parallel Data Warehouse
Hadoop OnCloud
Hadoop OnPrivateServer
Connectors
SSRS
BI Platform
Familiar End User ToolsSpreadsheet Predictive Analytics
Data Market Place
NoSQL
Petabytes of Data(Unstructured)
Hundreds of TB of Data(structured)
12
Issue with Big Data Infrastructure
Large investment
Scalabilty
ROI
Business Cases
13
14Source : http://acloudyplace.com/
15
Big Data on Cloud
Using IaaS to leverage Cloud Vms
Using Big Data as a Services
16
Big Data Services on Cloud
Amazon Elastic Mapreduce
Microsoft Azure Hadoop
17
Big Data as a Service
18
19
Database as a Service
Amazon RDS
IBM SQL Database for Bluemix
Microsoft SQL Database
Google CloudSQL
20
NoSQL as a Service
Amazon DynomoDB
Google Cloud DataStore
Microsoft Azure DocumentDB
Cloudant on IBM Bluemix.
Mongo DB on Heroku
21
Hadoop as a Service
Amazon Elastic Map Reduce
Rackspace Cloud Big Data Platform
Qubole
Google Cloud Platform
IBM Bluemix: Analytic on Hadoop
Microsoft Azure HDInsight
22
23
24
Big Data on Amazon EMR
25
26
27
28
Big Data on Cloud Roadmap
Step 1: Build the business case
Step 2: Assess your Big Data applicationworkloads
Step 3: Develop a technical approach fordeploying and managing Big Data in the cloud
Step 4: Address governance, security, privacy,risk,
Step 5: Deploy, integrate, and operationalizeyour cloud-based Big Data infrastructure
Source : Deploying Big Data Analytics Applications to the Cloud: Roadmap for Success: CSCS
29
Access your application workloads
Big-data storage
Big-data processing
Big-data development
Source : Deploying Big Data Analytics Applications to the Cloud: Roadmap for Success: CSCS
30
Sample applications
Enterprise applications already hosted in thecloud
High-volume external data sources thatrequire considerable preprocessing
Tactical applications beyond your on-premises, Big Data capabilities
Elastic provisioning of very large but short-lived analytic sandboxes
Source : Deploying Big Data Analytics Applications to the Cloud: Roadmap for Success: CSCS
31
Demo
32
Amazon DynomoDB
33
Google BigQuery
34
Hadoop on Google
35
Amazon EMR
36
www.facebook.com/imcinstitute