Upload
deepak-ramanathan
View
527
Download
4
Tags:
Embed Size (px)
DESCRIPTION
Big Data Analytics
Citation preview
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
IT STRATEGY FOR SCALABLE ANALYTICS, MODERN DATA ARCHITECTURES
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
MODERN ARCHITECTURES
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
STUNNING FACT
Making the Modern World: Materials and Dematerialization - Vaclav Smil
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
Scarcity
• Technology constrained
• Process-centric
• Focus on cost control
Everything is forbidden unless it is permitted
Abundance
• Focus on value
• Discovery-centric
• Technology empowered
Everything is permitted unless it is forbidden
Shift in Mindset
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
Trends Big Data, Storage, Hadoop & In-memory Technology
Vertica
Teradata
Greenplum
Oracle
Microsoft PDW
Hadoop
$- $20,000 $40,000 $60,000 $80,000 $100,000
Today 2009
Cost of Storage, Memory, Computing • In 2000 a GB of Disk $17 today < $0.07• In 2000 a GB of Ram $1800 today < $1• In 2009 a TB of RDBMS was $70K today < $ 20K
Cost per Terabyte
THE PERFECT STORM: STORAGE TECHNOLOGY COSTS AND CPU SPEED
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
MODERN REALITY
• Commoditization• Architectures• ScaleInfrastructure
• New Complex Streams• Perishable Considerations• Cost Data
• New Category of Business Problems• Analytical Algorithms• OperationalizationAnalytics
8Copyright © 2011, SAS Institute Inc. All rights reserved.
Finding treasures in unstructured datalike social media or survey tools
that could uncover insightsabout consumer sentiment
Mine transaction databases for data of spending patterns that indicate a stolen card..
Leveraging historical data to drive better insight into decision-makingfor the future
Analyze massiveamounts of data inorder to accurately
identify areas likely toproduce the mostprofitable results
FORECASTING
DATA MINING
TEXT ANALYTICS
OPTIMIZATION
STATISTICS
ADVANCED ANALYTICS
INFORMATIONMANAGEMENT
Copyright © 2011, SAS Institute Inc. All rights reserved.
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
CURRENT TRENDS IN ANALYTICS
Complex Business Problems Are Driving Analytics Innovation
Speed Will Be Of Essence
Leverage Analytics To Unlock The Information Contained In Unstructured Data
Operationalizing Analytics
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
CURRENT AND FUTURE ARCHITECTURES
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
WHERE WE ARE TODAY?
SETTING THE SCENE
Operational Data Sources
EDW
Data Mart
Data Mart
Analytic Mart
Analytic Mart
BI and Analytics
Unstructured, Semi-structured and Streaming data (i.e. sensor data) handled often outside the Warehouse flow
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
WHERE DOES HADOOP FIT?
HADOOP AS A “NEW DATA” STORE
Operational Data Sources
EDW
Data Mart
Data Mart
Analytic Mart
Analytic Mart
BI and Analytics
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
WHERE DOES HADOOP FIT?
HADOOP AS AN ADDITIONAL INPUT TO THE EDW
Operational Data Sources
EDW
Data Mart
Data Mart
Analytic Mart
Analytic Mart
Analytic Mart
Data Mart
BI and Analytics
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
WHERE DOES HADOOP FIT?
HADOOP DATA PLATFORM AS A “STAGING LAYER” AS PART OF A “DATA LAKE” – Downstream stores could be Hadoop, data appliances or an RDBMS
Data Mart
Operational Data Sources EDW
Data Mart
Analytic Mart
Analytic Mart
BI and Analytics
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
15
SAS BIG DATA STRATEGY – SAS AREAS
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
Impala
SAS & HADOOP SAS® WITHIN THE HADOOP ECOSYSTEM
Next-GenSAS® User
User Interface
Metadata
Data Access
DataProcessing
FileSystem
SAS® User
MPI Based
SAS® LASR™ AnalyticServer
SAS® High-Performance
Analytic Procedures
HDFS
Base SAS & SAS/ACCESS® to Hadoop™
SAS Metadata
Pig
Map Reduce
In-MemoryData Access
SAS® Visual Analytics
SAS®
Enterprise Miner™
SAS® Data Integration
SAS®
EnterpriseGuide®
Hive
SAS Embedded Process
Accelerators
SAS® In-Memory Statistics for
Haodop
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
IDENTIFY /FORMULATE
PROBLEM
DATAPREPARATION
DATAEXPLORATION
TRANSFORM& SELECT
BUILDMODEL
VALIDATEMODEL
DEPLOYMODEL
EVALUATE /MONITORRESULTS
IN SUMMARY SAS ENABLES THE ENTIRE LIFECYCLE AROUND HADOOP
SAS Visual AnalyticsSAS Visual StatisticsSAS In-Memory Statistics for Hadoop
Done using either the Data Preparation, Data Exploration or Build Model Tools
SAS High Performance Analytics Offerings supported by relevant clients like SAS Enterprise Miner, SAS/STAT etc.
Decision Manager
SAS Scoring Accelerator for HadoopSAS Code Accelerator for Hadoop
SAS Visual AnalyticsDecision Manager
Done using either the Data Preparation, Data Exploration or Build Model Tools
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
SAS® VISUAL ANALYTICSA SINGLE SOLUTION FOR DATA DISCOVERY,
VISUALIZATION, ANALYTICS AND REPORTING
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
SAS® VISUAL ANALYTICS
EXAMPLE: TEXT ANALYSIS GIVES YOU INSIGHT TO CUSTOMER EXPERIENCE AND OPINION
VISUALIZATION POWERED BY SAS ANALYTICS Analytics applied
to text provides real MEANING
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
VISUALIZATION EXAMPLES
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
SAS® VISUAL STATISTICS
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
DATA TO DECISION LIFECYCLE
SAS® Visual StatisticsTEXT
COMPETITIVEADVANTAGE
MANAGE DATA
EX
PL
OR
ED
ATA
DEVELOP MODELS
DE
PL
OY
&
MO
NIT
OR
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
APPLICATION AREAS
Segmentation
Classification
Prediction
Ad-hoc Discovery
Data Preparation
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
SAS IN-MEMORY STATISTICS FOR HADOOP
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.
SAS® IN-MEMORY STATISTICS FOR
HADOOP
WHY IT IS IMPORTANT?
SPEED
Multi-user interactive analytics environment for increased productivity
Proven state-of-the-art statistical algorithms and machine learning techniques
Highly scalable, in-memory environment grows easily as needed
Memory and data efficient for a significant reduction of data latency to rapidly analyze large and complex data in Hadoop
PRECISION
INTERACTIVE
SCALABLE
Copy r ight © 2013, SAS Ins t i tu te Inc . A l l r ights reserved.sas.com