14
Microsoft Azure R Data Science with Microsoft Azure and R Dmitry Petukhov, Microsoft Data Platform MVP, C# MCP, Big Data Enthusiast && Coffee Addicted ( | ) = ( ) ( ) =1 ( ) ( ) Ω, ,ℙ

Microsoft Azure + R

Embed Size (px)

Citation preview

Page 1: Microsoft Azure + R

Microsoft Azure ♥ RData Science with Microsoft Azure

and R

Dmitry Petukhov,Microsoft Data Platform MVP, C# MCP, Big Data Enthusiast && Coffee Addicted

𝑃 (𝐴 𝑗|𝐵 )=𝑃 (𝐴 𝑗 )𝑃 (𝐵∨𝐴 𝑗)

∑𝑖=1

𝑁

𝑃 (𝐴𝑖 )𝑃 (𝐵∨𝐴𝑖)

⟨Ω ,𝔘 ,ℙ ⟩

Page 2: Microsoft Azure + R

Microsoft Azure + R. Prototype to Product Challenge

PrototypingFlexibilityDistributedScalableFault-toleranceReliable

ProductionFlexibilityDistributedScalableFault-toleranceReliable+ Big Data Ready+ LSML

Black Magic!

Migration

Page 3: Microsoft Azure + R

Microsoft Azure + R. Hello R!

Python is a COOL language!But R…

Specialized in statistical analyzeTime-effective => ideal for…

…prototyping, competition, researching, and for fun!

Standalone computing => not bad scalable Open sourceBig bearded community

Page 4: Microsoft Azure + R

Storage

ResourceManagement

ML Framework

Execution Engine

Local OS

Local Disc

Pyth

on R

untim

e

Yet A

noth

er

Runt

ime

scikitlearn

HDFS

YARN

MapReduce

Mahout

HDFS / S3

YARN / Apache Mesos

Spark

MLlib

HDFS / S3

YARN / Apache Mesos

Python / R on Spark

Python/Rtools

Spark

Local PC Hybrid Model Cluster (on-premises/on-demand)

somelibrar

y

Machine Learning in Finance. Infrastructure for Data Scientist

Low HighCost of deployment/ownership

Distributed FS

Dark Magic…

ML as a Service

Python/Rtools

Microsoft Azure + R. Infrastructures for Data Scientists

Page 5: Microsoft Azure + R

Microsoft Azure + R. Microsoft ♥ R

R Server for Azure HDInsightData Science VMAzure Machine Learning

Support R-scripts executionAllow authoring custom R modulesJupyter Notebooks with R kernel support

Azure HDInsightHadoop/Spark-cluster as a Service

SQL Server R ServicesPower BI

Running R Scripts & excellent visualization R Tools for Visual Studio

Microsoft Azure

Page 6: Microsoft Azure + R

References

Reference: http://www.r-bloggers.com/using-microsoft-r-server-to-address-scalability-issues-in-r/

Microsoft Azure + R. Microsoft R Server Platform

Page 7: Microsoft Azure + R

References

Reference: http://www.r-bloggers.com/using-microsoft-r-server-to-address-scalability-issues-in-r/

Microsoft Azure + R. DistributedR: write once, deploy anywhere

Page 8: Microsoft Azure + R

R Server for Azure HDInsight Killer features list:

100% open source R implementation;workload running inside HDInsight (Hadoop/Spark).

Microsoft Azure + R. R Server for Azure HDInsight

Page 9: Microsoft Azure + R

R, Python, SQL, C#

Microsoft Azure + R. Data Science VM

Microsoft R Server Developer Edition, Anaconda Python distribution, Jupyter notebooks for Python and R, Visual Studio Community Edition with Python and R Tools, Power BI desktop, SQL Server Express editionML libs: CNTK, xgboost and Vowpal WabbitAzure SDK

Data Science VM inside:

Page 10: Microsoft Azure + R

R Tools in Azure Machine Learning:Support R-scripts execution;Allow authoring custom R modules;Jupyter Notebooks with R kernel support.

Microsoft Azure + R. Azure Machine Learning

Page 11: Microsoft Azure + R

Microsoft Azure + R. Azure Machine Learning

Team

Data

JupyterNotebook

Azure ML Studio

GitHub/TFS in Azure

Share results

Source codeIdeas

Best practices

Cloud

Contribute

Azure Machine Learning

You

h(θ0, θn)

Azure Storage

Local

Commands flow

Data flow

Publish model

REST API

Test model

Request/response flow

Page 12: Microsoft Azure + R

ReferencesCortana Intelligence and Machine Learning BlogR for Azure Machine Learning. QuickstartMachine Learning Algorithm Cheat SheetMachine Learning Hackathon. How to win?Azure ML Repositories on GitHubMicrosoft Azure for all group on FacebookSoon in Slack (invite form)

Microsoft Azure + R. References

Page 13: Microsoft Azure + R

© 2016 Dmitry Petukhov All rights reserved. Microsoft and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries.

Data Science must win!

Page 14: Microsoft Azure + R

Q&ANow or later (send on

[email protected])

Ping meHabr: @codezombie

LinkedIn: @dpetukhovFacebook: @code.zombi

Read my tech code instinct blog ( http://0xCode.in/ )

Microsoft Azure + R. Stay in Touch!