Upload
dmitry-petukhov
View
362
Download
0
Embed Size (px)
Citation preview
Microsoft Azure ♥ RData Science with Microsoft Azure
and R
Dmitry Petukhov,Microsoft Data Platform MVP, C# MCP, Big Data Enthusiast && Coffee Addicted
𝑃 (𝐴 𝑗|𝐵 )=𝑃 (𝐴 𝑗 )𝑃 (𝐵∨𝐴 𝑗)
∑𝑖=1
𝑁
𝑃 (𝐴𝑖 )𝑃 (𝐵∨𝐴𝑖)
⟨Ω ,𝔘 ,ℙ ⟩
Microsoft Azure + R. Prototype to Product Challenge
PrototypingFlexibilityDistributedScalableFault-toleranceReliable
ProductionFlexibilityDistributedScalableFault-toleranceReliable+ Big Data Ready+ LSML
Black Magic!
Migration
Microsoft Azure + R. Hello R!
Python is a COOL language!But R…
Specialized in statistical analyzeTime-effective => ideal for…
…prototyping, competition, researching, and for fun!
Standalone computing => not bad scalable Open sourceBig bearded community
Storage
ResourceManagement
ML Framework
Execution Engine
Local OS
Local Disc
Pyth
on R
untim
e
Yet A
noth
er
Runt
ime
scikitlearn
HDFS
YARN
MapReduce
Mahout
HDFS / S3
YARN / Apache Mesos
Spark
MLlib
HDFS / S3
YARN / Apache Mesos
Python / R on Spark
Python/Rtools
Spark
Local PC Hybrid Model Cluster (on-premises/on-demand)
somelibrar
y
Machine Learning in Finance. Infrastructure for Data Scientist
Low HighCost of deployment/ownership
Distributed FS
Dark Magic…
ML as a Service
Python/Rtools
Microsoft Azure + R. Infrastructures for Data Scientists
Microsoft Azure + R. Microsoft ♥ R
R Server for Azure HDInsightData Science VMAzure Machine Learning
Support R-scripts executionAllow authoring custom R modulesJupyter Notebooks with R kernel support
Azure HDInsightHadoop/Spark-cluster as a Service
SQL Server R ServicesPower BI
Running R Scripts & excellent visualization R Tools for Visual Studio
Microsoft Azure
References
Reference: http://www.r-bloggers.com/using-microsoft-r-server-to-address-scalability-issues-in-r/
Microsoft Azure + R. Microsoft R Server Platform
References
Reference: http://www.r-bloggers.com/using-microsoft-r-server-to-address-scalability-issues-in-r/
Microsoft Azure + R. DistributedR: write once, deploy anywhere
R Server for Azure HDInsight Killer features list:
100% open source R implementation;workload running inside HDInsight (Hadoop/Spark).
Microsoft Azure + R. R Server for Azure HDInsight
R, Python, SQL, C#
Microsoft Azure + R. Data Science VM
Microsoft R Server Developer Edition, Anaconda Python distribution, Jupyter notebooks for Python and R, Visual Studio Community Edition with Python and R Tools, Power BI desktop, SQL Server Express editionML libs: CNTK, xgboost and Vowpal WabbitAzure SDK
Data Science VM inside:
R Tools in Azure Machine Learning:Support R-scripts execution;Allow authoring custom R modules;Jupyter Notebooks with R kernel support.
Microsoft Azure + R. Azure Machine Learning
Microsoft Azure + R. Azure Machine Learning
Team
Data
JupyterNotebook
Azure ML Studio
GitHub/TFS in Azure
Share results
Source codeIdeas
Best practices
Cloud
Contribute
Azure Machine Learning
You
h(θ0, θn)
Azure Storage
Local
Commands flow
Data flow
Publish model
REST API
Test model
Request/response flow
ReferencesCortana Intelligence and Machine Learning BlogR for Azure Machine Learning. QuickstartMachine Learning Algorithm Cheat SheetMachine Learning Hackathon. How to win?Azure ML Repositories on GitHubMicrosoft Azure for all group on FacebookSoon in Slack (invite form)
Microsoft Azure + R. References
© 2016 Dmitry Petukhov All rights reserved. Microsoft and other product names are or may be registered trademarks and/or trademarks in the U.S. and/or other countries.
Data Science must win!
Q&ANow or later (send on
Ping meHabr: @codezombie
LinkedIn: @dpetukhovFacebook: @code.zombi
Read my tech code instinct blog ( http://0xCode.in/ )
Microsoft Azure + R. Stay in Touch!