Upload
jorge-millan-cabrera
View
100
Download
0
Embed Size (px)
Citation preview
2017 - Madrid
Overview of Azure Data FactoryCarlos SacristánData & Analytics Solution Architect, Kabel
#GIBMad2017
Who am I?
Carlos SacristánData & Analytics Solution Architect, Kabel
https://twitter.com/sacrisql
+34 649 425 928
https://www.linkedin.com/in/csacristan/
#GIBMad2017
What is Azure Data Factory
ADF is a cloud-based data integration service that
orchestrates and automatesthe movement and transformation of data
Think of it like a manufacturing factory running equipment to take the raw materials and transform them into finished goods
#GIBMad2017
What is ADF
Mmmm… but we already have things like Integration Services or Stream Analytics
#GIBMad2017
Just one thing. Scheduling
Pipeline Active Periods
Activity Schedule
Dataset Availability
#GIBMad2017
Customer Churn
Azure Blob Storage
Game Log Files
Customer Table
On Premises
Data Mart
Game Logs
Customer Table
Azure DB
Customer
Game Usage
Visualize
Data Set(Collection of files, DB table, etc)
Activity: a processing step (Hadoop job, custom code, ML model, etc)
Pipeline: a sequence of activities (logical group)
Data Sources Ingest Transform & Analyze Publish
Customer
TableGeocode
Transform, Combine, etc Analyze Move
#GIBMad2017