Apache Beam @ GCPUG.TW Flink.TW 20161006

View
126
Download
2
Category

Software

Preview:

Citation preview

Apache Beam in Data Pipeline

Randy Huang 2016/10/06

Who am I

• Data Architect @ VMFive

• Fluentd/Embulk fans

Overview

• Define Data Pipeline

• Architecture

• How to write Beam

• Demo

Data PipelineInput Algorithm Output

Why Apache Beam?

Data Pipeline’s world is chaos

Goal

• Provide an abstraction layer between data processing’s code and the execution runtime.

• Batch processing and Streaming Jobs in one world.

• Beam SDK open the door to write once, run anywhere.*

on-premise and non-Google cloud

Supported Runners

• Google Cloud Dataflow (Block/Non-Blocking)

• Apache Flink 1.1.2

• Apache Spark 1.6.2 Hadoop 2.2.0 Kafka 0.8.2.1

API, model, and engine

Architecture

• Pipelines

• Translators

• Runners

programming tips/ Flink

• Use the Flink DataStream API in Java and Scala

• Use the Beam API directly in Java (and soon Python) with the Flink runner

SDK

• Four Parts :

• Pipeline : Streaming & Batch Processing

• PCollection

• Transform

• I/O : Source & Sink

for Flink user• we encourage users to use either of the Beam or Flink

APIs to implement their Flink jobs for stream data processing.

• But Native Flink API -

• backwards-compatible API

• built-in libraries (e.g., CEP and upcoming SQL)

• key-value state (with the ability to query that state in the future)

http://data-artisans.com/why-apache-beam/

Demo• GDELT project

• EventCount by Location

Pileline

Recap

• Write the general data pipeline, and choose your runner

Next…

• New Runners, SDK (python still dev)

• DSL

Another things

• BigQuery have DML support!!! https://goo.gl/lcZQVZ

• DataStudio Beta in Taiwan is available

• Embulk

• Fluentd v0.14.6 - 2016/09/07

https://goo.gl/lcZQVZ

forward secure

remember to setup nginx

Recommended

GCPUG.TW - GCP學習資源分享

Technology

Ordenamiento y regulación de la propiedad rural Descripcióngaceta.diputados.gob.mx/Gaceta/63/2016/oct/Sedatu-20161006.pdf · 2010 259.85 5,333.19 4.87 % Porcentaje de los sujetos

Documents

1, 중권과 파권의 차이점 - att.eduspa.comatt.eduspa.com/FileData/event/lis/20161006/기기요점정리_스파패스.pdf · ① 3상을 2상으로 변화 ⓐ 우드 브릿지

Documents

Tekniskt Museum Oslo_ 20161006

Education

UNIVERSITAS INDONESIA BOGOR PADA MASA BERSIAP …lib.ui.ac.id/file?file=digital/20161006-RB04R181b-Bogor pada Masa... · xii Universitas Indonesia . ... sebuah negara baru yang bernama

Documents

Índicepic1.aotcloud.com/tclportal/20161006/02/36/35/Idol... · despliegue del airbag, - consulte con el fabricante del vehículo o con su concesionario para comprobar la correcta

Documents

McNamara 20161006 revised draft tables for CPHG Light

Documents

CI／CD、自動化，你還沒準備好（GCPUG.TW Meetup #34）

Technology

Diario Resumen 20161006

News & Politics

PLANI STRATEGJIK I ARSIMIT NË KOSOVË 2017-2021masht.rks-gov.net/uploads/2017/02/20161006-psak-2017-2021_1.pdf · Objektivi Strategjik/OS 2: Menaxhimi i sistemit arsimor Menaxhimi

Documents

GCPUG.TW meetup #28 - GKE上運作您的k8s服務

Technology

VitrA 4life con METgtgroup.com.hk/uploadfile/20161006/4 life ING.pdf · Title: VitrA 4life con MET.fh11 Author: Grafik3 Created Date: 2/11/2009 9:04:10 AM

Documents

salzburgmobil2025 Analyse NEUES FORMAT 20161006 · Diese Peer-Reviewer brachten im Rahmen des ersten Dialogforums jeweils fachliche Inputs zur Konzepterstellung ein und liefern in

Documents

20161006 Presentacion PRIMARE Gradiant v2 - USC...medio rural, utilizando las TIC para desarrollar una actividad agropecuaria y pesquera más sostenible, mejorar la seguridad y la

Documents

Stream Processing with Apache Flink (Flink.tw Meetup 2016/07/19)

Data & Analytics

20161006 Szaro Power Matters Conference all slides

Documents

A foton fázisoperátora mint HaarA foton …theo.physx.u-szeged.hu/szeminarium/20161006...A foton fázisoperátora mint HaarA foton fázisoperátora mint Haar-integrál aintegrál

Documents

Collecting Cancer Data: Melanoma - Emory Universityweb1.sph.emory.edu/GCCS/NAACCR_WEBINARS/20161006/Melanoma 2016 Slides.pdf•H9 –Code most specific histology term: melanoma, NOS

Documents

GCPUG.TW - 2015活動回顧

Technology

1 PLAN SPROVOĐENJA ZA - rks-gov.netmasht.rks-gov.net/uploads/2017/02/20161006-plan-sprovod...Procena rada timova za procenu u opštinama i priprema izveštaja € 2,250 BK – MONT

Documents