26
Confidential Customized for Lorem Ipsum LLC Version 1.0 Spark na Google Cloud Friends don't let friends build data centers

Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Confidential Customized for Lorem Ipsum LLC Version 1.0

Spark na Google CloudFriends don't let friends build data centers

Page 2: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Data Scientist @Avenue Code

Evandro CaldeiraCientista de dados na Avenue Code. Formado em Engenharia da Computação e louco por café

E-mail: [email protected]

Page 3: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

BrasilBelo HorizonteSão PauloPorto Alegre

EUA

Canadá

Page 4: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Confidential Customized for Lorem Ipsum LLC Version 1.0

TOC

Overview

On premise vs cloud

Como migrar

Demo

Page 5: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Por que Spark?

Page 6: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)
Page 7: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)
Page 8: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

On premise1 Equipamentos

2 Gerenciamento

3 Picos de uso

4 $$$

Page 9: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Migragação para GCP:Descomissionamento de um datacenter em 2018

Spotify

Page 10: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Primeiros passos para GCP

Page 11: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)
Page 12: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)
Page 13: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

O que fazer1 Mova os dados

2 Experimente

Page 14: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

O que fazer3 Use clusters efêmeros

5 Delete o cluster ao finalizar

4 Workers preemptivos

Page 15: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Hands on

Page 16: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)
Page 17: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Créditos grátis!

Page 18: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Instalação1 Google SDK

2 Spark standalone

Page 19: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Criação do cluster

Page 21: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)
Page 22: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Execução de job

Page 24: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)
Page 25: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Sourcehttps://github.com/evandroc/tdc-spark

Page 26: Spark na Google Cloud - Amazon Web Services · Labels + Add label Max restarts per hour ... Create a cluster Name cluster-I Region global Cluster mode Standard (1 master, N workers)

Obrigado.