with a GPU Data Frame Accelerate Analyticson-demand.gputechconf.com/gtc-il/2017/presentation/...MAPD...

Accelerate Analytics with a GPU Data FrameAaron WilliamsOctober 18, 2017

MapD: Extreme Analytics

100x Faster Queries

MapD Core

The world’s fastest columnar database, powered

by GPUs

Visualization at the Speed of Thought

MapD Immerse

A visualization front end that leverages the speed &

rendering superiority of GPUs

MapD System ArchitectureAccelerating the existing data infrastructure

MAPD DEMO

MapD BenchmarksBlogger Mark Litwintschik benchmarked MapD on a billion-row taxi data set and found it to be up to orders-of-magnitude faster than the fastest CPU databases

MapD Core: Comparative Query Acceleration*System Q 1 Q 2 Q 3 Q 4

BrytlytDB & 2-node p2.16xlarge cluster 36x 47x 25x 12x

ClickHouse, Intel Core i5 4670K 49x 58x 32x 25x

Redshift, 6-node ds2.8xlarge cluster 74x 24x 14x 6x

BigQuery 95x 38x 6x 6x

Presto, 50-node n1-standard-4 cluster 190x 75x 61x 41x

Amazon Athena 305x 117x 37x 13x

Elasticsearch (heavily tuned) 386x 343x n/a n/a

Spark 2.1, 11 x m3.xlarge cluster w/ HDFS 485x 153x 119x 169x

Presto, 10-node n1-standard-4 cluster 524x 189x 127x 61x

Vertica, Intel Core i5 4670K 685x 607x 203x 132x

Elasticsearch (lightly tuned) 1,642x 1,194x n/a n/a

Presto, 5-node m3.xlarge cluster w/ HDFS 1,667x 735x 388x 159x

Presto, 50-node m3.xlarge cluster w/ S3 2,048x 849x 164x 86x

PostgreSQL 9.5 & cstore_fdw 7,238x 3,302x 1,424x 722x

Spark 1.6, 5-node m3.xlarge cluster w/ S3 12,571x 5,906x 3,758x 1,884x

*All speed comparisons are to the “MapD & 1 Nvidia Pascal DGX-1” benchmark

Source: http://tech.marksblogg.com/benchmarks.html

Query Compilation with LLVM

Traditional DBs can be highly inefficient• each operator in SQL treated as a separate function• incurs tremendous overhead and prevents vectorization

MapD compiles queries w/LLVM to create one custom function• Queries run at speeds approaching hand-written functions• LLVM enables generic targeting of different architectures (GPUs, X86, ARM, etc).• Code can be generated to run query on CPU and GPU simultaneously

10111010101001010110101101010101

00110101101101010101010101011101LLVM

Keeping Data Close to ComputeMapD maximizes performance by optimizing memory use

SSD or NVRAM STORAGE (L3)250GB to 20TB1-2 GB/sec

CPU RAM (L2)32GB to 3TB70-120 GB/sec

GPU RAM (L1)24GB to 256GB1000-6000 GB/sec

Hot Data Speedup = 1500x to 5000xOver Cold Data

Warm DataSpeedup = 35x to 120xOver Cold Data

Cold Data

COMPUTELAYER

STORAGELAYER

Data Lake/Data Warehouse/System Of Record

Space Increases

The Status Quo: Memory Bottlenecks

PCIe4-16GB/s

The GPU Open Analytics Initiative ModelStandard in-memory format; zero-copy interchange

Interactive Machine LearningEmpowering the People in the Pipeline

Personas inAnalytics Lifecycle

(Illustrative)Business Analyst

Data Scientist

Data Engineer

IT Systems Admin

Data Scientist / Business Analyst

Data Preparation

Data Discovery& Feature

Engineering

Model & Validate

PredictOperationalize

Monitoring & Refinement

Evaluate & Decide

GPUsMapD H20.ai MapD

GOAI DEMO

Try MapDIt’s free and it’s easy (and @ortelius sez “it’s the new h0t sh1t”)

Play with the live demos:https://www.mapd.com/demos/

Download the Community Edition:https://www.mapd.com/platform/download-community/

Join our forums:https://community.mapd.com/

Review these slides:https://www.slideshare.net/aaronrogerwilliams

Aaron WilliamsVP of Global Community

@_arw_ aaron@mapd.com /in/aaronwilliams/ /williamsaaron

with a GPU Data Frame Accelerate Analyticson-demand.gputechconf.com/gtc-il/2017/presentation/...MAPD...

Documents

Anand Santhanam - on-demand.gputechconf.com

2016 Formulary Annual Notice of Change Medicare Advantage Plans (MAPD) · Annual Notice of Change . Medicare Advantage Plans (MAPD) This is a listing of the changes that have occurred

SummaCare MAPD Sample

The PSD upgrade performance Front-End-Electronics MAPD HV system Temperature stabilization MAPD gain monitoring system Slow control Readout –present and

urbulence T - on-demand.gputechconf.com

Health Alliance MAPD (HMO) for State Employees Group ... · Form CMS 10260-ANOC/EOC OMB Approval 0938-1051 (Expires: May 31, 2020) (Approved 05/2017) Health Alliance MAPD (HMO) for

Sonder Standard MAPD 5-Tier (List of Covered Drugs) List

All Transmissions Overview Transmissions Overview ... 2015 . CMS MAPD Transmissions Inventory Transmissions Inventory Version 35.0 – Updated: ... TIBCO …

GPU-Accelerated Applications for HPC Industries| NVIDIA · 2015-11-06 · GPU‑ACCELERATED APPLICATIONS CONTENTS 01 Computational Finance ... MapD MapD is GPU-powered big data analytics

WELCOME [on-demand.gputechconf.com]on-demand.gputechconf.com/gtcdc/2018/pdf/dc8101-ai-for...KEY NOTE: INTRODUCTION TO AI 1:30-1:50PM Kirk Borne, Booz Allen GOVERNMENT PROJECTS Computer

dennis.homelinuxserver.orgdennis.homelinuxserver.org/ · 3 q q q q q q 3 q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q 6 q q q q q q q 4 6 q q q q q q q q q q q q q

MAPD - INFORMSmeetings2.informs.org/.../04/MAPD_Directory_Web.pdf6 6 INFORMS 2018 MAPD SPEAKERS AND PANELISTS analyzing, and supporting business decisions. He is author of the best-selling

Managing Risk: Maximizing Opportunities in the MAPD Market

For MAPD #13_20151112

mapd, guide, 2

Your TRAIL Medicare Advantage Prescription Drug (MAPD ... · 2 CIP This is your State of Illinois Total Retiree Advantage Illinois (TRAIL) Medicare Advantage Prescription Drug (MAPD)

Pedoman penulisan tesis mapd ipdn

GPU-Accelerated Applications for HPC Industries| NVIDIA · MapD MapD is GPU-powered big data analytics and visualization platform that is hundreds of times faster than CPU in-memory

2019 Core MAPD 19472-19374-19472-19473-19475

greenwaycollab.com€¦ · q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q q