Presto: Distributed sql query engine

PRESTO Kiran Palaka

Problem to solve Huge production of data. As data is growing enormously to the point of peta bytes ,

querying the database has become a big issue. So we should be able to run more interactive queries and get

results faster .

Introduction Presto is a open source distributed sql query engine. For running queries against of all sizes ranging from

gigabytes to petabytes . It supports ANSI SQL ,including complex

queries,aggresgations,joins and window functions . It is implemented in java.

Presto: I can query

Relational databases

Proprietary data stores

Architecture

Architecture Explanation Client sends sql to presto coordinator. Coordinator parses ,analyzes and plans the query execution. The scheduler wires together the execution pipeline ,assigns

work to nodes closest to data and monitors the progress. The client pulls the data from output stage which in turn pulls

data from underlying stages.

Hive/Mapreduce Execution model Hive translates queries into multiple stage of mapreduce

tasks and execute them one after the other. Each task reads input from disk and writes intermediate

output back to disk.

Presto Execution Presto engine does not use Mapreduce. It employs a custom query and execution engine with

operators designed to support sql semantics. Processing is in memory and pipelined across the network

between stages which avoids unnecessary I/O and associated latency overhead.

Pipelined execution model runs multiple stages at once and streams data from one stage to next as it becomes available which reduces end-to-end latency

Note Presto dynamically compiles certain portions of query plan to

byte code which lets JVM optimize and generate native machine code.

Extensibility Presto was designed with a simple storage abstraction that

makes its easy to provide sql query capability against disparate data sources.

Connectors only need to provide interfaces for fetching meta data, getting data locations and accessing data itself.

Limitations Size limitation on the join tables and cardinality of unique

groups. Lacks the ability to write output back to tables. Currently

query results are streamed to client.

Presto developers claim: Presto is 10x better than hive/Mapreduce in terms of cpu

efficiency and latency for most queries. Supports ANSI sql, including joins, left/right outer

joins,subqueries,most of the common aggregate and scalar functions, including approximate distinct counts, approximate percentiles

Presto: Distributed sql query engine

Technology

Presto Training Series, Session 1: Using Advanced SQL ... · Presto Training Series, Session 1: Using Advanced SQL Features In Presto Try Presto: David Phillips and Manfred Moser

My sql query browser

Structured Query Language(SQL)

SQL Query Basic - Aeriesconference.aeries.com/spring2017/docs/PDFs/940 SQL Query... · 2017-03-01 · SQL Query Basic Conference 2017 SQL Query Basic Session 940 - Page 1 . Session

Presto - SLAC Conferences, Workshops and … · 3 What is Presto? •Open source distributed SQL query engine •Designed and written from the ground up for interactive analytical

Structured Query Language (SQL) Query Language (SQL).pdf · Structured Query Language (SQL) Structured query language (SQL) was designed to implement both data definition and data

SQL : Query Language

Module 2: Using Transact-SQL Querying Tools. Overview SQL Query Analyzer Using the Object Browser Tool in SQL Query Analyzer Using Templates in SQL Query

Translating WFS Query to SQL/XML Query

SQL Tutorial - · PDF fileSQL Tutorial Learn SQL ... Classic Query Engine and SQL query engine etc. Classic query engine handles all non-SQL queries but SQL query engine ... DQL -

DB2 Sql Query

Query 2 SQL

Sql Server Query Parameterization

07 Structured Query Language - Worayoot · 2014-01-17 · Structured Query Language (SQL) 1 Structured Query Language (SQL) 2 • SQL ทีใชในระบบฐานข้อมูลแบบ

SQL Query Disassembler

PL/SQL database query

Microsoft SQL Server Query Tuning - Meetupfiles.meetup.com/1381968/Microsoft SQL Server Query...Microsoft PowerPoint - Microsoft SQL Server Query Tuning [Compatibility Mode] Author

Advanced SQL injection to operating system full control · UNION query (inband) SQL injection: par=1 UNION ALL SELECT query--Batched queries SQL injection: par=1 ; SQL query;--

Sql query performance analysis

SQL Query for DB2 - User's Manual...SQL Query for DB2 (Бизнес версия) + 2 года Сопровождения* SQL Query for DB2 (Бизнес версия) + 3 года