Presto Essentials: Data Science
1h 49mIntermediate2017-05-12
Authors

Ben Sullins
Data Geek, Tech Consultant
Course details
Netflix and Airbnb both use Presto—an open-source SQL query engine developed by Facebook—for their ever-expanding big data querying needs. In this course, learn how to harness the power of your big data system using the Presto platform, which breaks the false dilemma of having to choose between an expensive commercial solution that offers fast analytics, and a slow, ostensibly free solution that requires excessive hardware.
Data science expert Ben Sullins helps you get up to speed with Presto, and leverage it to accomplish a wide-range of data science and analytics tasks. He uses different interfaces with Presto—such as R and Tableau—and digs into the expressive SQL language that Presto offers for your analysis. At the end of this course, you'll know the key concepts of Presto and how to use them to take full advantage of your modern big data system.
Learning objectives
What does Presto do?
Running Presto
Connecting from Tableau and R
Connecting to Hive, MySQL, and the local system
Retrieving data
Combining data sources
Basic SQL functions
Advanced SQL functions
Migrating from Hive
Data science expert Ben Sullins helps you get up to speed with Presto, and leverage it to accomplish a wide-range of data science and analytics tasks. He uses different interfaces with Presto—such as R and Tableau—and digs into the expressive SQL language that Presto offers for your analysis. At the end of this course, you'll know the key concepts of Presto and how to use them to take full advantage of your modern big data system.
Learning objectives
What does Presto do?
Running Presto
Connecting from Tableau and R
Connecting to Hive, MySQL, and the local system
Retrieving data
Combining data sources
Basic SQL functions
Advanced SQL functions
Migrating from Hive
Skills covered
PrestoData EngineeringData ScienceOpen Source
Concepts
0. Introduction
- 01 - Welcome
- 02 - What you should know
- 03 - Exercise files
1. Introducing Presto
- 04 - Presto origins
- 05 - Why use Presto
- 06 - What does Presto do
- 07 - How Presto works
2. Running Presto
- 08 - Deployment options
- 09 - Sandbox setup
- 10 - Command-line interface
- 11 - Connect from Tableau
- 12 - Connect from R
- 13 - Presto web UI
3. Connectors
- 14 - Connect to Hive
- 15 - Connect to MySQL
- 16 - Connect to the local system
- 17 - Benchmark with TPCH
- 18 - Additional connectors
4. SQL Syntax
- 19 - Retrieve data
- 20 - Combine data sources
- 21 - Group results
5. Basic SQL functions
- 22 - Conditional expressions
- 23 - Date and time functions
- 24 - Data aggregation
- 25 - Conversion between data types
6. Advanced SQL Functions
- 26 - Window functions
- 27 - Bollinger bands
- 28 - Access data with arrays
- 29 - Access data with maps
- 30 - Parse URL strings
- 31 - Work with JSON data
7. Migrating from Hive
- 32 - Migrate from Hive
Conclusion
- 33 - Next steps