This hands-on self-paced training course targets Data Engineers, Data Scientists and Data Analysts who want to use Databricks Delta for ETL processing on Data Lakes. The course ends with a capstone project building a complete data pipeline using Databricks Delta.
3-6 hours, 75% hands-on
The course is a series of seven self-paced lessons plus a final capstone project building a complete data pipeline using Databricks Delta.. Each lesson includes hands-on exercises.
Supported platforms include Azure Databricks, Databricks Community Edition, and non-Azure Databricks.
- If you're planning to use the course on Azure Databricks, select the "Azure Databricks" Platform option.
- If you're planning to use the course on Databricks Community Edition or on a non-Azure version of Databricks, select the "Other Databricks" Platform option.
During this course you:
- Use the interactive Databricks notebook environment.
- Use Databricks Delta to create, append and upsert data into a Data Lake.
- Use Databricks Delta to manage and extract actionable insights out of a Data Lake.
- Use Databricks Delta's advanced optimization features to speed up queries.
- Use Databricks Delta to seamlessly ingest streaming and historical data.
- Implement a Databricks Delta data pipeline architecture.
- Completed the Getting Started with Apache Spark™ SQL, Getting Started with Apache Spark™ DataFrames, or ETL Part 1 course, or already have similar knowledge
- Introducing Delta
- Capstone Project
- Please be sure to use a supported browser.
This self-paced training course may be used by 1 user for 12 months from the date of purchase. It may not be transferred or shared with any other user.