Introduction to Apache Spark Architecture

Summary

Develop a robust understanding of how Apache Spark executes some of the most common transformations and actions.

Description

In this course, you will explore how Apache Spark executes a series of queries. Examples will include simple, narrow transformations and more complex, wide transformations.

This course will give developers the working understanding they need to eventually write code that leverages the power of Apache Spark for even the simplest of queries.

Learning objectives

  • Explain how Apache Spark applications are divided into jobs, stages, and tasks.

  • Explain the major components of Apache Spark's distributed architecture.

Prerequisites

  • Familiarity with basic information about Apache Spark (what it is, what it is used for)

Learning path

  • This course is part of the SQL analyst, data scientist, and data engineering Databricks Academy learning paths.

Proof of completion

  • Upon 80% completion of this course, you will receive a proof of completion.