CRT020: Databricks Certified Associate Developer for Apache Spark 2.4 & Scala 2.11

Summary

COMING SOON: Databricks Certified Associate Developer for Apache Spark 2.4 & Scala 2.11 validates your knowledge of the core components of the DataFrames API and validates that you have a rudimentary knowledge of the Spark Architecture.

Description

The test is approximately 180 minutes with a series of randomly selected, multiple choice questions and another set of randomly selected coding challenges. Implementation of the coding challenges is completed within the Databricks product. Candidates are advised to become familiar with our online programming environment by signing up for the free version of Databricks, the Community Edition. Memorization of the APIs is not required and access to the Programming and API docs will be made available during the exam.

NOTE: This certification is not affiliated with the Apache Software Foundation.

Candidates are expected to be familiar with the following architectural concepts and their relationship to each other:

  • Driver
  • Executor
  • Core/Slots
  • Jobs
  • Stages
  • Tasks
  • Partitions
  • Shuffling
  • Wide vs Narrow Transformations

Candidates are expected to have a command of the following APIs but, memorization of the API is not required.

  • SparkSession
  • DataFrameReader/DataFrameWriter
  • DataFrame/Dataset
  • Row/Column
  • Spark SQL functions

Prerequisites

Instructor Led Trainings

DB 105 - Apache Spark™ Programming

DB 301 - Apache Spark™ for Machine Learning and Data Science

Self Paced Courses

SP800: Getting Started with Apache Spark™ DataFrames Azure | AWS

SP820: ETL Part 1 – Data Extraction Azure | AWS

SP821: ETL Part 2 – Transformations and Loads AWS | Azure

Recommended Reading

Spark: The Definitive Guide: Big Data Processing Made Simple Book by Bill Chambers and Matei Zaharia

7 Steps for a Developer to Learn Apache® Spark™