DB 105 - Apache Spark™ Programming

DB 105 - Apache Spark™ Programming

Summary

This course covers the fundamentals of Apache Spark including Spark’s architecture and internals, the core APIs for using Spark, SQL and other high-level data access tools, as well as Spark’s streaming capabilities and machine learning APIs. The class is a mixture of lecture and hands-on labs.

Duration

3 Days

Objectives

After taking this class, students will be able to:

  • Use the core Spark APIs to operate on data
  • Articulate and implement typical use cases for Spark
  • Build data pipelines and query large data sets using Spark SQL and DataFrames
  • Analyze Spark jobs using the administration UIs inside Databricks
  • Create Structured Streaming jobs
  • Work with relational data using the GraphFrames APIs
  • Understand how a Machine Learning pipeline works
  • Understand the basics of Spark’s internals

Audience

Data analysts who want to learn the fundamentals of programming with Apache Spark, how to streamline their big data processing, build production Spark jobs, and understand/debug running Spark applications.

Prerequisites

  • Some familiarity with Apache Spark is helpful but not required.
  • Knowledge of SQL is helpful.
  • Basic programming experience in an object-oriented or functional language is required. The class can be taught concurrently in Python and Scala.

Additional Notes

All ​participants ​will ​need ​:

  • a ​laptop ​with ​updated ​versions ​of ​Chrome ​or ​Firefox ​(Internet ​Explorer ​and ​Safari ​are ​not ​supported) ​
  • ​an ​internet ​connection ​which ​can ​support ​use ​of ​GoToTraining. ​
  • ​GoToTraining ​is ​the online ​platform ​via ​which ​the ​class ​will ​be ​delivered and ​prior ​to ​attendance, ​each ​registrant ​will ​receive ​GoToTraining ​log-in ​instructions. For ​more ​information ​and ​to ​confirm ​​​your ​​​computer ​​​can ​​​run ​​​GoToTraining, ​please ​check ​here: Validation

    Upcoming Classes

    Jun 24
    7:00 AM - 11:00 AM
    Pacific Daylight Time
    Online
    $ 2500.00 USD
    Jul 8
    9:00 AM - 1:00 PM
    Greenwich Mean Time
    Online
    $ 2500.00 USD
    Jul 8
    9:00 AM - 5:00 PM
    Central European Summer Time
    France
    $ 2500.00 USD
    Jul 16
    9:00 AM - 5:00 PM
    British Summer Time
    United Kingdom
    $ 2500.00 USD $ 2125.00 USD
    Before Jul 09, 2019 9:00AM BST
    Jul 29
    8:00 AM - 4:00 PM
    Eastern Daylight Time
    United States
    $ 2500.00 USD
    Aug 6
    9:00 AM - 5:00 PM
    Pacific Daylight Time
    Online
    $ 2500.00 USD
    Sep 24
    9:00 AM - 5:00 PM
    Pacific Daylight Time
    Online
    $ 2500.00 USD
    Sep 30
    8:00 AM - 4:00 PM
    Mountain Daylight Time
    United States
    $ 2500.00 USD
    Oct 8
    8:00 AM - 4:00 PM
    Greenwich Mean Time
    Online
    $ 2500.00 USD
    Nov 5
    9:00 AM - 5:00 PM
    Pacific Standard Time
    Online
    $ 2500.00 USD
    Dec 4
    8:00 AM - 4:00 PM
    Central European Time
    France
    $ 2500.00 USD $ 2125.00 USD
    Before Sep 05, 2019 10:00AM CEST

    Onsite Training

    Request Quote

    Public Training

    Virtual Class - US Pacific Time

    Virtual Class - GMT Time

    Paris

    London

    • 9:00 AM - 5:00 PM
      $ 2500.00 USD $ 2125.00 USD before Tuesday, July 9, 2019 9:00 AM BST.

    Reston, VA

    Virtual Class - US Eastern Time

    Centennial, CO


    Don't see a date that works for you?

    Request Class