Introduction to Databricks Connect

Summary

Learn about the DB Connect library.

Description

DB Connect is a library that allows a software developer to treat their local machine as the driver while executing jobs against a previously configured cluster in the Databricks workspace. With this pattern, software developers can step away from Notebook-based development patterns and return to more traditional patterns. DB Connect aims to support better software development practices such as test-driven development, traditional project management for Python, Scala, Java, SparkR, and sparklyr and helps close the integration gap between “traditional” applications and “spark” applications.

In this course, participants will be introduced to DB Connect through various presentations and demos. Participants will start by contrasting how DB Connect works to other development patterns. Then we will explore the simplicity by which DB Connect is installed and configured. And then we will conclude with a real-time demonstration of an application running on a developer’s local machine while executing its Spark jobs against a cluster in the Databricks workspace.

Learning objectives

  • Explain how Databricks Connect is used by data practitioners working with Databricks.

  • Install and configure Databricks Connect.

Prerequisites

  • Intermediate experience using the Databricks Workspace

Learning path

  • This course is part of the Data Engineer path.

Proof of completion

  • Upon completing this video, you will receive a proof of completion. 

How do I rate this course?