Databricks Command Line Interface (CLI) Fundamentals

Summary

Learn basic configuration and usage for uploading and downloading notebooks, loading custom libraries, managing clusters, starting jobs, monitoring runs, and configuring workspace secrets using the CLI.

Description

While the Databricks platform web-based graphical user interface provides powerful functionality for data teams, many use cases call for programmatic command line access. The Databricks command line interface (CLI) provides access to a variety of powerful workspace features. This module is not intended as a comprehensive overview of all the CLI can do, but rather an introduction to some of the common features users may desire to leverage in their workloads.

Learning objectives

  • Install and configure the Databricks CLI to securely interact with the Databricks Workspace.

  • Configure workspace secrets using the CLI for more secure sharing and use of string-based credentials in notebooks.

  • Sync notebooks and libraries between the Databricks workspace and other environments using the CLI.

  • Perform a variety of tasks including interacting with clusters, jobs, and runs using the CLI.

Prerequisites

  • Familiarity with Apache Spark concepts

  • Familiarity with the data engineering capabilities of the Databricks Platform

  • Intermediate experience using the Databricks platform for data engineering (creating clusters, loading notebooks, scheduling jobs, etc.)

Learning path

  • This course is part of the data engineering and platform administrator learning paths.

Proof of completion

  • Upon 80% completion of this course, you will receive a proof of completion.