Introduction to Cloning with Delta Lake

Summary

Make no-hassle copies of Delta tables for development and archiving.

Description

The addition of clone to Delta Lake empowers data engineers and administrators to easily replicate data stored in the Lakehouse. Organizations can use deep clone to archive versions of their production tables for regulatory compliance. Developers can easily create development datasets isolated from production data with shallow clone. In this course, you’ll learn the basics of cloning with Delta Lake and get hands-on experience working with the syntax.

Learning objectives

  • Describe the basic execution of deep and shallow clones with Delta Lake.

  • Use deep clones to create full incremental backups of tables.

  • Use shallow clones to create development datasets.

  • Describe strengths, limitations, and caveats of each type of clone.

Prerequisites

  • Hands-on experience working with Delta Lake

  • Intermediate experience with Spark and Databricks

Learning path

  • This course is part of the Data Engineering learning path.

Proof of completion

  • Upon 80% completion of this course, you will receive a proof of completion.

 

Part of Learning Pathway(s)