Migrating SAS Procedures to Databricks


Learn how to translate SAS statements and functions into code that can be run on Databricks.


This course will enable experienced SAS developers to quickly learn how to translate familiar SAS statements and functions into code that can be run on Databricks. It begins with an introduction to the Databricks environment and the different approaches to coding in Databricks, followed by an overview of how SAS PROC and DATA steps can be performed in Databricks.You will learn about how you can use Spark SQL, PySpark, and other tools to read .sas7bdat files and perform common operations. Finally, you will see code examples and gain hands-on practice performing some of the most common SAS operations in Databricks.

Learning objectives

  • Read data stored in .sas7bdat files using Spark SQL and PySpark.

  • Explain the conceptual and syntactical relationships between SAS DATA and PROC statements and their correlaries on Databricks.

  • Learn how Python can be leveraged to augment ANSI SQL to create reusable Spark SQL code.

  • Translate common PROC functions to Databricks.

  • Translate common DATA steps to Databricks.


  • Intermediate to advanced SAS programming experience

  • Beginning knowledge of Python programming

  • Beginning-level experience with SQL

Learning path

  • This course is part of the SQL analyst, Data engineer, Data scientist, and Platform administrator learning paths.

Proof of completion

  • Upon 80% completion of this course, you will receive a proof of completion.


Part of Learning Pathway(s)