Databricks Certified Professional Data Scientist

Summary

The Databricks Certified Professional Data Scientist certification exam assesses the understanding of the basics of machine learning, the steps in the machine learning lifecycle, the understanding of basic machine learning algorithms and techniques, and the understanding of the basics of machine learning model management.

Description

The Databricks Certified Professional Data Scientist certification exam assesses the understanding of the basics of machine learning and the steps in the machine learning lifecycle, including data preparation, feature engineering, the training of models, model selection, interpreting models, and the production of models. The exam also assesses the understanding of basic machine learning algorithms and techniques, including linear regression, logistic regression, regularization, decision trees, tree-based ensembles, basic clustering algorithms, and matrix factorization techniques. The basics of model management with MLflow, like logging and model organization, are also assessed.

Prerequisites

The minimally qualified candidate should have:

  • a complete understanding of the basics of machine learning, including:

    • bias-variance tradeoff

    • in-sample vs. out-of sample data

    • categories of machine learning

    • applied statistics concepts

  • a intermediate understanding of the steps in the machine learning lifecycle, including:

    • data preparation

    • feature engineering

    • model training, selection, and production

    • interpreting models

  • a complete understanding of basic machine learning algorithms and techniques, including:

    • linear, logistic, and regularized regression

    • tree-based models like decision trees, random forest and gradient boosted trees

    • unsupervised techniniques like K-means and PCA

    • specific algorithms like ALS for recommendation and isolation forests for outlier detection

  • a complete understanding of the basics of machine learning model management like logging and model organization with MLflow

Preparation

The following Databricks courses should help you prepare for this exam

Learning Path

This certification is part of the Data Scientist learning path.

Exam Details

The exam details are as follows:

  • The exam consists of 60 multiple-choice questions.

  • Candidates will have 120 minutes to complete the exam.

  • The minimum passing score for the exam is 70 percent. This translates to correctly answering a minimum of 42 of the 60 questions.

  • The exam will be conducted via an online proctor.

  • This exam has no code-based questions, and there will be no test aids available while taking the exam.

Other exam details are available via the Certification FAQ.

Technology Requirements

Please find tech requirements and preparation instructions on Kryterion’s Online Proctored Exam Support page.

Registration

To register for this certification, please click the button below and follow the instructions to create a certification account and process payment.

 

Register