Training FAQ

Training FAQ

Table of Contents

Main FAQ

Why do I need to create a Learner Account?

Databricks has a Learning Management System (LMS) to manage our customers’ trainings and for the best experience we associate all of your past and current Databricks trainings and certifications to a single account.

How do I create my Learner Account?

Click the red Academy Log In button in the upper right-hand corner of this page and scroll down to the “Don’t have an account? Sign up!” section and fill out all the required fields. You will be required to create a new password and verify your email - please make sure you are using your corporate email domain to enjoy all the corporate benefits your organization may have with Databricks

Where do I log in to see my account?

By default, you will be asked to log in when you are ready to purchase courses in the shopping cart or you may log in at any time when accessing the Academy content by clicking on the red box called “My Account” on the top right.

How can I contact Databricks Training Support?

Follow this link to the training support page on help.databricks.com.

What Account Log On Details should I use?

Databricks values your privacy and so offers tokenized security login. Please do not share your details with anyone. We advise that If you already have a Databricks Community Edition license, then please use the same login credentials as we will be shortly merging that account with your training portal log in but you should use your business email as the system will automatically apply any available promotions to that email domain

How do I run the self-paced course notebooks?

The notebooks must be imported into a Databricks workspace.

How do I access a Databricks workspace?

Important: Access to a Databricks workspace is not part of your course purchase price. You are responsible for getting access to Databricks. Currently, there are several available options:

Use your employer's workspace

If your employer is a Databricks customer, and you have access to your employer's Databricks workspace, we recommend using that workspace.

Use a Databricks trial account

If you cannot use an employer's workspace, you can use a Databricks trial.

For a trial of Azure Databricks, follow these steps:

  1. Set up your Azure account
  2. Create an Azure Databricks Workspace
  3. Import the Courseware into Azure Databricks.

For a trial of AWS Databricks, see this page.

Use Databricks Community Edition

You can sign up for Databricks Community Edition here.

NOTE:Every course except ETL 3 will run on the free Databricks Community Edition.

Why can I not find all of my purchased Trainings?

By default, when accessing your training account, you will see the current courses in "My Dashboard".

If you are looking for your completed courses, then change your Filter Results to "Completed"

Your certificates will be found under the Awards section and it is from here you can share your success with your Linked In account

How do I use my Corporate Training Credits?

You must request a coupon code from your company's Authorized Owner, to be applied in the shopping cart and make sure you select the correct payment type. The Authorized owner will be required to approve or reject each request for training before your registration is regarded as successful and you will receive notification in your Learner Portal

How do I see the balance of training credits?

Only the authorized person will have access to see the balance of Training Credits held in their account.

There must be sufficient training credits to enable a course to be purchased, otherwise the authorized person will receive a notification

Certification FAQ

This section contains frequently asked questions about Databricks Certification Exams. Questions in the “General” section apply to all exams. Use the links below to navigate to exam-specific questions about the Databricks Certified Associate Developer for Apache Spark 2.4 exam or the Databricks Certified Associate ML Practitioner w/ Apache Spark 2.4 exam.

General

How do I register for a Databricks certification exam?

Follow these steps:

  • Create an account (or login) to https://academy.databricks.com.
  • Click on the Certifications Tab to see all available certificate exams
  • Click the Register button for the exam you would like to take.
  • Follow on-screen prompts to schedule an exam with our partner proctoring service.

How long do I have to complete each exam?

For each exam, you will have 2 hours (120 minutes) to complete 60 multiple choice questions. Our goal is to assess your skills, not your speed.

What is the minimum passing score for certification exams?

Your unrounded score must be 70.00% or better. This translates to correctly answering a minimum of 42 of the 60 questions.

How are exams graded?

Your exam will be graded immediately. You will receive your unofficial score report upon completion of the exam. The exam and proctor records will be sent to Databricks. If awarded, you will receive instructions to access your certificate within one week.

Do I get a certificate or badge for passing the Databricks certification exams?

A certificate will be made available via our Learning Management System (LMS). To view those resources, please sign in to the LMS at https://academy.databricks.com/.

Are there any practice exams available for Databricks certification exams?

There are no practice exams at this time.

What are the technology requirements for Databricks certification exams?

Please find tech requirements and preparation instructions on Kryterion’s Online Proctored Exam Support page.

Can I use Windows or a Mac for the Databricks certification exams?

Please find tech requirements and preparation instructions on Kryterion’s Online Proctored Exam Support page.

What type of camera, computer, and web browser are required to take the online-proctored Databricks certification exams?

Please find tech requirements and preparation instructions on Kryterion’s Online Proctored Exam Support page.

How can I test my webcam before taking the Databricks certification exams?

Please find tech requirements and preparation instructions on Kryterion’s Online Proctored Exam Support page.

How do I reschedule my certification exam?

If you need to reschedule your exam, and it's more than 24 hours from the start time, please login to your Webassessor account and reschedule. If you need to reschedule your exam, and it's within 24 hours of the start time, please contact Kryterion here.

Can I use my non-English keyboard as is, or do I have to switch to an English keyboard in order to take the Databricks certification exams?

Exams follow a multiple choice format. You may use any keyboard as-is. This exam is written in English.

Are Databricks certification exams available in languages other than English?

At this time, exams are only available in English.

When do awarded certificates associated with Databricks certification exams expire?

The certificate does not expire but it is tied to a particular Spark version. This naturally obsoletes over time as newer versions of Spark and corresponding exams are released.

How much does it cost to take Databricks certification exams?

$200 US. There are no free retakes.

How long will it take to get exam results?

Unofficial results are shared immediately at the conclusion of the exam. For passing scores, Databricks will review the proctor record and notify you when your certificate has been posted to your Databricks Academy account within 1 week.

How do I use a voucher to register for a Databricks certification exam?

Databricks will not issue retake vouchers for this exam. If you already have a voucher, begin the registration process as recommended. On the check out page, you may enter your voucher code. Click Submit. Then, Check Out.

What happens if I don’t pass the exam the first time?

You are welcome to re-register and retake the exam as many times as you would like. Each attempt costs $200. Databricks will not issue free retake vouchers for this exam.

Would it be possible to see the breakdown of sections missed so that I can work on those areas before retaking a Databricks certification exam?

To maintain exam integrity, Databricks cannot release specific information regarding missed questions.

Is there a list of test centers for the Databricks certification exams?

All exams will be administered online with a live proctor through our exam partner, Kryterion Global Testing Solutions.

Can I have a hard-copy of my certificate mailed to me?

No, but you can download and print your certificate from your Databricks Academy account.

Databricks Certified Associate Developer for Apache Spark

Is the Databricks Certified Associate Developer for Apache Spark exam open-book?

The documentation proctor will provide PDF versions of Spark documentation, as well as the associated API docs for Scala or Python.

How many questions are in the Databricks Certified Associate Developer with Apache Spark exam?

The exam consists of 60 multiple-choice questions. There are three main categories:

  • Spark Architecture: Conceptual understanding (~17%)
  • Spark Architecture: Applied understanding (~11%)
  • Spark DataFrame API Applications (~72%)

Does the Databricks Certified Associate Developer for Apache Spark exam require Databricks-specific knowledge?

No. Test-takers will be assessed on their understanding of basic Spark Architecture and knowledge of the Spark DataFrames API, especially as it can be used to accomplish routine Spark DataFrame tasks.

What programming languages can I use to complete the Databricks Certified Associate Developer for Apache Spark exam?

The exam is available in two languages: Scala and Python.

The two exams are nearly identical with the exception that each one expresses the nuances of their particular language - colons vs curly braces, for example. Any concepts unique to a particular language (e.g. Dataset's encoders and decoders or Python's vectorized UDFs) are excluded from this exam.

How do I prepare for the Databricks Certified Associate Developer for Apache Spark exam?

Test-takers should have a basic understanding of Spark architecture and be able to apply the Spark DataFrame API to complete individual data manipulation taste like selecting, renaming and manipulating columns; filtering, dropping, sorting and aggregating rows; joining, reading, and writing partitioning DataFrames; and working with UDFs and Spark SQL functions. Databricks expects that developers who have used the Spark DataFrame API for at least 6 months should be able to pass this certification exam.

You can also access a variety of formal training courses and/or reference materials including, but not limited to, the following:

  • Databricks Instructor-Led Training: Apache Spark™ Programming with Databricks
  • Recommended Reading: Spark: The Definitive Guide and Learning Spark

What is the difference between the Databricks Certified Associate Developer for Apache Spark 2.4 exam and the Databricks Certified Associate Developer for Apache Spark 3.0 exam?

These two exams are very similar in the concepts they assess due to the minimal changes between Spark 2.4 and Spark 3.0 in the scope of this exam.

The major change associated with the Spark 3.0 version of the exam is the inclusion of Adaptive Query Execution. For more information on AQE, please check on this Databricks blog post.

What is the difference between the 2019 Databricks Certified Associate for Apache Spark and the Databricks Certified Associate Developer for Apache Spark exam?

The current exam is meant to measure the same areas of comprehension and applied knowledge as the former. The format has been changed to multiple choice.

Does the Databricks Certified Associate Developer for Apache Spark exam replace the 2019 Databricks Certified Associate Developer exam?

Yes. This exam is meant to measure the same areas of comprehension and application of concepts. Databricks has changed the format to multiple choice.

Can I pass the Databricks Certified Associate Developer for Apache Spark exam with just Spark SQL?

No. This exam will test your knowledge of the Spark DataFrames API.

Who should I contact for a test voucher if I am owed a free or discounted exam attempt for the Databricks Certified Associate Developer for Apache Spark exam?

Please fill out a test voucher inquiry at this link.

Databricks Certified Associate ML Practitioner for Apache Spark

Is the Databricks Certified Associate ML Practitioner for Apache Spark exam open-book?

The proctor will provide PDFs of Spark Python API docs.

How many questions are in the Databricks Certified Associate Developer with Apache Spark exam?

The exam consists of 60 multiple-choice questions. There are three main categories:

  • General ML Concepts (25%)
  • Spark ML Library Structure and Concepts (25%)
  • Spark ML Applications (50%)

Does the Databricks Certified Associate ML Practitioner for Apache Spark require Databricks-specific knowledge?

No. Test-takers will be assessed on their understanding of the basic concepts of machine learning, their understanding of the format and structure of the Spark ML library, and their ability to complete tasks in the machine learning workflow using Spark ML.

What programming languages can I use to complete the Databricks Certified Associate ML Practitioner for Apache Spark exam?

This exam is available in Python only.

How do I prepare for the Databricks Certified Associate ML Practitioner for Apache Spark exam?

Test-takers should be able to apply the Spark ML library to complete individual tasks in the machine learning workflow and understand the structure and format of the Spark ML library. They should also have a basic understanding of basic machine learning concepts, including supervised vs. unsupervised learning, regression vs. classification, clustering, cross-validation, hyperparameter tuning, and model evaluation and selection. Databricks expects that data scientists and machine learning engineers who have used the Spark ML library for at least 6 months should be able to pass this certification exam.

You can access a variety of format training courses and/or reference materials including, but not limited to, the following:

  • Databricks Instructor-Led Training: Scalable Machine Learning with Apache Spark
  • Recommended Reading: Spark: The Definitive Guide

Databricks Certified Professional Data Scientist

Is the Databricks Certified Professional Data Scientist exam open-book?

No. Test-takers will not be allowed any resources while taking this exam.

How many questions are in the Databricks Certified Professional Data Scientist exam?

The exam consists of 60 multiple-choice questions. There are four main categories:

  • Machine Learning Basics (17%)
  • Machine Learning Lifecycle (42%)
  • Machine Learning Algorithms and Techniques (33%)
  • Machine Learning Model Management with MLflow (8%)

Does the Databricks Certified Professional Data Scientist exam require Databricks-specific or Spark-specific knowledge?

No. Test-takers will be assessed on their understanding of the basics of machine learning and data science, how to complete each stage of the machine learning lifecycle, how various machine learning algorithms and techniques are implemented, and how to manage machine learning models with MLflow.

No. Test-takers will not be assessed on any programming tasks. The exam assesses an in-depth conceptual understanding of data science and machine learning.

Does the Databricks Certified Professional Data Scientist exam require an understanding of programming or any specific machine learning or data science libraries?

No. Test-takers will not be assessed on any programming tasks. The exam assesses an in-depth conceptual understanding of data science and machine learning. A conceptual understanding of the MLflow product will be assessed.

What programming languages can I use to complete the Databricks Certified Professional Data Scientist exam?

This exam does not have a programming component so there are no associated programming languages.

How do I prepare for the Databricks Certified Professional Data Scientist exam?

The minimally qualified candidate should have a complete understanding of the basics of machine learning, including the bias-variance tradeoff, in-sample vs. out-of sample data, categories of machine learning, and applied statistics concepts; an intermediate understanding of the steps in the machine learning lifecycle, including data preparation, feature engineering, model training, selection, and production, and interpreting models; a complete understanding of basic machine learning algorithms and techniques, including linear, logistic, and regularized regression, tree-based models like decision trees, random forest and gradient boosted trees, unsupervised techniques like K-means and PCA, and specific algorithms like ALS for recommendation and isolation forests for outlier detection; and a complete understanding of the basics of machine learning model management like logging and model organization with MLflow.

You can access a variety of format training courses and/or reference materials including, but not limited to, the following:

  • Databricks Instructor-Led Training: Scalable Machine Learning with Databricks and Machine Learning in Production: MLflow and Model Deployment
  • Databricks Self-Paced Training: Machine Learning in Production: MLflow and Model Deployment
  • Recommended Reading: Learning Spark, Introduction to Statistical Learning and Elements of Statistical Learning