New Year Special 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: save70

New Release Databricks-Machine-Learning-Professional ML Data Scientist Questions

Databricks Certified Machine Learning Professional Questions and Answers

Question 13

A data scientist wants to remove the star_rating column from the Delta table at the location path. To do this, they need to load in data and drop the star_rating column.

Which of the following code blocks accomplishes this task?

Options:

A.

spark.read.format(“delta”).load(path).drop(“star_rating”)

B.

spark.read.format(“delta”).table(path).drop(“star_rating”)

C.

Delta tables cannot be modified

D.

spark.read.table(path).drop(“star_rating”)

E.

spark.sql(“SELECT * EXCEPT star_rating FROM path”)

Question 14

Which of the following machine learning model deployment paradigms is the most common for machine learning projects?

Options:

A.

On-device

B.

Streaming

C.

Real-time

D.

Batch

E.

None of these deployments

Question 15

A machine learning engineering team wants to build a continuous pipeline for data preparation of a machine learning application. The team would like the data to be fully processed and made ready for inference in a series of equal-sized batches.

Which of the following tools can be used to provide this type of continuous processing?

Options:

A.

Spark UDFs

B.

[Structured Streaming

C.

MLflow

D Delta Lake

D.

AutoML

Question 16

A data scientist has developed and logged a scikit-learn random forest model model, and then they ended their Spark session and terminated their cluster. After starting a new cluster, they want to review the feature_importances_ of the original model object.

Which of the following lines of code can be used to restore the model object so that feature_importances_ is available?

Options:

A.

mlflow.load_model(model_uri)

B.

client.list_artifacts(run_id)["feature-importances.csv"]

C.

mlflow.sklearn.load_model(model_uri)

D.

This can only be viewed in the MLflow Experiments UI

E.

client.pyfunc.load_model(model_uri)