New Year Special 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: save70

Last Attempt Databricks-Certified-Associate-Developer-for-Apache-Spark-3.0 Questions

Databricks Certified Associate Developer for Apache Spark 3.0 Exam Questions and Answers

Question 25

Which of the following code blocks shuffles DataFrame transactionsDf, which has 8 partitions, so that it has 10 partitions?

Options:

A.

transactionsDf.repartition(transactionsDf.getNumPartitions()+2)

B.

transactionsDf.repartition(transactionsDf.rdd.getNumPartitions()+2)

C.

transactionsDf.coalesce(10)

D.

transactionsDf.coalesce(transactionsDf.getNumPartitions()+2)

E.

transactionsDf.repartition(transactionsDf._partitions+2)

Question 26

Which of the following code blocks returns a DataFrame with approximately 1,000 rows from the 10,000-row DataFrame itemsDf, without any duplicates, returning the same rows even if the code

block is run twice?

Options:

A.

itemsDf.sampleBy("row", fractions={0: 0.1}, seed=82371)

B.

itemsDf.sample(fraction=0.1, seed=87238)

C.

itemsDf.sample(fraction=1000, seed=98263)

D.

itemsDf.sample(withReplacement=True, fraction=0.1, seed=23536)

E.

itemsDf.sample(fraction=0.1)

Question 27

Which of the following describes the conversion of a computational query into an execution plan in Spark?

Options:

A.

Spark uses the catalog to resolve the optimized logical plan.

B.

The catalog assigns specific resources to the optimized memory plan.

C.

The executed physical plan depends on a cost optimization from a previous stage.

D.

Depending on whether DataFrame API or SQL API are used, the physical plan may differ.

E.

The catalog assigns specific resources to the physical plan.