Databricks Related Exams
Databricks-Certified-Professional-Data-Engineer Exam
When evaluating the Ganglia Metrics for a given cluster with 3 executor nodes, which indicator would signal proper utilization of the VM's resources?
Which statement describes the correct use of pyspark.sql.functions.broadcast?
A production cluster has 3 executor nodes and uses the same virtual machine type for the driver and executor.
When evaluating the Ganglia Metrics for this cluster, which indicator would signal a bottleneck caused by code executing on the driver?