Which CLI command allows the customized conda environment to be shared with co-workers?
You want to make your model more frugal to reduce the cost of collecting and processing data. You plan to do this by removing features that are highly correlated. You would like to create a heatmap that displays the correlation so that you can identify candidate features to remove. Which Accelerated Data Science (ADS) SDK method is appropriate to display the comparability between Continuous and Categorical features?
You want to create a user group for a team of external data science consultants. The consultants should only have the ability to see Data Science resource details but not have the ability to create, delete, or update Data Science resources. What verb should you write in the policy?
As a data scientist, you are tasked with creating a model training job that is expected to take different hyperparameter values on every run. What is the most efficient way to set those parameters with Oracle Data Science Jobs?
You have been given a collection of digital files required for a business audit. They consist of several different formats that you would like to annotate using Oracle Cloud Infrastructure (OCI) Data Labeling. Which THREE types of files could this tool annotate?
What is feature engineering in machine learning used for?
You are a data scientist leveraging Oracle Cloud Infrastructure (OCI) to create a model and need some additional Python libraries for processing genome sequencing data. Which of the following THREE statements are correct with respect to installing additional Python libraries to process the data?
You want to make your model more parsimonious to reduce the cost of collecting and processing data. You plan to do this by removing features that are highly correlated. You would like to create a heatmap that displays the correlation so that you can identify candidate features to remove. Which Accelerated Data Science (ADS) SDK method would be appropriate to display the correlation between Continuous and Categorical features?
Which type of file system does File Storage use?
The Oracle AutoML pipeline automates hyperparameter tuning by training the model with different parameters in parallel. You have created an instance of Oracle AutoML as oracle_automl and now you want an output with all the different trials performed by Oracle AutoML. Which of the following commands gives you the results of all trials?
Arrange the following in the correct Git Repository workflow order:
Install, configure, and authenticate Git.
Configure SSH keys for the Git repository.
Create a local and remote Git repository.
Commit files to the local Git repository.
Push the commit to the remote Git repository.
You are attempting to save a model from a notebook session to the model catalog by using ADS SDK, with resource principal as the authentication signer, and you get a 404 authentication error. Which TWO should you look for to ensure permissions are set up correctly?
You have built a machine model to predict whether a bank customer is going to default on a loan. You want to use Local Interpretable Model-Agnostic Explanations (LIME) to understand a specific prediction. What is the key idea behind LIME?
What is a conda environment?
You are a data scientist designing an air traffic control model, and you choose to leverage Oracle AutoML. You understand that the Oracle AutoML pipeline consists of multiple stages and automatically operates in a certain sequence. What is the correct sequence for the Oracle AutoML pipeline?
Which type of firewalls are designed to protect against web application attacks, such as SQL injection and cross-site scripting?
Which is NOT a valid OCI Data Science notebook session approach?
Which technique can be used for feature engineering in the machine learning lifecycle?
You’re going to create an Oracle Cloud Infrastructure Anomaly Detection model for multivariate data. Where do you need to store the training data?
Where are OCI secrets stored?
Triggering a PagerDuty notification as part of Monitoring is an example of what in the OCI Console?
You want to build a multistep machine learning workflow by using the Oracle Cloud Infrastructure (OCI) Data Science Pipeline feature. How would you configure the conda environment to run a pipeline step?
Which components are a part of the OCI Identity and Access Management service?
Which function's objective is to represent the difference between the predictive value and the target value?
You are attempting to save a model from a notebook session to the model catalog by using the Accelerated Data Science (ADS) SDK, with resource principal as the authentication signer, and you get a 404 authentication error. Which two should you look for to ensure permissions are set up correctly?
Which is NOT a part of Observability and Management Services?
You are running a pipeline in the OCI Data Science service and want to override some of the pipeline's default settings. Which of the following statements about overriding pipeline defaults is true?
You realize that your model deployment is about to reach its utilization limit. What would you do to avoid the issue before requests start to fail? Pick THREE.
Which of these protects customer data at rest and in transit in a way that allows customers to meet their security and compliance requirements for cryptographic algorithms and key management?
Six months ago you created and deployed a model that predicts customer churn for a call center. Initially, it was yielding quality predictions. However, over the last two months, users have been questioning the credibility of the predictions. Which TWO methods would you employ to verify accuracy and lower customer churn?
As a data scientist, you are working on a global health dataset that has data from more than 50 countries. You want to encode three features, such as 'countries', 'race', and 'body organ' as categories. Which option would you use to encode the categorical feature?
Where do calls to stdout and stderr from score.py go in a model deployment?
You are preparing a configuration object necessary to create a Data Flow application. Which THREE parameter values should you provide?
You are a researcher who requires access to large datasets. Which OCI service would you use?
What is the primary difference between a data scientist and a data engineer?
Which statement about Oracle Cloud Infrastructure Multi-Factor Authentication (MFA) is NOT valid?
You have trained three different models on your dataset using Oracle AutoML. You want to visualize the behavior of each of the models, including the baseline model, on the test set. Which class should be used from the Accelerated Data Science (ADS) SDK to visually compare the models?
You realize that your model deployment is about to reach its utilization limit. What would you do to avoid the issue before requests start to fail? Which THREE steps would you perform?
You want to write a program that performs document analysis tasks such as extracting text and tables from a document. Which Oracle AI service would you use?
Which statement best describes Oracle Cloud Infrastructure Data Science Jobs?
In machine learning, what is the primary difference between supervised and unsupervised learning?
What is the minimum active storage duration for logs used by Logging Analytics to be archived?
While working with Git on Oracle Cloud Infrastructure (OCI) Data Science, you notice that two of the operations are taking more time than the others due to your slow internet speed. Which TWO operations would experience the delay?
You have trained a binary classifier for a loan application and saved this model into the model catalog. A colleague wants to examine the model, and you need to share the model with your colleague. From the model catalog, which model artifacts can be shared?
You are a data scientist using Oracle AutoML to produce a model and you are evaluating the score metric for the model. Which of the following TWO prevailing metrics would you use for evaluating a multiclass classification model?
A bike sharing platform has collected user commute data for the past 3 years. For increasing profitability and making useful inferences, a machine learning model needs to be built from the accumulated data. Which of the following options has the correct order of the required machine learning tasks for building a model?
Which step is a part of the AutoML pipeline?