In which two ways can you improve data durability in Oracle Cloud Infrastructure Object Storage?
You are a data scientist; you use the Oracle Cloud Infrastructure (OCI) Language service to train custom models. Which types of custom models can be trained?
Which is NOT a valid OCI Data Science notebook session approach?
You have an embarrassingly parallel or distributed batch job with a large amount of data running using Data Science Jobs. What would be the best approach to run the workload?
Which architecture is based on the principle of “never trust, always verify”?
During a job run, you receive an error message that no space is left on your disk device. To solve the problem, you must increase the size of the job storage. What would be the most efficient way to do this with Data Science Jobs?
As a data scientist, you are tasked with creating a model training job that is expected to take different hyperparameter values on every run. What is the most efficient way to set those parameters with Oracle Data Science Jobs?
Which statement about Oracle Cloud Infrastructure Data Science Jobs is true?
As a data scientist, you are tasked with creating a model training job that is expected to take different hyperparameter values on every run. What is the most efficient way to set those parameters with Oracle Data Science Jobs?
Which OCI service provides a managed Kubernetes service for deploying, scaling, and managing containerized applications?
Which feature of Oracle Cloud Infrastructure Data Science provides an interactive coding environment for building and training machine learning models?
Which CLI command allows the customized conda environment to be shared with co-workers?
Which OCI service enables you to build, train, and deploy machine learning models in the cloud?
You want to write a Python script to create a collection of different projects for your data science team. Which Oracle Cloud Infrastructure (OCI) Data Science interface would you use?
You have received machine learning model training code, without clear information about the optimal shape to run the training. How would you proceed to identify the optimal compute shape for your model training that provides a balanced cost and processing time?
True or false? Bias is a common problem in data science applications.
You have received machine learning model training code, without clear information about the optimal shape to run the training on. How would you proceed to identify the optimal compute shape for your model training that provides a balanced cost and processing time?
Which function's objective is to represent the difference between the predictive value and the target value?
Select two reasons why it is important to rotate encryption keys when using Oracle Cloud Infrastructure (OCI) Vault to store credentials or other secrets.
What is the name of the machine learning library used in Apache Spark?
What is a common maxim about data scientists?
You are a data scientist trying to load data into your notebook session. You understand that Accelerated Data Science (ADS) SDK supports loading various data formats. Which of the following THREE are ADS-supported data formats?
What is feature engineering in machine learning used for?
Arrange the following in the correct Git Repository workflow order:
Install, configure, and authenticate Git.
Configure SSH keys for the Git repository.
Create a local and remote Git repository.
Commit files to the local Git repository.
Push the commit to the remote Git repository.
What is a conda environment?
You want to evaluate the relationship between feature values and target variables. You have a large number of observations having a near uniform distribution and the features are highly correlated. Which model explanation technique should you choose?
Which encryption is used for Oracle Data Science?
Which Security Zone policy is NOT valid?
Which of the following analytical and statistical techniques do data scientists commonly use?
Which statement is true about origin management in Web Application Firewall (WAF)?
Which THREE types of data are used for Data Labeling?
Which model has an open-source, open model format that allows you to run machine learning models on different platforms?
Which statement about Oracle Cloud Infrastructure Anomaly Detection is true?
For your next data science project, you need access to public geospatial images. Which Oracle Cloud service provides free access to those images?
As a data scientist, you require a pipeline to train ML models. When can a pipeline run be initiated?
Which Oracle Cloud Infrastructure (OCI) Data Science policy is invalid?
How are datasets exported in the OCI Data Labeling service?
Which of the following best describes the principal goal of data science?
You are building a model and need input that represents data as morning, afternoon, or evening. However, the data contains a timestamp. What part of the Data Science lifecycle would you be in when creating the new variable?
Which of the following TWO non-open source JupyterLab extensions has Oracle Cloud Infrastructure (OCI) Data Science developed and added to the notebook session experience?
Using Oracle AutoML, you are tuning hyperparameters on a supported model class and have specified a time budget. AutoML terminates computation once the time budget is exhausted. What would you expect AutoML to return in case the time budget is exhausted before hyperparameter tuning is completed?
As a data scientist, you use the Oracle Cloud Infrastructure (OCI) Language service to train custommodels. Which types of custom models can be trained?
You are a researcher who requires access to large datasets. Which OCI service would you use?
You are using Oracle Cloud Infrastructure (OCI) Anomaly Detection to train a model to detect anomalies in pump sensor data. How does the required False Alarm Probability setting affect an anomaly detection model?
You are working as a data scientist for a healthcare company. They decide to analyze the data to find patterns in a large volume of electronic medical records. You are asked to build a PySpark solution to analyze these records in a JupyterLab notebook. What is the order of recommended stepsto develop a PySpark application in Oracle Cloud Infrastructure (OCI) Data Science?
You have been given a collection of digital files required for a business audit. They consist of several different formats that you would like to annotate using Oracle Cloud Infrastructure (OCI) Data Labeling. Which THREE types of files could this tool annotate?
You are preparing a configuration object necessary to create a Data Flow application. Which THREE parameter values should you provide?