Month End Sale Limited Time Flat 70% Discount offer - Ends in 0d 00h 00m 00s - Coupon code: 70spcl

EMC D-DS-FN-23 Dell Data Science Foundations Exam Practice Test

Page: 1 / 6
Total 59 questions

Dell Data Science Foundations Questions and Answers

Question 1

In association rules, given items X and Y, what does lift measure?

Options:

A.

Percentage of transactions that contain an itemset with X

B.

Percentage of transactions with Xthat also contain Y

C.

Difference in the probability ofX and Y appearing together compared with expectations as if they were statistically independent

D.

How many times more often X and Y occur together than expected if they were statistically independent, expressed as a ratio

Question 2

Which Hadoop service responds to requests for compute and memory resources?

Options:

A.

Application Manager

B.

DataNode

C.

Scheduler

D.

Application Master

Question 3

Question # 3

Refer to the exhibit, which shows pairwise counts for items purchased together.

Consider the following association rule: Milk -> Eggs

What is value of the lift?

Options:

A.

1.18

B.

0.264

C.

120

D.

70.81

Question 4

In time series analysis, what function is examined to identify the order of the autoregressive component of an ARIMA model?

Options:

A.

Logistic function

B.

Lognormal distribution function

C.

Partial autocorrelation function

D.

Normal distribution function

Question 5

What is the purpose of applying the naïve Bayes conditional independence assumption?

Options:

A.

To simplify the probability calculations

B.

To calculate the probability of rare events

C.

To minimize rounding errors in probability calculations

D.

To accurately calculate each probability

Question 6

In a user-defined aggregate function, what is FFUNC?

Options:

A.

Optional final calculation function

B.

Window function

C.

State transition function

D.

Segment-level calculation function

Question 7

Which visualization technique should be avoided?

Options:

A.

Using a small number of contrasting colors to draw distinctions

B.

Using tables of numbers to present all of the data visually

C.

Achieving a high data-ink ratio

D.

Using visuals to illustrate key points

Question 8

What data asset is an example of quasi-structured data?

Options:

A.

Excel file

B.

Clickstream data

C.

Relational database table

D.

Comma-separated value file

Question 9

Which R function plots a distribution of a single variable along two different axes?

Options:

A.

table()

B.

summaryQ

C.

density ()

D.

rug()

Question 10

Which SQL set operator returns rows that exist in the first SELECT statement answer set but not in the second SELECT statement?

Options:

A.

EXCEPT

B.

UNION

C.

UNION ALL

D.

INTERSECT

Question 11

What is the similarity between the matrix and array data structures in R?

Options:

A.

Both structures can contain only integers

B.

Both structures can only contain one data type

C.

Both structures can store multiple data types

D.

Both structures must be 2-dimensional

Question 12

Which analytic technique would be appropriate to estimate home sale price in U.S. dollars as a function of square footage, number of bedrooms, and lot size?

Options:

A.

Time series analysis

B.

Linear regression

C.

Naive Bayesian classification

D.

K-means clustering

Question 13

What does “MAD” in MADlib stand for?

Options:

A.

Magnetic Association Design

B.

Magnetic Agile Deep

C.

Multiple Agile Development

D.

Multiple Access Design

Question 14

What is a recommended use case for regular expressions?

Options:

A.

Linear regression

B.

Decision trees

C.

Logistic regression

D.

In-database text analysis

Question 15

What is part of the model output for a linear regression?

Options:

A.

The assignment of each input datum to a cluster

B.

Coefficients indicating relative impact of the input variables on the outcome

C.

The set of all rules X -> Y with minimum support and confidence

D.

Probability score for each possible class label

Question 16

What characterizes the Hadoop Distributed File System?

Options:

A.

Peer to peer system designed to run on custom designed hardware

B.

Peer to peer system designed to run on commodity hardware

C.

Master/ slave system designed to run on custom designed hardware

D.

Master/ slave system designed to run on commodity hardware

Question 17

What are three built-in data types in the R programming language?

Options:

A.

Boolean, integer, and character

B.

Boolean, table, and character

C.

Boolean, table, and integer

D.

List, array, and integer

Page: 1 / 6
Total 59 questions