## NPTEL Data Science For Engineers Week 7 Assignment Answers 2023

**1. Which among the following is not a type of cross-validation technique?**

LOOCV

k-fold croos validation

Validation set approach

Bias variance trade off

**2. Which among the following is a classification problem?**

Predicting the average rainfall in a given month.

Predicting whether a patient is diagnosed with a disease or not.

Predicting the price of a house.

Predicting whether it will rain or not tomorrow.

**3. Find the accuracy of the model.**

0.95

0.55

0.45

0.88

**4. Find the sensitivity of the model.**

0.95

0.55

1

0.88

**5. Under the ‘family’ parameter of glm() function, which one of the following distributions correspond to logistic regression for a variable with binary output?**

Binomial

Gaussian

Gamma

Poisson

**6. What is the dimension of the dataframe?**

(150, 5)

(150, 4)

(50, 5)

None of the above

**7. What can you comment on the distribution of the independent variables in the dataframe?**

The variables Sepal Length and Sepal Width are not normally distributed

All the variables are normally distributed

The variable Petal Length alone is normally distributed

None of the above

**8. How many rows in the dataset contain missing values?**

10

5

25

0

**9. Which of the following code blocks can be used to summarize the data (finding the mean of the columns PetalLength and PetalWidth), similar to the one given below.**

lapply(irisdata[, 3:4], mean)

sapply(irisdata[, 3:4], 2, mean)

apply(irisdata[, 3:4], 2, mean)

apply(irisdata[, 3:4], 1, mean)

**10. What can be interpreted from the plot shown below?**

Sepal widths of Versicolor flowers are lesser than 3 cm.

Sepal lengths of Setosa flowers are lesser than 6 cm.

Sepal lengths of Virginica flowers are greater than 6 cm.

Sepals of Setosa flowers are relatively more wider than Versicolor flowers.

