본문 바로가기

Coursera/Mathematics for ML and Data Science

Probability & Statistics for Machine Learning & Data Science (14)

728x90

Describing probability distributions and probability distributions with multiple variables

Quiz

728x90

Q1

Consider the four sets of samples above. Which one has the smallest variance?

$$ \begin{array}{c|cccc}\hline \text { Set } & \text { Values } & & & \\ \hline 1 & 1 & 5 & 7 & 9 \\2 & -20 & -10 & 0 & 10 \\3 & 100 & 101 & 102 & 103 \\4 & -10 & -5 & 0 & -5 \\\hline\end{array} $$

  1. 1
  2. 2
  3. 3
  4. 4

Answer

더보기

3

The variance measures how much a sample is spread. We can easily look at all the samples and check that this one has the smallest spread.

Q2

Consider two games, Game A and Game B, each with different probability distributions of winnings and losses. Game A has a probability of 1/3 to win $2 and a probability of 2/3 to lose $1. Game B has a probability of 1/2 to win $0.50, a probability of 1/4 to lose $0.50, a probability of 1/8 to win $5, and a probability of 1/8 to lose $2.

Which of the following statements is true?

  1. Both Game A and Game B have the same kurtosis.
  2. Game A's kurtosis is smaller than Game B's kurtosis.
  3. Game B's kurtosis is smaller than Game A's kurtosis.

Answer

더보기

2

Kurtosis measures the shape and thickness of the rails of a probability distribution. A larger kurtosis indicates thicker tails and more extreme values. In this case, Game Y has a larger kurtosis because it has thicker tails due to the presence of extreme values (winning $5 or losing $2 with small probabilities).

Consider the following independent random variables:

$$ X \sim \text{Normal}(3,1)\\ Y \sim \text{Normal}(2,2) $$

Then $Z=X+Y \sim \text{Normal}(\mu,\sigma)$, where $\mu, \sigma$ are equal to:

  1. $\mu=\sqrt5, \sigma=\sqrt3$
  2. $\mu=5, \sigma=\sqrt5$
  3. $\mu=5, \sigma=\sqrt3$
  4. $\mu=5, \sigma=5$

Hint

더보기

Remember the formula for a sum of independent Gaussians, $\mu Z = \mu X + \mu Y$, and $\sigma Z = \sqrt{\sigma^2X + \sigma^2Y}$

Answer

Q4

Consider the following box plot for the test scores of two classes, A and B:

Which of the following statements is true?

  1. Class B's interquartile range (IQR) is larger than Class A's interquartile range.
  2. Class B's median score is higher than Class A's median score.
  3. Class A's interquartile range (IQR) is larger than Class B's interquartile range.
  4. Class A's median score is higher than Class B's median score.

Answer

더보기

2

Looking at the box plot, we can see that the median of Class A is around 75, while the median of Class B is around 85.

3

The rectangle in A is bigger than B.

Q5

Consider the following QQ plot for a set of data:

Which of the following statements is true?

  1. The data has a lower variance than a normal distribution.
  2. The data is not normally distributed.
  3. The data looks normally distributed.
  4. The data has a higher variance than a normal distribution.

Hint

더보기

The variance of the data cannot be determined from the QQ plot.

Answer

더보기

3

The QQ plot compares the observed data with the theoretical quantiles of a normal distribution. If the points lie close to the diagonal line, then the data is likely normally distributed.

All the information provided is based on the Probability & Statistics for Machine Learning & Data Science | Coursera from DeepLearning.AI

728x90