본문 바로가기

Coursera/Mathematics for ML and Data Science

Probability & Statistics for Machine Learning & Data Science (10)

728x90

Describing probability distributions and probability distributions with multiple variables

This week you will learn about different measures to describe probability distributions and any dataset. These include measures of central tendency (mean, median, and mode), variance, skewness, and kurtosis. The concept of the expected value of a random variable is introduced to help you understand each of these measures. You will also learn about some visual tools to describe data and distributions. In lesson 2, you will learn about the probability distribution of two or more random variables using concepts like joint, marginal, and conditional distributions. You will end the week by learning about covariance: a generalization of variance to two or more random variables.

Describing Distributions

Expected Value

Mean is the sum of all elements divided by the total number of elements.

We can think of the expected value as the same as the mean.

Mean = Expected value

You play a game with a friend. The game is as follows: You throw a fair coin. If it lands heads you win 10 dollars, otherwise you win nothing. What's the maximum amount you should be willing to pay to play this game?

더보기

$5

This is the average amount you'd win if you played the game many times.

01

What's the maximum amount you should be willing to play this new game, where you flip 3 coins and win a dollar for each heads-up get?

더보기

$1.50

This is the average or expected amount of money you'd win if you played the game many times.

012

Depending on the weights given (a probability of each case), the mean changes.

We use integrals to calculate and get the mean of the continuous random variables.

012

With the uniform distribution, the average is the middle of points a and b (two endpoints).

When the distribution is not uniform or symmetrical, the mean would not be in the middle.

Rather, the median will be the middle point.

Other measures of central tendency: median and mode

Mean doesn’t always work as the central point of the distribution due to outliers.

01

Outliers are the points that are way off from all the other points.

To overcome the issue with outliers, we use the median, which is the middle point of all the given data spread out in one line.

01

Mode is the number that shows up the most in the data or the maximum point in the distribution.

If we have multiple modes in the distribution, we call it multimodal distribution.

012
Symmetrical, asymmetrical binomial distributions and a normal distribution

Expected value of a Function

012

To get the expected value of a function, we multiply the probabilities of variables with functions of variables and sum them up.

So here with the function of squaring variables (on the last slide), we square the outputs and divide by 6 to get the expected value (mean).

012
Example of winning double the amount but paying $5 for every game

On the formula, we can group the constant and take out the constant from the expected value like on the last slide.

This becomes like the linear function (ax + b).

728x90

Sum of expectations

Game: You flip a coin. If heads, you win $1, otherwise you win nothing. Then you roll a dice. You win the amount you roll. What are your expected winnings for this game?

What is the expected number of correct assignments when we have 8 billion people and 8 billion unique names in the bag and hand out the names to everyone?

더보기

1

012
This slide explains why the answer is 1 with smaller examples

The sum of expectations is simply adding the average of each expectation and if everything is added, they add up to 1.

All the information provided is based on the Probability & Statistics for Machine Learning & Data Science | Coursera from DeepLearning.AI

728x90