Definition

Is a complete description of the probabilities associated within the sample space .

Ex: Rolling a six-sided die 🎲

For a fair die and , the probability distribution is

Types of Distributions

Marginal (unconditional) probability distributions

, the probability of event , is also sometimes called the unconditional probability, or marginal probability of

Joint Probability Distributions

The joint probability of two events and is the probability that both events occur, which is .

Conditional Probability Distributions

The conditional probability , which is the probability of event given that event has occurred, is given by:

Ex: With a six-sided die 🎲

Probability of rolling at least a :

Probability of an even roll:

The conditional probability that a roll is at least given that the roll is even:

The conditional probability that a roll is even given that it is at least 3 is:

Bernoulli Distribution

A distribution based on the idea of Binary Variables (Bernoulli Variables|Binary/Bernoulli%20Variables). Literally is a way of mapping the two categorical values to a function/number.

Look at Coding categorical variables (indicator variables|Coding%20categorical%20variables).

Multiplication Rule

Comes directly from the conditional probability formula:

and

Ex: Product returns

Suppose a website sells three types of shirts, each with different return rates:

  • Shirt 1: 60% of overall shirt sales, 15% of purchases returned
  • Shirt 2: 30% of overall shirt sales, 25% of purchases returned
  • Shirt 3: 10% of overall shirt sales, 35% of purchases returned

For a randomly selected shirt purchase, what is the probability
that the purchase is shirt 1 and is returned?

Law of Total Probability

If are disjoint events and also exhaustive events, then for any event :

Binomial Distribution

Probability of successes out of trials.

Geometric Distribution

Number of trials to get the first success.

If is the number of trials to get the first success with , then is a Geometric Random Variable with parameter , or

We solve this using the Geometric Series.

Memoryless Property

Basically, that the number of times it took to get a first success remains the same even after a failure.

So, note that:

Normal Distribution

A normal random variable with mean parameter and variance parameter , denoted , has a probability density function (pdf) given by:

and for all .

Special Properties of the Normal Distribution

  • is symmetric around , with population mean
  • The maximum value of the pdf occurs at , with the pdf strictly increasing to the left of , and strictly decreasing to the right of .
  • The two parameters (mean) and (variance) completely describe the normal distribution

Cumulative Density Function (CDF) of the Normal Distribution

The cumulative density function (CDF) of the normal random variable is given by: