Discrete Random Variables
Introduction
A discrete random variable is a function from the sample space to a set of possible values .
Each of those possible values has a probability of occurring depending on the mappings from the sample space.
Probability mass function
This is the function that determines the probability of a discrete random variable having a certain value. The equation used to calculate this value will differ depending on the type of random variable.
This can also be used to determine the distribution of probabilities for each possible value of .
Expected value
In any given situation, we are able to calculate the value of a discrete random variable that is expected. This value is like a weighted average.
It is given by the equation below:
It is important to note that the above will change depending on the type of random variable you are working with. In other words, be careful when expanding .
There is also a case where we can be asked to find the expected value of a function of :
A moment of a discrete random variable is the expected value of a given power of . The moment of a discrete random variable is defined as:
Stirling’s approximation
This is a method of approximating factorials:
Where the symbol means that the ratio of the two quantities tends to 1 as tends to infinity.
Joint distributions
So far we have examined cases with the possible values of one random variable. Now we need to think about what to do when we want to know the probability of two random variables having certain values.
If these two random variables are independent then:
Otherwise, we need to look at the marginal distributions which are essentially the probability mass function of each value of a random variable laid out in a table.
For example, let’s take a case where we have 2 red pens, 1 green pen and 1 blue pen. Two pens are picked at random without replacement. Let be the number of red pens chosen and be the number of green pens chosen. We can calculate the marginal distributions as:
These can then be joined into a joint distribution table:
Some properties of joint distributions include:
-
-
-
Sum of the product of x,y,p for each element in the joint table.
You can also use the following to calculate joint distribution table values:
This works both ways:
Variance & covariance
The variance of a discrete random variable is a number which defines how much the possible values of differ from each other. It can be calculated using:
The covariance, on the other hand, determines how much the possible values of two random variables and differ from each other. It can be calculated using:
As a side note, the standard deviation can be calculated by:
Independent variables
By definition, two random variables are independent if:
An implication of this, using the equivalencies in the section above, is:
However, be aware that this statement only works in one logical direction. Covariance being zero does not implicate anything. You can prove that two random variables are not independent if: