Discrete Random Variables - Joint Probability Distribution
The joint probability distribution of two random variables is a function describing the probability of pairs of values occurring. For instance, consider a random variable \(X\) that represents the number of heads in a single coin flip, and a random variable \(Y\) that represents the number of heads in a different single coin flip. Then the joint distribution of \(X\) and \(Y\) is
\[\text{Pr}(X=x, Y=y) = \begin{cases} \frac{1}{4} & x = 0, y = 0 \\ \frac{1}{4} & x = 0, y = 1 \\ \frac{1}{4} & x = 1, y = 0 \\ \frac{1}{4} & x = 1, y = 1 \end{cases} \]
Contents
Formal definition
The joint distribution of two random variables \(X,Y\) is given by
\[\text{Pr}[X=x \text{ and } Y=y] = \text{Pr}(Y = y | X = x)\text{Pr}(X = x) = \text{Pr}(X = x | Y = y)\text{Pr}(Y = y)\]
where \(P(Y = y | X = x)\) is the probability that \(Y = y\) given that \(X = x\). When \(X,Y\) are independent, this value is equal to \(P(Y = y)\) (as the fact that \(X = x\) is irrelevant), and
\[\text{Pr}[X = x, Y = y] = \text{Pr}(X=x)\text{Pr}(Y=y)\]
This also allows the joint distribution to be used as an independence test: if the above relation holds for all \(x,y\), then \(X\) and \(Y\) are independent. Otherwise, they are not.
Notably, this relation immediately rearranges to Bayes' theorem, which is very important in conditional probability.
Suppose that the probability that a random person has a certain disease is \(0.005.\) A scientist develops a device which tests a person positive for the disease with \(95\)% chance when the person really has the disease. However, the same device tests a person positive for the disease with \(1\)% chance when the person in fact does not have the disease. What is the probability that a person really has the disease when tested positive by the device?
Total probability
If \(X\) takes on the value \(x\), \(Y\) must take on some value, and so
\[\text{Pr}[X = x] = \sum_y \text{Pr}[X = x, Y = y] = \sum_y \text{Pr}[X = x|Y = y]\text{Pr}[Y = y]\]
which is related to the law of iterated expectation.