Standard Deviation
The standard deviation of a probability distribution, just like the variance of a probability distribution, is a measurement of the deviation in that probability distribution. It allows one to quantify how much the outcomes of a probability experiment tend to differ from the expected value.
Standard deviation is often used in the calculation of other statistics such as the -score and the coefficient of variation.
Contents
Properties of Standard Deviation
The standard deviation of a random variable , denoted as or , is defined as the square root of the variance.
The standard deviation of a random variable is
Equivalently, the standard deviation of a random variable is
What is the standard deviation of a fair six-sided die roll?
Let be the random variable that represents the result of a fair six-sided die roll.
Recall from another example that . Then,
By the properties of variance, we have the following properties of standard deviation:
For random variable and any constant , we have
For random variable and any constant , we have
Let be pairwise independent random variables. Then
Three fair six-sided dice are rolled, and the results are summed together.
What is the standard deviation of this result? Round your answer to three decimal places.
Gaussian Distribution and -Scores
The variance and standard deviation of a random variable intuitively measure the amount of spread, or dispersion of the random variable from the mean. This allows us to approach the following question: what is the probability that the value of the random variable is far from its expected value?
In the worked examples below, we will prove bounds on this probability for a general random variable using Chebyshev's inequality. In the special case of a random variable with a normal distribution (or Gaussian distribution) , we have the following:
If we consider the probability density function of , this shows that the area under the curve in the interval is approximately , the area under the curve in the interval is approximately , and the area under the curve in the interval is approximately . These probabilities, derived from the area under the curve of a normal distribution, hold true regardless of the value of or .
For a random variable, the -score, or standardized score, of a value is the number of standard deviations the value is from the mean of the data.
Let be a random variable. If is the expected value of , is the standard deviation of , and is a value that can take, then the -score of is
This gives a measure of how far the value is with respect to the mean value. If a point has value less than the expected value, the -score is negative. If the point has value greater than the expected value, the -score is positive.
Coefficient of Variation
While studying the spread of the random variable, we may want to take into account the magnitude of the expected value. The coefficient of variation is the ratio of the standard deviation to the expected value:
The coefficient of variation is a measurement of the amount of deviation in a probability distribution relative to the expected value.
Prove Markov's inequality: For any nonnegative random variable and positive constant ,
Consider the event . Then
Therefore, , implying
We now consider bounds for a general random variable (not necessarily nonnegative).
Prove Chebyshev's inequality: For a random variable with mean and standard deviation and for any positive constant
Consider the random variable . This is a nonnegative random variable, so we can apply Markov's inequality to obtain
Therefore,
Example Problems
In annual check-up, a standard fasting blood sugar level has a normal range of 75 to 115 mg/dL, claimed to cover approximately 95% of the world's population.
What is the cut-point integer value of high sugar level beyond percentile in an individual? (You don't have to be a doctor to diagnose diabetes.)
The probability generating function for the Poisson distribution is
Find its standard deviation.