The central limit theorem, or CLT for short, is absolutely vital to statistics, so it'll crop up many times throughout our course.
In a nutshell, the CLT says that the sum of a large number of random draws is roughly distributed like a bell curve.
This is a casual version of the true CLT, but it's catchy and suits our needs well enough.
In this quiz, we'll unpack this statement and uncover the intuitive ideas at the heart of the CLT.
Without further ado, let's begin building our CLT intuition with a trip to the casino...
Many fortunes have been lost at the tables of The Gambler's Ruin casino, but Marvin isn't thinking about that: he's too intent on having some fun and, hopefully, winning some cash.
Fortunately for him, his wiser and savvier friend Zhang Wei tags along to make sure Marvin doesn't gamble away all of his savings.
The pair descend onto the main floor of The Gambler's Ruin and immerse themselves in a cacophony of sounds and a galaxy of flashing lights.
Zhang Wei guides his friend quickly past the rows of slot machines and roulette wheels to a game called "Roll of the Dice," where even gullible Marvin may just have a shot at winning...
"The game is simple," explains the croupier. "You pick a number, and then wager that the dice will roll that value."
The croupier looks Marvin up and down and decides he looks a bit of a rube, so she decides to go easy on him at first.
"We'll start with a single die, so you can bet on a roll of or What's your bet?"
Marvin pauses, scratches his head, and gives serious consideration to the options before him.
If the die is fair, what's Marvin's best strategy for winning?
Zhang Wei stands off to the side and watches as Marvin makes his first bet... and loses.
"Tough break," says the croupier with a hint of mock sympathy. "Better luck next time! Want to try a more exciting game?"
Marvin nods and listens as the croupier explains the rules. "This time, I throw two dice and you bet on the total value of the two rolls. So, if you bet on and one die comes up and the other comes up you win; otherwise, you lose your wager. Got it?"
Marvin nods again and thinks over his options. He turns to Zhang Wei and says " and are two of my lucky numbers. What do you think? Should I bet on or "
What should Zhang Wei say? Assume the dice are fair and the rolls are independent.
Hint: A throw of the dice can be represented as a pair of integers, so the sample space is
Marvin takes Zhang Wei's advice, bets on and ends up winning!
The croupier smiles, gives a few words of encouragement, and then invites Marvin to up the challenge by betting on the total roll of three dice.
He's just about through pondering his choices when he feels a tap on his shoulder. Marvin turns and sees Zhang Wei holding out his phone to him. On the screen is the following interactive plot:
Zhang Wei explains that a bar's height in this histogram represents the number of ways dice can roll a sum total of the integer below it.
For example, there's only one way to roll a namely but there are different ways of rolling an that's why its bar is so much higher than 's.
Given Zhang Wei's histogram and the assumption that the dice are all fair and the rolls are independent, how should Marvin bet?
The croupier adds one more die to the roll after every bet to make the game more fun, but she's really just making it harder for Marvin to win by adding more possible outcomes.
Zhang Wei suspected she'd do this.
Fortunately, he came prepared: the plot he shares with Marvin has a slider labeled for the number of dice used in a roll (see below).
Zhang Wei adjusts the scale by dividing the heights of the histogram bars by so the plot displays the probability distribution for the sum total of a roll of fair independent dice: Change the value of and study the shape of the distributions as you do. What do you notice?
To recap, if fair dice are rolled independently, there's a uniform probability distribution on the sample space, which consists of ordered lists of length with entries or
Since there are choices for each entry in a list, there are lists, so The smallest possible roll value is corresponding to the single outcome the largest possible value is corresponding to the outcome
The roll totals near the center of the range can be achieved by far more dice roll outcomes than those at the ends; that's why the distribution is peaked there. As we move inwards away from the extreme ends of the range of possible roll totals, the distribution grows symmetrically about the middle of the range where the peak sits.
In short, for the distribution is distinctly bell-shaped: