The margin of error in political polls

It's spring 2014 and that means it is only a matter of months before people the world over are buried under an avalanche of public polls, purporting to show that some thing, leader, or law is about to be heaved upon them by majority vote. From Slovakia, to the United States, to Indonesia, to Bangladesh, to Sweden, few populations will escape the year unscathed.

Typically, such polls are posed to a small subset (NNpopN \ll N_{pop}) of the population as a binary choice: "Do you support Party X or Party Y?", "Person 1 or Person 2?", "Are you for or against Issue A?". The results are used to infer the preference of the population at large, to within some margin of error, i.e. "Thing X is preferred by 56.3% of the population ±\pm 4.2%".

This raises some questions:

  • How is the population sampled?
  • What is the probability model?
  • Where does the ±\pm come from and how is it calculated?
  • How does the ±\pm depend on the number of people polled?

These issues are not trivial and can be difficult to get right. The Chicago Tribune famously blew the call on the United States Presidential election of 1948, calling the election for Thomas E. Dewey when in fact Harry Truman had won.

To the first question, a polling agency will attempt to sample the population at random by targeting a mixture of people that reflects as nearly as possible the known distribution of income, education, race, religion, etc. in the total population according to a census or some other survey.

Implicit in multiple choice polling questions is the assumption that, despite rich differences between people, each person can be approximated by their choice from a limited set of predetermined options. For simplicity, let's say the question is Q\mathbb{Q} and that people can respond in one of two ways, as above. If they're for one choice we count them in group A\mathbb{A} which has AA people in the full population. If they're for the other choice, they're in group B\mathbb{B} which has BB people.

By asking a total of NN people at random, we're effectively doing the same thing as when we pick random samples (with replacement) from a bag of colored marbles. Each randomly selected person has probability pA=AA+B\displaystyle p_A = \frac{A}{A+B} of having opinion A, and probability pB=BA+B=1pA\displaystyle p_B =\frac{B}{A+B} = 1-p_A of having opinion B. Therefore, the probability model is the binomial distribution.

The third and fourth questions are our objects of focus, which we'll discuss next.

Note by Josh Silverman
5 years, 5 months ago

No vote yet
1 vote

  Easy Math Editor

This discussion board is a place to discuss our Daily Challenges and the math and science related to those challenges. Explanations are more than just a solution — they should explain the steps and thinking strategies that you used to obtain the solution. Comments should further the discussion of math and science.

When posting on Brilliant:

  • Use the emojis to react to an explanation, whether you're congratulating a job well done , or just really confused .
  • Ask specific questions about the challenge or the steps in somebody's explanation. Well-posed questions can add a lot to the discussion, but posting "I don't understand!" doesn't help anyone.
  • Try to contribute something new to the discussion, whether it is an extension, generalization or other idea related to the challenge.
  • Stay on topic — we're all here to learn more about math and science, not to hear about your favorite get-rich-quick scheme or current world events.

MarkdownAppears as
*italics* or _italics_ italics
**bold** or __bold__ bold

- bulleted
- list

  • bulleted
  • list

1. numbered
2. list

  1. numbered
  2. list
Note: you must add a full line of space before and after lists for them to show up correctly
paragraph 1

paragraph 2

paragraph 1

paragraph 2

[example link](https://brilliant.org)example link
> This is a quote
This is a quote
    # I indented these lines
    # 4 spaces, and now they show
    # up as a code block.

    print "hello world"
# I indented these lines
# 4 spaces, and now they show
# up as a code block.

print "hello world"
MathAppears as
Remember to wrap math in \( ... \) or \[ ... \] to ensure proper formatting.
2 \times 3 2×3 2 \times 3
2^{34} 234 2^{34}
a_{i-1} ai1 a_{i-1}
\frac{2}{3} 23 \frac{2}{3}
\sqrt{2} 2 \sqrt{2}
\sum_{i=1}^3 i=13 \sum_{i=1}^3
\sin \theta sinθ \sin \theta
\boxed{123} 123 \boxed{123}

Comments

Sort by:

Top Newest

Hey Josh excuse me to comment here but I didn't find any different way to thank you because of the amazing notes you are doing in mechanics new section. I've started learnings Phisycs from them, because they're awesome

Jordi Bosch - 5 years ago

Log in to reply

You can delete this comment once you haver read it. It doesn't fit with the article

Jordi Bosch - 5 years ago

Log in to reply

Thank you Jordi. That is very encouraging news, and it makes me want to write even more.

Josh Silverman Staff - 5 years ago

Log in to reply

×

Problem Loading...

Note Loading...

Set Loading...