Is Barack Obama up by double digits or is this race a dead heat? Is there an inconsistency between tightening national polls and yesterday’s battleground state polls, or is it no big deal? What exactly is happening with the polls all over the place?
Comparing polls these days is hardly a matter of comparing apples to apples. There are apples, oranges, and even pineapples. To say “no two polls are alike” in methodology is an understatement. The whole host of things that pollsters do to their data — weight it, screen it, use different samples and question forms — all can lead to a world where two polls that you’d expect to be within the margin of error of one another wind up all over the map.
First, there’s what pollsters do in sampling to get the data in the first place. Some use cell phones, some don’t. The methodology for handling cell users is newly emerging, and cell users are typically treated a bit differently than landline users, such as being offered incentives for their time. Then, if you’re looking at a poll that uses “likely voters,” you’re looking at a sample that has been based on a pollster’s assumption. Screening out respondents who are registered to vote but don’t fit a pollster’s definition of “likely voter” introduces a subjective filter to the data.
Since almost no pollsters reveal what their likely voter screener is, it’s hard for an outside observer to know whether to trust that pollster’s assumptions about the behavior or questions that can best identify likely voters. Sure, some voters are more likely to vote than others. But if you’re going to screen out folks who are eligible to vote because you don’t think they’re going to exercise their democratic right, readers should want to know what your rationale is for that screen.
There’s also sample size and dates of fielding. A survey that fields for two days is very different from a survey that fields for four in this political environment where the latest big political event immediately finds its way into households through cable news and the internet. Plus, smaller samples mean bigger margins of error.
SO BEFORE WE EVEN get to the survey itself or the data weighting, you’ve already got a pretty major bit of subjectivity and varying methodology that’s been thrown into the mix. Unless all pollsters are using the exact same sample size, fielding time, and model to determine “likely voter” behavior, you’re already looking at apples and oranges. (To its credit, CNN releases both likely voter and registered voter results. While we still don’t know how CNN screens for likely voters, it’s good that it provides both sets of data.)
Now, consider that pollsters have different ways of asking the ballot test question. In the statewide polls James Antle blogged about yesterday, the pollsters all take different approaches. The CNN polls ask respondents to choose between the tickets, naming both the presidential and vice presidential candidates. If a voter is undecided, he is then pushed to say what candidate he is leaning toward. In the Big Ten Battleground poll, undecided voters are allowed to remain undecided. Quinnipiac pushes the leaners as well and includes them in the totals, but the Quinnipiac question does not name the vice presidential candidates.
See? Three different polls that will all show up as “Obama/McCain Ballot Tests.” And all three questions are structured very differently.
Then there’s data weighting. What pollsters do with the data after they get it is important. When a pollster weights data, they do so with good reason — you are making sure that your sample is representative of the facts you know about the population you’re sampling. Most do this. But what pollsters weight varies. Some weight by party ID (I discuss this further here), while some don’t. Some allow partisan splits that are far, far, far outside the norm to remain because they treat party as a question response as fluid as “Who would you vote for?” Others treat partisanship the same way you treat race or gender, something that is much more stable in the electorate.
So after all that filtering, sampling, surveying, and weighting, it makes sense that you’d wind up with polls that are a bit all over the place. If you’re interested in keeping tabs on where the electorate is going, research the different polls, choose the ones with methodologies you trust, and if you can, try to evaluate them individually. But with the enormous amount of data swirling around out there as the election nears, you’ve got the ability to be a discriminating consumer. Rather than look at it as a curse of confusion, you can pick and choose the best and look to them to inform your pre-election predictions and analysis.