2020 Presidential Election Prediction

As November 3rd is less than two days away, in this blog post I create a final prediction for the 2020 U.S. presidential election based off of state winners.

A sneak peek of what’s ahead:

The Model

My model is based off of Abromowitz’s time for change model and adapted for the state level. Abromowitz’s three explanatory variables are Q2 GDP growth, national net approval ratings for the president, and incumbency. While extremely parsimonious and highly accurate, the model does not show vote choice disaggregated by state.

One of my explanatory variables is state-level poll estimates instead of national presidential net approval ratings. The figure below shows the trends in Trump’s average net approval ratings, calculated by taking the average of the difference between his approval and non-approval ratings, and nation-wide poll estimates for his share of the 2-party vote come November 3rd. As you can see, the troughs and peaks in both charts follow each other quite similarly and indicate that poll estimates and national approval ratings approximate around the same thing, which is vote choice.

Unlike national approval ratings, however, poll estimates can further be disaggregated by state. This data was downloaded from 538. The figure below shows how poll estimates have changed from the end of February until the most recent model date, October 30th. Disaggregating by state is important for two reasons: 1) The nature of the winner-takes-all electoral college system means we should be very tuned in to state partisan leanings rather than overall national popular vote share. Exhibit A: The 2000 and 2016 elections, in which the elected President was not the candidate with the majority popular vote share overall. 2) State partisan leanings have implications for down-ballot races as well; if Texas sees a Trump victory by 2 points, for example, down-ballot races are likely quite competitive as well given that Texas has been reliably red until 2016.

Parameters

Fundamental indicators are a strong predictor of elections. Firstly, voters balance sociotropic and individual-level concerns on the state of society and what that means for their immediate communities. Economic data, at least in national aggregate, is consistent and widely accessible. Voters cast their ballots based off of retrospective assessments of presidential performance, and the economy is good proxy for this retrospection process because its effects are felt immediately in terms of inflation or increased levels of real disposable income, as well as covered extensively by the media and touted by political figures as resume-building credentials. Because of the human difficulty it takes to accurately recall economic performance over a 4-year span, voters overwhelmingly tend to measure the entire 4-year economic performance based off of the last two years, especially the last two quarters of election year (Healy & Lenz, 2013).

Another piece of the puzzle is incumbency. While voters have no underlying preference for incumbents once other factors are controlled for, the reality is that their time in office confers benefits to their election (Brown, 2014). In fact, only 3 sitting presidents- Ford, Carter, and George H.W. Bush- have lost their re-election campaigns in the post-war era. I believe the main mechanism incumbency confers advantages to candidates is through the financial aspect. Indeed, residents reward sitting presidents for spending in their area, presumably because the spending indicates future prosperity under the President (Kriner & Reeves, 2012). I hypothesize that adding an incumbency interaction term to the fundamentals variable will result in a stronger model.

While only 3 incumbent presidents have lost in the 18 post-WWII elections, 11 incumbent parties have lost. This may indicate voters’ preferences for a change of pace, as described by Abromowitz. I also believe that this reflects the voter psyche of retrospective voting, as failures and negative events that accrue over a 4-year term may cause voters to simply get tired of the incumbent party. In examining the impact of shocks on presidential elections, it is found that voters still punish/reward incumbents for politically exogenous shocks that are endogenized through government interaction with the issue, such as natural disaster response (Healy & Malholtra, 2010). Simply by being an incumbent, a candidate is held responsible for factors that may have originated exogenously. Voters may tire of incumbent party platforms for this reason.

Regarding fundamentals, I test two indicators: GDP growth and RDI growth, to determine which can be a better predictor. I avoid looking at stock movements, inflation, or unemployment, as models that regress 2-party vote share against those singular variables find weak R-squared values (respectively, they are 0.08, 0.09, and 0.02) relative to GDP and RDI growth. GDP and RDI growth are better measures than absolute GDP and RDI levels, as the latter do not tell us whether the incumbent simply inherited a booming economy or a failing one. Additionally, I test Q2 and Q3 to determine whether an average of the two periods or Q3 alone is a better indicator based off of Healy & Lenz’s findings that other quarters are weak predictors.

As mentioned previously, poll estimates closely follow presidential approval ratings, which have a strong correlation with the incumbent’s 2-party vote share. Therefore, I use the most recent state-level polling data there is available for 2 reasons. The first is that polls tend to converge towards the actual outcome (Gelman & King, 1993). The second is that, given the United States’s electoral college system, disaggregating by state is of utmost importance because a national popular vote win could very well fail to reflect an electoral college win.

Next, I examine the models’ in- and out-of-sample predictive power and examine the coefficients on my model of choice.

Validation

Seeing the success of Abromowitz’s parsimonious linear regression model, I tested several variants of the time for change model and gathered their in-sample R-squared fit values and out-of-sample cross-validation R-squared values. First, I chose to use Q3 economic indicators rather than indicators averaged over Q2 and Q3 because every single model formula variant performed better when using Q3 data rather than averaged Q2 and Q3 economic indicators. This was a surprising finding, as I previously believed that taking the average could protect our model performance by mitigating unusually poor or good economic performance. The formulas I used can be found below. For mod2, for example, the adjusted R-squared was 0.800 when solely using the Q3 RDI growth indicator (5-fold cross-validation R-squared was .795), whereas the adjusted R-squared was .788 when using averaged Q2 and Q3 RDI growth (5-fold cross-validation R-squared was .786). When performing cross-validation, the mean squared errors’ standard errors were similar. For these reasons, the following models will be tested using only Q3 indicators.

mod1 <- formula(D_pv2p ~ avg_poll + GDP_growth_qt + incumbent)
mod2 <- formula(D_pv2p ~ avg_poll + RDI_growth + incumbent)
mod3 <- formula(D_pv2p ~ poll_2party + RDI_growth + incumbent)
mod4 <- formula(D_pv2p ~ poll_2party + RDI_growth + incumbent + RDI_growth:incumbent)
mod5 <- formula(D_pv2p ~ poll_2party + RDI_growth + incumbent + RDI_growth:incumbent + incumbent_party)

Indeed, when performing 5-fold cross-validation, mod5 was the best performer with a mean squared error of 4.14 and R-squared of .834, compared to the worst performer mod1 with 4.50 MSE and .796 R-squared. The MSE is useful because extreme values are punished more in out-of-sample model evaluations, which is especially pertinent for 2020 since it may indeed yield a more extreme prediction.

Now that I’ve tested my model for in-sample and out-of-sample fit, it is time to examine the coefficients.

Coefficients

2-party Democratic vote share
Poll estimate (scaled for 2 parties)0.902***
(0.019)
Q3 RDI growth-2.665
(21.534)
Incumbency1.807***
(0.612)
Party incumbency-1.532***
(0.513)
Interaction: Q3 RDI growth and incumbency-126.799**
(50.234)
Constant4.654***
(0.974)
Observations510
Adjusted R20.820
Residual Std. Error4.255 (df = 504)
Note:*p<0.1; **p<0.05; ***p<0.01

For every point increase in polls for the Democratic candidate, the predicted 2-party vote share increases on average because of a positive coefficient. My incumbency and party incumbency hypotheses are reflected in the positive coefficients for each of these parameters. However, the negative coefficient with a high standard error for Q3 RDI growth, and the large negative term for the interaction parameter is concerning. I add controls for state by creating dummy variables for each state, which still yields a similarly large standard error for the negative coefficient of RDI growth. I will avoid using this 54-parameter model due to overfitting concerns.

I will proceed with model 5 with an important tweak and a caveat. I acknowledge that the predictive uncertainty will be impacted by the large standard error of the interaction term. Therefore, I will create two separate models: one for core states (solidly partisan states) and one for swing states. My finalized model details are below for the core state model (using mod5) and the swing state model (made as parsimonious as possible with dummy variables for each swing state).

core state model: D_pv2p ~ poll_2party + RDI_growth + incumbent + RDI_growth:incumbent + incumbent_party
swing state model: D_pv2p ~ poll_2party + state + incumbent + RDI_growth
2-party Democratic vote share
Core state vote shareSwing state vote share
Poll estimates0.970***0.446***
(0.018)(0.050)
Florida1.149
(1.895)
Georgia2.379
(1.907)
Iowa3.597*
(1.876)
Michigan4.179**
(1.878)
Minnesota5.709***
(1.865)
Nevada1.974
(1.944)
New Hampshire0.062
(1.949)
North Carolina0.796
(1.820)
Ohio1.965
(1.827)
Pennsylvania3.821**
(1.873)
Texas-3.625*
(1.904)
Wisconsin4.261**
(1.965)
RDI growth21.433-160.748***
(21.948)(36.494)
Incumbency1.617**0.731
(0.625)(0.785)
Party incumbency-1.501***
(0.520)
Interaction of RDI growth and incumbency-132.769**
(51.994)
Constant1.40124.847***
(0.928)(2.653)
Observations375135
R20.8870.633
Adjusted R20.8860.587
Residual Std. Error3.730 (df = 369)4.116 (df = 119)
Note:*p<0.1; **p<0.05; ***p<0.01

Let’s talk core states first. For every point increase in polls for the Democratic candidate, the predicted 2-party vote share increases by 0.97 points on average. As expected, incumbency also provides a boost; all else held equal, an incumbent has a 1.62 point advantage over their opponent on average. Again, confirming my hypothesis, party incumbency is a disadvantage; all else held equal, a candidate whose party was previously in power sees a 1.5 point disadvantage on average. While the RDI growth error is still large, the coefficient makes substantive sense as it is positive; for every 1% increase in Q3 RDI growth, there is on average a 20-point increase in vote share (again caveating with the large standard error). The average residual standard error is 3.73 for this model, which is an improvement from the previous unseparated model.

Looking at the swing state model, it is evident that this model has less predictive power due to its lower R-squared and higher residual standard error. Relative to the unseparated model, however, the average error magnitude is still lower. Note that the constant is 24 versus the constant in the core states model, which is 1. Even though the state dummy variables’ coefficients are positive, it does not necessarily mean that each swing state leans Democratic; rather, we can only draw comparisons about their relative partisan leanings.

Next, I will apply these models to a 2020 prediction.

The Prediction

I obtained updated state-level polling data from 538. I also obtained updated Q3 economic data from the Bureau of Economic Analysis. The final point predictions are shown below in a map; it is highly optimistic for Biden, at 442 electoral votes, versus Trump’s 96.

I think that this is not unlikely, albeit unprecedented. The Trump administration has handled the coronavirus pandemic markedly poorer compared to other countries. The death toll is continuously rising, and his rhetorical style is increasingly divisive even among Republicans (CNN).

That being said, point estimates are not as helpful as viewing the entire predictive interval. We have the standard error of each out-of-sample prediction, so I constructed a 95% confidence interval by taking the point prediction and adding/subtracting 1.96 * stderr. The confidence intervals do not tell us that we can expect Biden or Trump to win with 95% probability; rather, it merely describes that a projected 95% of the time, the Democratic vote share will fall within a certain range. I mapped out the values that fell outside of this 95% confidence interval below. If the lower bound was above 50% vote share, the state is colored blue; if the upper bound was below 50% vote share, the state is colored red. States that cover 50% vote share within the 95% confidence interval are colored purple. Note that D.C. was not included due to dropping from data joins.

Acknowledging the Unknown

There are many reasons that 2020 is unique; a dampened ground game, turnout uncertainty, and differential mobilization across different demographics will all play a role. My model is optimistic for Biden’s win (especially my point estimates, which are extremely and almost improbably optimistic. But that’s why we don’t simply just look at point estimates). Various factors - such as the Trump admin’s handling of coronavirus, foreign relations, “shocks” and major events such as the presidential and vice presidential debates, Trump’s tax returns, and John Bolton’s tell-all - only hurt Trump.

I would like to acknowledge two circumstances that might help Trump’s re-election chances: differences in demographic voting behavior, and differences in voter turnout across partisan lines due to the pandemic and state voting laws. Looking at differences in demographic effects, last week I played around with a hypothetical scenario that Black communitise affected disproportionately by the coronavirus would hurt Trump, because Black voters would punish Trump for this endogenized “exogenous” shock. On the flip side, it could very well be that certain demographics - such as less educated rural white voters or Latino men - will vote at unexpectedly high levels. This is an important consideration, but I choose not to model demographic changes in my model due to its sensitivity to small changes. Demographic voting blocs’ behavior is further obscured by the second consideration, which is voter turnout.

Regarding turnout, one possible scenario that helps Trump would be that the dramatic increase in mail-in ballots and absentee voting might disproportionately affect Democrat voters’ ability to vote. This could be the case, and there are already thorny instances regarding what ballots will get counted. Certainly, a limitation of my model is its inability to speak to this issue. An assumption my model makes is that turnout effects may simply cancel each other out, as certain groups are galvanized to vote more whereas others experience structural barriers to vote. For example, youth are voting at unprecedentedly high rates and early voting exit polls show a Democratic lean; however, strict voting laws may suppress the Democratic vote, as Democrats are more likely to engage in voting by mail than Republicans (Thomspon, 2020).

This may not be the case. I will be back in the coming weeks with the benefit of hindsight to evaluate how my prediction did. See you on the other side of the election!

You can find the replication scripts for graphics included in this week’s blog here. You can find the necessary data here.