Predicting United States policy outcomes with Random Forests

by   Shawn McGuire, et al.

Two decades of U.S. government legislative outcomes, as well as the policy preferences of rich people, the general population, and diverse interest groups, were captured in a detailed dataset curated and analyzed by Gilens, Page et al. (2014). They found that the preferences of the rich correlated strongly with policy outcomes, while the preferences of the general population did not, except via a linkage with rich people's preferences. Their analysis applied the tools of classical statistical inference, in particular logistic regression. In this paper we analyze the Gilens dataset using the complementary tools of Random Forest classifiers (RFs), from Machine Learning. We present two primary findings, concerning respectively prediction and inference: (i) Holdout test sets can be predicted with approximately 70 that consult only the preferences of rich people and a small number of powerful interest groups, as well as policy area labels. These results include retrodiction, where models trained on pre-1997 cases predicted "future" (post-1997) cases. The 20 detailed but noisy dataset, indicates the high importance of a few wealthy players in U.S. policy outcomes, and aligns with a body of research indicating that the U.S. government has significant plutocratic tendencies. (ii) The feature selection methods of RF models identify especially salient subsets of interest groups (economic players). These can be used to further investigate the dynamics of governmental policy making, and also offer an example of the potential value of RF feature selection methods for inference on datasets such as this.


Evolution of Preferences in Multiple Populations

We study the evolution of preferences and the behavioral outcomes in an ...

Bayesian feature selection with strongly-regularizing priors maps to the Ising Model

Identifying small subsets of features that are relevant for prediction a...

Machine Learning to detect cyber-attacks and discriminating the types of power system disturbances

This research proposes a machine learning-based attack detection model f...

Predicting Opioid Use Outcomes in Minoritized Communities

Machine learning algorithms can sometimes exacerbate health disparities ...

Matching Refugees to Host Country Locations Based on Preferences and Outcomes

Facilitating the integration of refugees has become a major policy chall...

On Modeling Human Perceptions of Allocation Policies with Uncertain Outcomes

Many policies allocate harms or benefits that are uncertain in nature: t...

Please sign up or login with your details

Forgot password? Click here to reset