Dataset: ElectionΒΆ

This dataset contains voting results per county in the 2012 presidential election, along with various population statistics about the county.

Not all counties are represented in this dataset. Two random, non-overlapping subsets were drawn from the full dataset to produce the training and the testing sets.

Dependent variable:
 

The column VOTES_DIFF is the percentage point difference between votes for Obama compared to votes for Romney in a specific county. If Obama received 25% of votes and Romney 30% of votes, this value will be -5.

Predictor variables:
 

The predictors are population statistics about the county:

POPULATION:Population in log10 (i.e. 4 means 10,000 people)
INCOME:Median annual income per household in $1000
AGE:Median age
UNEMPLOYMENT:Unemployment rate
BACHELORDEG:Percentage of population with a bachelor’s degree
HOMEOWNERS:Percentage of population that is homeowner
FEMALE:Percentage of population that is female
65PLUS:Percentage of population that is 65 or older
LATINO:Percentage of population that is Latino
WHITE:Percentage of population that is white
BLACK:Percentage of population that is black
File paths:
data/election/training.csv
data/election/testing.csv