Dataset: Election ================= This dataset contains voting results per county in the 2012 presidential election, along with various population statistics about the county. Not all counties are represented in this dataset. Two random, non-overlapping subsets were drawn from the full dataset to produce the training and the testing sets. :Dependent variable: The column ``VOTES_DIFF`` is the percentage point difference between votes for Obama compared to votes for Romney in a specific county. If Obama received 25% of votes and Romney 30% of votes, this value will be -5. :Predictor variables: The predictors are population statistics about the county: :``POPULATION``: Population in log10 (i.e. 4 means 10,000 people) :``INCOME``: Median annual income per household in $1000 :``AGE``: Median age :``UNEMPLOYMENT``: Unemployment rate :``BACHELORDEG``: Percentage of population with a bachelor's degree :``HOMEOWNERS``: Percentage of population that is homeowner :``FEMALE``: Percentage of population that is female :``65PLUS``: Percentage of population that is 65 or older :``LATINO``: Percentage of population that is Latino :``WHITE``: Percentage of population that is white :``BLACK``: Percentage of population that is black :File paths: :: data/election/training.csv data/election/testing.csv