Data Wrangling Resources

Resources

– Home Mortgage Disclosure Act example from class

John Canny’s slides on Data Cleaning and Integration

Detaled discussion of Levenshtein Distance

– Useful book: Big Data and Social Science: A Practical Guide to Methods and Tools (Chapman & Hall/CRC Statistics in the Social and Behavioral Sciences) by Foster, Ghani et al. On reserve in Regenstein for the quarter.