Files, links for a series of lessons to introduce someone to a Python data science stack.
-
NYC ER ... file was downloaded (and cleaned a bit) from NYC Health Dept EpiQuery on 4Jun20, https://a816-health.nyc.gov/hdi/epiquery/visualizations?PageType=ps&PopulationSource=Syndromic
-
PandasJumble0.csv, PandasJumble1.csv, PandasJumble2.csv added to help practice with select, sort, groupby.
-
coldmed.csv ... 5 years of Google Trends data for 'cold medicine' added to compare with patterns in NYC ER
-
resume.csv is Kosuke Imai's abridgement of the data from Marianne Bertrand and Sendhil Mullainathan's "Are Emily and Greg more employable than Lakisha and Jamal?" in the American Economic Review, vol. 94, pp. 991-1013. Imai's repo is at https://github.com/kosukeimai/qss/tree/master/CAUSALITY
-
resume_city_eoe.csv is an abridgement (4 columns) of the Bertrand and Mullainathan data via https://dev.openicpsr.org/openicpsr/project/108486/version/V1/view;jsessionid=D57159F04E26C7F92F83177737DDEDA6