1.7 Data Used in the Course
acs
Data from the 2019 American Community Survey about Education and Earnings
Access: Course Website
Description: Course Website
Airq
Data about air quality in California counties in 1972.
Access: `data(Airq, package=“Ecdat”)
Description:
?Ecdat::Airq
AJR
Data about GDP, institutions, and settler mortality
Access: `data(AJR, package=“hdm”)
Description:
?hdm::AJR
banks
Data about bank closures in Mississippi during the Great Depression
Access: Course Website
Description: Course Website
Birthweight_Smoking
Data about infant birthweights and mother’s smoking from PA 1989
Access: Course Website
Description: Mark Watson’s website
Caschool
School-level test score data from California in 1998-1999
Access:
data(Caschool, package="Ecdat")
Description:
?Ecdat::Caschool
diamond_train
Data about diamond prices. The full version of this data I got from Kaggle, and then I split it into training and testing data.
Access: Course Website
Description: A description of each column in the data is available under the
Description
tab on Kaggle
diamond_test
Out of sample version of
diamond_train
dataAccess: Course Website
Description: A description of each column in the data is available under the
Description
tab on Kaggle
Fair
Individual-level data about affairs in the United States
Access:
data(Fair, package="Ecdat")
Description:
?Ecdat::Fair
Fatalities
State-level panel data about drunk driving laws and traffic fatalities
Access: `data(Fatality, package=“AER”)
Description:
?AER::Fatality
fertilizer_2000
Country-level data about fertilizer and crop yields from the year 2000. See description of
fertilizer_panel
below for more detailsAccess: Course Website
Description: Course Website
fertilizer_panel
Country-level panel data from 1965-2000 (every 5 years) about fertilizer and crop yields for 68 developing countries. This data is a smaller version of the data used in McArthur, John W., and Gordon C. McCord. “Fertilizing growth: Agricultural inputs and their effects in economic development.” Journal of Development Economics 127 (2017): 133-152. url: https://doi.org/10.1016/j.jdeveco.2017.02.007.
Access: Course Website
Description: Course Website
house
U.S. House of Representatives elections data from 1946-1998
Access: Course Website
Description: Course Website
intergenerational_mobility
Intergenerational mobility data from PSID
Access: Course Website
Description: Course Website
Lead_Mortality
Infant mortality and lead pipes in 1900
Access: Access: Course Website
Description: Mark Watson’s website
mlda
Car accident deaths by age group
Access: Course Website
Description: Course Website
mroz
Labor force particpation of married women
Access:
data(mroz, package="wooldridge")
Description:
?wooldridge::mroz
mutual_funds
Mutual fund performance data
Access: Course Website
Description: Course Website
rand_hie
RAND health insurance experiment
Access: Course Website
Description: Course Website
Star
Data from Project STAR that randomly assigned some students to smaller class sizes
Access: `data(Star, package=“Ecdat”)
Description:
?Ecdat::Star
titanic_training
Passenger level data on surviving Titanic. This data is a slightly adapted version of the
titanic
data on KaggleAccess: Course Website
Description: Kaggle
titanic_testing
Out of sample version of
titanic_training
dataAccess: Course Website
Description: Kaggle