1.7 Data Used in the Course

  • acs

  • Airq

    • Data about air quality in California counties in 1972.

    • Access: `data(Airq, package=“Ecdat”)

    • Description: ?Ecdat::Airq

  • AJR

    • Data about GDP, institutions, and settler mortality

    • Access: `data(AJR, package=“hdm”)

    • Description: ?hdm::AJR

  • banks

  • Birthweight_Smoking

  • Caschool

    • School-level test score data from California in 1998-1999

    • Access: data(Caschool, package="Ecdat")

    • Description: ?Ecdat::Caschool

  • diamond_train

    • Data about diamond prices. The full version of this data I got from Kaggle, and then I split it into training and testing data.

    • Access: Course Website

    • Description: A description of each column in the data is available under the Description tab on Kaggle

  • diamond_test

    • Out of sample version of diamond_train data

    • Access: Course Website

    • Description: A description of each column in the data is available under the Description tab on Kaggle

  • Fair

    • Individual-level data about affairs in the United States

    • Access: data(Fair, package="Ecdat")

    • Description: ?Ecdat::Fair

  • Fatalities

    • State-level panel data about drunk driving laws and traffic fatalities

    • Access: `data(Fatality, package=“AER”)

    • Description: ?AER::Fatality

  • fertilizer_2000

    • Country-level data about fertilizer and crop yields from the year 2000. See description of fertilizer_panel below for more details

    • Access: Course Website

    • Description: Course Website

  • fertilizer_panel

    • Country-level panel data from 1965-2000 (every 5 years) about fertilizer and crop yields for 68 developing countries. This data is a smaller version of the data used in McArthur, John W., and Gordon C. McCord. “Fertilizing growth: Agricultural inputs and their effects in economic development.” Journal of Development Economics 127 (2017): 133-152. url: https://doi.org/10.1016/j.jdeveco.2017.02.007.

    • Access: Course Website

    • Description: Course Website

  • house

  • intergenerational_mobility

  • Lead_Mortality

  • mlda

  • mroz

    • Labor force particpation of married women

    • Access: data(mroz, package="wooldridge")

    • Description: ?wooldridge::mroz

  • mutual_funds

  • rand_hie

  • Star

    • Data from Project STAR that randomly assigned some students to smaller class sizes

    • Access: `data(Star, package=“Ecdat”)

    • Description: ?Ecdat::Star

  • titanic_training

    • Passenger level data on surviving Titanic. This data is a slightly adapted version of the titanic data on Kaggle

    • Access: Course Website

    • Description: Kaggle

  • titanic_testing