1.7 Data Used in the Course

The following provides the full list of data that we will use this semester. Some of the links below are to data posted on my website; others point to data hosted externally (if you notice any broken links, please let me know).

  • acs

  • Airq

    • Data about air quality in California counties in 1972.

    • Access: `data(Airq, package=“Ecdat”)

    • Description: ?Ecdat::Airq

  • AJR

    • Data about GDP, institutions, and settler mortality

    • Access: `data(AJR, package=“hdm”)

    • Description: ?hdm::AJR

  • banks

  • Birthweight_Smoking

  • Caschool

    • School-level test score data from California in 1998-1999

    • Access: data(Caschool, package="Ecdat")

    • Description: ?Ecdat::Caschool

  • diamond_train

    • Data about diamond prices. The full version of this data I got from Kaggle, and then I split it into training and testing data.

    • Access: Course Website

    • Description: A description of each column in the data is available under the Description tab on Kaggle

  • diamond_test

    • Out of sample version of diamond_train data

    • Access: Course Website

    • Description: A description of each column in the data is available under the Description tab on Kaggle

  • Fair

    • Individual-level data about affairs in the United States

    • Access: data(Fair, package="Ecdat")

    • Description: ?Ecdat::Fair

  • Fatalities

    • State-level panel data about drunk driving laws and traffic fatalities

    • Access: `data(Fatality, package=“AER”)

    • Description: ?AER::Fatality

  • fertilizer_2000

    • Country-level data about fertilizer and crop yields from the year 2000. See description of fertilizer_panel below for more details

    • Access: Course Website

    • Description: Course Website

  • fertilizer_panel

    • Country-level panel data from 1965-2000 (every 5 years) about fertilizer and crop yields for 68 developing countries. This data is a smaller version of the data used in McArthur, John W., and Gordon C. McCord. “Fertilizing growth: Agricultural inputs and their effects in economic development.” Journal of Development Economics 127 (2017): 133-152. url: https://doi.org/10.1016/j.jdeveco.2017.02.007.

    • Access: Course Website

    • Description: Course Website

  • house

  • intergenerational_mobility

  • Lead_Mortality

  • mlda

  • mroz

    • Labor force particpation of married women

    • Access: data(mroz, package="wooldridge")

    • Description: ?wooldridge::mroz

  • mutual_funds

  • rand_hie

  • Star

    • Data from Project STAR that randomly assigned some students to smaller class sizes

    • Access: `data(Star, package=“Ecdat”)

    • Description: ?Ecdat::Star

  • titanic_training

    • Passenger level data on surviving Titanic. This data is a slightly adapted version of the titanic data on Kaggle

    • Access: Course Website

    • Description: Kaggle

  • titanic_testing

  • us_data