Homework 1

Due: At the start of class on Monday, Feb. 5. Please turn in a hard copy.

Textbook Questions 2.2, 2.5, 2.6, 2.10, 2.11, 2.12, 2.13, 2.14, 2.21. For the TRUE or FALSE questions, make sure to provide an explanation.

Extra Question 1 Download the data fertilizer_2000.RData from the course website to answer this question (there is also a description file on the course website).

  1. How many observations does this data contain?

  2. Data about what country is provided in the 21st row?

  3. What is the average per capita GDP across all countries in the data?

  4. What is the average precipitation among countries that have higher than average per capita GDP?

Extra Question 2 For this question, we will use data from Project Star. To access this data, please install the Ecdat package and run the following code: data(Star, package="Ecdat"). To view a description of the data, you can run the following code: ?Ecdat::Star.

Project Star is a very well-known experiment in Tennessee in the 1980s where students were randomly assigned to be in a small class, a regular class, or in a class with a teaching aid. The study was focused on whether or not reducing class sizes improved student’s test scores.

For this problem, we will be interested in the \(ATT\) of being in a small class relative to a regular class for boys on their math test scores (that is, I’d like for you to use a subset of the data that only includes boys and only includes students who were assigned to a small class or regular class).

  1. Exploiting that the treatment is randomly assigned, calculate an estimate of the \(ATT\) by comparing the mean of tmathssk for boys in small class sizes relative to boys in regular classes.

  2. Now, use a regression to calculate the same \(ATT\) as we were interested in from part (a). [For this part, you can compare your answer to results that use lm, but I’d like for you to use matrix algebra for your main result.] How do the results compare to the ones from part (a)?

  3. Now, run a regression that additionally includes the teacher’s experience (totexpk) and free lunch status (freelunk) as additional regressors. [As before, you can compare your answer to the ones from lm, but please report your final answer using matrix algebra.] How do these results compare to the ones from parts (a) and (b)?