Comments for Mostly Harmless Econometrics

Comment on Comments on Bad Control by josh

josh — Tue, 24 Oct 2017 14:14:19 +0000

Your right to be suspicious, Jie. You can not test for a mediator just by sticking it in as a control. That’s why mediator’s are “bad”! Check out Chapter 6 of Mastering ‘Metrics for more guidance on this difficult problem.
http://masteringmetrics.com/

Comment on Comments on Bad Control by Jie Ma

Jie Ma — Tue, 24 Oct 2017 09:38:12 +0000

Hi,

I’m reading the bad control section again and having trouble connecting the argument with the common practice of including intermediate outcomes as a way to test for the mechanism. Many papers (Nunn and Qian 2014, Chen, Kung and Ma 2017, etc) use the intermediate outcomes as controls: if the coefficient of the treatment variable changes in magnitude or significance when including these intermediate outcome, they claim the intermediate outcome is a potential channel of the effect. For example, by adding occupation to the regression between education and earnings, the coefficient is reduced. This would imply that the occupation is a potential explanation of education’s effect on earnings.

The method makes sense intuitively but the papers using them do not provide econometric theories to support. I’m wondering what’re your thoughts on this. Thanks!

Jie

Comment on Multiple endogenous variables – now what?! by josh

josh — Sat, 25 May 2013 23:10:32 +0000

Daifeng, probably you dont want to hear this but if the thing doesnt work in the separate-by-subgroup analysis, then it doesnt work and you should move on. the only thing pooling does is restrict the covariate effects to be the same, which is most likely second order for your ability to identify effects in subgroups.
-JA

Comment on Why children succeed by László

László — Mon, 03 Sep 2012 13:35:13 +0000

Mostly Harmless Econometrica

Comment on Probit better than LPM? by VJ

VJ — Mon, 16 Jul 2012 03:01:13 +0000

As for the discussion for measurement error in the case of probit and logit, I think a good source of reading would be “Mismeasured Variables in Econometric Analysis: Problems from the Right and Problems from the Left” : http://www.aeaweb.org/articles.php?doi=10.1257/jep.15.4.57

It’s worthwhile to note that you can end up with significant biases when you have in mismeasurement of the independent variable as well as the dependent variable.

Comment on Probit better than LPM? by The Linear Education Model | Ceteris Non Paribus

The Linear Education Model | Ceteris Non Paribus — Fri, 13 Jul 2012 06:52:57 +0000

[…] are robust to any cdf for the error term. Jorn-Steffen Pischke at Mostly Harmless Econometrics points out that my gut is not wrong: “The structural parameters of a binary choice model, just like the probit index […]

Comment on Probit better than LPM? by Winston Lin

Winston Lin — Wed, 11 Jul 2012 15:20:28 +0000

Sorry my link to Freedman’s paper didn’t go through. It’s in Statistical Science, 2008, Vol. 23, No. 2, 237-249, and here’s an ungated preprint:

http://www.stat.berkeley.edu/~census/neylogit.pdf

Comment on Probit better than LPM? by Winston Lin

Winston Lin — Tue, 10 Jul 2012 19:40:56 +0000

Steve, I like your answer and just have a nerdy footnote.

In a completely randomized experiment with a binary outcome, if you want to adjust for covariates to improve precision, you can use either logit (with an average marginal effect calculation) or OLS to consistently estimate the average treatment effect, even if your model’s “wrong”. Probit doesn’t enjoy this robustness property.

The first-order conditions for OLS and the logit MLE imply a nice property: if you compute an “untreated” predicted probability for each person, using her actual covariate values but setting the treatment dummy to 0, then the average “untreated” prediction in the control group equals the raw control mean. In large enough samples, this will be very similar to the average “untreated” prediction in the full sample (since the distribution of covariates in the control group will resemble the distribution in the full sample). The latter is a regression-adjusted control mean. So we have an adjusted control mean that enjoys the same consistency properties as the raw control mean.

Similarly, we can compute a “treated” predicted probability for each person, and the resulting adjusted treatment group mean enjoys the same consistency properties as the raw treatment group mean. So the difference between the adjusted treatment and control group means is consistent for ATE. None of this depends on the model being correct.

The probit MLE first-order conditions don’t imply the same nice property.

David Freedman gave a rigorous proof for logit in “Randomization does not justify logistic regression”. (The negative message is that you can’t just use predictions at the mean covariate values, and the coefficient on treatment doesn’t estimate anything meaningful if the model’s wrong. But diehard MHE fans already know that.)

Freedman also briefly discussed probits:

“On the other hand, with the probit, the plug-in estimators are unlikely to be consistent, since the analogs of the likelihood equations (16–18) below involve weighted averages rather than simple averages. In simulation studies … Numerical calculations also confirm inconsistency of the plug-in estimators [average marginal effect estimates from the probit], although the asymptotic bias is small.”

A couple other references:

D. Firth and K. Bennett (1998). “Robust models in probability sampling.” JRSSB 60: 3-21.

J. Wooldridge (2007). “Inverse probability weighted estimation for general missing data problems.” J. Econometrics 141: 1281-1301. (See section 6.2.)

Comment on Fixed effects and lagged dependent variables again by josh

josh — Fri, 08 Jun 2012 19:52:05 +0000

yes I am! but alas its not Guryan (2004), referenced in this regard on p. 246. Its in Guryan’s (2001) NBER Working paper version of the same study. Amazing what editors and refs will do ….

Comment on Fixed effects and lagged dependent variables again by R A

R A — Fri, 08 Jun 2012 09:29:55 +0000

Are you aware of any papers that elaborate on these “bracketing” properties of either LDV and FE for a model that includes both an LDV and a FE?