SlideShare uma empresa Scribd logo
1 de 49
Baixar para ler offline
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
High-Dimensional Methods: Examples for
Inference on Structural Effects
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M.
Taddy
July 16, 2013
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Example 1: 401(k)’s and Assets
Estimate the effect of 401(k) eligibility on measure of accumulated
assets, (e.g. PVW 1994, 1995, 1996)
yi = di α0 + xi β + ζi
yi = net financial assets or total wealth,
di = eligible for 401(k),
xi = controls for individual characteristics. PVW argue
important to control for income
income (<10k, 10k-20k, 20k-30k, 30k-40k, 40k-50k, 50k-75k,
75k+), age, age2
, family size, education (high school, some
college, college), married, two-earner, defined benefit, ira,
home-owner
n = 9915
Baseline estimates: Net-TFA: 9216.5 (1340.6); TW: 6612.0
(2110.1)
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Did we control sufficiently for income?
Previous model provides nice baseline, but we might wonder
Are 7 dummies for income categories sufficient to control for
income?
More complex nonlinearity?
Interactions?
Could we improve efficiency?
Even if eligibilty were randomly assigned, might want to
introduce controls to improve efficiency
“Over-controlling”?
Other important variables?
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Regularization
All treatment effects estimation problems involve
regularization/variable selection.
The only valid inference without dimension reduction is to
conclude that one cannot learn from the data.
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Regularization
Dimension reduction strategies:
Randomized treatment assignment: Reduces dimension of
control vector to 0.
May still want to do variable selection to improve efficiency
(“single-selection” procedures)
Stratified sampling
Intuition
Formal Model Selection
Will fail without good intuition (Needle in a haystack: Want a
big needle or a small haystack)
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Intuitive Dimension Reduction
401(k) Example Baseline:
Data Selection
Wave 4 of 1990 SIPP
Ignore panel aspect
Impose criteria (e.g. age selection) to limit sample to
“interesting” population (control)
Variable Selection
655 raw variables (obviously many administrative/technical,
highly unlikely to be related to problem of interest)
Select 9 as controls: income, age, family size, education,
married, two-earner, defined benefit, ira, home-owner
Functional form
no interactions, dummies for income categories, quadratic in
age, dummies for schooling levels
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
“Formal” Dimension Reduction: Model
Model:
yi = di α0 + g(xi ) + ζi ≈ di α0 + zi βg + ζi
di = m(xi ) + ui ≈ zi βm + ui
E[ζi |xi , di ] = E[ui |xi ] = 0
where zi is a function of xi .
“Reduced forms”:
yi = zi β + ζi
di = zi βm + ui
β = βg + α0βm
ζi = ζi + α0ui
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
“Formal” Dimension Reduction: Intuition
Intuition for what we get from each reduced form:
Reduced form for treatment:
Find variables that are strongly related to treatment
Ignoring these potentially leads to omitted-variables-bias
(OVB)
Reduced form for outcome:
Find variables that are strongly related outcome
Improve efficiency
Reduce OVB which would result if associated coefficient in
treatment equation is small but non-zero
Reduced forms are parts of problem data are informative about
(predictive relationships)
Good methods for estimating these parts of the model
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
“Formal” Dimension Reduction: Choices
Choice 1 (Data Set): Same as above
Choice 2 (Baseline Variables): Same as above
Could consider adding additional variables
Each variable makes the haystack bigger
Think carefully. Do we want more variables or more flexibility
in other dimensions?
Choice 3 (Functional Form): Want to try to be very flexible in
income (Details to follow)
Choice 4 (Selection Method/Auxiliary Parameters): LASSO
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Review: Feasible LASSO Allowing Heteroskedasticity
LASSO (heteroskedastic):
βL = arg min
n
i=1
(yi − xi β)2
+ λ
p
j=1
|γj βj |
for a generic outcome (y) and set of regressors (x)
Need to fill in values for λ and γj for j = 1, ..., p.
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Review: Feasible LASSO Allowing Heteroskedasticity
γj :
Want γj ≈ 1
n
n
i=1 x2
ij ε2
i
Estimate iteratively for a given value of λ.
Start with initial guess for β (β∗
) → ei = yi − xi β∗
(initial
guess for {εi }n
i=1)
Estimate new value of β by solving above problem for given
value of λ with γj = 1
n
n
i=1 x2
ij e2
i
Take new value of β to form new residuals and new set of γj
estimates
Iterate to convergence (or max number of iterations)
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Review: Feasible LASSO Allowing Heteroskedasticity
λ:
1. Cross-validation
10-fold CV in all examples
Use 1 s.e. rule as discussed by M. Taddy previously
Iterative estimation of penalty loadings inside of CV loop
Less stable than vanilla LASSO CV
1 s.e. rule seems to add some stability
2. Theoretical value:
Simple bound 1: 2.2Φ−1
(1 − q
2p )
Simple bound 2: 2.2 2n log(2p/q)
Theoretically need q → 0, q size of test of hypothesis that
“biggest” coefficient equals 0 when all coeffients equal 0. We
use q = .05 or q = .1/ max{p, n} in our examples. (Bounds
above are bounds on this critical value.)
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
“Formal” Dimension Reduction: Variables
Dummies: married, two-earner, defined benefit, ira,
home-owner
Schooling: 5th degree orthogonal polynomial (generated by
Stata orthpoly)
Family Size: 3rd degree orthogonal polynomial (generated by
Stata orthpoly)
Age: Cubic spline with 10 equally spaced knots (30 terms)
Income: Cubic spline with 15 equally spaced knots (45 terms)
Interactions:
1. Dummies with Schooling, Family Size, and Age terms (190
interactions)
2. Income terms interacted with A. Dummies, Schooling,
Family Size, and Age terms and B. interactions in 1. (10,485
interactions)
10,763 total variables
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Dimension Reduction
So, n = 9,915; p = 10,763
Cannot identify effect of 401(k) if we think we actually need all of
these terms to adequately control for income.
Have already intuitively reduced dimensionality by focusing on 9
controls that seem plausibly related to income and assets
Baseline results resolve this identification problem by assuming the
functional form is known.
Rather than assume functional form, try to learn it from data using
variable selection (from a flexible but still very parsimonious
specification).
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Results:
Note: Before constructing splines, income and age were put on
[0,1] interval.
401(k) Eligibility:
Selected variables (CV and Plug-in): income,
max(0, income − .33), two–earner, defined–benefit,
home–owner, education3, max(0, age3 − .4),
max(0, age3 − .5), home–owner ∗ income
Estimate 401(k) effects obtained by combining these variable
with those selected for relevant outcome and running OLS
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Results:
Net Financial Assets:
Selected variables (CV and Plug-in): income, two–earner,
age, two–earner ∗ family–size, ira ∗ age, home–owner ∗ age2,
ira ∗ income, home–owner ∗ income, home–owner ∗ income2
Estimated Effect: 8687.43 (1274.76) [Recall baseline
estimates: 9216.5 (1340.6)]
Total Wealth:
Selected variables (CV and Plug-in): income, two–earner,
ira ∗ age, home–owner ∗ age, home–owner ∗ age2,
ira ∗ income, home–owner ∗ income, age ∗ income
Estimated Effect: 5374.79 (1990.94) [Recall baseline
estimates: 6612.0 (2110.1)]
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
ATE with Heterogeneous Treatment Effects:
We could also estimate the ATE allowing for full heterogeneity of
treatment effects.
Model:
yi = di g1(xi ) + (1 − di )g0(xi ) + ζi
di = m(xi ) + ui (as before)
Following Hahn (1998), can estimate ATE (α) as
α =
1
N
N
i=1
di (yi − g1(xi ))
m(xi )
−
(1 − di )(yi − g0(xi ))
1 − m(xi )
+ g1(xi ) − g0(xi )
where we obtain estimates of the functions g0(·), g1(·), and m(·)
as above using variable selection methods.
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
ATE with Heterogeneous Treatment Effects:
Estimates of m(·) were obtained previously.
Net Financial Assets and Total Wealth:
Use the same set of covariates broken out by values of 401(k)
eligibilty dummy
Use the CV penalty parameter from before scaled for the
number of observations in each category (3682 are eligible,
6233 not-eligible)
I.e. multiply CV penalty parameter by 3682/9915 in eligible
models and by 6233/9915 in not-eligible models
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
ATE Results:
Financial Assets:
Selected Variables (eligible): income, IRA, age,
two–earner ∗ family–size, IRA ∗ age, home–owner ∗ age,
IRA ∗ income, home–owner ∗ income
Selected Variables (non-eligible): IRA, home–owner, age,
IRA ∗ age, home–owner ∗ age
Estimates: 8032.54 (1136.6)
Total Wealth:
Selected Variables (eligible): income2
, two–earner, age,
IRA ∗ age, home–owner ∗ age, IRA ∗ income,
home–owner ∗ income, age ∗ income
Selected Variables (non-eligible): income, IRA, IRA ∗ age,
home–owner ∗ age, home–owner ∗ age2
, IRA ∗ income,
home–owner ∗ income
Estimates: 6180.29 (1828.5)
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Example 2: Effect of Abortion on Crime
Goal: Understand causal effect of dit (abortion) on yit (crime).
(Donohue and Levitt 2001)
Problem: Abortion rates are not randomly assigned
Key concern:
states are different for lots of reasons
crime rates in states evolve differently for lots of reasons
factors that are associated to differences in states, state
evolutions, etc. may also be related to differences in abortion
rates, abortion rate evolution, etc.
Most of the favorite stories for confounding obviously fit here.
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Example 2: Baseline Model
Donohue and Levitt (2001) baseline model
yit = ditα0 + xitβg + γt + δi + εit
yit = crime-rate (violent, property, or murder per 1000)
dit = “effective” abortion rate
xit = eight controls: log of lagged prisoners per capita, the log
of lagged police per capita, the unemployment rate, per-capita
income, the poverty rate, AFDC generosity at time t − 15, a
dummy for concealed weapons law, and beer consumption per
capita
γt time effects
δi state effects
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Baseline Results:
Violent Property Murder
Estimator Effect Std. Err. Effect Std. Err. Effect Std. Err.
DL Table 4 -.129 .024 -.091 .018 -.121 .047
First-Diff -.152 .034 -.108 .022 -.204 .068
Use first-differences from now on.
Assumes confounds are time invariant, state invariant, or captured
by small set of variables in xit
State-specific characteristics related to features of abortion only
allowed to be related to level of crime rate
I.e. evolution of abortion rates and crime rates unrelated after
subtracting the mean
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Flexible Trends
“Model” with flexible trends:
yit = ditα0 + g(xi1, xi2, . . . , xiT , t, i) + ζit
dit = m(xi1, xi2, . . . , xiT , t, i) + uit
Want to allow flexible state-specific trends BUT clearly can’t learn
about about effect if trends are allowed to do anything.
I.e. if m(·) and g(·) can vary arbitrarily across i and t clearly can’t
identify α0
Need to reduce the dimension.
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Intuitive Dimension Reduction
Abortion Baseline:
Functional form
g(xi1, xi2, . . . , xiT , t, i) = xitβg + γt + δi
additively separable
(Correlated) Evolution of state crime and abortion rates
captured by macro-economy (γt), constant state-specific level
shifts from aggregate (δi ), and small number of time varying
variables
Variable Selection
Select 8 time varying state-level control variables (of the many
state-level macro series available)
What if there are (correlated) differences in abortion and crime
evolution not captured by aggregate evolution?
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Formal Dimension Reduction
Model (First-Differences):
∆yit = ∆ditα0 + γt + witπ1 + εit
∆dit = κt + witπ2 + uit
Use first-difference to remove state effects
time effects (not-selected over), included in both reduced form
models
With wit = ∆xit first-difference version of Donohue and Levitt
(2001) model. (Results presented above.)
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Formal Dimension Reduction: Variables
wit:
1. differences in eight original controls
2. initial conditions of controls, abortion rate, crime rate
3. within state averages of controls, abortion rate
4. t, t2
5. interactions of 1-3 with 4
corresponds to a model for crime and abortion rates with a
cubic trend that may depend on baseline state characteristics
p = 284
n = 576
Variables in 2-5 motivated by a desire to have a flexible, sensible
model of evolution
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Kitchen Sink:
Why not use “everything”?
Violent Property Murder
Estimator Effect Std. Err. Effect Std. Err. Effect Std. Err.
All Controls .006 .755 -.154 .224 2.240 2.804
A flexible cubic trend arguably isn’t going crazy, but everything is
rendered very imprecise.
Probably a lot of things added aren’t really important
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Variable Selection:
Variables in each equation selected using LASSO.
10-Fold CV
Plug-in penalty parameter: 2.2 2n(log(2p/.05))
Penalty loadings estimated using iterative Lasso with 100 max
iterations
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Variable Selection:
Violent crime selection:
Abortion equation (CV and Plug-in): (11) lagged prisoners,
lagged police, lagged unemployment, initial income, initial
income difference × t, initial beer consumption difference × t,
initial income × t, initial prisoners squared × t2, average
income, average income × t, initial abortion rate
Crime equation (CV): No variables
Crime equation (Plug-in): Initial difference in abortion rate ×
t, Initial abortion rate × t
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Variable Selection:
Property crime selection:
Abortion equation (CV): (7) lagged prisoners, lagged income,
initial income, initial income difference, initial income
difference × t, average income, initial abortion rate
Abortion equation (Plug-in): (12) lagged prisoners, lagged
police, lagged income, Initial income difference, initial income,
initial income difference × t, initial beer difference × t, initial
prisoners squared × t, initial prisoners squared × t2, initial
beer squared × t2, average income, initial abortion rate
Crime equation (CV): No variables
Crime equation (Plug-in): (3) Initial income squared × t,
Initial income squared × t2, average AFDC squared
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Variable Selection:
Murder rate selection:
Abortion equation (CV): (5) lagged prisoners, lagged
unemployment, initial income × t, average income × t, initial
abortion rate
Abortion equation (Plug-in): (9) lagged prisoners, lagged
unemployment, initial unemployment difference squared,
initial prisoners × t, initial income times t, initial beer
difference × t2, average income × t, initial abortion rate,
initial abortion rate × t
Crime equation (CV and Plug-in): No variables selected
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Estimated Effects of Abortion on Crime
Violent Property Murder
Estimator Effect Std. Err. Effect Std. Err. Effect Std. Err.
DL Table 4 -.129 .024 -.091 .018 -.121 .047
First-Diff -.152 .034 -.108 .022 -.204 .068
All Controls .006 .755 -.154 .224 2.240 2.804
Post-DS (CV) -.119 .120 -.042 .059 -.122 .131
Post-DS (Plug-in) -.174 .120 -.052 .070 -.123 .148
“Post-DS” Results in-line with critique raised by Foote and Goetz
(2008).
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Example 3: Institutions and Growth (AJR 2001)
Equation of interest:
log(GDP per capitai ) = α(Protection from Expropriationi ) + xi β + εi
Endogeneity/Simultaneity:
better institutions may lead to higher incomes
higher incomes may lead to the development of better
institutions
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Instrument
Instrument = European Settler Mortality:
First Stage: settlers set up better institutions in places they
might stick around in (i.e. where they were less likely to die)
and institutions are highly persistent
Exclusion: GDP, while persistent, is unlikely to be strongly
influenced by the factors that determined the exact
development of institutions 100+ years ago except through
the institutions established
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Controls
There may be other factors that are highly persistent and
related to development of institutions and GDP
leading candidate - geography (Geographic Determinism)
Want to control for geography and use variation in mortality
not captured by geography
Baseline AJR results control linearly for latitude
AJR consider continent dummies, split by continent, first-stage
gets weak with some of these
Baseline estimates find strong positive effect of institutions:
First-stage: -0.5372 (0.1545)
α: 0.9692 (0.2128)
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Setting
Want to control flexibly for geography but may lose power to
identify effect of institutions.
IV with one instrument and unknown controls:
yi = αdi + xi β + εi
di = π1zi + xi Π2 + vi
zi = xi γ + ui
Believe zi is a valid instrument after controlling for xi .
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Setting
Same as estimating regression coefficients after conditioning but
now have three reduced form/prediction equations:
yi = xi β + εi
di = xi Π2 + vi
zi = xi γ + ui
Do variable selection on the three equations and use union of
selected variables as controls.
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Variable Selection
Select over a flexible function of geography:
Africa, Asia, North America, South America (dummies)
latitude, latitude2, latitude3, (latitude-.08)+, (latitude-.16)+,
(latitude-.24)+, ((latitude-.08)+)2, ((latitude-.16)+)2,
((latitude-.24)+)2, ((latitude-.08)+)3, ((latitude-.16)+)3,
((latitude-.24)+)3
Using all these variables results in a very weak first-stage:
First-stage: -0.2164 (0.2191)
α: 0.9480 (0.7384) (and unreliable)
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Variable Selection:
Variables for each equation selected with Lasso:
Penalty loading calculated with iterative method
10-Fold CV and Plug-in [2.2 2n(log(2p/γ)) with
γ = .1/ log(n)] give same results
GDP - Africa
Expropriation - Africa
Mortality - Africa
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Results:
Latitude All Controls Selection
First Stage -0.5372 -0.2164 -0.5429
(0.1545) (0.2191) (0.1719)
Second Stage 0.9692 0.9480 0.7710
(0.2128) (0.7384) (0.1971)
First Stage - Coefficient on Settler Mortality
Second Stage - Coefficient on Protection from Expropriation
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Example 4: Eminent Domain
Estimate economic consequences of the law of takings or eminent
domain (following Chen and Yeh (2010))
Consider effect on Case-Shiller Price Index.
Eminent domain (or law of takings): when a government actor
physically acquires the property rights of one or more individuals
Laws/judicial decisions may not be exogenous.
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Instruments
Random assignment of judges to three judge federal appellate
panels
⇒
Panel demographics randomly assigned conditional on the
distribution of characteristics of federal circuit court judges in a
given circuit-year
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Why variable selection?
Many characteristics of three judge panels
Any set of characteristics of the three judge panel unrelated to
structural unobservable
Some instruments may be more valuable than others
Could attempt to solve through intuition. Number of judges who
are democrats:
Judges’ political affiliation known to predict decisions for
many outcomes
First Stage: 0.0664 (0.0713)
Second Stage: -0.2583 (0.5251) (Unreliable)
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Model
Econometric model:
yct = ac + bt + gct + θ Takings Lawct + Wctd + ct
Takings Lawct = αc + βt + γct + Wctδ + zctΠ + vct
c circuit; t time
yct: log(house price index) or log(GDP)
Takings Lawct: number of pro-plaintiff (overturn gov’t taking)
apellate takings decisions
(ac, αc), (bt, βt), and (gct, γct): circuit-specific effects,
time-specific effects, and circuit-specific time trends.
(d, δ) coefficients on exogenous variables
zct instruments with coefficients Π
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Model
Controls (Wct):
≈ 30 probability controls for panel demographics
a dummy for no cases in that circuit-year
number of takings appellate decisions
θ: effect of an additional decision upholding individual property
rights on an economic outcome
Any set of characteristics of three judge panels is potentially an
instrument.
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Instruments
Do ex ante dimension reduction by intuively selecting
characteristics thought to have strong signal about judge
preferences over government vs. individual property rights:
42 baseline variables
gender, race, religion (jewish, catholic, protestant, evangelical,
none) , political affiliation, bachelor obtained in-state, bachelor
from public university, JD from a public university, has an LLM
or SJD, elevated from a district court
number of panels with 1, 2, or 3 members with each
characteristic
+
cubic in number panels democrat, number with JD from public
university, number elevated from district
first order interactions between all variables
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Additional pre-processing
Remove instruments likely to be irrelevant based on features of
instrument vector alone:
remove any instrument with mean < .05, standard deviation
after partialling out controls < .000001
remove one from each pair of any pair with bivariate
correlation > .99 in absolute value
Note: Selection based on characteristics of z cannot introduce bias
under exclusion restriction.
Leaves 147 instruments (n = 183)
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Obtaining the Estimates
Method: Post-LASSO
Penalty parameters and loadings:
Penalty loading calculated through iterative scheme
10-fold Cross-validation to obtain penalty
Plug-in penalty: 2.2
√
nΦ−1(1 − γ/(2p)), γ = .1/ log(n)
(Gives same results as 10-fold CV)
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects
Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain
Results: Case-Shiller Prices
LASSO Selection Results:
1+ JD public squared
First Stage: 0.4495 (0.0511) [Using 1+ democrat: 0.0664
(0.0713)]
Second Stage: 0.0648 (0.0240) [Using 1+ democrat: -0.2583
(0.5251) (Unreliable)]
V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy
High-Dimensional Methods: Examples for Inference on Structural Effects

Mais conteúdo relacionado

Mais procurados

Intro to Model Selection
Intro to Model SelectionIntro to Model Selection
Intro to Model Selection
chenhm
 
original
originaloriginal
original
butest
 
Download presentation source
Download presentation sourceDownload presentation source
Download presentation source
butest
 
week9_Machine_Learning.ppt
week9_Machine_Learning.pptweek9_Machine_Learning.ppt
week9_Machine_Learning.ppt
butest
 

Mais procurados (20)

Probability distribution Function & Decision Trees in machine learning
Probability distribution Function  & Decision Trees in machine learningProbability distribution Function  & Decision Trees in machine learning
Probability distribution Function & Decision Trees in machine learning
 
A Maximum Entropy Approach to the Loss Data Aggregation Problem
A Maximum Entropy Approach to the Loss Data Aggregation ProblemA Maximum Entropy Approach to the Loss Data Aggregation Problem
A Maximum Entropy Approach to the Loss Data Aggregation Problem
 
Nber slides11 lecture2
Nber slides11 lecture2Nber slides11 lecture2
Nber slides11 lecture2
 
Quantitative Analysis For Management 11th Edition Render Solutions Manual
Quantitative Analysis For Management 11th Edition Render Solutions ManualQuantitative Analysis For Management 11th Edition Render Solutions Manual
Quantitative Analysis For Management 11th Edition Render Solutions Manual
 
Decision Tree and Bayesian Classification
Decision Tree and Bayesian ClassificationDecision Tree and Bayesian Classification
Decision Tree and Bayesian Classification
 
孔令傑 / 給工程師的統計學及資料分析 123 (2016/9/4)
孔令傑 / 給工程師的統計學及資料分析 123 (2016/9/4)孔令傑 / 給工程師的統計學及資料分析 123 (2016/9/4)
孔令傑 / 給工程師的統計學及資料分析 123 (2016/9/4)
 
Quantitative Analysis For Management 11th Edition Render Test Bank
Quantitative Analysis For Management 11th Edition Render Test BankQuantitative Analysis For Management 11th Edition Render Test Bank
Quantitative Analysis For Management 11th Edition Render Test Bank
 
352735322 rsh-qam11-tif-02-doc
352735322 rsh-qam11-tif-02-doc352735322 rsh-qam11-tif-02-doc
352735322 rsh-qam11-tif-02-doc
 
Data science
Data scienceData science
Data science
 
2018 MUMS Fall Course - Issue Arising in Several Working Groups: Probabilisti...
2018 MUMS Fall Course - Issue Arising in Several Working Groups: Probabilisti...2018 MUMS Fall Course - Issue Arising in Several Working Groups: Probabilisti...
2018 MUMS Fall Course - Issue Arising in Several Working Groups: Probabilisti...
 
Intro to Model Selection
Intro to Model SelectionIntro to Model Selection
Intro to Model Selection
 
2018 MUMS Fall Course - Essentials of Bayesian Hypothesis Testing - Jim Berge...
2018 MUMS Fall Course - Essentials of Bayesian Hypothesis Testing - Jim Berge...2018 MUMS Fall Course - Essentials of Bayesian Hypothesis Testing - Jim Berge...
2018 MUMS Fall Course - Essentials of Bayesian Hypothesis Testing - Jim Berge...
 
Boston talk
Boston talkBoston talk
Boston talk
 
A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.
A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.
A Fuzzy Mean-Variance-Skewness Portfolioselection Problem.
 
original
originaloriginal
original
 
Download presentation source
Download presentation sourceDownload presentation source
Download presentation source
 
week9_Machine_Learning.ppt
week9_Machine_Learning.pptweek9_Machine_Learning.ppt
week9_Machine_Learning.ppt
 
Data mining Concepts and Techniques
Data mining Concepts and Techniques Data mining Concepts and Techniques
Data mining Concepts and Techniques
 
Math 221 Massive Success / snaptutorial.com
Math 221 Massive Success / snaptutorial.comMath 221 Massive Success / snaptutorial.com
Math 221 Massive Success / snaptutorial.com
 
Data Mining: Concepts and Techniques — Chapter 2 —
Data Mining:  Concepts and Techniques — Chapter 2 —Data Mining:  Concepts and Techniques — Chapter 2 —
Data Mining: Concepts and Techniques — Chapter 2 —
 

Semelhante a High-Dimensional Methods: Examples for Inference on Structural Effects

Thesis_NickyGrant_2013
Thesis_NickyGrant_2013Thesis_NickyGrant_2013
Thesis_NickyGrant_2013
Nicky Grant
 
Topic_6
Topic_6Topic_6
Topic_6
butest
 
CHI SQUARE DISTRIBUTIONdjfnbefklwfwpfioaekf.pptx
CHI SQUARE DISTRIBUTIONdjfnbefklwfwpfioaekf.pptxCHI SQUARE DISTRIBUTIONdjfnbefklwfwpfioaekf.pptx
CHI SQUARE DISTRIBUTIONdjfnbefklwfwpfioaekf.pptx
rathorebhagwan07
 
InstructionDue Date 6 pm on October 28 (Wed)Part IProbability a.docx
InstructionDue Date 6 pm on October 28 (Wed)Part IProbability a.docxInstructionDue Date 6 pm on October 28 (Wed)Part IProbability a.docx
InstructionDue Date 6 pm on October 28 (Wed)Part IProbability a.docx
dirkrplav
 
PSY520 – Module 7Answer SheetSubmit your answers in the .docx
PSY520 – Module 7Answer SheetSubmit your answers in the .docxPSY520 – Module 7Answer SheetSubmit your answers in the .docx
PSY520 – Module 7Answer SheetSubmit your answers in the .docx
woodruffeloisa
 

Semelhante a High-Dimensional Methods: Examples for Inference on Structural Effects (20)

Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
Frontiers of Computational Journalism week 1 - Introduction and High Dimensio...
 
Econometrics of High-Dimensional Sparse Models
Econometrics of High-Dimensional Sparse ModelsEconometrics of High-Dimensional Sparse Models
Econometrics of High-Dimensional Sparse Models
 
Analyzing experimental research data
Analyzing experimental research dataAnalyzing experimental research data
Analyzing experimental research data
 
Thesis_NickyGrant_2013
Thesis_NickyGrant_2013Thesis_NickyGrant_2013
Thesis_NickyGrant_2013
 
Unit 6: All
Unit 6: AllUnit 6: All
Unit 6: All
 
Basics of Hypothesis Testing
Basics of Hypothesis Testing  Basics of Hypothesis Testing
Basics of Hypothesis Testing
 
4_22865_IS465_2019_1__2_1_02Data-2.ppt
4_22865_IS465_2019_1__2_1_02Data-2.ppt4_22865_IS465_2019_1__2_1_02Data-2.ppt
4_22865_IS465_2019_1__2_1_02Data-2.ppt
 
Topic_6
Topic_6Topic_6
Topic_6
 
Can We Use Rum and Not Get Drunk?
Can We Use Rum and Not Get Drunk?Can We Use Rum and Not Get Drunk?
Can We Use Rum and Not Get Drunk?
 
CHI SQUARE DISTRIBUTIONdjfnbefklwfwpfioaekf.pptx
CHI SQUARE DISTRIBUTIONdjfnbefklwfwpfioaekf.pptxCHI SQUARE DISTRIBUTIONdjfnbefklwfwpfioaekf.pptx
CHI SQUARE DISTRIBUTIONdjfnbefklwfwpfioaekf.pptx
 
Problem set 3 - Statistics and Econometrics - Msc Business Analytics - Imperi...
Problem set 3 - Statistics and Econometrics - Msc Business Analytics - Imperi...Problem set 3 - Statistics and Econometrics - Msc Business Analytics - Imperi...
Problem set 3 - Statistics and Econometrics - Msc Business Analytics - Imperi...
 
Private Distributed Collaborative Filtering
Private Distributed Collaborative FilteringPrivate Distributed Collaborative Filtering
Private Distributed Collaborative Filtering
 
Bayesian/Fiducial/Frequentist Uncertainty Quantification by Artificial Samples
Bayesian/Fiducial/Frequentist Uncertainty Quantification by Artificial SamplesBayesian/Fiducial/Frequentist Uncertainty Quantification by Artificial Samples
Bayesian/Fiducial/Frequentist Uncertainty Quantification by Artificial Samples
 
To Explain, To Predict, or To Describe?
To Explain, To Predict, or To Describe?To Explain, To Predict, or To Describe?
To Explain, To Predict, or To Describe?
 
InstructionDue Date 6 pm on October 28 (Wed)Part IProbability a.docx
InstructionDue Date 6 pm on October 28 (Wed)Part IProbability a.docxInstructionDue Date 6 pm on October 28 (Wed)Part IProbability a.docx
InstructionDue Date 6 pm on October 28 (Wed)Part IProbability a.docx
 
Advanced Econometrics L3-4.pptx
Advanced Econometrics L3-4.pptxAdvanced Econometrics L3-4.pptx
Advanced Econometrics L3-4.pptx
 
nnml.ppt
nnml.pptnnml.ppt
nnml.ppt
 
PSY520 – Module 7Answer SheetSubmit your answers in the .docx
PSY520 – Module 7Answer SheetSubmit your answers in the .docxPSY520 – Module 7Answer SheetSubmit your answers in the .docx
PSY520 – Module 7Answer SheetSubmit your answers in the .docx
 
QNT 561 Week 4 Weekly Learning Assessments
QNT 561 Week 4 Weekly Learning Assessments  QNT 561 Week 4 Weekly Learning Assessments
QNT 561 Week 4 Weekly Learning Assessments
 
Machine Learning and Artificial Neural Networks.ppt
Machine Learning and Artificial Neural Networks.pptMachine Learning and Artificial Neural Networks.ppt
Machine Learning and Artificial Neural Networks.ppt
 

Mais de NBER

FISCAL STIMULUS IN ECONOMIC UNIONS: WHAT ROLE FOR STATES
FISCAL STIMULUS IN ECONOMIC UNIONS: WHAT ROLE FOR STATESFISCAL STIMULUS IN ECONOMIC UNIONS: WHAT ROLE FOR STATES
FISCAL STIMULUS IN ECONOMIC UNIONS: WHAT ROLE FOR STATES
NBER
 
Business in the United States Who Owns it and How Much Tax They Pay
Business in the United States Who Owns it and How Much Tax They PayBusiness in the United States Who Owns it and How Much Tax They Pay
Business in the United States Who Owns it and How Much Tax They Pay
NBER
 
Redistribution through Minimum Wage Regulation: An Analysis of Program Linkag...
Redistribution through Minimum Wage Regulation: An Analysis of Program Linkag...Redistribution through Minimum Wage Regulation: An Analysis of Program Linkag...
Redistribution through Minimum Wage Regulation: An Analysis of Program Linkag...
NBER
 
The Distributional E ffects of U.S. Clean Energy Tax Credits
The Distributional Effects of U.S. Clean Energy Tax CreditsThe Distributional Effects of U.S. Clean Energy Tax Credits
The Distributional E ffects of U.S. Clean Energy Tax Credits
NBER
 
An Experimental Evaluation of Strategies to Increase Property Tax Compliance:...
An Experimental Evaluation of Strategies to Increase Property Tax Compliance:...An Experimental Evaluation of Strategies to Increase Property Tax Compliance:...
An Experimental Evaluation of Strategies to Increase Property Tax Compliance:...
NBER
 
Jackson nber-slides2014 lecture3
Jackson nber-slides2014 lecture3Jackson nber-slides2014 lecture3
Jackson nber-slides2014 lecture3
NBER
 
Acemoglu lecture4
Acemoglu lecture4Acemoglu lecture4
Acemoglu lecture4
NBER
 
Lecture 4: NBERMetrics
Lecture 4: NBERMetricsLecture 4: NBERMetrics
Lecture 4: NBERMetrics
NBER
 
Lecture 2: NBERMetrics
Lecture 2: NBERMetricsLecture 2: NBERMetrics
Lecture 2: NBERMetrics
NBER
 

Mais de NBER (20)

FISCAL STIMULUS IN ECONOMIC UNIONS: WHAT ROLE FOR STATES
FISCAL STIMULUS IN ECONOMIC UNIONS: WHAT ROLE FOR STATESFISCAL STIMULUS IN ECONOMIC UNIONS: WHAT ROLE FOR STATES
FISCAL STIMULUS IN ECONOMIC UNIONS: WHAT ROLE FOR STATES
 
Business in the United States Who Owns it and How Much Tax They Pay
Business in the United States Who Owns it and How Much Tax They PayBusiness in the United States Who Owns it and How Much Tax They Pay
Business in the United States Who Owns it and How Much Tax They Pay
 
Redistribution through Minimum Wage Regulation: An Analysis of Program Linkag...
Redistribution through Minimum Wage Regulation: An Analysis of Program Linkag...Redistribution through Minimum Wage Regulation: An Analysis of Program Linkag...
Redistribution through Minimum Wage Regulation: An Analysis of Program Linkag...
 
The Distributional E ffects of U.S. Clean Energy Tax Credits
The Distributional Effects of U.S. Clean Energy Tax CreditsThe Distributional Effects of U.S. Clean Energy Tax Credits
The Distributional E ffects of U.S. Clean Energy Tax Credits
 
An Experimental Evaluation of Strategies to Increase Property Tax Compliance:...
An Experimental Evaluation of Strategies to Increase Property Tax Compliance:...An Experimental Evaluation of Strategies to Increase Property Tax Compliance:...
An Experimental Evaluation of Strategies to Increase Property Tax Compliance:...
 
Nbe rcausalpredictionv111 lecture2
Nbe rcausalpredictionv111 lecture2Nbe rcausalpredictionv111 lecture2
Nbe rcausalpredictionv111 lecture2
 
Recommenders, Topics, and Text
Recommenders, Topics, and TextRecommenders, Topics, and Text
Recommenders, Topics, and Text
 
Jackson nber-slides2014 lecture3
Jackson nber-slides2014 lecture3Jackson nber-slides2014 lecture3
Jackson nber-slides2014 lecture3
 
Jackson nber-slides2014 lecture1
Jackson nber-slides2014 lecture1Jackson nber-slides2014 lecture1
Jackson nber-slides2014 lecture1
 
Acemoglu lecture4
Acemoglu lecture4Acemoglu lecture4
Acemoglu lecture4
 
The NBER Working Paper Series at 20,000 - Joshua Gans
The NBER Working Paper Series at 20,000 - Joshua GansThe NBER Working Paper Series at 20,000 - Joshua Gans
The NBER Working Paper Series at 20,000 - Joshua Gans
 
The NBER Working Paper Series at 20,000 - Claudia Goldin
The NBER Working Paper Series at 20,000 - Claudia GoldinThe NBER Working Paper Series at 20,000 - Claudia Goldin
The NBER Working Paper Series at 20,000 - Claudia Goldin
 
The NBER Working Paper Series at 20,000 - James Poterba
The NBER Working Paper Series at 20,000 - James PoterbaThe NBER Working Paper Series at 20,000 - James Poterba
The NBER Working Paper Series at 20,000 - James Poterba
 
The NBER Working Paper Series at 20,000 - Scott Stern
The NBER Working Paper Series at 20,000 - Scott SternThe NBER Working Paper Series at 20,000 - Scott Stern
The NBER Working Paper Series at 20,000 - Scott Stern
 
The NBER Working Paper Series at 20,000 - Glenn Ellison
The NBER Working Paper Series at 20,000 - Glenn EllisonThe NBER Working Paper Series at 20,000 - Glenn Ellison
The NBER Working Paper Series at 20,000 - Glenn Ellison
 
L3 1b
L3 1bL3 1b
L3 1b
 
Nuts and bolts
Nuts and boltsNuts and bolts
Nuts and bolts
 
Applications: Prediction
Applications: PredictionApplications: Prediction
Applications: Prediction
 
Lecture 4: NBERMetrics
Lecture 4: NBERMetricsLecture 4: NBERMetrics
Lecture 4: NBERMetrics
 
Lecture 2: NBERMetrics
Lecture 2: NBERMetricsLecture 2: NBERMetrics
Lecture 2: NBERMetrics
 

Último

Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Último (20)

ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 

High-Dimensional Methods: Examples for Inference on Structural Effects

  • 1. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain High-Dimensional Methods: Examples for Inference on Structural Effects V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy July 16, 2013 V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 2. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Example 1: 401(k)’s and Assets Estimate the effect of 401(k) eligibility on measure of accumulated assets, (e.g. PVW 1994, 1995, 1996) yi = di α0 + xi β + ζi yi = net financial assets or total wealth, di = eligible for 401(k), xi = controls for individual characteristics. PVW argue important to control for income income (<10k, 10k-20k, 20k-30k, 30k-40k, 40k-50k, 50k-75k, 75k+), age, age2 , family size, education (high school, some college, college), married, two-earner, defined benefit, ira, home-owner n = 9915 Baseline estimates: Net-TFA: 9216.5 (1340.6); TW: 6612.0 (2110.1) V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 3. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Did we control sufficiently for income? Previous model provides nice baseline, but we might wonder Are 7 dummies for income categories sufficient to control for income? More complex nonlinearity? Interactions? Could we improve efficiency? Even if eligibilty were randomly assigned, might want to introduce controls to improve efficiency “Over-controlling”? Other important variables? V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 4. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Regularization All treatment effects estimation problems involve regularization/variable selection. The only valid inference without dimension reduction is to conclude that one cannot learn from the data. V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 5. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Regularization Dimension reduction strategies: Randomized treatment assignment: Reduces dimension of control vector to 0. May still want to do variable selection to improve efficiency (“single-selection” procedures) Stratified sampling Intuition Formal Model Selection Will fail without good intuition (Needle in a haystack: Want a big needle or a small haystack) V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 6. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Intuitive Dimension Reduction 401(k) Example Baseline: Data Selection Wave 4 of 1990 SIPP Ignore panel aspect Impose criteria (e.g. age selection) to limit sample to “interesting” population (control) Variable Selection 655 raw variables (obviously many administrative/technical, highly unlikely to be related to problem of interest) Select 9 as controls: income, age, family size, education, married, two-earner, defined benefit, ira, home-owner Functional form no interactions, dummies for income categories, quadratic in age, dummies for schooling levels V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 7. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain “Formal” Dimension Reduction: Model Model: yi = di α0 + g(xi ) + ζi ≈ di α0 + zi βg + ζi di = m(xi ) + ui ≈ zi βm + ui E[ζi |xi , di ] = E[ui |xi ] = 0 where zi is a function of xi . “Reduced forms”: yi = zi β + ζi di = zi βm + ui β = βg + α0βm ζi = ζi + α0ui V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 8. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain “Formal” Dimension Reduction: Intuition Intuition for what we get from each reduced form: Reduced form for treatment: Find variables that are strongly related to treatment Ignoring these potentially leads to omitted-variables-bias (OVB) Reduced form for outcome: Find variables that are strongly related outcome Improve efficiency Reduce OVB which would result if associated coefficient in treatment equation is small but non-zero Reduced forms are parts of problem data are informative about (predictive relationships) Good methods for estimating these parts of the model V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 9. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain “Formal” Dimension Reduction: Choices Choice 1 (Data Set): Same as above Choice 2 (Baseline Variables): Same as above Could consider adding additional variables Each variable makes the haystack bigger Think carefully. Do we want more variables or more flexibility in other dimensions? Choice 3 (Functional Form): Want to try to be very flexible in income (Details to follow) Choice 4 (Selection Method/Auxiliary Parameters): LASSO V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 10. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Review: Feasible LASSO Allowing Heteroskedasticity LASSO (heteroskedastic): βL = arg min n i=1 (yi − xi β)2 + λ p j=1 |γj βj | for a generic outcome (y) and set of regressors (x) Need to fill in values for λ and γj for j = 1, ..., p. V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 11. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Review: Feasible LASSO Allowing Heteroskedasticity γj : Want γj ≈ 1 n n i=1 x2 ij ε2 i Estimate iteratively for a given value of λ. Start with initial guess for β (β∗ ) → ei = yi − xi β∗ (initial guess for {εi }n i=1) Estimate new value of β by solving above problem for given value of λ with γj = 1 n n i=1 x2 ij e2 i Take new value of β to form new residuals and new set of γj estimates Iterate to convergence (or max number of iterations) V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 12. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Review: Feasible LASSO Allowing Heteroskedasticity λ: 1. Cross-validation 10-fold CV in all examples Use 1 s.e. rule as discussed by M. Taddy previously Iterative estimation of penalty loadings inside of CV loop Less stable than vanilla LASSO CV 1 s.e. rule seems to add some stability 2. Theoretical value: Simple bound 1: 2.2Φ−1 (1 − q 2p ) Simple bound 2: 2.2 2n log(2p/q) Theoretically need q → 0, q size of test of hypothesis that “biggest” coefficient equals 0 when all coeffients equal 0. We use q = .05 or q = .1/ max{p, n} in our examples. (Bounds above are bounds on this critical value.) V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 13. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain “Formal” Dimension Reduction: Variables Dummies: married, two-earner, defined benefit, ira, home-owner Schooling: 5th degree orthogonal polynomial (generated by Stata orthpoly) Family Size: 3rd degree orthogonal polynomial (generated by Stata orthpoly) Age: Cubic spline with 10 equally spaced knots (30 terms) Income: Cubic spline with 15 equally spaced knots (45 terms) Interactions: 1. Dummies with Schooling, Family Size, and Age terms (190 interactions) 2. Income terms interacted with A. Dummies, Schooling, Family Size, and Age terms and B. interactions in 1. (10,485 interactions) 10,763 total variables V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 14. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Dimension Reduction So, n = 9,915; p = 10,763 Cannot identify effect of 401(k) if we think we actually need all of these terms to adequately control for income. Have already intuitively reduced dimensionality by focusing on 9 controls that seem plausibly related to income and assets Baseline results resolve this identification problem by assuming the functional form is known. Rather than assume functional form, try to learn it from data using variable selection (from a flexible but still very parsimonious specification). V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 15. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Results: Note: Before constructing splines, income and age were put on [0,1] interval. 401(k) Eligibility: Selected variables (CV and Plug-in): income, max(0, income − .33), two–earner, defined–benefit, home–owner, education3, max(0, age3 − .4), max(0, age3 − .5), home–owner ∗ income Estimate 401(k) effects obtained by combining these variable with those selected for relevant outcome and running OLS V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 16. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Results: Net Financial Assets: Selected variables (CV and Plug-in): income, two–earner, age, two–earner ∗ family–size, ira ∗ age, home–owner ∗ age2, ira ∗ income, home–owner ∗ income, home–owner ∗ income2 Estimated Effect: 8687.43 (1274.76) [Recall baseline estimates: 9216.5 (1340.6)] Total Wealth: Selected variables (CV and Plug-in): income, two–earner, ira ∗ age, home–owner ∗ age, home–owner ∗ age2, ira ∗ income, home–owner ∗ income, age ∗ income Estimated Effect: 5374.79 (1990.94) [Recall baseline estimates: 6612.0 (2110.1)] V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 17. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain ATE with Heterogeneous Treatment Effects: We could also estimate the ATE allowing for full heterogeneity of treatment effects. Model: yi = di g1(xi ) + (1 − di )g0(xi ) + ζi di = m(xi ) + ui (as before) Following Hahn (1998), can estimate ATE (α) as α = 1 N N i=1 di (yi − g1(xi )) m(xi ) − (1 − di )(yi − g0(xi )) 1 − m(xi ) + g1(xi ) − g0(xi ) where we obtain estimates of the functions g0(·), g1(·), and m(·) as above using variable selection methods. V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 18. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain ATE with Heterogeneous Treatment Effects: Estimates of m(·) were obtained previously. Net Financial Assets and Total Wealth: Use the same set of covariates broken out by values of 401(k) eligibilty dummy Use the CV penalty parameter from before scaled for the number of observations in each category (3682 are eligible, 6233 not-eligible) I.e. multiply CV penalty parameter by 3682/9915 in eligible models and by 6233/9915 in not-eligible models V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 19. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain ATE Results: Financial Assets: Selected Variables (eligible): income, IRA, age, two–earner ∗ family–size, IRA ∗ age, home–owner ∗ age, IRA ∗ income, home–owner ∗ income Selected Variables (non-eligible): IRA, home–owner, age, IRA ∗ age, home–owner ∗ age Estimates: 8032.54 (1136.6) Total Wealth: Selected Variables (eligible): income2 , two–earner, age, IRA ∗ age, home–owner ∗ age, IRA ∗ income, home–owner ∗ income, age ∗ income Selected Variables (non-eligible): income, IRA, IRA ∗ age, home–owner ∗ age, home–owner ∗ age2 , IRA ∗ income, home–owner ∗ income Estimates: 6180.29 (1828.5) V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 20. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Example 2: Effect of Abortion on Crime Goal: Understand causal effect of dit (abortion) on yit (crime). (Donohue and Levitt 2001) Problem: Abortion rates are not randomly assigned Key concern: states are different for lots of reasons crime rates in states evolve differently for lots of reasons factors that are associated to differences in states, state evolutions, etc. may also be related to differences in abortion rates, abortion rate evolution, etc. Most of the favorite stories for confounding obviously fit here. V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 21. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Example 2: Baseline Model Donohue and Levitt (2001) baseline model yit = ditα0 + xitβg + γt + δi + εit yit = crime-rate (violent, property, or murder per 1000) dit = “effective” abortion rate xit = eight controls: log of lagged prisoners per capita, the log of lagged police per capita, the unemployment rate, per-capita income, the poverty rate, AFDC generosity at time t − 15, a dummy for concealed weapons law, and beer consumption per capita γt time effects δi state effects V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 22. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Baseline Results: Violent Property Murder Estimator Effect Std. Err. Effect Std. Err. Effect Std. Err. DL Table 4 -.129 .024 -.091 .018 -.121 .047 First-Diff -.152 .034 -.108 .022 -.204 .068 Use first-differences from now on. Assumes confounds are time invariant, state invariant, or captured by small set of variables in xit State-specific characteristics related to features of abortion only allowed to be related to level of crime rate I.e. evolution of abortion rates and crime rates unrelated after subtracting the mean V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 23. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Flexible Trends “Model” with flexible trends: yit = ditα0 + g(xi1, xi2, . . . , xiT , t, i) + ζit dit = m(xi1, xi2, . . . , xiT , t, i) + uit Want to allow flexible state-specific trends BUT clearly can’t learn about about effect if trends are allowed to do anything. I.e. if m(·) and g(·) can vary arbitrarily across i and t clearly can’t identify α0 Need to reduce the dimension. V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 24. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Intuitive Dimension Reduction Abortion Baseline: Functional form g(xi1, xi2, . . . , xiT , t, i) = xitβg + γt + δi additively separable (Correlated) Evolution of state crime and abortion rates captured by macro-economy (γt), constant state-specific level shifts from aggregate (δi ), and small number of time varying variables Variable Selection Select 8 time varying state-level control variables (of the many state-level macro series available) What if there are (correlated) differences in abortion and crime evolution not captured by aggregate evolution? V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 25. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Formal Dimension Reduction Model (First-Differences): ∆yit = ∆ditα0 + γt + witπ1 + εit ∆dit = κt + witπ2 + uit Use first-difference to remove state effects time effects (not-selected over), included in both reduced form models With wit = ∆xit first-difference version of Donohue and Levitt (2001) model. (Results presented above.) V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 26. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Formal Dimension Reduction: Variables wit: 1. differences in eight original controls 2. initial conditions of controls, abortion rate, crime rate 3. within state averages of controls, abortion rate 4. t, t2 5. interactions of 1-3 with 4 corresponds to a model for crime and abortion rates with a cubic trend that may depend on baseline state characteristics p = 284 n = 576 Variables in 2-5 motivated by a desire to have a flexible, sensible model of evolution V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 27. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Kitchen Sink: Why not use “everything”? Violent Property Murder Estimator Effect Std. Err. Effect Std. Err. Effect Std. Err. All Controls .006 .755 -.154 .224 2.240 2.804 A flexible cubic trend arguably isn’t going crazy, but everything is rendered very imprecise. Probably a lot of things added aren’t really important V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 28. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Variable Selection: Variables in each equation selected using LASSO. 10-Fold CV Plug-in penalty parameter: 2.2 2n(log(2p/.05)) Penalty loadings estimated using iterative Lasso with 100 max iterations V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 29. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Variable Selection: Violent crime selection: Abortion equation (CV and Plug-in): (11) lagged prisoners, lagged police, lagged unemployment, initial income, initial income difference × t, initial beer consumption difference × t, initial income × t, initial prisoners squared × t2, average income, average income × t, initial abortion rate Crime equation (CV): No variables Crime equation (Plug-in): Initial difference in abortion rate × t, Initial abortion rate × t V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 30. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Variable Selection: Property crime selection: Abortion equation (CV): (7) lagged prisoners, lagged income, initial income, initial income difference, initial income difference × t, average income, initial abortion rate Abortion equation (Plug-in): (12) lagged prisoners, lagged police, lagged income, Initial income difference, initial income, initial income difference × t, initial beer difference × t, initial prisoners squared × t, initial prisoners squared × t2, initial beer squared × t2, average income, initial abortion rate Crime equation (CV): No variables Crime equation (Plug-in): (3) Initial income squared × t, Initial income squared × t2, average AFDC squared V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 31. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Variable Selection: Murder rate selection: Abortion equation (CV): (5) lagged prisoners, lagged unemployment, initial income × t, average income × t, initial abortion rate Abortion equation (Plug-in): (9) lagged prisoners, lagged unemployment, initial unemployment difference squared, initial prisoners × t, initial income times t, initial beer difference × t2, average income × t, initial abortion rate, initial abortion rate × t Crime equation (CV and Plug-in): No variables selected V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 32. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Estimated Effects of Abortion on Crime Violent Property Murder Estimator Effect Std. Err. Effect Std. Err. Effect Std. Err. DL Table 4 -.129 .024 -.091 .018 -.121 .047 First-Diff -.152 .034 -.108 .022 -.204 .068 All Controls .006 .755 -.154 .224 2.240 2.804 Post-DS (CV) -.119 .120 -.042 .059 -.122 .131 Post-DS (Plug-in) -.174 .120 -.052 .070 -.123 .148 “Post-DS” Results in-line with critique raised by Foote and Goetz (2008). V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 33. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Example 3: Institutions and Growth (AJR 2001) Equation of interest: log(GDP per capitai ) = α(Protection from Expropriationi ) + xi β + εi Endogeneity/Simultaneity: better institutions may lead to higher incomes higher incomes may lead to the development of better institutions V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 34. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Instrument Instrument = European Settler Mortality: First Stage: settlers set up better institutions in places they might stick around in (i.e. where they were less likely to die) and institutions are highly persistent Exclusion: GDP, while persistent, is unlikely to be strongly influenced by the factors that determined the exact development of institutions 100+ years ago except through the institutions established V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 35. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Controls There may be other factors that are highly persistent and related to development of institutions and GDP leading candidate - geography (Geographic Determinism) Want to control for geography and use variation in mortality not captured by geography Baseline AJR results control linearly for latitude AJR consider continent dummies, split by continent, first-stage gets weak with some of these Baseline estimates find strong positive effect of institutions: First-stage: -0.5372 (0.1545) α: 0.9692 (0.2128) V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 36. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Setting Want to control flexibly for geography but may lose power to identify effect of institutions. IV with one instrument and unknown controls: yi = αdi + xi β + εi di = π1zi + xi Π2 + vi zi = xi γ + ui Believe zi is a valid instrument after controlling for xi . V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 37. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Setting Same as estimating regression coefficients after conditioning but now have three reduced form/prediction equations: yi = xi β + εi di = xi Π2 + vi zi = xi γ + ui Do variable selection on the three equations and use union of selected variables as controls. V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 38. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Variable Selection Select over a flexible function of geography: Africa, Asia, North America, South America (dummies) latitude, latitude2, latitude3, (latitude-.08)+, (latitude-.16)+, (latitude-.24)+, ((latitude-.08)+)2, ((latitude-.16)+)2, ((latitude-.24)+)2, ((latitude-.08)+)3, ((latitude-.16)+)3, ((latitude-.24)+)3 Using all these variables results in a very weak first-stage: First-stage: -0.2164 (0.2191) α: 0.9480 (0.7384) (and unreliable) V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 39. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Variable Selection: Variables for each equation selected with Lasso: Penalty loading calculated with iterative method 10-Fold CV and Plug-in [2.2 2n(log(2p/γ)) with γ = .1/ log(n)] give same results GDP - Africa Expropriation - Africa Mortality - Africa V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 40. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Results: Latitude All Controls Selection First Stage -0.5372 -0.2164 -0.5429 (0.1545) (0.2191) (0.1719) Second Stage 0.9692 0.9480 0.7710 (0.2128) (0.7384) (0.1971) First Stage - Coefficient on Settler Mortality Second Stage - Coefficient on Protection from Expropriation V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 41. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Example 4: Eminent Domain Estimate economic consequences of the law of takings or eminent domain (following Chen and Yeh (2010)) Consider effect on Case-Shiller Price Index. Eminent domain (or law of takings): when a government actor physically acquires the property rights of one or more individuals Laws/judicial decisions may not be exogenous. V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 42. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Instruments Random assignment of judges to three judge federal appellate panels ⇒ Panel demographics randomly assigned conditional on the distribution of characteristics of federal circuit court judges in a given circuit-year V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 43. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Why variable selection? Many characteristics of three judge panels Any set of characteristics of the three judge panel unrelated to structural unobservable Some instruments may be more valuable than others Could attempt to solve through intuition. Number of judges who are democrats: Judges’ political affiliation known to predict decisions for many outcomes First Stage: 0.0664 (0.0713) Second Stage: -0.2583 (0.5251) (Unreliable) V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 44. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Model Econometric model: yct = ac + bt + gct + θ Takings Lawct + Wctd + ct Takings Lawct = αc + βt + γct + Wctδ + zctΠ + vct c circuit; t time yct: log(house price index) or log(GDP) Takings Lawct: number of pro-plaintiff (overturn gov’t taking) apellate takings decisions (ac, αc), (bt, βt), and (gct, γct): circuit-specific effects, time-specific effects, and circuit-specific time trends. (d, δ) coefficients on exogenous variables zct instruments with coefficients Π V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 45. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Model Controls (Wct): ≈ 30 probability controls for panel demographics a dummy for no cases in that circuit-year number of takings appellate decisions θ: effect of an additional decision upholding individual property rights on an economic outcome Any set of characteristics of three judge panels is potentially an instrument. V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 46. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Instruments Do ex ante dimension reduction by intuively selecting characteristics thought to have strong signal about judge preferences over government vs. individual property rights: 42 baseline variables gender, race, religion (jewish, catholic, protestant, evangelical, none) , political affiliation, bachelor obtained in-state, bachelor from public university, JD from a public university, has an LLM or SJD, elevated from a district court number of panels with 1, 2, or 3 members with each characteristic + cubic in number panels democrat, number with JD from public university, number elevated from district first order interactions between all variables V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 47. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Additional pre-processing Remove instruments likely to be irrelevant based on features of instrument vector alone: remove any instrument with mean < .05, standard deviation after partialling out controls < .000001 remove one from each pair of any pair with bivariate correlation > .99 in absolute value Note: Selection based on characteristics of z cannot introduce bias under exclusion restriction. Leaves 147 instruments (n = 183) V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 48. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Obtaining the Estimates Method: Post-LASSO Penalty parameters and loadings: Penalty loading calculated through iterative scheme 10-fold Cross-validation to obtain penalty Plug-in penalty: 2.2 √ nΦ−1(1 − γ/(2p)), γ = .1/ log(n) (Gives same results as 10-fold CV) V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects
  • 49. Example 1: 401(k) Example 2: Crime Example 3: Institutions Example 4: Eminent Domain Results: Case-Shiller Prices LASSO Selection Results: 1+ JD public squared First Stage: 0.4495 (0.0511) [Using 1+ democrat: 0.0664 (0.0713)] Second Stage: 0.0648 (0.0240) [Using 1+ democrat: -0.2583 (0.5251) (Unreliable)] V. Chernozhukov, M. Gentzkow, C. Hansen, J. Shapiro, M. Taddy High-Dimensional Methods: Examples for Inference on Structural Effects