🎳Intro to Econometrics Unit 9 Review

Instrumental variables are crucial in econometrics for addressing endogeneity issues. They help estimate causal effects when explanatory variables are correlated with error terms. This topic explores the key criteria for valid instruments: relevance and exogeneity.

Relevance ensures instruments correlate strongly with endogenous regressors, while exogeneity requires they're uncorrelated with error terms. The notes cover instrument strength, overidentification, weak instrument problems, and trade-offs in instrument selection. Understanding these concepts is vital for proper IV implementation.

Relevance of instruments

Instruments must be relevant to the endogenous regressors in the model to provide consistent estimates of the causal effect of interest
The relevance condition requires that the instruments have a strong correlation with the endogenous explanatory variables

Correlation with endogenous regressors

Instruments should have a non-zero correlation with the endogenous regressors in the model
The correlation between the instruments and endogenous regressors can be assessed using the first-stage regression, where the endogenous variables are regressed on the instruments
A high correlation between the instruments and endogenous regressors indicates that the instruments are relevant and can effectively isolate the exogenous variation in the endogenous variables
Example: In a study of the effect of education on earnings, a valid instrument could be the distance to the nearest college, as it is likely to be correlated with an individual's level of education

Strength of instruments

The strength of an instrument refers to the magnitude of its correlation with the endogenous regressors
Weak instruments, or those with low correlation, can lead to biased and inconsistent estimates in the IV regression
The strength of instruments can be evaluated using the F-statistic from the first-stage regression
- A high F-statistic (typically greater than 10) suggests that the instruments are strong and relevant
Example: A study examining the impact of air pollution on health outcomes may use wind direction as an instrument, as it is strongly correlated with the concentration of pollutants in the air

Exogeneity of instruments

The exogeneity condition requires that the instruments are uncorrelated with the error term in the structural equation
Instruments should only affect the dependent variable through their influence on the endogenous regressors and not through any other channels

Uncorrelated with error term

For an instrument to be valid, it must be uncorrelated with the unobserved factors that affect the dependent variable (i.e., the error term)
If the instrument is correlated with the error term, the IV estimates will be biased and inconsistent
The assumption of instrument exogeneity cannot be directly tested, as the error term is unobservable
Researchers must rely on economic theory and intuition to justify the exogeneity of their chosen instruments

Exclusion restriction

The exclusion restriction states that the instruments should only affect the dependent variable through their impact on the endogenous regressors
In other words, the instruments should have no direct effect on the dependent variable, other than through the endogenous variables
Violating the exclusion restriction leads to biased and inconsistent IV estimates
Example: In a study of the effect of military service on earnings, the draft lottery number may serve as a valid instrument, as it affects earnings only through its impact on the likelihood of military service and not through any other channels

Correlation with endogenous regressors, Linear Quantile Regression and Endogeneity Correction

Overidentifying restrictions

Overidentifying restrictions occur when there are more instruments than endogenous regressors in the model
Having more instruments than necessary allows for the testing of the validity of the instruments

Surplus of instruments

When the number of instruments exceeds the number of endogenous regressors, the model is said to be overidentified
Overidentification provides an opportunity to test the joint validity of the instruments using the Hansen J-statistic or the Sargan test
If the overidentifying restrictions are satisfied, the surplus instruments can help improve the efficiency of the IV estimates

Testing validity with restrictions

The Hansen J-statistic and the Sargan test are used to assess the validity of overidentifying restrictions
These tests evaluate whether the instruments are uncorrelated with the error term in the structural equation
A failure to reject the null hypothesis of these tests suggests that the overidentifying restrictions are satisfied and the instruments are valid
Example: In a study with multiple instruments, such as parental education and siblings' education as instruments for an individual's education, overidentifying restriction tests can be used to assess the validity of these instruments

Weak instruments

Weak instruments are those that have a low correlation with the endogenous regressors in the model
The use of weak instruments can lead to biased and inconsistent IV estimates, even in large samples

Bias in IV estimators

When instruments are weak, the IV estimator can be biased towards the OLS estimator
The bias of the IV estimator increases as the correlation between the instruments and the endogenous regressors decreases
In the presence of weak instruments, the IV estimates may be more biased than the OLS estimates, defeating the purpose of using IV methods

Finite sample properties

Weak instruments can lead to poor finite sample properties of the IV estimator
With weak instruments, the IV estimator may have a large standard error and a non-normal sampling distribution, even in large samples
Confidence intervals based on weak instruments may have incorrect coverage probabilities, leading to invalid inference

Rule of thumb for F-statistic

A commonly used rule of thumb to assess the strength of instruments is the F-statistic from the first-stage regression
An F-statistic greater than 10 is often considered a threshold for strong instruments
However, this rule of thumb should be used with caution, as it may not always be reliable, particularly with multiple endogenous regressors or non-i.i.d. errors
Example: In a study with a single endogenous regressor, an F-statistic of 5 in the first-stage regression would suggest that the instrument is weak and may lead to biased IV estimates

Instrument exogeneity vs relevance

When selecting instruments, researchers face a trade-off between the exogeneity and relevance of the instruments
Instruments that are highly relevant may be more likely to violate the exogeneity condition, while instruments that are strictly exogenous may have weaker relevance

Tradeoffs in instrument selection

Researchers must carefully consider the balance between instrument exogeneity and relevance when choosing instruments
Instruments that are more closely related to the endogenous regressors may have a stronger first-stage relationship but may also be more likely to violate the exclusion restriction
Conversely, instruments that are less related to the endogenous regressors may be more plausibly exogenous but may suffer from weak instrument bias

Consequences of invalid instruments

Using invalid instruments that violate either the exogeneity or relevance condition can lead to biased and inconsistent IV estimates
Instruments that are not exogenous (i.e., correlated with the error term) will produce estimates that are biased and inconsistent, even in large samples
Instruments that are not relevant (i.e., weakly correlated with the endogenous regressors) will lead to estimates that are biased towards the OLS estimator and may have poor finite sample properties
Example: In a study of the effect of health insurance on health outcomes, using an individual's occupation as an instrument may be problematic, as occupation could be correlated with both insurance status and health outcomes, violating the exogeneity condition

🎳Intro to Econometrics Unit 9 Review