Econometrics Cheat Sheet

properties of OLS Matrix

Sum of Squared Residuals	(y − Xβˆ)′(y − Xβˆ)
	y′y − βˆ′X′y − y′Xβˆ + βˆ′X′Xβˆ
	y′y − 2βˆ′X′y + βˆ′X′Xβˆ
Minimise the SSR	∂(SSR)/∂βˆ = −2X′y + 2X′Xβˆ = 0
from the minimum we get: "normal equation"	(X′X)βˆ = X′y
Solve for OLS estimator βˆ; by pre multiplying both sides by (X′X)	(X′X)−1(X′X)βˆ = (X′X)−1X′y
by definition, (X′X)−1(X′X) = I	Iβˆ = (X′X)−1X′y
	βˆ = (X ′ X )−1 X ′ y
Properties
The observed values of X are uncorrelated with the residuals.	X′e = 0 implies that for every column xk of X, x′ke = 0.
substitute in y = Xβˆ + e into normal equation	(X′X)βˆ = X′(Xβˆ + e)
	(X′X)βˆ = (X′X)βˆ + X′e
	X′e = 0
The sum of the residuals is zero.	If there is a constant, then the first column in X (i.e. X1) will be a column of ones. This means that for the first element in the X′e vector (i.e. X11 ×e1 +X12 ×e2 +...+X1n ×en) to be zero, it must be the case that ei = 0.
The sample mean of the residuals is zero.	e= ∑e i/n = 0.
The regression hyperplane passes through the means of the observed values (X and y).	This follows from the fact that e = 0. Recall that e = y − Xβˆ. Dividing by the number of observations, we get e = y − xβˆ = 0. This implies that y = xβˆ. This shows that the regression hyperplane goes through the point of means of the data.
The predicted values of y are uncorrelated with the residuals.	ˆ′e = (Xβˆ)′e = b′X′e = 0
The mean of the predicted Y’s for the sample will equal the mean of the observed Y’s : y^-=y-
The Gauss-Markov Theorem: Proof that βˆ is an unbiased estimator of β	βˆ = (X′X)⁻¹X′y=(X′X)⁻¹X′(Xβ + ε)
	β + (X′X)⁻¹X′ε
given (X′X)⁻¹X′X = I	E[βˆ] = E[β] + E[(X′X)⁻¹X′ε] = β + (X′X)⁻¹X′E[ε]
where E[X′ε]=0	E[βˆ]=β
Proof that βˆ is a linear estimator of β.	βˆ = β + (X′X)⁻¹X′ε; where (X′X)⁻¹X′= A
	βˆ = β + Aε => linear equation

Hetroskedasticity

consequence:	the statistics used to test hypotheses under Gauss-Markov assumptions are not valid in the presence of hetroskedasticity.
Valid estimator (any form)	∑[(x1- x-)² uˆi²]/[SST²x]
	SSTx=∑(x1- x-)²
Robust Standard error	Varˆ(βˆj)=∑[rˆij²û²i]/[SSR²j]

Inference

Normality Assumption:	zero mean and Variance
	Var(u)= σ²
T-test:	(βˆ j- β j)/se(βˆ j)~ t n-k-1 =t df
H0 : βj = 0	used in testing hypotheses about a single population parameter as in .
Test statistic	t βˆ j=(βˆ j)/se(βˆ j)~ t n-k-1
	t = (estimate − hypothesised value)/ standard error
Alternative Hypothesis/one sided
H1: βj > 0	t βˆj > c [c @5%]
H1: βj < 0	t βˆj <- c [c @5%]
Two sided
H1: βj =/= 0	\|tβˆj \| > c [c @2.5%]
If H0, rejected	x j is statistically significant, (significantly different from zero), @ the 5% level
if H0, not rejected	x j is statistically insignificant @the 5% level
P-value	smallest significant level at which the null hypotheses would be rejected
Confidence Interval	βˆj ±c·se(βˆj)
	where c is 97.5 percentile in a t n-k-1 distribution
CI given; @ 5% significant level	H0 :βj =aj is rejected against H1:βj = ̸=aj ; if aj is not in the 95% confidence interval
H0:β1<β2 ⇔ β1−β2<0	t= (βˆ1−βˆ2) /se(βˆ1 − βˆ2)
se(βˆ1 − βˆ2) = √Var(βˆ1 − βˆ2)
	Var(βˆ1 − βˆ2) = Var(βˆ1) + Var(βˆ2) − 2Cov(βˆ1, βˆ2)
alternative to calculating se(βˆ1 − βˆ2)	Let θ = βˆ1 − βˆ2; β1 = θ + βˆ2
H0: θ=0, H1: θ<0	Substituting β1 = θ + βˆ2 into the model we obtain
β0 +θ x1 +β2(x1 +x2)+β3 x3 +u
F Test	F =[(SSRr-SSRur )/q] / [SSRur/(n-k-1)]
q	=number of restrictions
n-k-1= df ur	= df r- df ur
R² F stat	SSR= SST(1 - R² )
	F= [(R²ur-R²r)/q] / [1-R²ur)/(df ur)]
remember to not square the R value thats already been done
Overall significance of the regression
Testing joint exclusion	[R²/R]/[(1-R²)/(n-k-1)]

Data Scaling

Changes:
if Xj is * by c	Its coefficient is / by c
If dependant variable is * by c	ALL OLS coefficients are * by c
neither t nor F statistics are affected
Beta coefficients	obtained from an OLS regression after the dependant and independent variables have been transformed into z-scores

Dummy Variables

Dummy/Binary Variables	= yes/no variables
	= take on the values 0 and 1 to identify the mutually exclusive classes of the explanatory variables.
	= leads to regression models where the parameters have very natural interpretations
Given: wage= β0+ ∂0 female + β1 edu + u
	To solve for ∂0:
	∂0=E(wage\|female,edu)-E(wage\|male,edu)
	where level of education is the same
Graphically ∂0 =	an intercept shift
	male intercept= β0
	female intercept= β0+∂0
dummy variable trap=	when both dummy variables (male & female) are included; resulting in perfect collinearity
If a qualitative variable has m levels;	then (m−1) dummy variables are required and each of them takes value 0 and 1.
Hypothesis test
Test whether the two regression models are identical:
	H0 :β2 =β3 0
	H1 :β2 ≠0 and/or β3 ≠0.
Acceptance of H0 indicates that only single model is necessary to explain the relationship.
Test is two models differ with respect to intercepts only and they have same slopes
	H0 :β3 =0
	H1:β3 ≠0.
Treating a quantitative variable as qualitative variable increases the complexity of the model.
	The degrees of freedom for error are reduced.
	Can effect the inferences if data set is small

Econometrics Cheat Sheet (DRAFT) by dsjac3

properties of OLS Matrix

Hetroskedasticity

Inference

Data Scaling

Dummy Variables

Latest Cheat Sheet

Random Cheat Sheet

About Cheatography

Behind the Scenes

Recent Cheat Sheet Activity

Please Disable Your Ad Blocker

Econometrics Cheat Sheet (DRAFT) by dsjac3

properties of OLS Matrix

Hetros­ked­ast­icity

Inference

Data Scaling

Dummy Variables

Latest Cheat Sheet

Random Cheat Sheet

About Cheatography

Behind the Scenes

Recent Cheat Sheet Activity

Please Disable Your Ad Blocker

Hetroskedasticity