Beruflich Dokumente
Kultur Dokumente
Introduction to Econometrics
Professor Woodbury
Fall Semester 2015
Section 2: Tu/Th 10:20-11:40, 117 Berkey
Section 3: Tu/Th 12:40-2:00, 117 Berkey
What is econometrics?
The simplest definition is:
The statistical analysis of economic (and related) data
But there are many other definitions
Use of statistical methods to test hypotheses about economic
relationships, estimate actual magnitudes, and use these estimates
to make quantitative predictions of economic events
How we use theory and data from economics, along with the
tools from statistics, to answer how much type questions
2
Questions like these may seem unanswerable, but we will see that
econometrics is about finding ways to obtain credible answers to
these questions
In fact, most of the course deals with difficulties arising from using
observational data to estimate causal effects
confounding effects (omitted factors)
simultaneous causality
correlation does not imply causation
American
Community
Survey
Current
Population Survey
Establishment
Survey
Unit of
observation
Household
Household
Establishment
Establishment
Establishment
Frequency
Annual
Monthly
Monthly
Quarterly
Monthly
Type
Cross section
Cross section
(panel aspects)
Cross section
Cross section
Cross section
Sample size
~3,500,000
per year
60,000 households
145,000
establishments1
10,800
establishments1
16,400
establishments1
Exclusions from
sample
None
None
Agriculture and
self-employed
Agriculture and
self-employed
Agriculture and
self-employed
Payroll records
Mail survey
How gathered
10
12
13
Initial
look
data
Initial
lookatatsome
the data:
(You(You
should
already
know
how
totointerpret
should
already
know
how
interpretthis
this table)
table)
doesnt
us anything
about
therelationship
relationship between
This This
tabletable
doesnt
tell tell
us anything
about
the
betweenand
testthe
scores
and the STR. ratio!
test scores
student-teacher
1/2/3-8
14
Do districts
smaller
classes
have
higher
testscores?
scores?
Do districts
withwith
smaller
classes
have
higher
test
Scatterplot
of test
score
v. student-teacherratio
ratio
Scatterplot
of test
score
v. student-teacher
What
does
figure
show?
What
does
thisthis
figure
show?
1/2/3-9
15
16
(Y )
Standard
deviation (sBY)B
Small
657.4
19.4
238
Large
650.0
17.9
182
Class
Size
Average score
1. Estimation of
2. 2.Test
thatthat
==
00
Testthe
thehypothesis
hypothesis
3. Construct a confidence interval for
3. Construct a confidence interval for
17
1/2/3-11
Estimation
1. Estimation
Ysmall Ylarge =
nsmall
nsmall
i 1
Yi
1
nlarge
nlarge
Yi
i 1
= 657.4 650.0
= 7.4
Is this a large difference in a real-world sense?
Standard
deviationinacross
districtssense?
= 19.1
Is this a large
difference
a real-world
between
60th
and 75th
percentiles of test score
Difference
Standard
deviation
across
districts
= 19.1
distribution
is 667.6
659.4
8.2
Difference
between
60th
P P and
75th
P =P percentiles
of test score
enough difference to be important for school
This is aisbig
distribution
667.6
659.4 = 8.2
reform discussions, for parents, or for a school committee?
18
2. Hypothesis testing
2. Hypothesis testing
Hypothesis testing
Difference-in-means
test: compute the t-statistic,
Difference-in-means test compute the t-statistic:
1/2/3-13
19
sBYB
small
Y
657.4
19.4
238
large
650.0
17.9
182
Size
Ys Yl
ss2
ns
sl2
nl
657.4 650.0
19.42
238
17.92
182
7.4
= 4.05
1.83
|t| > 1.96, so we reject (at the 5% significance level) the null
|t| hypothesis
> 1.96, so that
reject
themeans
5% significance
the(attwo
are the samelevel) the null
20
3. Confidence interval
A 95%Confidence
confidence interval
interval for the difference between the
meansAis,95% confidence interval for the difference between the means
is:
(Ys Yl )
1.96 SE(Ys Yl )
1/2/3-15
21
Stata issues
22
23
sum
6. Sort the data, then produce descriptive statistics for each of the
two groups of school districts (large and small STR):
sort lg
by lg: sum
ttest testscr, by lg
25
26
27