Beruflich Dokumente
Kultur Dokumente
In answering these below, paste the Stata output only when it is asked. When
pasting output, use the copy as picture option. When testing a hypothesis, be sure
to mention the distribution of the test statistic, its degrees of freedom, the level of
significance and the associated critical value. DO NOT USE THE STATA test
COMMAND.
It would be easiest if you inserted your answer between the questions below and
returned this document. Rename the document as `your name.docx’ and upload it
on LMS.
You have to do this exam by yourself. You are allowed to consult the textbook and
your notes. You are NOT allowed to consult anybody whether by speaking, by text
messages or email or any other means. Violations will attract penalties as per
Ashoka policy.
1. (a) Regress log of wages on a constant and the female dummy. Paste
output here.
t= (Betajhat – aj)/se(Betajhat)
Therefore, as we found earlier, Betajhat = -.8553218 while se(Betajhat) =
.0635312
Therefore.
This value is statistically significant at the 10% significance level, but not at the
5% or 1% level. Thus, the null hypothesis that the coefficient on female dummy
is -0.5 can be rejected at the 10% level, but not at the 5% or 1% significance
level.
2. (a) Regress log of wages on a constant, the female dummy, age of the
individual and the square of age. Paste your output here.
(b) Controlling for age and the square of age does not seem to substantially
change the coefficient of the female dummy. Why is that so?
[5+5]
This is due to collinearity between age and square of age with female dummy.
Age and square of age do not have significant collinearity with female dummy,
thus it’s coefficient is not substantially changed. The correlation between female
dummy and age is 0.0269 while the correlation between female dummy and the
square of age is 0.0346. As we can see, they do not have a significant or high
correlation.
3. (a) Regress log of wages on a constant, the female dummy, age of the
individual the square of age and the social group dummies for scheduled caste,
for scheduled tribe and for other backward caste. Note the omitted category is
the general castes (or forward castes). Paste your output here.
(b) Test the null hypothesis that none of the social group dummmies matter, i.e.,
controlling for sex, age and square of age, the average of log wages is the same
for all categories: scheduled castes, scheduled tribes, other backward castes and
the general (forward) castes. Do NOT use the Stata test command.
Therefore,
F= (R^2ur – R^2r)/q
_____________________
(1-R^2ur)/ df ur
R^2ur = 0.2541
R^2r = 0.2236
q= 3
df ur= n-k-1 = 1000-6-1 = 993
Thefore,
F value = 0.000137
(c) Test the null hypothesis that relative to the general (forward) castes,
scheduled castes and other backward castes suffer the same extent of
discrimination. If this requires new regressions, paste the output in your
answer.
[5+15+15]
In this case, the null that forward castes suffer the same extent of discrimination
as scheduled castes and other backward castes is rejected. This is because the p-
value is very small, close to zero. This indicates that the data is statistically
significant, wherein we can reject the null hypothesis.
4. (a) Regress log of wages on a constant, the female dummy, age of the
individual the square of age, the social group dummies for scheduled caste, for
scheduled tribe and for other backward caste, and the education dummies for
illiterate, literate, primary, secondary, and higher secondary. Paste the output
here.
. regress lnwage female age age2 scd std obc illiterate literate primary second
> ary higher_secondary
(b) Compare the above regression with the regression in question 3 (without the
education dummies). Does the inclusion of education dummies alter the
discrimination against women, scheduled castes, scheduled tribes and other
backward castes? Why?
[5+15]
(b) Is either of the land variables individually significant at the 5 or 10% level?
(c) Now drop land owned (LandO) and re-run the regression. Is the included
land variable significant at the 5 or 10% level?
The included land variable is significant both at the 5% and 10% level.
There is a very high correlation between land owned and land possessed. Thus,
when we drop the land owned variable, the land possessed variable is
statistically significant. They have 0.97 correlation approximately.
[4+4+7]