Beruflich Dokumente
Kultur Dokumente
handle it
Arooj Arshad
PhD Scholar
Goals
Discuss ways to evaluate
and understand missing data
Discuss common missing
Carol
Dweck,
data
methods
based
Know
on research
the advantages and
disadvantages
of common
on belief
systems,
and methods
their role in
Treatment
of the missing
motivation
and
data
achievement,
has
ways of missing
akeyEfficient
contribution
data handling
in originating
and
explaining implicit
theories of
intelligence/ability
.
Missing
Data
If any data on any variable
from any participant is not
present, the researcher is
dealing with missing or
incomplete data
Legitimate
Missing
Data
Legitimate missing data is an
absence of data when it is
appropriate for there to be an
absence.
(Cole, 2008)
The
Point to be remembered.
All
Nature of Missingness
Variance
Missing data can
sometimes lead
to wrong
standard errors.
Wrong study
conclusions
about
relationship of
variables to
outcomes.
(Roth, 2001)
10
Deletion Methods
Model-Based Methods
11
Deletion Method
12
analyze cases
with complete data
dropping the missing
variables.
When a researcher is
estimating a model,
such as a linear
regression, most
statistical packages
use listwise deletion
by default.
13
(Cole, 2008)
Ease of implementation.
Comparability across analyses
Disadvantage
14
(Cole, 2008)
Hot-Deck Imputation
Researcher
Disadvantages
16
17
Mean/Mode
substitution
Dummy variable control
Conditional mean substitution
18
Mean/Mode Substitution
Replace
or mode
Run analyses as if complete cases analysis
Advantages
Can
Disadvantages
Reduces
variability(underestimate standard
error).
Weakens covariance and correlation estimates
in the data (because It ignores relationship
between variables)
19
20
Mean
substitution is worth
considering when correlations
between variables in the data are low
and less than 10% of the data are
missing (Donner, 1982).
21
Disadvantage
Results in biased estimates
Not theoretically driven
22
Regression Imputation
Replaces
Disadvantages:
Overestimates model fit and correlation
estimates
Weakens variance
23
24
Likelihood Using EM
algorithm
Multiple imputation
25
Disadvantages
26
we
27
Multiple Imputation
Impute:
28
Multiple Imputation
Advantages:
Disadvantages:
Cumbersome coding
Room for error when specifying models
29
Multiple Imputation
Using this likelihood function the ML
procedure provides parameter estimates
based on all available data, including the
incomplete cases. However, simulation
studies show that ML is an inadequate
estimation technique for some small
sample problems and results in biased
estimates (Little and Rubin, 1989). For
large samples ML is a preferred method for
dealing with missing data (Schafer and
Graham, 2002).
30
31
32
Roth, 1994
33
References
Allison, P. D. (2001). Missing Data. Sage University Papers Series on
Quantitative Applications in the Social Sciences. Thousand
Oaks: Sage.
Cole, J. C. (2008). How to deal with missing data. In J. W. Osborne
(Ed.), Best practices in quantitative methods (214238). Thousand
Oaks, CA: Sage.
Enders, C. (2010). Applied Missing Data Analysis. Guilford Press: New
York.
Little, R. J., & Donald, R. (2002). Statistical Analysis with Missing
Data. John Wiley & Sons, Inc: Hoboken.
Roth, P. (1994). Missing data: A conceptual review for applied
psychologists. Personnel Psychology, 47, 537-560.
Schafer, J. L., John W. G. (2002). Missing Data: Our View of the State
of the Art. Psychological Methods, (7), 147-177.
34