Beruflich Dokumente
Kultur Dokumente
Factor analysis
Cluster analysis
Higher-way contingency table analyses
Agreement (kappa)
Correlation analysis (correlation coefficient)
Accuracy (sensitivity, specificity)
Continuous
Effect of nightly exercise on hrs
of sleep before/after in
insomniacs
Binary (yes/no)
Patient satisfaction before vs.
after color change in hospital
ward
Mastectomy vs. Lumpectomy on
survival in patients matched by
age & family history
1-group
Depression in substance
abusers
2-group
2-group,
pre/post
3-group
Continuous
Predictor
Analytic Methods
Dependent Variable (Outcome) Type
Study Design
Continuous
(multiple regression for
multivariate analysis)
Binary (yes/no)
(logistic regression for
multivariate analysis)
Pre/Post
Paired t-test
McNemars test
Matched pairs
Paired t-test
McNemars test
1-group
One-group t-test
2-group*
Two-group t-test
2-group, pre/post*
Analysis of Covariance or
multiple regression
Repeated measures
logistic regression
3-group*
Analysis of Variance
Chi-square test
Continuous
Predictor
Simple regression
Logistic regression
* Bivariate relationships
Resting
Heart
Rate
No
Exercise
Mild
Exercise
Strenuous
Exercise
Advantage
Order effect
Period effect
Probability of rejecting
null hypothesis,
when true
0.05
0.0975
0.143
0.226
10
0.401
Confounding
Example 1: Sex bias in graduate admissions?
(UC, Berkeley, 1973)
Overall:
Confounding (2)
Male
Female
Number of
Applicants
Percent
Admitted
Number of
Applicants
Percent
Admitted
A
B
825
62%
108
82%
560
63%
25
68%
C
D
325
37%
593
34%
417
33%
375
35%
E
F
191
28%
393
24%
373
6%
341
7%
Total
2691
45%
1835
30%
Major
Weighted Average:
39%
43%
Confounding (3)
Example2 : Is psychiatric hospitalization rate different
in substance users versus non-users?
Hospitalization
Yes
No
User
20
373 5.1%
Non-user
6
316 1.9%
Substance use looks to be associated with higher
psychiatric hospitalization rate.
User
Non-User
Confounding (4)
Example 3: Smoking versus MI
Smoker
Non-Smoker
MI
51
54
No MI
43
67
54%
44.6%
OR = 1.47
Male
Female
MI
37
25
14
29
No MI 24
20
19
47
61% 56%
48% 38%
OR = 1.23
OR = 1.19
Smokers have higher MI rate, but the magnitude of the
relative likelihood of MI (measured as odds ratio (OR)) is
larger in the combined data.
Confounding (5)
Example 4:
1) Regression of Happiness on Smoker Group
Coef
SE
p-value
Intercept
65.05
1.48
0.000
Smoke
4.80
2.03
0.020
2) Regression of Happiness on Age
Coef
SE
Intercept
7.48
2.45
Age
1.85
0.07
p-value
0.003
0.000
Confounding (6)
Relationship between Happiness and Age
20
100
20
40
40
Happiness
60
60
80
80
100
by Smoking Status
20
Not
25
30
Smoke
Y, Smoke == Not
Age
35
40
45
Y, Smoke == Smoke
DevelopingaStatisticalAnalysisPlan
Comparing two groups
Continuous: t-test
Proportion: chi-square test
Comparing multiple groups (continuous): ANOVA
Adjusted for other factors: ANCOVA, or regression
Dichotomous outcome: Logistic regression
Count outcome: Poisson regression
Survival time outcome: Cox regression
Watch for correlated data (repeated measures, clusters
e.g., teeth in the mouth
To Keep in Mind
Typically, multiple appropriate methods are available
to analyze the same data that could yield legitimate
answers.
Try to use at least two different available methods to
confirm your results.
Always look at the raw data and display data
graphically, so learn to choose the right graphical
displays (ex: cross tabs, scatter plots, box plots)
It helps to make sample tables summarizing results
before you start the analysis.