Beruflich Dokumente
Kultur Dokumente
508 Biostatistics for Evidence‐based Practice
Song Ge
BSN, RN, PhD Candidate
Johns Hopkins University School of Nursing
www.nursing.jhu.edu
Learning Objectives
https://infoactive.co/data‐design/ch11.html
Positively skewed Math
Method Good for: Bad for:
Operation
Log ln(x) Right Zero values
log10(x) skewed data Negative
values
Square root √x Right Negative
skewed data values
2
Square x Left skewed Negative
Normally distributed data values
1/3
Cube root x Right Not as
skewed data effective as
Negative log
values transform
Reciprocal 1/x Making small Zero values
values Negative
bigger and values
big values
ll
Continued…
5. Samples must be
representative of the
population
6. There is no multicollinearity:
the interdependent variables
are so strongly intercorrelated
that they are indistinguishable
from each other
If VIF lies between 1‐10, no multicollinearity
If VIF <1 or >10, then there is multicollinearity
Continued…
7. The relationship between x and y must
be linear. When two scores are graphed,
they should tend to form a straight line.
If that is not a linear relationship, other
methods must be used.
8. For every value of X, the distribution of Y
scores must have approximately equal
variability (homoscedasticity)
Multiple Linear Regression
http://www.aetheling.com/models/cusp/Intro.htm
Multiple Linear Regression Equation
https://en.wikipedia.org/wiki/Pearson_correlation_coefficient
SPSS output for R square
The individual piece: Correlation coefficient
F‐ Test of Regression coefficient: Whether the independent variable
associated with it is contributing significantly to the variance accounted for
in the dependent variable
Group exercise
• Adjusted R2=0.031
• The four independent variables explain 3.1%
of the variance in the dependent variable.
Model Summary
a Predictors: (Constant), Hispanic, restaurant_dich, participant gender, age in years, Asian, Black
Analysis Example: ANOVA
• The p-value for the overall model is 0.004.
The amount of variance explained by the
model (independent variables) is statistically
significant
Analysis Example: Coefficients
• Beta for gender (−0.015), beta for age (0.002), beta
for eating in restaurants (0.008), beta for Black (−
0.053), beta for Asian (0.0006), and beta for
Hispanic (− 0.040), the regression constant (5.189)
Coefficientsa
Model Unstandardized Coefficients Standardized t Sig. 95% Confidence Interval for
Coefficients B
B Std. Error Beta Lower Bound Upper Bound
1 (Constant) 5.189 0.040 129.516 0.000 5.110 5.268
Coefficientsa
Model Unstandardized Coefficients Standardized t Sig. 95% Confidence Interval for
Coefficients B
B Std. Error Beta Lower Bound Upper Bound
1 (Constant) 5.189 0.040 129.516 0.000 5.110 5.268