Beruflich Dokumente
Kultur Dokumente
ANALYSIS
CORRELATION VS REGRESSION
• In correlation, the two variables are treated as equals.
Y Y
X X
Y Y
X X
SCATTER PLOT
X X X
r = -1 r = -.6 r=0
Y
Y Y
X X X
r = +1 r = +.3 r=0
Slide from: Statistics for Managers Using Microsoft® Excel 4th Edition, 2004 Prentice-Hall
REGRESSION LINE
•Regression line is the
best straight line
description of the plotted
points and can use it to
describe the association
between the variables.
Y Y
x x
residuals
x residuals x
Not Linear
Linear
Residual Analysis for
Homoscedasticity
Y Y
x x
residuals
x residuals x
Not Independent
Independent
residuals
residuals
X
residuals
X
WHAT IS “LINEAR”?
• Remember this: Y = mX+C?
•As predictors are added to the model, each predictor will explain some of the
variance in the dependent variable simply due to chance.
•One could continue to add predictors to the model which would continue to
improve the ability of the predictors to explain the dependent variable, although
some of this increase in R-square would be simply due to chance variation in that
particular sample.
•The p-value associated with this F value is very small (0.000). These values are used to
answer the question "Do the independent variables reliably predict the dependent
variable?". (menguji kesignifikanan model regresi sekaligus - kesemua X dengan Y bagi
regresi berganda)
•The p-value is compared to your alpha level (typically 0.05) and, if smaller, you can conclude
"Yes, the independent variables reliably predict the dependent variable".
Ho : β=0 (Tiada hubungan linear antara X danY) Ho : β1 = β2 = … = βk = 0 (tiada
hubungan linear)
H1: β ≠ 0 ( Terdapat hubungan linear antara X dan Y) H1 : sekurang-kurangnya satu β ≠ 0
(regresi berganda)
•If the p-value were greater than 0.05, you would say that the group of independent variables
does not show a statistically significant relationship with the dependent variable, or that the group
of independent variables does not reliably predict the dependent variable.
TABLE 3: COEFFICIENTS
•B - These are the values for the regression equation for predicting
the dependent variable from the independent variable. These are
called unstandardized coefficients because they are measured
in their natural units.
iv1
iv2