Beruflich Dokumente
Kultur Dokumente
Chapter 13
The Simple Linear Regression
Model and Correlation
© 1999 Prentice-Hall, Inc. Chap. 13 - 1
Chapter Topics
• Types of Regression Models
• Determining the Simple Linear Regression
Equation
• Measures of Variation in Regression and
Correlation
• Assumptions of Regression and Correlation
• Estimation of Predicted Values
• Correlation - Measuring the Strength of the
Association
Y
X 1
X 2
X
Y
X2
X1
X
Regression Line
© 1999 Prentice-Hall, Inc. Chap. 13 - 7
Regression Models
Advertising Advertising
Sales Sales
Advertising Advertising
Simple Multiple
Non- Non-
Linear Linear
Linear Linear
Simple Multiple
Non- Non-
Linear Linear
Linear Linear
Y
Y = bX + a
Change
b = S lo p e in Y
C h a n g e in X
a = Y -in te r c e p t
X
Yi 0 1 X i i
Dependent
(Response) Independent
Slope (Explanatory)
Variable
Variable
© 1999 Prentice-Hall, Inc. Chap. 13 - 18
Population
Linear Regression Model
Y Yi 0 1X i i Observed
Value
i = Random Error
0 1X i
YX
(E(Y))
X
Observed Value
© 1999 Prentice-Hall, Inc. Chap. 13 - 19
Sample Linear
Regression Model
Yi b0 b1 X i
Yi = Predicted Value of Y for observation i
Regression Applet
M ean o f Y
X
Y = c o n s ta n t+ (0 )X
= E (Y )
© 1999 Prentice-Hall, Inc. Chap. 13 - 24
What should we
expect?
• If Y and X are
related, then E(Y| Y
X)<>E(Y) - we
should predict a
different Y for M ean of Y
every value of X.
• Therefore, the slope
will not be zero M ean of X X
B <>0
10000
8000 Y 5130
6000
4000
X 2350
2000
0
0 1000 2000 3000 4000 5000 6000
Square Feet
Excel Output
10000
8000
8 7Xi
5130 1.4
6000
5 +
.4 1
4000 636
= 1
2000
Yi
2350
0
0 1000 2000 3000 4000 5000 6000
Square Feet
X
X Xi
© 1999 Prentice-Hall, Inc. Chap. 13 - 35
Measures of Variation:
The Sum of Squares
SST = Total Sum of Squares
•measures_the variation of the Yi values around their
mean Y
SSR = Regression Sum of Squares
•explained variation attributable to the relationship
between X and Y
SSE = Error Sum of Squares
•variation attributable to factors other than the
relationship between X and Y
^=b +b X
Y ^=b +b X
Y
i 0 1 i i 0 1 i
X X
© 1999 Prentice-Hall, Inc. Chap. 13 - 41
R and F connection
2
SSR 2 2
F r * SST n 2 r n 2
(2 1)
* *
SSE 2 2
n 2 (1 r ) * SST 2 1 1 r 1
Inte rce pt
Excel Printout
Coeffic ients
1636.41473
forEProduce
S t andard Stores
rror t S tatP-value
451.4953308
Lower 95%
0.0151488 475.810926
X V a ria bl e 1 1.48663366 0.164999212 0.0002812 1.06249037
The t test for =0 is identical to the F test for r2=0 for
simple regression. The t-statistic will be the square root of
the F statistic (t=1.4866/.1649=9.01) F1,n-2=t2n-2
mean point
SALES
• If B is different this will
give small differences in the
forecast for Y near the
mean, but big differences L o w e r 9 5 % e s tim a t e o f
B (1 .0 6 )
+ b X
1 i
Yi = b0
_ X
X A Given X
© 1999 Prentice-Hall, Inc. Chap. 13 - 54
Example: Produce Stores
Data for 7 Stores:
Annual
Store Square Sales Predict the annual
Feet ($000)
sales for a store with
1 1,726 3,681 2000 square feet.
2 1,542 3,395
3 2,816 6,653 Regression Model Obtained:
4 5,555 9,543
5 1,292 3,318
6 2,208 5,563
Yi = 1636.415 +1.487Xi
7 1,313 3,760
© 1999 Prentice-Hall, Inc. Chap. 13 - 55
Estimation of Predicted
Values: Example
Confidence Interval Estimate for Individual Y
Find the 95% confidence interval for the average annual sales
for stores of 2,000 square feet
Predicted Sales Yi = 1636.415 +1.487Xi = 4610.45 ($000)
X = 2350.29 SYX = 611.75 tn-2 = t5 = 2.5706
1 ( X i X )2
Ŷi t n 2 Syx n = 4610.45 980.97
n ( X X )2
i
i 1 Confidence interval for mean Y
© 1999 Prentice-Hall, Inc. Chap. 13 - 56
Estimation of Predicted
Values: Example
Confidence Interval Estimate for XY
Find the 95% confidence interval for annual sales of one
particular stores of 2,000 square feet
Predicted Sales Yi = 1636.415 +1.487Xi = 4610.45 ($000)
1 ( X i X )2
Ŷi t n 2 Syx 1 n = 4610.45 1853.45
n ( X X )2
i Confidence interval for indivi
i 1
Y
© 1999 Prentice-Hall, Inc. Chap. 13 - 57
Estimation of Predicted
Values: Example
Example SPSS
Example EXCEL
F1,n 2 or
1 r 2
r 2 n 2
t n2
1 r 2
SAMPLE SIZE IS REALLY IMPORTANT FOR
SIGNIFICANCE!
© 1999 Prentice-Hall, Inc. Chap. 13 - 70
Guessing Correlations
“unreliable”
SATADVN 1.0000
SATBENFT .2220 1.0000
SATCLTRE .5136 .3559 1.0000
SATRECGN .5403 .3044 .5579 1.0000
SATPAY .4381 .3015 .3817 .4849 1.0000
SATCONDT .4202 .3172 .5136 .4755 .3879
SATPEOPL .2336 .2715 .3782 .3390 .2488
SATWORK .5353 .1683 .4167 .3766 .3145
SATSTRTG .4905 .3233 .6131 .4636 .3582
SATMNGR .3710 .2009 .3665 .4831 .2907
SATCOMP .4170 .4621 .3861 .4707 .7996
SATEVAL .4727 .2714 .4830 .5549 .4410
SATTRAN .4323 .2470 .3959 .4263 .3304
SATCONDT 1.0000
SATPEOPL .3756 1.0000
SATWORK .3898 .2645 1.0000
SATSTRTG .4869 .2998 .4717 1.0000
SATMNGR .3544 .2711 .3068 .3246 1.0000
SATCOMP .3913 .2627 .3003 .3751 .3197
SATEVAL .4240 .2656 .3497 .4585 .3909
SATTRAN .3543 .2795 .3507 .4058 .2986
SATCOMP 1.0000
SATEVAL .4739 1.0000
SATTRAN .3320 .4713 1.0000
FEELJOB 1.0000
JOBINTER -.7033 1.0000
JOBGROW -.6182 .7152 1.0000
SLVEPROB -.5211 .6084 .6762 1.0000
QSTNVP -.2964 .3465 .3901 .427
Coefficients
t Sig.
Model B Std. Error Beta
(Constant) 3.782 .059 64.536 .000
ENJYCUST 2.3E-02 .014 .037 1.651 .099