Beruflich Dokumente
Kultur Dokumente
Understand how regression analysis can be used to develop an equation that estimates
mathematically how two variables are related.
2.
Understand the differences between the regression model, the regression equation, and the
estimated regression equation.
3.
Know how to fit an estimated regression equation to a set of sample data based upon the leastsquares method.
4.
Be able to determine how good a fit is provided by the estimated regression equation and compute
the sample correlation coefficient from the regression analysis output.
5.
Understand the assumptions necessary for statistical inference and be able to test for a significant
relationship.
6.
Know how to develop confidence interval estimates of y given a specific value of x in both the case
of a mean value of y and an individual value of y.
7.
Learn how to use a residual plot to make a judgement as to the validity of the regression
assumptions, recognize outliers, and identify influential observations.
8.
14 - 1
Chapter 14
Solutions:
a.
16
14
12
10
y
8
6
4
2
0
0
b.
c.
Many different straight lines can be drawn to provide a linear approximation of the
relationship between x and y; in part d we will determine the equation of a straight line
that best represents the relationship according to the least squares criterion.
d.
yi = 40
( xi x )( yi y ) = 26
( xi x )( yi y ) 26
=
= 2.6
( xi x ) 2
10
b0 = y b1 x = 8 (2.6)(3) = 0.2
y = 0.2 + 2.6 x
e.
14 - 2
( xi x ) 2 = 10
a.
35
30
25
20
y
2.
15
10
5
0
0
b.
c.
Many different straight lines can be drawn to provide a linear approximation of the
relationship between x and y; in part d we will determine the equation of a straight line that
best represents the relationship according to the least squares criterion.
d.
yi = 116
( xi x )( yi y ) = 57.8
( xi x )( yi y ) 57.8
=
= 1.8766
( xi x ) 2
30.8
b0 = y b1 x = 23.2 ( 18766
.
)(38
. ) = 30.3311
y = 30.33 188
. x
e.
y = 30.33 188
. (6) = 19.05
14 - 3
( xi x ) 2 = 30.8
Chapter 14
3.
a.
7
6
5
4
3
2
1
0
0
b.
yi = 17
( xi x )( yi y ) = 11.6
( xi x )( yi y ) 11.6
=
= 0.5088
( xi x ) 2
22.8
14 - 4
( xi x ) 2 = 22.8
4. a.
135
130
125
y
120
115
110
105
100
61
62
63
64
65
66
67
68
69
b.
There appears to be a linear relationship between x and y.
c.
Many different straight lines can be drawn to provide a linear approximation of the
relationship between x and y; in part d we will determine the equation of a straight line that
best represents the relationship according to the least squares criterion.
d.
yi = 585
( xi x )( yi y ) = 110
( xi x )( yi y ) 110
=
= 5.5
( xi x ) 2
20
y = 240.5 + 55
. x = 240.5 + 55
. (63) = 106 pounds
14 - 5
( xi x ) 2 = 20
Chapter 14
a.
2500
2000
Price ($)
5.
1500
1000
500
0
1
1.5
2.5
3.5
4.5
Baggage Capacity
b.
c.
Many different straight lines can be drawn to provide a linear approximation of the
relationship between x and y; in part (d) we will determine the equation of a straight line
that best represents the relationship according to the least squares criterion.
d.
yi = 11,110
( xi x )( yi y ) = 2909.8889
( xi x ) 2 = 4.5556
( xi x )( yi y ) 2909.9889
=
= 638.7561
( xi x ) 2
4.5556
A one point increase in the baggage capacity rating will increase the price by approximately $639.
f.
14 - 6
a.
30
25
20
Bonus
6.
15
10
5
0
50.00
70.00
90.00
110.00
130.00
150.00
170.00
Salary
b.
c.
yi = 160
( xi x )( yi y ) = 1114
( xi x ) 2 = 6046
( xi x )( yi y ) 1114
=
= .18425
( xi x ) 2
6046
b0 = y b1 x = 16 (.18425)(142) = 10.16
y = 10.16 + .18 x
d.
14 - 7
190.00
Chapter 14
7.
a.
1550
1500
S&P
1450
1400
1350
1300
9600
9800
10000
10200
10400
10600
10800
11000
11200
DJIA
b.
Let x = DJIA and y = S&P. Summations needed to compute the slope and y-intercept are:
xi = 104,850
b1 =
yi = 14, 233
( xi x )( yi y ) = 268,921
( xi x ) 2 = 1,806,384
( xi x )( yi y )
268,921
=
= 0.14887
( xi x ) 2
1,806,384
8.
a.
14 - 8
1800
1600
1400
Price
1200
1000
800
600
400
200
0
0
Sidetrack Capability
b.
There appears to be a linear relationship between x = sidetrack capability and y = price, with higher
priced models having a higher level of handling.
c.
yi = 10, 621
( xi x )( yi y ) = 4003.2
( xi x )( yi y ) 4003.2
=
= 204.2449
( xi x ) 2
19.6
9.
a.
14 - 9
( xi x ) 2 = 19.6
Chapter 14
150
140
130
120
110
100
90
80
70
60
50
0
b.
10
yi = 1080
( xi x )( yi y ) = 568
( xi x )( yi y ) 568
=
=4
( xi x ) 2
142
b0 = y b1 x = 108 (4)(7) = 80
y = 80 + 4 x
c.
y = 80 + 4 x = 80 + 4(9) = 116
14 - 10
( xi x ) 2 = 142
12
14
10. a.
95
Overall Rating
90
85
80
75
70
65
60
100
120
140
160
180
200
220
240
260
Performance Score
b.
Let x = performance score and y = overall rating. Summations needed to compute the slope and yintercept are:
xi = 2752
b1 =
yi = 1177
( xi x )( yi y ) = 1723.73
( xi x )( yi y )
1723.73
=
= 0.1452
( xi x ) 2
11,867.73
14 - 11
( xi x ) 2 = 11,867.73
Chapter 14
11. a.
35
Departures (% )
30
25
20
15
10
5
0
10
15
20
25
30
35
Arrivals (% )
b.
c.
yi = 265
( xi x )( yi y ) = 101
( xi x ) 2 = 114.4
( xi x )( yi y )
101
=
= .8829
2
( xi x )
114.4
b0 = y b1 x = 21 (.8829)(21.4) = 2.106
y = 2.106 + .883 x
d.
A one percent increase in the percentage of late arrivals will increase the percentage of late arrivals
by .88 or slightly less than one percent.
e.
14 - 12
12. a.
5,000,000
4,500,000
4,000,000
Production
3,500,000
3,000,000
2,500,000
2,000,000
1,500,000
1,000,000
500,000
0
0
200
400
600
800
1,000
1,200
1,400
1,600
1,800
Number of Employees
b.
There appears to be a positive linear relationship between the number of employees and the
production output.
c.
Let x = number of employees and y = production output. Summations needed to compute the slope
and y-intercept are:
( x
b1 =
= 6,640
=16,990,258
x )( y i y ) = 8,166,725,499.8182
( x
x ) 2 = 2,894,161.2727
( x x )( y y ) = 8,166,725,499.8182 = 2,821.7935
2,894,161.2727
( x x )
i
d.
13. a.
14 - 13
Chapter 14
30.0
25.0
20.0
15.0
10.0
5.0
0.0
0.0
20.0
40.0
60.0
80.0
100.0
120.0
b.
The summations needed to compute the slope and the y-intercept are:
xi = 399
b1 =
yi = 97.1
( xi x )( yi y ) = 1233.7
( xi x ) 2 = 7648
( xi x )( yi y ) 1233.7
=
= 0.16131
( xi x ) 2
7648
b0 = y b1 x = 1387143
.
(016131
.
)(57) = 4.67675
y = 4.68 + 016
. x
c.
y = 4.68 + 016
. x = 4.68 + 016
. (52.5) = 13.08 or approximately $13,080.
The agent's request for an audit appears to be justified.
14 - 14
140.0
14. a.
30.0
25.0
20.0
15.0
10.0
5.0
0.0
0
100
200
300
400
500
600
700
800
900
b.
x
b1 =
= 4,697
= 312
( x
x )( y i y ) = 5,862.98
( x
x ) 2 = 592,620
( x x )( y y ) = 5,862.98 = .0099
592,620
( x x )
i
y =8
The sum of squares due to error and the total sum of squares are
SSE = ( yi yi ) 2 = 12.40
SST = ( yi y ) 2 = 80
c.
r = .845 = +.9192
14 - 15
Chapter 14
16. a.
The estimated regression equation and the mean for the dependent variable are:
yi = 30.33 1.88 x
y = 23.2
The sum of squares due to error and the total sum of squares are
SSE = ( yi y i ) 2 = 6.33
SST = ( yi y ) 2 = 114.80
c.
r = .945 = .9721
Note: the sign for r is negative because the slope of the estimated regression equation is negative.
(b1 = -1.88)
17.
The estimated regression equation and the mean for the dependent variable are:
yi = .75 + .51x
y = 3.4
The sum of squares due to error and the total sum of squares are
SSE = ( yi y i ) 2 = 5.3
SST = ( yi y ) 2 = 11.2
The estimated regression equation and the mean for the dependent variable are:
y = 1790.5 + 581.1x
y = 3650
The sum of squares due to error and the total sum of squares are
SSE = ( yi y i ) 2 = 85,135.14
c.
19. a.
r = .746 = +.8637
The estimated regression equation and the mean for the dependent variable are:
14 - 16
y = 137.63 + .1489 x
y = 1423.3
The sum of squares due to error and the total sum of squares are
SSE = ( yi y i ) 2 = 7547.14
SST = ( yi y ) 2 = 47,582.10
c.
20. a.
r = .84 = +.92
Let x = income and y = home price. Summations needed to compute the slope and y-intercept are:
= 14,706
b1 =
= 2,752
( x
x )( y i y ) = 41,242
( x
x ) 2 = 163,074
( x x )( y y ) = 41,242 = .253
163,074
( x x )
i
The sum of squares due to error and the total sum of squares are
SSE = ( y i y i ) 2 = 8,036.2960
SST = ( y i y ) 2 = 18,466.6667
c.
21. a.
yi = 33, 700
( xi x )( yi y ) = 712,500
( xi x )( yi y ) 712,500
=
= 7.6
( xi x ) 2
93, 750
14 - 17
( xi x ) 2 = 93, 750
Chapter 14
b.
$7.60
c.
The sum of squares due to error and the total sum of squares are:
SSE = ( yi y i ) 2 = 233,333.33
SST = ( yi y ) 2 = 5, 648,333.33
yi = 10, 221
( xi x )( yi y ) = 34,359.81
( xi x ) 2 = 291.389
( xi x )( yi y ) 34,359.81
=
= 117.9173
( xi x ) 2
291.389
The sum of squares due to error and the total sum of squares are:
SSE = ( yi y i ) 2 = 1,678, 294
SST = ( yi y ) 2 = 5, 729,911
r = .7071 = +.84
It reflects a linear relationship that is between weak and strong.
23. a.
b.
c.
( xi x ) 2 = 10
sb1 =
s
( xi x )
2.033
10
= 0.643
14 - 18
d.
t=
b1
2.6
=
= 4.04
sb1 .643
Using t table (3 degrees of freedom), area in tail is between .01 and .025
p-value is between .02 and .05
Actual p-value = .0272
Because p-value , we reject H0: 1 = 0
e.
24. a.
Sum of Squares
67.6
12.4
80.0
Mean Square
67.6
4.133
F
16.36
b.
c.
( xi x ) 2 = 30.8
sb1 =
d.
Degrees of Freedom
1
3
4
t=
s
( xi x )
1.453
30.8
= 0.262
b1 188
.
=
= 7.18
sb1
.262
Using t table (3 degrees of freedom), area in tail is less than .005; p-value is less than .01
Actual p-value = .0056
Because p-value , we reject H0: 1 = 0
e.
14 - 19
Chapter 14
Sum of Squares
108.47
6.33
114.80
Degrees of Freedom
1
3
4
Mean Square
108.47
2.11
F
51.41
b.
( xi x ) 2 = 22.8
sb1 =
t=
s
( xi x )
1.33
22.8
= 0.28
b1 .51
=
= 182
.
sb1 .28
Using t table (3 degrees of freedom), area in tail is between .05 and .10
p-value is between .10 and .20
Actual p-value = .1664
Because p-value > , we cannot reject H0: 1 = 0; x and y do not appear to be related.
c.
26. a.
t=
s
( xi x )
145.89
0.74
= 169.59
b1 58108
.
=
= 3.43
sb1 169.59
Using t table (4 degrees of freedom), area in tail is between .01 and .025
14 - 20
c.
Source of Variation
Regression
Error
Total
27. a.
Sum of Squares
249864.86
85135.14
335000
Degrees of Freedom
1
4
5
Mean Square
249864.86
21283.79
F
11.74
yi = 1654
( xi x )( yi y ) = 315.2
( xi x ) 2 = 10.1
( xi x )( yi y ) 315.2
=
= 31.2079
( xi x ) 2
10.1
c.
d.
14 - 21
Chapter 14
28.
29.
Sum of Squares
5,415,000.00
233,333.33
5,648,333.33
Degrees of Freedom
1
4
5
Mean Square
5,415,000
58,333.33
F
92.83
Using F table (1 degree of freedom numerator and 4 denominator), p-value is less than .01
Actual p-value = .0006
Because p-value , we reject H0: 1 = 0. Production volume and total cost are related.
30.
t=
s
( xi x )
458.02
291.389
= 26.832
b1 117.9173
=
= 4.395
sb1
26.832
Using t table (1 degree of freedom numerator and 8 denominator), area in tail is less than .005
p-value is less than .01
Actual p-value = .0024
14 - 22
32. a.
b.
s = 2.033
x =3
( xi x ) 2 = 10
s yp = s
2
1 ( xp x )
1 (4 3) 2
+
=
2.033
+
= 1.11
n ( xi x ) 2
5
10
c.
sind = s 1 +
d.
y p t / 2 sind
2
1 ( xp x )
1 (4 3) 2
+
=
2.033
1
+
+
= 2.32
n ( xi x )2
5
10
s = 1.453
x = 3.8
s yp = s
( xi x ) 2 = 30.8
2
1 ( xp x )
1 (3 3.8) 2
+
=
1.453
+
= .068
n ( xi x ) 2
5
30.8
14 - 23
Chapter 14
y = 30.33 188
. x = 30.33 188
. (3) = 24.69
y p t / 2 s y p
24.69 3.182 (.68) = 24.69 2.16
or 22.53 to 26.85
c.
sind = s 1 +
d.
y p t / 2 sind
2
1 ( xp x )
1 (3 3.8) 2
+
=
1.453
1
+
+
= 1.61
n ( xi x )2
5
30.8
s = 1.33
x = 5.2
( xi x ) 2 = 22.8
2
1 ( xp x )
1 (3 5.2) 2
s yp = s
+
= 1.33 +
= 0.85
2
n ( xi x )
5
22.8
sind = s 1 +
2
1 ( xp x )
1 (3 5.2) 2
+
=
1.33
1
+
+
= 1.58
n ( xi x )2
5
22.8
y p t / 2 sind
2.28 3.182 (1.58) = 2.28 5.03
or -2.27 to 7.31
35. a.
s = 145.89
x = 3.2
( xi x ) 2 = 0.74
14 - 24
s yp = s
2
1 ( xp x )
1 (3 3.2)2
+
=
145.89
+
= 68.54
n ( xi x ) 2
6
0.74
b.
sind = s 1 +
2
1 ( xp x )
1 (3 3.2) 2
+
=
145.89
1
+
+
= 161.19
n ( xi x )2
6
0.74
y p t / 2 sind
3533.8 2.776 (161.19) = 3533.8 447.46
or $3086.34 to $3981.26
36. a.
b.
( xi x ) 2 = 11,867.73
2
1 ( xp x )
1 (200 183.4667) 2
+
=
3.5232
+
= 1.055
n ( xi x ) 2
15
11,867.73
y p t / 2 s y p
80.86 2.160 (1.055) = 80.86 2.279
or 78.58 to 83.14
c.
sind = s 1 +
2
1 ( xp x )
1 (200 183.4667) 2
+
= 3.5232 1 + +
= 3.678
2
n ( xi x )
15
11,867.73
y p t / 2 sind
80.86 2.160 (3.678) = 80.86 7.944
or 72.92 to 88.80
37. a.
x = 57
s2 = 1.88
( xi x ) 2 = 7648
s = 1.37
14 - 25
Chapter 14
s yp = s
2
1 ( xp x )
1 (52.5 57) 2
+
=
1.37
+
= 0.52
n ( xi x ) 2
7
7648
y p t / 2 s y p
13.08 2.571 (.52) = 13.08 1.34
or 11.74 to 14.42 or $11,740 to $14,420
b.
sind = 1.47
13.08 2.571 (1.47) = 13.08 3.78
or 9.30 to 16.86 or $9,300 to $16,860
c.
d.
Any deductions exceeding the $16,860 upper limit could suggest an audit.
38. a.
b.
( xi x ) 2 = 93,750
2
1 ( xp x )
1 (500 575) 2
+
=
241.52
1
+
+
= 267.50
n ( xi x )2
6
93, 750
y p t / 2 sind
5046.67 4.604 (267.50) = 5046.67 1231.57
or $3815.10 to $6278.24
c.
39. a.
Based on one month, $6000 is not out of line since $3815.10 to $6278.24 is the prediction interval.
However, a sequence of five to seven months with consistently high costs should cause concern.
Let x = miles of track and y = weekday ridership in thousands.
Summations needed to compute the slope and y-intercept are:
xi = 203
b1 =
yi = 309
( xi x )( yi y ) = 1471
( xi x )( yi y ) 1471
=
= 1.7554
( xi x ) 2
838
14 - 26
( xi x ) 2 = 838
2
1 ( xp x )
1 (30 29) 2
+
= 14.41 +
= 5.47
2
n ( xi x )
7
838
d.
sind = s 1 +
2
1 ( xp x )
1 (30 29) 2
+
= 14.41 1 + +
= 15.41
2
n ( xi x )
7
838
y p t / 2 sind
45.9 2.571(15.41) = 45.9 39.6
or 6.3 to 85.5
The prediction interval is so wide that it would not be of much value in the planning process. A
larger data set would be beneficial.
40. a.
b.
c.
y = 20.0 + 7.21x
1.3626
d.
e.
41. a.
14 - 27
Chapter 14
b.
t=
b1 B1 .8951 0
=
= 6.01
sb1
.149
Using t table (1 degree of freedom numerator and 8 denominator), area in tail is less than .005
p-value is less than .01
Actual p-value = .0004
Because p-value , we reject H0: B1 = 0
c.
42 a.
b.
30
c.
d.
43. a.
Coef
-247042
2.9370
SE Coef
819079
1.960
R-Sq = 89.0%
T
-0.30
9.85
P
0.768
0.000
R-Sq(adj) = 88.1%
Analysis of Variance
Source
Regression
Residual Error
Total
DF
1
12
13
SS
5.61573E+14
6.94878E+13
6.31061E+14
MS
5.61573E+14
5.79065E+12
b.
c.
d.
14 - 28
F
96.98
P
0.000
Coef
41.490
-1.3301
SE Coef
6.333
0.3522
R-Sq = 74.0%
T
6.55
-3.78
P
0.001
0.013
R-Sq(adj) = 68.8%
Analysis of Variance
Source
Regression
Residual Error
Total
DF
1
5
6
SS
136.07
47.72
183.79
MS
136.07
9.54
F
14.26
d.
Since the p-value = 0.013 is less than = .05, the relationship is significant.
e.
r 2 = .688. The least squares line does not provide a very good fit.
f.
g.
45. a.
xi = 70
b1 =
yi = 76
( xi x )( yi y ) = 200
( xi x ) 2 = 126
( xi x )( yi y ) 200
=
= 1.5873
( xi x ) 2
126
c.
14 - 29
P
0.013
Chapter 14
Residuals
4
2
0
-2
-4
-6
0
10
15
20
25
s 2 = 23.78
hi =
e.
46. a.
1 ( xi x ) 2
1 ( xi 14) 2
+
=
+
n ( xi x ) 2 5
126
b.
4
3
Residuals
2
1
0
-1
-2
-3
-4
0
10
The assumption that the variance is the same for all values of x is questionable. The variance
appears to increase for larger values of x.
47. a.
14 - 30
y = 29.4 + 1.55 x
b.
c.
10
Residuals
5
0
-5
-10
-15
25
30
35
40
45
50
55
60
65
Predicted Values
d.
The residual plot leads us to question the assumption of a linear relationship between x and y. Even
though the relationship is significant at the .05 level of significance, it would be extremely
dangerous to extrapolate beyond the range of the data.
14 - 31
Chapter 14
48. a.
y = 80 + 4 x
8
6
Residuals
4
2
0
-2
-4
-6
-8
0
10
12
14
b.
49. a.
b.
Standardized Residuals
2
1.5
1
0.5
0
-0.5
-1
-1.5
0
10
20
30
40
50
60
c.
There is an unusual trend in the residuals. The assumptions concerning the error term appear
questionable.
14 - 32
Coef
66.10
0.4023
S = 12.62
SE Coef
32.06
0.2276
R-sq = 38.5%
T
2.06
1.77
p
0.094
0.137
R-sq(adj) = 26.1%
Analysis of Variance
SOURCE
DF
Regression
1
Residual Error 5
Total
6
SS
497.2
795.7
1292.9
Unusual Observations
Obs.
X
Y
1
135
145.00
MS
497.2
159.1
Fit
120.42
SEFit
4.87
F
3.12
Residual
24.58
p
0.137
St.Resid
2.11R
*
*
*
*
*
--+---------+---------+---------+---------+---------+----YHAT
110.0
115.0
120.0
125.0
130.0
135.0
The standardized residual plot indicates that the observation x = 135, y = 145 may be an outlier;
note that this observation has a standardized residual of 2.11.
14 - 33
Chapter 14
c.
Y
51. a.
Coef
13.002
0.4248
S = 3.181
SE Coef
2.396
0.2116
R-sq = 40.2%
T
5.43
2.01
p
0.002
0.091
R-sq(adj) = 30.2%
Analysis of Variance
SOURCE
DF
Regression
1
Residual Error 6
Total
7
Unusual Observations
Obs.
X
Y
7
12.0
24.00
8
22.0
19.00
SS
40.78
60.72
101.50
MS
40.78
10.12
Fit Stdev.Fit
18.10
1.20
22.35
2.78
F
4.03
Residual
5.90
-3.35
p
0.091
St.Resid
2.00R
-2.16RX
14 - 34
24.0+
20.0+
16.0+
12.0+
-
*
*
*
*
*
*
*
+---------+---------+---------+---------+---------+------X
0.0
5.0
10.0
15.0
20.0
25.0
The scatter diagram indicates that the observation x = 22, y = 19 is an influential observation.
52. a.
Coef
4.089
0.19552
SE Coef
2.168
0.03635
R-Sq = 78.3%
T
1.89
5.38
P
0.096
0.001
R-Sq(adj) = 75.6%
Analysis of Variance
Source
Regression
Residual Error
Total
DF
1
8
9
Unusual Observations
Obs
MediaExp
Amount
1
120
36.30
SS
735.84
203.51
939.35
Fit
27.55
MS
735.84
25.44
F
28.93
SE Fit
3.30
Residual
8.75
P
0.001
St Resid
2.30R
Minitab identifies observation 1 as having a large standardized residual; thus, we would consider
14 - 35
Chapter 14
observation 1 to be an outlier.
14 - 36
53. a.
Coef
15.193
0.018086
S = 5.29760
SE Coef
2.004
0.002079
R-Sq = 81.7%
T
7.58
8.70
P
0.000
0.000
R-Sq(adj) = 80.6%
Analysis of Variance
Source
Regression
Residual Error
Total
SS
2123.8
477.1
2600.9
MS
2123.8
28.1
F
75.68
P
0.000
(response is Price(NT$))
St andardized Residual
b.
DF
1
17
18
-1
-2
20
25
30
35
40
Fit t ed Value
14 - 37
45
50
55
Chapter 14
54. a.
Coef
707.0
0.0048154
S = 379.8
SE Coef
118.0
0.0008076
R-Sq = 66.4%
T
5.99
5.96
P
0.000
0.000
R-Sq(adj) = 64.5%
Analysis of Variance
Source
Regression
Residual Error
Total
Unusual
Obs
6
17
DF
1
18
19
Observations
MktCap
Salary
507217
3325.0
120967
116.2
SS
5129071
2596647
7725718
MS
5129071
144258
Fit
3149.5
1289.5
SE Fit
338.6
86.4
F
35.55
Residual
175.5
-1173.3
P
0.000
St Resid
1.02 X
-3.17R
b.
Standardized Residuals
3
2
1
0
-1
-2
-3
-4
500
1000
1500
2000
2500
3000
3500
Predicted Values
55.
No. Regression or correlation analysis can never prove that two variables are causally related.
56.
The estimate of a mean value is an estimate of the average of all y values associated with the same
x. The estimate of an individual y value is an estimate of only one of the y values associated with a
particular x.
57.
14 - 38
Coef
9.265
0.7105
S = 1.419
SE Coef
1.099
0.1474
R-Sq = 74.4%
T
8.43
4.82
P
0.000
0.001
R-Sq(adj) = 71.2%
Analysis of Variance
Source
Regression
Residual Error
Total
DF
1
8
9
SS
46.784
16.116
62.900
MS
46.784
2.015
F
23.22
P
0.001
b.
Since the p-value corresponding to F = 23.22 = .001 < = .05, the relationship is significant.
c.
r 2 = .744; a good fit. The least squares line explained 74.4% of the variability in Price.
d.
59. a.
Coef
-3.834
0.29567
S = 11.04
SE Coef
5.903
0.02648
R-Sq = 91.9%
T
-0.65
11.17
P
0.529
0.000
R-Sq(adj) = 91.2%
Analysis of Variance
Source
Regression
Residual Error
Total
DF
1
11
12
SS
15208
1341
16550
MS
15208
122
F
124.72
P
0.000
b.
y = 3.83 + .296(150) = 40.57 ; approximately 40.6 million shares of options grants outstanding.
c.
r 2 = .919; a very good fit. The least squares line explained 91.9% of the variability in Options.
60. a.
Coef
0.2747
0.9498
S = 2.664
SE Coef
0.9004
0.3569
R-Sq = 47.0%
T
0.31
2.66
P
0.768
0.029
R-Sq(adj) = 40.3%
Analysis of Variance
Source
DF
SS
MS
14 - 39
Chapter 14
Regression
1
Residual Error 8
Total
9
50.255
56.781
107.036
50.255
7.098
7.08
0.029
b.
Since the p-value = 0.029 is less than = .05, the relationship is significant.
c.
r2 = .470. The least squares line does not provide a very good fit.
d.
61. a.
100
High Temperature
90
80
70
60
50
40
35
45
55
65
75
85
Low Temperature
b.
It appears that there is a positive linear relationship between the two variables.
c.
Coef
23.899
0.8980
SE Coef
6.481
0.1121
R-Sq = 78.1%
T
3.69
8.01
P
0.002
0.000
R-Sq(adj) = 76.9%
Analysis of Variance
Source
Regression
Residual Error
Total
DF
1
18
19
SS
1792.3
502.7
2294.9
MS
1792.3
27.9
F
64.18
P
0.000
d.
Since the p-value corresponding to F = 64.18 = .000 < = .05, the relationship is significant.
e.
r 2 = .781; a good fit. The least squares line explained 78.1% of the variability in high temperature.
f.
r = .781 = +.88
14 - 40
14 - 41
Chapter 14
The Minitab output is shown below:
62.
Coef
10.528
0.9534
S = 4.250
SE Coef
3.745
0.1382
R-sq = 85.6%
T
2.81
6.90
p
0.023
0.000
R-sq(adj) = 83.8%
Analysis of Variance
SOURCE
DF
Regression
1
Residual Error 8
Total
9
Fit
39.13
Stdev.Fit
1.49
SS
860.05
144.47
1004.53
MS
860.05
18.06
95% C.I.
35.69, 42.57)
F
47.62
p
0.000
95% P.I.
28.74, 49.52)
a.
y = 10.5 + .953 x
b.
Since the p-value corresponding to F = 47.62 = .000 < = .05, we reject H0: 1 = 0.
c.
d.
63. a.
Coef
22.174
-0.14783
S = 1.489
SE Coef
1.653
0.04391
R-Sq = 73.9%
T
13.42
-3.37
P
0.000
0.028
R-Sq(adj) = 67.4%
Analysis of Variance
Source
Regression
Residual Error
Total
DF
1
4
5
SS
25.130
8.870
34.000
MS
25.130
2.217
F
11.33
P
0.028
SE Fit
0.896
95.0% CI
12.294, 17.271)
95.0% PI
9.957, 19.608)
b.
Since the p-value corresponding to F = 11.33 = .028 < = .05, the relationship is significant.
c.
r 2 = .739; a good fit. The least squares line explained 73.9% of the variability in the number of
defects.
d.
Using the Minitab output in part (a), the 95% confidence interval is 12.294 to 17.271.
14 - 42
There appears to be a negative linear relationship between distance to work and number of days
absent.
The MINITAB output is shown below:
The regression equation is
Y = 8.10 - 0.344 X
Predictor
Constant
X
Coef
8.0978
-0.34420
S = 1.289
SE Coef
0.8088
0.07761
R-sq = 71.1%
T
10.01
-4.43
p
0.000
0.002
R-sq(adj) = 67.5%
Analysis of Variance
SOURCE
DF
Regression
1
Residual Error 8
Total
9
Fit
6.377
SS
32.699
13.301
46.000
Stdev.Fit
0.512
MS
32.699
1.663
95% C.I.
5.195, 7.559)
F
19.67
p
0.002
95% P.I.
3.176, 9.577)
c.
d.
r2 = .711. The estimated regression equation explained 71.1% of the variability in y; this is a
reasonably good fit.
e.
The 95% confidence interval is 5.195 to 7.559 or approximately 5.2 to 7.6 days.
65. a.
Coef
220.00
131.67
S = 75.50
SE Coef
58.48
17.80
R-sq = 87.3%
T
3.76
7.40
p
0.006
0.000
R-sq(adj) = 85.7%
Analysis of Variance
SOURCE
DF
Regression
1
Residual Error 8
Total
9
Fit
746.7
Stdev.Fit
29.8
SS
312050
45600
357650
MS
312050
5700
95% C.I.
678.0, 815.4)
F
54.75
p
0.000
95% P.I.
559.5, 933.9)
b.
Since the p-value corresponding to F = 54.75 is .000 < = .05, we reject H0: 1 = 0.
c.
d.
14 - 43
Chapter 14
66. a.
Coef
5.847
0.8295
S = 7.523
SE Coef
7.972
0.1095
R-sq = 87.8%
T
0.73
7.58
p
0.484
0.000
R-sq(adj) = 86.2%
Analysis of Variance
SOURCE
DF
Regression
1
Residual Error 8
Total
9
Fit
84.65
SS
3249.7
452.8
3702.5
Stdev.Fit
3.67
MS
3249.7
56.6
95% C.I.
76.19, 93.11)
F
57.42
p
0.000
95% P.I.
65.35, 103.96)
b.
Since the p-value corresponding to F = 57.42 is .000 < = .05, we reject H0: 1 = 0.
c.
84.65 points
d.
67. a.
Coef
-0.4710
0.00003868
S = 0.2088
SE Coef
0.5842
0.00001731
R-Sq = 21.7%
T
-0.81
2.23
P
0.431
0.038
R-Sq(adj) = 17.4%
Analysis of Variance
Source
Regression
Residual Error
Total
DF
1
18
19
SS
0.21749
0.78451
1.00200
MS
0.21749
0.04358
F
4.99
P
0.038
Fit
0.8828
SE Fit
0.0523
95.0% CI
( 0.7729, 0.9927)
95.0% PI
( 0.4306, 1.3349)
b.
Since the p-value = 0.038 is less than = .05, the relationship is significant.
c.
r2 = .217. The least squares line does not provide a very good fit.
d.
14 - 44