Beruflich Dokumente
Kultur Dokumente
SEM2 2010-11
CORRELATION &
SIMPLE REGRESSION
250
250
200
200
150
150
100
100
50
50
0
0
50
100
150
200
250
300
50
Positive
100
150
200
250
Negative
180
160
160
140
140
120
120
100
100
80
80
60
60
40
40
20
20
0
0
50
100
150
High
200
250
50
100
150
200
250
Low
180
200
160
180
160
140
140
120
120
100
100
80
80
60
60
40
40
20
20
0
0
0
50
100
High
150
200
250
50
100
150
200
250
Zero
140
120
100
80
60
40
20
0
0
50
100
150
200
Height (cms)
7
160
140
Symptom Index
Weight (kgs)
160
120
100
80
60
40
20
0
0
50
100
150
200
250
180
Symptom Index
160
140
120
100
80
60
40
20
0
0
50
100
150
200
250
160
Symptom Index
140
120
100
Why not?
Residuals
80
60
40
20
0
0
50
100
150
200
250
Correlation examples
11
Regression
IV Internal / Ratio
Requirement :
The independent and dependent variables are normally
distributed in the population
The cases represents a random sample from the population
Simple Regression
How best to summarise the data?
160
180
140
160
140
Symptom Index
Symptom Index
120
100
80
60
120
100
80
60
40
40
20
20
50
100
150
200
250
50
100
150
200
250
200
180
160
140
120
100
80
60
40
20
0
0
50
100
150
200
250
10
Simple Regression
R2 - Goodness of fit
High values show good fit, low values show poor fit
Simple Regression
Low values of R2
DV
300
250
200
150
100
50
0
0
100
200
300
R2 = 0
(0% - randomly scattered
points, no apparent
relationship between X
and Y)
Implies that a best-fit line
will be a very poor
description of data
IV (regressor, predictor)
11
Simple Regression
High values of R2
300
250
DV
200
R2 = 1
150
100
50
0
0
100
200
300
IV
250
DV
200
150
100
50
0
0
50
100
150
200
250
IV
Simple Regression
R2 - Goodness of fit
180
160
160
140
120
120
S ymptom Index
S ymptom Index
140
100
80
60
100
80
60
40
40
20
20
0
0
50
100
150
200
250
50
100
150
200
250
12
6
5
4
3
2
1
0
0
25
180
Symptom Index
160
140
120
100
80
60
40
20
0
0
50
100
150
200
250
26
13
Regression
27
Regression - Types
14
Yi = 0 + 1 X i + i
Constant
Population
Regression Coefficients
Sample
= a + bX
Y
Parameters
l
30
15
If
1 > 0
0 + 1 X
X
31
If
1 < 0
0 + 1 X
X
32
16
If
0 + 1 X
X
Copyright (c) Bani K. Mallick
33
If
H0 :
H0 : 1 = 0
l
17
Y
Sales
1.52
1.68
1.8
2.05
2.36
2.25
2.68
2.9
3.14
3.06
3.24
1.92
3.4
3.28
3.17
2.83
2.58
2.86
2.26
2.14
1.98
20
40
60
80
100
120
Hypothesis Test :
1 Regression Model
2 Slope
18
a=yb X
= a + bX
Example1 :
Data were collected from a randomly
selected sample to determine relationship
between average assignment scores and test
scores in statistics. Distribution for
the data is presented in the table below.
1. Calculate coefficient of determination
and the correlation coefficient
2. Determine the prediction equation.
3. Test hypothesis for the slope at 0.05 level
of significance
Data set:
ID
1
2
3
4
5
6
7
8
9
10
Scores
Assign
8.5
6
9
10
8
7
5
6
7.5
5
Test
88
66
94
98
87
72
45
63
85
77
19
ID
1
2
3
4
5
6
7
8
9
10
= 215.5 = 8.257
26.1
a= y b x
X
8.5
6
9
10
8
7
5
6
7.5
5
Y
88
66
94
98
87
72
45
63
85
77
Summary stat:
= 18.050
Prediction equation:
= 18.05 + 8.257X
10
72
775
544.5
62,441
5,795.5
57
18.05
8.2
20
Example 2:
MARITAL SATISFACTION
Children : Y
Parents : X
1
3
7
9
8
4
5
Mean of X
No of pairs
X
X squared
Standard deviation
XY
3
2
6
7
8
6
3
Mean of Y
Y
X squared
Standard deviation
a= y b x
= 5.00 +.65 (5.29)
= 8.438
Prediction equation:
= 8.44 + 65x
21
0.6
8.43
Descriptive Statistics
Mean
Grade - PMR MATH
TEACHER_FACTOR
Std. Deviation
2.53
1.468
62
3.9643
.91443
62
Correlations
Model Summaryb
Model
ACTOR
1.000
.571
R
.571a
R Square
Adjusted R
Square
.326
.315
di
TEACHER_FACTO
.571
1.000
.000
.000
62
62
R
Sig. (1-tailed)
R
N
si
62
62
22
ANOVAb
Model
1
Sum of
Squares
Regression
Residual
df
42.848
88.588
Total
131.435
1
60
61
Mean
Square
42.848
1.476
F
29.021
Sig.
.000a
Model
Coefficientsa
Standardized
Unstandardized Coefficients
Coefficients
B
Std. Error
Beta
-1.101
.692
(Constant)
TEACHER_FACTOR
.917
.170
.571
t
-1.591
Sig.
.117
5.387
.000
Descriptive Statistics
Mean
Std. Deviation
2.53
1.468
TEACHER_FACTOR
3.9643
Race
Grade - PMR
MATH
TEACHER_FA
CTOR
Race
62
.91443
1.90
62
.593
Correlations
Pearson
Correlation
Grade - TEACHER
PMR MATH
_FACTOR
Race
1.000
.571
-.015
.571
1.000
.019
-.015
.019
1.000
.000
.453
.000
.440
.453
.440
62
62
62
62
62
62
62
62
62
Model
62
Model Summaryb
Adjusted R Std. Error of
R Square
Square
the Estimate
R
.572a
.327
.304
1.225
Sig. (1-tailed)
Grade - PMR
MATH
TEACHER_FA
CTOR
Race
Grade - PMR
MATH
TEACHER_FA
CTOR
Race
23
Model
1
Regression
Residual
Total
ANOVAb
Sum of
Mean
Squares
Square
df
F
Sig.
42.939
2
21.469
14.313
.000a
88.497
59
1.500
131.435
61
Coefficientsa
Model
(Constant)
TEACHER_FACTOR
Race
a. Dependent Variable: Grade - PMR MATH
Unstandardized Coefficients
B
Std. Error
-.980
.853
.917
.172
-.065
.265
Standardized
Coefficients
Beta
.571
-.026
t
-1.150
5.349
-.246
Sig.
.255
.000
.806
24