Beruflich Dokumente
Kultur Dokumente
REGRESSION
Introduction
Regression Analysis
3
Example of Simple Regression Analysis
4
Relevance to Statistics
5
The SLRM or First-Order
(Straight-Line) Model
6
this simple model approximates the relationship between the dependent
variable and the independent variable
7
where:
the y-intercept of the regression line (the expected y-value
βo when x=o); has practical meaning only if x=0 is in the range of
values of x in the sample data
8
Justifications for the Error Term in the
Model: ε (epsilon)
1. Accounts for other factors, which are not included in the model, that
affect Y aside from X
2. Accounts for measurement error even for those factor/s included in
the model
3. Dependent Variable Y is a random variable due to the error term in
the model.
Justifications for the Error Term in the
Model:
12
SSE or the least square estimates the parameters of B0 and b1
respectively wherein the least square line or linear regression
equation is denoted by :
ŷ=b0+b1*x
13
Sxx=∑xiyi -nxӮ
Sxy=∑xi2-n(x2)
Syy=∑yi2-n(Ӯ2)
MSE= SSE/n-2
SSE= Syy-b1*Sxy
14
Significance Tests
Concerning SLRM
15
The significance test is performed to know whether we should
reject the hypothesis or accept the hypothesis given the
rejection region.
16
Significance Test
17
18
Test Statistic: t=b1/sqrt(MSE/Sxx)
Rejection region: 𝞪/2
b1= Sxy/Sxx
MSE=SSE/n-2
Sxx=summation of Xi2-n(meanx2)
19
Pearson’s r
20
IN LINEAR CORRELATION
a statistical method of
determining the nature and
strength of the linear
relationship between two
numerical (i.e., interval and
ratio) variables X and Y using a
single numerical value known as
the Pearson’s product moment
correlation coefficient (or
Pearson’s r)
21
OTHER VARIATIONS
OTHER VARIATIONS
22
What do we get from knowing r ?
A) R’S SIGN
◉ IF IT IS POSITIVE, IT MEANS THERE IS A DIRECT LINEAR
RELATIONSHIP BETWEEN X AND Y
◉ IF IT IS NEGATIVE, IT MEANS THERE IS AN INVERSE LINEAR
RELATIONSHIP BETWEEN X AND Y
23
B) MAGNITUDE OF R
24
25
LIMITATIONS OF PEARSON’S R
26
LIMITATIONS OF PEARSON’S R
27
EXAMPLE
SUMMARY STATISTICS:
sxy= 79.6 To solve for Pearson’s r:
sxx = 65.6
syy = 113.6
28
TESTING ITS SIGNIFICANCE
Statements:
Ho: = 0 (i.e., there is no significant linear relationship between X and Y) versus
Ha: 0 (i.e., there is a significant linear relationship between X and Y)
29
Simple Case
30
FORMULA
31
32
SIMPLE CASE
33
Store Shelf Space, Weekly Sales, Store Shelf Space, Weekly Sales,
X feet Y dollars X feet Y dollars
1 5 160 7 15 230
2 5 220 8 15 270
3 5 140 9 15 280
4 10 190 10 20 260
5 10 240 11 20 290
6 10 260 12 20 310
34
Required
◉ Set up the SLRM for this data set and indicate the scope of
regression.
◉ Estimate the expected weekly sales of all the stores with a
12 feet of shelf space.
◉ Interpret the scope of regression equation.
◉ Test the significance of shelf space as a predictor for the
mean weekly sales.
◉ Give and interpret the following: Pearson’s r; Coefficient of
determination r-squared
35
Activity
36
QUESTION 1
37
QUESTION 2
The error term (ε) is the reason for the model being called
the ___________model
a. Probabilistic
b. America’s Next Top
c. Deterministic
d. Victoria’s Secret
39
QUESTION 4
a. r= -0.62
b. r= 0.005
c. r= 0.81
d. r= -1
40
QUESTION 5
41
QUESTION 6
42
QUESTION 7
A. Regression Analysis
B. Regressive Analysis
C. Registration Analysis
D. Regular Analysis
43
FOR QUESTIONS 8 - 10
Employee 1 2 3 4 5 6
Age (X) 18 26 39 48 53 58
Days (Y) 16 12 9 5 6 2
44
FOR QUESTIONS 8 - 10
8. Set up the equation of the fitted regression line for this data set.
9. Give the Pearson’s correlation coefficient , r.
10. Give the sample coefficient of determination, r2
45
BONUS QUESTION
Who sang the hit song “Beep Beep Beep (Ang Sabi ng Jeep)”?
46
ANSWERS
1. B
2. B
3. A
4. C
5. A
6. A
7. A
8. Y = 21.100 - 0.317
9. -0.979
10. 95.9%
Bonus: Willie Revillame/Kuya Wil
47