Beruflich Dokumente
Kultur Dokumente
Chapter 11
Correlation and Regression: Measuring and Predicting Relationships
2/10/2012
Slide 11-2
Scatterplot
A picture to explore the relationship in bivariate data
Correlation r
Measures strength of the relationship (from 1 to 1)
Regression
Predicting one variable from the other
2/10/2012
Slide 11-3
Interpreting Correlation
Y Y
r=1
A perfect straight line tilting up to the right
X Y Y
r=0
No overall tilt No relationship?
X Y
r = 1
A perfect straight line tilting down to the right
2/10/2012
Linear relationship
Increasing relationship
Tilts up and to the right
2/10/2012
Linear relationship
$1,000
$500
Increasing relationship
Tilts up and to the right
2/10/2012
Correlation is r = 0.890
Strong negative association Straight line with scatter
Interest rate
Linear relationship
6.0% 5.5%
Decreasing relationship
0%
1%
2% Loan fee
3%
4%
2/10/2012
Correlation is r = 0.11
Today's change
A weak relationship?
No relationship?
Tilt is neither up nor down
2/10/2012
A nonlinear relationship
Call Price
$100 $75 $50 $25 $0 $450 $500 $550 Strike Price $600 $650
Correlation r = 0.895
A negative relationship: Higher strike price goes with lower call price
2/10/2012
Correlation r = 0.0155
r suggests no relationship
Temperature
2/10/2012
Example: Telecommunications
Correlation r = 0.820
Circuit miles (millions) Log of miles 2,000 1,000 0 0 1,000 2,000 Investment ($millions) 20
r = 0.957
15 15 20 Log of investment
2/10/2012
Correlation r = 0.950
for all bonds
Correlation r = 0.994
Ordinary bonds only
2/10/2012
An outlier is visible
A disaster (a fire at the factory) High cost, but few produced
Cost 5,000 Cost
10,000
r = 0.623
4,000 3,000
0 0 20 40 60 Number produced
20 30 40 50 Number produced
2/10/2012
Slide 11-13
Correlation r = 0.8667
Salary ($thousand) Experience 15 10 20 5 15 5
2/10/2012
Salary 30 35 55 22 40 27
60 50 40 30 20 0 10 20 Experience
Slide 11-14
2/10/2012
Slide 11-15
2/10/2012
Slide 11-16
Salary
30 20 10 0 10 Experience 20
2/10/2012
Slide 11-17
Se ! SY r 2 n 1 1
Slide 11-18
Se (continued)
About 68% of the data are within one standard error of estimate of the least-squares line
(For a bivariate normal distribution)
60 50 Salary 40 30 20 0 10 Experience 20
Interpretation: similar to standard deviation Can move Least-Squares Line up and down by Se
2/10/2012
Slide 11-19
Coefficient of Determination R2
Tells what percent of the variability (variance) of Y is explained by X Example: R2 = 0.86672 = 0.751
Experience explains 75.1% of the variation in salaries
2/10/2012
Slide 11-20
Linear Model
The foundation for statistical inference in regression Observed Y is a straight line, plus randomness
Y = E+ FX + I
Randomness of individuals Population relationship, on average
Y
X
2/10/2012
Slide 11-21
2/10/2012
Slide 11-22
Approximately how far the observed slope b is from the population slope F Example (Salary vs. Experience)
6.52 Sb ! ! 0.48 6.06 6 1
Observed slope, b = 1.673, is about 0.48 away from the unknown slope of the population
2/10/2012
Slide 11-23
Statistical Inference
b s tS b
where t has n 2 degrees of freedom
YES
If 0 is not in the confidence interval
Or if |tstatistic| = |b/Sb| > ttable
NO
Otherwise
2/10/2012
Slide 11-24
Slide 11-25