Sie sind auf Seite 1von 24

min  20

20, 22, 23, 24, 24, 25, 25, 27, 35 Q1  22.5


med  24
Q3  26
 Are there any outliers? max  35
Q1  1.5(3.5) Q3  1.5(3.5)
22.5  5.25
17.25
26  5.25
31.25

Outlier  35
 Draw a skeleton boxplot.

 Draw a modified boxplot.


Chebyshev’s & The
Empirical Rule
Objective
Calculate values
using Chebyshev’s
Theorem and the
Empirical Rule.
Relevance
To be able to calculate
values with
symmetrical and non-
symmetrical
distributions.
Describing Data in terms of the
Standard Deviation.

Test Mean = 80
St. Dev. = 5
Chebyshev’s Rule
The percent of observations that are within
k standard deviations of the mean is at
least
 1
100 1  2  %
 k 
Facts about Chebyshev
 Applicable to any data set – whether it is
symmetric or skewed.

 Many times there are more than 75% - this


is a very conservative estimation.
Using Chebyshev
Use your calculators to fill in the list

2 0.75 75%
3 0.89 89%
4 0.94 94%
4.472 0.95 95%
5 0.96 96%
10 0.99 99%
Interpret using Chebyshev
Test Mean = 80
St. Dev. = 5

80

1. What percent are between 70 and 90? 75%


2. What percent are between 60 and 100? 94%
Collect wrist measurements
(in)
 Create distribution
 Find st. dev & mean.
 What percent is within 1 deviation of mean
Practice Problems
1. Using Chebyshev, solve the following problem for a
distribution with a mean of 80 and a st. dev. of 10.
a. At least what percentage of values will fall between 60
and 100?

k  2  75%
b. At least what percentage of values will fall between 65
and 95?
1
k  1.5  1  2 x100  56%
1.5
Normal Distributions
 These are special density curves.

 They have the same overall shape


 Symmetric
 Single-Peaked
 Bell-Shaped

 They are completely described by giving its


mean () and its standard deviation ().

 We abbreviate it N(,)
Normal Curves….

•Changing the mean without changing the standard


deviation simply moves the curve horizontally.
•The Standard deviation controls the spread of a
Normal Curve.
Standard Deviation
 It’s the natural measure of spread for Normal
distributions.

 It can be located by eye on a Normal curve.


 It’s
the point at which the curve changes from concave
down to concave up.
Why is the Normal Curve Important?
 They are good descriptions for some real data
such as
 Testscores like SAT, IQ
 Repeated careful measurements of the same quantity
 Characteristics of biological populations (height)

 They are good approximations to the results of


many kinds of chance outcomes

 They are used in many statistical inference


procedures.
Empirical Rule
 Can only be used if the data can be
reasonably described by a normal curve.
 Approximately
 68% of the data is within 1 st. dev. of mean
 95% of the data is within 2 st. dev. of mean
 99.7% of data is within 3 st. dev. of mean
Empirical Rule
 What percent do you think……

 www.whfreeman.com/tps4e
Empirical Rule (68-95-99.7 Rule)

 In the Normal distribution with mean ()


and standard deviation ():
 Within 1 of  ≈ 68% of the observations
 Within 2 of  ≈ 95% of the observations
 Within 3 of  ≈ 99.7% of the observations
The distribution of batting average (proportion of hits) for the 432
Major League Baseball players with at least 100 plate appearances
in the 2009 season is normally distributed defined N(0.261, 0.034).

 Sketch a Normal density curve for this distribution of batting


averages. Label the points that are 1, 2, and 3 standard
deviations from the mean.

.159 .193 .227 .261 .295 .329 .363


 What percent of the batting averages are above 0.329?
2.5%
 What percent are between 0.227 and .295?

68%
Scores on the Wechsler adult Intelligence Scale (a
standard IQ test) for the 20 to 34 age group are
approximately Normally distributed. N(110, 25).

 What percent are between 85 and 135?


35, 60, 85, 110, 135, 160, 185

68%
 What percent are below 185?

100  .15  .9985 or 99.85%


 What percent are below 60?

2.5%
2. A sample of the hourly wages of employees who work in
restaurants in a large city has a mean of $5.02 and a st.
dev. of $0.09.
a. Using Chebyshev’s, find the range in which at least
75% of the data will fall.

75%  2 st. dev 4.66  5.38

b. Using the Empirical rule, find the range in which at


least 68% of the data will fall.

68%  1 st. dev 4.93  5.11


The mean of a distribution is 50 and the standard deviation is
6. Using the empirical rule, find the percentage that will fall
between 38 and 62.

2 st. dev  95%


A sample of the labor costs per hour to assemble a certain
product has a mean of $2.60 and a standard deviation of
$0.15, using Chebyshev’s, find the values in which at least
88.89% of the data will lie.

88.89%  3 st. dev.

x  3s  2.15  3.05
Homework
 Worksheet

 Quiz Friday 9/25 on Boxplots & outliers

Das könnte Ihnen auch gefallen