Beruflich Dokumente
Kultur Dokumente
Descriptive Statistics
Part 1: Organizing Data
Learning Objectives
Variable and data types
Distribution tables and histograms
Distribution shapes
Data: Definitions
Variable: A characteristic that varies
from one individual to another. In
statistics, we call these random
variables, because their values occur
by chance.
Example: Hair colour, height, number
of cells in a bacterial culture, etc.
5
Data
Observing several values of a variable
gives a set of data.
Qualitative data comes from several
values of a qualitative variable.
Similar for discrete and continuous
data.
Goal 1 of Descriptive
Statistics
Make the data easy to read by
somehow grouping and graphing it.
The method of grouping and graphing
will depend on what type of data you
have!
Normally use a computer to group and
graph for us.
10
Qualitative Data
This is the easiest to group and graph.
All you can do is count the number of
individual values in each category and
record these is a table.
Most common graphing options:
11
12
Frequency
Relative
Frequency
Nintendo WiiU
Playstation 4
Xbox One
TOTAL
13
Playstation
Xbox
14
Frequency
Chart
10
8
6
4
2
Nintendo
Playstation
Xbox
15
Relative Frequency
Chart
0.6
0.5
0.4
0.3
0.2
Nintendo
Playstation
Xbox
16
Playstation
Xbox
17
Quantitative Data
Graphs and grouping methods are
similar whether data is discrete or
continuous.
19
20
[ x, y )
Means everything from x up to, but not including y.
Example: All values from 30 to 40, including 30
but NOT including 40 Itself, would be:
[30, 40)
24
Partial Solution
Distribution of TV Viewing Times (minutes)
Class
[60, 65)
[65, 70)
[70, 75)
[75, 80)
[80, 85)
[85, 90)
[90, 95)
[95, 100)
[100, 105)
Frequency
Relative
Frequency
Cumulative
Cumulative
Relative
Frequency
Frequency
Fill these in on
the handout!
27
29
31
Example:
Reading Histograms
Going back to the TV viewing times
data, the following histograms (next
slides) were done in Minitab.
Use the appropriate histogram to
answer the questions that follow.
33
38
Distribution Shapes
When you make a histogram, it has a
particular shape. This is called the
distribution shape.
The shape of your data will determine
the type of statistical analysis that is
appropriate.
Therefore, this is information you will
use for the rest of the course and in
EVERY statistical study you conduct.
39
How to Construct a
Distribution Shape
R
E
L
A
T
I
V
E
F
R
E
Q
E
N
C
Y
OBSERVATION CLASSES
41
Distribution Shapes
The following are perfect distribution
shapes.
In real life, you will almost never get a
histogram that looks perfect.
Therefore, these are meant as
guidelines.
42
43
44
45
46
48
Bimodal
49
Multimodal
50
Sample Distributions
What you just saw are perfect
distribution shapes.
In real life, youll likely NEVER see a
perfect distribution shape in your
histograms from your samples.
This is called Sampling Error.
51
Sampling Error
This will cause some minor variations in
a histogram.
Usually it is good enough to say a
histograms shape is close to one of
our distributions.
52
Example
The following histogram came from a
normal distribution:
Example
What do you think is the distribution
shape of this histogram?
55
56
57
58
Accuracy
Why do those histograms have
declining accuracy of the shape?
A histograms shape is most obvious
(and therefore, most representative of
the populations distribution) when the
sample size is ?
59