Beruflich Dokumente
Kultur Dokumente
P S
c). d).
S P
Data
Which of the following Venn diagrams
shows the relationship between
population data and sample data?
a). P b). S
S P
P S
c). d).
S P
Levels of Measurement
a). Nominal
b). Ordinal
c). Interval
d). Ratio
Levels of Measurement
a). Nominal
b). Ordinal
c). Interval
d). Ratio
Researchers want data on taste of a group of
pineapples. A panel of tasters rates the
pineapples according to the categories
poor, acceptable, and good. Only some
of the pineapples are included in the taste
test. In this case, the _______ is taste. This is
a _________ variable. Because only some of
the pineapples in the field are included in the
study, we have a __________. The proportion
of pineapples in the sample with a taste rating
of good is a __________.
Researchers want data on taste of a group of
pineapples. A panel of tasters rates the
pineapples according to the categories
poor, acceptable, and good. Only some
of the pineapples are included in the taste
test. In this case, the variable is taste. This is
a qualitative variable. Because only some of
the pineapples in the field are included in the
study, we have a sample. The proportion of
pineapples in the sample with a taste rating of
good is a statistic.
Two Branches of Statistics
Stratified sampling
Subgroup 4
Subgroup 3
Population
Subgroup 2
sample
Subgroup 1
Sampling Techniques
Systematic sampling
Number every member of the population.
Select every kth member.
Cluster sampling
Population is naturally divided into pre-existing
segments.
Make a random selection of clusters, then select
all members of each cluster.
In a census, measurements or
observations are obtained from the
entire population (uncommon and
often impractical).
In a sample, measurements or
observations are obtained from part
of the population (common).
Surveys
Collecting data from respondents by asking them
questions.
Survey Pitfalls
Nonresponse undercoverage of population.
Truthfulness respondents sometimes lie.
Faulty recall of respondent
Hidden bias due to poor question wording.
Vague wording sometimes, often, seldom
Interviewer influence who is asking the
questions and in what manner.
Voluntary response relatively interested
individuals are more likely to participate.
Frequency Tables
A frequency table
organizes quantitative data.
partitions data into classes (intervals).
shows how many data values are in each
class. Test Score Number of
Students
61-70 4
71-80 8
81-90 15
91-100 7
Data Classes and Class Frequency
Class: an interval of values.
Example: 61 x 70
It has:
A lower limit a and an
upper limit b.
A width.
A lower boundary and
an upper boundary
(integer data).
A midpoint.
Structure of a Data Class
A data class is basically an interval on a number line.
If a = 60 and b = 69
for integer data,
what is the value of
the lower boundary?
If a = 60 and b = 69
for integer data,
what is the value of
the lower boundary?
Skewed Skewed
Left Right
Critical Thinking
A bimodal distribution shape might
indicate that the data are from two
different populations.
Outliers data values that are very
different from other values in the data set.
Outliers may indicate data recording
errors.
Exploratory Data Analysis
EDA is the process of learning about a
data set by creating graphs.
Example: Sixteen
students are asked
how many college
math classes they
have completed.
{0, 3, 2, 2, 1, 1, 0, 5,
1, 1, 0, 2, 2, 7,
1, 3}
Median
Finding the median:
1). Order the data from smallest to largest.
(x )
i
2
2 i 1 2
N
The Coefficient of Variation
a). b).
c). d).
Which of the following shows a strong
negative correlation?
a). b).
c). d).
Critical Thinking
y 22.35 1.60 x
Critical Thinking: Making Predictions