Beruflich Dokumente
Kultur Dokumente
The level of measurement refers to the relationship among the values that are assigned to the
attributes for a
variable. What
does that mean?
Begin with the
idea of the
variable, in this
example "party
affiliation." That
variable has a
number of
attributes. Let's
assume that in
this particular election context the only relevant attributes are "republican", "democrat", and
"independent". For purposes of analyzing the results of this variable, we arbitrarily assign the
values 1, 2 and 3 to the three attributes. The level of measurement describes the relationship
among these three values. In this case, we simply are using the numbers as shorter
placeholders for the lengthier text terms. We don't assume that higher values mean "more" of
something and lower numbers signify "less". We don't assume the the value of 2 means that
democrats are twice something that republicans are. We don't assume that republicans are in
first place or have the highest priority just because they have the value of 1. In this case, we
only use the values as a shorter name for the attribute. Here, we would describe the level of
measurement as "nominal".
First, knowing the level of measurement helps you decide how to interpret the data from that
variable. When you know that a measure is nominal (like the one just described), then you know
that the numerical values are just short codes for the longer names. Second, knowing the level
of measurement helps you decide what statistical analysis is appropriate on the values that
were assigned. If a measure is nominal, then you know that you would never average the data
values or do a t-test on the data.
There are typically four levels of measurement that are defined:
Nominal
Ordinal
Interval
Ratio
In nominal measurement the numerical values just "name" the attribute uniquely. No ordering
of the cases is implied. For example, jersey numbers in basketball are measures at the nominal
level. A player with number 30 is not more of anything than a player with number 15, and is
certainly not twice whatever number 15 is.
In ordinal measurement the attributes can be rank-ordered. Here, distances between attributes
do not have any meaning. For example, on a survey you might code Educational Attainment as
0=less than high school; 1=some high school.; 2=high school degree; 3=some college;
4=college degree; 5=post college. In this measure, higher numbers mean more education. But
is distance from 0 to 1 same as 3 to 4? Of course not. The interval between values is not
interpretable in an ordinal measure.
It's important to recognize that there is a hierarchy implied in the level of measurement idea. At
lower levels of measurement, assumptions tend to be less restrictive and data analyses tend to
be less sensitive. At each level up the hierarchy, the current level includes all of the qualities of
the one below it and adds something new. In general, it is desirable to have a higher level of
measurement (e.g., interval or ratio) rather than a lower one (nominal or ordinal).
Groups (Strata) 4 Time Zones in the U.S. 26 PSU intercollegiate 11 different elementary
(Eastern,Central, teams schools in the local school
Mountain,Pacific) district
Obtain a Simple 500 people from each of the 5 athletes from each of 20 students from each of the
Random Sample 4 time zones the 26 PSU teams 11 elementary schools
Population All people in U.S. All PSU intercollegiate All elementary students in a
athletes local school district
Groups (Clusters) 4 Time Zones in the U.S. 26 PSU intercollegiate 11 different elementary
(Eastern,Central, teams schools in the local school
Mountain,Pacific.) district
Obtain a Simple 2 time zones from the 4 8 teams from the 26 4 elementary schools from
Random Sample possible time zones possible teams the l1 possible elementary
schools
Sample every person in the 2 every athlete on the 8 every student in the 4
selected time zones selected teams selected elementary schools
Each of the three examples that are found in Tables 3.2 and 3.3 were used to illustrate how both
stratified and cluster sampling could be accomplished. However, there are obviously times when one
sampling method is preferred over the other. The following explanations add some clarification about
when to use which method.
With Example 1: Stratified sampling would be preferred over cluster sampling, particularly if the
questions of interest are affected by time zone. For example the percentage of people watching a live
sporting event on television might be highly affected by the time zone they are in. Cluster sampling
really works best when there are a reasonable number of clusters relative to the entire population. In
this case, selecting 2 clusters from 4 possible clusters really does not provide much advantage over
simple random sampling.
With Example 2: Either stratified sampling or cluster sampling could be used. It would depend on
what questions are being asked. For instance, consider the question "Do you agree or disagree that
you receive adequate attention from the team of doctors at the Sports Medicine Clinic when
injured?" The answer to this question would probably not be team dependent, so cluster sampling
would be fine. In contrast, if the question of interest is "Do you agree or disagree that weather
affects your performance during an athletic event?" The answer to this question would probably be
influenced by whether or not the sport is played outside or inside. Consequently, stratified sampling
would be preferred.
With Example 3: Cluster sampling would probably be better than stratified sampling if each
individual elementary school appropriately represents the entire population as in aschool district
where students from throughout the district can attend any school. Stratified sampling could be used
if the elementary schools had very different locations and served only their local neighborhood (i.e.,
one elementary school is located in a rural setting while another elementary school is located in an
urban setting.) Again, the questions of interest would affect which sampling method should be used.
The most common method of carrying out a poll today is using Random Digit Dialing in which a
machine random dials phone numbers. Some polls go even farther and have a machine conduct the
interview itself rather than just dialing the number! Such "robo call polls" can be very biased
because they have extremely low response rates (most people don't like speaking to a machine) and
because federal law prevents such calls to cell phones. Since the people who have landline phone
service tend to be older than people who have cell phone service only, another potential source of
bias is introduced. National polling organizations that use random digit dialing in conducting
interviewer based polls are very careful to match the number of landline versus cell phones to the
population they are trying to survey.
Non-probability Sampling
The following sampling methods that are listed in your text are types of non-probability sampling
that should be avoided:
1. volunteer samples
2. haphazard (convenience) samples
Since such non-probability sampling methods are based on human choice rather than random
selection, statistical theory cannot explain how they might behave and potential sources of bias are
rampant. In your textbook, the two types of non-probability samples listed above are called
"sampling disasters."
Read the article: "How Polls are Conducted" by the Gallup organization available in Canvas.
The article provides great insight into how major polls are conducted. When you are finished reading
this article you may want to go to the Gallup Poll Web site, http://www.gallup.com, and see the
results from recent Gallup polls. Another excellent source of public opinion polls on a wide variety
of topics using solid sampling methodology is the Pew Reserach Center website
at http://www.pewresearch.org When you read one of the summary reports on the Pew site, there is a
link (in the upper right corner) to the complete report giving more detailed results and a full
description of their methodology as well as a link to the actual questionnaire used in the survey so
you can judge whether their might be bias in the wording of their survey.
It is important to be mindful of margin or error as discussed in this article. We all need to remember
that public opinion on a given topic cannot be appropriately measured with one question that is only
asked on one poll. Such results only provide a snapshot at that moment under certain
conditions. The concept of repeating procedures over different conditions and times leads to more
valuable and durable results. Within this section of the Gallup article, there is also an error: "in 95 out
of those 100 polls, his rating would be between 46% and 54%." This should instead say that in an
expected 95 out of those 100 polls, the true population percent would be within the confidence
interval calculated. In 5 of those surveys, the confidence interval would not contain the population
percent.