Beruflich Dokumente
Kultur Dokumente
A. Distribution
B. Determinants
C. Application
1. ____ Compare food histories between persons with Staphylococcus
food poisoning and those without
2. ____ Compare frequency of brain cancer among anatomists with
frequency in general population
3. ____ Mark on a map the residences of all children born with birth
defects within 2 miles of a hazardous waste site
4. ____ Graph the number of cases of congenital syphilis by year for the
country
5. ____ Recommend that close contacts of a child recently reported
with meningococcal meningitis receive Rifampin
6. ____ Tabulate the frequency of clinical signs, symptoms, and
laboratory findings among children with chickenpox in Cincinnati, Ohio
Exercise 1.1
1. B
2. B
3. A
4. A
5. C
6. A
Self-Assessment Quiz
Now that you have read Lesson 1 and have completed the exercises, you
should be ready to take the self-assessment quiz. This quiz is designed to
help you assess how well you have learned the content of this lesson. You
may refer to the lesson text whenever you are unsure of the answer.
A. Who
B. When
C. Where
D. Why
2. In the definition of epidemiology, "determinants" generally includes:
A. Agents
B. Causes
C. Control measures
D. Risk factors
E. Sources
3. Epidemiology, as defined in this lesson, would include which of the
following activities?
D. Spot map
C. Suspect cases
A. Endemic curve
B. Epidemic curve
C. Seasonal trend
D. Secular trend
11. For the cruise ship scenario described in Question 7, if you
suspected that the norovirus may have been transmitted by ice made
or served aboard ship, how might you display "place"?
A. Age of passenger
B. Detailed food history (what person ate) while aboard ship
D. Symptoms
13. When analyzing surveillance data by age, which of the following age
groups is preferred? (Choose one best answer)
A. Experimental
B. Observational
C. Cohort
D. Case-control
E. Clinical trial
15. The Iowa Women's Health Study, in which researchers enrolled
41,837 women in 1986 and collected exposure and lifestyle information
to assess the relationship between these factors and subsequent
occurrence of cancer, is an example of which type(s) of study?
A. Experimental
B. Observational
C. Cohort
D. Case-control
E. Clinical trial
16. British investigators conducted a study to compare measles-mumps-
rubella (MMR) vaccine history among 1,294 children with pervasive
development disorder (e.g., autism and Asperger's syndrome) and 4,469
children without such disorders. (They found no association.) This is an
example of which type(s) of study?
A. Experimental
B. Observational
C. Cohort
D. Case-control
E. Clinical trial
17. A cohort study differs from a case-control study in that:
3. ____ Exposure
Figure 1.27 Natural History of Disease Timeline
Image Description
Source: Smeeth L, Cook C, Fombonne E, Heavey L, Rodrigues LC, Smith
PG, Hall AJ. MMR vaccination and pervasive developmental disorders.
Lancet 2004;364:963–9.
2. A reservoir of an infectious agent can be:
A. An asymptomatic human
B. A symptomatic human
C. An animal
D. The environment
2. Indirect transmission includes which of the following?
A. Droplet spread
B. Mosquito-borne
C. Foodborne
A. Endemic
B. Outbreak
C. Pandemic
D. Sporadic
1. ____ Disease 1: usually 40–50 cases per week; last week, 48 cases
2. ____ Disease 2: fewer than 10 cases per year; last week, 1 case
3. ____ Disease 3: usually no more than 2–4 cases per week; last week,
13 cases
2. A propagated epidemic is usually the result of what type of
exposure?
A. Point source
D. Person-to-person
Use Table 2.16 for Questions 1 and 2, and for Questions 10–13.
Table 2.16 Admitting Clinical Characteristics of Patients with Severe Acute Respiratory
Syndrome — Singapore, March–May, 2003
Lymphocyte
Date of Age How Temp
ID Sex Symptoms† Count (× Outcome
Diagnosis (Years) Acquired (°C)
10 /L)‡
−9
Data Source: Singh K, Hsu L-Y, Villacian JS, Habib A, Fisher D, Tambyah PA. Severe acute respiratory
A. Categorical
B. Continuous
C. Interval
D. Nominal
E. Ordinal
F. Qualitative
G. Quantitative
H. Ratio
1. ____ Sex
2. ____ Age
Image Description
2. Which of the following terms accurately describe the distribution
shown in Figure 2.12?
A. Negatively skewed
B. Positively skewed
E. Asymmetrical
Figure 2.12
Image Description
C. Has half the observations below it and half above it on a log scale
A. 37.35°C
B. 37.9°C
C. 38.0°C
D. 38.35°C
E. 38.5°C
11. The median of the temperatures listed in Table 2.16 is:
A. 37.35°C
B. 37.9°C
C. 38.0°C
D. 38.35°C
E. 38.5°C
12. The mean of the temperatures listed in Table 2.16 is:
A. 37.35°C
B. 37.9°C
C. 38.0°C
D. 38.35°C
E. 38.5°C
13. The midrange of the temperatures listed in Table 2.16 is:
A. 37.35°C
B. 37.9°C
C. 38.0°C
D. 38.35°C
E. 38.5°C
14. In epidemiology, the measure of central location generally preferred
for summarizing skewed data such as incubation periods is the:
A. Mean
B. Median
C. Midrange
D. Mode
15. The measure of central location generally preferred for additional
statistical analysis is the:
A. Mean
B. Median
C. Midrange
D. Mode
16. Which of the following are considered measures of spread?
A. Interquartile range
B. Percentile
C. Range
D. Standard deviation
17. The measure of spread most affected by one extreme value is the:
A. Interquartile range
B. Range
C. Standard deviation
D. Mean
18. The interquartile range covers what proportion of a distribution?
A. 25%
B. 50%
C. 75%
D. 100%
19. The measure of central location most commonly used with the
interquartile range is the:
A. Arithmetic mean
B. Geometric mean
C. Median
D. Midrange
E. Mode
20. The measure of central location most commonly used with the
standard deviation is the:
A. Arithmetic mean
B. Median
C. Midrange
D. Mode
21. The algebraic relationship between the variance and standard
deviation is that:
A. confidence interval
B. error rate
C. standard deviation
D. variance
Answers to Self-Assessment Quiz
1. Line list or line listing. A line listing is a table in which each row
typically represents one person or case of disease, and each column
represents a variable such as ID, age, sex, etc.
2. Sex A, D, F
Age B, G, H
Lymphocyte count B, G, H
Sex is a nominal variable, meaning that its categories have names but not
numerical value. Nominal variables are qualitative or categorical variables.
Age and lymphocyte count are ratio variables because they are both
numeric variable with true zero points. Ratio variables are continuous and
quantitative variables.
3. A. Because the centers of each distribution line up, they have the
same measure of central location. But because each distribution is spread
differently, they have different measures of spread.
4. B, C, E. Right/left skewness refers to the tail of a distribution.
Because the "hump" of this distribution is on the left and the tail is on the
right, it is said to be skewed positively to the right. A skewed distribution is
not symmetrical.
5. C. For a distribution such as that shown in Figure 2.12, with its hump
to the left, the mode will be smaller than either the median or the mean. The
long tail to the right will pull the mean upward, so that the sequence will be
mode < median < mean.
6. B. The mode is the value that occurs most often.
7. C. The median is the value that has half the observations below it
and half above it.
8. D. The mean is the value that is statistically closest to all of the
values in the distribution
9. D. The geometric mean is the value that is statistically closest to all
of the values in the distribution on a log scale.
10. C, E. The mode is the value that occurs most often. A distribution
can have one mode, more than one mode, or no mode. In this distribution,
both 38.0°C and 38.5°C appear 3 times.
11. D. The median is the value that has half the observations below it
and half above it. For a distribution with an even number of values, the
median falls between 2 observations, in this situation between the 7th and
8th values. The 7th value is 38.2°C and the 8th value is 38.5°C, so the median is
the average of those two values, i.e., 38.35°C.
12. C. The mean is the average of all the values. Given 14 temperatures
that sum to 531.6, the mean is calculated as 531.6 ∕ 14, which equals
37.97°C, which should be rounded to 38.0°C.
13. A. The midrange is halfway between the smallest and largest values.
Since the lowest and highest temperatures are 35.1°C and 39.6°C , the
midrange is calculated as 35.1 + 39.6 ∕ 2, or 37.35°C.
14. B. In epidemiology, the measure of central location generally
preferred for summarizing skewed data such as incubation periods is the
median.
15. A. The measure of central location generally preferred for additional
statistical analysis is the mean, which is the only measure that has good
statistical properties.
16. A, C, D, E. Interquartile range, range, standard deviation, and
variance are all measures of spread. A percentile identifies a particular
place on the distribution, but is not a measure of spread.
17. B. The range is the difference between the extreme values on either
side, so it is most directly affected by those values.
18. B. The interquartile range covers the central 50% of a distribution.
19. C. The interquartile range usually accompanies the median, since
both are based on percentiles. The interquartile range covers from the
25th to the 75thpercentile, while the median marks the 50th percentile.
20. A. The standard deviation usually accompanies the arithmetic mean.
21. A. The standard deviation is the square root of the variance.
22. A, D. Use of the mean and standard deviation are usually restricted
to data that are more-or-less normally distributed. Calculation of the
standard deviation requires squaring differences and then taking the square
root, so you need a calculator that has a square-root function.
23. B. Distributions A, B, and C all range from 1 to 39 and have two
central values of 20. Considering the eight values other than the smallest
and largest, distribution C has values close to 20 (from 15 to 25),
Distribution A has values from 10 to 30, and Distribution B has values from 3
to 37. So Distribution B has the broadest spread among the first 3
distributions. Distribution D has larger values than the first 3 distributions
(41–49 rather than 1–39), but they cluster rather tightly around the central
value of 45.
24. A and E. The area from the 2.5th percentile to the 97.5th percentile
includes 95% of the area below the curve, which corresponds to ± 1.96
standard deviations along the x-axis.
25. A. The primary use of the standard error of the mean is in
calculating a confidence interval.
Exercise 3.1
For each of the fractions shown below, indicate whether it is a ratio, a
proportion, a rate, or none of the three.
1. Ratio
2. Proportion
3. Rate
4. None of the above
1. ____ 1.
2. ____ 2.
number of women in State A who died from heart disease in
2004 estimated number of women living in State A on July
1, 2004
3. ____ 3.
4. ____ 4.
number of women in State A who died from cancer (all types) in 2004
5. ____ 5.