Sie sind auf Seite 1von 4

Name_________________________________________Per______Date_____________________

AP Statistics Data Analysis Graphical Displays Review



1. A university instructor created a website for her
Organic chemistry course. The students in her class were
encouraged to use the website as an additional resource
for the course. At the end of the semester, the
instructor asked each student how many times had he or
she visited the website and recorded the counts. Based
on the histogram, describe the distribution of the
website use.

The distribution of the of the number of visits to the
course website by each student for the semester is
skewed to the left, with the number of visits ranging
from 1 to 15 views. The distribution is centered at about
14 visits, with many students visiting 15 times. There is an outlier in the distribution, two students who visited the
site once. The next highest number of visits was 8.

2. A Ph. D. candidate is collecting data about women in math careers. She interviewed 200 female mathematicians
and recorded the following data: number of years attending university, math classes taken in high school (algebra,
geometry, etc.), gender of high school math teacher, and high school GPA.
Tell which variables are qualitative and which variables are quantitative. For the numerical variable state whether
they are continuous or discrete.

Quantitative: Number of years attending (discrete), GPA (continuous)
Qualitative: Classes taken, Gender of high school teacher.

3. Office workers were asked how long it took them to travel to work one morning in minutes. Provided is a table of
their responses. Sketch a stem-and-leaf plot for the data. Without actually calculating the mean or median, would
you expect the mean to be greater than or less than median. Justify your answer.
Commute Times in Minutes
20 20 20 22 23
24 24 25 27 28
30 32 35 37 41
42 47 48 49 50
52 58 60 65 73

Higher, because the data are skewed to the right. Mean > Median



4. A survey was given to all Texas high schools to determine if students favored starting school closer to Labor
Day. The amount of data received was enormous, so the researcher decided to focus on just 5A high schools, like
DPHS. Now that the researcher has the data he/she will generalize the results to Texas as a state. Discuss the
population of interest, sample, and the branch of statistics described in the last sentence? (in context)

Population of interest: All high schools in the state of Texas.
Sample: 5A high schools that returned the survey.
Collecting and organizing the data is part of descriptive statistics. One the research determines his/her results
and generalizes back to all high schools in Texas this will be inferential statistics.
5. The Kentucky Derby has been run annually since 1900 at
Churchill Downs, Louisville, Kentucky. The distance is 1 miles.
Since 1900, all winning times have been over 2 minutes, except for
the record time of 1 minute and 59.2 seconds run by Secretariat in
1973. The following graph shows seconds over 2 minutes for all
winning times.There are 98 data values represented. What
percentage of winning times is between 2 minutes 3.15 seconds and
2 minutes 7.15 seconds? 37%

6. The data below gives the cost per ounce (in cents) for 30
shampoos intended for normal hair and 30 shampoos intended for
fine hair.
Normal
79 63 19 9 37 49 20 16 55 69 23
14 9 7 21
44 13 16 23 20 64 28 18 32 81 5 47 50 8 9
Fine
69 9 23 22 8 12 32 12 18 74 19 63 49 37 55
75 44 8 17 11 23 50 65 51 35 14 20 28 8 27

Both the normal and fine shampoos cost have a right tail distribution which skews the cost to the higher values.
This means you have more smaller costs for the shampoos.
They both have clusters in the $5-$28 range. The fine
shampoo has a range of $67 whereas the normal shampoo has
a range of $77. Key 0|5 = $5.00







7. Below is the fastest speeds driven by statistic students as
reported on the student surveys. Construct a histogram for
the data. Then CUSS about the data.

165 110 105 90 85 110 120 192 130

105 120 70 70 90 70 109 60 130

125 130 90 95 130 80 110 120 90

90 100 90


The distribution is skewed to the right. Meaning more times around
the lower values (70-91) were reported. The mean of the distribution
is greater than the median. There is a gap in the range of 133-154.
The range of the drivers speed is 121 with a maximum reported speed
of 192 and a minimum reported speed of 70

9 9 9 8 7 5 0 8 8 8 9
9 8 6 6 4 3 1 1 2 2 4 7 8 9
8 3 3 1 0 0 2 0 2 3 3 7 8
7 2 3 2 5 7
9 7 4 4 4 9
5 0 5 0 1 5
9 4 3 6 3 5 9
9 7 4 5
1 8
Class Frequency R.F
70 x < 91 11 .367
91 x < 112 8 .267
112 x < 133 8 .267
133 x < 154 0 .000
154 x < 175 2 .067
175 x < 196 1 .033
Seconds
C
u
m
u
l
a
t
i
v
e

R
e
l

F
r
e
q
u
e
n
c
y

-.85
1.15
3.15
5.15
7.15
9.15
11.15
13.15
.20
.40
60.
.80
1.0
(.10)
(.45)
(.72)
(.82)
(.91)
(.97)
(1.0)


Free Response
9. The graph below displays the scores of 32 students on a recent exam. Scores on this exam ranged from 64 to -
95 points.

a) Describe the shape of this distribution in context of the problem.

The distribution is skewed to the left (or toward the lower scores). This could result from being a harder test.

b) In order to motivate her students, the instructor of the class wants to report that, overall, the classs
performance on the exam was high. Which summary statistics, the mean or the median, should the instructor use
to report that overall exam performance was high? Explain.

Since the distribution is skewed towards the lower values, the mean will be pulled in that direction. Thus, the
instructor should report the median to motivate her students.

c)The midrange is defined as
max min
2
imum imum
. Compute this value using the data.

64 95
79.5
2
midrange



d) Is the midrange considered a measure of center or a measure of spread? Explain.

The midrange is a measure of center. The maximum provides information about the upper tail, more specifically the
upper extreme value. The minimum provides information about the lower tail, more specifically the lower extreme
value. By averaging these two values and creating the midrange, we are creating a statistic that provides the
halfway point between the two extremes.

















10. Most women who have had a mastectomy (removal of breast tissue for medical reasons) can have breast
reconstruction surgery. The reconstruction surgery can be performed at the same time as the mastectomy, known
as an immediate reconstruction, or after the patient has healed from the mastectomy, commonly referred to as a
second surgery reconstruction. The table below shows the percentages of choices regarding reconstruction for
three age categories. A graphical display has been added to help visualize the distribution.


Age
Under 35 35-50 Over 50
Immediate reconstruction 63% 48% 23%
Second surgery reconstruction 31% 34% 41%
No reconstruction 6% 18% 36%
Total 100% 100% 100%

a) Use the data to sketch a graphical display for the
data.







b) From your graphical display and the data, does
there appear to be an association between
reconstruction and age? Justify your response.

Yes. A higher percentage of older women, especially
over 50, who have had mastectomies choose not to
have reconstruction surgery. Likewise, a higher
percentage of younger patients choose to have
immediate reconstruction surgery. It appears that as the age of women have mastectomies increases, the
importance of having reconstructive surgeries decreases.

Das könnte Ihnen auch gefallen