Beruflich Dokumente
Kultur Dokumente
Instructions to Candidates:
Attempt ALL questions.
Each question is of equal mark value.
Start your solution to each question on a new page.
To ensure full marks show all the steps in working out your
solution. Marks may be deducted for failure to show appropriate
calculations or formulae.
Unless otherwise stated, use a significance level of 5%.
Selected statistical tables are attached to the back of the
examination paper.
Page 1 of 12
Page 2 of 12
N
200
200
200
200
Mean SE Mean
0.6150 *drip1*
37.52
*drip2*
5.155
*drip3*
3.000
*drip4*
Q1
0.000
28.25
4.000
2.000
Median
1.0000
38.00
5.000
3.000
StDev
0.4878
15.87
1.894
1.607
Q3
1.0000
47.00
7.000
4.000
Minimum
0.000
1.00
1.000
0.000
Maximum
1.0000
81.00
10.000
9.000
ix. The number which should be present at drip2 is (to 3 decimal places)
a. 2.653
b. 0.188
c. 1.122
d. 0.079
x. Based on the above descriptive statistics, which box in the graph below best
represents the variable age?
Page 3 of 12
Data
100
75
50
25
0
Age 1
a.
b.
c.
d.
Age 2
Age 3
Age 4
Age 1
Age 2
Age 3
Age 4.
The doctor is interested in the differences between her male and female patients.
A boxplot is given below of the ages of the patients, split by gender. Use it to
answer question (xi).
Boxplot of Age vs Gender
90
80
70
Age
60
50
40
30
20
10
0
0
1
Gender
Page 4 of 12
Variable
Age
Gender
0
1
N
77
123
Mean
36.09
38.41
SE Mean
1.98
1.34
Variable
Age
xii.
xiii.
xiv.
xv.
StDev
17.37
14.85
Page 5 of 12
The doctor is interested in the proportion of patients (regardless of gender) who have
visited 4 or fewer times over the year. She finds that in her sample of 200, 117
patients have visited 4 or fewer times. Based on this information, a 98% confidence
interval for the population proportion visiting 4 or fewer times in a year is calculated
p (1 p )
.
to be p c
xviii. In the confidence interval formula above, the value p should be replaced by
which of the following numbers?
117
a.
200
83
b.
200
83
c.
117
d. 0.01 .
xix. In the confidence interval formula above, the value c should be replaced by
which of the following numbers?
a. 0.01
b. 0.02
c. 1.96
d. 2.33
xx. In the confidence interval formula above, the value n should be replaced by
which of the following numbers?
a. 117
b. 83
c. 200
d. Not enough information is available to answer this question.
Page 6 of 12
0.5
0.4
0.3
0.2
0.1
0.0
0
6
X
10
12
Variable
X
Y
N
30
30
Mean
3.798
0.1960
Variable
X
Y
Minimum
0.510
0.000
SE Mean
0.629
0.0416
Median
2.045
0.0800
StDev
3.443
0.2279
Maximum
11.040
0.9100
Covariances: X, Y
X
Y
X 11.855989
Y
0.572271
0.051942
A regression is performed in Minitab, but a minor chemical spill has obscured some
of the output.
Page 7 of 12
Coef
0.01268
0.048269
SE Coef
T
0.04355 *spill2*
0.008559
5.64
P
0.773
0.000
Analysis of Variance
Source
Regression
Residual Error
Total
DF
1
28
29
SS
0.80106
0.70526
1.50632
MS
0.80106
0.02519
F
31.80
P
0.000
Unusual Observations
Obs
6
X
8.2
Y
0.9100
Fit
0.4085
SE Fit
0.0475
Residual
0.5015
St Resid
3.31R
Percent
90
50
10
1
99
-2
0
2
Standardized Residual
3
2
1
0
-1
0.00
Frequency
10.0
7.5
5.0
2.5
0.0
-1
0
1
2
Standardized Residual
0.30
Fitted Value
0.45
0.60
0.15
3
2
1
0
-1
8 10 12 14 16 18 20 22 24 26 28 30
Observation Order
Page 8 of 12
Fit
0.1092
0.2058
0.3023
0.3988
SE Fit
0.0328
0.0290
0.0346
0.0462
95%
(0.0420,
(0.1463,
(0.2315,
(0.3042,
CI
0.1764)
0.2652)
0.3731)
0.4934)
95%
(-0.2228,
(-0.1247,
(-0.0304,
( 0.0602,
PI
0.4412)
0.5362)
0.6350)
0.7374)
X
2.00
4.00
6.00
8.00
Page 9 of 12
Page 10 of 12
Another study is performed into non-violent criminals under 20, this time classifying
them by the number of times they have been successfully prosecuted. Upon
examining all records available, the following probability distribution is found to
apply to the group.
Number of
1
2
3
4
Convictions
Probability
0.33
0.34
0.22
0.11
d. (4 marks) Find the mean number of convictions among non-violent criminals
under 20 years of age. Is this an observable value, i.e. is it possible that there will
be a non-violent criminal in this age group with that exact number of convictions?
Does this indicate a problem with the data? Explain your answer.
e. (4 marks) Find the standard deviation of the number of convictions among this
group of criminals.
f. (1 mark) If a single criminal of this group is sampled at random, what is the
probability that he has been convicted 3 or fewer times?
g. (3 marks) A sample of 50 criminals who fit this profile is taken. What is the
probability that the average number of times they have been convicted is 3 or
fewer?
Page 11 of 12
,
40 x < 55
150
f ( x) =
.
x
60
, 55 x < 60
50
0,
60 x
i. (3 marks) Draw a graph of f(x), clearly marking all axes and points of
interest.
ii. (2 marks) Is f(x) a probability distribution? Explain why or why not.
iii. (2 marks) What proportion of the time will the lecturer take less than 45
minutes for a lecture?
iv. (3 marks) What is the probability that a lecture runs for longer than 50
minutes?
(b)One of the lecturers colleagues, called Professor Good (to preserve anonymity),
with a much better reputation for time management of lectures, claims that the
length of his lectures is best represented by a uniform variable with positive
probability between 47 and 52 minutes.
i. (1 mark) Draw a graph representing the probability distribution of the
length of a lecture given by Professor Good, clearly marking all axes and
points of interest.
ii. (1 mark) Find the expected length of a randomly selected lecture given by
Professor Good.
iii. (1 mark) Find the standard deviation of length of a lecture given by
Professor Good.
iv. (2 marks) Find the probability that a lecture given by Professor Good runs
for longer than 50 minutes.
v. (1 mark) Find the probability that a lecture given by Professor Good takes
exactly 50 minutes.
vi. (4 marks) If a random sample of 50 lectures given by Professor Good is
timed, find the probability that the sample average is over 50 minutes.
Page 12 of 12