Beruflich Dokumente
Kultur Dokumente
Statistics Assignment
Question 1
Figure 1
Figure 2
The frequency distribution table was created by arranging the data into separate
classes and determining their ranges. Then the midpoint was calculated for each
class by averaging the lower and higher band of their range:
Classh+Classl2
=COUNTIF(A3:A102,”>=118.1”)-COUNTIF(A3:A102,”>121”)
b) The data set was inputted onto a spreadsheet in order to produce the
histogram and calculate the following mean.
The mean can be calculated by inputting the following formula into the
spreadsheet cell:
=AVERAGE(A3:A102)
The mean can also be calculated from the histogram graph by summing the
products of each bar’s frequency and midpoint (area) and then divide by the sum
of the frequencies:
101.55 15 1523.25
104.55 11 1150.05
107.55 14 1505.7
110.55 19 2100.45
113.55 12 1362.6
116.55 14 1631.7
119.55 15 1793.25
Total 11067
Figure 3
c) First method to determine the mode of the data using the spreadsheet
functions is by inputting the following into a cell:
=MODE(A3:A102)
This produces the result 118.2. The mode represents the most commonly
occurring number in the data set which can also be calculated manually. We can
also calculate the mode from the histogram by drawing a diagonal line from the
upper corners of the highest block to adjacent block upper corners on opposite
sides shown below:
crude method and not very accurate compared to the calculated method using
the raw data.
Figure 4
A cumulative data plot represents all the resistors in ascending order. From this
we can determine the mode from identifying the longest horizontal section with
no gradient in the trend as follows in figure 5:
Figure 5
=MEDIAN(B3:B102)
This returns the value of 110.95. This represents the middle value of the data set
range once the values have been sorted. There are 100 resistor values in the
range so the median is between the 50th and the 51st value which are 110.9 and
111.0 respectively. To calculate this manually we average the two values as
follows:
110.9+111.02=110.95
To determine the median from the histogram a vertical line is drawn that divides
the total area of the histogram into two even parts. The total area of the
histogram is 3 x 100 = 300 units. Halving this produces 150 which gives us the
unit area required on either side of the median. To achieve this the largest
rectangle must be split so that
This corresponds that the median lies at approximately 111. This calculation is
slightly less accurate than the first method.
Question 2
84 – 88 4
89 – 93 10
94 – 98 24
99 – 103 34
104 – 108 28
109 – 113 12
114 – 118 6
119 – 123 2
Figure 6
To achieve this another table is formed which details midpoint x, xc, frequency f,
xcf and xc2f with a step size of 5.
x xc f xcf xc2f
86 -3 4 -12 36
91 -2 10 -20 40
96 -1 24 -24 24
101 0 34 0 0
106 1 28 28 28
111 2 12 24 48
116 3 6 18 54
121 4 2 8 32
Student Ref: 20177530
Figure 7
From figure 7 we can calculate the mean of xc using the following formula:
c=xcff=22120=0.183
=101+0.183×5=101.917
σc=xc2ff- c2=262120-0.1832=2.15=1.466
Then multiply this by the step increment value of 5 to find the standard
deviation:
σ=1.466×5=7.331
This value of 7.331 represents the average distance that the values in the data
set lay away from the mean which entails that this set has a large dispersion and
variability of values.
The mean , was calculated as 101.917, and the standard deviation to be 7.331
which is the same as the previously calculated values.
Student Ref: 20177530
Question 3
(q+p)n=qn+nqn-1p+n(n-1)2!qn-2p2+nn-1(n-2)3!qn-3p3+…
qn=0.9750=0.218
nqn-1p=50×0.9749×0.03=0.337
qn+nqn-1p=(0.9750)+(50×0.9749×0.03)=0.218+0.337=0.555
1-qn+nqn-1p=1-[(0.9750)+(50×0.9749×0.03)]=1-0.555=0.445
e-λ(1+λ+λ22!+λ33!+…)
λ=np=250×0.01=2.5
e-2.51=0.082
λe-λ=2.5e-2.5=0.205
λ2e-λ2!=2.52×e-2.52×1=0.257
1-(e-λ+λe-λ+λ2e-λ2!)=1-0.082+0.205+0.257=0.456
Binomial:
(q+p)n=qn+nqn-1p+n(n-1)2!qn-2p2+nn-1(n-2)3!qn-3p3+…
n(n-1)2!qn-2p2=10(10-1)2×1×0.910-2×0.12=0.194
Poisson:
λ=np=10×0.1=1
e-λ(1+λ+λ22!+λ33!+…)
λ2e-λ2!=12e-12×1=0.184
Question 4
=1.12+1.15+1.10+1.14+1.15+1.10+1.117=1.1243Ωm-1
σ=(1.12-1.1243)2+(1.15-1.1243)2+(1.10-1.1243)2+…
7=0.002971437=0.0206Ωm-1
With the mean and standard deviation calculated, we must find z, the normal
standard variate.
z=99÷2100=0.495
Using the following table Partial areas under the standardised normal curve in
figure 8, z has been identified by the red square which equates to 2.57.
±zσn
1.124+2.57×0.02067=1.144 Ωm-1
1.124-2.57×0.02067=1.104Ωm-1
Student Ref: 20177530
2
(shaded area = p)
±tcsN-1=1.1243±3.14(0.0206)7-1=1.1243±0.0264
1.1243+0.0264=1.151Ωm-1
1.1243-0.0264=1.098Ωm-1
This indicates that there is a 99% chance that the true specific resistance of the
wire lies between 1.151Ωm-1 and 1.098Ωm-1.
Student Ref: 20177530
Student Ref: 20177530