Beruflich Dokumente
Kultur Dokumente
www.curriculumpress.co.uk
April 2003
Number 122
standard deviation =
means "sum of", so x2 means "square each value then add them up"
chi-squared test
t-test
Step 1. Find out the midpoint of each class, by adding its endpoints
and dividing by two. Add it to the table. Call this column "x"
Step 2. Add another column, and put in it the values of
x number of individuals (f)
Step 3. mean =
mean =
Number of
individuals (f)
6
11
10
8
4
60+143+160+152+88
6+11+10+8+4
xf
x
(9 + 11)
(12 + 14)
(15 + 17)
(18 + 20)
(21 + 23)
2
2
2
2
2
=
=
=
=
=
10
13
16
19
22
x2 =22 + 52 + 62 + 72 + 82 = 178
5 = 35.6
x2 /n = 178
5 = 5.6 Mean2 = 5.62 = 31.36
Mean = (2 + 5 + 6 + 7 + 8)
2
2
x /n mean = 35.6 31.36 = 4.24
standard deviation = 4.24 = 2.0591
x 2 mean2
n
60
143
160
152
88
= 15.46
eg. Find the standard deviation of the following data
Length
Number of
(nearest cm) individuals (f)
9 - 11
6
12 - 14
11
15 - 17
10
18 - 20
8
21 - 23
4
xf
10
13
16
19
22
60
143
160
152
88
x2 f
600
1859
2560
2888
1936
Bio Factsheet
www.curriculumpress.co.uk
The mean, of course, is the average - but that does not mean half the values
are below and half above it, or that it is a common value. For example, the
mean of the values 1, 1, 2, 3, 100 is 21.4; this is nowhere near any of the actual
values, and four out of the five values are below it!
Degrees of freedom: you do not need to know the exact meaning, although
you do need to know how to calculate them (see below). The idea is that
the amount of data you have affects the critical value - this is because you
are much more likely to get unusual results by chance if you only have a few
observations, than if you have a lot of observations.
The mean also does not distinguish betwee these two data sets:A: 48, 49, 50, 51, 52
B: 35, 40, 50, 62, 63
Both sets of data have mean 50, but they are not very similar.
This is where the standard deviation comes in. This measures how spread
out the data are - the bigger the standard deviation, the greater the spread.
For example, for data set A above, the standard deviation is 1.414, and for
set B, it is 11.296.
You then need to relate this back to the original hypotheses; this will be
discussed in more detail for each test.
The remainder of the section is divided between the chi-squared test and the
t-test.
We know that data set 2 is more spread out than data set 1. Let's consider
which would be more likely to have a value in it above 50, say.
For data set 1, 50 is more than 2 standard deviations away from the mean
(45.2 + 2 2.13 = 49.46)
For data set 2, 50 is less than 1 standard deviation away from the mean
(43.7 + 10.03 = 53.73).
This tells us that 50 is a less "extreme" or "uncommon" value for data set
2 than for data set 1. So data set 2 is more likely to have values above 50.
Chi-squared test
There are two types main types of chi-squared test you may have to do:
a) Testing to see if there is a difference
b) Testing to see if the theoretical ratios predicted by genetics apply
The hypotheses for the tests are
a) H0: there is no difference between the different conditions
H1: there is a difference between the different conditions
Statistical tests
In the exam, you will always be told which statistical test to use if you are
being required to do calculations. You will be given any tables you need.
There are various types of questions: understanding statistical terms like degrees of freedom, significance, etc
2 =
(O E- E)
a) To calculate expected values when you are testing for a difference, you
just add up all the values and divide by the number of them.
b) To calculate expected values for genetics, you have to use the genetic
ratio. The procedure is:
i) Add up all the values from the data you are given
ii) Add up all the numbers in the genetic ratio
(eg for 9:3:3:1, do 9 + 3 + 3 + 1 = 16)
This tells you the number of parts you will be dividing your total
from i) into.
iii) Find out how much one part is, by dividing your total from i) by your
total from ii)
iv) Find out the expected frequencies, by multiplying one part by the
numbers in the ratio (eg by 9, 3, 3 and 1)
Test statistic: this is the value calculated from your data. The formula for
it depends on the test you are doing.
Critical value: this is the value you compare the test statistic to, to decide
whether you are going to accept or reject the null hypothesis.
For both t-test and chi-squared test, you reject the null hypothesis if your
test statistic is greater than the critical value.
Critical values come from statistical tables.
Once you have calculated the expected frequencies, you substitute into the
formula above to find the chi-squared value.
Bio Factsheet
www.curriculumpress.co.uk
Using statistical tables
All you have to do is to read down to find the number of degrees of freedom
you have, and across to find the significance level (usually 5% = 0.05).
chi-squared tables
t-table
df
1
2
3
4
0.10
2.71
4.61
6.25
7.78
0.05
3.84
5.99
7.81
9.49
0.025
0.01
5.02
6.63
7.38
9.21
9.35 11.34
11.14 13.23
0.005
7.88
10.60
12.84
14.86
df
7
8
9
10
11
Significance level
0.1
0.05
1.895
2.365
1.860
2.306
1.833
2.262
1.812
2.228
1.796
2.201
0.01
3.499
3.355
3.250
3.169
3.106
t-test
There are two types of t-test, paired and unpaired. The exam will always
make it clear which you should do. You will always be given the relevant
formulae.
Common mistakes
These are some of the commonest errors candidates make:-
Rounding errors, due to rounding too early. If in doubt, use all the
figures.
It is useful to keep figures in your calculator, to avoid having to keep
writing down and re-entering data. Learn how to use your calculator
memory.
Failure to show working - hence throwing away all the marks if there
is even one tiny error in calculation.
The paired t-test first requires you to find the differences between each pair
of values. You then work with these differences only.
x (n -1)
paired t-test: t =
s
t =
Exam questions will get you to do these calculations bit by bit and "follow
through" marks are likely to be awarded - so if you calculate s wrong, for
example, but use your value correctly to calculate the value of t, then you
will get the rest of the marks.
Calculator Tips:To carry out any calculation that is set out as a fraction, you
must put brackets round the top and round the bottom.
It is probably easier to work out the number inside the squareroot first, then take the square root, rather than trying to do it all
in one go.