Beruflich Dokumente
Kultur Dokumente
John T. Ormerod
1. Probability on Sets
Formulae: P(A B) = P(A) + P(B) P(A B) If A and B are independent then P(A B) = P(A) P(B), otherwise A and B are dependent and P(A B) = P(A) P(B). If A and B are mutually exclusive then P(A B) = 0. Bayes rule: P(A|B) = P(A B) P(B) and P(B|A) = P(A B) . P(A)
Know the denitions of medical jargon in terms of probabilities, e.g. Sensitivity P(S+ |D+ ). Specicity P(S |D ). False Negative P(S |D+ ) False Positive P(S+ |D ) PV + P(D+ |S+ ). PV P(D |S ). Prevalence P(D+ ).
3. Calculate PV +
Calculate P(D+ |S+ ): You will be given the formula for P(D+ |S+ ): PV + = P(D+ |S+ ) P(S+ |D+ )P(D+ ) = . P(S+ |D )P(D ) + P(S+ |D+ )P(D+ ) You will need to read the problem description, and from the problem description, identify the constituent parts of the formula. A little algebra might be needed, e.g. know, for example, P(S+ |D ) = 1 P(S |D ) and P(D ) = 1 P(D+ ).
5. Normal Probabilities
Normal Probabilities: X N(, 2 ), X is continuous. E(X) = and Var(X) = 2 . P(X = x) = 0 (But impossible things happen all the time!) Many equivalent statements P(X x) = NORMDIST(x, , , 1) and if Z =
X
in EXCEL
then
T=
i=1
Xi N(n, n 2 )
and X =
1 n i=1
Xi N ,
2 n
7. Approximating Distributions
Poisson Distribution on different time lengths: If the rate of occurrence is Poisson() per unit interval, and if W counts the responses over an interval of length t then W Poisson(t). Approximating a Binomial Distribution by a Poisson Distribution: If X Binomial(n, p), and if n is large and p is small, a good approximate variable is Y Poisson(), where = np, and so P(X r) P(Y r).
8. Approximating Distributions
Approximating the distribution of total or mean by a Normal Distribution, i.e. the central limit theorem: If E(X) = , Var(X) = 2 and n is large then
n n
T=
i=1
Xi N(n, n )
and X =
1 n i=1
Xi N ,
2 n
9. Condence Intervals
s n
where P(|Z| z ) = 1 , P(|tn1 | t ) = 1 and is typically 5%, e.g. P(|Z| 1.96) = 0.95. Note z = ABS(NORMSINV(/2)) and t = ABS(TINV(, n 1)). Also: Interpretation of condence intervals.
10. p-values
Denition: The p-value is the probability of observations at least extreme of unusual as actually observed. Also, the p-value is calculated assuming that H0 is true. Interpretation: Small p-values (< 0.05), for example a p-value of 0.01 means either 1. or 2. is true (but we cant tell which):
1. H0 is true and the observed sample is improbable. 2. H0 is not true.
The smaller the p-value, the stronger the evidence against H0 in favour of HA .
Given a problem description: Select an appropriate null and alternative hypothesis. Select an appropriate test statistic for the problem (and know its distribution), i.e. choose the correct test. State the EXCEL command to calculate the p-value. Given the p-value draw a conclusion (again, interpret the p-value in relation to the hypothesis). Note that test statistics and their distributions are in the formula sheet. Too many tests to go through now. Consult the lecture notes.
Relative risk: Risk of the disease given a risk factor divided by the risk of the disease without the risk factor. Formula: P(D+ |S+ ) a(c + d) RR = = . + |S ) P(D c(a + b) Only makes sense if data from a prospective study or from a sample of completed records. Odds ratio: Many denitions. For the general 2 2 table all come down to: ad OR = . bc Can be calculated regardless of the type of study used.
The presence of a symptom (S+ ) is used to diagnose the presence of a certain disease (D+ ). The probability, P(D |S ) is known as: (a) sensitivity (b) specicity (c) PV (This one is correct) (d) odds ratio (e) relative risk.
If the prevalence of an infection is 0.0003, the probability of at most 2 cases in a random sample of 10000 is, (using the Poisson approximation): (a) P (Y < 3), with Y Poisson(3), (This one is correct) (b) P (Y 3), with Y Poisson(3), (c) =1-POISSON(0,3,1), (in EXCEL) (d) =POISSON(0,3,1)+POISSON(1,3,1)+POISSON(2,3,1), (in EXCEL) (e) none of these.
A p-value of 0.01 means: (a) there is 1% chance H0 true, (b) there is 1% chance H1 true, (c) the data are consistent with H0, (d) there is evidence against H0, (This one is correct) (e) none of these.