Beruflich Dokumente
Kultur Dokumente
Definition
In statistics, a null hypothesis is a statement that one seeks to nullify with
evidence to the contrary. Most commonly it is a statement that the phenomenon
being studied produces no effect or makes no difference. An example of a null
hypothesis is the statement "This diet has no effect on people's weight." Usually,
an experimenter frames a null hypothesis with the intent of rejecting it: that is,
intending to run an experiment which produces data that shows that the
phenomenon under study does make a difference.[2] In some cases there is a
specific alternative hypothesis that is opposed to the null hypothesis, in other
cases the alternative hypothesis is not explicitly stated, or is simply "the null
hypothesis is false" in either event, this is a binary judgment, but the
interpretation differs and is a matter of significant dispute in statistics.
A type I error (or error of the first kind) is the incorrect rejection of a true null
hypothesis. Usually a type I error leads one to conclude that a supposed effect or
relationship exists when in fact it doesn't. Examples of type I errors include a test
that shows a patient to have a disease when in fact the patient does not have the
disease, a fire alarm going on indicating a fire when in fact there is no fire, or an
experiment indicating that a medical treatment should cure a disease when in
fact it does not.
A type II error (or error of the second kind) is the failure to reject a false null
hypothesis. Examples of type II errors would be a blood test failing to detect the
disease it was designed to detect, in a patient who really has the disease; a fire
breaking out and the fire alarm does not ring; or a clinical trial of a medical
treatment failing to show that the treatment works when really it does.[3]
In terms of false positives and false negatives, a positive result corresponds to
rejecting the null hypothesis, while a negative result corresponds to failing to
reject the null hypothesis.
When comparing two means, concluding the means were different when in reality
they were not different would be a Type I error; concluding the means were not
different when in reality they were different would be a Type II error. Various
extensions have been suggested as "Type III errors", though none have wide
use.
All statistical hypothesis tests have a probability of making type I and type II
errors. For example, all blood tests for a disease will falsely detect the disease in
some proportion of people who don't have it, and will fail to detect the disease in
some proportion of people who do have it. A test's probability of making a type I
error is denoted by . A test's probability of making a type II error is denoted by .
These error rates are traded off against each other: for any given sample set, the
effort to reduce one type of error generally results in increasing the other type of
2
error. For a given test, the only way to reduce both error rates is to increase the
sample size, and this may not be feasible.
These terms are also used in a more general way by social scientists and others
to refer to flaws in reasoning.[4] This article is specifically devoted to the statistical
meanings of those terms and the technical issues of the statistical errors that
those terms describe.
The rate of the type II error is denoted by the Greek letter (beta) and related to
the power of a test (which equals 1).
Table of error types
Tabularised relations between truth/falseness of the null hypothesis and
outcomes of the test:[2]
Examples[edit]
Example 1[edit]
Hypothesis: "Adding water to toothpaste protects against cavities."
Null hypothesis: "Adding water to toothpaste has no effect on cavities."
This null hypothesis is tested against experimental data with a view to nullifying it
with evidence to the contrary.
A type I error occurs when detecting an effect (adding water to toothpaste
protects against cavities) that is not present. The null hypothesis is true (i.e., it is
true that adding water to toothpaste has no effect on cavities), but this null
hypothesis is rejected based on bad experimental data.
Example 2
Hypothesis: "Adding fluoride to toothpaste protects against cavities."
Null hypothesis: "Adding fluoride to toothpaste has no effect on cavities."
This null hypothesis is tested against experimental data with a view to nullifying it
with evidence to the contrary.
4
Etymology[edit]
In 1928, Jerzy Neyman (18941981) and Egon Pearson (18951980), both
eminent statisticians, discussed the problems associated with "deciding whether
or not a particular sample may be judged as likely to have been randomly drawn
from a certain population"[8]p. 1: and, as Florence Nightingale David remarked, "it is
necessary to remember the adjective 'random' [in the term 'random sample']
should apply to the method of drawing the sample and not to the sample itself".[9]
They identified "two sources of error", namely:
(a) the error of rejecting a hypothesis that should have been accepted, and
(b) the error of accepting a hypothesis that should have been rejected.[8]p.31
In 1930, they elaborated on these two sources of error, remarking that:
...in testing hypotheses two considerations must be kept in view, (1) we
must be able to reduce the chance of rejecting a true hypothesis to as low
a value as desired; (2) the test must be so devised that it will reject the
hypothesis tested when it is likely to be false.[10]
5