Sie sind auf Seite 1von 2

12/8/2017 GraphPad - FAQ 1383 - Why use n-1 when calculating a standard deviation?

Cart SEARCH
(/ecommerce/)
Sign In (/myaccount)

Home » Support

Frequently Asked Questions


SEARCH FOR FAQ

Search

 Why use n-1 when calculating a standard deviation?


FAQ# 1383 Last Modified 22-March-2009

How ito calculate the standard deviation

1. Compute the square of the difference between each value and the sample mean.

2. Add those values up.

3. Divide the sum by n-1. This is called the variance.

4. Take the square root to obtain the Standard Deviation.

Why n-1?

Why divide by n-1 rather than n in the third step above? In step 1, you compute the difference between each value and the mean
of those values. You don't know the true mean of the population; all you know is the mean of your sample. Except for the rare
cases where the sample mean happens to equal the population mean, the data will be closer to the sample mean than it will be
to the true population mean. So the value you compute in step 2 will probably be a bit smaller (and can't be larger) than what it
would be if you used the true population mean in step 1. To make up for this, divide by n-1 rather than n.v This is called Bessel's
correction (http://en.wikipedia.org/wiki/Bessel's_correction).

But why n-1? If you knew the sample mean, and all but one of the values, you could calculate what that last value must be.
Statisticians say there are n-1 degrees of freedom.

When should the SD be computed with a denominator of n?

Statistics books often show two equations to compute the SD, one using n, and the other using n-1, in the denominator. Some
calculators have two buttons.

The n-1 equation is used in the common situation where you are analyzing a sample of data and wish to make more general
conclusions. The SD computed this way (with n-1 in the denominator) is your best guess for the value of the SD in the overall
population.

https://www.graphpad.com/support/faqid/1383/ 1/2
12/8/2017 GraphPad - FAQ 1383 - Why use n-1 when calculating a standard deviation?

If you simply want to quantify the variation in a particular set of data, and don't plan to extrapolate to make wider conclusions,
then you can compute the SD using n in the denominator. The resulting SD is the SD of those particular values. It makes no
sense to compute the SD this way if you want to estimate the SD of the population from which those points were drawn. It only
makes sense to use n in the denominator when there is no sampling from a population, there is no desire to make general
conclusions.

The goal of science is always to generalize, so the equation with n in the denominator should not be used. The only example I
can think of where it might make sense is in quantifying the variation among exam scores. But much better would be to show a
scatterplot of every score, or a frequency distribution histogram.

GraphPad Prism and InStat always compute the SD with the n-1 denominator.

Need to learn
Prism 7?
These guided examples of common analyses will get you off to a great start! CLICK HERE > (/scientific-software/prism/?
tab=5)

(/data-analysis-resource-center/)

On-site training
LEARN MORE > (/support/on-site-training)

https://www.graphpad.com/support/faqid/1383/ 2/2

Das könnte Ihnen auch gefallen