Sie sind auf Seite 1von 4

BOX PLOT

Definition

A box and whisker plot (sometimes called a box plot) is a graph that presents
information from a five-number summary. It does not show a distribution in as
much detail as a stem and leaf plot or histogram does, but is especially useful
for indicating whether a distribution is skewed and whether there are potential
unusual observations (outliers) in the data set. Box and whisker plots are also
very useful when large numbers of observations are involved and when two or
more data sets are being compared.

Why Use a Box and Whisker Plot?

Box and whisker plots are very effective and easy to read. They
summarize data from multiple sources and display the results in a single
graph. Box and whisker plots allow for comparison of data from different
categories for easier, more effective decision-making.

When to Use a Box and Whisker Plot

Use box and whisker plots when you have multiple data sets from
independent sources that are related to each other in some way.
Examples include test scores between schools or classrooms, data from
before and after a process change, similar features on one part such as
cam shaft lobes, or data from duplicate machines manufacturing the same
products.

Parts of a Box Plot

In a box and whisker plot:

 the ends of the box are the upper and lower quartiles, so the box spans
the interquartile range
 the median is marked by a vertical line inside the box
 the whiskers are the two lines outside the box that extend to the highest
and lowest observations.
Steps In Making a Box Plot

1. Find the five-number summary of your data set:

The minimum is the smallest value in the data set, and the maximum is
the largest value in the data set. Use the following steps to find the 25th
percentile (known as Q1), the 50th percentile (the median), and the 75th
percentile (Q3).

1.1 Order all the values in the data set from smallest to largest.

1.2 Multiply k percent times the total number of values in the data, n.

The result is known as the index.

1.3 If the index obtained in Step 2 isn’t a whole number, round it up to


the nearest whole number and go to Step 4a.

If index obtained in Step 2 is a whole number, go to Step 4b.

1.4 Choose one of the following.

a. Count the values in your data set from left to right (from the smallest
to the largest value) until you reach the number indicated by Step 3. The
corresponding value in your data set is the kth percentile.

b. Count the values in your data set from left to right (smallest to largest)
until you reach the number indicated by Step 2. The kthpercentile is the
average of that corresponding value in your data set and the value that
directly follows it.

2. Create a vertical (or horizontal) number line whose scale includes the
values in the five-number summary and uses appropriate units of equal
distance from each other.

3. Mark the location of each value in the five-number summary just above
the number line (for a horizontal boxplot) or just to the right of the number
line (for a vertical boxplot).

4. Draw a box around the marks for the 25th percentile and the 75th
percentile.

5. Draw a line in the box where the median is located.

6. Determine whether or not outliers are present.

To make this determination, calculate the Interquartile Range (IQR),


which is found by subtracting Q3 – Q1; then multiply IQR by 1.5. Add this
amount to the value of Q3 and subtract this amount from Q1. This gives
you a wider boundary around the median than the box does. Any data
points that fall outside this boundary are determined to be outliers.
7. If there are no outliers (according to your results of Step 6), draw lines
from the upper and lower edges of the box out to the minimum and
maximum values in the data set.

8. If there are outliers (according to your results of Step 6), indicate their
location on the boxplot with * signs.

Instead of drawing a line from the edge of the box all the way to the most
extreme outlier, stop the line at the last data value that isn’t an outlier.

Advantages of Using a Box Plot

1. Handles Large Data Easily

Due to the five number data summary, a box plot is able to handle and
present a summary of a large amount of data. A box plot consists of the
median, which is the midpoint of the range of data; the upper and lower
quartiles, which represent the numbers above and below the highest and
lower quarters of the data; and the minimum and maximum data values.
Organizing data in a box plot by using five key concepts is an efficient way
of dealing with large data that is too unmanageable for other graphs, such
as line plots or stem and leaf plots.

2. Summarizing

A box plot is a highly visually effective way of viewing a clear summary of


one or more sets of data. It is particularly useful for quickly summarizing
and comparing different sets of results from different experiments. At a
glance, a box plot allows a graphical display of the distribution of results
and provides indications of symmetry within the data.

3. Outliers

A box plot is one of very few statistical graph methods that show outliers.
There might be one outlier or multiple outliers within a set of data, which
occurs both below and above the minimum and maximum data values. An
outlier is an obscure result that can be detected by extending the
minimum and maximum data values to a maximum of 1.5 times the inter-
quartile range. Any results of data that fall outside of the minimum and
maximum values are considered outliers, which are easy to determine on
a box plot graph.

Disadvantages of Using a Box Plot

1. Exact Values Not Retained

The issue with handling such large amounts of data in a box plot is that
the exact values and details of the distribution of results are not retained.
A box plot shows only a simple summary of the distribution of results, so
that it can be quickly viewed and compared with other data. For a
thorough, more detailed analysis of data a box plot should be used in
combination with another statistical graph method, such as a histogram.

REFERENCES

1. http://www.statcan.gc.ca/edu/power-pouvoir/ch12/5214889-eng.htm

2. http://www.ehow.com/info_12025269_advantages-disadvantages-box-
plot.html

3. http://www.dummies.com/education/math/statistics/how-to-make-a-
boxplot-from-a-five-number-summary/

4. http://asq.org/learn-about-quality/data-collection-analysis-
tools/overview/box-whisker-plot.html

Das könnte Ihnen auch gefallen