Beruflich Dokumente
Kultur Dokumente
GRAPHS USED IN
BIOSTATISTICS
NIDHI
MSB/17/154
NIDHI
MSB/17/154
What is a “GRAPH” ?
• A graph commonly consists of two axes called the x-axis (horizontal) and y-axis
(vertical). Each axis corresponds to one variable. The axes are labeled with different
names, such as Price and Quantity.
• The place where the two axes intersect is called the origin. The origin is also
identified as the point (0,0).
Common use Of Graphs.
• Graphs provide a powerful way of summarising data and presenting them in a way that
most people find easy to read.
• Why use graphs to present data?
Because they...
-are quick and direct.
-highlight the most important facts.
-facilitate understanding of the data.
-can convince readers.
-can be easily remembered.
• Statistical graphics are useful tools for exploring data, aiding inference and
communicating results
– Display large data coherently
– Maximize the ability to detect unusual features
– Facilitate communication with: regulators, investigators, collaborators, upper
Management, etc.
…
Cont
Types of Graphs
• The choice of an appropriate graph or chart depends on what information you are
trying to convey.
• There are many different types of graphs used in Biostatistics that can be used to
convey information, including:
• BAR GRAPH,
• HISTOGRAMS,
• PIE CHARTS,
• BOX PLOT,
• SCATTER PLOT,
• CUMULATIVE DISTRIBUTION CURVE,
• DENDOGRAM.
1. BAR GRAPH
• A bar chart consists of a grid and some vertical or horizontal columns (bars).
Each column represents quantitative data.
• Bar Graph is useful when there is a numerical comparison.
• A bar graph is very similar to a Histograms but is used for Categorical data.
• Rectangular bars are used to represent categories of data, and the height of each
bar is proportional to the frequency or relative frequencies.
• All the bars are drawn with same width, and the bars can be presented either
vertically or horizontally.
• Bar graphs enable comparison across several categories, making it easy to
identify frequently and infrequently occurring categories.
2. HISTOGRAMS
• Histograms are yet another graphic way of presenting data to show the distribution of
the observations.
• It is one of the most common forms of graphical presentation of a frequency
distribution.
• A histogram is constructed by representing the measurements or observations that are
grouped on a horizontal scale, the interval frequencies on a vertical scale, and drawing
rectangles whose bases equal the class intervals and whose heights are determined by
the corresponding class frequencies.
• No regular spaces between the bars. Histograms are used to identify the general shape
of distribution of the data.
3. BOX PLOT
• A box plot displays summary statistics for the distribution of values for a variable.
• The outer bounds of the box represent the first and third quartiles.
• The line inside the box represents the median.
• The markers outside the box, referred to as outliers, represent data points that are
outside of the 25th and 75th percentiles.
• Box plots reveal the main features of a batch of data, i.e. how the data are spread out.
• Any boxplot is a graph of the five-number summary:
-MINIMUM
-1ST QUARTILE
-MEDIAN
-3RD QUARTILE
-MAXIMUM
with suspected outliers plotted individually.
The boxplot consists of a rectangular box, which represents the middle half of all
scores (between Q1 and Q3).
• Approximately one-fourth of the values should fall between the minimum and Q1,
and approximately one-fourth should fall between Q3 and the maximum.
• Lines called whiskers extend from the box out to the minimum and maximum
scores that are not possible outliers.
• If an observation falls more than 1.5x IQR outside of the box, it is plotted
individually as an outlier.
Box-and-whisker plots
Whisker Box Whisker
0 10 20 30 40 50
Minimum
value (17) Maximum
Median (36) value (48)
Lower Quartile –
median of all numbers Upper Quartile – median
in the list to the left of of all numbers to the right
the median of the median
(25+30)/2 = 27.5 (40+40)/2 = 40
4. SCATTERPLOT
• A cumulative distribution plot may need a footnote and additional text in the
body of the label describing how to read the graph.
• For example, the following text could accompany the graph shown above:
“Approximately 50% of the patients in each group had a decrease of at least 2
mg/dL at endpoint.”
• This graph shows the percentage of subjects (y-axis) attaining a change from baseline
less than or equal to the value on the x-axis. A curve that shifts to the left indicates a
better response.
6. Dendrogram