Beruflich Dokumente
Kultur Dokumente
The
science
of
data
What
is
data?
Informa7on,
in
the
form
of
facts
or
figures
obtained
from
experiments
or
surveys,
used
as
a
basis
for
making
calcula7ons
or
drawing
conclusions
Encarta
dic7onary
Sta7s7cs
in
Science
Qualita7ve
Quan7ta7ve
Qualita7ve
Data
• Informa7on
that
relates
to
characteris)cs
or
descrip)on
(observable
quali7es)
• Informa7on
is
o/en
grouped
by
descrip7ve
category
• Examples
– Species
of
plant
– Type
of
insect
– Shades
of
color
– Rank
of
flavor
in
taste
tes7ng
Remember:
qualita.ve
data
can
be
“scored”
and
evaluated
numerically
Qualita7ve
data,
manipulated
numerically
Calculated
distance
Average
value
What
do
error
bars
suggest?
• If
the
bars
show
extensive
overlap,
it
is
likely
that
there
is
not
a
significant
difference
between
those
values
Quick
Review
–
3
measures
of
“Central
Tendency”
• mode:
value
that
appears
most
frequently
• median:
When
all
data
are
listed
from
least
to
greatest,
the
value
at
which
half
of
the
observa7ons
are
greater,
and
half
are
lesser.
• The
most
commonly
used
measure
of
central
tendency
is
the
mean,
or
arithme7c
average
(sum
of
data
points
divided
by
the
number
of
points)
How
can
leaf
lengths
be
displayed
graphically?
Simply
measure
the
lengths
of
each
and
plot
how
many
are
of
each
length
If
smoothed,
the
histogram
data
assumes
this
shape
This
Shape?
• Is
a
classic
bell-‐shaped
curve,
AKA
Gaussian
Distribu7on
Curve,
AKA
a
Normal
Distribu7on
curve.
• OR….
In
MicrosoR
Excel,
type
the
following
code
into
the
cell
where
you
want
the
Standard
Devia7on
result,
using
the
"unbiased,"
or
"n-‐1"
method:
=STDEV(A1:A30)
(subs.tute
the
cell
name
of
the
first
value
in
your
dataset
for
A1,
and
the
cell
name
of
the
last
value
for
A30.)
• For today……….