Beruflich Dokumente
Kultur Dokumente
www.BIPM.org
The Big Picture
CHAPTER 3: Methodology
CHAPTER 5: Findings
Lecture #1
Analyse
To examine (something) methodically and in detail,
typically in order to explain and interpret it.
1. Data Collection
2. Data Presentation
3. Data Interpretation
Overview
The better the record that has been made of what has
been done, the easier will be the task of presenting the
work.
9
1. Data Collection
Significant figures
• Significant: “having a particular meaning; indicative of
something.”
o “meaningful”
10
1. Data Collection
Significant figures
• Writing a value as 6.12 is to give it to three significant
figures, and to write it as 6.124 is to give it to four
significant figures.
• Significant figures are the figures that lie between the first
non-zero figure and the last figure inclusive.
11
Check Your Neighbour
How many significant figures appear in the following numbers?
A. 1.654 ?
B. 0.00437 ?
C. 64 000 ?
D. 1.20 ?
E. 0.100 007 38 ?
12
Check Your Neighbour
How many significant figures appear in the following numbers?
Answer:
A. 1.654 Four
B. 0.00437 ?
C. 64 000 ?
D. 1.20 ?
E. 0.100 007 38 ?
13
Check Your Neighbour
How many significant figures appear in the following numbers?
Answer:
A. 1.654 Four
B. 0.00437 Three
C. 64 000 ?
D. 1.20 ?
E. 0.100 007 38 ?
14
Check Your Neighbour
How many significant figures appear in the following numbers?
Answer:
A. 1.654 Four
B. 0.00437 Three
C. 64 000 Two
D. 1.20 ?
E. 0.100 007 38 ?
15
Check Your Neighbour
How many significant figures appear in the following numbers?
Answer:
A. 1.654 Four
B. 0.00437 Three
C. 64 000 Two
D. 1.20 Three
E. 0.100 007 38 ?
16
Check Your Neighbour
How many significant figures appear in the following numbers?
Answer:
A. 1.654 Four
B. 0.00437 Three
C. 64 000 Two
D. 1.20 Three
17
Check Your Neighbour
How many significant figures are implied by the way the
following numbers are written?
A. 3.24 ?
B. 0.0023 ?
C. 83 400 ?
D. 1.010 ?
E. 10.5 ?
18
Check Your Neighbour
How many significant figures are implied by the way the
following numbers are written?
Answer:
A. 3.24 Three
B. 0.0023 ?
C. 83 400 ?
D. 1.010 ?
E. 10.5 ?
19
Check Your Neighbour
How many significant figures are implied by the way the
following numbers are written?
Answer:
A. 3.24 Three
B. 0.0023 Two
C. 83 400 ?
D. 1.010 ?
E. 10.5 ?
20
Check Your Neighbour
How many significant figures are implied by the way the
following numbers are written?
Answer:
A. 3.24 Three
B. 0.0023 Two
C. 83 400 Three
D. 1.010 ?
E. 10.5 ?
21
Check Your Neighbour
How many significant figures are implied by the way the
following numbers are written?
Answer:
A. 3.24 Three
B. 0.0023 Two
C. 83 400 Three
D. 1.010 Four
E. 10.5 ?
22
Check Your Neighbour
How many significant figures are implied by the way the
following numbers are written?
Answer:
A. 3.24 Three
B. 0.0023 Two
C. 83 400 Three
D. 1.010 Four
E. 10.5 Three
23
1. Data Collection
Significant figures & scientific notation
• It is not always clear how many figures in a number are
significant.
26
Check Your Neighbour
Give the following numbers in scientific notation to four
significant figures:
Answer:
# Number Scientific notation
A. 0.005 654 2 5.654 × 10−3
B. 125.04 ?
C. 93 842 773 ?
D. 3 400 042 ?
E. 0.000 000 100 092 ?
27
Check Your Neighbour
Give the following numbers in scientific notation to four
significant figures:
Answer:
# Number Scientific notation
A. 0.005 654 2 5.654 × 10−3
B. 125.04 1.250 × 102
C. 93 842 773 ?
D. 3 400 042 ?
E. 0.000 000 100 092 ?
28
Check Your Neighbour
Give the following numbers in scientific notation to four
significant figures:
Answer:
# Number Scientific notation
A. 0.005 654 2 5.654 × 10−3
B. 125.04 1.250 × 102
C. 93 842 773 9.384 × 107
D. 3 400 042 ?
E. 0.000 000 100 092 ?
29
Check Your Neighbour
Give the following numbers in scientific notation to four
significant figures:
Answer:
# Number Scientific notation
A. 0.005 654 2 5.654 × 10−3
B. 125.04 1.250 × 102
C. 93 842 773 9.384 × 107
D. 3 400 042 3.400 × 106
E. 0.000 000 100 092 ?
30
Check Your Neighbour
Give the following numbers in scientific notation to four
significant figures:
Answer:
# Number Scientific notation
A. 0.005 654 2 5.654 × 10−3
B. 125.04 1.250 × 102
C. 93 842 773 9.384 × 107
D. 3 400 042 3.400 × 106
E. 0.000 000 100 092 1.001 × 10−7
31
Check Your Neighbour
Give the following numbers in scientific notation to two
significant figures:
32
Check Your Neighbour
Give the following numbers in scientific notation to two
significant figures:
Answer:
# Number Scientific notation
A. 0.005 654 2 5.7 × 10−3
B. 125.04 1.3 × 102
C. 93 842 773 9.4 × 107
D. 3 400 042 3.4 × 106
E. 0.000 000 100 092 1.0 × 10−7
33
1. Data Collection
Significant figures & calculations
• If you are required to perform a calculation in which the
uncertainties in the quantities are not known, the following
rules are useful:
34
1. Data Collection
Significant figures & calculations
• Rule 2: When adding or subtracting numbers: round the
result of the calculation to the least number of decimal
places as contained in the quantities involved.
35
Check Your Neighbour
Write down the results of the following calculations to an
appropriate number of significant figures:
# Calculation Answer
A. 1.2 × 8 ?
B. 13.0 × 43.23 ?
C. 0.0104 × 0.023 ?
D. 33 + 435.5 ?
E. 14.1 ÷ 76.3 ?
F. 105.55 – 34.2 ?
36
Check Your Neighbour
Write down the results of the following calculations to an
appropriate number of significant figures:
Answer:
# Calculation Answer
A. 1.2 × 8 10
B. 13.0 × 43.23 562
C. 0.0104 × 0.023 0.000 24
D. 33 + 435.5 469
E. 14.1 ÷ 76.3 0.185
F. 105.55 – 34.2 71.4
37
1. Data Collection
True value, accuracy and precision
• Measuring a quantity is an attempt to find an estimate of the
‘true value’ of that quantity.
39
1. Data Collection
Uncertainties in measurements
• Example: variation of electrical resistance with temperature
of a copper wire.
44
1. Data Collection
Single measurement: calibration uncertainty
• The instruments that you use should have been calibrated
at some time against a standard. For the calibration to
remain valid, the instrument must be checked regularly.
45
1. Data Collection
Repeat measurements
• To be able to get a real feel for the variability in
measurement, more than one measurement should be
made for each quantity.
46
1. Data Collection
The mean
• Example: Times for an object to fall 25 m
Time of fall (s) 0.64 0.61 0.63 0.53 0.59 0.65 0.60 0.61 0.64 0.71
• We could expect the time that it really took for the object to
fall to lie somewhere between two extreme measured
values, namely between 0.53 s and 0.71 s.
σ 𝑥𝑖
𝑥ҧ =
𝑛
range
Uncertainty in mean =
n
49
1. Data Collection
Uncertainty in the mean
• Example: Speed of sound in air at 20°C
Speed (m/s) 341.5 342.4 342.2 345.5 341.1 338.5 340.3 342.7
53
1. Data Collection
Fractional and percentage uncertainty
• In some cases you may be required to state the ratio
uncertainty in quantity
quantity
1. Systematic uncertainties
2. Random uncertainties
1. Offset uncertainty
2. Gain uncertainty
55
1. Data Collection
Offset uncertainty
• Example: Melting point of water using thermocouple
Temp. (°C) -7.5 -7.3 -6.9 -7.4 -7.7 -7.6 -7.6 -7.3 -7.6
60
1. Data Collection
Standard deviation (SD)
• If 𝑥𝑖 represents an 𝑖th data value in a set of 𝑛 repeated
measurements, and 𝑥ҧ the mean of the data values, then the
standard deviation, 𝜎 is given by
σ(𝑥𝑖 − 𝑥)ҧ 2
𝜎=
𝑛
61
1. Data Collection
Standard deviation (SD)
• Example: Time for a body to slide down a plane.
Time (s) 0.64 0.64 0.59 0.58 0.70 0.61 0.68 0.55 0.57 0.63
Vol. (mL) 33 45 43 42 45 42 41 44 40 42
• In summary:
66
1. Data Collection
Population and sample
• From this perspective, the SD is the estimate of the
population SD and is calculated using
σ(𝑥𝑖 − 𝑥)ҧ 2
𝜎=
𝑛−1
68
1. Data Collection
Combining uncertainties
• Example: Consider a function 𝑉 = 𝑉(𝑎, 𝑏), where 𝑎 and 𝑏
have uncertainties ∆𝑎 and ∆𝑏, respectively.
𝜕𝑉 𝜕𝑉
∆𝑉 = ∆𝑎 + ∆𝑏
𝜕𝑎 𝜕𝑏
2 2
𝜕𝑉 𝜕𝑉
𝜎𝑉ഥ = 2
𝜎𝑎ത + 𝜎𝑏ത2
𝜕𝑎 𝜕𝑏
70
2. Data Presentation
Overview
71
2. Data Presentation
x-y graphs
• Pictorial representation of data in a graph is a good way to
summarise many important features of the experiment.
74
2. Data Presentation
Linear x-y graphs
• Linear graphs have an important place in the analysis of
experimental data for the following reasons:
76
2. Data Presentation
Linear x-y graphs
i. No two people draw the same ‘best’ line through a
given data set.
ii. If the uncertainty in each data point is different, how do
we take this into account when drawing the best line?
iii. Drawing the best line is difficult for largely-scattered
data.
iv. Finding the uncertainties in m and c is cumbersome.
77
2. Data Presentation
Least squares method
i. We assume that any random uncertainty in data values is
confined to measurements made of the y-quantity.
78
2. Data Presentation
Least squares method
Observed
experimentally
Calculated:
𝑦𝑖𝑐 = 𝑚𝑥𝑖 + 𝑐
79
2. Data Presentation
Least squares method
• For a particular value of x, labelled 𝑥𝑖 , there are observed
(𝑦𝑖𝑜 ) and calculated (𝑦𝑖𝑐 ) values of y.
• The best position for the line, and therefore the best values
for m and c, is found by minimising the sum of the square of
the residuals.
80
2. Data Presentation
Least squares method
• Writing SS for sum of squares, we say:
= σ𝑛1 (∆𝑦𝑖 )2
𝑆𝑆 = [𝑦𝑖𝑜 − 𝑚𝑥𝑖 + 𝑐 ]2
81
2. Data Presentation
Least squares method
• We seek values of m and c that reduce SS to the smallest
possible value. Those are the best values for gradient and
intercept.
𝑥𝑖 (𝑦𝑖𝑜 −𝑚𝑥𝑖 − 𝑐) = 0
and:
(𝑦𝑖𝑜 −𝑚𝑥𝑖 − 𝑐) = 0
82
2. Data Presentation
Least squares method
• The above equations can be expanded and combined to
give the following equations for m and c:
𝑛 σ 𝑥𝑖 𝑦𝑖 − σ 𝑥𝑖 σ 𝑦𝑖
𝑚= 2
𝑛 σ 𝑥𝑖 2 − (σ 𝑥𝑖 )
and:
σ 𝑥𝑖 2 σ 𝑦𝑖 − σ 𝑥𝑖 σ 𝑥𝑖 𝑦𝑖
𝑐= 2
2
𝑛 σ 𝑥𝑖 − (σ 𝑥𝑖 )
83
2. Data Presentation
Least squares method
• The ‘o’ from the subscript of y has been omitted from the
observed values of y that appear in the equations.
1
𝜎𝑛2
𝜎𝑚 = 1
2 2
𝑛 σ 𝑥𝑖 2 − (σ 𝑥𝑖 )
and:
1
𝜎 σ 𝑥𝑖 2 2
𝜎𝑐 = 1
2 2
𝑛 σ 𝑥𝑖 2 − (σ 𝑥𝑖 )
85
2. Data Presentation
Least squares method
• The 𝜎 is the uncertainty in each y-value of the data point.
1
2
1
𝜎= (𝑦𝑖 −𝑚𝑥𝑖 − 𝑐)2
𝑛−2
86
2. Data Presentation
Weighting the fit
• ‘Weighted’ least squares fitting is used for situations in
which the uncertainties in the y-values vary from point to
point.
87
2. Data Presentation
Weighting the fit
• Let
2 2
1 𝑥𝑖 𝑥𝑖
∆= 2 2 − 2
𝜎𝑖 𝜎𝑖 𝜎𝑖
1 𝑥𝑖 𝑦𝑖 𝑥𝑖 𝑦𝑖
σ 2 σ 2 −σ 2 σ 2
𝜎𝑖 𝜎𝑖 𝜎𝑖 𝜎𝑖
𝑚=
∆
88
2. Data Presentation
Weighting the fit
𝑥2 𝑦𝑖 𝑥𝑖 𝑥𝑖 𝑦𝑖
σ 2 σ 2 −σ 2 σ 2
𝜎𝑖 𝜎𝑖 𝜎𝑖 𝜎𝑖
𝑐=
∆
• The equations for 𝜎𝑚 and 𝜎𝑐 are:
1
1 2
σ 2
𝜎𝑖
𝜎𝑚 =
∆
89
2. Data Presentation
Weighting the fit
1
𝑥𝑖 2 2
σ 2
𝜎𝑖
𝜎𝑐 =
∆
• With the obvious good amount of work required in applying
the foregoing equations, it can be of great assistance to use
a computer spreadsheet, e.g. Microsoft Excel.
90
2. Data Presentation
Histograms
• A histogram (or bar chart) is useful for displaying data
from repeat measurements.
o N = 50
o 𝑁 = 50 = 7.1 = 7
95
2. Data Presentation
Histograms
• Step 4: Divide R by the number calculated in step 3 to give
the width of each interval
96
2. Data Presentation
Histograms
1 0.51 – 0.53 4
2 0.54 – 0.56 5
3 0.57 – 0.59 11
4 0.60 – 0.62 15
5 0.63 – 0.65 9
6 0.66 – 0.68 4
7 0.69 – 0.71 2
2. Data Presentation
Histograms
50 readings of time for body to slide down incline
16
14
12
10
Frequency
2 4
0
1 2 3 4 5 6 7
98
Intervals of time (s)
2. Data Presentation
Histograms
50 readings of time for body to slide down incline
16
14
12
10
Frequency
2 5
4
0
1 2 3 4 5 6 7
99
Intervals of time (s)
2. Data Presentation
Histograms
50 readings of time for body to slide down incline
16
14
12
10
Frequency
6
11
4
2 5
4
0
1 2 3 4 5 6 7
100
Intervals of time (s)
2. Data Presentation
Histograms
50 readings of time for body to slide down incline
16
14
12
10
Frequency
8
15
6
11
4 9
2 5
4 4
2
0
1 2 3 4 5 6 7
101
Intervals of time (s)
2. Data Presentation
Histograms
• The histogram displays the following features:
3. There are few values that lie far from the mean.
DATA
2. Data Presentation
Histograms
• Example of data presented as histogram.
137Cs Spectrum
800
600
Counts
400
200
0
542
576
46
79
112
145
178
211
245
278
311
344
377
410
443
476
509
609
642
675
708
741
774
807
104
Energy (keV)
2. Data Presentation
Distributions
• In most experiments, as one increases the number of
measurements, the histogram takes on some definite,
continuous curve.
8
6
4
2
0
1 2 3 4 5 6 7
Intervals of time (s)
2. Data Presentation
Distributions
• A limiting distribution defines some function f(x).
108
2. Data Presentation
Distributions
• The mean of any number is the sum of all different values,
𝑥𝑖 , each weighted by the fraction of times it is obtained,
𝑥ҧ = 𝑥𝑖 𝐹𝑖
𝑖
109
2. Data Presentation
Distributions
• We can also calculate the standard deviation, 𝜎𝑥 , for the
measurements
∞
2
𝜎𝑥 = න (𝑥 − 𝑥)ҧ 2 𝑓 𝑥 𝑑𝑥
−∞
−(𝑥−𝑋) 2 /2𝜎 2
𝑓 𝑥 = 𝑒
111
2. Data Presentation
The Normal Distribution
• The 𝜎 is called the width parameter.
∞
• To satisfy −∞ 𝑓 𝑥 𝑑𝑥 = 1, this function becomes:
1 −(𝑥−𝑋) 2 /2𝜎 2
𝑓 𝑥 = 𝑒
𝜎 2𝜋
𝑥ҧ = 𝑋 and 𝜎𝑥 2 = 𝜎 2
112
2. Data Presentation
The Normal Distribution
• In other words, if we make a large (but finite) number of
trials, then our average, 𝑥,ҧ will be close to X.
119
2. Data Presentation
Reduced 𝝌𝟐
• The number 𝒅 in a statistical calculation is the number of
observed data minus the number of parameters computed
from the data and used in the calculation.
• Therefore,
𝑑 =𝑛−𝑐
𝜒 2 = 𝜒 2 /𝑑
122
2. Data Presentation
Probabilities for 𝝌𝟐
• We can calculate the probability of obtaining a value of
𝜒 2 as large as, or larger than, our observed value 𝜒𝑜 2 (where
the subscript o stands for “obtained”).
𝑃(𝜒 2 ≥ 𝜒𝑜 2 )
123
2. Data Presentation
Probabilities for 𝝌𝟐
• If the probability is high, then our value 𝜒𝑜 2 is perfectly
acceptable, and there is no reason to reject our expected
distribution.
124
2. Data Presentation
Probabilities for 𝝌𝟐
• With the boundary at 5 percent, we would say that our
observed value 𝜒𝑜 2 indicates a “significant disagreement” if
𝑃 𝜒 2 ≥ 𝜒𝑜 2 < 5%
125
2. Data Presentation
Probabilities for 𝝌𝟐
• The probabilities 𝑃 𝜒 2 ≥ 𝜒𝑜 2 are calculated from the
integral
∞
2 2 2 𝑑−1 −𝑥 2 /2
𝑃𝑑 (𝜒 ≥ 𝜒0 ) = 𝑑/2 න 𝑥 𝑒 𝑑𝑥
2 𝛤(𝑑/2) 𝜒𝑜
126
2. Data Presentation
Probabilities for 𝝌𝟐
The percentage 𝑃 𝜒 2 ≥ 𝜒𝑜 2 of obtaining a value 𝜒 2 greater than or equal to 𝜒𝑜 2 .
𝟎𝟐
𝝌
d
0 0.25 0.5 0.75 1.0 1.25 1.5 1.75 2.0 3.0 4.0 5.0 6.0
40
𝑥𝑖
best estimate of 𝑋 = 𝑥ҧ = = 𝟕𝟑𝟎. 𝟏 𝐜𝐦
40
𝑖=1
σ 𝑥𝑖 − 𝑥ҧ 2
best estimate of 𝜎 = = 𝟒𝟔. 𝟖 𝐜𝐦
39
132
2. Data Presentation
Example
• Suppose we have reason to believe these measurements
are governed by a Gaussian distribution 𝑓𝑋,𝜎 𝑥 .
40
𝑥𝑖
best estimate of 𝑋 = 𝑥ҧ = = 𝟕𝟑𝟎. 𝟏 𝐜𝐦
40
𝑖=1
σ 𝑥𝑖 − 𝑥ҧ 2
best estimate of 𝜎 = = 𝟒𝟔. 𝟖 𝐜𝐦
39
133
2. Data Presentation
Example
• We divide the range of possible 𝑥 values into bins, with bin
boundaries at 𝑋 – 𝜎, 𝑋, and 𝑋 + 𝜎.
Observations
Bin # Interval (cm)
Ok in bin
1 x<X–σ x < 683.3 8
2 X–σ<x<X 683.3 < x < 730.1 10
3 X<x<X+σ 730.1 < x < 776.9 16
4 X+σ<x 776.9 < x 6
Expected Observed
Bin # Interval (cm) Probability Pk
Ek = NPk Ok
1 x<X–σ 16 % 6.4 8
2 X–σ<x<X 34 % 13.6 10
3 X<x<X+σ 34 % 13.6 16
4 X+σ<x 16 % 6.4 6
135
2. Data Presentation
Example
• We now calculate
𝑛
(𝑂 − 𝐸 )2
𝑘 𝑘
𝜒2 =
𝐸𝑘
𝑘=1
𝜒 2 = 𝜒 2 /𝑑 136
2. Data Presentation
Example
• Here there were three constraints and hence only one
degree of freedom,
𝑑 =𝑛−𝑐 =4−3=1
𝜒 2 1.80
2
𝜒 = = = 1.80
𝑑 1
138
2. Data Presentation
Summary: Chi-squared Test
• If we make 𝑛 measurements for which we know, or can
calculate, the expected values and the standard deviations,
then we define 𝜒 2 as
𝑛 2
observed value − expected value
𝜒2 =
standard deviation
1
139
2. Data Presentation
Summary: Chi-squared Test
• The expected number 𝐸𝑘 is determined by the assumed
distribution of 𝑥, and the standard deviation is 𝐸𝑘 .
𝑛
𝑂𝑘 − 𝐸𝑘 2
2
𝜒 =
𝐸𝑘
𝑘=1
140
2. Data Presentation
Summary: Chi-squared Test
• If we were to repeat the whole experiment many times, the
mean value of 𝜒 2 should be equal to 𝑑, the number of
degrees of freedom, defined as
𝑑 =𝑛−𝑐
𝜒 2 = 𝜒 2 /𝑑
141
2. Data Presentation
Summary: Chi-squared Test
• If assumed distribution is correct, 𝜒 2 should be of order 1.
143
3. Data Interpretation
Overview
• The question is: what can be usefully said with the data
that were gathered?
144
3. Data Interpretation
• An experiment is likely to contain many details, both major
and minor.
145
3. Data Interpretation
• Even if the experimental method used could have been
improved, we should be not be too dismissive of data that
were obtained in an experiment.
• The question is: what can be usefully said with the data
that were gathered, despite the shortcomings?
• What was the aim of the experiment, and how far did the
experiments performed go in achieving that aim?
146
3. Data Interpretation
147