Beruflich Dokumente
Kultur Dokumente
TERMINOLOGY
Replication repetition of a basic
experiment without changing any factor
settings, allows the experimenter to estimate
the experimental error (noise) in the system
used to determine whether observed
differences in the data are real or just
noise, allows the experimenter to obtain
more statistical power (ability to identify
small effects)
6
TERMINOLOGY
.Randomization a statistical tool used to minimize
potential uncontrollable biases in the experiment by
randomly assigning material, people, order that
experimental trials are conducted, or any other
factor not under the control of the experimenter.
Results in averaging out the effects of the
extraneous factors that may be present in order to
minimize the risk of these factors affecting the
experimental results.
7
TERMINOLOGY
Blocking technique used to increase the
precision of an experiment by breaking the
experiment into homogeneous segments
(blocks) in order to control any potential
block to block variability (multiple lots of
raw material, several shifts, several
machines, several inspectors). Any effects
on the experimental results as a result of the
blocking factor will be identified and
minimized.
8
TERMINOLOGY
Confounding - A concept that basically means that
multiple effects are tied together into one parent
effect and cannot be separated. For example,
1. Two people flipping two different coins would
result in the effect of the person and the effect of
the coin to be confounded
2. As experiments get large, higher order
interactions (discussed later) are confounded with
lower order interactions or main effect.
TERMINOLOGY
Factors experimental factors or
independent variables (continuous or
discrete) an investigator manipulates to
capture any changes in the output of the
process. Other factors of concern are those
that are uncontrollable and those which are
controllable but held constant during the
experimental runs.
10
TERMINOLOGY
Responses dependent variable measured
to describe the output of the process.
Treatment Combinations (run)
experimental trial where all factors are set
at a specified level.
11
TERMINOLOGY
PLANNING A DOE
Everyone involved in the experiment should
have a clear idea in advance of exactly what
is to be studied, the objectives of the
experiment, the questions one hopes to
answer and the results anticipated
13
PLANNING A DOE
Select a response/dependent variable
(variables) that will provide information
about the problem under study and the
proposed measurement method for this
response variable, including an
understanding of the measurement system
variability
14
PLANNING A DOE
Select the independent variables/factors
(quantitative or qualitative) to be
investigated in the experiment, the number
of levels for each factor, and the levels of
each factor chosen either specifically (fixed
effects model) or randomly (random effects
model).
15
PLANNING A DOE
Choose an appropriate experimental design
(relatively simple design and analysis methods are
almost always best) that will allow your experimental
questions to be answered once the data is collected
and analyzed, keeping in mind tradeoffs between
statistical power and economic efficiency. At this
point in time it is generally useful to simulate the
study by generating and analyzing artificial data to
insure that experimental questions can be answered
as a result of conducting your experiment
16
PLANNING A DOE
Perform the experiment (collect data)
paying particular attention such things as
randomization and measurement system
accuracy, while maintaining as uniform an
experimental environment as possible.
How the data are to be collected is a critical
stage in DOE
17
PLANNING A DOE
Analyze the data using the appropriate
statistical model insuring that attention is
paid to checking the model accuracy by
validating underlying assumptions
associated with the model. Be liberal in the
utilization of all tools, including graphical
techniques, available in the statistical
software package to insure that a maximum
amount of information is generated
18
PLANNING A DOE
Based on the results of the analysis, draw
conclusions/inferences about the results,
interpret the physical meaning of these
results, determine the practical significance
of the findings, and make recommendations
for a course of action including further
experiments
19
23
Power
0.8
0.6
0.4
0.2
0
11.8
11.9
12
12.1
12.2
True Mean
24
25
26
27
28
29
DIFFERENCE IN VARIANCE
HYPOTHESIS TEST
30
DIFFERENCE IN VARIANCE
HYPOTHESIS TEST
Can you detect this difference?
31
DIFFERENCE IN VARIANCE
HYPOTHESIS TEST -POWER CURVE
32
Power
0.8
0.6
0.4
0.2
0
-5
-4
-3
-2
-1
Difference in Means
34
Analysis of Variance
---------------------------------------------------------------------------Source
Sum of Squares
Df Mean Square
F-Ratio
P-Valu
---------------------------------------------------------------------------Between groups
1.10019
6
0.183364
18.66
0.000
Within groups
1.30717
133
0.00982837
---------------------------------------------------------------------------Total (Corr.)
2.40736
139
36
37
VISUAL COMPARISON OF 7
NEEDLES
Box-and-Whisker Plot
N1
N2
Col_2
N3
N4
N5
N6
N7
11.5
11.7
11.9
12.1
12.3
Col_1
38
40
# of runs
4
8
16
32
64
128
256
512
1024
42
Orthogonal Settings
RUN Factor A Factor B RESPONSE
low
low
y1
-1
-1
y1
high
low
y2
+1
-1
y2
low
high
y3
-1
+1
y3
high
high
y4
+1
+1
y4
43
( y2 y4 ) ( y1 y3 )
Factor A Effect
2
2
( y3 y4 ) ( y1 y2 )
Factor B Effect
2
2
44
45
46
47
48
EXAMPLE:
FACTORIAL (2k) DESIGNS (k = 2)
A microbiologist is interested in the effect
of two different culture mediums [medium 1
(low) and medium 2 (high)] and two
different times [10 hours (low) and 20 hours
(high)] on the growth rate of a particular
CFU [Bugs].
49
EXAMPLE:
FACTORIAL (2k) DESIGNS (k = 2)
Since two factors are of interest, k =2, and
we would need the following four runs
resulting in
Generalized Settings
RUN Medium
Time
Growth Rate
1
low
low
17
high
low
15
low
high
38
high
high
39
50
EXAMPLE:
FACTORIAL (2k) DESIGNS (k = 2)
Estimates for the medium and time
effects are
Medium effect = [(15+39)/2] [(17 +
38)/2] = -0.5
Time effect = [(38+39)/2] [(17 + 15)/2] =
22.5
51
EXAMPLE:
FACTORIAL (2k) DESIGNS (k = 2)
52
EXAMPLE:
FACTORIAL (2k) DESIGNS (k = 2)
A statistical analysis using the appropriate
statistical model would result in the
following information. Factor A (medium)
and Factor B (time)
Type III Sums of Squares
-----------------------------------------------------------------------------------Source
Sum of Squares
Df
Mean Square
F-Ratio
P-Value
-----------------------------------------------------------------------------------FACTOR A
0.25
1
0.25
0.11
0.7952
FACTOR B
506.25
1
506.25
225.00
0.0424
Residual
2.25
1
2.25
-----------------------------------------------------------------------------------Total (corrected)
508.75
3
All F-ratios are based on the residual mean square error.
53
EXAMPLE:
CONCLUSIONS
In statistical language, one would conclude
that factor A (medium) is not statistically
significant at a 5% level of significance
since the p-value is greater than 5% (0.05),
but factor B (time) is statistically significant
at a 5 % level of significance since this pvalue is less than 5%.
54
EXAMPLE:
CONCLUSIONS
In layman terms, this means that we have
no evidence that would allow us to
conclude that the medium used has an effect
on the growth rate, although it may well
have an effect (our conclusion was
incorrect).
55
EXAMPLE:
CONCLUSIONS
Additionally, we have evidence that would
allow us to conclude that time does have an
effect on the growth rate, although it may
well not have an effect (our conclusion was
incorrect).
56
EXAMPLE:
CONCLUSIONS
In general we control the likelihood of
reaching these incorrect conclusions by the
selection of the level of significance for the
test and the amount of data collected
(sample size).
57
2k DESIGNS (k > 2)
As the number of factors increase, the
number of runs needed to complete a
complete factorial experiment will increase
dramatically. The following 2k design
layout depict the number of runs needed for
values of k from 2 to 5. For example, when
k = 5, it will take 25 = 32 experimental runs
for the complete factorial experiment.
58
59
b
-1
-1
+1
+1
-1
-1
+1
+1
c
-1
-1
-1
-1
+!
+1
+1
+1
ab
1
-1
-1
1
1
-1
-1
1
ac
1
-1
1
-1
-1
1
-1
1
bc
1
1
-1
-1
-1
-1
1
1
abc
-1
1
1
-1
1
-1
-1
1
60
2k DESIGNS (k > 2)
Once the effect for all factors and
interactions are determined, you are able to
develop a prediction model to estimate the
response for specific values of the factors.
In general, we will do this with statistical
software, but for these designs, you can do
it by hand calculations if you wish.
61
2k DESIGNS (k > 2)
For example, if there are no significant interactions
present, you can estimate a response by the
following formula. (for quantitative factors only)
62
GRADE
85
75
65
55
10
12
14
16
#HRS STUDY
18
20
63
Note that
* Any factor d effect is now confounded with the a*b
a
-1
+1
-1
+1
-1
+1
-1
+1
interaction
* Any factor e effect is now confounded with the a*c
interaction
* etc.
* What
is the d*e
interaction
confounded
with????????
b
c
d = ab
e = ac
f = bc
g = abc
-1
-1
+1
+1
-1
-1
+1
+1
-1
-1
-1
-1
+!
+1
+1
+1
1
-1
-1
1
1
-1
-1
1
1
-1
1
-1
-1
1
-1
1
1
1
-1
-1
-1
-1
1
1
-1
1
1
-1
1
-1
-1
1
66
67
Interaction Plot
RESPONSE
10
FACTOR B
-1
1
9
8
7
6
5
-1
FACTOR A
69
Resolution of a Design
F1
F2
F3
Y100
10
45
60
11825
30
45
40
8781
20
30
40
8413
10
30
50
9216
20
45
50
9288
30
60
50
8261
20
45
50
9329
30
45
60
10855
20
45
50
9205
10
20
60
40
8538
11
10
45
40
9718
12
30
30
50
11308
13
20
60
60
10316
14
10
60
50
12056
15
20
30
60
10378
74
75
60
Factor_B
55
50
45
40
35
30
10
14
18
22
Factor_A
26
30
Var_3
9300.0
9500.0
9700.0
9900.0
10100.0
10300.0
10500.0
10700.0
10900.0
11100.0
11300.0
11500.0
11700.0
76
CLASSROOM EXERCISE
CLASSROOM EXERCISE
Conduct Experiment student holds 3 foot
pin the tail on the donkey stick and
attempts to hit the target. An observer will
assist to mark the hit on the target.
Collect data students take data home for
week and come back with what you would
recommend AND why.
YOU TELL THE CLASS HOW TO PLAY
THE GAME TO WIN.
78
CLASSROOM EXERCISE
79
CLASSROOM EXERCISE
MARKER VERTICAL
1ST OBS 2ND OBS 3RD OBS 4TH OBS
STICK
POLE
MEAN
STANDARD
DEVIATION
-2.750
-4.500
-4.750
-5.000
-4.250
1.021
-12.500
-6.750
-4.625
-4.000
-6.969
3.871
3.000
3.250
3.875
6.250
4.094
1.484
4.625
11.250
12.625
14.000
10.625
4.155
MARKER
STICK
VERTICAL
POLE
80
81