Beruflich Dokumente
Kultur Dokumente
Mario Vianello
vianello.clm@tin.it
PRODUCT
QUALITY DESIGN
01OFHLO
These slides can not be reproduced or distributed without permission in writing from the author. 1
COURSE CONTENTS
These slides can not be reproduced or distributed without permission in writing from the author. 2
PRODUCT QUALITY DESIGN
5
CRITERIA AND METHODS
TO PLAN RELIABILITY
EXPERIMENTAL VERIFICATIONS
These slides can not be reproduced or distributed without permission in writing from the author. 3
CHAPTER CONTENTS
These slides can not be reproduced or distributed without permission in writing from the author. 4
5. Criteria and methods to plan reliability experim. verifications
5.1
GENERAL
CRITERIA FOR DESIGNING
EXPERIMENTAL TESTS
These slides can not be reproduced or distributed without permission in writing from the author. 5
5.1. General criteria for designing experimental tests
5.1.1
Categories and purposes
of experimental tests
These slides can not be reproduced or distributed without permission in writing from the author. 6
5.1.1. Categories and purposes of experimental tests
These slides can not be reproduced or distributed without permission in writing from the author. 7
5.1.1. Categories and purposes of experimental tests
Categories and purposes of Experimental Tests
OPERATING TEST: tests aimed at verifying whether the design
changes achieve the desired performance targets. Small samples
are sufficient.
DURABILITY TEST: tests aimed to avoid durability failures (i.e.
failures for which the repair work is not technically feasible or not
economically convenient), and then:
• tests extended to the wear out area and beyond;
• sample size intermediate between that required by operating tests and
that required by reliability test (but, given the burden of these tests, closer
to the first ones than to the second ones);
• test acceleration can be very high, because these tests are aimed at
kinds of failure which have been substantially identified.
RELIABILITY TEST: tests aimed at detecting any kind of failure,
which has been able to overcome the prevention checks, and
therefore:
• large samples are required, because we have to be able to detect failures
even with very low frequencies;
• acceleration can not be very high, for the need to maintain a full relation-
ship with the customer’s use, because, in this case, we need to be able to
detect any kind of failure.
These slides can not be reproduced or distributed without permission in writing from the author. 8
5.1.1. Categories and purposes of experimental tests
These slides can not be reproduced or distributed without permission in writing from the author. 9
5.1. General criteria for designing experimental tests
5.1.2
Concept of
test “significance”
These slides can not be reproduced or distributed without permission in writing from the author. 10
5.1.2. Concept of test “significance”
How many cars must be experimentally tested ?
ONCE
For models with high production, many cars were
tested, while for low-production models, only a few
cars were tested.
But the low-production models are often the more
expensive ones, whose reliability, in this way, is
less well tested !
NOW
Sometimes one wonders: “How many cars
need to test, so that the test is significant“ ?
Oversimplifying...
in Statistics, the term significance level means
the probability of failing,
due to the fact that deductions are made from a sample
and not from the full population.
These slides can not be reproduced or distributed without permission in writing from the author. 12
5.1.2. Concept of tests “significance”
In general :
the design of an experimental test depends on what its
results will have to express and of what decision they will
help us to take.
if we can not afford an adequate sample size to answer the
question of interest, it is worth wondering whether it is better
to eliminate all experimental test (saving the expenditure!).
Alternatively, we must clearly highlight the questions (less
important) to which a reduced test (compared to the ideal
one) is able to answer, and also highlight the relative risks
after a careful evaluation of costs and benefits.
These slides can not be reproduced or distributed without permission in writing from the author. 13
5.1. General criteria for designing experimental tests
5.1.3
Overview of
all experimental tests
for a new car model
in development
These slides can not be reproduced or distributed without permission in writing from the author. 14
5.1.3. Overview of all experimental tests for a new car model in development
These slides can not be reproduced or distributed without permission in writing from the author. 15
5.1.3. Overview of all experimental tests for a new car model in development
q FIAT
These slides can not be reproduced or distributed without permission in writing from the author. 16
5. Criteria and methods to plan reliability experim. verifications
5.2
COMPONENT BENCH
TESTING
(SUCCESS RUN)
These slides can not be reproduced or distributed without permission in writing from the author. 17
5.2. Component bench testing (and their limits)
5.2.1
Basic Success Run
These slides can not be reproduced or distributed without permission in writing from the author. 18
5.2.1. Component bench testing: basic Success Run
C = 1 - Rn
which expresses the Confidence Level, C, depending
on the reliability, R, and on the sample size, n.
The confidence level expresses here the detect ability of failures. In fact, R is the
Reliability of a single component, i.e. its probability not to fail during the mission in
question, and therefore the probability of not finding faults on n components in the
same conditions is given by the product of the probability of n terms equal to R:
R . R . R ... . R = Rn. Ultimately, Rn is the probability of finding not even a failure
(= zero failures), and C = 1 - Rn is the probability of finding at least one failure
(one or more), namely the detect ability. We can now proceed with a reasoning
similar to that of Statistical Hypothesis, noting that, if the reliability was equal to R
or less, it would be a probability not less than 1 - Rn to have at least one failure,
but, as there has been no failure, we are allowed to believe that the actual reliability
is greater than R: with a confidence level equal to 1 - Rn and risk to be wrong (the
so-called error of Type I) equal to Rn.
These slides can not be reproduced or distributed without permission in writing from the author. 20
5.2.1. Component bench testing: basic Success Run
These slides can not be reproduced or distributed without permission in writing from the author. 21
5.2.1. Component bench testing: basic Success Run
Inverse application
The same expression is commonly used to plan the minimum number of
units to be tested, in order to statistically demonstrate a predetermined
reliability target with an assigned confidence level.
It’s enough to write in the inverse form the preceding expression:
With the values of the preceding example, we obtain (as it was logical to
expect):
n = Log(1- 0.74) / Log 0.997 = (-0.585) / (-0.0013) = 450 units.
Since the market requires more and more demanding reliability targets,
the number of units to be sampled is usually very high.
These slides can not be reproduced or distributed without permission in writing from the author. 22
5.2.1. Component bench testing: basic Success Run
MINIMUM SAMPLE SIZE depending on the MINIMUM VALUE, R, OF RELIABILITY to be demonstrated and on its CONFIDENCE LEVEL, C
MINIMUM
C O N F I D E N C E L E V E L , C
RELIABILITY
R 0,500 0,550 0,600 0,650 0,700 0,750 0,800 0,850 0,900 0,950 0,960 0,970 0,980 0,990 0,995 0,999
0,500 1 2 2 2 2 2 3 3 4 5 5 6 6 7 8 10
0,550 2 2 2 2 3 3 3 4 4 6 6 6 7 8 9 12
0,600 2 2 2 3 3 3 4 4 5 6 7 7 8 10 11 14
0,650 2 2 3 3 3 4 4 5 6 7 8 9 10 11 13 17
0,700 2 3 3 3 4 4 5 6 7 9 10 10 11 13 15 20
0,750 3 3 4 4 5 5 6 7 9 11 12 13 14 17 19 25
0,800 4 4 5 5 6 7 8 9 11 14 15 16 18 21 24 31
0,850 5 5 6 7 8 9 10 12 15 19 20 22 25 29 33 43
0,900 7 8 9 10 12 14 16 19 22 29 31 34 38 44 51 66
0,950 14 16 18 21 24 28 32 37 45 59 63 69 77 90 104 135
0,960 17 20 23 26 30 34 40 47 57 74 79 86 96 113 130 170
0,970 23 27 31 35 40 46 53 63 76 99 106 116 129 152 174 227
0,980 35 40 46 52 60 69 80 94 114 149 160 174 194 228 263 342
0,990 69 80 92 105 120 138 161 189 230 299 321 349 390 459 528 688
0,995 139 160 183 210 241 277 322 379 460 598 643 700 781 919 1.058 1.379
0,999 693 799 916 1.050 1.204 1.386 1.609 1.897 2.302 2.995 3.218 3.505 3.911 4.603 5.296 6.905
These slides can not be reproduced or distributed without permission in writing from the author. 23
5.2.1. Component bench testing: basic Success Run
Which value do we assume for the MINIMUM RELIABILITY ?
BASIC ASSUMPTIONS
Usual target for the failure frequency at the end of the warranty period (2 years) = 0.5 repair/car
Number of car’s subsystems/components influencing reliability ≈ 300
(which may cause about 1,200 different failure modes, as seen in the histogram of Par. 2.3.4.2 speaking of the binomial distribution)
Let’s suppose that each of the 300 subsystems/components has a failure frequency of 0.01 (= 1%)
during the warranty period (2 years): this means that the production process produces 1 defective
every 100 components (not so bad!), which corresponds to a reliability equal to 99%.
Under this hypothesis, the number of repairs/cars during the warranty period (2 years) is given by:
1
------- . 300 = 3 repairs/car
100
This result (3 repairs/car) is 6 times greater than the usual target of 0.5 repair/car !
As a conclusion, in the previous Table, we have to assume a MINIMUM RELIABILITY > 0.990.
And, since a value of 0.999 would require samples of impossible size, we would usually limit to
reliability values around 0.995.
These slides can not be reproduced or distributed without permission in writing from the author. 24
5.2.1. Component bench testing: basic Success Run
“Attainable” and “statistically provable” targets
d
1/3÷ 1/5 d Statistically
provable target
105 rep./100 cars
D = 1/3 ÷ 1/5 d
30 40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200
FREQUENZA DI FAILURE
IN FIELD GUASTO IN RETE [ inconvenienti
FREQUENCY / 100
[repairs/100 vetture ]
cars]
These slides can not be reproduced or distributed without permission in writing from the author. 25
5.2.1. Component bench testing: basic Success Run
With the same reliability value to be demonstrated,
the needed sample size greatly increases with the increasing of the confidence level.
1,00
R [fraction]
0,99
dimostrata [frazione]
0,98
0,97
RELIABILITY,
0,96
0,95
0,94
Affidabilità R
Demonstrated
0,93
C.L. = 50%
0,92 C.L. = 75%
C.L. = 90%
0,91
0,90
0 50 100 150 200 250 300 350 400 450 500
Numeroofditested
Number unità provate con zero
units (with zero guasti
failures)
These slides can not be reproduced or distributed without permission in writing from the author. 26
5.2.1. Component bench testing: basic Success Run
These slides can not be reproduced or distributed without permission in writing from the author. 27
5.2.1. Component bench testing: basic Success Run
5.2.2
Prolonged Success Run
These slides can not be reproduced or distributed without permission in writing from the author. 29
5.2.2. Component bench testing: Prolonged Success Run
These slides can not be reproduced or distributed without permission in writing from the author. 30
5.2.2. Component bench testing: Prolonged Success Run
These slides can not be reproduced or distributed without permission in writing from the author. 31
5.2.2. Component bench testing: Prolonged Success Run
L = Ttest / Testimate
These slides can not be reproduced or distributed without permission in writing from the author. 32
5.2.2. Component bench testing: Prolonged Success Run
Based on the concepts outlined above, if Ttest > Testimate and we are
not dealing with infant mortality failures, to extend the test is "as if" we
test an “equivalent” number of units neq greater than the number n of
units really tested. The relationship which links them is:
neq = n . Lb
where b is the shape parameter (exponent) of Weibull distribution:
b < 1 for infant mortality failures;
b = 1 for random failures (useful life);
b > 1 for wear-out failures.
These slides can not be reproduced or distributed without permission in writing from the author. 34
5.2.2. Component bench testing: Prolonged Success Run
Example of
SEMIAUTOMATIC PLANNING of RELIABILITY TESTS
These slides can not be reproduced or distributed without permission in writing from the author. 35
5.2.2. Component bench testing: Prolonged Success Run
Example of
SEMIAUTOMATIC PLANNING of RELIABILITY TESTS
CONCLUSIONS
Reliability target required by Specifications is 0.987 with a Con-
fidence Level equal to 75%.
The reliability target we can demonstrate by bench test, with a
Confidence Level of 75%, testing 8 units without prolonging
the test, is 0.841.
Reliability target that we can demonstrate, with a confidence
level of 75%, with all units in all tests (8 units in bench tests
and 26 prototype cars), without prolonging the test, is 0.960.
Reliability target that we can demonstrate, with a confidence le-
vel of 75%, with all units in all tests (8 units in bench tests and
26 prototype cars) and prolonging the test is 0.988 (even bet-
ter than Specification requirement).
These slides can not be reproduced or distributed without permission in writing from the author. 36
5.2. Component bench testing (and their limits)
5.2.3
Extended Success Run
These slides can not be reproduced or distributed without permission in writing from the author. 37
5.2.2. Component bench testing: Extended Success Run
These slides can not be reproduced or distributed without permission in writing from the author. 38
5.2.2. Component bench testing: Extended Success Run
1,00
R [fraction]
0,99
R dimostrata [frazione]
0,98
0,97
Affidabilità RELIABILITY,
0,96
0,95
0,94
Demonstrated
0,92 failures
0 guasti
1 guasto
failure
0,91 2 guasti
failures
0,90
0 50 100 150 200 250 300 350 400 450 500
Number ofunità
Numero di tested units
provate
These slides can not be reproduced or distributed without permission in writing from the author. 39
5.2.2. Component bench testing: Extended Success Run
1,00
R [fraction]
0,99
R dimostrata [frazione]
0,98
0,97
Affidabilità RELIABILITY,
0,96
0,95
0,94
Demonstrated
0,92 0 guasti
failures
1 guasto
failure
0,91 2 guasti
failures
0,90
0 50 100 150 200 250 300 350 400 450 500
Numero di
Number ofunità provate
tested units
These slides can not be reproduced or distributed without permission in writing from the author. 40
5.2.2. Component bench testing: Extended Success Run
1,00
R [fraction]
0,99
dimostrata [frazione]
0,98
0,97
RELIABILITY,
0,96
0,95
Affidabilità R
0,94
Demonstrated
0,90
0 50 100 150 200 250 300 350 400 450 500
Number di
Numero ofunità
tested units
provate
These slides can not be reproduced or distributed without permission in writing from the author. 41
5. Criteria and methods to plan reliability experim. verifications
5.3
RELIABILITY TESTS
ON COMPLETE VEHICLES
FOR DESIGN VERIFICATION
(Reliability Growth Testing
for Useful Life)
These slides can not be reproduced or distributed without permission in writing from the author. 42
5.3. Rel. tests on compl. vehicle for design verif. (useful life)
EXAMPLE OF RELIABILITY TEST FOR A WHOLE VEHICLE,
WITH ACCELERATION 2, KNOWN AS “DEMANDING CUSTOMER”
DISTRIBUTION OF AVERAGE ANNUAL CORRESPONDING DISTANCES DISTRIBUTION
DISTANCES IN CUSTOMER USE IN THE ACCELERATED TEST KNOWN AS "DEMANDING CUSTOMER"
Average annual Equivalent distances
true distances in the test
Types of travel Types of travel Reduction
[km] % compared to the [km] %
customer use
The distances in the test (last two columns to the right in the table) obviously change with the mission profile of the
car model (described in the first 3 columns on the left), but remain substantially valid the reduction principles, descri-
bed in the two central yellow columns (the antepenultimate and the previous one). These tests with acceleration 2 are
the basic reference in the Reliability Growth Testing for useful life (which will be discussed immediately hereinaf-
ter). However, in the tests design, no account is taken of their acceleration (that assumes a purely conservative
role in reliability predictions/verifications: hence the name of “demanding customer tests”). The only constraint
that the acceleration implies for these tests is that the tested vehicles may not exceed 60,000 km in order to
avoid the risk of encroaching in the wear-out area (since 2 . 60.000 = 120.000 km).
These slides can not be reproduced or distributed without permission in writing from the author. 43
5.3. Rel. tests on compl. vehicle for design verif. (useful life)
5.3.1
Fundamentals of
Reliability Growth Testing
(Duane learning curve)
These slides can not be reproduced or distributed without permission in writing from the author. 44
5.3.1. Fundamentals of Reliability Growth Testing
These slides can not be reproduced or distributed without permission in writing from the author. 45
5.3.1. Fundamentals of Reliability Growth Testing
Duane’s learning curve
25
FAILURES
0,004
N°inconvenienti
N° of progressively detected
via via rilevati
cumulato
failures (cumulative)
(cumulato)
20
OF DETECTED
SPEED of failure detection
0,003
[ failures / 100 hours ]
INCONVENIENTI
15
0,002
NUMBER
10
Numero di
0,001
CUMULATIVE
5
Velocità di rilevamento
Speed of failure inconvenienti
detection (failure detection rate)
0,000 0
0 5.000 10.000 15.000 20.000 25.000 30.000 35.000 40.000
CUMULATIVE
TEMPO TOTALCUMULATO
DI SPERIMENTAZIONE TESTINGGLOBALE
TIME [working hours]
[ore di funzionamento]
where k and a are appropriate constants. Deriving (1) respect to the time,
we get the instantaneous failure rate li:
li = dN(t)/dt = k . (1 - a) . t -a (2)
this relationship represents a straight line in a log-log graph:
These slides can not be reproduced or distributed without permission in writing from the author. 47
5.3.1. Fundamentals of Reliability Growth Testing
1,8
FREQUENCY DENSITY, li
]
1,6
INTENSITÀ DI FREQUENZA DI GUASTO li
x 100hours)
ore)]
1,4
x 100
1,2
/ (vettura
/ (vehicle
1,0
[inconvenienti
0,8
[ failures
FAILURE
0,6
0,4
Obiettivo
Target
0,2
0,0
0 5.000 10.000 15.000
CUMULATIVE
TEMPO DI PROVATEST TIME
CUMULATO [ore[working hours]
di funzionamento]
These slides can not be reproduced or distributed without permission in writing from the author. 48
5.3.1. Fundamentals of Reliability Growth Testing
In a log-log graph, Duane’s learning curve becomes a straight line
0,010
DI GUASTO li l
[inconvenienti / (vettura x 100 ore)]
FREQUENZADENSITY,
100 hours) ]
FAILUREDIFREQUENCY
(vehicle x
0,001
failures /
Obiettivo
Target
INTENSITÀ
[
0,000
100 1.000 10.000 100.000
These slides can not be reproduced or distributed without permission in writing from the author. 49
5.3.1. Fundamentals of Reliability Growth Testing
Confidence band for the experimental tests
PREDICTIVE
MODELLO MODEL AND
MODELLOPREVISIVO
PREVISIVO CONFIDENCE
EEINTERVALLO
INTERVALLO INTERVAL
DIDIFIDUCIA
FIDUCIA
Duane’s
retta di di
retta straight
Duane
Duane linemedi);
(valori
(valori (mean values);
medi);pend. = slope
0,40 = 0.40
= 0,40
pend.
10,0
Duane’s
curva
curvadi di 75%
Duane
Duane C.L. Upper Limit;
(C.L.=75%);
(C.L.=75%); pend. mean
media
pend. slope
media >>
> 0,400.40
0,40
10,0
Target
target perfor useful
vita utilelife (mean
(valore value)
medio)
target per vita utile (valore medio)
STARTING-POINT 75%
STARTING-POINT = difettosità
75% STARTING-POINT
75% iniziale
= one-sided
= difettosità C.L.
iniziale 75%
upper
C.L. monolaterale
limit
75% with 75% C.L.
monolaterale Target
target per
target for
per useful
vita utile
vita life
utile (75% C.L. Upper Limit)
(C.L.=75%)
(C.L.=75%)
convenzionalmente
conventionally a 100
calculated ore100
at ore hours
DI GUASTO l l ii
convenzionalmente a 100
l
[Inconvenienti / (vettura x 100 ore)] i
100 ore)]
STARTING-POINT
STARTING-POINT = difettosità
= average iniziale media
initial defectiveness
DI FREQUENZADENSITY,
]
Banda di fiducia
Confidence
Banda band
di fiducia
/ (vettura
(75% monolaterale)
(one-sided
(75% limit of 75% C.L.)
monolaterale)
INTENSITÀFREQUENCY
1,0 (pendenza
(slope ofdella
(pendenza retta
Duane’s
della distraight
retta Duane) line)
di Duane)
[Inconvenienti
Spesso
Spesso i dueduetarget
iOften vengono
the two
target targets
vengono confusi
confusi
(per
(per can
l'esiguità be
l'esiguità considered
dell'intervallo equal
di
dell'intervallo fiducia
di fiducia
(for the small size of the confidence interval
dopodopo unun
after aelevato
elevato
large numero
numbernumero di hours)
of test ore)
di ore)
[ failures
FAILURE
Ammontare
Ammontare globale
globale
Total
della test hours
sperimentazione
della sperimentazione
0,1
0,1
100
100 1.000
1.000 10.000
10.000 100.000
100.000
TEMPO
CUMULATIVE
TEMPODIDI
PROVA CUMULATO
PROVA [ore
TEST TIME
CUMULATO didi
funzionamento]
[working
[ore hours]
funzionamento]
These slides can not be reproduced or distributed without permission in writing from the author. 50
5.3.1. Fundamentals of Reliability Growth Testing
Purpose of Reliability Growth Testing for “useful life”
2,5E-5
… and therefore
it is not involved
in the wear-out zone
RATE, h(t)l
2,0E-5
STARTING POINT (failure frequency of the first R.G. batch)
h(t)
GUASTO
FREQUENCY,
Target, l
(slope of ”useful life”)
0,0E+0 of the new car model
0 25.000 50.000 75.000 100.000 125.000 150.000 175.000 200.000
PERCORRENZA,
MILEAGE,km km
km( o (or (o tempo
tempo
hours o cicli)
oorcicli )
cycles)
5.3.2
“New Content “
as a main parameter
for sizing test samples
These slides can not be reproduced or distributed without permission in writing from the author. 52
5.3.2. Reliability Growth for useful life: New Content
The New Content of a new product in development gives the percentage of
failure frequency (theoretically from 0% to 100% of the total failure frequency
corresponding to the whole vehicle) which is attributable to redesigned
elements and/or elements employed in a different operating environment
(e.g. a known gearbox applied to a new engine or a known suspension with adjustments aimed
to a different market with a greater percentage of dirt roads) with regard to products
already marketed and used as reference point (archetype).
In other words, this percentage is given by the failure frequency (e.g. measured
in “failures/100 units") corresponding to the subsystems/components (or portion
thereof) on which changes were made, divided by the total failure frequency of
the product (e.g. the whole vehicle).
Of course, it makes no sense (and is therefore forbidden!) assume values
exactly equal to 0% or 100%, because in any situation, there is always
something changed (e.g. the mission profile or the operating environment in which the
subsystem/component is placed) and something already experienced (e.g.: designers
virtuous traditions, standards, etc.) .
It’s worth to highlight that the failure frequency to be used (first column in the Table of
next slide) is “actual”, since detected in field, for what concerns archetype’s
components, while it is “potential”, since estimated, for what concerns the
“innovative” components of the new product in development.
These slides can not be reproduced or distributed without permission in writing from the author. 53
5.3.2. Reliability Growth for useful life: New Content
Being merely illustrative and with a large simplification of the concept, but useful for
didactic purposes, we could say that New Content is given by the ratio between the
sum of the failure frequencies(1) of the (only) modified components (for better
adequacy to the new vehicle model in development, and therefore with a “specific”(2)
design) and the total failure frequency of the vehicle(3):
Of course, the professional calculation of New Content (as you can guess from the
previous slide), is a bit more complicated than that and the next slide gives a rough
idea about the practice run of calculations.
(1) Instead it would be totally wrong to consider New Content as the ratio between the number of modified
components and the total number of components that make up the whole vehicle.
(2) To be the numerator of the New Content, it is not necessary that components are "innovative" (i.e. sub-
stantially new inventions or, at least, very recent), but it is enough that they are "specific" i.e. that their
de-sign has been modified, for example, in order to better suit the needs of the new car model in deve-
lopment. Clearly, all innovative components belong to the category of specific, but not vice versa (the
most of the specific components are not innovative).
(3) The failure frequency of the whole vehicle is calculated: a) for the components already present in the ar-
chetype, on the basis of their failure frequencies from filed, subjectively corrected taking into account of
differences presumably introduced by the planned modifications; b) for the new components (= not pre-
sent in the archetype), on the basis of the best possible estimates (e.g. by the systems in series and in
parallel).
These slides can not be reproduced or distributed without permission in writing from the author. 54
5.3.2. Reliability Growth for useful life: New Content
SPREADSHEET for NEW CONTENT estimate
NEW CONTENT assessment - Car model: f ictitious example of an old car model (production year around 2000)
( INPUT data in YELLOW cells )
Log t
These slides can not be reproduced or distributed without permission in writing from the author. 56
5.3.2. Reliability Growth for useful life: New Content
Based on historical data, Fiat established a semi empirical relation-
ship to evaluate the Starting Point according to the set reliability
target and to the specific New Content (this relationship is not shown for
reasons of business confidentiality): every Company will evaluate his
own relationship.
Growth slope
Type of Reliability Program
a
Little or no attention to the reliability problems.
- 0.01 to - 0.10 No formal procedure.
The input parameter to which the Reliability Growth Testing shows the maximum sensitivity
is the slope of the Duane straight line. However, if we incorrectly assume an optimistic
value for it, this will not affect the final reliability of the product, but simply will require a test
time longer than planned.
These slides can not be reproduced or distributed without permission in writing from the author. 59
5.3. Rel. tests on compl. vehicle for design verif. (useful life)
5.3.3
Meaningfulness
of tested vehicles
in subsequent batches
These slides can not be reproduced or distributed without permission in writing from the author. 60
5.3.3. R.G. for Useful life.: meaningfulness of vehicles
These slides can not be reproduced or distributed without permission in writing from the author. 61
5.3.3. R.G. for Useful life.: meaningfulness of vehicles
IMPORTANT REMARK
Classical Statistics provides only predictions based on homogeneous
batches (taken from the same population) and therefore does not allow to
put together batches of vehicles with different meaningfulness in reliabi-
lity estimates. So we would be forced to base the estimated only on the
results of the last batch (the closest to the vehicle that will actually be
sold that are the subject of the estimate) and thus the quality of the esti-
mate (expressed by the Confidence Level and the Confidence Interval)
would be determined only by the size of the last batch.
Instead, Reliability Growth Testing uses meaningfulness in order to
empirically increase the sample size of the last batch by adding to the
number of its vehicles, the number of vehicles in each previous batch
multiplied by their specific meaningfulness. In this way, the quality of the
estimate (expressed by the Confidence Level and the Confidence Inter-
val) is affected not only by the size of the last batch, but also by that of all
previous batches, although "weighted" by their meaningfulness, with
evident benefits (see slide 76).
These slides can not be reproduced or distributed without permission in writing from the author. 63
5.3. Rel. tests on compl. vehicle for design verif. (useful life)
5.3.4
Tests designing
These slides can not be reproduced or distributed without permission in writing from the author. 64
5.3.4. R.G. for useful life: tests designing
MAIN INPUTs:
Target to be demonstrated, corresponding to the achievable target
recognized by Engineering;
New Content of the new model, which measures what the car has
never been tested, evaluated as % of failure frequency of reference:
redesigned elements and/or elements employed in a different operating
environment.
Meaningfulness of cars, which measures how much tested cars
are representative of a defined project (weighting the assessment
on the failure frequency of the archetype).
OPTIONAL ASSUMPTIONS: depending on what is available in the com-
pany for the new car model in development:
Speed of Problem-Solving (slope of Duane’s straight line);
Number of phases (batches)
Number of cars needed
for each batch.
TOTAL MILEAGE
(and of each batch or phase)
Test time (of every
These slides can not be reproduced or distributed without permission in writing from the author.
batch and total) 65
5.3.4. R.G. for useful life: tests designing
Phase 2 (and following) re-starting-point
Phase 1 Target
Ending Point
Phase 1
100 hours
These slides can not be reproduced or distributed without permission in writing from the author. 66
5.3.4. R.G. for useful life: tests designing
How it is easy to understand, the more the tested cars are close
to those that will be sold (meaningfulness) and the sooner they
are available to be tested, the smaller will be the size of batches
(samples).
Experimental tests will be divided into several phases (from 2 to
4), taking into account the meaningfulness and the timing with
which the vehicles will be available for reliability tests.
These slides can not be reproduced or distributed without permission in writing from the author. 67
5.3.4. R.G. for useful life: tests designing
Reliability Growth Model (actual case)
Phase 1 Target
Ph1T
Phase 2 Target
Final Target Final Target
FT
End of Phase 1
These slides can not be reproduced or distributed without permission in writing from the author. 68
5.3.4. R.G. for useful life: tests designing
NCtot = 40%
Yellow cells highlight initial assumptions
P H A S E S
GROWTH PHASES Verification
1 2 3 (last) phase
Estimated at the planning of testing 85% 95% 100% 100%
CAR (1)
MEANINGFULNESS (2)
Detected at the end of each phase 85% 95% 98% 100%
Conventionally = 36%
SPECIFIC Formulas NCspec = 0,40 . 0,85 NCspec = 0,40 . (0,95 - 0,85) NCspec = 0,40 . (1,00 - 0,95) of the (total) NC of the
previous (last) phase
NEW CONTENT
OF EACH PHASE Result of the formula 34% 4% 2% 0,72%
Check: 34 + 4 + 2 = 40 = NCtot
(1) Before starting to test the 3rd batch (3rd phase) a full meaningfulness of 100% was expected, but, during the 3 rd phase, some new failure modes have been
detected and then solved by implementing suitable corrective actions. The " New Content " due to these corrective actions (defined but never tested on the cars)
has been estimated around 2%, so that the actual meaningfulness of the 3rd phase becomes equal to 98%. This requires an additional phase to get the actual 100%:
this phase is called " verification phase " . Conventionally, its New Content is assumed equal to 36% of the (total) New Content of the previous (last) phase.
(2) The actual car meaningfulness remains 100% at the end of the "verification phase" (only) if, as usually happens, no corrective actions have been implemented during
the verification phase.
These slides can not be reproduced or distributed without permission in writing from the author. 69
5.3.4. R.G. for useful life: tests designing
CAR MODEL Date
RELEASE FOR MANUFACTURE: 14.000 32
PRODUCTION START-UP: May 1, 2003
COMMERCIAL LAUNCH: June 15, 2003
Reference Information
DATE OF MANUFACTURE OF TESTED CARS
PHASE's STARTING DATE
PHASE's ENDING DATE 10,63
PHASE's LENGTH [months]
OVERLAPPING WITH SUBSEQUENT [months]
MEANINGFULNESS OF TESTED CARS
NUMBER OF TESTED CARS 52
These slides can not be reproduced or distributed without permission in writing from the author. Example of an EXCEL spreadsheet to plan the phases (by trial and error) 70
5.3. Rel. tests on compl. vehicle for design verif. (useful life)
5.3.5
Results monitoring
These slides can not be reproduced or distributed without permission in writing from the author. 71
5.3.5. R.G. for useful life: results monitoring
FAILURE FREQUENCY DENSITY, l [failures / (vehicle x 100 hours)]
R E Q U I R E D T E S T T I M E
These slides can not be reproduced or distributed without permission in writing from the author. 72
5.3.5. R.G. for useful life: results monitoring
Monitoring the progress of testing is particularly simple, since the test results (thick
broken line in Figure) are displayed on the same chart that contains the line of
planning. This facilitates a continuous and immediate comparison of results with the
intermediate targets defined by the growth model.
Each of the two curves (planned and resulting) is the upper limit of a confidence
band, because it identifies, point by point, the upper limit of the specific confidence
interval.
Usually the broken curve is updated weekly. Thus, the 1st point of the broken line is
the number of failures detected at the end of the first week divided by the total accu-
mulated test hours during the first week; the 2nd point is calculated by assuming the
same number of failures, but divided by the sum of hours accumulated during both
the first and second week, the 3rd point, positioned vertically above the 2nd, is calcu-
lated by dividing the total number of failures of the two weeks by the total hours ac-
cumulated in the two weeks. And we continues this way until we get to point A, at
which (for the first time) the broken line has a vertical drop due to the benefits of im-
plemented corrective actions.
Therefore, point A graphically represents the simplest graphical estimate of the Star-
ting Point value, deduced from experimental results.
These slides can not be reproduced or distributed without permission in writing from the author. 73
5.3.5. R.G. for useful life: results monitoring
These slides can not be reproduced or distributed without permission in writing from the author. 74
5.3.5. R.G. for useful life: results monitoring
These slides can not be reproduced or distributed without permission in writing from the author. 75
5.3.5. R.G. for useful life: results monitoring
These slides can not be reproduced or distributed without permission in writing from the author. 76
5.3.5. R.G. for useful life: results monitoring
In assessing the confidence interval of the final estimate of the failure frequency (of
a new car model) after testing the last lot, the Reliability Growth Testing offers a
further advantage, because it takes into account not only the size of the last batch (as
classical Statistics suggests), but also of that of previous batches.
4 BATCHES OR PHASES
GROWTH PHASES Verification phase
1 2 3 (last) actual data virtual sample
Because the cars of the batches preceding the last not have a full meaningfulness,
their size is "weighted" on the meaningfulness of each batch, as shown in Table
above.
In practice, in the above example, the confidence interval is not calculated on the
basis of the only 10 cars of the last lot, but also taking into account the cars of all
previous batches (albeit "weighted" on their meaningfulness). So 94.4 cars are
counted (in a virtual way). Therefore, with a larger number of cars, we obtain a
smaller confidence interval.
These slides can not be reproduced or distributed without permission in writing from the author. 77
5.3.5. R.G. for useful life: results monitoring
These slides can not be reproduced or distributed without permission in writing from the author. 78
5.3. Rel. tests on compl. vehicle for design verif. (useful life)
5.3.6
In field prediction adjustment
These slides can not be reproduced or distributed without permission in writing from the author. 79
5.3.6. R.G. for useful life: In field prediction adjustment
These slides can not be reproduced or distributed without permission in writing from the author. 80
5.3.6. R.G. for useful life: In field prediction adjustment
An example to better capture the concept
Suppose there are 1000 balls in an urn,
with the characteristics shown in Table
Balls Meaning
Number Coulor in the similarity
500 white No failures
These slides can not be reproduced or distributed without permission in writing from the author. 81
5.3.6. R.G. for useful life: In field prediction adjustment
Suppose we extract a sample of 20 balls from the urn.
Without disturbing Statistics ...
… it is reasonable to expect that the white balls represent, more
or less, one half the sample and the other half is made up of co-
lored balls (in various ways);
… it is not said that the balls green/yellow are present in the sam-
ple in a quantity (even approximately) halved compared to the
blue/red: this can be generalized by saying that usually the fre-
quency of the colors of the whole population is not observed
in the sample;
… indeed, in a sample of 20 balls, it is even possible that a color
is completely missing: of course it is easier that green or yellow
is missing, but it is not impossible that are the blue and/or red to
miss;
… in a sample, it is plausible the presence of some metallic ball
(all of different colors): based on the sample, one is led to attribu-
te to each of them a frequency of 1/20, while we know that their
true frequency, in this example, is 1/1000.
These slides can not be reproduced or distributed without permission in writing from the author. 82
5.3.6. R.G. for useful life: In field prediction adjustment
Generalized deductions
These slides can not be reproduced or distributed without permission in writing from the author. 83
5.3.6. R.G. for useful life: In field prediction adjustment
These slides can not be reproduced or distributed without permission in writing from the author. 84
5.3.5. R.G. for useful life: results monitoring 100%
90%
No. failure kinds mildly detectable
86,1%
91,6%
We can use the “detect ability” in the sample in order to adjust the
50,5%
58,0%
50%
41,9%
40%
31,6% 32,1%
30%
25,5%
19,5%
20% 16,6%
10,3%
10%
5,0%
2,8%
0%
Supposing a total
of 100 cars
detect ability
A in a sample (e.g. 64%)
Credible estimate of failure frequency before last corrective actions
Improvement detected
on the sample
prediction
In field
% fail. not detectable
in a sample (e.g. 36%)
These slides can not be reproduced or distributed without permission in writing from the author. 85
5.3.6. R.G. for useful life: In field prediction adjustment
At this point, it is clear that the true value of the in field failure frequency is
between point A and the value of the result (near the target) reached on
the sample.
Taking the value of Point A as a prediction of the in field failure frequen-
cy, would be pessimistic, because it was implemented a number of cor-
rective actions. On the other hand, assume the value provided by the re-
sult on the sample would be optimistic, because it does not take ac-
count of kinds of failures that have not appeared in the sample.
So, much empirically, we split the interval, between the previous two alter-
native estimations, into two parts proportional to the visibility and to the
non-visibility on the sample, as shown in previous slide.
We have already seen in Chapter 2 - Statistics, speaking about the bino-
mial distribution, the meaning of "detect ability" (or, rather, visibility) on
a sample of size n: the non-visibility quantifies, in some way, the inade-
quacy of the sample in relation to achieving a very precise estimate.
In this way, with very large samples (practically unfeasible!), the in field
estimate would be close to the result on the sample, while, with small
samples, the in field estimate slightly differs from the value of Point A.
These slides can not be reproduced or distributed without permission in writing from the author. 86
5.3.6. R.G. for useful life: In field prediction adjustment
These slides can not be reproduced or distributed without permission in writing from the author. 87
5. Criteria and methods to plan reliability experim. verifications
5.4
RELIABILITY TESTS
ON COMPLETE VEHICLES
FOR PROCESS VERIFICATION
(Reliability Growth Testing
for Infant Mortality)
These slides can not be reproduced or distributed without permission in writing from the author. 88
5.4. R.G. for infant mortality: process verification
But how much these distances can be limited ? Even for infant
mortality failures is often required some time to "mature" (i.e. to be
perceived by the driver). So if we reduce too much distance, we risk
of not detecting some kind of failure (probably not very frequent, but
able to be annoying for customers).
These slides can not be reproduced or distributed without permission in writing from the author. 89
5.4. R.G. for infant mortality: process verification
These slides can not be reproduced or distributed without permission in writing from the author. 90
5.4. R.G. for infant mortality: process verification
These slides can not be reproduced or distributed without permission in writing from the author. 91
DI GUASTO ll 5.4. R.G. for infant mortality: process verification
l
100 km)]
DI FREQUENZADENSITY,
a 180 km
x 100x km)
/ (vettura ]
INTENSITÀFREQUENCY
failures / (cars
l a 1.500 km
[[inconvenienti
mileage
media annua
precorrenza
Annual
FAILURE
average
180 km 1.500 km P EI RL CEOA
M RRG EEN Z A
[ k [km]
m]
These slides can not be reproduced or distributed without permission in writing from the author. 92
5.4. R.G. for infant mortality: process verification
These slides can not be reproduced or distributed without permission in writing from the author. 93
5. Criteria and methods to plan reliability experim. verifications
5.5
FINAL SUMMARY ON
EXPERIMENTAL TESTS
These slides can not be reproduced or distributed without permission in writing from the author. 94
5.5. Final summary on experimental tests
Current standards
Different categories of experimental tests to define
sample size
At least 1
failure in the
sample
OPERATING tests (with assigned
probability):
binomial distr.
Success-Run
Reliability Verification
M(t) curve
with (one sided)
confidence
band
RELIABILITY tests
Design
(useful life)
Reliability Growth Testing
Process
(infant mortality)
DURABILITY tests
These slides can not be reproduced or distributed without permission in writing from the author. 95
5.5. Final summary on experimental tests
The predictions
obtained using Reliability Growth Testing methodology,
by combining tests for useful life and those for infant mortality,
proved to be particularly accurate.
The forecast-error
on the failure frequency at 12 months of use,
noticeable only after about a year and a half,
was around 15%,
which can be considered a very good result,
if account is taken of the advance which it is given.
The estimate is generally a little optimistic,
because it can not take into account of possible process drifts.
It can become pessimistic for those models (e.g. niche models),
for which smaller samples are taken:
the smaller sample size widens the confidence interval
of the failure frequency estimate.
These slides can not be reproduced or distributed without permission in writing from the author. 96
5.5. Final summary on experimental tests
EXAMPLES (only qualitative) of comparisons between:
PREDITION at the end of Reliability Growth Testing and
RESULT IN FIELD after one year of use of the cars sold in the 2nd 3rd and 4th month after the Commercial Launch.
Unusual conspicuous
process drifts occured RG PREDICTION
immediately after the
Commercial Launch
(causes have been IN FIELD RESULT
identified)
FAILURES / 100 CARS
CAR MODELS
(first applications of the Reliability Growth Testing methodology)
These slides can not be reproduced or distributed without permission in writing from the author. 97
5.5. Final summary on experimental tests
These slides can not be reproduced or distributed without permission in writing from the author. 98