Beruflich Dokumente
Kultur Dokumente
Michael A. Greenfield
Deputy Associate Administrator
Office of Safety and Mission Assurance
Outline
IDENTIFY
Identify risk issues and concerns
Statements of risk
List of risks
ANALYZE
Evaluate (impact/severity, probability, time
frame), classify, and prioritize risks
Risk evaluation
Risk classification
Risk prioritization
Resources
PLAN
Decide what, if anything, should
be done about risks
Program/project data
(metrics information)
TRACK
Monitor risk metrics and
verify/validate mitigation actions
CONTROL
Replan mitigations, close risks, invoke
contingency plans, or track risks
Risk decisions
DESIGN RULES
MATERIALS SELECTION
LESSONS LEARNED
ANALYSES
TECHNOLOGY
QUALIFICATION
VENDOR QUALIFICATION
PROCESS CONTROLS
MISSION SIMULATION
INSPECTIONS
VERIFICATIONS
ASSEMBLY TESTING
PERFORMANCE TESTING
RELIABILITY ANALYSES
SYSTEM TESTING
MISSION SUCCESS
Risk assessment analyses should use the simplest methods that adequately
characterize the probability and severity of undesired events
Qualitative methods that characterize hazards and failure modes and effects
should be used first
System safety analysis must include early interaction with project engineering,
integration, and operations functions to ensure all hazards are identified
NSTS 22206, instructions for preparation of FMEA and CIL [for Space Shuttle]
System and performance requirements are defined
SSP 30234, instructions for preparation of FMEA and CIL [for Space Station]
FMEA process, requirements, rules, reporting requirements are described
Occurrence
Severity
Detection
Example
For want of a nail, the shoe was lost;
For want of a shoe, the horse was lost;
For want of a horse, the rider was lost;
For want of a rider, the battle was lost;
For want of a battle, the kingdom was lost!
A Good FMEA
Effects
Recommended
Improvement
Improved Rating
Sev* Freq Det RPN
Loss of Freon
96
32
96
Monitor temperature 4
with thermocouples.
16
1. Cumulative
fatigue
Loss of Freon
16
2. Poor
soldering
Loss of Freon
80
Sticky,
1. Dirt or
intermittent
foreign
objects
Loss of control
of temperature
Loss of control
of temperature
112
Electronic redundant 7
valve action and
fault identification
Stuck
closed
1. Component
failed
Loss of control
of temperature
2. Component
expansion
and
contraction
Loss of control
of temperature
Leakage
at the
joint
Valve
Causes
(failure
mechanism)
1. Corrosion
Update process
FMEA
*Severity ratings 8 to 10 request special effort in design improvement regardless of RPN rating
Reliability prediction
Sub FMEAs
Analysis for:
Safety
Maintainability
Logistics support
Mechanic training
FMEA is limited when multiple failure modes occur at the same time
Background
Concept
The tree shows the logical branches from a single failure at the top of
the tree to the root cause(s) at the bottom of the tree
Standard logic symbols connect the branches of the tree. For example,
gates permit or inhibit the passage of fault logic up the tree through
the events.
Fault tree does not necessarily contain all possible failure modes of the
components of the system. Fault tree contains only those failure modes
whose existence contribute to the existence of the top event.
Fault Tree
K in g d o m L o s t
O R
R o y a l F a m ily
A s s a s s in a t e d in
C oup
B u b o n ic P la g u e
S e r f R e b e llio n
O R
A N D
R a ts
R u n n in g
R am pant
M a jo r B a t t le is
Lost
T re a s o n
In fe c te d
F le a s
E nem y has
s u p e r io r w e a p o n s
(c ro s s -b o w s )
P o o r t a c t ic a l
d e c is io n
O R
C r it ic a l in p u t n o t
r e c e iv e d
In c o m p e te n c e
O R
N o h o rs e to c a rry
r id e r
O R
L o s s o f c o re
c o m p e te n c y
N e p o tis m
R id e r
becom es
lo s t
C
O R
O R
H o r s e in ju r e s le g
H o r s e k ille d
b y s n ip e r
H o r s e d ie s
of
m a ln u tr itio n
L o s s o f n a il
O R
H o r s e f a lls
in h o le
O R
H o r s e lo s e s s h o e
S hoe of poor
d e s ig n
N a il n o t
in s ta lle d
P o o r w o r k m a n s h ip
N a il m a te r ia l o u to f-s p e c
P re s s u re ta n k
ru p tu re
H o rn
S w it c h
+
C o n ta c ts
O v e r p r e s s u r e in
ta n k
B a s ic
ta n k
fa ilu r e
Tank
Pow er
S u p p ly
P u m p o p e ra te s to o
lo n g
C u rre n t fe d to
m o to r f o r t o o lo n g
Pum p
T im e r
S w itc h c lo s e d
t o o lo n g
+
Example 11 - FTA
development of an air
pumping system
B a s ic
c o n ta c ts
fa ilu r e
N o c o m m a n d to
o p e n c o n ta c ts
B a s ic
s w itc h
f a ilu r e
N o com m and
t o o p e n s w it c h
O p e r a t o r d id n o t
o p e n t h e s w it c h
+
B a s ic
tim e r
fa ilu r e
T im e r s e t s
t im e t o o
lo n g
B a s ic
s w it c h
fa ilu r e
N o com m and
to o p e ra to r
B a s ic
s w it c h
fa ilu r e
S econda r y a la r m
fa ilu r e
Uses
Supports study of the probability of occurrence for each of the root causes
System analysis via hardware fault tree may be linked to software fault tree
What is PRA?
Methodology
Development of
Initiating Events
Development of Information
Procedures
Test and maintenance
practices
Human reliability
Failure data
Drawings
Specifications
Success criteria
Support information
Human interaction
Sequence (scenario)
Development
System Analysis
Quantification
Dependent failure
analysis
Uncertainly
analysis
Risk calculations
Estimation of
Consequences
Risk
Value
Horse
Rider
Message Decision
Battle
Shoe
Stays On
(.85)
S (.85)
Loss of
Nail
Loss of
Shoe
(.15)
Horse
OK
(.95)
Horse
Lame (.05)
Rider
OK
(.80)
Rider
Lost
(.20)
Message
Received
(1)
Good
Decision
(.75)
Bad
Decision
(.25)
Battle
Won
(.80)
Battle
Lost
(.20)
S (.06)
F (.02)
F (.03)
F (.03)
F (.01)
.1
5% of Area
.2
.5
Probability of Event
.9
5% of Area
Shuttle
Element
Current Reliability **
Element
What ifs ?
Shuttle
SSME
1.0x10-03*
1/1000
5.0x10-4
1/2000
1/560
1/779
SRB
7.6x10-04
1/1312
3.8x10-4
1/2624
1/525
1/657
RSRM
1.8x10-04
1/5682
8.8x10-5
1/11,364
1/455
1/474
ET
1.4x10-04
1/7246
6.5x10-5
1/4,492
1/451
1/466
Orbiter
4.8x10-05
1/20,619
2.4x10-5
1/41,238
1/441
1/447
1/438
Network logic and paths may not be fully understood for the
desired repeatable outcome
References