Beruflich Dokumente
Kultur Dokumente
REPAIRABLE SYSTEMS
MARKOV PROCESSES
Enrique López Droguett
Associate Professor
Mechanical Engineering Department
University of Chile
e-mail: elopezdroguett@ing.uchile.cl
REPAIRABLE SYSTEMS MODELING
Non-Repairable Repairable
X(t)
1
D1 D2 D3
0 T1 T2 T3 t
AVAILABILITY
A(t ) = P [ X (t ) = 1]
• Note that if the system does not go through repair, A(t) = R(t)
U (t ) = 1 - A(t ) = P [ X (t ) = 0]
EXAMPLE – 1
• The data comes from a single system and the repair times are
considered small when compared to the operational times
0 0 0
1 177 177
2 242 65
3 293 51
4 336 43
5 368 32
6 395 27
7 410 15
EXAMPLE – 1
177 65 51 43 32 27 15
0 177 242 293 336 368 395 410
Time
• Sad System L
– Times between events decrease with time
– System is deteriorating
– Failures are more frequent with time
• Happy System J
– Times between events increase with time
– System is improving
– Failures are less frequent with time
EXAMPLE – 1
7
5
Number of Failures N(t)
0
0 50 100 150 200 250 300 350 400 450
Time [days]
MARKOV MODELS IN
RELIABILITY
l0®2
k=0 l0®1
l1®0 l1®2
k=1
l2®1 k=2
STOCHASTIC PROCESSES
ì ü
• Stochastic Process ïí X (t),t ÎT ïý is a collection of random
ïî ïþ
variables:
– For each t ÎT , X (t ) is a random variable
• If the index t is the time, then X (t) is the state of the process at
t
ìï ü
í X (t),t ³ 0ïý
ïî ïþ
STOCHASTIC PROCESSES
1 0
µ
WHAT A MARKOVIAN ANALYSIS IS?
• Example:
– Redundant systems in standby
– Systems with multiple failure modes
– Systems with some degraded states
MARKOV MODELS AS REPRESENTATION OF THE
DYNAMIC BEHAVIOR
• Describe the dynamics of a system:
– By a set of states in which that system can be in a given time
– Through the likelihood and rate of transitions among states
• System states:
Component 1 Component 2
System States
States States
0 Operational Operational
1 Failed Operational
2 Operational Failed
3 Failed Failed
EXAMPLE – 2: TWO COMPONENTS IN PARALLEL
• Markov Diagram
l0®1
0 1
l1®0
l2®3
2 3
l3®2
EXAMPLE – 2: TWO COMPONENTS IN PARALLEL
• Markov Diagram:
l0®1
0 1
l1®0
l0®3
l2®0 l0®2 l3®1 l1®3
l2®3
2 3
l3®2
MAROVIAN MODEL ANALYSIS
• Example:
– At t = 0, the system is in the initial state 0
– Determine the probability that the system is in one of the possible
states 0, 1, 2 or 3 at time t
ì ü
• The event ïí X (t) = j ïý means that the system is in state j at time t
ï
î ï
þ
é ù
Pj (t) = P êê X (t) = j úú
ë û
MARKOV PROCESSES
• Markovian Property:
– Given that the system is in state i at time t, X (t) = i , the future
states X (t +n ) do not depend on the previous states, with X (u) , u < t
é ù é ù
P êê X (t + v) = j | X (t) = i, X (u) = x(u)úú = P êê X (t + v) = j | X (t ) = i úú
ë û ë û
• Therefore:
– Degradation processes are not easily analyzed via Markov models
TRANSITION PROBABILITIES
• Properties:
Pij (t) ³ 0; t > 0
r
å Pij (t) =1; t > 0
j=0
Chapman-Kolmogorov Equations:
r
Pij (t + Δt)= ∑ Pik (t)⋅Pkj (Δt); t,Δt >0
k=0
CHAPMAN-KOLMOGOROV EQUATIONS
⎛ ⎞
Pij (t + Δt)≡ P ⎜⎜ X(t + Δt)= j| X(0)= i⎟⎟
⎝ ⎠
⎛ ⎞
= ∑ P ⎜⎜ X(t + Δt)= j, X(t)= k | X(0)= i⎟⎟
⎝ ⎠
k
⎛ ⎞ ⎛ ⎞
= ∑ P ⎜⎜ X(t + Δt)= j| X(t)= k, X(0)= i⎟⎟ ⋅P ⎜⎜ X(t)= k | X(0)= i⎟⎟
⎝ ⎠ ⎝ ⎠
k
⎛ ⎞ ⎛ ⎞
= ∑ P ⎜⎜ X(t + Δt)= j| X(t)= k ⎟⎟ ⋅P ⎜⎜ X(t)= k | X(0)= i⎟⎟
⎝ ⎠ ⎝ ⎠
k
= ∑ Pkj (Δt)⋅Pik (t)
k
TRANSITION RATES
• Transition rates λij (t) from state i to state j are defined as:
⎡ ⎤
P ⎢⎢ X (t + Δt)= j| X (t)= i⎥⎥
λij (t)= lim ⎣ ⎦
Δt→0 Δt
Pi j (Δt) d
λij = lim = Pij (0)= P!ij ⎛⎜ 0⎞⎟
Δt→0 Δt dt ⎝ ⎠
STATE EQUATIONS
r
P Dt =1- å Pjk çç Dt ÷÷
æ ö æ ö
As ç ÷
jj çè ÷
ø è ø
k =0
k¹ j
Thus,
é ù
ê ú
r ê r ú
P t +Dt = å Pik t × P Dt + P t × 1- å Pjk çç Dt
æ ö æ ö æ ö æ ö ê æ öú
ç ÷ çç ÷÷ ç ÷ ê ÷ú
ij çè ÷
ø è ø kj çè ÷
ø
ij ççè ÷÷
ø ê è
÷ú
ø
k =0 ê k =0 ú
k¹ j ê
êë
k¹ j ú
úû
STATE EQUATIONS
• Simplifying:
r r
P t +Dt - P t =-P t × å P Dt + å Pik æççt ö÷÷× Pkj çç Dt ÷÷
æ ö æ ö æ ö æ ö æ ö
ç ÷ ç ÷
ij çè ÷
ø
ij ççè ÷÷
ø ij ççè ÷÷
ø jk çè ÷
ø è ø è ø
k =0 k =0
k¹ j k¹ j
⎡ ⎤
⎢
⎢
−λ00 ! λr0 ⎥
⎥
⎢ ⎥
T= ⎢
⎢ " # " ⎥
⎥
⎢ ⎥
⎢
⎢
λ0r ! −λrr ⎥
⎥
⎣ ⎦
where
r
l jj = å l jk
k =0
k¹ j
STATE EQUATIONS: MATRIX NOTATION
• As r
l jj = å l jk
k =0
k¹ j
each state equation can be written as:
d P ⎡⎢ X(t)= j ⎤⎥ = λ ⋅P ⎡⎢ X(t)= k ⎤⎥
dt ⎢⎣ ⎥⎦ ∑ kj ⎢⎣ ⎥⎦
k
In matrix notation:
⎡ ⎤
⎡ ⎤ ⎢ ⎡ ⎤ ⎥
⎢ ⎡ ⎤ ⎥ ⎡ ⎤ ⎢ P X (t)=0
⎢ ⎥ ⎥
⎢
⎢
P ⎢⎢ X (t)=0⎥⎥ ⎥
⎥
⎢
⎢
−λ00 " λr0 ⎥
⎥
⎢
⎢
⎢⎣ ⎥⎦ ⎥
⎥
⎣ ⎦ ⎢ ⎥
d ⎢
⎢ !
⎥
= !
⎥ ⎢
⎢ # ! ⋅ ⎥
⎥
⎢
⎢ !
⎥
⎥
dt ⎢ ⎥
⎢ ⎥
⎢ ⎥
⎢
⎢
⎡ ⎤
P ⎢⎢ X (t)= r ⎥⎥ ⎥
⎥ λ0r⎢ " −λrr ⎥
⎢
⎢ ⎡ ⎤ ⎥
⎥
⎢ ⎣ ⎦ ⎥ ⎢
⎣
⎥
⎦
⎢ P ⎢⎢ X (t)= r ⎥ ⎥
⎥⎦ ⎥
⎢⎣ ⎥⎦ ⎢ ⎣
⎢⎣ ⎥⎦
EXAMPLE – 3: SYSTEM WITH TWO COMPONENTS
• System states:
Component 1 Component 2
System States
States States
1 Operational Operational
2 Failed Operational
3 Operational Failed
4 Failed Failed
• Markov Diagram
EXAMPLE – 3: SYSTEM WITH TWO COMPONENTS
• Objective:
– Find the system probability for each state as a function of time
– Pi (t) : probability that the system will be in state i at time t
Rs (t) = P1 (t)
• For state 1:
• For state 2:
• For state 3:
EXAMPLE – 3: SYSTEM WITH TWO COMPONENTS
• State probabilities:
• System reliability:
– Series system:
– Parallel system:
EXAMPLE – 4: STANDBY SYSTEM WITH SWITCHING
FAILURES*
• An active generator has a failure rate of 0.01/day. An older
standby generator has a failure rate of 0.001/day while in
standby and a failure rate of 0.1/day when on-line. There is a
10% chance of a switching failure. Determine:
– System reliability for a planned 30-day use
– System MTTF
– If there is no switching failures, find the system MTTF
P1(t) = e −( λ1 + λ2s )t
λ1 + λ2s − λ2 ⎣ ⎦
P3(t) = e − λ1t
−e
−( λ1 + λ2s )t
(1− p)λ1 ⎡ 1 1 ⎤
MTTF = λ1 + ⎢ − ⎥
1 λ1 + λ2s − λ2 ⎣ λ2 λ1 + λ2s ⎦
EXAMPLE – 4: STANDBY SYSTEM WITH SWITCHING
FAILURES*
Maintenance Types
Source: Rausand, M., Hoyland, A., System Reliability Theory, 2nd ed., Wiley, New York, 2004.
Overview
Physical Asset
Management
It should determine
Maintenance
Strategies
Mathematical
Corrective Preventive Modeling
Proactive
Including
For example
Restrictions
Decision Objectives
Inspection Variables
Scheduling For Example
Implies to For inspections, it means
wait for It should decrease Number
. of
Minimization
failure rates Maximize / Inspections
Intervals between Reach a level of
Failures Inspections
Costs
Impact Availability
Various Mean Times
Source: Rausand, M., Hoyland, A., System Reliability Theory, 2nd ed., Wiley, New York, 2004.
Downtime
• Unplanned downtime:
– Caused by an item failures and internal and external random events
• Human errors
• Sabotage
• Flooding
• Planned downtime:
– Caused by planned preventive maintenance, planned operations (change of
tools), holidays
– Scheduled planned downtime (e.g., PM)
– Unscheduled planned downtime: initiated by CM, incipient failures, etc.
TRANSIENT SOLUTION
r
å Pi (t) =1
i=0
EXAMPLE – 5
1 0
• State equations:
⎡ ⎤ ⎡ ⎤
− µ λ ⋅ P0(t) = P!0(t) ⎥⎥⎥
⎡ ⎤ ⎢
⎢ ⎥ ⎢ ⎥
⎢ ⎥ ⎢
⎢ ⎥ ⎢
⎢ ⎥
⎢
⎢
⎢⎣
µ −λ P1(t)
⎥
⎥
⎥⎦
P!1(t) ⎥⎥
⎢
⎢
⎥
⎥
⎢
⎢
⎣ ⎦ ⎢⎣ ⎥⎦
• Given that
P0(0) = 0 P1(0) =1
• Solving:
-µ P0*(s) + l P1*(s) = sP0*(s)
µ P0*(s) -l P1*(s) = sP1*(s) -1
• Thus
P0*(s) = 1s - P1*(s)
• Then
P1*(s) = 1 + µs × 1
l +µ +s l +µ +s
EXAMPLE – 5
P1*(s) = l × 1 + µ ×1s
l +µ l +µ +s l +µ
P1(t) = l e-(l +µ )t + µ
l +µ l +µ
1.0
0.9
0.8
λ = 0.01/h
0.7
µ = 0.001/h
0.6
0.5
0.4
0.3
0.2
0.1
0.0
0 100 200 300 400 500
Time [h]
EXAMPLE – 5
• Mean Availability:
A = tlim
®¥
A(t)
• Thus,
A = P1 = tlim P (t ) = µ
®¥ 1 l +µ
A= MTTF
MTTF + MTTR
IRREDUCIBLE MARKOV PROCESS
lim ! ⎛⎜ t ⎞⎟ =0
P
t→∞ j ⎝ ⎠
ASYMPTOTIC SOLUTION
• Note that the initial state of the system does not influence the
steady-state probabilities
VISIT FREQUENCY TO A STATE
⎛ ⎞
P ⎜⎜ ( X (t + Δt)= k)∩( X (t)= j)⎟⎟ =
⎝ ⎠
⎛ ⎞
P ⎜⎜ X (t + Δt)= k | X (t)= j⎟⎟ ⋅P( X (t)= j)= Pjk (Δt)⋅Pj (t)
⎝ ⎠
⎛ ⎞
P ⎜⎜ ( X(t + Δt)= k)∩( X(t)= j)⎟⎟ Pjk (Δt)⋅Pj (t)
υ dep (t)= lim ⎝ ⎠
= lim = Pj (t)⋅λ jk
jk Δt→0 Δt Δt→0 Δt
VISIT FREQUENCY TO A STATE
• In the steady-state:
υ dep
jk
lim Pj (t)⋅λ jk = Pj ⋅λ jk
= t→∞
• Total frequency of departures from state j in the steady-state is
r
υ j = Pj ⋅ ∑ λ jk = λ jj ⋅Pj
dep
k=0
k≠ j
• Mean Availability:
A= ∑ Pk
k∈G
• Mean Unavailability:
U =1− A= ∑ Pk
k∈F
FAILURE FREQUENCY AND REPAIR FREQUENCY
• Failure frequency: wF
– Expected number of visits to states in F from states in G per unit
time calculated after a long time
• Repair frequency: wR
– Expected number of visits to states in G from states in F per unit
time calculated after a long time
MEAN DURATION OF A SYSTEM FAILURE
• Mean time from when the system enters a failed state (F) until it
is repaired or restored into an operating state (G)
U = ω F ⋅TF
MTBF AND MTTF OF THE SYSTEM
MTBF = w1
F
∞
f (s) = ∫ e− st f (t) dt
0
• Some properties:
⎢⎣ 0 ⎥⎦ s
LAPLACE TRANSFORMS