Beruflich Dokumente
Kultur Dokumente
+ =
+ =
+ + + +
+ +
q
c
(1)
If both the multi-dimensional dynamic noise process } {
k
c
and dynamic measurement error series } {
k
q possess the
following properties:
(a) 0 } { = =
k k
E E q c
(b) 0 ) , cov( = k k q c
(c) ) , cov( ), , cov( ) ( ) ( k k k k k k R R c c q q c q = =
then the optimal estimator of the state vector
k
X in model
(1) can be expressed by the following recursive relationships:
+ =
+ =
=
+ =
+ + +
+ + + + + + +
+ +
+ + + + +
) ( 1 1 1
1
) 1 ( 1 1 1 1 1 1
) | ( 1 ) | 1 (
) | 1 ( 1 ) | ( 1 ) 1 | 1 (
) (
} {
k k k k k k k
k k k k k k k
k k k k k
k k k k k k k k
R A H K I A
R H H H K
X A X
E K X A X
c
t
q
t t
(2)
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 2, No. 12, 2011
38 | P a g e
www.ijacsa.thesai.org
These recursive relationships can intuitively express that
the best estimator
) 1 | 1 (
+ + k k X
of the state
1 + k
X at time
1 + k
t is
composed of two parts: the best estimator of the state
k
X at
time
k
t and the sampling innovation as follows
) | ( 1 1 1 ) | 1 (
k k k k k k k
X A H Y E
+ + + +
= (3)
Obviously, if the additional samples
1 + k
Y are normal
values, the sampling innovation
) | 1 (
k k E +
will make some correct
modification to predictor
) | 1 (
k k X +
with ratio
1 + k K
to get the next
optimal estimator
) 1 | 1 (
+ + k k X
. On the other hand, if the additional
samples
1 + k
Y are outliers, the resultant innovation
) | 1 (
k k E +
will
be abnormal and the abnormal information will result in an
erroneous modification to prediction
) | 1 (
k k
x
+
by the same ratio
1 + k
K , which lead to the filtering estimators deviating from
normal states.
In section V of this paper, an example is given to
substantiate the negative influence of outliers on the Kalman
filtering estimators of state vectors. Figure 1 plots the
simulation results, which indicates that the Kalman filtering of
state vectors is far from normal states when there are outliers in
sampling data series.
For a practical measurement system or device, outliers are
inevitable in sampling data because of faulty operations or
recording errors. In some cases, there may be complicated
abnormal measurement data existing in sampling processes,
such as step-type jumps or patchy outliers, etc. For example,
when the flight of a spacecraft was tracked by an impulse radar,
1~2 (occasionally as high as 5) of measurement data
display serious deviations from the trend formed by most other
samples. So it is very important to modify the Kalman filtering
algorithms or reduce the negative effects of outliers on Kalman
filtering estimators of state vectors.
III. TACTICS TO DEAL WITH OUTLIERS IN SAMPLE
Considering that it is very difficult to diagnose outliers in a
large quantity of data and that the Kalman filtering is still
widely used, a best effort is made in this paper to improve
fault-tolerance of the linear optimal recursive filtering
algorithm. The improved filters should possess the following
properties:
1) The algorithm should be recursive and easy to use;
2) When there are a few abnormal samples in the measured
data series, the filtering algorithm must have a strong
ability to overcome negative effects from abnormal data,
or must restrict the negative effects within prescribed
bounds;
3) When there are no abnormal samples, the algorithm is
capable of making full use of useful information to
achieve high filtering accuracy.
Considering these three restrictive conditions stated above,
a set of algorithms which are similar to the Kalman filter are
established as follows
=
u + =
+
+ + +
+ + + + + + +
) | 1 (
1
1 ) | 1 ( 1
) | 1 ( 1 1 1 ) | ( 1 ) 1 | 1 (
) (
~ ~
k k k k k k
k k k k k k k k k k
E G E r
E r K X A X
t
(3)
where the function series
)} ( { 1 u + k
are segment-wise smooth
and bounded,
1 + k
K is the gain and
1 + k
G is a weighting
matrix.
When
1 1 = u + k
and I G
k
=
+1
(identity matrix) are selected,
) 1 | 1 (
~
+ + k k
X in equation (3) is reduced to the conventional Kalman
filter. It is found that the main reason for outlier-tolerance of
conventional Kalman filter is that the function sequence {
1 1 = u + k
} potentially treat all of the innovations (normal and
abnormal) equally. This is the root cause why Kalman filter is
unable to deal with outliers. In order to endow a filter anti-
outliers capability, a sensible tactic is to select a suitable
1 + uk
which must decrease or diminish to zero when
1 + k
r increases.
IV. OPTIMAL SELECTION OF
k
u
In order to reduce the negative effects of outliers in state
filtering, some analyses of formulae (3) must be done as
follows:
) | 1 (
2 / 1
1
2
) | 1 ( 1 1
) | ( 1 ) 1 | 1 (
) | 1 ( ) 1 | 1 ( ) 1 | 1 (
) (
~ ~
~ ~ ~
k k k k k k k
k k k k k
k k k k k k
G K
X A X
X X X
+ + + + +
+ + +
+ + + + +
u =
=
A A
e e
(4)
where
) | 1 ( k k +
e =
) | 1 (
1
2
1
k k
k
E G
+
+
is one-step predicted weighted
residual.
For threshold constant series } {
1 + k
c used to control the
difference between the filtered values and the predicted values
of state vectors
1 + k
X
,
1 + uk
should be suitably chosen so as to
make sure that the following inequality holds
1
2 / 1
2
) | 1 ( 1
2
1 ) | 1 ( ) | 1 ( ) 1 | 1 (
)} ( {
~
+
+ + + + + + +
s
u s A
k
k k k k k k k k k k
c
X e e e
t
(5)
where
1 + k
is the maximum eigenvalue of matrix
1 1 1 + + + k k k K G K
t
.
It is easy to see that there are quite many different kinds of
function series } { 1 + uk
satisfying inequality (5).All of the
function series } { 1 + uk
are denoted as a set S:
} ), , 0 [ , ) ( , 1 : } {{
1
1
1 1 1
N k r
c
r r S
k
k
k k k
e + e s u s u u =
+
+
+ + +
(6)
The way to select function series } { 1 + uk
from the set S is
examined below.
Theorem 1For a linear stochastic system, when the noise
series } , , ; , , , {
2 1 1 0 0
A A q q c c x are stochastic sequences that
possess a normal distribution and are mutually independent, if
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 2, No. 12, 2011
39 | P a g e
www.ijacsa.thesai.org
the weighting matrix sequence are
) 1 ( 1 1 1 1 + + + + +
+ E =
k k k k k
R H H G
q
t
,
then the function series
S k e u + } { 1
which lead to the least
errors of filtering estimation are as follows:
>
s
= u
k
k
k
k
k
k
k
C
r
r
C
C
r
r
2
2
,
, 1
) (
(7)
where
k
C
is the suitably chosen threshold constant.
ProofIt follows from equations (2) and (3) that the
filtering error can be expressed as follows:
)
( )
~
( 2
~
~
~
1 ) 1 | 1 ( ) 1 | 1 ( ) 1 | 1 (
2
1 ) 1 | 1 (
2
) 1 | 1 ( ) 1 | 1 (
2
1 ) 1 | 1 ( ) 1 | 1 ( ) 1 | 1 (
2
1 ) 1 | 1 (
+ + + + + + +
+ + + + + + +
+ + + + + + +
+ + +
+
+ =
+ =
k k k k k k k
k k k k k k k
k k k k k k k
k k k
X X X X E
X X E X X E
X X X X E
X X E
t
Using the basic properties on conditional probability
distribution , the following results may be deduced
0
)} , , | (
{ )
~
(
} , , | )
{( )
~
(
} , , | )
( )
~
( {
)
( )
~
(
1 1 1 ) 1 | 1 ( ) 1 | 1 ( ) 1 | 1 (
1 1 1 ) 1 | 1 ( ) 1 | 1 ( ) 1 | 1 (
1 1 1 ) 1 | 1 ( ) 1 | 1 ( ) 1 | 1 (
1 ) 1 | 1 ( ) 1 | 1 ( ) 1 | 1 (
=
=
=
=
+ + + + + + + +
+ + + + + + + +
+ + + + + + + +
+ + + + + + +
k k k k k k k k
k k k k k k k k
k k k k k k k k
k k k k k k k
Y Y X E X X X E
Y Y X X E X X E
Y Y X X X X E E
X X X X E
A
A
A
t
t
t
t
Note that the project property of Kalman filtering has been
used in the last equation of the above expression. So the
following result can be obtained:
2
1 ) 1 | 1 (
2
) 1 | 1 ( ) 1 | 1 (
2
1 ) 1 | 1 (
~
~
+ + + + + + +
+ + +
+ =
k k k k k k k
k k k
X X E X X E
X X E
(8)
In order to investigate how to minimize the error in
equation (8), equations (2) and (3) are used to deduce the
following expression:
}
)] ( 1 {[
~
2
) | 1 ( 1
2
1
1
2
) 1 | 1 ( ) 1 | 1 ( k k k k
k
k k k k E K r E X X E + + +
+
+ + + + u =
(9)
It is obvious that function series S
k
e u } { which were
prescribed in the equation (7) minimize the equation (9).
In the following part of this section, we will discuss how to
calculate weigh matrix sequence } { 1 + k G and threshold
constants
1 + k
C ( A , 2 , 1 = k ).
Hypothesis 1. Suppose that } {
k
c and } {
k
q are two
independent noise sequences and they satisfy following
relationships:
a).
t t
q c
0 0 0 0 0 0 0
, , 0 , 0 x x R x Ex x Ex E E
k k
+ = = = = ;
b). ) ( ), (
) ( ) (
k k R E k k R E
k k k k k k
= = o q q o c c
q
t
c
t
;
c). 0 , 0 , 0 0 0
= = = k k k k
Ex Ex E
t t t
q c q c
Hypothesis 2 Suppose that } {
k
c and } {
k
q are two
independent noise sequences and they satisfy following
distributions:
) , 0 ( ~ ); , 0 ( ~ ); , ( ~
) ( ) ( 0 0 0 i i i i
R N R N R x N x
q c
q c
Lemma 1 If a linear stochastic system defined by
equation (1) satisfies hypothesis 1 and hypothesis 2, then
stochastic sequence { ; , , ,
0 0 k
x c c A } , ,
1 k
q q A obeys the
law of a multivariate normal distribution and their joint
covariance matrix
a
C is as follows:
) ( ) (
) ( 0
0 ) (
2 1
2
1
) (
k C k C
k C
k C
C
a a
a
a
k a
+ =
|
|
.
|
\
|
=
&
(10)
where operator +
&
represents the block-diagonal matrix
formed by the two block matrices on either side of the operator;
) ( 2 k Ca
is the covariance matrix of the measured noise sequence,
which can be expressed as the inverse matrix of the tri-diagonal
matrix A=
k k j i
a
) (
,
which is defined as follows
= > =
=
=
=
+ =
+ =
) ,..., 1 , , 2 ( 0
,
1
) ( , 1
1
) ( 1 ,
1
) 1 ( ,
1
) (
1
) 1 ( ,
1
1
) 1 ( 1
1
0 1 , 1
k j i j i a
A R a
R A a
R a
A R A R a
A R A R a
j i
i i i i
i i i i
k k k
i i i i i i
c
c
t
c
c
t
c
c
t
&
&
(11)
In formula (11) matrix ) (
2
k C
a
is the covariance matrix of
the measured noise sequence } {
k
q
:
} , , { diag ) (
) ( ) 1 ( 2 k a
R R k C
q q
A =
Lemma 2 If the linear stochastic system defined by
equation (1) satisfies hypothesis 1 and hypothesis 2, the joint
distributions of sequence { ; , , ,
1 0 k
x x x A } , ,
1 k
y y A are
normal and their covariance matrix is given by the following
relationships
(
(
(
+
=
1 1 1
1 1
a a a
a a
b
C H HC HC
H C C
C
t
t
M
A A A
M
,
(
(
(
=
k H
H
H
A
M M M
A
0 0
0 0 1
where the superscript t denotes transpose of a matrix.
Analyzing these two lemmas described above, it is found
that the joint distribution of sequence { } , ,
1 k
y y A is also
normal.
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 2, No. 12, 2011
40 | P a g e
www.ijacsa.thesai.org
If the linear stochastic system defined by equation (1)
satisfies those two hypotheses, denoting the joint covariance
matrix of { } , ,
1 k
y y A by
) (k D
and the covariance matrix of
) | 1 (
k k E +
by
1 +
O
k
it is easy to verify the properties of Kalman
filtering algorithms that
) (
1
) ( ) ( 1 1 k d k k d k k
D D D d
+ +
= O
t
(13)
where ) , cov(
1 1 1 + + +
=
k k k
y y d , the auto-variance matrix of
stochastic vector
1 + k
y ; matrix
) (k d
D is the covariance matrix
of { } , ,
1 k
y y A and
1 + k
y .
Weighting matrix
1 + k
G and weighted residual
) | 1 ( k k+
e
can be calculated as follows
O =
= O =
+
+ +
+ + +
) | 1 (
2 / 1
1 ) | 1 (
) (
1
) (
) ( 1 1 1
k k k k k
k d
k
k d k k k
E
D D D d G
e
t
(14)
It can be proven that
) | 1 ( k k + e follows the standard
multivariate normal distribution. Supposing that the measured
information is an m-dimensional vector, the weighted residual
) | 1 ( k k+
e is an m-dimensional standard normal variable, the
norm of which is equal to
1 + k
r :
) ( ~
2
) | 1 (
1
1 ) | 1 ( 1
m E E r
k k k k k k
_
t
+
+ + +
O = (15)
where ) (
2
m _ denotes the _ distribution of degrees-of-
freedom m.
Since the random variable
1 + k
r satisfies the distribution
) (
2
m _ , the threshold constant
) (m c
o
satisfies
o
o
= >
+
)) ( (
1
m c r P
k
Generally parameter o is taken as 0.05 or 0.025. If
) (
1
m c r
k
o
s
+
, it is believed with (1-o )100% confidence that
the additional information from the measured data
1 + k
y is
reasonable; otherwise, when ) (
1
m c r
k
o
>
+
, it is also believed
with o 100% confidence that the additional information
from the measured data
1 + k
y is unreasonable and hence this
additional information must be discarded.
From the above analysis and considering equation (6), it is
known that
1 + k
C can be given by
2 / 1
1 1
)) ( ( m c C
k k
o
+ +
= (16)
where
1 + k
is the maximum eigenvalue of matrix
t
1 1 + + k k
K K .
In specific applications in engineering,
1 + k
C may be chosen
with our experience.
V. NUMERICAL SIMULATION
Supposing that the coefficient matrices and covariance
matrix of errors defined by equation (1) are as follows
t
|
|
|
.
|
\
|
=
(
(
(
=
7 . 0 0
2 . 0 0
0 9 . 0
,
0 . 1 3 . 0 0
0 0 . 1 2 . 0
0 0 0 . 1
k k
H A
and the covariance of the model errors are respectively equal to
=
=
} 1 . 0 , 1 . 0 { diag
} 5 . 0 , 5 . 0 , 5 . 0 { diag
) (
) (
k
k
R
R
q
o
Using Monte Carlo method and selecting the initial states of
the system as the following
=
=
=
3 . 2
5 . 1
3 . 1
) 0 | 0 ( 3
) 0 | 0 ( 2
) 0 | 0 ( 1
x
x
x
(
(
(
=
01 . 0
01 . 0
01 . 0
) 0 | 0 (
.
one hundred pieces of simulation data are generated and
denoted by the set S. Let deviations of the 50
th
and 75
th
pieces
of data be
) 75 , 50 ( 5 ) 1 ( , 100 ) 1 (
1
2 1
= = A = A
+
i y y
i i
The new set with these two outliers is denoted by S*. The
estimates on S* made by equations (1) and (2) are shown in
figures 1 and 2.
(a) Filtering Estimation of
1 X
(b) Filtering Estimation of
2 X
0
400
800
1 9 17 25 33 41 49 57 65 73 81 89 97
0
40
80
1 11 21 31 41 51 61 71 81 91
(IJACSA) International Journal of Advanced Computer Science and Applications,
Vol. 2, No. 12, 2011
41 | P a g e
www.ijacsa.thesai.org
(c) Filtering Estimation of
3 X
Figure 1 Kalman Filters of State Variables
3
R X e
(a) Filtering Estimation of
1
X
(b) Filtering Estimation of
2
X
(c) Filtering Estimation of
3 X
Figure 2 Fault-tolerant Filters of State Variables
3
R X e
It can be seen clearly that the outliers have a very negative
effect on conventional Kalman filtering and the outliers-
tolerant modification method proposed in the paper is capable
of overcoming this negative effect and is reliable.
ACKNOWLEDGEMENTS
The research was supported by the Sweden Institute Grant
(SI 05483/2005-210), the National Natural Science Foundation
of China (No.61074077) and the Tianyuan Fund of National
Natural Science Foundation of China (No.11026224).
The first author would like to thank Dean Ingrid Melinder
of the Computer Science and Communication (CSC) of the
Royal Institute of Technology (KTH) for her friendly help
when he visited the CSC of KTH of Sweden.
REFERENCES
[1] Hu Shaolin, Fan Jingcheng. Bounded Influence Filter of the Dynamic-
Measurement System. Control Theory and Applications1993.10(1):
36~45
[2] Han Congzhao, Wang Yuejuan, Wan Baiwu. Stochastic System Theory.
Press of the Xian Jiaotong University1987
[3] Martin D, Mintz. Robust Filtering and Prediction for Linear System
with Uncertain Dynamics a game theoretic approach.IEEE Trans.
Automatic Control,1983, AC-18
[4] Martin D. Robust Method for Time SeriesAcademic Press Inc
1980
[5] Hu Shaolin,Wang Xiaofeng,Karl Meinke,Huajiang Ouyang.Outlier-
tolerant fitting and online diagnosis of outliers in dynamic process
sampling data series.in Artificial Intelligence andComputational
Intelligence, pp.195-204, LNAI 7004(Deng Miao,Lei Wang,eds)
Springer Press,2011
[6] F Carlos,S Alcala, J Qin. Unified Analysis of Diagnosis Methods for
Process Monitoring. In: Proc of the 7
th
IFAC Symp on Fault Detection,
Supervision and Safety of Technical Processes, Spain,2009,1007-1012
[7] Nihal Y, Bovas A, MacGregor J. A kalman filter in the presence of
outliers Communications in Statistics - Theory and Methods Volume 20,
Issue 5-6, 1991
[8] Durovic, Z.M., Kovacevic, B.D.: Robust estimation with unknown noise
statistics. IEEE Transactions on Automatic Control 44 ,12921296,1999
[9] Chan, S.C., Zhang, Z.G., Tse, K.W.: A new robust Kalman filter
algorithm under outliers and system uncertainties. In: IEEE International
Symposium on Circuits and Systems. IEEE,43174320,2005
[10] Jo-Anne Ting, Evangelos Theodorou, Stefan Schaal. A Kalman Filter for
Robust Outlier Detection. Computational Learning & Motor Control Lab
University of Southern California,IROS,2007
0
3
6
1 11 21 31 41 51 61 71 81 91
0
400
800
1 8 15 22 29 36 43 50 57 64 71 78 85 92 99
0
40
80
1 11 21 31 41 51 61 71 81 91
0
3
6
1 11 21 31 41 51 61 71 81 91