Beruflich Dokumente
Kultur Dokumente
I. I NTRODUCTION
Recently there has been a growing interest in applying
widely linear (WL) filtering to speech enhancement [1], [2].
The benefit of using a WL filter compared to a linear filter
stems from the fact that speech enhancement algorithms often
operate in the frequency domain, which yields complex signals
with non-circular statistics. With a linear filter, due to circularity assumptions that are imposed, any non-circular second
order statistics are neglected which could result in suboptimal
solutions. Therefore in order to fully exploit the non-circularity
of the second order statistics a WL filter should be used.
In WL filtering a complex signal is augmented with its
conjugate and a filter is derived from the corresponding
compound signal. This can sometimes improve performance
in a mean squared error (MSE) sense but by no more than a
factor of 2 [3]. In fact with certain signal models, e.g., double
white, it can be shown that the WL filter offers no benefit to
the linear filter [4], [5].
In [6], [7], [8], [9], [10], WL filtering has been applied
to speech enhancement and echo cancellation in binaural
hearing aids or other stereo systems, where two real signals
are combined to form a single complex signal which is then
used in a WL framework. In this paper we show that while
this formulation presents a novel approach, it cannot result in
This research work was carried out at the ESAT Laboratory of KU Leuven,
in the frame of KU Leuven Research Council CoE EF/05/006 Optimization
in Engineering (OPTEC) and PFV/10/002 (OPTEC), Concerted Research
Action GOA-MaNet, the Belgian Programme on Interuniversity Attraction
Poles initiated by the Belgian Federal Science Policy Office: IUAP P7/19 Dynamical systems, control and optimization (DYSCO) 2012-2017, Research
Project iMinds, Research Project FWO nr. G.0763.12 Wireless Acoustic
Sensor Networks for Extended Auditory Communication. Alexander Bertrand
is supported by a Postdoctoral Fellowship of the Research Foundation Flanders
(FWO). The scientific responsibility is assumed by its authors.
The authors are with the Department of Electrical Engineering ESAT-SCD
/ iMinds - Future Health Department, KU Leuven, Kasteelpark Arenberg 10,
B-3001 Leuven, Belgium (e-mail: joseph.szurley@esat.kuleuven.be; alexander.bertrand@esat.kuleuven.be; marc.moonen@esat.kuleuven.be).
yk,m (t)
..
yk,m =
(3)
, k = 0, 1
.
yk,m (t L + 1)
(5)
(6)
aTk R1
yy ak
where
k =
eTk Rxx ek
.
eTk Rxx R1
yy Rxx ek
d0B
d0MWF
d1MWF
0 d0MWF
x1,0
1 d1MWF
Fig. 1. Graphical representation between the linear MWF and linear MVDR
solutions.
(10)
and using the definition of the response vector (9) the MVDR
filter is shown to be equivalent up to a scaling factor, k , to
the MWF (6), i.e.,
kMVDR = k R1
w
yy Rxx ek
d0A
(7)
R1
yy ak
x0,0
(11)
(12)
(13)
= k dkMWF .
The derivation of the linear MWF (6) and MVDR filter (11)
also holds for complex-valued signals, but then the transpose
operator T should be replaced by a transpose conjugation H in
every equation. However, complex signals allow to also exploit
the non-circularity of the signals if widely linear filtering
techniques are used instead [2], [3], [4].
In [6], [7], [8], [9], a complex signal vector is artificially
constructed from the real signals received at both arrays,
= y0 + jy1
y
(14)
y
=
y
(15)
and v
are
where denotes complex conjugation and where x
defined similarly. This augmented signal vector can easily be
shown to be a transform of the original signal vector in (4),
I jI
=
y
y.
(16)
I jI
This transformation may then be used to show the equivalence
between the estimated desired speech component found with
the WL filters and the estimated desired speech components,
(k = 0, 1), found with the linear filters when applied to real
signals.
A. Widely Linear Multi-Channel Wiener Filter
The WL-MWF of (6) is given as
MWF =
w
Ry1
x
e0
y
Rx
(17)
H }. The estimated
H } and Rx x = E{
xx
yy
where Ry y = E{
desired speech component using (17) is then given as
(18)
Rx x e0
.
T
e0 Rx x e0
(21)
Ry1
y
a
H Ry1
a
y
a
MVDR =
w
where
eT0 Rx x e0
.
eT0 Rx x Ry1
x
e0
y
Rx
I
jI
T I
Rxx
dMWF = e0
I
I jI
0
.
(26)
=
[1|j] 1
1 d1MVDR
The WL-MVDR output then corresponds to the linear MVDR
outputs up to a real-valued scaling with 0 and 1 (or to the
linear MWF up to a joint scaling with
).
It is noted that
eT0 Rxx e0 + eT1 Rxx e1
(27)
= T
1
T
e0 Rxx R1
yy Rxx e0 + e1 Rxx Ryy Rxx e1
and so
can also be computed from quantities available in
the linear filtering approach hence both approaches yield the
exact same information.
The WL-MWF gives the same estimates for x0,0 and x1,0
as the linear MWF. The WL-MVDR is obtained by equally
stretching the linear MWF solutions by .
If the dA
0 component
is stretched into something longer than x0,0 , then the dA
1 is
stretched into something shorter than x1,0 , and vice versa
A
because the two stretched components, dA
0 and d1 , now
(22)
and using the definition of the response vector (21) the WLMVDR filter is shown to be equivalent up to a scaling factor,
Ry1
x
e0
y
Rx
(25)
dMVDR =
eT0 Rx x Ry1
y
y
(28)
(23)
(24)
H
H
1
jI
I jI
jI
I jI
1 I
y
Ryy
jI
I jI
I jI
I jI
(29)
(19)
(30)
1
xy = x VxT Ryy
Ux x
(31)
where
= xy ux1,1 vx1,1
(32)
eT0 Rxx e0
x
=
1
T
xy
e0 Rxx Ryy Rxx e0
(33)
eT0 Rx x e0
x
.
=
1
T
xy
e0 Rx x Ry y Rx x e0
(37)