Beruflich Dokumente
Kultur Dokumente
2011
by Armand M. Makowski
ENEE 627
SPRING 2011
INFORMATION THEORY
CONVEXITY
Convex sets
A subset K of Rd is said to be convex if for any elements x and y of K, and any
in ]0, 1], we have
x + (1 )y K,
[0, 1]
[0, 1].
c
2011
by Armand M. Makowski
x, y K
(0, 1)
(1 x1 + . . . + p xp ) = 1 (x1 ) + . . . + p (xp )
Under the foregoing assumptions we have 0 < I < 1, so that the definition
X i
xI =
xi
I
iI
c
2011
by Armand M. Makowski
!
X j
X i
(xi ) + I c
(xj )
I
I
Ic
iI
j I
/
(3)
= 1 (x1 ) + . . . + p (xp ).
(xI )
and
(5)
(xI c )
X i
(xi )
I
iI
X j
(xj ).
I c
j I
/
However, because of (1) the inequalities leading to (3) must necessarily hold
as equalities, and this implies
(6)
(I xI + I c xI c ) = I (xI ) + I c (xI c ),
(7)
(xI ) =
X i
(xi )
I
iI
and
(8)
(xI c ) =
X j
(xj )
I c
j I
/
c
2011
by Armand M. Makowski
i, j = 1, . . . , p
i 6= j
Kullback-Leibler distance
Consider a set X of finite cardinality. With and pmfs on X , define
X
(x)
D(||) =
(x) log
x
(x)
with the conventions
0
0 log
= 0,
0
p
p log
= if p > 0
0
and
0
0 log
= 0 if q > 0
q
The proof of Theorem 2.6.3 revisited: Thus,
X
(x)
D(||) =
(x) log
x
(x)
X
(x)
=
(x) log
x: (x)>0
(x)
X
(x)
=
(x) log
x: (x)>0
(x)
X
(x)
(9)
log
(x)
x: (x)>0
(x)
X
= log
(x)
x: (x)>0
(10)
log 1 = 0
c
2011
by Armand M. Makowski
x: (x)>0
(x) = c
x: (x)>0
(x) = c
since
X
x: (x)>0
(x) =
(x) = 1.
Consequently, c = 1 and
X
x: (x)=0
(x) = 0,
whence (x) = 0 if and only if (x) = 0. In sum, (x) = (x) for all x in X .