Beruflich Dokumente
Kultur Dokumente
0
0
0
0
Ale Hladnik
r 1
0
0
05_Hladnik_Formulas
1
0
r 1 r 2 n 0
Keywords: principal component analysis, image compression, face recognition, image processing, computer vision
Principal component analysis (PCA) is one of the most widely implemented tools for dimensionality reduction or
data exploration used in a variety of scientific and engineering disciplines. It transforms a number of possibly correlated variables into a smaller number of new variables, known
as principal components. 1Since a digital image
05_Hladnik_Formulas
introduction the paper will focus on two typical image processing/computer vision applications of PCA. First,
choice of the number of extracted PCs affects the image compression ratio and consequently the image quality.
0 0 0in range
and accesscontrol.
01 content
cluding law enforcement and surveillance, personal identification, video
indexing,
0
0
0
2 extraction
0 feature
0
0 fromthe face
0 region
Individual steps involve segmentation detection of faces from ascene,
2
05_Hladnik_Formulas
followed by either recognition or verification. MATLAB implementation
based
available
of PCA
on
a freely
face
05_Hladnik_Formulas
r r 00
0 0
00 00
05_Hladnik_Formulas
05_Hladnik_Formulas
05_Hladnik_Formulas
SS05_Hladnik_Formulas
05_Hladnik_Formulas
0
00 1r r11
00
0 0
mxn
mxm
mxn
VT
S=
nxn
n 1
56
11
nn11
0
0
0
0
0
0
0
0
0
0
r
0
0
0
0
0
0
0
r 1
0
0
n
0
11
nn11
video content
indexing, and entrance security. Individual
steps involve
segmentation detection of faces from a
by either recognition
or verification. The main idea of using PCA for face recognition is to transform face images
into a small set of characteristic feature images, known
as eigenfaces, which are the PCs of the training set face
images. Recognition is performed by projecting a new
image into the subspace spanned by the eigenfaces
(face space) and then classifying the face by comparing its position in face space with the positions of
known individuals (Turk and Pentland, 1991).
ranks.
n n00 r r11r r2 2
values
singular
or
compression
ratio (CR) and reduced file size. CR can be
0
0
0
0
r2 2higher
r
r
r
1
1
n
n
r
r
2
2
n
n
1 12 2
11rr0202
r r00
Before applying SVD to image compression,
original
computed according to the following formula:
1
transimage matrix
A = XT has to be appropriately
11nnr r0202
r r11r r2 2
T
Z = n X
While in lossless (reversible) compression typical CR
nn11
nn11
n 1 1
is approximately 2:1 for natural scenes and somewhat
11
11
1
where we subtracted
the row
higher for document images, this ratio is much higher
average
from each
n
n
1
1
n
n
1
1
n
1
Thus, the
entry to ensure zero mean across the rows.
with lossy compression schemes, such as lossy JPEG.
matrix Z has columns
with
we
con- T 11
zero mean. Next,
1
1
1
following m x m matrix: ZTZ =
XT
XT
struct the
1.3 Face recognition
n 1
nn11
nn11
n 1
nn11
nn11
(1)
05_Hladnik_Formulas
05_Hladnik_Formulas
05_Hladnik_Formulas
05_Hladnik_Formulas
0
0
0
0
000 000
Image compression
1 00 0ui for
0
vectors
1.2
Column
= 1, 2,
n i = 1,
i for
1 0 i
0 , m
0 andv
101n 00
01 01 00 0
0000 00
orthonormal
Due
2, , n form
sets.
S
is
an
m
n
diagonal
0 00
000 000
to the large
00 0 0 002 2
0
0
0
00image
0000r 2020
0000still
video
applications, compression is an
matrix. The diagonal
entries, (i = 1, 2, , 0n)0ofS0are
00 and
i S 2 2
called
matrix
be
almost
inevitable
step
towards the reduction of the
1 00 of
01 01A. 0It0can
achieved
0
0
00 r
00000 000
1values
singular
1
0 0
000 00
that:
storagespace
or transmission time requirements.
00
image
0
0
0
0
0
0
0
0
0
0
0
0
S
00 00 0
r 0r 0 00 00
r
22
22
0 0 0
S
While
lossy compression
standards such as JPEG offer a
0
0 SS r r
0 0 000 r 1 SS
0
ntradeoff
r r11
r0
0
0
0
0
0
0
0
0
0
0
0000
between
1
r
22 0
11
r r11
00 00
0
and
(2)
compression
methods run length encoding,
0rr 000
r0r00 000
00 0
0lossless
S
S
0
rSS1
0
0
00r 2 000
n 00
LZW,
Chain
codes
and
others no information reduction
n
0
0
000
0000
r 1
0n n 00
0000 00 0
0
n n
r 2 n 0 r r11 0 n r0r011
place, but
takes
0
0 0 0 0
0 0
0 00 00
0000 00 0
0000large.
00
of the important
00
One
properties
of SVD
is that the
0
0
0
0
0
0
0
0
0
0
0
0
0
0
r n 0
2
n
case
in image
singular1values.
is
generally
image and are exploited in image compression tech00 it
0
0000 00
00
00Since
0000 00the
compression
applications that the singular
niques: coding-, interpixel- (spatial) and psychovisual
ofr a2 n 0
values
n1
1
2 2
with
r rincreasing
00
rank,
1 1r1it
11rr0202
r r00 redundancy. In practice, both non-redundant and
1
1
matrixdecrease
quickly
is2 pos2
1. Introduction
A = USVT
57
Scene
Canon
EOS 400D
8-bit GS
image
3648 x 2736 px
SSIM
PSNR
Man
Image quality
assessment
CR
Cropping
Hedgehog
Buildings
Lossy
compressed
image
Downsampling
PCA
1024 x 1024
px
05_Hladnik_Tables
Table 1: Comparison of the three 1024 x 1024 test images in terms of the number of PCs, compression
ratio, SSIM index and PSNR
Figure 3: Face96 training (top) and testing (bottom) sets used in the face recognition investigation.
58
No. of
PCs
10
20
30
40
50
100
200
512
SSIM index
PSNR
CR
Hedgehog
Buildings
Man
Hedgehog
Buildings
Man
48.8
25.0
16.8
12.6
10.1
5.1
2.6
1.0
0.34
0.50
0.62
0.70
0.76
0.91
0.98
1.00
0.64
0.76
0.85
0.89
0.91
0.97
0.99
1.00
0.77
0.88
0.92
0.95
0.96
0.99
1.00
1.00
14.6
15.7
16.7
17.6
18.4
21.7
27.0
41.6
21.5
24.4
26.4
27.7
28.7
32.3
37.2
48.8
26.0
30.9
33.6
35.5
37.0
41.2
45.3
53.9
200
512
2.6
1.0
0.98
1.00
0.99
1.00
1.00
1.00
27.0
41.6
37.2
48.8
45.3
53.9
Table 1: Comparison of the three 1024 x 1024 test images in terms of the number of PCs,
compression ratio, SSIM index and PSNR.
Table 2: Recognition rate for the three datasets used in the face recognition study.
Original image
Recognition
Rate (RR)
1024 x 1024 px
Reconstructed /
compressed image
SSIM map
Reconstructed /
compressed image
SSIM map
50 PCs
10 PCs
30 PCs
100 PCs
Dataset
Ratio
Percentage
Faces94
11/12
92
Faces95
2/12
17
Faces96
12/12
100
Table patterned
2: Recognition
rate for of
theFaces96
three datasets
in the face
recognition
study.A Tutorial on Principal Component
Shlens,
J. (2003):
highly
background
images, used
RR was
Analysis; Derivation, Discussion and Singular Value
100%. The results will be presented in more detail in a
Decomposition, URL: http://www.cs.princeton.edu/
forthcoming article.
picasso/mats/PCA-Tutorial-Intuition_jp.pdf (last access:
4. Conclusions
3.5.2013).
Spacek, L. (2008): Collection of Facial Images, URL:
It is a well-known fact that the choice of the best
http://cswww.essex.ac.uk/mv/allfaces (last access:
compression method always depends on the actual ap3.5.2013).
plication. Among the most important factors to consider Turk, M.A./Pentland, A.P. (1991): Face Recognition
are input image characteristics (data type, previous
Using Eigenfaces, Proceedings of the IEEE Computer
processing), desired output (compressed) image quality,
Society Conference on Computer Vision and Pattern
compression scheme computational complexity and
Recognition, 586-591.
presence of artifacts (blocks, blur, edge noise). PCA is
Wang, Z./Bovik, A.C./Sheikh, H.R./Simoncelli, E.P.
one of the many possible approaches used to reduce im(2004): Image quality assessment: From error visibilage file size at the expense of its perceptual quality.
ity to structural similarity, IEEE Transactions on Image
PCA has also been successfully implemented in adProcessing, vol. 13 (4), 600-612.
vanced machine vision/pattern recognition applications
such as face recognition. As our study demonstrated,
(received for the first time: 01-10-12)
image/scene characteristics, e.g. background pattern or
human face scale, can decisively influence the effectiveness of a face recognition system.
5. References
Figure 4: Results of the Hedgehog image compression using 10, 30, 50 and 100 PCs.
60
Ale Hladnik
Dr., Ass. Prof.,
Chair of Information and
Graphic Arts Technology,
Faculty of Natural Sciences
and Engineering,
University of Ljubljana,
Slovenia
ales.hladnik@ntf.uni-lj.si
61