Beruflich Dokumente
Kultur Dokumente
Abstract—Face recognition is one of the most significant work and achievements were gained by an
successful applications of image analysis and understanding appearance based approach. This approach involves
and has gained much attention in recent years. Among machine learning techniques and algorithms. Again
many approaches to the problem of face recognition, there are number of machine learning algorithms like
appearance-based subspace analysis still gives the most
promising results. In this paper we study the three most
back propagation Neural Networks, Naïve Bayes,
popular appearance-based face recognition projection Support Vector Machine and ensemble technique like
methods (PCA, LDA and ICA). All methods are tested in Adaboost etc. Support Vector Machines based face
equal working conditions regarding preprocessing and detection methods have gained a considerable attention
algorithm implementation on the FERET and AT&T data during last few years. Some experiments are done for
set with its standard tests. We also compare the ICA method classification of face and non-face classes based on
with its whitening preprocess and find out that there is no kernel methods. In general we can divide the face
significant difference between them. When we compare recognition techniques into two groups: feature-based
different projection with different metrics we found out that approach and appearance-based approach. The
the LDA+COS combination is the most promising for all
tasks. The L1 metric gives the best results in combination
geometric feature-based approach uses properties of
with PCA and ICA1, and COS is superior to any other facial features such as eyes, nose, mouth, chin and there
metric when used with LDA and ICA2. Our results are relations for face recognition descriptors. Advantages of
compared to other studies and some discrepancies are this approach include economy and efficiency when
pointed out. achieving data reduction and insensitivity to variations
in illumination and viewpoint. However, facial feature
I. INTRODUCTION detection and measurements techniques developed to
date are not reliable enough for geometric feature-based
Face detection has gained significant importance during recognition. Such geometric properties alone are
the last few years. It has very many applications in the inadequate for face recognition because rich
field computer vision. The security organizations, information contained in the facial texture or
personal identification systems, law enforcement appearance is discarded. This problem tries to achieve
agencies, and monitoring applications can use this local appearance-based feature approaches. On the
technology. Human face detection has traditionally other hand, the appearance-based approach, such as
been a challenging task due to a number of factors like PCA, LDA and ICA based methods, has significantly
face size, image size, type and other conditions advanced face recognition techniques. Such an
involved. The still pictures containing faces may have approach generally operates directly on an image-based
different poses and angles. Also pictures and images representation. It extracts features into a subspace
differ with respect to resolution, camera lighting, and derived from training images. In addition those linear
contrast. Images have different properties and digital methods can be extended using nonlinear kernel
structure in case of gray scale and colored. The task techniques to deal with nonlinearity in face recognition.
even become more difficult when faces are occluded by Although the kernel methods may achieve good
other objects like mustaches, beards, glasses and masks performance on the training data, it may not be so for
etc. There are still lot of efforts to be carried out before unseen data owing this to their higher flexibility than
achieving optimal accuracy, robustness and linear methods and a possibility of overfitting therefore.
computational efficiency in detecting faces. Subspace analysis is done by projecting an image
A number of approaches are adopted for face into a lower dimensional space and after that
detection and these can be categorized in different recognition is performed by measuring the distances
ways. There is knowledge, invariant and templates between known images and the image to be recognized.
based methods developed. The face is so non-rigid and The most challenging part of such a system is finding
also when some have poses and gestures then most of an adequate subspace. In the paper three most popular
above said methods cannot work well and failed to appearance-based subspace projection methods will be
provide good enough accuracy. For last few years a presented: Principal Component Analysis (PCA),
Linear Discriminant Analysis (LDA), and Independent reduce this image space to RN = 403 . Reduced image
Component Analysis (ICA). Using PCA [3], a face space is much lower than original image space (m <<
subspace is constructed to represent “optimally” only N), in spite of that we retained 98.54% of original
the face object. Using LDA [4], a discriminant subspace information.
is constructed to distinguish “optimally” faces of
different persons. In comparison with PCA which takes A. Principle Component Analysis (PCA)
into account only second order statistics to find a
subspace, ICA [5] captures both second and higher- The PCA method [3] tends to find such s subspace
whose basis vectors correspond to the maximum
order statistics and projects the input data onto the basis
vectors that are as statistically independent as possible. variance direction in the original image space. New
We made a comparison of those three methods with basis vectors define a subspace of face images called
three different distance metrics: City block (L1), face space. All images of known faces are projected
Euclidean (L2) and Cosine (COS) distance. For onto the face space to find sets of weights that describe
the contribution of each vector. For identification an
consistency with other studies we used the FERET data
set [9], with its standard gallery images and probe sets unknown person, the normalized image of person is
for testing. Even though a lot of studies were done with first projected onto face space to obtain its set of
weights. Than we compare these weights to sets of
some of those methods it is very difficult to compare
the results with each other because of different weights of known people from gallery. If the image
preprocessing, normalization, different metrics and elements are considered as random variables, the PCA
basis vectors are defined as eigenvectors of scatter
even databases. Although the researcher used the same
database they chose different training sets. We also matrix ST:
noticed that the results of other research groups are
often contradictory. In most cases the results are given
only for one or two projection-metric combinations for
a specific projection method, and in some cases
researchers are using nonstandard databases or some B. Linear Discriminant Analysis (LDA)
hybrid test sets derived from standard database. Bartlett LDA method [4] finds the vectors in the underlying
et al. [5] and Liu et al. [10] claim that ICA outperforms space that best discriminate among classes. For all
PCA, while Beak et al. [11] claim that PCA is better. samples of all classes it defined two matrix: between-
Moghaddam [12] states that there is no significant class scatter matrix SB and the within-class scatter
difference. Beveridge et al. [13] claim that in their test matrix SW. SB represents the scatter of features around
LDA performed uniformly worse than PCA, Martinez the overall mean ¹ for all face classes and SW represents
[14] states that LDA is better for some tasks, and the scatter of features around the mean of each face
Navarrete et al. [15] claim that LDA outperforms PCA class:
on all tasks in their tests. The rest of the paper is
organized as follows: Section 2 gives brief description
of the algorithm to be compared, Section 3 includes
details of experimentation done, Section 4 consist the
results and compares results with other research groups
and Section 5 concludes the paper.
C. Training
To train the PCA algorithm we used M=1007 FERET
1. Distance measures images of c=504 classes (different persons). Each class
To measure the distance between unknown probe image contains a different number of persons. These numbers
and gallery images stored in database different distance vary from 1 to 10. Out of 1007 images in training set,
measures will be used. Manhattan (L1), Euclidean (L2) 396 of images are taken from the gallery (39% of all
and Cosine (COS) distance. Generally, for two vectors, training images) and 99 images are take from dup1
x and y distance measures are defined as: probe set (10% of all training images). The remaining
512 are not in any set used for recognition. The training
set and gallery overlap on about 33% and with dup1
probe set on about 14%. PCA derived, in accordance
with theory, M ¡ 1 = 1006 meaningful eigenvectors.
We adopted the FERET recommendation and kept the
top 40% of those, resulting in 403-dimensional PCA
subspace. In such way 98.54% of original information
(energy) was retained in those 403 eigenvectors. This
subspace was used for recognition as PCA face space
and as input to LDA and ICA (PCA was the
preprocessing dimensionality reduction step). For ICA ICA1+L1, but it can be stated that the remaining three
representation we also try to use more eigenvectors but projection-metric combinations (LDA+COS,
the performance was worse. We also confirm the ICA2+COS and PCA+L1) produce similar results and
findings in [8] that recognition performance is not no straightforward conclusion can be drawn regarding
different if we use only preprocessing step of ICA which is the best for specific task. ICA1 performance
method. In our case where the dimensionality of ICA was comparable to LDA and this confirms the
representation is the same as the dimensionality of PCA theoretical property of ICA1 that it is optimal for
the performance is the same for L2 and COS metrics recognizing facial actions. On the fc (the different
and for the L1 metrics the performance is not much illumination task) probe set LDA+COS and ICA2+COS
different. Besides of using time consuming ICA win. ICA1 is the worst choice, which is not surprising
methods we can use only preprocessing whitening step since ICA1 tends to isolate the face parts and is
(PCA I instead of ICA I and PCA II instead of ICA II). therefore not appropriate for recognizing images taken
Although LDA can produce a maximum of c¡1 basis under different illumination conditions. On the dup1
vectors we kept only 403 to make fair comparisons with and dup2 (the temporal change tasks) probe sets, again
PCA and ICA methods. After all the subspaces have LDA+COS wins and ICA1 is the worst, especially for
been derived, all images from data sets were projected the dup2 data set. ICA2+COS also did very good on
onto subspace and recognition using neighbor such difficult tasks. If we compare the metrics the L1
classification with various distance measures was gives the best results in combination with PCA and
conducted. ICA1. It can be concluded that COS is superior to any
other metric when used with LDA and ICA2. We found
IV. RESULTS it surprising that L2 is not the best choice in any of the
combinations, but in the past research it was the most
Results of our experiment can be seen in Tables. We frequently used metric. Fb probe set was found to be the
test all the projection-metric combinations. Since we easiest (highest recognition rates) and dup2 the most
implemented four projection methods (PCA, LDA, demanding (lower recognition rates), which is
ICA1 and ICA2) and three distance measures (L1, L2 consistent with [9], but in contradiction with Beak at al.
and COS) The best performance on each data set for [11] who stated that fc is the most demanding probe set.
each method is bolded. Also consistent with [9] is that LDA+COS outperforms
all others. Both [9] and [6], when comparing PCA and
1. fb probe set ICA, claim that ICA2 outperforms PCA+L2 and this is
PCA LDA ICA1 ICA2 what we
L1 41.2% 37.2% 37.5% 17.5% Face Recognition in Different Subspaces 9 also
L2 34.2% 40.1% 29.2% 27.2% found. As stated in [5], we also found that ICA2 gives
COS 32.5% 57.3% 30.5% 41.9% best result when combined with COS. We also agree
with Navarrete et al. [15] that LDA+COS works better
2. fc probe set than PCA. We agree with Moghaddam et al. [12] and
PCA LDA ICA1 ICA2 with Yang et al. [8] who stated that there is no
L1 78.6% 70.8% 80.6% 62.7% significant difference between PCA and ICA. We also
L2 76.5% 72.6% 78.4% 68.3% confirm the result in [8] that there is no significant
COS 75.68% 88.60 76.9% 82.8% performance difference between ICA and preprocessing
whitening PCA step.
3. dup1 probe set
PCA LDA ICA1 ICA2 V. CONCLUSION
L1 51.4% 48.1% 17.2% 32.6%
L2 11.5% 50.1% 9.7% 47.3% This paper presented comparative study of three most
COS 12.3% 69.8% 12.4% 74.50% popular appearance based face recognition projection
methods (PCA, LDA and LDA) and their accompanied
three distance metrics (City block, Euclidean and
4. dup2 probe set
Cosine) in equal working conditions. From our
PCA LDA ICA1 ICA2 comparative research we can derive that the L2 metric
L1 20.51% 22.3% 12.8% 8.2% is the most promising combination for all tasks.
L2 11.6% 30.4% 9.6% 18.5%
Although ICA1+L1 seems to be promising, except for
COS 11.1% 41.1% 8.3% 25.7% the illumination changes task where LDA+COS and
Tables shows Performance across four projection ICA2+COS outperforms PCA and ICA1. For all probe
methods and three metrics. The best projection metric sets the COS seems to be the best choice of metric for
combinations are in bold.On the fb (the different LDA and ICA2 and L1 for PCA and ICA1. LDA+COS
expression task) probe set the best combination is combination turned out to be the best choice for
temporal changes task. In spite of the fact that L2 [7] Hyv¨arinen, A., Oja, E.: Independent component analysis:
algorithms and aplications. Neural Networks. 13(4-5) (2000)
metric produced lower results it is surprising that it was 411–430
used so often in the past.We also tested only whitened [8] Yang, J., Zhang, D., Yang, J.Y.: Is ICA Significantly Better than
PCA preprocessing step of ICA method and it confirms PCA for Face Recognition? ICCV. (2005) 198–203
that there is no performance difference between ICA [9] Phillips, P.J., Moon, H., Rizvi, S.A., Rauss, P.J.: The FERET
Evaluation Methodology for Face-Recognition Algorithms,
and preprocessing PCA. IEEE Trans. on Pattern Recognition and Machince Intelligence,
22(10), (2000) 1090–1104
REFERENCES [10] Liu, C., Wechsler, H.: Comparative Assessment of Independent
Component Analysis (ICA) for Face Rcognition, Second
[1] Zhao, W., Chellappa, R., Phillips, P.J., Rosenfeld, A.: Face International Conference on Audio- and Video-based Biometric
Recognition: A Literature Survey, ACM Computing Surveys, Person Authentication, (1999) 22–23
(2003) 399–458 [11] Baek, K., Draper, B., Beveridge, J.R., She, K.: PCA vs. ICA: A
[2] Solina, F.,Peer, P., Batagelj, B., Juvan, S., Kovaˇc, J.: Color- Comparison on the FERET Data Set, Proc. of the Fourth
Based Face Detection in the ”15 Seconds of Fame” Art International Conference on Computer Vision, Pattern
Installation, International Conference on Computer Vision / Recognition and Image Processing, (8-14), (2002) 824–827
Computer Graphics Collaboration for Model-based Imaging, [12] Moghaddam, B.: Principal Manifolds and Probabilistic
Rendering, image Analysis and Graphical special Effects Subspaces for Visual Recognition, IEEE Trans. on Pattern
MIRAGE’03, (2003) 38–47 Analysis and Machine Inteligence, 24(6), 2002 780–788
[3] Turk, M., Pentland, A.: Eigenfaces for Recognition, Journal of [13] Beveridge, J.R., She, K., Draper, B., Givens, G.H.: A
Cognitive Neurosicence, 3(1), 1991, 71–86 Nonparametric Statistical Comparison of Principal Component
[4] Zhao, W., Chellappa, R., Krishnaswamy, A.: Discriminant and Linear Discriminant Subspaces for Face Recognition, Proc.
Analysis of Principal Components for Face Recognition, Proc. Of the IEEE Conference on Computer Vision and Pattern
of the 3rd IEEE International Conference on Face and Gesture Recognition, (2001) 535–542
Recognition, FG’98, (1998) 336 [14] Martinez, A., Kak, A.: PCA versus LDA, IEEE Trans. on
[5] Bartlett, M.S., Movellan, J.R., Sejnowski, T.J.: Face Pattern Analysis and Machine Inteligence, 23(2), 2001 228–233
Recognition by Independent Component Analysis, IEEE Trans. [15] Navarrete, P., Ruiz-del-Solar, J.: Analysis and Comparison of
on Neural Networks, 13(6), (2002) 1450–1464 Eigenspace-Based Face Recognition Approaches, International
[6] Draper, B., Baek, K., Bartlett, M.S., Beveridge, J.R.: Journal of Pattern Recognition and Artificial Intelligence, 16(7),
Recognizing Faces with PCA and ICA, Computer Vision and 2002 817–830
Image Understanding (Spacial Issue on Face Recognition),
91(1-2), (2003) 115–137