April 2, 2007 14:42 World Scientific Review Volume - 9in x 6in Main˙WorldSc˙IPR˙SAB
Nontensor-Product-Wavelet-Based Facial Feature Representation 209
as fingerprint recognition, face recognition is much more difficult because
there are usually many individuals(classes), only a few images (samples) per
person, so a face recognition system must recognize faces by extrapolating
from the training samples. Various changes in face images also present a
great challenge, and a face recognition system must be robust with respect
to the many variabilities of face images such as viewpoint, illumination,
and facial expression conditions.
Many novel attempts have been made to face recognition research
since the late 1970s
[
25,26
]
. There are two major approaches for vision
research: geometrical local feature-based (e.g. relative positions of eyes,
nose, and mouth.) schemes and holistic template-based systems and their
variations
[
27
]
. The geometrical-based approach performs successfully in
accurate facial feature detection scheme. However, it remains limited
applications because of its difficult implementation and its unreliability
in some cases. Compared to this approach, template-based approach is
more promising due to its ease of implementation and robustness. In
holistic template-matching systems, attempts are made to capture the
most appropriate representation of face images as a whole and exploit the
statistical regularities of pixel intensity variations. Principal Component
Analysis (PCA)
[
22
]
and Linear discriminant Analysis (LDA)
[
13,22
]
are
the two most classical and popular methods. The PCA is a typical
method, which faces are represented by a linear combination of weighted
eigenvectors, known as eigenfaces
[
1
]
. The LDA obtain features through
eigenvector analysis of scatter matrices with the objective of maximizing
between-class variations and minimizing within-class variations. These two
methods both provides a small set of features that carry the most relevant
information for classification purposes. However, the PCA usually give
high similarities indiscriminately for two images from a single person or
from two different persons and the LDA is also complex as there is a lot of
within-class variation due to differing facial expressions, head orientations,
lighting conditions, etc. Although many improving approaches have been
proposed based on the two methods such as kernel PCA (KPCA)
[
16
]
and
kernel Fisher’s discriminant analysis (KFDA)
[
8,12,24
]
whichusedkernel
skills, the essence problem has not been solved.
As we all known, the main challenge in feature representation is to
represent the input data in a reduced low-dimensional feature space, in
which, the most facial features are revealed or kept. Wavelet transform
has much more advantages on this point. Compared to the PCA and LDA
projections, wavelet subband coefficients can efficiently capture substantial
facial features while keeping computational complexity low. It is well known
to all that wavelet transform has a robust multi-resolution capability which
accords well with human visual system. Moreover, it provides a spatial