Yanushkevich S.N., Wang P.S.P., Gavrilova M.L., Srihari S.N. (eds.) Image Pattern Recognition. Synthesis and Analysis in Biometrics

Подождите немного. Документ загружается.

April 2, 2007 14:42 World Scientiﬁc Review Volume - 9in x 6in Main˙WorldSc˙IPR˙SAB

208 Synthesis and Analysis in Biometrics

three parts: ﬁrst, by perform 2-level wavelettransform with a new

nontensor product wavelet ﬁlter, a face images are represented by the

lowest resolution subbands after decomposition. Second, the Principal

Component Analysis (PCA) feature selection scheme is adopted to

reduce the computational complexity of feature representation. Finally,

to test the robustness of the proposed facial feature representation,

the Support Vector Machines (SVM) is applied for classiﬁcation. The

experimental results show that our method is superior to other methods

in terms of recognition accuracy and eﬃciency.

Contents

8.1. Introduction ................................... 208

8.2. Construction of Nontensor Product Wavelet Filters Banks . . . . . . . . . 211

8.2.1. Characteristics of Centrally Symmetric Orthogonal Matrix . . . . 212

8.2.2. NontensorProductWaveletFilter.................. 214

8.2.3. Examples................................ 216

8.3. ExperimentalResults.............................. 217

8.4. Conclusions ................................... 221

Bibliography ....................................... 222

Glossary

DNWT — Discrete Nontensor product Wavelet Transform

DSWT — Discrete tensor product Wavelet Transform

KFDA — Kernel Fisher’s discriminant Analysis

KPCA — Kernel Principle Component Analysis

LDA — Linear discriminant Analysis

MRA — Multiresolution analysis

PCA — Principal Component Analysis

NFL — Nearest Feature Line algorithm

NN — Nearest Neighbor algorithm

SVM — Support Vector Machines

8.1. Introduction

Face recognition is a active research area, and they can be used in wide

range of applications such as surveillance and security, telecommunication

and digital libraries, human-computer intelligent interaction, and smart

environments. Compared to classical pattern recognition problems such

April 2, 2007 14:42 World Scientiﬁc Review Volume - 9in x 6in Main˙WorldSc˙IPR˙SAB

Nontensor-Product-Wavelet-Based Facial Feature Representation 209

as ﬁngerprint recognition, face recognition is much more diﬃcult because

there are usually many individuals(classes), only a few images (samples) per

person, so a face recognition system must recognize faces by extrapolating

from the training samples. Various changes in face images also present a

great challenge, and a face recognition system must be robust with respect

to the many variabilities of face images such as viewpoint, illumination,

and facial expression conditions.

Many novel attempts have been made to face recognition research

since the late 1970s

[

25,26

]

. There are two major approaches for vision

research: geometrical local feature-based (e.g. relative positions of eyes,

nose, and mouth.) schemes and holistic template-based systems and their

variations

[

]

. The geometrical-based approach performs successfully in

accurate facial feature detection scheme. However, it remains limited

applications because of its diﬃcult implementation and its unreliability

in some cases. Compared to this approach, template-based approach is

more promising due to its ease of implementation and robustness. In

holistic template-matching systems, attempts are made to capture the

most appropriate representation of face images as a whole and exploit the

statistical regularities of pixel intensity variations. Principal Component

Analysis (PCA)

[

]

and Linear discriminant Analysis (LDA)

[

13,22

]

are

the two most classical and popular methods. The PCA is a typical

method, which faces are represented by a linear combination of weighted

eigenvectors, known as eigenfaces

[

]

. The LDA obtain features through

eigenvector analysis of scatter matrices with the objective of maximizing

between-class variations and minimizing within-class variations. These two

methods both provides a small set of features that carry the most relevant

information for classiﬁcation purposes. However, the PCA usually give

high similarities indiscriminately for two images from a single person or

from two diﬀerent persons and the LDA is also complex as there is a lot of

within-class variation due to diﬀering facial expressions, head orientations,

lighting conditions, etc. Although many improving approaches have been

proposed based on the two methods such as kernel PCA (KPCA)

[

]

and

kernel Fisher’s discriminant analysis (KFDA)

[

8,12,24

]

whichusedkernel

skills, the essence problem has not been solved.

As we all known, the main challenge in feature representation is to

represent the input data in a reduced low-dimensional feature space, in

which, the most facial features are revealed or kept. Wavelet transform

has much more advantages on this point. Compared to the PCA and LDA

projections, wavelet subband coeﬃcients can eﬃciently capture substantial

facial features while keeping computational complexity low. It is well known

to all that wavelet transform has a robust multi-resolution capability which

accords well with human visual system. Moreover, it provides a spatial

April 2, 2007 14:42 World Scientiﬁc Review Volume - 9in x 6in Main˙WorldSc˙IPR˙SAB

210 Synthesis and Analysis in Biometrics

and a frequency decomposition of a image simultaneously. Using wavelet

transforms as multiscale orthogonal representations of face images, diﬀerent

components of wavelet decomposition capture diﬀerent visual aspects of a

gray-scale image. This multi-resolution analysis is provided in the form of

coeﬃcient matrix.

Each face is described by a subset of band ﬁltered images containing

waveletcoeﬃcients. At each level of decomposition there are four orthogonal

subimages corresponding to LL, LH, HL, and HH. By spatial frequency

analysis, the image is represented as a weighted combination of basis

functions, in which high frequencies (LH, HL, HH) carry ﬁnely detail

information and low frequency (LL) carry coarse, shape-based information.

Only a change in face will aﬀect all frequency components. Earlier

published research

[

]

demonstrated that: the eﬀect of diﬀerent facial

expressions can be attenuated by removing the high-frequency components

and the low-frequency components only are suﬃcient for recognition.

Subsequently, an appropriate wavelet transform can result in robust

representations with regard to lighting changes and be capable of capturing

substantial facial features while keeping computational complexity low. It

is increasingly popular used in face representation in recent years and good

results are obtained for race and gender classiﬁcation

[

21,27

]

Almost all the literatures of wavelet based face recognition use two-

dimensional tensor product ones, which is the tensor product of one-

dimensional wavelets. However, the property of anisotropic makes tensor

product wavelet not attractive most for the purpose of facial representation

[

9,4

]

. Nontensor product wavelet, which is, the corresponding scaling

function and associated wavelet function can’t be written in the form of

products of one-dimensional ones, can reveal more features than that of the

common used tensor product wavelet transform

[

4,5,10,11,20

]

. Therefore,

we suggest to represent facial features by discrete nontensor product wavelet

transform (DNWT) in this paper.

Many eﬀorts have been spent on constructing nontensor product

wavelets. However, up to now, there is no systematic method to construct

two-dimensional nontensor product wavelets

[

9,4,20

]

. In this chapter, we

present a novel method for constructing nontensor product wavelet ﬁlters.

A new nontensor product bivariate waveletﬁlter banks with linear phase

are constructed from the centrally symmetric matrices. Our investigations

demonstrate that these ﬁlter banks have a matrix factorization and they are

capable of describing the features of face image. The new nontensor product

ﬁlters derived from our method are applied for the feature representation

of face image. To test the eﬀect of representation based on the nontensor

product wavelet transform, we design a set of scheme for we use Support

Vector Machines (SVM) for classiﬁcation. The SVM is a newly powerful

April 2, 2007 14:42 World Scientiﬁc Review Volume - 9in x 6in Main˙WorldSc˙IPR˙SAB

Nontensor-Product-Wavelet-Based Facial Feature Representation 211

machine learning approach, owing to its remarkable characteristics such as

good generalization performance, the absence of local minimal and sparse

representation of solution. It has become a popular research method in

anomaly detection, and good application are reported

[

8,19

]

. Experiments

are tested by using popular used ORL face database. The eﬃciency of our

approach produced a signiﬁcant improvement which includes a substantial

advance in correctness and in time of processing comparing with those

obtained by the discrete tensor product wavelet transform (DSWT) and

the well-known conventional PCA and LDA methods.

This chapter is organized as follows: The construction of new

nontensor product bivariate wavelet ﬁlter banks are brieﬂy described

in Section 8.2. The feature vector selection algorithm and experiment

result are demonstrated in Section 8.3. Finally, conclusions are drawn in

Section 8.4.

8.2. Construction of Nontensor Product Wavelet Filters

Banks

Multiresolution analysis (MRA) theory provides a natural framework for

understanding wavelets and ﬁlter banks. According to MRA, reﬁnable

functions (scaling functions) and wavelets are completely determined by a

low-pass ﬁlter and high-pass ﬁlters, respectively. In subband code schemes,

a low-pass ﬁlter and high-pass ﬁlters are respectively used as analysis

ﬁlter and synthesis ﬁlters which form perfect reconstruction ﬁlter banks.

Daubechies

[

]

designed univariate two-channel perfect reconstruction ﬁlter

banks having ﬁnite impulse response (FIR) corresponding to a univariate

orthonormal wavelet having a compact support and vanishing moments. It

is well known that there does not exist an orthonormal symmetric wavelet

with a compact support in the univariate dyadic dilation case, that is, two-

channel perfect reconstruction FIR banks having a linear phase are not

available in the univariate case.

Our interest here is in multivariate ﬁlter banks

[

]

A commonly used method builds multivariate ﬁlter banks by the tensor

products of univariate ﬁlters. This construction of ﬁlter banks focuses

excessively on the coordinate direction.

Therefore, nontensor product approaches for construction of multi-

variate ﬁlter banks or wavelets are desirable. Much interest has been given

The SVM was developed by V. N. Vapnic (V. N. Vapnic, “The Nature of Statistical

Learning Theory”, Springer, 1995), Remarks of Editors.

April 2, 2007 14:42 World Scientiﬁc Review Volume - 9in x 6in Main˙WorldSc˙IPR˙SAB

212 Synthesis and Analysis in Biometrics

to the study of nontensor product wavelets in L

)

[

5,20

]

,andalso

[

]

for constructions of multivariate wavelets on invariant sets) as well as to

multiwavelets and corresponding vector-valued ﬁlter banks

[

6,17,18

]

However, it is not easy to design multivariate ﬁlter banks.

At present, no general method is available for designing multivariate

ﬁlter banks and vector-valued ﬁlter banks. There are two fundamental

diﬃculties that one encounters in the design of a low-pass ﬁlter

and high-pass ﬁlters which are used for the construction of reﬁnable

functions and wavelets, respectively.

Most of the current study in multivariate wavelets is given to a dilation

matrix with determinant two

[

11,23

]

, since in this case, only one high-pass

ﬁlter is needed to be construct and the matrix extension is the same as the

univariate two-channel case

[

]

Often, one seeks ﬁlter banks leading to smooth wavelets. However, in

the application of ﬁlter banks to texture analysis, experiments show that

“smooth” ﬁlter banks are not suitable because texture images is not smooth.

Here we describe a general construction of bivariate nontensor product

wavelet ﬁlter banks with linear phase by using centrally symmetric matrices

[

3,4

]

. The family of ﬁlter banks given in this paper are suitable in this

context although it is diﬃcult to achieve smoothness. These ﬁlter banks

have a matrix factorization and can be applied to facial representation.

8.2.1. Characteristics of Centrally Symmetric Orthogonal

Matrix

We consider the following n × n centrally symmetric matrix

B =(b

j,l

)

j,l=1

where

j,l

= b

n+1−j,n+1−l

,j,l=1, 2,...,n.

Similarly, centrally anti-symmetric matrix

B =



j,l



j,l=1

is deﬁned if

j,l

= −

n+1−j,n+1−l

,j,l=1, 2,...,n.

The centrally symmetric and central anti-symmetric matrices of order

n are closed related to the special matrix H

, which is deﬁned by

April 2, 2007 14:42 World Scientiﬁc Review Volume - 9in x 6in Main˙WorldSc˙IPR˙SAB

Nontensor-Product-Wavelet-Based Facial Feature Representation 213







00···1

0 ··· 10

10···0







. (8.1)

B is a centrally symmetric matrix of order n if and only if it satisﬁes the

matrix equation

= B. (8.2)

Similarly,

B is a centrally anti-symmetric matrix if and only if the

following matrix equation holds

B = −H

. (8.3)

Note that, for centrally anti-symmetric matrix

B =



j,l



j,l=1

with odd

number n,the

[

n+1

], [

n+1

]

elements

[

n+1

],[

n+1

]

= 0. Throughout this

note, we use the notation [x] to denote the integer no more than the real

number x.

To construct two channel ﬁlter banks suitable for image processing,

we need to consider the concrete construction of centrally symmetric

orthogonal matrix of order 4, which corresponds the case n =4. Inthis

case, any centrally symmetric orthogonal matrix B has the general form

B =

−H

0 Z

−H

(8.4)

with Z

and Z

being orthonormal matrices of order 4. Let

S =







10 0−1

01−10

0110

1001







more precisely, we have the following equivalent form

B =







ab00

cd00

00ef

00gh







(8.5)

with real numbers a, b, c, d, e, f, g, h satisfying

+ b

= c

+ d

=1,ac+ bd =0,

+ f

= g

+ h

=1,eg+ fh =0.

The parametrization solutions of above equations for these real numbers

are a = d =cosα, b = −sin α, c =sinα, e = h =cosβ, f = −sin β and

April 2, 2007 14:42 World Scientiﬁc Review Volume - 9in x 6in Main˙WorldSc˙IPR˙SAB

214 Synthesis and Analysis in Biometrics

g =sinβ for any real numbers α and β. Therefore, any centrally symmetric

orthogonal matrix B has the more simple parametrization representation

B =







10 0−1

01−10

0110

1001













cos α −sin α 00

sin α cos α 00

00cosβ −sin β

00sinβ cos β













1001

0110

0 −110

−1001







(8.6)

Further, we deﬁned

(α,β)

:= B







cos α +cosβ −sin α +sinβ −sin α − sin β cos α −cos β

sin α − sin β cos α +cosβ cos α − cos β sin α +sinβ

sin α +sinβ cos α −cos β cos α +cosβ sin α − sin β

cos α −cos β −sin α − sin β −sin α +sinβ cos α +cosβ







(8.7)

Thus we get an constructive characterization of centrally symmetric

orthogonal matrix of order 4.

In the following,we will oﬀer some examples of construction of centrally

symmetric orthogonal matrix of order 4,which play an crucial role in the

design of nontensor product bivariate ﬁlter banks with two channels. The

ﬁrst case is to let α = β. Letting α = β =

and α = β =

respectively,

we have the following 4 order centrally symmetric orthogonal matrix

(

)







00−10

0001

1000

0 −100







(

)

√







10−10

0101

1010

0 −101







We consider the case with α =0andβ =

,wegetthat

(0,

)







11−11

−1111

111−1

1 −111







8.2.2. Nontensor Product Wavelet Filter

Next by using the above, we will develop a general method for constructing

nontensor product bivariate waveletilter banks.

Given a bivariate trigonometric polynomials

(ξ,η)=



j∈Z



k∈Z

j,k

−i(jξ+kη)

, (ξ,η) ∈ R

April 2, 2007 14:42 World Scientiﬁc Review Volume - 9in x 6in Main˙WorldSc˙IPR˙SAB

Nontensor-Product-Wavelet-Based Facial Feature Representation 215

its polyphase factors are the bivariate trigonometric polynomials m

0,l

deﬁned for l =0, 1, 2, 3as

0,0

(ξ,η)=



j∈Z



k∈Z

2j,2k

−i(jξ+kη)

0,1

(ξ,η)=



j∈Z



k∈Z

2j+1,2k

−i(jξ+kη)

0,2

(ξ,η)=



j∈Z



k∈Z

2j,2k+1

−i(jξ+kη)

0,3

(ξ,η)=



j∈Z



k∈Z

2j+1,2k+1

−i(jξ+kη)

Reversing the process, we can construct the bivariate trigonometric

polynomials m

from its polyphase factors m

0,l

, j =0, 1, 2, 3 by the formula

(ξ,η)=m

0,0

(2ξ, 2η)+e

−iξ

0,1

(2ξ, 2η)+e

−iη

0,2

(2ξ, 2η)

+ e

−i(ξ+η)

0,3

(2ξ, 2η),

where (ξ,η) ∈ R

The construction of multivariate compactly supported orthonormal

multiwavelets using MRA is equivalent to the design of orthogonal FIR

and QMF ﬁlter banks, which leads to the following two questions.

(i) Find low-pass ﬁlter m

(ξ,η) satisfying the orthogonal condition

(ξ,η)|

+ |m

(ξ + π, η)|

+ |m

(ξ,η + η)|

+|m

(ξ + π, η + η)|

=1, (ξ, η) ∈ R

;

(ii) Find 3 high-pass ﬁlter m

such that the matrix M =

(α

,α

,β

) is unitary, where α

and β

are the row vector

(ξ + iπ, η),m

(ξ + iπ, η))

and (m

(ξ+iπ, η+π),m

(ξ+iπ, η+π))

(i =0, 1).

Given the orthogonal ﬁlter banks m

,l =0, 1, 2, 3 at hand, one can use

the Pyramid algorithm to decompose and reconstruct the image. It will

beneﬁt us from the point view of polyphase to understand the conditions

(i) and (ii).

Both of the problems (i) and (ii) (equivalently a and b) are nonlinear

problem in mathematics, which is essentially quadratic algebraic equations

with multiple variables. There is no general solution for this problem

presently. Now we will oﬀer a class of solutions of (i) and (ii) starting

from centrally symmetric matrix.

April 2, 2007 14:42 World Scientiﬁc Review Volume - 9in x 6in Main˙WorldSc˙IPR˙SAB

216 Synthesis and Analysis in Biometrics

Let

=(1, 1, 1, 1)

=(1, −1, 1, −1)

=(1, 1, −1, −1)

=(1, −1, −1, 1)

(8.8)

and denote by D(ξ,η) the matrix of trigonometric polynomial

D(ξ,η)=







10 0 0

0 e

−iξ

00e

−iη

00 0e

−i(ξ+η)







, (ξ,η) ∈ R

For any ﬁxed positive integer N , arbitrarily chosen real number pairs

(α

,β

), k =1, 2,...,N (for k = j,(α

,β

)mayequalto(α

,β

)), The

low-pass ﬁlter m

(ξ,η)isdeﬁnedasfollows:



1,e

−iξ

−iη

−i(ξ+η)



k=1

(α

,β

)

D(2ξ, 2η)U

(α

,β

)

, (8.9)

where Q

(α

,β

)

is centrally symmetric orthogonal matrix deﬁned previously,

and (ξ, η) ∈ R

. It is easy to see that m(0, 0) = 1, which means that m

alow-passﬁlter.

Correspondingly,three high-pass ﬁlters m

,j =1, 2, 3 with respect to

the above low-pass ﬁlter m

(ξ,η) are deﬁned as follows:

(ξ,η)=



1,e

−iξ

−iη

−i(ξ+η)



k=1

(α

,β

)

D(2ξ, 2η)U

(α

,β

)

(ξ,η)=



1,e

−iξ

−iη

−i(ξ+η)



k=1

(α

,β

)

D(2ξ, 2η)U

(α

,β

)

(ξ,η)=



1,e

−iξ

−iη

−i(ξ+η)



k=1

(α

,β

)

D(2ξ, 2η)U

(α

,β

)

with V

deﬁned in (8.8), where j =1, 2, 3, (ξ, η) ∈ R

.Itiseasytocheck

that m

(0, 0) = 0,j=1, 2, 3. That is to say, m

,j =1, 2, 3 are high-pass

ﬁlters.

8.2.3. Examples

Now we will provide two concrete important examples of ﬁlter banks

obtained by our approach.

April 2, 2007 14:42 World Scientiﬁc Review Volume - 9in x 6in Main˙WorldSc˙IPR˙SAB

Nontensor-Product-Wavelet-Based Facial Feature Representation 217

Example 1. Using the previous matrix

(0,

)







11−11

−1111

111−1

1 −111







and setting N = 1, we get the following ﬁlter banks











(x, y)=1/8(−x + y +1+x

+ x

− y

+ xy − x

y + xy

+ x

+xy

+ x

+ yx

− x

+ x

+ y

(x, y)=1/8(−x + y +1− x

+ x

+ y

+ xy − x

y − xy

− x

−xy

− x

+ yx

+ x

− y

(x, y)=1/8(x − y + x

+ x

+ y

− xy − 1 − x

y − xy

+ x

−xy

+ x

+ yx

− x

+ x

− y

(x, y)=1/8(−x + y +1+x

− x

+ y

+ xy + x

y − xy

+ x

−xy

+ x

− yx

− x

− y

with x = e

−iξ

,y = e

−iη

Example 2. Letting α = − β,andα =

leads to

(

,−

)

√







1 −100

1100

0011

00−11







Further, setting N = 2, the matrices Q

(0,

)

and Q

(

,−

)

lead to the

following ﬁlter banks











(x, y)=1/4x

+1/4x

+1/8x

+1/8yx

− 1/8x

y +1/8y

+1/8y

+1/8x

− 1/8y

(x, y)=−1/4x

+1/4x

+1/8x

+1/8yx

+1/8x

y +1/8y

−1/8y

− 1/8x

− 1/8y

(x, y)=−1/4x

+1/8x

+1/8yx

+1/4x

− 1/8x

y − 1/8y

−1/8y

− 1/8x

+1/8x

+1/8y

(x, y)=−1/4x

+1/8x

+1/8yx

− 1/4x

+1/8x

y − 1/8y

+1/8y

+1/8x

− 1/8x

+1/8y

8.3. Experimental Results

By using the proposed facial feature representation, we develop a new

nontensor-product-wavelet-based face recognition scheme by combining

the techniques of PCA and SVM. To test the robustness of the

proposed nontensor-product-wavelet-based facial feature representation,

the following experiments are conducted by ORL face database, which