April 2, 2007 14:42 World Scientific Review Volume - 9in x 6in Main˙WorldSc˙IPR˙SAB
Large-Scale Biometric Identification: Challenges and Solutions 277
should be always 1. However, in real-world systems, the correct identity
occurs in the top K,withsomeK<<M.
11.8. Identification Methods
We describe how a practical identification system operates to see the types
of answers it returns. Typically an identification system will run in one
of three modes of operation, depending on the application for which it is
being used. In each mode, some subject presents a biometric to the system
and that biometric is compared to the biometric samples enrolled in the
database M. In some cases every enrolled sample in the database will be
compared, but in others only some subset of M is compared.
For simplicity we consider the former case, with a database M of M
enrolled subjects, i.e. the 1 : M search problem in its most general case.
The three modes of operation are related to the three primary criteria for
choosing the subset of M:
Threshold-based. This approach is effectively the same as repeating the
operation of 1:1 verification for each person in the database. The
query biometric template B is compared with each of the enrolled
biometrics to obtain a match/non-match decision. This is typically
done by computing the scores s(B; B
m
),m=1,...,M for all enrolled
templates B
m
∈ M and considering as matches all those candidates
with scores exceeding some threshold t
o
. The complete list of all
matching idenfitities is returned. If no candidate matches (e.g. no
score exceeds the threshold), the person is presumed not to be in the
database.
Rank-based. The system always returns some vector of fixed size, K,of
the enrolled identities that best match the presented biometric. This
requires an ability to rank (sort) the items in the database. The ranking
might be based on scores. With K = 1 the system returns a single
candidate corresponding to the person in the database most likely to
be the same as the input query biometric Q. Notethatitisusually
not necessary to rank all the identities in M. Producing a ranked
short-list of the best K items can be accomplished more efficiently
than sorting all M enrollees. However, in the most general (and most
computational complex case) the output vector is just a permutation
or re-ordering or ranking of database vector M. It is a vector whose
meaning depends upon the relation between the input biometric and
the enrolled biometrics.