Likelihood Ratio-Based Biometric Verification

Similar documents
BIOMETRIC verification systems are used to verify the

Biometrics: Introduction and Examples. Raymond Veldhuis

EXTRACTING BIOMETRIC BINARY STRINGS WITH MINIMAL AREA UNDER THE FRR CURVE FOR THE HAMMING DISTANCE CLASSIFIER

On the Individuality of Fingerprints: Models and Methods

4038 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 53, NO. 10, OCTOBER 2005

Biometric Fusion: Does Modeling Correlation Really Matter?

Role of Assembling Invariant Moments and SVM in Fingerprint Recognition

A Minutiae-based Fingerprint Individuality Model

Biometric verification based on grip-pattern recognition

A Contrario Detection of False Matches in Iris Recognition

Fingerprint Individuality

PIN-based cancelable biometrics

Estimation of Relative Operating Characteristics of Text Independent Speaker Verification

Score Normalization in Multimodal Biometric Systems

Predicting large population data cumulative match characteristic performance from small population data

Biometric Security Based on ECG

Likelihood Ratio Based Biometric Score Fusion

Semi-parametric Distributional Models for Biometric Fusion

RAMAKRISHNA LANKA MSEE, UTA ADVISING PROFESSOR: DR. K. R. RAO

When enough is enough: early stopping of biometrics error rate testing

IBM Research Report. The Relation between the ROC Curve and the CMC

If you wish to cite this paper, please use the following reference:

Score calibration for optimal biometric identification

Orientation Map Based Palmprint Recognition

INTERNATIONAL JOURNAL OF ENGINEERING SCIENCES & RESEARCH TECHNOLOGY

STONY BROOK UNIVERSITY. CEAS Technical Report 829

Error Rates. Error vs Threshold. ROC Curve. Biometrics: A Pattern Recognition System. Pattern classification. Biometrics CSE 190 Lecture 3

Research Article Biometric Quantization through Detection Rate Optimized Bit Allocation

Feature selection and classifier performance in computer-aided diagnosis: The effect of finite sample size

EEL 851: Biometrics. An Overview of Statistical Pattern Recognition EEL 851 1

This is an accepted version of a paper published in Elsevier Information Fusion. If you wish to cite this paper, please use the following reference:

Do We Really Have to Consider Covariance Matrices for Image Feature Points?

Research Article Binary Biometric Representation through Pairwise Adaptive Phase Quantization

I D I A P R E S E A R C H R E P O R T. Samy Bengio a. November submitted for publication

FINGERPRINT ENHANCEMENT Frequency Domain

Relationship between identification metrics: Expected Confusion and Area Under a ROC curve

Citation for published version (APA): Susyanto, N. (2016). Semiparametric copula models for biometric score level fusion

Automated Statistical Recognition of Partial Discharges in Insulation Systems.

WHEN IS A MAXIMAL INVARIANT HYPOTHESIS TEST BETTER THAN THE GLRT? Hyung Soo Kim and Alfred O. Hero

Research Article Multimodal Personal Verification Using Likelihood Ratio for the Match Score Fusion

Automatic Identity Verification Using Face Images

Segmentation of Overlapped Region using Morphological Processing

Detection theory 101 ELEC-E5410 Signal Processing for Communications

University of Cambridge Engineering Part IIB Module 3F3: Signal and Pattern Processing Handout 2:. The Multivariate Gaussian & Decision Boundaries

Dynamic Linear Combination of Two-Class Classifiers

Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems

Two-Layered Face Detection System using Evolutionary Algorithm

On Identification of Cascade Systems 1

Palmprint identification using feature-level fusion

The Hermite Transform: An Alternative Image Representation Model for Iris Recognition

Learning features by contrasting natural images with noise

Multiple Similarities Based Kernel Subspace Learning for Image Classification

Modifying Voice Activity Detection in Low SNR by correction factors

Modeling Multiscale Differential Pixel Statistics

Hyung So0 Kim and Alfred 0. Hero

CFAR TARGET DETECTION IN TREE SCATTERING INTERFERENCE

Announcement. HW4 has been assigned. Finger Print Recognition using Minutiae. Biometrics CSE 190 Lecture 16. CSE190, Winter CSE190, Winter 2011

Palmprint Verification with Moments

Proc. of NCC 2010, Chennai, India

Lecture 16: Small Sample Size Problems (Covariance Estimation) Many thanks to Carlos Thomaz who authored the original version of these slides

Cooperative Spectrum Sensing for Cognitive Radios under Bandwidth Constraints

An Adaptive LQG Combined With the MRAS Based LFFC for Motion Control Systems

Palmprint based Verification System Robust to Occlusion using Low-order Zernike Moments of Sub-images

Feature extraction for one-class classification

Introduction to Statistical Inference

Has this Person Been Encountered Before? : Modeling an Anonymous Identification System

Constrained Optimization and Support Vector Machines

System 1 (last lecture) : limited to rigidly structured shapes. System 2 : recognition of a class of varying shapes. Need to:

Face detection and recognition. Detection Recognition Sally

A Statistical Analysis of Fukunaga Koontz Transform

Analytical Study of Biometrics Normalization and Fusion Techniques For Designing a Multimodal System

Face Recognition Using Eigenfaces

ON BINARY REPRESENTATIONS FOR BIOMETRIC TEMPLATE PROTECTION. Chun Chen

Corners, Blobs & Descriptors. With slides from S. Lazebnik & S. Seitz, D. Lowe, A. Efros

Principal Components Analysis. Sargur Srihari University at Buffalo

Efficient Sensitivity Analysis in Hidden Markov Models

Two-Dimensional Sparse Arrays with Hole-Free Coarray and Reduced Mutual Coupling

Automated Segmentation of Low Light Level Imagery using Poisson MAP- MRF Labelling

Feature Vector Similarity Based on Local Structure

Linear Models for Classification

Designing Information Devices and Systems I Fall 2018 Lecture Notes Note Introduction to Linear Algebra the EECS Way

A Modular NMF Matching Algorithm for Radiation Spectra

Resolution Selection Using Generalized Entropies of Multiresolution Histograms

An Analysis of BioHashing and Its Variants

Object Recognition. Digital Image Processing. Object Recognition. Introduction. Patterns and pattern classes. Pattern vectors (cont.

Template-based Recognition of Static Sitting Postures

A Novel PCA-Based Bayes Classifier and Face Analysis

Real Time Face Detection and Recognition using Haar - Based Cascade Classifier and Principal Component Analysis

Application of hopfield network in improvement of fingerprint recognition process Mahmoud Alborzi 1, Abbas Toloie- Eshlaghy 1 and Dena Bazazian 2

LECTURE NOTE #3 PROF. ALAN YUILLE

Topic 17: Simple Hypotheses

Detection of Artificial Satellites in Images Acquired in Track Rate Mode.

Adaptive Binary Integration CFAR Processing for Secondary Surveillance Radar *

Invariant Feature Extraction from Fingerprint Biometric Using Pseudo Zernike Moments

HST 583 FUNCTIONAL MAGNETIC RESONANCE IMAGING DATA ANALYSIS AND ACQUISITION A REVIEW OF STATISTICS FOR FMRI DATA ANALYSIS

A Modified Incremental Principal Component Analysis for On-line Learning of Feature Space and Classifier

Virtual Array Processing for Active Radar and Sonar Sensing

Growing Window Recursive Quadratic Optimization with Variable Regularization

A Source Cell-phone Identification Scheme Based on Canonical Correlation Analysis of Photo Response Non-uniformity

Transcription:

Likelihood Ratio-Based Biometric Verification Asker M. Bazen and Raymond N.J. Veldhuis University of Twente, Department of Electrical Engineering, Laboratory of Signals and Systems, P.O. box 27, 75 AE Enschede, The Netherlands Phone: +3 53 489 356 Fax: +3 53 489 6 E-mail: a.m.bazen@el.utwente.nl Abstract This paper presents results on optimal similarity measures for biometric verification based on fixed-length feature vectors. First, we show that the verification of a single user is equivalent to the detection problem, which implies that for single-user verification the likelihood ratio is optimal. Second, we show that under some general conditions, decisions based on posterior probabilities and likelihood ratios are equivalent, and result in the same ROC. However, in a multi-user situation, these two methods lead to different average error rates. As a third result, we prove theoretically that, for multi-user verification, the use of the likelihood ratio is optimal in terms of average error rates. The superiority of this method is illustrated by experiments in fingerprint verification. It is shown that error rates of approximately 4 can be achieved when using multiple fingerprints for template construction. Keywords Fingerprint recognition, fixed-length feature vector, posterior probability density, likelihood ratio. I. INTRODUCTION Biometric verification systems are used to verify the claimed identity of a user by measuring specific characteristics of the body, such as fingerprints, hand geometry, irises, or pressure signals. The verification system calculates the similarity between the measured characteristic and a template corresponding to the claimed identity. If the similarity is larger than an acceptance threshold, the user is accepted. Otherwise, the user is rejected. Most fingerprint matching systems use minutiae-based algorithms [], [2], which are in general considered as most reliable. However, comparing two sets of minutiae is not a straightforward task. First, the number of minutiae that are extracted depends on the actual fingerprint. Second, it is not known beforehand which minutia in the template set corresponds to which one in the test set. Third, even if the first two problems are solved, the minutiae locations and associated characteristics cannot be compared directly due to translation, rotation, scaling, etc. of the fingerprints. Instead, comparing two sets of minutiae requires special point-pattern matching algorithms. In contrast, this paper presents a fingerprint matching algorithm that uses fixed-length feature vectors, consisting of a number of measurements that are performed at some specific, fixed, locations in the fingerprint. The advantage of this approach is that, once the features are extracted, the matching is very fast, which enables the search for a matching fingerprint in a large database. Given a test feature vector v that is obtained from a user requesting access to a biometric system, and a class w that represents the users claimed identity represented by a template feature vector), the task of a biometric verification system is to decide whether the offered feature vector can be accepted as a member of the given class or not. For this purpose, the system determines a measure that represents the similarity between the test and the template measurements, and the user is granted access to the system if the similarity measure exceeds a certain threshold. The subject of this paper is the comparison of different similarity measures that can be used. e present results on optimal similarity measures for general biometric verification based on fixed-length feature vectors. Various similarity measures for fixed-length feature vectors have been proposed in the literature. Here, we give an overview of the three most widely used measures, being Euclidean distance, posterior probabilities and likelihood ratios. In [3], FingerCode is used as feature vector for fingerprint verification. This feature vector contains the standard deviation of the responses of Gabor filters in specified locations in the fingerprint. For comparison of these feature vectors, the Euclidean distance is used. For this reason this method treats all elements of the feature vector as equally important and uncorrelated. Although this is not a realistic assumption, the authors present experiments with relatively good recognition performance. In [4], biometric verification systems that are based on hand geometry and face recognition are presented. In that paper, it is claimed that decisions that are based on the posterior probability densities are optimal, where optimality means minimal error rates as defined in Section III. The posterior probability density of class w given observed feature vector v is given by: 8

pw v) = pv w) pw) pv) where pv w) is the probability density of the feature vectors given class w, pw) is the probability density of the class w, and pv) is the prior probability density of the feature vectors. The feature vector v is accepted as member of the template class if its posterior probability density exceeds a threshold t [, t max ]. On the other hand, in detection theory it is known since long that the use of likelihood ratios is asymptotically optimal [5]. In detection, a given feature vector has to be classified as originating from a predefined situation the presence of some object to be detected) or not. Since the detection problem is in some sense equivalent to the verification problem that is considered here, it is to be expected that using likelihood ratios for biometric verification is optimal as well. The likelihood ratio Lv) is given by: Lv) = pv w) pv w) where pv w) is the probability of v, given v is not a member of class w. Since we assume infinitely many classes, exclusion of a single class w does not change the distribution of the feature vector v. Therefore, the distribution of v, given v is not a member of w, equals the prior distribution of v: and the likelihood ratio is given by ) 2) pv w) = pv) 3) Lv) = pv w) pv) In this framework, a test feature vector v is accepted as member of the template class if its likelihood ratio exceeds a threshold t [,. The acceptance region A t,w and rejection region R t,w can be defined in the feature space v: 4) A t,w = {v v Lv) t} 5) R t,w = {v v Lv) < t} 6) The probability density functions in Expressions 4 and are in practice usually modelled by multidimensional Gaussian distributions. More details of that case are given in Appendix A. This paper focusses on likelihood ratio-based biometric verification, and on the differences in verification performance between posterior probability-based and likelihood ratio-based biometric systems. In Section II, the general expressions for biometric system errors are derived. Next, the optimality of likelihood ratio-based decisions is proved in Section III. Then, Section IV presents experimental results on fingerprint verification, confirming that using likelihood ratios instead of posterior probability densities decreases the error rates, even though the feature vectors are the same for both decision methods. II. BIOMETRIC SYSTEM ERRORS In this section, we derive expressions for the system errors, using likelihood ratio-based decisions, as a function of the probability density functions of the feature vectors. A. False rejection rate The false rejection rate FRR) measures the probability that a feature vector is rejected as a member of some class, although it does originate from that class. For a specific class w and a given threshold t, FRRt, w) is given by: FRRt, w) = Pv R t,w v w) = pv w)dv 7) R t,w Since A t,w + R t,w = v, this can also be written as: FRRt, w) = pv w)dv 8) A t,w The average) overall false rejection rate FRRt) is found by integrating over all classes: FRRt) = FRRt, w) pw)dw 9) where is the space of all classes. The summation over all discrete) classes is represented by a continuous) integral to indicate the infinite number of classes. B. False acceptance rate The false acceptance rate FAR) measures the probability that a feature vector is accepted as a member of some class, although it does not originate from that class. For a specific class w and a given a threshold t, FARt, w) is given by: FARt, w) = Pv A t,w v w) = pv)dv ) A t,w where again pv w) = pv) is used. Then, the average) global false acceptance rate FARt) is found by integrating over all classes. FARt) = FARt, w) pw)dw ) 82

C. Receiver operating curve The dependence of both error rates on the threshold can be visualized in a plot of FRR against FAR for varying threshold values, which is called the receiver operating curve ROC). Using a procedure similar to the work in [4] on posterior probabilities, an expression that describes the trade off between FAR and FRR for a likelihood ratiobased verification system can be derived: dfrrt, w) dfart, w) = t 2) The same expression has been derived in [5] using other methods. III. OPTIMALITY OF LIKELIHOOD RATIOS In this section, we prove that, for verification based on fixed-length feature vectors, the use of the likelihood ratio is optimal in terms of average overall error rates. In this context, optimality is defined as the lowest FAR for a given FRR, or alternatively the lowest FRR for a given FAR. First, we consider the less complex case of singleuser verification, where the system has to decide whether or not an input feature vector originates from the only user that is known to the system. A. Single-User Verification For single-user verification, there is one fixed distribution pv w) of feature vectors from the genuine user and one fixed distribution pv) of feature vectors from impostors. This situation is equivalent to the detection problem, which implies that the likelihood ratio is optimal for single-user verification [5]. The relation between the likelihood ratio and the posterior probability, which is derived from Expressions and 4, is given by: pw v) = pw) Lv) 3) Since there is only one user in the system, pw) is a constant, and both methods provide the same error rates if the thresholds t p for posterior probability density and t L for likelihood ratio are set to t p = pw) t L. This means that for single-user verification, both methods provide the same ROC, with a different threshold parameterization along the curve. This is shown in Figure, for instance by observing the leftmost curve, with the associated threshold values for both methods. B. Multi-User Verification In a multi-user situation, the two methods lead to different average error rates. The difference between averag- Fig.. Averaging ROCs of classes with different occurrence probabilities. ing with the two methods is illustrated in Figure, which shows three ROCs for frequently, average and infrequently occurring classes in a synthetic data set. At some specific positions at the ROCs, the threshold values for both methods are indicated. It can be seen that combining individual ROCs with a constant posterior probability threshold t p will take the average of different points on the individual ROCs than combining individual ROCs with a constant likelihood ratio threshold t L. The question that is answered in the rest of this section is which of all possible averaging paths will lead to the minimum overall error rates. It can be explained intuitively that the use of likelihood ratios will lead to better overall matching performance than the use of posterior probabilities. A fixed posterior probability threshold requires equal similarity values for feature vectors from each individual, under the condition that the distribution of the feature vectors from the entire population is much wider than the distribution within one class. This means that a feature vector is relatively close to its associated class center, such that pv) pw) see Expression ). This condition is easily satisfied in practice. The effect is an equal FRR and a lower FAR for less frequently occurring classes. On the other hand, a fixed likelihood ratio threshold requires lower similarity values for less frequently occurring feature vectors see Expression 3). As a consequence, the acceptance region for less frequently occurring classes is larger, which has two effects. The reduction in FAR is smaller, but at the same time, FRR is reduced. The overall recognition performance can be optimized by choosing the right trade-off between these two effects. 83

Next, we prove that using likelihood ratios in multi-user verification is optimal. First, define ϕl w) as the probability density function of the likelihood ratio of an observation vector v that is taken from the true class w. Also, define ϕl w) as the probability density function of the likelihood ratio of an observation vector v that is not taken from the true class w. For these probability density functions, the following well-known relation holds [5]: ϕl w) = L ϕl w) 4) The error rates FRR and FAR for class w, as a function of the threshold t, are given by and FRRt, w) = FARt, w) = t t ϕl w)d L 5) ϕl w)d L 6) Next, we find expressions for the average FRRt.)) and FARt.)) with a class-dependent threshold tw) by integrating over all classes: FRRt.)) = FARt.)) = pw) tw) ϕl w)d L dw 7) pw) ϕl w)d L dw 8) tw) For optimal verification performance, the question is how to choose the threshold t as a function of w, such that the resulting ROC is minimal. This is solved by Lagrange optimization, see for instance [6]. The objective is to minimize FRR, subject to the condition of a constant FAR. The threshold is chosen as t = t opt w) + ε f w), where t opt w) is the optimal threshold, f w) is some function of w, ε is a small constant, and some specific value for FAR is chosen as additional condition. Then: J = + λ [ pw) FARt) topt w)+ε f w) pw) ] t opt w)+ε f w) ϕl w)d L dw ϕl w)d L dw 9) has to be minimized by setting the derivative with respect to ε to zero: λ pw)ϕt opt w) + ε f w) w) f w)dw pw)ϕt opt w) + ε f w) w) f w)dw = 2) By realizing that this expression must hold for any f w), the integrals over all w can be omitted. Furthermore, since t opt is optimal, ε is equal to zero, which further simplifies the expression to: ϕt opt w) w) λϕt opt w) w) = 2) Applying Expression 4 results in: t opt w)ϕt opt w) w) λϕt opt w) w) = 22) which, by dividing both sides by ϕt opt w) w) and rearranging the expression, gives: t opt w) = λ 23) Since λ is a constant, the optimal threshold t opt w) is constant too, independent of w. Therefore, using a constant likelihood ratio threshold when averaging over the classes gives the optimal verification results. IV. EXPERIMENTAL RESULTS In this section, results of fingerprint matching experiments are presented. The proposed similarity measures, being Euclidean distance, posterior probabilities and likelihood ratios, have been evaluated by applying them to Database 2 of FVC2 [7]. The FVC2 database consists of 88 8-bit gray-scale fingerprints, 8 prints of each of different fingers. The images are captured with a capacitive sensor at 5 dpi, resulting in image sizes of 364 by 256 pixels. A. Feature Vectors e use two types of feature vectors that are extracted from the gray scale fingerprint images. The first feature vector is the squared directional field that is defined in [8], which is calculated at a regular grid of by points with spacings of 8 pixels and is centered at the core point see [8]). At each position in the grid, the squared directional field is coded in a vector of two elements. The resulting feature vector of length 242 is reduced to dimension by principal component analysis over the entire population. The directional field of an example fingerprint is shown in Figure 2. 84

entire population. This feature vector is inspired by FingerCode [3], but it can be calculated more efficiently since a rectangular grid is used rather than a circular one see also [], [2]), and it performs slightly better. a) fingerprint b) directional field Fig. 2. Fingerprint image and its directional field. For approximately % of the fingerprints, the automatic core point extraction failed and for those fingerprints, the location of the core point was adjusted manually. The automatic core point extraction errors could be resolved by two related methods. First, feature vectors could be extracted at many regularly spaced locations from the fingerprint. That one feature vector that results in the highest matching score is used. This solution is inspired by the feature space correlation method that is described in [9]. Second, feature vectors could be extracted at each location where a possibly false) core is detected. Again the best matching feature vector is used. This would save a lot of processing time compared to the first method. The second feature vector is the Gabor response of the fingerprint, which is discussed in []. After substraction of the local mean, the fingerprint image is filtered by a set of four complex Gabor filters, which are given by: h Gabor x, y) = exp x 2 + y 2 ) 2σ 2 24) exp j2π f x sin θ + y cos θ)) The orientations θ are set to, π/4, π/2, and 3π/4, the spatial frequency is set to f =.25, which corresponds to a ridge-valley period of 8 pixels, and the width of the filter is set to σ = 3. The absolute values of the output images are taken, which are subsequently filtered by a Gaussian window with σ = 6. The resulting images are shown in Figure 3. Next, samples are taken at a regular grid of by points with spacings of 8 pixels and centered at the core point. The resulting feature vector of length 484 is reduced to dimension 2 by principal component analysis of the B. Matching Experiments To enable calculation of the posterior probability density and likelihood ratio, we assume Gaussian probability density functions with unequal means but equal covariance matrices for the feature vectors from all individual classes. This within-class covariance matrix represents the differences between multiple prints of the same finger, like noise, partial impressions, and elastic deformations. Another Gaussian probability density function with between-class covariance matrix B is assumed for the feature vectors of the entire population, representing the differences between individual fingerprints. For both feature vectors, the within-class and between-class covariance matrices have been determined from the fingerprints in our database. Then the matching scores of 38 genuine attempts and 5995 impostor attempts have been recorded. The use of equal within-class covariance matrices for all users is motivated by the fact that in a biometric system in practice, only one or a few feature vectors are available as template. Therefore, no user dependent covariance matrix can be determined reliably, and the best approximation possible is to use the average covariance matrix for all users. There are a few motivations for using Gaussian distributions for the feature vectors. In general, measurements of natural features tend to a Gaussian distribution. Furthermore, as the dimension of the feature vectors is reduced by principal component analysis, the feature vector elements are weighted sums of measured features, which approximate a Gaussian distribution even better, as dictated by the central limit theorem. Appendix A provides expressions for the verification errors for Gaussian distributed feature vectors. The resulting average ROCs of the matching experiment, using Euclidean distance, posterior probability and likelihood ratios, are shown in Figures 4 and 5. Both the directional field and the Gabor response perform at an equal error rate EER) of approximately 5% when using likelihood ratios. The figures show that the use of posterior probabilities results in a higher EER of 8%, while the Euclidean distance performs at an even worse EER of 5%. Figure 5 shows that posterior probabilities perform better than likelihood ratios for FAR > 2%. For that setting, the acceptance region is very large, and the criterion pv) pw) is no longer satisfied. Combined with possibly unequal intra class covariance matrices, this might 85

a) π b) π/4 c) π/2 d) 3π/4 Fig. 3. Smoothed absolute value of the fingerprint image of Figure 2a) that has been filtered by Gabor filters with different orientations. FRR 2 FRR 2 Results of the directional field based matching algo- Fig. 4. rithm. 3 Euclidean distance Posterior Probability Likelihood Ratio 4 4 3 2 FAR 3 Euclidean distance Posterior Probability Likelihood Ratio 4 4 3 2 FAR Fig. 6. Results of the matching algorithm that uses both the directional field and the Gabor response as features. FRR Results of the Gabor response based matching algo- Fig. 5. rithm. 2 3 Euclidean distance Posterior Probability Likelihood Ratio 4 4 3 2 FAR explain the results. Next, a new feature vector has been constructed by concatenating the directional field and Gabor response into one large feature vector of dimension 3, and new inter class and intra class covariance matrices have been determined. For this combined feature vector, the performance differences are even more significant. Likelihood ratios perform at EER =.5%, posterior probability at EER = 7% and Euclidean distance at EER = 2%, as shown in Figure 6. The EER of.5% is a remarkably good result, equal to the best participant to FVC2. The algorithm takes less than ms, compared to approximately ms for a fast minutiae matching algorithm see for instance []). Using appropriate preprocessing, consecutive matches of a single test fingerprint to a database of templates take less than 25 µs per template, which enables 86

FRR 2 3 Euclidean distance Posterior Probability Likelihood Ratio 4 4 3 2 FAR Fig. 7. Results of the matching algorithm that is based on both features and uses two fingerprints for template construction. TABLE I MATCHING PERFORMANCE USING MULTIPLE FINGERPRINTS FOR TEMPLATE CONSTRUCTION. # templates 2 3 4 5 EER.5%.7%.3%.% % the search through a database that contains 4, entries in second. For a biometric system in practice, the feature vectors that correspond to the true classes centers are not known. Instead, only one example of the feature vector is available as template. This means that all elements of the intra class covariance matrix are twice as large, compared to the situation with known class centers. As a result, the acceptance region has to be 2 d/2 times as large with d the dimension of the feature vector) for a specific FRR. If the inter class feature vector distribution is much wider than the intra class distribution, the impostor distribution can be assumed constant for thresholds that correspond to a relatively high likelihood ratio. Consequently, FAR is also 2 d/2 times as large as it would be with known class centers. This is confirmed in experiments with low dimensional synthetic data sets. The combined feature vector has dimension d = 3. This means that, at least theoretically, FAR can be reduced by a factor of 2 3/2 45 when using class centers as templates. To validate this effect, experiments have been set up where multiple fingerprints have been used for construction of the template. The template is taken as the average of multiple individual feature vectors, while only one of the remaining fingerprint is used as test vector. For this experiment, 66 genuine attempts and 5995 impostor attempts have been carried out. The ROCs for the three similarity measures, using 2 fingerprints to construct the template, are shown in Figure 7, and the equal error rates for likelihood ratios with to 5 fingerprints for template construction are shown in Table I. The table shows that the performance gain in practice is not as large as it is in theory. But still, the matching performance can be increased enormously by using multiple fingerprints for template construction. Two more remarks have to be made on this subject. First, the performance is evaluated in a database of 88 fingerprints, using approximately 6 genuine and 6 impostor attempts. In this evaluation set, error rates smaller than.% cannot be estimated reliably. Therefore, the % in Table I does not mean that we have implemented the perfect biometric system, but only that it made no errors on our database. Second, the practical performance gain of using multiple feature vectors for template construction is smaller than the theoretic gain since the inter class covariance matrix is not much wider than the intra class covariance matrix for most of the elements of the feature vector. Therefore, the assumption of a constant pv) is not true in practice, and the performance gain is smaller than predicted. C. Unknown Versus Known Class Centers V. CONCLUSIONS In this paper, we have shown that the verification of a single user is equivalent to the detection problem, which implies that, for single-user verification, the likelihood ratio is optimal. e have also shown that, in single-user verification, decisions based on posterior probability and likelihood ratio are equivalent, and result in the same ROC. However, in a multi-user situation, the two methods lead to different average error rates. As a third result, we have proven theoretically that, for multi-user verification, the use of the likelihood ratio is optimal in terms of average error rates. The superiority of the likelihood based similarity measure is illustrated by experiments in fingerprint verification. It is shown that error rates of approximately 4 can be achieved when using multiple fingerprints for template construction. Since the algorithm is extremely fast, it can be used to search through large fingerprint databases. For automatic application of the algorithm, improvements have to be made in the automatic extraction of the core point. This could be circumvented by trying all detected cores, but that would slow down a database search. 87

APPENDIX I. GAUSSIAN APPROXIMATION OF ERROR RATES Assume that the different observed feature vectors v that are generated by class w, have a multi-dimensional Gaussian distribution with dimension d, mean c and withinclass covariance matrix, independent of the class. Then, the conditional probability density pv w) is given by pv w) = nc, ) 25) = 2π) d/2 /2 exp ) 2 v c)t v c) 26) Furthermore, assume that the centers c of classes w have a Gaussian distribution with zero mean which can be guaranteed by subtraction of the mean) and between-class covariance matrix B. Then, pc) is given by: pc) = n, B ) 27) = 2π) d/2 B /2 exp ) 2 ct B c 28) The prior probability density function pv) of all patterns is given by: pv) = = v pv w)pw) dw nc, ) n, ) dc 29) which has a total covariance matrix T = + B. Under the condition that the distribution over the entire population is much wider than the distribution within one class, pv) is equal to pc), given by: B 3) pv) = n, B ) 3) = 2π) d/2 B /2 exp ) 2 vt B v 32) Using the acceptance condition Lv) t, and Expression 4 for Lv), an expression for the acceptance region A t,w can be derived: pv w) t pv) 33) By substituting the Gaussian probability density functions 26 and 32, we obtain: exp /2 2π) d/2 t exp 2π) d/2 /2 B or 2 v c)t v c) ) ) 2 vt B v 34) exp 2 v c)t v c) + ) 2 vt B v ) /2 35) t B By taking the natural logarithm and multiplying both sides by -2, we obtain: v c) T v c) vt B v log t 2 ) B and the acceptance region is given by: with 36) A t,w = {v v v c) T v c) β2 t)} 37) β 2 t) = log t 2 ) + v T B B v 38) For threshold values that correspond to a relatively high likelihood ratio, v is relatively close to c, and v can be replaced with c in Expression 38. Then, β is independent of v, and A t,w is an ellipsoid region with a Mahalanobis distance less than β from the class center c. Now, the error rates can be calculated by substituting Expression 37 into 8 and. To transform the multidimensional integrals into one-dimensional integrals, we use the volume Volr, ) of a ellipsoid that is defined by covariance matrix and Mahalanobis distance r, which is given by [4]: Volr, ) = V unit /2 r d 39) with V unit a constant that depends on the dimension d of the space 88

Using π /2 V unit = d/2)! 2 d π d )/2 d )/2 d! d even d odd 4) d Volr, ) = V unit /2 d s d ds 4) and Expression 8, the false rejection rate is given by: βt) FRRt, w) = = βt) pv w)d Volr, ) 2π) d/2 /2 exp s2 /2) V unit /2 ds d ds 42) = V unit d βt) 2π) exp s 2 /2) s d ds d/2 [8] A.M. Bazen and S.H. Gerez, Systematic methods for the computation of the directional field and singular points of fingerprints, IEEE Trans. PAMI, vol. 24, no. 7, pp. 95 99, July 22. [9] A. Ross, J. Reisman, and A.K. Jain, Fingerprint matching using feature space correlation, in Biometric Authentication, Int. ECCV 22 orkshop, M. Tistarelli, J. Bigun, and A.K. Jain, Eds., Copenhagen, Denmark, June 22, vol. 2359 of LNCS, pp. 48 57, Springer-Verlag. [] A.M. Bazen, Fingerprint Identification - Feature Extraction, Matching, and Database Search, Ph.D. thesis, University of Twente, Enschede, The Netherlands, Sept. 22. [] A.K. Jain, A. Ross, and S. Prabhakar, Fingerprint matching using minutiae and texture features, in Proc. Int. Conf. on Image Processing ICIP), Greece, Oct. 2. [2] A. Ross, A.K. Jain, and J. Reisman, A hybrid fingerprint matcher, in Proc. ICPR 22, Quebec City, Canada, Aug. 22. which corresponds to a χ 2 distribution with d degrees of freedom. Using Expression and assuming pv) constant within class w, the false acceptance rate is given by: FARt, w) = pv) V unit /2 β d t) 43) These expressions for the theoretic error rates can be calculated relatively easily. Experiments show that they provide a very accurate estimate of the error rates for low dimensions d < ), while the experimental results deviate from the predicted values for higher dimensions. This can be explained by the fact that pv) cannot be assumed constant within class w anymore. REFERENCES [] A.M. Bazen and S.H. Gerez, Elastic minutiae matching by means of thin-plate spline models, in Proc. ICPR 22, Quebec City, Aug. 22. [2] A.K. Jain, L. Hong, and R. Bolle, On-line fingerprint verification, IEEE Trans. PAMI, vol. 9, no. 4, pp. 32 34, Apr. 997. [3] A.K. Jain, S. Prabhakar, L. Hong, and S. Pankanti, Filterbankbased fingerprint matching, IEEE Trans. Image Processing, vol. 9, no. 5, pp. 846 859, May 2. [4] M. Golfarelli, D. Maio, and D. Maltoni, On the error-reject trade-off in biometric verification systems, IEEE Trans. PAMI, vol. 9, no. 7, pp. 786 796, July 997. [5] H.L. Van Trees, Detection, estimation, and modulation theory, iley, New York, 968. [6] T.K. Moon and.c. Stirling, Mathematical Methods and Algorithms for Signal Processing, Prentice Hall, Upper Saddle River, NJ, 2. [7] D. Maio, D. Maltoni, R. Cappelli, J.L. ayman, and A.K. Jain, FVC2: Fingerprint verification competition, IEEE Trans. PAMI, vol. 24, no. 3, pp. 42 42, Mar. 22. 89