Invariant Pattern Recognition using Dual-tree Complex Wavelets and Fourier Features

Similar documents
COMPLEX WAVELET TRANSFORM IN SIGNAL AND IMAGE ANALYSIS

Research Article Cutting Affine Moment Invariants

Invariant Scattering Convolution Networks

SIMPLE GABOR FEATURE SPACE FOR INVARIANT OBJECT RECOGNITION

Object Recognition Using Local Characterisation and Zernike Moments

Enhanced Fourier Shape Descriptor Using Zero-Padding

A Novel Rejection Measurement in Handwritten Numeral Recognition Based on Linear Discriminant Analysis

A Robust Modular Wavelet Network Based Symbol Classifier

Science Insights: An International Journal

Affine Normalization of Symmetric Objects

Original Article Design Approach for Content-Based Image Retrieval Using Gabor-Zernike Features

IN many image processing applications involving wavelets

FARSI CHARACTER RECOGNITION USING NEW HYBRID FEATURE EXTRACTION METHODS

Lecture Notes 5: Multiresolution Analysis

Neural-wavelet Methodology for Load Forecasting

Accurate Orthogonal Circular Moment Invariants of Gray-Level Images

AN INVESTIGATION ON EFFICIENT FEATURE EXTRACTION APPROACHES FOR ARABIC LETTER RECOGNITION

Role of Assembling Invariant Moments and SVM in Fingerprint Recognition

Low-Complexity Image Denoising via Analytical Form of Generalized Gaussian Random Vectors in AWGN

Image Denoising using Uniform Curvelet Transform and Complex Gaussian Scale Mixture

Multiscale Autoconvolution Histograms for Affine Invariant Pattern Recognition

A New Efficient Method for Producing Global Affine Invariants

A LEAST-SQUARES APPROACH TO MATCHING LINES WITH FOURIER DESCRIPTORS

Feature Extraction Using Zernike Moments

Logo Recognition System Using Angular Radial Transform Descriptors

Multiresolution Analysis

Various Shape Descriptors in Image Processing A Review

Estimation of the Optimum Rotational Parameter for the Fractional Fourier Transform Using Domain Decomposition

An Investigation of 3D Dual-Tree Wavelet Transform for Video Coding

Empirical Analysis of Invariance of Transform Coefficients under Rotation

IMAGE INVARIANTS. Andrej Košir, Jurij F. Tasič

Scale-Invariance of Support Vector Machines based on the Triangular Kernel. Abstract

Sparse linear models

SDP APPROXIMATION OF THE HALF DELAY AND THE DESIGN OF HILBERT PAIRS. Bogdan Dumitrescu

Logarithmic quantisation of wavelet coefficients for improved texture classification performance

Multiresolution image processing

Multi-Layer Boosting for Pattern Recognition

Comparing Robustness of Pairwise and Multiclass Neural-Network Systems for Face Recognition

A New Smoothing Model for Analyzing Array CGH Data

Wavelet Footprints: Theory, Algorithms, and Applications

Multiresolution analysis & wavelets (quick tutorial)

Subcellular Localisation of Proteins in Living Cells Using a Genetic Algorithm and an Incremental Neural Network

OVERLAPPING ANIMAL SOUND CLASSIFICATION USING SPARSE REPRESENTATION

A New Affine Invariant Image Transform Based on Ridgelets

Satellite image deconvolution using complex wavelet packets

Image fusion based on bilateral sharpness criterion in DT-CWT domain

Aruna Bhat Research Scholar, Department of Electrical Engineering, IIT Delhi, India

446 SCIENCE IN CHINA (Series F) Vol. 46 introduced in refs. [6, ]. Based on this inequality, we add normalization condition, symmetric conditions and

Hand Written Digit Recognition using Kalman Filter

A Novel Activity Detection Method

IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS PART C: APPLICATIONS AND REVIEWS, VOL. 30, NO. 1, FEBRUARY I = N

Image Recognition Using Modified Zernike Moments

Hu and Zernike Moments for Sign Language Recognition

DYNAMIC TEXTURE RECOGNITION USING ENHANCED LBP FEATURES

A Method for Blur and Similarity Transform Invariant Object Recognition

Which wavelet bases are the best for image denoising?

Modeling Multiscale Differential Pixel Statistics

Multimedia Databases. Wolf-Tilo Balke Philipp Wille Institut für Informationssysteme Technische Universität Braunschweig

Multimedia Databases. Previous Lecture. 4.1 Multiresolution Analysis. 4 Shape-based Features. 4.1 Multiresolution Analysis

A Constraint Relationship for Reflectional Symmetry and Rotational Symmetry

WE begin by briefly reviewing the fundamentals of the dual-tree transform. The transform involves a pair of

arxiv: v1 [cs.cv] 10 Feb 2016

CP DECOMPOSITION AND ITS APPLICATION IN NOISE REDUCTION AND MULTIPLE SOURCES IDENTIFICATION

A Machine Intelligence Approach for Classification of Power Quality Disturbances

Orientation Map Based Palmprint Recognition

A Novel Fast Computing Method for Framelet Coefficients

CONSTRUCTION OF AN ORTHONORMAL COMPLEX MULTIRESOLUTION ANALYSIS. Liying Wei and Thierry Blu

Fourier descriptors and handwritten digit recognition

Classification of handwritten digits using supervised locally linear embedding algorithm and support vector machine

Constraint relationship for reflectional symmetry and rotational symmetry

On New Radon-Based Translation, Rotation, and Scaling Invariant Transform for Face Recognition

Affine Structure From Motion

Gabor wavelet analysis and the fractional Hilbert transform

Myoelectrical signal classification based on S transform and two-directional 2DPCA

YARN TENSION PATTERN RETRIEVAL SYSTEM BASED ON GAUSSIAN MAXIMUM LIKELIHOOD. Received July 2010; revised December 2010

Comparative study of global invariant. descriptors for object recognition

Imago: open-source toolkit for 2D chemical structure image recognition

Face recognition using Zernike moments and Radon transform

Multisets mixture learning-based ellipse detection

Data-Hiding Capacities of Non-Redundant Complex Wavelets

Wavelet Neural Networks for Nonlinear Time Series Analysis

Symmetric Wavelet Tight Frames with Two Generators

Evaluation of OCR Algorithms for Images with Different Spatial Resolutions and Noises

Multimedia Databases. 4 Shape-based Features. 4.1 Multiresolution Analysis. 4.1 Multiresolution Analysis. 4.1 Multiresolution Analysis

Wavelet Analysis of Print Defects

On the Dual-Tree Complex Wavelet Packet and. Abstract. Index Terms I. INTRODUCTION

Introduction to Wavelets and Wavelet Transforms

Analysis of Redundant-Wavelet Multihypothesis for Motion Compensation

LOCALIZED analyses in one dimension(al) (1-D) have

EECS490: Digital Image Processing. Lecture #26

38 1 Vol. 38, No ACTA AUTOMATICA SINICA January, Bag-of-phrases.. Image Representation Using Bag-of-phrases

Deep Learning: Approximation of Functions by Composition

Wavelet-based Salient Points with Scale Information for Classification

Model-Based Diagnosis of Chaotic Vibration Signals

MULTI-SCALE IMAGE DENOISING BASED ON GOODNESS OF FIT (GOF) TESTS

Design of Image Adaptive Wavelets for Denoising Applications

Rational Coefficient Dual-Tree Complex Wavelet Transform: Design and Implementation Adeel Abbas and Trac D. Tran, Member, IEEE

MOMENT functions are used in several computer vision

DUAL TREE COMPLEX WAVELETS

SPEECH ENHANCEMENT USING PCA AND VARIANCE OF THE RECONSTRUCTION ERROR IN DISTRIBUTED SPEECH RECOGNITION

Transcription:

Invariant Pattern Recognition using Dual-tree Complex Wavelets and Fourier Features G. Y. Chen and B. Kégl Department of Computer Science and Operations Research, University of Montreal, CP 6128 succ. Centre-Ville, Montreal, Quebec, Canada H3C 3J7. Email: {chenguan, kegl}@iro.umontreal.ca. Abstract In this paper, we propose an invariant descriptor for pattern recognition by using the dual-tree complex wavelet and the Fourier transform. The approximate shift-invariant property of the dual-tree complex wavelets and the good property of the Fourier transform make our descriptor a very attractive choice for invariant pattern recognition. Experiments show that our proposed descriptor is very robust to Gaussian white noise and it achieves high recognition rates. Keywords: wavelets, dual-tree complex wavelets, Fourier transform, feature extraction, pattern recognition. 1 Introduction Feature extraction is a crucial step in invariant pattern recognition ([1] - [18]). Many invariant descriptors use the Fourier transform and the wavelet transform to extract invariant features. The Fourier transform has been a powerful tool for pattern recognition ([2] - [7]). One important property of the Fourier transform is that a shift in the time domain causes no change in the magnitude spectrum. This can be used to extract invariant features in pattern recognition. Translation invariance of a 2-D pattern can be achieved by taking the magnitude spectrum of the 2-D Fourier transform of the pattern. Rotation invariance can be obtained by performing 1-D Fourier transform in the angular direction in polar coordinates. Scale invariance can be accomplished by taking the logarithm of the polarized image and then conducting a 1-D Fourier transform along the radial direction. Wavelet transforms have also been proved to be very popular and effective in pattern recognition. Here we briefly review some of the papers for pattern recognition using wavelets. Bui et al. [8] proposed an invariant descriptor by using the orthonormal shell and the Fourier transform. The descriptor is invariant to translation, rotation, and scaling. Chen and Bui [9] invented an invariant descriptor by using a combination of Fourier transform and wavelet transform. They polarize the pattern first, and then perform 1- D wavelet transform along the radial direction and 1- D Fourier transform in the angular direction. Lee et al. [1] proposed a scheme for multiresolution recognition of unconstrained handwritten numerals using the wavelet transform and a simple multilayer cluster neural network. The wavelet features of handwritten numeral at two decomposition levels are fed into the multilayer cluster neural network. Wunsch and Laine [11] gave a descriptor by extracting wavelet features from the outer contour of the handwritten characters and feeding the features into neural networks. Their experiments were done for handprinted characters. Chen et al. [12] developed a descriptor by using multiwavelets and neural networks. The multiwavelet features are also extracted from the outer contour of the handwritten numerals and fed into neural networks. This descriptor gives higher recognition rate than the one given in [11] for handwritten numeral recognition. Khalil and Bayoumi [13] investigated to use wavelet modulus maxima for invariant 2-D pattern recognition. Tieng and Boles [14] used wavelet zero-crossing to recognize 2-D patterns. Tao et al. proposed a technique for feature extraction by using wavelet and fractal [15]. Shen and Ip [16] presented a set of wavelet moment invariants for the classification of seemingly similar objects with subtle differences. Tieng and Boles considered wavelet-based affine invariant pattern recognition in [17] and [18].

In this paper, we propose a novel feature extraction technique for pattern recognition.the pattern is first normalized so that it is translation invariant and scale invariant.a transformation from the Cartesian coordinate system to the polar coordinate system is then performed. We conduct a 1-D Fourier transform along the angular direction and take the magnitude, and a 1-D dual-tree complex wavelet transform along the radial direction. The descriptor thus obtained is invariant to translation, rotation, and scaling. Experimental results show that the descriptor achieves higher recognition rates than the descriptor developed in [9] for recognizing noisy patterns. The organization of this paper is as follows. Section 2 reviews the dual-tree complex wavelet and its shift invariant property. Section 3 proposes a novel feature extraction technique for pattern recognition by using the dual-tree complex wavelet and the Fourier transform. Section 4 conducts some experiments for pattern recognition by considering different rotation angles and noise levels. Finally Section 5 gives the conclusions and future work to be done. 2 The Dual-Tree Complex Wavelet It is well known that the ordinary discrete wavelet transform is not shift invariant because of the decimation operation during the transform. A small shift in the input signal can cause very different output wavelet coefficients. This is the main limitation of wavelet in pattern recognition. One way of overcoming this is to do the wavelet transform without decimation. The drawback of this approach is that it is computationally inefficient, especially in multiple dimensions. Kingsbury ([19] - [21]) introduced a new kind of wavelet transform, called the dual-tree complex wavelet transform, that exhibits approximate shift invariant property and improved angular resolution. The success of the transform is because of the use of filters in two trees, a and b. He proposed a simple delay of one sample between the level 1 filters in each tree, and then the use of alternate odd-length and even-length linear-phase filters. As he pointed out that there are some difficulties in the odd/even filter approach. Therefore, he proposed a new Q shif t dual-tree [22] where all the filters beyond level 1 are even length. The filters in the two trees are just the time-reverse of each other, as are the analysis and reconstruction filters. The new filters are shorter than before, and the new transform still satisfies the shift invariant property and good directional selectivity in multiple dimensions. As will be shown later, this dual-tree complex wavelet can be successfully used in invariant feature extraction for pattern recognition. 3 Pattern Recognition by using a Combination of Fourier and Dualtree Complex Wavelet Features A good pattern recognition descriptor should extract features that are invariant to translation, rotation, and scaling. In this paper, we use the available techniques to preprocess the pattern so that it is invariant to translation and scaling. The rotation invariance can be achieved by our newly developed descriptor below. We polarize the pattern the same way as the descriptor developed in [9]. In order to achieve rotational invariance, we apply the 1-D Fourier transform along the axis of the polar angle to obtain the magnitude of its spectrum. Since the spectrum magnitude of the Fourier transform of circularly shifted signals are the same, we obtain features that are rotation invariant. Because the dual-tree complex wavelet coefficients have approximate shift-invariant property and represent pattern features at different resolution levels, we apply the dual-tree complex wavelet transform along the radial axis. We use the features obtained in this way to recognize the pattern at different resolution levels. The steps of the our newly proposed descriptor can be given as follows: 1. Find the centroid of the pattern f(x, y) and move the centroid to the center of the image. 2. Normalize the pattern so that it is scale invariant. 3. Transform f(x, y) into polar coordinate system to obtain g(r, θ). 4. Conduct 1-D Fourier transform on g(r, θ) along the axis of polar angle θ and obtain its spectrum magnitude: G(r, φ) = F T θ (g(r, θ)). 5. Apply 1-D dual-tree complex wavelet transform on G(r, φ) along the radial axis r: W F (r, φ) = DT W T r (G(r, φ)). 6. Use the daul-tree complex wavelet coefficients to query the pattern feature database at different resolution levels.

Fig. 1 shows the steps of the new descriptor. It is easy to see that this descriptor is similar to that developed in [9]. The only difference is that we replaced the 1-D wavelet transform in [9] with the 1- D dual-tree complex wavelet transform. The approximate shift-invariant property of the dual-tree complex wavelet transform makes this newly proposed descriptor a very good choice for invariant pattern recognition. Experimental results show that this descriptor is an excellent choice for pattern recognition, and it is also robust to Gaussian white noise. 4 Experimental Results We tested our proposed descriptor by using a database of 85 printed Chinese characters in our experiments. The database is shown in Fig. 2. Every character is represented by 64 64 pixels. Our major concern in our experiments is the performance of the descriptor on rotation. For each character, we tested nine rotation angles 3 o, o, 9 o, 1 o, 15 o, 18 o, 21 o, 2 o and 27 o. Daubechies-4 wavelet is used in this experiment. We test the performance of our proposed descriptor on noisy data. The noisy images with different orientations are generated by adding Gaussian white noise to the noise-free images. The signal to noise ratio (SNR) is defined as i,j (f i,j avg(f)) 2 SNR = i,j (n i,j avg(n)) 2 where f is the noise-free image, n is the added white noise, and avg(f) is the average value of the image f. We conduct experiments for SNR=, 15, 1, 5, 4, 3, 2, 1, and.5. Fig. 3 shows the noisy patterns with theses SNR s. The experimental results are listed in Table 1. It is clear that our proposed descriptor is very robust to Gaussian white noise. We compare the recognition rates of our proposed descriptor with those of [9] under the noisy environment. Chen and Bui [9] developed an invariant descriptor by using a combination of the Fourier transform and the wavelet transform. They polarize the pattern first, and then perform a 1-D wavelet transform along the radial direction and a 1-D Fourier transform along the angular direction. We give the recognition rates of the descriptor developed in [9] in Table 2. By looking at the recognition rates in Table 1 and Table 2, we can see that our proposed descriptor produces much better recognition results than the descriptor developed in [9] under the noisy environment. In fact, at SNR=.5, it is very difficult to recognize the patterns even by human eyes. However, our proposed descriptor can do an excellent job in this case. Note that we do not perform any denoising procedure on the noisy pattern. Our proposed descriptor extracts invariant features from the noisy pattern directly and it achieves very high recognition rates. This indicates that our proposed descriptor is very robust to Gaussian white noise even when the noise level is high. 5 Conclusions In this paper, we present an invariant descriptor for pattern recognition by combining Fourier spectrum with the dual-tree complex wavelets. The scale variance and translation variance are eliminated by standard normalization techniques. Our descriptor is also rotational invariant. The approximate shiftinvariant property of the dual-tree complex wavelet transform makes our new descriptor outperform the descriptor developed in [9] for recognizing noisy patterns. Experimental results show that our proposed descriptor is a reliable choice for pattern recognition. It is evident that this descriptor is very robust to Gaussian white noise. Future work will be done by introducing SVM or neural networks to the new descriptor for the classification phase. Acknowledgments This work was supported by the postdoctoral fellowship from the Natural Sciences and Engineering Research Council of Canada (NSERC) and the Canadian Space Agency Postdoctoral Fellowship Supplement. References [1] O. D. Trier, A. K. Jain and T. Taxt, Feature extraction methods for character recognition: A survey, Pattern Recognition, vol. 29, no. 4, pp. 641-662, 1996. [2] G. H. Granlund, Fourier processing for handwritten character recognition, IEEE Trans. Comput., vol. C-21, no. 3, pp. 195-1, 1972. [3] C. C. Lin and R. Chellappa, Classification of partial 2-D shapes using Fourier descriptors, IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 9, no. 5, pp. 686-69, 1987. [4] S. S. Wang, P. C. Chen and W. G. Lin, Invariant pattern recognition by moment Fourier descriptor, Pattern Recognition vol. 27, no. 12, pp. 1735-1742, 1994.

[5] C. T. Zahn and R. Z. Roskies, Fourier descriptors for plane closed curves, IEEE Transactions on Computers vol. C-21, no. 3, pp. 269-281, 1972. [6] F. P. Kuhl and C. R. Giardina, Elliptic Fourier features of a closed contour, Computer Graphics and Image Processing, vol. 18, no. 3, pp. 236-258, 1982. [7] C. S. Lin and C.-L. Hwang, New forms of shape invariants from elliptic Fourier descriptor, Pattern Recognition, vol,, no. 5, pp. 535-545, 1987. [8] T. D. Bui, G. Y. Chen and L. Feng, An orthonormal-shell-fourier descriptor for rapid matching of patterns in image database, International Journal of Pattern Recognition and Artificial Intelligence, vol. 15, no. 8, pp. 1213-1229, 1. [9] G. Y. Chen and T. D. Bui, Invariant Fourierwavelet descriptor for pattern recognition, Pattern Recognition, vol. 32, no. 7, pp. 183-188, 1999. [1] S.-W. Lee, C.-H. Kim, H. Ma and Y. Y. Tang, Multiresolution recognition of unconstrained handwritten numerals with wavelet transform and multilayer cluster neural network, Pattern Recognition, vol. 29, no. 12, pp. 1953-1961, 1996. [11] P. Wunsch and A. F. Laine, Wavelet descriptors for multiresolution recognition of handprinted characters, Pattern Recognition, vol. 28, no. 8, pp. 1237-1249, 1995. [15] Y. Tao, E. C. M. Lam and Y. Y. Tang, Feature extraction using wavelet and fractal, Pattern Recognition Letters, vol. 22, no. 3-4, pp. 271-287, 1. [16] D. Shen and H. H. S. Ip, Discriminative wavelet shape descriptors for recognition of 2-D patterns, Pattern Recognition, vol. 32, no. 2, pp. 151-165, 1999. [17] Q. M. Tieng and W. W. Boles, An application of wavelet based affine-invariant representation, Pattern Recognition Letters, vol. 16, no. 12, pp. 1287-1296, 1995. [18] Q. M. Tieng and W. W. Boles, Waveletbased affine invariant representation: A tool for recognizing planar objects in 3D space, IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 8, pp. 846-857, 1997. [19] N. G. Kingsbury, The dual-tree complex wavelet transform: a new efficient tool for image restoration and enhancement, Proc. EUSIPCO 98, Rhodes, Sept. 1998, pp. 319-322. [] N. G. Kingsbury, Shift invariant properties of the dual-tree complex wavelet transform, Proc. IEEE ICASSP 99, Phoenix, AZ, March 1999. [21] J. Romberg, H. Choi, R. Baraniuk and N. G. Kingsbury, Multiscale classification using complex wavelets, Proc. IEEE ICIP, Vancouver, Sept. 11-13,. [22] N. G. Kingsbury, A dual-tree complex wavelet transform with improved orthogonality and symmetry properties, Proc. IEEE ICIP, Vancouver, Sept. 11-13,. [12] G. Y. Chen, T. D. Bui and A. Krzyżak, Contour-based handwritten numeral recognition using multiwavelets and neural networks, Pattern Recognition, vol. 36, no. 7, pp. 1597-14, 3. [13] M. I. Khalil and M. M. Bayoumi, Invariant 2- D object recognition using the wavelet modulus maxima, Pattern Recognition Letters, vol. 21, no. 9, pp. 863-872,. [14] Q. T. Tieng and W. W. Boles, Recognition of 2-D object contours using wavelet transform zero-crossing representation, IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 19, no. 8, pp. 91-916, 1997.

Original character Polarized character 1 3 8 1 5 1 8 1 1 Magnitudes of Fourier coefficients (1D FFT) Dual tree Complex Wavelet Transform 4 2 6 4 2 Figure 1: The original character, its polarization, its Fourier magnitude spectrum, and the dual-tree complex wavelet coefficients. Figure 2: The Chinese character database used in this paper.

SNR= SNR=15 SNR=1 SNR=5 SNR=4 SNR=3 SNR=2 SNR=1 SNR=.5 Figure 3: The noisy patterns with SNR =, 15, 1, 5, 4, 3, 2, 1, and.5, respectively. Different Rotation SNR 3 o o 9 o 1 o 15 o 18 o 21 o 2 o 27 o 1 1 1 1 1 1 1 1 1 15 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 5 1 1 1 1 1 1 1 1 1 4 1 1 1 1 1 1 1 1 1 3 1 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 1 1 96.47 96.47 1 94.12 92.94 96.47 94.12 95.29 98.82.5 74.12 69.41 65.88 55.29 61.18 61.18 61.18 61.18 67.6 Table 1: The recognition rates of our proposed descriptor for different SNR s. Different Rotation SNR 3 o o 9 o 1 o 15 o 18 o 21 o 2 o 27 o 1 1 1 1 1 1 1 1 1 15 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 5 1 1 1 1 1 1 1 1 1 4 1 1 1 1 1 1 1 1 1 3 1 1 1 1 1 1 1 1 1 2 1 1 1 97.65 97.65 98.82 1 98.82 1 1 9.59 89.41 89.41 85.88 84.71 87.6 87.6 84.71 89.41.5. 55.29 56.47 44.71 5.59 48.24 45.88 47.6. Table 2: The recognition rates of [9] for different SNR s.