Pop-Click Noise Detection Using Inter-Frame Correlation for Improved Portable Auditory Sensing

Similar documents
An Improved multiple fractal algorithm

IMAGE DENOISING USING NEW ADAPTIVE BASED MEDIAN FILTER

Multigradient for Neural Networks for Equalizers 1

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

A Novel Fuzzy logic Based Impulse Noise Filtering Technique

CHAPTER 4 SPEECH ENHANCEMENT USING MULTI-BAND WIENER FILTER. In real environmental conditions the speech signal may be

CONTRAST ENHANCEMENT FOR MIMIMUM MEAN BRIGHTNESS ERROR FROM HISTOGRAM PARTITIONING INTRODUCTION

STUDY OF A THREE-AXIS PIEZORESISTIVE ACCELEROMETER WITH UNIFORM AXIAL SENSITIVITIES

Chapter 13: Multiple Regression

Parameter Estimation for Dynamic System using Unscented Kalman filter

Average Decision Threshold of CA CFAR and excision CFAR Detectors in the Presence of Strong Pulse Jamming 1

The Study of Teaching-learning-based Optimization Algorithm

Department of Electrical & Electronic Engineeing Imperial College London. E4.20 Digital IC Design. Median Filter Project Specification

Inductance Calculation for Conductors of Arbitrary Shape

Low Complexity Soft-Input Soft-Output Hamming Decoder

Microwave Diversity Imaging Compression Using Bioinspired

Multiple Sound Source Location in 3D Space with a Synchronized Neural System

Turbulence classification of load data by the frequency and severity of wind gusts. Oscar Moñux, DEWI GmbH Kevin Bleibler, DEWI GmbH

Copyright 2017 by Taylor Enterprises, Inc., All Rights Reserved. Adjusted Control Limits for P Charts. Dr. Wayne A. Taylor

is the calculated value of the dependent variable at point i. The best parameters have values that minimize the squares of the errors

Impulse Noise Removal Technique Based on Fuzzy Logic

Negative Binomial Regression

Improvement of Histogram Equalization for Minimum Mean Brightness Error

Orientation Model of Elite Education and Mass Education

FFT Based Spectrum Analysis of Three Phase Signals in Park (d-q) Plane

A New Scrambling Evaluation Scheme based on Spatial Distribution Entropy and Centroid Difference of Bit-plane

Regularized Discriminant Analysis for Face Recognition

White Noise Reduction of Audio Signal using Wavelets Transform with Modified Universal Threshold

COMPUTATIONALLY EFFICIENT WAVELET AFFINE INVARIANT FUNCTIONS FOR SHAPE RECOGNITION. Erdem Bala, Dept. of Electrical and Computer Engineering,

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method

A Multi-Level Approach for Temporal Video Segmentation based on Adaptive Examples

arxiv:cs.cv/ Jun 2000

Short Term Load Forecasting using an Artificial Neural Network

One-sided finite-difference approximations suitable for use with Richardson extrapolation

A Network Intrusion Detection Method Based on Improved K-means Algorithm

Structure and Drive Paul A. Jensen Copyright July 20, 2003

An Application of Fuzzy Hypotheses Testing in Radar Detection

An Upper Bound on SINR Threshold for Call Admission Control in Multiple-Class CDMA Systems with Imperfect Power-Control

Color Rendering Uncertainty

NON-LINEAR CONVOLUTION: A NEW APPROACH FOR THE AURALIZATION OF DISTORTING SYSTEMS

Polynomial Regression Models

A NEW DISCRETE WAVELET TRANSFORM

Proportionate Sign Subband Adpative Filtering Algorithm for Network Echo Cancellers

De-noising Method Based on Kernel Adaptive Filtering for Telemetry Vibration Signal of the Vehicle Test Kejun ZENG

Analytical Chemistry Calibration Curve Handout

Pulse Coded Modulation

A Robust Method for Calculating the Correlation Coefficient

COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS

Outage Probability of Macrodiversity Reception in the Presence of Fading and Weibull Co- Channel Interference

Scroll Generation with Inductorless Chua s Circuit and Wien Bridge Oscillator

Boostrapaggregating (Bagging)

DETERMINATION OF TEMPERATURE DISTRIBUTION FOR ANNULAR FINS WITH TEMPERATURE DEPENDENT THERMAL CONDUCTIVITY BY HPM

A Two-Level Detection Algorithm for Optical Fiber Vibration

Copyright 2017 by Taylor Enterprises, Inc., All Rights Reserved. Adjusted Control Limits for U Charts. Dr. Wayne A. Taylor

Identification of Linear Partial Difference Equations with Constant Coefficients

Lab 2e Thermal System Response and Effective Heat Transfer Coefficient

DPCM Compression for Real-Time Logging While Drilling Data

Research on Modified Root-MUSIC Algorithm of DOA Estimation Based on Covariance Matrix Reconstruction

The Synchronous 8th-Order Differential Attack on 12 Rounds of the Block Cipher HyRAL

Quantitative Discrimination of Effective Porosity Using Digital Image Analysis - Implications for Porosity-Permeability Transforms

A new Approach for Solving Linear Ordinary Differential Equations

AGC Introduction

COMPARING NOISE REMOVAL IN THE WAVELET AND FOURIER DOMAINS

Harmonic Detection Algorithm based on DQ Axis with Fourier Analysis for Hybrid Power Filters

Relationship between Refractive Index and Molar Concentration of Multi-Component Solutions Zhu Xingyu 1, a, Mai Tiancheng 2, b and Zhao Zilong 2, c

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems

Grover s Algorithm + Quantum Zeno Effect + Vaidman

Statistics II Final Exam 26/6/18

Vector Median Filter with Directional Detector for Color Image Denoising

Uncertainty in measurements of power and energy on power networks

Comparison of Regression Lines

WARPED LINEAR PREDICTION FOR IMPROVED PERCEPTUAL QUALITY IN THE SCELP LOW DELAY AUDIO CODEC (W-SCELP)

ECE559VV Project Report

A New Metric for Quality Assessment of Digital Images Based on Weighted-Mean Square Error 1

Comparative Analysis between Different Linear Filtering Algorithms of Gamma Ray Spectroscopy

The Order Relation and Trace Inequalities for. Hermitian Operators

Suppose that there s a measured wndow of data fff k () ; :::; ff k g of a sze w, measured dscretely wth varable dscretzaton step. It s convenent to pl

Integrating Neural Networks and PCA for Fast Covert Surveillance

Over-Temperature protection for IGBT modules

Power law and dimension of the maximum value for belief distribution with the max Deng entropy

Lecture 6: Introduction to Linear Regression

Modal Strain Energy Decomposition Method for Damage Detection of an Offshore Structure Using Modal Testing Information

Number of cases Number of factors Number of covariates Number of levels of factor i. Value of the dependent variable for case k

Mixed Noise Suppression in Color Images by Signal-Dependent LMS L-Filters

Transfer Functions. Convenient representation of a linear, dynamic model. A transfer function (TF) relates one input and one output: ( ) system

Support Vector Machines. Vibhav Gogate The University of Texas at dallas

THE EFFECT OF TORSIONAL RIGIDITY BETWEEN ELEMENTS ON FREE VIBRATIONS OF A TELESCOPIC HYDRAULIC CYLINDER SUBJECTED TO EULER S LOAD

On the Repeating Group Finding Problem

AN ADAPTIVE WATERMARKING ALGORITHM FOR DEM BASED ON DFT

Research Article Green s Theorem for Sign Data

An Admission Control Algorithm in Cloud Computing Systems

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

A New Adaptive Filter Approach for Acoustic Echo Canceller in Teleconference Systems

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Chapter 5 Multilevel Models

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Novel Pre-Compression Rate-Distortion Optimization Algorithm for JPEG 2000

MACHINE APPLIED MACHINE LEARNING LEARNING. Gaussian Mixture Regression

System in Weibull Distribution

Transcription:

Advanced Scence and Technology Letters, pp.164-168 http://dx.do.org/10.14257/astl.2013 Pop-Clc Nose Detecton Usng Inter-Frame Correlaton for Improved Portable Audtory Sensng Dong Yun Lee, Kwang Myung Jeon, and Hong Koo Km School of Informaton and Communcatons Gwangju Insttute of Scence and Technology (GIST) {ldy, mjeon, hongoo}@gst.ac.r Abstract. In ths paper, a pop-clc nose detecton method s proposed to mprove the qualty of audo sgnals recorded usng portable mcrophones. In order to reduce false alarm and mssng detecton errors, the proposed method utlzes the second-order dfference of an uncorrelated resdual sgnal, followed by adaptve medan thresholdng. It s shown from performance evaluaton that the proposed method acheves hgher detecton accuracy, under varous sgnal-tonose rato condtons, than wth a conventonal method usng the frst-order dfference of the resdual sgnal. Keywords: Pop-clc nose, lnear predcton, dfference of resdual sgnal, adaptve medan thresholdng 1 Introducton When an audo sgnal s recorded by portable audtory sensors such as a condenser mcrophone or a mcro electrcal-mechancal system (MEMS) mcrophone, there are many factors that generate nose. In partcular, a pop-clc nose s generated by varous physcal phenomena n acoustcs, such as touchng screens, clcng buttons, and so on [1]. Interference by such noses whle recordng can be hghly annoyng to most lsteners. In order to detect a pop-clc nose, there have been many nose detecton methods proposed [1 4]. Among them, the technques usng the resdual sgnal [3] and the frst-order dfference of the resdual sgnal [4] were reported to successfully detect pop-clc noses. However, the error rate ncreased extremely n the nterval where the sgnal-to-nose rato (SNR) was hgh [4]. Thus, n order to mprove detecton accuracy under hgh SNR condtons, the proposed pop-clc nose detecton method frst utlzes the second-order dfference of an uncorrelated resdual sgnal. Ths gves more emphass on a pop-clc nose as t has strong energy n the hgh frequency band. Next, adaptve medan threshold [5][6] s appled at those values to determne whether or not there s a pop-clc nose. Followng ths ntroducton, Secton 2 proposes a pop-clc nose detecton method usng the second-order dfference of an uncorrelated resdual sgnal. Next, Secton 3 evaluates the performance of the proposed method and compares t wth that of a ISSN: 2287-1233 ASTL Copyrght 2013 SERSC

Advanced Scence and Technology Letters Fg. 1. Procedure of the proposed pop-clc nose detecton method. conventonal method usng the frst-order dfference of the resdual sgnal. Fnally, Secton 4 concludes the paper. 2 Proposed Pop-Clc Nose Detecton Method Fg. 1 shows an overall procedure of the proposed pop-clc nose detecton method. As shown n the fgure, the resdual sgnal of the -th frame, e, s frst obtaned by the lnear predcton (LP) technque, defned as where p e = s a s ( n ) (1) = 1 a and s are the -th LP coeffcent and the n-th nput sample of the -th frame, respectvely. In addton, p s the lnear predcton order. Smlarly to Eq. (1), the resdual sgnal of the nput sgnal appled to the (-1)-th lnear predcton flter, e M, s obtaned as e M p = 1 1 = s a s ( n ) (2) 1 where a s the -th LP coeffcent obtaned from the (-1)-th frame. The dfference between e and e M corresponds to an uncorrelated resdual sgnal between the -th and (-1)-th frames, whch s obtaned as e D M = e e. (3) Next, the absolute value of the second-order dfference of an uncorrelated resdual sgnal s obtaned as D D D g = e ( n 1) 2 e + e ( n + 1). (4) Copyrght 2013 SERSC 165

Advanced Scence and Technology Letters Table 2. Comparson of F-measure values between two pop-clc nose detecton methods at 15 db SNR. LP order 4 6 8 10 Conventonal method 0.74 0.84 0.81 0.81 Proposed method 0.85 0.95 0.89 0.86 The g n Eq. (4) becomes large durng nose ntervals, thus t s used to decde whether or not the -th frame ncludes a pop-clc nose. The decson s actually performed by applyng adaptve medan thresholdng to g, such that 1, f g > θ med( g ( n m),, g,, g ( n + m)) N( ) = 0, otherwse (5) where med ( ), θ, and m are a medan flter, a scale factor, and the coverage of the medan flter, respectvely. In Eq. (5), N () ndcates the presence of pop-clc nose or not by ts value. In other words, a pop-clc nose s detected, that s, N ( ) = 1, f the value of the second-order dfference of an uncorrelated resdual sgnal s hgher than the value of the medan threshold. Otherwse, N ( ) = 0, whch means there s no pop-clc nose wthn the -th frame. 3 Performance Evaluaton The performance of the proposed method was evaluated by usng F-measure [7], and t was compared wth that of a conventonal method usng the frst-order dfference of the resdual sgnal. For the test, three male and three female voce sgnals, sampled at 48 Hz, were chosen. Each sample was one mnute long wth 100 pop-clc noses n total. For the analyss, each sgnal was segmented nto consecutve frames by applyng a Hannng wndow whose length was 4,096 samples, where each frame was overlapped by half wth the prevous frame. Note here that the parameters n Eq. (5) for medan threshold were set as θ = 50 and m = 30. Frst, n order to select a sutable LP order, the detecton accuracy was measure dependng on the LP order at 15 db SNR. Table 1 compares F-measure values between the conventonal and proposed methods for four dfferent LP orders. It was shown from the table that the proposed method provded hgher F-measure value than the conventonal method for all LP orders. The hghest detecton accuracy was acheved when LP order was sx. Thus, the LP order was set to sx for the next experment. Second, n order to evaluate the performance of the detecton under SNR condtons, pop-clc noses were mxed to have dfferent SNRs from 0 25 db at 5 db ncrements. The hgher SNR means that a pop-clc nose has lower power compared wth that of the orgnal sgnal. Fg. 2 also compares F-measure values between the conventonal and proposed methods under dfferent SNRs. It was shown from the fgure that the proposed method had hgher F-measure values than the conventonal method under all the SNRs. 166 Copyrght 2013 SERSC

Advanced Scence and Technology Letters Fg. 2. F-measure values of the conventonal and proposed methods under dfferent SNRs rangng from 0 25 db. 4 Concluson In ths paper, a pop-clc nose detecton method was proposed for mproved portable audtory sensng. The proposed method used second-order dfference of an uncorrelated resdual sgnal, and t acheved hgher pop-clc nose detecton accuracy by 4.43% than wth a conventonal method usng the frst-order dfference of the resdual sgnal. Acnowledgments. Ths wor was supported n part by the NRF grant funded by the government of Korea (MSIP) (No. 2012-010636), and by the MSIP, Korea, under the ITRC support program supervsed by the NIPA (NIPA-2013-H0301-13-4005). References 1. Sadler, B. M.: Detecton n correlated mpulsve nose usng fourth-order cumulants. IEEE Transactons on Sgnal Processng, 44(11), (1996) pp. 2793-2800. 2. Chandra, C., Moore, M. S., Mtra, S. K.: An effcent method for the removal of mpulse nose from speech and audo sgnals. In: Proceedngs of ISCAS, (1998) pp. 206-208. 3. Kauppnen, I.: Methods for detectng mpulsve nose n speech and audo sgnals. In: Proceedngs of Internatonal Conference on 14th Dgtal Sgnal Processng, (2002) pp. 967-970. 4. Hong, J., Par, J., Han, S., Hahn, M.: Sporadc nose reducton for robust speech recognton n moble devces. In: Proceedngs of IEEE Internatonal Conference on Consumer Electroncs, (2011) pp. 831-832. 5. Chen, T., Wu, H. R.: Adaptve mpulse detecton usng center-weghted medan flters. IEEE Sgnal Processng Letters, 8(1), (2001) pp. 1-3. Copyrght 2013 SERSC 167

Advanced Scence and Technology Letters 6. Esquef, P. A. A., Bscanho, L. W. P., Dnz, P. S. R., Freeland, F. P.: A double-thresholdbased approach to mpulsve nose detecton n audo sgnals. In: Proceedngs of EUSIPCO, (2000) pp. 2041-2044. 7. Powers, D. M. W.: Evaluaton: from precson, recall and F-measure to ROC, nformedness, maredness & correlaton. Journal of Machne Learnng Technologes, 2(1), (2011) pp. 37-63. 168 Copyrght 2013 SERSC