Albenzio Cirillo INFOCOM Dpt. Università degli Studi di Roma, Sapienza
|
|
- Bryce Holmes
- 5 years ago
- Views:
Transcription
1 Albenzio Cirillo INFOCOM Dpt. Università degli Studi di Roma, Sapienza ET2010 XXVI Riunione Annuale dei Ricercatori di Elettrotecnica Napoli, 9-11 giugno 2010
2 Motivation Sound Source Localization (SSL): Time-Delay Estimation (TDE) approach; Steered Response Power (SRP) approach. TDE anomalies and secondary peaks: Smoothed Likelihood Function (SLF); Optimal Line Selection (OLS): parameters and robustness. Simplified Optimal Line Selection: Computational savings. Conclusion 2
3 Ultimate goal of engineering is to create context-aware machines context-awareness depends on the knowledge of basic information useful to describe events: the position of a sound source is a fundamental information to refine further audio signal processing algorithms; Performance of state-of-art SSL estimators are hampered by the presence of reverberation (T60), so there is still need for a robust solution. Effect of reverberation is commonly represented by a FIR filter and the signal acquired from each microphone is corrupted by additive white noise. A typical approach consists of estimating the time-delay between microphone pairs and combining the delays to retrieve the position information. Source s(t) s m1 m2 mn x1(t)=h1(t)*s(t)+n1(t) x2(t)=h2(t)*s(t)+n2(t) xn(t)=hn(t)*s(t)+nn(t) SSL estimator 3
4 Blind Channel Identification (BCI) Method Source s(t) x1[n] x2[n] hˆ2 [ n] hˆ1 [ n] + - e[n] h ˆ [ n] arg max h ˆ [ ] ˆ arg max 2 n 1 n n Pro: Reverberation is considered by modeling the channel as a FIR. Cons: h1[n] and h2[n] possibly have common zeros, so there is not unique solution to the BCI problem. Filter length needs to be increased with a higher reverberation time. Speech signal does not guarantee the convergence of the adaptative algorithm. 4
5 Generalized Cross Correlation (GCC) G x1 x 2 (f ) CrossPower Spectrum Density; g (f ) Generic Pre-filtering Function. Phase Transform (PHAT) PHAT mitigates the effect of reverberation on GCC.. ( g ) 2 ( ) ( ) ( ) j f x1 x 2 g x1 x2 R f G f e df PHAT g ( f ) ˆ arg max 1 G ( f) xx 1 2 PHAT R x ( ) 1x 2 T 60 =0.15 s T 60 =0.9 s 5
6 There is a TDE anomaly whenever PHAT ( PHAT Rx kt) R ( ), with kt T 1x2 x1x 2 TRUE TRUE Probability of Anomaly (PA) can be evaluated considering GCC samples as variables of a normal distribution whose variance depends on noise level [Ianniello,1982] or on the reverberation time [Gustafsson,Trivedi,Rao 2003] P A L1 z0 z 0 zldz l 1 dz L is the number of samples in the feasible range of delays; z 0 z l is the variable associated to the max GCC value; is the variable associated to the generic GCC value. 0 Effects of reverberation depend on the distance between the source and the microphones r i [ m] i 6
7 A different approach to SSL is SRP-PHAT, that does not require TDE. Given the mathematical delay SRP-PHAT function for M microphone pairs SRP M j f s m1i m2i ( s) ( ) x ( ) 1ix2i i1 Sound map of the ISPAC Lab s m1 s m2 T(s,m 1,m2), c speed of c PHAT g 2 (,, ) f G f e df propagation, ŝ arg max This filter-and-delay beamformer is robust to reverberation. It is needed an exhaustive search on a grid of point in the investigated environment. Scaling SRP values, it is possible to obtain an image known as Sound Map of the acoustic event. 7 s F SRP (s)
8 Smoothed Likelihood Function (SLF) Given the first k peaks of GCC-PHAT for each of the P microphone pairs P ( p) ( p) 2 SLF ( s) max k Vk ( ( s, m1p, m2p); k, ) p1 PHAT V R ( ) k x x k 1 2 SLF for a single mic pair with k=3 8
9 SLF is much more a regular function than SRP-PHAT. ISPAC Lab 4.22x5.5x2.93 m; T 60 =0.3s; SLF SRP-PHAT SLF SRP-PHAT 9
10 OLS is an extension of the Linear Intersection (LI) technique for SSL[Brandstein,1995]. Mic quadruple: 2 orthogonal pairs. 1 c cos 2 2 d s.t. cos ( 12) cos ( 34) 1 10
11 LI Mic quadruple Considering k delays for each mic pair, it is possible to bear up to k 2 lines for each quadruple. Points at minimum distance between skew lines 11
12 Select the set of lines that generates the most compact set of points at minimum distance; C L 1 N / ( i, j) L b sij, point at min distance between line i and j;; N, the number of mic pairs bl, mid-point of the L-th set of points s ij L 2 Up to k 2*Q sets of lines, with Q quadruples; It is needed a sub-optimal search. 12
13 Simplified OLS (S-OLS) is a sub-optimal version of the OLS estimator. Considering just 2 quadruples, it is possible to select up to k 4 initial guesses as source position and use them to find the best set of lines. The number of comparisons to select the best set is: (Q-2)k 2 *k 4 OLS requires k 2Q comparisons, so let us try to compare it to S-OLS: ( Q 2) k * k k 2 4 2Q 6 2Q ( Q2) k 1 Example k 3, Q4 2Q OLSev k S-OLSev ( Q 2) k * k 14 S-OLSev 0.2 OLSev 58 13
14 Root Mean Square Error of Localization versus Reverberation Time Room dimension : Lx=10m, Ly=6.6m, Lz=3m OLS and S-OLS performance are similar to SRP-PHAT. 14
15 In case none of the set of lines produces a set of points that is compact enough, there is a missed localization event. K(1:3) K(1:3) Increasing k does make sense! 15
16 Are we really solving the problem of time-delay ambiguities? In most cases, increasing k, time-delays are disambiguated.. Results on 6 different mic pairs K(1:3) 16
17 OLS is a real-time algorithm that allow us to have a good robustness to varying environment condition, but still requires a good microphone displacement. Next goal is to produce a joint audio-video estimator, in order to build up a generic framework for audio-video processing and classification. Frames from the Automatic Cameraman Demo dex.htm Venue: SmartSpaces Lab, CalIt2 Building, University of California, San Diego. Description: the red square is located around the sound source that has been estimated thanks to the OLS method. The system is made of 16 microphones. 17
18 I. Parisi R., Cirillo A., Panella M., Uncini A., SOURCE LOCALIZATION IN REVERBERANT ENVIRONMENTS BY CONSISTENT PEAK SELECTION, Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, (ICASSP 2007), Honolulu, Hawaii, USA, April 15-20, II. Musolino A., Raugi M., Turcu F., Parisi R., Uncini A., Cirillo A., LOCALIZATION OF DEFECTS IN CONCRETE STRUCTURES VIA THE CROSS POWER SPECTRUM PHASE, Proc. Of PIERS 2007, Prague, Czech Republic, August 27-30, III. Cirillo A., Parisi R., Uncini A., SOUND MAPPING IN REVERBERANT ROOMS BY A ROBUST DIRECT METHOD, Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, (ICASSP 2008), Las Vegas, Nevada, USA, March 30-April 4, IV. Cirillo A., Parisi R., Uncini A., PREFILTERING TECHNIQUES ON CONSISTENT PEAK SELECTION FOR TALKER POSITION ESTIMATION IN REVERBERANT ROOMS, Proc. Of Hands-Free Speech Communication and Microphone Arrays, 2008, Trento, Italy, May 6-8, V. Cirillo A., Parisi R., Uncini A., A NEW CONSISTENCY MEASURE FOR LOCALIZATION OF SOUND SOURCES IN THE PRESENCE OF REVERBERATION, Proceedings of the IEEE Digital Signal Processing conference (DSP 2009), Santorini, Greece, July 5-7, VI. VII. Zannini C. M., Cirillo A., Parisi R., Uncini A., IMPROVED TDOA DISAMBIGUATION TECHNIQUES FOR SOUND SOURCE LOCALIZATION IN REVERBERANT ENVIRONMENTS, Proceedings of IEEE International Symposium on Circuits And Systems, Paris, France, May 28-Jun 2, Cirillo A., Scarpiniti M., Parisi R., Uncini A., SIMPLIFIED OPTIMAL LINE SELECTION FOR ACOUSTIC LOCALIZATION IN THE PRESENCE OF REVERBERATION, EUSIPCO 2010, Aalborg, Denmark, August 23-27,
A comparative study of time-delay estimation techniques for convolutive speech mixtures
A comparative study of time-delay estimation techniques for convolutive speech mixtures COSME LLERENA AGUILAR University of Alcala Signal Theory and Communications 28805 Alcalá de Henares SPAIN cosme.llerena@uah.es
More informationShankar Shivappa University of California, San Diego April 26, CSE 254 Seminar in learning algorithms
Recognition of Visual Speech Elements Using Adaptively Boosted Hidden Markov Models. Say Wei Foo, Yong Lian, Liang Dong. IEEE Transactions on Circuits and Systems for Video Technology, May 2004. Shankar
More informationA. Ideal model. where i is an attenuation factor due to propagation effects,
Adaptive eigenvalue decomposition algorithm for passive acoustic source localization Jacob Benesty Bell Laboratories, Lucent Technologies, 700 Mountain Avenue, Murray Hill, New Jersey 07974-0636 Received
More informationLECTURE NOTES IN AUDIO ANALYSIS: PITCH ESTIMATION FOR DUMMIES
LECTURE NOTES IN AUDIO ANALYSIS: PITCH ESTIMATION FOR DUMMIES Abstract March, 3 Mads Græsbøll Christensen Audio Analysis Lab, AD:MT Aalborg University This document contains a brief introduction to pitch
More informationA low intricacy variable step-size partial update adaptive algorithm for Acoustic Echo Cancellation USNRao
ISSN: 77-3754 International Journal of Engineering and Innovative echnology (IJEI Volume 1, Issue, February 1 A low intricacy variable step-size partial update adaptive algorithm for Acoustic Echo Cancellation
More information"Robust Automatic Speech Recognition through on-line Semi Blind Source Extraction"
"Robust Automatic Speech Recognition through on-line Semi Blind Source Extraction" Francesco Nesta, Marco Matassoni {nesta, matassoni}@fbk.eu Fondazione Bruno Kessler-Irst, Trento (ITALY) For contacts:
More informationNOISE ROBUST RELATIVE TRANSFER FUNCTION ESTIMATION. M. Schwab, P. Noll, and T. Sikora. Technical University Berlin, Germany Communication System Group
NOISE ROBUST RELATIVE TRANSFER FUNCTION ESTIMATION M. Schwab, P. Noll, and T. Sikora Technical University Berlin, Germany Communication System Group Einsteinufer 17, 1557 Berlin (Germany) {schwab noll
More informationAcoustic Signal Processing. Algorithms for Reverberant. Environments
Acoustic Signal Processing Algorithms for Reverberant Environments Terence Betlehem B.Sc. B.E.(Hons) ANU November 2005 A thesis submitted for the degree of Doctor of Philosophy of The Australian National
More informationDominant Pole Localization of FxLMS Adaptation Process in Active Noise Control
APSIPA ASC 20 Xi an Dominant Pole Localization of FxLMS Adaptation Process in Active Noise Control Iman Tabatabaei Ardekani, Waleed H. Abdulla The University of Auckland, Private Bag 9209, Auckland, New
More informationTim Habigt. May 30, 2014
Slide 1/26 Sound-Source Localization Tim Habigt May 2014 Sound-Source Localization Tim Habigt Lehrstuhl für Datenverarbeitung May 30, 2014 Slide 2/26 Sound-Source Localization Tim Habigt May 2014 Table
More informationMITIGATING UNCORRELATED PERIODIC DISTURBANCE IN NARROWBAND ACTIVE NOISE CONTROL SYSTEMS
17th European Signal Processing Conference (EUSIPCO 29) Glasgow, Scotland, August 24-28, 29 MITIGATING UNCORRELATED PERIODIC DISTURBANCE IN NARROWBAND ACTIVE NOISE CONTROL SYSTEMS Muhammad Tahir AKHTAR
More informationAcoustic MIMO Signal Processing
Yiteng Huang Jacob Benesty Jingdong Chen Acoustic MIMO Signal Processing With 71 Figures Ö Springer Contents 1 Introduction 1 1.1 Acoustic MIMO Signal Processing 1 1.2 Organization of the Book 4 Part I
More informationAcoustic Source Separation with Microphone Arrays CCNY
Acoustic Source Separation with Microphone Arrays Lucas C. Parra Biomedical Engineering Department City College of New York CCNY Craig Fancourt Clay Spence Chris Alvino Montreal Workshop, Nov 6, 2004 Blind
More informationA new method for a nonlinear acoustic echo cancellation system
A new method for a nonlinear acoustic echo cancellation system Tuan Van Huynh Department of Physics and Computer Science, Faculty of Physics and Engineering Physics, University of Science, Vietnam National
More informationREAL-TIME TIME-FREQUENCY BASED BLIND SOURCE SEPARATION. Scott Rickard, Radu Balan, Justinian Rosca. Siemens Corporate Research Princeton, NJ 08540
REAL-TIME TIME-FREQUENCY BASED BLIND SOURCE SEPARATION Scott Rickard, Radu Balan, Justinian Rosca Siemens Corporate Research Princeton, NJ 84 fscott.rickard,radu.balan,justinian.roscag@scr.siemens.com
More informationA Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement
A Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement Simon Leglaive 1 Laurent Girin 1,2 Radu Horaud 1 1: Inria Grenoble Rhône-Alpes 2: Univ. Grenoble Alpes, Grenoble INP,
More informationA SPARSENESS CONTROLLED PROPORTIONATE ALGORITHM FOR ACOUSTIC ECHO CANCELLATION
6th European Signal Processing Conference (EUSIPCO 28), Lausanne, Switzerland, August 25-29, 28, copyright by EURASIP A SPARSENESS CONTROLLED PROPORTIONATE ALGORITHM FOR ACOUSTIC ECHO CANCELLATION Pradeep
More informationUSING STATISTICAL ROOM ACOUSTICS FOR ANALYSING THE OUTPUT SNR OF THE MWF IN ACOUSTIC SENSOR NETWORKS. Toby Christian Lawin-Ore, Simon Doclo
th European Signal Processing Conference (EUSIPCO 1 Bucharest, Romania, August 7-31, 1 USING STATISTICAL ROOM ACOUSTICS FOR ANALYSING THE OUTPUT SNR OF THE MWF IN ACOUSTIC SENSOR NETWORKS Toby Christian
More informationMusical noise reduction in time-frequency-binary-masking-based blind source separation systems
Musical noise reduction in time-frequency-binary-masing-based blind source separation systems, 3, a J. Čermá, 1 S. Arai, 1. Sawada and 1 S. Maino 1 Communication Science Laboratories, Corporation, Kyoto,
More informationScalable audio separation with light Kernel Additive Modelling
Scalable audio separation with light Kernel Additive Modelling Antoine Liutkus 1, Derry Fitzgerald 2, Zafar Rafii 3 1 Inria, Université de Lorraine, LORIA, UMR 7503, France 2 NIMBUS Centre, Cork Institute
More informationSource localization and separation for binaural hearing aids
Source localization and separation for binaural hearing aids Mehdi Zohourian, Gerald Enzner, Rainer Martin Listen Workshop, July 218 Institute of Communication Acoustics Outline 1 Introduction 2 Binaural
More informationA Local Model of Relative Transfer Functions Involving Sparsity
A Local Model of Relative Transfer Functions Involving Sparsity Zbyněk Koldovský 1, Jakub Janský 1, and Francesco Nesta 2 1 Faculty of Mechatronics, Informatics and Interdisciplinary Studies Technical
More informationROBUSTNESS OF PARAMETRIC SOURCE DEMIXING IN ECHOIC ENVIRONMENTS. Radu Balan, Justinian Rosca, Scott Rickard
ROBUSTNESS OF PARAMETRIC SOURCE DEMIXING IN ECHOIC ENVIRONMENTS Radu Balan, Justinian Rosca, Scott Rickard Siemens Corporate Research Multimedia and Video Technology Princeton, NJ 5 fradu.balan,justinian.rosca,scott.rickardg@scr.siemens.com
More informationAN APPROACH TO PREVENT ADAPTIVE BEAMFORMERS FROM CANCELLING THE DESIRED SIGNAL. Tofigh Naghibi and Beat Pfister
AN APPROACH TO PREVENT ADAPTIVE BEAMFORMERS FROM CANCELLING THE DESIRED SIGNAL Tofigh Naghibi and Beat Pfister Speech Processing Group, Computer Engineering and Networks Lab., ETH Zurich, Switzerland {naghibi,pfister}@tik.ee.ethz.ch
More informationTime Delay Estimation in Room Acoustic Environments: An Overview
Hindawi Publishing Corporation EURASIP Journal on Applied Signal Processing Volume 26, Article ID 2653, Pages 1 19 DOI 1.1155/ASP/26/2653 Time Delay Estimation in Room Acoustic Environments: An Overview
More informationTo Separate Speech! A System for Recognizing Simultaneous Speech
A System for Recognizing Simultaneous Speech John McDonough 1,2,Kenichi Kumatani 2,3,Tobias Gehrig 4, Emilian Stoimenov 4, Uwe Mayer 4, Stefan Schacht 1, Matthias Wölfel 4 and Dietrich Klakow 1 1 Spoken
More informationNear Optimal Adaptive Robust Beamforming
Near Optimal Adaptive Robust Beamforming The performance degradation in traditional adaptive beamformers can be attributed to the imprecise knowledge of the array steering vector and inaccurate estimation
More informationIndependent Component Analysis. Slides from Paris Smaragdis UIUC
Independent Component Analysis Slides from Paris Smaragdis UIUC 1 A simple audio problem 2 Formalizing the problem Each mic will receive a mi of both sounds Sound waves superimpose linearly We ll ignore
More informationTracking and Far-Field Speech Recognition for Multiple Simultaneous Speakers
Tracking and Far-Field Speech Recognition for Multiple Simultaneous Speakers Tobias Gehrig and John McDonough Institut für Theoretische Informatik Universität Karlsruhe Am Fasanengarten 5, 76131 Karlsruhe,
More informationEnergy Based Acoustic Source Localization
Energy Based Acoustic Source Localization Xiaohong Sheng and Yu-Hen Hu University of Wisconsin Madison, Department of Electrical and Computer Engineering Madison WI 5376, USA, sheng@ece.wisc.edu, hu@engr.wisc.edu,
More informationSpatial Smoothing and Broadband Beamforming. Bhaskar D Rao University of California, San Diego
Spatial Smoothing and Broadband Beamforming Bhaskar D Rao University of California, San Diego Email: brao@ucsd.edu Reference Books and Papers 1. Optimum Array Processing, H. L. Van Trees 2. Stoica, P.,
More informationESTIMATION OF RELATIVE TRANSFER FUNCTION IN THE PRESENCE OF STATIONARY NOISE BASED ON SEGMENTAL POWER SPECTRAL DENSITY MATRIX SUBTRACTION
ESTIMATION OF RELATIVE TRANSFER FUNCTION IN THE PRESENCE OF STATIONARY NOISE BASED ON SEGMENTAL POWER SPECTRAL DENSITY MATRIX SUBTRACTION Xiaofei Li 1, Laurent Girin 1,, Radu Horaud 1 1 INRIA Grenoble
More informationIMPROVED MULTI-MICROPHONE NOISE REDUCTION PRESERVING BINAURAL CUES
IMPROVED MULTI-MICROPHONE NOISE REDUCTION PRESERVING BINAURAL CUES Andreas I. Koutrouvelis Richard C. Hendriks Jesper Jensen Richard Heusdens Circuits and Systems (CAS) Group, Delft University of Technology,
More informationProbabilistic Structure from Sound and Probabilistic Sound Source Localization
Probabilistic Structure from Sound and Probabilistic Sound Source Localization Chi-Hao Lin and Chieh-Chih Wang Department of Computer Science and Information Engineering Graduate Institute of Networking
More informationA FINITE DIFFERENCE METHOD FOR THE EXCITATION OF A DIGITAL WAVEGUIDE STRING MODEL
A FINITE DIFFERENCE METHOD FOR THE EXCITATION OF A DIGITAL WAVEGUIDE STRING MODEL Leonardo Gabrielli 1, Luca Remaggi 1, Stefano Squartini 1 and Vesa Välimäki 2 1 Universitá Politecnica delle Marche, Ancona,
More informationSource counting in real-time sound source localization using a circular microphone array
Source counting in real-time sound source localization using a circular microphone array Despoina Pavlidi, Anthony Griffin, Matthieu Puigt, Athanasios Mouchtaris To cite this version: Despoina Pavlidi,
More informationEfficient Target Activity Detection Based on Recurrent Neural Networks
Efficient Target Activity Detection Based on Recurrent Neural Networks D. Gerber, S. Meier, and W. Kellermann Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU) Motivation Target φ tar oise 1 / 15
More informationTutorial on Blind Source Separation and Independent Component Analysis
Tutorial on Blind Source Separation and Independent Component Analysis Lucas Parra Adaptive Image & Signal Processing Group Sarnoff Corporation February 09, 2002 Linear Mixtures... problem statement...
More informationSingle Channel Signal Separation Using MAP-based Subspace Decomposition
Single Channel Signal Separation Using MAP-based Subspace Decomposition Gil-Jin Jang, Te-Won Lee, and Yung-Hwan Oh 1 Spoken Language Laboratory, Department of Computer Science, KAIST 373-1 Gusong-dong,
More information19th European Signal Processing Conference (EUSIPCO 2011) Barcelona, Spain, August 29 - September 2, 2011
19th European Signal Processing Conference (EUSIPCO 211) Barcelona, Spain, August 29 - September 2, 211 DEVELOPMENT OF PROTOTYPE SOUND DIRECTION CONTROL SYSTEM USING A TWO-DIMENSIONAL LOUDSPEAKER ARRAY
More informationINTRODUCTION Noise is present in many situations of daily life for ex: Microphones will record noise and speech. Goal: Reconstruct original signal Wie
WIENER FILTERING Presented by N.Srikanth(Y8104060), M.Manikanta PhaniKumar(Y8104031). INDIAN INSTITUTE OF TECHNOLOGY KANPUR Electrical Engineering dept. INTRODUCTION Noise is present in many situations
More informationNon-Negative Matrix Factorization And Its Application to Audio. Tuomas Virtanen Tampere University of Technology
Non-Negative Matrix Factorization And Its Application to Audio Tuomas Virtanen Tampere University of Technology tuomas.virtanen@tut.fi 2 Contents Introduction to audio signals Spectrogram representation
More informationLast time: small acoustics
Last time: small acoustics Voice, many instruments, modeled by tubes Traveling waves in both directions yield standing waves Standing waves correspond to resonances Variations from the idealization give
More informationA SPARSENESS CONTROLLED PROPORTIONATE ALGORITHM FOR ACOUSTIC ECHO CANCELLATION
A SPARSENESS CONTROLLED PROPORTIONATE ALGORITHM FOR ACOUSTIC ECHO CANCELLATION Pradeep Loganathan, Andy W.H. Khong, Patrick A. Naylor pradeep.loganathan@ic.ac.uk, andykhong@ntu.edu.sg, p.naylorg@ic.ac.uk
More informationEfficient Use Of Sparse Adaptive Filters
Efficient Use Of Sparse Adaptive Filters Andy W.H. Khong and Patrick A. Naylor Department of Electrical and Electronic Engineering, Imperial College ondon Email: {andy.khong, p.naylor}@imperial.ac.uk Abstract
More informationReduced-cost combination of adaptive filters for acoustic echo cancellation
Reduced-cost combination of adaptive filters for acoustic echo cancellation Luis A. Azpicueta-Ruiz and Jerónimo Arenas-García Dept. Signal Theory and Communications, Universidad Carlos III de Madrid Leganés,
More informationBinaural Beamforming Using Pre-Determined Relative Acoustic Transfer Functions
Binaural Beamforming Using Pre-Determined Relative Acoustic Transfer Functions Andreas I. Koutrouvelis, Richard C. Hendriks, Richard Heusdens, Jesper Jensen and Meng Guo e-mails: {a.koutrouvelis, r.c.hendriks,
More informationAPPLICATION OF MVDR BEAMFORMING TO SPHERICAL ARRAYS
AMBISONICS SYMPOSIUM 29 June 2-27, Graz APPLICATION OF MVDR BEAMFORMING TO SPHERICAL ARRAYS Anton Schlesinger 1, Marinus M. Boone 2 1 University of Technology Delft, The Netherlands (a.schlesinger@tudelft.nl)
More informationWhen Dictionary Learning Meets Classification
When Dictionary Learning Meets Classification Bufford, Teresa 1 Chen, Yuxin 2 Horning, Mitchell 3 Shee, Liberty 1 Mentor: Professor Yohann Tendero 1 UCLA 2 Dalhousie University 3 Harvey Mudd College August
More informationMultiple Sound Source Counting and Localization Based on Spatial Principal Eigenvector
INTERSPEECH 27 August 2 24, 27, Stockholm, Sweden Multiple Sound Source Counting and Localization Based on Spatial Principal Eigenvector Bing Yang, Hong Liu, Cheng Pang Key Laboratory of Machine Perception,
More informationJOINT DEREVERBERATION AND NOISE REDUCTION BASED ON ACOUSTIC MULTICHANNEL EQUALIZATION. Ina Kodrasi, Simon Doclo
JOINT DEREVERBERATION AND NOISE REDUCTION BASED ON ACOUSTIC MULTICHANNEL EQUALIZATION Ina Kodrasi, Simon Doclo University of Oldenburg, Department of Medical Physics and Acoustics, and Cluster of Excellence
More informationBenchmarking Functional Link Expansions for Audio Classification Tasks
25th Italian Workshop on Neural Networks (Vietri sul Mare) Benchmarking Functional Link Expansions for Audio Classification Tasks Scardapane S., Comminiello D., Scarpiniti M., Parisi R. and Uncini A. Overview
More informationACOUSTIC SOURCE LOCALIZATION BY FUSING DISTRIBUTED MICROPHONE ARRAYS MEASUREMENTS
6th European ignal Processing Conference (EUIPCO 8), Lausanne, witzerland, August 5-9, 8, copyright by EURAIP ACOUTIC OURCE LOCALIZATION BY FUING DITRIBUTED MICROPHONE ARRAY MEAUREMENT G. Prandi, G. Valenzise,
More informationQuadrature Prefilters for the Discrete Wavelet Transform. Bruce R. Johnson. James L. Kinsey. Abstract
Quadrature Prefilters for the Discrete Wavelet Transform Bruce R. Johnson James L. Kinsey Abstract Discrepancies between the Discrete Wavelet Transform and the coefficients of the Wavelet Series are known
More informationA Low-Distortion Noise Canceller and Its Learning Algorithm in Presence of Crosstalk
414 IEICE TRANS. FUNDAMENTALS, VOL.E84 A, NO.2 FEBRUARY 2001 PAPER Special Section on Noise Cancellation Reduction Techniques A Low-Distortion Noise Canceller Its Learning Algorithm in Presence of Crosstalk
More informationKernel-Based Formulations of Spatio-Spectral Transform and Three Related Transforms on the Sphere
Kernel-Based Formulations of Spatio-Spectral Transform and Three Related Transforms on the Sphere Rod Kennedy 1 rodney.kennedy@anu.edu.au 1 Australian National University Azores Antipode Tuesday 15 July
More informationReal-Time Spectrogram Inversion Using Phase Gradient Heap Integration
Real-Time Spectrogram Inversion Using Phase Gradient Heap Integration Zdeněk Průša 1 and Peter L. Søndergaard 2 1,2 Acoustics Research Institute, Austrian Academy of Sciences, Vienna, Austria 2 Oticon
More informationChapter 2 Fundamentals of Adaptive Filter Theory
Chapter 2 Fundamentals of Adaptive Filter Theory In this chapter we will treat some fundamentals of the adaptive filtering theory highlighting the system identification problem We will introduce a signal
More informationNONLINEAR ECHO CANCELLATION FOR HANDS-FREE SPEAKERPHONES. Bryan S. Nollett and Douglas L. Jones
NONLINEAR ECHO CANCELLATION FOR HANDS-FREE SPEAKERPHONES Bryan S. Nollett and Douglas L. Jones Coordinated Science Laboratory University of Illinois at Urbana-Champaign 1308 W. Main St. Urbana, IL 61801
More informationThe mathematician Ramanujan, and digital signal processing
The mathematician Ramanujan, and digital signal processing P. P. Vaidyanathan California Institute of Technology, Pasadena, CA SAM-2016, Rio, Brazil 1887-1920 Self-educated mathematician from India Grew
More informationEIGENFILTERS FOR SIGNAL CANCELLATION. Sunil Bharitkar and Chris Kyriakakis
EIGENFILTERS FOR SIGNAL CANCELLATION Sunil Bharitkar and Chris Kyriakakis Immersive Audio Laboratory University of Southern California Los Angeles. CA 9. USA Phone:+1-13-7- Fax:+1-13-7-51, Email:ckyriak@imsc.edu.edu,bharitka@sipi.usc.edu
More informationA Probability Model for Interaural Phase Difference
A Probability Model for Interaural Phase Difference Michael I. Mandel, Daniel P.W. Ellis Department of Electrical Engineering Columbia University, New York, New York {mim,dpwe}@ee.columbia.edu Abstract
More informationGAUSSIANIZATION METHOD FOR IDENTIFICATION OF MEMORYLESS NONLINEAR AUDIO SYSTEMS
GAUSSIANIATION METHOD FOR IDENTIFICATION OF MEMORYLESS NONLINEAR AUDIO SYSTEMS I. Marrakchi-Mezghani (1),G. Mahé (2), M. Jaïdane-Saïdane (1), S. Djaziri-Larbi (1), M. Turki-Hadj Alouane (1) (1) Unité Signaux
More informationSimultaneous Localization of a Mobile Robot and Multiple Sound Sources Using a Microphone Array
Advanced Robotics 25 (2011) 135 152 brill.nl/ar Full paper Simultaneous Localization of a Mobile Robot and Multiple Sound Sources Using a Microphone Array Jwu-Sheng Hu a, Chen-Yu Chan a, Cheng-Kang Wang
More informationSpeaker Tracking and Beamforming
Speaker Tracking and Beamforming Dr. John McDonough Spoken Language Systems Saarland University January 13, 2010 Introduction Many problems in science and engineering can be formulated in terms of estimating
More informationConvolutive Transfer Function Generalized Sidelobe Canceler
IEEE TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, VOL. XX, NO. Y, MONTH 8 Convolutive Transfer Function Generalized Sidelobe Canceler Ronen Talmon, Israel Cohen, Senior Member, IEEE, and Sharon
More informationConvex Combination of MIMO Filters for Multichannel Acoustic Echo Cancellation
Convex Combination of MIMO Filters for Multichannel Acoustic Echo Cancellation Danilo Comminiello, Simone Scardapane, Michele Scarpiniti, Raffaele Parisi, and Aurelio Uncini Department of Information Engineering,
More informationx 1 (t) Spectrogram t s
A METHOD OF ICA IN TIME-FREQUENCY DOMAIN Shiro Ikeda PRESTO, JST Hirosawa 2-, Wako, 35-98, Japan Shiro.Ikeda@brain.riken.go.jp Noboru Murata RIKEN BSI Hirosawa 2-, Wako, 35-98, Japan Noboru.Murata@brain.riken.go.jp
More informationEstimating Correlation Coefficient Between Two Complex Signals Without Phase Observation
Estimating Correlation Coefficient Between Two Complex Signals Without Phase Observation Shigeki Miyabe 1B, Notubaka Ono 2, and Shoji Makino 1 1 University of Tsukuba, 1-1-1 Tennodai, Tsukuba, Ibaraki
More informationHARDWARE IMPLEMENTATION OF FIR/IIR DIGITAL FILTERS USING INTEGRAL STOCHASTIC COMPUTATION. Arash Ardakani, François Leduc-Primeau and Warren J.
HARWARE IMPLEMENTATION OF FIR/IIR IGITAL FILTERS USING INTEGRAL STOCHASTIC COMPUTATION Arash Ardakani, François Leduc-Primeau and Warren J. Gross epartment of Electrical and Computer Engineering McGill
More informationEstimation of Reflections from Impulse Responses
Proceedings of the International Symposium on Room Acoustics, ISRA 2 29 3 August 2, Melbourne, Australia Estimation of Reflections from Impulse Responses PACS: 43.55 Sakari Tervo, Teemu Korhonen 2, and
More informationSpatial Diffuseness Features for DNN-Based Speech Recognition in Noisy and Reverberant Environments
Spatial Diffuseness Features for DNN-Based Speech Recognition in Noisy and Reverberant Environments Andreas Schwarz, Christian Huemmer, Roland Maas, Walter Kellermann Lehrstuhl für Multimediakommunikation
More informationDirection of Arrival Estimation: Subspace Methods. Bhaskar D Rao University of California, San Diego
Direction of Arrival Estimation: Subspace Methods Bhaskar D Rao University of California, San Diego Email: brao@ucsdedu Reference Books and Papers 1 Optimum Array Processing, H L Van Trees 2 Stoica, P,
More informationLab 9a. Linear Predictive Coding for Speech Processing
EE275Lab October 27, 2007 Lab 9a. Linear Predictive Coding for Speech Processing Pitch Period Impulse Train Generator Voiced/Unvoiced Speech Switch Vocal Tract Parameters Time-Varying Digital Filter H(z)
More informationBayesian Regularization and Nonnegative Deconvolution for Time Delay Estimation
University of Pennsylvania ScholarlyCommons Departmental Papers (ESE) Department of Electrical & Systems Engineering December 24 Bayesian Regularization and Nonnegative Deconvolution for Time Delay Estimation
More informationPerformance Analysis for Strong Interference Remove of Fast Moving Target in Linear Array Antenna
Performance Analysis for Strong Interference Remove of Fast Moving Target in Linear Array Antenna Kwan Hyeong Lee Dept. Electriacal Electronic & Communicaton, Daejin University, 1007 Ho Guk ro, Pochen,Gyeonggi,
More informationDIRECTION ESTIMATION BASED ON SOUND INTENSITY VECTORS. Sakari Tervo
7th European Signal Processing Conference (EUSIPCO 9) Glasgow, Scotland, August 4-8, 9 DIRECTION ESTIMATION BASED ON SOUND INTENSITY VECTORS Sakari Tervo Helsinki University of Technology Department of
More informationFREQUENCY-DOMAIN IMPLEMENTATION OF BLOCK ADAPTIVE FILTERS FOR ICA-BASED MULTICHANNEL BLIND DECONVOLUTION
FREQUENCY-DOMAIN IMPLEMENTATION OF BLOCK ADAPTIVE FILTERS FOR ICA-BASED MULTICHANNEL BLIND DECONVOLUTION Kyungmin Na, Sang-Chul Kang, Kyung Jin Lee, and Soo-Ik Chae School of Electrical Engineering Seoul
More informationLTI Systems, Additive Noise, and Order Estimation
LTI Systems, Additive oise, and Order Estimation Soosan Beheshti, Munther A. Dahleh Laboratory for Information and Decision Systems Department of Electrical Engineering and Computer Science Massachusetts
More informationProbabilistic 3D Sound Source Mapping using Moving Microphone Array. Sasaki et al., IROS 2016 Inkyu An
Probabilistic 3D Sound Source Mapping using Moving Microphone Array Sasaki et al., IROS 2016 Inkyu An Content 1. Background - What is the Sound Source Localization? 2. Motivation 3. Approach 4. Result
More informationBLIND DECONVOLUTION ALGORITHMS FOR MIMO-FIR SYSTEMS DRIVEN BY FOURTH-ORDER COLORED SIGNALS
BLIND DECONVOLUTION ALGORITHMS FOR MIMO-FIR SYSTEMS DRIVEN BY FOURTH-ORDER COLORED SIGNALS M. Kawamoto 1,2, Y. Inouye 1, A. Mansour 2, and R.-W. Liu 3 1. Department of Electronic and Control Systems Engineering,
More informationA New Interval-Based Method to Characterize Estimability
[Version: 2002/11/11 v1.00] A New Interval-Based Method to Characterize Estimability O. Reynet 1,, L. Jaulin 2 1 E 3 I 2 laboratory, ENSIETA 2 DTN laboratory, ENSIETA SUMMARY Estimability is a property
More informationInvestigation of passive acoustic anemometer by use of wind noise correlation
Investigation of passive acoustic anemometer by use of wind noise correlation Tetsuro MURAKAMI 1, Yoshinori YOKO, Yuya SATO 3, Hirofumi NAKAJIMA 4, Kazuhiro NAKADAI 5 1 4 Kogakuin University, Japan 5 Honda
More informationAn Observer for Phased Microphone Array Signal Processing with Nonlinear Output
2010 Asia-Pacific International Symposium on Aerospace Technology An Observer for Phased Microphone Array Signal Processing with Nonlinear Output Bai Long 1,*, Huang Xun 2 1 Department of Mechanics and
More informationDue dates are as mentioned above. Checkoff interviews for PS4 and PS5 will happen together between October 24 and 28, 2012.
Problem Set 5 Your answers will be graded by actual human beings (at least that's what we believe!), so don't limit your answers to machine-gradable responses. Some of the questions specifically ask for
More informationLoudspeaker Choice and Placement. D. G. Meyer School of Electrical & Computer Engineering
Loudspeaker Choice and Placement D. G. Meyer School of Electrical & Computer Engineering Outline Sound System Design Goals Review Acoustic Environment Outdoors Acoustic Environment Indoors Loudspeaker
More informationSound Engineering Test and Analysis
Sound Engineering Test and Analysis Product sound and sound quality are key aspects of product perception. How a product sounds plays a critical role in conveying the right message about its functionality,
More informationA NEW INFORMATION THEORETIC APPROACH TO ORDER ESTIMATION PROBLEM. Massachusetts Institute of Technology, Cambridge, MA 02139, U.S.A.
A EW IFORMATIO THEORETIC APPROACH TO ORDER ESTIMATIO PROBLEM Soosan Beheshti Munther A. Dahleh Massachusetts Institute of Technology, Cambridge, MA 0239, U.S.A. Abstract: We introduce a new method of model
More informationTIME DOMAIN ACOUSTIC CONTRAST CONTROL IMPLEMENTATION OF SOUND ZONES FOR LOW-FREQUENCY INPUT SIGNALS
TIME DOMAIN ACOUSTIC CONTRAST CONTROL IMPLEMENTATION OF SOUND ZONES FOR LOW-FREQUENCY INPUT SIGNALS Daan H. M. Schellekens 12, Martin B. Møller 13, and Martin Olsen 4 1 Bang & Olufsen A/S, Struer, Denmark
More informationAdaptive beamforming for uniform linear arrays with unknown mutual coupling. IEEE Antennas and Wireless Propagation Letters.
Title Adaptive beamforming for uniform linear arrays with unknown mutual coupling Author(s) Liao, B; Chan, SC Citation IEEE Antennas And Wireless Propagation Letters, 2012, v. 11, p. 464-467 Issued Date
More informationGeorge Mason University ECE 201: Introduction to Signal Analysis Spring 2017
Assigned: March 20, 2017 Due Date: Week of April 03, 2017 George Mason University ECE 201: Introduction to Signal Analysis Spring 2017 Laboratory Project #6 Due Date Your lab report must be submitted on
More informationSparse Phase Retrieval Using Partial Nested Fourier Samplers
Sparse Phase Retrieval Using Partial Nested Fourier Samplers Heng Qiao Piya Pal University of Maryland qiaoheng@umd.edu November 14, 2015 Heng Qiao, Piya Pal (UMD) Phase Retrieval with PNFS November 14,
More informationSinger Identification using MFCC and LPC and its comparison for ANN and Naïve Bayes Classifiers
Singer Identification using MFCC and LPC and its comparison for ANN and Naïve Bayes Classifiers Kumari Rambha Ranjan, Kartik Mahto, Dipti Kumari,S.S.Solanki Dept. of Electronics and Communication Birla
More informationAdaptive Channel Modeling for MIMO Wireless Communications
Adaptive Channel Modeling for MIMO Wireless Communications Chengjin Zhang Department of Electrical and Computer Engineering University of California, San Diego San Diego, CA 99- Email: zhangc@ucsdedu Robert
More informationAn Improved Cumulant Based Method for Independent Component Analysis
An Improved Cumulant Based Method for Independent Component Analysis Tobias Blaschke and Laurenz Wiskott Institute for Theoretical Biology Humboldt University Berlin Invalidenstraße 43 D - 0 5 Berlin Germany
More informationAn Adaptive Sensor Array Using an Affine Combination of Two Filters
An Adaptive Sensor Array Using an Affine Combination of Two Filters Tõnu Trump Tallinn University of Technology Department of Radio and Telecommunication Engineering Ehitajate tee 5, 19086 Tallinn Estonia
More informationLECTURE :ICA. Rita Osadchy. Based on Lecture Notes by A. Ng
LECURE :ICA Rita Osadchy Based on Lecture Notes by A. Ng Cocktail Party Person 1 2 s 1 Mike 2 s 3 Person 3 1 Mike 1 s 2 Person 2 3 Mike 3 microphone signals are mied speech signals 1 2 3 ( t) ( t) ( t)
More informationEfficient Semi-Blind Channel Estimation and Equalization Based on a Parametric Channel Representation
Efficient Semi-Blind Channel Estimation and Equalization Based on a Parametric Channel Representation Presenter: Kostas Berberidis University of Patras Computer Engineering & Informatics Department Signal
More informationResearch Letter An Algorithm to Generate Representations of System Identification Errors
Research Letters in Signal Processing Volume 008, Article ID 5991, 4 pages doi:10.1155/008/5991 Research Letter An Algorithm to Generate Representations of System Identification Errors Wancheng Zhang and
More informationA Flexible ICA-Based Method for AEC Without Requiring Double-Talk Detection
APSIPA ASC 2011 Xi an A Flexible ICA-Based Method for AEC Without Requiring Double-Talk Detection Marko Kanadi, Muhammad Tahir Akhtar, Wataru Mitsuhashi Department of Information and Communication Engineering,
More information