Outline. Why speech processing? Speech signal processing. Advanced Multimedia Signal Processing #5:Speech Signal Processing 2 -Processing-

Similar documents
Outline. Image processing includes. Edge detection. Advanced Multimedia Signal Processing #8:Image Processing 2 processing

Lecture 2: Discrete-Time Signals & Systems. Reza Mohammadkhani, Digital Signal Processing, 2015 University of Kurdistan eng.uok.ac.

ANALYSIS IN THE FREQUENCY DOMAIN

Communication Technologies

Learning Spherical Convolution for Fast Features from 360 Imagery

Numbering Systems Basic Building Blocks Scaling and Round-off Noise. Number Representation. Floating vs. Fixed point. DSP Design.

Radiation Physics Laboratory - Complementary Exercise Set MeBiom 2016/2017

Data Assimilation 1. Alan O Neill National Centre for Earth Observation UK

Impact of the Sampling Period on the Design of Digital PID Controllers

SER/BER in a Fading Channel

EXST Regression Techniques Page 1

Image Filtering: Noise Removal, Sharpening, Deblurring. Yao Wang Polytechnic University, Brooklyn, NY11201

A Novel Algorithm to Improve Resolution for Very Few Samples

Rotor Stationary Control Analysis Based on Coupling KdV Equation Finite Steady Analysis Liu Dalong1,a, Xu Lijuan2,a

Procdings of IC-IDC0 ( and (, ( ( and (, and (f ( and (, rspctivly. If two input signals ar compltly qual, phas spctra of two signals ar qual. That is

Hydrogen Atom and One Electron Ions

2F1120 Spektrala transformer för Media Solutions to Steiglitz, Chapter 1

Solution of Assignment #2

High Energy Physics. Lecture 5 The Passage of Particles through Matter

Radar Signal Demixing via Convex Optimization

Slide 1. Slide 2. Slide 3 DIGITAL SIGNAL PROCESSING CLASSIFICATION OF SIGNALS

2. Laser physics - basics

Recursive Estimation of Dynamic Time-Varying Demand Models

Time Delay Estimation by Bispectrum Interpolation

4.2 Design of Sections for Flexure

Linear Non-Gaussian Structural Equation Models

Sliding Mode Flow Rate Observer Design

Gamma-ray burst spectral evolution in the internal shock model

ARIMA Methods of Detecting Outliers in Time Series Periodic Processes

Robust surface-consistent residual statics and phase correction part 2

Addition of angular momentum

Problem Set #2 Due: Friday April 20, 2018 at 5 PM.

WINDOW OPTIMIZATION ISSUES IN RECURSIVE LEAST-SQUARES ADAPTIVE FILTERING AND TRACKING. Tayeb Sadiki,Mahdi Triki, Dirk T.M. Slock

22/ Breakdown of the Born-Oppenheimer approximation. Selection rules for rotational-vibrational transitions. P, R branches.

Random Access Techniques: ALOHA (cont.)

Least Favorable Distributions to Facilitate the Design of Detection Systems with Sensors at Deterministic Locations

Electron energy in crystal potential

Models for IC Buffers: A Top-down Approach Taking the nonlinear Thévenin-like topology beyond the proof of concept

Capturing. Fig. 1: Transform. transform. of two time. series. series of the. Fig. 2:

Problem Statement. Definitions, Equations and Helpful Hints BEAUTIFUL HOMEWORK 6 ENGR 323 PROBLEM 3-79 WOOLSEY

Mathematics. Complex Number rectangular form. Quadratic equation. Quadratic equation. Complex number Functions: sinusoids. Differentiation Integration

Sara Godoy del Olmo Calculation of contaminated soil volumes : Geostatistics applied to a hydrocarbons spill Lac Megantic Case

Chemical Physics II. More Stat. Thermo Kinetics Protein Folding...

RELATIONS BETWEEN GABOR TRANSFORMS AND FRACTIONAL FOURIER TRANSFORMS AND THEIR APPLICATIONS FOR SIGNAL PROCESSING

Title: Vibrational structure of electronic transition

ECE 344 Microwave Fundamentals

Davisson Germer experiment Announcements:

Zero Vectors Combinatorial Code Family for Spectral Amplitude Coding (SAC-OCDMA)

A Low-Cost and High Performance Solution to Frequency Estimation for GSM/EDGE

Collisions between electrons and ions

Sec 2.3 Modeling with First Order Equations

EE140 Introduction to Communication Systems Lecture 2

Appendix. Kalman Filter

0WAVE PROPAGATION IN MATERIAL SPACE

Inference Methods for Stochastic Volatility Models

Random Process Part 1

A STUDY ON THE NONLINEALITY OF RUNOFF PHENOMENA AND ESTIMATION OF EFFECTIVE RAINFALL

The general linear model for fmri

Propositional Logic. Combinatorial Problem Solving (CPS) Albert Oliveras Enric Rodríguez-Carbonell. May 17, 2018

Classical Magnetic Dipole

Middle East Technical University Department of Mechanical Engineering ME 413 Introduction to Finite Element Analysis

Engineering 323 Beautiful HW #13 Page 1 of 6 Brown Problem 5-12

ECE 650 1/8. Homework Set 4 - Solutions

Total Wave Function. e i. Wave function above sample is a plane wave: //incident beam

That is, we start with a general matrix: And end with a simpler matrix:

The pn junction: 2 Current vs Voltage (IV) characteristics

Search sequence databases 3 10/25/2016

Molecular Orbitals in Inorganic Chemistry

Determination of Vibrational and Electronic Parameters From an Electronic Spectrum of I 2 and a Birge-Sponer Plot

ECE Department Univ. of Maryland, College Park

VII. Quantum Entanglement

Sinusoidal Response Notes

Chapter 6: Polarization and Crystal Optics

PHASE-ONLY CORRELATION IN FINGERPRINT DATABASE REGISTRATION AND MATCHING

Supplementary Materials

day month year documentname/initials 1

Analysis of Key Factors Affecting Electronic Transformer Calibration System Error

MA 262, Spring 2018, Final exam Version 01 (Green)

Applied Statistics II - Categorical Data Analysis Data analysis using Genstat - Exercise 2 Logistic regression

Quaternion Fourier Transform for Colour Images

As the matrix of operator B is Hermitian so its eigenvalues must be real. It only remains to diagonalize the minor M 11 of matrix B.

Differential Equations

Probability and Stochastic Processes: A Friendly Introduction for Electrical and Computer Engineers Roy D. Yates and David J.

AS 5850 Finite Element Analysis

Deepak Rajput

ECE602 Exam 1 April 5, You must show ALL of your work for full credit.

Introduction to the Fourier transform. Computer Vision & Digital Image Processing. The Fourier transform (continued) The Fourier transform (continued)

Detection of Energetic Particles by a Network of HF Propagation Paths in Alaska

1 Isoparametric Concept

EEO 401 Digital Signal Processing Prof. Mark Fowler

ME 321 Kinematics and Dynamics of Machines S. Lambert Winter 2002

The graph of y = x (or y = ) consists of two branches, As x 0, y + ; as x 0, y +. x = 0 is the

Interactive spicpms data treatment using Nanocount

Phase Rotation for the 80 MHz ac Mixed Mode Packet

Estimation of the two-photon QED background in Belle II

Research and Analysis Laser Target Optics Characteristics and Signal Recognition Processing in Detection Screen System

Definition1: The ratio of the radiation intensity in a given direction from the antenna to the radiation intensity averaged over all directions.

Chapter 6. The Discrete Fourier Transform and The Fast Fourier Transform

Lecture 28 Title: Diatomic Molecule : Vibrational and Rotational spectra

Need to understand interaction of macroscopic measures

Transcription:

Outlin Advancd Multimdia Signal Procssing #5:Spch Signal Procssing -Procssing- Intllignt Elctronic Systms Group Dpt. of Elctronic Enginring, UEC Basis of Spch Procssing Nois Rmoval Spctral Subtraction Microphon Array DOA Bamformr Spch Sparation Frquncy Assignmnt Multimodal Spch Sparation Takayuki Nagai Why spch procssing? Spch signal procssing 3 Improv th spch rcognition rat Improv th spch quality spch communications (cll phon tc. nois rmoval drvrbration blind sourc sparation comprssion (coding How many microphons? singl channl charactristics harmonic structur, smoothnss dnoising, spch sparation, drvrbration visual information multi-channlmicrophon array spatial information sourc localization, dnoising, spch sparation, drvrbration Location of th sound sourc and snsors nar fild sphrical wav far fild plan wav Nois charactristics many mthods assum th nois is uncorrlatd with spch signals 4 whit nois E [ s( t n( t ] = 0

Th first stp in spch procssing Th first stp in SP (cont d fram dcomposition spch signals ar nonstationary, but can b stationary within a short priod of tim procssing fram basis ida of tim-frquncy rprsntation windowing window function is utilizd for th dcomposition fram lngth530ms shift normally, half of th fram lngth window function fram 5 x sg ( = x( w( 6 Th first stp in SP (cont d Spctral Subtraction (SS w 7 w window function Rctangular window Hamming window hamming Hanning window hanning ( n = ( n = 0.54 0.46 cos 0 w rc ( n = : 0 n L 0 : othrwis π n : 0 n L L : othrwis π n cos : 0 n L L : othrwis 0 8 wll-known dnoising mthod asy to implmnt and low computational load stimat th nois magnitud spctrum and subtract it from th input signal in ach fram! S.F.Boll 979 obsrvation x ( = s( w( additiv nois modl clan spchdsird nois th magnitud of nois spctrum W( is stimatd in any way Sˆ ( = ( X( W( (

Spctral Subtraction (cont d Spctral Subtraction (cont d simpl but works prtty good! vry fast (asy X ( = S( W( x ( = s( w( X ( = S( W( Th paramtr α controls th amount of nois subtractd from th noisy signal Sˆ ( = ( X( α W( ( ˆ = if X( α W( < 0 thn S( 0 can b X( W( < 0 9 0 Spctral Subtraction (cont d flow graph Spctral Subtraction (cont d maning of SS W ( ω sˆ ( x( Sˆ ( ω X ( ω α W ( ω X ( ω Sˆ( = ( X( α W( = X( ( α W( α W( = FFT[ x( ] X( = H( X( ( ( SS is nothing but filtring! tim varying (dpnds on input and nois spctrum

An xampl of SS Magnitud & Phas problm Combination of magnitud and phas info. from two diffrnt spch signals MATLAB sourc cod can b downloadd (MMSP wb sit 4.47 [db] 7.7dB 3 Aftr subtraction thr rmain paks in th nois spctrum. Th narrowr paks ar prcivd as tim varying tons which w rfr to as musical nois. 4 Multi-channl spch procssing two-channl spch mixing modl Multi-usr nvironmnt 5 Man-machin intrfac Human-Robot communication Multi-channl spch sparation Cllular phon 6 tim-frquncy domain Y Y H = H H H S S

thr xist many mthods Microphon array Microphon array (multi channl 7 spatial information Indpndnt Componnt Analysis:ICA (multi channl statistical information W-disjoint orthogonality (singl or multi channl tim-frquncy sparsity (mask Multi-channl SAFIA, DUET, Frquncy Assignmnt Singl-channl Factorial HMM 8 Dirction of Arrival (DOA important cus ITD(intraural tim diffrnc IID(intraural intnsity diffrnc HRTF(Had-Rlatd Transfr Functio algorithms Dlay and sum Phas Transform (PHAT Crosspowr Spctrum Phas (CSP High-rsolution mthod Minimum varianc mthod MUSIC (MUltipl SIgnal Classificatio Bam forming Dlay and sum Adaptiv microphon array CSP mthod Dlay and sum bamformr Tim dlay stimation using crosspowr spctrum phas 9 cross corrlation τ x ( x ( t t θ d CSP p=argmax(csp τ = p T θ θ = sin CSP = IFFT cτ d * X ( X( X( X( X( = S( idal cas X( = S( jωk jωm 0 x ( x ( t x 3( t x M (t t D D D3 y(t DM x ( t x t x t x t D D ( D 3( D3 M ( M Di = ( Li Lmi / c y( t = M i= x ( t i D i

Adaptiv array Adaptiv array (cont'd targt sourc τ x ( x ( t t if w know th DOA of th nois, "Null" can b strd to that dirction! θ d - y (t θ D x ( t D compnsation (filtr yˆ ( t nois H compnsation (filtr D compnsation (filtr H adaptiv controllr H H H3 HM adaptiv controllr - D D D3 H H H3 y(t - - - DM griffith-jim array minimiz th powr of output Charactristics of spch signals Exampls Assumption (W-d orthogonality signals from diffrnt sourcs do not shar th sam frquncy at th sam tim mask channl or channl? channl or channl? Mixd input spch mask 3 frquncy 4

Frquncy Assignmnt Frquncy Assignmnt (cont d Input signal wight = Output signal Input signal Frquncy Assignmnt wight = 0 Output signal wight = wight = 0 5 6 Frquncy Assignmnt (cont d Frquncy Assignmnt (cont d τ i : tim dlay of arrival from sourc to th microphon i Transfr function H = phas diffrnc τ : s s tim dlay ij ij ωτ θ θ ωτ θ θ jθ ω s tim dlay τ : s τ ω should b an idal phas diff. at ω ij Z = Y Y Z whr = α( l, ( = Y Y = α( l, ( α( l, = H β = H S S jωτ β ( jω ( τ τ jωτ β ( jω ( τ τ 7 8

Word rcognition rsult multimodal(mdia information word rcognition rat [%] original mixd spch procssd spch 9 location of th othr spakr 30 Intgration (spakr modl FA Visual information probabilistic intgration log Pr( ω S = log Pr( ω S y, y, λ i y, λ i, λ, λ i i log Pr( y y, ω S spakr modl frquncy assign mnt Us of visual infromation fac dtction rcognition tracking (color-basd τ i Estimation of (spakr localizatio CSP mthod (half sampl accuracy find multipl paks according to th fac locations Phas rror i.i.d. Gaussian PDF log Pr( y y, ω S = ( θ ( ωτ Σ 3 FA const 3

Multimodal spch Exprimntal sttings locations 33 34 Rsults Rsults Exampl 35 36