Audio Features. Fourier Transform. Short Time Fourier Transform. Short Time Fourier Transform. Short Time Fourier Transform

Size: px
Start display at page:

Download "Audio Features. Fourier Transform. Short Time Fourier Transform. Short Time Fourier Transform. Short Time Fourier Transform"

Transcription

1 Advanced Course Computer Science Music Processing Summer Term 2009 Meinard Müller Saarland University and MPI Informatik Audio Features Fourier Transform Tells which notes (frequencies) are played, but does not tell when the notes are played Frequency information is averaged over the entire time interval Time information is hidden in the phase Windowed Fourier Transform (WFT) (STFT) (Dennis Gabor, 1946) 2 Idea: To recover time information, only a small section of the signal is used for the spectral analysis This section is determined by a window function (, ) Definition: STFT w.r.t. g of a signal Interpretation: represents a musical note of frequency which oscillates within the translated window given by Inner product measures the correlation between the signal f and the musical note. with 3 4 Box window: discontinuities at window boundaries cause artefacts in the frequency domain Triangle window 5 6

2 Hann window Chirp signal and STFT with box window of length Time-Frequency Localization Chirp signal and STFT with hann window of length 0.05 Size of window constitutes a compromise between time resolution and frequency resolution: Large window : poor time resolution good frequency resolution Small window : good time resolution poor frequency resolution Heisenberg Uncertainty Principle: there is no window function that localizes in time and frequency with arbitrary position Signal and STFT with hann window of length 0.02 Signal and STFT with hann window of length

3 Heisenberg Uncertainty Principle Information Cells Window function with Center Width IC( ) IC( ) MATLAB MATLAB function SPECTROGRAM N = window length (in samples) M = overlap (usually ) Compute DFT N for every windowed section Keep lower Fourier coefficients Example Let x be a DT Signal Sampling rate: Window length: Overlap: Hopsize: Let Sequence of spectral vectors (for each window a vector of dimension ) 15 corresponds to window 16 Example Time resolution: Frequency resolution: Model assumption: Equal tempered scale MIDI pitches: Piano notes: Concert pitch: Center frequency: Logarithmic frequency distribution Octave: doubling of frequency 17 18

4 Idea: Binning of Fourier coefficients Divide up the fequency axis into logarithmically spaced pitch regions and combine spectral coefficients of each region to a single pitch coefficient. Time-frequency representation Windowing in the time domain Windowing in the frequency domain Note MIDI pitch Center [Hz] frequency Left [Hz] boundary Right [Hz] boundary A A# B C C# D D# E F F# G G# A Width [Hz] 21 Details: Let be a spectral vector obtained from a spectrogram w.r.t. a sampling rate and a window length N. The spectral coefficient corresponds to the frequency Let be the set of coefficients assigned to a pitch Then the pitch coefficient is defined as 22 Example: A4, p = 69 Center frequency: Lower bound: Upper bound: STFT with, Example: A4, p = 69 Center frequency: Lower bound: Upper bound: STFT with, S(p = 69) 23 24

5 Note: For some pitches, S(p) may be empty. This particularly holds for low notes corresponding to narrow frequency bands. Audio Representation Example: Op. 100, No. 2 by Friedrich Burgmüller Linear frequency sampling is problematic! Solution: Multi-resolution spectrograms or multirate filterbanks Example: Op. 100, No. 2 by Friedrich Burgmüller Example: Op. 100, No. 2 by Friedrich Burgmüller Frequency in Hz Intensity MIDI pitch Intensity (db) E4 C4 A3 Time in seconds 27 Time in samples 28 Example: Chromatic Scale Example: Chromatic Scale Frequency in Hz Intensity MIDI pitch Intensity (db) Time in seconds 29 Time in samples 30

6 Human perception of pitch is periodic in the sense that two pitches are perceived as similar in color if they differ by an octave. Seperate pitch into two components: tone height (octave number) and chroma. Chroma : 12 traditional pitch classes of the equaltempered scale. For example Chroma C Computation: pitch features chroma features Add up all pitches belonging to the same class Result: 12-dimensional chroma vector. Chromatic circle Shepard s helix of pitch perception Bartsch/Wakefield, IEEE Trans. Multimedia, Sequence of chroma vectors correlates to the harmonic progression Normalization makes features invariant to changes in dynamics Example: C-Major Scale Further quantization and smoothing: CENS features Taking logarithm before adding up pitch coefficients accounts for logarithmic sensation of intensity Example: Burgmüller Op. 100, No. 2 Normalization Chroma Intensity (db) Chroma Intensity Time in samples 35 Time in samples 36

7 Example: Bach Toccata Example: Bach Toccata Koopman Ruebsam Koopman Ruebsam 37 Feature resolution: 10 Hz 38 Example: Bach Toccata Example: Bach Toccata Koopman Ruebsam Koopman Ruebsam Feature resolution: 1 Hz 39 Feature resolution: 0.33 Hz 40 WAV Chroma CENS (10 Hz) (1 Hz) WAV Chroma CENS (10 Hz) (1 Hz) Beethoven s Fifth (Bernstein) 41 42

8 WAV Chroma CENS (10 Hz) (1 Hz) WAV Chroma CENS (10 Hz) (1 Hz) Beethoven s Fifth (Bernstein) Beethoven s Fifth (Bernstein) Beethoven s Fifth (Piano/Sherbakov) Beethoven s Fifth (Piano/Sherbakov) Brahms Hungarian Dance No Example: Zager & Evans In The Year 2525 Example: Zager & Evans In The Year 2525 How to deal with transpositions? 45 Original: 46 Example: Zager & Evans In The Year 2525 Original: Shifted: 47

Audio Features. Fourier Transform. Fourier Transform. Fourier Transform. Short Time Fourier Transform. Fourier Transform.

Audio Features. Fourier Transform. Fourier Transform. Fourier Transform. Short Time Fourier Transform. Fourier Transform. Advanced Course Computer Science Music Processing Summer Term 2010 Fourier Transform Meinard Müller Saarland University and MPI Informatik meinard@mpi-inf.mpg.de Audio Features Fourier Transform Fourier

More information

Introduction Basic Audio Feature Extraction

Introduction Basic Audio Feature Extraction Introduction Basic Audio Feature Extraction Vincent Koops (with slides by Meinhard Müller) Sound and Music Technology, December 6th, 2016 1 28 November 2017 Today g Main modules A. Sound and music for

More information

Short-Time Fourier Transform and Chroma Features

Short-Time Fourier Transform and Chroma Features Friedrich-Alexander-Universität Erlangen-Nürnberg Lab Course Short-Time Fourier Transform and Chroma Features International Audio Laboratories Erlangen Prof. Dr. Meinard Müller Friedrich-Alexander Universität

More information

Short-Time Fourier Transform and Chroma Features

Short-Time Fourier Transform and Chroma Features Friedrich-Alexander-Universität Erlangen-Nürnberg Lab Course Short-Time Fourier Transform and Chroma Features International Audio Laboratories Erlangen Prof. Dr. Meinard Müller Friedrich-Alexander Universität

More information

University of Colorado at Boulder ECEN 4/5532. Lab 2 Lab report due on February 16, 2015

University of Colorado at Boulder ECEN 4/5532. Lab 2 Lab report due on February 16, 2015 University of Colorado at Boulder ECEN 4/5532 Lab 2 Lab report due on February 16, 2015 This is a MATLAB only lab, and therefore each student needs to turn in her/his own lab report and own programs. 1

More information

Topic 7. Convolution, Filters, Correlation, Representation. Bryan Pardo, 2008, Northwestern University EECS 352: Machine Perception of Music and Audio

Topic 7. Convolution, Filters, Correlation, Representation. Bryan Pardo, 2008, Northwestern University EECS 352: Machine Perception of Music and Audio Topic 7 Convolution, Filters, Correlation, Representation Short time Fourier Transform Break signal into windows Calculate DFT of each window The Spectrogram spectrogram(y,1024,512,1024,fs,'yaxis'); A

More information

Fourier Analysis of Signals

Fourier Analysis of Signals Chapter 2 Fourier Analysis of Signals As we have seen in the last chapter, music signals are generally complex sound mixtures that consist of a multitude of different sound components. Because of this

More information

TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION

TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION 13 th International Conference on Digital Audio Effects Romain Hennequin, Roland Badeau and Bertrand David Telecom

More information

Time-Frequency Analysis

Time-Frequency Analysis Time-Frequency Analysis Basics of Fourier Series Philippe B. aval KSU Fall 015 Philippe B. aval (KSU) Fourier Series Fall 015 1 / 0 Introduction We first review how to derive the Fourier series of a function.

More information

Fourier Analysis of Signals

Fourier Analysis of Signals Chapter 2 Fourier Analysis of Signals As we have seen in the last chapter, music signals are generally complex sound mixtures that consist of a multitude of different sound components. Because of this

More information

y = log b Exponential and Logarithmic Functions LESSON THREE - Logarithmic Functions Lesson Notes Example 1 Graphing Logarithms

y = log b Exponential and Logarithmic Functions LESSON THREE - Logarithmic Functions Lesson Notes Example 1  Graphing Logarithms y = log b Eponential and Logarithmic Functions LESSON THREE - Logarithmic Functions Eample 1 Logarithmic Functions Graphing Logarithms a) Draw the graph of f() = 2 b) Draw the inverse of f(). c) Show algebraically

More information

EE123 Digital Signal Processing

EE123 Digital Signal Processing EE123 Digital Signal Processing Lecture 1 Time-Dependent FT Announcements! Midterm: 2/22/216 Open everything... but cheat sheet recommended instead 1am-12pm How s the lab going? Frequency Analysis with

More information

Scattering.m Documentation

Scattering.m Documentation Scattering.m Documentation Release 0.3 Vincent Lostanlen Nov 04, 2018 Contents 1 Introduction 3 2 Filter bank specifications 5 3 Wavelets 7 Bibliography 9 i ii Scattering.m Documentation, Release 0.3

More information

Elec4621 Advanced Digital Signal Processing Chapter 11: Time-Frequency Analysis

Elec4621 Advanced Digital Signal Processing Chapter 11: Time-Frequency Analysis Elec461 Advanced Digital Signal Processing Chapter 11: Time-Frequency Analysis Dr. D. S. Taubman May 3, 011 In this last chapter of your notes, we are interested in the problem of nding the instantaneous

More information

Musical Modulation by Symmetries. By Rob Burnham

Musical Modulation by Symmetries. By Rob Burnham Musical Modulation by Symmetries By Rob Burnham Intro Traditionally intervals have been conceived as ratios of frequencies. This comes from Pythagoras and his question about why consonance intervals like

More information

AN EFFICIENT MULTI-RESOLUTION SPECTRAL TRANSFORM FOR MUSIC ANALYSIS

AN EFFICIENT MULTI-RESOLUTION SPECTRAL TRANSFORM FOR MUSIC ANALYSIS 10th International Society for Music Information Retrieval Conference (ISMIR 2009) AN EFFICIENT MULTI-RESOLUTION SPECTRAL TRANSFORM FOR MUSIC ANALYSIS Pablo Cancela Martín Rocamora Ernesto López Universidad

More information

TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION

TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION Romain Hennequin, Roland Badeau and Bertrand David, Institut Telecom; Telecom ParisTech; CNRS LTCI Paris France romainhennequin@telecom-paristechfr

More information

Frequency Domain Speech Analysis

Frequency Domain Speech Analysis Frequency Domain Speech Analysis Short Time Fourier Analysis Cepstral Analysis Windowed (short time) Fourier Transform Spectrogram of speech signals Filter bank implementation* (Real) cepstrum and complex

More information

L6: Short-time Fourier analysis and synthesis

L6: Short-time Fourier analysis and synthesis L6: Short-time Fourier analysis and synthesis Overview Analysis: Fourier-transform view Analysis: filtering view Synthesis: filter bank summation (FBS) method Synthesis: overlap-add (OLA) method STFT magnitude

More information

Response-Field Dynamics in the Auditory Pathway

Response-Field Dynamics in the Auditory Pathway Response-Field Dynamics in the Auditory Pathway Didier Depireux Powen Ru Shihab Shamma Jonathan Simon Work supported by grants from the Office of Naval Research, a training grant from the National Institute

More information

Lecture Notes 5: Multiresolution Analysis

Lecture Notes 5: Multiresolution Analysis Optimization-based data analysis Fall 2017 Lecture Notes 5: Multiresolution Analysis 1 Frames A frame is a generalization of an orthonormal basis. The inner products between the vectors in a frame and

More information

Introduction to time-frequency analysis. From linear to energy-based representations

Introduction to time-frequency analysis. From linear to energy-based representations Introduction to time-frequency analysis. From linear to energy-based representations Rosario Ceravolo Politecnico di Torino Dep. Structural Engineering UNIVERSITA DI TRENTO Course on «Identification and

More information

Lecture 7: Pitch and Chord (2) HMM, pitch detection functions. Li Su 2016/03/31

Lecture 7: Pitch and Chord (2) HMM, pitch detection functions. Li Su 2016/03/31 Lecture 7: Pitch and Chord (2) HMM, pitch detection functions Li Su 2016/03/31 Chord progressions Chord progressions are not arbitrary Example 1: I-IV-I-V-I (C-F-C-G-C) Example 2: I-V-VI-III-IV-I-II-V

More information

SNR Calculation and Spectral Estimation [S&T Appendix A]

SNR Calculation and Spectral Estimation [S&T Appendix A] SR Calculation and Spectral Estimation [S&T Appendix A] or, How not to make a mess of an FFT Make sure the input is located in an FFT bin 1 Window the data! A Hann window works well. Compute the FFT 3

More information

Nonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation

Nonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation Nonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation Mikkel N. Schmidt and Morten Mørup Technical University of Denmark Informatics and Mathematical Modelling Richard

More information

BME 50500: Image and Signal Processing in Biomedicine. Lecture 5: Correlation and Power-Spectrum CCNY

BME 50500: Image and Signal Processing in Biomedicine. Lecture 5: Correlation and Power-Spectrum CCNY 1 BME 50500: Image and Signal Processing in Biomedicine Lecture 5: Correlation and Power-Spectrum Lucas C. Parra Biomedical Engineering Department CCNY http://bme.ccny.cuny.edu/faculty/parra/teaching/signal-and-image/

More information

TIME-FREQUENCY ANALYSIS: TUTORIAL. Werner Kozek & Götz Pfander

TIME-FREQUENCY ANALYSIS: TUTORIAL. Werner Kozek & Götz Pfander TIME-FREQUENCY ANALYSIS: TUTORIAL Werner Kozek & Götz Pfander Overview TF-Analysis: Spectral Visualization of nonstationary signals (speech, audio,...) Spectrogram (time-varying spectrum estimation) TF-methods

More information

The mathematician Ramanujan, and digital signal processing

The mathematician Ramanujan, and digital signal processing The mathematician Ramanujan, and digital signal processing P. P. Vaidyanathan California Institute of Technology, Pasadena, CA SAM-2016, Rio, Brazil 1887-1920 Self-educated mathematician from India Grew

More information

CONSTRUCTING AN INVERTIBLE CONSTANT-Q TRANSFORM WITH NONSTATIONARY GABOR FRAMES

CONSTRUCTING AN INVERTIBLE CONSTANT-Q TRANSFORM WITH NONSTATIONARY GABOR FRAMES CONSTRUCTING AN INVERTIBLE CONSTANT-Q TRANSFORM WITH NONSTATIONARY GABOR FRAMES MONIKA DÖRFLER, NICKI HOLIGHAUS, THOMAS GRILL, AND GINO ANGELO VELASCO Abstract. An efficient and perfectly invertible signal

More information

Multiresolution Analysis

Multiresolution Analysis Multiresolution Analysis DS-GA 1013 / MATH-GA 2824 Optimization-based Data Analysis http://www.cims.nyu.edu/~cfgranda/pages/obda_fall17/index.html Carlos Fernandez-Granda Frames Short-time Fourier transform

More information

1. Calculation of the DFT

1. Calculation of the DFT ELE E4810: Digital Signal Processing Topic 10: The Fast Fourier Transform 1. Calculation of the DFT. The Fast Fourier Transform algorithm 3. Short-Time Fourier Transform 1 1. Calculation of the DFT! Filter

More information

Ch. 15 Wavelet-Based Compression

Ch. 15 Wavelet-Based Compression Ch. 15 Wavelet-Based Compression 1 Origins and Applications The Wavelet Transform (WT) is a signal processing tool that is replacing the Fourier Transform (FT) in many (but not all!) applications. WT theory

More information

Modeling Measurement Uncertainty in Room Acoustics P. Dietrich

Modeling Measurement Uncertainty in Room Acoustics P. Dietrich Modeling Measurement Uncertainty in Room Acoustics P. Dietrich This paper investigates a way of determining and modeling uncertainty contributions in measurements of room acoustic parameters, which are

More information

E : Lecture 1 Introduction

E : Lecture 1 Introduction E85.2607: Lecture 1 Introduction 1 Administrivia 2 DSP review 3 Fun with Matlab E85.2607: Lecture 1 Introduction 2010-01-21 1 / 24 Course overview Advanced Digital Signal Theory Design, analysis, and implementation

More information

Continuous Fourier transform of a Gaussian Function

Continuous Fourier transform of a Gaussian Function Continuous Fourier transform of a Gaussian Function Gaussian function: e t2 /(2σ 2 ) The CFT of a Gaussian function is also a Gaussian function (i.e., time domain is Gaussian, then the frequency domain

More information

Basics about Fourier analysis

Basics about Fourier analysis Jérôme Gilles UCLA PART ONE Fourier analysis On the menu... Introduction - some history... Notations. Fourier series. Continuous Fourier transform. Discrete Fourier transform. Properties. 2D extension.

More information

BASIC COMPRESSION TECHNIQUES

BASIC COMPRESSION TECHNIQUES BASIC COMPRESSION TECHNIQUES N. C. State University CSC557 Multimedia Computing and Networking Fall 2001 Lectures # 05 Questions / Problems / Announcements? 2 Matlab demo of DFT Low-pass windowed-sinc

More information

Introduction to Biomedical Engineering

Introduction to Biomedical Engineering Introduction to Biomedical Engineering Biosignal processing Kung-Bin Sung 6/11/2007 1 Outline Chapter 10: Biosignal processing Characteristics of biosignals Frequency domain representation and analysis

More information

HHT: the theory, implementation and application. Yetmen Wang AnCAD, Inc. 2008/5/24

HHT: the theory, implementation and application. Yetmen Wang AnCAD, Inc. 2008/5/24 HHT: the theory, implementation and application Yetmen Wang AnCAD, Inc. 2008/5/24 What is frequency? Frequency definition Fourier glass Instantaneous frequency Signal composition: trend, periodical, stochastic,

More information

Introduction to Time-Frequency Distributions

Introduction to Time-Frequency Distributions Introduction to Time-Frequency Distributions Selin Aviyente Department of Electrical and Computer Engineering Michigan State University January 19, 2010 Motivation for time-frequency analysis When you

More information

! Spectral Analysis with DFT. ! Windowing. ! Effect of zero-padding. ! Time-dependent Fourier transform. " Aka short-time Fourier transform

! Spectral Analysis with DFT. ! Windowing. ! Effect of zero-padding. ! Time-dependent Fourier transform.  Aka short-time Fourier transform Lecture Outline ESE 531: Digital Signal Processing Spectral Analysis with DFT Windowing Lec 24: April 18, 2019 Spectral Analysis Effect of zero-padding Time-dependent Fourier transform " Aka short-time

More information

Time-Frequency Analysis of Radar Signals

Time-Frequency Analysis of Radar Signals G. Boultadakis, K. Skrapas and P. Frangos Division of Information Transmission Systems and Materials Technology School of Electrical and Computer Engineering National Technical University of Athens 9 Iroon

More information

Digital Speech Processing Lecture 10. Short-Time Fourier Analysis Methods - Filter Bank Design

Digital Speech Processing Lecture 10. Short-Time Fourier Analysis Methods - Filter Bank Design Digital Speech Processing Lecture Short-Time Fourier Analysis Methods - Filter Bank Design Review of STFT j j ˆ m ˆ. X e x[ mw ] [ nˆ m] e nˆ function of nˆ looks like a time sequence function of ˆ looks

More information

Wavelet Transform. Figure 1: Non stationary signal f(t) = sin(100 t 2 ).

Wavelet Transform. Figure 1: Non stationary signal f(t) = sin(100 t 2 ). Wavelet Transform Andreas Wichert Department of Informatics INESC-ID / IST - University of Lisboa Portugal andreas.wichert@tecnico.ulisboa.pt September 3, 0 Short Term Fourier Transform Signals whose frequency

More information

Design Criteria for the Quadratically Interpolated FFT Method (I): Bias due to Interpolation

Design Criteria for the Quadratically Interpolated FFT Method (I): Bias due to Interpolation CENTER FOR COMPUTER RESEARCH IN MUSIC AND ACOUSTICS DEPARTMENT OF MUSIC, STANFORD UNIVERSITY REPORT NO. STAN-M-4 Design Criteria for the Quadratically Interpolated FFT Method (I): Bias due to Interpolation

More information

Chapters 11 and 12. Sound and Standing Waves

Chapters 11 and 12. Sound and Standing Waves Chapters 11 and 12 Sound and Standing Waves The Nature of Sound Waves LONGITUDINAL SOUND WAVES Speaker making sound waves in a tube The Nature of Sound Waves The distance between adjacent condensations

More information

Sound Recognition in Mixtures

Sound Recognition in Mixtures Sound Recognition in Mixtures Juhan Nam, Gautham J. Mysore 2, and Paris Smaragdis 2,3 Center for Computer Research in Music and Acoustics, Stanford University, 2 Advanced Technology Labs, Adobe Systems

More information

ALGEBRA PROBLEMS FOR MAA MINI-COURSE ON GEOMETRY AND ALGEBRA IN MUSIC THEORY JOINT MATHEMATICS MEETING IN NEW ORLEANS, JANUARY 9, 2011

ALGEBRA PROBLEMS FOR MAA MINI-COURSE ON GEOMETRY AND ALGEBRA IN MUSIC THEORY JOINT MATHEMATICS MEETING IN NEW ORLEANS, JANUARY 9, 2011 ALGEBRA PROBLEMS FOR MAA MINI-COURSE ON GEOMETRY AND ALGEBRA IN MUSIC THEORY JOINT MATHEMATICS MEETING IN NEW ORLEANS, JANUARY 9, 2011 THOMAS M. FIORE 1. Pitches and Pitch Classes (1.1) (Pitch Classes

More information

Chirp Transform for FFT

Chirp Transform for FFT Chirp Transform for FFT Since the FFT is an implementation of the DFT, it provides a frequency resolution of 2π/N, where N is the length of the input sequence. If this resolution is not sufficient in a

More information

Automatic Speech Recognition (CS753)

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 12: Acoustic Feature Extraction for ASR Instructor: Preethi Jyothi Feb 13, 2017 Speech Signal Analysis Generate discrete samples A frame Need to focus on short

More information

OLA and FBS Duality Review

OLA and FBS Duality Review MUS421/EE367B Lecture 10A Review of OverLap-Add (OLA) and Filter-Bank Summation (FBS) Interpretations of Short-Time Fourier Analysis, Modification, and Resynthesis Julius O. Smith III (jos@ccrma.stanford.edu)

More information

Lecture 22: Reconstruction and Admissibility

Lecture 22: Reconstruction and Admissibility WAVELETS AND MULTIRATE DIGITAL SIGNAL PROCESSING Lecture 22: Reconstruction and Admissibility Prof.V.M.Gadre, EE, IIT Bombay Tutorials Q 1. Construct the STFT, CWT of the signal x(t) using Matlab and discuss

More information

Digital Image Processing Lectures 13 & 14

Digital Image Processing Lectures 13 & 14 Lectures 13 & 14, Professor Department of Electrical and Computer Engineering Colorado State University Spring 2013 Properties of KL Transform The KL transform has many desirable properties which makes

More information

Convolutive Non-Negative Matrix Factorization for CQT Transform using Itakura-Saito Divergence

Convolutive Non-Negative Matrix Factorization for CQT Transform using Itakura-Saito Divergence Convolutive Non-Negative Matrix Factorization for CQT Transform using Itakura-Saito Divergence Fabio Louvatti do Carmo; Evandro Ottoni Teatini Salles Abstract This paper proposes a modification of the

More information

MAT120 Supplementary Lecture Notes

MAT120 Supplementary Lecture Notes MAT0 Supplementary Lecture Notes Overview of Numbers Integers: {..., 3,,, 0,,, 3,...} Rational Numbers:, 3, etc., quotients of integers a b, b 0; finite or repeating decimal expansions. Irrational Numbers:,,

More information

AUDIO ANALYSIS, VISUALIZATION, AND TRANSFORMATION WITH THE MATCHING PURSUIT ALGORITHM. Garry Kling and Curtis Roads

AUDIO ANALYSIS, VISUALIZATION, AND TRANSFORMATION WITH THE MATCHING PURSUIT ALGORITHM. Garry Kling and Curtis Roads 04 DAFx AUDIO ANALYSIS, VISUALIZATION, AND TRANSFORMATION WITH THE MATCHING PURSUIT ALGORITHM Garry Kling and Curtis Roads Media Arts and Technology 3431 South Hall University of California, Santa Barbara,

More information

Wavelets in Pattern Recognition

Wavelets in Pattern Recognition Wavelets in Pattern Recognition Lecture Notes in Pattern Recognition by W.Dzwinel Uncertainty principle 1 Uncertainty principle Tiling 2 Windowed FT vs. WT Idea of mother wavelet 3 Scale and resolution

More information

Time-frequency frames and applications - Lecture 1: Theory and Concepts

Time-frequency frames and applications - Lecture 1: Theory and Concepts Time-frequency frames and applications - Lecture 1: Theory and Concepts N. Holighaus Acoustics Research Institute, Austrian Academy of Sciences 21 October 2014 N. Holighaus Time-frequency - Lecture I 21

More information

arxiv: v1 [physics.pop-ph] 10 Aug 2015

arxiv: v1 [physics.pop-ph] 10 Aug 2015 Revising the Musical Equal Temperament Haye Hinrichsen arxiv:508.02292v [physics.pop-ph] 0 Aug 205 Universität Würzburg, Fakultät für Physik und Astronomie Campus Süd, Am Hubland, 97074 Würzburg, Germany

More information

6.003 Signal Processing

6.003 Signal Processing 6.003 Signal Processing Week 6, Lecture A: The Discrete Fourier Transform (DFT) Adam Hartz hz@mit.edu What is 6.003? What is a signal? Abstractly, a signal is a function that conveys information Signal

More information

10ème Congrès Français d Acoustique

10ème Congrès Français d Acoustique 1ème Congrès Français d Acoustique Lyon, 1-16 Avril 1 Spectral similarity measure invariant to pitch shifting and amplitude scaling Romain Hennequin 1, Roland Badeau 1, Bertrand David 1 1 Institut TELECOM,

More information

Order Tracking Analysis

Order Tracking Analysis 1. Introduction Order Tracking Analysis Jaafar Alsalaet College of Engineering-University of Basrah Mostly, dynamic forces excited in a machine are related to the rotation speed; hence, it is often preferred

More information

On crystal growth in harmonic space James Tenney ( )

On crystal growth in harmonic space James Tenney ( ) On crystal growth in harmonic space James Tenney (1993-98) It seems clear, intuitively, that a concern for harmonic coherence would lead to the use of relatively compact, connected sets of points in harmonic

More information

Discrete Fourier transform (DFT)

Discrete Fourier transform (DFT) Discrete Fourier transform (DFT) Alejandro Ribeiro January 19, 2018 Let x : [0, N 1] C be a discrete signal of duration N and having elements x(n) for n [0, N 1]. The discrete Fourier transform (DFT) of

More information

Gaussian Processes for Audio Feature Extraction

Gaussian Processes for Audio Feature Extraction Gaussian Processes for Audio Feature Extraction Dr. Richard E. Turner (ret26@cam.ac.uk) Computational and Biological Learning Lab Department of Engineering University of Cambridge Machine hearing pipeline

More information

Timbre Similarity. Perception and Computation. Prof. Michael Casey. Dartmouth College. Thursday 7th February, 2008

Timbre Similarity. Perception and Computation. Prof. Michael Casey. Dartmouth College. Thursday 7th February, 2008 Timbre Similarity Perception and Computation Prof. Michael Casey Dartmouth College Thursday 7th February, 2008 Prof. Michael Casey (Dartmouth College) Timbre Similarity Thursday 7th February, 2008 1 /

More information

MIPS A Formal Language for the Mathematical Investigation of Pitch Systems. David Meredith

MIPS A Formal Language for the Mathematical Investigation of Pitch Systems. David Meredith MIPS A Formal Language for the Mathematical Investigation of Pitch Sstems David Meredith September 10, 2001 Contents 1 Introduction to MIPS and the genus representation of octave equivalence 4 1.1 Introduction..............................................

More information

Optimal Speech Enhancement Under Signal Presence Uncertainty Using Log-Spectral Amplitude Estimator

Optimal Speech Enhancement Under Signal Presence Uncertainty Using Log-Spectral Amplitude Estimator 1 Optimal Speech Enhancement Under Signal Presence Uncertainty Using Log-Spectral Amplitude Estimator Israel Cohen Lamar Signal Processing Ltd. P.O.Box 573, Yokneam Ilit 20692, Israel E-mail: icohen@lamar.co.il

More information

arxiv: v3 [physics.pop-ph] 14 Nov 2015

arxiv: v3 [physics.pop-ph] 14 Nov 2015 Revising the Musical Equal Temperament Haye Hinrichsen arxiv:508.02292v3 [physics.pop-ph] 4 Nov 205 Universität Würzburg, Fakultät für Physik und Astronomie Campus Süd, Am Hubland, 97074 Würzburg, Germany

More information

L29: Fourier analysis

L29: Fourier analysis L29: Fourier analysis Introduction The discrete Fourier Transform (DFT) The DFT matrix The Fast Fourier Transform (FFT) The Short-time Fourier Transform (STFT) Fourier Descriptors CSCE 666 Pattern Analysis

More information

2D Spectrogram Filter for Single Channel Speech Enhancement

2D Spectrogram Filter for Single Channel Speech Enhancement Proceedings of the 7th WSEAS International Conference on Signal, Speech and Image Processing, Beijing, China, September 15-17, 007 89 D Spectrogram Filter for Single Channel Speech Enhancement HUIJUN DING,

More information

Sound 2: frequency analysis

Sound 2: frequency analysis COMP 546 Lecture 19 Sound 2: frequency analysis Tues. March 27, 2018 1 Speed of Sound Sound travels at about 340 m/s, or 34 cm/ ms. (This depends on temperature and other factors) 2 Wave equation Pressure

More information

Speech Signal Representations

Speech Signal Representations Speech Signal Representations Berlin Chen 2003 References: 1. X. Huang et. al., Spoken Language Processing, Chapters 5, 6 2. J. R. Deller et. al., Discrete-Time Processing of Speech Signals, Chapters 4-6

More information

Fourier Analysis of Signals Using the DFT

Fourier Analysis of Signals Using the DFT Fourier Analysis of Signals Using the DFT ECE 535 Lecture April 29, 23 Overview: Motivation Many applications require analyzing the frequency content of signals Speech processing study resonances of vocal

More information

CRYSTALLIZATION SONIFICATION OF HIGH-DIMENSIONAL DATASETS

CRYSTALLIZATION SONIFICATION OF HIGH-DIMENSIONAL DATASETS Proceedings of the 22 International Conference on Auditory Display, Kyoto, Japan, July 2 5, 22 CRYSTALLIZATION SONIFICATION OF HIGH-DIMENSIONAL DATASETS T. Hermann Faculty of Technology Bielefeld University,

More information

Numerical Aspects of Gabor Analysis

Numerical Aspects of Gabor Analysis Numerical Harmonic Analysis Group hans.feichtinger@univie.ac.at www.nuhag.eu DOWNLOADS: http://www.nuhag.eu/bibtex Graz, April 12th, 2013 9-th Austrian Numerical Analysis Day hans.feichtinger@univie.ac.at

More information

MULTIPITCH ESTIMATION AND INSTRUMENT RECOGNITION BY EXEMPLAR-BASED SPARSE REPRESENTATION. Ikuo Degawa, Kei Sato, Masaaki Ikehara

MULTIPITCH ESTIMATION AND INSTRUMENT RECOGNITION BY EXEMPLAR-BASED SPARSE REPRESENTATION. Ikuo Degawa, Kei Sato, Masaaki Ikehara MULTIPITCH ESTIMATION AND INSTRUMENT RECOGNITION BY EXEMPLAR-BASED SPARSE REPRESENTATION Ikuo Degawa, Kei Sato, Masaaki Ikehara EEE Dept. Keio University Yokohama, Kanagawa 223-8522 Japan E-mail:{degawa,

More information

Time-Frequency Analysis: Fourier Transforms and Wavelets

Time-Frequency Analysis: Fourier Transforms and Wavelets Chapter 4 Time-Frequenc Analsis: Fourier Transforms and Wavelets 4. Basics of Fourier Series 4.. Introduction Joseph Fourier (768-83) who gave his name to Fourier series, was not the first to use Fourier

More information

I. Signals & Sinusoids

I. Signals & Sinusoids I. Signals & Sinusoids [p. 3] Signal definition Sinusoidal signal Plotting a sinusoid [p. 12] Signal operations Time shifting Time scaling Time reversal Combining time shifting & scaling [p. 17] Trigonometric

More information

Time-Frequency Analysis: Fourier Transforms and Wavelets

Time-Frequency Analysis: Fourier Transforms and Wavelets Chapter 4 Time-Frequenc Analsis: Fourier Transforms and Wavelets 4. Basics of Fourier Series 4.. Introduction Joseph Fourier (768-83) who gave his name to Fourier series, was not the first to use Fourier

More information

A POSTERIORI SPEECH PRESENCE PROBABILITY ESTIMATION BASED ON AVERAGED OBSERVATIONS AND A SUPER-GAUSSIAN SPEECH MODEL

A POSTERIORI SPEECH PRESENCE PROBABILITY ESTIMATION BASED ON AVERAGED OBSERVATIONS AND A SUPER-GAUSSIAN SPEECH MODEL A POSTERIORI SPEECH PRESENCE PROBABILITY ESTIMATION BASED ON AVERAGED OBSERVATIONS AND A SUPER-GAUSSIAN SPEECH MODEL Balázs Fodor Institute for Communications Technology Technische Universität Braunschweig

More information

An Efficient Pitch-Tracking Algorithm Using a Combination of Fourier Transforms

An Efficient Pitch-Tracking Algorithm Using a Combination of Fourier Transforms An Efficient Pitch-Tracking Algorithm Using a Combination of Fourier Transforms Sylvain Marchand To cite this version: Sylvain Marchand. An Efficient Pitch-Tracking Algorithm Using a Combination of Fourier

More information

Analysis of polyphonic audio using source-filter model and non-negative matrix factorization

Analysis of polyphonic audio using source-filter model and non-negative matrix factorization Analysis of polyphonic audio using source-filter model and non-negative matrix factorization Tuomas Virtanen and Anssi Klapuri Tampere University of Technology, Institute of Signal Processing Korkeakoulunkatu

More information

Timbral, Scale, Pitch modifications

Timbral, Scale, Pitch modifications Introduction Timbral, Scale, Pitch modifications M2 Mathématiques / Vision / Apprentissage Audio signal analysis, indexing and transformation Page 1 / 40 Page 2 / 40 Modification of playback speed Modifications

More information

SINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIX FACTORIZATION AND SPECTRAL MASKS. Emad M. Grais and Hakan Erdogan

SINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIX FACTORIZATION AND SPECTRAL MASKS. Emad M. Grais and Hakan Erdogan SINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIX FACTORIZATION AND SPECTRAL MASKS Emad M. Grais and Hakan Erdogan Faculty of Engineering and Natural Sciences, Sabanci University, Orhanli

More information

Signal Processing COS 323

Signal Processing COS 323 Signal Processing COS 323 Digital Signals D: functions of space or time e.g., sound 2D: often functions of 2 spatial dimensions e.g. images 3D: functions of 3 spatial dimensions CAT, MRI scans or 2 space,

More information

Analytic discrete cosine harmonic wavelet transform(adchwt) and its application to signal/image denoising

Analytic discrete cosine harmonic wavelet transform(adchwt) and its application to signal/image denoising Analytic discrete cosine harmonic wavelet transform(adchwt) and its application to signal/image denoising M. Shivamurti and S. V. Narasimhan Digital signal processing and Systems Group Aerospace Electronic

More information

LAB 2: DTFT, DFT, and DFT Spectral Analysis Summer 2011

LAB 2: DTFT, DFT, and DFT Spectral Analysis Summer 2011 University of Illinois at Urbana-Champaign Department of Electrical and Computer Engineering ECE 311: Digital Signal Processing Lab Chandra Radhakrishnan Peter Kairouz LAB 2: DTFT, DFT, and DFT Spectral

More information

Notes on Wavelets- Sandra Chapman (MPAGS: Time series analysis) # $ ( ) = G f. y t

Notes on Wavelets- Sandra Chapman (MPAGS: Time series analysis) # $ ( ) = G f. y t Wavelets Recall: we can choose! t ) as basis on which we expand, ie: ) = y t ) = G! t ) y t! may be orthogonal chosen or appropriate properties. This is equivalent to the transorm: ) = G y t )!,t )d 2

More information

Physical Acoustics. Hearing is the result of a complex interaction of physics, physiology, perception and cognition.

Physical Acoustics. Hearing is the result of a complex interaction of physics, physiology, perception and cognition. Physical Acoustics Hearing, auditory perception, or audition is the ability to perceive sound by detecting vibrations, changes in the pressure of the surrounding medium through time, through an organ such

More information

Multiscale Image Transforms

Multiscale Image Transforms Multiscale Image Transforms Goal: Develop filter-based representations to decompose images into component parts, to extract features/structures of interest, and to attenuate noise. Motivation: extract

More information

Jean Morlet and the Continuous Wavelet Transform (CWT)

Jean Morlet and the Continuous Wavelet Transform (CWT) Jean Morlet and the Continuous Wavelet Transform (CWT) Brian Russell 1 and Jiajun Han 1 CREWES Adjunct Professor CGG GeoSoftware Calgary Alberta. www.crewes.org Introduction In 198 Jean Morlet a geophysicist

More information

Time-domain representations

Time-domain representations Time-domain representations Speech Processing Tom Bäckström Aalto University Fall 2016 Basics of Signal Processing in the Time-domain Time-domain signals Before we can describe speech signals or modelling

More information

ROBUST REALTIME POLYPHONIC PITCH DETECTION

ROBUST REALTIME POLYPHONIC PITCH DETECTION ROBUST REALTIME POLYPHONIC PITCH DETECTION by John M. Thomas A Thesis Submitted to the Graduate Faculty of George Mason University In Partial fulfillment of The Requirements for the Degree of Master of

More information

Identification and separation of noises with spectro-temporal patterns

Identification and separation of noises with spectro-temporal patterns PROCEEDINGS of the 22 nd International Congress on Acoustics Soundscape, Psychoacoustics and Urban Environment: Paper ICA2016-532 Identification and separation of noises with spectro-temporal patterns

More information

PreFEst: A Predominant-F0 Estimation Method for Polyphonic Musical Audio Signals

PreFEst: A Predominant-F0 Estimation Method for Polyphonic Musical Audio Signals PreFEst: A Predominant-F0 Estimation Method for Polyphonic Musical Audio Signals Masataka Goto National Institute of Advanced Industrial Science and Technology (AIST). IT, AIST, 1-1-1 Umezono, Tsukuba,

More information

MULTI-RESOLUTION SIGNAL DECOMPOSITION WITH TIME-DOMAIN SPECTROGRAM FACTORIZATION. Hirokazu Kameoka

MULTI-RESOLUTION SIGNAL DECOMPOSITION WITH TIME-DOMAIN SPECTROGRAM FACTORIZATION. Hirokazu Kameoka MULTI-RESOLUTION SIGNAL DECOMPOSITION WITH TIME-DOMAIN SPECTROGRAM FACTORIZATION Hiroazu Kameoa The University of Toyo / Nippon Telegraph and Telephone Corporation ABSTRACT This paper proposes a novel

More information

Sound. Speed of Sound

Sound. Speed of Sound Sound TUNING FORK CREATING SOUND WAVES GUITAR STRING CREATING SOUND WAVES Speed of Sound Sound travels at a speed that depends on the medium through which it propagates. The speed of sound depends: - directly

More information

Lecture 14 1/38 Phys 220. Final Exam. Wednesday, August 6 th 10:30 am 12:30 pm Phys multiple choice problems (15 points each 300 total)

Lecture 14 1/38 Phys 220. Final Exam. Wednesday, August 6 th 10:30 am 12:30 pm Phys multiple choice problems (15 points each 300 total) Lecture 14 1/38 Phys 220 Final Exam Wednesday, August 6 th 10:30 am 12:30 pm Phys 114 20 multiple choice problems (15 points each 300 total) 75% will be from Chapters 10-16 25% from Chapters 1-9 Students

More information

Bayesian Estimation of Time-Frequency Coefficients for Audio Signal Enhancement

Bayesian Estimation of Time-Frequency Coefficients for Audio Signal Enhancement Bayesian Estimation of Time-Frequency Coefficients for Audio Signal Enhancement Patrick J. Wolfe Department of Engineering University of Cambridge Cambridge CB2 1PZ, UK pjw47@eng.cam.ac.uk Simon J. Godsill

More information