Audio Features. Fourier Transform. Short Time Fourier Transform. Short Time Fourier Transform. Short Time Fourier Transform
|
|
- Josephine Marjorie Ward
- 5 years ago
- Views:
Transcription
1 Advanced Course Computer Science Music Processing Summer Term 2009 Meinard Müller Saarland University and MPI Informatik Audio Features Fourier Transform Tells which notes (frequencies) are played, but does not tell when the notes are played Frequency information is averaged over the entire time interval Time information is hidden in the phase Windowed Fourier Transform (WFT) (STFT) (Dennis Gabor, 1946) 2 Idea: To recover time information, only a small section of the signal is used for the spectral analysis This section is determined by a window function (, ) Definition: STFT w.r.t. g of a signal Interpretation: represents a musical note of frequency which oscillates within the translated window given by Inner product measures the correlation between the signal f and the musical note. with 3 4 Box window: discontinuities at window boundaries cause artefacts in the frequency domain Triangle window 5 6
2 Hann window Chirp signal and STFT with box window of length Time-Frequency Localization Chirp signal and STFT with hann window of length 0.05 Size of window constitutes a compromise between time resolution and frequency resolution: Large window : poor time resolution good frequency resolution Small window : good time resolution poor frequency resolution Heisenberg Uncertainty Principle: there is no window function that localizes in time and frequency with arbitrary position Signal and STFT with hann window of length 0.02 Signal and STFT with hann window of length
3 Heisenberg Uncertainty Principle Information Cells Window function with Center Width IC( ) IC( ) MATLAB MATLAB function SPECTROGRAM N = window length (in samples) M = overlap (usually ) Compute DFT N for every windowed section Keep lower Fourier coefficients Example Let x be a DT Signal Sampling rate: Window length: Overlap: Hopsize: Let Sequence of spectral vectors (for each window a vector of dimension ) 15 corresponds to window 16 Example Time resolution: Frequency resolution: Model assumption: Equal tempered scale MIDI pitches: Piano notes: Concert pitch: Center frequency: Logarithmic frequency distribution Octave: doubling of frequency 17 18
4 Idea: Binning of Fourier coefficients Divide up the fequency axis into logarithmically spaced pitch regions and combine spectral coefficients of each region to a single pitch coefficient. Time-frequency representation Windowing in the time domain Windowing in the frequency domain Note MIDI pitch Center [Hz] frequency Left [Hz] boundary Right [Hz] boundary A A# B C C# D D# E F F# G G# A Width [Hz] 21 Details: Let be a spectral vector obtained from a spectrogram w.r.t. a sampling rate and a window length N. The spectral coefficient corresponds to the frequency Let be the set of coefficients assigned to a pitch Then the pitch coefficient is defined as 22 Example: A4, p = 69 Center frequency: Lower bound: Upper bound: STFT with, Example: A4, p = 69 Center frequency: Lower bound: Upper bound: STFT with, S(p = 69) 23 24
5 Note: For some pitches, S(p) may be empty. This particularly holds for low notes corresponding to narrow frequency bands. Audio Representation Example: Op. 100, No. 2 by Friedrich Burgmüller Linear frequency sampling is problematic! Solution: Multi-resolution spectrograms or multirate filterbanks Example: Op. 100, No. 2 by Friedrich Burgmüller Example: Op. 100, No. 2 by Friedrich Burgmüller Frequency in Hz Intensity MIDI pitch Intensity (db) E4 C4 A3 Time in seconds 27 Time in samples 28 Example: Chromatic Scale Example: Chromatic Scale Frequency in Hz Intensity MIDI pitch Intensity (db) Time in seconds 29 Time in samples 30
6 Human perception of pitch is periodic in the sense that two pitches are perceived as similar in color if they differ by an octave. Seperate pitch into two components: tone height (octave number) and chroma. Chroma : 12 traditional pitch classes of the equaltempered scale. For example Chroma C Computation: pitch features chroma features Add up all pitches belonging to the same class Result: 12-dimensional chroma vector. Chromatic circle Shepard s helix of pitch perception Bartsch/Wakefield, IEEE Trans. Multimedia, Sequence of chroma vectors correlates to the harmonic progression Normalization makes features invariant to changes in dynamics Example: C-Major Scale Further quantization and smoothing: CENS features Taking logarithm before adding up pitch coefficients accounts for logarithmic sensation of intensity Example: Burgmüller Op. 100, No. 2 Normalization Chroma Intensity (db) Chroma Intensity Time in samples 35 Time in samples 36
7 Example: Bach Toccata Example: Bach Toccata Koopman Ruebsam Koopman Ruebsam 37 Feature resolution: 10 Hz 38 Example: Bach Toccata Example: Bach Toccata Koopman Ruebsam Koopman Ruebsam Feature resolution: 1 Hz 39 Feature resolution: 0.33 Hz 40 WAV Chroma CENS (10 Hz) (1 Hz) WAV Chroma CENS (10 Hz) (1 Hz) Beethoven s Fifth (Bernstein) 41 42
8 WAV Chroma CENS (10 Hz) (1 Hz) WAV Chroma CENS (10 Hz) (1 Hz) Beethoven s Fifth (Bernstein) Beethoven s Fifth (Bernstein) Beethoven s Fifth (Piano/Sherbakov) Beethoven s Fifth (Piano/Sherbakov) Brahms Hungarian Dance No Example: Zager & Evans In The Year 2525 Example: Zager & Evans In The Year 2525 How to deal with transpositions? 45 Original: 46 Example: Zager & Evans In The Year 2525 Original: Shifted: 47
Audio Features. Fourier Transform. Fourier Transform. Fourier Transform. Short Time Fourier Transform. Fourier Transform.
Advanced Course Computer Science Music Processing Summer Term 2010 Fourier Transform Meinard Müller Saarland University and MPI Informatik meinard@mpi-inf.mpg.de Audio Features Fourier Transform Fourier
More informationIntroduction Basic Audio Feature Extraction
Introduction Basic Audio Feature Extraction Vincent Koops (with slides by Meinhard Müller) Sound and Music Technology, December 6th, 2016 1 28 November 2017 Today g Main modules A. Sound and music for
More informationShort-Time Fourier Transform and Chroma Features
Friedrich-Alexander-Universität Erlangen-Nürnberg Lab Course Short-Time Fourier Transform and Chroma Features International Audio Laboratories Erlangen Prof. Dr. Meinard Müller Friedrich-Alexander Universität
More informationShort-Time Fourier Transform and Chroma Features
Friedrich-Alexander-Universität Erlangen-Nürnberg Lab Course Short-Time Fourier Transform and Chroma Features International Audio Laboratories Erlangen Prof. Dr. Meinard Müller Friedrich-Alexander Universität
More informationUniversity of Colorado at Boulder ECEN 4/5532. Lab 2 Lab report due on February 16, 2015
University of Colorado at Boulder ECEN 4/5532 Lab 2 Lab report due on February 16, 2015 This is a MATLAB only lab, and therefore each student needs to turn in her/his own lab report and own programs. 1
More informationTopic 7. Convolution, Filters, Correlation, Representation. Bryan Pardo, 2008, Northwestern University EECS 352: Machine Perception of Music and Audio
Topic 7 Convolution, Filters, Correlation, Representation Short time Fourier Transform Break signal into windows Calculate DFT of each window The Spectrogram spectrogram(y,1024,512,1024,fs,'yaxis'); A
More informationFourier Analysis of Signals
Chapter 2 Fourier Analysis of Signals As we have seen in the last chapter, music signals are generally complex sound mixtures that consist of a multitude of different sound components. Because of this
More informationTIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION
TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION 13 th International Conference on Digital Audio Effects Romain Hennequin, Roland Badeau and Bertrand David Telecom
More informationTime-Frequency Analysis
Time-Frequency Analysis Basics of Fourier Series Philippe B. aval KSU Fall 015 Philippe B. aval (KSU) Fourier Series Fall 015 1 / 0 Introduction We first review how to derive the Fourier series of a function.
More informationFourier Analysis of Signals
Chapter 2 Fourier Analysis of Signals As we have seen in the last chapter, music signals are generally complex sound mixtures that consist of a multitude of different sound components. Because of this
More informationy = log b Exponential and Logarithmic Functions LESSON THREE - Logarithmic Functions Lesson Notes Example 1 Graphing Logarithms
y = log b Eponential and Logarithmic Functions LESSON THREE - Logarithmic Functions Eample 1 Logarithmic Functions Graphing Logarithms a) Draw the graph of f() = 2 b) Draw the inverse of f(). c) Show algebraically
More informationEE123 Digital Signal Processing
EE123 Digital Signal Processing Lecture 1 Time-Dependent FT Announcements! Midterm: 2/22/216 Open everything... but cheat sheet recommended instead 1am-12pm How s the lab going? Frequency Analysis with
More informationScattering.m Documentation
Scattering.m Documentation Release 0.3 Vincent Lostanlen Nov 04, 2018 Contents 1 Introduction 3 2 Filter bank specifications 5 3 Wavelets 7 Bibliography 9 i ii Scattering.m Documentation, Release 0.3
More informationElec4621 Advanced Digital Signal Processing Chapter 11: Time-Frequency Analysis
Elec461 Advanced Digital Signal Processing Chapter 11: Time-Frequency Analysis Dr. D. S. Taubman May 3, 011 In this last chapter of your notes, we are interested in the problem of nding the instantaneous
More informationMusical Modulation by Symmetries. By Rob Burnham
Musical Modulation by Symmetries By Rob Burnham Intro Traditionally intervals have been conceived as ratios of frequencies. This comes from Pythagoras and his question about why consonance intervals like
More informationAN EFFICIENT MULTI-RESOLUTION SPECTRAL TRANSFORM FOR MUSIC ANALYSIS
10th International Society for Music Information Retrieval Conference (ISMIR 2009) AN EFFICIENT MULTI-RESOLUTION SPECTRAL TRANSFORM FOR MUSIC ANALYSIS Pablo Cancela Martín Rocamora Ernesto López Universidad
More informationTIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION
TIME-DEPENDENT PARAMETRIC AND HARMONIC TEMPLATES IN NON-NEGATIVE MATRIX FACTORIZATION Romain Hennequin, Roland Badeau and Bertrand David, Institut Telecom; Telecom ParisTech; CNRS LTCI Paris France romainhennequin@telecom-paristechfr
More informationFrequency Domain Speech Analysis
Frequency Domain Speech Analysis Short Time Fourier Analysis Cepstral Analysis Windowed (short time) Fourier Transform Spectrogram of speech signals Filter bank implementation* (Real) cepstrum and complex
More informationL6: Short-time Fourier analysis and synthesis
L6: Short-time Fourier analysis and synthesis Overview Analysis: Fourier-transform view Analysis: filtering view Synthesis: filter bank summation (FBS) method Synthesis: overlap-add (OLA) method STFT magnitude
More informationResponse-Field Dynamics in the Auditory Pathway
Response-Field Dynamics in the Auditory Pathway Didier Depireux Powen Ru Shihab Shamma Jonathan Simon Work supported by grants from the Office of Naval Research, a training grant from the National Institute
More informationLecture Notes 5: Multiresolution Analysis
Optimization-based data analysis Fall 2017 Lecture Notes 5: Multiresolution Analysis 1 Frames A frame is a generalization of an orthonormal basis. The inner products between the vectors in a frame and
More informationIntroduction to time-frequency analysis. From linear to energy-based representations
Introduction to time-frequency analysis. From linear to energy-based representations Rosario Ceravolo Politecnico di Torino Dep. Structural Engineering UNIVERSITA DI TRENTO Course on «Identification and
More informationLecture 7: Pitch and Chord (2) HMM, pitch detection functions. Li Su 2016/03/31
Lecture 7: Pitch and Chord (2) HMM, pitch detection functions Li Su 2016/03/31 Chord progressions Chord progressions are not arbitrary Example 1: I-IV-I-V-I (C-F-C-G-C) Example 2: I-V-VI-III-IV-I-II-V
More informationSNR Calculation and Spectral Estimation [S&T Appendix A]
SR Calculation and Spectral Estimation [S&T Appendix A] or, How not to make a mess of an FFT Make sure the input is located in an FFT bin 1 Window the data! A Hann window works well. Compute the FFT 3
More informationNonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation
Nonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation Mikkel N. Schmidt and Morten Mørup Technical University of Denmark Informatics and Mathematical Modelling Richard
More informationBME 50500: Image and Signal Processing in Biomedicine. Lecture 5: Correlation and Power-Spectrum CCNY
1 BME 50500: Image and Signal Processing in Biomedicine Lecture 5: Correlation and Power-Spectrum Lucas C. Parra Biomedical Engineering Department CCNY http://bme.ccny.cuny.edu/faculty/parra/teaching/signal-and-image/
More informationTIME-FREQUENCY ANALYSIS: TUTORIAL. Werner Kozek & Götz Pfander
TIME-FREQUENCY ANALYSIS: TUTORIAL Werner Kozek & Götz Pfander Overview TF-Analysis: Spectral Visualization of nonstationary signals (speech, audio,...) Spectrogram (time-varying spectrum estimation) TF-methods
More informationThe mathematician Ramanujan, and digital signal processing
The mathematician Ramanujan, and digital signal processing P. P. Vaidyanathan California Institute of Technology, Pasadena, CA SAM-2016, Rio, Brazil 1887-1920 Self-educated mathematician from India Grew
More informationCONSTRUCTING AN INVERTIBLE CONSTANT-Q TRANSFORM WITH NONSTATIONARY GABOR FRAMES
CONSTRUCTING AN INVERTIBLE CONSTANT-Q TRANSFORM WITH NONSTATIONARY GABOR FRAMES MONIKA DÖRFLER, NICKI HOLIGHAUS, THOMAS GRILL, AND GINO ANGELO VELASCO Abstract. An efficient and perfectly invertible signal
More informationMultiresolution Analysis
Multiresolution Analysis DS-GA 1013 / MATH-GA 2824 Optimization-based Data Analysis http://www.cims.nyu.edu/~cfgranda/pages/obda_fall17/index.html Carlos Fernandez-Granda Frames Short-time Fourier transform
More information1. Calculation of the DFT
ELE E4810: Digital Signal Processing Topic 10: The Fast Fourier Transform 1. Calculation of the DFT. The Fast Fourier Transform algorithm 3. Short-Time Fourier Transform 1 1. Calculation of the DFT! Filter
More informationCh. 15 Wavelet-Based Compression
Ch. 15 Wavelet-Based Compression 1 Origins and Applications The Wavelet Transform (WT) is a signal processing tool that is replacing the Fourier Transform (FT) in many (but not all!) applications. WT theory
More informationModeling Measurement Uncertainty in Room Acoustics P. Dietrich
Modeling Measurement Uncertainty in Room Acoustics P. Dietrich This paper investigates a way of determining and modeling uncertainty contributions in measurements of room acoustic parameters, which are
More informationE : Lecture 1 Introduction
E85.2607: Lecture 1 Introduction 1 Administrivia 2 DSP review 3 Fun with Matlab E85.2607: Lecture 1 Introduction 2010-01-21 1 / 24 Course overview Advanced Digital Signal Theory Design, analysis, and implementation
More informationContinuous Fourier transform of a Gaussian Function
Continuous Fourier transform of a Gaussian Function Gaussian function: e t2 /(2σ 2 ) The CFT of a Gaussian function is also a Gaussian function (i.e., time domain is Gaussian, then the frequency domain
More informationBasics about Fourier analysis
Jérôme Gilles UCLA PART ONE Fourier analysis On the menu... Introduction - some history... Notations. Fourier series. Continuous Fourier transform. Discrete Fourier transform. Properties. 2D extension.
More informationBASIC COMPRESSION TECHNIQUES
BASIC COMPRESSION TECHNIQUES N. C. State University CSC557 Multimedia Computing and Networking Fall 2001 Lectures # 05 Questions / Problems / Announcements? 2 Matlab demo of DFT Low-pass windowed-sinc
More informationIntroduction to Biomedical Engineering
Introduction to Biomedical Engineering Biosignal processing Kung-Bin Sung 6/11/2007 1 Outline Chapter 10: Biosignal processing Characteristics of biosignals Frequency domain representation and analysis
More informationHHT: the theory, implementation and application. Yetmen Wang AnCAD, Inc. 2008/5/24
HHT: the theory, implementation and application Yetmen Wang AnCAD, Inc. 2008/5/24 What is frequency? Frequency definition Fourier glass Instantaneous frequency Signal composition: trend, periodical, stochastic,
More informationIntroduction to Time-Frequency Distributions
Introduction to Time-Frequency Distributions Selin Aviyente Department of Electrical and Computer Engineering Michigan State University January 19, 2010 Motivation for time-frequency analysis When you
More information! Spectral Analysis with DFT. ! Windowing. ! Effect of zero-padding. ! Time-dependent Fourier transform. " Aka short-time Fourier transform
Lecture Outline ESE 531: Digital Signal Processing Spectral Analysis with DFT Windowing Lec 24: April 18, 2019 Spectral Analysis Effect of zero-padding Time-dependent Fourier transform " Aka short-time
More informationTime-Frequency Analysis of Radar Signals
G. Boultadakis, K. Skrapas and P. Frangos Division of Information Transmission Systems and Materials Technology School of Electrical and Computer Engineering National Technical University of Athens 9 Iroon
More informationDigital Speech Processing Lecture 10. Short-Time Fourier Analysis Methods - Filter Bank Design
Digital Speech Processing Lecture Short-Time Fourier Analysis Methods - Filter Bank Design Review of STFT j j ˆ m ˆ. X e x[ mw ] [ nˆ m] e nˆ function of nˆ looks like a time sequence function of ˆ looks
More informationWavelet Transform. Figure 1: Non stationary signal f(t) = sin(100 t 2 ).
Wavelet Transform Andreas Wichert Department of Informatics INESC-ID / IST - University of Lisboa Portugal andreas.wichert@tecnico.ulisboa.pt September 3, 0 Short Term Fourier Transform Signals whose frequency
More informationDesign Criteria for the Quadratically Interpolated FFT Method (I): Bias due to Interpolation
CENTER FOR COMPUTER RESEARCH IN MUSIC AND ACOUSTICS DEPARTMENT OF MUSIC, STANFORD UNIVERSITY REPORT NO. STAN-M-4 Design Criteria for the Quadratically Interpolated FFT Method (I): Bias due to Interpolation
More informationChapters 11 and 12. Sound and Standing Waves
Chapters 11 and 12 Sound and Standing Waves The Nature of Sound Waves LONGITUDINAL SOUND WAVES Speaker making sound waves in a tube The Nature of Sound Waves The distance between adjacent condensations
More informationSound Recognition in Mixtures
Sound Recognition in Mixtures Juhan Nam, Gautham J. Mysore 2, and Paris Smaragdis 2,3 Center for Computer Research in Music and Acoustics, Stanford University, 2 Advanced Technology Labs, Adobe Systems
More informationALGEBRA PROBLEMS FOR MAA MINI-COURSE ON GEOMETRY AND ALGEBRA IN MUSIC THEORY JOINT MATHEMATICS MEETING IN NEW ORLEANS, JANUARY 9, 2011
ALGEBRA PROBLEMS FOR MAA MINI-COURSE ON GEOMETRY AND ALGEBRA IN MUSIC THEORY JOINT MATHEMATICS MEETING IN NEW ORLEANS, JANUARY 9, 2011 THOMAS M. FIORE 1. Pitches and Pitch Classes (1.1) (Pitch Classes
More informationChirp Transform for FFT
Chirp Transform for FFT Since the FFT is an implementation of the DFT, it provides a frequency resolution of 2π/N, where N is the length of the input sequence. If this resolution is not sufficient in a
More informationAutomatic Speech Recognition (CS753)
Automatic Speech Recognition (CS753) Lecture 12: Acoustic Feature Extraction for ASR Instructor: Preethi Jyothi Feb 13, 2017 Speech Signal Analysis Generate discrete samples A frame Need to focus on short
More informationOLA and FBS Duality Review
MUS421/EE367B Lecture 10A Review of OverLap-Add (OLA) and Filter-Bank Summation (FBS) Interpretations of Short-Time Fourier Analysis, Modification, and Resynthesis Julius O. Smith III (jos@ccrma.stanford.edu)
More informationLecture 22: Reconstruction and Admissibility
WAVELETS AND MULTIRATE DIGITAL SIGNAL PROCESSING Lecture 22: Reconstruction and Admissibility Prof.V.M.Gadre, EE, IIT Bombay Tutorials Q 1. Construct the STFT, CWT of the signal x(t) using Matlab and discuss
More informationDigital Image Processing Lectures 13 & 14
Lectures 13 & 14, Professor Department of Electrical and Computer Engineering Colorado State University Spring 2013 Properties of KL Transform The KL transform has many desirable properties which makes
More informationConvolutive Non-Negative Matrix Factorization for CQT Transform using Itakura-Saito Divergence
Convolutive Non-Negative Matrix Factorization for CQT Transform using Itakura-Saito Divergence Fabio Louvatti do Carmo; Evandro Ottoni Teatini Salles Abstract This paper proposes a modification of the
More informationMAT120 Supplementary Lecture Notes
MAT0 Supplementary Lecture Notes Overview of Numbers Integers: {..., 3,,, 0,,, 3,...} Rational Numbers:, 3, etc., quotients of integers a b, b 0; finite or repeating decimal expansions. Irrational Numbers:,,
More informationAUDIO ANALYSIS, VISUALIZATION, AND TRANSFORMATION WITH THE MATCHING PURSUIT ALGORITHM. Garry Kling and Curtis Roads
04 DAFx AUDIO ANALYSIS, VISUALIZATION, AND TRANSFORMATION WITH THE MATCHING PURSUIT ALGORITHM Garry Kling and Curtis Roads Media Arts and Technology 3431 South Hall University of California, Santa Barbara,
More informationWavelets in Pattern Recognition
Wavelets in Pattern Recognition Lecture Notes in Pattern Recognition by W.Dzwinel Uncertainty principle 1 Uncertainty principle Tiling 2 Windowed FT vs. WT Idea of mother wavelet 3 Scale and resolution
More informationTime-frequency frames and applications - Lecture 1: Theory and Concepts
Time-frequency frames and applications - Lecture 1: Theory and Concepts N. Holighaus Acoustics Research Institute, Austrian Academy of Sciences 21 October 2014 N. Holighaus Time-frequency - Lecture I 21
More informationarxiv: v1 [physics.pop-ph] 10 Aug 2015
Revising the Musical Equal Temperament Haye Hinrichsen arxiv:508.02292v [physics.pop-ph] 0 Aug 205 Universität Würzburg, Fakultät für Physik und Astronomie Campus Süd, Am Hubland, 97074 Würzburg, Germany
More information6.003 Signal Processing
6.003 Signal Processing Week 6, Lecture A: The Discrete Fourier Transform (DFT) Adam Hartz hz@mit.edu What is 6.003? What is a signal? Abstractly, a signal is a function that conveys information Signal
More information10ème Congrès Français d Acoustique
1ème Congrès Français d Acoustique Lyon, 1-16 Avril 1 Spectral similarity measure invariant to pitch shifting and amplitude scaling Romain Hennequin 1, Roland Badeau 1, Bertrand David 1 1 Institut TELECOM,
More informationOrder Tracking Analysis
1. Introduction Order Tracking Analysis Jaafar Alsalaet College of Engineering-University of Basrah Mostly, dynamic forces excited in a machine are related to the rotation speed; hence, it is often preferred
More informationOn crystal growth in harmonic space James Tenney ( )
On crystal growth in harmonic space James Tenney (1993-98) It seems clear, intuitively, that a concern for harmonic coherence would lead to the use of relatively compact, connected sets of points in harmonic
More informationDiscrete Fourier transform (DFT)
Discrete Fourier transform (DFT) Alejandro Ribeiro January 19, 2018 Let x : [0, N 1] C be a discrete signal of duration N and having elements x(n) for n [0, N 1]. The discrete Fourier transform (DFT) of
More informationGaussian Processes for Audio Feature Extraction
Gaussian Processes for Audio Feature Extraction Dr. Richard E. Turner (ret26@cam.ac.uk) Computational and Biological Learning Lab Department of Engineering University of Cambridge Machine hearing pipeline
More informationTimbre Similarity. Perception and Computation. Prof. Michael Casey. Dartmouth College. Thursday 7th February, 2008
Timbre Similarity Perception and Computation Prof. Michael Casey Dartmouth College Thursday 7th February, 2008 Prof. Michael Casey (Dartmouth College) Timbre Similarity Thursday 7th February, 2008 1 /
More informationMIPS A Formal Language for the Mathematical Investigation of Pitch Systems. David Meredith
MIPS A Formal Language for the Mathematical Investigation of Pitch Sstems David Meredith September 10, 2001 Contents 1 Introduction to MIPS and the genus representation of octave equivalence 4 1.1 Introduction..............................................
More informationOptimal Speech Enhancement Under Signal Presence Uncertainty Using Log-Spectral Amplitude Estimator
1 Optimal Speech Enhancement Under Signal Presence Uncertainty Using Log-Spectral Amplitude Estimator Israel Cohen Lamar Signal Processing Ltd. P.O.Box 573, Yokneam Ilit 20692, Israel E-mail: icohen@lamar.co.il
More informationarxiv: v3 [physics.pop-ph] 14 Nov 2015
Revising the Musical Equal Temperament Haye Hinrichsen arxiv:508.02292v3 [physics.pop-ph] 4 Nov 205 Universität Würzburg, Fakultät für Physik und Astronomie Campus Süd, Am Hubland, 97074 Würzburg, Germany
More informationL29: Fourier analysis
L29: Fourier analysis Introduction The discrete Fourier Transform (DFT) The DFT matrix The Fast Fourier Transform (FFT) The Short-time Fourier Transform (STFT) Fourier Descriptors CSCE 666 Pattern Analysis
More information2D Spectrogram Filter for Single Channel Speech Enhancement
Proceedings of the 7th WSEAS International Conference on Signal, Speech and Image Processing, Beijing, China, September 15-17, 007 89 D Spectrogram Filter for Single Channel Speech Enhancement HUIJUN DING,
More informationSound 2: frequency analysis
COMP 546 Lecture 19 Sound 2: frequency analysis Tues. March 27, 2018 1 Speed of Sound Sound travels at about 340 m/s, or 34 cm/ ms. (This depends on temperature and other factors) 2 Wave equation Pressure
More informationSpeech Signal Representations
Speech Signal Representations Berlin Chen 2003 References: 1. X. Huang et. al., Spoken Language Processing, Chapters 5, 6 2. J. R. Deller et. al., Discrete-Time Processing of Speech Signals, Chapters 4-6
More informationFourier Analysis of Signals Using the DFT
Fourier Analysis of Signals Using the DFT ECE 535 Lecture April 29, 23 Overview: Motivation Many applications require analyzing the frequency content of signals Speech processing study resonances of vocal
More informationCRYSTALLIZATION SONIFICATION OF HIGH-DIMENSIONAL DATASETS
Proceedings of the 22 International Conference on Auditory Display, Kyoto, Japan, July 2 5, 22 CRYSTALLIZATION SONIFICATION OF HIGH-DIMENSIONAL DATASETS T. Hermann Faculty of Technology Bielefeld University,
More informationNumerical Aspects of Gabor Analysis
Numerical Harmonic Analysis Group hans.feichtinger@univie.ac.at www.nuhag.eu DOWNLOADS: http://www.nuhag.eu/bibtex Graz, April 12th, 2013 9-th Austrian Numerical Analysis Day hans.feichtinger@univie.ac.at
More informationMULTIPITCH ESTIMATION AND INSTRUMENT RECOGNITION BY EXEMPLAR-BASED SPARSE REPRESENTATION. Ikuo Degawa, Kei Sato, Masaaki Ikehara
MULTIPITCH ESTIMATION AND INSTRUMENT RECOGNITION BY EXEMPLAR-BASED SPARSE REPRESENTATION Ikuo Degawa, Kei Sato, Masaaki Ikehara EEE Dept. Keio University Yokohama, Kanagawa 223-8522 Japan E-mail:{degawa,
More informationTime-Frequency Analysis: Fourier Transforms and Wavelets
Chapter 4 Time-Frequenc Analsis: Fourier Transforms and Wavelets 4. Basics of Fourier Series 4.. Introduction Joseph Fourier (768-83) who gave his name to Fourier series, was not the first to use Fourier
More informationI. Signals & Sinusoids
I. Signals & Sinusoids [p. 3] Signal definition Sinusoidal signal Plotting a sinusoid [p. 12] Signal operations Time shifting Time scaling Time reversal Combining time shifting & scaling [p. 17] Trigonometric
More informationTime-Frequency Analysis: Fourier Transforms and Wavelets
Chapter 4 Time-Frequenc Analsis: Fourier Transforms and Wavelets 4. Basics of Fourier Series 4.. Introduction Joseph Fourier (768-83) who gave his name to Fourier series, was not the first to use Fourier
More informationA POSTERIORI SPEECH PRESENCE PROBABILITY ESTIMATION BASED ON AVERAGED OBSERVATIONS AND A SUPER-GAUSSIAN SPEECH MODEL
A POSTERIORI SPEECH PRESENCE PROBABILITY ESTIMATION BASED ON AVERAGED OBSERVATIONS AND A SUPER-GAUSSIAN SPEECH MODEL Balázs Fodor Institute for Communications Technology Technische Universität Braunschweig
More informationAn Efficient Pitch-Tracking Algorithm Using a Combination of Fourier Transforms
An Efficient Pitch-Tracking Algorithm Using a Combination of Fourier Transforms Sylvain Marchand To cite this version: Sylvain Marchand. An Efficient Pitch-Tracking Algorithm Using a Combination of Fourier
More informationAnalysis of polyphonic audio using source-filter model and non-negative matrix factorization
Analysis of polyphonic audio using source-filter model and non-negative matrix factorization Tuomas Virtanen and Anssi Klapuri Tampere University of Technology, Institute of Signal Processing Korkeakoulunkatu
More informationTimbral, Scale, Pitch modifications
Introduction Timbral, Scale, Pitch modifications M2 Mathématiques / Vision / Apprentissage Audio signal analysis, indexing and transformation Page 1 / 40 Page 2 / 40 Modification of playback speed Modifications
More informationSINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIX FACTORIZATION AND SPECTRAL MASKS. Emad M. Grais and Hakan Erdogan
SINGLE CHANNEL SPEECH MUSIC SEPARATION USING NONNEGATIVE MATRIX FACTORIZATION AND SPECTRAL MASKS Emad M. Grais and Hakan Erdogan Faculty of Engineering and Natural Sciences, Sabanci University, Orhanli
More informationSignal Processing COS 323
Signal Processing COS 323 Digital Signals D: functions of space or time e.g., sound 2D: often functions of 2 spatial dimensions e.g. images 3D: functions of 3 spatial dimensions CAT, MRI scans or 2 space,
More informationAnalytic discrete cosine harmonic wavelet transform(adchwt) and its application to signal/image denoising
Analytic discrete cosine harmonic wavelet transform(adchwt) and its application to signal/image denoising M. Shivamurti and S. V. Narasimhan Digital signal processing and Systems Group Aerospace Electronic
More informationLAB 2: DTFT, DFT, and DFT Spectral Analysis Summer 2011
University of Illinois at Urbana-Champaign Department of Electrical and Computer Engineering ECE 311: Digital Signal Processing Lab Chandra Radhakrishnan Peter Kairouz LAB 2: DTFT, DFT, and DFT Spectral
More informationNotes on Wavelets- Sandra Chapman (MPAGS: Time series analysis) # $ ( ) = G f. y t
Wavelets Recall: we can choose! t ) as basis on which we expand, ie: ) = y t ) = G! t ) y t! may be orthogonal chosen or appropriate properties. This is equivalent to the transorm: ) = G y t )!,t )d 2
More informationPhysical Acoustics. Hearing is the result of a complex interaction of physics, physiology, perception and cognition.
Physical Acoustics Hearing, auditory perception, or audition is the ability to perceive sound by detecting vibrations, changes in the pressure of the surrounding medium through time, through an organ such
More informationMultiscale Image Transforms
Multiscale Image Transforms Goal: Develop filter-based representations to decompose images into component parts, to extract features/structures of interest, and to attenuate noise. Motivation: extract
More informationJean Morlet and the Continuous Wavelet Transform (CWT)
Jean Morlet and the Continuous Wavelet Transform (CWT) Brian Russell 1 and Jiajun Han 1 CREWES Adjunct Professor CGG GeoSoftware Calgary Alberta. www.crewes.org Introduction In 198 Jean Morlet a geophysicist
More informationTime-domain representations
Time-domain representations Speech Processing Tom Bäckström Aalto University Fall 2016 Basics of Signal Processing in the Time-domain Time-domain signals Before we can describe speech signals or modelling
More informationROBUST REALTIME POLYPHONIC PITCH DETECTION
ROBUST REALTIME POLYPHONIC PITCH DETECTION by John M. Thomas A Thesis Submitted to the Graduate Faculty of George Mason University In Partial fulfillment of The Requirements for the Degree of Master of
More informationIdentification and separation of noises with spectro-temporal patterns
PROCEEDINGS of the 22 nd International Congress on Acoustics Soundscape, Psychoacoustics and Urban Environment: Paper ICA2016-532 Identification and separation of noises with spectro-temporal patterns
More informationPreFEst: A Predominant-F0 Estimation Method for Polyphonic Musical Audio Signals
PreFEst: A Predominant-F0 Estimation Method for Polyphonic Musical Audio Signals Masataka Goto National Institute of Advanced Industrial Science and Technology (AIST). IT, AIST, 1-1-1 Umezono, Tsukuba,
More informationMULTI-RESOLUTION SIGNAL DECOMPOSITION WITH TIME-DOMAIN SPECTROGRAM FACTORIZATION. Hirokazu Kameoka
MULTI-RESOLUTION SIGNAL DECOMPOSITION WITH TIME-DOMAIN SPECTROGRAM FACTORIZATION Hiroazu Kameoa The University of Toyo / Nippon Telegraph and Telephone Corporation ABSTRACT This paper proposes a novel
More informationSound. Speed of Sound
Sound TUNING FORK CREATING SOUND WAVES GUITAR STRING CREATING SOUND WAVES Speed of Sound Sound travels at a speed that depends on the medium through which it propagates. The speed of sound depends: - directly
More informationLecture 14 1/38 Phys 220. Final Exam. Wednesday, August 6 th 10:30 am 12:30 pm Phys multiple choice problems (15 points each 300 total)
Lecture 14 1/38 Phys 220 Final Exam Wednesday, August 6 th 10:30 am 12:30 pm Phys 114 20 multiple choice problems (15 points each 300 total) 75% will be from Chapters 10-16 25% from Chapters 1-9 Students
More informationBayesian Estimation of Time-Frequency Coefficients for Audio Signal Enhancement
Bayesian Estimation of Time-Frequency Coefficients for Audio Signal Enhancement Patrick J. Wolfe Department of Engineering University of Cambridge Cambridge CB2 1PZ, UK pjw47@eng.cam.ac.uk Simon J. Godsill
More information