David Weenink. First semester 2007
|
|
- Amberly Stevenson
- 6 years ago
- Views:
Transcription
1 Institute of Phonetic Sciences University of Amsterdam First semester 2007
2
3 Definition (ANSI: In Psycho-acoustics) is that auditory attribute of sound according to which sounds can be ordered on a scale from low to high. This ordering is unique only for sine tones! Complex tones in general can have several pitches. Perception of pitch is a compex phenomenon: Definition of Praat s Meter The Object From to Sound of pure tone depends on: Intensity Spectral composition Type and intensity of superimposed additional sound Which ear is stimulated
4 Definition (ANSI: In Psycho-acoustics) is that auditory attribute of sound according to which sounds can be ordered on a scale from low to high. This ordering is unique only for sine tones! Complex tones in general can have several pitches. Perception of pitch is a compex phenomenon: Definition of Praat s Meter The Object From to Sound of pure tone depends on: Intensity Spectral composition Type and intensity of superimposed additional sound Which ear is stimulated
5 Definition (ANSI: In Psycho-acoustics) is that auditory attribute of sound according to which sounds can be ordered on a scale from low to high. This ordering is unique only for sine tones! Complex tones in general can have several pitches. Perception of pitch is a compex phenomenon: Definition of Praat s Meter The Object From to Sound of pure tone depends on: Intensity Spectral composition Type and intensity of superimposed additional sound Which ear is stimulated
6 Definition (ANSI: In Psycho-acoustics) is that auditory attribute of sound according to which sounds can be ordered on a scale from low to high. This ordering is unique only for sine tones! Complex tones in general can have several pitches. Perception of pitch is a compex phenomenon: Definition of Praat s Meter The Object From to Sound of pure tone depends on: Intensity Spectral composition Type and intensity of superimposed additional sound Which ear is stimulated
7 Intensity Stevens, S.S. (1935), The relation of pitch to intensity, J. Acoust. Soc. Am. 6, Measurements were made of the amount by which the pitch of tones ranging from 150 to 12,000 Hz is changed by an increase in intensity. Observers were presented alternately with two tones of different frequency and required to make them sound equal in pitch by varying the intensity of one of the tones. The results, when plotted as equal pitch contours, show (1) the pitch of high tones increases with intensity, (2) the pitch of low tones decreases with intensity, (3) the point at which the effect reverses varies with intensity level. The fact that the points of reversal correspond quite closely with the points of greatest sensitivity of the ear, as shown by contours of equal loudness, suggests that the change in pitch with intensity is due to the resonant characteristics of the ear. Back to Definition of Praat s Meter The Object From to Sound
8 Superimposed Additional Sound Allanson, J.T., Schenkel, K.D. (1965). The effect of band-limited noise on the pitch of pure tones. J. Sound Vib. 2, An investigation has been made of the effect of a band of noise, one-third of an octave wide, on the perceived pitch of a pure tone. In general, the pitch was found to move away from the interfering noise. However, in contrast with the results of erlier workers, the shifts in pitch were found to be quite small and it is suggested that this may be due to the difference in experimental procedures. Back to Definition of Praat s Meter The Object From to Sound
9 Spectral Composition Terhardt, E. (1971). Die Tonhöhe Harmonischer Klänge und das Oktavintervall. Acustica 24, The frequencies of a sinusoidal tone and of a complex tone with the same pitch are slightly different. The investigations show that usually the pitch of a complex tone is lower than the pitch of sinusoidal tone of the same (fundamental) frequency. The frequency ratio corresponding to the subjectively correct pitch interval of a musical octave usually differs slightly from the value 2. This phenomenon was investigated with low pure tones and with complex tones. The results for complex tones are explained by the octave intervals that were found with simple tones and the pitch differences between simple and complex tones. Back to Definition of Praat s Meter The Object From to Sound
10 Interaural Difference Observation When one and the same tone is alternately presented to the right and left ear alone, there is a good chance that a slight but, for a fixed frequency systematic, difference between the pitch from the right ear and that from the left can be noticed. Definition of Praat s Meter The Object From to Sound It escapes notice because the pitches of the two ears merge in conscious perception. of sine tones is basically created by each ear individually! Ref: persons/ter/top/diplacus.html Back to
11 Relations between frequency is signal property is a subjective property (Du: toonhoogte) Definition of Praat s Meter The Object From to Sound Monotone relation between pitch and F 0
12 Relations between frequency is signal property is a subjective property (Du: toonhoogte) Definition of Praat s Meter The Object From to Sound Monotone relation between pitch and F 0
13 Relations between frequency is signal property is a subjective property (Du: toonhoogte) Definition of Praat s Meter The Object From to Sound Monotone relation between pitch and F 0
14 Time-domain: inverse filtering, autocorrelation... -domain: via harmonic structure in the spectrum, cepstrum, harmonic sieve... Auditory modeling: auditive filterbank neural transduction spike generation interval detection candidates. Definition of Praat s Meter The Object From to Sound
15 Time-domain: inverse filtering, autocorrelation... -domain: via harmonic structure in the spectrum, cepstrum, harmonic sieve... Auditory modeling: auditive filterbank neural transduction spike generation interval detection candidates. Definition of Praat s Meter The Object From to Sound
16 Time-domain: inverse filtering, autocorrelation... -domain: via harmonic structure in the spectrum, cepstrum, harmonic sieve... Auditory modeling: auditive filterbank neural transduction spike generation interval detection candidates. Definition of Praat s Meter The Object From to Sound
17 Time-domain: inverse filtering, autocorrelation... -domain: via harmonic structure in the spectrum, cepstrum, harmonic sieve... Auditory modeling: auditive filterbank neural transduction spike generation interval detection candidates. Definition of Praat s Meter The Object From to Sound A good pitch meter gives several candidates for pitch
18 praat s Meter Table: Accuracy of the algorithm with Hanning window. Periods per determination error window F /F sine wave pulse train > 3 < < > 6 < < > 12 < < > 3 < 10 6 (Gaussian window) Definition of Praat s Meter The Object From to Sound
19 praat s Algorithm Two steps: 1 Find the pitch candidates and their strengths 2 Determine optimal path in candidate space on the basis of costs and merits Definition of Praat s Meter The Object From to Sound
20 praat s Algorithm Two steps: 1 Find the pitch candidates and their strengths 2 Determine optimal path in candidate space on the basis of costs and merits Definition of Praat s Meter The Object From to Sound
21 Finding the Candidates (1) autocorrelation 1-r r r 0 Definition of Praat s Meter The Object From to Sound τ max Lag τ (s) r(τ) = x(t)x(t + τ)dt (autocorrelation) r (τ) = r(τ)/r(0) (normalized autocorrelation) Every peak in autocorrelation function is pitch candidate HNR = 10 log r 1 r (harmonics-to-noise ratio in db) Position and amplitude of peak by sin x/x interpolation.
22 Finding the Candidates (1) autocorrelation 1-r r r 0 Definition of Praat s Meter The Object From to Sound τ max Lag τ (s) r(τ) = x(t)x(t + τ)dt (autocorrelation) r (τ) = r(τ)/r(0) (normalized autocorrelation) Every peak in autocorrelation function is pitch candidate HNR = 10 log r 1 r (harmonics-to-noise ratio in db) Position and amplitude of peak by sin x/x interpolation.
23 Finding the Candidates (1) autocorrelation 1-r r r 0 Definition of Praat s Meter The Object From to Sound τ max Lag τ (s) r(τ) = x(t)x(t + τ)dt (autocorrelation) r (τ) = r(τ)/r(0) (normalized autocorrelation) Every peak in autocorrelation function is pitch candidate HNR = 10 log r 1 r (harmonics-to-noise ratio in db) Position and amplitude of peak by sin x/x interpolation.
24 Finding the Candidates (1) autocorrelation 1-r r r 0 Definition of Praat s Meter The Object From to Sound τ max Lag τ (s) r(τ) = x(t)x(t + τ)dt (autocorrelation) r (τ) = r(τ)/r(0) (normalized autocorrelation) Every peak in autocorrelation function is pitch candidate HNR = 10 log r 1 r (harmonics-to-noise ratio in db) Position and amplitude of peak by sin x/x interpolation.
25 Finding the Candidates (2): Autocorrelation and Window Correction 1 (1) x(t) -1 0 Time (ms) 24 1 (2) w(t) 0 0 Time (ms) 24 1 (3) a(t)=w(t)x(t) -1 0 Time (ms) 24 Definition of Praat s Meter The Object From to Sound 1 (4) r a (τ) 1 (5) r w (τ) 1 (6) r x (τ) Lag (ms) Lag (ms) Lag (ms) 24 x(t) = ( sin 2π140t) sin 2π280t x(t) multiplied by w(t) gives a(t) r a (τ) divided by r w (τ) gives r x (τ)
26 Finding the Optimal Path (1): The Problem The Problem N analysis frames M candidates per frame Find the best path } O(M N )paths Definition of Praat s Meter The Object From to Sound N = 100 and M = 10 gives possible paths. This will take year (10 9 paths /s) We need something better: dynamic programming O(N M)! A reduction of computing time from to
27 Finding the Optimal Path (2): The Solution Introduce positive and negative costs and find the path with maximum costs with a dynamic programming algorithm. There are two kinds of costs involved 1 Costs within each frame Definition of Praat s Meter The Object From to Sound 2 Costs for transitions between frames
28 Finding the Optimal Path (2): The Solution Introduce positive and negative costs and find the path with maximum costs with a dynamic programming algorithm. There are two kinds of costs involved 1 Costs within each frame Definition of Praat s Meter The Object From to Sound 2 Costs for transitions between frames
29 Finding the Optimal Path (3): Costs Within On the + side: r (τ max ) On the - side: OctaveCost 2 log(minimum τ max ) R = r (τ max ) OctaveCost 2 log(minimum τ max ) Best local candidate has highest R OctaveCost favors higher frequencies: Perfect periodic signal For x(t) = (1 + d mod sin 2πFt)sin2π2Ft, perceived pitch is 2F when d mod 0.3 Criterion: OctaveCost = d 2 mod Default is 0.01, i.e. d mod = 0.1 Definition of Praat s Meter The Object From to Sound
30 Finding the Optimal Path (3): Costs Within On the + side: r (τ max ) On the - side: OctaveCost 2 log(minimum τ max ) R = r (τ max ) OctaveCost 2 log(minimum τ max ) Best local candidate has highest R OctaveCost favors higher frequencies: Perfect periodic signal For x(t) = (1 + d mod sin 2πFt)sin2π2Ft, perceived pitch is 2F when d mod 0.3 Criterion: OctaveCost = d 2 mod Default is 0.01, i.e. d mod = 0.1 Definition of Praat s Meter The Object From to Sound
31 Finding the Optimal Path (3): Costs Within On the + side: r (τ max ) On the - side: OctaveCost 2 log(minimum τ max ) R = r (τ max ) OctaveCost 2 log(minimum τ max ) Best local candidate has highest R OctaveCost favors higher frequencies: Perfect periodic signal For x(t) = (1 + d mod sin 2πFt)sin2π2Ft, perceived pitch is 2F when d mod 0.3 Criterion: OctaveCost = d 2 mod Default is 0.01, i.e. d mod = 0.1 Definition of Praat s Meter The Object From to Sound Optimal path: connect all the best local candidates
32 Finding the Optimal Path (4): Costs Between On the transition from frame i to i + 1 we can have octave jumps or voiced-unvoiced jumps - costs: { VoicedUnvoicedCost Fi = 0 xor F i+1 = 0 OctaveJumpCost 2 log F i F i+1 F i 0 and F i+1 0 Considerations Increasing VoicedUnvoicedCost decreases the number of voiced-unvoiced transitions Increasing OctaveJumpCost decreases the number of octave jumps OctaveJumpCost=0 and VoicedUnvoicedCost=0: no path finder, local best candidate selected. Definition of Praat s Meter The Object From to Sound
33 Finding the Optimal Path (4): Costs Between On the transition from frame i to i + 1 we can have octave jumps or voiced-unvoiced jumps - costs: { VoicedUnvoicedCost Fi = 0 xor F i+1 = 0 OctaveJumpCost 2 log F i F i+1 F i 0 and F i+1 0 Considerations Increasing VoicedUnvoicedCost decreases the number of voiced-unvoiced transitions Increasing OctaveJumpCost decreases the number of octave jumps OctaveJumpCost=0 and VoicedUnvoicedCost=0: no path finder, local best candidate selected. Definition of Praat s Meter The Object From to Sound
34 Finding the Optimal Path (4): Costs Between On the transition from frame i to i + 1 we can have octave jumps or voiced-unvoiced jumps - costs: { VoicedUnvoicedCost Fi = 0 xor F i+1 = 0 OctaveJumpCost 2 log F i F i+1 F i 0 and F i+1 0 Considerations Increasing VoicedUnvoicedCost decreases the number of voiced-unvoiced transitions Increasing OctaveJumpCost decreases the number of octave jumps OctaveJumpCost=0 and VoicedUnvoicedCost=0: no path finder, local best candidate selected. Definition of Praat s Meter The Object From to Sound
35 Extra Correction in the PathFinder Extra correction for timestep values different from 0.01: timestepcorrection = 0.01 / timestep Definition of Praat s Meter The Object From to Sound OctaveJumpCost *= timestepcorrection VoicedUnvoicedCost *= timestepcorrection
36 The Object Attributes (Sampled) xmin, xmax Time domain nx Number of analysis frames dx Timestep x1 Centre of first frame ceiling Maximum frequency, if above then voiceless. maxncandidates Maximum number of candidates frame Row (1..nx) of Frame s Definition of Praat s Meter The Object From to Sound
37 The Object Attributes (Sampled) xmin, xmax Time domain nx Number of analysis frames dx Timestep x1 Centre of first frame ceiling Maximum frequency, if above then voiceless. maxncandidates Maximum number of candidates frame Row (1..nx) of Frame s Definition of Praat s Meter The Object From to Sound
38 The Object Attributes (Sampled) xmin, xmax Time domain nx Number of analysis frames dx Timestep x1 Centre of first frame ceiling Maximum frequency, if above then voiceless. maxncandidates Maximum number of candidates frame Row (1..nx) of Frame s Definition of Praat s Meter The Object From to Sound
39 The Object Attributes (Sampled) xmin, xmax Time domain nx Number of analysis frames dx Timestep x1 Centre of first frame ceiling Maximum frequency, if above then voiceless. maxncandidates Maximum number of candidates frame Row (1..nx) of Frame s Definition of Praat s Meter The Object From to Sound
40 The Object Attributes (Sampled) xmin, xmax Time domain nx Number of analysis frames dx Timestep x1 Centre of first frame ceiling Maximum frequency, if above then voiceless. maxncandidates Maximum number of candidates frame Row (1..nx) of Frame s Definition of Praat s Meter The Object From to Sound
41 The Object Attributes (Sampled) xmin, xmax Time domain nx Number of analysis frames dx Timestep x1 Centre of first frame ceiling Maximum frequency, if above then voiceless. maxncandidates Maximum number of candidates frame Row (1..nx) of Frame s Definition of Praat s Meter The Object From to Sound
42 The Object Attributes (Sampled) xmin, xmax Time domain nx Number of analysis frames dx Timestep x1 Centre of first frame ceiling Maximum frequency, if above then voiceless. maxncandidates Maximum number of candidates frame Row (1..nx) of Frame s Definition of Praat s Meter The Object From to Sound
43 Frame Frame Attributes intensity relative intensity (0: silence) ncandidates Number of candidates in frame candidate Row (1..nCandidates) of Candidate Definition of Praat s Meter The Object From to Sound Candidate Attributes frequency in Hz, 0 for voiceless. strength A number in [0, 1]
44 Frame Frame Attributes intensity relative intensity (0: silence) ncandidates Number of candidates in frame candidate Row (1..nCandidates) of Candidate Definition of Praat s Meter The Object From to Sound Candidate Attributes frequency in Hz, 0 for voiceless. strength A number in [0, 1]
45 Frame Frame Attributes intensity relative intensity (0: silence) ncandidates Number of candidates in frame candidate Row (1..nCandidates) of Candidate Definition of Praat s Meter The Object From to Sound Candidate Attributes frequency in Hz, 0 for voiceless. strength A number in [0, 1]
46 Frame Frame Attributes intensity relative intensity (0: silence) ncandidates Number of candidates in frame candidate Row (1..nCandidates) of Candidate Definition of Praat s Meter The Object From to Sound Candidate Attributes frequency in Hz, 0 for voiceless. strength A number in [0, 1]
47 Frame Frame Attributes intensity relative intensity (0: silence) ncandidates Number of candidates in frame candidate Row (1..nCandidates) of Candidate Definition of Praat s Meter The Object From to Sound Candidate Attributes frequency in Hz, 0 for voiceless. strength A number in [0, 1]
48 From to Sound (1) To make a sound with a prescribed F 0 contour: Start with a Tier object, i.e. a series of (time,pitch) points without voiced-unvoiced info. (Hz) Time (s) Definition of Praat s Meter The Object From to Sound With two points specified, the pitch follows the blue line.
49 From to Sound (2) Next: To Sound (pulse train)... We want to generate a signal s(t) specified as s(t) = n i=1 p iδ(t t i ), where the t i are the positions of the pulses. Not band-limited signal: times t i are not necessarily at multiples of T How do we make a band-limited signal from s(t)? Filtering with a low-pass filter! We do not know the spectrum of s(t) so we make the convolution with the impulse response of the low-pass filter. Definition of Praat s Meter The Object From to Sound
50 Intermezzo: Pulses and Low-Pass Convolution of Pulse Series with Impulse Response of a Low-Pass Impulse response of a square low-pass [ f c, f c ] is: h(t) = + H(f )e2πift df = +f c 1 f c 2f c e 2πift df Now the convolution: = sin 2πfct 2πf ct Definition of Praat s Meter The Object From to Sound g(t) = h(t) s(t) = h(τ)s(t τ)dτ = = sin 2πf cτ n i=1 p iδ(t t i τ)dτ 2πf cτ = n i=1 sin 2πfcτ 2πf cτ We have used f (t)δ(t t 0 )dt = f (t 0 ). This is like the resample formula p i δ(t t i τ)dτ = n i=1 p i sin 2πfc(t t i ) 2πf c(t t i )
51 Check Create Sound from formula... sinxx Mono sin(2*pi*1000*(x-0.5))/(2*pi*1000*(x-0.5)) To Spectrum... n Edit Definition of Praat s Meter The Object From to Sound
52 Boersma, P. (1993), Accurate short-time analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound, IFA Proceedings 17, Hermes, D.J. (1993), analysis, in Visual Representations of s, Cooke, M., Beet, S. & Crawford, M. (eds.), John Wiley & Sons Ltd. Terhardt, E., Akustische Kommunikation - Grundlagen mit Hörbeispielen. Springer, Berlin/Heidelberg, Definition of Praat s Meter The Object From to Sound
Signal representations: Cepstrum
Signal representations: Cepstrum Source-filter separation for sound production For speech, source corresponds to excitation by a pulse train for voiced phonemes and to turbulence (noise) for unvoiced phonemes,
More informationFeature extraction 1
Centre for Vision Speech & Signal Processing University of Surrey, Guildford GU2 7XH. Feature extraction 1 Dr Philip Jackson Cepstral analysis - Real & complex cepstra - Homomorphic decomposition Filter
More informationTimbral, Scale, Pitch modifications
Introduction Timbral, Scale, Pitch modifications M2 Mathématiques / Vision / Apprentissage Audio signal analysis, indexing and transformation Page 1 / 40 Page 2 / 40 Modification of playback speed Modifications
More informationPhysical Acoustics. Hearing is the result of a complex interaction of physics, physiology, perception and cognition.
Physical Acoustics Hearing, auditory perception, or audition is the ability to perceive sound by detecting vibrations, changes in the pressure of the surrounding medium through time, through an organ such
More informationTime-Frequency Analysis
Time-Frequency Analysis Basics of Fourier Series Philippe B. aval KSU Fall 015 Philippe B. aval (KSU) Fourier Series Fall 015 1 / 0 Introduction We first review how to derive the Fourier series of a function.
More informationSound 2: frequency analysis
COMP 546 Lecture 19 Sound 2: frequency analysis Tues. March 27, 2018 1 Speed of Sound Sound travels at about 340 m/s, or 34 cm/ ms. (This depends on temperature and other factors) 2 Wave equation Pressure
More informationLinear Prediction 1 / 41
Linear Prediction 1 / 41 A map of speech signal processing Natural signals Models Artificial signals Inference Speech synthesis Hidden Markov Inference Homomorphic processing Dereverberation, Deconvolution
More informationSEISMIC WAVE PROPAGATION. Lecture 2: Fourier Analysis
SEISMIC WAVE PROPAGATION Lecture 2: Fourier Analysis Fourier Series & Fourier Transforms Fourier Series Review of trigonometric identities Analysing the square wave Fourier Transform Transforms of some
More informationUniversity of Colorado at Boulder ECEN 4/5532. Lab 2 Lab report due on February 16, 2015
University of Colorado at Boulder ECEN 4/5532 Lab 2 Lab report due on February 16, 2015 This is a MATLAB only lab, and therefore each student needs to turn in her/his own lab report and own programs. 1
More informationTopic 6. Timbre Representations
Topic 6 Timbre Representations We often say that singer s voice is magnetic the violin sounds bright this French horn sounds solid that drum sounds dull What aspect(s) of sound are these words describing?
More informationSPEECH ANALYSIS AND SYNTHESIS
16 Chapter 2 SPEECH ANALYSIS AND SYNTHESIS 2.1 INTRODUCTION: Speech signal analysis is used to characterize the spectral information of an input speech signal. Speech signal analysis [52-53] techniques
More informationCOMP 546, Winter 2018 lecture 19 - sound 2
Sound waves Last lecture we considered sound to be a pressure function I(X, Y, Z, t). However, sound is not just any function of those four variables. Rather, sound obeys the wave equation: 2 I(X, Y, Z,
More informationFourier Analysis Linear transformations and lters. 3. Fourier Analysis. Alex Sheremet. April 11, 2007
Stochastic processes review 3. Data Analysis Techniques in Oceanography OCP668 April, 27 Stochastic processes review Denition Fixed ζ = ζ : Function X (t) = X (t, ζ). Fixed t = t: Random Variable X (ζ)
More informationTime-Series Analysis for Ear-Related and Psychoacoustic Metrics
Time-Series Analysis for Ear-Related and Psychoacoustic Metrics V. Mellert, H. Remmers, R. Weber, B. Schulte-Fortkamp how to analyse p(t) to obtain an earrelated parameter? general remarks on acoustical
More informationLINEAR SYSTEMS. J. Elder PSYC 6256 Principles of Neural Coding
LINEAR SYSTEMS Linear Systems 2 Neural coding and cognitive neuroscience in general concerns input-output relationships. Inputs Light intensity Pre-synaptic action potentials Number of items in display
More informationFrequency Domain Speech Analysis
Frequency Domain Speech Analysis Short Time Fourier Analysis Cepstral Analysis Windowed (short time) Fourier Transform Spectrogram of speech signals Filter bank implementation* (Real) cepstrum and complex
More information06/12/ rws/jMc- modif SuFY10 (MPF) - Textbook Section IX 1
IV. Continuous-Time Signals & LTI Systems [p. 3] Analog signal definition [p. 4] Periodic signal [p. 5] One-sided signal [p. 6] Finite length signal [p. 7] Impulse function [p. 9] Sampling property [p.11]
More informationSolutions to Problems in Chapter 4
Solutions to Problems in Chapter 4 Problems with Solutions Problem 4. Fourier Series of the Output Voltage of an Ideal Full-Wave Diode Bridge Rectifier he nonlinear circuit in Figure 4. is a full-wave
More informationLecture 7: Pitch and Chord (2) HMM, pitch detection functions. Li Su 2016/03/31
Lecture 7: Pitch and Chord (2) HMM, pitch detection functions Li Su 2016/03/31 Chord progressions Chord progressions are not arbitrary Example 1: I-IV-I-V-I (C-F-C-G-C) Example 2: I-V-VI-III-IV-I-II-V
More informationNeural Encoding: Firing Rates and Spike Statistics
Neural Encoding: Firing Rates and Spike Statistics Dayan and Abbott (21) Chapter 1 Instructor: Yoonsuck Choe; CPSC 644 Cortical Networks Background: Dirac δ Function Dirac δ function has the following
More informationChapter 9. Linear Predictive Analysis of Speech Signals 语音信号的线性预测分析
Chapter 9 Linear Predictive Analysis of Speech Signals 语音信号的线性预测分析 1 LPC Methods LPC methods are the most widely used in speech coding, speech synthesis, speech recognition, speaker recognition and verification
More informationSignal Modeling Techniques in Speech Recognition. Hassan A. Kingravi
Signal Modeling Techniques in Speech Recognition Hassan A. Kingravi Outline Introduction Spectral Shaping Spectral Analysis Parameter Transforms Statistical Modeling Discussion Conclusions 1: Introduction
More informationAutomatic Speech Recognition (CS753)
Automatic Speech Recognition (CS753) Lecture 12: Acoustic Feature Extraction for ASR Instructor: Preethi Jyothi Feb 13, 2017 Speech Signal Analysis Generate discrete samples A frame Need to focus on short
More informationL6: Short-time Fourier analysis and synthesis
L6: Short-time Fourier analysis and synthesis Overview Analysis: Fourier-transform view Analysis: filtering view Synthesis: filter bank summation (FBS) method Synthesis: overlap-add (OLA) method STFT magnitude
More informationReview: Continuous Fourier Transform
Review: Continuous Fourier Transform Review: convolution x t h t = x τ h(t τ)dτ Convolution in time domain Derivation Convolution Property Interchange the order of integrals Let Convolution Property By
More informationCorrelation, discrete Fourier transforms and the power spectral density
Correlation, discrete Fourier transforms and the power spectral density visuals to accompany lectures, notes and m-files by Tak Igusa tigusa@jhu.edu Department of Civil Engineering Johns Hopkins University
More informationReference Text: The evolution of Applied harmonics analysis by Elena Prestini
Notes for July 14. Filtering in Frequency domain. Reference Text: The evolution of Applied harmonics analysis by Elena Prestini It all started with: Jean Baptist Joseph Fourier (1768-1830) Mathematician,
More informationL29: Fourier analysis
L29: Fourier analysis Introduction The discrete Fourier Transform (DFT) The DFT matrix The Fast Fourier Transform (FFT) The Short-time Fourier Transform (STFT) Fourier Descriptors CSCE 666 Pattern Analysis
More informationPHY 103: Standing Waves and Harmonics. Segev BenZvi Department of Physics and Astronomy University of Rochester
PHY 103: Standing Waves and Harmonics Segev BenZvi Department of Physics and Astronomy University of Rochester Sounds of the Universe NASA/JPL, September 2016 2 Properties of Waves Wavelength: λ, length
More informationThe Spectral Differences in Sounds and Harmonics. Through chemical and biological processes, we, as humans, can detect the movement and
John Chambers Semester Project 5/2/16 The Spectral Differences in Sounds and Harmonics Through chemical and biological processes, we, as humans, can detect the movement and vibrations of molecules through
More informationA523 Signal Modeling, Statistical Inference and Data Mining in Astrophysics Spring 2011
A523 Signal Modeling, Statistical Inference and Data Mining in Astrophysics Spring 2011 Lecture 6 PDFs for Lecture 1-5 are on the web page Problem set 2 is on the web page Article on web page A Guided
More informationSignal Processing COS 323
Signal Processing COS 323 Digital Signals D: functions of space or time e.g., sound 2D: often functions of 2 spatial dimensions e.g. images 3D: functions of 3 spatial dimensions CAT, MRI scans or 2 space,
More information13.42 READING 6: SPECTRUM OF A RANDOM PROCESS 1. STATIONARY AND ERGODIC RANDOM PROCESSES
13.42 READING 6: SPECTRUM OF A RANDOM PROCESS SPRING 24 c A. H. TECHET & M.S. TRIANTAFYLLOU 1. STATIONARY AND ERGODIC RANDOM PROCESSES Given the random process y(ζ, t) we assume that the expected value
More informationrepresentation of speech
Digital Speech Processing Lectures 7-8 Time Domain Methods in Speech Processing 1 General Synthesis Model voiced sound amplitude Log Areas, Reflection Coefficients, Formants, Vocal Tract Polynomial, l
More informationSpeech Signal Representations
Speech Signal Representations Berlin Chen 2003 References: 1. X. Huang et. al., Spoken Language Processing, Chapters 5, 6 2. J. R. Deller et. al., Discrete-Time Processing of Speech Signals, Chapters 4-6
More informationIV. Covariance Analysis
IV. Covariance Analysis Autocovariance Remember that when a stochastic process has time values that are interdependent, then we can characterize that interdependency by computing the autocovariance function.
More informationAN INVERTIBLE DISCRETE AUDITORY TRANSFORM
COMM. MATH. SCI. Vol. 3, No. 1, pp. 47 56 c 25 International Press AN INVERTIBLE DISCRETE AUDITORY TRANSFORM JACK XIN AND YINGYONG QI Abstract. A discrete auditory transform (DAT) from sound signal to
More informationResponse-Field Dynamics in the Auditory Pathway
Response-Field Dynamics in the Auditory Pathway Didier Depireux Powen Ru Shihab Shamma Jonathan Simon Work supported by grants from the Office of Naval Research, a training grant from the National Institute
More information2A1H Time-Frequency Analysis II
2AH Time-Frequency Analysis II Bugs/queries to david.murray@eng.ox.ac.uk HT 209 For any corrections see the course page DW Murray at www.robots.ox.ac.uk/ dwm/courses/2tf. (a) A signal g(t) with period
More informationNicholas J. Giordano. Chapter 13 Sound
Nicholas J. Giordano www.cengage.com/physics/giordano Chapter 13 Sound Sound Sounds waves are an important example of wave motion Sound is central to hearing, speech, music and many other daily activities
More informationNearly Perfect Detection of Continuous F 0 Contour and Frame Classification for TTS Synthesis. Thomas Ewender
Nearly Perfect Detection of Continuous F 0 Contour and Frame Classification for TTS Synthesis Thomas Ewender Outline Motivation Detection algorithm of continuous F 0 contour Frame classification algorithm
More informationCorrelator I. Basics. Chapter Introduction. 8.2 Digitization Sampling. D. Anish Roshi
Chapter 8 Correlator I. Basics D. Anish Roshi 8.1 Introduction A radio interferometer measures the mutual coherence function of the electric field due to a given source brightness distribution in the sky.
More informationESS Dirac Comb and Flavors of Fourier Transforms
6. Dirac Comb and Flavors of Fourier ransforms Consider a periodic function that comprises pulses of amplitude A and duration τ spaced a time apart. We can define it over one period as y(t) = A, τ / 2
More informationMATHEMATICAL TOOLS FOR DIGITAL TRANSMISSION ANALYSIS
ch03.qxd 1/9/03 09:14 AM Page 35 CHAPTER 3 MATHEMATICAL TOOLS FOR DIGITAL TRANSMISSION ANALYSIS 3.1 INTRODUCTION The study of digital wireless transmission is in large measure the study of (a) the conversion
More informationComputational Auditory Scene Analysis Based on Loudness/Pitch/Timbre Decomposition
Computational Auditory Scene Analysis Based on Loudness/Pitch/Timbre Decomposition Mototsugu Abe and Shigeru Ando Department of Mathematical Engineering and Information Physics Faculty of Engineering,
More information2A1H Time-Frequency Analysis II Bugs/queries to HT 2011 For hints and answers visit dwm/courses/2tf
Time-Frequency Analysis II (HT 20) 2AH 2AH Time-Frequency Analysis II Bugs/queries to david.murray@eng.ox.ac.uk HT 20 For hints and answers visit www.robots.ox.ac.uk/ dwm/courses/2tf David Murray. A periodic
More informationFOURIER ANALYSIS. (a) Fourier Series
(a) Fourier Series FOURIER ANAYSIS (b) Fourier Transforms Useful books: 1. Advanced Mathematics for Engineers and Scientists, Schaum s Outline Series, M. R. Spiegel - The course text. We follow their notation
More informationIB Paper 6: Signal and Data Analysis
IB Paper 6: Signal and Data Analysis Handout 5: Sampling Theory S Godsill Signal Processing and Communications Group, Engineering Department, Cambridge, UK Lent 2015 1 / 85 Sampling and Aliasing All of
More informationA Spectral-Flatness Measure for Studying the Autocorrelation Method of Linear Prediction of Speech Analysis
A Spectral-Flatness Measure for Studying the Autocorrelation Method of Linear Prediction of Speech Analysis Authors: Augustine H. Gray and John D. Markel By Kaviraj, Komaljit, Vaibhav Spectral Flatness
More informationFig 1: Stationary and Non Stationary Time Series
Module 23 Independence and Stationarity Objective: To introduce the concepts of Statistical Independence, Stationarity and its types w.r.to random processes. This module also presents the concept of Ergodicity.
More informationNonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation
Nonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation Mikkel N. Schmidt and Morten Mørup Technical University of Denmark Informatics and Mathematical Modelling Richard
More informationCarrier frequency estimation. ELEC-E5410 Signal processing for communications
Carrier frequency estimation ELEC-E54 Signal processing for communications Contents. Basic system assumptions. Data-aided DA: Maximum-lielihood ML estimation of carrier frequency 3. Data-aided: Practical
More informationEA2.3 - Electronics 2 1
In the previous lecture, I talked about the idea of complex frequency s, where s = σ + jω. Using such concept of complex frequency allows us to analyse signals and systems with better generality. In this
More informationHomework 9 Solutions
8-290 Signals and Systems Profs. Byron Yu and Pulkit Grover Fall 207 Homework 9 Solutions Part One. (6 points) Compute the convolution of the following continuous-time aperiodic signals. (Hint: Use the
More informationNORWEGIAN UNIVERSITY OF SCIENCE AND TECHNOLOGY DEPARTMENT OF ELECTRONICS AND TELECOMMUNICATIONS
page 1 of 5 (+ appendix) NORWEGIAN UNIVERSITY OF SCIENCE AND TECHNOLOGY DEPARTMENT OF ELECTRONICS AND TELECOMMUNICATIONS Contact during examination: Name: Magne H. Johnsen Tel.: 73 59 26 78/930 25 534
More informationMusic Synthesis. synthesis. 1. NCTU/CSIE/ DSP Copyright 1996 C.M. LIU
Music Synthesis synthesis. 1 pintroduction pmodeling, Synthesis, and Overview padditive Synthesis psubtractive Synthesis pnonlinear Synthesis pwavetable Synthesis psummary and Conclusions 1. Introduction
More informationL7: Linear prediction of speech
L7: Linear prediction of speech Introduction Linear prediction Finding the linear prediction coefficients Alternative representations This lecture is based on [Dutoit and Marques, 2009, ch1; Taylor, 2009,
More informationTopic 7. Convolution, Filters, Correlation, Representation. Bryan Pardo, 2008, Northwestern University EECS 352: Machine Perception of Music and Audio
Topic 7 Convolution, Filters, Correlation, Representation Short time Fourier Transform Break signal into windows Calculate DFT of each window The Spectrogram spectrogram(y,1024,512,1024,fs,'yaxis'); A
More informationFourier Series and Transforms. Revision Lecture
E. (5-6) : / 3 Periodic signals can be written as a sum of sine and cosine waves: u(t) u(t) = a + n= (a ncosπnft+b n sinπnft) T = + T/3 T/ T +.65sin(πFt) -.6sin(πFt) +.6sin(πFt) + -.3cos(πFt) + T/ Fundamental
More informationQ. 1 Q. 25 carry one mark each.
GATE 5 SET- ELECTRONICS AND COMMUNICATION ENGINEERING - EC Q. Q. 5 carry one mark each. Q. The bilateral Laplace transform of a function is if a t b f() t = otherwise (A) a b s (B) s e ( a b) s (C) e as
More informationAcceleration, Velocity and Displacement Spectra Omega Arithmetic
Acceleration, Velocity and Displacement Spectra Omega Arithmetic 1 Acceleration, Velocity and Displacement Spectra Omega Arithmetic By Dr Colin Mercer, Technical Director, Prosig A ccelerometers are robust,
More informationLECTURE NOTES IN AUDIO ANALYSIS: PITCH ESTIMATION FOR DUMMIES
LECTURE NOTES IN AUDIO ANALYSIS: PITCH ESTIMATION FOR DUMMIES Abstract March, 3 Mads Græsbøll Christensen Audio Analysis Lab, AD:MT Aalborg University This document contains a brief introduction to pitch
More informationE&CE 358, Winter 2016: Solution #2. Prof. X. Shen
E&CE 358, Winter 16: Solution # Prof. X. Shen Email: xshen@bbcr.uwaterloo.ca Prof. X. Shen E&CE 358, Winter 16 ( 1:3-:5 PM: Solution # Problem 1 Problem 1 The signal g(t = e t, t T is corrupted by additive
More informationSinusoidal Modeling. Yannis Stylianou SPCC University of Crete, Computer Science Dept., Greece,
Sinusoidal Modeling Yannis Stylianou University of Crete, Computer Science Dept., Greece, yannis@csd.uoc.gr SPCC 2016 1 Speech Production 2 Modulators 3 Sinusoidal Modeling Sinusoidal Models Voiced Speech
More informationL8: Source estimation
L8: Source estimation Glottal and lip radiation models Closed-phase residual analysis Voicing/unvoicing detection Pitch detection Epoch detection This lecture is based on [Taylor, 2009, ch. 11-12] Introduction
More informationChapter 16 Waves in One Dimension
Chapter 16 Waves in One Dimension Slide 16-1 Reading Quiz 16.05 f = c Slide 16-2 Reading Quiz 16.06 Slide 16-3 Reading Quiz 16.07 Heavier portion looks like a fixed end, pulse is inverted on reflection.
More informationCombination Tones As a Phenomenon of the Central Level in Auditory System
P a g e 1 Combination Tones As a Phenomenon of the Central Level in Auditory System Tadeusz ZIĘBAKOWSKI West Pomeranian University of Technology Institute of Manufacturing Engineering Al. Piastów 19, 70-310
More informationSystem Identification & Parameter Estimation
System Identification & Parameter Estimation Wb3: SIPE lecture Correlation functions in time & frequency domain Alfred C. Schouten, Dept. of Biomechanical Engineering (BMechE), Fac. 3mE // Delft University
More informationThis examination consists of 10 pages. Please check that you have a complete copy. Time: 2.5 hrs INSTRUCTIONS
THE UNIVERSITY OF BRITISH COLUMBIA Department of Electrical and Computer Engineering EECE 564 Detection and Estimation of Signals in Noise Final Examination 08 December 2009 This examination consists of
More informationAn Efficient Pitch-Tracking Algorithm Using a Combination of Fourier Transforms
An Efficient Pitch-Tracking Algorithm Using a Combination of Fourier Transforms Sylvain Marchand To cite this version: Sylvain Marchand. An Efficient Pitch-Tracking Algorithm Using a Combination of Fourier
More informationCOMP 546. Lecture 21. Cochlea to brain, Source Localization. Tues. April 3, 2018
COMP 546 Lecture 21 Cochlea to brain, Source Localization Tues. April 3, 2018 1 Ear pinna auditory canal cochlea outer middle inner 2 Eye Ear Lens? Retina? Photoreceptors (light -> chemical) Ganglion cells
More informationMathematical Foundations of Signal Processing
Mathematical Foundations of Signal Processing Module 4: Continuous-Time Systems and Signals Benjamín Béjar Haro Mihailo Kolundžija Reza Parhizkar Adam Scholefield October 24, 2016 Continuous Time Signals
More informationChapter 2: The Fourier Transform
EEE, EEE Part A : Digital Signal Processing Chapter Chapter : he Fourier ransform he Fourier ransform. Introduction he sampled Fourier transform of a periodic, discrete-time signal is nown as the discrete
More informationChapter 3 Convolution Representation
Chapter 3 Convolution Representation DT Unit-Impulse Response Consider the DT SISO system: xn [ ] System yn [ ] xn [ ] = δ[ n] If the input signal is and the system has no energy at n = 0, the output yn
More informationZeros of z-transform(zzt) representation and chirp group delay processing for analysis of source and filter characteristics of speech signals
Zeros of z-transformzzt representation and chirp group delay processing for analysis of source and filter characteristics of speech signals Baris Bozkurt 1 Collaboration with LIMSI-CNRS, France 07/03/2017
More informationDirect-Sequence Spread-Spectrum
Chapter 3 Direct-Sequence Spread-Spectrum In this chapter we consider direct-sequence spread-spectrum systems. Unlike frequency-hopping, a direct-sequence signal occupies the entire bandwidth continuously.
More informationFourier Analysis and Power Spectral Density
Chapter 4 Fourier Analysis and Power Spectral Density 4. Fourier Series and ransforms Recall Fourier series for periodic functions for x(t + ) = x(t), where x(t) = 2 a + a = 2 a n = 2 b n = 2 n= a n cos
More informationx(t+ δt) - x(t) = slope δt t+δt
Techniques of Physics Worksheet 2 Classical Vibrations and Waves Introduction You will have encountered many different examples of wave phenomena in your courses and should be familiar with most of the
More informationE2.5 Signals & Linear Systems. Tutorial Sheet 1 Introduction to Signals & Systems (Lectures 1 & 2)
E.5 Signals & Linear Systems Tutorial Sheet 1 Introduction to Signals & Systems (Lectures 1 & ) 1. Sketch each of the following continuous-time signals, specify if the signal is periodic/non-periodic,
More informationA Family of Nyquist Filters Based on Generalized Raised-Cosine Spectra
Proc. Biennial Symp. Commun. (Kingston, Ont.), pp. 3-35, June 99 A Family of Nyquist Filters Based on Generalized Raised-Cosine Spectra Nader Sheikholeslami Peter Kabal Department of Electrical Engineering
More informationA COMPUTATIONAL SOFTWARE FOR NOISE MEASUREMENT AND TOWARD ITS IDENTIFICATION
A COMPUTATIONAL SOFTWARE FOR NOISE MEASUREMENT AND TOWARD ITS IDENTIFICATION M. SAKURAI Yoshimasa Electronic Inc., Daiichi-Nishiwaki Bldg., -58-0 Yoyogi, Shibuya, Tokyo, 5-0053 Japan E-mail: saku@ymec.co.jp
More information2.1 Basic Concepts Basic operations on signals Classication of signals
Haberle³me Sistemlerine Giri³ (ELE 361) 9 Eylül 2017 TOBB Ekonomi ve Teknoloji Üniversitesi, Güz 2017-18 Dr. A. Melda Yüksel Turgut & Tolga Girici Lecture Notes Chapter 2 Signals and Linear Systems 2.1
More informationCochlear modeling and its role in human speech recognition
Allen/IPAM February 1, 2005 p. 1/3 Cochlear modeling and its role in human speech recognition Miller Nicely confusions and the articulation index Jont Allen Univ. of IL, Beckman Inst., Urbana IL Allen/IPAM
More informationDOING PHYSICS WITH MATLAB FOURIER ANALYSIS FOURIER TRANSFORMS
DOING PHYSICS WITH MATLAB FOURIER ANALYSIS FOURIER TRANSFORMS Ian Cooper School of Physics, University of Sydney ian.cooper@sydney.edu.au DOWNLOAD DIRECTORY FOR MATLAB SCRIPTS maths_ft_01.m mscript used
More informationRepresenting a Signal
The Fourier Series Representing a Signal The convolution method for finding the response of a system to an excitation takes advantage of the linearity and timeinvariance of the system and represents the
More informationAcoustic Quantities. LMS Test.Lab. Rev 12A
Acoustic Quantities LMS Test.Lab Rev 12A Copyright LMS International 2012 Table of Contents Chapter 1 Acoustic quantities... 5 Section 1.1 Sound power (P)... 5 Section 1.2 Sound pressure... 5 Section
More informationRadar Systems Engineering Lecture 3 Review of Signals, Systems and Digital Signal Processing
Radar Systems Engineering Lecture Review of Signals, Systems and Digital Signal Processing Dr. Robert M. O Donnell Guest Lecturer Radar Systems Course Review Signals, Systems & DSP // Block Diagram of
More informationCHAPTER 12 PROBLEMS AND EXERCISES
CHAPTER 1 PROBLEMS AND EXERCISES Problem 1: Consider a PWAS transducer of length l 7 mm, width b 1.65 mm, thickness t 0. mm, and material properties as given in Table 1.. The PWAS is bonded at one end
More informationTopic 3: Fourier Series (FS)
ELEC264: Signals And Systems Topic 3: Fourier Series (FS) o o o o Introduction to frequency analysis of signals CT FS Fourier series of CT periodic signals Signal Symmetry and CT Fourier Series Properties
More informationFunctions not limited in time
Functions not limited in time We can segment an arbitrary L 2 function into segments of width T. The mth segment is u m (t) = u(t)rect(t/t m). We then have m 0 u(t) =l.i.m. m0 u m (t) m= m 0 This wors
More informationPeriodicity and pitch perception
Periodicity and pitch perception John R. Pierce Center for Computer Research in Music and 4coustics, Department of Music, Stanford University, Stanford, California 94505 (Received 11 December 1990; revised
More informationCMPT 889: Lecture 2 Sinusoids, Complex Exponentials, Spectrum Representation
CMPT 889: Lecture 2 Sinusoids, Complex Exponentials, Spectrum Representation Tamara Smyth, tamaras@cs.sfu.ca School of Computing Science, Simon Fraser University September 26, 2005 1 Sinusoids Sinusoids
More information2 Frequency-Domain Analysis
2 requency-domain Analysis Electrical engineers live in the two worlds, so to speak, of time and frequency. requency-domain analysis is an extremely valuable tool to the communications engineer, more so
More informationLab 4: Quantization, Oversampling, and Noise Shaping
Lab 4: Quantization, Oversampling, and Noise Shaping Due Friday 04/21/17 Overview: This assignment should be completed with your assigned lab partner(s). Each group must turn in a report composed using
More informationDigital Communications
Digital Communications Chapter 9 Digital Communications Through Band-Limited Channels Po-Ning Chen, Professor Institute of Communications Engineering National Chiao-Tung University, Taiwan Digital Communications:
More informationEnvironmental Effects and Control of Noise Lecture 2
Noise, Vibration, Harshness Sound Quality Research Group NVH-SQ Group University of Windsor 92-455 Environmental Effects and Control of Noise Copyright 2015 Colin Novak Copyright 2015 by Colin Novak. All
More information200Pa 10million. Overview. Acoustics of Speech and Hearing. Loudness. Terms to describe sound. Matching Pressure to Loudness. Loudness vs.
Overview Acoustics of Speech and Hearing Lecture 1-2 How is sound pressure and loudness related? How can we measure the size (quantity) of a sound? The scale Logarithmic scales in general Decibel scales
More informationWEAK SIGNAL DETECTION BASED ON TWO DIMENSIONAL STOCHASTIC RESONANCE. Leonardo Barbini, Matthew O. T. Cole, Andrew J. Hillis, Jonathan L.
WEAK SIGNAL DETECTION BASED ON TWO DIMENSIONAL STOCHASTIC RESONANCE Leonardo Barbini, Matthew O. T. Cole, Andrew J. Hillis, Jonathan L. du Bois University of Bath Department of Mechanical Engineering Claverton
More informationCommunication Theory Summary of Important Definitions and Results
Signal and system theory Convolution of signals x(t) h(t) = y(t): Fourier Transform: Communication Theory Summary of Important Definitions and Results X(ω) = X(ω) = y(t) = X(ω) = j x(t) e jωt dt, 0 Properties
More informationSuperposition and Standing Waves
Physics 1051 Lecture 9 Superposition and Standing Waves Lecture 09 - Contents 14.5 Standing Waves in Air Columns 14.6 Beats: Interference in Time 14.7 Non-sinusoidal Waves Trivia Questions 1 How many wavelengths
More information