Lecture 5: Jan 19, 2005

Size: px
Start display at page:

Download "Lecture 5: Jan 19, 2005"

Transcription

1 EE516 Computer Speech Processing Winter 2005 Lecture 5: Jan 19, 2005 Lecturer: Prof: J. Bilmes University of Washington Dept. of Electrical Engineering Scribe: Kevin Duh 5.1 Overview In the last lecture, we studied the anatomy of the hearing organ (outer, middle, and inner ear) and two views of frequency encoding in the hair cells (Place Theory and Timing Theory). In this lecture, we look at two broad topics: 1. How sound and speech are perceived 2. How speech is produced In the first topic, we shall discover that sound/speech perception is not as simple as bunch of dancing hair cells. Instead, pyscho-acoustical experiments have shown complex phenomena in sound perception, such as hearing thresholds, non-linear relationship between intensity and loudness, and temporal/simultaneous masking. We also briefly discuss the difficulty of speech perception research and overview some notable experiments regarding intelligibility tests on spectrally-filtered speech and gaussian-scaled speech. In the second topic, we will develop a simple mathematical model for speech production based on the source-filter model. Specifically, we will model the vocal tract as an uniform lossless tube and derive the dynamics of acoustic pressure waves within it. As we shall see, this simple model elegantly explains the formant structure of the vowel schwa, and with some extensions, we can also model the vocal tract for other vowels. Finally, we also derive the 1-D wave equations that serve as the basis for the above lossless tube derivations. 5.2 Perception of Sound and Speech Sound Perception Thresholds of Hearing and Feeling The threshold of hearing is the minimum intensity at which one can perceive a sound; the threshold of feeling is the point where any increase in sound intensity begins to cause physical pain. As shown in Figure 1, these thresholds vary across frequencies. The intensity of sound is measured in terms of sound pressure level (SPL) in units of decibels (db). Note that the threshold of hearing decreases sharply as we progress from 20Hz to 2000Hz, but increases rapidly thereafter. This bandpass phenomenon is due to the filtering of the outer and middle ear, and the sensitivity of hair cells to different frequencies. From this curve, we can note the interesting fact that it is easier to hear female speakers at low volumes, since females have a pitch range corresponding to a lower threshold of hearing. (Recall that the F0 s of males and females are Hz and Hz, respectively.) 5-1

2 Lecture 5: Jan 19, The plot also shows the frequencies of speech, which are relatively constant with respect to the threshold of hearing. The threshold of hearing is higher for 7000Hz, but only fricatives contain significant amounts of these frequencies, which has little effect on speech intelligibility and naturalness. Some believe that evolution has adapted speech production to fit this region of auditory perception; others believe that the auditory system, which basically functions only as a hearing organ, is under less selective pressure than the vocal system, which simultaneously works as speech, eating, and smelling organs. Figure 5.1: Hearing thresholds across different frequencies and frequency range of speech. [O87] Loudness Intensity The perception of loudness is not the same as the raw intensity of sound pressure waves. Fletcher and Munson developed a plot (Figure 2) which shows that our perception of loudness varies across frequencies. Sound pressure waves at different intensities may be perceived similarly in terms of loudness, depending on the waves frequencies. This forms the equal loudness curve. Since intensity of sound pressure waves does not correspond to the perception of loudness, a phon is a defined as an unit of loudness. As shown in Figure 3, the equal loudness curves define the different levels of phons. Speech typically lies between 20 to 80 phons. Figure 5.2: Fletcher and Munson curves [O87]

3 Lecture 5: Jan 19, Figure 5.3: Phons and Equal Loudness Curves [O87] Masking Masking occurs when the perception of one sound is obscured by the presence of another sound. This occurs when one sound raises the threshold of hearing for the other sound. Two types of masking are: Simultaneous Masking two sounds occurring at once, and one masks the other. The lower frequency sound masks the higher frequency sound, and this typically occurs when both sounds fall under the same neural critical band. Temporal Masking two sounds occur in sequence in time, and one masks the other. If the earlier sound masks the later sound, this is known as forward temporal masking. The opposite case is called backward temporal masking. Forward masking can be explained by neural fatigue, where the neuron needs some time to recharge after firing at the first sound. Backward masking very likely involves a blocking phenomenon where the processing of the earlier sound at a higher auditory level is interrupted by a later, louder sound. Masking is widely exploited in auditory coding. Sounds that are masked do not need to be encoded because the listener will not be able to perceive them anyways. (However, a select few people with golden ears may be able to hear the additive noise created by such a lossy encoder.) Neural Tuning Curves A neural tuning curve shows the threshold of hearing for a single neuron. These curves are helpful in determining the critical band, the frequencies where a neuron is active. They can be determined by probing a single neuron and measuring the response in terms of spiking rate for a given stimulus. The neural tuning curve for probing an anesthetized cat is shown in Figure 4. These curves show the threshold where neurons begin to respond above spontaneous firing. From the figure, we observe that (1) neural tuning curves exhibit roughly constant Q, and (2) different neurons have different tuning curves and critical frequencies. This is one of the evidence that supports von Bekesey s Place Theory of Frequency Encoding. The pyschophysical tuning curve for humans can be obtained by using masking (Figure 5). One such procedure uses a fixed-level tone in narrowband noise (masker). The tone is fixed at a low level ( 10dB SPL) to ensure that only one auditory filter responds. Then, the masker frequency and intensity is adjusted to the points where the listener loses perception of the tone (due to the masking effect). These points form the upside-down critical frequency curve of that neuron.

4 Lecture 5: Jan 19, Figure 5.4: Neural tuning curves taken from anesthetized cats. Figure 5.5: Psychophysical tuning curves of humans Speech Perception Challenges in Speech Perception Research Speech perception occurs at a much higher level in the auditory cortex or brain. This is an active research area as not much is yet understood. The goal is to find invariant acoustic cues for different speech sounds. For example, what discriminates between phonemes, phones, syllables, words, phrases, or sentences? These are difficult to figure out since the acoustic cues for a sound changes depending on its context. For example, Figure 6 shows the formants of [di] and [du]. Although the phone [d] is the same in both cases, the contextual vowel following it changes the F2 drastically. Therefore, humans must be using more information than formants when perceiving speech.

5 Lecture 5: Jan 19, Figure 5.6: Formant transition is different depending on context. Example of [di], [du]. [LB88] Spectral Regions of Speech Perception The frequencies from 200Hz to 5500Hz are the most important for speech perception. Experiments in speech perception are usually performed by filtering out various spectral regions in a speech signal, then measuring intelligibility from test subjects. For example, after filtering out frequencies less than 1000Hz, the discriminability of voicing and manner of articulation decreases. (/b/ vs. /p/ vs. /v/). After filtering out frequencies above 1200Hz, place of articulation discriminability drops (/p/ vs. /t/). Note that in telephones, the bandwidth is Hz, which is good enough for speech intelligibility. Although the E-set (/p/, /d/, /e/, /g/, /c/) and fricatives like /f/ and /s/ may be difficult to discriminate, intelligibility is usually not a problem when words are spoken in context. Experiments in Gaussian scaled speech tests intelligibility by zeroing out windows of speech at different locations in time. The surprising finding is that if the Gaussian windows are short enough in duration, no particular zeroed location in the time/frequency axis contains the critical information that makes the speech incomprehensible. Apparently we can infer what is missing from the other sound segments. The bottom line is that humans are remarkably good at perceiving speech. Even in the presence of noise or missing sounds, we automatically use a variety of cues and context to determine the spoken speech. 5.3 Speech Production: A Mathematical Model of the Vocal Tract A mathematical model of the speech production process is important because it sheds light on the characteristics of speech and serves as the basis for speech analysis and synthesis. With a good model, we should be able to build good speech synthesizers. Further, if we can produce an inverse model that goes from the acoustic signal to model parameters, we should be able to do better speech recognition, as well. To begin our derivation of a mathematical model, we start with a joke: JOKE: A group of wealthy investors wanted to predict the outcome of horse races so they can become even richer. Therefore, they hired a group of eminent physicists around the world to research the issue. After a year, the physicists returned with the promise of a solution. They said they have developed a solution that would accurately predict the outcome of any race without fail! The investers were very eager in listening to this great discovery. So the head physicist reported, Well, first we need to simplify the problem. Assume the horse is a perfect sphere... What we will do in the following derivation is to assume that the vocal tract is an uniform lossless tube excited by

6 Lecture 5: Jan 19, a source. Comparing Figure 7 to Figure 8, we see that we have vastly simplified the situation. In the next lecture, we will extend the uniform tube derived here into a concatenation of tubes of varying sizes. It turns out that despite the simplifying assumptions, the models do shed light on the speech production process and in some cases perform adequatly in practice. Figure 5.7: Schematic of Human Vocal Tract [O87] Figure 5.8: Uniform Lossless Tube Vocal Tract Model

7 Lecture 5: Jan 19, Simplified Production Model: Overview Figure 5.9: Speech production as Source-Filter model. The source e(t) is the glottal pulse; The filter impulse response v(t) defines the resonances. The simplified production model we will adopt is a time-varying system (vocal tract) that is excited by a period source (glottis), as shown in Figure 9. The glottal source generates a sawtooth wave, which is superior to square waves and triangle waves in that it contains all the harmonics. The vocal tract is a time-varying filter that shapes the excitation signal with different formants. In other words, we have: S(jΩ) = V (jω)i(jω)g(jω) where V (jω) is the vocal tract response, G(jΩ) is the glottal pulse spectral shape, I(jΩ) is a periodic pulse train, and S(jΩ) is the output speech spectrum. Each of these models can be approximated separately. However, doing so makes an independence assumption between glottal excitement and vocal tract coloring, which is mostly but not entirely true. For example, in Lombard speech, the presence of noise affects speaker effort in an non-linear fashion. Also, in the extremely fast speech by Steve Woodmore (637 words/minute), there is little pitch variation, which suggests a coupling of the source and filter Glottal Excitation Model The glottal excitation e(t) is the convolution of a glottal pulse g(t), which has a specific spectral shape, with a periodic impulse train i(t). As seen in Figure 10, the glottal pulse looks like a half-wave rectified sine-wave, where closing phase occurs more rapid than the opening phase. This is because the Bernoulli force closes the glottis rapidly while the air pressure from the lungs opens the glottis gradually. As a result, we often model the glottal pulse as a sawtooth wave. Figure 5.10: Glottal pulse and mouth sound pressure [O87] The spectrum of a glottal pulse G(jΩ) is a low-pass signal with a cut-off frequency around 500Hz. The fall-off is about 12dB/Octave.

8 Lecture 5: Jan 19, Uniform Lossless Tube Vocal Tract Model After modeling the glottal source, we will now model the vocal tract filter. In the uniform lossless tube model (Figure 11) of the vocal tract, we assume the following: The vocal tract is a single tube (and therefore time-variation of the vocal tract shape is not captured.) The tube is not curved (which actually does not have a large effect.) Losses due to heat conduction, viscous friction at vocal tract walls, softness of vocal tract walls, etc. are ignored. Radiation of sound occurs only at the lips. There is no nasal coupling. Using the laws of conservation of mass, energy, and momentum, it can be shown that the acoustic waves in an uniform lossless tube satisfies the following 1-D wave equations: 2 p 2 = 1 ( 2 ) p c 2 t 2 (5.1) 2 u 2 = 1 c 2 ( 2 ) u t 2 (5.2) p = p(x, t): pressure as a function of position and time u = u(x, t): volume velocity of particles in the tube t: time x: particle position along the tube of length l c: speed of sound 340m/s Note: The wave equations are one-dimensional because we can assume planar wave propagation along the length of the tube when the wavelengths are sufficiently long compared to the diameter of the tube. (This is true in typical human vocal tracts for frequencies less than 4000Hz.) The general solution to these partial differential equations has the form: p(t, x) = p + (t x/c) + p (t + x/c) (5.3) u(t, x) = u + (t x/c) + u (t + x/c) (5.4) where p + and u+ are the forward-propagating waves, p and u are the backward-propagating waves, and both are arbitrary functions with constant first and second derivatives. One can imagine waves as propagating along the length of the tube through time without changing shape. Now we will find the vocal tract transfer function V (jω) to get a frequency domain understanding of the system. We begin by assuming the boundary condition at x = 0 is an excitation by a complex exponential: u(0, t) = u G (t) = U G (Ω)e jωt

9 Lecture 5: Jan 19, where U G (Ω) is the complex amplitude. As a result, the solutions to the forward and backward waves have the form: u + (t x/c) = K + e jω(t x/c) (5.5) u (t + x/c) = K e jω(t+x/c) (5.6) Substituting these equations in Eqs. 6.3 and 6.4 and applying the boundary condition where the pressue at the lips is zero (lips is open). p(l, t) = 0, we solve for the unknown constants K +, K, and arrive at the steady-state solutions: where Z 0 = ρc/a is the characteristic impedance of tube. sin(ω(l x)/c) p(x, t) = jz 0 U G (Ω) e jωt (5.7) cos(ωl/c) cos(ω(l x)/c) u(x, t) = U a (Ω) e jωt (5.8) cos(ωl/c) The volume velocity at the lips, which occurs at x = l, is therefore: u(l, t) = U(l, Ω)e jωt = U G(Ω)e jωt cos(ωl/c) (5.9) Finally, we can get the vocal tract transfer function: V (Ω) = U(l, Ω) U G (Ω) = 1 cos(ωl/c) (5.10) Figure 5.11: Resonant frequencies of an uniform tube Figure 11 shows a plot of V (Ω). This is an all-pole model, and the peaks occur when Ωl/c = (2n + 1)π/2. Thus, for an uniform tube model, the resonance frequencies are at evenly spaced frequencies f n = (2n+1)πc 2l, n = 0, ±1, ±2,... The spacing decreases as vocal tract length l increases. The resonant frequencies are analagous to the formats we see in speech, and the single tube model actually models the vowel schwa relatively well. As we shall see in the next lecture, differently-spaced resonant frequencies corresponding to different speech sounds can be modeled by concatenating uniform tubes of various sizes to approximate a varying vocal tract.

10 Lecture 5: Jan 19, Derivation of 1-D Wave Equations The 1-D wave equations (Eqs. 6.1, 6.2) we used for the single lossless tube model is not difficult to derive. The outline of our derivation is as follows: 1. Approximate particles in the tube as infinitesimally small boxes and use Newton s 2nd Law to model its motions. The result is p = ρ 0 v t. 2. Use the Adiabatic Ideal Gas Law to model the change of volume of the box. The result is 2 p = 1 2 p 2 c 2 t Do the same to get 2 u 2 = 1 c 2 2 u t 2. In the vocal tract, particles move when pressure is applied in some direction. To model this, we will consider a infinitesimally small cube of volume V = x y z as shown in Figure 12. The glottis produces air pressure from the left end of the tube, so there is a force that pushes the box to the right. This force can be described by Newton s Second Law of Motion, F = ma. This model works well for sounds in the audible range (e.g. Not more than 110dB SPL). Figure 5.12: Box in tube model. Pressure comes from the x=0 end. Since pressure is defined as the force over the area (p = f/a), we have the force causing acceleration in the negative x direction on the cube as f l = p A = ( p x) ( y z). The force causing a positive x acceleration is therefore the negative: Using Newton s Second Law, the force per unit volume is: f = ( p x) ( y z). (5.11) f V = Ma V = M V u t = ρ 0 u t (5.12) where M is the mass of the box, u is the volume velocity of the gas, and ρ 0 = M/V is the density of the box. From Eq, 6.11, we have p the derivation: = f x y z = f V. Combining this with Eq. 6.12, we get our desired result for step 1 of

11 Lecture 5: Jan 19, p v = ρ t (5.13) Next we proceed to step 2, where we apply the Ideal Gas Law to derive the second derivative equation for pressure. Recall that the Charles-Boyle Ideal Gas Law states that P V = nrt, where P : pressure V : volume n: number of moles in the volume R: universal gas constant T : temperature in Kelvins In general, this law describes the relationship between pressure and volume, depending on how temperature changes. There are two types of thermal conditions: Isothermal: The gas remains at constant temperature, which occurs if the change in pressure/volume is slow (quasi-static). The result is P V = constant. Adiabatic: No heat flows in or out of the system. This occurs if 1) the system is well insulated, and 2) the pressure/volume change happens so fast that heat has no time to flow in or out. This effect is dependent on the wavelenth of a typical sound. The wavelength of an example speech sound at 1000Hz is λ = c f = 340m/s 1000s 1 = 0.34m, so the length of a half-cycle is 0.17m. Heat diffusion occurs approximately at the rate of 0.5m/s. A half-cycle at 1000Hz lasts 0.5ms, so heat travels only 2.5x10 4 m. This is much smaller than the half-cycle of the speech sound, so the process in the vocal tract is essentially adiabatic. The Adiabatic Gas law is defined as where the heat capacity γ = P V γ = constant (5.14) specific heat capacity at constant pressure specific heat capacity at constant volume, which is 1.4 in air. We want the differential form of the adiabatic gas law, so we will first take the logarithm to get: ln(p V γ ) = constant ln P + γ ln V = constant Then we take take the derivative and rearrange to get the differential form: dp P = γ dv V which can be approximated as

12 Lecture 5: Jan 19, p P 0 = γ V V 0 (5.15) Here, P 0 is the undisturbed pressure and p is the incremental pressure from the wave; V 0 is the undisturbed volume and V is the incremental volume from the wave. Changing V to τ for notational simplicity and taking the derivative of time, we arrive at: 1 p P 0 t = γ τ V 0 t (5.16) Figure 5.13: Left: before deformation; Right: after deformation. Note the incremental volume. Now we look at the incremental volume of the box in the tube. We assume that the mass in the box remains constant (conservation of mass) while the side of the box extends due to pressure change. As shown in Figure 13, the incremental volume is related to ξ x, the displacement/extension of the box at one side. Subtracting the volumes of boxes in Figure 13 ( ( x + ξx x) y z and x y z ), the incremental volume is simply Taking the derivative with respect to time, we have τ = ξ x x y z = V 0 ξ x τ t = V 0 ξ x ξ x = V 0 t ξ x = V 0 t u = V 0 (5.17) Note the last step in equation (Eq. 6.17) follows because u, the instantaneous particle velocity, must be equivalent to the instantaneous velocity of the box s edge ( ξx t ) due to conservation of mass. Finally, we combine the equations we derived previously ( Eqs. 6.13, 6.16, 6.17 ) for the final result. Substituting Eq in Eq and taking derivative, we have:

13 Lecture 5: Jan 19, p t = γp u 0 2 p t 2 = γp 2 u 0 t (5.18) Taking the derivative (with respect of x) of Eq. 6.13, then substituting in Eq. 6.18, we get: 2 p 2 u 2 = ρ 0 t = ρ 0 2 p γp 0 t 2 Finally, observing that γp0 ρ 0 = c 2, where c is the speed of sound in the gas, we have our 1-D wave equation for pressure: 2 p 2 = 1 c 2 ( 2 ) p t 2 To find the wave equation for volume velocity, we take derivative of Eq with respect to x and proceed similarly. In the end, we have two 1-D wave equations describing the dynamics of sound pressure and volume velocity inside an uniform lossless tube: 2 p 2 = 1 c 2 2 u 2 = 1 c 2 ( 2 ) p t 2 ) ( 2 u t 2 References [DHP99] J. R. DELLER and J. H. L. Hansen and J. G. Proakis, Discrete-Time Processing of Speech Signals, Wiley-IEEE Press, [HAH01] X. HUANG, A. ACERO and H. HON, Spoken Language Processing, Prentice Hall PTR, 2001 [LB88] P. LIEBERMAN abd S. BLUMSTEIN Speech Physiology, Speech Perception, and Acoustic Phonetics Cambridge University Press, 1988 [O87] D. O SHAUGHNESSY, Speech Communications: Human and Machine, Addison-Wesley, 1987 [RS78] L. R. RABINER and R. W. SCHAFER, Digital Processing of Speech Signals, Prentice Hall, New Jersey, 1978.

SPEECH COMMUNICATION 6.541J J-HST710J Spring 2004

SPEECH COMMUNICATION 6.541J J-HST710J Spring 2004 6.541J PS3 02/19/04 1 SPEECH COMMUNICATION 6.541J-24.968J-HST710J Spring 2004 Problem Set 3 Assigned: 02/19/04 Due: 02/26/04 Read Chapter 6. Problem 1 In this problem we examine the acoustic and perceptual

More information

COMP 546, Winter 2018 lecture 19 - sound 2

COMP 546, Winter 2018 lecture 19 - sound 2 Sound waves Last lecture we considered sound to be a pressure function I(X, Y, Z, t). However, sound is not just any function of those four variables. Rather, sound obeys the wave equation: 2 I(X, Y, Z,

More information

Sound 2: frequency analysis

Sound 2: frequency analysis COMP 546 Lecture 19 Sound 2: frequency analysis Tues. March 27, 2018 1 Speed of Sound Sound travels at about 340 m/s, or 34 cm/ ms. (This depends on temperature and other factors) 2 Wave equation Pressure

More information

L7: Linear prediction of speech

L7: Linear prediction of speech L7: Linear prediction of speech Introduction Linear prediction Finding the linear prediction coefficients Alternative representations This lecture is based on [Dutoit and Marques, 2009, ch1; Taylor, 2009,

More information

Lecture 3: Acoustics

Lecture 3: Acoustics CSC 83060: Speech & Audio Understanding Lecture 3: Acoustics Michael Mandel mim@sci.brooklyn.cuny.edu CUNY Graduate Center, Computer Science Program http://mr-pc.org/t/csc83060 With much content from Dan

More information

Linear Prediction 1 / 41

Linear Prediction 1 / 41 Linear Prediction 1 / 41 A map of speech signal processing Natural signals Models Artificial signals Inference Speech synthesis Hidden Markov Inference Homomorphic processing Dereverberation, Deconvolution

More information

Speech Signal Representations

Speech Signal Representations Speech Signal Representations Berlin Chen 2003 References: 1. X. Huang et. al., Spoken Language Processing, Chapters 5, 6 2. J. R. Deller et. al., Discrete-Time Processing of Speech Signals, Chapters 4-6

More information

L8: Source estimation

L8: Source estimation L8: Source estimation Glottal and lip radiation models Closed-phase residual analysis Voicing/unvoicing detection Pitch detection Epoch detection This lecture is based on [Taylor, 2009, ch. 11-12] Introduction

More information

The effect of speaking rate and vowel context on the perception of consonants. in babble noise

The effect of speaking rate and vowel context on the perception of consonants. in babble noise The effect of speaking rate and vowel context on the perception of consonants in babble noise Anirudh Raju Department of Electrical Engineering, University of California, Los Angeles, California, USA anirudh90@ucla.edu

More information

Physical Acoustics. Hearing is the result of a complex interaction of physics, physiology, perception and cognition.

Physical Acoustics. Hearing is the result of a complex interaction of physics, physiology, perception and cognition. Physical Acoustics Hearing, auditory perception, or audition is the ability to perceive sound by detecting vibrations, changes in the pressure of the surrounding medium through time, through an organ such

More information

Introduction to Acoustics Exercises

Introduction to Acoustics Exercises . 361-1-3291 Introduction to Acoustics Exercises 1 Fundamentals of acoustics 1. Show the effect of temperature on acoustic pressure. Hint: use the equation of state and the equation of state at equilibrium.

More information

Signal Modeling Techniques in Speech Recognition. Hassan A. Kingravi

Signal Modeling Techniques in Speech Recognition. Hassan A. Kingravi Signal Modeling Techniques in Speech Recognition Hassan A. Kingravi Outline Introduction Spectral Shaping Spectral Analysis Parameter Transforms Statistical Modeling Discussion Conclusions 1: Introduction

More information

4.2 Acoustics of Speech Production

4.2 Acoustics of Speech Production 4.2 Acoustics of Speech Production Acoustic phonetics is a field that studies the acoustic properties of speech and how these are related to the human speech production system. The topic is vast, exceeding

More information

Source/Filter Model. Markus Flohberger. Acoustic Tube Models Linear Prediction Formant Synthesizer.

Source/Filter Model. Markus Flohberger. Acoustic Tube Models Linear Prediction Formant Synthesizer. Source/Filter Model Acoustic Tube Models Linear Prediction Formant Synthesizer Markus Flohberger maxiko@sbox.tugraz.at Graz, 19.11.2003 2 ACOUSTIC TUBE MODELS 1 Introduction Speech synthesis methods that

More information

Feature extraction 1

Feature extraction 1 Centre for Vision Speech & Signal Processing University of Surrey, Guildford GU2 7XH. Feature extraction 1 Dr Philip Jackson Cepstral analysis - Real & complex cepstra - Homomorphic decomposition Filter

More information

Lecture 2: Acoustics. Acoustics & sound

Lecture 2: Acoustics. Acoustics & sound EE E680: Speech & Audio Processing & Recognition Lecture : Acoustics 1 3 4 The wave equation Acoustic tubes: reflections & resonance Oscillations & musical acoustics Spherical waves & room acoustics Dan

More information

Signal representations: Cepstrum

Signal representations: Cepstrum Signal representations: Cepstrum Source-filter separation for sound production For speech, source corresponds to excitation by a pulse train for voiced phonemes and to turbulence (noise) for unvoiced phonemes,

More information

Last time: small acoustics

Last time: small acoustics Last time: small acoustics Voice, many instruments, modeled by tubes Traveling waves in both directions yield standing waves Standing waves correspond to resonances Variations from the idealization give

More information

Fundamentals of Acoustics

Fundamentals of Acoustics Fundamentals of Acoustics Introductory Course on Multiphysics Modelling TOMASZ G. ZIELIŃSKI bluebox.ippt.pan.pl/ tzielins/ Institute of Fundamental Technological Research of the Polish Academy of Sciences

More information

Resonances and mode shapes of the human vocal tract during vowel production

Resonances and mode shapes of the human vocal tract during vowel production Resonances and mode shapes of the human vocal tract during vowel production Atle Kivelä, Juha Kuortti, Jarmo Malinen Aalto University, School of Science, Department of Mathematics and Systems Analysis

More information

SPEECH ANALYSIS AND SYNTHESIS

SPEECH ANALYSIS AND SYNTHESIS 16 Chapter 2 SPEECH ANALYSIS AND SYNTHESIS 2.1 INTRODUCTION: Speech signal analysis is used to characterize the spectral information of an input speech signal. Speech signal analysis [52-53] techniques

More information

Cochlear modeling and its role in human speech recognition

Cochlear modeling and its role in human speech recognition Allen/IPAM February 1, 2005 p. 1/3 Cochlear modeling and its role in human speech recognition Miller Nicely confusions and the articulation index Jont Allen Univ. of IL, Beckman Inst., Urbana IL Allen/IPAM

More information

3 THE P HYSICS PHYSICS OF SOUND

3 THE P HYSICS PHYSICS OF SOUND Chapter 3 THE PHYSICS OF SOUND Contents What is sound? What is sound wave?» Wave motion» Source & Medium How do we analyze sound?» Classifications» Fourier Analysis How do we measure sound?» Amplitude,

More information

L6: Short-time Fourier analysis and synthesis

L6: Short-time Fourier analysis and synthesis L6: Short-time Fourier analysis and synthesis Overview Analysis: Fourier-transform view Analysis: filtering view Synthesis: filter bank summation (FBS) method Synthesis: overlap-add (OLA) method STFT magnitude

More information

Automatic Speech Recognition (CS753)

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 12: Acoustic Feature Extraction for ASR Instructor: Preethi Jyothi Feb 13, 2017 Speech Signal Analysis Generate discrete samples A frame Need to focus on short

More information

Chapter 9. Linear Predictive Analysis of Speech Signals 语音信号的线性预测分析

Chapter 9. Linear Predictive Analysis of Speech Signals 语音信号的线性预测分析 Chapter 9 Linear Predictive Analysis of Speech Signals 语音信号的线性预测分析 1 LPC Methods LPC methods are the most widely used in speech coding, speech synthesis, speech recognition, speaker recognition and verification

More information

COMP 546. Lecture 21. Cochlea to brain, Source Localization. Tues. April 3, 2018

COMP 546. Lecture 21. Cochlea to brain, Source Localization. Tues. April 3, 2018 COMP 546 Lecture 21 Cochlea to brain, Source Localization Tues. April 3, 2018 1 Ear pinna auditory canal cochlea outer middle inner 2 Eye Ear Lens? Retina? Photoreceptors (light -> chemical) Ganglion cells

More information

Frequency Domain Speech Analysis

Frequency Domain Speech Analysis Frequency Domain Speech Analysis Short Time Fourier Analysis Cepstral Analysis Windowed (short time) Fourier Transform Spectrogram of speech signals Filter bank implementation* (Real) cepstrum and complex

More information

Measures and sources of sound. Professor Phil Joseph

Measures and sources of sound. Professor Phil Joseph Measures and sources of sound Professor Phil Joseph WHAT IS NOISE? Sound may be defined as a time-varying fluctuation of the density of a fluid from its ambient (mean) state, which is accompanied by a

More information

Introduction to Acoustics. Phil Joseph

Introduction to Acoustics. Phil Joseph Introduction to Acoustics Phil Joseph INTRODUCTION TO ACOUSTICS Sound and Noise Sound waves Frequency, wavelength and wavespeed Point sources Sound power and intensity Wave reflection Standing waves Measures

More information

1 Wind Turbine Acoustics. Wind turbines generate sound by both mechanical and aerodynamic

1 Wind Turbine Acoustics. Wind turbines generate sound by both mechanical and aerodynamic Wind Turbine Acoustics 1 1 Wind Turbine Acoustics Wind turbines generate sound by both mechanical and aerodynamic sources. Sound remains an important criterion used in the siting of wind farms. Sound emission

More information

PHYSICS 220. Lecture 21. Textbook Sections Lecture 21 Purdue University, Physics 220 1

PHYSICS 220. Lecture 21. Textbook Sections Lecture 21 Purdue University, Physics 220 1 PHYSICS 220 Lecture 21 Sound Textbook Sections 13.1 13.7 Lecture 21 Purdue University, Physics 220 1 Overview Last Lecture Interference and Diffraction Constructive, destructive Diffraction: bending of

More information

200Pa 10million. Overview. Acoustics of Speech and Hearing. Loudness. Terms to describe sound. Matching Pressure to Loudness. Loudness vs.

200Pa 10million. Overview. Acoustics of Speech and Hearing. Loudness. Terms to describe sound. Matching Pressure to Loudness. Loudness vs. Overview Acoustics of Speech and Hearing Lecture 1-2 How is sound pressure and loudness related? How can we measure the size (quantity) of a sound? The scale Logarithmic scales in general Decibel scales

More information

Sound, acoustics Slides based on: Rossing, The science of sound, 1990, and Pulkki, Karjalainen, Communication acoutics, 2015

Sound, acoustics Slides based on: Rossing, The science of sound, 1990, and Pulkki, Karjalainen, Communication acoutics, 2015 Acoustics 1 Sound, acoustics Slides based on: Rossing, The science of sound, 1990, and Pulkki, Karjalainen, Communication acoutics, 2015 Contents: 1. Introduction 2. Vibrating systems 3. Waves 4. Resonance

More information

Robust Speaker Identification

Robust Speaker Identification Robust Speaker Identification by Smarajit Bose Interdisciplinary Statistical Research Unit Indian Statistical Institute, Kolkata Joint work with Amita Pal and Ayanendranath Basu Overview } } } } } } }

More information

Lab 9a. Linear Predictive Coding for Speech Processing

Lab 9a. Linear Predictive Coding for Speech Processing EE275Lab October 27, 2007 Lab 9a. Linear Predictive Coding for Speech Processing Pitch Period Impulse Train Generator Voiced/Unvoiced Speech Switch Vocal Tract Parameters Time-Varying Digital Filter H(z)

More information

CMPT 889: Lecture 9 Wind Instrument Modelling

CMPT 889: Lecture 9 Wind Instrument Modelling CMPT 889: Lecture 9 Wind Instrument Modelling Tamara Smyth, tamaras@cs.sfu.ca School of Computing Science, Simon Fraser University November 20, 2006 1 Scattering Scattering is a phenomenon in which the

More information

Linear Prediction Coding. Nimrod Peleg Update: Aug. 2007

Linear Prediction Coding. Nimrod Peleg Update: Aug. 2007 Linear Prediction Coding Nimrod Peleg Update: Aug. 2007 1 Linear Prediction and Speech Coding The earliest papers on applying LPC to speech: Atal 1968, 1970, 1971 Markel 1971, 1972 Makhoul 1975 This is

More information

Lecture 18. Waves and Sound

Lecture 18. Waves and Sound Lecture 18 Waves and Sound Today s Topics: Nature o Waves Periodic Waves Wave Speed The Nature o Sound Speed o Sound Sound ntensity The Doppler Eect Disturbance Wave Motion DEMO: Rope A wave is a traveling

More information

Numerical Simulation of Acoustic Characteristics of Vocal-tract Model with 3-D Radiation and Wall Impedance

Numerical Simulation of Acoustic Characteristics of Vocal-tract Model with 3-D Radiation and Wall Impedance APSIPA AS 2011 Xi an Numerical Simulation of Acoustic haracteristics of Vocal-tract Model with 3-D Radiation and Wall Impedance Hiroki Matsuzaki and Kunitoshi Motoki Hokkaido Institute of Technology, Sapporo,

More information

Feature extraction 2

Feature extraction 2 Centre for Vision Speech & Signal Processing University of Surrey, Guildford GU2 7XH. Feature extraction 2 Dr Philip Jackson Linear prediction Perceptual linear prediction Comparison of feature methods

More information

Proc. of NCC 2010, Chennai, India

Proc. of NCC 2010, Chennai, India Proc. of NCC 2010, Chennai, India Trajectory and surface modeling of LSF for low rate speech coding M. Deepak and Preeti Rao Department of Electrical Engineering Indian Institute of Technology, Bombay

More information

Nicholas J. Giordano. Chapter 13 Sound

Nicholas J. Giordano.  Chapter 13 Sound Nicholas J. Giordano www.cengage.com/physics/giordano Chapter 13 Sound Sound Sounds waves are an important example of wave motion Sound is central to hearing, speech, music and many other daily activities

More information

Thermodynamics continued

Thermodynamics continued Chapter 15 Thermodynamics continued 15 Work The area under a pressure-volume graph is the work for any kind of process. B Pressure A W AB W AB is positive here volume increases Volume Clicker Question

More information

Time-domain representations

Time-domain representations Time-domain representations Speech Processing Tom Bäckström Aalto University Fall 2016 Basics of Signal Processing in the Time-domain Time-domain signals Before we can describe speech signals or modelling

More information

Sound impulse. r = (X X 0 ) 2 + (Y Y 0 ) 2 + (Z Z 0 ) 2. and

Sound impulse. r = (X X 0 ) 2 + (Y Y 0 ) 2 + (Z Z 0 ) 2. and Sound impulse Consider an isolated perturbation of air pressure at 3D point (X o, Y o, Z o ) and at time t = t 0, for example, due to some impact. Or, you can imagine a digital sound generation system

More information

AN INVERTIBLE DISCRETE AUDITORY TRANSFORM

AN INVERTIBLE DISCRETE AUDITORY TRANSFORM COMM. MATH. SCI. Vol. 3, No. 1, pp. 47 56 c 25 International Press AN INVERTIBLE DISCRETE AUDITORY TRANSFORM JACK XIN AND YINGYONG QI Abstract. A discrete auditory transform (DAT) from sound signal to

More information

CMPT 889: Lecture 8 Digital Waveguides

CMPT 889: Lecture 8 Digital Waveguides CMPT 889: Lecture 8 Digital Waveguides Tamara Smyth, tamaras@cs.sfu.ca School of Computing Science, Simon Fraser University February 10, 2012 1 Motion for a Wave For the string, we are interested in the

More information

Chapters 11 and 12. Sound and Standing Waves

Chapters 11 and 12. Sound and Standing Waves Chapters 11 and 12 Sound and Standing Waves The Nature of Sound Waves LONGITUDINAL SOUND WAVES Speaker making sound waves in a tube The Nature of Sound Waves The distance between adjacent condensations

More information

HW Set 2 Solutions. Problem 1

HW Set 2 Solutions. Problem 1 HW Set 2 Solutions. Note: problems 1 and 6 are quite long, and therefore cannot come up on the Exam as such. But the many skills exercised in these problems will be needed for dealing with some related,

More information

Determining the Shape of a Human Vocal Tract From Pressure Measurements at the Lips

Determining the Shape of a Human Vocal Tract From Pressure Measurements at the Lips Determining the Shape of a Human Vocal Tract From Pressure Measurements at the Lips Tuncay Aktosun Technical Report 2007-01 http://www.uta.edu/math/preprint/ Determining the shape of a human vocal tract

More information

Lecture 14 1/38 Phys 220. Final Exam. Wednesday, August 6 th 10:30 am 12:30 pm Phys multiple choice problems (15 points each 300 total)

Lecture 14 1/38 Phys 220. Final Exam. Wednesday, August 6 th 10:30 am 12:30 pm Phys multiple choice problems (15 points each 300 total) Lecture 14 1/38 Phys 220 Final Exam Wednesday, August 6 th 10:30 am 12:30 pm Phys 114 20 multiple choice problems (15 points each 300 total) 75% will be from Chapters 10-16 25% from Chapters 1-9 Students

More information

Timbral, Scale, Pitch modifications

Timbral, Scale, Pitch modifications Introduction Timbral, Scale, Pitch modifications M2 Mathématiques / Vision / Apprentissage Audio signal analysis, indexing and transformation Page 1 / 40 Page 2 / 40 Modification of playback speed Modifications

More information

Sinusoidal Modeling. Yannis Stylianou SPCC University of Crete, Computer Science Dept., Greece,

Sinusoidal Modeling. Yannis Stylianou SPCC University of Crete, Computer Science Dept., Greece, Sinusoidal Modeling Yannis Stylianou University of Crete, Computer Science Dept., Greece, yannis@csd.uoc.gr SPCC 2016 1 Speech Production 2 Modulators 3 Sinusoidal Modeling Sinusoidal Models Voiced Speech

More information

logarithmic a ratio gain or loss. not an absolute value

logarithmic a ratio gain or loss. not an absolute value Class #2 DECIBELS The Decibel The db is a logarithmic unit used to describe a ratio (between two sound pressure levels, power levels, voltages, currents, and many others). We use the terms gain or loss.

More information

Final Exam Notes 8am WednesdayDecember 16, 2015 Physics 1320 Music & Physics Prof. Tunks & Olness

Final Exam Notes 8am WednesdayDecember 16, 2015 Physics 1320 Music & Physics Prof. Tunks & Olness Final Exam Notes 8am WednesdayDecember 16, 2015 Physics 1320 Music & Physics Prof. Tunks & Olness 1)This contains review material for the Final exam. It consists of review sheets for exams 1 and 2, as

More information

OPAC102. The Acoustic Wave Equation

OPAC102. The Acoustic Wave Equation OPAC102 The Acoustic Wave Equation Acoustic waves in fluid Acoustic waves constitute one kind of pressure fluctuation that can exist in a compressible fluid. The restoring forces responsible for propagating

More information

Zeros of z-transform(zzt) representation and chirp group delay processing for analysis of source and filter characteristics of speech signals

Zeros of z-transform(zzt) representation and chirp group delay processing for analysis of source and filter characteristics of speech signals Zeros of z-transformzzt representation and chirp group delay processing for analysis of source and filter characteristics of speech signals Baris Bozkurt 1 Collaboration with LIMSI-CNRS, France 07/03/2017

More information

Scattering. N-port Parallel Scattering Junctions. Physical Variables at the Junction. CMPT 889: Lecture 9 Wind Instrument Modelling

Scattering. N-port Parallel Scattering Junctions. Physical Variables at the Junction. CMPT 889: Lecture 9 Wind Instrument Modelling Scattering CMPT 889: Lecture 9 Wind Instrument Modelling Tamara Smyth, tamaras@cs.sfu.ca School of Computing Science, Simon Fraser niversity November 2, 26 Scattering is a phenomenon in which the wave

More information

Quarterly Progress and Status Report. On the synthesis and perception of voiceless fricatives

Quarterly Progress and Status Report. On the synthesis and perception of voiceless fricatives Dept. for Speech, Music and Hearing Quarterly Progress and Status Report On the synthesis and perception of voiceless fricatives Mártony, J. journal: STLQPSR volume: 3 number: 1 year: 1962 pages: 017022

More information

representation of speech

representation of speech Digital Speech Processing Lectures 7-8 Time Domain Methods in Speech Processing 1 General Synthesis Model voiced sound amplitude Log Areas, Reflection Coefficients, Formants, Vocal Tract Polynomial, l

More information

LECTURE NOTES IN AUDIO ANALYSIS: PITCH ESTIMATION FOR DUMMIES

LECTURE NOTES IN AUDIO ANALYSIS: PITCH ESTIMATION FOR DUMMIES LECTURE NOTES IN AUDIO ANALYSIS: PITCH ESTIMATION FOR DUMMIES Abstract March, 3 Mads Græsbøll Christensen Audio Analysis Lab, AD:MT Aalborg University This document contains a brief introduction to pitch

More information

Chapter 1 Fundamentals of Sound Waves -1

Chapter 1 Fundamentals of Sound Waves -1 Chapter 1 Fundamentals of Sound Waves -1 Sound Sections What is Sound? Producing a Sound Wave Characteristics of Sound Waves The Speed of Sound Intensity of Sound Waves What is Sound? Sound is such a common

More information

Computational Perception. Sound Localization 1

Computational Perception. Sound Localization 1 Computational Perception 15-485/785 January 17, 2008 Sound Localization 1 Orienting sound localization visual pop-out eye/body movements attentional shift 2 The Problem of Sound Localization What are the

More information

Lecture 5: GMM Acoustic Modeling and Feature Extraction

Lecture 5: GMM Acoustic Modeling and Feature Extraction CS 224S / LINGUIST 285 Spoken Language Processing Andrew Maas Stanford University Spring 2017 Lecture 5: GMM Acoustic Modeling and Feature Extraction Original slides by Dan Jurafsky Outline for Today Acoustic

More information

S=z y. p(x, t) v. = ρ x (x,t)

S=z y. p(x, t) v. = ρ x (x,t) LTUR 2: One-imensional Traveling Waves Main Points - xponential and sine-wave solutions to the one-dimensional wave equation. - The distributed compressibility and mass in acoustic plane waves are analogous

More information

Physical and Biological Properties of Agricultural Products Acoustic, Electrical and Optical Properties and Biochemical Property

Physical and Biological Properties of Agricultural Products Acoustic, Electrical and Optical Properties and Biochemical Property Physical and Biological Properties of Agricultural Products Acoustic, Electrical and Optical Properties and Biochemical Property 1. Acoustic and Vibrational Properties 1.1 Acoustics and Vibration Engineering

More information

Fall 2004 Physics 3 Tu-Th Section

Fall 2004 Physics 3 Tu-Th Section Fall 2004 Physics 3 Tu-Th Section Claudio Campagnari Lecture 3: 30 Sep. 2004 Web page: http://hep.ucsb.edu/people/claudio/ph3-04/ 1 Sound Sound = longitudinal wave in a medium. The medium can be anything:

More information

Chapter 17: Waves II. Sound waves are one example of Longitudinal Waves. Sound waves are pressure waves: Oscillations in air pressure and air density

Chapter 17: Waves II. Sound waves are one example of Longitudinal Waves. Sound waves are pressure waves: Oscillations in air pressure and air density Sound waves are one example of Longitudinal Waves Sound waves are pressure waves: Oscillations in air pressure and air density Before we can understand pressure waves in detail, we need to understand what

More information

Sound Propagation through Media. Nachiketa Tiwari Indian Institute of Technology Kanpur

Sound Propagation through Media. Nachiketa Tiwari Indian Institute of Technology Kanpur Sound Propagation through Media Nachiketa Tiwari Indian Institute of Technology Kanpur LECTURE-17 HARMONIC PLANE WAVES Introduction In this lecture, we discuss propagation of 1-D planar waves as they travel

More information

PHY 103 Impedance. Segev BenZvi Department of Physics and Astronomy University of Rochester

PHY 103 Impedance. Segev BenZvi Department of Physics and Astronomy University of Rochester PHY 103 Impedance Segev BenZvi Department of Physics and Astronomy University of Rochester Midterm Exam Proposed date: in class, Thursday October 20 Will be largely conceptual with some basic arithmetic

More information

THE SYRINX: NATURE S HYBRID WIND INSTRUMENT

THE SYRINX: NATURE S HYBRID WIND INSTRUMENT THE SYRINX: NATURE S HYBRID WIND INSTRUMENT Tamara Smyth, Julius O. Smith Center for Computer Research in Music and Acoustics Department of Music, Stanford University Stanford, California 94305-8180 USA

More information

APPENDIX B. Noise Primer

APPENDIX B. Noise Primer APPENDIX B Noise Primer NOISE PRIMER TABLE OF CONTENTS 1. INTRODUCTION...1 2. BASIC SOUND PRINCIPLES...1 2.1. SOUND AS A WAVE...1 2.2. SOUND PRESSURE LEVEL...2 2.2.1. Decibel Math...4 2.2.2. Descriptive

More information

Sound. Speed of Sound

Sound. Speed of Sound Sound TUNING FORK CREATING SOUND WAVES GUITAR STRING CREATING SOUND WAVES Speed of Sound Sound travels at a speed that depends on the medium through which it propagates. The speed of sound depends: - directly

More information

ENTROPY RATE-BASED STATIONARY / NON-STATIONARY SEGMENTATION OF SPEECH

ENTROPY RATE-BASED STATIONARY / NON-STATIONARY SEGMENTATION OF SPEECH ENTROPY RATE-BASED STATIONARY / NON-STATIONARY SEGMENTATION OF SPEECH Wolfgang Wokurek Institute of Natural Language Processing, University of Stuttgart, Germany wokurek@ims.uni-stuttgart.de, http://www.ims-stuttgart.de/~wokurek

More information

Speech Spectra and Spectrograms

Speech Spectra and Spectrograms ACOUSTICS TOPICS ACOUSTICS SOFTWARE SPH301 SLP801 RESOURCE INDEX HELP PAGES Back to Main "Speech Spectra and Spectrograms" Page Speech Spectra and Spectrograms Robert Mannell 6. Some consonant spectra

More information

PHY 103 Impedance. Segev BenZvi Department of Physics and Astronomy University of Rochester

PHY 103 Impedance. Segev BenZvi Department of Physics and Astronomy University of Rochester PHY 103 Impedance Segev BenZvi Department of Physics and Astronomy University of Rochester Reading Reading for this week: Hopkin, Chapter 1 Heller, Chapter 1 2 Waves in an Air Column Recall the standing

More information

Improved Method for Epoch Extraction in High Pass Filtered Speech

Improved Method for Epoch Extraction in High Pass Filtered Speech Improved Method for Epoch Extraction in High Pass Filtered Speech D. Govind Center for Computational Engineering & Networking Amrita Vishwa Vidyapeetham (University) Coimbatore, Tamilnadu 642 Email: d

More information

David Weenink. First semester 2007

David Weenink. First semester 2007 Institute of Phonetic Sciences University of Amsterdam First semester 2007 Definition (ANSI: In Psycho-acoustics) is that auditory attribute of sound according to which sounds can be ordered on a scale

More information

COMP Signals and Systems. Dr Chris Bleakley. UCD School of Computer Science and Informatics.

COMP Signals and Systems. Dr Chris Bleakley. UCD School of Computer Science and Informatics. COMP 40420 2. Signals and Systems Dr Chris Bleakley UCD School of Computer Science and Informatics. Scoil na Ríomheolaíochta agus an Faisnéisíochta UCD. Introduction 1. Signals 2. Systems 3. System response

More information

What does the speed of a wave depend on?

What does the speed of a wave depend on? Today s experiment Goal answer the question What does the speed of a wave depend on? Materials: Wave on a String PHeT Simulation (link in schedule) and Wave Machine Write a CER in pairs. Think about the

More information

Mel-Generalized Cepstral Representation of Speech A Unified Approach to Speech Spectral Estimation. Keiichi Tokuda

Mel-Generalized Cepstral Representation of Speech A Unified Approach to Speech Spectral Estimation. Keiichi Tokuda Mel-Generalized Cepstral Representation of Speech A Unified Approach to Speech Spectral Estimation Keiichi Tokuda Nagoya Institute of Technology Carnegie Mellon University Tamkang University March 13,

More information

Ljud i byggnad och samhälle (VTAF01) Sound propagation outdoors MATHIAS BARBAGALLO DIVISION OF ENGINEERING ACOUSTICS, LUND UNIVERSITY

Ljud i byggnad och samhälle (VTAF01) Sound propagation outdoors MATHIAS BARBAGALLO DIVISION OF ENGINEERING ACOUSTICS, LUND UNIVERSITY Ljud i byggnad och samhälle (VTAF01) Sound propagation outdoors MATHIAS BARBAGALLO DIVISION OF ENGINEERING ACOUSTICS, LUND UNIVERSITY recap from last lectures Pressure waves For a sound to be perceived

More information

Signals, Instruments, and Systems W5. Introduction to Signal Processing Sampling, Reconstruction, and Filters

Signals, Instruments, and Systems W5. Introduction to Signal Processing Sampling, Reconstruction, and Filters Signals, Instruments, and Systems W5 Introduction to Signal Processing Sampling, Reconstruction, and Filters Acknowledgments Recapitulation of Key Concepts from the Last Lecture Dirac delta function (

More information

Acoustic tubes with maximal and minimal resonance frequencies

Acoustic tubes with maximal and minimal resonance frequencies Acoustic tubes with imal and minimal resonance frequencies B. De Boer Spuistraat 1, 11VT Amsterdam, Netherlands b.g.deboer@uva.nl 565 This paper presents a theoretical derivation of acoustic tract shapes

More information

Physics General Physics. Lecture 25 Waves. Fall 2016 Semester Prof. Matthew Jones

Physics General Physics. Lecture 25 Waves. Fall 2016 Semester Prof. Matthew Jones Physics 22000 General Physics Lecture 25 Waves Fall 2016 Semester Prof. Matthew Jones 1 Final Exam 2 3 Mechanical Waves Waves and wave fronts: 4 Wave Motion 5 Two Kinds of Waves 6 Reflection of Waves When

More information

Echo cancellation by deforming sound waves through inverse convolution R. Ay 1 ward DeywrfmzMf o/ D/g 0001, Gauteng, South Africa

Echo cancellation by deforming sound waves through inverse convolution R. Ay 1 ward DeywrfmzMf o/ D/g 0001, Gauteng, South Africa Echo cancellation by deforming sound waves through inverse convolution R. Ay 1 ward DeywrfmzMf o/ D/g 0001, Gauteng, South Africa Abstract This study concerns the mathematical modelling of speech related

More information

Lecture 5 Notes: 07 / 05. Energy and intensity of sound waves

Lecture 5 Notes: 07 / 05. Energy and intensity of sound waves Lecture 5 Notes: 07 / 05 Energy and intensity of sound waves Sound waves carry energy, just like waves on a string do. This energy comes in several types: potential energy due to the compression of the

More information

Environmental Effects and Control of Noise Lecture 2

Environmental Effects and Control of Noise Lecture 2 Noise, Vibration, Harshness Sound Quality Research Group NVH-SQ Group University of Windsor 92-455 Environmental Effects and Control of Noise Copyright 2015 Colin Novak Copyright 2015 by Colin Novak. All

More information

Lecture 21: Decay of Resonances

Lecture 21: Decay of Resonances Lecture 21: Decay of Resonances Recall that a resonance involves energy bouncing back and forth between two means of storage. In air resonances in cavities, energy is going back and forth between being

More information

Constriction Degree and Sound Sources

Constriction Degree and Sound Sources Constriction Degree and Sound Sources 1 Contrasting Oral Constriction Gestures Conditions for gestures to be informationally contrastive from one another? shared across members of the community (parity)

More information

CEPSTRAL ANALYSIS SYNTHESIS ON THE MEL FREQUENCY SCALE, AND AN ADAPTATIVE ALGORITHM FOR IT.

CEPSTRAL ANALYSIS SYNTHESIS ON THE MEL FREQUENCY SCALE, AND AN ADAPTATIVE ALGORITHM FOR IT. CEPSTRAL ANALYSIS SYNTHESIS ON THE EL FREQUENCY SCALE, AND AN ADAPTATIVE ALGORITH FOR IT. Summarized overview of the IEEE-publicated papers Cepstral analysis synthesis on the mel frequency scale by Satochi

More information

Design of a CELP coder and analysis of various quantization techniques

Design of a CELP coder and analysis of various quantization techniques EECS 65 Project Report Design of a CELP coder and analysis of various quantization techniques Prof. David L. Neuhoff By: Awais M. Kamboh Krispian C. Lawrence Aditya M. Thomas Philip I. Tsai Winter 005

More information

Physics 101: Lecture 22 Sound

Physics 101: Lecture 22 Sound EXAM III Physics 101: Lecture 22 Sound Today s lecture will cover Textbook Chapter 12 Physics 101: Lecture 22, Pg 1 Standing Waves Fixed Endpoints Fundamental n=1 (2 nodes) l n = 2L/n f n = n v / (2L)

More information

Quiz on Chapters 13-15

Quiz on Chapters 13-15 Quiz on Chapters 13-15 Final Exam, Thursday May 3, 8:00 10:00PM ANH 1281 (Anthony Hall). Seat assignments TBD RCPD students: Thursday May 3, 5:00 9:00PM, BPS 3239. Email will be sent. Alternate Final Exam,

More information

Lecture III : Jan 30, 2017

Lecture III : Jan 30, 2017 Lecture III : Jan 30, 2017 Prequantum Era Particles & Waves Reading Assignment: Chapter 2 and 3 from Quantum Physics for Poets. Summarize your thoughts with some questions/comments. ( One page writeup

More information

TYPES OF WAVES. 4. Waves and Sound 1

TYPES OF WAVES. 4. Waves and Sound 1 TYPES OF WAVES Consider a set of playground swings attached by a rope from seat to seat If you sit in the first swing and begin oscillating, this disturbs the equilibrium The connecting ropes cause the

More information

Chapter 18 Solutions

Chapter 18 Solutions Chapter 18 Solutions 18.1 he resultant wave function has the form y A 0 cos φ sin kx ω t + φ (a) A A 0 cos φ (5.00) cos (π /4) 9.4 m f ω π 100π π 600 Hz *18. We write the second wave function as hen y

More information

Section 2: Lecture 1 Integral Form of the Conservation Equations for Compressible Flow

Section 2: Lecture 1 Integral Form of the Conservation Equations for Compressible Flow Section 2: Lecture 1 Integral Form of the Conservation Equations for Compressible Flow Anderson: Chapter 2 pp. 41-54 1 Equation of State: Section 1 Review p = R g T " > R g = R u M w - R u = 8314.4126

More information

UNIVERSITY OF SASKATCHEWAN Department of Physics and Engineering Physics

UNIVERSITY OF SASKATCHEWAN Department of Physics and Engineering Physics UNIVERSITY OF SASKATCHEWAN Department of Physics and Engineering Physics Physics 117.3 MIDTERM TEST February 13, 2014 Time: 90 minutes NAME: (Last) Please Print (Given) STUDENT NO.: LECTURE SECTION (please

More information