Isolated Word Recogniser

Similar documents
Hidden Markov Model Parameters

Statistical Inference Procedures

SOLUTION: The 95% confidence interval for the population mean µ is x ± t 0.025; 49

Comments on Discussion Sheet 18 and Worksheet 18 ( ) An Introduction to Hypothesis Testing

x z Increasing the size of the sample increases the power (reduces the probability of a Type II error) when the significance level remains fixed.

REVIEW OF SIMPLE LINEAR REGRESSION SIMPLE LINEAR REGRESSION

Chapter 9. Key Ideas Hypothesis Test (Two Populations)

u t u 0 ( 7) Intuitively, the maximum principles can be explained by the following observation. Recall

Fig. 1: Streamline coordinates

Assignment 1 - Solutions. ECSE 420 Parallel Computing Fall November 2, 2014

State space systems analysis

10-716: Advanced Machine Learning Spring Lecture 13: March 5

Lecture 30: Frequency Response of Second-Order Systems

STUDENT S t-distribution AND CONFIDENCE INTERVALS OF THE MEAN ( )

NUMERICAL METHODS FOR SOLVING EQUATIONS

Société de Calcul Mathématique, S. A. Algorithmes et Optimisation

Confidence Intervals: Three Views Class 23, Jeremy Orloff and Jonathan Bloom

Math 213b (Spring 2005) Yum-Tong Siu 1. Explicit Formula for Logarithmic Derivative of Riemann Zeta Function

18.05 Problem Set 9, Spring 2014 Solutions

ON THE SCALE PARAMETER OF EXPONENTIAL DISTRIBUTION

CHAPTER 6. Confidence Intervals. 6.1 (a) y = 1269; s = 145; n = 8. The standard error of the mean is = s n = = 51.3 ng/gm.

IntroEcono. Discrete RV. Continuous RV s

Chapter 8.2. Interval Estimation

Heat Equation: Maximum Principles

20. CONFIDENCE INTERVALS FOR THE MEAN, UNKNOWN VARIANCE

13.4 Scalar Kalman Filter

M227 Chapter 9 Section 1 Testing Two Parameters: Means, Variances, Proportions

COMPARISONS INVOLVING TWO SAMPLE MEANS. Two-tail tests have these types of hypotheses: H A : 1 2

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 16 11/04/2013. Ito integral. Properties

Erick L. Oberstar Fall 2001 Project: Sidelobe Canceller & GSC 1. Advanced Digital Signal Processing Sidelobe Canceller (Beam Former)

STA 4032 Final Exam Formula Sheet

TESTS OF SIGNIFICANCE

Tools Hypothesis Tests

STRONG DEVIATION THEOREMS FOR THE SEQUENCE OF CONTINUOUS RANDOM VARIABLES AND THE APPROACH OF LAPLACE TRANSFORM

Difference tests (1): parametric

SEQUENCE AND SERIES NCERT

Confidence Intervals. Confidence Intervals

8.6 Order-Recursive LS s[n]

11 Hidden Markov Models

Tables and Formulas for Sullivan, Fundamentals of Statistics, 2e Pearson Education, Inc.

Brief Review of Linear System Theory

We will look for series solutions to (1) around (at most) regular singular points, which without

1. (a) If u (I : R J), there exists c 0 in R such that for all q 0, cu q (I : R J) [q] I [q] : R J [q]. Hence, if j J, for all q 0, j q (cu q ) =

The z-transform. 7.1 Introduction. 7.2 The z-transform Derivation of the z-transform: x[n] = z n LTI system, h[n] z = re j

TI-83/84 Calculator Instructions for Math Elementary Statistics

Chapter 9: Hypothesis Testing

VIII. Interval Estimation A. A Few Important Definitions (Including Some Reminders)

Kinetics of Complex Reactions

CS284A: Representations and Algorithms in Molecular Biology

Performance-Based Plastic Design (PBPD) Procedure

LECTURE 13 SIMULTANEOUS EQUATIONS

Infinite Sequences and Series

Queueing Theory (Part 3)

IP Reference guide for integer programming formulations.

Linear Regression Demystified

On the 2-Domination Number of Complete Grid Graphs

System Control. Lesson #19a. BME 333 Biomedical Signals and Systems - J.Schesser

The Performance of Feedback Control Systems

Statistical Equations

11/19/ Chapter 10 Overview. Chapter 10: Two-Sample Inference. + The Big Picture : Inference for Mean Difference Dependent Samples

3. Z Transform. Recall that the Fourier transform (FT) of a DT signal xn [ ] is ( ) [ ] = In order for the FT to exist in the finite magnitude sense,

Sums, products and sequences

CHAPTER 10 INFINITE SEQUENCES AND SERIES

Last time: Ground rules for filtering and control system design

Math 475, Problem Set #12: Answers

MTH 212 Formulas page 1 out of 7. Sample variance: s = Sample standard deviation: s = s

Definitions and Theorems. where x are the decision variables. c, b, and a are constant coefficients.

Frequency Domain Filtering

Sequences, Mathematical Induction, and Recursion. CSE 2353 Discrete Computational Structures Spring 2018

(b) What is the probability that a particle reaches the upper boundary n before the lower boundary m?

S T A T R a c h e l L. W e b b, P o r t l a n d S t a t e U n i v e r s i t y P a g e 1. = Population Variance

Math 61CM - Solutions to homework 3

( ) = p and P( i = b) = q.

Math 155 (Lecture 3)

Lecture 5: April 17, 2013

Statistics 511 Additional Materials

Lecture 19: Convergence

ECEN 655: Advanced Channel Coding Spring Lecture 7 02/04/14. Belief propagation is exact on tree-structured factor graphs.

Statistics and Chemical Measurements: Quantifying Uncertainty. Normal or Gaussian Distribution The Bell Curve

Chapter 1 Econometrics

( ) (( ) ) ANSWERS TO EXERCISES IN APPENDIX B. Section B.1 VECTORS AND SETS. Exercise B.1-1: Convex sets. are convex, , hence. and. (a) Let.

multiplies all measures of center and the standard deviation and range by k, while the variance is multiplied by k 2.

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES

THE CONCEPT OF THE ROOT LOCUS. H(s) THE CONCEPT OF THE ROOT LOCUS

Math 113 Exam 3 Practice

Introduction to Computational Biology Homework 2 Solution

The picture in figure 1.1 helps us to see that the area represents the distance traveled. Figure 1: Area represents distance travelled

Problem Cosider the curve give parametrically as x = si t ad y = + cos t for» t» ß: (a) Describe the path this traverses: Where does it start (whe t =

ECE 6980 An Algorithmic and Information-Theoretic Toolbox for Massive Data

The Structure of Z p when p is Prime

The Expectation-Maximization (EM) Algorithm

1. By using truth tables prove that, for all statements P and Q, the statement

Lecture 4 February 16, 2016

Further Investigation of alternative Formulation of RP Model with Response Error. Ed Stanek

Generalized Likelihood Functions and Random Measures

Introduction to Computational Molecular Biology. Gibbs Sampling

L = n i, i=1. dp p n 1

ELEC 372 LECTURE NOTES, WEEK 4 Dr. Amir G. Aghdam Concordia University

The multiplicative structure of finite field and a construction of LRC

x c the remainder is Pc ().

Transcription:

Lecture 5 Iolated Word Recogitio Hidde Markov Model of peech State traitio ad aligmet probabilitie Searchig all poible aligmet Dyamic Programmig Viterbi Aligmet Iolated Word Recogitio 8. Iolated Word Recogier Hypothei Geerator cat Laguage Model pr(w) w Preproceor Acoutic Model pd( w) Preproceor Idetifie tart/ed of each word Covert peech ito a equece of feature vector at iterval of aroud 0 m. Hypothei Geerator Try each poible word i tur Laguage Model Etimate the probability of the word Ca deped o the precedig word Acoutic Model Calculate the probability deity that a oberved equece of feature vector correpod to the choe word 8. core

8.5 Speech Productio Model c c c a a a t t t c a t Each phoeme i a word correpod to a umber of model tate. Each model tate repreet a ditict oud with it ow acoutic pectrum. For each tate, we tore the mea value ad variace for each of F feature. Whe ayig a word, the peaker tay i each tate for oe or more frame ad the goe o to the ext tate. The time i each tate will vary accordig to how fat he/he i peakig. Some peech oud lat loger tha other 8.6 State Traitio Aume that the probability of proceedig oto the ext tate i p ad the probability of tayig i the ame tate i p. The probability of beig i ay tate for the ext frame deped oly o the curret tate ad ot o ay previou hitory thi i a Markov proce. 0.6 0.8 0.9 More realitic aumptio icreae computatio without improvig performace. 0. 0. 0. a a t 0. 0.8 0.6 0. 0. 0. 0.08 0.06 0.0 0.0 5 6 7 8 9 0 5 State Duratio (frame) 0 806 06 0.0.0. 0.0.50.60.7 Probability The legth of time, D, pet i ay tate follow a egative expoetial ditributio with a average duratio of /p frame. pr( D ) p ( p ) Sice by differetiatig E( D ) 0 x x p ( p ) 0 x p ( x )

8.7 Aligmet Probabilitie We ca calculate the probability of havig: frame i the firt tate, 0.6 0.8 0.9 0. 0. a a t 0. i the ecod tate, ad 6 i the third tate A complete pecificatio of which frame are i each tate i a aligmet. A before we ca ue log probabilitie ad add them itead of multiplyig Avoid dyamic rage problem 0.6 0.6 0. 0.8 0. 0. 0.0006 8.8 Hidde Markov Model c a t c c c a a a t t t To calculate the probability deity (pd) that a obervatio matche a particular word with a give aligmet, we multiply together: the probability of the aligmet the output pd of each frame Try thi for every poible aligmet of every poible word equece ad chooe the oe with highet probability. Hidde Markov Model the correct aligmet i hidde: we ca t oberve it directly. We talk of the probability deity of the model geeratig the oberved frame equece.

8.9 Hidde Markov Model Parameter c c c a a a t t t c a t A Hidde Markov Model for a word mut pecify the followig parameter for tate : The mea ad variace for each of the F elemet of the parameter vector: µ ad σ. Thee allow u to calculate d (x): the output probability deity of iput frame x i tate. The traitio probabilitie a,j to every poible ucceor tate. a,j i ofte zero for all j except j ad j+ it i the called a left-to-right, o kip model. For a Hidde Markov Model with S tate we therefore have aroud (F+)S parameter. A typical word might have S5 ad F9 givig 00 parameter i all. 8.0 Miimum Cot Path Suppoe we wat to fid the cheapet path through a toll road ytem: Start Fiih I each circle we will eter the lowet cot of a jourey from Start Start Fiih 5 Begi by puttig 0 i the Start circle Now put,, i the d colum circle ad mark all three egmet i bold. Thi how the lowet cot to each of thee circle ad the route by which you go. Put 6,, 6 i the rd colum circle ad, i each cae, mark the bet egmet from the previou colum i bold. Put 5, ad 6 i the th colum. Put 7 i the Fiih circle. We ca trace bold egmet backward from Fiih to fid the bet overall path.

8. Dyamic Programmig Thi techique for fidig the miimum cot path through a graph i kow a dyamic programmig. Three coditio mut be true: All path through the graph mut go from left to right. The cot of each egmet of a path mut be fixed i advace: it mut ot deped o which of the other egmet are icluded i the route. The total cot of a path mut jut be the um of each of it egmet cot. Dyamic programmig i guarateed to fid the path with miimum cot. We ca alo fid the maximum cot path i the ame way: i thi cae the cot are uually called utilitie itead. We ca ue Dyamic Programmig to fid the bet aligmet of a equece of feature vector with a word model. bet mea the aligmet with the highet productio probability deity. 8. Aligmet Graph Start Fiih We ca draw a aligmet graph (or lattice): The colum correpod to peech frame The row correpod to model tate Each poible path from Start to Fiih correpod to a aligmet of the peech frame with the model. All valid path pa through each colum i tur. I goig to the ext colum, a path i retricted to the tate traitio allowed by the tate diagram I the above example a path mut either remai i the ame tate or ele go o to the ext tate.

8. Segmet Utilitie x 0 Start Fiih Segmet prob deity a, d (x 0 ) The probability deity of a path egmet goig from tate i to tate j i the product of: The probability of the traitio: a i,j The output probability deity of the correpodig iput frame i tate j: d j (x) The probability deity of the etire aligmet i the product of it cotituet egmet pd. Thi equal the pd that the model geerate the oberved equece of feature vector with thi particular aligmet. 8. Dyamic Programmig Step x 0 Start B(,9) B(,9) B(,9) B(,0) Defie B(,t) to be the probability deity of the bet partial aligmet begiig at Start ad edig with frame t i tate. Ay aligmet goig through (,0) mut go through either (,9), (,9) or (,9). Hece:,, ( ) B(,0) max B(,9) a d ( x ) 0 I geeral, we ca calculate B(*,t) from B(*,t ): ( ) Bkt (, ) max Bt (, ) ak dk( x t) Iitialie thi recurio by ettig B(,)d ()

8.5 Viterbi Aligmet Start a S Fiih Thi procedure i called Viterbi Aligmet: B(,) d (x ); B(,) 0 for > for t:t for k:s z(k,t)jargmax(b(,t-) a k ) for all with a k 0 B(k,t)B(j,t-) a jk d k (x t ) ed ed The bet path ha prob deity B(S,T) a S a S i the exit probability from the fial tate The z(k,t) array tore the iformatio eeded to trace back the bet path. 8.6 Iolated Word Recogitio Require the peaker to iert a gap betwee each word Ued for budget ytem with little CPU power Recogitio: Extract a word-log egmet of peech,, from the iput igal. Covert it ito a equece of frame. Calculate pr(w ) pr(w) pr( w) for each poible word, w, i the recogitio vocabulary. pr(w) i the prior probability of the word: get thi from word frequecie or word-pair frequecie (e.g. miiter ofte follow prime ). pr( w) i obtaied by uig the Viterbi aligmet algorithm to fid the log probability deity of the bet aligmet of with the model for w. Chooe the word with the highet probability Need to create a eparate Hidde Markov model for each word i the vocabulary. Log