The Electron ic PSC Testing System

Size: px
Start display at page:

Download "The Electron ic PSC Testing System"

Transcription

1 20 6 JO URAL O F CH IESE IFO RM AT IO PROCESS IG Vol120 o16 : (2006) ,,, (, ) : 100,,, 500,, (2144)(2130) :; ;; ; ; : TP391: A The Electron ic PSC Testing System W E I Si, L IU Q ing2sheng, HU Yu, WAG Ren2hua ( Man Machine Voice Communication Laboratory, University of Science&Technology of China, Hefei, Anhui , China) Abstract: This paper develop s an automatic PSC testing system aim ing at efficiently evaluating the spoken Chinese. On the basis of 100 hoursstandard Chinese database, this paper uses the characteristic of Chinese and linguist s expert knowledge to op tim ize the traditional speech evaluation algorithm. A t the same time, a corpus2adap tive method is p ro2 pose to enhance the robustness and performance of the algorithm. Experiments on 500 personspsc testing database p rove that the new algorithm is much better than the original algorithm. After linear mapp ing, the error between the machine score and the human score is almost equal to the error between humans, that is The result indicates that the automatic PSC testing system can rep lace the human to evaluating spoken Chinese under text2dependent con2 dition. Key word: computer app lication; Chinese information p rocessing; Putonghua shuip ing ceshi; p ronunciation evalua2 tion; PSC testing database, automatic testing system. 1,,,,,, 100,,,, : : : ( ZD I105 - B02) : (1982),,,. 89

2 ( SR I) V ILT [ 1, 2 ], SR I SC ILL [ 3, 4 ], V ICK [ 5, 6 ],,, [ 9, 10 ],,,,,,,,,,,,(2144) (2130),,, 2,, 1 1 1,,,, 90

3 3 311,,,,,, 16K, 16bit / 4500/ 60/( 400) 3 /, 100,,,, 312,,,,, 16K, 16bit % 71% 23%,,, 3, 313 ( ),,264,236 3 A, B { S i, i = 1, 2,, n},(1) : [ (S A i - S A ) (S B i - S B ) ] (S A i - S A ) 2 [ (S B i - S B ) 2 (1) 91

4 , S A i A i, S B i B i, S A A, S B B 4 ( )/ (110, 010) / (110, 010) (0191, 1188) / ( 0190, 1197) (0188, 2154) / ( 0189, 2147) (0191, 1188) / ( 0190, 1197) (110, 010) / (110, 010) (0191, 2119) / ( 0189, 2147) (0190, 2120) / (0189, 2130) (0188, 2154) / ( 0189, 2147) (0191, 2119) / ( 0189, 2147) (110, 010) / (110, 010) 4,018,3 4,, 411 HMM, 25m s, 10m s MFCC,39 HMM, TO P O T, P O T O HMM T,HMM [ 13, 5 ] O TP T O,, P P ( T O ) T O (2) [ 3 ] = ( log ( P ( T i O ( T i ) ) ) / F ( T i ) ) / = ( log ( P (O ( T i ) T i ) P ( T i ) qq P (O ( T i ) q) p ( q) ) / F ( T i ) ) / (2) P (O ( T i ) T i ) ( log ( max qq P (O ( T i ) q) ) / F ( T i ) ) /, Q, qt i, F ( T i )T i,, P (O ( T i ) T i ) T i O ( T i ), : 0158, (2),,

5 ,,,,,, (3) : P ( T O ) = ( log ( P ( T i O ( T i ) ) ) / F ( T i ) ) / = ( log ( P (O ( T i ) T i ) P ( T i ) T qq i error P (O ( T i ) q) p ( q) ) / F ( T i ) ) / (3) P (O ( T i ) T i ) ( log ( max T qq i P (O ( T i ) q) ) / F ( T i ) ) / error (3) (2), ( 2), (3) [ 7 ], 41212, [ 8 ],(4) G sen t = G i / G i = G i in itia l + G i fina l G sent, G i i G i in itia l i G i fina l i,,, (5) G sen t = i / G G i = G i in itia l (1 + D u ri f ina l D u r i in itia l CO EF) + G i fina l G sen t, G i i, D ur i fina l i, D ur i in itia l i CO EF,, CO EF,, 41213,,,, MLLR (Maximum L ikelihood L ine2 ar Regression) [ 11 ], MLLR,,,,,,, : (4) (5) 93

6 ,, T i HTKHMM,,, (6), T i THR ESH i T i < THR ESH i THR ESH,,, MLLR, 5, 511 (3),,,, HMM,, 5 5 / 0165 / / (6) 5,,,,,, [ 8 ],, 6, HMM, /, / /0177 6,,[ 8 ] ,7 7, / HMM, 0177 / / /

7 7,, 514,,, 8: HMM,,, 8 8 (/) + + VS 0165 / /0181 VS 0190 /0189 8,, 6,,,,,,,, :,, (7) S core m ach ine S core m ach ine S core m ach ine = 3 = 3 = 3 1i P ( o i ) + S core 4 C = 2i P ( o i ) + S core 4 C = 3i P ( o i ) + S core 4 C, P ( o i )i,, 1 i, Score 4, C, Score m ach ine 9 9 ( )/ + + VS (0183, - ) / (0181, - ) (0195, 1128) / (0184, 2144) VS (0190, 2120) / (0189, 2130) 9,,,, (7) 95

8 ,, (2144)(2130) 7,,,,,,,, 0165 /0161 (/, )0183 /0181,,, 0195 /0184,1128 /2144, 0190 / /2130,,,,,,, : [ 1 ] H. L. Franco, L. eumeyer, Y. Kim, O. Ronen. Automatic p ronunciation scoring for language instruction[a ]. ICASSP[ C ], 1997, [ 2 ] L. eumeyer, H. Franco, V. D igalakis, M. W eintraub. Automatic scoring of p ronunciation quality. Speech Communication 30 [ J ], 2000, [ 3 ] S. M. W itt, S. J. Young. Phone2level p ronunciation scoring and assessment for interactive language learning [A ]. In: Speech Communication 30, 2000, [ 4 ] S. M. W itt, U se of speech recognition in computer2assisted language learning, Doctor s D issertation of Cam2 bridge[d ], [ 5 ] C. Cucchiarini, F. D. W et, H. Strik, L. Boves. A ssessment of Dutch p ronunciation by means of automatic speech recognition technology[a ]. ICSLP, Vol. 5 [ C ], 1998, [ 6 ] C. Cucchiarini, H. Strik, L. Boves. Automatic evaluation of dutch p ronunciation by using speech recognition technology[a ]. Proceedings of the IEEE workshop ASRU [ C ], Santa Barbara. 1997, [ 7 ] A ijun L i, Xia W ang, A Contrastive Investigation of Standard Mandarin and Accented [A ]. EuroSpeech [ C ], 2003, [ 8 ],,,. [A ]. [ C ], 2005, [ 9 ],. [A ]. [ J ], 1998, [ 10 ],,. [A ]. [ C ], 2005, [ 11 ] C. J. Leggetter, P. C. Woodland, Maximum L ikelihood L inear Regression for Speaker Adap tation of Contin2 uous Density H idden M arkov Models, Computer Speech and Language[ J ], 1995,

Presented By: Omer Shmueli and Sivan Niv

Presented By: Omer Shmueli and Sivan Niv Deep Speaker: an End-to-End Neural Speaker Embedding System Chao Li, Xiaokong Ma, Bing Jiang, Xiangang Li, Xuewei Zhang, Xiao Liu, Ying Cao, Ajay Kannan, Zhenyao Zhu Presented By: Omer Shmueli and Sivan

More information

Chinese Journal of Scientific Instrument. High frequency we ighted M FCC extraction for noise robust speaker ver if ication

Chinese Journal of Scientific Instrument. High frequency we ighted M FCC extraction for noise robust speaker ver if ication 29 3 20083 Chinese Journal of Scientific Instrument Vol129 No13 Mar. 2008 M FCC 1, 1, 2 (1 400044; 2 400044) : MFCC Mel,,,,, MFCC,,, : ; ; ; ; MFCC : TP192. 3 : A: 520. 2040 High frequency we ighted M

More information

Hidden Markov Model and Speech Recognition

Hidden Markov Model and Speech Recognition 1 Dec,2006 Outline Introduction 1 Introduction 2 3 4 5 Introduction What is Speech Recognition? Understanding what is being said Mapping speech data to textual information Speech Recognition is indeed

More information

A discussion on methodologies for research into complex system s

A discussion on methodologies for research into complex system s 4 1 Vol. 4. 1 2009 2 CAA I Transactions on Intelligent System s Feb. 2009 1, 2, 3, 2, 4 (1., 610074; 2., 610500; 3., 610500; 4., 618000) : 20 80,,,, 6 : ; ; Agent ; Swarm; StarLogo : N94, TP273 : A : 167324785

More information

Journal of Beijing University of Aeronautics and A stronautics PCNN, PCNN. Nove l adap tive deno ising m e thod fo r extrem e no ise ba sed on PCNN

Journal of Beijing University of Aeronautics and A stronautics PCNN, PCNN. Nove l adap tive deno ising m e thod fo r extrem e no ise ba sed on PCNN 2009 1 35 1 Journal of Beijing University of Aeronautics and A stronautics January 2009 Vol. 35 No11 PCNN (, 100191) : PCNN ( Pulse Coup led Neural Network) ADEN (Adapative Denosing method for Extreme

More information

Study on disturbance torques compensation in high precise servo turn table control system

Study on disturbance torques compensation in high precise servo turn table control system 13 4 20097 EL ECTR ICMACH IN ESANDCON TROL Vol113 No14 July 2009,, (, 150001) :, PD,, Lyapunov, ;,, : ; ; ; Lyapunov; : TP 273 : A : 1007-449X (2009) 04-0615- 05 Study on disturbance torques compensation

More information

Deep Learning for Speech Recognition. Hung-yi Lee

Deep Learning for Speech Recognition. Hung-yi Lee Deep Learning for Speech Recognition Hung-yi Lee Outline Conventional Speech Recognition How to use Deep Learning in acoustic modeling? Why Deep Learning? Speaker Adaptation Multi-task Deep Learning New

More information

N-gram N-gram Language Model for Large-Vocabulary Continuous Speech Recognition

N-gram N-gram Language Model for Large-Vocabulary Continuous Speech Recognition 2010 11 5 N-gram N-gram Language Model for Large-Vocabulary Continuous Speech Recognition 1 48-106413 Abstract Large-Vocabulary Continuous Speech Recognition(LVCSR) system has rapidly been growing today.

More information

Experiments with a Gaussian Merging-Splitting Algorithm for HMM Training for Speech Recognition

Experiments with a Gaussian Merging-Splitting Algorithm for HMM Training for Speech Recognition Experiments with a Gaussian Merging-Splitting Algorithm for HMM Training for Speech Recognition ABSTRACT It is well known that the expectation-maximization (EM) algorithm, commonly used to estimate hidden

More information

Geoffrey Zweig May 7, 2009

Geoffrey Zweig May 7, 2009 Geoffrey Zweig May 7, 2009 Taxonomy of LID Techniques LID Acoustic Scores Derived LM Vector space model GMM GMM Tokenization Parallel Phone Rec + LM Vectors of phone LM stats [Carrasquillo et. al. 02],

More information

Automatic Speech Recognition (CS753)

Automatic Speech Recognition (CS753) Automatic Speech Recognition (CS753) Lecture 21: Speaker Adaptation Instructor: Preethi Jyothi Oct 23, 2017 Speaker variations Major cause of variability in speech is the differences between speakers Speaking

More information

Feature-Space Structural MAPLR with Regression Tree-based Multiple Transformation Matrices for DNN

Feature-Space Structural MAPLR with Regression Tree-based Multiple Transformation Matrices for DNN MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Feature-Space Structural MAPLR with Regression Tree-based Multiple Transformation Matrices for DNN Kanagawa, H.; Tachioka, Y.; Watanabe, S.;

More information

Robust Sound Event Detection in Continuous Audio Environments

Robust Sound Event Detection in Continuous Audio Environments Robust Sound Event Detection in Continuous Audio Environments Haomin Zhang 1, Ian McLoughlin 2,1, Yan Song 1 1 National Engineering Laboratory of Speech and Language Information Processing The University

More information

JOURNAL OF NATURAL RESOURCES Mar., 2008 : X24 : A : (2008) : ; : : ( )

JOURNAL OF NATURAL RESOURCES Mar., 2008 : X24 : A : (2008) : ; : : ( ) 23 2 Vol123 No12 2008 3 JOURNAL OF NATURAL RESOURCES Mar., 2008 : (, 100101) :,,,,,,,(),,, : ; ; ; : X24 : A: 1000-3037 (2008) 02-0177 - 08 1,,,,,,,,,,,,,,, 2,, [ 1, 2 ] : 2007-06- 09; : 2007-12- 26 :

More information

MAP adaptation with SphinxTrain

MAP adaptation with SphinxTrain MAP adaptation with SphinxTrain David Huggins-Daines dhuggins@cs.cmu.edu Language Technologies Institute Carnegie Mellon University MAP adaptation with SphinxTrain p.1/12 Theory of MAP adaptation Standard

More information

Vol112, No11 Feb1, 2010 JOURNAL OF GEO2INFORMATION SC IENCE , CBERS IRS - P5, ;, : ; : E2mail: lreis1ac1cn [ 6-13 ]

Vol112, No11 Feb1, 2010 JOURNAL OF GEO2INFORMATION SC IENCE , CBERS IRS - P5, ;, : ; : E2mail: lreis1ac1cn [ 6-13 ] 12 1 2010 2 JOURNAL OF GEO2INFORMATION SC IENCE Vol112, No11 Feb1, 2010, 3, (, 100101) :,,,,,, CBERS IRS - P5, ;,, 20m 1km ;,, CBERS, 200m, 76% ; P5, 100m, 84% : ; ; ; 1 [ 4-5 ], GIS, [ 6-13 ], ( 1995

More information

Heeyoul (Henry) Choi. Dept. of Computer Science Texas A&M University

Heeyoul (Henry) Choi. Dept. of Computer Science Texas A&M University Heeyoul (Henry) Choi Dept. of Computer Science Texas A&M University hchoi@cs.tamu.edu Introduction Speaker Adaptation Eigenvoice Comparison with others MAP, MLLR, EMAP, RMP, CAT, RSW Experiments Future

More information

ENHANCEMENTS OF MAXIMUM LIKELIHOOD EIGEN-DECOMPOSITION USING FUZZY LOGIC CONTROL FOR EIGENVOICE-BASED SPEAKER ADAPTATION.

ENHANCEMENTS OF MAXIMUM LIKELIHOOD EIGEN-DECOMPOSITION USING FUZZY LOGIC CONTROL FOR EIGENVOICE-BASED SPEAKER ADAPTATION. International Journal of Innovative Computing, Information and Control ICIC International c 2011 ISSN 1349-4198 Volume 7, Number 7(B), July 2011 pp. 4207 4222 ENHANCEMENTS OF MAXIMUM LIKELIHOOD EIGEN-DECOMPOSITION

More information

Segmental Recurrent Neural Networks for End-to-end Speech Recognition

Segmental Recurrent Neural Networks for End-to-end Speech Recognition Segmental Recurrent Neural Networks for End-to-end Speech Recognition Liang Lu, Lingpeng Kong, Chris Dyer, Noah Smith and Steve Renals TTI-Chicago, UoE, CMU and UW 9 September 2016 Background A new wave

More information

Use precise language and domain-specific vocabulary to inform about or explain the topic. CCSS.ELA-LITERACY.WHST D

Use precise language and domain-specific vocabulary to inform about or explain the topic. CCSS.ELA-LITERACY.WHST D Lesson eight What are characteristics of chemical reactions? Science Constructing Explanations, Engaging in Argument and Obtaining, Evaluating, and Communicating Information ENGLISH LANGUAGE ARTS Reading

More information

Mixtures of Gaussians with Sparse Structure

Mixtures of Gaussians with Sparse Structure Mixtures of Gaussians with Sparse Structure Costas Boulis 1 Abstract When fitting a mixture of Gaussians to training data there are usually two choices for the type of Gaussians used. Either diagonal or

More information

A TWO-LAYER NON-NEGATIVE MATRIX FACTORIZATION MODEL FOR VOCABULARY DISCOVERY. MengSun,HugoVanhamme

A TWO-LAYER NON-NEGATIVE MATRIX FACTORIZATION MODEL FOR VOCABULARY DISCOVERY. MengSun,HugoVanhamme A TWO-LAYER NON-NEGATIVE MATRIX FACTORIZATION MODEL FOR VOCABULARY DISCOVERY MengSun,HugoVanhamme Department of Electrical Engineering-ESAT, Katholieke Universiteit Leuven, Kasteelpark Arenberg 10, Bus

More information

Full-covariance model compensation for

Full-covariance model compensation for compensation transms Presentation Toshiba, 12 Mar 2008 Outline compensation transms compensation transms Outline compensation transms compensation transms Noise model x clean speech; n additive ; h convolutional

More information

M odeling and sim ula ting the forag ing system in multi2source groups w ith random d isturbances

M odeling and sim ula ting the forag ing system in multi2source groups w ith random d isturbances 3 4 Vol. 3. 4 20088 CAA I Transactions on Intelligent System s Aug. 2008,, (,150001) :..... 2,,,. Starlogo,.,. : ;; ; Starlogo : TP18 : A: 167324785 (2008) 0420342207 M odeling and sim ula ting the forag

More information

A Low-Cost Robust Front-end for Embedded ASR System

A Low-Cost Robust Front-end for Embedded ASR System A Low-Cost Robust Front-end for Embedded ASR System Lihui Guo 1, Xin He 2, Yue Lu 1, and Yaxin Zhang 2 1 Department of Computer Science and Technology, East China Normal University, Shanghai 200062 2 Motorola

More information

Comparing linear and non-linear transformation of speech

Comparing linear and non-linear transformation of speech Comparing linear and non-linear transformation of speech Larbi Mesbahi, Vincent Barreaud and Olivier Boeffard IRISA / ENSSAT - University of Rennes 1 6, rue de Kerampont, Lannion, France {lmesbahi, vincent.barreaud,

More information

Dominant Feature Vectors Based Audio Similarity Measure

Dominant Feature Vectors Based Audio Similarity Measure Dominant Feature Vectors Based Audio Similarity Measure Jing Gu 1, Lie Lu 2, Rui Cai 3, Hong-Jiang Zhang 2, and Jian Yang 1 1 Dept. of Electronic Engineering, Tsinghua Univ., Beijing, 100084, China 2 Microsoft

More information

Maximum Likelihood and Maximum A Posteriori Adaptation for Distributed Speaker Recognition Systems

Maximum Likelihood and Maximum A Posteriori Adaptation for Distributed Speaker Recognition Systems Maximum Likelihood and Maximum A Posteriori Adaptation for Distributed Speaker Recognition Systems Chin-Hung Sit 1, Man-Wai Mak 1, and Sun-Yuan Kung 2 1 Center for Multimedia Signal Processing Dept. of

More information

Results as of 30 September 2018

Results as of 30 September 2018 rt Results as of 30 September 2018 F r e e t r a n s l a t ion f r o m t h e o r ig ina l in S p a n is h. I n t h e e v e n t o f d i s c r e p a n c y, t h e Sp a n i s h - la n g u a g e v e r s ion

More information

Spacec raft au tom a tic te st and spacecraft te st language

Spacec raft au tom a tic te st and spacecraft te st language 2009 11 35 11 Journal of Beijing University of Aeronautics and A stronautics November2009 Vol. 35No111 (, 100191), ;,,,, 4.. ; ; ; ; TP 273 +. 5 A : 100125965 (2009) 1121375204 Spacec raft au tom a tic

More information

Why DNN Works for Acoustic Modeling in Speech Recognition?

Why DNN Works for Acoustic Modeling in Speech Recognition? Why DNN Works for Acoustic Modeling in Speech Recognition? Prof. Hui Jiang Department of Computer Science and Engineering York University, Toronto, Ont. M3J 1P3, CANADA Joint work with Y. Bao, J. Pan,

More information

Nearly Perfect Detection of Continuous F 0 Contour and Frame Classification for TTS Synthesis. Thomas Ewender

Nearly Perfect Detection of Continuous F 0 Contour and Frame Classification for TTS Synthesis. Thomas Ewender Nearly Perfect Detection of Continuous F 0 Contour and Frame Classification for TTS Synthesis Thomas Ewender Outline Motivation Detection algorithm of continuous F 0 contour Frame classification algorithm

More information

w h e r e e v e r t h e y live. It is an i n d u s t r i a l i z e d form of t e a c h i n g and

w h e r e e v e r t h e y live. It is an i n d u s t r i a l i z e d form of t e a c h i n g and 3 b D i s t a n c e E d u c a t i o n - In India S o c i o - L e g a l A n a l y s i s A- D i s t a n c e E d u c a t i o n - C o n c e p t D i s t a n c e T e a c h i n g or E d u c a t i o n is a m e

More information

Multi-level Gaussian selection for accurate low-resource ASR systems

Multi-level Gaussian selection for accurate low-resource ASR systems Multi-level Gaussian selection for accurate low-resource ASR systems Leïla Zouari, Gérard Chollet GET-ENST/CNRS-LTCI 46 rue Barrault, 75634 Paris cedex 13, France Abstract For Automatic Speech Recognition

More information

Hidden Markov Modelling

Hidden Markov Modelling Hidden Markov Modelling Introduction Problem formulation Forward-Backward algorithm Viterbi search Baum-Welch parameter estimation Other considerations Multiple observation sequences Phone-based models

More information

ON SCALABLE CODING OF HIDDEN MARKOV SOURCES. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose

ON SCALABLE CODING OF HIDDEN MARKOV SOURCES. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose ON SCALABLE CODING OF HIDDEN MARKOV SOURCES Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose Department of Electrical and Computer Engineering University of California, Santa Barbara, CA, 93106

More information

Robust Speaker Identification

Robust Speaker Identification Robust Speaker Identification by Smarajit Bose Interdisciplinary Statistical Research Unit Indian Statistical Institute, Kolkata Joint work with Amita Pal and Ayanendranath Basu Overview } } } } } } }

More information

Pattern Recognition Applied to Music Signals

Pattern Recognition Applied to Music Signals JHU CLSP Summer School Pattern Recognition Applied to Music Signals 2 3 4 5 Music Content Analysis Classification and Features Statistical Pattern Recognition Gaussian Mixtures and Neural Nets Singing

More information

O verv iew on Con trol Stra teg ies of Brushless D oubly - Fed M ach ines. L IU Hang - hang, HAN L i

O verv iew on Con trol Stra teg ies of Brushless D oubly - Fed M ach ines. L IU Hang - hang, HAN L i 2010 6 echnica l review, (, 400044) :,,, : ; ; : M 343:A :1004-7018( 2010) 06-0069 - 05 O verv iew on Con trol Stra teg ies of Brushless D oubly - Fed M ach ines L IU Hang - hang, HAN L i ( Chongqing University,

More information

Recent Developments in Statistical Dialogue Systems

Recent Developments in Statistical Dialogue Systems Recent Developments in Statistical Dialogue Systems Steve Young Machine Intelligence Laboratory Information Engineering Division Cambridge University Engineering Department Cambridge, UK Contents Review

More information

Lecture 5: GMM Acoustic Modeling and Feature Extraction

Lecture 5: GMM Acoustic Modeling and Feature Extraction CS 224S / LINGUIST 285 Spoken Language Processing Andrew Maas Stanford University Spring 2017 Lecture 5: GMM Acoustic Modeling and Feature Extraction Original slides by Dan Jurafsky Outline for Today Acoustic

More information

PHONEME CLASSIFICATION OVER THE RECONSTRUCTED PHASE SPACE USING PRINCIPAL COMPONENT ANALYSIS

PHONEME CLASSIFICATION OVER THE RECONSTRUCTED PHASE SPACE USING PRINCIPAL COMPONENT ANALYSIS PHONEME CLASSIFICATION OVER THE RECONSTRUCTED PHASE SPACE USING PRINCIPAL COMPONENT ANALYSIS Jinjin Ye jinjin.ye@mu.edu Michael T. Johnson mike.johnson@mu.edu Richard J. Povinelli richard.povinelli@mu.edu

More information

China Academic Journal Electronic Publishing House. All rights reserved JOURNAL OF NATURAL RESOURCES Aug, 2009

China Academic Journal Electronic Publishing House. All rights reserved JOURNAL OF NATURAL RESOURCES Aug, 2009 24 8 Vol124 No18 20098 JOURNAL OF NATURAL RESOURCES Aug, 2009, (, 710062) : 19962006,,, 15,:,,,,,,, : ; ; ; : F29111: A : 1000-3037 (2009) 08-1378 - 08 1 1978 1719%20064319%, 0193 [ 1 ] 20 90,, 19962002

More information

Boundary Contraction Training for Acoustic Models based on Discrete Deep Neural Networks

Boundary Contraction Training for Acoustic Models based on Discrete Deep Neural Networks INTERSPEECH 2014 Boundary Contraction Training for Acoustic Models based on Discrete Deep Neural Networks Ryu Takeda, Naoyuki Kanda, and Nobuo Nukaga Central Research Laboratory, Hitachi Ltd., 1-280, Kokubunji-shi,

More information

Rasch , 40 (9) : ,,, ,,,, B A cta Psychologica S in ica DO I: /SP. J

Rasch , 40 (9) : ,,, ,,,, B A cta Psychologica S in ica DO I: /SP. J 2008, 40 (9) : 1030 1040 A cta Psychologica S in ica DO I: 10. 3724 /SP. J. 1041. 2008. 01030 Rasch 3 1 2 ( 1,,100875) ( 2 Kennedy School of Government, Harvard University, MA 02138, USA) Rasch, 66,,,,

More information

End-to-end Automatic Speech Recognition

End-to-end Automatic Speech Recognition End-to-end Automatic Speech Recognition Markus Nussbaum-Thom IBM Thomas J. Watson Research Center Yorktown Heights, NY 10598, USA Markus Nussbaum-Thom. February 22, 2017 Nussbaum-Thom: IBM Thomas J. Watson

More information

Model-Based Margin Estimation for Hidden Markov Model Learning and Generalization

Model-Based Margin Estimation for Hidden Markov Model Learning and Generalization 1 2 3 4 5 6 7 8 Model-Based Margin Estimation for Hidden Markov Model Learning and Generalization Sabato Marco Siniscalchi a,, Jinyu Li b, Chin-Hui Lee c a Faculty of Engineering and Architecture, Kore

More information

A Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement

A Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement A Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement Simon Leglaive 1 Laurent Girin 1,2 Radu Horaud 1 1: Inria Grenoble Rhône-Alpes 2: Univ. Grenoble Alpes, Grenoble INP,

More information

Table of C on t en t s Global Campus 21 in N umbe r s R e g ional Capac it y D e v e lopme nt in E-L e ar ning Structure a n d C o m p o n en ts R ea

Table of C on t en t s Global Campus 21 in N umbe r s R e g ional Capac it y D e v e lopme nt in E-L e ar ning Structure a n d C o m p o n en ts R ea G Blended L ea r ni ng P r o g r a m R eg i o na l C a p a c i t y D ev elo p m ent i n E -L ea r ni ng H R K C r o s s o r d e r u c a t i o n a n d v e l o p m e n t C o p e r a t i o n 3 0 6 0 7 0 5

More information

shhgs@wgqqh.com chinapub 2002 7 Bruc Eckl 1000 7 Bruc Eckl 1000 Th gnsis of th computr rvolution was in a machin. Th gnsis of our programming languags thus tnds to look lik that Bruc machin. 10 7 www.wgqqh.com/shhgs/tij.html

More information

= (, ) V λ (1) λ λ ( + + ) P = [ ( ), (1)] ( ) ( ) = ( ) ( ) ( 0 ) ( 0 ) = ( 0 ) ( 0 ) 0 ( 0 ) ( ( 0 )) ( ( 0 )) = ( ( 0 )) ( ( 0 )) ( + ( 0 )) ( + ( 0 )) = ( + ( 0 )) ( ( 0 )) P V V V V V P V P V V V

More information

( Stationary wavelet transform, SW T) [ 5 ]

( Stationary wavelet transform, SW T) [ 5 ] 123 20106 JOURAL OF GEO2IFORATIO SC IECE Vol112, o13 Jun1, 2010, 3 (, 350108;, 350108) :,, : allat (DW T) trous ( SW T) (SCT),IHS PCA IKOOS,,,,, DW T SW T SCT ; DW T SW T SCT IHS PCA,, IHS PCA,, SCT PCA,

More information

Dynamic Time-Alignment Kernel in Support Vector Machine

Dynamic Time-Alignment Kernel in Support Vector Machine Dynamic Time-Alignment Kernel in Support Vector Machine Hiroshi Shimodaira School of Information Science, Japan Advanced Institute of Science and Technology sim@jaist.ac.jp Mitsuru Nakai School of Information

More information

Mixtures of Gaussians with Sparse Regression Matrices. Constantinos Boulis, Jeffrey Bilmes

Mixtures of Gaussians with Sparse Regression Matrices. Constantinos Boulis, Jeffrey Bilmes Mixtures of Gaussians with Sparse Regression Matrices Constantinos Boulis, Jeffrey Bilmes {boulis,bilmes}@ee.washington.edu Dept of EE, University of Washington Seattle WA, 98195-2500 UW Electrical Engineering

More information

Lecture 10. Discriminative Training, ROVER, and Consensus. Michael Picheny, Bhuvana Ramabhadran, Stanley F. Chen

Lecture 10. Discriminative Training, ROVER, and Consensus. Michael Picheny, Bhuvana Ramabhadran, Stanley F. Chen Lecture 10 Discriminative Training, ROVER, and Consensus Michael Picheny, Bhuvana Ramabhadran, Stanley F. Chen IBM T.J. Watson Research Center Yorktown Heights, New York, USA {picheny,bhuvana,stanchen}@us.ibm.com

More information

Model-Based Approaches to Robust Speech Recognition

Model-Based Approaches to Robust Speech Recognition Model-Based Approaches to Robust Speech Recognition Mark Gales with Hank Liao, Rogier van Dalen, Chris Longworth (work partly funded by Toshiba Research Europe Ltd) 11 June 2008 King s College London Seminar

More information

M itchelson R L , (Wolfson index) ( Tsui - W ang index) : ; : : ( ) :,, E - mail: edu.

M itchelson R L , (Wolfson index) ( Tsui - W ang index) : ; : : ( ) :,, E - mail: edu. 2 8 6 28 No. 6 Vol. 2 0 0 8 1 2 SC IENTIA GEOGRAPH ICA SIN ICA Dec., 2 0 0 8 1, 2, 3 (1., 100101; 2., 100049; 3.,, 130024) : ; 1990, ;, - ;, - : ; ; ; ; : F119. 9 : A : 1000-0690 (2008) 06-0722 - 07, M

More information

Global SNR Estimation of Speech Signals using Entropy and Uncertainty Estimates from Dropout Networks

Global SNR Estimation of Speech Signals using Entropy and Uncertainty Estimates from Dropout Networks Interspeech 2018 2-6 September 2018, Hyderabad Global SNR Estimation of Speech Signals using Entropy and Uncertainty Estimates from Dropout Networks Rohith Aralikatti, Dilip Kumar Margam, Tanay Sharma,

More information

1., X37 A (2009)

1., X37 A (2009) , Journal of Anhui Agri. Sci. 2009, 37 (35) : 17649-17652, 17691 1, 2, 1, 3, 4, 1, 3 (1.,100101; 2.,100101; 3.,100049; 4.,100875),, CO 2 10,4, ;; CENTURY; RothC; DNDC X37 A 0517-6611 (2009) 35-17649 -

More information

c. What is the average rate of change of f on the interval [, ]? Answer: d. What is a local minimum value of f? Answer: 5 e. On what interval(s) is f

c. What is the average rate of change of f on the interval [, ]? Answer: d. What is a local minimum value of f? Answer: 5 e. On what interval(s) is f Essential Skills Chapter f ( x + h) f ( x ). Simplifying the difference quotient Section. h f ( x + h) f ( x ) Example: For f ( x) = 4x 4 x, find and simplify completely. h Answer: 4 8x 4 h. Finding the

More information

Compound rotor position self2sen sing method of PM SM

Compound rotor position self2sen sing method of PM SM 2 5 289 EL ECTR ICMACH IN ESANDCON TROL Vol2 No5 Sep. 28, 2,,,, (., 4; 2., 255) :,,,,, : ; ; ; ; : TM34 : A : 7 449X (28) 5 498 6 Compound rotor position self2sen sing method of PM SM HOU L i2m in, 2,

More information

Upper Bound Kullback-Leibler Divergence for Hidden Markov Models with Application as Discrimination Measure for Speech Recognition

Upper Bound Kullback-Leibler Divergence for Hidden Markov Models with Application as Discrimination Measure for Speech Recognition Upper Bound Kullback-Leibler Divergence for Hidden Markov Models with Application as Discrimination Measure for Speech Recognition Jorge Silva and Shrikanth Narayanan Speech Analysis and Interpretation

More information

An Evolutionary Programming Based Algorithm for HMM training

An Evolutionary Programming Based Algorithm for HMM training An Evolutionary Programming Based Algorithm for HMM training Ewa Figielska,Wlodzimierz Kasprzak Institute of Control and Computation Engineering, Warsaw University of Technology ul. Nowowiejska 15/19,

More information

Discriminative training of GMM-HMM acoustic model by RPCL type Bayesian Ying-Yang harmony learning

Discriminative training of GMM-HMM acoustic model by RPCL type Bayesian Ying-Yang harmony learning Discriminative training of GMM-HMM acoustic model by RPCL type Bayesian Ying-Yang harmony learning Zaihu Pang 1, Xihong Wu 1, and Lei Xu 1,2 1 Speech and Hearing Research Center, Key Laboratory of Machine

More information

ACS Introduction to NLP Lecture 2: Part of Speech (POS) Tagging

ACS Introduction to NLP Lecture 2: Part of Speech (POS) Tagging ACS Introduction to NLP Lecture 2: Part of Speech (POS) Tagging Stephen Clark Natural Language and Information Processing (NLIP) Group sc609@cam.ac.uk The POS Tagging Problem 2 England NNP s POS fencers

More information

I zm ir I nstiute of Technology CS Lecture Notes are based on the CS 101 notes at the University of I llinois at Urbana-Cham paign

I zm ir I nstiute of Technology CS Lecture Notes are based on the CS 101 notes at the University of I llinois at Urbana-Cham paign I zm ir I nstiute of Technology CS - 1 0 2 Lecture 1 Lecture Notes are based on the CS 101 notes at the University of I llinois at Urbana-Cham paign I zm ir I nstiute of Technology W hat w ill I learn

More information

GMM-Based Speech Transformation Systems under Data Reduction

GMM-Based Speech Transformation Systems under Data Reduction GMM-Based Speech Transformation Systems under Data Reduction Larbi Mesbahi, Vincent Barreaud, Olivier Boeffard IRISA / University of Rennes 1 - ENSSAT 6 rue de Kerampont, B.P. 80518, F-22305 Lannion Cedex

More information

Proc. of NCC 2010, Chennai, India

Proc. of NCC 2010, Chennai, India Proc. of NCC 2010, Chennai, India Trajectory and surface modeling of LSF for low rate speech coding M. Deepak and Preeti Rao Department of Electrical Engineering Indian Institute of Technology, Bombay

More information

A L A BA M A L A W R E V IE W

A L A BA M A L A W R E V IE W A L A BA M A L A W R E V IE W Volume 52 Fall 2000 Number 1 B E F O R E D I S A B I L I T Y C I V I L R I G HT S : C I V I L W A R P E N S I O N S A N D TH E P O L I T I C S O F D I S A B I L I T Y I N

More information

M odeling and simulation of power assembly for single2axle para llel hybr id electr ic veh icles

M odeling and simulation of power assembly for single2axle para llel hybr id electr ic veh icles 13 1 2009 11 EL ECTR IC MACH IN ES AND CON TROL Vol113 Supp l. 1 Nov. 2009,, (, 150040) :, Insight,,, ADV ISOR ( advanced vehicle simulator),,, :, : ; ; ; ADV ISOR : U 469. 72 : A : 1007-449X (2009) 1-0036-

More information

Usually the estimation of the partition function is intractable and it becomes exponentially hard when the complexity of the model increases. However,

Usually the estimation of the partition function is intractable and it becomes exponentially hard when the complexity of the model increases. However, Odyssey 2012 The Speaker and Language Recognition Workshop 25-28 June 2012, Singapore First attempt of Boltzmann Machines for Speaker Verification Mohammed Senoussaoui 1,2, Najim Dehak 3, Patrick Kenny

More information

Monaural speech separation using source-adapted models

Monaural speech separation using source-adapted models Monaural speech separation using source-adapted models Ron Weiss, Dan Ellis {ronw,dpwe}@ee.columbia.edu LabROSA Department of Electrical Enginering Columbia University 007 IEEE Workshop on Applications

More information

On the Influence of the Delta Coefficients in a HMM-based Speech Recognition System

On the Influence of the Delta Coefficients in a HMM-based Speech Recognition System On the Influence of the Delta Coefficients in a HMM-based Speech Recognition System Fabrice Lefèvre, Claude Montacié and Marie-José Caraty Laboratoire d'informatique de Paris VI 4, place Jussieu 755 PARIS

More information

[ 4 ], [ 13 ], [ 3 ] [ 5 ] [ 7 ] China Academic Journal Electronic Publishing House. All rights reserved.

[ 4 ], [ 13 ], [ 3 ] [ 5 ] [ 7 ] China Academic Journal Electronic Publishing House. All rights reserved. 9 JOURAL OF V IBRATIO AD SHOCK Vol. 9 o. 010,, (,, 3007 :,,,,,, : ; ; : O3; TB535: A,,,,,,,,,,, [ 1-6 ], [ 9, 10 ],, [ 11, 1 ], 1,,[ 1 ], [ ] [ 3 ] [ ],, [ 4 ], [ 5 ] [ 4 ], [ 6 ], [ 7 ],, [ 8 ],[ 4 ]

More information

4A (Automatized A t2 mospheric Absorp tion A tlas) , 4A, NOVELTIS Laboratoire de. MetOp 4A /OP 3 IASI, AR ID LAND GEOGRAPHY Jan.

4A (Automatized A t2 mospheric Absorp tion A tlas) , 4A, NOVELTIS Laboratoire de. MetOp 4A /OP 3 IASI, AR ID LAND GEOGRAPHY Jan. 33 1 2010 1 33 No. 1 Vol. AR ID LAND GEOGRAPHY Jan. 2010 1, 2, 1, 3 (1, 100190; 2, 100049; 3, 100101) : (RBF),,, 9 m 10 m 12 m, 4A 100,, : ; ; : TP732. 2 : A : 1000-6060 (2010) 01-0099 - 07 (99 105),,,,

More information

A Direct Criterion Minimization based fmllr via Gradient Descend

A Direct Criterion Minimization based fmllr via Gradient Descend A Direct Criterion Minimization based fmllr via Gradient Descend Jan Vaněk and Zbyněk Zajíc University of West Bohemia in Pilsen, Univerzitní 22, 306 14 Pilsen Faculty of Applied Sciences, Department of

More information

QUATERNARY SC IENCES

QUATERNARY SC IENCES 28 4 20087 QUATERNARYSC IENCES Vol. 28, No. 4 July, 2008 1001-7410 (2008) 04-535 - 09 3 (, 100101;,, 730000),, 100,,,,,, ( < 200m) (200500m) (5001000m ) (10002500m)( > 2500m)7,, 1 500000 1 1000000 (DTM

More information

INFRARED TARGET EXTRACTION ALGORITHM BY USING PARTICLE SWARM OPTIM IZATION PARTICLE FILTER

INFRARED TARGET EXTRACTION ALGORITHM BY USING PARTICLE SWARM OPTIM IZATION PARTICLE FILTER 29 1 20102 J. Infrared M illim. W aves Vol. 29, o. 1 February, 2010 : 1001-9014 (2010) 01-0063 - 06 1, 2 (1., 200240; 2., 200233) : ( PSOPF),.,,,,.,.,. : ;; ; : TP391. 4: A IFRARD TART XTRACTIO ALORITHM

More information

FACTORIAL HMMS FOR ACOUSTIC MODELING. Beth Logan and Pedro Moreno

FACTORIAL HMMS FOR ACOUSTIC MODELING. Beth Logan and Pedro Moreno ACTORIAL HMMS OR ACOUSTIC MODELING Beth Logan and Pedro Moreno Cambridge Research Laboratories Digital Equipment Corporation One Kendall Square, Building 700, 2nd loor Cambridge, Massachusetts 02139 United

More information

FEATURE SELECTION USING FISHER S RATIO TECHNIQUE FOR AUTOMATIC SPEECH RECOGNITION

FEATURE SELECTION USING FISHER S RATIO TECHNIQUE FOR AUTOMATIC SPEECH RECOGNITION FEATURE SELECTION USING FISHER S RATIO TECHNIQUE FOR AUTOMATIC SPEECH RECOGNITION Sarika Hegde 1, K. K. Achary 2 and Surendra Shetty 3 1 Department of Computer Applications, NMAM.I.T., Nitte, Karkala Taluk,

More information

Hierarchical Multi-Stream Posterior Based Speech Recognition System

Hierarchical Multi-Stream Posterior Based Speech Recognition System Hierarchical Multi-Stream Posterior Based Speech Recognition System Hamed Ketabdar 1,2, Hervé Bourlard 1,2 and Samy Bengio 1 1 IDIAP Research Institute, Martigny, Switzerland 2 Ecole Polytechnique Fédérale

More information

, kw, kw 3176%,, JOURNAL OF NATURAL RESOURCES Aug., , : F42612 : A : (2009)

, kw, kw 3176%,, JOURNAL OF NATURAL RESOURCES Aug., , : F42612 : A : (2009) 24 8 Vol124 No18 2009 8 JOURNAL OF NATURAL RESOURCES Aug., 2009 1, 2, 13 (11, 100101; 21, 100049) : 1997 2006 10,,,, 2006 GDP 527111 1 195120 5 745182, 3,,,,, 4 : : ; ; ; : F42612 : A: 1000-3037 (2009)

More information

Generalized Cyclic Transformations in Speaker-Independent Speech Recognition

Generalized Cyclic Transformations in Speaker-Independent Speech Recognition Generalized Cyclic Transformations in Speaker-Independent Speech Recognition Florian Müller 1, Eugene Belilovsky, and Alfred Mertins Institute for Signal Processing, University of Lübeck Ratzeburger Allee

More information

A Comparative Study of Histogram Equalization (HEQ) for Robust Speech Recognition

A Comparative Study of Histogram Equalization (HEQ) for Robust Speech Recognition Computational Linguistics and Chinese Language Processing Vol. 12, No. 2, June 2007, pp. 217-238 217 The Association for Computational Linguistics and Chinese Language Processing A Comparative Study of

More information

A NONPARAMETRIC BAYESIAN APPROACH FOR SPOKEN TERM DETECTION BY EXAMPLE QUERY

A NONPARAMETRIC BAYESIAN APPROACH FOR SPOKEN TERM DETECTION BY EXAMPLE QUERY A NONPARAMETRIC BAYESIAN APPROACH FOR SPOKEN TERM DETECTION BY EXAMPLE QUERY Amir Hossein Harati Nead Torbati and Joseph Picone College of Engineering, Temple University Philadelphia, Pennsylvania, USA

More information

WaveNet: A Generative Model for Raw Audio

WaveNet: A Generative Model for Raw Audio WaveNet: A Generative Model for Raw Audio Ido Guy & Daniel Brodeski Deep Learning Seminar 2017 TAU Outline Introduction WaveNet Experiments Introduction WaveNet is a deep generative model of raw audio

More information

Shankar Shivappa University of California, San Diego April 26, CSE 254 Seminar in learning algorithms

Shankar Shivappa University of California, San Diego April 26, CSE 254 Seminar in learning algorithms Recognition of Visual Speech Elements Using Adaptively Boosted Hidden Markov Models. Say Wei Foo, Yong Lian, Liang Dong. IEEE Transactions on Circuits and Systems for Video Technology, May 2004. Shankar

More information

Dept. of Linguistics, Indiana University Fall 2009

Dept. of Linguistics, Indiana University Fall 2009 1 / 14 Markov L645 Dept. of Linguistics, Indiana University Fall 2009 2 / 14 Markov (1) (review) Markov A Markov Model consists of: a finite set of statesω={s 1,...,s n }; an signal alphabetσ={σ 1,...,σ

More information

( name, ), 1 ( a), (p lay2scrip t) ( act) ( b),

( name, ), 1 ( a), (p lay2scrip t) ( act) ( b), 79 3 ( name, ),,, :,,,,, ;,,,, : 1 ( a), :, :,,, ( b), :, :,,, 1 ( a) ( b), :, ( ),, (p lay2scrip t),,,,,, ( act), ( scene),,, 1 ( a) ( b), 1 ( a) ( b),,, ( ) 3 ( 07BZX047) 80 2010 1,,,,,,,,, : 1.,,,,,

More information

Pattern Classification

Pattern Classification Pattern Classification Introduction Parametric classifiers Semi-parametric classifiers Dimensionality reduction Significance testing 6345 Automatic Speech Recognition Semi-Parametric Classifiers 1 Semi-Parametric

More information

COMPILATION OF AUTOMATA FROM MORPHOLOGICAL TWO-LEVEL RULES

COMPILATION OF AUTOMATA FROM MORPHOLOGICAL TWO-LEVEL RULES Kimmo Koskenniemi Re se ar ch Unit for Co mp ut at io na l Li ng ui st ic s University of Helsinki, Hallituskatu 11 SF-00100 Helsinki, Finland COMPILATION OF AUTOMATA FROM MORPHOLOGICAL TWO-LEVEL RULES

More information

Hidden Markov Models. Dr. Naomi Harte

Hidden Markov Models. Dr. Naomi Harte Hidden Markov Models Dr. Naomi Harte The Talk Hidden Markov Models What are they? Why are they useful? The maths part Probability calculations Training optimising parameters Viterbi unseen sequences Real

More information

Double closed2control of active filter using repetitive algorithm

Double closed2control of active filter using repetitive algorithm 13 1 200911 EL ECTR ICMACH IN ESANDCON TROL Vol113 Supp l. 1 Nov. 2009,, (, 410076) :,,,,,,,6118% 516%, 01770198 : ; ; : TP 273 : A : 1007-449X (2009)1-0067- 05 Double closed2control of active filter using

More information

Symmetric Distortion Measure for Speaker Recognition

Symmetric Distortion Measure for Speaker Recognition ISCA Archive http://www.isca-speech.org/archive SPECOM 2004: 9 th Conference Speech and Computer St. Petersburg, Russia September 20-22, 2004 Symmetric Distortion Measure for Speaker Recognition Evgeny

More information

Detection-Based Speech Recognition with Sparse Point Process Models

Detection-Based Speech Recognition with Sparse Point Process Models Detection-Based Speech Recognition with Sparse Point Process Models Aren Jansen Partha Niyogi Human Language Technology Center of Excellence Departments of Computer Science and Statistics ICASSP 2010 Dallas,

More information

OH BOY! Story. N a r r a t iv e a n d o bj e c t s th ea t e r Fo r a l l a g e s, fr o m th e a ge of 9

OH BOY! Story. N a r r a t iv e a n d o bj e c t s th ea t e r Fo r a l l a g e s, fr o m th e a ge of 9 OH BOY! O h Boy!, was or igin a lly cr eat ed in F r en ch an d was a m a jor s u cc ess on t h e Fr en ch st a ge f or young au di enc es. It h a s b een s een by ap pr ox i ma t ely 175,000 sp ect at

More information

BLACK BOX OPTIMIZATION FOR AUTOMATIC SPEECH RECOGNITION. Shinji Watanabe and Jonathan Le Roux

BLACK BOX OPTIMIZATION FOR AUTOMATIC SPEECH RECOGNITION. Shinji Watanabe and Jonathan Le Roux 2014 IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP) BLACK BOX OPTIMIZATION FOR AUTOMATIC SPEECH RECOGNITION Shinji Watanabe and Jonathan Le Roux Mitsubishi Electric Research

More information

ASPEAKER independent speech recognition system has to

ASPEAKER independent speech recognition system has to 930 IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 13, NO. 5, SEPTEMBER 2005 Vocal Tract Normalization Equals Linear Transformation in Cepstral Space Michael Pitz and Hermann Ney, Member, IEEE

More information

Electron ic pole changing techn ique of multi2phase induction motor

Electron ic pole changing techn ique of multi2phase induction motor 3 3 95 EL ECTR ICMACH IN ESANDCON TROL Vol3 No3 May 9,,,,,, (., 37;., 33) :,,,,,, 939,, : ; ; ; : TM3 : A : 7-449X (9) 3-3- 5 Electron ic pole changing techn ique of multiphase induction motor YANG J iaqiang,

More information

Lecture 3: ASR: HMMs, Forward, Viterbi

Lecture 3: ASR: HMMs, Forward, Viterbi Original slides by Dan Jurafsky CS 224S / LINGUIST 285 Spoken Language Processing Andrew Maas Stanford University Spring 2017 Lecture 3: ASR: HMMs, Forward, Viterbi Fun informative read on phonetics The

More information