The Electron ic PSC Testing System
|
|
- Stanley Hopkins
- 5 years ago
- Views:
Transcription
1 20 6 JO URAL O F CH IESE IFO RM AT IO PROCESS IG Vol120 o16 : (2006) ,,, (, ) : 100,,, 500,, (2144)(2130) :; ;; ; ; : TP391: A The Electron ic PSC Testing System W E I Si, L IU Q ing2sheng, HU Yu, WAG Ren2hua ( Man Machine Voice Communication Laboratory, University of Science&Technology of China, Hefei, Anhui , China) Abstract: This paper develop s an automatic PSC testing system aim ing at efficiently evaluating the spoken Chinese. On the basis of 100 hoursstandard Chinese database, this paper uses the characteristic of Chinese and linguist s expert knowledge to op tim ize the traditional speech evaluation algorithm. A t the same time, a corpus2adap tive method is p ro2 pose to enhance the robustness and performance of the algorithm. Experiments on 500 personspsc testing database p rove that the new algorithm is much better than the original algorithm. After linear mapp ing, the error between the machine score and the human score is almost equal to the error between humans, that is The result indicates that the automatic PSC testing system can rep lace the human to evaluating spoken Chinese under text2dependent con2 dition. Key word: computer app lication; Chinese information p rocessing; Putonghua shuip ing ceshi; p ronunciation evalua2 tion; PSC testing database, automatic testing system. 1,,,,,, 100,,,, : : : ( ZD I105 - B02) : (1982),,,. 89
2 ( SR I) V ILT [ 1, 2 ], SR I SC ILL [ 3, 4 ], V ICK [ 5, 6 ],,, [ 9, 10 ],,,,,,,,,,,,(2144) (2130),,, 2,, 1 1 1,,,, 90
3 3 311,,,,,, 16K, 16bit / 4500/ 60/( 400) 3 /, 100,,,, 312,,,,, 16K, 16bit % 71% 23%,,, 3, 313 ( ),,264,236 3 A, B { S i, i = 1, 2,, n},(1) : [ (S A i - S A ) (S B i - S B ) ] (S A i - S A ) 2 [ (S B i - S B ) 2 (1) 91
4 , S A i A i, S B i B i, S A A, S B B 4 ( )/ (110, 010) / (110, 010) (0191, 1188) / ( 0190, 1197) (0188, 2154) / ( 0189, 2147) (0191, 1188) / ( 0190, 1197) (110, 010) / (110, 010) (0191, 2119) / ( 0189, 2147) (0190, 2120) / (0189, 2130) (0188, 2154) / ( 0189, 2147) (0191, 2119) / ( 0189, 2147) (110, 010) / (110, 010) 4,018,3 4,, 411 HMM, 25m s, 10m s MFCC,39 HMM, TO P O T, P O T O HMM T,HMM [ 13, 5 ] O TP T O,, P P ( T O ) T O (2) [ 3 ] = ( log ( P ( T i O ( T i ) ) ) / F ( T i ) ) / = ( log ( P (O ( T i ) T i ) P ( T i ) qq P (O ( T i ) q) p ( q) ) / F ( T i ) ) / (2) P (O ( T i ) T i ) ( log ( max qq P (O ( T i ) q) ) / F ( T i ) ) /, Q, qt i, F ( T i )T i,, P (O ( T i ) T i ) T i O ( T i ), : 0158, (2),,
5 ,,,,,, (3) : P ( T O ) = ( log ( P ( T i O ( T i ) ) ) / F ( T i ) ) / = ( log ( P (O ( T i ) T i ) P ( T i ) T qq i error P (O ( T i ) q) p ( q) ) / F ( T i ) ) / (3) P (O ( T i ) T i ) ( log ( max T qq i P (O ( T i ) q) ) / F ( T i ) ) / error (3) (2), ( 2), (3) [ 7 ], 41212, [ 8 ],(4) G sen t = G i / G i = G i in itia l + G i fina l G sent, G i i G i in itia l i G i fina l i,,, (5) G sen t = i / G G i = G i in itia l (1 + D u ri f ina l D u r i in itia l CO EF) + G i fina l G sen t, G i i, D ur i fina l i, D ur i in itia l i CO EF,, CO EF,, 41213,,,, MLLR (Maximum L ikelihood L ine2 ar Regression) [ 11 ], MLLR,,,,,,, : (4) (5) 93
6 ,, T i HTKHMM,,, (6), T i THR ESH i T i < THR ESH i THR ESH,,, MLLR, 5, 511 (3),,,, HMM,, 5 5 / 0165 / / (6) 5,,,,,, [ 8 ],, 6, HMM, /, / /0177 6,,[ 8 ] ,7 7, / HMM, 0177 / / /
7 7,, 514,,, 8: HMM,,, 8 8 (/) + + VS 0165 / /0181 VS 0190 /0189 8,, 6,,,,,,,, :,, (7) S core m ach ine S core m ach ine S core m ach ine = 3 = 3 = 3 1i P ( o i ) + S core 4 C = 2i P ( o i ) + S core 4 C = 3i P ( o i ) + S core 4 C, P ( o i )i,, 1 i, Score 4, C, Score m ach ine 9 9 ( )/ + + VS (0183, - ) / (0181, - ) (0195, 1128) / (0184, 2144) VS (0190, 2120) / (0189, 2130) 9,,,, (7) 95
8 ,, (2144)(2130) 7,,,,,,,, 0165 /0161 (/, )0183 /0181,,, 0195 /0184,1128 /2144, 0190 / /2130,,,,,,, : [ 1 ] H. L. Franco, L. eumeyer, Y. Kim, O. Ronen. Automatic p ronunciation scoring for language instruction[a ]. ICASSP[ C ], 1997, [ 2 ] L. eumeyer, H. Franco, V. D igalakis, M. W eintraub. Automatic scoring of p ronunciation quality. Speech Communication 30 [ J ], 2000, [ 3 ] S. M. W itt, S. J. Young. Phone2level p ronunciation scoring and assessment for interactive language learning [A ]. In: Speech Communication 30, 2000, [ 4 ] S. M. W itt, U se of speech recognition in computer2assisted language learning, Doctor s D issertation of Cam2 bridge[d ], [ 5 ] C. Cucchiarini, F. D. W et, H. Strik, L. Boves. A ssessment of Dutch p ronunciation by means of automatic speech recognition technology[a ]. ICSLP, Vol. 5 [ C ], 1998, [ 6 ] C. Cucchiarini, H. Strik, L. Boves. Automatic evaluation of dutch p ronunciation by using speech recognition technology[a ]. Proceedings of the IEEE workshop ASRU [ C ], Santa Barbara. 1997, [ 7 ] A ijun L i, Xia W ang, A Contrastive Investigation of Standard Mandarin and Accented [A ]. EuroSpeech [ C ], 2003, [ 8 ],,,. [A ]. [ C ], 2005, [ 9 ],. [A ]. [ J ], 1998, [ 10 ],,. [A ]. [ C ], 2005, [ 11 ] C. J. Leggetter, P. C. Woodland, Maximum L ikelihood L inear Regression for Speaker Adap tation of Contin2 uous Density H idden M arkov Models, Computer Speech and Language[ J ], 1995,
Presented By: Omer Shmueli and Sivan Niv
Deep Speaker: an End-to-End Neural Speaker Embedding System Chao Li, Xiaokong Ma, Bing Jiang, Xiangang Li, Xuewei Zhang, Xiao Liu, Ying Cao, Ajay Kannan, Zhenyao Zhu Presented By: Omer Shmueli and Sivan
More informationChinese Journal of Scientific Instrument. High frequency we ighted M FCC extraction for noise robust speaker ver if ication
29 3 20083 Chinese Journal of Scientific Instrument Vol129 No13 Mar. 2008 M FCC 1, 1, 2 (1 400044; 2 400044) : MFCC Mel,,,,, MFCC,,, : ; ; ; ; MFCC : TP192. 3 : A: 520. 2040 High frequency we ighted M
More informationHidden Markov Model and Speech Recognition
1 Dec,2006 Outline Introduction 1 Introduction 2 3 4 5 Introduction What is Speech Recognition? Understanding what is being said Mapping speech data to textual information Speech Recognition is indeed
More informationA discussion on methodologies for research into complex system s
4 1 Vol. 4. 1 2009 2 CAA I Transactions on Intelligent System s Feb. 2009 1, 2, 3, 2, 4 (1., 610074; 2., 610500; 3., 610500; 4., 618000) : 20 80,,,, 6 : ; ; Agent ; Swarm; StarLogo : N94, TP273 : A : 167324785
More informationJournal of Beijing University of Aeronautics and A stronautics PCNN, PCNN. Nove l adap tive deno ising m e thod fo r extrem e no ise ba sed on PCNN
2009 1 35 1 Journal of Beijing University of Aeronautics and A stronautics January 2009 Vol. 35 No11 PCNN (, 100191) : PCNN ( Pulse Coup led Neural Network) ADEN (Adapative Denosing method for Extreme
More informationStudy on disturbance torques compensation in high precise servo turn table control system
13 4 20097 EL ECTR ICMACH IN ESANDCON TROL Vol113 No14 July 2009,, (, 150001) :, PD,, Lyapunov, ;,, : ; ; ; Lyapunov; : TP 273 : A : 1007-449X (2009) 04-0615- 05 Study on disturbance torques compensation
More informationDeep Learning for Speech Recognition. Hung-yi Lee
Deep Learning for Speech Recognition Hung-yi Lee Outline Conventional Speech Recognition How to use Deep Learning in acoustic modeling? Why Deep Learning? Speaker Adaptation Multi-task Deep Learning New
More informationN-gram N-gram Language Model for Large-Vocabulary Continuous Speech Recognition
2010 11 5 N-gram N-gram Language Model for Large-Vocabulary Continuous Speech Recognition 1 48-106413 Abstract Large-Vocabulary Continuous Speech Recognition(LVCSR) system has rapidly been growing today.
More informationExperiments with a Gaussian Merging-Splitting Algorithm for HMM Training for Speech Recognition
Experiments with a Gaussian Merging-Splitting Algorithm for HMM Training for Speech Recognition ABSTRACT It is well known that the expectation-maximization (EM) algorithm, commonly used to estimate hidden
More informationGeoffrey Zweig May 7, 2009
Geoffrey Zweig May 7, 2009 Taxonomy of LID Techniques LID Acoustic Scores Derived LM Vector space model GMM GMM Tokenization Parallel Phone Rec + LM Vectors of phone LM stats [Carrasquillo et. al. 02],
More informationAutomatic Speech Recognition (CS753)
Automatic Speech Recognition (CS753) Lecture 21: Speaker Adaptation Instructor: Preethi Jyothi Oct 23, 2017 Speaker variations Major cause of variability in speech is the differences between speakers Speaking
More informationFeature-Space Structural MAPLR with Regression Tree-based Multiple Transformation Matrices for DNN
MITSUBISHI ELECTRIC RESEARCH LABORATORIES http://www.merl.com Feature-Space Structural MAPLR with Regression Tree-based Multiple Transformation Matrices for DNN Kanagawa, H.; Tachioka, Y.; Watanabe, S.;
More informationRobust Sound Event Detection in Continuous Audio Environments
Robust Sound Event Detection in Continuous Audio Environments Haomin Zhang 1, Ian McLoughlin 2,1, Yan Song 1 1 National Engineering Laboratory of Speech and Language Information Processing The University
More informationJOURNAL OF NATURAL RESOURCES Mar., 2008 : X24 : A : (2008) : ; : : ( )
23 2 Vol123 No12 2008 3 JOURNAL OF NATURAL RESOURCES Mar., 2008 : (, 100101) :,,,,,,,(),,, : ; ; ; : X24 : A: 1000-3037 (2008) 02-0177 - 08 1,,,,,,,,,,,,,,, 2,, [ 1, 2 ] : 2007-06- 09; : 2007-12- 26 :
More informationMAP adaptation with SphinxTrain
MAP adaptation with SphinxTrain David Huggins-Daines dhuggins@cs.cmu.edu Language Technologies Institute Carnegie Mellon University MAP adaptation with SphinxTrain p.1/12 Theory of MAP adaptation Standard
More informationVol112, No11 Feb1, 2010 JOURNAL OF GEO2INFORMATION SC IENCE , CBERS IRS - P5, ;, : ; : E2mail: lreis1ac1cn [ 6-13 ]
12 1 2010 2 JOURNAL OF GEO2INFORMATION SC IENCE Vol112, No11 Feb1, 2010, 3, (, 100101) :,,,,,, CBERS IRS - P5, ;,, 20m 1km ;,, CBERS, 200m, 76% ; P5, 100m, 84% : ; ; ; 1 [ 4-5 ], GIS, [ 6-13 ], ( 1995
More informationHeeyoul (Henry) Choi. Dept. of Computer Science Texas A&M University
Heeyoul (Henry) Choi Dept. of Computer Science Texas A&M University hchoi@cs.tamu.edu Introduction Speaker Adaptation Eigenvoice Comparison with others MAP, MLLR, EMAP, RMP, CAT, RSW Experiments Future
More informationENHANCEMENTS OF MAXIMUM LIKELIHOOD EIGEN-DECOMPOSITION USING FUZZY LOGIC CONTROL FOR EIGENVOICE-BASED SPEAKER ADAPTATION.
International Journal of Innovative Computing, Information and Control ICIC International c 2011 ISSN 1349-4198 Volume 7, Number 7(B), July 2011 pp. 4207 4222 ENHANCEMENTS OF MAXIMUM LIKELIHOOD EIGEN-DECOMPOSITION
More informationSegmental Recurrent Neural Networks for End-to-end Speech Recognition
Segmental Recurrent Neural Networks for End-to-end Speech Recognition Liang Lu, Lingpeng Kong, Chris Dyer, Noah Smith and Steve Renals TTI-Chicago, UoE, CMU and UW 9 September 2016 Background A new wave
More informationUse precise language and domain-specific vocabulary to inform about or explain the topic. CCSS.ELA-LITERACY.WHST D
Lesson eight What are characteristics of chemical reactions? Science Constructing Explanations, Engaging in Argument and Obtaining, Evaluating, and Communicating Information ENGLISH LANGUAGE ARTS Reading
More informationMixtures of Gaussians with Sparse Structure
Mixtures of Gaussians with Sparse Structure Costas Boulis 1 Abstract When fitting a mixture of Gaussians to training data there are usually two choices for the type of Gaussians used. Either diagonal or
More informationA TWO-LAYER NON-NEGATIVE MATRIX FACTORIZATION MODEL FOR VOCABULARY DISCOVERY. MengSun,HugoVanhamme
A TWO-LAYER NON-NEGATIVE MATRIX FACTORIZATION MODEL FOR VOCABULARY DISCOVERY MengSun,HugoVanhamme Department of Electrical Engineering-ESAT, Katholieke Universiteit Leuven, Kasteelpark Arenberg 10, Bus
More informationFull-covariance model compensation for
compensation transms Presentation Toshiba, 12 Mar 2008 Outline compensation transms compensation transms Outline compensation transms compensation transms Noise model x clean speech; n additive ; h convolutional
More informationM odeling and sim ula ting the forag ing system in multi2source groups w ith random d isturbances
3 4 Vol. 3. 4 20088 CAA I Transactions on Intelligent System s Aug. 2008,, (,150001) :..... 2,,,. Starlogo,.,. : ;; ; Starlogo : TP18 : A: 167324785 (2008) 0420342207 M odeling and sim ula ting the forag
More informationA Low-Cost Robust Front-end for Embedded ASR System
A Low-Cost Robust Front-end for Embedded ASR System Lihui Guo 1, Xin He 2, Yue Lu 1, and Yaxin Zhang 2 1 Department of Computer Science and Technology, East China Normal University, Shanghai 200062 2 Motorola
More informationComparing linear and non-linear transformation of speech
Comparing linear and non-linear transformation of speech Larbi Mesbahi, Vincent Barreaud and Olivier Boeffard IRISA / ENSSAT - University of Rennes 1 6, rue de Kerampont, Lannion, France {lmesbahi, vincent.barreaud,
More informationDominant Feature Vectors Based Audio Similarity Measure
Dominant Feature Vectors Based Audio Similarity Measure Jing Gu 1, Lie Lu 2, Rui Cai 3, Hong-Jiang Zhang 2, and Jian Yang 1 1 Dept. of Electronic Engineering, Tsinghua Univ., Beijing, 100084, China 2 Microsoft
More informationMaximum Likelihood and Maximum A Posteriori Adaptation for Distributed Speaker Recognition Systems
Maximum Likelihood and Maximum A Posteriori Adaptation for Distributed Speaker Recognition Systems Chin-Hung Sit 1, Man-Wai Mak 1, and Sun-Yuan Kung 2 1 Center for Multimedia Signal Processing Dept. of
More informationResults as of 30 September 2018
rt Results as of 30 September 2018 F r e e t r a n s l a t ion f r o m t h e o r ig ina l in S p a n is h. I n t h e e v e n t o f d i s c r e p a n c y, t h e Sp a n i s h - la n g u a g e v e r s ion
More informationSpacec raft au tom a tic te st and spacecraft te st language
2009 11 35 11 Journal of Beijing University of Aeronautics and A stronautics November2009 Vol. 35No111 (, 100191), ;,,,, 4.. ; ; ; ; TP 273 +. 5 A : 100125965 (2009) 1121375204 Spacec raft au tom a tic
More informationWhy DNN Works for Acoustic Modeling in Speech Recognition?
Why DNN Works for Acoustic Modeling in Speech Recognition? Prof. Hui Jiang Department of Computer Science and Engineering York University, Toronto, Ont. M3J 1P3, CANADA Joint work with Y. Bao, J. Pan,
More informationNearly Perfect Detection of Continuous F 0 Contour and Frame Classification for TTS Synthesis. Thomas Ewender
Nearly Perfect Detection of Continuous F 0 Contour and Frame Classification for TTS Synthesis Thomas Ewender Outline Motivation Detection algorithm of continuous F 0 contour Frame classification algorithm
More informationw h e r e e v e r t h e y live. It is an i n d u s t r i a l i z e d form of t e a c h i n g and
3 b D i s t a n c e E d u c a t i o n - In India S o c i o - L e g a l A n a l y s i s A- D i s t a n c e E d u c a t i o n - C o n c e p t D i s t a n c e T e a c h i n g or E d u c a t i o n is a m e
More informationMulti-level Gaussian selection for accurate low-resource ASR systems
Multi-level Gaussian selection for accurate low-resource ASR systems Leïla Zouari, Gérard Chollet GET-ENST/CNRS-LTCI 46 rue Barrault, 75634 Paris cedex 13, France Abstract For Automatic Speech Recognition
More informationHidden Markov Modelling
Hidden Markov Modelling Introduction Problem formulation Forward-Backward algorithm Viterbi search Baum-Welch parameter estimation Other considerations Multiple observation sequences Phone-based models
More informationON SCALABLE CODING OF HIDDEN MARKOV SOURCES. Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose
ON SCALABLE CODING OF HIDDEN MARKOV SOURCES Mehdi Salehifar, Tejaswi Nanjundaswamy, and Kenneth Rose Department of Electrical and Computer Engineering University of California, Santa Barbara, CA, 93106
More informationRobust Speaker Identification
Robust Speaker Identification by Smarajit Bose Interdisciplinary Statistical Research Unit Indian Statistical Institute, Kolkata Joint work with Amita Pal and Ayanendranath Basu Overview } } } } } } }
More informationPattern Recognition Applied to Music Signals
JHU CLSP Summer School Pattern Recognition Applied to Music Signals 2 3 4 5 Music Content Analysis Classification and Features Statistical Pattern Recognition Gaussian Mixtures and Neural Nets Singing
More informationO verv iew on Con trol Stra teg ies of Brushless D oubly - Fed M ach ines. L IU Hang - hang, HAN L i
2010 6 echnica l review, (, 400044) :,,, : ; ; : M 343:A :1004-7018( 2010) 06-0069 - 05 O verv iew on Con trol Stra teg ies of Brushless D oubly - Fed M ach ines L IU Hang - hang, HAN L i ( Chongqing University,
More informationRecent Developments in Statistical Dialogue Systems
Recent Developments in Statistical Dialogue Systems Steve Young Machine Intelligence Laboratory Information Engineering Division Cambridge University Engineering Department Cambridge, UK Contents Review
More informationLecture 5: GMM Acoustic Modeling and Feature Extraction
CS 224S / LINGUIST 285 Spoken Language Processing Andrew Maas Stanford University Spring 2017 Lecture 5: GMM Acoustic Modeling and Feature Extraction Original slides by Dan Jurafsky Outline for Today Acoustic
More informationPHONEME CLASSIFICATION OVER THE RECONSTRUCTED PHASE SPACE USING PRINCIPAL COMPONENT ANALYSIS
PHONEME CLASSIFICATION OVER THE RECONSTRUCTED PHASE SPACE USING PRINCIPAL COMPONENT ANALYSIS Jinjin Ye jinjin.ye@mu.edu Michael T. Johnson mike.johnson@mu.edu Richard J. Povinelli richard.povinelli@mu.edu
More informationChina Academic Journal Electronic Publishing House. All rights reserved JOURNAL OF NATURAL RESOURCES Aug, 2009
24 8 Vol124 No18 20098 JOURNAL OF NATURAL RESOURCES Aug, 2009, (, 710062) : 19962006,,, 15,:,,,,,,, : ; ; ; : F29111: A : 1000-3037 (2009) 08-1378 - 08 1 1978 1719%20064319%, 0193 [ 1 ] 20 90,, 19962002
More informationBoundary Contraction Training for Acoustic Models based on Discrete Deep Neural Networks
INTERSPEECH 2014 Boundary Contraction Training for Acoustic Models based on Discrete Deep Neural Networks Ryu Takeda, Naoyuki Kanda, and Nobuo Nukaga Central Research Laboratory, Hitachi Ltd., 1-280, Kokubunji-shi,
More informationRasch , 40 (9) : ,,, ,,,, B A cta Psychologica S in ica DO I: /SP. J
2008, 40 (9) : 1030 1040 A cta Psychologica S in ica DO I: 10. 3724 /SP. J. 1041. 2008. 01030 Rasch 3 1 2 ( 1,,100875) ( 2 Kennedy School of Government, Harvard University, MA 02138, USA) Rasch, 66,,,,
More informationEnd-to-end Automatic Speech Recognition
End-to-end Automatic Speech Recognition Markus Nussbaum-Thom IBM Thomas J. Watson Research Center Yorktown Heights, NY 10598, USA Markus Nussbaum-Thom. February 22, 2017 Nussbaum-Thom: IBM Thomas J. Watson
More informationModel-Based Margin Estimation for Hidden Markov Model Learning and Generalization
1 2 3 4 5 6 7 8 Model-Based Margin Estimation for Hidden Markov Model Learning and Generalization Sabato Marco Siniscalchi a,, Jinyu Li b, Chin-Hui Lee c a Faculty of Engineering and Architecture, Kore
More informationA Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement
A Variance Modeling Framework Based on Variational Autoencoders for Speech Enhancement Simon Leglaive 1 Laurent Girin 1,2 Radu Horaud 1 1: Inria Grenoble Rhône-Alpes 2: Univ. Grenoble Alpes, Grenoble INP,
More informationTable of C on t en t s Global Campus 21 in N umbe r s R e g ional Capac it y D e v e lopme nt in E-L e ar ning Structure a n d C o m p o n en ts R ea
G Blended L ea r ni ng P r o g r a m R eg i o na l C a p a c i t y D ev elo p m ent i n E -L ea r ni ng H R K C r o s s o r d e r u c a t i o n a n d v e l o p m e n t C o p e r a t i o n 3 0 6 0 7 0 5
More informationshhgs@wgqqh.com chinapub 2002 7 Bruc Eckl 1000 7 Bruc Eckl 1000 Th gnsis of th computr rvolution was in a machin. Th gnsis of our programming languags thus tnds to look lik that Bruc machin. 10 7 www.wgqqh.com/shhgs/tij.html
More information= (, ) V λ (1) λ λ ( + + ) P = [ ( ), (1)] ( ) ( ) = ( ) ( ) ( 0 ) ( 0 ) = ( 0 ) ( 0 ) 0 ( 0 ) ( ( 0 )) ( ( 0 )) = ( ( 0 )) ( ( 0 )) ( + ( 0 )) ( + ( 0 )) = ( + ( 0 )) ( ( 0 )) P V V V V V P V P V V V
More information( Stationary wavelet transform, SW T) [ 5 ]
123 20106 JOURAL OF GEO2IFORATIO SC IECE Vol112, o13 Jun1, 2010, 3 (, 350108;, 350108) :,, : allat (DW T) trous ( SW T) (SCT),IHS PCA IKOOS,,,,, DW T SW T SCT ; DW T SW T SCT IHS PCA,, IHS PCA,, SCT PCA,
More informationDynamic Time-Alignment Kernel in Support Vector Machine
Dynamic Time-Alignment Kernel in Support Vector Machine Hiroshi Shimodaira School of Information Science, Japan Advanced Institute of Science and Technology sim@jaist.ac.jp Mitsuru Nakai School of Information
More informationMixtures of Gaussians with Sparse Regression Matrices. Constantinos Boulis, Jeffrey Bilmes
Mixtures of Gaussians with Sparse Regression Matrices Constantinos Boulis, Jeffrey Bilmes {boulis,bilmes}@ee.washington.edu Dept of EE, University of Washington Seattle WA, 98195-2500 UW Electrical Engineering
More informationLecture 10. Discriminative Training, ROVER, and Consensus. Michael Picheny, Bhuvana Ramabhadran, Stanley F. Chen
Lecture 10 Discriminative Training, ROVER, and Consensus Michael Picheny, Bhuvana Ramabhadran, Stanley F. Chen IBM T.J. Watson Research Center Yorktown Heights, New York, USA {picheny,bhuvana,stanchen}@us.ibm.com
More informationModel-Based Approaches to Robust Speech Recognition
Model-Based Approaches to Robust Speech Recognition Mark Gales with Hank Liao, Rogier van Dalen, Chris Longworth (work partly funded by Toshiba Research Europe Ltd) 11 June 2008 King s College London Seminar
More informationM itchelson R L , (Wolfson index) ( Tsui - W ang index) : ; : : ( ) :,, E - mail: edu.
2 8 6 28 No. 6 Vol. 2 0 0 8 1 2 SC IENTIA GEOGRAPH ICA SIN ICA Dec., 2 0 0 8 1, 2, 3 (1., 100101; 2., 100049; 3.,, 130024) : ; 1990, ;, - ;, - : ; ; ; ; : F119. 9 : A : 1000-0690 (2008) 06-0722 - 07, M
More informationGlobal SNR Estimation of Speech Signals using Entropy and Uncertainty Estimates from Dropout Networks
Interspeech 2018 2-6 September 2018, Hyderabad Global SNR Estimation of Speech Signals using Entropy and Uncertainty Estimates from Dropout Networks Rohith Aralikatti, Dilip Kumar Margam, Tanay Sharma,
More information1., X37 A (2009)
, Journal of Anhui Agri. Sci. 2009, 37 (35) : 17649-17652, 17691 1, 2, 1, 3, 4, 1, 3 (1.,100101; 2.,100101; 3.,100049; 4.,100875),, CO 2 10,4, ;; CENTURY; RothC; DNDC X37 A 0517-6611 (2009) 35-17649 -
More informationc. What is the average rate of change of f on the interval [, ]? Answer: d. What is a local minimum value of f? Answer: 5 e. On what interval(s) is f
Essential Skills Chapter f ( x + h) f ( x ). Simplifying the difference quotient Section. h f ( x + h) f ( x ) Example: For f ( x) = 4x 4 x, find and simplify completely. h Answer: 4 8x 4 h. Finding the
More informationCompound rotor position self2sen sing method of PM SM
2 5 289 EL ECTR ICMACH IN ESANDCON TROL Vol2 No5 Sep. 28, 2,,,, (., 4; 2., 255) :,,,,, : ; ; ; ; : TM34 : A : 7 449X (28) 5 498 6 Compound rotor position self2sen sing method of PM SM HOU L i2m in, 2,
More informationUpper Bound Kullback-Leibler Divergence for Hidden Markov Models with Application as Discrimination Measure for Speech Recognition
Upper Bound Kullback-Leibler Divergence for Hidden Markov Models with Application as Discrimination Measure for Speech Recognition Jorge Silva and Shrikanth Narayanan Speech Analysis and Interpretation
More informationAn Evolutionary Programming Based Algorithm for HMM training
An Evolutionary Programming Based Algorithm for HMM training Ewa Figielska,Wlodzimierz Kasprzak Institute of Control and Computation Engineering, Warsaw University of Technology ul. Nowowiejska 15/19,
More informationDiscriminative training of GMM-HMM acoustic model by RPCL type Bayesian Ying-Yang harmony learning
Discriminative training of GMM-HMM acoustic model by RPCL type Bayesian Ying-Yang harmony learning Zaihu Pang 1, Xihong Wu 1, and Lei Xu 1,2 1 Speech and Hearing Research Center, Key Laboratory of Machine
More informationACS Introduction to NLP Lecture 2: Part of Speech (POS) Tagging
ACS Introduction to NLP Lecture 2: Part of Speech (POS) Tagging Stephen Clark Natural Language and Information Processing (NLIP) Group sc609@cam.ac.uk The POS Tagging Problem 2 England NNP s POS fencers
More informationI zm ir I nstiute of Technology CS Lecture Notes are based on the CS 101 notes at the University of I llinois at Urbana-Cham paign
I zm ir I nstiute of Technology CS - 1 0 2 Lecture 1 Lecture Notes are based on the CS 101 notes at the University of I llinois at Urbana-Cham paign I zm ir I nstiute of Technology W hat w ill I learn
More informationGMM-Based Speech Transformation Systems under Data Reduction
GMM-Based Speech Transformation Systems under Data Reduction Larbi Mesbahi, Vincent Barreaud, Olivier Boeffard IRISA / University of Rennes 1 - ENSSAT 6 rue de Kerampont, B.P. 80518, F-22305 Lannion Cedex
More informationProc. of NCC 2010, Chennai, India
Proc. of NCC 2010, Chennai, India Trajectory and surface modeling of LSF for low rate speech coding M. Deepak and Preeti Rao Department of Electrical Engineering Indian Institute of Technology, Bombay
More informationA L A BA M A L A W R E V IE W
A L A BA M A L A W R E V IE W Volume 52 Fall 2000 Number 1 B E F O R E D I S A B I L I T Y C I V I L R I G HT S : C I V I L W A R P E N S I O N S A N D TH E P O L I T I C S O F D I S A B I L I T Y I N
More informationM odeling and simulation of power assembly for single2axle para llel hybr id electr ic veh icles
13 1 2009 11 EL ECTR IC MACH IN ES AND CON TROL Vol113 Supp l. 1 Nov. 2009,, (, 150040) :, Insight,,, ADV ISOR ( advanced vehicle simulator),,, :, : ; ; ; ADV ISOR : U 469. 72 : A : 1007-449X (2009) 1-0036-
More informationUsually the estimation of the partition function is intractable and it becomes exponentially hard when the complexity of the model increases. However,
Odyssey 2012 The Speaker and Language Recognition Workshop 25-28 June 2012, Singapore First attempt of Boltzmann Machines for Speaker Verification Mohammed Senoussaoui 1,2, Najim Dehak 3, Patrick Kenny
More informationMonaural speech separation using source-adapted models
Monaural speech separation using source-adapted models Ron Weiss, Dan Ellis {ronw,dpwe}@ee.columbia.edu LabROSA Department of Electrical Enginering Columbia University 007 IEEE Workshop on Applications
More informationOn the Influence of the Delta Coefficients in a HMM-based Speech Recognition System
On the Influence of the Delta Coefficients in a HMM-based Speech Recognition System Fabrice Lefèvre, Claude Montacié and Marie-José Caraty Laboratoire d'informatique de Paris VI 4, place Jussieu 755 PARIS
More information[ 4 ], [ 13 ], [ 3 ] [ 5 ] [ 7 ] China Academic Journal Electronic Publishing House. All rights reserved.
9 JOURAL OF V IBRATIO AD SHOCK Vol. 9 o. 010,, (,, 3007 :,,,,,, : ; ; : O3; TB535: A,,,,,,,,,,, [ 1-6 ], [ 9, 10 ],, [ 11, 1 ], 1,,[ 1 ], [ ] [ 3 ] [ ],, [ 4 ], [ 5 ] [ 4 ], [ 6 ], [ 7 ],, [ 8 ],[ 4 ]
More information4A (Automatized A t2 mospheric Absorp tion A tlas) , 4A, NOVELTIS Laboratoire de. MetOp 4A /OP 3 IASI, AR ID LAND GEOGRAPHY Jan.
33 1 2010 1 33 No. 1 Vol. AR ID LAND GEOGRAPHY Jan. 2010 1, 2, 1, 3 (1, 100190; 2, 100049; 3, 100101) : (RBF),,, 9 m 10 m 12 m, 4A 100,, : ; ; : TP732. 2 : A : 1000-6060 (2010) 01-0099 - 07 (99 105),,,,
More informationA Direct Criterion Minimization based fmllr via Gradient Descend
A Direct Criterion Minimization based fmllr via Gradient Descend Jan Vaněk and Zbyněk Zajíc University of West Bohemia in Pilsen, Univerzitní 22, 306 14 Pilsen Faculty of Applied Sciences, Department of
More informationQUATERNARY SC IENCES
28 4 20087 QUATERNARYSC IENCES Vol. 28, No. 4 July, 2008 1001-7410 (2008) 04-535 - 09 3 (, 100101;,, 730000),, 100,,,,,, ( < 200m) (200500m) (5001000m ) (10002500m)( > 2500m)7,, 1 500000 1 1000000 (DTM
More informationINFRARED TARGET EXTRACTION ALGORITHM BY USING PARTICLE SWARM OPTIM IZATION PARTICLE FILTER
29 1 20102 J. Infrared M illim. W aves Vol. 29, o. 1 February, 2010 : 1001-9014 (2010) 01-0063 - 06 1, 2 (1., 200240; 2., 200233) : ( PSOPF),.,,,,.,.,. : ;; ; : TP391. 4: A IFRARD TART XTRACTIO ALORITHM
More informationFACTORIAL HMMS FOR ACOUSTIC MODELING. Beth Logan and Pedro Moreno
ACTORIAL HMMS OR ACOUSTIC MODELING Beth Logan and Pedro Moreno Cambridge Research Laboratories Digital Equipment Corporation One Kendall Square, Building 700, 2nd loor Cambridge, Massachusetts 02139 United
More informationFEATURE SELECTION USING FISHER S RATIO TECHNIQUE FOR AUTOMATIC SPEECH RECOGNITION
FEATURE SELECTION USING FISHER S RATIO TECHNIQUE FOR AUTOMATIC SPEECH RECOGNITION Sarika Hegde 1, K. K. Achary 2 and Surendra Shetty 3 1 Department of Computer Applications, NMAM.I.T., Nitte, Karkala Taluk,
More informationHierarchical Multi-Stream Posterior Based Speech Recognition System
Hierarchical Multi-Stream Posterior Based Speech Recognition System Hamed Ketabdar 1,2, Hervé Bourlard 1,2 and Samy Bengio 1 1 IDIAP Research Institute, Martigny, Switzerland 2 Ecole Polytechnique Fédérale
More information, kw, kw 3176%,, JOURNAL OF NATURAL RESOURCES Aug., , : F42612 : A : (2009)
24 8 Vol124 No18 2009 8 JOURNAL OF NATURAL RESOURCES Aug., 2009 1, 2, 13 (11, 100101; 21, 100049) : 1997 2006 10,,,, 2006 GDP 527111 1 195120 5 745182, 3,,,,, 4 : : ; ; ; : F42612 : A: 1000-3037 (2009)
More informationGeneralized Cyclic Transformations in Speaker-Independent Speech Recognition
Generalized Cyclic Transformations in Speaker-Independent Speech Recognition Florian Müller 1, Eugene Belilovsky, and Alfred Mertins Institute for Signal Processing, University of Lübeck Ratzeburger Allee
More informationA Comparative Study of Histogram Equalization (HEQ) for Robust Speech Recognition
Computational Linguistics and Chinese Language Processing Vol. 12, No. 2, June 2007, pp. 217-238 217 The Association for Computational Linguistics and Chinese Language Processing A Comparative Study of
More informationA NONPARAMETRIC BAYESIAN APPROACH FOR SPOKEN TERM DETECTION BY EXAMPLE QUERY
A NONPARAMETRIC BAYESIAN APPROACH FOR SPOKEN TERM DETECTION BY EXAMPLE QUERY Amir Hossein Harati Nead Torbati and Joseph Picone College of Engineering, Temple University Philadelphia, Pennsylvania, USA
More informationWaveNet: A Generative Model for Raw Audio
WaveNet: A Generative Model for Raw Audio Ido Guy & Daniel Brodeski Deep Learning Seminar 2017 TAU Outline Introduction WaveNet Experiments Introduction WaveNet is a deep generative model of raw audio
More informationShankar Shivappa University of California, San Diego April 26, CSE 254 Seminar in learning algorithms
Recognition of Visual Speech Elements Using Adaptively Boosted Hidden Markov Models. Say Wei Foo, Yong Lian, Liang Dong. IEEE Transactions on Circuits and Systems for Video Technology, May 2004. Shankar
More informationDept. of Linguistics, Indiana University Fall 2009
1 / 14 Markov L645 Dept. of Linguistics, Indiana University Fall 2009 2 / 14 Markov (1) (review) Markov A Markov Model consists of: a finite set of statesω={s 1,...,s n }; an signal alphabetσ={σ 1,...,σ
More information( name, ), 1 ( a), (p lay2scrip t) ( act) ( b),
79 3 ( name, ),,, :,,,,, ;,,,, : 1 ( a), :, :,,, ( b), :, :,,, 1 ( a) ( b), :, ( ),, (p lay2scrip t),,,,,, ( act), ( scene),,, 1 ( a) ( b), 1 ( a) ( b),,, ( ) 3 ( 07BZX047) 80 2010 1,,,,,,,,, : 1.,,,,,
More informationPattern Classification
Pattern Classification Introduction Parametric classifiers Semi-parametric classifiers Dimensionality reduction Significance testing 6345 Automatic Speech Recognition Semi-Parametric Classifiers 1 Semi-Parametric
More informationCOMPILATION OF AUTOMATA FROM MORPHOLOGICAL TWO-LEVEL RULES
Kimmo Koskenniemi Re se ar ch Unit for Co mp ut at io na l Li ng ui st ic s University of Helsinki, Hallituskatu 11 SF-00100 Helsinki, Finland COMPILATION OF AUTOMATA FROM MORPHOLOGICAL TWO-LEVEL RULES
More informationHidden Markov Models. Dr. Naomi Harte
Hidden Markov Models Dr. Naomi Harte The Talk Hidden Markov Models What are they? Why are they useful? The maths part Probability calculations Training optimising parameters Viterbi unseen sequences Real
More informationDouble closed2control of active filter using repetitive algorithm
13 1 200911 EL ECTR ICMACH IN ESANDCON TROL Vol113 Supp l. 1 Nov. 2009,, (, 410076) :,,,,,,,6118% 516%, 01770198 : ; ; : TP 273 : A : 1007-449X (2009)1-0067- 05 Double closed2control of active filter using
More informationSymmetric Distortion Measure for Speaker Recognition
ISCA Archive http://www.isca-speech.org/archive SPECOM 2004: 9 th Conference Speech and Computer St. Petersburg, Russia September 20-22, 2004 Symmetric Distortion Measure for Speaker Recognition Evgeny
More informationDetection-Based Speech Recognition with Sparse Point Process Models
Detection-Based Speech Recognition with Sparse Point Process Models Aren Jansen Partha Niyogi Human Language Technology Center of Excellence Departments of Computer Science and Statistics ICASSP 2010 Dallas,
More informationOH BOY! Story. N a r r a t iv e a n d o bj e c t s th ea t e r Fo r a l l a g e s, fr o m th e a ge of 9
OH BOY! O h Boy!, was or igin a lly cr eat ed in F r en ch an d was a m a jor s u cc ess on t h e Fr en ch st a ge f or young au di enc es. It h a s b een s een by ap pr ox i ma t ely 175,000 sp ect at
More informationBLACK BOX OPTIMIZATION FOR AUTOMATIC SPEECH RECOGNITION. Shinji Watanabe and Jonathan Le Roux
2014 IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP) BLACK BOX OPTIMIZATION FOR AUTOMATIC SPEECH RECOGNITION Shinji Watanabe and Jonathan Le Roux Mitsubishi Electric Research
More informationASPEAKER independent speech recognition system has to
930 IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, VOL. 13, NO. 5, SEPTEMBER 2005 Vocal Tract Normalization Equals Linear Transformation in Cepstral Space Michael Pitz and Hermann Ney, Member, IEEE
More informationElectron ic pole changing techn ique of multi2phase induction motor
3 3 95 EL ECTR ICMACH IN ESANDCON TROL Vol3 No3 May 9,,,,,, (., 37;., 33) :,,,,,, 939,, : ; ; ; : TM3 : A : 7-449X (9) 3-3- 5 Electron ic pole changing techn ique of multiphase induction motor YANG J iaqiang,
More informationLecture 3: ASR: HMMs, Forward, Viterbi
Original slides by Dan Jurafsky CS 224S / LINGUIST 285 Spoken Language Processing Andrew Maas Stanford University Spring 2017 Lecture 3: ASR: HMMs, Forward, Viterbi Fun informative read on phonetics The
More information