DIRECTLY MODELING VOICED AND UNVOICED COMPONENTS IN SPEECH WAVEFORMS BY NEURAL NETWORKS. Keiichi Tokuda Heiga Zen
|
|
- Gerard White
- 6 years ago
- Views:
Transcription
1 DIRECTLY MODELING VOICED AND UNVOICED COMPONENTS IN SPEECH WAVEFORMS BY NEURAL NETWORKS Keiichi Tokda Heiga Zen Google Nagoya Institte of Technology, Japan ABSTRACT This paper proposes a noel acostic model based on neral networks for statistical parametric speech synthesis The neral network otpts parameters of a non-zero mean Gassian process, which defines a probability density fnction of a speech waeform gien lingistic featres The mean and coariance fnctions of the Gassian process represent deterministic (oiced) and stochastic (noiced) components of a speech waeform, as the preios approach considered the noiced component only Experimental reslts show that the proposed approach can generate speech waeforms approximating natral speech waeforms Index Terms Statistical parametric speech synthesis; neral network; waefom 1 INTRODUCTION Typical statistical parametric speech synthesis systems first extract a set of parametric representation of speech (eg, cepstra 1], line spectrm pairs 2], fndamental freqency, and aperiodicity 3]) then model relationships between the extracted acostic parameters and lingistic featres associated with the speech waeform sing an acostic model 6] (eg, hidden Marko models 4], neral networks 5]) There hae been a cople of attempts to integrate acostic featre extraction into acostic modeling, sch as the log spectral distortion-ersion of minimm generation error training 7], statistical ocoder 8], waeform-leel statistical model 9], and melcepstral analysis-integrated hidden Marko models 1] Tokda and Zen recently proposed a neral network-based approach to integrate acostic featre extraction into acostic modeling 11] Here, a neral network otpts parameters of a zero mean Gassian process, which defines a probability density fnction of a speech waeform gien lingistic featres The coariance fnction of the Gassian process is parameterized by minimm-phase cepstrm The network weights are optimized so as to maximize the log likelihood of the Gassian process gien corresponding pairs of speech waeforms (target) and lingistic featre seqences (inpt) This approach can oercome the limitations of the preios approaches, sch as two-step optimization (acostic featre extraction acostic modeling), se of spectra rather than waeforms, and se of oerlapping and shifting frames as nit Howeer, the speech signal model sed in this approach has only a stochastic (noiced) component, as hman speech has both stochastic and deterministic (oiced) components This paper extends the preios approach 11] to hae both the oiced and noiced components in its speech signal model A neral network otpts parameters of a non-zero mean Gassian process, which defines a probability density fnction of a speech waeform gien lingistic featres The deterministic (oiced) component of a speech waeform is modeled by the mean fnction of the Gassian process, which is parameterized by mixed-phase complex cepstrm Its training algorithm which can rn seqentially in a sample-by-sample or segment-by-segment manner is also deried The rest of the paper is organized as follows Section 2 defines the signal model and the waeform-leel probability density fnction Section 3 deries the training algorithm Preliminary experimental reslts are presented in Section 4 Conclding remarks are gien in the final section 2 WAVEFORM-LEVEL DEFINITION OF PROBABILITY DENSITY FUNCTION OF SPEECH 21 Signal model This paper adopts the signal model shown in Fig 1 Ths, the probability density fnction of a discrete-time speech signal x = x(), x(1),, x(t 1)] corresponding to an tterance or whole speech database is assmed to be a non-zero mean stationary Gassian process, which can be written as p(x λ) = N (x; H c p, Σ c ), (1) λ is the model parameter set, p = p(), p(1),, p(t 1)] is a plse seqence haing ale 1 at pitch mark positions otherwise : p =,,, 1,,,, 1,,,, 1,,, ], (2) and H c is a deterministic component matrix 1 gien as H c = h() h( 1) h( T +1) h(1) h() h( 1) h(t 1) h(1) h(), (3) whose elements are gien by the implse response of the mixed phase system fnction H (z): h(n) = 1 2π π π H (e jω ) e jωn dω (4) In this paper, we assme that the system fnction H (z) generating oiced component (t) is parameterized by complex cepstrm c as H (e jω ) = exp c (m) e jωm, (5) c = c ( M),, c (),, c (M)] The system H (z) shold not model delay since it cases an nder-determined problem 1 Althogh we assme that x and p are infinite seqences, they are described as finite seqences for notation simplicity When they are finite seqences, H shold be a circlant matrix rather than a Toeplitz matrix
2 Plse train p(t) e(t) N(, 1) White noise Mixed phase H (z) H (z) Minimm phase Voiced component (t) (t) Unoiced component Fig 1 Speech signal model Speech x(t) when we estimate H (z) and plse positions of p(t) simltaneosly The complex cepstral representation can aoid the problem becase it intrinsically does not represent delay of the system The coariance matrix Σ c is gien as Σ c = r() r(1) r(t 1) r(1) r() r(1) r(t 1) r(1) r() π (6) r(k) = 1 π H (e jω ) 2 e jωk dω, (7) 2π and H (e jω ) 2 is the power spectrm of the noiced component (t) This paper assmes that the corresponding minimm-phase system fnction H (z) is parameterized by minimm cepstrm c as H (e jω ) = exp c (m) e jωm, (8) m= c = c (), c (1), c (2),, c (M)] The inerse of the coariance matrix Σ c can be written as the same form in 11] as A c = Σ 1 c = A c A c, (9) a() a(1) a() a(t 1) a(1) a() (1) and a(n) is the implse response of the inerse system gien as a(n) = 1 π H 1 (e jω ) e jωn dω (11) 2π π From the aboe definition, the logarithm of the probability density fnction can be written as log p(x λ) = T 2 log 2π log A c A c 1 2 (x H c p) A c A c (x H c p) (12) the model parameter set is gien as λ = {c, c, p} We assme that plse positions in p are extracted by sing an external pitch marker and therefore p is fixed in the following discssion 22 Non-stationarity modeling Eqation (12) can be rewritten as log p(x λ) = T 2 log 2π log A c A c G = A c H c is gien as G = 1 2 (A c x Gp) (A c x Gp), (13) g() g( 1) g( T +1) g(1) g() g( 1) g(t 1) g(1) g() (14) and g(n) is the implse response of the system fnction G(z) = H (z)h 1 (z): G(e jω ) = exp {c (m) c (m)} e jωm, (c (m) =, m < ), (15) that is, g(n) = 1 π G(e jω ) e jωn dω (16) 2π π To model the non-stationary natre of the speech signal, x is assmed to be segment-by-segment piecewise-stationary: A c in Eq (1) and G in Eq (14) are redefined as a (i 1) () a (i) (1) a (i) () a (i) (1) a (i) () A c = L a (i) (1) a (i) () a (i+1) (1) a (i+1) () (17) and g (i 1) () g (i 1) ( 1) g (i) (1) g (i) () g (i) ( 1) g (i) (1) g (i) () g (i) ( 1) G = g (i) (1) g (i) () g (i) ( 1) g (i+1) (1) g (i+1) () (18) i is the segment index, L is the size of each segment, a (i) (n) is the implse response of the inerse system of H (i) (z) represented by cepstrm = (), (1),, (M)], (19) as in Eq (8) for the i-th segment, and g (i) (n) is the implse response of the system G (i) (z) represented by cepstrm and = ( M),, (),, (M)] (2) L,
3 as in Eq (15) for the i-th segment Here the model parameter set of the probability density fnction p(x λ) can be written { } as { λ = c = {c, c }, } c = c (), c (1),, c (I 1), c = c (), c (1),, c (I 1), and I is the nmber of segments in x corresponding to an tterance or whole speech database, and ths T = L I Note that p is omitted from λ since it is assmed to be fixed in this paper 3 TRAINING ALGORITHM 31 Deriatie of the log likelihood With some elaboration, 2 the partial deriatie of Eq (13) wrt can be deried as, (),, (M)] = log p(x c)/ = ( M),, Plse train p(t) x(t) Speech Cepstrm Forward propagation f(t) G(z) H 1 (z) s(t) e(t) z z 1 d (i) Deriaties c (i) Back propagation L 1 (m) = e (i) (Li + k) f (i) (Li + k m), (21) k= and f (i) (t) is the otpt of G (i) (z), whose inpt is p(t), ie f (i) (t) = The signal e (i) (t) is gien as g (i) (n) p(t n), (22) n= e (i) (t) = s (i) (t) f (i) (t), (23) s (i) (t) is the otpt of the inerse of H (i) (z), whose inpt is x(t), ie s (i) (t) = a (i) (n) x(t n) (24) n= The partial deriatie of Eq (13) wrt can also be deried as, = log p(x c)/ = (), d (i) (1),, (M)] L 1 (m) = e (i) (Li + k) e (i) (Li + k m) δ(m)l, (25) k= and δ(m) is the nit implse fnction 32 Seqential algorithm For calclating the implse response a (i) (n) or g (i) (n) sing a recrsie formla 13], O(MN) operations are reqired at each segment i, een if it is trncated with a sfficiently large nmber of N Frthermore, for calclating s (i) (t) in Eq (24) or f (i) (t) in Eq (22), O(N(M + L)) operations are reqired for each segment i First, to redce the comptational brden in calclating s (i) (t) in Eq (24), the following two approximations are applied; 1 By assming s (i) (t) e (i 1) (t), t = Li M,, Li 1 (26) 2 Similar deriation can be fond in Eqs (14) and (16) in 12] Lingistic featres l (i) Lingistic featre extraction Text analysis TEXT Fig 2 Block diagram of the proposed waeform-based framework (M = 2, L = 1, ie i = t) The element z can be realized in the training phase becase it is in an offline processing mode For notation simplicity, here acostic model is illstrated as a feed-forward neral network rather than long short-term memory recrrent neral network (LSTM-RNN) s (i) (t) can be calclated as the otpt of the inerse system whose parameters change segment by segment as follows: s (i) (t) = s(t) = a t (n) x(t n), (27) n= a t (n) = a (i) (n), t = Li,, Li + L 1 (28) 2 As an approximation, inerse filtering in Eq (27) can be efficiently calclated by the log magnitde approximation (LMA) filter 3 12] whose coefficients are gien by c t =, t = Li,, Li + L 1 (29) The same approximation can be applied to calclation of s (i) (t) in Eq (24), except that the system fnction G(z) corresponding to Eq (22) is decomposed into minimm- and maximm-phase components as G(z) = G + (z)g (z), G + (e jω ) = exp G (e jω ) = exp {c (m) c (m)} e jωm, (3) m= 1 c (m) e jωm (31) 3 The LMA filter is a special type of digital filter which can approximate the system fnction of Eq (8)
4 Each of them are implemented in the LMA filter strctre Howeer, G (z) is an anticasal system while G + (z) is a casal system, and ths G (z) shold rn in a time-reersal manner With the aboe approximations, a simple strctre for training the neral network acostic model, which represents a mapping from lingistic featres to speech signals, can be deried It can rn in a seqential manner as shown in Fig 2 This neral network otpts cepstrm c gien } lingistic featre ector seqence 4 l = {l (), l (1),, l (I 1), which in trn gies a probability density fnction of speech signals x corresponding to an tterance or whole speech database conditioned on l as p(x l, M) = N ( x; H c(l) p, Σ c(l) ), (32) M denotes a set of network weights, c(l) is gien by actiations at the otpt layer of the network gien inpt lingistic featres, and the RHS is gien by Eq (12) By back-propagating the deriatie of the log likelihood fnction throgh the network, the network weights can be pdated to maximize the log likelihood It shold be noted that the proposed approach optimizes the acostic featre extraction part and acostic modeling part simltaneosly As a reslt, better modeling accracy can be expected 33 Synthesis strctre The speech waeform can be generated by sampling x from the probability density fnction p(x l, M) It can be done by sing the signal model strctre shown in Fig 1 By decomposing H (z) into minimm- and maximm-phase components as H (z) = H + (z)h (z), the system fnctions H + (z), H (z) and H (z) can be implemented by sing the LMA filter strctre, H + (z) rns in a time-reersal manner It shold be noted that we need an external F predictor for generating the plse train p(t) 4 EXPERIMENTS Speech data in US English from a female professional speaker was sed for the experiments The training and deelopment data sets consisted of 35,497 and 1 tterances, respectiely A speakerdependent nidirectional LSTM-RNN 14] was trained The lingistic featres deried from speech data, transcriptions, and alignments, inclded 56 lingistic contexts, 1 nmerical featres for coarse-coded position of the crrent frame within the crrent phoneme, and one nmerical featre for dration of the crrent phoneme The speech data was downsampled from 48 khz to 16 khz Then 39 mel-cepstrm, 5 band aperiodicity, 1 log F, and 1 oiced/noiced binary flag were extracted at each frame Glottal closre instants (GCI) locations were also extracted sing REAPER 15] Both the inpt and otpt featres were normalized to hae zero-mean nit-ariance The architectre of the LSTM- RNN had 1 feed-forward hidden layer with 256 nits and rectified linear (ReLU) actiation 16] followed by 3 forward-directed LSTMP 17] hidden layers with 512 memory blocks and 256 projection nits, 1 feed-forward hidden layer with 256 nits and ReLU actiation, and an otpt layer with 47 nits 5 and linear actiation 4 The definition of the lingistic featre ector sed in this paper can be fond in 5] and 14] 5 It inclded 39 mel-cepstrm, 5 band aperiodicity, 1 log F, and 1 oiced/noiced flag 2591 Natral Predicted GCI Fig 3 A segment of the generated speech waeform for a sentence Two elect only two not inclded in the training data To redce the training time and the impact of haing many silences, 8% of silence regions were remoed After setting the network weights randomly, they were first pdated to minimize the mean sqared error between the extracted and predicted acostic featres Then the last layer was replaced by a randomly initialized otpt layer with 119 nits 6 and linear actiation After pdating the weights associated with the otpt layer, all weights in the network were pdated by the proposed seqential algorithm so as to maximize the log likelihood of Eq (12) They were first pdated by non-distribted Adam 18] then distribted AdaGrad 19] The mini-batch back propagation throgh time (BPTT) 2] algorithm was sed 17] in both cases Dropot 21] stochastic reglarization (5%) was sed throghot to preent oerfitting Fig 3 shows a synthesized speech waeform generated from the trained neral network It can be seen from the figre that a speech waeform approximating the natral speech waeform is generated 5 CONCLUSIONS An acostic modeling approach based on neral networks to statistical parametric speech synthesis was proposed The network otpts parameters of a non-zero mean Gassian process, which defines a probability density fnction of a speech waeform gien lingistic featres The stochastic (noiced) component of a speech waeform is modeled by the coariance fnction of the Gassian process, parameterized by minimm-phase cepstrm, as the deterministic (oiced) component is modeled by the mean fnction of the Gassian process, parameterized by mixed-phase complex cepstrm Its training algorithm which can rn seqentially on speech waeform in a sample-by-sample or segment-by-segment manner was deried Ftre work incldes simltaneosly estimating pitch marks inclding fractional pitch search in the model training One of the limitations of this approach is that both acostic modeling (l c) and waeform modeling (c x) error goes to the noiced component Introdction of a coariance strctre for the oiced component can alleiate this problem Performance ealation in practical conditions as a text-to-speech synthesis application is also inclded in ftre work 6 It inclded 39 minimm-phase noiced cepstrm, 39 minimmphase oiced cepstrm, and 1 39 maximm-phase oiced cepstrm
5 6 REFERENCES 1] S Imai and C Frichi, Unbiased estimation of log spectrm, in Proc EURASIP, 1988, pp pp ] F Itakra, Line spectrm representation of linear predictor coefficients of speech signals, The Jornal of the Acost Society of America, ol 57, no S1, pp S35 S35, ] H Kawahara, J Estill, and O Fjimra, Aperiodicity extraction and control sing mixed mode excitation and grop delay maniplation for a high qality speech analysis, modification and synthesis system straight, in Proc MAVEBA, 21, pp ] T Yoshimra, K Tokda, T Masko, T Kobayashi, and T Kitamra, Simltaneos modeling of spectrm, pitch and dration in HMM-based speech synthesis, in Proc Erospeech, 1999, pp ] H Zen, A Senior, and M Schster, Statistical parametric speech synthesis sing deep neral networks, in Proc ICASSP, 213, pp ] H Zen, K Tokda, and A Black, Statistical parametric speech synthesis, Speech Commn, ol 51, no 11, pp , 29 7] Y-J W and K Tokda, Minimm generation error training with direct log spectral distortion on LSPs for HMM-based speech synthesis, in Proc Interspeech, 28, pp ] T Toda and K Tokda, Statistical approach to ocal tract transfer fnction estimation based on factor analyzed trajectory hmm, in Proc ICASSP, 28, pp ] R Maia, H Zen, and M Gales, Statistical parametric speech synthesis with joint estimation of acostic and excitation model parameters, in Proc ISCA SSW7, 21, pp ] K Nakamra, K Hashimoto, Y Nankak, and K Tokda, Integration of spectral featre extraction and modeling for HMM-based speech synthesis, IEICE Trans Inf Syst, ol 97, no 6, pp , ] K Tokda and H Zen, Directly modeling speech waeforms by neral networks for statistical parametric speech synthesis, in Proc ICASSP, 215, pp ] K Tokda, T Kobayashi, and S Imai, Adaptie cepstral analysis of speech, IEEE Trans Speech Adio Process, ol 3, no 6, pp , ] AV Oppenhem and RW Schafer, Descrete-Time Signal Processing, Prentice Hall, ] H Zen and H Sak, Unidirectional long short-term memory recrrent neral network with recrrent otpt layer for lowlatency speech synthesis, in Proc ICASSP, 215, pp ] REAPER: Robst Epoch And Pitch EstimatoR, githbcom/google/reaper, ] M Zeiler, M Ranzato, R Monga, M Mao, K Yang, Q-V Le, P Ngyen, A Senior, V Vanhocke, J Dean, and G Hinton, On rectified linear nits for speech processing, in Proc ICASSP, 213, pp ] H Sak, A Senior, and F Beafays, Long short-term memory recrrent neral network architectres for large scale acostic modeling, in Proc Interspeech, ] D Kingma and J Ba, Adam: A method for stochastic optimization, arxi preprint arxi: , ] J Dchi, E Hazan, and Y Singer, Adaptie sbgradient methods for online learning and stochastic optimization, The Jornal of Machine Learning Research, pp , 211 2] R Williams and J Peng, An efficient gradient-based algorithm for on-line training of recrrent network trajectories, Neral Compt, ol 2, no 4, pp 49 51, ] G Hinton, N Sriastaa, A Krizhesky, I Stskeer, and R Salakhtdino, Improing neral networks by preenting co-adaptation of featre detectors, arxi preprent arxi:12758, 212
Path-SGD: Path-Normalized Optimization in Deep Neural Networks
Path-SGD: Path-Normalized Optimization in Deep Neral Networks Behnam Neyshabr, Rslan Salakhtdinov and Nathan Srebro Toyota Technological Institte at Chicago Department of Compter Science, University of
More informationWe automate the bivariate change-of-variables technique for bivariate continuous random variables with
INFORMS Jornal on Compting Vol. 4, No., Winter 0, pp. 9 ISSN 09-9856 (print) ISSN 56-558 (online) http://dx.doi.org/0.87/ijoc.046 0 INFORMS Atomating Biariate Transformations Jeff X. Yang, John H. Drew,
More informationVisualisations of Gussian and Mean Curvatures by Using Mathematica and webmathematica
Visalisations of Gssian and Mean Cratres by Using Mathematica and webmathematica Vladimir Benić, B. sc., (benic@grad.hr), Sonja Gorjanc, Ph. D., (sgorjanc@grad.hr) Faclty of Ciil Engineering, Kačićea 6,
More informationChapter 4 Supervised learning:
Chapter 4 Spervised learning: Mltilayer Networks II Madaline Other Feedforward Networks Mltiple adalines of a sort as hidden nodes Weight change follows minimm distrbance principle Adaptive mlti-layer
More informationSimulation investigation of the Z-source NPC inverter
octoral school of energy- and geo-technology Janary 5 20, 2007. Kressaare, Estonia Simlation investigation of the Z-sorce NPC inverter Ryszard Strzelecki, Natalia Strzelecka Gdynia Maritime University,
More informationDigital Image Processing. Lecture 8 (Enhancement in the Frequency domain) Bu-Ali Sina University Computer Engineering Dep.
Digital Image Processing Lectre 8 Enhancement in the Freqenc domain B-Ali Sina Uniersit Compter Engineering Dep. Fall 009 Image Enhancement In The Freqenc Domain Otline Jean Baptiste Joseph Forier The
More informationJoint Transfer of Energy and Information in a Two-hop Relay Channel
Joint Transfer of Energy and Information in a Two-hop Relay Channel Ali H. Abdollahi Bafghi, Mahtab Mirmohseni, and Mohammad Reza Aref Information Systems and Secrity Lab (ISSL Department of Electrical
More informationThe Real Stabilizability Radius of the Multi-Link Inverted Pendulum
Proceedings of the 26 American Control Conference Minneapolis, Minnesota, USA, Jne 14-16, 26 WeC123 The Real Stabilizability Radis of the Mlti-Link Inerted Pendlm Simon Lam and Edward J Daison Abstract
More informationON TRANSIENT DYNAMICS, OFF-EQUILIBRIUM BEHAVIOUR AND IDENTIFICATION IN BLENDED MULTIPLE MODEL STRUCTURES
ON TRANSIENT DYNAMICS, OFF-EQUILIBRIUM BEHAVIOUR AND IDENTIFICATION IN BLENDED MULTIPLE MODEL STRUCTURES Roderick Mrray-Smith Dept. of Compting Science, Glasgow Uniersity, Glasgow, Scotland. rod@dcs.gla.ac.k
More informationPrandl established a universal velocity profile for flow parallel to the bed given by
EM 0--00 (Part VI) (g) The nderlayers shold be at least three thicknesses of the W 50 stone, bt never less than 0.3 m (Ahrens 98b). The thickness can be calclated sing Eqation VI-5-9 with a coefficient
More informationVectors in Rn un. This definition of norm is an extension of the Pythagorean Theorem. Consider the vector u = (5, 8) in R 2
MATH 307 Vectors in Rn Dr. Neal, WKU Matrices of dimension 1 n can be thoght of as coordinates, or ectors, in n- dimensional space R n. We can perform special calclations on these ectors. In particlar,
More informationElements of Coordinate System Transformations
B Elements of Coordinate System Transformations Coordinate system transformation is a powerfl tool for solving many geometrical and kinematic problems that pertain to the design of gear ctting tools and
More informationExercise 4. An optional time which is not a stopping time
M5MF6, EXERCICE SET 1 We shall here consider a gien filtered probability space Ω, F, P, spporting a standard rownian motion W t t, with natral filtration F t t. Exercise 1 Proe Proposition 1.1.3, Theorem
More informationReduction of over-determined systems of differential equations
Redction of oer-determined systems of differential eqations Maim Zaytse 1) 1, ) and Vyachesla Akkerman 1) Nclear Safety Institte, Rssian Academy of Sciences, Moscow, 115191 Rssia ) Department of Mechanical
More informationVocoding approaches for statistical parametric speech synthesis
Vocoding approaches for statistical parametric speech synthesis Ranniery Maia Toshiba Research Europe Limited Cambridge Research Laboratory Speech Synthesis Seminar Series CUED, University of Cambridge,
More informationMinimizing Intra-Edge Crossings in Wiring Diagrams and Public Transportation Maps
Minimizing Intra-Edge Crossings in Wiring Diagrams and Pblic Transportation Maps Marc Benkert 1, Martin Nöllenbrg 1, Takeaki Uno 2, and Alexander Wolff 1 1 Department of Compter Science, Karlsrhe Uniersity,
More informationStudy on the Mathematic Model of Product Modular System Orienting the Modular Design
Natre and Science, 2(, 2004, Zhong, et al, Stdy on the Mathematic Model Stdy on the Mathematic Model of Prodct Modlar Orienting the Modlar Design Shisheng Zhong 1, Jiang Li 1, Jin Li 2, Lin Lin 1 (1. College
More informationDevelopment of Second Order Plus Time Delay (SOPTD) Model from Orthonormal Basis Filter (OBF) Model
Development of Second Order Pls Time Delay (SOPTD) Model from Orthonormal Basis Filter (OBF) Model Lemma D. Tfa*, M. Ramasamy*, Sachin C. Patwardhan **, M. Shhaimi* *Chemical Engineering Department, Universiti
More informationStudy on the impulsive pressure of tank oscillating by force towards multiple degrees of freedom
EPJ Web of Conferences 80, 0034 (08) EFM 07 Stdy on the implsive pressre of tank oscillating by force towards mltiple degrees of freedom Shigeyki Hibi,* The ational Defense Academy, Department of Mechanical
More informationData-Efficient Control Policy Search using Residual Dynamics Learning
Data-Efficient Control Policy Search sing Residal Dynamics Learning Matteo Saveriano 1, Ychao Yin 1, Pietro Falco 1 and Donghei Lee 1,2 Abstract In this work, we propose a model-based and data efficient
More informationFOUNTAIN codes [3], [4] provide an efficient solution
Inactivation Decoding of LT and Raptor Codes: Analysis and Code Design Francisco Lázaro, Stdent Member, IEEE, Gianligi Liva, Senior Member, IEEE, Gerhard Bach, Fellow, IEEE arxiv:176.5814v1 [cs.it 19 Jn
More informationON THE PERFORMANCE OF LOW
Monografías Matemáticas García de Galdeano, 77 86 (6) ON THE PERFORMANCE OF LOW STORAGE ADDITIVE RUNGE-KUTTA METHODS Inmaclada Higeras and Teo Roldán Abstract. Gien a differential system that inoles terms
More informationPHASE STEERING AND FOCUSING BEHAVIOR OF ULTRASOUND IN CEMENTITIOUS MATERIALS
PHAS STRING AND FOCUSING BHAVIOR OF ULTRASOUND IN CMNTITIOUS MATRIALS Shi-Chang Wooh and Lawrence Azar Department of Civil and nvironmental ngineering Massachsetts Institte of Technology Cambridge, MA
More informationRelativity II. The laws of physics are identical in all inertial frames of reference. equivalently
Relatiity II I. Henri Poincare's Relatiity Principle In the late 1800's, Henri Poincare proposed that the principle of Galilean relatiity be expanded to inclde all physical phenomena and not jst mechanics.
More informationAn extremum seeking approach to parameterised loop-shaping control design
Preprints of the 19th World Congress The International Federation of Atomatic Control An extremm seeking approach to parameterised loop-shaping control design Chih Feng Lee Sei Zhen Khong Erik Frisk Mattias
More informationGenerative Model-Based Text-to-Speech Synthesis. Heiga Zen (Google London) February
Generative Model-Based Text-to-Speech Synthesis Heiga Zen (Google London) February rd, @MIT Outline Generative TTS Generative acoustic models for parametric TTS Hidden Markov models (HMMs) Neural networks
More informationAn Excitation Model for HMM-Based Speech Synthesis Based on Residual Modeling
An Model for HMM-Based Speech Synthesis Based on Residual Modeling Ranniery Maia,, Tomoki Toda,, Heiga Zen, Yoshihiko Nankaku, Keiichi Tokuda, National Inst. of Inform. and Comm. Technology (NiCT), Japan
More informationBLIND IMAGE WATERMARKING BASED ON SAMPLE ROTATION WITH OPTIMAL DETECTOR
7th Eropean Signal Processing Conference (EUSIPCO 009 Glasgow, Scotland, Agst -8, 009 BLIND IMAGE WATERMARKING BASED ON SAMPLE ROTATION WITH OPTIMAL DETECTOR S.M.E. Sahraeian, M.A. Akhaee, and F. Marvasti
More informationLPV control of an active vibration isolation system
29 American Control Conference Hyatt Regency Rierfront, St. Lois, MO, USA Jne 1-12, 29 ThC15.1 LPV control of an actie ibration isolation system W.H.T.M. Aangenent, C.H.A. Criens, M.J.G. an de Molengraft,
More informationBridging the Gap Between Multigrid, Hierarchical, and Receding-Horizon Control
Bridging the Gap Between Mltigrid, Hierarchical, and Receding-Horizon Control Victor M. Zavala Mathematics and Compter Science Division Argonne National Laboratory Argonne, IL 60439 USA (e-mail: vzavala@mcs.anl.gov).
More informationControl of a Power Assisted Lifting Device
Proceedings of the RAAD 212 21th International Workshop on Robotics in Alpe-Adria-Danbe Region eptember 1-13, 212, Napoli, Italy Control of a Power Assisted Lifting Device Dimeas otios a, Kostompardis
More informationChapter 6 Momentum Transfer in an External Laminar Boundary Layer
6. Similarit Soltions Chapter 6 Momentm Transfer in an Eternal Laminar Bondar Laer Consider a laminar incompressible bondar laer with constant properties. Assme the flow is stead and two-dimensional aligned
More informationThe Dual of the Maximum Likelihood Method
Department of Agricltral and Resorce Economics University of California, Davis The Dal of the Maximm Likelihood Method by Qirino Paris Working Paper No. 12-002 2012 Copyright @ 2012 by Qirino Paris All
More informationDiscussion of The Forward Search: Theory and Data Analysis by Anthony C. Atkinson, Marco Riani, and Andrea Ceroli
1 Introdction Discssion of The Forward Search: Theory and Data Analysis by Anthony C. Atkinson, Marco Riani, and Andrea Ceroli Søren Johansen Department of Economics, University of Copenhagen and CREATES,
More informationTechnical Note. ODiSI-B Sensor Strain Gage Factor Uncertainty
Technical Note EN-FY160 Revision November 30, 016 ODiSI-B Sensor Strain Gage Factor Uncertainty Abstract Lna has pdated or strain sensor calibration tool to spport NIST-traceable measrements, to compte
More informationChange Detection Using Directional Statistics
Change Detection Using Directional Statistics Tsyoshi Idé Dzng T. Phan Jayant Kalagnanam IBM Research, T. J. Watson Research Center 1101 Kitchawan Rd., Yorktown Heights, NY 10598, USA tide,phand,jayant@s.ibm.com
More informationA New Approach to Direct Sequential Simulation that Accounts for the Proportional Effect: Direct Lognormal Simulation
A ew Approach to Direct eqential imlation that Acconts for the Proportional ffect: Direct ognormal imlation John Manchk, Oy eangthong and Clayton Detsch Department of Civil & nvironmental ngineering University
More informationStep-Size Bounds Analysis of the Generalized Multidelay Adaptive Filter
WCE 007 Jly - 4 007 London UK Step-Size onds Analysis of the Generalized Mltidelay Adaptive Filter Jnghsi Lee and Hs Chang Hang Abstract In this paper we analyze the bonds of the fixed common step-size
More informationDiscontinuous Fluctuation Distribution for Time-Dependent Problems
Discontinos Flctation Distribtion for Time-Dependent Problems Matthew Hbbard School of Compting, University of Leeds, Leeds, LS2 9JT, UK meh@comp.leeds.ac.k Introdction For some years now, the flctation
More informationAn Autoregressive Recurrent Mixture Density Network for Parametric Speech Synthesis
ICASSP 07 New Orleans, USA An Autoregressive Recurrent Mixture Density Network for Parametric Speech Synthesis Xin WANG, Shinji TAKAKI, Junichi YAMAGISHI National Institute of Informatics, Japan 07-03-07
More informationNearly Perfect Detection of Continuous F 0 Contour and Frame Classification for TTS Synthesis. Thomas Ewender
Nearly Perfect Detection of Continuous F 0 Contour and Frame Classification for TTS Synthesis Thomas Ewender Outline Motivation Detection algorithm of continuous F 0 contour Frame classification algorithm
More informationChange Detection Using Directional Statistics
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI-16) Change Detection Using Directional Statistics Tsyoshi Idé, Dzng T. Phan, Jayant Kalagnanam IBM Research,
More informationPhase-merging enhanced harmonic generation free-electron laser
OPEN ACCESS Phase-merging enhanced harmonic generation free-electron laser To cite this article: Chao Feng et al 014 New J. Phys. 16 04301 View the article online for pdates and enhancements. Related content
More informationCreating a Sliding Mode in a Motion Control System by Adopting a Dynamic Defuzzification Strategy in an Adaptive Neuro Fuzzy Inference System
Creating a Sliding Mode in a Motion Control System by Adopting a Dynamic Defzzification Strategy in an Adaptive Nero Fzzy Inference System M. Onder Efe Bogazici University, Electrical and Electronic Engineering
More informationStability of Model Predictive Control using Markov Chain Monte Carlo Optimisation
Stability of Model Predictive Control sing Markov Chain Monte Carlo Optimisation Elilini Siva, Pal Golart, Jan Maciejowski and Nikolas Kantas Abstract We apply stochastic Lyapnov theory to perform stability
More informationReflections on a mismatched transmission line Reflections.doc (4/1/00) Introduction The transmission line equations are given by
Reflections on a mismatched transmission line Reflections.doc (4/1/00) Introdction The transmission line eqations are given by, I z, t V z t l z t I z, t V z, t c z t (1) (2) Where, c is the per-nit-length
More informationA Model-Free Adaptive Control of Pulsed GTAW
A Model-Free Adaptive Control of Plsed GTAW F.L. Lv 1, S.B. Chen 1, and S.W. Dai 1 Institte of Welding Technology, Shanghai Jiao Tong University, Shanghai 00030, P.R. China Department of Atomatic Control,
More informationAsymptotic Gauss Jacobi quadrature error estimation for Schwarz Christoffel integrals
Jornal of Approximation Theory 146 2007) 157 173 www.elseier.com/locate/jat Asymptotic Gass Jacobi qadratre error estimation for Schwarz Christoffel integrals Daid M. Hogh EC-Maths, Coentry Uniersity,
More informationLinear and Nonlinear Model Predictive Control of Quadruple Tank Process
Linear and Nonlinear Model Predictive Control of Qadrple Tank Process P.Srinivasarao Research scholar Dr.M.G.R.University Chennai, India P.Sbbaiah, PhD. Prof of Dhanalaxmi college of Engineering Thambaram
More informationModel (In-)Validation from a H and µ perspective
Model (In-)Validation from a H and µ perspectie Wolfgang Reinelt Department of Electrical Engineering Linköping Uniersity, S-581 83 Linköping, Seden WWW: http://.control.isy.li.se/~olle/ Email: olle@isy.li.se
More informationEstimations of the Influence of the Non-Linearity of the Aerodynamic Coefficients on the Skewness of the Loading. Vincent Denoël *, 1), Hervé Degée 1)
Estimations of the Inflence of the Non-Linearity of the Aerodynamic oefficients on the Skewness of the Loading Vincent enoël *, 1), Heré egée 1) 1) epartment of Material mechanics and Strctres, Uniersity
More informationTheoretical and Experimental Implementation of DC Motor Nonlinear Controllers
Theoretical and Experimental Implementation of DC Motor Nonlinear Controllers D.R. Espinoza-Trejo and D.U. Campos-Delgado Facltad de Ingeniería, CIEP, UASLP, espinoza trejo dr@aslp.mx Facltad de Ciencias,
More informationDirect linearization method for nonlinear PDE s and the related kernel RBFs
Direct linearization method for nonlinear PDE s and the related kernel BFs W. Chen Department of Informatics, Uniersity of Oslo, P.O.Box 1080, Blindern, 0316 Oslo, Norway Email: wenc@ifi.io.no Abstract
More informationSareban: Evaluation of Three Common Algorithms for Structure Active Control
Engineering, Technology & Applied Science Research Vol. 7, No. 3, 2017, 1638-1646 1638 Evalation of Three Common Algorithms for Strctre Active Control Mohammad Sareban Department of Civil Engineering Shahrood
More information1 The space of linear transformations from R n to R m :
Math 540 Spring 20 Notes #4 Higher deriaties, Taylor s theorem The space of linear transformations from R n to R m We hae discssed linear transformations mapping R n to R m We can add sch linear transformations
More informationSources of Non Stationarity in the Semivariogram
Sorces of Non Stationarity in the Semivariogram Migel A. Cba and Oy Leangthong Traditional ncertainty characterization techniqes sch as Simple Kriging or Seqential Gassian Simlation rely on stationary
More informationArtificial Noise Revisited: When Eve Has more Antennas than Alice
Artificial Noise Reisited: When e Has more Antennas than Alice Shiyin Li Yi Hong and manele Viterbo CS Department Monash Uniersity Melborne VIC 3800 Astralia mail: shiyin.li yi.hong emanele.iterbo@monash.ed
More informationCONTINUOUS PLSI AND SMOOTHING TECHNIQUES FOR HYBRID MUSIC RECOMMENDATION
10th International Society for Msic Information Retrieal Conference (ISMIR 2009) CONTINUOUS PLSI AND SMOOTHING TECHNIQUES FOR HYBRID MUSIC RECOMMENDATION Kayoshi Yoshii Masataka Goto National Institte
More informationModel Discrimination of Polynomial Systems via Stochastic Inputs
Model Discrimination of Polynomial Systems via Stochastic Inpts D. Georgiev and E. Klavins Abstract Systems biologists are often faced with competing models for a given experimental system. Unfortnately,
More informationNonparametric Identification and Robust H Controller Synthesis for a Rotational/Translational Actuator
Proceedings of the 6 IEEE International Conference on Control Applications Mnich, Germany, October 4-6, 6 WeB16 Nonparametric Identification and Robst H Controller Synthesis for a Rotational/Translational
More informationFrequency Estimation, Multiple Stationary Nonsinusoidal Resonances With Trend 1
Freqency Estimation, Mltiple Stationary Nonsinsoidal Resonances With Trend 1 G. Larry Bretthorst Department of Chemistry, Washington University, St. Lois MO 6313 Abstract. In this paper, we address the
More informationLecture Notes On THEORY OF COMPUTATION MODULE - 2 UNIT - 2
BIJU PATNAIK UNIVERSITY OF TECHNOLOGY, ODISHA Lectre Notes On THEORY OF COMPUTATION MODULE - 2 UNIT - 2 Prepared by, Dr. Sbhend Kmar Rath, BPUT, Odisha. Tring Machine- Miscellany UNIT 2 TURING MACHINE
More informationSimplified Identification Scheme for Structures on a Flexible Base
Simplified Identification Scheme for Strctres on a Flexible Base L.M. Star California State University, Long Beach G. Mylonais University of Patras, Greece J.P. Stewart University of California, Los Angeles
More informationNumerical Model for Studying Cloud Formation Processes in the Tropics
Astralian Jornal of Basic and Applied Sciences, 5(2): 189-193, 211 ISSN 1991-8178 Nmerical Model for Stdying Clod Formation Processes in the Tropics Chantawan Noisri, Dsadee Skawat Department of Mathematics
More informationNonlinear parametric optimization using cylindrical algebraic decomposition
Proceedings of the 44th IEEE Conference on Decision and Control, and the Eropean Control Conference 2005 Seville, Spain, December 12-15, 2005 TC08.5 Nonlinear parametric optimization sing cylindrical algebraic
More informationReformulating the HMM as a trajectory model by imposing explicit relationship between static and dynamic features
Reformulating the HMM as a trajectory model by imposing explicit relationship between static and dynamic features Heiga ZEN (Byung Ha CHUN) Nagoya Inst. of Tech., Japan Overview. Research backgrounds 2.
More informationSelf-induced stochastic resonance in excitable systems
Self-indced stochastic resonance in excitable systems Cyrill B. Mrato Department of Mathematical Sciences, New Jersey Institte of Technology, Newark, NJ 7 Eric Vanden-Eijnden Corant Institte of Mathematical
More informationFLUCTUATING WIND VELOCITY CHARACTERISTICS OF THE WAKE OF A CONICAL HILL THAT CAUSE LARGE HORIZONTAL RESPONSE OF A CANTILEVER MODEL
BBAA VI International Colloqim on: Blff Bodies Aerodynamics & Applications Milano, Italy, Jly, 2-24 28 FLUCTUATING WIND VELOCITY CHARACTERISTICS OF THE WAKE OF A CONICAL HILL THAT CAUSE LARGE HORIZONTAL
More informationarxiv: v2 [stat.ap] 22 Dec 2016
Localizing the Ensemble Kalman Particle Filter Sylain Robert and Hans R. Künsch Seminar für Statistik, ETH Zürich, Switzerland December 23, 2016 arxi:1605.054762 [stat.ap] 22 Dec 2016 Abstract Ensemble
More informationDynamic Optimization of First-Order Systems via Static Parametric Programming: Application to Electrical Discharge Machining
Dynamic Optimization of First-Order Systems via Static Parametric Programming: Application to Electrical Discharge Machining P. Hgenin*, B. Srinivasan*, F. Altpeter**, R. Longchamp* * Laboratoire d Atomatiqe,
More informationImage and Multidimensional Signal Processing
Image and Mltidimensional Signal Processing Professor William Hoff Dept of Electrical Engineering &Compter Science http://inside.mines.ed/~whoff/ Forier Transform Part : D discrete transforms 2 Overview
More informationACTUATION AND SIMULATION OF A MINISYSTEM WITH FLEXURE HINGES
The 4th International Conference Comptational Mechanics and Virtal Engineering COMEC 2011 20-22 OCTOBER 2011, Brasov, Romania ACTUATION AND SIMULATION OF A MINISYSTEM WITH FLEXURE HINGES D. NOVEANU 1,
More informationDesign and Analyses of some 1-D Chaotic Generators for Secure Data
SETIT 2009 5 th International Conference: Sciences of Electronic, Tenologies of Information and Telecommnications Mar 22-26, 2009 TUISIA Design and Analyses of some 1-D Chaotic Generators for Secre Data
More informationPulses on a Struck String
8.03 at ESG Spplemental Notes Plses on a Strck String These notes investigate specific eamples of transverse motion on a stretched string in cases where the string is at some time ndisplaced, bt with a
More informationMATH2715: Statistical Methods
MATH275: Statistical Methods Exercises III (based on lectres 5-6, work week 4, hand in lectre Mon 23 Oct) ALL qestions cont towards the continos assessment for this modle. Q. If X has a niform distribtion
More informationPROBABILISTIC APPROACHES TO STABILITY AND DEFORMATION PROBLEMS IN BRACED EXCAVATION
Clemson Uniersity TigerPrints All Dissertations Dissertations 12-2011 PROBABILISTIC APPROACHES TO STABILITY AND DEFORMATION PROBLEMS IN BRACED EXCAVATION Zhe Lo Clemson Uniersity, jerry8256@gmail.com Follow
More informationNotes on Linear Minimum Mean Square Error Estimators
Notes on Linear Minimum Mean Square Error Estimators Ça gatay Candan January, 0 Abstract Some connections between linear minimum mean square error estimators, maximum output SNR filters and the least square
More informationGen Hebb Learn PPaplinski Generalized Hebbian Learning and its pplication in Dimensionality Redction Illstrative Example ndrew P Paplinski Department
Faclty of Compting and Information Technology Department of Digital Systems Technical Report 9-2 Generalized Hebbian Learning and its pplication in Dimensionality Redction Illstrative Example ndrew P Paplinski
More informationPrediction of Transmission Distortion for Wireless Video Communication: Analysis
Prediction of Transmission Distortion for Wireless Video Commnication: Analysis Zhifeng Chen and Dapeng W Department of Electrical and Compter Engineering, University of Florida, Gainesville, Florida 326
More informationEXPT. 5 DETERMINATION OF pk a OF AN INDICATOR USING SPECTROPHOTOMETRY
EXPT. 5 DETERMITIO OF pk a OF IDICTOR USIG SPECTROPHOTOMETRY Strctre 5.1 Introdction Objectives 5.2 Principle 5.3 Spectrophotometric Determination of pka Vale of Indicator 5.4 Reqirements 5.5 Soltions
More informationText-to-speech synthesizer based on combination of composite wavelet and hidden Markov models
8th ISCA Speech Synthesis Workshop August 31 September 2, 2013 Barcelona, Spain Text-to-speech synthesizer based on combination of composite wavelet and hidden Markov models Nobukatsu Hojo 1, Kota Yoshizato
More informationComparing Recent Waveform Generation and Acoustic Modeling Methods for Neural-network-based Speech Synthesis
ICASSP 2018 Calgary, Canada Comparing Recent Waveform Generation and Acoustic Modeling Methods for Neural-network-based Speech Synthesis Xin WANG, Jaime Lorenzo-Trueba, Shinji TAKAKI, Lauri Juvela, Junichi
More informationChapter 3 MATHEMATICAL MODELING OF DYNAMIC SYSTEMS
Chapter 3 MATHEMATICAL MODELING OF DYNAMIC SYSTEMS 3. System Modeling Mathematical Modeling In designing control systems we mst be able to model engineered system dynamics. The model of a dynamic system
More informationPredicting Popularity of Twitter Accounts through the Discovery of Link-Propagating Early Adopters
Predicting Poplarity of Titter Acconts throgh the Discoery of Link-Propagating Early Adopters Daichi Imamori Gradate School of Informatics, Kyoto Uniersity Sakyo, Kyoto 606-850 Japan imamori@dl.soc.i.kyoto-.ac.jp
More informationIII. Demonstration of a seismometer response with amplitude and phase responses at:
GG5330, Spring semester 006 Assignment #1, Seismometry and Grond Motions De 30 Janary 006. 1. Calibration Of A Seismometer Using Java: A really nifty se of Java is now available for demonstrating the seismic
More informationApproximate Solution for the System of Non-linear Volterra Integral Equations of the Second Kind by using Block-by-block Method
Astralian Jornal of Basic and Applied Sciences, (1): 114-14, 008 ISSN 1991-8178 Approximate Soltion for the System of Non-linear Volterra Integral Eqations of the Second Kind by sing Block-by-block Method
More informationLinear Strain Triangle and other types of 2D elements. By S. Ziaei Rad
Linear Strain Triangle and other tpes o D elements B S. Ziaei Rad Linear Strain Triangle (LST or T6 This element is also called qadratic trianglar element. Qadratic Trianglar Element Linear Strain Triangle
More informationApplication of Compressed Sensing to DRM Channel Estimation
Application of Compressed Sensing to DRM Channel Estimation Chenhao Qi and Lenan W School of Information Science and Engineering Sotheast University, Nanjing 10096, China Email: {qch,wln}@se.ed.cn Abstract
More informationA Characterization of the Domain of Beta-Divergence and Its Connection to Bregman Variational Model
entropy Article A Characterization of the Domain of Beta-Divergence and Its Connection to Bregman Variational Model Hyenkyn Woo School of Liberal Arts, Korea University of Technology and Edcation, Cheonan
More informationMAT389 Fall 2016, Problem Set 6
MAT389 Fall 016, Problem Set 6 Trigonometric and hperbolic fnctions 6.1 Show that e iz = cos z + i sin z for eer comple nmber z. Hint: start from the right-hand side and work or wa towards the left-hand
More informationDiscussion Papers Department of Economics University of Copenhagen
Discssion Papers Department of Economics University of Copenhagen No. 10-06 Discssion of The Forward Search: Theory and Data Analysis, by Anthony C. Atkinson, Marco Riani, and Andrea Ceroli Søren Johansen,
More informationOptimal Control of a Heterogeneous Two Server System with Consideration for Power and Performance
Optimal Control of a Heterogeneos Two Server System with Consideration for Power and Performance by Jiazheng Li A thesis presented to the University of Waterloo in flfilment of the thesis reqirement for
More informationApplying Fuzzy Set Approach into Achieving Quality Improvement for Qualitative Quality Response
Proceedings of the 007 WSES International Conference on Compter Engineering and pplications, Gold Coast, stralia, Janary 17-19, 007 5 pplying Fzzy Set pproach into chieving Qality Improvement for Qalitative
More informationEE2 Mathematics : Functions of Multiple Variables
EE2 Mathematics : Fnctions of Mltiple Variables http://www2.imperial.ac.k/ nsjones These notes are not identical word-for-word with m lectres which will be gien on the blackboard. Some of these notes ma
More informationApplication of Fractional Fourier Transform in Non-stationary Signal Time Frequency Analysis
International Core Jornal of Engineering Vol. No.6 5 ISSN: 44-895 Application of Fractional Forier Transform in Non-stationary Signal Time Freqency Analysis Danyan Bai Changchn University of Science and
More informationThe Brauer Manin obstruction
The Braer Manin obstrction Martin Bright 17 April 2008 1 Definitions Let X be a smooth, geometrically irredcible ariety oer a field k. Recall that the defining property of an Azmaya algebra A is that,
More informationThe Linear Quadratic Regulator
10 The Linear Qadratic Reglator 10.1 Problem formlation This chapter concerns optimal control of dynamical systems. Most of this development concerns linear models with a particlarly simple notion of optimality.
More informationRESGen: Renewable Energy Scenario Generation Platform
1 RESGen: Renewable Energy Scenario Generation Platform Emil B. Iversen, Pierre Pinson, Senior Member, IEEE, and Igor Ardin Abstract Space-time scenarios of renewable power generation are increasingly
More informationMultivariable Ripple-Free Deadbeat Control
Scholarly Jornal of Mathematics and Compter Science Vol. (), pp. 9-9, October Available online at http:// www.scholarly-jornals.com/sjmcs ISS 76-8947 Scholarly-Jornals Fll Length Research aper Mltivariable
More information1. Solve Problem 1.3-3(c) 2. Solve Problem 2.2-2(b)
. Sole Problem.-(c). Sole Problem.-(b). A two dimensional trss shown in the figre is made of alminm with Yong s modls E = 8 GPa and failre stress Y = 5 MPa. Determine the minimm cross-sectional area of
More information