arxiv: v1 [quant-ph] 4 Nov 2008

Size: px
Start display at page:

Download "arxiv: v1 [quant-ph] 4 Nov 2008"

Transcription

1 Training a Binary Cassifier with the Quantum Adiabatic Agorithm Hartmut Neven Googe, neven@googe.com arxiv: v1 [quant-ph] 4 Nov 28 Vasi S. Denchev Purdue University, vdenchev@purdue.edu Geordie Rose and Wiiam G. Macready D-Wave Systems, rose,wgm@dwavesys.com November 4, 28 Abstract This paper describes how to make the probem of binary cassification amenabe to quantum computing. A formuation is empoyed in which the binary cassifier is constructed as a threshoded inear superposition of a set of weak cassifiers. The weights in the superposition are optimized in a earning process that strives to minimize the training error as we as the number of weak cassifiers used. No efficient soution to this probem is known. To bring it into a format that aows the appication of adiabatic quantum computing (AQC, we first show that the bit-precision with which the weights need to be represented ony grows ogarithmicay with the ratio of the number of training exampes to the number of weak cassifiers. This aows to effectivey formuate the training process as a binary optimization probem. Soving it with heuristic sovers such as tabu search, we find that the resuting cassifier outperforms a widey used state-of-the-art method, AdaBoost, on a variety of benchmark probems. Moreover, we discovered the interesting fact that bit-constrained earning machines often exhibit ower generaization error rates. Changing the oss function that measures the training error from -1 oss to east squares maps the training to quadratic unconstrained binary optimization. This corresponds to the format required by D-Wave s impementation of AQC. Simuations with heuristic sovers again yied resuts better than those obtained with boosting approaches. Since the resuting quadratic binary program is NP-hard, additiona gains can be expected from appying the actua quantum processor. 1 Introduction Many probems in machine earning map onto optimization probems that are formay NP-hard 1. Consequenty, arge areas of the fied are concerned with simpifications and reaxations that make the resuting optimization probems computationay tractabe. This has resuted in heuristic toos 1 From the forma NP-hardness of a cass of probems does not foow that probem instances encountered in practice are computationay difficut. However, experience tes us that this is usuay the case for the earning probems at hand. 1

2 that are usefu in practice but whose quaity is inferior compared to resuts obtained by soving the origina probem. Moreover, the weath of heuristic methods requires that the practitioner needs to seect the most suitabe approach on a case by case basis. Adiabatic quantum computing is a new method that draws on quantum mechanica processes that promises to sove hard discrete optimization probems better than possibe with cassica agorithms [FGGS][FGG + 1]. Thus it offers an opportunity to tacke hard machine earning probems heads on. This paper investigates how this nove method can be appied to a basic probem in machine earning: constructing a binary cassifier from a dictionary of feature detectors. 2 Training a binary cassifier We study a cassifier of the form ( N y = H(x = sign w i h i (x, (1 where x R M are the input patterns to be cassified, y { 1, 1} is the output of the cassifier, the h i : x { 1, 1} are so-caed weak cassifiers or features detectors, and the w i [, 1] are a set of weights to be optimized. H(x is known as a strong cassifier. Training, i.e. the process of choosing the weights w i, proceeds by simutaneousy minimizing two terms. One term measures the error over a set of S training exampes {(x s, y s s = 1,..., S}. A natura choice is -1 error, which counts the number of miscassifications over the training set. L(w = ( S H y s s=1 N w i h i (x s, (2 where H is the Heaviside step function. L(w is referred to as the oss function. The second term is known as reguarization, R(w, and it ensures that the cassifier does not become too compex. Cassifiers with high compexity tend to cassify the exampes in the training set with ow error but do not do we on independent test sets. The phenomenon of a cassifier achieving a sma training error but yieding a arge generaization error is known as overearning. A simpe choice for the reguarization term is based on the -norm, w, which gives the number of non-zero weights: R(w = λ w = λ w i (3 Therefore, training amounts to soving the foowing minimization probem: w opt = arg min w = arg min w (L(w + R(w ( S H( y s s=1 N w i h i (x s + λ w i, (4 where λ contros the reative importance of the reguarization. Due to the non-convexity of the oss function, the resuting optimization probem is suspected to be NP-hard. Even if we were to choose 2

3 a convex oss function, (4 is ikey to remain an NP-hard probem due to the choice of the -norm for the reguarization term [Zha8]. The choice of the -norm is attractive since it expicity enforces sparsity, i.e. it drives many of the w i to zero. This is not ony associated with good generaization but aso fast execution during the performance phase. Each contribution to the overa oss, i.e. the per sampe oss, H( y s N w ih i (x s, enforces an inequaity constraint: N y s w i h i (x s for s = 1,..., S (5 Thus, each training sampe brings about an inequaity, which demands to choose weights that are on one side of a diagona hyperpane in N-dimensiona space. The hyperpane is defined by a set of coefficients that are ±1 depending on the responses of the weak cassifiers. Fig. 1 iustrates the situation for N = 3. The number of regions created by S hyperpanes can be cacuated using their characteristic poynomia [OT92]: N regions = ( 1 N S k ( 1 k ( 1 dim(t S k, (6 where S k designates the k-eement subsets of the S hyperpanes and dim( S k is the dimension of the intersection of S k. 2 Due to inear dependencies among the hyperpanes, which occur for N 4, we were not abe to find a cosed form expression for dim( S k and instead have to resort to an upper bound for N regions, which is known [OT92][Sau72] to be N regions k= ( S k (7 It is possibe that two different training sampes generate identica inequaity constraints for the w i. In this sense, (7 is a conservative estimate as the actua number of soution spaces is often ower. 3 Modifications to aow the appication of the quantum adiabatic agorithm To bring (4 to a form that is amenabe to AQC as impemented by the D-Wave hardware 3, we need to effect severa modifications. First, we need to transition from continuous weights w i [, 1] to binary variabes. Formay this can aways be achieved by a binary expansion of the weights. The question that naturay arises is how many bits in the expansion are needed. Since each binary variabe is associated with a qubit, it is important that we ony use the minima number necessary. Discrete weight configurations represented by a finite number of bits ie on a hypercubic attice with edges that have 2 bits vertices. If each soution region contains a attice vertex then a cassifiers H w (x that can be attained with rea vaued weights can aso be reaized by the discrete weight 2 In this cacuation we ignored the fact that a hyperpane or parts of it can become a soution space itsef. This can occur when there are two training sampes for which the sets of {h i(x s} differ by a goba sign but have the same abe y s. Since this case is exceedingy unikey, the probabiity being of the order O(S/2 (2N, we can afford not to consider this situation. 3 The D-Wave hardware minimizes an Ising function via a physica anneaing of therma and quantum fuctuations. 3

4 Figure 1: Arrangement of the diagona hyperpanes that define the soution spaces for seecting w opt. Depicted is the situation for N = 3, which yieds 14 regions. The number of soution chambers grows rapidy N = 4 eads to 14 and N = 5 to 1882 regions. Here a possibe hyperpanes are shown. However in practice S training sampes wi ony invoke a sma fraction of the 2 N 1 possibe hyperpanes. The bue dots are the vertices of a cube paced in the positive quadrant with one vertex coinciding with the origin. They correspond to weight configurations that can be represented with one bit. Mutipe bits woud give rise to a cube shaped attice. configurations. Thus one obtains a rough estimate for the required bit depth by demanding that the number of vertices on the (2 bits N attice is at east as arge as the number of soution regions created by the hyperpanes N regions. The weak cassifiers are typicay constructed in a way that ony positive weights are needed. Hence, we ony need to hit soution regions in the positive quadrant of which there are approximatey N regions /2 N. V ertices on Lattice Regions in P ositive Quadrant (2bits N = (2bits+1 N (8 N regions N regions 2 N 2(bits+1N N ( S k= k 2(bits+1N ( es = 2(bits+1N N N N N (es N! 1 (9 N ( 2(bits+1 N = ( 2(bits+1 N = ( 2(bits+1! N 1 es efn ef (1 bits og 2 (f + og 2 (e 1, (11 4 N

5 where e is the Euer number and f = S N. On the eft side of eqn. (9 we used a standard resut regarding binomia coefficients: N ( S k= k ( es N N. This hods in the case that S N. Smaer numbers of training exampes ead to an even ower bound than (11 for the required bit precision. Equation (11 is an important resut as it shows that the bit precision needed for the weights ony grows ogarithmicay with the ratio of the number of training exampes to the number of weak cassifiers. Thus for many probems that arise in practice we get away with very few bits and often we wi ony need ony a singe bit. The second modification is not imposed by our desire to appy AQC per se but rather by the imitations of the D-Wave hardware, which cas for a Hamitonian that has at most quadratic terms. To this end we effect a change in the oss function, now using the quadratic oss, such that finding w opt in (4 amounts to soving a quadratic optimization program: ( S w opt = arg min w i h i (x s y s 2 + λ w w s=1 ( S N 2 = arg min w i h i (x s 2 w i h i (x s y s + y 2 w s + λ w i s=1 ( S = arg min w w i w j h i (x s h j (x s + j=1 s=1 }{{} Corr(h i,h j w i S λ 2 h i (x s y s s=1 }{{} Corr(h i,y In the third ine we dropped S s=1 y2 s because it represents a constant offset. In order for the square oss to be compatibe with the binary decision enforced by the sign in eqn. (1 we scae the h i (x such that h i : x { 1 N, 1 N }. Eqn. (12 corresponds to a quadratic unconstrained binary optimization (QUBO probem. Note that the transition from the second to the third ine ony hods for weights comprised of a singe bit. If we use an arbitrary number of bits, we have to introduce an auxiiary bit w i,aux for each weight to enforce a -norm reguarization within the framework of quadratic optimization. R(w of (4 then becomes (12 R(w = κw i (1 w i,aux + λw i,aux (13 Minimizing R(w causes the w i,aux to act as indicator bits that are 1 when w i > and otherwise. For this to work κ has to be chosen sufficienty arge. There is an intuitive way to ook at (12. The weak cassifiers h i, whose output is we correated with the abes y cause the bias term to be owered, thus causing an increase in the probabiity that w i = 1. The couping terms are proportiona to the correation among the weak cassifiers. Weak cassifiers that are strongy correated with each other cause the couping energy to go up, thereby increasing the probabiity for one of the correated cassifiers to be switched off, i.e. that either w i or w j becomes. The matrix Corr(h i, h j figuring in the quadratic term is positive semi-definite, thereby seemingy making the resuting optimization probem efficienty sovabe with cassica optimization 5

6 techniques. However, it has been confirmed that the quadratic unconstrained program with binary weights an integer programming probem is NP-hard, which vaidates the motivation for appying quantum agorithms to find w opt in the above formuation [KN1][HR98]. Moreover, the matrix figuring in the quadratic term ceases to be of Gram type when the w i are represented by more that one bit and the modified reguarization term (13 eads to additiona entries in the matrix. 4 Impementation detais We impemented the training formuations given by (4 and (12 in Matab. The dictionaries of weak cassifiers that we empoyed consist of decision stumps of the form: h 1+ (x = sign(x Θ + for = 1,..., M (14 h 1 (x = sign( x Θ for = 1,..., M (15 h 2+ (x = sign(x i x j Θ + i,j for = 1,..., ( M 2 h 2 (x = sign( x i x j Θ i,j for = 1,..., ( M 2 ; i, j = 1,..., M; i < j (16 ; i, j = 1,..., M; i < j (17 Here h 1+, h 1, h 2+ and h 2 are positive and negative weak cassifiers of orders 1 and 2; M is the dimensionaity of the input vector x; x,x i,x j are the eements of the input vector and Θ +, Θ, Θ + i,j and Θ i,j are optima threshods of the positive and negative weak cassifiers of orders 1, and 2 respectivey. The input vectors are normaized using the 2-norm, i.e. we have x 2 = 1. Using the training data, an approximatey optima threshod Θ is computed for each cassifier. The goa is to obtain an operating point that resuts in the minimum number of errors due to that weak cassifier aone when the weak cassifier is evauated on the entire training set. To minimize (4 for the purpose of determining the optima weights, we empoy simuated anneaing. An exponentia cooing schedue is used, and the schedue is tuned to the dataset for improved performance. The QUBO from (12, can be rewritten as w opt = arg min w ( i,j Q i,jw i w j, where the coefficient matrix has eements Q i,j = Corr(h i, h j and Q i,i = S + λ 2Corr(h N 2 i, y. The resutant probem is soved with a muti-start tabu sover tuned to QUBO probems [Pa4]. We noticed that we coud achieve enhanced resuts by adding a post-processing step. The w opt returned by tabu search is used to compute an optima threshod for the fina strong cassifier: T = 1 S N S s=1 wopt i h i (x s, where the x vectors are taken from a vaidation data set. T represents the average of a computed responses of the strong cassifier immediatey before the categorica decision is made. We modify eqn. (1 by inserting T. Thus the fina cassifier becomes y = sign ( N w opt i h i (x T After that, the set of test exampes is evauated using the strong cassifier configured in this way and the test errors are counted. The output consists of the number of test errors, the number of weak cassifiers with non-zero weights that make up the fina strong cassifier. Since the optima reguarization strength cannot be known a-priori for different data, we use 3-fod cross vaidation (18 6

7 x s2 x s x s x s1 Figure 2: Visuaization of the synthetic test data empoyed in the benchmark tests. The data consists of two isotropic Gaussian distributions of 3-dimensiona inputs with different variances. Shown are the first two dimensions. The two couds represent the positive and negative data sets which are transated reative to each other. The position of their means is controed by an overap parameter chosen such that the couds are maximay segregated when this overap is (eft and maximay overap when the overap is 1 (right. Thus, the cassification task is hardest when the overap parameter is 1. in order to find a reguarization strength λ that resuts in the best generaization on a vaidation set. We ony consider vaues of λ for which the tota number of weak cassifiers does not exceed N/2. 5 Performance measurements on benchmark probems To assess the performance of binary cassifiers of the form (18 trained by soving the optimization probems (4 and (12 respectivey, we measured their test errors on 3-dimensiona synthetic and natura data sets. Synthetic test data was generated by samping from P (x, y = 1 2 δ(y 1N(x µ +, I+ 1 2 δ(y +1N(x µ, I where N(x µ, Σ is a spherica Gaussian having mean µ and covariance Σ. An overap coefficient determines the separation of the two Gaussians. The synthetic data is iustrated on Fig. 2. The natura data consists of vectors of Gabor waveets ampitudes extracted at eye ocations in images showing faces. For comparison we ran the same tests using different impementations of boosting. First we impemented AdaBoost as formuated by Freund and Shapire in [FS99]. Here we used the same weak cassifiers as the ones defined in (14-(17. Additionay, we compared the performance against the GML AdaBoost Matab Toobox [Vez6]. The GML toobox contains impementations of three different favors of AdaBoost. We ran the tests on a three of them but ony dispay the test resuts for the best out of the three. The GML toobox is imited to the equivaent of our order 1 weak cassifiers cassification and regression trees with branching factor of 1. In the tests we varied the dictionary of weak cassifiers, the bit precision used to represent the weights as we as f, the ratio of training sampes to weak cassifiers. We used two dictionaries. The first, caed the order 1 7

8 GML AB1 QP1 1 bit QP1 3 bits 1 Loss, order 1, 1 bit 1 Loss, order 1, 3 bits 1 Loss, order 1, 64 bits GML AB1 QP1 1 bit QP1 3 bits 1 Loss, order 1, 1 bit 1 Loss, order 1, 3 bits 1 Loss, order 1, 64 bits Error rate Error rate Overap coefficient Overap coefficient Figure 3: Test errors for the synthetic data set with f = 1 and f = 8 for the order 1 dictionary. Note that due to the different variances of the Gaussian distributions for the positive and negative training sampes the generaization error does not necessariy approach.5 as the overap becomes maxima AB2 QP2 1 bit QP2 3 bits 1 Loss, order 2, 1 bit 1 Loss, order 2, 3 bits 1 Loss, order 2, 64 bits.4.35 AB2 QP2 1 bit QP2 3 bits 1 Loss, order 2, 1 bit 1 Loss, order 2, 3 bits 1 Loss, order 2, 64 bits.3.3 Error rate.25.2 Error rate Overap coefficient Overap coefficient Figure 4: Test errors for the synthetic data set with f = 1 and f = 8 for the order 2 dictionary dictionary, consists of the set of decision stumps with inear arguments h 1+, h 1 as per eqns. (14 and (15. The second one, caed the order 2 dictionary, uses the set of weak cassifiers of the order 1 dictionary but adds the order 2 decisions stumps h 2+ and h 2 described in eqns. (17 and (18 as we. The order 1 dictionary has 6 weak cassifiers whie the order 2 dictionary empoys 93. Figs. 3 and 4 show the test errors on the synthetic data set we obtained for different configurations. QP1 and QP2 denote the cassifiers trained with the quadratic program (12 whie using the dictionaries of order 1 and 2, respectivey. Simiary, -1 Loss 1 and -1 Loss 2 stand for the cassifiers using dictionaries 1 and 2 trained by soving the optimization probem (4. AB1 and AB2 denote cassifiers trained for the same dictionaries but with the AdaBoost agorithm. Finay, GML represents the best resut obtained with the GML AdaBoost Matab toobox. For GML, ony 8

9 a dictionary equivaent to our order 1 dictionary is avaiabe. The figures show test errors obtained on data sets that were not used during the training but were drawn however i.i.d. from the the same distributions. Test error is potted against overap coefficient for the range.7 to 1 corresponding to an increasingy harder cassification probem. Accordingy, we observe an increasing error rate. A number of observations can be made. First, the cassifiers trained with goba optimization outperform those obtained by the greedy feature seection methods empoyed in boosting. Second, with the exception of QP1, the goba optimizations that used fewer bits to represent the weights did better than those that empoyed more. Cassifiers using the richer order 2 dictionary achieved a ower test error, which, given the structure and size of the dictionaries reative to the input dimensionaity is not surprising. We do not draw concusions from the fact that -1 oss fared worse than quadratic oss. This coud be an artifact of using simuated anneaing to sove (4, whie tabu search was empoyed to optimize (12. Besides synthetic data, we aso did testing with natura data. The tabe in Fig. 5 shows the resuts obtained from a test set consisting of vectors of Gabor waveets ampitudes extracted at eye ocations in images showing faces. The data consisted of 2, input vectors, which we divided eveny into a training set, a vaidation set to fix the parameter λ, and a test set. For QP the bitconstrained earners aways performed better in terms of accuracy and cassifier compactness. In Figure 5: Test resuts obtained for a natura data set, which consisted of vectors of Gabor waveets ampitudes extracted at eye ocations in images showing faces. Each data ce in the tabes contains two numbers the first represents the respective error rate, and the second gives the number of weak cassifiers with a non-zero weight. The vaues are averages obtained through cross-vaidation of 1 runs. The shaded ces indicate the most accurate resuts. 9

10 the case of -1 Loss the performance was simiar for the different bit depths with sma trade-offs between accuracy and compactness. The goba optimization approach using the quadratic objective function (12 yieds the best resuts. The accuracy is ony increased by ess than 1% reative to AdaBoost, but this is accompished with a reduction of more than 5% of the switched-on weak cassifiers. 6 Discussion We have seen an impressive performance of goba optimization approaches that minimize a reguarized measure of training error to find an optima combination of weights for constructing a binary cassifier. Goba optimization competes successfuy with greedy methods such as the state-of-theart method AdaBoost. Further, we discovered that bit-constrained earning machines often exhibit a generaization error that is ower than the one obtained when the weights are represented with higher precision. To the best of our knowedge, this has not been studied before. Bit constraining can be regarded as an intrinsic reguarization that contributes to keeping the mode compexity ow. The finding that the bit-precision needed to reaize the optima training error ony grows ogarithmicay with the ratio of the number of training exampes to weak cassifiers, suppies insight into why few-bit earning machines work. The competitive performance of bit-constrained cassifiers suggests that training benefits from being treated as an integer program. This has a twofod impication. First, this is good news for hardware-constrained impementations such as ce phones, sensor networks, or eary quantum chips with sma numbers of qubits. Second, this renders the training probem manifesty NP-hard, thus further motivating the appication of quantum agorithms that may generate better approximate soutions than cassicay avaiabe. Our next steps wi be to investigate the advantages that goba optimization with AQC hardware offers for our probem instances. We pan to use the next generation of D-Wave chips with 128 qubits. This wi invove adjusting our impementation to additiona engineering constraints of the existing AQC hardware such as a sparse connectivity graph among the qubits. Empoying AQC during the training phase has the significant benefit that once the optima set of weights has been computed, then those can be taken advantage of by an entirey cassica processor. In this work we ony considered fixed dictionaries of weak cassifiers. An important generaization that remains to be studied is to appy this framework to adaptive dictionaries. We want to concude with the remark that our finding that bit-constraint earning has good generaization properties may have impications when studying pasticity in the nervous system, where it is sti an unresoved probem how a synapse can store information reiaby over a ong period of time[ksj]. Acknowedgments We woud ike to thank Hartwig Adam, Jiayong Zhang and Xiaowei Li for their hep with preparing the natura test data; Aessandro Bissacco for his assistance with Matab and reviewing the boosting code; Edward Farhi, Yoram Singer, Urich Buddemeier and Vint Cerf for commenting on earier versions of the paper. 1

11 References [FGG + 1] Edward Farhi, Jeffrey Godstone, Sam Gutmann, Joshua Lapan, Andrew Lundgren, and Danie Preda. A quantum adiabatic evoution agorithm appied to random instances of an np-compete probem. Science, 292:472, 21. [FGGS] Edward Farhi, Jeffrey Godstone, Sam Gutmann, and Michae Sipser. Quantum computation by adiabatic evoution. 2. preprint quant-ph/116v1. [FS99] [HR98] [KN1] [KSJ] Yoav Freund and Robert E. Schapire. A short introduction to boosting. Journa of Japanese Society for Artificia Inteigence, 14(5:771 78, Christoph Hemberg and Franz Rend. Soving quadratic (,1-probems by semidefinite programs and cutting panes. Math. Program., 82(3: , Kengo Katayama and Hiroyuki Narihisa. Performance of simuated anneaing-based heuristic for the unconstrained binary quadratic programming probem. European Journa of Operationa Research, 134(1:13 119, 21. Eric R. Kande, James H. Schwartz, and Thomas M. Jesse. Principes of Neura Science. McGraw-Hi, 2. [OT92] Peter Orik and Hiroaki Terao. Arrangements of hyperpanes. Grundehren der Mathematischen Wissenschaften [Fundamenta Principes of Mathematica Sciences]. Springer-Verag, Berin, Germany, [Pa4] Gintaras Paubeckis. Mutistart tabu search strategies for the unconstrained binary quadratic optimization probem. Ann. Oper. Res., 131: , 24. [Sau72] [Vez6] [Zha8] Norbert Sauer. On the density of famiies of sets. Journa of Combinatoria Theory, 13: , Aexander Vezhnevets. GML AdaBoost Matab toobox.3. MSU Graphics & Media Lab, Computer Vision Group, Department of Computer Science, Moscow State University, 26. Tong Zhang. Forward-backward greedy agorithm for earning sparse representations. Rutgers Statistics Department Technica Report,

A Brief Introduction to Markov Chains and Hidden Markov Models

A Brief Introduction to Markov Chains and Hidden Markov Models A Brief Introduction to Markov Chains and Hidden Markov Modes Aen B MacKenzie Notes for December 1, 3, &8, 2015 Discrete-Time Markov Chains You may reca that when we first introduced random processes,

More information

Bayesian Learning. You hear a which which could equally be Thanks or Tanks, which would you go with?

Bayesian Learning. You hear a which which could equally be Thanks or Tanks, which would you go with? Bayesian Learning A powerfu and growing approach in machine earning We use it in our own decision making a the time You hear a which which coud equay be Thanks or Tanks, which woud you go with? Combine

More information

Statistical Learning Theory: A Primer

Statistical Learning Theory: A Primer Internationa Journa of Computer Vision 38(), 9 3, 2000 c 2000 uwer Academic Pubishers. Manufactured in The Netherands. Statistica Learning Theory: A Primer THEODOROS EVGENIOU, MASSIMILIANO PONTIL AND TOMASO

More information

SVM: Terminology 1(6) SVM: Terminology 2(6)

SVM: Terminology 1(6) SVM: Terminology 2(6) Andrew Kusiak Inteigent Systems Laboratory 39 Seamans Center he University of Iowa Iowa City, IA 54-57 SVM he maxima margin cassifier is simiar to the perceptron: It aso assumes that the data points are

More information

An Algorithm for Pruning Redundant Modules in Min-Max Modular Network

An Algorithm for Pruning Redundant Modules in Min-Max Modular Network An Agorithm for Pruning Redundant Modues in Min-Max Moduar Network Hui-Cheng Lian and Bao-Liang Lu Department of Computer Science and Engineering, Shanghai Jiao Tong University 1954 Hua Shan Rd., Shanghai

More information

Inductive Bias: How to generalize on novel data. CS Inductive Bias 1

Inductive Bias: How to generalize on novel data. CS Inductive Bias 1 Inductive Bias: How to generaize on nove data CS 478 - Inductive Bias 1 Overfitting Noise vs. Exceptions CS 478 - Inductive Bias 2 Non-Linear Tasks Linear Regression wi not generaize we to the task beow

More information

Explicit overall risk minimization transductive bound

Explicit overall risk minimization transductive bound 1 Expicit overa risk minimization transductive bound Sergio Decherchi, Paoo Gastado, Sandro Ridea, Rodofo Zunino Dept. of Biophysica and Eectronic Engineering (DIBE), Genoa University Via Opera Pia 11a,

More information

CS229 Lecture notes. Andrew Ng

CS229 Lecture notes. Andrew Ng CS229 Lecture notes Andrew Ng Part IX The EM agorithm In the previous set of notes, we taked about the EM agorithm as appied to fitting a mixture of Gaussians. In this set of notes, we give a broader view

More information

A. Distribution of the test statistic

A. Distribution of the test statistic A. Distribution of the test statistic In the sequentia test, we first compute the test statistic from a mini-batch of size m. If a decision cannot be made with this statistic, we keep increasing the mini-batch

More information

II. PROBLEM. A. Description. For the space of audio signals

II. PROBLEM. A. Description. For the space of audio signals CS229 - Fina Report Speech Recording based Language Recognition (Natura Language) Leopod Cambier - cambier; Matan Leibovich - matane; Cindy Orozco Bohorquez - orozcocc ABSTRACT We construct a rea time

More information

FRST Multivariate Statistics. Multivariate Discriminant Analysis (MDA)

FRST Multivariate Statistics. Multivariate Discriminant Analysis (MDA) 1 FRST 531 -- Mutivariate Statistics Mutivariate Discriminant Anaysis (MDA) Purpose: 1. To predict which group (Y) an observation beongs to based on the characteristics of p predictor (X) variabes, using

More information

Separation of Variables and a Spherical Shell with Surface Charge

Separation of Variables and a Spherical Shell with Surface Charge Separation of Variabes and a Spherica She with Surface Charge In cass we worked out the eectrostatic potentia due to a spherica she of radius R with a surface charge density σθ = σ cos θ. This cacuation

More information

Stochastic Variational Inference with Gradient Linearization

Stochastic Variational Inference with Gradient Linearization Stochastic Variationa Inference with Gradient Linearization Suppementa Materia Tobias Pötz * Anne S Wannenwetsch Stefan Roth Department of Computer Science, TU Darmstadt Preface In this suppementa materia,

More information

First-Order Corrections to Gutzwiller s Trace Formula for Systems with Discrete Symmetries

First-Order Corrections to Gutzwiller s Trace Formula for Systems with Discrete Symmetries c 26 Noninear Phenomena in Compex Systems First-Order Corrections to Gutzwier s Trace Formua for Systems with Discrete Symmetries Hoger Cartarius, Jörg Main, and Günter Wunner Institut für Theoretische

More information

XSAT of linear CNF formulas

XSAT of linear CNF formulas XSAT of inear CN formuas Bernd R. Schuh Dr. Bernd Schuh, D-50968 Kön, Germany; bernd.schuh@netcoogne.de eywords: compexity, XSAT, exact inear formua, -reguarity, -uniformity, NPcompeteness Abstract. Open

More information

Expectation-Maximization for Estimating Parameters for a Mixture of Poissons

Expectation-Maximization for Estimating Parameters for a Mixture of Poissons Expectation-Maximization for Estimating Parameters for a Mixture of Poissons Brandon Maone Department of Computer Science University of Hesini February 18, 2014 Abstract This document derives, in excrutiating

More information

Unconditional security of differential phase shift quantum key distribution

Unconditional security of differential phase shift quantum key distribution Unconditiona security of differentia phase shift quantum key distribution Kai Wen, Yoshihisa Yamamoto Ginzton Lab and Dept of Eectrica Engineering Stanford University Basic idea of DPS-QKD Protoco. Aice

More information

From Margins to Probabilities in Multiclass Learning Problems

From Margins to Probabilities in Multiclass Learning Problems From Margins to Probabiities in Muticass Learning Probems Andrea Passerini and Massimiiano Ponti 2 and Paoo Frasconi 3 Abstract. We study the probem of muticass cassification within the framework of error

More information

DIGITAL FILTER DESIGN OF IIR FILTERS USING REAL VALUED GENETIC ALGORITHM

DIGITAL FILTER DESIGN OF IIR FILTERS USING REAL VALUED GENETIC ALGORITHM DIGITAL FILTER DESIGN OF IIR FILTERS USING REAL VALUED GENETIC ALGORITHM MIKAEL NILSSON, MATTIAS DAHL AND INGVAR CLAESSON Bekinge Institute of Technoogy Department of Teecommunications and Signa Processing

More information

The Group Structure on a Smooth Tropical Cubic

The Group Structure on a Smooth Tropical Cubic The Group Structure on a Smooth Tropica Cubic Ethan Lake Apri 20, 2015 Abstract Just as in in cassica agebraic geometry, it is possibe to define a group aw on a smooth tropica cubic curve. In this note,

More information

Optimality of Inference in Hierarchical Coding for Distributed Object-Based Representations

Optimality of Inference in Hierarchical Coding for Distributed Object-Based Representations Optimaity of Inference in Hierarchica Coding for Distributed Object-Based Representations Simon Brodeur, Jean Rouat NECOTIS, Département génie éectrique et génie informatique, Université de Sherbrooke,

More information

A Solution to the 4-bit Parity Problem with a Single Quaternary Neuron

A Solution to the 4-bit Parity Problem with a Single Quaternary Neuron Neura Information Processing - Letters and Reviews Vo. 5, No. 2, November 2004 LETTER A Soution to the 4-bit Parity Probem with a Singe Quaternary Neuron Tohru Nitta Nationa Institute of Advanced Industria

More information

$, (2.1) n="# #. (2.2)

$, (2.1) n=# #. (2.2) Chapter. Eectrostatic II Notes: Most of the materia presented in this chapter is taken from Jackson, Chap.,, and 4, and Di Bartoo, Chap... Mathematica Considerations.. The Fourier series and the Fourier

More information

Algorithms to solve massively under-defined systems of multivariate quadratic equations

Algorithms to solve massively under-defined systems of multivariate quadratic equations Agorithms to sove massivey under-defined systems of mutivariate quadratic equations Yasufumi Hashimoto Abstract It is we known that the probem to sove a set of randomy chosen mutivariate quadratic equations

More information

4 Separation of Variables

4 Separation of Variables 4 Separation of Variabes In this chapter we describe a cassica technique for constructing forma soutions to inear boundary vaue probems. The soution of three cassica (paraboic, hyperboic and eiptic) PDE

More information

Minimizing Total Weighted Completion Time on Uniform Machines with Unbounded Batch

Minimizing Total Weighted Completion Time on Uniform Machines with Unbounded Batch The Eighth Internationa Symposium on Operations Research and Its Appications (ISORA 09) Zhangiaie, China, September 20 22, 2009 Copyright 2009 ORSC & APORC, pp. 402 408 Minimizing Tota Weighted Competion

More information

MARKOV CHAINS AND MARKOV DECISION THEORY. Contents

MARKOV CHAINS AND MARKOV DECISION THEORY. Contents MARKOV CHAINS AND MARKOV DECISION THEORY ARINDRIMA DATTA Abstract. In this paper, we begin with a forma introduction to probabiity and expain the concept of random variabes and stochastic processes. After

More information

Lecture Note 3: Stationary Iterative Methods

Lecture Note 3: Stationary Iterative Methods MATH 5330: Computationa Methods of Linear Agebra Lecture Note 3: Stationary Iterative Methods Xianyi Zeng Department of Mathematica Sciences, UTEP Stationary Iterative Methods The Gaussian eimination (or

More information

8 Digifl'.11 Cth:uits and devices

8 Digifl'.11 Cth:uits and devices 8 Digif'. Cth:uits and devices 8. Introduction In anaog eectronics, votage is a continuous variabe. This is usefu because most physica quantities we encounter are continuous: sound eves, ight intensity,

More information

Power Control and Transmission Scheduling for Network Utility Maximization in Wireless Networks

Power Control and Transmission Scheduling for Network Utility Maximization in Wireless Networks ower Contro and Transmission Scheduing for Network Utiity Maximization in Wireess Networks Min Cao, Vivek Raghunathan, Stephen Hany, Vinod Sharma and. R. Kumar Abstract We consider a joint power contro

More information

Uniprocessor Feasibility of Sporadic Tasks with Constrained Deadlines is Strongly conp-complete

Uniprocessor Feasibility of Sporadic Tasks with Constrained Deadlines is Strongly conp-complete Uniprocessor Feasibiity of Sporadic Tasks with Constrained Deadines is Strongy conp-compete Pontus Ekberg and Wang Yi Uppsaa University, Sweden Emai: {pontus.ekberg yi}@it.uu.se Abstract Deciding the feasibiity

More information

Statistical Learning Theory: a Primer

Statistical Learning Theory: a Primer ??,??, 1 6 (??) c?? Kuwer Academic Pubishers, Boston. Manufactured in The Netherands. Statistica Learning Theory: a Primer THEODOROS EVGENIOU AND MASSIMILIANO PONTIL Center for Bioogica and Computationa

More information

Cryptanalysis of PKP: A New Approach

Cryptanalysis of PKP: A New Approach Cryptanaysis of PKP: A New Approach Éiane Jaumes and Antoine Joux DCSSI 18, rue du Dr. Zamenhoff F-92131 Issy-es-Mx Cedex France eiane.jaumes@wanadoo.fr Antoine.Joux@ens.fr Abstract. Quite recenty, in

More information

Statistics for Applications. Chapter 7: Regression 1/43

Statistics for Applications. Chapter 7: Regression 1/43 Statistics for Appications Chapter 7: Regression 1/43 Heuristics of the inear regression (1) Consider a coud of i.i.d. random points (X i,y i ),i =1,...,n : 2/43 Heuristics of the inear regression (2)

More information

Moreau-Yosida Regularization for Grouped Tree Structure Learning

Moreau-Yosida Regularization for Grouped Tree Structure Learning Moreau-Yosida Reguarization for Grouped Tree Structure Learning Jun Liu Computer Science and Engineering Arizona State University J.Liu@asu.edu Jieping Ye Computer Science and Engineering Arizona State

More information

Chemical Kinetics Part 2

Chemical Kinetics Part 2 Integrated Rate Laws Chemica Kinetics Part 2 The rate aw we have discussed thus far is the differentia rate aw. Let us consider the very simpe reaction: a A à products The differentia rate reates the rate

More information

Formulas for Angular-Momentum Barrier Factors Version II

Formulas for Angular-Momentum Barrier Factors Version II BNL PREPRINT BNL-QGS-06-101 brfactor1.tex Formuas for Anguar-Momentum Barrier Factors Version II S. U. Chung Physics Department, Brookhaven Nationa Laboratory, Upton, NY 11973 March 19, 2015 abstract A

More information

Appendix for Stochastic Gradient Monomial Gamma Sampler

Appendix for Stochastic Gradient Monomial Gamma Sampler 3 4 5 6 7 8 9 3 4 5 6 7 8 9 3 4 5 6 7 8 9 3 3 3 33 34 35 36 37 38 39 4 4 4 43 44 45 46 47 48 49 5 5 5 53 54 Appendix for Stochastic Gradient Monomia Gamma Samper A The Main Theorem We provide the foowing

More information

ASummaryofGaussianProcesses Coryn A.L. Bailer-Jones

ASummaryofGaussianProcesses Coryn A.L. Bailer-Jones ASummaryofGaussianProcesses Coryn A.L. Baier-Jones Cavendish Laboratory University of Cambridge caj@mrao.cam.ac.uk Introduction A genera prediction probem can be posed as foows. We consider that the variabe

More information

(This is a sample cover image for this issue. The actual cover is not yet available at this time.)

(This is a sample cover image for this issue. The actual cover is not yet available at this time.) (This is a sampe cover image for this issue The actua cover is not yet avaiabe at this time) This artice appeared in a journa pubished by Esevier The attached copy is furnished to the author for interna

More information

Chemical Kinetics Part 2. Chapter 16

Chemical Kinetics Part 2. Chapter 16 Chemica Kinetics Part 2 Chapter 16 Integrated Rate Laws The rate aw we have discussed thus far is the differentia rate aw. Let us consider the very simpe reaction: a A à products The differentia rate reates

More information

Physics 127c: Statistical Mechanics. Fermi Liquid Theory: Collective Modes. Boltzmann Equation. The quasiparticle energy including interactions

Physics 127c: Statistical Mechanics. Fermi Liquid Theory: Collective Modes. Boltzmann Equation. The quasiparticle energy including interactions Physics 27c: Statistica Mechanics Fermi Liquid Theory: Coective Modes Botzmann Equation The quasipartice energy incuding interactions ε p,σ = ε p + f(p, p ; σ, σ )δn p,σ, () p,σ with ε p ε F + v F (p p

More information

Nonlinear Gaussian Filtering via Radial Basis Function Approximation

Nonlinear Gaussian Filtering via Radial Basis Function Approximation 51st IEEE Conference on Decision and Contro December 10-13 01 Maui Hawaii USA Noninear Gaussian Fitering via Radia Basis Function Approximation Huazhen Fang Jia Wang and Raymond A de Caafon Abstract This

More information

Learning Fully Observed Undirected Graphical Models

Learning Fully Observed Undirected Graphical Models Learning Fuy Observed Undirected Graphica Modes Sides Credit: Matt Gormey (2016) Kayhan Batmangheich 1 Machine Learning The data inspires the structures we want to predict Inference finds {best structure,

More information

Target Location Estimation in Wireless Sensor Networks Using Binary Data

Target Location Estimation in Wireless Sensor Networks Using Binary Data Target Location stimation in Wireess Sensor Networks Using Binary Data Ruixin Niu and Pramod K. Varshney Department of ectrica ngineering and Computer Science Link Ha Syracuse University Syracuse, NY 344

More information

Data Mining Technology for Failure Prognostic of Avionics

Data Mining Technology for Failure Prognostic of Avionics IEEE Transactions on Aerospace and Eectronic Systems. Voume 38, #, pp.388-403, 00. Data Mining Technoogy for Faiure Prognostic of Avionics V.A. Skormin, Binghamton University, Binghamton, NY, 1390, USA

More information

12.2. Maxima and Minima. Introduction. Prerequisites. Learning Outcomes

12.2. Maxima and Minima. Introduction. Prerequisites. Learning Outcomes Maima and Minima 1. Introduction In this Section we anayse curves in the oca neighbourhood of a stationary point and, from this anaysis, deduce necessary conditions satisfied by oca maima and oca minima.

More information

FORECASTING TELECOMMUNICATIONS DATA WITH AUTOREGRESSIVE INTEGRATED MOVING AVERAGE MODELS

FORECASTING TELECOMMUNICATIONS DATA WITH AUTOREGRESSIVE INTEGRATED MOVING AVERAGE MODELS FORECASTING TEECOMMUNICATIONS DATA WITH AUTOREGRESSIVE INTEGRATED MOVING AVERAGE MODES Niesh Subhash naawade a, Mrs. Meenakshi Pawar b a SVERI's Coege of Engineering, Pandharpur. nieshsubhash15@gmai.com

More information

Appendix for Stochastic Gradient Monomial Gamma Sampler

Appendix for Stochastic Gradient Monomial Gamma Sampler Appendix for Stochastic Gradient Monomia Gamma Samper A The Main Theorem We provide the foowing theorem to characterize the stationary distribution of the stochastic process with SDEs in (3) Theorem 3

More information

Combining reaction kinetics to the multi-phase Gibbs energy calculation

Combining reaction kinetics to the multi-phase Gibbs energy calculation 7 th European Symposium on Computer Aided Process Engineering ESCAPE7 V. Pesu and P.S. Agachi (Editors) 2007 Esevier B.V. A rights reserved. Combining reaction inetics to the muti-phase Gibbs energy cacuation

More information

HYDROGEN ATOM SELECTION RULES TRANSITION RATES

HYDROGEN ATOM SELECTION RULES TRANSITION RATES DOING PHYSICS WITH MATLAB QUANTUM PHYSICS Ian Cooper Schoo of Physics, University of Sydney ian.cooper@sydney.edu.au HYDROGEN ATOM SELECTION RULES TRANSITION RATES DOWNLOAD DIRECTORY FOR MATLAB SCRIPTS

More information

Gauss Law. 2. Gauss s Law: connects charge and field 3. Applications of Gauss s Law

Gauss Law. 2. Gauss s Law: connects charge and field 3. Applications of Gauss s Law Gauss Law 1. Review on 1) Couomb s Law (charge and force) 2) Eectric Fied (fied and force) 2. Gauss s Law: connects charge and fied 3. Appications of Gauss s Law Couomb s Law and Eectric Fied Couomb s

More information

Asymptotic Properties of a Generalized Cross Entropy Optimization Algorithm

Asymptotic Properties of a Generalized Cross Entropy Optimization Algorithm 1 Asymptotic Properties of a Generaized Cross Entropy Optimization Agorithm Zijun Wu, Michae Koonko, Institute for Appied Stochastics and Operations Research, Caustha Technica University Abstract The discrete

More information

Substructuring Preconditioners for the Bidomain Extracellular Potential Problem

Substructuring Preconditioners for the Bidomain Extracellular Potential Problem Substructuring Preconditioners for the Bidomain Extraceuar Potentia Probem Mico Pennacchio 1 and Vaeria Simoncini 2,1 1 IMATI - CNR, via Ferrata, 1, 27100 Pavia, Itay mico@imaticnrit 2 Dipartimento di

More information

(f) is called a nearly holomorphic modular form of weight k + 2r as in [5].

(f) is called a nearly holomorphic modular form of weight k + 2r as in [5]. PRODUCTS OF NEARLY HOLOMORPHIC EIGENFORMS JEFFREY BEYERL, KEVIN JAMES, CATHERINE TRENTACOSTE, AND HUI XUE Abstract. We prove that the product of two neary hoomorphic Hece eigenforms is again a Hece eigenform

More information

Quantum Mechanical Models of Vibration and Rotation of Molecules Chapter 18

Quantum Mechanical Models of Vibration and Rotation of Molecules Chapter 18 Quantum Mechanica Modes of Vibration and Rotation of Moecues Chapter 18 Moecuar Energy Transationa Vibrationa Rotationa Eectronic Moecuar Motions Vibrations of Moecues: Mode approximates moecues to atoms

More information

arxiv: v1 [math.ca] 6 Mar 2017

arxiv: v1 [math.ca] 6 Mar 2017 Indefinite Integras of Spherica Besse Functions MIT-CTP/487 arxiv:703.0648v [math.ca] 6 Mar 07 Joyon K. Boomfied,, Stephen H. P. Face,, and Zander Moss, Center for Theoretica Physics, Laboratory for Nucear

More information

Scalable Spectrum Allocation for Large Networks Based on Sparse Optimization

Scalable Spectrum Allocation for Large Networks Based on Sparse Optimization Scaabe Spectrum ocation for Large Networks ased on Sparse Optimization innan Zhuang Modem R&D Lab Samsung Semiconductor, Inc. San Diego, C Dongning Guo, Ermin Wei, and Michae L. Honig Department of Eectrica

More information

A Simple and Efficient Algorithm of 3-D Single-Source Localization with Uniform Cross Array Bing Xue 1 2 a) * Guangyou Fang 1 2 b and Yicai Ji 1 2 c)

A Simple and Efficient Algorithm of 3-D Single-Source Localization with Uniform Cross Array Bing Xue 1 2 a) * Guangyou Fang 1 2 b and Yicai Ji 1 2 c) A Simpe Efficient Agorithm of 3-D Singe-Source Locaization with Uniform Cross Array Bing Xue a * Guangyou Fang b Yicai Ji c Key Laboratory of Eectromagnetic Radiation Sensing Technoogy, Institute of Eectronics,

More information

High Spectral Resolution Infrared Radiance Modeling Using Optimal Spectral Sampling (OSS) Method

High Spectral Resolution Infrared Radiance Modeling Using Optimal Spectral Sampling (OSS) Method High Spectra Resoution Infrared Radiance Modeing Using Optima Spectra Samping (OSS) Method J.-L. Moncet and G. Uymin Background Optima Spectra Samping (OSS) method is a fast and accurate monochromatic

More information

Multiple Beam Interference

Multiple Beam Interference MutipeBeamInterference.nb James C. Wyant 1 Mutipe Beam Interference 1. Airy's Formua We wi first derive Airy's formua for the case of no absorption. ü 1.1 Basic refectance and transmittance Refected ight

More information

Sequential Decoding of Polar Codes with Arbitrary Binary Kernel

Sequential Decoding of Polar Codes with Arbitrary Binary Kernel Sequentia Decoding of Poar Codes with Arbitrary Binary Kerne Vera Miosavskaya, Peter Trifonov Saint-Petersburg State Poytechnic University Emai: veram,petert}@dcn.icc.spbstu.ru Abstract The probem of efficient

More information

Do Schools Matter for High Math Achievement? Evidence from the American Mathematics Competitions Glenn Ellison and Ashley Swanson Online Appendix

Do Schools Matter for High Math Achievement? Evidence from the American Mathematics Competitions Glenn Ellison and Ashley Swanson Online Appendix VOL. NO. DO SCHOOLS MATTER FOR HIGH MATH ACHIEVEMENT? 43 Do Schoos Matter for High Math Achievement? Evidence from the American Mathematics Competitions Genn Eison and Ashey Swanson Onine Appendix Appendix

More information

Efficiently Generating Random Bits from Finite State Markov Chains

Efficiently Generating Random Bits from Finite State Markov Chains 1 Efficienty Generating Random Bits from Finite State Markov Chains Hongchao Zhou and Jehoshua Bruck, Feow, IEEE Abstract The probem of random number generation from an uncorreated random source (of unknown

More information

https://doi.org/ /epjconf/

https://doi.org/ /epjconf/ HOW TO APPLY THE OPTIMAL ESTIMATION METHOD TO YOUR LIDAR MEASUREMENTS FOR IMPROVED RETRIEVALS OF TEMPERATURE AND COMPOSITION R. J. Sica 1,2,*, A. Haefee 2,1, A. Jaai 1, S. Gamage 1 and G. Farhani 1 1 Department

More information

arxiv: v1 [cs.db] 1 Aug 2012

arxiv: v1 [cs.db] 1 Aug 2012 Functiona Mechanism: Regression Anaysis under Differentia Privacy arxiv:208.029v [cs.db] Aug 202 Jun Zhang Zhenjie Zhang 2 Xiaokui Xiao Yin Yang 2 Marianne Winsett 2,3 ABSTRACT Schoo of Computer Engineering

More information

The EM Algorithm applied to determining new limit points of Mahler measures

The EM Algorithm applied to determining new limit points of Mahler measures Contro and Cybernetics vo. 39 (2010) No. 4 The EM Agorithm appied to determining new imit points of Maher measures by Souad E Otmani, Georges Rhin and Jean-Marc Sac-Épée Université Pau Veraine-Metz, LMAM,

More information

A Novel Learning Method for Elman Neural Network Using Local Search

A Novel Learning Method for Elman Neural Network Using Local Search Neura Information Processing Letters and Reviews Vo. 11, No. 8, August 2007 LETTER A Nove Learning Method for Eman Neura Networ Using Loca Search Facuty of Engineering, Toyama University, Gofuu 3190 Toyama

More information

A Statistical Framework for Real-time Event Detection in Power Systems

A Statistical Framework for Real-time Event Detection in Power Systems 1 A Statistica Framework for Rea-time Event Detection in Power Systems Noan Uhrich, Tim Christman, Phiip Swisher, and Xichen Jiang Abstract A quickest change detection (QCD) agorithm is appied to the probem

More information

MONTE CARLO SIMULATIONS

MONTE CARLO SIMULATIONS MONTE CARLO SIMULATIONS Current physics research 1) Theoretica 2) Experimenta 3) Computationa Monte Caro (MC) Method (1953) used to study 1) Discrete spin systems 2) Fuids 3) Poymers, membranes, soft matter

More information

Approximated MLC shape matrix decomposition with interleaf collision constraint

Approximated MLC shape matrix decomposition with interleaf collision constraint Approximated MLC shape matrix decomposition with intereaf coision constraint Thomas Kainowski Antje Kiese Abstract Shape matrix decomposition is a subprobem in radiation therapy panning. A given fuence

More information

Determining The Degree of Generalization Using An Incremental Learning Algorithm

Determining The Degree of Generalization Using An Incremental Learning Algorithm Determining The Degree of Generaization Using An Incrementa Learning Agorithm Pabo Zegers Facutad de Ingeniería, Universidad de os Andes San Caros de Apoquindo 22, Las Condes, Santiago, Chie pzegers@uandes.c

More information

MATH 172: MOTIVATION FOR FOURIER SERIES: SEPARATION OF VARIABLES

MATH 172: MOTIVATION FOR FOURIER SERIES: SEPARATION OF VARIABLES MATH 172: MOTIVATION FOR FOURIER SERIES: SEPARATION OF VARIABLES Separation of variabes is a method to sove certain PDEs which have a warped product structure. First, on R n, a inear PDE of order m is

More information

Multicategory Classification by Support Vector Machines

Multicategory Classification by Support Vector Machines Muticategory Cassification by Support Vector Machines Erin J Bredensteiner Department of Mathematics University of Evansvie 800 Lincon Avenue Evansvie, Indiana 47722 eb6@evansvieedu Kristin P Bennett Department

More information

On the Goal Value of a Boolean Function

On the Goal Value of a Boolean Function On the Goa Vaue of a Booean Function Eric Bach Dept. of CS University of Wisconsin 1210 W. Dayton St. Madison, WI 53706 Lisa Heerstein Dept of CSE NYU Schoo of Engineering 2 Metrotech Center, 10th Foor

More information

Ant Colony Algorithms for Constructing Bayesian Multi-net Classifiers

Ant Colony Algorithms for Constructing Bayesian Multi-net Classifiers Ant Coony Agorithms for Constructing Bayesian Muti-net Cassifiers Khaid M. Saama and Aex A. Freitas Schoo of Computing, University of Kent, Canterbury, UK. {kms39,a.a.freitas}@kent.ac.uk December 5, 2013

More information

BP neural network-based sports performance prediction model applied research

BP neural network-based sports performance prediction model applied research Avaiabe onine www.jocpr.com Journa of Chemica and Pharmaceutica Research, 204, 6(7:93-936 Research Artice ISSN : 0975-7384 CODEN(USA : JCPRC5 BP neura networ-based sports performance prediction mode appied

More information

Discrete Applied Mathematics

Discrete Applied Mathematics Discrete Appied Mathematics 159 (2011) 812 825 Contents ists avaiabe at ScienceDirect Discrete Appied Mathematics journa homepage: www.esevier.com/ocate/dam A direct barter mode for course add/drop process

More information

VI.G Exact free energy of the Square Lattice Ising model

VI.G Exact free energy of the Square Lattice Ising model VI.G Exact free energy of the Square Lattice Ising mode As indicated in eq.(vi.35), the Ising partition function is reated to a sum S, over coections of paths on the attice. The aowed graphs for a square

More information

6.434J/16.391J Statistics for Engineers and Scientists May 4 MIT, Spring 2006 Handout #17. Solution 7

6.434J/16.391J Statistics for Engineers and Scientists May 4 MIT, Spring 2006 Handout #17. Solution 7 6.434J/16.391J Statistics for Engineers and Scientists May 4 MIT, Spring 2006 Handout #17 Soution 7 Probem 1: Generating Random Variabes Each part of this probem requires impementation in MATLAB. For the

More information

An approximate method for solving the inverse scattering problem with fixed-energy data

An approximate method for solving the inverse scattering problem with fixed-energy data J. Inv. I-Posed Probems, Vo. 7, No. 6, pp. 561 571 (1999) c VSP 1999 An approximate method for soving the inverse scattering probem with fixed-energy data A. G. Ramm and W. Scheid Received May 12, 1999

More information

Active Learning & Experimental Design

Active Learning & Experimental Design Active Learning & Experimenta Design Danie Ting Heaviy modified, of course, by Lye Ungar Origina Sides by Barbara Engehardt and Aex Shyr Lye Ungar, University of Pennsyvania Motivation u Data coection

More information

SydU STAT3014 (2015) Second semester Dr. J. Chan 18

SydU STAT3014 (2015) Second semester Dr. J. Chan 18 STAT3014/3914 Appied Stat.-Samping C-Stratified rand. sampe Stratified Random Samping.1 Introduction Description The popuation of size N is divided into mutuay excusive and exhaustive subpopuations caed

More information

Problem set 6 The Perron Frobenius theorem.

Problem set 6 The Perron Frobenius theorem. Probem set 6 The Perron Frobenius theorem. Math 22a4 Oct 2 204, Due Oct.28 In a future probem set I want to discuss some criteria which aow us to concude that that the ground state of a sef-adjoint operator

More information

<C 2 2. λ 2 l. λ 1 l 1 < C 1

<C 2 2. λ 2 l. λ 1 l 1 < C 1 Teecommunication Network Contro and Management (EE E694) Prof. A. A. Lazar Notes for the ecture of 7/Feb/95 by Huayan Wang (this document was ast LaT E X-ed on May 9,995) Queueing Primer for Muticass Optima

More information

A unified framework for Regularization Networks and Support Vector Machines. Theodoros Evgeniou, Massimiliano Pontil, Tomaso Poggio

A unified framework for Regularization Networks and Support Vector Machines. Theodoros Evgeniou, Massimiliano Pontil, Tomaso Poggio MASSACHUSETTS INSTITUTE OF TECHNOLOGY ARTIFICIAL INTELLIGENCE LABORATORY and CENTER FOR BIOLOGICAL AND COMPUTATIONAL LEARNING DEPARTMENT OF BRAIN AND COGNITIVE SCIENCES A.I. Memo No. 1654 March23, 1999

More information

Melodic contour estimation with B-spline models using a MDL criterion

Melodic contour estimation with B-spline models using a MDL criterion Meodic contour estimation with B-spine modes using a MDL criterion Damien Loive, Ney Barbot, Oivier Boeffard IRISA / University of Rennes 1 - ENSSAT 6 rue de Kerampont, B.P. 80518, F-305 Lannion Cedex

More information

Asynchronous Control for Coupled Markov Decision Systems

Asynchronous Control for Coupled Markov Decision Systems INFORMATION THEORY WORKSHOP (ITW) 22 Asynchronous Contro for Couped Marov Decision Systems Michae J. Neey University of Southern Caifornia Abstract This paper considers optima contro for a coection of

More information

Introduction. Figure 1 W8LC Line Array, box and horn element. Highlighted section modelled.

Introduction. Figure 1 W8LC Line Array, box and horn element. Highlighted section modelled. imuation of the acoustic fied produced by cavities using the Boundary Eement Rayeigh Integra Method () and its appication to a horn oudspeaer. tephen Kirup East Lancashire Institute, Due treet, Bacburn,

More information

How the backpropagation algorithm works Srikumar Ramalingam School of Computing University of Utah

How the backpropagation algorithm works Srikumar Ramalingam School of Computing University of Utah How the backpropagation agorithm works Srikumar Ramaingam Schoo of Computing University of Utah Reference Most of the sides are taken from the second chapter of the onine book by Michae Nieson: neuranetworksanddeepearning.com

More information

Tight Approximation Algorithms for Maximum Separable Assignment Problems

Tight Approximation Algorithms for Maximum Separable Assignment Problems MATHEMATICS OF OPERATIONS RESEARCH Vo. 36, No. 3, August 011, pp. 416 431 issn 0364-765X eissn 156-5471 11 3603 0416 10.187/moor.1110.0499 011 INFORMS Tight Approximation Agorithms for Maximum Separabe

More information

Smoothness equivalence properties of univariate subdivision schemes and their projection analogues

Smoothness equivalence properties of univariate subdivision schemes and their projection analogues Numerische Mathematik manuscript No. (wi be inserted by the editor) Smoothness equivaence properties of univariate subdivision schemes and their projection anaogues Phiipp Grohs TU Graz Institute of Geometry

More information

A proposed nonparametric mixture density estimation using B-spline functions

A proposed nonparametric mixture density estimation using B-spline functions A proposed nonparametric mixture density estimation using B-spine functions Atizez Hadrich a,b, Mourad Zribi a, Afif Masmoudi b a Laboratoire d Informatique Signa et Image de a Côte d Opae (LISIC-EA 4491),

More information

Statistical Inference, Econometric Analysis and Matrix Algebra

Statistical Inference, Econometric Analysis and Matrix Algebra Statistica Inference, Econometric Anaysis and Matrix Agebra Bernhard Schipp Water Krämer Editors Statistica Inference, Econometric Anaysis and Matrix Agebra Festschrift in Honour of Götz Trenker Physica-Verag

More information

Pairwise RNA Edit Distance

Pairwise RNA Edit Distance Pairwise RNA Edit Distance In the foowing: Sequences S 1 and S 2 associated structures P 1 and P 2 scoring of aignment: different edit operations arc atering arc removing 1) ACGUUGACUGACAACAC..(((...)))...

More information

Paragraph Topic Classification

Paragraph Topic Classification Paragraph Topic Cassification Eugene Nho Graduate Schoo of Business Stanford University Stanford, CA 94305 enho@stanford.edu Edward Ng Department of Eectrica Engineering Stanford University Stanford, CA

More information

Stochastic Complement Analysis of Multi-Server Threshold Queues. with Hysteresis. Abstract

Stochastic Complement Analysis of Multi-Server Threshold Queues. with Hysteresis. Abstract Stochastic Compement Anaysis of Muti-Server Threshod Queues with Hysteresis John C.S. Lui The Dept. of Computer Science & Engineering The Chinese University of Hong Kong Leana Goubchik Dept. of Computer

More information

Nonlinear Analysis of Spatial Trusses

Nonlinear Analysis of Spatial Trusses Noninear Anaysis of Spatia Trusses João Barrigó October 14 Abstract The present work addresses the noninear behavior of space trusses A formuation for geometrica noninear anaysis is presented, which incudes

More information

Probabilistic Graphical Models

Probabilistic Graphical Models Schoo of Computer Science Probabiistic Graphica Modes Gaussian graphica modes and Ising modes: modeing networks Eric Xing Lecture 0, February 0, 07 Reading: See cass website Eric Xing @ CMU, 005-07 Network

More information

arxiv: v1 [cs.lg] 31 Oct 2017

arxiv: v1 [cs.lg] 31 Oct 2017 ACCELERATED SPARSE SUBSPACE CLUSTERING Abofaz Hashemi and Haris Vikao Department of Eectrica and Computer Engineering, University of Texas at Austin, Austin, TX, USA arxiv:7.26v [cs.lg] 3 Oct 27 ABSTRACT

More information