arxiv: v2 [stat.ml] 25 Aug 2015

Size: px
Start display at page:

Download "arxiv: v2 [stat.ml] 25 Aug 2015"

Transcription

1 A statistical esective on andomized sketching fo odinay least-squaes Gavesh Raskutti Michael Mahoney,3 Deatment of Statistics & Deatment of Comute Science, Univesity of Wisconsin Madison Intenational Comute Science Institute 3 Deatment of Statistics, Univesity of Califonia Bekeley axiv: v [stat.ml] 5 Aug 05 Abstact We conside statistical as well as algoithmic asects of solving lage-scale least-squaes (LS) oblems using andomized sketching algoithms. Fo a LS oblem with inut data (X, Y ) R n R n, sketching algoithms use a sketching matix, S R n, whee n. Then, athe than solving the LS oblem using the full data (X, Y ), sketching algoithms solve the LS oblem using only the sketched data (SX, SY ). Pio wok has tyically adoted an algoithmic esective, in that it has made no statistical assumtions on the inut X and Y, and instead it has been assumed that the data (X, Y ) ae fixed and wost-case (WC). Pio esults show that, when using sketching matices such as andom ojections and leveage-scoe samling algoithms, with n, the WC eo is the same as solving the oiginal oblem, u to a small constant. Fom a statistical esective, we tyically conside the mean-squaed eo efomance of andomized sketching algoithms, when data (X, Y ) ae geneated accoding to a statistical linea model Y = Xβ + ɛ, whee ɛ is a noise ocess. In this ae, we ovide a igoous comaison of both esectives leading to insights on how they diffe. To do this, we fist develo a famewok fo assessing, in a unified manne, algoithmic and statistical asects of andomized sketching methods. We then conside the statistical ediction efficiency (PE) and the statistical esidual efficiency (RE) of the sketched LS estimato; and we use ou famewok to ovide ue bounds fo seveal tyes of andom ojection and andom samling sketching algoithms. Among othe esults, we show that the RE can be ue bounded when n while the PE tyically equies the samle size to be substantially lage. Lowe bounds develoed in subsequent esults show that ou ue bounds on PE can not be imoved. Intoduction Recent wok in lage-scale data analysis has focused on develoing so-called sketching algoithms: given a data set and an objective function of inteest, constuct a small sketch of the full data set, e.g., by using andom samling o andom ojection methods, and use that sketch as a suogate to efom comutations of inteest fo the full data set (see [] fo a eview). Most effot in this aea has adoted an algoithmic esective, wheeby one shows that, when the sketches ae constucted aoiately, one can obtain answes that ae aoximately as good as the exact answe fo the inut data at hand, in less time than would be equied to comute an exact answe fo the data at hand. In statistics, howeve, one is often moe inteested in how well a ocedue efoms elative to an hyothesized model than how well it efoms on the aticula data set at hand. Thus an imotant to question to conside is whethe the insights fom the algoithmic esective of sketching cay ove to the statistical setting. Thus, in this ae, we develo a unified aoach that consides both the statistical esective as well as algoithmic esective on ecently-develoed andomized sketching algoithms, and we ovide bounds on two statistical objectives fo seveal tyes of andom ojection and andom samling sketching algoithms.

2 . Oveview of the oblem The oblem we conside in this ae is the odinay least-squaes (LS o OLS) oblem: given as inut a matix X R n of obseved featues o covaiates and a vecto Y R n of obseved esonses, etun as outut a vecto β OLS that solves the following otimization oblem: β OLS = ag min β R Y Xβ. () We will assume that n and ae both vey lage, with n, and fo simlicity we will assume ank(x) =, e.g., to ensue a unique full-dimensional solution. The OLS solution, β OLS = (X T X) X T Y, has a numbe of well-known desiable statistical oeties [4]; and it is also well-known that the unning time o comutational comlexity fo this oblem is O(n ) [0]. Fo many moden alications, howeve, n may be on the ode of and may be on the ode of , and thus comuting the exact LS solution with taditional O(n ) methods can be comutationally challenging. This, couled with the obsevation that aoximate answes often suffice fo downsteam alications, has led to a lage body of wok on develoing fast aoximation algoithms to the LS oblem []. One vey oula aoach to educing comutation is to efom LS on a caefully-constucted sketch of the full data set. That is, athe than comuting a LS estimato fom Poblem () fom the full data (X, Y ), geneate sketched data (SX, SY ) whee S R n, with n, is a sketching matix, and then comute a LS estimato fom the following sketched oblem: β S ag min β R SY SXβ. () Once the sketching oeation has been efomed, the additional comutational comlexity of β S is O( ), i.e., simly call a taditional LS solve on the sketched oblem. Thus, when using a sketching algoithm, two citeia ae imotant: fist, ensue the accuacy of the sketched LS estimato is comaable to, e.g., not much wose, than the efomance of the oiginal LS estimato; and second, ensue that comuting and alying the sketching matix S is not too comutationally intensive, e.g., that is faste than solving the oiginal oblem exactly.. Pio esults Random samling and andom ojections ovide two aoaches to constuct sketching matices S that satisfy both of these citeia and that have eceived attention ecently in the comute science community. Vey loosely seaking, a andom ojection matix S is a dense matix, whee each enty is a meanzeo bounded-vaiance Gaussian o Rademache andom vaiable, although othe constuctions based on andomized Hadamad tansfomations ae also of inteest; and a andom samling matix S is a vey sase matix that has exactly non-zeo enty (which tyically equals one multilied by a escaling facto) in each ow, whee that one non-zeo can be chosen unifomly, non-unifomly based on hyotheses about the data, o non-unifomly based on emiical statistics of the data such as the leveage scoes of the matix X. In aticula, note that a sketch constucted fom an n andom ojection matix S consists of linea combinations of most o all of the ows of (X, Y ), and a sketch constucted fom a andom samling matix S consists of tyically-escaled ows of (X, Y ). Random ojection algoithms have eceived a geat deal That is, O(n ) time suffices to comute the LS solution fom Poblem () fo abitay o wost-case inut, with, e.g., the Cholesky Decomosition on the nomal equations, with a QR decomosition, o with the Singula Value Decomosition [0]. The eade should, howeve, be awae of ecently-develoed inut-sasity time andom ojection methods [5, 3, 6].

3 of attention moe geneally, lagely due to thei connections with the Johnson-Lindenstauss lemma [5] and its extensions; and andom samling algoithms have eceived a geat deal of attention, lagely due to thei alications in lage-scale data analysis alications []. A detailed oveview of andom ojection and andom samling algoithms fo matix oblems may be found in the ecent monogah of []. Hee, we biefly summaize the most elevant asects of the theoy. In tems of unning time guaantees, the unning time bottleneck fo andom ojection algoithms fo the LS oblem is the alication of the ojection to the inut data, i.e., actually efoming the matixmatix multilication to imlement the ojection and comute the sketch. By using fast Hadamad-based andom ojections, howeve, [8] develoed a andom ojection algoithm that uns on abitay o wostcase inut in o(n ) time. (See [8] fo a ecise statement of the unning time.) As fo andom samling, it is tivial to imlement unifom andom samling, but it is vey easy to show examles of inut data on which unifom samling efoms vey ooly. On the othe hand, [9, 7] have shown that if the andom samling is efomed with esect to nonunifom imotance samling obabilities that deend on the emiical statistical leveage scoes of the inut matix X, i.e., the diagonal enties of the hat matix H = X(X T X) X T, then one obtains a andom samling algoithm that achieves much bette esults fo abitay o wost-case inut. Leveage scoes have a long histoy in obust statistics and exeimental design. In the obust statistics community, samles with high leveage scoes ae tyically flagged as otential outlies (see, e.g., [3, 4,, 3, 4]). In the exeimental design community, samles with high leveage have been shown to imove oveall efficiency, ovided that the undelying statistical model is accuate (see, e.g., [9, 3]). This should be contasted with thei use in theoetical comute science. Fom the algoithmic esective of wost-case analysis, that was adoted by [8] and [7], samles with high leveage tend to contain the most imotant infomation fo subsamling/sketching, and thus it is beneficial fo wost-case analysis to bias the andom samle to include samles with lage statistical leveage scoes o to otate to a andom basis whee the leveage scoes ae aoximately unifomized. The unning-time bottleneck fo this leveage-based andom samling algoithm is the comutation of the leveage scoes of the inut data; and the obvious well-known algoithm fo this involves O(n ) time to efom a QR decomosition to comute an othogonal basis fo X [0]. By using fast Hadamadbased andom ojections, howeve, [7] showed that one can comute aoximate QR decomositions and thus aoximate leveage scoes in o(n ) time, and (based on evious wok [9]) this immediately imlies a leveage-based andom samling algoithm that uns on abitay o wost-case inut in o(n ) time [7]. Reades inteested in the actical efomance of these andomized algoithms should consult BENDENPIK [] o LSRN [4]. In tems of accuacy guaantees, both [8] and [7] ove that thei esective andom ojection and leveage-based andom samling LS sketching algoithms each achieve the following wost-case (WC) eo guaantee: fo any abitay (X, Y ), Y Xβ S ( + κ) Y Xβ OLS, (3) with high obability fo some e-secified eo aamete κ (0, ). 3 This +κ elative-eo guaantee 4 is extemely stong, and it is alicable to abitay o wost-case inut. That is, wheeas in statistics one tyically assumes a model, e.g., a standad linea model on Y, Y = Xβ + ɛ, (4) 3 The quantity β S β OLS is also bounded by [8] and [7]. 4 The nonstandad aamete κ is used hee fo the eo aamete since ɛ is used below to efe to the noise o eo ocess. 3

4 whee β R is the tue aamete and ɛ R n is a standadized noise vecto, with E[ɛ] = 0 and E[ɛɛ T ] = I n n, in [8] and [7] no statistical model is assumed on X and Y, and thus the unning time and quality-ofaoximation bounds aly to any abitay (X, Y ) inut data..3 Ou aoach and main esults In this ae, we adot a statistical esective on these andomized sketching algoithms, and we addess the following fundamental questions. Fist, unde a standad linea model, e.g., as given in Eqn. (4), what oeties of a sketching matix S ae sufficient to ensue low statistical eo, e.g., mean-squaed, eo? Second, how do existing andom ojection algoithms and leveage-based andom samling algoithms efom by this statistical measue? Thid, how does this elate to the oeties of a sketching matix S that ae sufficient to ensue low wost-case eo, e.g., of the fom of Eqn. (3), as has been established eviously in [8, 7, ]? We addess these elated questions in a numbe of stes. In Section, we will esent a famewok fo evaluating the algoithmic and statistical oeties of andomized sketching methods in a unified manne; and we will show that oviding wost-case eo bounds of the fom of Eqn. (3) and oviding bounds on two elated statistical objectives boil down to contolling diffeent stuctual oeties of how the sketching matix S inteacts with the left singula subsace of the design matix. In aticula, we will conside the oblique ojection matix, Π U S = U(SU) S, whee ( ) denotes the Mooe-Penose seudo-invese of a matix and U is the left singula matix of X. This famewok will allow us to daw a comaison between the wost-case eo and two elated statistical efficiency citeia, the statistical ediction efficiency (PE) (which is based on the ediction eo E[ X( β β) ] and which is given in Eqn. (7) below) and the statistical esidual efficiency (RE) (which is based on esidual eo E[ Y X β ] and which is given in Eqn. (8) below); and it will allow us to ovide sufficient conditions that any sketching matix S must satisfy in ode to achieve efomance guaantees fo these two statistical objectives. In Section 3, we will esent ou main theoetical esults, which consist of bounds fo these two statistical quantities fo vaiants of andom samling and andom ojection sketching algoithms. In aticula, we ovide ue bounds on the PE and RE (as well as the wost-case WC) fo fou sketching schemes: () an aoximate leveage-based andom samling algoithm, as is analyzed by [7]; () a vaiant of leveagebased andom samling, whee the andom samles ae not e-scaled io to thei inclusion in the sketch, as is consideed by [9, 0]; (3) a vanilla andom ojection algoithm, whee S is a andom matix containing i.i.d. Gaussian o Rademache andom vaiables, as is oula in statistics and scientific comuting; and (4) a andom ojection algoithm, whee S is a andom Hadamad-based andom ojection, as analyzed in []. Fo sketching schemes (), (3), and (4), ou ue bounds fo each of the two measues of statistical efficiency ae identical u to constants; and they show that the RE scales as +, while the PE scales as n. In aticula, this means that it is ossible to obtain good bounds fo the RE when n (in a manne simila to the samling comlexity of the WC bounds); but in ode to obtain even nea-constant bounds fo PE, must be at least of constant ode comaed to n. We then esent a lowe bound develoed in subsequent wok by [7] which shows that unde geneal conditions on S, ou ue bound of n fo PE can not be imoved. Fo the sketching scheme (), we show, on the othe hand, that unde the stong assumtion that thee ae k lage leveage scoes and the emaining n k ae small, then the WC scales as +, the RE scales as + k n, and the PE scales as k. That is, shae bounds ae ossible fo leveage-scoe samling without e-scaling in the statistical setting, but much stonge assumtions ae needed on the inut data. In Section 4, we will sulement ou theoetical esults by esenting ou main emiical esults, which consist of an evaluation of the comlementay oeties of andom samling vesus andom ojection 4

5 methods. Ou emiical esults suot ou theoetical esults, and they also show that fo lage than but much close to than n, ojection-based methods tend to out-efom samling-based methods, while fo significantly lage than, ou leveage-based samling methods efom slightly bette. In Section 5, we will ovide a bief discussion and conclusion and we ovide oofs of ou main esults in the Aendix..4 Additional elated wok Vey ecently [9] consideed statistical asects of leveage-based samling algoithms (called algoithmic leveaging in [9]). Assuming a standad linea model on Y of the fom of Eqn. (4), the authos develoed fist-ode Taylo aoximations to the statistical elative efficiency of diffeent estimatos comuted with leveage-based samling algoithms, and they veified the quality of those aoximations with comutations on eal and synthetic data. Taken as a whole, thei esults suggest that, if one is inteested in the statistical efomance of these andomized sketching algoithms, then thee ae nontivial tade-offs that ae not taken into account by standad wost-case analysis. Thei aoach, howeve, does not immediately aly to andom ojections o othe moe geneal sketching matices. Futhe, the ealm of alicability of the fist-ode Taylo aoximation was not ecisely quantified, and they left oen the question of stuctual chaacteizations of andom sketching matices that wee sufficient to ensue good statistical oeties on the sketched data. We addess these issues in this ae. Afte the aeaance of the oiginal technical eot vesion of this ae [8], we wee made awae of subsequent wok by [7], who also conside a statistical esective on sketching. Amongst othe esults, they develo a lowe bound which confims that using a single andomized sketching matix S can not achieve a PE bette than n. This lowe bound comlements ou ue bounds develoed in this ae. Thei main focus is to use this insight to develo an iteative sketching scheme which yields bounds on the PE when an n sketch is alied eeatedly. Geneal famewok and stuctual esults In this section, we develo a famewok that allows us to view the algoithmic and statistical esectives on LS oblems fom a common esective. We then use this famewok to show that existing wost-case bounds as well as ou novel statistical bounds fo the mean-squaed eos can be exessed in tems of diffeent stuctual conditions on how the sketching matix S inteacts with the data (X, Y ).. A statistical-algoithmic famewok Recall that we ae given as inut a data set, (X, Y ) R n R n, and the objective function of inteest is the standad LS objective, as given in Eqn. (). Since we ae assuming, without loss of geneality, that ank(x) =, we have that β OLS = X Y = (X T X) X T Y, (5) whee ( ) denotes the Mooe-Penose seudo-invese of a matix, and whee the second equality follows since ank(x) = To esent ou famewok and objectives, let S R n denote an abitay sketching matix. That is, although we will be most inteested in sketches constucted fom andom samling o andom ojection oeations, fo now we let S be any n matix. Then, we ae inteested in analyzing the efomance of objectives chaacteizing the quality of a sketched LS objective, as given in Eqn (), whee again we ae 5

6 inteested in solutions of the fom β S = (SX) SY. (6) (We emhasize that this does not in geneal equal ((SX) T SX) (SX) T SY, since the invese will not exist if the sketching ocess does not eseve ank.) Ou goal hee is to comae the efomance of β S to β OLS. We will do so by consideing thee elated efomance citeia, two of a statistical flavo, and one of a moe algoithmic o wost-case flavo. Fom a statistical esective, it is common to assume a standad linea model on Y, Y = Xβ + ɛ, whee we emind the eade that β R is the tue aamete and ɛ R n is a standadized noise vecto, with E[ɛ] = 0 and E[ɛɛ T ] = I n n. Fom this statistical esective, we will conside the following two citeia. The fist statistical citeion we conside is the ediction efficiency (PE), defined as follows: whee the exectation E[ ] is taken ove the andom noise ɛ. C P E (S) = E[ X(β β S) ] E[ X(β β OLS ) (7) ], The second statistical citeion we conside is the esidual efficiency (RE), defined as follows: whee, again, the exectation E[ ] is taken ove the andom noise ɛ. C RE (S) = E[ Y Xβ S ] E[ Y Xβ OLS (8) ], Recall that the standad elative statistical efficiency fo two estimatos β and β is defined as eff(β, β ) = Va(β ), whee Va( ) denotes the vaiance of the estimato (see e.g., [6]). Fo the PE, we have elaced Va(β ) the vaiance of each estimato by the mean-squaed ediction eo. Fo the RE, we use the tem esidual since fo any estimato β, Y X β ae the esiduals fo estimating Y. Fom an algoithmic esective, thee is no noise ocess ɛ. Instead, X and Y ae abitay, and β is simly comuted fom Eqn (5). To daw a aallel with the usual statistical geneative ocess, howeve, and to undestand bette the elationshi between vaious objectives, conside defining Y in tems of X by the following linea model : Y = Xβ + ɛ, whee β R and ɛ R n. Imotantly, β and ɛ hee eesent diffeent quantities than in the usual statistical setting. Rathe than ɛ eesenting a noise ocess and β eesenting a tue aamete that is obseved though a noisy Y, hee in the algoithmic setting, we will take advantage of the ank-nullity theoem in linea algeba to elate X and Y. 5 To define a wost case model Y = Xβ + ɛ fo the algoithmic setting, one can view the noise ocess ɛ to consist of any vecto that lies in the null-sace of X T. Then, since the choice of β R is abitay, one can constuct any abitay o wost-case inut data Y. Fom this algoithmic case, we will conside the following citeion. 5 The ank-nullity theoem assets that given any matix X R n and vecto Y R n, thee exists a unique decomosition Y = Xβ + ɛ, whee β is the ojection of Y on to the ange sace of X T and ɛ = Y Xβ lies in the null-sace of X T [5]. 6

7 The algoithmic citeion we conside is the wost-case (WC) eo, defined as follows: C W C (S) = su Y Y Xβ S Y Xβ OLS. (9) This citeion is wost-case since we take a suemum Y, and it is the efomance citeion that is analyzed in [8] and [7], as bounded in Eqn. (3). Witing Y as Xβ + ɛ, whee X T ɛ = 0, the WC eo can be e-exessed as: C W C (S) = su Y =Xβ+ɛ, X T ɛ=0 Y Xβ S Y Xβ OLS. Hence, in the wost-case algoithmic setu, we take a suemum ove ɛ, whee X T ɛ = 0, wheeas in the statistical setu, we take an exectation ove ɛ whee E[ɛ] = 0. Befoe oceeding, seveal othe comments about this algoithmic-statistical famewok and ou objectives ae woth mentioning. Fom the esective of ou two linea models, we have that β OLS = β + (X T X) X T ɛ. In the statistical setting, since E[ɛɛ T ] = I n n, it follows that β OLS is a andom vaiable with E[β OLS ] = β and E[(β β OLS )(β β OLS ) T ] = (X T X). In the algoithmic setting, on the othe hand, since X T ɛ = 0, it follows that β OLS = β. C RE (S) is a statistical analogue of the wost-case algoithmic objective C W C (S), since both conside Y Xβ the atio of the metics S. The diffeence is that a su ove Y in the algoithmic setting is Y Xβ OLS elaced by an exectation ove noise ɛ in the statistical setting. A natual question is whethe thee is an algoithmic analogue of C P E (S). Such a efomance metic would be: su Y X(β β S ) X(β β OLS ), (0) whee β is the ojection of Y on to the ange sace of X T. Howeve, since β OLS = β+(x T X) X T ɛ and since X T ɛ = 0, β OLS = β in the algoithmic setting, the denominato of Eqn. (0) equals zeo, and thus the objective in Eqn. (0) is not well-defined. The difficulty of comuting o aoximating this objective aallels ou esults below that show that aoximating C P E (S) is much moe challenging (in tems of the numbe of samles needed) than aoximating C RE (S). In the algoithmic setting, the sketching matix S and the objective C W C (S) can deend on X and Y in any abitay way, but in the following we conside only sketching matices that ae eithe indeendent of both X and Y o deend only on X (e.g., via the statistical leveage scoes of X). In the statistical setting, S is allowed to deend on X, but not on Y, as any deendence of S on Y might intoduce coelation between the sketching matix and the noise vaiable ɛ. Removing this estiction is of inteest, esecially since one can obtain WC bounds of the fom Eqn. (3) by constucting S by andomly samling accoding to an imotance samling distibution that deends on the influence scoes essentially the leveage scoes of the matix X augmented with Y as an additional column of the (X, Y ) ai. Both C P E (S) and C RE (S) ae qualitatively elated to quantities analyzed by [9, 0]. In addition, C W C (S) is qualitatively simila to Cov( β Y ) in [9, 0], since in the algoithmic setting Y is teated 7

8 as fixed; and C RE (S) is qualitatively simila to Cov( β) in [9, 0], since in the statistical setting Y is teated as andom and coming fom a linea model. That being said, the metics and esults we esent in this ae ae not diectly comaable to those of [9, 0] since, e.g., they had a slightly diffeent setu than we have hee, and since they used a fist-ode Taylo aoximation while we do not.. Stuctual esults on sketching matices We ae now eady to develo stuctual conditions chaacteizing how the sketching matix S inteacts with the data matix X that will allow us to ovide ue bounds fo the quantities C W C (S), C P E (S), and C RE (S). To do so, ecall that given the data matix X, we can exess the singula value decomosition of X as X = UΣV T, whee U R n is an othogonal matix, i.e., U T U = I. In addition, we can define the oblique ojection matix Π U S := U(SU) S. () Note that if ank(sx) =, then Π U S can be exessed as ΠU S = U(U T S T SU) U T S T S, since U T S T SU is invetible. Imotantly howeve, deending on the oeties of X and how S is constucted, it can easily haen that ank(sx) <, even if ank(x) =. Given this setu, we can now state the following lemma, the oof of which may be found in Section A.. This lemma chaacteizes how C W C (S), C P E (S), and C RE (S) deend on diffeent stuctual oeties of Π U S and SU. Lemma. Fo the algoithmic setting, Fo the statistical setting, C W C (S) = + su δ R,U T ɛ=0 [ (I (SU) (SU))δ ɛ ] + ΠU S ɛ ɛ. C P E (S) = (I (SU) SU)ΣV T β + ΠU S F, and C RE (S) = + (I (SU) SU)ΣV T β n Seveal oints ae woth making about Lemma. + ΠU S F n = + C SP E(S). n/ Fo all 3 citeia, the tem which involves (SU) SU is a bias tem that is non-zeo in the case that ank(su) <. Fo C P E (S) and C RE (S), the tem coesonds exactly to the statistical bias; and if ank(su) =, meaning that S is a ank-eseving sketching matix, then the bias tem equals 0, since (SU) SU = I. In actice, if is chosen smalle than o lage than but vey close to, it may haen that ank(su) <, in which case this bias is incued. The final equality C RE (S) = + C P E(S) n/ shows that in geneal it is much moe difficult (in tems of the numbe of samles needed) to obtain bounds on C P E (S) than C RE (S) since C RE (S) e-scales C P E (S) by /n, which is much less than. This will be eflected in the main esults below, whee the scaling of C RE (S) will be a facto of /n smalle than C P E (S). In geneal, it is significantly moe difficult to bound C P E (S), since X(β β OLS ) is, wheeas Y Xβ OLS is n, and so thee is much less magin fo eo in aoximating C P E (S). 8

9 Π In the algoithmic o wost-case setting, su U ɛ R n /{0},Π U S ɛ ɛ=0 is the elevant quantity, wheeas ɛ in the statistical setting Π U S F is the elevant quantity. The Fobenius nom entes in the statistical setting because we ae taking an aveage ove homoscedastic noise, and so the l nom of the eigenvalues of Π U S need to be contolled. On the othe hand, in the algoithmic o wost-case setting, the wost diection in the null-sace of U T needs to be contolled, and thus the sectal nom entes. 3 Main theoetical esults In this section, we ovide ue bounds fo C W C (S), C P E (S), and C RE (S), whee S coesond to andom samling and andom ojection matices. In aticula, we ovide ue bounds fo 4 sketching matices: () a vanilla leveage-based andom samling algoithm fom [7]; () a vaiant of leveage-based andom samling, whee the andom samles ae not e-scaled io to thei inclusion in the sketch; (3) a vanilla andom ojection algoithm, whee S is a andom matix containing i.i.d. sub-gaussian andom vaiables; and (4) a andom ojection algoithm, whee S is a andom Hadamad-based andom ojection, as analyzed in []. 3. Random samling methods Hee, we conside andom samling algoithms. To do so, fist define a andom samling matix S R n as follows: S ij {0, } fo all (i, j) and n S j= ij =, whee each ow has an indeendent multinomial distibution with obabilities ( i ) n. The matix of coss-leveage scoes is defined as L = UUT R n n, and l i = L ii denotes the leveage scoe coesonding to the i th samle. Note that the leveage scoes satisfy n l i = tace(l) = and 0 l i. The samling obability distibution we conside ( i ) n is of the fom i = ( θ) l i + θq i, whee {q i } n satisfies 0 q i and n q i = is an abitay obability distibution, and 0 θ <. In othe wods, it is a convex combination of a leveage-based distibution and anothe abitay distibution. Note that fo θ = 0, the obabilities ae ootional to the leveage scoes, wheeas fo θ =, the obabilities follow {q i } n. We conside two samling matices, one whee the andom samling matix is e-scaled, as in [8], and one in which no e-scaling takes lace. In aticula, let S NR = S denote the andom samling matix (whee the subscit N R denotes the fact that no e-scaling takes lace). The e-scaled samling matix is S R R n = SW, whee W R n n is a diagonal e-scaling matix, whee [W ] jj = j and W ji = 0 fo j i. The quantity j is the e-scaling facto. In this case, we have the following esult, the oof of which may be found in Section B.. Theoem. Fo S = S R, with C ( θ) log ( C ( θ)), then with obability at least 0.7, it holds that ank(s R U) = and that: C W C (S R ) + C P E (S R ) 44 n C RE (S R ) Seveal things ae woth noting about this esult. Fist, note that both C W C (S R ) and C RE (S R ) scale as ; thus, it is ossible to obtain high-quality efomance guaantees fo odinay least squaes, as 9

10 long as 0, e.g., if is only slightly lage than. On the othe hand, C P E(S R ) scales as n, meaning needs to be close to n to ovide simila efomance guaantees. Next, note that all of the ue bounds aly to any data matix X, without assuming any additional stuctue on X. Finally, note that when θ =, which coesonds to samling the ows based on {q i } n, all the ue bounds ae. Ou simulations also eveal that unifom samling geneally efoms moe ooly than leveage-scoe based aoaches unde the linea models we conside. An imotant actical oint is the following: the distibution {q i } n does not ente the esults. This allows us to conside diffeent distibutions. An obvious choice is unifom, i.e., q i = n (see e.g., [9, 0]). Anothe imotant examle is that of aoximate leveage-scoe samling, as develoed in [7]. (The unning time of the main algoithm of [7] is o(n ), and thus this educes comutation comaed with the use of exact leveage scoes, which take O(n ) time to comute). Let ( l i ) n denote the aoximate leveage scoes develoed by the ocedue in [7]. Based on Theoem in [7], l i l i ɛ whee 0 < ɛ < fo aoiately chosen. Now, using i = l i, i can be e-exessed as i = ( ɛ) l i + ɛq i whee (q i ) n is a distibution (unknown since we only have a bound on the aoximate leveage scoes). Hence, the efomance bounds achieved by aoximate leveaging ae analogous to those achieved by adding ɛ multilied by a unifom o othe abitay distibution. Next, we conside the leveage-scoe estimato without e-scaling S NR. In ode to develo nontivial bounds on C W C (S NR ), C P E (S NR ), and C RE (S NR ), we need to make a stong assumtion on the leveagescoe distibution on X. To do so, we define the following. Definition (k-heavy hitte leveage distibution). A sequence of leveage scoes (l i ) n is a k-heavy hitte leveage scoe distibution if thee exist constants c, C > 0 such that fo i k, c k l i C k and fo the emaining n k leveage scoes, i=k+ l i 3 4. The inteetation of a k-heavy hitte leveage distibution is one in which only k samles in X contain the majoity of the leveage scoe mass. In the simulations below, we ovide examles of synthetic matices X whee the majoity of the mass is in the lagest leveage scoes. The aamete k acts as a measue of non-unifomity, in that the smalle the k, the moe non-unifom ae the leveage scoes. The k-heavy hitte leveage distibution allows us to model highly non-unifom leveage scoes. In this case, we have the following esult, the oof of which may be found in Section B.. Theoem. Fo S = S NR, with θ = 0 and assuming a k-heavy hitte leveage distibution and c log ( c ), then with obability at least 0.6, it holds that ank(s NR ) = and that: C W C (S NR ) + 44C c C P E (S NR ) 44C4 c k C RE (S NR ) + 44C4 c Notice that when k n, bounds in Theoem on C P E (S NR ) and C RE (S NR ) ae significantly shae than bounds in Theoem on C P E (S R ) and C RE (S R ). Hence not e-scaling has the otential to ovide shae bound in the statistical setting. Howeve a stonge assumtion on X is needed fo this esult. 3. Random ojection methods Hee, we conside two andom ojection algoithms, one based on a sub-gaussian ojection matix and the othe based on a Hadamad ojection matix. To do so, define [S SGP ] ij = X ij, whee (X ij ) i, j n k n. 0

11 ae i.i.d. sub-gaussian andom vaiables with E[X ij ] = 0, vaiance E[X ij ] = σ and sub-gaussian aamate. In this case, we have the following esult, the oof of which may be found in Section B.3. Theoem 3. Fo any matix X, thee exists a constant c such that if c log n, then with obability geate than 0.7, it holds that ank(s SGP ) = and that: C W C (S SGP ) + C P E (S SGP ) 44( + n ) C RE (S SGP ) Notice that the bounds in Theoem 3 fo S SGP ae equivalent to the bounds in Theoem fo S R, excet that is equied only to be lage than O(log n) athe than O( log ). Hence fo smalle values of, andom sub-gaussian ojections ae moe stable than leveage-scoe samling based aoaches. This eflects the fact that to a fist-ode aoximation, leveage-scoe samling efoms as well as efoming a smooth ojection. Next, we conside the andomized Hadamad ojection matix. In aticula, S Had = S Unif HD, whee H R n n is the standad Hadamad matix (see e.g., []), S Unif R n is an n unifom samling matix, and D R n n is a diagonal matix with andom equiobable ± enties. In this case, we have the following esult, the oof of which may be found in Section B.4. Theoem 4. Fo any matix X, thee exists a constant c such that if c log n(log + log log n), then with obability geate than 0.8, it holds that ank(s Had ) = and that: C W C (S Had ) + 40 log(n) C RE (S Had ) 40 log(n)( + n ) C P E (S Had ) + 40 log(n)( + ). Notice that the bounds in Theoem 4 fo S Had ae equivalent to the bounds in Theoem fo S R, u to a constant and log(n) facto. As discussed in [8], the Hadamad tansfomation makes the leveage scoes of X aoximately unifom (u to a log(n) facto), which is why the efomance is simila to the sub- Gaussian ojection (which also tends to make the leveage scoes of X aoximately unifom). We susect that the additional log(n) facto is an atifact of the analysis since we use an enty-wise concentation bound; using moe sohisticated techniques, we believe that the log(n) can be emoved. 3.3 Lowe Bounds Subsequent to the dissemination of the oiginal vesion of this ae [8], [7] amongst othe esults develo lowe bounds on the numeato in C P E (S). This oves that ou ue bounds on C P E (S) can not be imoved. We e-state Theoem (Examle ) in [7] in a way that makes it most comaable to ou esults. Theoem 5 (Theoem in [7]). Fo any sketching matix satisfying E[S T (SS T ) S] o η n, any estimato based on (SX, SY ) satisfies the lowe bound with obability geate than /: C P E (S) n 8η.

12 Gaussian and Hadamad ojections, as well as e-weighted aoximate leveage-scoe samling, all satisfy the condition E[S T (SS T ) S] o η n. On the othe hand, un-weighted leveage-scoe samling does not satisfy this condition and hence does not satisfy the lowe bound. This is why we ae able to ove a tighte ue bound when the matix X has highly non-unifom leveage scoes. Imotantly, this oves that C P E (S) is a quantity that is moe challenging to contol than C RE (S) and C W C (S) when only a single sketch is used. Using this insight, [7] show that by using a aticula iteative Hessian sketch, C P E (S) can be contolled u to constant. In addition to oviding a lowe bound on the PE using a sketching matix just once, [7] also develo a new iteative sketching scheme whee sketching matices ae used eeatedly can educe the PE significantly. Finally, in io wok, [8, 7] show that the ate + may be achieved fo the PE using the estimato β = ((SX) T (SX)) X T Y. This estimato is elated to the idge egession estimato since sketches o andom ojections ae alied only in the comutation of the X T X matix and not X T Y. Since both X T Y and (SX) T (SX) have small dimension, this estimato has significant comutational benefits. Howeve this estimato does not violate the lowe bound in [7] since it not based on the sketches (SX, SY ) but instead uses (SX, X T Y ). 4 Emiical esults In this section, we esent the esults of an emiical evaluation, illustating the esults of ou theoy. We will comae the following 6 sketching matices. () S = S R - andom leveage-scoe samling with e-scaling. () S = S NR - andom leveage-scoe samling without e-scaling. (3) S = S Unif - andom unifom samling (each samle dawn indeendently with obability /n). (4) S = S Sh - andom leveage-scoe samling with e-scaling and with θ = 0.. (5) S = S GP - Gaussian ojection matices. (6) S = S Had - Hadamad ojections. To comae the methods and see how they efom on inuts with diffeent leveage scoes, we geneate test matices using a method outlined in [9, 0]. Set n = 04 (to ensue, fo simlicity, an intege owe of fo the Hadamad tansfom) and = 50, and let the numbe of samles dawn with elacement,, be vaied. X is then geneated based on a t-distibution with diffeent choices of ν to eflect diffeent unifomity of leveage scoes. Each ow of X is selected indeendently with distibution X i t ν (Σ), whee Σ coesonds to an auto-egessive model with ν the degees of feedom. The 3 values of ν esented hee ae ν = (highly non-unifom), ν = (modeately non-unifom), and ν = 0 (vey unifom). See Figue fo a lot to see how ν influences the unifomity of the leveage scoes. Fo each setting, the simulation is eeated 00 times in ode to aveage ove both the andomness in the samling, and in the statistical setting, the andomness ove y. Note that a natual comaison can be dawn between the aamete ν and the aamete k in the k- heavy hitte definition. If we want to find the value k such that 90% of the leveage mass is catued, fo ν =, k 00, fo ν =, k 700 and fo ν = 0, k 900, accoding to Figue (b). Hence the smalle ν, the smalle k since the leveage-scoes ae moe non-unifom.

13 Leveage scoe ν = 0 ν= ν= Unifom Cumulative leveage mass ν=0 ν= ν= Unifom Index Index Figue : Odeed leveage scoes fo diffeent values of ν (a) and cumulative sum of odeed leveage scoes fo diffeent values of ν (b). We fist comae the sketching methods in the statistical setting by comaing C P E (S). In Figue, we lot the aveage C P E (S) fo the 6 subsamling aoaches outlined above, aveaged ove 00 samles fo lage values of between 300 and 000. In addition, in Figue 3, we include a table fo esults on smalle values of between 80 and 00, to get a sense of the efomance when is close to. Obseve that in the lage setting, S NR is clealy the best aoach, out-efoming S R, esecially fo ν =. Fo small, ojection-based methods wok bette, esecially fo ν =, since they tend to unifomize the leveage scoes. In addition, S Sh is sueio comaed to S NR, when is small, esecially when ν =. We do not lot C RE (S) as it is simly a e-scaled C P E (S) to Lemma S=S R 8 S = S R 8 S = S R 4.5 S = S NR S =S Unif 7 S = S NR S = S Unif 6 4 S = S NR S = S Sh C RPE (S) S = S Sh S = S GP S = S Had C RPE (S) S = S Sh S = S GP S = S Had C RPE (S) 0 8 S = S GP S = S Had (a) ν = 0 (b) ν = (c) ν = Figue : Relative ediction efficiency C P E (S) fo lage. Oveall, S NR, S R, and S Sh comae vey favoably to S Unif, which is consistent with Theoem, since samles with highe leveage scoes tend to educe the mean-squaed eo. Futhemoe, S R (which ecall involves e-scaling) only inceases the mean-squaed eo, which is again consistent with the theoetical esults. The effects ae moe aaent as the leveage scoe distiibution is moe non-unifom (i.e., fo ν = ). 3

14 S R S NR S Unif S Sh S GP S Had (a) ν = 0 S R S NR S Unif S Sh S GP S Had (b) ν = S R S NR S Unif S Sh S GP S Had (c) ν = Figue 3: Relative ediction efficiency C P E (S) fo small. The theoetical ue bound in Theoems - 4 suggests that C P E (S) is of the ode n, indeendent of the leveage scoes of X, fo S = S R as well as S = S Had and S GP. On the othe hand, the simulations suggest that fo highly non-unifom leveage scoes, C P E (S R ) is highe than when the leveage scoes ae unifom, wheeas fo S = S Had and S GP, the non-unifomity of the leveage scoes does not significantly affect the bounds. The eason that S Had and S GP ae not significantly affected by the leveage-scoe distibution is that the Hadamad and Gaussian ojection has the effect of making the leveage scoes of any matix unifom [8]. The eason fo the aaent disaity when S = S R is that the theoetical bounds use Makov s inequality which is a cude concentation bound. We susect that a moe efined analysis involving the bounded diffeence inequality would eflect that non-unifom leveage scoes esult in a lage C P E (S R ) S = S R S = S NR S = S R S = S NR S = S Sh C WCR (S).5. S = S Sh S = S GP S = S Had S = S Unif C WCR (S).5..5 C WCR (S) 3.5 S = S GP S = S Had (a) ν = 0 (b) ν = (c) ν = Figue 4: Wost-case elative eo C W C (S) fo lage. 4

15 S R S NR S Unif S Sh S GP S Had (a) ν = 0 S R S NR S Unif S Sh S GP S Had (b) ν = S R S NR S Unif S Sh S GP S Had (c) ν = Figue 5: Wost-case elative eo C W C (S) fo lage. Finally, Figues 4 and 5 ovide a comaison of the wost-case elative eo C W CE (S) fo lage and small ( > 00 and 00, esectively) values of. Obseve that, in geneal, C W C (S) ae much close to than C P E (S) fo all choices of S. This eflects the scaling of n diffeence between the bounds. Inteestingly, Figues 4 and 5 indicates that S NR still tends to out-efom S R in geneal, howeve the diffeence is not as significant as in the statistical setting. 5 Discussion and conclusion In this ae, we develoed a famewok fo analyzing algoithmic and statistical citeia fo geneal sketching matices S R n alied to the least-squaes objective. As ou analysis makes clea, ou famewok eveals that the algoithmic and statistical citeia deend on diffeent oeties of the oblique ojection matix Π U S = U(SU) U, whee U is the left singula matix fo X. In aticula, the algoithmic citeia Π (WC) deends on the quantity su U U T S ɛ ɛ=0 ɛ, since in that case the data may be abitay and wost-case, wheeas the two statistical citeia (RE and PE) deends on Π U S F, since in that case the data follow a linea model with homogenous noise vaiance. Using ou famewok, we develo ue bounds fo 3 efomance citeia alied to 4 sketching schemes. Ou ue bounds eveal that in the egime whee < n, ou sketching schemes achieve otimal efomance u to constants, in tems of WC and RE. On the othe hand, the PE scales as n meaning needs to be close to (o geate than) n fo good efomance. Subsequent lowe bounds in [7] show that this ue bound can not be imoved, but subsequent wok by [7] as well as [8, 7] ovide altenate moe sohisticated sketching aoaches to deal with these challenges. Ou simulation esults eveal that fo when is vey close to, ojection-based aoaches tend to out-efom samling-based aoaches 5

16 since ojection-based aoaches tend to be moe stable in that egime. Thee ae numeous ways in which the famewok and esults fom this ae can be extended. Fistly, thee is a lage liteatue that esents a numbe of diffeent aoaches to sketching. Since ou famewok ovides geneal conditions to assess the statistical and algoithmic efomance fo sketching matices, a natual and staightfowad extension would be to use ou famewok to comae othe sketching matices. Anothe natual extension is to detemine whethe asects of the famewok can be adated to othe statistical models and oblems of inteest (e.g., genealized linea models, covaiance estimation, PCA, etc.). Finally, anothe imotant diection is to comae the stability and obustness oeties of diffeent sketching matices. Ou cuent analysis assumes a known linea model, and it is unclea how the sketching matices behave unde model mis-secification. Acknowledgement. We would like to thank the Statistical and Alied Mathematical Sciences Institute and the membes of its vaious woking gous fo helful discussions. A Auxiliay Results In this section, we ovide oofs of Lemma and an intemediate esult we will late use to ove the main theoems. A. Poof of Lemma Recall that X = UΣV T, whee U R n, Σ R and V R denote the left singula matix, diagonal singula value matix and ight singula matix esectively. Fist we show that Y Xβ OLS = ɛ. To do so, obseve that Y Xβ OLS = Y UΣV T β OLS, and set δ OLS = ΣV T β OLS. It follows that δ OLS = U T Y. Hence Y Xβ OLS = Y Π U Y, whee Π U = UU T. Fo evey Y R n, thee exists a unique δ R and ɛ R n such that U T ɛ = 0 and Y = Uδ + ɛ. Hence Y Xβ OLS = (I n n Π U )ɛ = ɛ, whee the final equality holds since Π U ɛ = 0. Now we analyze Y Xβ S. Obseve that Y Xβ S = Y Π S UY, whee Π U S = U(SU) S. Since Y = Uδ + ɛ, it follows that Y Xβ S = U(I (SU) SU)δ + (I n n Π S U)ɛ = (I (SU) SU)δ + (I n n Π S U)ɛ = (I (SU) SU)δ + ɛ + Π S Uɛ. 6

17 Theefoe fo all Y : C W C (S) = Y Xβ S Y Xβ OLS = + (I (SU) SU)δ + ΠU S ɛ ɛ, whee U T ɛ = 0. Taking a suemum ove Y and consequently ove ɛ and δ comletes the oof fo C W C (S). Now we tun to the oof fo C P E (S). Fist note that Unde the linea model Y = UΣV T β + ɛ, Since E[ɛɛ T ] = I n n, it follows that Fo β S, we have that E[ X(β OLS β) ] = E[ UU T Y UΣV T β ]. E[ X(β OLS β) ] = E[ Π U ɛ ]. E[ X(β OLS β) ] = E[ Π U ɛ ] = Π U F =. E[ X(β S β) ] = E[ Π U S Y UΣV T β ] = E[ (U(I (SU) SU)ΣV T β + Π U S ɛ ] Hence C P E (S) = /( (I (SU) SU)ΣV T β + ΠU S F ) as stated. Fo C RE (S), the mean-sqaued eo fo δ OLS and δ S ae and Hence, E[ Y Xβ OLS ] = E[ (I Π U )ɛ ] = (I (SU) SU)ΣV T β + E[ Π U S ɛ ] = (I (SU) SU)ΣV T β + Π U S F. = I Π U F = n, E[ Y Xβ S ] = (I (SU) SU)ΣV T β + E[ (I Π U S )ɛ ] = (I (SU) SU)ΣV T β + tace((i Π S ) T (I Π S )) = (I (SU) SU)ΣV T β + tace(i) tace(π S ) + Π S F = (I (SU) SU)ΣV T β + n + Π S F = (I (SU) SU)ΣV T β + n + Π S F. C RE (S) = n + (I (SU) SU)ΣV T β + Π S F n = + (I (SU) SU)ΣV T β + Π S F n = + C P E(S). n/ 7

18 A. Intemediate Result In ode to ovide a convenient way to aameteize ou ue bounds fo C W C (S), C P E (S), and C RE (S), we intoduce the following thee stuctual conditions on S. Let σ min (A) denote the minimum non-zeo singula value of a matix A. The fist condition is that thee exists an α(s) > 0 such that σ min (SU) α(s). () The second condition is that thee exists a β(s) such that U T S T Sɛ su β(s). (3) ɛ, U T ɛ=0 ɛ The thid condition is that thee exists a γ(s) such that U T S T S F γ(s). (4) Note that the stuctual conditions defined by α(s) and β(s) have been defined eviously as Eqn. (8) and Eqn. (9) in [8]. Given these quantities, we can state the following lemma, the oof of which may be found in Section A.. This lemma ovides ue bounds fo C W C (S), C P E (S), and C RE (S) in tems of the aametes α(s), β(s), and γ(s). Lemma. Fo α(s) and β(s), as defined in Eqn. () and (3), C W C (S) + su δ R,U T ɛ=0 (I (SU) (SU))δ ɛ + β (S) α 4 (S). Fo α(s) and γ(s), as defined in Eqn. () and (4), C P E (S) (I (SU) SU)ΣV T β + γ (S) α 4 (S). Futhemoe, C RE (S) + n [ (I (SU) SU)ΣV T β ] + γ (S) α 4. (S) Again, the tems involving (SU) SU ae a bias that equal zeo fo ank-eseving sketching matices. In addition, we emhasize that the esults of Lemma and Lemma hold fo abitay sketching matices S. In Aendix B, we bound α(s), β(s) and γ(s) fo seveal diffeent andomized sketching matices, and this will emit us to obtain bounds on C W C (S), C P E (S), and C RE (S). Fo the sketching matices we analyze, we ove that the bias tem is 0 with high obability. 8

19 A.3 Poof of Lemma Note that Π U S = U(SU) S. Let ank(su) = k <, and the singula value decomosition is SU = Ũ ΣṼ T, whee Σ R k k is a diagonal matix with non-zeo singula values of SU. Then, Π U S ɛ ɛ = U(SU) Sɛ ɛ = (SU) Sɛ ɛ = Ṽ Σ Ũ T Sɛ ɛ = Ṽ Σ Ṽ T Ṽ ΣŨ T Sɛ ɛ, whee we have ignoed the bias tem which emains unchanged. Note that Ṽ Σ Ṽ T α (S)I and Ṽ ΣŨ T = (SU) T = U T S T. Hence, Π U S ɛ ɛ = Ṽ Σ Ṽ T Ṽ ΣŨ T Sɛ ɛ U T S T Sɛ α 4 (S) ɛ β (S) = α 4 (S) ɛ, and the ue bound on C W C (S) follows. Similaly, Π U S F = U(SU) S F = (SU) S F = Ṽ Σ Ṽ T Ṽ ΣŨ T S F α 4 (S) U T S T S F and the ue bound on C P E (S) follows. B Poof of Main Theoems The oof techniques fo all of fou theoems ae simila, in that we use the intemediate esult Lemma and bound the exectations of α(s), β(s), and γ(s) fo each S, then aly Makov s inequality to develo high obability bounds. B. Poof of Theoem Fist we bound α (S R ) by using existing esults in [8]. In aticula alying Theoem 4 in [8] with β = θ, A = U T, ɛ = and δ = 0. ovides the desied lowe bound on α(s R ) and ensues that the bias tem in Lemma is 0 since ank(s R U) =. To ue bound β(s R ), we fist ue bound its exectation, then aly Makov s inequality. Using the esult of Table (second ow) of [6] with β = θ: E[ U T S T RS R ɛ ] ( θ) U T F ɛ = 9 ( θ) ɛ.

20 Alying Makov s inequality, with obability at least 0.9. Finally we bound γ(s R ): U T S T RS R ɛ ( θ) ɛ, [ U T SRS T R F ] = [tace(u T (SRS T R ) U)] = U ij U kj [(S T RS R ) ] ki = = j= k= j= Uij[S RS T R ] ii l i [SRS T R ] ii, whee the second last equality follows since [SR T S R] ki = 0 fo k i and the final equality follows since l i = j= U ij. Fist we ue bound E[γ(S R)] and then aly Makov s inequality. Recall that [S R ] ki = i σ ki whee P(σ ki = +) = i. Then, l i E([S T RS R ] ii]) = = = = l i i m= l= l i i m= l= l i i = + E[σmiσ li ] E[σ mi σ li ] [( ) i + i ] (l i ( ) + l i i )] l i i. Substituting i = ( θ) l i + θq i comletes the ue bound on E[γ(S R )]: + l i i = + + l i ( θ) l i + θq i θ + n ( θ) n + ( θ). 0

21 Using Makov s inequality, P ( γ(s R ) E[γ(S R )] 0E[γ(S R )] ) 0., and consequently γ(s R ) ( + n ( θ) ) with obability geate than 0.9. The final obability of 0.7 aises since we simultaneously equie all thee bounds to hold which hold with obability > 0.7. Alying Lemma in combination with ou high obability bounds fo α(s R ), β(s R ) and γ(s R ) comletes the oof fo Theoem. B. Poof of Theoem k Define S = S NR whee S NR is the samling matix without e-scaling. Recall the k-heavy hitte leveage-scoe assumtion. Since n i=k+ l i 0, n i=k+ i 0 (ecall i = l i ). Hence the obability that a samle only contains the k samles with high leveage scoe is: ( 0 ) 0 = 0.9. Fo the emainde of the oof, we condition on the event A that only the ows with the k lagest leveage scoes ae selected. Let Ũ Rk be the sub-matix of U coesonding to the to k leveage scoes. Let W = E[S T S] R k k. Since c k i C k fo all i k, ci k k W CI k k. Futhemoe since n i=k+ l i 0, 0.9I Ũ T Ũ I. Fist we lowe bound α (S NR ). Alying Theoem 4 in [8] with β = C, A = Ũ T W /, ɛ = c and δ = 0. ensues that as long as c log() fo sufficiently lage c, Ũ T W Ũ Ũ T S T SŨ o with obability at least 0.9. Since Ũ T W Ũ 3c 4, Ũ T S T SŨ c 4. Theefoe with obability at least 0.9, α (S NR ) c 4Ck. k Next we bound β(s NR ). Since S = S NR, if we condition on A, only the leading k leveage scoes ae selected and let Ũ Rk be the sub-matix of U coesonding to the to k leveage scoes. Using the esult of Table (second ow) of [6] with β = : c C, E[ U T S T NRS NR ɛ ] = k E[ U T S T Sɛ ] = k E[ Ũ T S T Sɛ ] k Ũ T ɛ k ɛ. Alying Makov s inequality, U T S T NRS NR ɛ k ɛ, with obability at least 0.9 which comletes the ue bound fo β(s NR ).

A Statistical Perspective on Randomized Sketching for Ordinary Least-Squares

A Statistical Perspective on Randomized Sketching for Ordinary Least-Squares Jounal of Machine Leaning Reseach 7 (206) -3 Submitted 8/5; Published /6 A Statistical Pesective on Randomized Sketching fo Odinay Least-Squaes Gavesh Raskutti Deatment of Statistics Univesity of Wisconsin

More information

Analysis of Arithmetic. Analysis of Arithmetic. Analysis of Arithmetic Round-Off Errors. Analysis of Arithmetic. Analysis of Arithmetic

Analysis of Arithmetic. Analysis of Arithmetic. Analysis of Arithmetic Round-Off Errors. Analysis of Arithmetic. Analysis of Arithmetic In the fixed-oint imlementation of a digital filte only the esult of the multilication oeation is quantied The eesentation of a actical multilie with the quantie at its outut is shown below u v Q ^v The

More information

Online-routing on the butterfly network: probabilistic analysis

Online-routing on the butterfly network: probabilistic analysis Online-outing on the buttefly netwok: obabilistic analysis Andey Gubichev 19.09.008 Contents 1 Intoduction: definitions 1 Aveage case behavio of the geedy algoithm 3.1 Bounds on congestion................................

More information

Numerical solution of the first order linear fuzzy differential equations using He0s variational iteration method

Numerical solution of the first order linear fuzzy differential equations using He0s variational iteration method Malaya Jounal of Matematik, Vol. 6, No. 1, 80-84, 2018 htts://doi.og/16637/mjm0601/0012 Numeical solution of the fist ode linea fuzzy diffeential equations using He0s vaiational iteation method M. Ramachandan1

More information

Kepler s problem gravitational attraction

Kepler s problem gravitational attraction Kele s oblem gavitational attaction Summay of fomulas deived fo two-body motion Let the two masses be m and m. The total mass is M = m + m, the educed mass is µ = m m /(m + m ). The gavitational otential

More information

556: MATHEMATICAL STATISTICS I

556: MATHEMATICAL STATISTICS I 556: MATHEMATICAL STATISTICS I CHAPTER 5: STOCHASTIC CONVERGENCE The following efinitions ae state in tems of scala anom vaiables, but exten natually to vecto anom vaiables efine on the same obability

More information

Lot-sizing for inventory systems with product recovery

Lot-sizing for inventory systems with product recovery Lot-sizing fo inventoy systems with oduct ecovey Ruud Teunte August 29, 2003 Econometic Institute Reot EI2003-28 Abstact We study inventoy systems with oduct ecovey. Recoveed items ae as-good-as-new and

More information

c( 1) c(0) c(1) Note z 1 represents a unit interval delay Figure 85 3 Transmit equalizer functional model

c( 1) c(0) c(1) Note z 1 represents a unit interval delay Figure 85 3 Transmit equalizer functional model Relace 85.8.3.2 with the following: 85.8.3.2 Tansmitted outut wavefom The 40GBASE-CR4 and 100GBASE-CR10 tansmit function includes ogammable equalization to comensate fo the fequency-deendent loss of the

More information

Is the general form of Renyi s entropy a contrast for source separation?

Is the general form of Renyi s entropy a contrast for source separation? Is the geneal fom of Renyi s entoy a contast fo souce seaation? Fédéic Vins 1, Dinh-Tuan Pham 2, and Michel Veleysen 1 1 Machine Leaning Gou Univesité catholique de Louvain Louvain-la-Neuve, Belgium {vins,veleysen}@dice.ucl.ac.be

More information

H5 Gas meter calibration

H5 Gas meter calibration H5 Gas mete calibation Calibation: detemination of the elation between the hysical aamete to be detemined and the signal of a measuement device. Duing the calibation ocess the measuement equiment is comaed

More information

Analysis of Finite Word-Length Effects

Analysis of Finite Word-Length Effects T-6.46 Digital Signal Pocessing and Filteing 8.9.4 Intoduction Analysis of Finite Wod-Length Effects Finite wodlength effects ae caused by: Quantization of the filte coefficients ounding / tuncation of

More information

CMSC 425: Lecture 5 More on Geometry and Geometric Programming

CMSC 425: Lecture 5 More on Geometry and Geometric Programming CMSC 425: Lectue 5 Moe on Geomety and Geometic Pogamming Moe Geometic Pogamming: In this lectue we continue the discussion of basic geometic ogamming fom the eious lectue. We will discuss coodinate systems

More information

Political Science 552

Political Science 552 Political Science 55 Facto and Pincial Comonents Path : Wight s Rules 4 v 4 4 4u R u R v 4. Path may ass though any vaiable only once on a single tavese. Path may go backwads, but not afte going fowad.

More information

6 Matrix Concentration Bounds

6 Matrix Concentration Bounds 6 Matix Concentation Bounds Concentation bounds ae inequalities that bound pobabilities of deviations by a andom vaiable fom some value, often its mean. Infomally, they show the pobability that a andom

More information

Stanford University CS259Q: Quantum Computing Handout 8 Luca Trevisan October 18, 2012

Stanford University CS259Q: Quantum Computing Handout 8 Luca Trevisan October 18, 2012 Stanfod Univesity CS59Q: Quantum Computing Handout 8 Luca Tevisan Octobe 8, 0 Lectue 8 In which we use the quantum Fouie tansfom to solve the peiod-finding poblem. The Peiod Finding Poblem Let f : {0,...,

More information

Approximating the minimum independent dominating set in perturbed graphs

Approximating the minimum independent dominating set in perturbed graphs Aoximating the minimum indeendent dominating set in etubed gahs Weitian Tong, Randy Goebel, Guohui Lin, Novembe 3, 013 Abstact We investigate the minimum indeendent dominating set in etubed gahs gg, )

More information

The Path Effect in Ground-Motion Variability: An Application of the Variance-Components Technique

The Path Effect in Ground-Motion Variability: An Application of the Variance-Components Technique Bulletin of the Seismological Society of Ameica, Vol. 96, No. 3,. 1170 1176, June 006, doi: 10.1785/010050155 The Path Effect in Gound-Motion Vaiability: An Alication of the Vaiance-Comonents Technique

More information

Dorin Andrica Faculty of Mathematics and Computer Science, Babeş-Bolyai University, Cluj-Napoca, Romania

Dorin Andrica Faculty of Mathematics and Computer Science, Babeş-Bolyai University, Cluj-Napoca, Romania #A INTEGERS 5A (05) THE SIGNUM EQUATION FOR ERDŐS-SURÁNYI SEQUENCES Doin Andica Faculty of Mathematics and Comute Science, Babeş-Bolyai Univesity, Cluj-Naoca, Romania dandica@math.ubbcluj.o Eugen J. Ionascu

More information

EM Boundary Value Problems

EM Boundary Value Problems EM Bounday Value Poblems 10/ 9 11/ By Ilekta chistidi & Lee, Seung-Hyun A. Geneal Desciption : Maxwell Equations & Loentz Foce We want to find the equations of motion of chaged paticles. The way to do

More information

New problems in universal algebraic geometry illustrated by boolean equations

New problems in universal algebraic geometry illustrated by boolean equations New poblems in univesal algebaic geomety illustated by boolean equations axiv:1611.00152v2 [math.ra] 25 Nov 2016 Atem N. Shevlyakov Novembe 28, 2016 Abstact We discuss new poblems in univesal algebaic

More information

Errata for Edition 1 of Coding the Matrix, January 13, 2017

Errata for Edition 1 of Coding the Matrix, January 13, 2017 Eata fo Edition of Coding the Matix, Januay 3, 07 You coy might not contain some of these eos. Most do not occu in the coies cuently being sold as Ail 05. Section 0.3:... the inut is a e-image of the inut...

More information

Probabilistic number theory : A report on work done. What is the probability that a randomly chosen integer has no square factors?

Probabilistic number theory : A report on work done. What is the probability that a randomly chosen integer has no square factors? Pobabilistic numbe theoy : A eot on wo done What is the obability that a andomly chosen intege has no squae factos? We can constuct an initial fomula to give us this value as follows: If a numbe is to

More information

Pearson s Chi-Square Test Modifications for Comparison of Unweighted and Weighted Histograms and Two Weighted Histograms

Pearson s Chi-Square Test Modifications for Comparison of Unweighted and Weighted Histograms and Two Weighted Histograms Peason s Chi-Squae Test Modifications fo Compaison of Unweighted and Weighted Histogams and Two Weighted Histogams Univesity of Akueyi, Bogi, v/noduslód, IS-6 Akueyi, Iceland E-mail: nikolai@unak.is Two

More information

C/CS/Phys C191 Shor s order (period) finding algorithm and factoring 11/12/14 Fall 2014 Lecture 22

C/CS/Phys C191 Shor s order (period) finding algorithm and factoring 11/12/14 Fall 2014 Lecture 22 C/CS/Phys C9 Sho s ode (peiod) finding algoithm and factoing /2/4 Fall 204 Lectue 22 With a fast algoithm fo the uantum Fouie Tansfom in hand, it is clea that many useful applications should be possible.

More information

Model Averaging by Stacking

Model Averaging by Stacking Oen Jounal of Statistics 05 5 797-807 ublished Online Decembe 05 in SciRes htt://wwwsciog/ounal/os htt://ddoiog/0436/os0557079 Model Aveaging by Stacking Claudio Moana Deatment of Economics Management

More information

Q. Obtain the Hamiltonian for a one electron atom in the presence of an external magnetic field.

Q. Obtain the Hamiltonian for a one electron atom in the presence of an external magnetic field. Syed Ashad Hussain Lectue Deatment of Physics Tiua Univesity www.sahussaintu.wodess.com Q. Obtain the Hamiltonian fo a one electon atom in the esence of an extenal magnetic field. To have an idea about

More information

An Estimate of Incomplete Mixed Character Sums 1 2. Mei-Chu Chang 3. Dedicated to Endre Szemerédi for his 70th birthday.

An Estimate of Incomplete Mixed Character Sums 1 2. Mei-Chu Chang 3. Dedicated to Endre Szemerédi for his 70th birthday. An Estimate of Incomlete Mixed Chaacte Sums 2 Mei-Chu Chang 3 Dedicated to Ende Szemeédi fo his 70th bithday. 4 In this note we conside incomlete mixed chaacte sums ove a finite field F n of the fom x

More information

Problem Set #10 Math 471 Real Analysis Assignment: Chapter 8 #2, 3, 6, 8

Problem Set #10 Math 471 Real Analysis Assignment: Chapter 8 #2, 3, 6, 8 Poblem Set #0 Math 47 Real Analysis Assignment: Chate 8 #2, 3, 6, 8 Clayton J. Lungstum Decembe, 202 xecise 8.2 Pove the convese of Hölde s inequality fo = and =. Show also that fo eal-valued f / L ),

More information

Psychometric Methods: Theory into Practice Larry R. Price

Psychometric Methods: Theory into Practice Larry R. Price ERRATA Psychometic Methods: Theoy into Pactice Lay R. Pice Eos wee made in Equations 3.5a and 3.5b, Figue 3., equations and text on pages 76 80, and Table 9.1. Vesions of the elevant pages that include

More information

Cross section dependence on ski pole sti ness

Cross section dependence on ski pole sti ness Coss section deendence on ski ole sti ness Johan Bystöm and Leonid Kuzmin Abstact Ski equiment oduce SWIX has ecently esented a new ai of ski oles, called SWIX Tiac, which di es fom conventional (ound)

More information

Conventional Interrater Reliability definitions, formulae, and worked examples in SPSS and STATISTICA

Conventional Interrater Reliability definitions, formulae, and worked examples in SPSS and STATISTICA Conventional Inteate Reliability definitions, fomulae, and woked examles in SPSS and STATISTICA Mach, 001 htt://www.baett.net/techaes/i_conventional.df f of 8 Assessing the Reliability of Rating Data Ratings

More information

QIP Course 10: Quantum Factorization Algorithm (Part 3)

QIP Course 10: Quantum Factorization Algorithm (Part 3) QIP Couse 10: Quantum Factoization Algoithm (Pat 3 Ryutaoh Matsumoto Nagoya Univesity, Japan Send you comments to yutaoh.matsumoto@nagoya-u.jp Septembe 2018 @ Tokyo Tech. Matsumoto (Nagoya U. QIP Couse

More information

Lecture 8 - Gauss s Law

Lecture 8 - Gauss s Law Lectue 8 - Gauss s Law A Puzzle... Example Calculate the potential enegy, pe ion, fo an infinite 1D ionic cystal with sepaation a; that is, a ow of equally spaced chages of magnitude e and altenating sign.

More information

Sequentially Deciding Between Two Experiments for Estimating a Common Success Probability

Sequentially Deciding Between Two Experiments for Estimating a Common Success Probability In Jounal Ameican Statistical Association 93 (998),. 502 5. Sequentially Deciding Between Two xeiments fo stimating a ommon Success Pobability Janis Hadwick Univesity of Michigan onnie Page Michigan State

More information

CSCE 478/878 Lecture 4: Experimental Design and Analysis. Stephen Scott. 3 Building a tree on the training set Introduction. Outline.

CSCE 478/878 Lecture 4: Experimental Design and Analysis. Stephen Scott. 3 Building a tree on the training set Introduction. Outline. In Homewok, you ae (supposedly) Choosing a data set 2 Extacting a test set of size > 3 3 Building a tee on the taining set 4 Testing on the test set 5 Repoting the accuacy (Adapted fom Ethem Alpaydin and

More information

ON INDEPENDENT SETS IN PURELY ATOMIC PROBABILITY SPACES WITH GEOMETRIC DISTRIBUTION. 1. Introduction. 1 r r. r k for every set E A, E \ {0},

ON INDEPENDENT SETS IN PURELY ATOMIC PROBABILITY SPACES WITH GEOMETRIC DISTRIBUTION. 1. Introduction. 1 r r. r k for every set E A, E \ {0}, ON INDEPENDENT SETS IN PURELY ATOMIC PROBABILITY SPACES WITH GEOMETRIC DISTRIBUTION E. J. IONASCU and A. A. STANCU Abstact. We ae inteested in constucting concete independent events in puely atomic pobability

More information

Assessing the Reliability of Rating Data

Assessing the Reliability of Rating Data Paul Baett age 1 Assessing the Reliability of Rating Data Ratings ae any kind of coding (qualitative o quantitative) made concening attitudes, behavious, o cognitions. Hee, I am concened with those kinds

More information

Research Design - - Topic 17 Multiple Regression & Multiple Correlation: Two Predictors 2009 R.C. Gardner, Ph.D.

Research Design - - Topic 17 Multiple Regression & Multiple Correlation: Two Predictors 2009 R.C. Gardner, Ph.D. Reseach Design - - Topic 7 Multiple Regession & Multiple Coelation: Two Pedictos 009 R.C. Gadne, Ph.D. Geneal Rationale and Basic Aithmetic fo two pedictos Patial and semipatial coelation Regession coefficients

More information

3.1 Random variables

3.1 Random variables 3 Chapte III Random Vaiables 3 Random vaiables A sample space S may be difficult to descibe if the elements of S ae not numbes discuss how we can use a ule by which an element s of S may be associated

More information

Encapsulation theory: the transformation equations of absolute information hiding.

Encapsulation theory: the transformation equations of absolute information hiding. 1 Encapsulation theoy: the tansfomation equations of absolute infomation hiding. Edmund Kiwan * www.edmundkiwan.com Abstact This pape descibes how the potential coupling of a set vaies as the set is tansfomed,

More information

Weighted least-squares estimators of parametric functions of the regression coefficients under a general linear model

Weighted least-squares estimators of parametric functions of the regression coefficients under a general linear model Ann Inst Stat Math (2010) 62:929 941 DOI 10.1007/s10463-008-0199-8 Weighted least-squaes estimatos of paametic functions of the egession coefficients unde a geneal linea model Yongge Tian Received: 9 Januay

More information

Recent developments in Code Aster to compute FRF and modes of VEM with frequency dependent properties

Recent developments in Code Aster to compute FRF and modes of VEM with frequency dependent properties Recent develoments in Code Aste to comute FRF and modes of VEM with fequency deendent oeties. Melette a, E. Pagnacco b and A. Ladie b a TAGET DELTA, 1 ue Adolhe Robet, 5800 Cosne su Loie, Fance b Laboatoie

More information

Fractional Zero Forcing via Three-color Forcing Games

Fractional Zero Forcing via Three-color Forcing Games Factional Zeo Focing via Thee-colo Focing Games Leslie Hogben Kevin F. Palmowski David E. Robeson Michael Young May 13, 2015 Abstact An -fold analogue of the positive semidefinite zeo focing pocess that

More information

DESIGN OF BEAMS FOR MOMENTS

DESIGN OF BEAMS FOR MOMENTS CHAPTER Stuctual Steel Design RFD ethod Thid Edition DESIGN OF BEAS FOR OENTS A. J. Clak School of Engineeing Deatment of Civil and Envionmental Engineeing Pat II Stuctual Steel Design and Analysis 9 FA

More information

Multiple Criteria Secretary Problem: A New Approach

Multiple Criteria Secretary Problem: A New Approach J. Stat. Appl. Po. 3, o., 9-38 (04 9 Jounal of Statistics Applications & Pobability An Intenational Jounal http://dx.doi.og/0.785/jsap/0303 Multiple Citeia Secetay Poblem: A ew Appoach Alaka Padhye, and

More information

II. Non-paper: Zeta functions in scattering problems evaluated on the real wave number axis. Andreas Wirzba. Institut fur Kernphysik, TH Darmstadt

II. Non-paper: Zeta functions in scattering problems evaluated on the real wave number axis. Andreas Wirzba. Institut fur Kernphysik, TH Darmstadt Damstadt, Octobe, 995 II. Non-ae: Zeta functions in scatteing oblems evaluated on the eal wave numbe axis Andeas Wizba Institut fu Kenhysik, TH Damstadt Schlogatenst. 9, D-6489 Damstadt, Gemany email:

More information

Chapter 5 Linear Equations: Basic Theory and Practice

Chapter 5 Linear Equations: Basic Theory and Practice Chapte 5 inea Equations: Basic Theoy and actice In this chapte and the next, we ae inteested in the linea algebaic equation AX = b, (5-1) whee A is an m n matix, X is an n 1 vecto to be solved fo, and

More information

The Substring Search Problem

The Substring Search Problem The Substing Seach Poblem One algoithm which is used in a vaiety of applications is the family of substing seach algoithms. These algoithms allow a use to detemine if, given two chaacte stings, one is

More information

49 th European Organization for Quality Congress Topic: Quality improvement. SPC in Short Production Runs with Autocorrelated Data

49 th European Organization for Quality Congress Topic: Quality improvement. SPC in Short Production Runs with Autocorrelated Data 49 th Euoean Oganization fo Quality Congess Toic: Quality imovement SPC in Shot Poduction Runs with Autocoelated Data José F. Gomes Requeio, Ana Vilela atos and Zulema oes Peeia Deatment of echanical and

More information

Computing Electromagnetic Fields in Inhomogeneous Media Using Lattice Gas Automata. I. Introduction

Computing Electromagnetic Fields in Inhomogeneous Media Using Lattice Gas Automata. I. Introduction Comuting Electomagnetic Fields in Inhomogeneous Media Using Lattice Gas Automata M.Zhang, D. Cule, L. Shafai, G. Bidges and N.Simons Deatment of Electical and Comute Engineeing Univesity of Manitoba Winnieg,

More information

Information Retrieval Advanced IR models. Luca Bondi

Information Retrieval Advanced IR models. Luca Bondi Advanced IR models Luca Bondi Advanced IR models 2 (LSI) Pobabilistic Latent Semantic Analysis (plsa) Vecto Space Model 3 Stating point: Vecto Space Model Documents and queies epesented as vectos in the

More information

Sincere Voting and Information Aggregation with Voting Costs

Sincere Voting and Information Aggregation with Voting Costs Sincee Voting and Infomation Aggegation with Voting Costs Vijay Kishna y and John Mogan z August 007 Abstact We study the oeties of euilibium voting in two-altenative elections unde the majoity ule. Votes

More information

6 PROBABILITY GENERATING FUNCTIONS

6 PROBABILITY GENERATING FUNCTIONS 6 PROBABILITY GENERATING FUNCTIONS Cetain deivations pesented in this couse have been somewhat heavy on algeba. Fo example, detemining the expectation of the Binomial distibution (page 5.1 tuned out to

More information

A NEW VARIABLE STIFFNESS SPRING USING A PRESTRESSED MECHANISM

A NEW VARIABLE STIFFNESS SPRING USING A PRESTRESSED MECHANISM Poceedings of the ASME 2010 Intenational Design Engineeing Technical Confeences & Computes and Infomation in Engineeing Confeence IDETC/CIE 2010 August 15-18, 2010, Monteal, Quebec, Canada DETC2010-28496

More information

APPLICATION OF MAC IN THE FREQUENCY DOMAIN

APPLICATION OF MAC IN THE FREQUENCY DOMAIN PPLICION OF MC IN HE FREQUENCY DOMIN D. Fotsch and D. J. Ewins Dynamics Section, Mechanical Engineeing Depatment Impeial College of Science, echnology and Medicine London SW7 2B, United Kingdom BSRC he

More information

A Converse to Low-Rank Matrix Completion

A Converse to Low-Rank Matrix Completion A Convese to Low-Rank Matix Completion Daniel L. Pimentel-Alacón, Robet D. Nowak Univesity of Wisconsin-Madison Abstact In many pactical applications, one is given a subset Ω of the enties in a d N data

More information

Central Coverage Bayes Prediction Intervals for the Generalized Pareto Distribution

Central Coverage Bayes Prediction Intervals for the Generalized Pareto Distribution Statistics Reseach Lettes Vol. Iss., Novembe Cental Coveage Bayes Pediction Intevals fo the Genealized Paeto Distibution Gyan Pakash Depatment of Community Medicine S. N. Medical College, Aga, U. P., India

More information

ASTR415: Problem Set #6

ASTR415: Problem Set #6 ASTR45: Poblem Set #6 Cuan D. Muhlbege Univesity of Mayland (Dated: May 7, 27) Using existing implementations of the leapfog and Runge-Kutta methods fo solving coupled odinay diffeential equations, seveal

More information

arxiv: v1 [physics.pop-ph] 3 Jun 2013

arxiv: v1 [physics.pop-ph] 3 Jun 2013 A note on the electostatic enegy of two point chages axiv:1306.0401v1 [physics.pop-ph] 3 Jun 013 A C Tot Instituto de Física Univesidade Fedeal do io de Janeio Caixa Postal 68.58; CEP 1941-97 io de Janeio,

More information

MATH 415, WEEK 3: Parameter-Dependence and Bifurcations

MATH 415, WEEK 3: Parameter-Dependence and Bifurcations MATH 415, WEEK 3: Paamete-Dependence and Bifucations 1 A Note on Paamete Dependence We should pause to make a bief note about the ole played in the study of dynamical systems by the system s paametes.

More information

4/18/2005. Statistical Learning Theory

4/18/2005. Statistical Learning Theory Statistical Leaning Theoy Statistical Leaning Theoy A model of supevised leaning consists of: a Envionment - Supplying a vecto x with a fixed but unknown pdf F x (x b Teache. It povides a desied esponse

More information

Unobserved Correlation in Ascending Auctions: Example And Extensions

Unobserved Correlation in Ascending Auctions: Example And Extensions Unobseved Coelation in Ascending Auctions: Example And Extensions Daniel Quint Univesity of Wisconsin Novembe 2009 Intoduction In pivate-value ascending auctions, the winning bidde s willingness to pay

More information

Scaling of hydraulic functions in heterogeneous soil using nonlinear least squares minimization method

Scaling of hydraulic functions in heterogeneous soil using nonlinear least squares minimization method Scaling of hydaulic functions in heteogeneous soil using nonlinea least squaes minimization method Akam Ahmed* Division of Fam Machiney and Post Havest Technology, Indian Gassland and Fodde Reseach Institute,

More information

An Exact Solution of Navier Stokes Equation

An Exact Solution of Navier Stokes Equation An Exact Solution of Navie Stokes Equation A. Salih Depatment of Aeospace Engineeing Indian Institute of Space Science and Technology, Thiuvananthapuam, Keala, India. July 20 The pincipal difficulty in

More information

Solution to HW 3, Ma 1a Fall 2016

Solution to HW 3, Ma 1a Fall 2016 Solution to HW 3, Ma a Fall 206 Section 2. Execise 2: Let C be a subset of the eal numbes consisting of those eal numbes x having the popety that evey digit in the decimal expansion of x is, 3, 5, o 7.

More information

Math 124B February 02, 2012

Math 124B February 02, 2012 Math 24B Febuay 02, 202 Vikto Gigoyan 8 Laplace s equation: popeties We have aleady encounteed Laplace s equation in the context of stationay heat conduction and wave phenomena. Recall that in two spatial

More information

A Crash Course in (2 2) Matrices

A Crash Course in (2 2) Matrices A Cash Couse in ( ) Matices Seveal weeks woth of matix algeba in an hou (Relax, we will only stuy the simplest case, that of matices) Review topics: What is a matix (pl matices)? A matix is a ectangula

More information

Journal of Inequalities in Pure and Applied Mathematics

Journal of Inequalities in Pure and Applied Mathematics Jounal of Inequalities in Pue and Applied Mathematics COEFFICIENT INEQUALITY FOR A FUNCTION WHOSE DERIVATIVE HAS A POSITIVE REAL PART S. ABRAMOVICH, M. KLARIČIĆ BAKULA AND S. BANIĆ Depatment of Mathematics

More information

Max-Planck-Institut fu Mathematik in den Natuwissenschaften Leizig Absence of stationay, sheically symmetic black hole solutions fo Einstein-Diac-Yang/Mills equations with angula momentum by Felix Finste,

More information

Maximal Inequalities for the Ornstein-Uhlenbeck Process

Maximal Inequalities for the Ornstein-Uhlenbeck Process Poc. Ame. Math. Soc. Vol. 28, No., 2, (335-34) Reseach Reot No. 393, 998, Det. Theoet. Statist. Aahus Maimal Ineualities fo the Onstein-Uhlenbeck Pocess S.. GRAVRSN 3 and G. PSKIR 3 Let V = (V t ) t be

More information

2. The Munich chain ladder method

2. The Munich chain ladder method ntoduction ootstapping has become vey popula in stochastic claims eseving because of the simplicity and flexibility of the appoach One of the main easons fo this is the ease with which it can be implemented

More information

10/04/18. P [P(x)] 1 negl(n).

10/04/18. P [P(x)] 1 negl(n). Mastemath, Sping 208 Into to Lattice lgs & Cypto Lectue 0 0/04/8 Lectues: D. Dadush, L. Ducas Scibe: K. de Boe Intoduction In this lectue, we will teat two main pats. Duing the fist pat we continue the

More information

B. Spherical Wave Propagation

B. Spherical Wave Propagation 11/8/007 Spheical Wave Popagation notes 1/1 B. Spheical Wave Popagation Evey antenna launches a spheical wave, thus its powe density educes as a function of 1, whee is the distance fom the antenna. We

More information

CHAPTER 3 NUMERICAL AND EXPERIMENTAL INVESTIGATIONS OF SOLIDIFICATION IN A CYLINDRICAL PCM STORAGE UNIT

CHAPTER 3 NUMERICAL AND EXPERIMENTAL INVESTIGATIONS OF SOLIDIFICATION IN A CYLINDRICAL PCM STORAGE UNIT 46 CHAPER 3 NUMERICAL AND EXPERIMENAL INVESIGAIONS OF SOLIDIFICAION IN A CYLINDRICAL PCM SORAGE UNI he design of a PCM based stoage system along with the flow of heat tansfe fluids (HF) involves solidification

More information

THE CONE THEOREM JOEL A. TROPP. Abstract. We prove a fixed point theorem for functions which are positive with respect to a cone in a Banach space.

THE CONE THEOREM JOEL A. TROPP. Abstract. We prove a fixed point theorem for functions which are positive with respect to a cone in a Banach space. THE ONE THEOEM JOEL A. TOPP Abstact. We pove a fixed point theoem fo functions which ae positive with espect to a cone in a Banach space. 1. Definitions Definition 1. Let X be a eal Banach space. A subset

More information

Math 301: The Erdős-Stone-Simonovitz Theorem and Extremal Numbers for Bipartite Graphs

Math 301: The Erdős-Stone-Simonovitz Theorem and Extremal Numbers for Bipartite Graphs Math 30: The Edős-Stone-Simonovitz Theoem and Extemal Numbes fo Bipatite Gaphs May Radcliffe The Edős-Stone-Simonovitz Theoem Recall, in class we poved Tuán s Gaph Theoem, namely Theoem Tuán s Theoem Let

More information

Quasi-Randomness and the Distribution of Copies of a Fixed Graph

Quasi-Randomness and the Distribution of Copies of a Fixed Graph Quasi-Randomness and the Distibution of Copies of a Fixed Gaph Asaf Shapia Abstact We show that if a gaph G has the popety that all subsets of vetices of size n/4 contain the coect numbe of tiangles one

More information

Regularization. Stephen Scott and Vinod Variyam. Introduction. Outline. Machine. Learning. Problems. Measuring. Performance.

Regularization. Stephen Scott and Vinod Variyam. Introduction. Outline. Machine. Learning. Problems. Measuring. Performance. leaning can geneally be distilled to an optimization poblem Choose a classifie (function, hypothesis) fom a set of functions that minimizes an objective function Clealy we want pat of this function to

More information

Geometry of the homogeneous and isotropic spaces

Geometry of the homogeneous and isotropic spaces Geomety of the homogeneous and isotopic spaces H. Sonoda Septembe 2000; last evised Octobe 2009 Abstact We summaize the aspects of the geomety of the homogeneous and isotopic spaces which ae most elevant

More information

Physics 121 Hour Exam #5 Solution

Physics 121 Hour Exam #5 Solution Physics 2 Hou xam # Solution This exam consists of a five poblems on five pages. Point values ae given with each poblem. They add up to 99 points; you will get fee point to make a total of. In any given

More information

Applying Model Reference Adaptive Controller for Lateral Motion of Aircraft

Applying Model Reference Adaptive Controller for Lateral Motion of Aircraft Ameican Jounal of Alied Sciences 7 (): 35-40, 010 ISSN 1546-939 010 Science Publications Alying Model Refeence Adative Contolle fo Lateal Motion of Aicaft 1 Abdelkade Maddi, 1 Abdeezak Guesssoum and Daoued

More information

LET a random variable x follows the two - parameter

LET a random variable x follows the two - parameter INTERNATIONAL JOURNAL OF MATHEMATICS AND SCIENTIFIC COMPUTING ISSN: 2231-5330, VOL. 5, NO. 1, 2015 19 Shinkage Bayesian Appoach in Item - Failue Gamma Data In Pesence of Pio Point Guess Value Gyan Pakash

More information

Many Electron Atoms. Electrons can be put into approximate orbitals and the properties of the many electron systems can be catalogued

Many Electron Atoms. Electrons can be put into approximate orbitals and the properties of the many electron systems can be catalogued Many Electon Atoms The many body poblem cannot be solved analytically. We content ouselves with developing appoximate methods that can yield quite accuate esults (but usually equie a compute). The electons

More information

Section 8.2 Polar Coordinates

Section 8.2 Polar Coordinates Section 8. Pola Coodinates 467 Section 8. Pola Coodinates The coodinate system we ae most familia with is called the Catesian coodinate system, a ectangula plane divided into fou quadants by the hoizontal

More information

Temporal-Difference Learning

Temporal-Difference Learning .997 Decision-Making in Lage-Scale Systems Mach 17 MIT, Sping 004 Handout #17 Lectue Note 13 1 Tempoal-Diffeence Leaning We now conside the poblem of computing an appopiate paamete, so that, given an appoximation

More information

ONE-POINT CODES USING PLACES OF HIGHER DEGREE

ONE-POINT CODES USING PLACES OF HIGHER DEGREE ONE-POINT CODES USING PLACES OF HIGHER DEGREE GRETCHEN L. MATTHEWS AND TODD W. MICHEL DEPARTMENT OF MATHEMATICAL SCIENCES CLEMSON UNIVERSITY CLEMSON, SC 29634-0975 U.S.A. E-MAIL: GMATTHE@CLEMSON.EDU, TMICHEL@CLEMSON.EDU

More information

BEST CONSTANTS FOR UNCENTERED MAXIMAL FUNCTIONS. Loukas Grafakos and Stephen Montgomery-Smith University of Missouri, Columbia

BEST CONSTANTS FOR UNCENTERED MAXIMAL FUNCTIONS. Loukas Grafakos and Stephen Montgomery-Smith University of Missouri, Columbia BEST CONSTANTS FOR UNCENTERED MAXIMAL FUNCTIONS Loukas Gafakos and Stehen Montgomey-Smith Univesity of Missoui, Columbia Abstact. We ecisely evaluate the oeato nom of the uncenteed Hady-Littlewood maximal

More information

Chem 453/544 Fall /08/03. Exam #1 Solutions

Chem 453/544 Fall /08/03. Exam #1 Solutions Chem 453/544 Fall 3 /8/3 Exam # Solutions. ( points) Use the genealized compessibility diagam povided on the last page to estimate ove what ange of pessues A at oom tempeatue confoms to the ideal gas law

More information

Edge Cover Time for Regular Graphs

Edge Cover Time for Regular Graphs 1 2 3 47 6 23 11 Jounal of Intege Sequences, Vol. 11 (28, Aticle 8.4.4 Edge Cove Time fo Regula Gahs Robeto Tauaso Diatimento di Matematica Univesità di Roma To Vegata via della Riceca Scientifica 133

More information

Surveillance Points in High Dimensional Spaces

Surveillance Points in High Dimensional Spaces Société de Calcul Mathématique SA Tools fo decision help since 995 Suveillance Points in High Dimensional Spaces by Benad Beauzamy Januay 06 Abstact Let us conside any compute softwae, elying upon a lage

More information

Outline. Lecture 12: Multi-view geometry / Stereo III. Review: the essential matrix. Review: disparity/depth maps

Outline. Lecture 12: Multi-view geometry / Stereo III. Review: the essential matrix. Review: disparity/depth maps Outline Lectue : ulti-vie geomet / Steeo III Tuesda, Oct CS 78/95T of. Kisten Gauman Last lectue: steeo econstuction ith calibated cameas non-geometic coesondence constaints Homogeneous coodinates, ojection

More information

An Application of Fuzzy Linear System of Equations in Economic Sciences

An Application of Fuzzy Linear System of Equations in Economic Sciences Austalian Jounal of Basic and Applied Sciences, 5(7): 7-14, 2011 ISSN 1991-8178 An Application of Fuzzy Linea System of Equations in Economic Sciences 1 S.H. Nassei, 2 M. Abdi and 3 B. Khabii 1 Depatment

More information

Voltage ( = Electric Potential )

Voltage ( = Electric Potential ) V-1 of 10 Voltage ( = lectic Potential ) An electic chage altes the space aound it. Thoughout the space aound evey chage is a vecto thing called the electic field. Also filling the space aound evey chage

More information

n 1 Cov(X,Y)= ( X i- X )( Y i-y ). N-1 i=1 * If variable X and variable Y tend to increase together, then c(x,y) > 0

n 1 Cov(X,Y)= ( X i- X )( Y i-y ). N-1 i=1 * If variable X and variable Y tend to increase together, then c(x,y) > 0 Covaiance and Peason Coelation Vatanian, SW 540 Both covaiance and coelation indicate the elationship between two (o moe) vaiables. Neithe the covaiance o coelation give the slope between the X and Y vaiable,

More information

Defining a Pseudo-Metric Topology on Linear Dynamic Systems

Defining a Pseudo-Metric Topology on Linear Dynamic Systems Peints of the 9th Wold Congess The Intenational Fedeation of Automatic Contol Defining a Pseudo-Metic Toology on Linea Dynamic Systems Vahid Hassani Andew Ross Nowegian Maine Technology Reseach Institute

More information

Classical Worm algorithms (WA)

Classical Worm algorithms (WA) Classical Wom algoithms (WA) WA was oiginally intoduced fo quantum statistical models by Pokof ev, Svistunov and Tupitsyn (997), and late genealized to classical models by Pokof ev and Svistunov (200).

More information

working pages for Paul Richards class notes; do not copy or circulate without permission from PGR 2004/11/3 10:50

working pages for Paul Richards class notes; do not copy or circulate without permission from PGR 2004/11/3 10:50 woking pages fo Paul Richads class notes; do not copy o ciculate without pemission fom PGR 2004/11/3 10:50 CHAPTER7 Solid angle, 3D integals, Gauss s Theoem, and a Delta Function We define the solid angle,

More information

On the Quasi-inverse of a Non-square Matrix: An Infinite Solution

On the Quasi-inverse of a Non-square Matrix: An Infinite Solution Applied Mathematical Sciences, Vol 11, 2017, no 27, 1337-1351 HIKARI Ltd, wwwm-hikaicom https://doiog/1012988/ams20177273 On the Quasi-invese of a Non-squae Matix: An Infinite Solution Ruben D Codeo J

More information

18.06 Problem Set 4 Solution

18.06 Problem Set 4 Solution 8.6 Poblem Set 4 Solution Total: points Section 3.5. Poblem 2: (Recommended) Find the lagest possible numbe of independent vectos among ) ) ) v = v 4 = v 5 = v 6 = v 2 = v 3 =. Solution (4 points): Since

More information

7.2. Coulomb s Law. The Electric Force

7.2. Coulomb s Law. The Electric Force Coulomb s aw Recall that chaged objects attact some objects and epel othes at a distance, without making any contact with those objects Electic foce,, o the foce acting between two chaged objects, is somewhat

More information