ESTIMATION BIAS IN SPATIAL MODELS WITH STRONGLY CONNECTED WEIGHT MATRICES

Size: px
Start display at page:

Download "ESTIMATION BIAS IN SPATIAL MODELS WITH STRONGLY CONNECTED WEIGHT MATRICES"

Transcription

1 ESTIMATION BIAS IN SPATIAL MODELS ITH STRONGLY CONNECTED EIGHT MATRICES Toy E. Smith Departmet of Systems ad Electrical Egieerig Uiversity of Pesylvaia Jauary, 008 (Revised Jue 8, 008) Abstract I this paper it is show that for both spatial lag ad spatial autoregressive models with strogly coected weight matrices, maximum likelihood estimates of the spatial depedece parameter are ecessarily biased dowward. I additio, it is show that same bias is preset i geeral Mora tests of spatial depedecy, so that positive depedecies will ofte fail to be detected whe weight matrices are strogly coected. A simulated umerical example is preseted to illustrate some of the practical cosequeces of these biases. Key ords: bias, spatial lag models, spatial autoregressive models, Mora test

2 1. Itroductio I a recet simulatio study, Mizruchi ad Neuma (008) have show that for spatial lag models with strogly coected (high desity) weight matrices, there is ofte a severe dowward bias i maximum-likelihood estimates of the spatial depedecy parameter. 1 This same bias is also reported by Farber, Páez ad Volz (008) i their recet simulatio study of the ifluece of etwork topology o tests of spatial depedecies. Hece the cetral purpose of this paper is to clarify the ature of this bias from a aalytical perspective. I additio, it is show that same bias is preset i both spatial autoregressive models ad i the more geeral Mora test of spatial depedecy. I all cases this bias implies that sigificatly positive spatial depedecies will ofte fail to be detected whe weight matrices are strogly coected. To establish these results, the aalytical strategy will be to cosider the extreme case of maximally coected weight matrices, ad to obtai exact results for this case. The rest will the follow from simple cotiuity cosideratios. To avoid repetitio, the aalytical developmet of spatial regressio models will focus o spatial lag models. Parallel results for spatial autoregressive models will simply be sketched. Hece to fix the ideas, we begi with the followig a stadard spatial lag model (SL) for spatial uits: (1) y = y Xβ ε ε N σ I, ~ (0, ) ( k 1) where y R is some variable of iterest ad X = [1, x1,.., xk] R represets a relevat set of k explaatory variables, with 1 = (1,..,1) deotig the uit -vector (correspodig to the itercept term i this liear model). [Throughout the followig 1 aalysis it will always be assumed that X has full colum rak, k 1, so that ( X X ) exists.] The ukow parameters of the model iclude the vector, β = ( β0, β1,.., β k ) of beta coefficiets, the variace, σ, of each residual i ε, ad the spatial depedece parameter,, which is of primary iterest i the preset aalysis. Also of major iterest is the structure of the spatial weight matrix,. For purposes of the preset aalysis, it is coveiet to begi by characterizig these matrices i the followig way. First we choose a fixed positive scalar, b, to serve as a upper boud o weight values. ith respect to this boud, a -square matrix, = ( wij : i, j = 1,.. ), is desigated as a weight matrix iff (i) w ii = 0 ad (ii) 0 wij b for all i, j = 1,..,. As usual, coditio (i) specifies that depedecies are defied oly betwee distict spatial uits. Coditio (ii) ca be thought of as a ormalizatio coditio the allows each weight, w, to be iterpreted as the degree of coectivity betwee i ad j, where wij ij = b implies a maximal degree of coectivity. This is particularly appropriate for 1 I am idebted to a referee for poitig out that similar observatios were made by Bao ad Ullah (007) with respect to the secod order bias of these estimates i the cotext of a pure spatial lag model with circular weight matrices of varyig degrees of coectivity.

3 applicatios of model (1) to social etworks amog agets. For the preset, the boud b oly serves as a coveiet coceptual device, ad ca be set equal to oe without loss of geerality. However, the questio of appropriate matrix ormalizatios for the estimatio of is of some importace, ad will be addressed below. If the class of all -square weight matrices is deoted by R (where the fixed scale parameter b is take to be implicit), the the relevat geometry of this set ca be depicted for the = case as follows. Observe that each matrix is of the form () 0 w 1 = w 1 0 ad thus is fully characterized by the -vector ( w1, w 1). Hece the etire class is see to be equivalet to the poits i the square, [0, b ], show below: Figure 1 here Here the lower left had corer correspods to the miimally coected weight matrix, with all zero compoets, ad the upper right had corer correspods to the maximally coected weight matrix,, with all off-diagoal elemets equal to b. This depictio for the x case makes it clear that ad are the two atural extreme weight matrices i for all. 3 Sice correspods to complete statistical idepedece i model (1), attetio has aturally focused o those weight matrices,, that are sufficietly close to to iherit all of its desirable large-sample properties (such as cosistecy ad asymptotic ormality of parameter estimates). Thus most of the literature has focused o those matrices i the lower left eighborhood show i Figure 1. I this cotext, the distiguishig feature of the preset aalysis is that it focuses rather o the upper right eighborhood i Figure 1, which for the momet we loosely desigate as strogly coected weight matrices. 4 Our cetral objective is to show ot oly that such weight matrices fail to share the desirable properties of the idepedece case, but also to determie the exact ature of this failure. Of particular iterest will be the severe dowward bias i maximum likelihood estimates of the spatial depedecy parameter,. This termiology is ot to be cofused with the graph-theoretical otio of totally coected which refers oly to the presece of a ozero liks betwee all distict ode pairs. 3 This ca also be expressed i terms of the (cell-wise) matrix iequalities for all 4 Here it should be oted that maximally coected spatial weight matrices have bee previously studied i a somewhat differet cotext by Kelejia ad Prucha (00) who described them simply as models with equal spatial weights [see also Kelejia, Prucha ad Yuzefovich (006) ad Baltagi (006)]. 3

4 To establish this result i a self-cotaied maer, it is coveiet to begi with a detailed developmet of the maximum-likelihood estimatio problem for the spatial lag model i Sectio below. This is followed i Sectio 3 by a aalysis of the maximally coected case,, i the upper right corer. The results for this case are exteded by cotiuity i Sectio 4 to all matrices sufficietly close to i a appropriate sese, ad some umerical illustratios are give. I Sectio 5 it is show that these results are essetially the same for spatial autoregressive models. Fially it is show i Sectio 6 that strog coectivity also has cosequeces for Mora diagostic tests of spatial idepedece.. Maximum Likelihood Estimatio for SL Models Model (1) implies that y is multiormally distributed, ad i particular that for ay give data, ( yx, ) the log likelihood fuctio for parameters ( β, σ, ) takes the form: 5 (3) L y X cost I ( βσ,,, ) = l( σ ) l det( ) 1 (( I ) y X ) (( I ) y X ) σ β β where I is the -square idetity matrix, ad where all terms ot ivolvig the parameters are subsumed i cost. As with all geeralized liear models, oe proceeds by first fixig the covariace parameters (i this case, ) ad maximizig the likelihood fuctio i β ad σ to produce the well-kow closed form coditioal estimates: (4) ˆ ( ) ( ) 1 sl = ( ) β X X X I y (5) ( ˆ ˆ σ ( ) (1/ ) ( ) ( )) (( ) ˆ sl = I y Xβsl I y Xβsl ( ) ) where the subscript sl deotes the SL model. These are the substituted ito (3) to yield a reduced fuctio desigated as the cocetrated likelihood fuctio, L sl, for. After some simple cacellig of terms, this fuctio takes the form: (6) L y X = cost I ˆ σ sl (, ) l det( ) ( /)l[ ( )] Oe the maximizes this fuctio to obtai the maximum likelihood estimate, ˆ, of ad the substitutes this value ito (4) ad (5) to obtai correspodig maximum likelihood estimates, ˆ ˆ β ( ˆ = βsl ) ad ˆ σ ˆ ( ˆ = σsl ), of β ad σ respectively. However, our primary iterest here is i ˆ itself. 5 Most of the followig developmet is quite stadard, ad ca be foud i may refereces icludig Aseli (1988) ad Aseli ad Bera (1998, sectio III.B). 4

5 To aalyze the fuctio, L sl, oe ca make further reductios as follows [see also Aseli (1988, Sectio 1.1.1)]. First let (7) 1 M = I X( XX) X deote the orthogoal projectio oto the complemet of the spa of X, so that by costructio, M M =, (8) (9) 1 MX = X X( XX ) XX = X X = 0, ad MM = ( I X( XX ) X )( I X( XX ) X ) = I X( XX ) X = M The substitutio of (4) ad (7) ito (5) yields the more compact form of the coditioal variace estimate, 1 1 (10) ˆ σsl( ) = (1/ ) ([ I X( XX ) X ]( I ) y ) ([ I X( XX ) X ]( I ) y) = (1/ ) M( I ) y M( I ) y ( ) ( ) ( ) = (1/ ) y( I ) M( I ) y This i tur allows the cocetrated likelihood i (6) to be writte as (11) L ( y, X ) = cost l det( I ) ( / )l[ y ( I ) M ( I ) y] sl where the term ( /)l(1/ ) has ow bee absorbed ito the costat. Further reductio is possible by observig that if the eigevalues of are deoted by λ( ) = { λi : i = 1,.., }, the the correspodig eigevalues of ( I ) are well kow to be give by λ( I ) = {1 λi : i = 1,.., }. To avoid complicatios i the aalysis to follow, it is coveiet to restrict our attetio to weight matrices,, with real eigevalues (which, most importatly, icludes all which are either symmetric or are row ormalizatios of symmetric matrices). I additio, it will also be assumed that the maximum eigevalue, λ ( ), of is max positive.6 (I particular this icludes all ozero symmetric weight matrices.) Hece we ow restrict our attetio to the subset: (1) = { : λ( ) is real, ad λ ( ) > 0} max 6 This maximum eigevalue is always oegative [Hor ad Johso (1985, Th.8.1.3)], but eed ot be positive eve whe has positive elemets. Eve for = the matrix, = [0 1;0 0], has λ ( ) = {0,0}. 5

6 Give this subset, together with the fact that the determiat of ay matrix is the product of its eigevalues [Hor ad Johso (1985,Th.1..1)], it the follows that (13) det( I ) = (1 λ ) l det( I ) = l 1 λ i i i i as log as each term, 1 λi, o the right had side is ozero. This of course requires further restrictios o. To specify these coditios, we first ote that sice the trace of every matrix is the sum of its eigevalues [Hor ad Johso (1985,Th.1..1)], it follows that i i i (14) λ = tr( ) = w = 0 ii for all. But sice λ ( ) > 0 for all max, this i tur implies that λ ( ), mi must be egative. These observatios together imply that for ay, all terms, 1 λi, i (13) will be positive if the admissible values of are restricted to the ope iterval (15) [ ] = 1 1 ( λ, ( ) λmax ( ) ) mi Hece we ow restrict to the iterval, [ ]. Uder this restrictio, (13) allows (11) to be reduced to the explicit form, (16) L ( y, X ) = cost l 1 λ ( / )l[ y ( I ) M ( I ) y] sl i i which is more readily aalyzed (ad computed). At this poit oe typically proceeds by observig that sice l det( I ) = o the boudaries of [ ], it is reasoable to assume that L sl has a well-defied differetiable maximum i the ope iterval [ ]. This will be true as log as the secod term i (16) is bouded above. To esure this, it must of course be assumed that (17) M ( I ) y 0 for all [ ] To iterpret this coditio, observe that model (1) ca be equivaletly writte as ( I ) y = Xβ ε, where ( I ) y represets the value of y after spatial lag effects have bee accouted for. If this variable is desigated as the effective value of y, (18) y ( ) = ( I ) y i model (1), the as a parallel to classical regressio, it is here assumed that for the give data vector, y, oe of its effective values, { y ( ) : [ ]} is perfectly fitted by X (i.e., lies i the spa of X ). e desigate data sets ( yx, ) satisfyig (17) as -regular. 6

7 Notice that for = 0 this implies the usual regularity coditio that My 0. Data ( y, X ) satisfyig oly this (classical regressio) coditio is simply said to be regular. 3. Biased Estimatio for the Maximally Coected Case i SL Models Give the simple form of the cocetrated likelihood fuctio, L sl, i (16), oe ca proceed to search for a maximum, ˆ, i the iterval [ ] (typically by stadard lie search procedures). However, it turs out that for the maximally coected case,, this maximizatio procedure is doomed to fail. Ideed, the mai result of this sectio will be to show that eve for regular data sets, L sl is always ubouded o [ ]. To establish this result, we begi by aalyzig the properties of. First observe that sice the -square uit matrix is costructible as the outer product, 1 1, the maximally coected weight matrix,, ca be writte as: (19) 0 b b b 0 (1 1 ) = = b b b 0 b I ith this explicit form, the followig result shows that the eigevalues of are computable i closed form: Lemma 1. For all b > 0 the eigevalues of i (19) are give by (0) ( ) { b,.., b, b( 1)} λ = where the eigevalue, b, has multiplicity 1. Proof: It follows from Searle (198, Sectio 1.3.d) that the eigevalues of ay matrix of the form A= ai c 11 are give by (1) λ ( A) = { a,..., a,( a c)} where a has multiplicity 1. Hece the eigevalues of () = b (11 I) = ( b) I ( b)11 are immediately see to be those i (0). The secod (ad most importat) property of maximally coected weight matrices is the followig idetity: 7

8 Lemma. If M is the orthogoal projectio matrix i (7) associated with ay data matrix, X = [1, x1,.., xk], for model (1) the, (3) (4) M = b M = M Proof: Simply observe from (19) ad (7) that M = ( I X( XX ) X ) b (11 I ) 1 = b I X XX X X XX X 1 1 [11 ( ) 11 ( ) ] 1 But sice [ X ( XX ) X ] X = X ad sice 1 is the first colum of X, it follows 1 i particular that [ X( XX ) X ]1 = 1. Hece we see that (5) M = b [1 1 I 1 1 X( XX ) X ] = b [ X( XX ) X I ] = b M Next observe that sice [ X ( XX ) X ]1 = 1 1 = 1 [ X( XX ) X ], it also follows that (6) M = b (1 1 I ) ( I X( XX ) X ) 1 = b X XX X I X XX X 1 1 [11 11 ( ) ( ) ] = b I X XX X 1 [11 11 ( ) ] = b I X XX X = b M. 1 [ ( ) ] Oe useful cosequece of this result is the followig: Lemma 3. Every regular data set, ( y, X ), is -regular. Proof: First observe from Lemma, together with the symmetry of ad M, that for ay [ ] ad data set ( yx, ), (7) y I M I y = y I M M y ( ) ( ) ( )( ) = y ( M M M M ) y = y ( M bm bm b M) y = (1 b b ) ymy 8

9 = (1 b) ymy But [ ] the implies that > 1/ b ad hece that 1 b > 0. Thus - regularity of ( y, X ) will follow if it ca be show that ymy > 0. But sice M is a orthogoal project matrix ad hece is positive semidefiite, it follows that ymy 0 for all y, ad moreover that ymy = 0 My= 0 [Hor ad Johso (1985, p.400)]. Fially sice the regularity of ( y, X ) implies that My 0, it must the be true that ymy > 0, ad thus that -regularity holds. ith these properties, we are ow ready to establish our mai result, amely that L sl is ubouded o [ ]. I particular, we show that L sl icreases without boud as approaches the lower boudary of [ ]. To so, observe from (15) ad (0) that this lower boudary poit,, is give by (8) = 1/ λmi( ) = 1/ b ith this defiitio we ow have: Propositio 1. If = i model (1), the for all regular data sets ( y, X ) ad all decreasig sequeces ( m) i [ ], with lim m m =, (9) lim L ( y, X) m sl m = Proof: The strategy will be to use Lemmas 1 ad to show that the cocetrated likelihood fuctio i (16) is reducible to a simple aalytical form for which the result is obvious. To do so, we first observe from Lemma 1 ad the positivity of mi {1 λ ( )} o [ ] that for ay [ ] we must have 7 i i i i i (30) l 1 λ = l(1 λ ) = ( 1) l[1 ( b)] l[1 b( 1)] i = ( 1)l(1 b) l[1 b( 1)] Moreover, we see from (7) above that (31) = ( /)l[ y( I ) M( I ) y] ( /)l[(1 b) y' My] = { l(1 b) ( / )l( y' My)} 7 For the case of b = 1/( 1) this result appears i sectio.5 of Kelejia ad Prucha (00). 9

10 Notice also from Lemma 3 that this log expressio is well defied for all [ ]. Hece by substitutig (30) ad (31) ito (16) we obtai the followig simple expressio for the cocetrated likelihood fuctio, (3) L ( y, X ) = cost {( 1)l(1 b) l[1 b( 1)]} sl { l(1 b) ( / )l( y' My)} = cost l(1 b) l[1 b( 1)] where the term, ( / )l( ymy ), ot cotaiig has agai bee absorbed i cost. From here we eed oly observe that sice = 1/b, it follows that for ay decreasig sequece ( m) i [ ], with lim m m =, we must have (33) lim L ( y, X ) = cost lim l(1 b) lim l[1 b( 1)] m sl m m m m m ad the result is established. 8 = cost l(1 b) l[1 b( 1)] = cost l(0) l( ) = Hece from a formal viewpoit, it may be cocluded that o maximum likelihood estimator of exists for model (1) whe =. 9 This somewhat surprisig i view of the fact that existece of maximum likelihood estimators for model (1) is geerally assumed to hold as log as ad [ ]. Moreover, it is iterestig to ote that from a practical viewpoit, such a failure would most likely ot eve be detected by stadard software. Ideed, oe would typically observe that the lie-search algorithm has coverged to some value of very close to. To gai further isights here, it is useful to illustrate this fidig with a umerical example, as show i Figure below. This is take from the umerical simulatio example preseted i Sectio 4 below (for a sample size of = 50 ). The First Term ad Secod Term show i Figure correspod, respectively, to the log-determiat expressio (30) ad the log-quadratic expressio (31) i Propositio 1 above. Notice that the log-determiat term is always well behaved, sice it is a sum of simple cocave fuctios, l(1 λ i ), o [ ]. Hece the culprit here is the log-quadratic term, which 8 Note that L is also ubouded at the upper boudary of [ ], amely = 1/ λ ( ) = 1/[ b( 1)]. sl max But sice L ( y, X) =, this is of little iterest for maximum likelihood estimatio. sl 9 This failure of existece is a istace of the more geeral result of Arold (1979, Th.3) regardig the o-existece of maximum-likelihood estimators for covariace parameters i liear models with exchageably distributed errors. I am idebted to Federico Martellosio for poitig this out to me. 10

11 i the preset case ot oly diverges to at, but does so at a faster rate tha the correspodig divergece of the log-determiat to. Figure Here Before examiig the practical cosequeces of this result for strogly coected weight matrices, we give a alterative statemet of Propositio 1 that will also prove useful for applicatios. Recall that our basic regularity assumptio o data ( yx, ) was desiged to avoid cases where some effective y -value, y ( ) was perfectly fitted by the data, X, i model (1). e ow show that Propositio 1 results from the fact that for every data set ( yx, ) i model (1), if =, the X must yield a perfect fit to the effective y - value, y ( ), o the lower boudary of [ ]. This fact depeds critically o the presece of a itercept term i model (1) [as should already be apparet from the proof of Lemma ]. Hece it is ow coveiet to make this itercept term explicit by rewritig model (1) as (34) y = y β X β ε ε N σ 1, ~ (0, ) 0 where X = [ x1,.., x k ] ad β = ( β1,.., β k ). For the particular case of it the follows that for ay choice of X, (35) = β β ε y y 1 0 X I y = β X β ε ( ) 01 y ( ) = β 1 X β ε 0 ith this otatio, recall from Lemma 3 ad (17) that for ay regular data set ( y, X) = ( y,[1, X ]) there exists o [ ] such that the effective value y ( ) is a perfect fit to X, i.e., such that (36) y ( ) = β0( )1 X β( ) for some choice of [ β0( ), β( )]. But eve whe this is true, it turs out that coditio (36) always fails at the lower boudary value,, of [ ] as we ow show: The followig result is essetially cotaied i Theorem 1 of Kelejia, Prucha ad Yuzefovich (006), where it is employed to study the cosistecy properties of SLS estimatio i the case of equal spatial weights. 11

12 Propositio. If = i model (1), the for ay data set (, ) y X, (37) y ( ) = β0 ( ) 1 X β( ) where β0 ( ) = 1 y ad β( ) = 0. Proof: First recall from (8) that for ay positive boud b, (38) = = < 1/ λmi( ) 1/ b 0 Hece it follows that, (39) y ( ) = ( I ) y = { I ( 1/ b)[ b (1 1 I )]} y = [ I (1 1 I )] y = 1 1 y = (1 y) 1 X 0 = β ( ) 1 X β( ) 0 ad the result is established. The advatage of viewig this result i terms of perfect fits is that it provides iformatio about the bias of other parameter estimates. For whe ˆ, expressio (37) suggests that oe should have ˆ β 0 1 y ad ˆ β j 0 for all j = 1,.., k. Moreover, sice a perfect fit ecessarily implies zero variace of residuals, this i tur suggests that ˆ σ e shall explore the practical cosequeces of these fidigs i the ext sectio. 4. Cosequeces for Strogly Coected eight Matrices i SL Models The above results show that for the extreme case of maximally coected matrices, we ca obtai a exact aalytical formulatio of the bias iheret i maximum likelihood estimatio for spatial lag models. This i tur suggests that such bias should be iherited by matrices, that are close to i some appropriate sese. To do so, it is coveiet to edow with a matrix orm that will allow some explicit measure of closeess. Here there are may choices. For example, the 1-orm of ay matrix A= ( aij ) R is A = a 1 ij, ad the -orm (Euclidea orm) of A is A ( a ) 1/ ij ij ij =. 1 However, for our preset purposes, the followig scaled versio of the 1-orm is useful for weight matrices,, 11 These results illustrate the more geeral fidig of Arold (1979, p.196) regardig the icosistecy of stadard parameter estimates for liear models with exchageably distributed errors. 1 May other choices are illustrated i Hor ad Johso (1985, Sectio 5.6). 1

13 1 (40) = = 1 1 ( 1) ij ( 1) ( ij / ) rc b w = ij w b ij 1. which we desigate as the relative coectivity orm. 13 If ( wij / b ) deotes the relative coectivity betwee uits (agets) i ad j, the this is simply the average of these relative coectivities over all distict ( i, j ) pairs. I the case of biary matrices,, this is easily see to reduce to the graph-theoretic otio of average lik desity. Give this orm (or ay other matrix orm), the iduced distace betwee ad is the give by (41) = w b = [1 ( w / b )] 1 1 rc b ( 1) ij ij ( 1) ij ij Next, we observe that up to this poit the actual magitude of has ot bee cosidered. All that has bee asserted is that for ay give weight matrix,, these values must lie i the ope iterval [ ] of expressio (15), ad that this iterval cotais zero (so that both positive ad egative values of are always possible.) But to gai further isight, it is useful to evaluate this iterval i specific cases. I the umerical illustratio below we shall use a sample size, = 50. Hece, by settig the boud at b = 1, it follows from Lemma 1 that for the maximally coected matrix, 50, we have λ ( ) = 1 mi ad λ ( ) 49 max =. The correspodig bouds o for this case are thus see to be (4) ( b b 1 ) ( ) 1/,1/ ( 1) = 1, ( 1,.0) 49 which, from a practical viewpoit, is see to offer little room for positive spatial depedecies at all. Sice positive depedecies are by far the most iterestig for practical applicatios, it is clear that a better choice of b should be cosidered. Here the most atural choice is to set b= 1/( 1), so that uder this ormalizatio we obtai (43) λ = b = = max ( ) ( 1) ( 1)/( 1) 1 This will esure that the iterval, [0,1), of oegative -values used for most applicatios actually lies i [ ]. For = 50 we the have (44) [ ] = ( 1/ b,1) = ( ( 1),1) = ( 49,1) 13 Sice every positive scalig of a orm yields aother orm, the first equality shows that this is ideed a matrix orm. 13

14 Oe additioal feature of this ormalizatio that is particularly useful for the preset aalysis is that 14 (45) [0,1) [ ] for all So this same iterval of -values is available for every choice of. 15 Give this ormalizatio, the objective of this sectio is to exted the bias results for maximally coected weight matrices i Propositio 1 to all weight matrices,, that are strogly coected i the sese that they are sufficietly close to i the relative coectivity orm. To do so, it is coveiet to itroduce the followig additioal covetios. First, for ay give ad data set ( y, X ) for model (1), we shall write the maximum likelihood estimator for as ˆ ( y, X ). As poited out above, this estimator ca fail to exist eve whe ( yx, ) is -regular. But for weight matrices close to (i relative coectivity), it should be clear that if ( yx, ) is - regular, the a differetiable maximum, ˆ ( y, X ), fails to exist oly whe L sl is ubouded at the lower boudary of [ ]. I such cases, we simply set ˆ ( y, X ) equal to this lower boudary, so that ˆ ( y, X ) ca be treated as a well-defied value for each. Next, to quatify the possible bias of these estimates, it is coveiet to focus oly o the most importat case of positive depedecies i model (1), i.e., > 0, ad to quatify various degrees of uderestimatio by iequalities of the form, (46) ˆ ( yx, ) < /(1 α ) where parameter α > 0 ca be iterpreted as a bias factor. For example, a bias factor of α = 1 would imply that ˆ ( y, X ) is less tha half the true value of. More geerally, higher bias factors correspod to more severe uderestimatio of. ith these covetios, we ow have the followig cosequece of Propositio 1: Propositio 3. For ay regular data set ( y, X ) with 3 ad ay give value, 0 (0,1), of the spatial depedece parameter for model (1), there exists for each choice of bias factor, α (0,1), a sufficietly small ε = εα (, 0, yx, ) > 0 such that for all, (47) < ε ˆ (, ) < /(1 α) rc y X 0 14 Expressio (45) follows from the fact that 0 λ ( ) λ ( ) = 1 1/ λ ( ) 1 [see max max max Hor ad Johso (1985, Corollary )]. 15 A alterative ormalizatio that also shares this property is to set b equal to the reciprocal of the smallest row or colum sum, as proposed by Kelejia ad Prucha (008, Lemma ). Though less stadard tha the preset covetio, this ormalizatio has the advatage of beig much easier to compute for large weight matrices. 14

15 Proof Sketch: The proof of this result is rather techical, ad is deferred to the Appedix. But the basic idea is simple. Observe from Figure that ot oly does the cocetrated likelihood fuctio, L sl, diverge to at, but i fact its derivative is everywhere egative i [ ]. Hece, if we ow write the cocetrated likelihood fuctio as, Lsl ( y, X, ), to emphasize its depedece o [as well as data ( y, X ) ], the the strategy of the proof is to show that the correspodig derivative, L sl ( y, X, ), with respect to is cotiuous i at the poit. Usig this cotiuity property, it is the possible to show that for ay choice of bias factor, α, whe is sufficietly close to [i.e., whe ε i (47) is sufficietly small], oe ca guaratee that L sl ( y, X, ) will be egative for all [ ] with /(1 α) 0, ad thus that Lsl ( y, X, ) ca oly achieve a maximum o [, /(1 α)). I other words, for ay degree of bias, α > 0, there is some threshold level of strog coectivity, < ε, which is sufficiet to esure this degree of bias. The proof sketched above also shows (from the persistece of egative slopes) that uder coditios of o spatial depedece (i.e, = 0 ) this ull hypothesis will ted to be falsely rejected i favor of egative depedecies ( < 0) for strogly coected weight matrices. Moreover, i cases where such depedecies are ideed egative, the stregth of these depedecies will ted to be overestimated. But as with all such cotiuity results, Propositio 3 still leaves ope the questio of how strog this coectivity must be i order to see a substatial effect. hile such questios ca oly be aswered defiitively by extesive simulatios, it is oetheless possible to illustrate the potetial sigificace of these results by meas of a typical example. 16 Here we set = 50, k =, ad costruct x -data ( x1, x ) by simulatig two uiformly distributed radom vectors, so that X = [150, x1, x]. Model (1) was the parameterized with β = ( β0, β1, β ) = (1,,3) ad stadard deviatio σ = 1. Agai for sake of illustratio the sigle value, =.5, was chose to represet (substatial) positive spatial depedecy i model (1). To aalyze the effects of strog coectivity, oly symmetric biary weight matrices were used i order to allow a average lik desity iterpretatio of the matrix orm i (40). A umber of matrices,, with differet average lik desities, d = (0,1) were radomly sampled. I particular, the values rc d {.30,.50,.80,.90,.95,.99} were chose for study. For each d a matrix, ( ), d 16 This example is oly meat to illustrate the practical cosequeces of the aalytical results above. As metioed i the itroductio, more extesive ad systematic simulatios ca be foud i both Mizruchi ad Neuma (008) ad Farber, S., A. Páez ad E. Volz (008). 15

16 was radomly sampled from the distributio idepedetly assigig w ij = 1 with probability d ad w ij = 0 otherwise. 17 I order to make the results at differet desity levels more readily comparable, each matrix d was ormalized i the same maer as, by dividig d by its maximum eigevalue. This rescalig esures that the positive values of i each simulated model are exactly the same, amely (0,1). 18 For each of these matrices, 1000 y -vectors were the simulated for model (1), ad correspodig maximum-likelihood estimates { ˆ ( s d ) : s= 1,..,1000} were computed.19 Perhaps the simplest way to summarize these results is to compare the sample mea values of ˆd for each of these desities with the true value, =.50, as i colum of Table 1 below. Table 1 Here As expected, oe sees uderestimatio i all cases, with steadily icreasig severity for higher desities. For compariso, the maximally coected case, d = 1, has bee added to show this extreme case is vastly worse tha all others. But oetheless, oe ca see the cotiuity properties i Propositio 3 at work. Uderestimatio becomes quite severe as coectivity desity icreases. Note also that i Table 1 the correspodig -itervals, [ d ], i (15) above are give i colum 4 (colum 3 will be discussed below). To provide a fuller compariso, selected histograms of { ˆ ( s d ) : s = 1,..,1000} are show for the cases d =.50,.80,.90,.99 i Figure 3 below. 0 Here the true value, =.50, is Figure 3 Here idicated by a bold arrow i each case to facilitate the visual compariso of these estimates. So at average lik-desity levels of at least 80% ( d.80 ) there is a substatial dowward bias i estimates. Aother way to see this is to ask what fractio of these estimates are reported as sigificatly differet from zero i the stadard two-sided tests usig asymptotic z-values. 1 Here, for a true value of =.50, oly the upper 4% of sample estimates at d =.80 are sigificatly differet from zero. he the desity is 17 Note that desity values, d, ca oly be approximated by this samplig procedure. However, repeated samples at each desity level yielded variatios that were too small to warrat reportig. I all cases the matrix, chose had a average lik desity well withi.01 of d. ( d ) 18 The ormalizatio, b = 1/( 1) = 1/49, used above has the theoretical advatage of preservig all relative coectivity relatioships. But the preset scalig to uit maximum eigevalues is a more typical ormalizatio used i practice. For compariso, calculatios were also doe for the 1/49 scalig, ad produced eve more dramatic uderestimatio results tha those preseted here. 19 The estimatio was doe i Matlab usig a modified versio of the LeSage (1999) suite of programs. 0 Cases d =.30 ad. d =.95 are, respectively, very similar to d =.50 ad d =.90, ad are omitted. 16

17 icreased to d =.90 this drops to less tha 15%. Further ivestigatios of such sigificace questios will be take up i Sectio 5 below. Fially, it is of iterest to recall from the discussio followig Propositio above that this uderestimatio of has cosequeces for the bias of other parameter estimates that are at least qualitatively predictable. hile it is difficult to place magitudes o the degree of these biases, they ca at least be illustrated for the simulatios of model (1) above. The mea estimates for all parameters are show i Table below (where the meas for ˆ have bee repeated from Table 1): Table Here Recall from Propositio that for the perfect fit case i the last row of Table, oe would predict a itercept coefficiet, ˆ β 0 1 y. I the preset case, the mea value of 1 y was about 351, which is i clear agreemet with Table. Hece for strogly coected weight matrices this is see to result i extreme overestimatio of β 0 i the preset case. It is also iterestig to ote that while the limitig estimates of β = ( β1, β ) ad σ i Table also agree with the zero values predicted by Propositio, these biases seem to disappear much more rapidly as lik desity decreases. However, it is worth otig that eve a slight dowward bias i ˆ σ (ad hece ˆ σ ) ca have potetially serious cosequeces for testig, where it ca lead to erroeous sigificace of beta parameters. 5. Extesio to Spatial Autoregressive Models The results above demostrate that strog coectivity of weight matrices ca lead to severe bias i the estimatio of spatial depedecies i spatial lag models. Hece it is atural to ask whether similar behavior is exhibited by spatial autoregressive models. Our mai result here is to show that with respect to spatial depedece parameters, the results for these two models are essetially idetical. To establish this, we begi by formulatig this model ad sketchig the parallel maximum likelihood estimatio problem for this case. As a parallel to model (1), the stadard spatial autoregressive model (SAR) for spatial uits: (48) y = Xβ u u = u ε ε N σ I,, ~ (0, ) 1 Note that for tests of positive it is theoretically more appropriate to cosider a oe-sided test ( > 0 ). But such results are ot reported i stadard spatial regressio software. This is also kow as the spatial errors model, to emphasize the spatial depedece amog errors. 17

18 where ow the spatial depedece parameter,, ad spatial weight matrix,, characterize possible spatial depedecies amog the residuals rather tha the depedet variable, y. 3 If oe solves for u ad writes this model i reduced form as (49) y = Xβ I ε ε N σ I 1 ( ), ~ (0, ) the it becomes clear that ad directly ifluece the covariace structure of the residuals, ε. Agai y is multiormally distributed, where the log likelihood fuctio for parameters ( β, σ, ) i (3) ow takes the form: (50) ( βσ,,, ) = l( σ ) l det( ) L y X cost I 1 σ ( y Xβ) ( I )( I )( y Xβ) The parallel betwee (3) ad (50) is eve more clear whe oe solves for the coditioal estimates of β ad σ give, (51) (5) ˆ 1 sar ( ) = [ ( )( ) ] ( )( ) β X I I X X I I y ˆ σ ( ) = (1/ )( y X ˆ β ( )) ( I )( I )( y X ˆ β ( )) sar sar sar ad substitutes ito (50) to obtai the cocetrated likelihood fuctio, L sar, for. Agai, after cacellig of terms, this fuctio reduces to (53) L y X = cost I ˆ σ sar (, ) l det( ) ( /)l[ ( )] which is see to be idetical i form to (6). 4 Hece these cocetrated likelihood fuctios differ oly with respect to their correspodig coditioal variace estimates i (5) ad (5). However, for the special case of maximally coected weight matrices,, it turs out that these coditioal variace estimates are idetical, as we ow show. To do so, we begi with the followig prelimiary result o a certai class of orthogoal 1 projectios, which are exemplified by the key projectio, X ( XX ) X, embodied i k expressio (7) for M. If for ay matrix, A R, of full colum rak k we ow let 1 k P = A( AA ) A deote the orthogoal projectio of R ito the spa of A (so that by A defiitio, PA A = A) the we have the followig useful coditio for equality betwee such projectios: 3 Although the spatial depedece parameter i this model acts o residuals rather tha y, we choose to keep the same otatio,, i order to emphasize the parallels betwee these two models. 4 I particular the costat terms, cost, are also easily show to be idetical. 18

19 k Lemma 4. For ay matrices, AB, R, of full colum rak, (54) PA = PB PAB= B Proof: Sice PB B = B, it follows at oce that PA = PB PAB= PBB= B. So we eed oly show the coverse. To do so, observe first that (55) P B= B P B BB B = B BB B P P = P 1 1 A A [( ) ] [( ) ] A B B Moreover, it also follows that (56) B = PB= A AA AB = A AA AB A 1 1 ( ) ( ) ( ) B B= BA AA AB BB = BA AA AB 1 1 ( ) ( ) AB = BB AA > 0 ad hece that AB is osigular. Thus by the first lie of (56) we have (57) PB= B PAAA AB = AAA AB PA= A 1 1 B B ( ) ( ) ( ) ( ) B where the last implicatio follows by post-multiplicatio of both sides by the 1 iverse of the osigular matrix ( AA ) ( AB ). Hece by the argumet i (55) (58) PA B = A PP B A = PA PP A B = PA where here the last implicatio follows by takig trasposes of both sides ad usig the symmetry of P A ad P B. Hece it be cocluded from (55) ad (58) that (59) PA = PAPB = PB ad the result is established. ith this result, we ow have the followig key idetity betwee SL models (1) ad SAR models (49) for the case of maximally coected weight matrices. Propositio 4. If = i models (1) ad (49), the the cocetrated likelihood fuctios L sl ad L sar are idetical for all [ ]. Proof: To establish this result, it is clear from (6) ad (53) that it suffices to show that the coditioal variace estimates i (10) ad (5) are idetical o [ ]. But if for otatioal coveiece we ow let (60) B = I = I b(1 1 I ) 19

20 [where b= 1/( 1) is oe possibility] the by the first lie of (10), it follows that for the SL model (1), 1 1 (61) ˆ σsl ( ) = (1/ ) ([ I X( XX ) X ] B y ) ([ I X( XX ) X ] By) = I X XX X B y 1 (1/ ) [ ( ) ] = (1/ ) ( I P ) B y X To compare this with the SAR model (49), observe from (51) that (6) ˆ 1 1 ( ) = ( ) = [( )( )] ( ) βsar X BBX XBBy BX BX BX By (63) ad hece from (5) that ˆ σ ( ) (1/ )( ˆ ( )) ( ˆ sar = y Xβsar B B y Xβsar ( )) = (1/ ) B ( ˆ y Xβsar ( )) = B y X BX BX BX By 1 (1/ ) ( [( ) ( )] ( ) ) = I BX BX BX BX By 1 (1/ ) { ( )[( ) ( )] ( ) } ) = (1/ ) ( I P ) B y B X I this form it is clear that the result will follow if it ca be show that (64) PX = B X for all P [ ] But sice X = [1, X ] ad PX X = X together imply that P 1 = 1, we must have X (65) PX( B X) = bpx(1 1 I) X = b( PX1 )1 X bpxx = b(1 1 ) X bx = b(1 1 I ) X = B X ad may coclude from Lemma 4 that (64) holds for all [ ]. Hece it follows at oce from Propositio 4 that for maximally coected weight matrices,, it will always be true that the maximum likelihood estimates, ˆ, of i correspodig SL ad SAR models are idetical. This i tur implies that Propositio 1 must hold i tact if the SL model i (1) is replaced by SAR model i (49). Hece the 0

21 same type of cotiuity argumet i Propositio ca be used to show that the spatial depedece parameter,, i SAR models will be uderestimated for strogly coected weight matrices. Rather tha repeat such argumets here, we simply report the correspodig estimatio results for the SAR model based o the same data X, parameters ( β, σ, ), ad weight matrices, d, d {.30,.50,.80,.90,.95,.99,1.00} used i Sectio 3 above. The results for are displayed i colum 3 of Table 1 i that sectio ad show that, as predicted by Propositio 4, these estimates coverge to the same extreme value as d approaches uity. However it is also clear that (at least i this particular example) the uderestimatio of is eve more severe tha for the SL model above. Table 3 Here The results for other parameter estimates are show i Table 3 above. Notice first that all mea beta estimates appear to be remarkably accurate eve i the maximally coected case. This is explaied by the well kow fact that for the SAR model, ˆβ is always a ubiased estimator of β for a correctly specified model, sice (66) E ˆ β X X I I X X I I E y X 1 ( ) = [ ( )( ) ] ( )( ) ( ) = X I I X X I I Xβ = β 1 [ ( )( ) ] ( )( ) Notice also that there is some slight uderestimatio of residual variace, as i the case of SL models. This does ot appear to be too severe (i the preset example). But agai, eve slight uderestimatio of variace ca lead to erroeous sigificat of beta parameters. Moreover, i the extreme case of maximal coectivity these estimates are i fact completely ustable, as ca be see by the depedecy of ˆβ o ˆ i the coditioal beta estimator of (6) above. If we set (67) ˆ = 1/ λmi ( ) = 1/ b i this extreme case, the (68) Bˆ = I ( 1/ b)[ b(11 I )] = 11 together with 1 1 = implies that (69) ˆ β ( ˆ ) = ( X B B X) XB B y = ( X 11 X) X 11 y 1 1 sar ˆ ˆ ˆ ˆ Hece if there is at least oe explaatory variable other tha the itercept (i.e., if k 1) the the matrix, X 1 1 X, is sigular ad the iverse i (69) will ot exist. I practice 1

22 however, what typically happes is that estimatio algorithms coverge to values close to 1/b which will yield well-defied aswers. I the case illustrated above, where 1/(1/49) = 49, eve values of cotiue to produce reasoable lookig estimates o average. 6. Cosequeces for Mora Tests of Spatial Autocorrelatio Aside from the above cosequeces for spatial regressio models such as SL ad SAR models, strog coectivity of weight matrices has broader implicatios for diagostic aalyses of spatial autocorrelatio. This is most evidet with respect to the sigle most widely used test for spatial autocorrelatio, amely the Mora Test. I particular, suppose that oe cosiders the ull hypothesis of idepedece ( = 0 ), uder which both SL ad SAR models reduce to the stadard liear model: (70) y = Xβ ε, ε N(0, σ I ) If oe costructs the stadard maximum-likelihood (OLS) estimates of β uder this hypothesis, (71) ˆ β = ( X X) 1 X y ad forms the correspodig vector of residual estimates: (7) ˆ ε = y yˆ = y X ˆ β the for ay give cadidate choice of a spatial weight matrix,, the associated Mora statistic, I, is defied by [see for example Aseli (1988, Sectio 8.1.1)]: (73) I = α ˆ ε ˆ ε ˆˆ ε ε where the positive costat, α = / = / w 1 ij ij, plays o substative role i the aalysis to follow. This ca be expressed i a more coveiet form (agai followig Aseli) by otig from (71) ad (7) that (74) 1 1 ˆ ( ) [ ( ) ] ε = y X XX Xy= I X XX X y= My ad hece from (9) that I ca be equivaletly writte as (75) I = α y MMy y My Uder the hypothesis of idepedece i (70), the mea ad variace of I are well kow to be [Aseli (1988, Sectio 8.1.1)]:

23 (76) (77) αtr( M ) EI ( ) = ( k 1) ( α ) { tr( MM ) tr( MM ) [ tr( M )] } var( I ) = [ E( I )] [ ( k 1)] [ ( k 1)] I this settig, our mai result is to show that for maximally coected weight matrices,, this Mora statistic is degeerate. 5 I particular it is completely cocetrated at the mea, EI ( ), ad hece ca ever detect spatial autocorrelatio. To establish this result, we first ote from (75) that this statistic is oly meaigful for data sets ( y, X ) with ymy 0. But sice, ymy = 0 My= 0 (as show i the proof of Lemma 3 above), this is i tur equivalet to the coditio that My 0. Hece, for purposes of this sectio we agai assume regularity of ( y, X ). I additio, we employ the ormalizatio covetio, b= 1/( 1), for so that λ ( ) 1 max =. Fially, for each regular data set, ( y, X ), we let I ( yx, ) deote the correspodig sample value of I i (75). ith these covetios, we have the followig result: Propositio 5. For all regular data sets ( y, X ), (78) I ( yx, ) = EI ( ) Proof: First observe that sice follows that b= = =, it 1/( 1) ( 1)[1/( 1)] 1 (79) α = = / = 1 1 ad hece that the Mora statistic for this case reduces to (80) I ( y, X) = y My y M y Thus we see from Lemma 3 that y ( bm) y 1 ymy 1 (81) I ( y, X) = = = y My 1 y My 1 5 This degeeracy is also a istace of the more geeral result i Arold (1979, Th.5) for the class of ivariat test statistics for liear models with exchageably distributed errors. A more explicit versio relatig to the preset case is give i Martellosio (008, Props. 3.4 ad 3.6). 3

24 ad may coclude that I is ideed cocetrated at a sigle value. To show that this value is precisely the mea, E( I ), uder idepedece, we first ote that sice the trace of orthogoal projectio (symmetric idempotet) matrix, M, is equal to the dimesio of its image space [Searle (198, Sectio 1.)], ad sice the dimesio of the complemet of the spa of X is ( k 1), it follows that (8) tr( M ) = ( k 1) This i tur implies from Lemma 3 that (83) 1 ( k 1) tr( M ) = tr( bm ) = tr( M ) = 1 1 ad hece from (76) ad (79) that tr( M ) 1 (84) EI ( ) = = ( k 1) 1 Thus the result follows from (81) ad (84). As a cosequece of Propositio 5, it follows that (with probability oe) 6 the realized value of I is precisely its expected value uder idepedece. Hece o evidece for spatial depedece ca ever be detected i this extreme case. More geerally, the same type of cotiuity argumet used i Propositio 3 above shows that for weight matrices,, that are sufficietly close to [say i terms of the relative coectivity orm] it must be true that the possible values of I are cocetrated close to the mea EI ( ). So agai this statistic should have little ability to detect spatial depedece. To make these ideas more cocrete, we choose to focus o the stadard z-test for Mora statistics foud i most software. If the stadard deviatio of I uder idepedece is 1/ deoted by σ ( I) = var( I), the it is well kow [Cliff ad Ord(1981, Sectio 8.5.1)] that the stadardized z-value (85) Z = I EI ( ) σ ( I ) is approximately distributed N(0,1) for large. Hece oe ca use this distributio theory to test the hypothesis of spatial idepedece with respect to weight matrix. 7 6 It is a simple matter to show for ay X, the set of y with My = 0 has probability measure zero. 7 It is worth otig here that the exact distributio of I uder idepedece has bee obtaied by Tiefelsdorf ad Boots (1995). However, most statistical packages rely o the asymptotic approximatio above. 4

25 To study the behavior of this test for strogly coected weight matrices, we shall focus oly o the simulatio results i Sectio 3 above based o the SL model. Here it was assumed that =.5 ad hece that a substatial degree of positive spatial depedece is preset. To determie whether this depedece ca be detected by the Mora statistic for a give weight matrix,, it suffices to compute I ( yx, ) for simulated data sets from model (1), ad the examie the frequecy distributio of z-values, Z ( yx, ), geerated by this data. For a oe-sided test of > 0 at the α =.05, oe eed oly cout the fractio of z-values above z α = 1.65 to determie the power of this test to detect positive spatial depedece, give the true value =.5. For the 1000 simulated values at each lik desity level i Sectio 3 above, the resultig estimated power levels are show i Table 4 below. Table 4 Here Here it is clear that at lik desities above.80 the distributio is so cocetrated aroud the ull mea, EI ( ), that eve a depedecy level of =.5 is detectable less tha 10% of the time. 8 It is also of iterest to ote that eve though the distributio of I cocetrates at the ull mea as lik desity approaches 1, the power levels do ot appear to fall to zero i Table 4. The reaso for this is that cocetratio of I values drives the variace i (76) to zero (as ca easily be verified by the same calculatios as for the mea i the proof of Propositio 5 9 ). Hece whe I is highly cocetrated, the stadardized value, Z, becomes ustable (as it approaches the limitig idetermiate values 0/0 for ). 7. Cocludig Remarks I this paper it has bee show that presece of strogly coected spatial weight matrices ca itroduce serious biases ito both the estimatio ad testig of spatial autocorrelatio. Hece oe is led to ask whether there is ay simple ituitive explaatio for this. Oe possibility relates to the otio of effective sample size. It has log bee observed that the presece of statistical depedecies essetially reduce the amout of iformatio gaied from each idividual observatio. For example, the observatio of a sequece of perfectly correlated coi tosses will offer o more iformatio tha the observatio of oly the first toss, o matter how log the sequece is. Hece it ca be argued that i so far as strog spatial coectivity reflects strog depedecies amog 8 It is also of iterest to ote that the.054 value for desity.99 is cosistet with a limitig value of α =.05 for the maximally coected case, as implied by results of Martellosio (008, Prop.3.5). 9 Note i particular from Lemma that for = btr( M) = [ ( k 1)]/( 1)., ( ) = [( )( )] = ( ) tr M M tr bm bm b tr MM 5

26 uits (or agets), there should be less statistical iformatio available for estimatio or tests of hypotheses. But while this argumet has ituitive appeal, ad is o doubt true to some extet, it fails to explai, for example, why maximum-likelihood methods should systematically uderestimate the parameter i SL ad SAR models. I this paper it has bee show that much ca be leared by studyig the extreme case of maximally coected weight matrices,. I particular, both cocetrated likelihood fuctios ad Mora statistics reduce to particularly simple forms i this case, ad ca be studied i detail. But eve i this extreme case, the subtlety of the uderestimatio questio above is uderscored by the fact that quite differet argumets were used to boud the values for each term i the cocetrated log likelihood fuctio. I particular, both the eigevalue structure of 1 ad the relatio of to the regressio projectio operator, I X( XX ) X, were ivolved. So i some respects, these results serve to raise as may theoretical questios as they aswer. Eve more importat are questios relatig to the practical cosequeces of these results. hile the sigle simulatio example preseted here is very suggestive, it ca provide o defiitive guidelies for applicatios. Hece the actual severity of these biases ca oly be determied by more extesive ad systematic simulatio studies, as already begu by Mizruchi ad Neuma (008) ad Farber, Páez ad Volz (008). Refereces. Aseli, L. (1988) Spatial Ecoometrics: Methods ad Models, Kluwer: Netherlads. Aseli, L. ad A. Bera (1998), Spatial Depedece i Liear Regressio Models with a Itroductio to Spatial Ecoometrics. I A. Ullah ad D. Giles (eds.), Hadbook of Applied Ecoomic Statistics, New York: Marcel Dekker, pp Arold, S.F. (1979) Liear models with exchageably distributed errors, Joural of the America Statistical Associatio, 74: Baltagi, B.H., (006) Radom effects ad spatial autocorrelatio with equal weights, Ecoometric Theory, : Bao, Y. ad A. Ullah (007) Fiite sample properties of maximum likelihood estimator i spatial models, Joural of Ecoometrics, 137: Farber, S., A. Páez ad E. Volz (008) Topology ad depedecy tests i spatial ad etwork autoregressive models, forthcomig i Geographical Aalysis (curretly available at 6

27 Hor, R.A. ad C.R. Johso, (1985) Matrix Aalysis, Cambridge Uiversity Press: Cambridge. Kelejia, H.H. ad I.R. Prucha (1998) Geeralized spatial two-stage least squares procedure for estimatig a spatial autoregressive model with autoregressive disturbaces, The Joural of Real Estate Fiace ad Ecoomics, 17: Kelejia, H.H. ad I.R. Prucha (00) SLS ad OLS i a spatial autoregressive model with equal spatial weights, Regioal Sciece ad Urba Ecoomics, 3: Kelejia, H.H., I.R. Prucha ad Y. Yuzefovich, (006) Estimatio problems i models with spatial weightig matrices which have blocks of equal elemets, Joural of Regioal Sciece, 46: Kelejia, H.H. ad I.R. Prucha (008) Specificatio ad estimatio of spatial autoregressive models with autoregressive ad heteroskedastic disturbaces, forthcomig i the Joural of Ecoometrics. LeSage, J. (1999) Spatial Ecoometrics Toolbox, Martellosio, F. (008) Testig for spatial autocorrelatio: the regressors that make the power disappear, orkig Paper, Departmet of Ecoomics, Uiversity of Readig, Readig RG6 6A, UK. Mizruchi, M.S. ad E.J Neuma (008) The effect of desity o the levels of bias i the etwork autocorrelatio model, forthcomig i Social Networks (curretly available at Ord, K. (1975) Estimatio methods for models of spatial iteractio, Joural of the America Statistical Associatio, 70: Searle, S.R., (198) Matrix Algebra Useful for Statistics, iley: New York. Tiefelsdorf, M. ad B.N. Boots (1995) The exact distributio of Mora s I, Eviromet ad Plaig A, 7: Ackowledgemets: The author is grateful to Mark Mizruchi, Eric Neuma, Oleg Smirov, Harry Kelejia ad Federico Martellosio for may helpful commets o a earlier draft of this paper. 7

10. Comparative Tests among Spatial Regression Models. Here we revisit the example in Section 8.1 of estimating the mean of a normal random

10. Comparative Tests among Spatial Regression Models. Here we revisit the example in Section 8.1 of estimating the mean of a normal random Part III. Areal Data Aalysis 0. Comparative Tests amog Spatial Regressio Models While the otio of relative likelihood values for differet models is somewhat difficult to iterpret directly (as metioed above),

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

Economics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator

Economics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator Ecoomics 24B Relatio to Method of Momets ad Maximum Likelihood OLSE as a Maximum Likelihood Estimator Uder Assumptio 5 we have speci ed the distributio of the error, so we ca estimate the model parameters

More information

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence Chapter 3 Strog covergece As poited out i the Chapter 2, there are multiple ways to defie the otio of covergece of a sequece of radom variables. That chapter defied covergece i probability, covergece i

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

7.1 Convergence of sequences of random variables

7.1 Convergence of sequences of random variables Chapter 7 Limit Theorems Throughout this sectio we will assume a probability space (, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite

More information

Asymptotic Results for the Linear Regression Model

Asymptotic Results for the Linear Regression Model Asymptotic Results for the Liear Regressio Model C. Fli November 29, 2000 1. Asymptotic Results uder Classical Assumptios The followig results apply to the liear regressio model y = Xβ + ε, where X is

More information

Convergence of random variables. (telegram style notes) P.J.C. Spreij

Convergence of random variables. (telegram style notes) P.J.C. Spreij Covergece of radom variables (telegram style otes).j.c. Spreij this versio: September 6, 2005 Itroductio As we kow, radom variables are by defiitio measurable fuctios o some uderlyig measurable space

More information

Efficient GMM LECTURE 12 GMM II

Efficient GMM LECTURE 12 GMM II DECEMBER 1 010 LECTURE 1 II Efficiet The estimator depeds o the choice of the weight matrix A. The efficiet estimator is the oe that has the smallest asymptotic variace amog all estimators defied by differet

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

Chapter 6 Infinite Series

Chapter 6 Infinite Series Chapter 6 Ifiite Series I the previous chapter we cosidered itegrals which were improper i the sese that the iterval of itegratio was ubouded. I this chapter we are goig to discuss a topic which is somewhat

More information

Linear Regression Demystified

Linear Regression Demystified Liear Regressio Demystified Liear regressio is a importat subject i statistics. I elemetary statistics courses, formulae related to liear regressio are ofte stated without derivatio. This ote iteds to

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors ECONOMETRIC THEORY MODULE XIII Lecture - 34 Asymptotic Theory ad Stochastic Regressors Dr. Shalabh Departmet of Mathematics ad Statistics Idia Istitute of Techology Kapur Asymptotic theory The asymptotic

More information

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS MASSACHUSTTS INSTITUT OF TCHNOLOGY 6.436J/5.085J Fall 2008 Lecture 9 /7/2008 LAWS OF LARG NUMBRS II Cotets. The strog law of large umbers 2. The Cheroff boud TH STRONG LAW OF LARG NUMBRS While the weak

More information

Slide Set 13 Linear Model with Endogenous Regressors and the GMM estimator

Slide Set 13 Linear Model with Endogenous Regressors and the GMM estimator Slide Set 13 Liear Model with Edogeous Regressors ad the GMM estimator Pietro Coretto pcoretto@uisa.it Ecoometrics Master i Ecoomics ad Fiace (MEF) Uiversità degli Studi di Napoli Federico II Versio: Friday

More information

Algebra of Least Squares

Algebra of Least Squares October 19, 2018 Algebra of Least Squares Geometry of Least Squares Recall that out data is like a table [Y X] where Y collects observatios o the depedet variable Y ad X collects observatios o the k-dimesioal

More information

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1. Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio

More information

Singular Continuous Measures by Michael Pejic 5/14/10

Singular Continuous Measures by Michael Pejic 5/14/10 Sigular Cotiuous Measures by Michael Peic 5/4/0 Prelimiaries Give a set X, a σ-algebra o X is a collectio of subsets of X that cotais X ad ad is closed uder complemetatio ad coutable uios hece, coutable

More information

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10 DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set

More information

The standard deviation of the mean

The standard deviation of the mean Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 9 Multicolliearity Dr Shalabh Departmet of Mathematics ad Statistics Idia Istitute of Techology Kapur Multicolliearity diagostics A importat questio that

More information

6.3 Testing Series With Positive Terms

6.3 Testing Series With Positive Terms 6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial

More information

Random Walks on Discrete and Continuous Circles. by Jeffrey S. Rosenthal School of Mathematics, University of Minnesota, Minneapolis, MN, U.S.A.

Random Walks on Discrete and Continuous Circles. by Jeffrey S. Rosenthal School of Mathematics, University of Minnesota, Minneapolis, MN, U.S.A. Radom Walks o Discrete ad Cotiuous Circles by Jeffrey S. Rosethal School of Mathematics, Uiversity of Miesota, Mieapolis, MN, U.S.A. 55455 (Appeared i Joural of Applied Probability 30 (1993), 780 789.)

More information

Geometry of LS. LECTURE 3 GEOMETRY OF LS, PROPERTIES OF σ 2, PARTITIONED REGRESSION, GOODNESS OF FIT

Geometry of LS. LECTURE 3 GEOMETRY OF LS, PROPERTIES OF σ 2, PARTITIONED REGRESSION, GOODNESS OF FIT OCTOBER 7, 2016 LECTURE 3 GEOMETRY OF LS, PROPERTIES OF σ 2, PARTITIONED REGRESSION, GOODNESS OF FIT Geometry of LS We ca thik of y ad the colums of X as members of the -dimesioal Euclidea space R Oe ca

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

11 Correlation and Regression

11 Correlation and Regression 11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

Definition 4.2. (a) A sequence {x n } in a Banach space X is a basis for X if. unique scalars a n (x) such that x = n. a n (x) x n. (4.

Definition 4.2. (a) A sequence {x n } in a Banach space X is a basis for X if. unique scalars a n (x) such that x = n. a n (x) x n. (4. 4. BASES I BAACH SPACES 39 4. BASES I BAACH SPACES Sice a Baach space X is a vector space, it must possess a Hamel, or vector space, basis, i.e., a subset {x γ } γ Γ whose fiite liear spa is all of X ad

More information

(A sequence also can be thought of as the list of function values attained for a function f :ℵ X, where f (n) = x n for n 1.) x 1 x N +k x N +4 x 3

(A sequence also can be thought of as the list of function values attained for a function f :ℵ X, where f (n) = x n for n 1.) x 1 x N +k x N +4 x 3 MATH 337 Sequeces Dr. Neal, WKU Let X be a metric space with distace fuctio d. We shall defie the geeral cocept of sequece ad limit i a metric space, the apply the results i particular to some special

More information

Product measures, Tonelli s and Fubini s theorems For use in MAT3400/4400, autumn 2014 Nadia S. Larsen. Version of 13 October 2014.

Product measures, Tonelli s and Fubini s theorems For use in MAT3400/4400, autumn 2014 Nadia S. Larsen. Version of 13 October 2014. Product measures, Toelli s ad Fubii s theorems For use i MAT3400/4400, autum 2014 Nadia S. Larse Versio of 13 October 2014. 1. Costructio of the product measure The purpose of these otes is to preset the

More information

11 THE GMM ESTIMATION

11 THE GMM ESTIMATION Cotets THE GMM ESTIMATION 2. Cosistecy ad Asymptotic Normality..................... 3.2 Regularity Coditios ad Idetificatio..................... 4.3 The GMM Iterpretatio of the OLS Estimatio.................

More information

7.1 Convergence of sequences of random variables

7.1 Convergence of sequences of random variables Chapter 7 Limit theorems Throughout this sectio we will assume a probability space (Ω, F, P), i which is defied a ifiite sequece of radom variables (X ) ad a radom variable X. The fact that for every ifiite

More information

Advanced Stochastic Processes.

Advanced Stochastic Processes. Advaced Stochastic Processes. David Gamarik LECTURE 2 Radom variables ad measurable fuctios. Strog Law of Large Numbers (SLLN). Scary stuff cotiued... Outlie of Lecture Radom variables ad measurable fuctios.

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

Principle Of Superposition

Principle Of Superposition ecture 5: PREIMINRY CONCEP O RUCUR NYI Priciple Of uperpositio Mathematically, the priciple of superpositio is stated as ( a ) G( a ) G( ) G a a or for a liear structural system, the respose at a give

More information

Estimation for Complete Data

Estimation for Complete Data Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of

More information

Lecture 6 Simple alternatives and the Neyman-Pearson lemma

Lecture 6 Simple alternatives and the Neyman-Pearson lemma STATS 00: Itroductio to Statistical Iferece Autum 06 Lecture 6 Simple alteratives ad the Neyma-Pearso lemma Last lecture, we discussed a umber of ways to costruct test statistics for testig a simple ull

More information

Lecture 3 The Lebesgue Integral

Lecture 3 The Lebesgue Integral Lecture 3: The Lebesgue Itegral 1 of 14 Course: Theory of Probability I Term: Fall 2013 Istructor: Gorda Zitkovic Lecture 3 The Lebesgue Itegral The costructio of the itegral Uless expressly specified

More information

It is always the case that unions, intersections, complements, and set differences are preserved by the inverse image of a function.

It is always the case that unions, intersections, complements, and set differences are preserved by the inverse image of a function. MATH 532 Measurable Fuctios Dr. Neal, WKU Throughout, let ( X, F, µ) be a measure space ad let (!, F, P ) deote the special case of a probability space. We shall ow begi to study real-valued fuctios defied

More information

6 Integers Modulo n. integer k can be written as k = qn + r, with q,r, 0 r b. So any integer.

6 Integers Modulo n. integer k can be written as k = qn + r, with q,r, 0 r b. So any integer. 6 Itegers Modulo I Example 2.3(e), we have defied the cogruece of two itegers a,b with respect to a modulus. Let us recall that a b (mod ) meas a b. We have proved that cogruece is a equivalece relatio

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates

Investigating the Significance of a Correlation Coefficient using Jackknife Estimates Iteratioal Joural of Scieces: Basic ad Applied Research (IJSBAR) ISSN 2307-4531 (Prit & Olie) http://gssrr.org/idex.php?joural=jouralofbasicadapplied ---------------------------------------------------------------------------------------------------------------------------

More information

ECON 3150/4150, Spring term Lecture 3

ECON 3150/4150, Spring term Lecture 3 Itroductio Fidig the best fit by regressio Residuals ad R-sq Regressio ad causality Summary ad ext step ECON 3150/4150, Sprig term 2014. Lecture 3 Ragar Nymoe Uiversity of Oslo 21 Jauary 2014 1 / 30 Itroductio

More information

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ STATISTICAL INFERENCE INTRODUCTION Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I oesample testig, we essetially

More information

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1 EECS564 Estimatio, Filterig, ad Detectio Hwk 2 Sols. Witer 25 4. Let Z be a sigle observatio havig desity fuctio where. p (z) = (2z + ), z (a) Assumig that is a oradom parameter, fid ad plot the maximum

More information

Commutativity in Permutation Groups

Commutativity in Permutation Groups Commutativity i Permutatio Groups Richard Wito, PhD Abstract I the group Sym(S) of permutatios o a oempty set S, fixed poits ad trasiet poits are defied Prelimiary results o fixed ad trasiet poits are

More information

The Borel hierarchy classifies subsets of the reals by their topological complexity. Another approach is to classify them by size.

The Borel hierarchy classifies subsets of the reals by their topological complexity. Another approach is to classify them by size. Lecture 7: Measure ad Category The Borel hierarchy classifies subsets of the reals by their topological complexity. Aother approach is to classify them by size. Filters ad Ideals The most commo measure

More information

Advanced Analysis. Min Yan Department of Mathematics Hong Kong University of Science and Technology

Advanced Analysis. Min Yan Department of Mathematics Hong Kong University of Science and Technology Advaced Aalysis Mi Ya Departmet of Mathematics Hog Kog Uiversity of Sciece ad Techology September 3, 009 Cotets Limit ad Cotiuity 7 Limit of Sequece 8 Defiitio 8 Property 3 3 Ifiity ad Ifiitesimal 8 4

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

Lecture Notes for Analysis Class

Lecture Notes for Analysis Class Lecture Notes for Aalysis Class Topological Spaces A topology for a set X is a collectio T of subsets of X such that: (a) X ad the empty set are i T (b) Uios of elemets of T are i T (c) Fiite itersectios

More information

Basics of Probability Theory (for Theory of Computation courses)

Basics of Probability Theory (for Theory of Computation courses) Basics of Probability Theory (for Theory of Computatio courses) Oded Goldreich Departmet of Computer Sciece Weizma Istitute of Sciece Rehovot, Israel. oded.goldreich@weizma.ac.il November 24, 2008 Preface.

More information

CLRM estimation Pietro Coretto Econometrics

CLRM estimation Pietro Coretto Econometrics Slide Set 4 CLRM estimatio Pietro Coretto pcoretto@uisa.it Ecoometrics Master i Ecoomics ad Fiace (MEF) Uiversità degli Studi di Napoli Federico II Versio: Thursday 24 th Jauary, 2019 (h08:41) P. Coretto

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

Solution. 1 Solutions of Homework 1. Sangchul Lee. October 27, Problem 1.1

Solution. 1 Solutions of Homework 1. Sangchul Lee. October 27, Problem 1.1 Solutio Sagchul Lee October 7, 017 1 Solutios of Homework 1 Problem 1.1 Let Ω,F,P) be a probability space. Show that if {A : N} F such that A := lim A exists, the PA) = lim PA ). Proof. Usig the cotiuity

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

4 Multidimensional quantitative data

4 Multidimensional quantitative data Chapter 4 Multidimesioal quatitative data 4 Multidimesioal statistics Basic statistics are ow part of the curriculum of most ecologists However, statistical techiques based o such simple distributios as

More information

Distribution of Random Samples & Limit theorems

Distribution of Random Samples & Limit theorems STAT/MATH 395 A - PROBABILITY II UW Witer Quarter 2017 Néhémy Lim Distributio of Radom Samples & Limit theorems 1 Distributio of i.i.d. Samples Motivatig example. Assume that the goal of a study is to

More information

Statistical Inference Based on Extremum Estimators

Statistical Inference Based on Extremum Estimators T. Rotheberg Fall, 2007 Statistical Iferece Based o Extremum Estimators Itroductio Suppose 0, the true value of a p-dimesioal parameter, is kow to lie i some subset S R p : Ofte we choose to estimate 0

More information

Riesz-Fischer Sequences and Lower Frame Bounds

Riesz-Fischer Sequences and Lower Frame Bounds Zeitschrift für Aalysis ud ihre Aweduge Joural for Aalysis ad its Applicatios Volume 1 (00), No., 305 314 Riesz-Fischer Sequeces ad Lower Frame Bouds P. Casazza, O. Christese, S. Li ad A. Lider Abstract.

More information

Optimally Sparse SVMs

Optimally Sparse SVMs A. Proof of Lemma 3. We here prove a lower boud o the umber of support vectors to achieve geeralizatio bouds of the form which we cosider. Importatly, this result holds ot oly for liear classifiers, but

More information

Probability, Expectation Value and Uncertainty

Probability, Expectation Value and Uncertainty Chapter 1 Probability, Expectatio Value ad Ucertaity We have see that the physically observable properties of a quatum system are represeted by Hermitea operators (also referred to as observables ) such

More information

LECTURE 14 NOTES. A sequence of α-level tests {ϕ n (x)} is consistent if

LECTURE 14 NOTES. A sequence of α-level tests {ϕ n (x)} is consistent if LECTURE 14 NOTES 1. Asymptotic power of tests. Defiitio 1.1. A sequece of -level tests {ϕ x)} is cosistet if β θ) := E θ [ ϕ x) ] 1 as, for ay θ Θ 1. Just like cosistecy of a sequece of estimators, Defiitio

More information

This is an introductory course in Analysis of Variance and Design of Experiments.

This is an introductory course in Analysis of Variance and Design of Experiments. 1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class

More information

Journal of Multivariate Analysis. Superefficient estimation of the marginals by exploiting knowledge on the copula

Journal of Multivariate Analysis. Superefficient estimation of the marginals by exploiting knowledge on the copula Joural of Multivariate Aalysis 102 (2011) 1315 1319 Cotets lists available at ScieceDirect Joural of Multivariate Aalysis joural homepage: www.elsevier.com/locate/jmva Superefficiet estimatio of the margials

More information

If a subset E of R contains no open interval, is it of zero measure? For instance, is the set of irrationals in [0, 1] is of measure zero?

If a subset E of R contains no open interval, is it of zero measure? For instance, is the set of irrationals in [0, 1] is of measure zero? 2 Lebesgue Measure I Chapter 1 we defied the cocept of a set of measure zero, ad we have observed that every coutable set is of measure zero. Here are some atural questios: If a subset E of R cotais a

More information

A Hadamard-type lower bound for symmetric diagonally dominant positive matrices

A Hadamard-type lower bound for symmetric diagonally dominant positive matrices A Hadamard-type lower boud for symmetric diagoally domiat positive matrices Christopher J. Hillar, Adre Wibisoo Uiversity of Califoria, Berkeley Jauary 7, 205 Abstract We prove a ew lower-boud form of

More information

Lecture 3: August 31

Lecture 3: August 31 36-705: Itermediate Statistics Fall 018 Lecturer: Siva Balakrisha Lecture 3: August 31 This lecture will be mostly a summary of other useful expoetial tail bouds We will ot prove ay of these i lecture,

More information

Optimization Methods MIT 2.098/6.255/ Final exam

Optimization Methods MIT 2.098/6.255/ Final exam Optimizatio Methods MIT 2.098/6.255/15.093 Fial exam Date Give: December 19th, 2006 P1. [30 pts] Classify the followig statemets as true or false. All aswers must be well-justified, either through a short

More information

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d

Linear regression. Daniel Hsu (COMS 4771) (y i x T i β)2 2πσ. 2 2σ 2. 1 n. (x T i β y i ) 2. 1 ˆβ arg min. β R n d Liear regressio Daiel Hsu (COMS 477) Maximum likelihood estimatio Oe of the simplest liear regressio models is the followig: (X, Y ),..., (X, Y ), (X, Y ) are iid radom pairs takig values i R d R, ad Y

More information

On Random Line Segments in the Unit Square

On Random Line Segments in the Unit Square O Radom Lie Segmets i the Uit Square Thomas A. Courtade Departmet of Electrical Egieerig Uiversity of Califoria Los Ageles, Califoria 90095 Email: tacourta@ee.ucla.edu I. INTRODUCTION Let Q = [0, 1] [0,

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

Infinite Sequences and Series

Infinite Sequences and Series Chapter 6 Ifiite Sequeces ad Series 6.1 Ifiite Sequeces 6.1.1 Elemetary Cocepts Simply speakig, a sequece is a ordered list of umbers writte: {a 1, a 2, a 3,...a, a +1,...} where the elemets a i represet

More information

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio

More information

Brief Review of Functions of Several Variables

Brief Review of Functions of Several Variables Brief Review of Fuctios of Several Variables Differetiatio Differetiatio Recall, a fuctio f : R R is differetiable at x R if ( ) ( ) lim f x f x 0 exists df ( x) Whe this limit exists we call it or f(

More information

Sequences A sequence of numbers is a function whose domain is the positive integers. We can see that the sequence

Sequences A sequence of numbers is a function whose domain is the positive integers. We can see that the sequence Sequeces A sequece of umbers is a fuctio whose domai is the positive itegers. We ca see that the sequece 1, 1, 2, 2, 3, 3,... is a fuctio from the positive itegers whe we write the first sequece elemet

More information

1 Introduction to reducing variance in Monte Carlo simulations

1 Introduction to reducing variance in Monte Carlo simulations Copyright c 010 by Karl Sigma 1 Itroductio to reducig variace i Mote Carlo simulatios 11 Review of cofidece itervals for estimatig a mea I statistics, we estimate a ukow mea µ = E(X) of a distributio by

More information

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f. Lecture 5 Let us give oe more example of MLE. Example 3. The uiform distributio U[0, ] o the iterval [0, ] has p.d.f. { 1 f(x =, 0 x, 0, otherwise The likelihood fuctio ϕ( = f(x i = 1 I(X 1,..., X [0,

More information

Sequences. Notation. Convergence of a Sequence

Sequences. Notation. Convergence of a Sequence Sequeces A sequece is essetially just a list. Defiitio (Sequece of Real Numbers). A sequece of real umbers is a fuctio Z (, ) R for some real umber. Do t let the descriptio of the domai cofuse you; it

More information

Sequences and Series of Functions

Sequences and Series of Functions Chapter 6 Sequeces ad Series of Fuctios 6.1. Covergece of a Sequece of Fuctios Poitwise Covergece. Defiitio 6.1. Let, for each N, fuctio f : A R be defied. If, for each x A, the sequece (f (x)) coverges

More information

TEACHER CERTIFICATION STUDY GUIDE

TEACHER CERTIFICATION STUDY GUIDE COMPETENCY 1. ALGEBRA SKILL 1.1 1.1a. ALGEBRAIC STRUCTURES Kow why the real ad complex umbers are each a field, ad that particular rigs are ot fields (e.g., itegers, polyomial rigs, matrix rigs) Algebra

More information

4. Partial Sums and the Central Limit Theorem

4. Partial Sums and the Central Limit Theorem 1 of 10 7/16/2009 6:05 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 4. Partial Sums ad the Cetral Limit Theorem The cetral limit theorem ad the law of large umbers are the two fudametal theorems

More information

6. Kalman filter implementation for linear algebraic equations. Karhunen-Loeve decomposition

6. Kalman filter implementation for linear algebraic equations. Karhunen-Loeve decomposition 6. Kalma filter implemetatio for liear algebraic equatios. Karhue-Loeve decompositio 6.1. Solvable liear algebraic systems. Probabilistic iterpretatio. Let A be a quadratic matrix (ot obligatory osigular.

More information

Lecture 19: Convergence

Lecture 19: Convergence Lecture 19: Covergece Asymptotic approach I statistical aalysis or iferece, a key to the success of fidig a good procedure is beig able to fid some momets ad/or distributios of various statistics. I may

More information

A Risk Comparison of Ordinary Least Squares vs Ridge Regression

A Risk Comparison of Ordinary Least Squares vs Ridge Regression Joural of Machie Learig Research 14 (2013) 1505-1511 Submitted 5/12; Revised 3/13; Published 6/13 A Risk Compariso of Ordiary Least Squares vs Ridge Regressio Paramveer S. Dhillo Departmet of Computer

More information

Math Solutions to homework 6

Math Solutions to homework 6 Math 175 - Solutios to homework 6 Cédric De Groote November 16, 2017 Problem 1 (8.11 i the book): Let K be a compact Hermitia operator o a Hilbert space H ad let the kerel of K be {0}. Show that there

More information

First, note that the LS residuals are orthogonal to the regressors. X Xb X y = 0 ( normal equations ; (k 1) ) So,

First, note that the LS residuals are orthogonal to the regressors. X Xb X y = 0 ( normal equations ; (k 1) ) So, 0 2. OLS Part II The OLS residuals are orthogoal to the regressors. If the model icludes a itercept, the orthogoality of the residuals ad regressors gives rise to three results, which have limited practical

More information

GUIDELINES ON REPRESENTATIVE SAMPLING

GUIDELINES ON REPRESENTATIVE SAMPLING DRUGS WORKING GROUP VALIDATION OF THE GUIDELINES ON REPRESENTATIVE SAMPLING DOCUMENT TYPE : REF. CODE: ISSUE NO: ISSUE DATE: VALIDATION REPORT DWG-SGL-001 002 08 DECEMBER 2012 Ref code: DWG-SGL-001 Issue

More information

Rank tests and regression rank scores tests in measurement error models

Rank tests and regression rank scores tests in measurement error models Rak tests ad regressio rak scores tests i measuremet error models J. Jurečková ad A.K.Md.E. Saleh Charles Uiversity i Prague ad Carleto Uiversity i Ottawa Abstract The rak ad regressio rak score tests

More information

10-701/ Machine Learning Mid-term Exam Solution

10-701/ Machine Learning Mid-term Exam Solution 0-70/5-78 Machie Learig Mid-term Exam Solutio Your Name: Your Adrew ID: True or False (Give oe setece explaatio) (20%). (F) For a cotiuous radom variable x ad its probability distributio fuctio p(x), it

More information

1.3 Convergence Theorems of Fourier Series. k k k k. N N k 1. With this in mind, we state (without proof) the convergence of Fourier series.

1.3 Convergence Theorems of Fourier Series. k k k k. N N k 1. With this in mind, we state (without proof) the convergence of Fourier series. .3 Covergece Theorems of Fourier Series I this sectio, we preset the covergece of Fourier series. A ifiite sum is, by defiitio, a limit of partial sums, that is, a cos( kx) b si( kx) lim a cos( kx) b si(

More information

Measure and Measurable Functions

Measure and Measurable Functions 3 Measure ad Measurable Fuctios 3.1 Measure o a Arbitrary σ-algebra Recall from Chapter 2 that the set M of all Lebesgue measurable sets has the followig properties: R M, E M implies E c M, E M for N implies

More information

Math 61CM - Solutions to homework 3

Math 61CM - Solutions to homework 3 Math 6CM - Solutios to homework 3 Cédric De Groote October 2 th, 208 Problem : Let F be a field, m 0 a fixed oegative iteger ad let V = {a 0 + a x + + a m x m a 0,, a m F} be the vector space cosistig

More information

CSE 527, Additional notes on MLE & EM

CSE 527, Additional notes on MLE & EM CSE 57 Lecture Notes: MLE & EM CSE 57, Additioal otes o MLE & EM Based o earlier otes by C. Grat & M. Narasimha Itroductio Last lecture we bega a examiatio of model based clusterig. This lecture will be

More information

1 General linear Model Continued..

1 General linear Model Continued.. Geeral liear Model Cotiued.. We have We kow y = X + u X o radom u v N(0; I ) b = (X 0 X) X 0 y E( b ) = V ar( b ) = (X 0 X) We saw that b = (X 0 X) X 0 u so b is a liear fuctio of a ormally distributed

More information

MEASURES OF DISPERSION (VARIABILITY)

MEASURES OF DISPERSION (VARIABILITY) POLI 300 Hadout #7 N. R. Miller MEASURES OF DISPERSION (VARIABILITY) While measures of cetral tedecy idicate what value of a variable is (i oe sese or other, e.g., mode, media, mea), average or cetral

More information

CS284A: Representations and Algorithms in Molecular Biology

CS284A: Representations and Algorithms in Molecular Biology CS284A: Represetatios ad Algorithms i Molecular Biology Scribe Notes o Lectures 3 & 4: Motif Discovery via Eumeratio & Motif Represetatio Usig Positio Weight Matrix Joshua Gervi Based o presetatios by

More information

Lecture 12: September 27

Lecture 12: September 27 36-705: Itermediate Statistics Fall 207 Lecturer: Siva Balakrisha Lecture 2: September 27 Today we will discuss sufficiecy i more detail ad the begi to discuss some geeral strategies for costructig estimators.

More information

MAT1026 Calculus II Basic Convergence Tests for Series

MAT1026 Calculus II Basic Convergence Tests for Series MAT026 Calculus II Basic Covergece Tests for Series Egi MERMUT 202.03.08 Dokuz Eylül Uiversity Faculty of Sciece Departmet of Mathematics İzmir/TURKEY Cotets Mootoe Covergece Theorem 2 2 Series of Real

More information

7 Sequences of real numbers

7 Sequences of real numbers 40 7 Sequeces of real umbers 7. Defiitios ad examples Defiitio 7... A sequece of real umbers is a real fuctio whose domai is the set N of atural umbers. Let s : N R be a sequece. The the values of s are

More information