arxiv: v2 [stat.me] 3 Nov 2014

Size: px
Start display at page:

Download "arxiv: v2 [stat.me] 3 Nov 2014"

Transcription

1 onarametric Stein-tye Shrinkage Covariance Matrix Estimators in High-Dimensional Settings Anestis Touloumis Cancer Research UK Cambridge Institute University of Cambridge Cambridge CB2 0RE, U.K. arxiv: v2 [stat.me] 3 ov 2014 Abstract Estimating a covariance matrix is an imortant task in alications where the number of variables is larger than the number of observations. In the literature, shrinkage aroaches for estimating a high-dimensional covariance matrix are emloyed to circumvent the limitations of the samle covariance matrix. A new family of nonarametric Stein-tye shrinkage covariance estimators is roosed whose members are written as a convex linear combination of the samle covariance matrix and of a redefined invertible target matrix. Under the Frobenius norm criterion, the otimal shrinkage intensity that defines the best convex linear combination deends on the unobserved covariance matrix and it must be estimated from the data. A simle but effective estimation rocess that roduces nonarametric and consistent estimators of the otimal shrinkage intensity for three oular target matrices is introduced. In simulations, the roosed Stein-tye shrinkage covariance matrix estimator based on a scaled identity matrix aeared to be u to 80% more efficient than existing ones in extreme high-dimensional settings. A colon cancer dataset was analyzed to demonstrate the utility of the roosed estimators. A rule of thumb for adhoc selection among the three commonly used target matrices is recommended. Keywords Covariance matrix, High-dimensional settings, onarametric estimation, Shrinkage estimation 1 Introduction The roblem of estimating large covariance matrices arises frequently in modern alications, such as in genomics, cancer research, clinical trials, signal rocessing, financial mathematics, attern recognition and comutational convex geometry. Formally, the goal is to estimate the covariance matrix Σ based on a samle of indeendent and identically distributed i.i.d) -variate random vectors X 1,..., X with mean vector µ in the small, large aradigm, that is when is a lot smaller comared to. It is a well-known fact that the samle covariance matrix S = 1 1 X i X)X i X) T, i=1 where X = i=1 X i/ is the samle mean vector, is not erforming satisfactory in high-dimensional settings. For examle, S is singular even when Σ is a strictly ositive definite matrix. Recent research in estimating high-dimensional covariance matrices includes banding, taering, enalization and shrinkage methods. We focus on the Steinian shrinkage method Stein, 1956) as adoted by Ledoit and Wolf 2004) because it leads to covariance matrix estimators that are: i) non-singular ii) well-conditioned, iii) invariant to ermutations of the order of the variables, iv) consistent to deartures from a multivariate normal model, v) not necessarily sarse, vi) exressed in closed form and vii) comutationally chea regardless of. Ledoit and Wolf 2004) roosed a Stein-tye covariance matrix estimator for Σ based on S = 1 λ)s + λνi, 1) 1

2 where I is the identity matrix, and where λ and ν minimize the risk function E [ S Σ 2 ] F, that is λ = E [ S Σ 2 ] F E [ ] S νi 2 F and ν = trσ). The otimal shrinkage intensity arameter λ in 1) suggests how much we must shrink the eigenvalues of the samle covariance matrix S towards the eigenvalues of the target matrix νi. For examle, λ = 0 imlies no contribution of νi to S, while λ = 1 imlies no contribution of S to S. Intermediate values for λ reveal the simultaneous contribution of S and νi to S. Desite the attractive interretation, S is not a covariance matrix estimator because ν and λ deend on the unobservable covariance matrix Σ. For this reason, Ledoit and Wolf 2004) roosed to lug-in nonarametric -consistent estimators for ν and λ in 1) and use the resulting matrix as a shrinkage covariance matrix estimator for Σ. Although ν seems to be adequately estimated by ˆν = trs)/, we noticed via simulations that the estimator of λ roosed by Ledoit and Wolf 2004) was biased in extreme high-dimensional settings and when Σ = I. This is counter-intuitive because λ = 1 and the lug-in estimator of S is exected to be as close as ossible to the target matrix νi. In addition, this observation underlines the imortance of choosing a target matrix that aroximates well the true underlying deendence structure. To this direction, Fisher and Sun 2011) roosed Stein-tye shrinkage covariance matrix estimators for alternative target matrices. However, they are no longer nonarametric as their construction was based on a multivariate normal model assumtion. Motivated by the above, we imrove estimation of the otimal shrinkage intensity by roviding a consistent estimator of λ in high-dimensional settings. To construct the estimator of λ we follow three simle stes: i) exand the exectations in the numerator and denominator of λ assuming a multivariate normal model, ii) rove that this ratio, say λ, is asymtotically equivalent to λ, and iii) relace each unknown arameter in λ with unbiased and consistent estimators constructed using U-statistics. The last ste is essential in our roosal so as to ensure consistent and nonarametric estimation of λ. Further, we relax the normality assumtion in Fisher and Sun 2011) for target matrices other than νi in 1) and we illustrate how to estimate consistently the corresonding otimal shrinkage intensities in high-dimensional settings. In other words, we roose a new nonarametric family of Stein-tye shrinkage estimators suitable for high-dimensional settings that reserve the attractive roerties mentioned in the first aragrah and can accommodate arbitrary target matrices. The rest of this aer is organized as follows. In Section 2, we resent the working framework that allows us to manage the high-dimensional setting. Section 3 contains the main results where we derive consistent and nonarametric estimators for the otimal shrinkage intensity of three different target matrices. We evaluate the erformance of the roosed covariance matrix estimators via simulations in Section 4. In Section 5, we illustrate the use of the roosed estimators in a colon cancer study and we recommend a rule of thumb for selecting the target matrix. In Section 6, we summarize our findings and discuss future research. The technical details can be found in the aendix. Throughout the aer, we use A 2 F = trat A)/ to denote the scaled Frobenius norm of A, tra) to denote the trace of the matrix A, D A to denote the diagonal matrix with elements the diagonal elements of A, and A B to denote the Hadamard roduct of the matrices A and B, i.e., the matrix whose a, b)-th element is the roduct of the corresonding elements of A and B. In the above, it is imlicit that A and B are matrices. 2 Framework for High-Dimensional Settings Let X 1,..., X be a samle of i.i.d. -variate random vectors from the nonarametric model X i = Σ 1/2 Z i + µ, 2) where µ = E[X i ] is the -variate mean vector, Σ = cov[x i ] = Σ 1/2 Σ 1/2 is the covariance matrix, and Z 1,..., Z is a collection of i.i.d. -variate random vectors. Instead of distributional assumtions, 2

3 moments restrictions are imosed on the random variables in Z i. In articular, let Z ia be the a-th random variable in Z i and suose that E[Z ia ] = 0, E[Zia 2 ] = 1, E[Z4 ia ] = 3 + B with 2 B < and for any nonnegative integers l 1,..., l 4 such that 4 ν=1 l ν 4 E[Z l 1 ia1 Z l 2 ia2 Z l 3 ia3 Z l 4 ia4 ] = E[Z l 1 ia1 ]E[Z l 2 ia2 ]E[Z l 2 ia3 ]E[Z l 4 ia4 ], 3) where the indexes a 1,..., a 4 are distinct. The nonarametric model 2) includes the -variate normal distribution µ, Σ) as a secial case obtained if Z ia are i.i.d. 0, 1) random variables. Since B = 0 under a multivariate normal model, B can be interreted as a measure of dearture of the fourth moment of Z ia to that of a 0, 1) random variable. The assumtion of common fourth moments is made for notational ease and the results of this aer remain valid even if E[Z 4 ia ] = 3 + B a for finite B a a = 1,..., ). Model 2) assumes that X i is a linear combination of Z i, a random vector that contains standardized white noise variables. Unlike the usual definition of white noise, model 2) makes no distributional assumtions for Z i and it allows deendence atterns given that the seudoindeendence condition 3) holds. Therefore, the working framework can cover situations in which the white noise mechanism does not roduce indeendent and/or identically distributed random variables. To handle the high-dimensional setting, we restrict the dimension of Σ by assuming that as, = ), trσ4 ) tr 2 Σ 2 ) t 1 and trσ2 ) tr 2 Σ) t 2, 4) with 0 t 2 t 1 1. This flexible assumtion does not secify the limiting behavior of with resect to, thus including the case where / is bounded Ledoit and Wolf, 2004). At the same time, it does not seriously restrict the class of covariance matrices for Σ that satisfy assumtion 4). For examle, members of this class are covariance matrices whose eigenvalues are bounded away from 0 and Chen et al., 2010), banded first-order auto-regressive covariance matrices such that Σ = {σ a σ b ρ a b I a b k)} 1 a,b with 1 ρ 1, σ a and σ b bounded ositive constants and 1 k Chen et al., 2010), covariance matrices that have a few divergent eigenvalues as long as they diverge slowly Chen and Qin, 2010), and covariance matrices with a comound symmetry correlation attern, that is Σ = {σ a σ b ρ} 1 a,b. Model 2) and assumtion 4) constitute an attractive working framework to handle the small, large aradigm. A similar but stricter working framework was considered by Chen et al. 2010) in the context of hyothesis testing for Σ. By contrast, we avoid making assumtions about moments of fifth or higher order and we allow more otions for Σ. 3 Main Results Under the nonarametric model 2), the exectations in the numerator and denominator of the otimal shrinkage intensity λ in 1) can be exlicitly calculated to obtain that λ = E [ S Σ 2 ] F E [ trσ 2 ) + tr ] 2 Σ) + BtrD 2 Σ S νi 2 = ) F trσ 2 ) + +1 tr 2 Σ) + BtrD 2 Σ ). Since trd 2 A ) = tra A) tra2 ) tr 2 A) for any ositive definite matrix A, the contribution of BtrD 2 Σ ) to λ under assumtion 4) is negligible when comared to that of trσ2 ) or tr 2 Σ). Ignore BtrD 2 Σ ) in λ and define λ trσ 2 ) + tr 2 Σ) = trσ 2 ) + +1 tr 2 Σ). 3

4 This is the otimal shrinkage intensity of the multivariate normal model or, more generally, λ = λ when B = 0 in the nonarametric model 2). Under assumtion 4), it follows that ) 1) trσ 2 ) 1 λ λ tr2 Σ) = ) trσ 2 ) 1 tr2 Σ) + +1 tr2 Σ) trσ 2 ) 1 tr2 Σ) trσ 2 ) tr 2 Σ) 1 B trd2 Σ ) B trd ) 2 Σ ) tr 2 Σ) + +1 tr2 Σ) + BtrD 2 Σ ) ) B trd2 Σ ) tr 2 Σ) where k denotes the absolute value of the real number k. Therefore, the otimal shrinkage intensity obtained under normality is asymtotically equivalent to the otimal shrinkage intensity with resect to model 2) as long as the trace ratios restrictions in 4) hold. This means that it suffices to construct a nonarametric and consistent estimator of λ to estimate λ. To accomlish this, relace the unknown arameters trσ) and trσ 2 ) in λ with the unbiased estimators 0, Y 1 = U 1 U 4 = 1 i=1 X T i X i 1 P 2 X T j X i i j and Y 2 = U 2 2U 5 + U 6 = 1 P2 X T i X j ) P3 i j i j k X T i X j X T i X k + 1 P 4 i j k l X i X T j X k X T l resectively, where Pt s = s!/s t)! and denotes summation over mutually distinct indices. ote that U 1 and U 2 are the unbiased estimators of trσ) and trσ 2 ) resectively, when the data are centered around the mean vector i.e., µ = 0), while the remaining terms U 4,U 5 and U 6 ) are U- statistics of second, third and fourth order that ensure the unbiasedness of Y 1 and Y 2 when µ 0. In B, we argue that Y 1 and Y 2 are ratio-consistent estimators to trσ) and trσ 2 ) resectively. Here, it should be noted a statistic ˆθ is called a ratio-consistent estimator to θ if ˆθ/θ converges in robability to one. Therefore, it follows from the continuous maing theorem that ˆλ = Y 2 + Y 2 1 Y Y 2 1 is a consistent estimator of λ. The roosed Stein-tye shrinkage estimator for Σ, Ŝ = 1 ˆλ)S + ˆλˆνI, is obtained by lugging-in ˆν = Y 1 / and ˆλ in 1). 3.1 Alternative target matrices ext, we consider target matrices other than νi in 1). This extension is motivated by situations where λ 1 and νi is not a good aroximation of Σ. In this case, Ŝ remains a well-defined and non-singular covariance matrix estimator but it fails to reflect the underlying deendence structure. Let T be a well-conditioned and non-singular target matrix, and define the matrix S T = 1 λ T )S + λ T T. 5) 4

5 Simle algebraic maniulation shows that the otimal shrinkage intensity λ T = E [ S Σ 2 ] F + E [tr{s Σ)Σ T)}] / E [ ] S T 2 6) F minimizes the exected risk function E[ S T Σ 2 F ]. A closed form solution for λ T can be derived by calculating the exectations in 6) with resect to model 2). The key idea of our roosal is to simlify the estimation rocess for λ T by identifying terms in the numerator and denominator of λ T that can be safely ignored under assumtion 4). One aroach is to examine whether the otimal shrinkage intensity under the multivariate normal model assumtion is asymtotically equivalent to λ T. Whenever this is the case, we can set B = 0 in λ T and relace the remaining arameters with unbiased and ratio-consistent estimators to obtain ˆλ T. The roosed Stein-tye shrinkage covariance matrix estimator is Ŝ T = 1 ˆλ T )S + ˆλ T T. ote that if we set T = νi then λ T = λ and thus S T = S. We rovide the estimator of λ T for two target matrices: i) the identity matrix T = I, and ii) the diagonal matrix T = D S whose diagonal elements are the samle variances. To guarantee the consistency of λ T, we suose that t 1 = t 2 = 0 in 4). This assumtion does not heavily affects the class of Σ under consideration. In fact, all the deendence atterns mentioned in Section 2 satisfy this stronger version of assumtion 4) excet the comound symmetry correlation matrix. We believe that this is a small rice to ay if we are willing to increase the range of the target matrices in 1). First, when T = I in 5) the otimal shrinkage intensity in 6) becomes λ I = trσ 2 ) + tr 2 Σ) + BtrD 2 Σ ) trσ 2 ) + tr 2 Σ) + 1)tr [Σ I ) 2 ] + BtrD 2 Σ ). As before, we can ignore the terms BtrD 2 Σ ) in λ I and rove that ˆλ I = Y 2 + Y 2 1 Y 2 + Y 2 1 1)2Y 1 ) is a consistent estimator of λ I. When T = D S, we can use the results in A and rove that the otimal shrinkage intensity in 6) is λ D = E [ S Σ 2 ] F + E [tr{s Σ)Σ DS )}] / E [ ] S D S 2 F E[trS 2 )] trσ 2 ) ) + E[trΣD S )] E[trSD S )]) = where Σ 1/2 ab = E[trS 2 )] E[trD 2 S )] [ trσ 2 ) + tr 2 Σ) 2trD 2 Σ ) + B trd 2 Σ ) a=1 b=1 trσ 2 ) + tr 2 Σ) + 1)trD 2 Σ ) + B [ trd 2 Σ ) a=1 b=1 Σ 1/2 ab ) 4 ] Σ 1/2 ab ) 4 ], [ denotes the a, b)-th element of Σ 1/2. It can be shown that B trd 2 Σ ) a=1 b=1 has negligible contribution to λ D and that Σ 1/2 ab ) 4 ] Y 3 = U 3 2U 7 + U 8 = 1 P2 trx i X T i X j X T j ) 2 1 P3 i j i j k trx i X T i X j X T k ) + 1 P 4 i j k l trx i X T j X k X T l ) is an unbiased and ratio-consistent estimator to trd 2 Σ ). The construction of Y 3 is closely related to that of Y 1 and Y 2. To see this, note that Y 3 is a linear combination of three U-statistics, U 3, 5

6 the unbiased estimator of trd 2 Σ ) when µ = 0, and U 7 and U 8, which make the bias of Y 3 zero when µ 0. Hence, a consistent estimator of λ D is 3.2 Remarks ˆλ D = Y 2 + Y1 2 2Y 3 Y 2 + Y )Y. 3 There is no need to account for the mean vector when the random vectors are centered. In this case, the roosed shrinkage covariance matrix estimators can be obtained by relacing 1 with in the formula for ˆλ, ˆλ I or ˆλ D, the samle covariance matrix S with i=1 X ix T i / and the statistics Y 1, Y 2 and Y 3 with U 1, U 2 and U 3 resectively. The last modification is due to the fact that U 1, U 2 and U 3 are unbiased and ratio-consistent estimators to the targeted arameters when µ = 0. Fisher and Sun 2011) derived Stein-tye shrinkage covariance matrix estimators for the three target matrices considered herein under a multivariate normal model. We emhasize that our estimators differ in three imortant asects. First, the consistency of the roosed estimators for the otimal shrinkage intensities λ, λ I or λ D is not sensitive to deartures of the normality assumtion. Second, trσ 2 ) and trd Σ ) are estimated using U-statistics and are not based on the samle covariance matrix S as in Fisher and Sun 2011). Consequently, we avoid terms such as X T i X i) 2 or X 4 ia, which allows us to control their asymtotic variance. Third, the class of covariance matrices under consideration in Fisher and Sun 2011) is different. They require the first four arithmetic means of the eigenvalues of Σ to converge while we lace trace ratios restrictions on Σ. 3.3 Software imlementation The R R Core Team, 2014) language ackage Shrinkcovmat imlements the roosed Stein-tye shrinkage covariance estimators and it is available at htt://cran.r-roject.org/web/ackages/shrinkcovmat. The core functions shinkcovmat.equal, shinkcovmat.identity and shinkcovmat.unequal rovide the roosed shrinkage covariance matrix estimators when T = νi, T = I and T = D S resectively. The statistics Y 1, Y 2 and Y 3 are calculated using the comutationally efficient formulas given in C. To modify the shrinkage estimators when µ = 0, one should set the argument centered=true in the core functions. 4 Simulations We carried out a simulation study to investigate the erformance of the roosed Stein-tye covariance matrix estimators. The -variate random vectors X 1,...,X were generated according to model 2), where we emloyed the following three distributional scenarios regarding Z 1,..., Z : 1. A normality scenario, in which Z ia i.i.d 0, 1). 2. A gamma scenario, in which Z ia = Z ia 8)/4, Z ia i.i.d Gamma4, 0.5) and thus B = A mixture of Scenarios 1 and 2, in which the first /2 elements of Z i are distributed according to Scenario 1 and the remaining elements according to Scenario 2. Scenarios 2 and 3 were used to emirically verify the nonarametric nature of the roosed methodology. To mimic high-dimensional settings, we let range from 10 to 100 with increments of 10 and we let = 100, 500, 1000, 1500 and The roosed family of covariance estimators was evaluated using Ŝ, the roosed shrinkage covariance matrix estimator when T = νi, which was then comared to Ŝ LW and Ŝ F S, the corresonding shrinkage covariance matrix estimator roosed by Ledoit and Wolf 2004) and Fisher and Sun 2011) resectively. As in Ledoit and Wolf 2004), we assumed that µ = 0 and we made the necessary adjustments to Ŝ and Ŝ F S. Since Ŝ F S was constructed under a multivariate normal model assumtion, its erformance was evaluated only in samling schemes that involved Scenario 1. We excluded the shrinkage estimator of Schäfer and Strimmer 2005) in 6

7 Table 1: Estimation of the otimal shrinkage intensity λ = 1 under Scenario 1 when Σ = I. ˆλ ˆλLW ˆλF S Mean S.E. Mean S.E. Mean S.E our simulation studies because they do not guarantee consistent estimation of λ in high-dimensional settings. Given Σ,, and the distributional scenario, we draw 1000 relicates based on which we calculated the simulated ercentage relative imrovement in average loss SPRIAL) of Ŝ and of Ŝ F S for estimating Σ. Formally, the SPRIAL criterion of ˆΣ was defined as SPRIAL ˆΣ) = 1000 b=1 Ŝ LW,b Σ 2 F 1000 b=1 ˆΣ b Σ 2 F 1000 b=1 Ŝ LW,b Σ 2 F 100%, where Ŝ LW,b denotes the estimator of Ledoit and Wolf 2004) at relicate b b = 1,..., 1000) and ˆΣ b denotes the corresonding estimator of the cometing estimation rocess that generates the covariance matrix estimator ˆΣ. By definition, SPRIALΣ) = 100% and SPRIALŜ LW ) = 0%. Therefore, ositive negative) values of SPRIAL ˆΣ) imly that ˆΣ is a more less) efficient covariance matrix estimator than Ŝ LW while values around zero imly that the two estimators were equally efficient. ote that we treated Ŝ LW as the baseline estimator because Ledoit and Wolf 2004) have already established its efficiency over the samle covariance matrix S. To exlore the situation in which the target matrix equals the true covariance matrix, we set Σ = I. In this case, accurate estimation of λ is crucial due to the fact that λ = 1 regardless of and. Table 1 contains the simulation results under Scenario 1 for the mean of the estimated otimal shrinkage intensities based on the roosed method ˆλ), the aroach of Ledoit and Wolf 2004) ˆλ LW ) and the aroach of Fisher and Sun 2011) ˆλ F S ) when = 10, 50, 100 and = 100, 1000, As required, ˆλ, ˆλLW and ˆλ F S were all restricted to lie on the unit interval. Comared to ˆλ LW, ˆλ aeared to be more accurate in estimating λ as it was substantially less biased and with slightly smaller standard error for all and. Although the bias of ˆλ LW decreased as increased, ˆλ LW seemed to be biased downwards even when = 100. These trends also occurred for Scenarios 2 and 3, and for this reason the results are omitted. As exected, no significant difference was noticed between ˆλ and ˆλ F S under Scenario 1. Figure 1 dislays SPRIALŜ ) under Scenario 2 - similar atterns were observed for the other two distributional scenarios. Clearly, Ŝ was more effective than Ŝ LW for % 97.81%) and for = 100 and % 61.16%). We should mention that Ŝ and Ŝ F S were equally efficient, meaning that SPRIALŜ ) and SPRIALŜ F S ) took similar values. To investigate the erformance of Ŝ when Σ deviates slightly from the target matrix, we emloyed a tridiagonal correlation matrix for Σ in which the non-zero off-diagonal elements were all equal to 0.1. Figure 2 suggests that Ŝ outerformed Ŝ LW in extreme high-dimensional settings 50) under Scenario 3. The efficiency gains in using Ŝ instead of Ŝ LW decreased at a much faster rate than before as increased. ote that similar trends were observed under Scenarios 1 and 2 results not shown). ext, we considered deendence structures such that νi is a rather oor aroximation of Σ. First, we defined Σ as a first order autoregressive correlation matrix with a, b)-th element Σ ab = 0.5 a b. Although the results in Table 2 imly that ˆλ is a more accurate estimator of λ than ˆλ LW, 7

8 SPRIAL Figure 1: SPRIALŜ ) under Scenario 2 when Σ = I and = 100 symbols), symbols), 1000 symbols), 1500 x symbols) or 2500 symbols). SPRIAL Figure 2: SPRIALŜ ) under Scenario 3 when Σ is a tridiagonal correlation matrix with a, b)-th element Σ ab = 0.1I a b ) and = 100 symbols), symbols), 1000 symbols), 1500 x symbols) or 2500 symbols). Figure 3 suggests that Ŝ and Ŝ LW were almost equally efficient as soon as = 30 and regardless of the dimensionality. We reached the same conclusions when we let Σ to be either a ositive definite matrix whose eigenvalues were drawn from the uniform distribution U0.5, 10) or a block diagonal covariance matrix such that the dimension of each block matrix is /4 /4 and the eigenvalues of these 4 block matrices were drawn from U0.5, 5), U5, 10), U10, 20) and U0.5, 100) resectively. In articular, ˆλ seemed to outerformed ˆλ LW for all configurations of,, Σ) while SPRIALŜ ) was close to zero unless 30. Hence, if the target matrix νi fails to describe adequately the deendence structure, we exect Ŝ to be more efficient than Ŝ LW only for small samle sizes. Further, we adoted a comound symmetry correlation form for Σ with correlation arameter ρ = 0.5. This is the only configuration of Σ where t 1 0 and t 2 0 in assumtion 4). The estimator ˆλ LW aeared to be less biased but with larger standard error than ˆλ, while the SPRIALŜ ) was always close to zero. The erformance of Ŝ and Ŝ LW was comarable across the related samling schemes. We also evaluated the erformance of the inverse of the cometing Stein-tye shrinkage covariance matrix estimators using a modified version of the SPRIAL criterion, that is we relaced Σ, Σ and Ŝ LW with their corresonding inverses in SPRIAL ˆΣ). The inverse of Ŝ was more efficient that of Ŝ LW when Σ was equal to the identity matrix or to the tridiagonal correlation matrix, and quite surrisingly when Σ satisfied the comound symmetry correlation attern and 50, as shown in Figure 4. In all other samling schemes, the inverses of Ŝ and Ŝ LW were comarable. 8

9 Table 2: Estimation of λ under Scenario 1 when Σ satisfies a first-order auto-regressive correlation form with correlation arameter ρ = 0.5. ˆλ ˆλLW ˆλF S λ Mean S.E. Mean S.E. Mean S.E SPRIAL Figure 3: SPRIALŜ ) under Scenario 2 when Σ satisfies a first-order autoregressive form with correlation arameter ρ = 0.5 and = 100 symbols) or 2500 symbols). Based on our simulations, SPRIALŜ ) was an increasing function of and a decreasing function of while keeing the remaining arameters fixed. This indicates that, comared to Ŝ LW, Ŝ is an imroved estimator of Σ in extreme high-dimensional settings. The nonarametric nature of Ŝ was emirically verified because both the SPRIAL criterion and the bias of ˆλ seemed to remain constant across the three distributional scenarios. In addition, we find reassuring that the simulation results for ˆλ and ˆλ F S and for Ŝ and Ŝ F S were almost identical under a multivariate normal distribution. To this end, we also believe that the above trends together with the fact that we have not encountered a situation in which Ŝ was erforming significantly worse than Ŝ LW are favoring the consistency of the roosed covariance matrix estimators. 5 Emirical Study: Colon Cancer Study Alon et al. 1999) described a colon cancer study where exression levels for 2000 genes were measured on 40 normal and on 22 colon tumor tissues. The dataset is available at htt://genomicsubs.rinceton.edu/oncology/affydata. As in Fisher and Sun 2011), we aly a logarithmic base 10) transformation to the exression levels and sort the genes based on the between-grou to within-grou sum of squares BW) selection criterion Dudoit et al., 2002). In the literature, it has been suggested to estimate the covariance matrix of the genes using a subset of the to genes see, e.g., Fisher and Sun, 2011). With this in mind, we lan to estimate the covariance matrix of the normal and of the colon cancer grou for subsets of the to genes, where = 250, 500, 750, 1000, 1250, 1500, 1750 and The Quantile-Quantile lots in Figure 5 raise concerns regarding the assumtion that the 4 to 9

10 SPRIAL Figure 4: SPRIAL criterion for the inverse of Ŝ under Scenario 3 when Σ satisfies a comound symmetry correlation matrix and = 100 symbols) or 2500 symbols) Figure 5: Quantile-Quantile lots for the exression levels of the 4 to genes according to the BW selection criterion. The to anel corresonds to the normal grou and the bottom anel to the colon cancer grou. genes are marginally normally distributed. Since similar atterns occur for the remaining genes, we conclude that a multivariate normal model is not likely to hold and nonarametric covariance matrix estimation is required. For this urose, we calculate Ŝ, Ŝ I and Ŝ D for both grous and for all values of. The otimal shrinkage intensity λ T in 5) is informative for the suitability of the selected target matrix T because it reveals its contribution to S T. If ˆλ, ˆλI and ˆλ D differ significantly, then it is meaningful to choose the target matrix with the largest estimated otimal shrinkage intensity. Otherwise, the selection of the target matrix can be based on ˆν and r, the range of the samle variances. In articular, we suggest to emloy I when ˆν is close to 1.00, D S when r is large say more than one unit so as to account for the samling variability), and νi when neither of these seems lausible. Table 3 dislays the estimated otimal shrinkage intensities for the roosed family of shrinkage covariance estimators, ˆν and r in both grous. The estimates of λ and λ D are very similar in both grous for all subsets of genes and larger than those of λ I. This suggests that D S and νi are better otions than I as a target matrix. Since r aears to be relatively small and constant across the to genes in both grous, it seems sensible to set T = νi for all. Therefore, we recommend using Ŝ to estimate the covariance matrix of the genes in the normal and in the colon cancer grou and regardless of. 10

11 Table 3: The estimates of λ, λ I, λ D, ν and the range of the samle variances r) in the colon cancer dataset. Grou Statistic ormal ˆλ ˆν ˆλ D r ˆλ I Colon ˆλ ˆν ˆλ D r ˆλ I Discussion We roosed a new family of nonarametric Stein-tye shrinkage estimators for a high-dimensional covariance matrix. This family is based on imroving the nonarametric estimation of the otimal shrinkage intensity for three commonly used target matrices at the exense of imosing mild restrictions for the covariance matrix Σ. In our simulations, the roosed shrinkage covariance matrix estimator Ŝ was more recise than that of Ledoit and Wolf 2004) for estimating Σ, esecially when the number of variables is extremely large comared to the samle size and/or νi is a good aroximation of Σ. Unsurrisingly, the behavior of our estimators and that of Fisher and Sun 2011) were similar as long as an underlying multivariate normal model holds. However, we emhasize that our estimators are more flexible since they are robust to deartures from normality. In addition, we recommended a simle data-driven strategy for selecting the target matrix in 5). The main idea is to comare ˆλ, ˆλ I and ˆλ D and choose the target matrix with the largest estimated otimal shrinkage intensity. If these estimates are similar and ˆν is close to one, then it is sensible to use I as target matrix. Otherwise, νi should be selected when the samle variances are very close and D S when these vary. The R ackage ShrinkCovMat imlements the roosed estimators. Since Ŝ, Ŝ I and Ŝ D are all biased estimators of Σ, their risk functions might be unbounded esecially when none of the corresonding target matrices describes adequately the underlying deendence structure. In these situations, a more suitable target matrix T in 5) must be considered. The corresonding λ T should be estimated by following our guidelines in Section 3.1. If such a target matrix cannot be identified, one should choose among the alternative covariance matrix estimation methods mentioned in the Introduction. In future research, we aim to investigate formal rocedures for selecting the target matrix in 5) and to extend the roosed Steinian shrinkage aroach for estimating correlation matrices. A Useful Results We list six results with resect to model 2) that allows us to derive the formulas for the λ, λ I and λ D : 1. E[trS)] = trσ). 2. E[trS 2 )] = 1 trσ2 ) tr2 Σ) + B 1 trd2 Σ ). 3. E[tr 2 S)] = tr 2 Σ) trσ2 ) + B 1 trd2 Σ ). 4. E[trD S )] = trd Σ ). 11

12 5. E[trD 2 S )] = E[trSD S)] = +1 1 trd2 Σ ) + a, b)-th element of Σ 1/2. B 1 a=1 b=1 Σ 1/2 ab ) 4, where Σ 1/2 ab denotes the 6. E[trΣD S )] = trd 2 Σ ). B Consistency of Y 1, Y 2 and Y 3 ote that E[Y 1 ] = E[trS)] = trσ). Under assumtion 4), derivations in Chen et al. 2010) imly that [ ] Var[Y 1 ] 2 + max{0, B} 2 trσ 2 ) 3 + max{0, B} tr 2 + Σ) 1) tr 2 0. Σ) Thus Y 1 is a ratio-consistent estimator to trσ). Similarly, it can be shown that Y 2 and Y 3 are unbiased estimators to trσ 2 ) and trd 2 Σ ) resectively, and that Var[Y 2]/tr 2 Σ 2 ) 0 and Var[Y 3 ]/tr 2 D 2 Σ ) 0 as. Hence, Y 2 and Y 3 are ratio-consistent estimators of trσ 2 ) and trd 2 Σ ). C Alternative Formulas for Y 1, Y 2 and Y 3 ote that Y 1 = U 1 U 4 = 1 Himeno and Yamada 2014) showed that i=1 X T i X i 1 P 2 X T j X i = trs). i j Y 2 = U 2 2U 5 + U 6 = 1 P2 X T i X j ) P3 + 1 P 4 = i j i j k l X i X T j X k X T l 1 2) 3) i j k X T i X j X T i X k [ 1) 2)trS 2 ) + tr 2 S) Q ] where Q = 1 1 [ Xi X) T X i X) ] 2. i=1 12

13 Also it can be shown that Y 3 = U 3 2U 7 + U 8 = 1 P2 trx i X T i X j X T j ) 2 1 P3 + 1 P 4 = 1 P P P 4 i j i j k l a=1 i j trx i X T j X k X T l ) X 2 iax 2 ja a=1 2 a=1 i=1 1 Xia) 1 2 i=1 j=i+1 i=1 j=i+1 X ia X ja 2 i j k X ia X ja trx i X T i X j X T k ) a=1 i j a=1 i j X 2 iax 2 ja X 3 iax ja Together these results reduce the comutational cost for Y 1, Y 2 and Y 3 from O 4 ) to O 2 ). References U. Alon,. Barkai, D. A. otterman, K. Gish, S. Ybarra, D. Mack, and A. J. Levine. Broad atterns of gene exression revealed by clustering analysis of tumor and normal colon tissues robed by oligonucleotide arrays. Proceedings of the ational Academy of Sciences of the United States of America, 96: , S.X. Chen and Y.L. Qin. A two-samle test for high-dimensional data with alications to gene-set testing. The Annals of Statistics, 38: , S.X. Chen, L.X. Zhang, and P.S. Zhong. Tests for high-dimensional covariance matrices. Journal of the American Statistical Association, 105: , S. Dudoit, J. Fridlyand, and T. P. Seed. Comarison of discrimination methods for the classification of tumors using gene exression data. Journal of the American Statistical Association, 97:77 87, T.J. Fisher and X. Sun. Imroved Stein-tye shrinkage estimators for the high-dimensional multivariate normal covariance matrix. Comutational Statistics and Data Analysis, 55: , T. Himeno and T. Yamada. Estimations for some functions of covariance matrix in high dimension under non-normality. Journal of Multivariate Analysis, 130:27 44, O. Ledoit and M. Wolf. A well-conditioned estimator for large-dimensional covariance matrices. Journal of Multivariate Analysis, 88: , R Core Team. R: A Language and Environment for Statistical Comuting. R Foundation for Statistical Comuting, Vienna, Austria, URL htt:// J. Schäfer and K. Strimmer. A shrinkage aroach to large-scale covariance matrix estimation and imlications for functional genomics. Statistical Alications in Genetics and Molecular Biology, 4: 32, C. Stein. Inadmissibility of the usual estimator for the mean of a multivariate normal distribution. Proceedings of the Third Berkeley Symosium on Mathematical Statistics and Probability, 1: ,

Estimation of the large covariance matrix with two-step monotone missing data

Estimation of the large covariance matrix with two-step monotone missing data Estimation of the large covariance matrix with two-ste monotone missing data Masashi Hyodo, Nobumichi Shutoh 2, Takashi Seo, and Tatjana Pavlenko 3 Deartment of Mathematical Information Science, Tokyo

More information

A Comparison between Biased and Unbiased Estimators in Ordinary Least Squares Regression

A Comparison between Biased and Unbiased Estimators in Ordinary Least Squares Regression Journal of Modern Alied Statistical Methods Volume Issue Article 7 --03 A Comarison between Biased and Unbiased Estimators in Ordinary Least Squares Regression Ghadban Khalaf King Khalid University, Saudi

More information

4. Score normalization technical details We now discuss the technical details of the score normalization method.

4. Score normalization technical details We now discuss the technical details of the score normalization method. SMT SCORING SYSTEM This document describes the scoring system for the Stanford Math Tournament We begin by giving an overview of the changes to scoring and a non-technical descrition of the scoring rules

More information

Estimating function analysis for a class of Tweedie regression models

Estimating function analysis for a class of Tweedie regression models Title Estimating function analysis for a class of Tweedie regression models Author Wagner Hugo Bonat Deartamento de Estatística - DEST, Laboratório de Estatística e Geoinformação - LEG, Universidade Federal

More information

On split sample and randomized confidence intervals for binomial proportions

On split sample and randomized confidence intervals for binomial proportions On slit samle and randomized confidence intervals for binomial roortions Måns Thulin Deartment of Mathematics, Usala University arxiv:1402.6536v1 [stat.me] 26 Feb 2014 Abstract Slit samle methods have

More information

Elements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley

Elements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley Elements of Asymtotic Theory James L. Powell Deartment of Economics University of California, Berkeley Objectives of Asymtotic Theory While exact results are available for, say, the distribution of the

More information

CHAPTER-II Control Charts for Fraction Nonconforming using m-of-m Runs Rules

CHAPTER-II Control Charts for Fraction Nonconforming using m-of-m Runs Rules CHAPTER-II Control Charts for Fraction Nonconforming using m-of-m Runs Rules. Introduction: The is widely used in industry to monitor the number of fraction nonconforming units. A nonconforming unit is

More information

Package ShrinkCovMat

Package ShrinkCovMat Type Package Package ShrinkCovMat Title Shrinkage Covariance Matrix Estimators Version 1.2.0 Author July 11, 2017 Maintainer Provides nonparametric Steinian shrinkage estimators

More information

General Linear Model Introduction, Classes of Linear models and Estimation

General Linear Model Introduction, Classes of Linear models and Estimation Stat 740 General Linear Model Introduction, Classes of Linear models and Estimation An aim of scientific enquiry: To describe or to discover relationshis among events (variables) in the controlled (laboratory)

More information

Tests for Two Proportions in a Stratified Design (Cochran/Mantel-Haenszel Test)

Tests for Two Proportions in a Stratified Design (Cochran/Mantel-Haenszel Test) Chater 225 Tests for Two Proortions in a Stratified Design (Cochran/Mantel-Haenszel Test) Introduction In a stratified design, the subects are selected from two or more strata which are formed from imortant

More information

Towards understanding the Lorenz curve using the Uniform distribution. Chris J. Stephens. Newcastle City Council, Newcastle upon Tyne, UK

Towards understanding the Lorenz curve using the Uniform distribution. Chris J. Stephens. Newcastle City Council, Newcastle upon Tyne, UK Towards understanding the Lorenz curve using the Uniform distribution Chris J. Stehens Newcastle City Council, Newcastle uon Tyne, UK (For the Gini-Lorenz Conference, University of Siena, Italy, May 2005)

More information

Notes on Instrumental Variables Methods

Notes on Instrumental Variables Methods Notes on Instrumental Variables Methods Michele Pellizzari IGIER-Bocconi, IZA and frdb 1 The Instrumental Variable Estimator Instrumental variable estimation is the classical solution to the roblem of

More information

Covariance Matrix Estimation for Reinforcement Learning

Covariance Matrix Estimation for Reinforcement Learning Covariance Matrix Estimation for Reinforcement Learning Tomer Lancewicki Deartment of Electrical Engineering and Comuter Science University of Tennessee Knoxville, TN 37996 tlancewi@utk.edu Itamar Arel

More information

A New Asymmetric Interaction Ridge (AIR) Regression Method

A New Asymmetric Interaction Ridge (AIR) Regression Method A New Asymmetric Interaction Ridge (AIR) Regression Method by Kristofer Månsson, Ghazi Shukur, and Pär Sölander The Swedish Retail Institute, HUI Research, Stockholm, Sweden. Deartment of Economics and

More information

16.2. Infinite Series. Introduction. Prerequisites. Learning Outcomes

16.2. Infinite Series. Introduction. Prerequisites. Learning Outcomes Infinite Series 6.2 Introduction We extend the concet of a finite series, met in Section 6., to the situation in which the number of terms increase without bound. We define what is meant by an infinite

More information

Hotelling s Two- Sample T 2

Hotelling s Two- Sample T 2 Chater 600 Hotelling s Two- Samle T Introduction This module calculates ower for the Hotelling s two-grou, T-squared (T) test statistic. Hotelling s T is an extension of the univariate two-samle t-test

More information

Statics and dynamics: some elementary concepts

Statics and dynamics: some elementary concepts 1 Statics and dynamics: some elementary concets Dynamics is the study of the movement through time of variables such as heartbeat, temerature, secies oulation, voltage, roduction, emloyment, rices and

More information

Lower Confidence Bound for Process-Yield Index S pk with Autocorrelated Process Data

Lower Confidence Bound for Process-Yield Index S pk with Autocorrelated Process Data Quality Technology & Quantitative Management Vol. 1, No.,. 51-65, 15 QTQM IAQM 15 Lower onfidence Bound for Process-Yield Index with Autocorrelated Process Data Fu-Kwun Wang * and Yeneneh Tamirat Deartment

More information

substantial literature on emirical likelihood indicating that it is widely viewed as a desirable and natural aroach to statistical inference in a vari

substantial literature on emirical likelihood indicating that it is widely viewed as a desirable and natural aroach to statistical inference in a vari Condence tubes for multile quantile lots via emirical likelihood John H.J. Einmahl Eindhoven University of Technology Ian W. McKeague Florida State University May 7, 998 Abstract The nonarametric emirical

More information

AI*IA 2003 Fusion of Multiple Pattern Classifiers PART III

AI*IA 2003 Fusion of Multiple Pattern Classifiers PART III AI*IA 23 Fusion of Multile Pattern Classifiers PART III AI*IA 23 Tutorial on Fusion of Multile Pattern Classifiers by F. Roli 49 Methods for fusing multile classifiers Methods for fusing multile classifiers

More information

arxiv: v1 [physics.data-an] 26 Oct 2012

arxiv: v1 [physics.data-an] 26 Oct 2012 Constraints on Yield Parameters in Extended Maximum Likelihood Fits Till Moritz Karbach a, Maximilian Schlu b a TU Dortmund, Germany, moritz.karbach@cern.ch b TU Dortmund, Germany, maximilian.schlu@cern.ch

More information

State Estimation with ARMarkov Models

State Estimation with ARMarkov Models Deartment of Mechanical and Aerosace Engineering Technical Reort No. 3046, October 1998. Princeton University, Princeton, NJ. State Estimation with ARMarkov Models Ryoung K. Lim 1 Columbia University,

More information

Using the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process

Using the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process Using the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process P. Mantalos a1, K. Mattheou b, A. Karagrigoriou b a.deartment of Statistics University of Lund

More information

A multiple testing approach to the regularisation of large sample correlation matrices

A multiple testing approach to the regularisation of large sample correlation matrices A multile testing aroach to the regularisation of large samle correlation matrices Natalia Bailey Queen Mary, University of London M. Hashem Pesaran University of Southern California, USA, and rinity College,

More information

Combining Logistic Regression with Kriging for Mapping the Risk of Occurrence of Unexploded Ordnance (UXO)

Combining Logistic Regression with Kriging for Mapping the Risk of Occurrence of Unexploded Ordnance (UXO) Combining Logistic Regression with Kriging for Maing the Risk of Occurrence of Unexloded Ordnance (UXO) H. Saito (), P. Goovaerts (), S. A. McKenna (2) Environmental and Water Resources Engineering, Deartment

More information

An Improved Generalized Estimation Procedure of Current Population Mean in Two-Occasion Successive Sampling

An Improved Generalized Estimation Procedure of Current Population Mean in Two-Occasion Successive Sampling Journal of Modern Alied Statistical Methods Volume 15 Issue Article 14 11-1-016 An Imroved Generalized Estimation Procedure of Current Poulation Mean in Two-Occasion Successive Samling G. N. Singh Indian

More information

Elements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley

Elements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley Elements of Asymtotic Theory James L. Powell Deartment of Economics University of California, Berkeley Objectives of Asymtotic Theory While exact results are available for, say, the distribution of the

More information

Probability Estimates for Multi-class Classification by Pairwise Coupling

Probability Estimates for Multi-class Classification by Pairwise Coupling Probability Estimates for Multi-class Classification by Pairwise Couling Ting-Fan Wu Chih-Jen Lin Deartment of Comuter Science National Taiwan University Taiei 06, Taiwan Ruby C. Weng Deartment of Statistics

More information

MATHEMATICAL MODELLING OF THE WIRELESS COMMUNICATION NETWORK

MATHEMATICAL MODELLING OF THE WIRELESS COMMUNICATION NETWORK Comuter Modelling and ew Technologies, 5, Vol.9, o., 3-39 Transort and Telecommunication Institute, Lomonosov, LV-9, Riga, Latvia MATHEMATICAL MODELLIG OF THE WIRELESS COMMUICATIO ETWORK M. KOPEETSK Deartment

More information

Preconditioning techniques for Newton s method for the incompressible Navier Stokes equations

Preconditioning techniques for Newton s method for the incompressible Navier Stokes equations Preconditioning techniques for Newton s method for the incomressible Navier Stokes equations H. C. ELMAN 1, D. LOGHIN 2 and A. J. WATHEN 3 1 Deartment of Comuter Science, University of Maryland, College

More information

MODELING THE RELIABILITY OF C4ISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL

MODELING THE RELIABILITY OF C4ISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL Technical Sciences and Alied Mathematics MODELING THE RELIABILITY OF CISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL Cezar VASILESCU Regional Deartment of Defense Resources Management

More information

Supplementary Materials for Robust Estimation of the False Discovery Rate

Supplementary Materials for Robust Estimation of the False Discovery Rate Sulementary Materials for Robust Estimation of the False Discovery Rate Stan Pounds and Cheng Cheng This sulemental contains roofs regarding theoretical roerties of the roosed method (Section S1), rovides

More information

16.2. Infinite Series. Introduction. Prerequisites. Learning Outcomes

16.2. Infinite Series. Introduction. Prerequisites. Learning Outcomes Infinite Series 6. Introduction We extend the concet of a finite series, met in section, to the situation in which the number of terms increase without bound. We define what is meant by an infinite series

More information

HENSEL S LEMMA KEITH CONRAD

HENSEL S LEMMA KEITH CONRAD HENSEL S LEMMA KEITH CONRAD 1. Introduction In the -adic integers, congruences are aroximations: for a and b in Z, a b mod n is the same as a b 1/ n. Turning information modulo one ower of into similar

More information

CONVOLVED SUBSAMPLING ESTIMATION WITH APPLICATIONS TO BLOCK BOOTSTRAP

CONVOLVED SUBSAMPLING ESTIMATION WITH APPLICATIONS TO BLOCK BOOTSTRAP Submitted to the Annals of Statistics arxiv: arxiv:1706.07237 CONVOLVED SUBSAMPLING ESTIMATION WITH APPLICATIONS TO BLOCK BOOTSTRAP By Johannes Tewes, Dimitris N. Politis and Daniel J. Nordman Ruhr-Universität

More information

1. INTRODUCTION. Fn 2 = F j F j+1 (1.1)

1. INTRODUCTION. Fn 2 = F j F j+1 (1.1) CERTAIN CLASSES OF FINITE SUMS THAT INVOLVE GENERALIZED FIBONACCI AND LUCAS NUMBERS The beautiful identity R.S. Melham Deartment of Mathematical Sciences, University of Technology, Sydney PO Box 23, Broadway,

More information

ASYMPTOTIC RESULTS OF A HIGH DIMENSIONAL MANOVA TEST AND POWER COMPARISON WHEN THE DIMENSION IS LARGE COMPARED TO THE SAMPLE SIZE

ASYMPTOTIC RESULTS OF A HIGH DIMENSIONAL MANOVA TEST AND POWER COMPARISON WHEN THE DIMENSION IS LARGE COMPARED TO THE SAMPLE SIZE J Jaan Statist Soc Vol 34 No 2004 9 26 ASYMPTOTIC RESULTS OF A HIGH DIMENSIONAL MANOVA TEST AND POWER COMPARISON WHEN THE DIMENSION IS LARGE COMPARED TO THE SAMPLE SIZE Yasunori Fujikoshi*, Tetsuto Himeno

More information

Uncorrelated Multilinear Principal Component Analysis for Unsupervised Multilinear Subspace Learning

Uncorrelated Multilinear Principal Component Analysis for Unsupervised Multilinear Subspace Learning TNN-2009-P-1186.R2 1 Uncorrelated Multilinear Princial Comonent Analysis for Unsuervised Multilinear Subsace Learning Haiing Lu, K. N. Plataniotis and A. N. Venetsanooulos The Edward S. Rogers Sr. Deartment

More information

A Bound on the Error of Cross Validation Using the Approximation and Estimation Rates, with Consequences for the Training-Test Split

A Bound on the Error of Cross Validation Using the Approximation and Estimation Rates, with Consequences for the Training-Test Split A Bound on the Error of Cross Validation Using the Aroximation and Estimation Rates, with Consequences for the Training-Test Slit Michael Kearns AT&T Bell Laboratories Murray Hill, NJ 7974 mkearns@research.att.com

More information

COMMUNICATION BETWEEN SHAREHOLDERS 1

COMMUNICATION BETWEEN SHAREHOLDERS 1 COMMUNICATION BTWN SHARHOLDRS 1 A B. O A : A D Lemma B.1. U to µ Z r 2 σ2 Z + σ2 X 2r ω 2 an additive constant that does not deend on a or θ, the agents ayoffs can be written as: 2r rθa ω2 + θ µ Y rcov

More information

#A64 INTEGERS 18 (2018) APPLYING MODULAR ARITHMETIC TO DIOPHANTINE EQUATIONS

#A64 INTEGERS 18 (2018) APPLYING MODULAR ARITHMETIC TO DIOPHANTINE EQUATIONS #A64 INTEGERS 18 (2018) APPLYING MODULAR ARITHMETIC TO DIOPHANTINE EQUATIONS Ramy F. Taki ElDin Physics and Engineering Mathematics Deartment, Faculty of Engineering, Ain Shams University, Cairo, Egyt

More information

Positive decomposition of transfer functions with multiple poles

Positive decomposition of transfer functions with multiple poles Positive decomosition of transfer functions with multile oles Béla Nagy 1, Máté Matolcsi 2, and Márta Szilvási 1 Deartment of Analysis, Technical University of Budaest (BME), H-1111, Budaest, Egry J. u.

More information

Estimating Time-Series Models

Estimating Time-Series Models Estimating ime-series Models he Box-Jenkins methodology for tting a model to a scalar time series fx t g consists of ve stes:. Decide on the order of di erencing d that is needed to roduce a stationary

More information

Research Note REGRESSION ANALYSIS IN MARKOV CHAIN * A. Y. ALAMUTI AND M. R. MESHKANI **

Research Note REGRESSION ANALYSIS IN MARKOV CHAIN * A. Y. ALAMUTI AND M. R. MESHKANI ** Iranian Journal of Science & Technology, Transaction A, Vol 3, No A3 Printed in The Islamic Reublic of Iran, 26 Shiraz University Research Note REGRESSION ANALYSIS IN MARKOV HAIN * A Y ALAMUTI AND M R

More information

Universal Finite Memory Coding of Binary Sequences

Universal Finite Memory Coding of Binary Sequences Deartment of Electrical Engineering Systems Universal Finite Memory Coding of Binary Sequences Thesis submitted towards the degree of Master of Science in Electrical and Electronic Engineering in Tel-Aviv

More information

Approximating min-max k-clustering

Approximating min-max k-clustering Aroximating min-max k-clustering Asaf Levin July 24, 2007 Abstract We consider the roblems of set artitioning into k clusters with minimum total cost and minimum of the maximum cost of a cluster. The cost

More information

1 Extremum Estimators

1 Extremum Estimators FINC 9311-21 Financial Econometrics Handout Jialin Yu 1 Extremum Estimators Let θ 0 be a vector of k 1 unknown arameters. Extremum estimators: estimators obtained by maximizing or minimizing some objective

More information

arxiv: v2 [stat.me] 27 Apr 2018

arxiv: v2 [stat.me] 27 Apr 2018 Nonasymtotic estimation and suort recovery for high dimensional sarse covariance matrices arxiv:17050679v [statme] 7 Ar 018 1 Introduction Adam B Kashlak and Linglong Kong Deartment of Mathematical and

More information

High-dimensional Ordinary Least-squares Projection for Screening Variables

High-dimensional Ordinary Least-squares Projection for Screening Variables High-dimensional Ordinary Least-squares Projection for Screening Variables Xiangyu Wang and Chenlei Leng arxiv:1506.01782v1 [stat.me] 5 Jun 2015 Abstract Variable selection is a challenging issue in statistical

More information

Estimation of Separable Representations in Psychophysical Experiments

Estimation of Separable Representations in Psychophysical Experiments Estimation of Searable Reresentations in Psychohysical Exeriments Michele Bernasconi (mbernasconi@eco.uninsubria.it) Christine Choirat (cchoirat@eco.uninsubria.it) Raffaello Seri (rseri@eco.uninsubria.it)

More information

Topic 7: Using identity types

Topic 7: Using identity types Toic 7: Using identity tyes June 10, 2014 Now we would like to learn how to use identity tyes and how to do some actual mathematics with them. By now we have essentially introduced all inference rules

More information

Chapter 3. GMM: Selected Topics

Chapter 3. GMM: Selected Topics Chater 3. GMM: Selected oics Contents Otimal Instruments. he issue of interest..............................2 Otimal Instruments under the i:i:d: assumtion..............2. he basic result............................2.2

More information

Yixi Shi. Jose Blanchet. IEOR Department Columbia University New York, NY 10027, USA. IEOR Department Columbia University New York, NY 10027, USA

Yixi Shi. Jose Blanchet. IEOR Department Columbia University New York, NY 10027, USA. IEOR Department Columbia University New York, NY 10027, USA Proceedings of the 2011 Winter Simulation Conference S. Jain, R. R. Creasey, J. Himmelsach, K. P. White, and M. Fu, eds. EFFICIENT RARE EVENT SIMULATION FOR HEAVY-TAILED SYSTEMS VIA CROSS ENTROPY Jose

More information

System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests

System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests 009 American Control Conference Hyatt Regency Riverfront, St. Louis, MO, USA June 0-, 009 FrB4. System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests James C. Sall Abstract

More information

Asymptotic F Test in a GMM Framework with Cross Sectional Dependence

Asymptotic F Test in a GMM Framework with Cross Sectional Dependence Asymtotic F Test in a GMM Framework with Cross Sectional Deendence Yixiao Sun Deartment of Economics University of California, San Diego Min Seong Kim y Deartment of Economics Ryerson University First

More information

LECTURE 7 NOTES. x n. d x if. E [g(x n )] E [g(x)]

LECTURE 7 NOTES. x n. d x if. E [g(x n )] E [g(x)] LECTURE 7 NOTES 1. Convergence of random variables. Before delving into the large samle roerties of the MLE, we review some concets from large samle theory. 1. Convergence in robability: x n x if, for

More information

2x2x2 Heckscher-Ohlin-Samuelson (H-O-S) model with factor substitution

2x2x2 Heckscher-Ohlin-Samuelson (H-O-S) model with factor substitution 2x2x2 Heckscher-Ohlin-amuelson (H-O- model with factor substitution The HAT ALGEBRA of the Heckscher-Ohlin model with factor substitution o far we were dealing with the easiest ossible version of the H-O-

More information

Bayesian Spatially Varying Coefficient Models in the Presence of Collinearity

Bayesian Spatially Varying Coefficient Models in the Presence of Collinearity Bayesian Satially Varying Coefficient Models in the Presence of Collinearity David C. Wheeler 1, Catherine A. Calder 1 he Ohio State University 1 Abstract he belief that relationshis between exlanatory

More information

Positive Definite Uncertain Homogeneous Matrix Polynomials: Analysis and Application

Positive Definite Uncertain Homogeneous Matrix Polynomials: Analysis and Application BULGARIA ACADEMY OF SCIECES CYBEREICS AD IFORMAIO ECHOLOGIES Volume 9 o 3 Sofia 009 Positive Definite Uncertain Homogeneous Matrix Polynomials: Analysis and Alication Svetoslav Savov Institute of Information

More information

Asymptotically Optimal Simulation Allocation under Dependent Sampling

Asymptotically Optimal Simulation Allocation under Dependent Sampling Asymtotically Otimal Simulation Allocation under Deendent Samling Xiaoing Xiong The Robert H. Smith School of Business, University of Maryland, College Park, MD 20742-1815, USA, xiaoingx@yahoo.com Sandee

More information

Distributed Rule-Based Inference in the Presence of Redundant Information

Distributed Rule-Based Inference in the Presence of Redundant Information istribution Statement : roved for ublic release; distribution is unlimited. istributed Rule-ased Inference in the Presence of Redundant Information June 8, 004 William J. Farrell III Lockheed Martin dvanced

More information

Asymptotic Properties of the Markov Chain Model method of finding Markov chains Generators of..

Asymptotic Properties of the Markov Chain Model method of finding Markov chains Generators of.. IOSR Journal of Mathematics (IOSR-JM) e-issn: 78-578, -ISSN: 319-765X. Volume 1, Issue 4 Ver. III (Jul. - Aug.016), PP 53-60 www.iosrournals.org Asymtotic Proerties of the Markov Chain Model method of

More information

Use of Transformations and the Repeated Statement in PROC GLM in SAS Ed Stanek

Use of Transformations and the Repeated Statement in PROC GLM in SAS Ed Stanek Use of Transformations and the Reeated Statement in PROC GLM in SAS Ed Stanek Introduction We describe how the Reeated Statement in PROC GLM in SAS transforms the data to rovide tests of hyotheses of interest.

More information

Partial Identification in Triangular Systems of Equations with Binary Dependent Variables

Partial Identification in Triangular Systems of Equations with Binary Dependent Variables Partial Identification in Triangular Systems of Equations with Binary Deendent Variables Azeem M. Shaikh Deartment of Economics University of Chicago amshaikh@uchicago.edu Edward J. Vytlacil Deartment

More information

Improved Bounds on Bell Numbers and on Moments of Sums of Random Variables

Improved Bounds on Bell Numbers and on Moments of Sums of Random Variables Imroved Bounds on Bell Numbers and on Moments of Sums of Random Variables Daniel Berend Tamir Tassa Abstract We rovide bounds for moments of sums of sequences of indeendent random variables. Concentrating

More information

The non-stochastic multi-armed bandit problem

The non-stochastic multi-armed bandit problem Submitted for journal ublication. The non-stochastic multi-armed bandit roblem Peter Auer Institute for Theoretical Comuter Science Graz University of Technology A-8010 Graz (Austria) auer@igi.tu-graz.ac.at

More information

Hidden Predictors: A Factor Analysis Primer

Hidden Predictors: A Factor Analysis Primer Hidden Predictors: A Factor Analysis Primer Ryan C Sanchez Western Washington University Factor Analysis is a owerful statistical method in the modern research sychologist s toolbag When used roerly, factor

More information

AN OPTIMAL CONTROL CHART FOR NON-NORMAL PROCESSES

AN OPTIMAL CONTROL CHART FOR NON-NORMAL PROCESSES AN OPTIMAL CONTROL CHART FOR NON-NORMAL PROCESSES Emmanuel Duclos, Maurice Pillet To cite this version: Emmanuel Duclos, Maurice Pillet. AN OPTIMAL CONTROL CHART FOR NON-NORMAL PRO- CESSES. st IFAC Worsho

More information

DETC2003/DAC AN EFFICIENT ALGORITHM FOR CONSTRUCTING OPTIMAL DESIGN OF COMPUTER EXPERIMENTS

DETC2003/DAC AN EFFICIENT ALGORITHM FOR CONSTRUCTING OPTIMAL DESIGN OF COMPUTER EXPERIMENTS Proceedings of DETC 03 ASME 003 Design Engineering Technical Conferences and Comuters and Information in Engineering Conference Chicago, Illinois USA, Setember -6, 003 DETC003/DAC-48760 AN EFFICIENT ALGORITHM

More information

Adaptive Estimation of the Regression Discontinuity Model

Adaptive Estimation of the Regression Discontinuity Model Adative Estimation of the Regression Discontinuity Model Yixiao Sun Deartment of Economics Univeristy of California, San Diego La Jolla, CA 9293-58 Feburary 25 Email: yisun@ucsd.edu; Tel: 858-534-4692

More information

A note on the random greedy triangle-packing algorithm

A note on the random greedy triangle-packing algorithm A note on the random greedy triangle-acking algorithm Tom Bohman Alan Frieze Eyal Lubetzky Abstract The random greedy algorithm for constructing a large artial Steiner-Trile-System is defined as follows.

More information

CHAPTER 5 STATISTICAL INFERENCE. 1.0 Hypothesis Testing. 2.0 Decision Errors. 3.0 How a Hypothesis is Tested. 4.0 Test for Goodness of Fit

CHAPTER 5 STATISTICAL INFERENCE. 1.0 Hypothesis Testing. 2.0 Decision Errors. 3.0 How a Hypothesis is Tested. 4.0 Test for Goodness of Fit Chater 5 Statistical Inference 69 CHAPTER 5 STATISTICAL INFERENCE.0 Hyothesis Testing.0 Decision Errors 3.0 How a Hyothesis is Tested 4.0 Test for Goodness of Fit 5.0 Inferences about Two Means It ain't

More information

arxiv: v3 [physics.data-an] 23 May 2011

arxiv: v3 [physics.data-an] 23 May 2011 Date: October, 8 arxiv:.7v [hysics.data-an] May -values for Model Evaluation F. Beaujean, A. Caldwell, D. Kollár, K. Kröninger Max-Planck-Institut für Physik, München, Germany CERN, Geneva, Switzerland

More information

Feedback-error control

Feedback-error control Chater 4 Feedback-error control 4.1 Introduction This chater exlains the feedback-error (FBE) control scheme originally described by Kawato [, 87, 8]. FBE is a widely used neural network based controller

More information

MATH 2710: NOTES FOR ANALYSIS

MATH 2710: NOTES FOR ANALYSIS MATH 270: NOTES FOR ANALYSIS The main ideas we will learn from analysis center around the idea of a limit. Limits occurs in several settings. We will start with finite limits of sequences, then cover infinite

More information

Introduction to Probability and Statistics

Introduction to Probability and Statistics Introduction to Probability and Statistics Chater 8 Ammar M. Sarhan, asarhan@mathstat.dal.ca Deartment of Mathematics and Statistics, Dalhousie University Fall Semester 28 Chater 8 Tests of Hyotheses Based

More information

s v 0 q 0 v 1 q 1 v 2 (q 2) v 3 q 3 v 4

s v 0 q 0 v 1 q 1 v 2 (q 2) v 3 q 3 v 4 Discrete Adative Transmission for Fading Channels Lang Lin Λ, Roy D. Yates, Predrag Sasojevic WINLAB, Rutgers University 7 Brett Rd., NJ- fllin, ryates, sasojevg@winlab.rutgers.edu Abstract In this work

More information

Information collection on a graph

Information collection on a graph Information collection on a grah Ilya O. Ryzhov Warren Powell October 25, 2009 Abstract We derive a knowledge gradient olicy for an otimal learning roblem on a grah, in which we use sequential measurements

More information

Finite Mixture EFA in Mplus

Finite Mixture EFA in Mplus Finite Mixture EFA in Mlus November 16, 2007 In this document we describe the Mixture EFA model estimated in Mlus. Four tyes of deendent variables are ossible in this model: normally distributed, ordered

More information

ute measures of uncertainty called standard errors for these b j estimates and the resulting forecasts if certain conditions are satis- ed. Note the e

ute measures of uncertainty called standard errors for these b j estimates and the resulting forecasts if certain conditions are satis- ed. Note the e Regression with Time Series Errors David A. Dickey, North Carolina State University Abstract: The basic assumtions of regression are reviewed. Grahical and statistical methods for checking the assumtions

More information

Adaptive estimation with change detection for streaming data

Adaptive estimation with change detection for streaming data Adative estimation with change detection for streaming data A thesis resented for the degree of Doctor of Philosohy of the University of London and the Diloma of Imerial College by Dean Adam Bodenham Deartment

More information

A MIXED CONTROL CHART ADAPTED TO THE TRUNCATED LIFE TEST BASED ON THE WEIBULL DISTRIBUTION

A MIXED CONTROL CHART ADAPTED TO THE TRUNCATED LIFE TEST BASED ON THE WEIBULL DISTRIBUTION O P E R A T I O N S R E S E A R C H A N D D E C I S I O N S No. 27 DOI:.5277/ord73 Nasrullah KHAN Muhammad ASLAM 2 Kyung-Jun KIM 3 Chi-Hyuck JUN 4 A MIXED CONTROL CHART ADAPTED TO THE TRUNCATED LIFE TEST

More information

STK4900/ Lecture 7. Program

STK4900/ Lecture 7. Program STK4900/9900 - Lecture 7 Program 1. Logistic regression with one redictor 2. Maximum likelihood estimation 3. Logistic regression with several redictors 4. Deviance and likelihood ratio tests 5. A comment

More information

MULTIVARIATE STATISTICAL PROCESS OF HOTELLING S T CONTROL CHARTS PROCEDURES WITH INDUSTRIAL APPLICATION

MULTIVARIATE STATISTICAL PROCESS OF HOTELLING S T CONTROL CHARTS PROCEDURES WITH INDUSTRIAL APPLICATION Journal of Statistics: Advances in heory and Alications Volume 8, Number, 07, Pages -44 Available at htt://scientificadvances.co.in DOI: htt://dx.doi.org/0.864/jsata_700868 MULIVARIAE SAISICAL PROCESS

More information

Paper C Exact Volume Balance Versus Exact Mass Balance in Compositional Reservoir Simulation

Paper C Exact Volume Balance Versus Exact Mass Balance in Compositional Reservoir Simulation Paer C Exact Volume Balance Versus Exact Mass Balance in Comositional Reservoir Simulation Submitted to Comutational Geosciences, December 2005. Exact Volume Balance Versus Exact Mass Balance in Comositional

More information

An Investigation on the Numerical Ill-conditioning of Hybrid State Estimators

An Investigation on the Numerical Ill-conditioning of Hybrid State Estimators An Investigation on the Numerical Ill-conditioning of Hybrid State Estimators S. K. Mallik, Student Member, IEEE, S. Chakrabarti, Senior Member, IEEE, S. N. Singh, Senior Member, IEEE Deartment of Electrical

More information

Shadow Computing: An Energy-Aware Fault Tolerant Computing Model

Shadow Computing: An Energy-Aware Fault Tolerant Computing Model Shadow Comuting: An Energy-Aware Fault Tolerant Comuting Model Bryan Mills, Taieb Znati, Rami Melhem Deartment of Comuter Science University of Pittsburgh (bmills, znati, melhem)@cs.itt.edu Index Terms

More information

The inverse Goldbach problem

The inverse Goldbach problem 1 The inverse Goldbach roblem by Christian Elsholtz Submission Setember 7, 2000 (this version includes galley corrections). Aeared in Mathematika 2001. Abstract We imrove the uer and lower bounds of the

More information

Applied Mathematics and Computation

Applied Mathematics and Computation Alied Mathematics and Comutation 217 (2010) 1887 1895 Contents lists available at ScienceDirect Alied Mathematics and Comutation journal homeage: www.elsevier.com/locate/amc Derivative free two-oint methods

More information

Spectral Analysis by Stationary Time Series Modeling

Spectral Analysis by Stationary Time Series Modeling Chater 6 Sectral Analysis by Stationary Time Series Modeling Choosing a arametric model among all the existing models is by itself a difficult roblem. Generally, this is a riori information about the signal

More information

Asymptotic non-null distributions of test statistics for redundancy in high-dimensional canonical correlation analysis

Asymptotic non-null distributions of test statistics for redundancy in high-dimensional canonical correlation analysis Asymtotic non-null distributions of test statistics for redundancy in high-dimensional canonical correlation analysis Ryoya Oda, Hirokazu Yanagihara, and Yasunori Fujikoshi, Deartment of Mathematics, Graduate

More information

Information collection on a graph

Information collection on a graph Information collection on a grah Ilya O. Ryzhov Warren Powell February 10, 2010 Abstract We derive a knowledge gradient olicy for an otimal learning roblem on a grah, in which we use sequential measurements

More information

Lecture 6. 2 Recurrence/transience, harmonic functions and martingales

Lecture 6. 2 Recurrence/transience, harmonic functions and martingales Lecture 6 Classification of states We have shown that all states of an irreducible countable state Markov chain must of the same tye. This gives rise to the following classification. Definition. [Classification

More information

Performance of lag length selection criteria in three different situations

Performance of lag length selection criteria in three different situations MPRA Munich Personal RePEc Archive Performance of lag length selection criteria in three different situations Zahid Asghar and Irum Abid Quaid-i-Azam University, Islamabad Aril 2007 Online at htts://mra.ub.uni-muenchen.de/40042/

More information

Maximum Likelihood Asymptotic Theory. Eduardo Rossi University of Pavia

Maximum Likelihood Asymptotic Theory. Eduardo Rossi University of Pavia Maximum Likelihood Asymtotic Theory Eduardo Rossi University of Pavia Slutsky s Theorem, Cramer s Theorem Slutsky s Theorem Let {X N } be a random sequence converging in robability to a constant a, and

More information

The Poisson Regression Model

The Poisson Regression Model The Poisson Regression Model The Poisson regression model aims at modeling a counting variable Y, counting the number of times that a certain event occurs during a given time eriod. We observe a samle

More information

2. Sample representativeness. That means some type of probability/random sampling.

2. Sample representativeness. That means some type of probability/random sampling. 1 Neuendorf Cluster Analysis Assumes: 1. Actually, any level of measurement (nominal, ordinal, interval/ratio) is accetable for certain tyes of clustering. The tyical methods, though, require metric (I/R)

More information

1 Gambler s Ruin Problem

1 Gambler s Ruin Problem Coyright c 2017 by Karl Sigman 1 Gambler s Ruin Problem Let N 2 be an integer and let 1 i N 1. Consider a gambler who starts with an initial fortune of $i and then on each successive gamble either wins

More information

c Copyright by Helen J. Elwood December, 2011

c Copyright by Helen J. Elwood December, 2011 c Coyright by Helen J. Elwood December, 2011 CONSTRUCTING COMPLEX EQUIANGULAR PARSEVAL FRAMES A Dissertation Presented to the Faculty of the Deartment of Mathematics University of Houston In Partial Fulfillment

More information

Uniformly best wavenumber approximations by spatial central difference operators: An initial investigation

Uniformly best wavenumber approximations by spatial central difference operators: An initial investigation Uniformly best wavenumber aroximations by satial central difference oerators: An initial investigation Vitor Linders and Jan Nordström Abstract A characterisation theorem for best uniform wavenumber aroximations

More information