American Society for Quality

Size: px
Start display at page:

Download "American Society for Quality"

Transcription

1 American Society for Quality Ridge Regression: Biased Estimation for Nonorthogonal Problems Author(s): Arthur E. Hoerl and Robert W. Kennard Source: Technometrics, Vol. 42, No. 1, Secial 40th Anniversary Issue (Feb., 2000), Published by: American Statistical Association and American Society for Quality Stable URL: htt:// Accessed: 24/10/ :25 Your use of the JSTOR archive indicates your accetance of JSTOR's Terms and Conditions of Use, available at htt:// JSTOR's Terms and Conditions of Use rovides, in art, that unless you have obtained rior ermission, you may not download an entire issue of a journal or multile coies of articles, and you may use content in the JSTOR archive only for your ersonal, non-commercial use. Please contact the ublisher regarding any further use of this work. Publisher contact information may be obtained at htt:// Each coy of any art of a JSTOR transmission must contain the same coyright notice that aears on the screen or rinted age of such transmission. JSTOR is a not-for-rofit organization founded in 1995 to build trusted digital archives for scholarshi. We work with the scholarly community to reserve their work and the materials they rely uon, and to build a common research latform that romotes the discovery and use of these resources. For more information about JSTOR, lease contact suort@jstor.org. American Statistical Association and American Society for Quality are collaborating with JSTOR to digitize, reserve and extend access to Technometrics. htt://

2 Ridge Regression: Biased Estimation for Nonorthogonal Problems Arthur E. HOERL* and Robert W. KENNARD* University of Delaware and E. I. du Pont de Nemours & Co. (Deceased 1994) (2632 Horseshoe Court, Cocoa, FL 32926) In multile regression it is shown that arameter estimates based on minimum residual sum of squares have a high robability of being unsatisfactory, if not incorrect, if the rediction vectors are not orthogonal. Proosed is an estimation rocedure based on adding small ositive quantities to the diagonal of X'X. Introduced is the ridge trace, a method for showing in two dimensions the effects of nonorthogonality. It is then shown how to augment X'X to obtain biased estimates with smaller mean square error. 0. INTRODUCTION Consider the standard model for multile linear regression, Y = X/3 + E, where it is assumed that X is (n x ) and of rank,/3 is ( x 1) and unknown, E[e] = 0, and E[se'] = u2in. If an observation on the factors is denoted by x, = {xi,,x22,...,zv}, the general form X/3 is { ip_1 3iOi(x^)} where the Oi are functions free of unknown arameters. The usual estimation rocedure for the unknown 3 is Gauss-Markov-linear functions of Y = {y,} that are unbiased and have minimum variance. This estimation rocedure is a good one if X'X, when in the form of a correlation matrix, is nearly a unit matrix. However, if X'X is not nearly a unit matrix, the least squares estimates are sensitive to a number of "errors." The results of these errors are critical when the secification is that X/3 is a true model. Then the least squares estimates often do not make sense when ut into the context of the hysics, chemistry, and engineering of the rocess which is generating the data. In such cases, one is forced to treat the estimated redicting function as a black box or to dro factors to destroy the correlation bonds among the Xi used to form X'X. Both these alternatives are unsatisfactory if the original intent was to use the estimated redictor for control and otimization. If one treats the result as a black box, he must caution the user of the model not to take artial derivatives (a useless caution in ractice), and in the other case, he is left with a set of dangling controllables or observables. Estimation based on the matrix [X'X + ki], k > 0 rather than on X'X has been found to be a rocedure that can be used to hel circumvent many of the difficulties associated with the usual least squares estimates. In articular, the rocedure can be used to ortray the sensitivity of the estimates to the articular set of data being used, and it can be used to obtain a oint estimate with a smaller mean square error. 1. PROPERTIES OF BEST LINEAR UNBIASED ESTIMATION Using unbiased linear estimation with minimum variance or maximum likelihood estimation when the random vector, e, is normal gives 3 -(X'X) 1X'Y (1.1) as an estimate of /3 and this gives the minimum sum of squares of the residuals: 03) =- (Y- X/)'(Y - X/). (1.2) The roerties of P are well known (Scott 1966). Here the concern is rimarily with cases for which X'X is not nearly a unit matrix (unless secified otherwise, the model is formulated to give an X'X in correlation form). To demonstrate the effects of this condition on the estimation of,3, consider two roerties of 13-its variance-covariance matrix and its distance from its exected value. (i) VAR(3) = c2(x'x)-1 (1.3) (ii) L1 = Distance from 3 to 3. or equivalently L =- (O - )'(P - 3) (1.4) E[L2] = r2trace(x'x)-1 E[/'P] = /3' + a2trace(x'x)-l When the error e is normally distributed, then VAR[L2] = 2cr4Trace(X'X)-2. (1.5) (1.5a) (1.6) These related roerties show the uncertainty in / when X'X moves from a unit matrix to an ill-conditioned one. If the eigenvalues of X'X are denoted by Amax = 1 > 2 > *' > \ = Amin > 0, (1.7) then the average value of the squared distance from / to /3 is given by E[LI2] = a2 (1/Ai) i=l (1.8)? 1970 American Statistical Association and the American Society for Quality 80

3 and the variance when the error is normal is given by VAR[L2] = 2-4 (1/Ai)2. Lower bounds for the average and variance are a2/amin and 2o4/A i,, resectively. Hence, if the shae of the factor sace is such that reasonable data collection results in an X'X with one or more small eigenvalues, the distance from /3 to,3 will tend to be large. Estimated coefficients, f3i, that are large in absolute value have been observed by all who have tackled live nonorthogonal data roblems. The least squares estimate (1.1) suffers from the deficiency of mathematical otimization techniques that give oint estimates; the estimation rocedure does not have built into it a method for ortraying the sensitivity of the solution (1.1) to the otimization criterion (1.2). The rocedures to be discussed in the sections to follow ortray the sensitivity of the solutions and utilize nonsensitivity as an aid to analysis. 2. RIDGE REGRESSION A. E. Hoerl first suggested in 1962 (Hoerl 1962; Hoerl and Kennard 1968) that to control the inflation and general instability associated with the least squares estimates, one can use 3* = [X'X + kj]-x'y; k > (2.1) = WX'Y. (2.2) The family of estimates given by k > 0 has many mathematical similarities with the ortrayal of quadratic resonse functions (Hoerl 1964). For this reason, estimation and analysis built around (2.1) has been labeled "ridge regression." The relationshi of a ridge estimate to an ordinary estimate is given by the alternative form 3* [I + k(x'x)-1]-1 (2.3) Z3. (2.4) This relationshi will be exlored further in subsequent sections. Some roerties of 3*, W, and Z that will be used are: (i) Let (i(w) and (i(z) be the eigenvalues of W and Z, resectively. Then (i(w) = 1/(A, + k) (2.5) i(z) = Ai/(A, + k) (2.6) where Ai are the eigenvalues of X'X. These results follow directly from the definitions of W and Z in (2.2) and (2.4) and the solution of the characteristic equations IW--I[ = 0 and IZ - II = 0. (ii) Z = I- k(x'x + ki)-1 = I- kw (2.7) The relationshi is readily verified by writing Z in the alternative form Z = (X'X + ki)- X'X = WX'X and multilying both sides of (2.7) on the left by W-1. (iii) /* for k y 0 is shorter than /3, i.e., RIDGE REGRESSION 81 By definition /* = Z/. From its definition and the assumtions on X'X, Z is clearly symmetric ositive definite. Then (1.9) the following relation holds (Sheff6 1960): (d*)'(3*) _< ax(z)i'2. (2.9) But ~max(z) = A1/(A1 + k) where A1 is the largest eigenvalue of X'X and (2.8) is established. From (2.6) and (2.7) it is seen that Z(0) = I and that Z aroaches 0 as k -+ oo. For an estimate /* the residual sum of squares is b*(k) = (Y - X3*)'(Y - X/*) (2.10) which can be written in the form 0*(k) = Y'Y - (d*)'x'y - k(3*)'(3*). (2.11) The exression shows that * (k) is the total sum of squares less the "regression" sum of squares for /3* with a modification deending uon the squared length of 3*. 3. THE RIDGE TRACE a. Definition of the Ridge Trace When X'X deviates considerably from a unit matrix, that is, when it has small eigenvalues, (1.5) and (1.6) show that the robability can be small that 3 will be close to /3. In any excet the smallest roblems, it is difficult to untangle the relationshis among the factors if one is confined to an insection of the simle correlations that are the elements of X'X. That such untangling is a roblem is reflected in the "automatic" rocedures that have been ut forward to reduce the dimensionality of the factor sace or to select some "best" subset of the redictors. These automatic rocedures include regression using the factors obtained from a coordinate transformation using the rincial comonents of X'X, stewise regression, comutation of all 2P regressions, and some subset of all regressions using fractional factorials or a branch and bound technique (Beale, Kendall, and Mann 1967; Efroyson 1960; Garside 1965; Gorman and Toman 1966; Hocking and Leslie 1967; Jeffers 1967; Scott 1966). However, with the occasional excetion of rincial comonents, these methods don't really give an insight into the structure of the factor sace and the sensitivity of the results to the articular set of data at hand. But by comut- ing /*(k) and O*(k) for a set of values of k, such insight can be obtained. A detailed study of two nonorthogonal roblems and the conclusions that can be drawn from their ridge traces is given in James and Stein (1961). b. Characterization of the Ridge Trace Let B be any estimate of the vector /. Then the residual sums of squares can be written as =(Y - XB)'(Y- XB) =(Y - X3)'(Y - X3) + (B -/)'X'X(B - 3) = >min +?(B) (3.1).on-to o.a I a Contours oft constant 0 are the surfaces of hyerellisoids (2.8) centered at /, the ordinary least squares estimate of /3. The

4 82 ARTHUR E. HOERL AND ROBERT W. KENNARD value of 0 is the minimum value,?mi, lus the value of the quadratic form in (B - /). There is a continuum of values of Bo that will satisfy the relationshi q = Omin + -o where 00 > 0 is a fixed increment. However, the relationshis in Section 2 show that on the average the distance from to 3 will tend to be large if there is a small eigenvalue of X'X. In articular, the worse the conditioning of X'X, the more 3 can be exected to be too long. On the other hand, the worse the conditioning, the further one can move from,3 without an areciable increase in the residual sums of squares. In view of (1.5a) it seems reasonable that if one moves away from the minimum sum of squares oint, the movement should be in a direction which will shorten the length of the regression vector. The ridge trace can be shown to be following a ath through the sums of squares surface so that for a fixed X a single value of B is chosen and that is the one with minimum length. This can be stated recisely as follows: Minimize B'B subject to (B - /)'X'X(B - 0) = 0. (3.2) As a Lagrangian roblem this is Minimize F = B'B + (1/k)[(B - 3)'X'X(B - )' - 0o] where (1/k) is the multilier. Then OF This reduces to (3.3) = 2B + (1/k)[2(X'X)B - 2(X'X)3] = 0 (3.4) B = /* = [X'X + ki]-x'y (3.5) where k is chosen to satisfy the restraint (3.2). This is the ridge estimator. Of course, in ractice it is easier to choose a k > 0 and then comute o0. In terms of 3* the residual sum of squares becomes 0*(k) = (Y- X/*)'(Y - XP*) =- min + k2 3*/(XIX)-1l*. (3.6) A comletely equivalent statement of the ath is this: If the squared length of the regression vector B is fixed at R2, then /3* is the value of B that gives a minimum sum of squares. That is, /3* is the value of B that minimizes the function F1 = (Y - XB)'(Y - XB) + (1/k)(B'B - R2). (3.7) With (3.1) in 3b, this shows that an increase in the residual sum of squares is equivalent to a decrease in the value of the likelihood function. So the contours of equal likelihood also lie on the surface of hyerellisoids centered at 3. The ridge trace can thereby be interreted as a ath through the likelihood sace, and the question arises as why this articular ath can be of secial interest. The reasoning is the same as for the sum of squares. Although long vectors give the same likelihood values as shorter vectors, they will not always have equal hysical meaning. Imlied is a restraint on the ossible values of /3 that is not made exlicit in the formulation of the general linear model given in the Introduction. This imlication is discussed further in the sections that follow. 4. MEAN SQUARE ERROR PROPERTIES OF RIDGE REGRESSION a. Variance and Bias of a Ridge Estimator To look at 3* from the oint of view of mean square error it is necessary to obtain an exression for E[L2(k)]. Straightforward alication of the exectation oerator and (2.3) gives the following: E[L 2(k)] =E[(* - -)'(* - )] = E[(/ - 3)'Z'Z(/ -/3)] + (Z/3-3)'(Z/3 -/3) (4.2) = -2 Trace(X'X)-lZ'Z + /(Z - I)'(Z - I)P/ (4.3) = 02[Trace(X'X + ki)-1 - k Trace(X'X + ki)-2] = + k2/3'(x'x + ki)-23 f2 Z Ai/(Ai + k)2 + k2f3(x'x + ki)-2/ 1 = 1 (k) + 2(k) (4.4) (4.5) (4.6) The meanings of the two elements of the decomosition, 7yi(k) and 72(k), are readily established. The second element, 72(k), is the squared distance from Z/( to 3. It will be zero when k = 0, since Z is then equal to I. Thus, 72 (k) can be considered the square of a bias introduced when /* is used rather than /. The first term, 71 (k), can be shown to be the sum of the variances (total variance) of the arameter estimates. In terms of the random variable Y, c. Likelihood Characterization of the Ridge Trace Using the assumtion that the error vector is Normal (0, a2in) the likelihood function is (27ru2)-n/2 ex{-(1/2cr2)(y - X/)'(Y - X/3)}. (3.8) The kernel of this function is the quadratic form in the exonential which can be written in the form (Y - X3)'(Y - X3) = (Y - X/)'(Y - X3) + (/ - f/)'x'x(3 - /3). (3.9) Then /* = Z/3 = Z(X'X) 1X'Y. (4.7) VAR[3*] = Z(X'X)-1X'VAR[Y]X(X'X)-1Z' = r2z(x'x)-lz. (4.8) The sum of the variances of all the /* is the sum of the diagonal elements of (4.8).

5 RIDGE REGRESSION 83 Figure 1 shows in qualitative form the relationshi between the variances, the squared bias, and the arameter k. The total variance decreases as k increases, while the squared bias increases with k. As is indicated by the dot- ted line, which is the sum of 71 (k) and Y2 (k) and thus is E[L2(k)], the ossibility exists that there are values of k (admissible values) for which the mean square error is less for 3* than it is for the usual solution 3. This ossibility is suorted by the mathematical roerties of 71(k) and 72(k). [See Section 4b.] The function 71 (k) is a monotonic decreasing function of k, while 7Y2(k) is monotonic increasing. However, the most significant feature is the value of the derivative of each function in the neighborhood of the origin. These derivatives are: Lim(dTy/dk) - -2a2E(1/A2) (4.9) k-o+ b. Theorems on the Mean Square Function Theorem 4.1. The total variance y71(k) is a continuous, monotonically decreasing function of k. Corollary The first derivative with resect to k of the total variance 7 (k), aroaches -oc as k and A -X0. Both the theorem and the corollary are readily roved by use of lyi(k) and its derivative exressed in terms of Ai. Theorem 4.2. The squared bias 72(k) is a continuous, monotonically increasing function of k. Proof From (4.5) Y2(k) = k2/'(x'x + ki)-2/3. Corollary The first derivative of the total variance, y1 (k), aroaches -oo as k and the matrix X'X becomes singular. Both the theorem and the corollary are readily roved by use of 7 (k) and its derivative exressed in terms of Ai. Theorem 4.2. The squared bias 72(k) monotonically increasing function of k. is a continuous, Lim (d72/dk) - 0. (4.10) k-o+ Proof: From (4.5) Y22(k) = k2,'(x'x + ki)-2/3. If A is the matrix of eigenvalues of X'X and P the orthogonal transformation such that X'X = P'AP, then Thus, -yi(k) has a negative derivative which aroaches -2r2 as k for an orthogonal X'X and aroaches -oo as X'X becomes ill-conditioned and A -- 2 (k) = k2 Ca2/(Xi + k)2 (4.11) 0. On the 1 other hand, as k -X 0+, (4.10) shows that 72(k) is flat and zero at the origin. These roerties lead to the conclusion that it is ossible to move to k where a > 0, take a little P/3. (4.12) bias, and substantially reduce the variance, thereby imroving Since Ai > 0 for all i and k > 0, each element (Ai + k) is the mean square error of estimation and rediction. An ex- ositive and there are no singularities in the sum. Clearly, istence theorem to validate this conclusion is given in Sec- 72(0) = 0. Then 72(k) is a continuous function for k > 0. tion 4b. For k > 0 (4.11) can be written as "Y2(k)- E oz2/[1 + (Ai/k) ]2. (4.13) Since Ai > 0 for all i, the functions Ai/k are clearly monotone decreasing for increasing k and each term of 'y2(k) is monotone increasing. So 72(k) is monotone increasing. q.e.d. f1 Corollary The squared bias 72(k) aroaches /3't as an uer limit. Proof: From (4.13) limk-ooy72(k) /3'P'P/3 = '/3 q.e.d. 1Zai - = a-a I- i t n Corollary The derivative 72 (k) aroaches zero as k -X 0+. Proof From (4.11) it is readily established that d'y2(k)/dk = 2k Aia2/(Ai + k)3. (4.14) k Figure 1. Each term in the sum 2kAia2/(Ai + k)3 is a continuous function. And the limit of each term as k -* 0+ is zero. q.e.d. Theorem 4.3. (Existence Theorem) There always exists a k > 0 such that E[L2(k)] < E[L2(0)] = C2 P (1/Ai).

6 84 ARTHUR E. HOERL AND ROBERT W. KENNARD Proof. From (4.5), (4.11), and (4.14) de[li(k)]/dk - dy(k)/dk + d2(k)/dk -2a2 Ai/(A, + k)3 + 2k A Ai?a2/(Ai + k)3. (4.15) First note that - 3y1(0) u2e(1/ai) and 72(0) = 0. In Theorems 4.1 and 4.2 it was established that -y1(k) and Y2 (k) are monotonically decreasing and increasing, resectively. Their first derivatives are always non-ositive and non-negative, resectively. Thus, to rove the theorem, it is only necessary to show that there always exists a k > 0 such that de[l2(k)]/dk < 0. The condition for this is shown by (4.15) to be: k < O /amax q.e.d. (4.16) c. Some Comments On The Mean Square Error Function The roerties of E[L2(k)] -=' (k) + Ty2(k) show that it will go through a minimum. And since 72(k) aroaches 3'/3 as a limit as k -+ oo, this minimum will move toward k = 0 as the magnitude of /3' increases. Since 3'/3 is the squared length of the unknown regression vector, it would aear to be imossible to choose a value of k 0 and thus achieve a smaller mean square error without being able to assign an uer bound to /3'3. On the other hand, it is clear that /3' does not become infinite in ractice, and one should be able to find a value or values for k that will ut /3* closer to /3 than is /. In other words, unboundedness, in the strict mathematical sense, and ractical unboundedness are two different things. In Section 7 some recommendations for choosing a k > 0 are given, and the imlicit assumtions of boundedness are exlored further. 5. A GENERAL FORM OF RIDGE REGRESSION It is always ossible to reduce the general linear regression roblem as defined in the Introduction to a canonical form in which the X'X matrix is diagonal. In articular there exists an orthogonal transformation P such that X'X = P'AP where A = (6ijAi) is the matrix of eigenvalues of X'X. Let and where X= X*P (5.1) Y = X*t + e (5.2) ac = PO, (X*)/(X*) = A, and a'a = '3. (5.3) Then the general ridge estimation rocedure is defined from where c* = [(X*)'(X*) + K] -(X*)'Y (5.4) K = (ijki), ki > 0. All the basic results given in Section 4 can be shown to hold for this more general formulation. Most imortant is that there is an equivalent to the existence theorem, Theorem 4.3. In the general form; one seeks a ki for each canonical variate defined by X*. By defining (L)2 = (&* - )'(* - a) it can be shown that the otimal values for the ki will be ki = a2/a2. There is no grahical equivalent to the RIDGE TRACE but an iterative rocedure initiated at ki = a 2/a2 can be used. (See Section 7) 6. RELATION TO OTHER WORK IN REGRESSION Ridge regression has oints of contact with other aroaches to regression analysis and to work with the same objective. Three should be mentioned. * In a series of aers, Stein (1960, 1962) and James and Stein (1961) investigated the imrovement in mean square error by a transformation on /3 of the form C3, 0 < C < 1, which is a shortening of the vector 3. They show that such a C > 0 can always be found and indicate how it might be comuted. * A Bayesian aroach to regression can be found in Jeffreys (1961) and Raiffa and Schlaifer (1961). Viewed in this context, each ridge estimate can be considered as the osterior mean based on giving the regression coefficients, /, a rior normal distribution with mean zero and variance-covariance matrix E = (6ij62/k). For those that do not like the hilosohical imlications of assuming P to be a random variable, all this is equivalent to constrained estimation by a nonuniform weighting on the values of 3. * Constrained estimation in a context related to regression can be found in Balakrishnan (1963). For the model in the resent aer, let /3 be constrained to be in a closed, bounded convex set C, and, in articular, let C be a hyershere of radius R. Let the estimation criterion be minimum residual sum of squares? = (Y-XB)'(Y-XB) where B is the value giving the minimum. Under the constraint, if /3'3 < R2, than B is chosen to be /3; otherwise B is chosen to be 3* where k is chosen so that (/3*)'(*) R2. 7. SELECTING A BETTER ESTIMATE OF / In Section 2 and in the examle of Section 3, it has been demonstrated that the ordinary least squares estimate of the regression vector 3 suffers from a number of deficiencies when X'X does not have a uniform eigenvalue sectrum. A class of biased estimators /*, obtained by augmenting the diagonal of X'X with small ositive quantities, has been introduced both to ortray the sensitivity of the solution to X'X and to form the basis for obtaining an estimate of /3 with a smaller mean square error. In examining the roerties of /3*, it can be shown that its use is equivalent to making certain boundedness assumtions regarding either the individual coordinates of /3 or its squared length, /3'3. As Barnard (1963) has recently ointed out, an alternative to unbiasedness in the logic of the least squares estimator /3 is the rior assurance of bounded mean square error with no boundedness assumtion on /3. If it is ossible to make secific mathematical assumtions about /3, then it is ossible to constrain the estimation rocedure to reflect these assumtions.

7 RIDGE REGRESSION 85 The inherent boundedness assumtions in using 3* make estimation can be continued until there is a stability it clear that it will not be ossible to construct a clear-cut, achieved in (a*)'(a*) and kio -= 2/(&o)2. automatic estimation rocedure to roduce a oint estimate (a single value of k or a secific value for each ki) as can be 8. CONCLUSIONS constructed to roduce 3. However, this is no drawback to It has been shown that when X'X is such that it its use has because with any given set of data it is not difficult a nonuniform to select a 3* that is better than /. In fact, ut in eigenvalue sectrum, the estimates of 3 in context, Y = X3 + e, based on the criterion of any set of data which minimum is a candidate residfor analysis using lin- ual sum of ear regression has imlicit in it restrictions on the squares, can have a high robability of being ossible far removed from 3. values This of the estimates that can be consistent with known unsatisfactory condition manifests itself in estimates that roerties of the data are too generator. Yet it is difficult to be exlarge in absolute value and some licit about these restrictions; it is esecially difficult to be may even have the wrong sign. By adding a small mathematically exlicit. In a recent ositive quantity to each aer (Clutton-Brock diagonal element the system 1965) it has been shown that for the roblem of [X'X + K]/* = X'Y acts more like an orthogonal system. estimating When K = the ki and mean /u of all a distribution, a solutions set of in data the has in it interval 0 < k < 1 imlicit are restrictions on the values of a that can be obtained, it is logical contenders ossible to obtain a two-dimensional charas acterization of the generators. Of course, in linear regression the roblem is system and a ortrayal of the kinds of much more difficulties caused difficult; the number of ossibilities is so by the intercorrelations among the relarge. dictors. First, there A is the number of arameters involved. To have study of the roerties of the estimator /3* shows ten to that it can be used to twenty regression coefficients is not uncommon. And imrove the mean square error of estheir signs have to be considered. Then there is X'X timation, and the and the magnitude of this imrovement increases with (2) different an factor increase correlations in and the ways in which sread of the they eigenvalue sectrum. An can be related. estimate Yet in based the on final analysis these many differ- /3* is biased and the use of a biased esent influences can timator be integrated to make an assessment as imlies some rior bound on the regression vector to whether the estimated values are consistent with the data /3. However, the data in any articular roblem has inforand the roerties of mation in the it data that can show the class generator. Guiding one of generators 3 that are along the way, of reasonable. course, is The the objective of the study. In Hoerl urose of the ridge trace is to ortray this and Kennard (1970) it is information shown for two roblems how such exlicitly and, hence, guide the user to a better estimate /*. an assessment can be made. Based on exerience, the best method for achieving a better estimate /3* is to use ki = k for all i and use the Ridge Trace to select a single value of k and a unique 3*. These kinds of things can be used to guide one to a choice. * At a certain value of k the system will stabilize and have the general characteristics of an orthogonal system. * Coefficients will not have unreasonable absolute values with resect to the factors for which they reresent rates of change. * Coefficients with aarently incorrect signs at k = 0 will have changed to have the roer sign. * The residual sum of squares will not have been inflated to an unreasonable value. It will not be large relative to the minimum residual sum of squares or large relative to what would be a reasonable variance for the rocess generating the data. Another aroach is to use estimates of the otimum values of ki develoed in Section 5. A tyical aroach here would be as follows: * Reduce the system to canonical by the transformations X = X*P and ca = P/. * Determine estimates of the otimum ki's using kio = &2/&2. Use the kio to obtain /*. * The ki0 will tend to be too small because of the tendency to overestimate a'a. Since use of the kio will shorten the length of the estimated regression vector, kio can be re-estimated using the 6&. This re- NOMENCLATURE 3= (X'X)- X'Y 3* = f*(k) = [X'X + ki]-1x'y; k > 0 W = W(k) = [X'X + ki]-1 = I - kw Z = Z(k)= [I + k(x'x)-1]- Ai = Eigenvalue of X'X; A1 > A2 >. > A > 0 A = (SijAi) = the matrix of eigenvalues P = An orthogonal matrix such that P'AP = X'X L (k) = E[(3)* - )'( *-)] = (k) + 72(k) 71 (k) = Variance of the estimate * 72(k) = Squared bias of the estimate 3* K = (&iki); ki > \- t - xj / I - negative constants. = PP X* = XP' A* = [(X*)'(X*) + K] -(X*)'Y W* = [(X*)'(x*) + K]-1 Z* = {I + [(X*)'(X*)]-1K]} = I- KW 0 A diagonal matrix of nnn- vt _-...L...1,.t1 t./1 111~./11 [Received August Revised June 1969.] REFERENCES Balakrishnan, A. V. (1963), "An Oerator Theoretic Formulation of a Class of Control Problems and a Steeest Descent Method of Solution," Journal on Control, 1, Barnard, G. A. (1963), "The Logic of Least Squares," Journal of the Royal Statistical Society, Ser. B, 65, Beale, E. M. L., Kendall, M. G., and Mann, D. W. (1967), "The Discarding of Variables in Multivariate Analysis," Biometrika, 54,

8 86 ARTHUR E. HOERL AND ROBERT W. KENNARD Clutton-Brock, M. (1965), "Using the Observations to Estimate Prior Distribution," Journal of the Royal Statistical Society, Ser. B, 27, Efroymson, M. A. (1960), "Multile Regression Analysis," in Mathematical Methodsfor Digital Comuters, eds. A. Ralston and H. S. Wilf, New York: Wiley, cha. 17. Garside, M. J. (1965), "The Best Subset in Multile Regression Analysis," Alied Statistics, 14. Gorman, J. W., and Toman, R. J. (1966), "Selection of Variables for Fitting Equations to Data," Technometrics, 8, Hocking, R. R., and Leslie, R. N. (1967), "Selection of the Best Subset in Regression Analysis," Technometrics, 9, Hoerl, A. E. (1962), "Alication of Ridge Analysis to Regression Problems," Chemical Engineering Progress, 58, (1965), "Ridge Analysis," Chemical Engineering Progress Symosium Series, 60, Hoerl, A. E., and Kennard, R. W. (1968), "On Regression Analysis and Biased Estimation," Technometrics, 10, (1970), "Ridge Regression. Alications to Non-orthogonal Problems," Technometrics, 12. James, W., and Stein, C. M. (1961), "Estimation With Quadratic Loss," Proceedings of the 4th Berkeley Symosium, 1, Jeffers, J. N. R. (1967), "Two Case Studies in the Alication of Princial Comonent Analysis," Alied Statistics, 16, Jeffreys, H. (1961), Theory of Probability (3rd ed.), London: Oxford University Press. Raiffa, H., and Schlaifer, R. (1961), Alied Statistical Decision Theory, Boston: Harvard University Press. Riley, J. D. (1955), "Solving Systems of Linear Equations With a Positive Definite, Symmetric, but Possibly Ill-conditioned Matrix," Mathematics of Comutation, 9, Scheffe, H. (1960), The Analysis of Variance, New York: Wiley. Scott, J. T., Jr. (1966), "Factor Analysis and Regression," Econometrica, 34, Stein, C. M. (1960), "Multile Regression," in Essays in Honor of Harold Hotelling, Stanford, CA: Stanford University Press, cha. 37. (1962), "Confidence Sets for the Mean of a Multivariate Normal Distribution," Journal of the Royal Statistical Society, Ser. B, 24,

Ridge Regression: Biased Estimation for Nonorthogonal Problems

Ridge Regression: Biased Estimation for Nonorthogonal Problems TECHNOMETRICS VOL. 12, No. 1 FEBRUARY 1970 Ridge Regression: Biased Estimation for Nonorthogonal Problems ARTHUR E. HOERL AND ROBERT W. KENNARD University of Delaware and E. 1. du Pont de Nemours & Co.

More information

A Comparison between Biased and Unbiased Estimators in Ordinary Least Squares Regression

A Comparison between Biased and Unbiased Estimators in Ordinary Least Squares Regression Journal of Modern Alied Statistical Methods Volume Issue Article 7 --03 A Comarison between Biased and Unbiased Estimators in Ordinary Least Squares Regression Ghadban Khalaf King Khalid University, Saudi

More information

4. Score normalization technical details We now discuss the technical details of the score normalization method.

4. Score normalization technical details We now discuss the technical details of the score normalization method. SMT SCORING SYSTEM This document describes the scoring system for the Stanford Math Tournament We begin by giving an overview of the changes to scoring and a non-technical descrition of the scoring rules

More information

Towards understanding the Lorenz curve using the Uniform distribution. Chris J. Stephens. Newcastle City Council, Newcastle upon Tyne, UK

Towards understanding the Lorenz curve using the Uniform distribution. Chris J. Stephens. Newcastle City Council, Newcastle upon Tyne, UK Towards understanding the Lorenz curve using the Uniform distribution Chris J. Stehens Newcastle City Council, Newcastle uon Tyne, UK (For the Gini-Lorenz Conference, University of Siena, Italy, May 2005)

More information

Estimation of the large covariance matrix with two-step monotone missing data

Estimation of the large covariance matrix with two-step monotone missing data Estimation of the large covariance matrix with two-ste monotone missing data Masashi Hyodo, Nobumichi Shutoh 2, Takashi Seo, and Tatjana Pavlenko 3 Deartment of Mathematical Information Science, Tokyo

More information

State Estimation with ARMarkov Models

State Estimation with ARMarkov Models Deartment of Mechanical and Aerosace Engineering Technical Reort No. 3046, October 1998. Princeton University, Princeton, NJ. State Estimation with ARMarkov Models Ryoung K. Lim 1 Columbia University,

More information

Use of Transformations and the Repeated Statement in PROC GLM in SAS Ed Stanek

Use of Transformations and the Repeated Statement in PROC GLM in SAS Ed Stanek Use of Transformations and the Reeated Statement in PROC GLM in SAS Ed Stanek Introduction We describe how the Reeated Statement in PROC GLM in SAS transforms the data to rovide tests of hyotheses of interest.

More information

General Linear Model Introduction, Classes of Linear models and Estimation

General Linear Model Introduction, Classes of Linear models and Estimation Stat 740 General Linear Model Introduction, Classes of Linear models and Estimation An aim of scientific enquiry: To describe or to discover relationshis among events (variables) in the controlled (laboratory)

More information

Numerical Linear Algebra

Numerical Linear Algebra Numerical Linear Algebra Numerous alications in statistics, articularly in the fitting of linear models. Notation and conventions: Elements of a matrix A are denoted by a ij, where i indexes the rows and

More information

Feedback-error control

Feedback-error control Chater 4 Feedback-error control 4.1 Introduction This chater exlains the feedback-error (FBE) control scheme originally described by Kawato [, 87, 8]. FBE is a widely used neural network based controller

More information

Positive decomposition of transfer functions with multiple poles

Positive decomposition of transfer functions with multiple poles Positive decomosition of transfer functions with multile oles Béla Nagy 1, Máté Matolcsi 2, and Márta Szilvási 1 Deartment of Analysis, Technical University of Budaest (BME), H-1111, Budaest, Egry J. u.

More information

A Qualitative Event-based Approach to Multiple Fault Diagnosis in Continuous Systems using Structural Model Decomposition

A Qualitative Event-based Approach to Multiple Fault Diagnosis in Continuous Systems using Structural Model Decomposition A Qualitative Event-based Aroach to Multile Fault Diagnosis in Continuous Systems using Structural Model Decomosition Matthew J. Daigle a,,, Anibal Bregon b,, Xenofon Koutsoukos c, Gautam Biswas c, Belarmino

More information

System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests

System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests 009 American Control Conference Hyatt Regency Riverfront, St. Louis, MO, USA June 0-, 009 FrB4. System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests James C. Sall Abstract

More information

MATH 2710: NOTES FOR ANALYSIS

MATH 2710: NOTES FOR ANALYSIS MATH 270: NOTES FOR ANALYSIS The main ideas we will learn from analysis center around the idea of a limit. Limits occurs in several settings. We will start with finite limits of sequences, then cover infinite

More information

Bayesian Spatially Varying Coefficient Models in the Presence of Collinearity

Bayesian Spatially Varying Coefficient Models in the Presence of Collinearity Bayesian Satially Varying Coefficient Models in the Presence of Collinearity David C. Wheeler 1, Catherine A. Calder 1 he Ohio State University 1 Abstract he belief that relationshis between exlanatory

More information

Convex Optimization methods for Computing Channel Capacity

Convex Optimization methods for Computing Channel Capacity Convex Otimization methods for Comuting Channel Caacity Abhishek Sinha Laboratory for Information and Decision Systems (LIDS), MIT sinhaa@mit.edu May 15, 2014 We consider a classical comutational roblem

More information

Plotting the Wilson distribution

Plotting the Wilson distribution , Survey of English Usage, University College London Setember 018 1 1. Introduction We have discussed the Wilson score interval at length elsewhere (Wallis 013a, b). Given an observed Binomial roortion

More information

MATHEMATICAL MODELLING OF THE WIRELESS COMMUNICATION NETWORK

MATHEMATICAL MODELLING OF THE WIRELESS COMMUNICATION NETWORK Comuter Modelling and ew Technologies, 5, Vol.9, o., 3-39 Transort and Telecommunication Institute, Lomonosov, LV-9, Riga, Latvia MATHEMATICAL MODELLIG OF THE WIRELESS COMMUICATIO ETWORK M. KOPEETSK Deartment

More information

arxiv: v1 [physics.data-an] 26 Oct 2012

arxiv: v1 [physics.data-an] 26 Oct 2012 Constraints on Yield Parameters in Extended Maximum Likelihood Fits Till Moritz Karbach a, Maximilian Schlu b a TU Dortmund, Germany, moritz.karbach@cern.ch b TU Dortmund, Germany, maximilian.schlu@cern.ch

More information

MODELING THE RELIABILITY OF C4ISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL

MODELING THE RELIABILITY OF C4ISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL Technical Sciences and Alied Mathematics MODELING THE RELIABILITY OF CISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL Cezar VASILESCU Regional Deartment of Defense Resources Management

More information

#A47 INTEGERS 15 (2015) QUADRATIC DIOPHANTINE EQUATIONS WITH INFINITELY MANY SOLUTIONS IN POSITIVE INTEGERS

#A47 INTEGERS 15 (2015) QUADRATIC DIOPHANTINE EQUATIONS WITH INFINITELY MANY SOLUTIONS IN POSITIVE INTEGERS #A47 INTEGERS 15 (015) QUADRATIC DIOPHANTINE EQUATIONS WITH INFINITELY MANY SOLUTIONS IN POSITIVE INTEGERS Mihai Ciu Simion Stoilow Institute of Mathematics of the Romanian Academy, Research Unit No. 5,

More information

A New Asymmetric Interaction Ridge (AIR) Regression Method

A New Asymmetric Interaction Ridge (AIR) Regression Method A New Asymmetric Interaction Ridge (AIR) Regression Method by Kristofer Månsson, Ghazi Shukur, and Pär Sölander The Swedish Retail Institute, HUI Research, Stockholm, Sweden. Deartment of Economics and

More information

Combining Logistic Regression with Kriging for Mapping the Risk of Occurrence of Unexploded Ordnance (UXO)

Combining Logistic Regression with Kriging for Mapping the Risk of Occurrence of Unexploded Ordnance (UXO) Combining Logistic Regression with Kriging for Maing the Risk of Occurrence of Unexloded Ordnance (UXO) H. Saito (), P. Goovaerts (), S. A. McKenna (2) Environmental and Water Resources Engineering, Deartment

More information

An Improved Generalized Estimation Procedure of Current Population Mean in Two-Occasion Successive Sampling

An Improved Generalized Estimation Procedure of Current Population Mean in Two-Occasion Successive Sampling Journal of Modern Alied Statistical Methods Volume 15 Issue Article 14 11-1-016 An Imroved Generalized Estimation Procedure of Current Poulation Mean in Two-Occasion Successive Samling G. N. Singh Indian

More information

Distributed Rule-Based Inference in the Presence of Redundant Information

Distributed Rule-Based Inference in the Presence of Redundant Information istribution Statement : roved for ublic release; distribution is unlimited. istributed Rule-ased Inference in the Presence of Redundant Information June 8, 004 William J. Farrell III Lockheed Martin dvanced

More information

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission. Use of the Singular Value Decomosition in Regression Analysis Author(s): John Mandel Source: The American Statistician, Vol. 36, No. 1 (Feb., 1982),. 15-24 Published by: American Statistical Association

More information

Chapter 13 Variable Selection and Model Building

Chapter 13 Variable Selection and Model Building Chater 3 Variable Selection and Model Building The comlete regsion analysis deends on the exlanatory variables ent in the model. It is understood in the regsion analysis that only correct and imortant

More information

Session 5: Review of Classical Astrodynamics

Session 5: Review of Classical Astrodynamics Session 5: Review of Classical Astrodynamics In revious lectures we described in detail the rocess to find the otimal secific imulse for a articular situation. Among the mission requirements that serve

More information

Modeling and Estimation of Full-Chip Leakage Current Considering Within-Die Correlation

Modeling and Estimation of Full-Chip Leakage Current Considering Within-Die Correlation 6.3 Modeling and Estimation of Full-Chi Leaage Current Considering Within-Die Correlation Khaled R. eloue, Navid Azizi, Farid N. Najm Deartment of ECE, University of Toronto,Toronto, Ontario, Canada {haled,nazizi,najm}@eecg.utoronto.ca

More information

Hidden Predictors: A Factor Analysis Primer

Hidden Predictors: A Factor Analysis Primer Hidden Predictors: A Factor Analysis Primer Ryan C Sanchez Western Washington University Factor Analysis is a owerful statistical method in the modern research sychologist s toolbag When used roerly, factor

More information

Radial Basis Function Networks: Algorithms

Radial Basis Function Networks: Algorithms Radial Basis Function Networks: Algorithms Introduction to Neural Networks : Lecture 13 John A. Bullinaria, 2004 1. The RBF Maing 2. The RBF Network Architecture 3. Comutational Power of RBF Networks 4.

More information

On Doob s Maximal Inequality for Brownian Motion

On Doob s Maximal Inequality for Brownian Motion Stochastic Process. Al. Vol. 69, No., 997, (-5) Research Reort No. 337, 995, Det. Theoret. Statist. Aarhus On Doob s Maximal Inequality for Brownian Motion S. E. GRAVERSEN and G. PESKIR If B = (B t ) t

More information

Using the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process

Using the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process Using the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process P. Mantalos a1, K. Mattheou b, A. Karagrigoriou b a.deartment of Statistics University of Lund

More information

The Binomial Approach for Probability of Detection

The Binomial Approach for Probability of Detection Vol. No. (Mar 5) - The e-journal of Nondestructive Testing - ISSN 45-494 www.ndt.net/?id=7498 The Binomial Aroach for of Detection Carlos Correia Gruo Endalloy C.A. - Caracas - Venezuela www.endalloy.net

More information

An Estimate For Heilbronn s Exponential Sum

An Estimate For Heilbronn s Exponential Sum An Estimate For Heilbronn s Exonential Sum D.R. Heath-Brown Magdalen College, Oxford For Heini Halberstam, on his retirement Let be a rime, and set e(x) = ex(2πix). Heilbronn s exonential sum is defined

More information

Finite Mixture EFA in Mplus

Finite Mixture EFA in Mplus Finite Mixture EFA in Mlus November 16, 2007 In this document we describe the Mixture EFA model estimated in Mlus. Four tyes of deendent variables are ossible in this model: normally distributed, ordered

More information

On a Markov Game with Incomplete Information

On a Markov Game with Incomplete Information On a Markov Game with Incomlete Information Johannes Hörner, Dinah Rosenberg y, Eilon Solan z and Nicolas Vieille x{ January 24, 26 Abstract We consider an examle of a Markov game with lack of information

More information

Variable Selection and Model Building

Variable Selection and Model Building LINEAR REGRESSION ANALYSIS MODULE XIII Lecture - 38 Variable Selection and Model Building Dr. Shalabh Deartment of Mathematics and Statistics Indian Institute of Technology Kanur Evaluation of subset regression

More information

Linear diophantine equations for discrete tomography

Linear diophantine equations for discrete tomography Journal of X-Ray Science and Technology 10 001 59 66 59 IOS Press Linear diohantine euations for discrete tomograhy Yangbo Ye a,gewang b and Jiehua Zhu a a Deartment of Mathematics, The University of Iowa,

More information

AI*IA 2003 Fusion of Multiple Pattern Classifiers PART III

AI*IA 2003 Fusion of Multiple Pattern Classifiers PART III AI*IA 23 Fusion of Multile Pattern Classifiers PART III AI*IA 23 Tutorial on Fusion of Multile Pattern Classifiers by F. Roli 49 Methods for fusing multile classifiers Methods for fusing multile classifiers

More information

The non-stochastic multi-armed bandit problem

The non-stochastic multi-armed bandit problem Submitted for journal ublication. The non-stochastic multi-armed bandit roblem Peter Auer Institute for Theoretical Comuter Science Graz University of Technology A-8010 Graz (Austria) auer@igi.tu-graz.ac.at

More information

Chapter 10. Supplemental Text Material

Chapter 10. Supplemental Text Material Chater 1. Sulemental Tet Material S1-1. The Covariance Matri of the Regression Coefficients In Section 1-3 of the tetbook, we show that the least squares estimator of β in the linear regression model y=

More information

On split sample and randomized confidence intervals for binomial proportions

On split sample and randomized confidence intervals for binomial proportions On slit samle and randomized confidence intervals for binomial roortions Måns Thulin Deartment of Mathematics, Usala University arxiv:1402.6536v1 [stat.me] 26 Feb 2014 Abstract Slit samle methods have

More information

Version 1.0. klm. General Certificate of Education June Mathematics. Statistics 4. Mark Scheme

Version 1.0. klm. General Certificate of Education June Mathematics. Statistics 4. Mark Scheme Version.0 klm General Certificate of Education June 00 Mathematics MS04 Statistics 4 Mark Scheme Mark schemes are reared by the Princial Examiner and considered, together with the relevant questions, by

More information

Shadow Computing: An Energy-Aware Fault Tolerant Computing Model

Shadow Computing: An Energy-Aware Fault Tolerant Computing Model Shadow Comuting: An Energy-Aware Fault Tolerant Comuting Model Bryan Mills, Taieb Znati, Rami Melhem Deartment of Comuter Science University of Pittsburgh (bmills, znati, melhem)@cs.itt.edu Index Terms

More information

Elementary Analysis in Q p

Elementary Analysis in Q p Elementary Analysis in Q Hannah Hutter, May Szedlák, Phili Wirth November 17, 2011 This reort follows very closely the book of Svetlana Katok 1. 1 Sequences and Series In this section we will see some

More information

Morten Frydenberg Section for Biostatistics Version :Friday, 05 September 2014

Morten Frydenberg Section for Biostatistics Version :Friday, 05 September 2014 Morten Frydenberg Section for Biostatistics Version :Friday, 05 Setember 204 All models are aroximations! The best model does not exist! Comlicated models needs a lot of data. lower your ambitions or get

More information

An Ant Colony Optimization Approach to the Probabilistic Traveling Salesman Problem

An Ant Colony Optimization Approach to the Probabilistic Traveling Salesman Problem An Ant Colony Otimization Aroach to the Probabilistic Traveling Salesman Problem Leonora Bianchi 1, Luca Maria Gambardella 1, and Marco Dorigo 2 1 IDSIA, Strada Cantonale Galleria 2, CH-6928 Manno, Switzerland

More information

On parameter estimation in deformable models

On parameter estimation in deformable models Downloaded from orbitdtudk on: Dec 7, 07 On arameter estimation in deformable models Fisker, Rune; Carstensen, Jens Michael Published in: Proceedings of the 4th International Conference on Pattern Recognition

More information

MULTIVARIATE STATISTICAL PROCESS OF HOTELLING S T CONTROL CHARTS PROCEDURES WITH INDUSTRIAL APPLICATION

MULTIVARIATE STATISTICAL PROCESS OF HOTELLING S T CONTROL CHARTS PROCEDURES WITH INDUSTRIAL APPLICATION Journal of Statistics: Advances in heory and Alications Volume 8, Number, 07, Pages -44 Available at htt://scientificadvances.co.in DOI: htt://dx.doi.org/0.864/jsata_700868 MULIVARIAE SAISICAL PROCESS

More information

Developing A Deterioration Probabilistic Model for Rail Wear

Developing A Deterioration Probabilistic Model for Rail Wear International Journal of Traffic and Transortation Engineering 2012, 1(2): 13-18 DOI: 10.5923/j.ijtte.20120102.02 Develoing A Deterioration Probabilistic Model for Rail Wear Jabbar-Ali Zakeri *, Shahrbanoo

More information

Keywords: pile, liquefaction, lateral spreading, analysis ABSTRACT

Keywords: pile, liquefaction, lateral spreading, analysis ABSTRACT Key arameters in seudo-static analysis of iles in liquefying sand Misko Cubrinovski Deartment of Civil Engineering, University of Canterbury, Christchurch 814, New Zealand Keywords: ile, liquefaction,

More information

STA 250: Statistics. Notes 7. Bayesian Approach to Statistics. Book chapters: 7.2

STA 250: Statistics. Notes 7. Bayesian Approach to Statistics. Book chapters: 7.2 STA 25: Statistics Notes 7. Bayesian Aroach to Statistics Book chaters: 7.2 1 From calibrating a rocedure to quantifying uncertainty We saw that the central idea of classical testing is to rovide a rigorous

More information

216 S. Chandrasearan and I.C.F. Isen Our results dier from those of Sun [14] in two asects: we assume that comuted eigenvalues or singular values are

216 S. Chandrasearan and I.C.F. Isen Our results dier from those of Sun [14] in two asects: we assume that comuted eigenvalues or singular values are Numer. Math. 68: 215{223 (1994) Numerische Mathemati c Sringer-Verlag 1994 Electronic Edition Bacward errors for eigenvalue and singular value decomositions? S. Chandrasearan??, I.C.F. Isen??? Deartment

More information

Chapter 7 Sampling and Sampling Distributions. Introduction. Selecting a Sample. Introduction. Sampling from a Finite Population

Chapter 7 Sampling and Sampling Distributions. Introduction. Selecting a Sample. Introduction. Sampling from a Finite Population Chater 7 and s Selecting a Samle Point Estimation Introduction to s of Proerties of Point Estimators Other Methods Introduction An element is the entity on which data are collected. A oulation is a collection

More information

Deriving Indicator Direct and Cross Variograms from a Normal Scores Variogram Model (bigaus-full) David F. Machuca Mory and Clayton V.

Deriving Indicator Direct and Cross Variograms from a Normal Scores Variogram Model (bigaus-full) David F. Machuca Mory and Clayton V. Deriving ndicator Direct and Cross Variograms from a Normal Scores Variogram Model (bigaus-full) David F. Machuca Mory and Clayton V. Deutsch Centre for Comutational Geostatistics Deartment of Civil &

More information

Hotelling s Two- Sample T 2

Hotelling s Two- Sample T 2 Chater 600 Hotelling s Two- Samle T Introduction This module calculates ower for the Hotelling s two-grou, T-squared (T) test statistic. Hotelling s T is an extension of the univariate two-samle t-test

More information

#A37 INTEGERS 15 (2015) NOTE ON A RESULT OF CHUNG ON WEIL TYPE SUMS

#A37 INTEGERS 15 (2015) NOTE ON A RESULT OF CHUNG ON WEIL TYPE SUMS #A37 INTEGERS 15 (2015) NOTE ON A RESULT OF CHUNG ON WEIL TYPE SUMS Norbert Hegyvári ELTE TTK, Eötvös University, Institute of Mathematics, Budaest, Hungary hegyvari@elte.hu François Hennecart Université

More information

ESTIMATION OF THE RECIPROCAL OF THE MEAN OF THE INVERSE GAUSSIAN DISTRIBUTION WITH PRIOR INFORMATION

ESTIMATION OF THE RECIPROCAL OF THE MEAN OF THE INVERSE GAUSSIAN DISTRIBUTION WITH PRIOR INFORMATION STATISTICA, anno LXVIII, n., 008 ESTIMATION OF THE RECIPROCAL OF THE MEAN OF THE INVERSE GAUSSIAN DISTRIBUTION WITH PRIOR INFORMATION 1. INTRODUCTION The Inverse Gaussian distribution was first introduced

More information

Research of power plant parameter based on the Principal Component Analysis method

Research of power plant parameter based on the Principal Component Analysis method Research of ower lant arameter based on the Princial Comonent Analysis method Yang Yang *a, Di Zhang b a b School of Engineering, Bohai University, Liaoning Jinzhou, 3; Liaoning Datang international Jinzhou

More information

HENSEL S LEMMA KEITH CONRAD

HENSEL S LEMMA KEITH CONRAD HENSEL S LEMMA KEITH CONRAD 1. Introduction In the -adic integers, congruences are aroximations: for a and b in Z, a b mod n is the same as a b 1/ n. Turning information modulo one ower of into similar

More information

Notes on Instrumental Variables Methods

Notes on Instrumental Variables Methods Notes on Instrumental Variables Methods Michele Pellizzari IGIER-Bocconi, IZA and frdb 1 The Instrumental Variable Estimator Instrumental variable estimation is the classical solution to the roblem of

More information

Approximating min-max k-clustering

Approximating min-max k-clustering Aroximating min-max k-clustering Asaf Levin July 24, 2007 Abstract We consider the roblems of set artitioning into k clusters with minimum total cost and minimum of the maximum cost of a cluster. The cost

More information

SAS for Bayesian Mediation Analysis

SAS for Bayesian Mediation Analysis Paer 1569-2014 SAS for Bayesian Mediation Analysis Miočević Milica, Arizona State University; David P. MacKinnon, Arizona State University ABSTRACT Recent statistical mediation analysis research focuses

More information

DETC2003/DAC AN EFFICIENT ALGORITHM FOR CONSTRUCTING OPTIMAL DESIGN OF COMPUTER EXPERIMENTS

DETC2003/DAC AN EFFICIENT ALGORITHM FOR CONSTRUCTING OPTIMAL DESIGN OF COMPUTER EXPERIMENTS Proceedings of DETC 03 ASME 003 Design Engineering Technical Conferences and Comuters and Information in Engineering Conference Chicago, Illinois USA, Setember -6, 003 DETC003/DAC-48760 AN EFFICIENT ALGORITHM

More information

On the Toppling of a Sand Pile

On the Toppling of a Sand Pile Discrete Mathematics and Theoretical Comuter Science Proceedings AA (DM-CCG), 2001, 275 286 On the Toling of a Sand Pile Jean-Christohe Novelli 1 and Dominique Rossin 2 1 CNRS, LIFL, Bâtiment M3, Université

More information

Unsupervised Hyperspectral Image Analysis Using Independent Component Analysis (ICA)

Unsupervised Hyperspectral Image Analysis Using Independent Component Analysis (ICA) Unsuervised Hyersectral Image Analysis Using Indeendent Comonent Analysis (ICA) Shao-Shan Chiang Chein-I Chang Irving W. Ginsberg Remote Sensing Signal and Image Processing Laboratory Deartment of Comuter

More information

Analysis of some entrance probabilities for killed birth-death processes

Analysis of some entrance probabilities for killed birth-death processes Analysis of some entrance robabilities for killed birth-death rocesses Master s Thesis O.J.G. van der Velde Suervisor: Dr. F.M. Sieksma July 5, 207 Mathematical Institute, Leiden University Contents Introduction

More information

arxiv: v3 [physics.data-an] 23 May 2011

arxiv: v3 [physics.data-an] 23 May 2011 Date: October, 8 arxiv:.7v [hysics.data-an] May -values for Model Evaluation F. Beaujean, A. Caldwell, D. Kollár, K. Kröninger Max-Planck-Institut für Physik, München, Germany CERN, Geneva, Switzerland

More information

AN OPTIMAL CONTROL CHART FOR NON-NORMAL PROCESSES

AN OPTIMAL CONTROL CHART FOR NON-NORMAL PROCESSES AN OPTIMAL CONTROL CHART FOR NON-NORMAL PROCESSES Emmanuel Duclos, Maurice Pillet To cite this version: Emmanuel Duclos, Maurice Pillet. AN OPTIMAL CONTROL CHART FOR NON-NORMAL PRO- CESSES. st IFAC Worsho

More information

Named Entity Recognition using Maximum Entropy Model SEEM5680

Named Entity Recognition using Maximum Entropy Model SEEM5680 Named Entity Recognition using Maximum Entroy Model SEEM5680 Named Entity Recognition System Named Entity Recognition (NER): Identifying certain hrases/word sequences in a free text. Generally it involves

More information

Spectral Analysis by Stationary Time Series Modeling

Spectral Analysis by Stationary Time Series Modeling Chater 6 Sectral Analysis by Stationary Time Series Modeling Choosing a arametric model among all the existing models is by itself a difficult roblem. Generally, this is a riori information about the signal

More information

arxiv:cond-mat/ v2 25 Sep 2002

arxiv:cond-mat/ v2 25 Sep 2002 Energy fluctuations at the multicritical oint in two-dimensional sin glasses arxiv:cond-mat/0207694 v2 25 Se 2002 1. Introduction Hidetoshi Nishimori, Cyril Falvo and Yukiyasu Ozeki Deartment of Physics,

More information

Desingularization Explains Order-Degree Curves for Ore Operators

Desingularization Explains Order-Degree Curves for Ore Operators Desingularization Exlains Order-Degree Curves for Ore Oerators Shaoshi Chen Det. of Mathematics / NCSU Raleigh, NC 27695, USA schen2@ncsu.edu Maximilian Jaroschek RISC / Joh. Keler University 4040 Linz,

More information

Probability Estimates for Multi-class Classification by Pairwise Coupling

Probability Estimates for Multi-class Classification by Pairwise Coupling Probability Estimates for Multi-class Classification by Pairwise Couling Ting-Fan Wu Chih-Jen Lin Deartment of Comuter Science National Taiwan University Taiei 06, Taiwan Ruby C. Weng Deartment of Statistics

More information

Paper C Exact Volume Balance Versus Exact Mass Balance in Compositional Reservoir Simulation

Paper C Exact Volume Balance Versus Exact Mass Balance in Compositional Reservoir Simulation Paer C Exact Volume Balance Versus Exact Mass Balance in Comositional Reservoir Simulation Submitted to Comutational Geosciences, December 2005. Exact Volume Balance Versus Exact Mass Balance in Comositional

More information

Uncorrelated Multilinear Principal Component Analysis for Unsupervised Multilinear Subspace Learning

Uncorrelated Multilinear Principal Component Analysis for Unsupervised Multilinear Subspace Learning TNN-2009-P-1186.R2 1 Uncorrelated Multilinear Princial Comonent Analysis for Unsuervised Multilinear Subsace Learning Haiing Lu, K. N. Plataniotis and A. N. Venetsanooulos The Edward S. Rogers Sr. Deartment

More information

ON THE LEAST SIGNIFICANT p ADIC DIGITS OF CERTAIN LUCAS NUMBERS

ON THE LEAST SIGNIFICANT p ADIC DIGITS OF CERTAIN LUCAS NUMBERS #A13 INTEGERS 14 (014) ON THE LEAST SIGNIFICANT ADIC DIGITS OF CERTAIN LUCAS NUMBERS Tamás Lengyel Deartment of Mathematics, Occidental College, Los Angeles, California lengyel@oxy.edu Received: 6/13/13,

More information

GOOD MODELS FOR CUBIC SURFACES. 1. Introduction

GOOD MODELS FOR CUBIC SURFACES. 1. Introduction GOOD MODELS FOR CUBIC SURFACES ANDREAS-STEPHAN ELSENHANS Abstract. This article describes an algorithm for finding a model of a hyersurface with small coefficients. It is shown that the aroach works in

More information

Statistics II Logistic Regression. So far... Two-way repeated measures ANOVA: an example. RM-ANOVA example: the data after log transform

Statistics II Logistic Regression. So far... Two-way repeated measures ANOVA: an example. RM-ANOVA example: the data after log transform Statistics II Logistic Regression Çağrı Çöltekin Exam date & time: June 21, 10:00 13:00 (The same day/time lanned at the beginning of the semester) University of Groningen, Det of Information Science May

More information

Correspondence Between Fractal-Wavelet. Transforms and Iterated Function Systems. With Grey Level Maps. F. Mendivil and E.R.

Correspondence Between Fractal-Wavelet. Transforms and Iterated Function Systems. With Grey Level Maps. F. Mendivil and E.R. 1 Corresondence Between Fractal-Wavelet Transforms and Iterated Function Systems With Grey Level Mas F. Mendivil and E.R. Vrscay Deartment of Alied Mathematics Faculty of Mathematics University of Waterloo

More information

Lower Confidence Bound for Process-Yield Index S pk with Autocorrelated Process Data

Lower Confidence Bound for Process-Yield Index S pk with Autocorrelated Process Data Quality Technology & Quantitative Management Vol. 1, No.,. 51-65, 15 QTQM IAQM 15 Lower onfidence Bound for Process-Yield Index with Autocorrelated Process Data Fu-Kwun Wang * and Yeneneh Tamirat Deartment

More information

On Wald-Type Optimal Stopping for Brownian Motion

On Wald-Type Optimal Stopping for Brownian Motion J Al Probab Vol 34, No 1, 1997, (66-73) Prerint Ser No 1, 1994, Math Inst Aarhus On Wald-Tye Otimal Stoing for Brownian Motion S RAVRSN and PSKIR The solution is resented to all otimal stoing roblems of

More information

An Inverse Problem for Two Spectra of Complex Finite Jacobi Matrices

An Inverse Problem for Two Spectra of Complex Finite Jacobi Matrices Coyright 202 Tech Science Press CMES, vol.86, no.4,.30-39, 202 An Inverse Problem for Two Sectra of Comlex Finite Jacobi Matrices Gusein Sh. Guseinov Abstract: This aer deals with the inverse sectral roblem

More information

Pell's Equation and Fundamental Units Pell's equation was first introduced to me in the number theory class at Caltech that I never comleted. It was r

Pell's Equation and Fundamental Units Pell's equation was first introduced to me in the number theory class at Caltech that I never comleted. It was r Pell's Equation and Fundamental Units Kaisa Taiale University of Minnesota Summer 000 1 Pell's Equation and Fundamental Units Pell's equation was first introduced to me in the number theory class at Caltech

More information

SIMULATED ANNEALING AND JOINT MANUFACTURING BATCH-SIZING. Ruhul SARKER. Xin YAO

SIMULATED ANNEALING AND JOINT MANUFACTURING BATCH-SIZING. Ruhul SARKER. Xin YAO Yugoslav Journal of Oerations Research 13 (003), Number, 45-59 SIMULATED ANNEALING AND JOINT MANUFACTURING BATCH-SIZING Ruhul SARKER School of Comuter Science, The University of New South Wales, ADFA,

More information

2. Sample representativeness. That means some type of probability/random sampling.

2. Sample representativeness. That means some type of probability/random sampling. 1 Neuendorf Cluster Analysis Assumes: 1. Actually, any level of measurement (nominal, ordinal, interval/ratio) is accetable for certain tyes of clustering. The tyical methods, though, require metric (I/R)

More information

CHAPTER 5 TANGENT VECTORS

CHAPTER 5 TANGENT VECTORS CHAPTER 5 TANGENT VECTORS In R n tangent vectors can be viewed from two ersectives (1) they cature the infinitesimal movement along a ath, the direction, and () they oerate on functions by directional

More information

Using Factor Analysis to Study the Effecting Factor on Traffic Accidents

Using Factor Analysis to Study the Effecting Factor on Traffic Accidents Using Factor Analysis to Study the Effecting Factor on Traffic Accidents Abstract Layla A. Ahmed Deartment of Mathematics, College of Education, University of Garmian, Kurdistan Region Iraq This aer is

More information

VIBRATION ANALYSIS OF BEAMS WITH MULTIPLE CONSTRAINED LAYER DAMPING PATCHES

VIBRATION ANALYSIS OF BEAMS WITH MULTIPLE CONSTRAINED LAYER DAMPING PATCHES Journal of Sound and Vibration (998) 22(5), 78 85 VIBRATION ANALYSIS OF BEAMS WITH MULTIPLE CONSTRAINED LAYER DAMPING PATCHES Acoustics and Dynamics Laboratory, Deartment of Mechanical Engineering, The

More information

A SIMPLE PLASTICITY MODEL FOR PREDICTING TRANSVERSE COMPOSITE RESPONSE AND FAILURE

A SIMPLE PLASTICITY MODEL FOR PREDICTING TRANSVERSE COMPOSITE RESPONSE AND FAILURE THE 19 TH INTERNATIONAL CONFERENCE ON COMPOSITE MATERIALS A SIMPLE PLASTICITY MODEL FOR PREDICTING TRANSVERSE COMPOSITE RESPONSE AND FAILURE K.W. Gan*, M.R. Wisnom, S.R. Hallett, G. Allegri Advanced Comosites

More information

FE FORMULATIONS FOR PLASTICITY

FE FORMULATIONS FOR PLASTICITY G These slides are designed based on the book: Finite Elements in Plasticity Theory and Practice, D.R.J. Owen and E. Hinton, 1970, Pineridge Press Ltd., Swansea, UK. 1 Course Content: A INTRODUCTION AND

More information

Principal Components Analysis and Unsupervised Hebbian Learning

Principal Components Analysis and Unsupervised Hebbian Learning Princial Comonents Analysis and Unsuervised Hebbian Learning Robert Jacobs Deartment of Brain & Cognitive Sciences University of Rochester Rochester, NY 1467, USA August 8, 008 Reference: Much of the material

More information

INTRODUCTION. Please write to us at if you have any comments or ideas. We love to hear from you.

INTRODUCTION. Please write to us at if you have any comments or ideas. We love to hear from you. Casio FX-570ES One-Page Wonder INTRODUCTION Welcome to the world of Casio s Natural Dislay scientific calculators. Our exeriences of working with eole have us understand more about obstacles eole face

More information

Estimating function analysis for a class of Tweedie regression models

Estimating function analysis for a class of Tweedie regression models Title Estimating function analysis for a class of Tweedie regression models Author Wagner Hugo Bonat Deartamento de Estatística - DEST, Laboratório de Estatística e Geoinformação - LEG, Universidade Federal

More information

Elements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley

Elements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley Elements of Asymtotic Theory James L. Powell Deartment of Economics University of California, Berkeley Objectives of Asymtotic Theory While exact results are available for, say, the distribution of the

More information

Machine Learning: Homework 4

Machine Learning: Homework 4 10-601 Machine Learning: Homework 4 Due 5.m. Monday, February 16, 2015 Instructions Late homework olicy: Homework is worth full credit if submitted before the due date, half credit during the next 48 hours,

More information

Asymptotically Optimal Simulation Allocation under Dependent Sampling

Asymptotically Optimal Simulation Allocation under Dependent Sampling Asymtotically Otimal Simulation Allocation under Deendent Samling Xiaoing Xiong The Robert H. Smith School of Business, University of Maryland, College Park, MD 20742-1815, USA, xiaoingx@yahoo.com Sandee

More information

Information collection on a graph

Information collection on a graph Information collection on a grah Ilya O. Ryzhov Warren Powell February 10, 2010 Abstract We derive a knowledge gradient olicy for an otimal learning roblem on a grah, in which we use sequential measurements

More information

How to Estimate Expected Shortfall When Probabilities Are Known with Interval or Fuzzy Uncertainty

How to Estimate Expected Shortfall When Probabilities Are Known with Interval or Fuzzy Uncertainty How to Estimate Exected Shortfall When Probabilities Are Known with Interval or Fuzzy Uncertainty Christian Servin Information Technology Deartment El Paso Community College El Paso, TX 7995, USA cservin@gmail.com

More information