arxiv: v1 [stat.me] 3 Nov 2015
|
|
- Loren Hampton
- 6 years ago
- Views:
Transcription
1 The Unbiasedness Approach to Linear Regression Models arxiv:5.0096v [stat.me] 3 Nov 205 P. Vellaisamy Department of Mathematics, Indian Institute of Technology Bombay, Powai, Mumbai , India. Abstract The linear regression models are widely used statistical techniques in numerous practical applications. The standard regression model requires several assumptions about the regressors and the error term. The regression parameters are estimated using the least-squares method. In this paper, we consider the regression model with arbitrary regressors and without the error term. An explicit expression for the regression parameters vector is obtained. The unbiasedness approach is used to estimate the regression parameters and its various properties are investigated. It is shown that the resulting unbiased estimator equals the least-squares estimator for the fixed design model. The analysis of residuals and the regression sum of squares can be carried out in a natural way. The unbiased estimator of the dispersion matrix of the unbiased estimator is also obtained. Applications to AR(p) model and numerical examples are also discussed. Keywords. Linear regression, regression coefficients, unbiased estimator, least-squares estimator, autoregressive model. Introduction The linear regression model is a commonly used statistical technique in practical applications (Quenonuille (950)), because of its simplicity and its realistic nature for modeling several phenomena. For an extensive treatment of this topic, one may refer to Draper and Smith (2003) and Chatterji and Hadi (2003). In the standard multiple linear regression model Y β X +...,β p X p +ǫ, it is generally assumed that the predictor variables X,...,X p are constant (non-stochastic) and also the error term ǫ is independent of predictor variables (see, for example, Yan and Su (2009)), especially for the estimation of regression coefficients. This means that the values of X (X,...,X p ) are controlled by an experimenter and when the experiment is repeated the values of X does not change, but Y changes as ǫ changes. These assumptions are not realistic in practice and in most experimental setups and especially in econometric situations where the changes in Y occurs mainly due to the changes in X. Even
2 2 under the stochastic regressors models, it is commonly assumed that X i s are independent of the ǫ. It is known that when one of the X i is correlated with ǫ i, the ordinary least-squares (OLS) estimator becomes biased and inconsistent as well. To overcome such difficulties, an instrumental variable (IV) Z, which is highly correlated with X i and is independent of ǫ i, is discussedintheliterature. But, findingasuitableivisalsoanissue. Also, OLSestimator has smaller variance with IV estimator, though it may have bias larger that that of IV estimator. In this paper, we consider the linear regression model with little assumptions about the predictors and treat them as random. We do not explicitly introduce the error term and thus avoid not only the associated probabilistic assumptions, but also the related issues mentioned above. Besides, we do not use the least-squares method for estimating the regression coefficients which requires vector differentiation and related minimization problems. In Section 2, we consider the linear regression model with stochastic regressors and obtain first an explicit relationship between the regression coefficients and the characteristics of the distribution of (Y,X,...,X p ). In Section 3, we obtain the unbiased estimators, using the relationship between regression coefficients. Our approach is semi-parametric in nature and does not assume any specific distribution, such as multivariate normal, which is commonly used in literature. Note that the unbiased estimator coincides with OLS estimator, as they are conditionally unbiased. Several known properties of the unbiased estimators are established using our approach. The dispersion matrix of the estimator is derived and also its unbiased estimator is also observed. The mean regression sum of squares and also the mean residual sum of squares are derived in a natural way. Using these results, a predictor of E(Y) based on a new observation is also pointed out. The method is then extended to autoregressive AR(p) model also. Finally, we discuss two examples, one based on simulated data and other one based on a real-life data, to show that that our approach yields the same least-squares estimates for the AR(3) model. 2 Multiple Linear Regression Model Let Y be the response or dependent variable, and X,X 2,...,X p be p arbitrary explanatory (predictor variables) which are not necessarily independent. Consider the class C of linear regression model defined by E(Y X,...,X p ) b 0 +b X +...+b p X p b 0 +X (p) b (p),(say), (2.) where X (p) (X,...,X p ) is the row-vector of predictor variables, and b (p) (b,...,b p ) t is the column-vector of regression coefficients. Note that model (2.) is rather general and
3 3 includes the class C ǫ of the usual linear regression models, defined by Y b 0 +b X +...+b p X p +ǫ, (2.2) where ǫ is the error term independent of the X i s. For example, let Z (X,X 2 ), E(Z) 0 and consider the model Y b X +b 2 X 2 +b 3 X X 2 Z. (2.3) Then E(Y X,X 2 ) b X + b 2 X 2 which belongs to C, but does not belong to C ǫ. Thus, C ǫ C. As another example, consider the autoregressive AR(p) process defined by X t φ 0 +φ X t +φ 2 X t φ p X t p +ǫ, (2.4) where the error term is independent of the X i, (t p) i t. Then E(X t X,...,X t ) φ 0 +φ X t +φ 2 X t φ p X t p which is again of the from given in (2.). Note here the X i are not independent. Thus, AR(p) process belongs to C ǫ and hence also to C. Our aim here is to consider the linear regression model defined in (2.) and the estimation of regression coefficients in a natural and simple way, with hardly a few assumptions on the model. For that purpose, we require the relationship among regression coefficients. 2. Relationships Among Regression Coefficients Let Y 0 (Y b 0 ) so that from (2.) E(Y 0 X (p) ) X (p) b (p) ; E(Y 0 ) E(X (p) )b (p), (2.5) where E(X (p) ) (EX,...,EX p ) is the row vector of the means of X i s. First our aim is to obtain an explicit representation for the regression coefficients. From (2.5), we get E(Y 0 X X (p) ) X (p) X b (p) ; E(Y 0 X ) E(X (p) X )b (p). (2.6) Hence, from (2.5) and (2.6) Cov(Y 0,X ) E(X (p) X )b (p) E(X (p) )E(X )b (p) Cov(X (p),x )b (p), (2.7) where Cov(X (p),x ) ( Cov(X,X ),Cov(X 2,X ),...,Cov(X (p),x ) ) is the row-vector of covariances. Similarly, Cov(Y 0,X i ) Cov(X (p),x i )b (p), 2 i p.
4 4 Since Cov(Y 0,X i ) Cov(Y,X i ), we can represent the above observations in a matrix form as, Cov(Y,X ) C,C 2,...,C p b Cov(Y,X 2 ) C 2,C 22,...,C p2 b 2, (2.8)... Cov(Y,X p ) C p,c 2p,...,C pp where C ij Cov(X i,x j ), for i,j p. Let now Cyx t (C y,c y2,...,c yp ) (Cov(Y,X ),...,Cov(Y,X p ); C xx (C ij ). (2.9) Then equation (2.8) can be written compactly as C yx C xx b (p), (2.0) which gives the relationship between regression coefficients and the associated covariances. Also from (2.5), we obtain b 0 E(Y) E(X (p) )b (p). (2.) Thus, we have proved the following result. Theorem 2. Let E(Y X,...X p ) b 0 + X (p) b (p) be the multiple linear regression model defined as in (2.). Let Cyx t (C y,c y2,...,c yp ), with C yj Cov(Y,X j ), and C xx (C ij ) with C ij Cov(X i,x j ), i,j p. Then the regression coefficients b 0 and b (p) satisfy C yx C xx b (p), and (2.2) b p where µ y E(Y) and µ (p) (EX,,EX p ) (µ,...,µ p ). When C xx is non-singular, we have a unique representation as b 0 µ y µ (p) b (p), (2.3) b (p) C xx C yx. (2.4) If C xx is singular, a g-inverse C xx may be used to obtain a representation for b (p). That is, using Jordan decomposition, let C xx PDP t, where P is an orthogonal matrix, and D diag[λ,...,λ p ] is a diagonal matrix. Then a g-inverse of C xx is C xx PDP, where D diag[λ,...,λ p ], and λ i λ i if λ i > 0, and is zero if λ i 0.
5 5 Remarks 2. Observe that when p and E(Y X ) b 0 +b X, the regression coefficient b is given by a known result. b C y C Cov(Y,X ) Var(X ), Similarly, from (2.4), the regression coefficients for the case p 2 are given by b C 22C y C 2 C y2 C C 22 C2 2 ; b 2 C C y2 C 2 C y C C 22 C2 2 (2.5) and the intercept b 0 can be evaluated using (2.3) and (2.5). Remarks 2.2 The following properties of the model in (2.) follow now easily. (i) It is known that f(x) E(Y X) minimizes E(Y f(x)) 2. This shows that E(Y X) X (p) b (p) minimizes the mean squared error whenx (p) b (p) is treated as an approximation to Y. (ii) Also, when C xx > 0 (positive definite) Cov(Y X (p) b (p),x (p) ) Cov(Y,X (p) ) Cov(X (p) b (p),x (p) ) C t yx bt (p) Cov(X (p),x (p) ) C t yx bt (p) C xx C t yx C t yxc xxc xx 0. (2.6) Indeed, we have from (2.6), Cov(Y X (p) b (p),x j ) 0, i j p Cov(Y X (p) b (p),x (p) d (q) ) 0 (2.7) for any vector d (q) (d,...,d q ). In particular, we have Cov(Y X (p) b (p),x (p) b (p) ) 0. (2.8) Also, Var(Y) Var(Y X (p) b (p) +X (p) b (p) ) Var(Y X (p) b (p) )+Var(X (p) b (p) ), because of (2.8).
6 6 3 Estimation of Regression Coefficients We start with a simpleknown result. Let (Y,X) beabivariate randomvector with with finite second moments and σ yx Cov(Y,X). Suppose (Y,X ),...,(Y n,x n ) is a random sample on the bivariate vector (Y,X). Then it is well known that is an unbiased estimator of σ yx. S yx n (Y i Y)(X i X i ) (3.) i Consider now the estimation of regression coefficients b 0 and b (p). Supposewe have a random sample (Y j,x j,...,x pj ), i n, of size n on (Y,X,X 2...,X p ). Let Y Y n. Y Y j ; X. X (p). X (p)j. X (p)n X X 2...X i...x p. X j X 2j...,X ij...x pj, (3.2). X n X 2n...X in,...x pn where X (p)j (X j,x 2j,...,X pj ) denotes the row-vector of j-th observation on predictor variables and X ij denotes the j-th observation on X i, i p, j n. Then, from (2.), E(Y j X)) E(Y j X (p)j ) b 0 +X (p)j b (p), j n. (3.3) Let Y n j Y j/n, and X i n j X ij/n, i p be the sample means. Also, let S yi n (Y j Y)(X ij X i ) n j (X ij X i )Y j (3.4) j be the sample covariance between Y and X i and S lm n (X lj X l )(X mj X m ) (3.5) j be the sample covariances between X l and X m, l,m p. Then by (3.), we have E(S yi ) C yi ; E(S lm ) C lm, (3.6) where C yi Cov(Y,X i ) and C lm Cov(X l,x m ), l,m n. Let S t yx (S y,s y2,...,s yp ) be the row-vector and S xx (S ij ) be the (p p) matrix of sample covariances of explanatory variables X,X 2,...,X p. Then from (3.6), E(S yx ) C yx ; E(S xx ) C xx, (3.7)
7 7 where C xx (C ij ) defined in (2.8). Using Theorem 2., an estimator ˆb (p) of the regression coefficient vector b (p) can be obtained by solving S xxˆb(p) S yx, (3.8) which is obtained by replacing the unbiased estimator of C yx and C xx in (2.2). When S xx is nonsingular, a.e, an explicit and unique estimator of b (p) is Similarly, an estimator of b 0 can be obtained from (2.3) as, where X (p) (X,X 2,...,X p ). ˆb(p) S xx S yx. (3.9) ˆb0 ˆµ y µ (p) b(p) Y X (p) b(p), (3.0) For completeness, we next summarize the results discussed above. Theorem 3. Let E(Y X (p) ) b 0 +X (p) b (p) and (Y j,x j,...,x pj ), j n, be a random sample of size n on (Y,X,...,X p ). Let S yx (S y,s y2,...,s yp ) t be the column vector of sample covariances between Y and X i s, and S xx (S ij ) be the sample covariance matrix of X i s. Let S xx be non-singular a.e. Then the natural estimators of b (p) and b 0 are ˆb(p) S xx S yx, and (3.) b0 Y X (p) b(p), (3.2) where Y and X (p) (X,...,X p ) denote the sample means. Remark 3. (i) When p and b b, we have b S yx /S xx, and b 0 Y X b, which coincide with the classical least-squares estimators for the simple linear regression model. (ii) From (3.8), b p (p) satisfies in general S xx b(p) S yx. It can be shown that S yx Cyx, as p p n, and S xx Cxx, as a special case. Thus, by Slutsky s theorem, b (p) b(p), a solution of the defining equation (2.2). Hence, b (p) is a consistent estimator for b (p). (iii) Note the natural estimators b (p) and b 0 are obtained by substituting the unbiased estimator of C yx,c xx and µ y and µ (p) in the defining equations (2.2) and (2.3). The next result shows that the natural estimators, given in Theorem 3., are indeed unbiased. Let henceforth X X (n,p) (X(n) t,...,xt p(n) ) (X ij) be the matrix of observations (see (3.2)), where X i(n) (X i,...,x in ) denotes the row-vector of n observations on X i. Theorem 3.2 Let S xx be non-singular a.e. Then the estimators b (p) and b 0 are both conditionally unbiased and hence are unbiased.
8 8 Proof. First note that, for i p, E(S yi X) E ( (X ij X i )Y j X ) (n ) (n ) (n ) j (X ij X i )E(Y j X) j ( (X ij X i ) b 0 + j ) p X rj b r r (from (3.3)) ( p ) (X ij X i ) X rj b r (n ) j r p (X ij X i )X rj b r (interchanging the order) (n ) r p (S ir )b r r S i(p) b (p), where S i(p) (S i,...,s ip ). Hence, j E(S yx X) S xx b (p) E(S xxs yx X) E(ˆb (p) X) b (p), (3.3) showing that ˆb (p) is conditionally unbiased. Similarly, E( b 0 X) E(Y X) X (p) E(ˆb (p) X) (b 0 +b X +...+b p X p X) X (p) b (p) (using (3.3)) b 0 +X (p) b (p) X (p) b (p) and so b 0 is also conditionally unbiased. b 0, (3.4) Hence, it follows, from (3.3) and (3.4), that E(ˆb (p) ) b (p) ; E(ˆb 0 ) b 0, proving the result. Remark 3.2 (i) Let now S xx 0 a.e. (positive semidefinite) and p >. Then, the estimator b(p) satisfies S xx b(p) S yx, for all (Y,X) E(S xx b(p) X) E(S yx X), for all X ( ) S xx E( b (p) X) b (p) 0, for all X,
9 9 using (3.3). Since S xx 0 a.e, we have E( b (p) X) b (p) a.e and so E( b (p) ) b (p), showing that b (p) is unbiased in this case also. It is known that if E( θ i ) θ i,i,2, then E( θ 2 / θ ) (θ 2 /θ ) may not hold in general. However, it holds in the linear regression case, in view of the following general result on a property of unbiased estimators. Let L(X) denote the distribution of X. Lemma 3. Let θ be a characteristic of L(X), θ 2 be a characteristic of L(Y,X) and θ (θ 2 /θ ). Let θ θ (X) and θ 2 θ 2 (Y,X) be respectively estimators of θ and θ 2 such that θ > 0 a.e and E( θ 2 X) θ θ a.e. Then θ ( θ 2 / θ ) is also unbiased for θ. Proof. Since E(ˆθ) E(E(ˆθ X)) E ( E )) ( ) ( ) ( θ2 X E E( θ 2 X) E θ θ θ, (3.5) θ θ θ the result follows. Remark 3.3 (i) Note in Lemma 3., θ is not necessarily unbiased. If in addition it is unbiased, then we have E ) ( θ2 θ θ 2 θ E( θ 2 ) E( θ ), an interesting result. This is indeed the case for the regression coefficient b, as seen next. (ii) Let p and n 2 so that E(Y X) b 0 +b X. Then, as seen earlier, b C yx C xx θ 2 θ (say); ˆb S yx S xx, where E(S yx ) C yx,e(s xx ) C xx. Also, by Theorem 3.2 E(ˆb ) E( S yx S xx ) C yx C xx b. The above result holds because the conditions S xx > 0 a.e. and from (3.3) of Lemma 3.2 are satisfied. E( θ 2 X) E(S yx X) S xx b θ b Let b t (b 0,b (p) ) denote the row-vector of regression coefficients and n denote the n- dimensional column vector with all its entries equal to. The least-squares estimator of b, when X is fixed (called fixed design) is bl (X t X ) X t Y, (3.6) where X ( n.x), Y t (Y,...,Y n ) and X is defined in (3.2). We now have following result.
10 0 Theorem 3.3 Let p and S xx > 0a.e. Then the unbiased estimator b t u ( b 0, b (p) ) t, defined in Theorem 3., coincides with the least squares estimator b l, defined in (3.6), for the fixed design. Proof. Note first that ( ) ( ) X t t X n n t nx n nx(p) X t n X t X X t X nx t (p) Using the formula (see Seber(984), p. 59) for the inverse of a partitioned matrix, namely, ( ) ( ) A B A +FG F t FG, B t D G F t G where F A B and G D B t F, we obtain (X t X ) n + (n ) X (p)sxxx t (p) (n ) S xxx t (p) (n ) X (p)sxx (n ) S xx (3.7) Note for example that the (i,j)- th element of the matrix G (X t X) nx t (p)x (p) is G ij X ir X jr nx i X j (n )S ir. r Hence, G (n )S xx. The other elements of the matrix in the rhs of (3.7) can easily be computed. Therefore, (X t X ) X t Y n + (n ) X (p)sxx Xt (p) (n ) S xx Xt (p) (n ) X (p)sxx (n ) S xx Y + (n ) X (p)sxx(nx t (p)y X t Y) (n ) S xx Xt Y n (n ) S xx Xt (p) Y ( Y X(p) Sxx S ) yx, SxxS yx using the fact (X t Y nx (p) Y) (n )S yx. The result now follows. We next find the covariance matrix D( b u ) Cov( b u, b u ) of b u ( b 0, b (p) ) t. ( ) n Y X t Y Theorem 3.4 Assume, in the model (2.), Var(Y X (p) ) σy x 2, which does not depend on X (p), and Sxx exists. Then the covariance matrix of b u is D( b u ) σy x 2 E(Xt X ) σy x 2 E n + (n ) X (p)sxx Xt (p) (n ) S xx Xt (p) (n ) X (p)sxx (n ) S xx (3.8)
11 Proof. Since E( b u X) b, we have D( b u ) E(D( b u X)) ( ) Var( b0 X) Cov( b 0, b (p) X) E (Cov( b 0, b (p) X)) t D( b (p) X) (3.9) First we obtain Observe now, Cov(S yr,s ys X) Cov D( b (p) X) D(S xxs yx X) (n ) S xx D(S yx X)S xx. (3.20) (X rj X r )Y j, (n ) j (X sk X s )Y k X (n ) 2 (X rj X r )(X sk X s )Cov((Y j,y k ) X). (3.2) j,k k Since Var(Y X (p) ) σ 2 y x, we have Cov((Y j,y k ) X) Hence, for r,s {,2,...,p}, { σ 2 y x, if k j 0, otherwise. and Cov((S yr,s ys ) X) Hence, we obtain from (3.20) σy x 2 (n ) 2 j (X rj X r )(X sj X s ) σ2 y x (n ) S rs D(S yx X) σ2 y x (n ) S xx. (3.22) D( b (p) X) σ2 y x (n ) S xx. (3.23) We obtain next Var( b 0 X). Note Var( b 0 X) Var(Y X (p) b(p) X) Var(Y X)+Var(X (p) b(p) X) 2Cov(Y,X (p) b(p) X). (3.24) By assumption, Var(Y X) σ2 y x n (3.25)
12 2 and using (3.23) Var(X (p) b(p) X) XD( b (p) X)X t (p) σ2 y x (n ) X (p)s xxx t (p). (3.26) We next show that Cov(Y,X (p) b(p) X) 0. Since we can write S yk S yx (n ) (n ) (X kj X k )Y j, j (X (p)j X (p) ) t Y j, where X (p)j (X j,x 2j,...,X pj ), and X (p) (X,X 2,...,X p ), as before. As Y j s are iid, we have Cov(Y,X (p) b(p) X) Cov( n j j Y j, (n ) X (p)sxx (X (p)j X (p) ) t Y j X) j Cov( n Y j, (n ) X (p)sxx(x (p)j X (p) ) t X) j n(n ) X (p)s xx (X (p)j X (p) ) t Cov(Y j,y j X) σy x 2 n(n ) X (p)sxx (X (p)j X (p) ) t j 0. (3.27) Therefore, from (3.24) (3.26), we have ( ) Var( b 0 X) σy x 2 n + (n ) X (p)sxxx t (p). (3.28) Finally, we compute Cov( b 0, b (p) X) ( ) Cov( b 0,e t b (p) X),...,Cov( b 0,e t p b p X), (3.29) where e t j is the ( p) row vector whose j-th entry is and all other entries are zeros. Note, for r p, Cov( b 0,e t r b (p) X) Cov(Y,e t r b (p) X) Cov(X (p) b(p),e t r b (p) X). (3.30) It can be shown, as in (3.27), that Cov(Y,e t r b (p) X) 0. (3.3)
13 3 Also, for r p, Hence, Cov(X (p) b(p),e t j b (p) X) X (p) D( b (p) X)e r σ2 y x (n ) X (p)s xxe r. (3.32) Cov( b 0, b (p) X) σ2 y x (n ) X (p)s xx (e,e 2,...,e p ) σ2 y x (n ) X (p)s xx. (3.33) Substituting (3.28),(3.33) and (3.23) in (3.9) and using (3.7), the result follows. 3. Analysis of Residuals and Regression Sum of Squares Consider the model (3.3) defined by E(Y j X (p)j ) b 0 +X (p)j b (p), Var(Y j X (p)j ) σy x 2, j n. (3.34) Assume n > (p+), and let b t (b 0,b (p) ), as before. Then the above model can be written as E(Y X ) X b; D(Y X ) σ 2 y x I n, (3.35) where Y t (Y,...,Y n ), X ( n.x) and I n is the identity matrix of order n. Then, in view of Theorem 3.3, bu (X t X ) X t Y. Let e (Y Ŷ), where Ŷ X b u, denote the residual vector. Then e [ I n X (X t X ) X t ] Y MY (say), where M M(X) [ I n X (X t X ) X t ] is symmetric and idempotent. It is also well known (see Draper and Smith (2002)) that tr(m) (n p ) and MX 0 a.s. Note also that e t e Y t MY and hence the the mean residual sum of squares is MSS E E(e t e) E { E(Y t MY X ) } E { (E(Y X )) t M(E(Y X ))+tr(md(y X )) } { } E b t X t MX b+tr(m)σy x 2 I n σ 2 y x tr(m) Thus, an unbiased estimator of σ 2 y x is (n p )σ 2 y x. (3.36) σ 2 y x SS E (n p ) e t e (n p ). (3.37)
14 4 Indeed, we have (see (3.36)) shown that so that it is also conditionally unbiased. E( σ 2 y x X ) σ 2 y x (3.38) Theorem 3.5 Let D( b u ), given in (3.8), be the dispersion matrix unbiased estimator b u. The its unbiased estimator is given by D( b u ) σ 2 y x(x t X ), (3.39) where σ 2 y x is defined in (3.37). Proof. Using (3.38), we get E( D( b u )) E(E( σ 2 y x(x t X ) X )) E(E( σ 2 y x X )(X t X ) ) E(σy x 2 (Xt X ) ) σy x 2 E((Xt X ) ) D( b u ) and hence the result follows. 3.2 Mean Regression Sum of Squares Note first that X t e Xt MY (MX ) t Y 0 Ŷt e 0 a.s. In particular, we have t ne 0 Y Ŷa.s. In addition, Cov( b u,e X ) Cov((X t X ) X t Y,Y X ) Cov( b u,x b u X ) (X t X ) X t D(Y X ) D( b u X )X t σy x 2 (Xt X ) X t σy x 2 (Xt X ) X t 0. (3.40) Hence, Cov( b u,e) E ( ) ( Cov( b u,e X +Cov E( bu X ),E(e X ) ) 0, using (3.40) and E(e X ) X b X E( b u X ) 0 (see (3.3) and (3.6)). Note (3.40) also implies Cov(Ŷ,e X ) 0. Also, Y t Y (e+ŷ)t(e+ŷ) Ŷt Ŷ +e t e,
15 5 which implies E(Y t Y nȳ 2 ) E(Ŷt Ŷ nȳ 2 )+E(e t e). (3.4) That is, MSS T MSS R +MSS E, where MSS T and MSS R respectively denote the mean total sum of squares and mean regression sum of squares. The mean regression sum of squares MSS R is given by { } E(Ŷt Ŷ) E E( b t u X t X bu X ) { } E (E( b u X )) t X t X E( b u X )+tr(x t X )D( b u X ) { } E b t X t X b+σy x 2 tr(i p+) { } E b t X t X b+(p+)σy x 2. (3.42) Thus, we obtain E (Ŷt ) Ŷ p+ (p+) E(bt (X t X )b)+σ 2 y x. (3.43) Form (3.4), (3.36) and (3.42), we obtain the mean total sum of squares as leading to E(Y t Y) E(b t (X t X )b)+nσ 2 y x E( Yt Y n ) n E(bt (X t X )b)+σy x 2. Finally, we define the mean coefficient of determination as ( n R 2 m E(R0) 2 E (Ŷi Y) 2 n ), (Y i Y) 2 so that R 2 0 is an unbiased estimator of R2 m. Note R m denote the mean multiple correlation coefficient. Finally, consider the problem of prediction of E(Y 0 ) or that of Y 0, a future observation corresponding to X (p)0, the new observed vector of predictors can easily be considered. That is, Ŷ0 X (p)0 b(p) may be considered as a predictor of E(Y 0 ) or that of Y 0. In that case, we have Var(Ŷ0) X (p)0 D( b (p) )X t (p)0 σ2 y x (n ) X (p)0e(s xx)x t (p)0. Also, an estimated value of Var(Ŷ0) is Var(Ŷ0) σ2 y x (n ) X (p)0s xxx t (p)0, which could be used to obtain prediction intervals for Y 0 or that of E(Y 0 ).
16 6 4 Application to AR(p) Process In this section, we show that the unbiasedness approach works for AR(p) time series models also. Consider AR(p) process so that Y t φ 0 +φ Y t +...+φ p Y t p +ε t, (4.) where ε t s are i.i.d with E(ε t ) 0 and V(ε t ) σ 2. Note AR(p) process satisfies E(Y t Y t,...,y t p ) φ 0 +φ Y t +...+φ p Y t p, (4.2) which is of the form (2.). Let E(Y t ) η, and from model (4.2), Cov(Y t,y t ) γ() Cov(Y t,y t 2 ) γ(2) C yx,.. Cov(Y t,y t p ) γ(p) and C xx (Cov(Y t i,y t j )) (γ (j i) ), for i j p. Then from Theorem 2., we have γ(0)... γ(p ) φ γ() γ()... γ(p 2) φ 2 γ(2)... γ(p )... γ(0) γ(p) φ p (4.3) or equivalently dividing by γ(0) Var(Y t ), ρ()... ρ(p ) ρ()... ρ(p 2). ρ(p ) ρ(p 2)... and φ φ 2 φ 0 η (η,...,η) η(. φ p φ ρ() φ 2 ρ(2).. ρ(p) φ p (4.4) p φ j ). (4.5) j
17 7 Note also that (4.4) is nothing but Yule-Walker equations. Assume we have a sample (Y,Y 2,...,Y n ), n p+, that satisfies (4.). Then, Y p+ φ o +φ Y p +...+φ p Y +ε p+ Y p+2 φ o +φ p+ Y p φ p Y 2 +ε p+2. (4.6) Y n φ o +φ Y n +...+φ p Y n p +ε n. Alternatively, the equations in (4.2) can also be written as, for j n p, E(Y p+j Y p+j,...,yj ) φ o +φ Y p+j +...+φ p Y j. (4.7) Let now Y Y p+ Y p+2. Y n φ φ 2, φ. φ p, Y p Y p... Y Y p+ Y p... Y 2 ; X. Y n Y n 2... Y n p Let us introduce the following notations. Define for 0 i p, Y p i+ n p Y p i+k, (n p) k the mean of values in i-th column of X. Also, let for i,j p, S i,j n p (Y p i+k Y p i+ )(Y p j+k Y p j+ ) (n p ) be the sample covariance between the i-th and j-th columns of X. Similarly, for j p, let S p+,j k n p (Y p+l Y p+ )(Y p j+l Y p j+ ) (n p ) be the sample covariance between Y and j-th column of X. Then S p+, S, S,2... S,p S p+,2 S 2, S 2,2... S 2,p S y,x ; S. xx. S p+,p S p, S p,2... S p,p Using our formulas in Theorem 3. for estimation of φ, we get, l. S xx φ Syx. (4.8)
18 8 That is, ˆφ j, j p, satisfies S, S,2... S,p S 2, S 2,2... S 2,p. S p, S p,2... S p,p ˆφ ˆφ 2. ˆφ p S p+, S p+,2. S p+,p (4.9) Let now Y (p) (Y p,y p,...,y ), so that we obtain from (4.5), ˆφ ˆφ 0 Y p+ Y ˆφ 2 (p) Y. p+ ˆφ p, Then the fitted AR(p) model is p (Y j ˆφp+ j ). (4.0) Y t ˆφ 0 + ˆφ Y t + + ˆφ p Y t p, (4.) where ˆφ j,0 j p, satisfy (4.9) and (4.0). Remark 4. Using our method, we have given a rigorous proof for the unbiased estimators of the parameters of the AR(p) model. This is another advantage of our approach. j 4. Numerical examples Finally, we discuss in this section two numerical examples, one based on simulated data and the other based on real-life data, for estimating the parameters of the AR(3) model. We show our results yield the estimates that coincide with classical estimates. Example 4. We simulated n00 data from AR(3) model with the parameters (0.4, 0., 0.3), using the R package. Based on the simulated data, we fitted the AR(3) model and estimated regression coefficients, including the intercept term, using our results. The results are given below: Parameters Least-squares Yule-Walker Unbiased estimate estimate estimate φ φ φ φ
19 9 Example 4.2 The data in feet on Level of lake Huron data (Brockwell and Davis (2006), p. 555)) is fitted for AR(3) model. The results are given below. Parameters Least-squares Yule-Walker Unbiased estimate estimate estimate φ φ φ φ From both the examples above for the AR(3) model, we see that the unbiased estimates obtained using our results coincide with the least-squares and the Yule-Walker estimates, as expected. Acknowledgments. The author is thankful to Saumya Bansal, Neha Rani Gupta and Swati Verma for some useful discussions and the computational work. References Brockwell, P. J. and Davis, R. A. (2006). Time Series: Theory and Methods. Second Edition, Springer, New York. Chatterji, S. and Hadi, A. S. (2006). Regression Analysis by Examples. John Wiley & Sons, New Jersey. Draper, N and Smith, H. (2003). Applied Regression Analysis. Third Edition, Wiley, New York. Quenonuille, M. H. (950). An application of least squares to family diet surveys. Econometrica, 8, Yan, X. and Su, X. G. (2009). Linear Regression Analysis: Theory and Computing. World Scientific Publishing Co, Pte Ltd, Singapore.
STAT 100C: Linear models
STAT 100C: Linear models Arash A. Amini June 9, 2018 1 / 56 Table of Contents Multiple linear regression Linear model setup Estimation of β Geometric interpretation Estimation of σ 2 Hat matrix Gram matrix
More informationNext tool is Partial ACF; mathematical tools first. The Multivariate Normal Distribution. e z2 /2. f Z (z) = 1 2π. e z2 i /2
Next tool is Partial ACF; mathematical tools first. The Multivariate Normal Distribution Defn: Z R 1 N(0,1) iff f Z (z) = 1 2π e z2 /2 Defn: Z R p MV N p (0, I) if and only if Z = (Z 1,..., Z p ) (a column
More informationInverse of a Square Matrix. For an N N square matrix A, the inverse of A, 1
Inverse of a Square Matrix For an N N square matrix A, the inverse of A, 1 A, exists if and only if A is of full rank, i.e., if and only if no column of A is a linear combination 1 of the others. A is
More informationRegression and Statistical Inference
Regression and Statistical Inference Walid Mnif wmnif@uwo.ca Department of Applied Mathematics The University of Western Ontario, London, Canada 1 Elements of Probability 2 Elements of Probability CDF&PDF
More informationIntroduction to Estimation Methods for Time Series models. Lecture 1
Introduction to Estimation Methods for Time Series models Lecture 1 Fulvio Corsi SNS Pisa Fulvio Corsi Introduction to Estimation () Methods for Time Series models Lecture 1 SNS Pisa 1 / 19 Estimation
More informationEconometrics I. Professor William Greene Stern School of Business Department of Economics 3-1/29. Part 3: Least Squares Algebra
Econometrics I Professor William Greene Stern School of Business Department of Economics 3-1/29 Econometrics I Part 3 Least Squares Algebra 3-2/29 Vocabulary Some terms to be used in the discussion. Population
More informationSTAT 135 Lab 13 (Review) Linear Regression, Multivariate Random Variables, Prediction, Logistic Regression and the δ-method.
STAT 135 Lab 13 (Review) Linear Regression, Multivariate Random Variables, Prediction, Logistic Regression and the δ-method. Rebecca Barter May 5, 2015 Linear Regression Review Linear Regression Review
More informationLECTURE 2 LINEAR REGRESSION MODEL AND OLS
SEPTEMBER 29, 2014 LECTURE 2 LINEAR REGRESSION MODEL AND OLS Definitions A common question in econometrics is to study the effect of one group of variables X i, usually called the regressors, on another
More informationSimple Linear Regression Analysis
LINEAR REGRESSION ANALYSIS MODULE II Lecture - 6 Simple Linear Regression Analysis Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Prediction of values of study
More informationPANEL DATA RANDOM AND FIXED EFFECTS MODEL. Professor Menelaos Karanasos. December Panel Data (Institute) PANEL DATA December / 1
PANEL DATA RANDOM AND FIXED EFFECTS MODEL Professor Menelaos Karanasos December 2011 PANEL DATA Notation y it is the value of the dependent variable for cross-section unit i at time t where i = 1,...,
More informationLinear Regression. In this problem sheet, we consider the problem of linear regression with p predictors and one intercept,
Linear Regression In this problem sheet, we consider the problem of linear regression with p predictors and one intercept, y = Xβ + ɛ, where y t = (y 1,..., y n ) is the column vector of target values,
More informationPeter Hoff Linear and multilinear models April 3, GLS for multivariate regression 5. 3 Covariance estimation for the GLM 8
Contents 1 Linear model 1 2 GLS for multivariate regression 5 3 Covariance estimation for the GLM 8 4 Testing the GLH 11 A reference for some of this material can be found somewhere. 1 Linear model Recall
More informationECON 3150/4150, Spring term Lecture 7
ECON 3150/4150, Spring term 2014. Lecture 7 The multivariate regression model (I) Ragnar Nymoen University of Oslo 4 February 2014 1 / 23 References to Lecture 7 and 8 SW Ch. 6 BN Kap 7.1-7.8 2 / 23 Omitted
More informationMATH 38061/MATH48061/MATH68061: MULTIVARIATE STATISTICS Solutions to Problems on Random Vectors and Random Sampling. 1+ x2 +y 2 ) (n+2)/2
MATH 3806/MATH4806/MATH6806: MULTIVARIATE STATISTICS Solutions to Problems on Rom Vectors Rom Sampling Let X Y have the joint pdf: fx,y) + x +y ) n+)/ π n for < x < < y < this is particular case of the
More informationSTAT 100C: Linear models
STAT 100C: Linear models Arash A. Amini April 27, 2018 1 / 1 Table of Contents 2 / 1 Linear Algebra Review Read 3.1 and 3.2 from text. 1. Fundamental subspace (rank-nullity, etc.) Im(X ) = ker(x T ) R
More informationTAMS39 Lecture 2 Multivariate normal distribution
TAMS39 Lecture 2 Multivariate normal distribution Martin Singull Department of Mathematics Mathematical Statistics Linköping University, Sweden Content Lecture Random vectors Multivariate normal distribution
More informationBasic Distributional Assumptions of the Linear Model: 1. The errors are unbiased: E[ε] = The errors are uncorrelated with common variance:
8. PROPERTIES OF LEAST SQUARES ESTIMATES 1 Basic Distributional Assumptions of the Linear Model: 1. The errors are unbiased: E[ε] = 0. 2. The errors are uncorrelated with common variance: These assumptions
More informationLinear Model Under General Variance
Linear Model Under General Variance We have a sample of T random variables y 1, y 2,, y T, satisfying the linear model Y = X β + e, where Y = (y 1,, y T )' is a (T 1) vector of random variables, X = (T
More informationSimple Linear Regression
Simple Linear Regression In simple linear regression we are concerned about the relationship between two variables, X and Y. There are two components to such a relationship. 1. The strength of the relationship.
More informationAdvanced Econometrics I
Lecture Notes Autumn 2010 Dr. Getinet Haile, University of Mannheim 1. Introduction Introduction & CLRM, Autumn Term 2010 1 What is econometrics? Econometrics = economic statistics economic theory mathematics
More informationESTIMATION THEORY. Chapter Estimation of Random Variables
Chapter ESTIMATION THEORY. Estimation of Random Variables Suppose X,Y,Y 2,...,Y n are random variables defined on the same probability space (Ω, S,P). We consider Y,...,Y n to be the observed random variables
More informationStatement: With my signature I confirm that the solutions are the product of my own work. Name: Signature:.
MATHEMATICAL STATISTICS Homework assignment Instructions Please turn in the homework with this cover page. You do not need to edit the solutions. Just make sure the handwriting is legible. You may discuss
More informationCh 2: Simple Linear Regression
Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component
More informationMA 575 Linear Models: Cedric E. Ginestet, Boston University Midterm Review Week 7
MA 575 Linear Models: Cedric E. Ginestet, Boston University Midterm Review Week 7 1 Random Vectors Let a 0 and y be n 1 vectors, and let A be an n n matrix. Here, a 0 and A are non-random, whereas y is
More informationCombining data from two independent surveys: model-assisted approach
Combining data from two independent surveys: model-assisted approach Jae Kwang Kim 1 Iowa State University January 20, 2012 1 Joint work with J.N.K. Rao, Carleton University Reference Kim, J.K. and Rao,
More informationidentity matrix, shortened I the jth column of I; the jth standard basis vector matrix A with its elements a ij
Notation R R n m R n m r R n s real numbers set of n m real matrices subset of R n m consisting of matrices with rank r subset of R n n consisting of symmetric matrices NND n subset of R n s consisting
More informationNotes on Random Vectors and Multivariate Normal
MATH 590 Spring 06 Notes on Random Vectors and Multivariate Normal Properties of Random Vectors If X,, X n are random variables, then X = X,, X n ) is a random vector, with the cumulative distribution
More informationMultivariate Regression Analysis
Matrices and vectors The model from the sample is: Y = Xβ +u with n individuals, l response variable, k regressors Y is a n 1 vector or a n l matrix with the notation Y T = (y 1,y 2,...,y n ) 1 x 11 x
More informationMA 575 Linear Models: Cedric E. Ginestet, Boston University Mixed Effects Estimation, Residuals Diagnostics Week 11, Lecture 1
MA 575 Linear Models: Cedric E Ginestet, Boston University Mixed Effects Estimation, Residuals Diagnostics Week 11, Lecture 1 1 Within-group Correlation Let us recall the simple two-level hierarchical
More informationTopic 6: Non-Spherical Disturbances
Topic 6: Non-Spherical Disturbances Our basic linear regression model is y = Xβ + ε ; ε ~ N[0, σ 2 I n ] Now we ll generalize the specification of the error term in the model: E[ε] = 0 ; E[εε ] = Σ = σ
More informationMultivariate Regression
Multivariate Regression The so-called supervised learning problem is the following: we want to approximate the random variable Y with an appropriate function of the random variables X 1,..., X p with the
More informationEconometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018
Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate
More informationLinear Regression. Junhui Qian. October 27, 2014
Linear Regression Junhui Qian October 27, 2014 Outline The Model Estimation Ordinary Least Square Method of Moments Maximum Likelihood Estimation Properties of OLS Estimator Unbiasedness Consistency Efficiency
More informationLecture 14 Simple Linear Regression
Lecture 4 Simple Linear Regression Ordinary Least Squares (OLS) Consider the following simple linear regression model where, for each unit i, Y i is the dependent variable (response). X i is the independent
More informationLinear Algebra Review
Linear Algebra Review Yang Feng http://www.stat.columbia.edu/~yangfeng Yang Feng (Columbia University) Linear Algebra Review 1 / 45 Definition of Matrix Rectangular array of elements arranged in rows and
More informationAdvanced Quantitative Methods: ordinary least squares
Advanced Quantitative Methods: Ordinary Least Squares University College Dublin 31 January 2012 1 2 3 4 5 Terminology y is the dependent variable referred to also (by Greene) as a regressand X are the
More informationTopics in Probability and Statistics
Topics in Probability and tatistics A Fundamental Construction uppose {, P } is a sample space (with probability P), and suppose X : R is a random variable. The distribution of X is the probability P X
More informationSTAT5044: Regression and Anova. Inyoung Kim
STAT5044: Regression and Anova Inyoung Kim 2 / 51 Outline 1 Matrix Expression 2 Linear and quadratic forms 3 Properties of quadratic form 4 Properties of estimates 5 Distributional properties 3 / 51 Matrix
More informationIntermediate Econometrics
Intermediate Econometrics Markus Haas LMU München Summer term 2011 15. Mai 2011 The Simple Linear Regression Model Considering variables x and y in a specific population (e.g., years of education and wage
More informationIntermediate Econometrics
Intermediate Econometrics Heteroskedasticity Text: Wooldridge, 8 July 17, 2011 Heteroskedasticity Assumption of homoskedasticity, Var(u i x i1,..., x ik ) = E(u 2 i x i1,..., x ik ) = σ 2. That is, the
More informationNonresponse weighting adjustment using estimated response probability
Nonresponse weighting adjustment using estimated response probability Jae-kwang Kim Yonsei University, Seoul, Korea December 26, 2006 Introduction Nonresponse Unit nonresponse Item nonresponse Basic strategy
More informationFormulas for probability theory and linear models SF2941
Formulas for probability theory and linear models SF2941 These pages + Appendix 2 of Gut) are permitted as assistance at the exam. 11 maj 2008 Selected formulae of probability Bivariate probability Transforms
More informationVariable Selection for Highly Correlated Predictors
Variable Selection for Highly Correlated Predictors Fei Xue and Annie Qu arxiv:1709.04840v1 [stat.me] 14 Sep 2017 Abstract Penalty-based variable selection methods are powerful in selecting relevant covariates
More informationLecture 15. Hypothesis testing in the linear model
14. Lecture 15. Hypothesis testing in the linear model Lecture 15. Hypothesis testing in the linear model 1 (1 1) Preliminary lemma 15. Hypothesis testing in the linear model 15.1. Preliminary lemma Lemma
More informationLet X and Y denote two random variables. The joint distribution of these random
EE385 Class Notes 9/7/0 John Stensby Chapter 3: Multiple Random Variables Let X and Y denote two random variables. The joint distribution of these random variables is defined as F XY(x,y) = [X x,y y] P.
More informationTMA4255 Applied Statistics V2016 (5)
TMA4255 Applied Statistics V2016 (5) Part 2: Regression Simple linear regression [11.1-11.4] Sum of squares [11.5] Anna Marie Holand To be lectured: January 26, 2016 wiki.math.ntnu.no/tma4255/2016v/start
More information4. Distributions of Functions of Random Variables
4. Distributions of Functions of Random Variables Setup: Consider as given the joint distribution of X 1,..., X n (i.e. consider as given f X1,...,X n and F X1,...,X n ) Consider k functions g 1 : R n
More informationREGRESSION ANALYSIS AND INDICATOR VARIABLES
REGRESSION ANALYSIS AND INDICATOR VARIABLES Thesis Submitted in partial fulfillment of the requirements for the award of degree of Masters of Science in Mathematics and Computing Submitted by Sweety Arora
More informationEcon 620. Matrix Differentiation. Let a and x are (k 1) vectors and A is an (k k) matrix. ) x. (a x) = a. x = a (x Ax) =(A + A (x Ax) x x =(A + A )
Econ 60 Matrix Differentiation Let a and x are k vectors and A is an k k matrix. a x a x = a = a x Ax =A + A x Ax x =A + A x Ax = xx A We don t want to prove the claim rigorously. But a x = k a i x i i=
More informationWLS and BLUE (prelude to BLUP) Prediction
WLS and BLUE (prelude to BLUP) Prediction Rasmus Waagepetersen Department of Mathematics Aalborg University Denmark April 21, 2018 Suppose that Y has mean X β and known covariance matrix V (but Y need
More informationAn Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data
An Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data Jae-Kwang Kim 1 Iowa State University June 28, 2012 1 Joint work with Dr. Ming Zhou (when he was a PhD student at ISU)
More informationXβ is a linear combination of the columns of X: Copyright c 2010 Dan Nettleton (Iowa State University) Statistics / 25 X =
The Gauss-Markov Linear Model y Xβ + ɛ y is an n random vector of responses X is an n p matrix of constants with columns corresponding to explanatory variables X is sometimes referred to as the design
More informationEC3062 ECONOMETRICS. THE MULTIPLE REGRESSION MODEL Consider T realisations of the regression equation. (1) y = β 0 + β 1 x β k x k + ε,
THE MULTIPLE REGRESSION MODEL Consider T realisations of the regression equation (1) y = β 0 + β 1 x 1 + + β k x k + ε, which can be written in the following form: (2) y 1 y 2.. y T = 1 x 11... x 1k 1
More informationMAS113 Introduction to Probability and Statistics. Proofs of theorems
MAS113 Introduction to Probability and Statistics Proofs of theorems Theorem 1 De Morgan s Laws) See MAS110 Theorem 2 M1 By definition, B and A \ B are disjoint, and their union is A So, because m is a
More informationEXTENDING PARTIAL LEAST SQUARES REGRESSION
EXTENDING PARTIAL LEAST SQUARES REGRESSION ATHANASSIOS KONDYLIS UNIVERSITY OF NEUCHÂTEL 1 Outline Multivariate Calibration in Chemometrics PLS regression (PLSR) and the PLS1 algorithm PLS1 from a statistical
More informationANALYSIS OF VARIANCE AND QUADRATIC FORMS
4 ANALYSIS OF VARIANCE AND QUADRATIC FORMS The previous chapter developed the regression results involving linear functions of the dependent variable, β, Ŷ, and e. All were shown to be normally distributed
More informationSolutions for Econometrics I Homework No.1
Solutions for Econometrics I Homework No.1 due 2006-02-20 Feldkircher, Forstner, Ghoddusi, Grafenhofer, Pichler, Reiss, Yan, Zeugner Exercise 1.1 Structural form of the problem: 1. q d t = α 0 + α 1 p
More informationLinear models. Linear models are computationally convenient and remain widely used in. applied econometric research
Linear models Linear models are computationally convenient and remain widely used in applied econometric research Our main focus in these lectures will be on single equation linear models of the form y
More informationSummary: A Random Walks View of Spectral Segmentation, by Marina Meila (University of Washington) and Jianbo Shi (Carnegie Mellon University)
Summary: A Random Walks View of Spectral Segmentation, by Marina Meila (University of Washington) and Jianbo Shi (Carnegie Mellon University) The authors explain how the NCut algorithm for graph bisection
More informationVariance reduction. Michel Bierlaire. Transport and Mobility Laboratory. Variance reduction p. 1/18
Variance reduction p. 1/18 Variance reduction Michel Bierlaire michel.bierlaire@epfl.ch Transport and Mobility Laboratory Variance reduction p. 2/18 Example Use simulation to compute I = 1 0 e x dx We
More informationChapter 5 Matrix Approach to Simple Linear Regression
STAT 525 SPRING 2018 Chapter 5 Matrix Approach to Simple Linear Regression Professor Min Zhang Matrix Collection of elements arranged in rows and columns Elements will be numbers or symbols For example:
More information3 Multiple Linear Regression
3 Multiple Linear Regression 3.1 The Model Essentially, all models are wrong, but some are useful. Quote by George E.P. Box. Models are supposed to be exact descriptions of the population, but that is
More informationCh4. Distribution of Quadratic Forms in y
ST4233, Linear Models, Semester 1 2008-2009 Ch4. Distribution of Quadratic Forms in y 1 Definition Definition 1.1 If A is a symmetric matrix and y is a vector, the product y Ay = i a ii y 2 i + i j a ij
More informationInstrumental Variables and Two-Stage Least Squares
Instrumental Variables and Two-Stage Least Squares Generalised Least Squares Professor Menelaos Karanasos December 2011 Generalised Least Squares: Assume that the postulated model is y = Xb + e, (1) where
More informationBusiness Statistics. Tommaso Proietti. Linear Regression. DEF - Università di Roma 'Tor Vergata'
Business Statistics Tommaso Proietti DEF - Università di Roma 'Tor Vergata' Linear Regression Specication Let Y be a univariate quantitative response variable. We model Y as follows: Y = f(x) + ε where
More informationA Short Course in Basic Statistics
A Short Course in Basic Statistics Ian Schindler November 5, 2017 Creative commons license share and share alike BY: C 1 Descriptive Statistics 1.1 Presenting statistical data Definition 1 A statistical
More informationECE 636: Systems identification
ECE 636: Systems identification Lectures 3 4 Random variables/signals (continued) Random/stochastic vectors Random signals and linear systems Random signals in the frequency domain υ ε x S z + y Experimental
More informationConsequences of measurement error. Psychology 588: Covariance structure and factor models
Consequences of measurement error Psychology 588: Covariance structure and factor models Scaling indeterminacy of latent variables Scale of a latent variable is arbitrary and determined by a convention
More information11 Hypothesis Testing
28 11 Hypothesis Testing 111 Introduction Suppose we want to test the hypothesis: H : A q p β p 1 q 1 In terms of the rows of A this can be written as a 1 a q β, ie a i β for each row of A (here a i denotes
More informationIn the bivariate regression model, the original parameterization is. Y i = β 1 + β 2 X2 + β 2 X2. + β 2 (X 2i X 2 ) + ε i (2)
RNy, econ460 autumn 04 Lecture note Orthogonalization and re-parameterization 5..3 and 7.. in HN Orthogonalization of variables, for example X i and X means that variables that are correlated are made
More informationWISE International Masters
WISE International Masters ECONOMETRICS Instructor: Brett Graham INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This examination paper contains 32 questions. You are
More informationMultivariate Random Variable
Multivariate Random Variable Author: Author: Andrés Hincapié and Linyi Cao This Version: August 7, 2016 Multivariate Random Variable 3 Now we consider models with more than one r.v. These are called multivariate
More informationReference: Davidson and MacKinnon Ch 2. In particular page
RNy, econ460 autumn 03 Lecture note Reference: Davidson and MacKinnon Ch. In particular page 57-8. Projection matrices The matrix M I X(X X) X () is often called the residual maker. That nickname is easy
More informationReview of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley
Review of Classical Least Squares James L. Powell Department of Economics University of California, Berkeley The Classical Linear Model The object of least squares regression methods is to model and estimate
More informationThe Algebra of the Kronecker Product. Consider the matrix equation Y = AXB where
21 : CHAPTER Seemingly-Unrelated Regressions The Algebra of the Kronecker Product Consider the matrix equation Y = AXB where Y =[y kl ]; k =1,,r,l =1,,s, (1) X =[x ij ]; i =1,,m,j =1,,n, A=[a ki ]; k =1,,r,i=1,,m,
More informationMA 575 Linear Models: Cedric E. Ginestet, Boston University Revision: Probability and Linear Algebra Week 1, Lecture 2
MA 575 Linear Models: Cedric E Ginestet, Boston University Revision: Probability and Linear Algebra Week 1, Lecture 2 1 Revision: Probability Theory 11 Random Variables A real-valued random variable is
More informationEcon 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines
Econ 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines Maximilian Kasy Department of Economics, Harvard University 1 / 37 Agenda 6 equivalent representations of the
More informationHOW IS GENERALIZED LEAST SQUARES RELATED TO WITHIN AND BETWEEN ESTIMATORS IN UNBALANCED PANEL DATA?
HOW IS GENERALIZED LEAST SQUARES RELATED TO WITHIN AND BETWEEN ESTIMATORS IN UNBALANCED PANEL DATA? ERIK BIØRN Department of Economics University of Oslo P.O. Box 1095 Blindern 0317 Oslo Norway E-mail:
More informationOn Expected Gaussian Random Determinants
On Expected Gaussian Random Determinants Moo K. Chung 1 Department of Statistics University of Wisconsin-Madison 1210 West Dayton St. Madison, WI 53706 Abstract The expectation of random determinants whose
More informationECON 3150/4150, Spring term Lecture 6
ECON 3150/4150, Spring term 2013. Lecture 6 Review of theoretical statistics for econometric modelling (II) Ragnar Nymoen University of Oslo 31 January 2013 1 / 25 References to Lecture 3 and 6 Lecture
More informationChapter 3: Maximum Likelihood Theory
Chapter 3: Maximum Likelihood Theory Florian Pelgrin HEC September-December, 2010 Florian Pelgrin (HEC) Maximum Likelihood Theory September-December, 2010 1 / 40 1 Introduction Example 2 Maximum likelihood
More informationTesting a Normal Covariance Matrix for Small Samples with Monotone Missing Data
Applied Mathematical Sciences, Vol 3, 009, no 54, 695-70 Testing a Normal Covariance Matrix for Small Samples with Monotone Missing Data Evelina Veleva Rousse University A Kanchev Department of Numerical
More information5 Operations on Multiple Random Variables
EE360 Random Signal analysis Chapter 5: Operations on Multiple Random Variables 5 Operations on Multiple Random Variables Expected value of a function of r.v. s Two r.v. s: ḡ = E[g(X, Y )] = g(x, y)f X,Y
More informationFall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.
1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n
More informationVariable Selection and Model Building
LINEAR REGRESSION ANALYSIS MODULE XIII Lecture - 37 Variable Selection and Model Building Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur The complete regression
More informationDiscrete time processes
Discrete time processes Predictions are difficult. Especially about the future Mark Twain. Florian Herzog 2013 Modeling observed data When we model observed (realized) data, we encounter usually the following
More informationConfidence Intervals, Testing and ANOVA Summary
Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0
More informationThe Linear Regression Model
The Linear Regression Model Carlo Favero Favero () The Linear Regression Model 1 / 67 OLS To illustrate how estimation can be performed to derive conditional expectations, consider the following general
More informationMultiple Regression Model: I
Multiple Regression Model: I Suppose the data are generated according to y i 1 x i1 2 x i2 K x ik u i i 1...n Define y 1 x 11 x 1K 1 u 1 y y n X x n1 x nk K u u n So y n, X nxk, K, u n Rks: In many applications,
More informationPrediction. Prediction MIT Dr. Kempthorne. Spring MIT Prediction
MIT 18.655 Dr. Kempthorne Spring 2016 1 Problems Targets of Change in value of portfolio over fixed holding period. Long-term interest rate in 3 months Survival time of patients being treated for cancer
More informationAsymptotic Statistics-III. Changliang Zou
Asymptotic Statistics-III Changliang Zou The multivariate central limit theorem Theorem (Multivariate CLT for iid case) Let X i be iid random p-vectors with mean µ and and covariance matrix Σ. Then n (
More informationStatistics STAT:5100 (22S:193), Fall Sample Final Exam B
Statistics STAT:5 (22S:93), Fall 25 Sample Final Exam B Please write your answers in the exam books provided.. Let X, Y, and Y 2 be independent random variables with X N(µ X, σ 2 X ) and Y i N(µ Y, σ 2
More informationInstrumental Variables
Università di Pavia 2010 Instrumental Variables Eduardo Rossi Exogeneity Exogeneity Assumption: the explanatory variables which form the columns of X are exogenous. It implies that any randomness in the
More informationMEI Exam Review. June 7, 2002
MEI Exam Review June 7, 2002 1 Final Exam Revision Notes 1.1 Random Rules and Formulas Linear transformations of random variables. f y (Y ) = f x (X) dx. dg Inverse Proof. (AB)(AB) 1 = I. (B 1 A 1 )(AB)(AB)
More informationSimple Linear Regression for the MPG Data
Simple Linear Regression for the MPG Data 2000 2500 3000 3500 15 20 25 30 35 40 45 Wgt MPG What do we do with the data? y i = MPG of i th car x i = Weight of i th car i =1,...,n n = Sample Size Exploratory
More informationCovariance Stationary Time Series. Example: Independent White Noise (IWN(0,σ 2 )) Y t = ε t, ε t iid N(0,σ 2 )
Covariance Stationary Time Series Stochastic Process: sequence of rv s ordered by time {Y t } {...,Y 1,Y 0,Y 1,...} Defn: {Y t } is covariance stationary if E[Y t ]μ for all t cov(y t,y t j )E[(Y t μ)(y
More informationWe begin by thinking about population relationships.
Conditional Expectation Function (CEF) We begin by thinking about population relationships. CEF Decomposition Theorem: Given some outcome Y i and some covariates X i there is always a decomposition where
More information. a m1 a mn. a 1 a 2 a = a n
Biostat 140655, 2008: Matrix Algebra Review 1 Definition: An m n matrix, A m n, is a rectangular array of real numbers with m rows and n columns Element in the i th row and the j th column is denoted by
More informationFor a stochastic process {Y t : t = 0, ±1, ±2, ±3, }, the mean function is defined by (2.2.1) ± 2..., γ t,
CHAPTER 2 FUNDAMENTAL CONCEPTS This chapter describes the fundamental concepts in the theory of time series models. In particular, we introduce the concepts of stochastic processes, mean and covariance
More informationStatistics 910, #5 1. Regression Methods
Statistics 910, #5 1 Overview Regression Methods 1. Idea: effects of dependence 2. Examples of estimation (in R) 3. Review of regression 4. Comparisons and relative efficiencies Idea Decomposition Well-known
More information