Estimation Methods for Multicollinearity Proplem Combined with High Leverage Data Points

Size: px
Start display at page:

Download "Estimation Methods for Multicollinearity Proplem Combined with High Leverage Data Points"

Transcription

1 Journal of Mathematcs and Statstcs 7 (): , 011 ISSN Scence Publcatons Estmaton Methods for Multcollnearty Proplem Combned wth Hgh Leverage Data Ponts Moawad El-Fallah and Abd El-Sallam Department of Statstcs and Mathematcs and Insurance, Faculty of Commerce, Zagazg Unversty, Egypt Abstract: Problem statement: Least Squares (LS) method has been the most popular method for estmatng the parameters of a model due to ts optmal propertes and ease of computaton. LS estmated regresson may be serously affected by multcollnearty whch s a near lnear dependency between two or more explanatory varables n the regresson models. Although LS estmates are unbased n the presence of multcollnearty, they wll be mprecse wth nflated standard errors of the estmated regresson coeffcents. Approach: In ths study, we wll study some alternatve regresson methods for estmatng the regresson parameters n the presence of multple hgh leverage ponts whch cause multcollnearty problem. These methods are manly depend on a one step reweghted least square, where the ntal weght functons were determned by the Dagnostc-Robust Generalzed Potentals (DRGP). The proposed alternatve methods n ths study are called GM-DRGP-L 1, GM- DRGP-LTS, M-DRGP, MM-DRGP and DRGP-MM. Results: The emprcal results of ths study ndcated that, the DRGP-MM and the GM-DRGP-LTS offers a substantal mprovement over other methods for correctng the problems of hgh leverage ponts enhancng multcollnearty. Concluson: The study had establshed that the DRGP-MM and the GM-DRGP-LTS methods were recommended to solve the multcollnearty problem wth hgh leverage data ponts. Key words: Multcollnearty problem, multple hgh, leverage ponts, alternatve robust estmatons robust generalzed, explanatory varables, dagnostc tool, regresson analyss., hgh leverage, substantal mprovement, leverage ponts, bounded nfluence INTRODUCTION Least squares estmaton s one of the most mportant regresson technques used for estmatng the parameters of a model. Two of the assumptons that make least squares so attractve n terms of general model hypothess and parameter sgnfcance testng, are normalty of error dstrbuton and ndependency of explanatory varables. The normalty assumpton can be volated n the presence of one or more suffcently outlyng observatons n the data set resultng n less relable estmates of the model parameters. The second s multcollnearty, whch s a near-lnear dependency among the explanatory varables (X-drecton). Multcollnearty can cause large varablty n the estmaton of parameters. Sometmes t causes the parameters estmaton to be dfferent from the true values by orders of magntude or ncorrect sgn. It may also nflate the varance of the estmatons. Hgh leverage ponts, the ponts far from the rest of the data n the X-drecton, have hgh potental for nfluencng most of the regresson results such as egenstructure and condton ndex of X. Had (199) noted that collnearty-nfluental ponts are usually the ponts wth hgh-leverage whch tends to pull the model ft to ther drecton and ntroduced these ponts as a new source of multcollnearty problems. Thus, dagnosng the multple hgh leverage ponts and recognzng estmatons, methods whch are resstant to these ponts may mprove regresson estmatons. In ths respect, alternatve robust regresson methods are desgned to be less senstve than least squares to outlers mostly n Y- drecton, resultng n mproved fts to the non-outlyng observatons. In order to acheve ths stablty, alternatve robust regresson methods lmt the nfluence of outlers. Three most mportant propertes of any alternatve robust regresson method are effcency, breakdown pont and bounded nfluence (Andersen, 008). The man objectve of ths study s to propose some alternatve estmators that are able to perform well where multple hgh leverage ponts cause multcollnearty problem n regresson analyss. Nonetheless, the development of such estmators has not been publshed extensvely n the lterature. To Correspondng Author: Moawad El-Fallah, Department of Statstcs and Mathematcs and Insurance, Faculty of Commerce, Zagazg Unversty, Egypt 19

2 acheve ths objectve, dfferent types of related alternatve robust methods have been nvestgated and ther propertes are compared. Among the dfferent types of robust technques, we wll consder the bounded nfluence or Generalzed M-estmators (Marronna and Yoha, 000; Ghaz et al., 010; Ramz. and Vvane, 010) whch attempt to assgn less weght to the hgh nfluence observatons and large resdual ponts. To enhance the GM-estmators, these estmators may be defned as mult-stage estmators where n dfferent stages, dfferent alternatve robust propertes of each technque are appled to combne the desrable propertes of each technque (Smpson et al., 199). MATERIALS AND METHODS Robust regresson methods: Let us consder the followng lnear regresson model as Eq. 1: Y = Xβ+ (1) Where: Y = The n 1 vector of response X = The n P (P = k+1) matrx = The n 1 vector whch has standardzed normal dstrbuton When the Least Squares (LS) method s employed, estmaton of the regresson parameters can be obtaned from Eq. : estmators, where the LMS and the LTS has low and medum effcency value, respectvely (Smpson et al. (199). Huber (1973) proposed a robust M-estmators where ˆβ are obtaned by solvng Eq. 3: y x βˆ n ψ = 1 s (3) It s mportant to pont out that, there are two types of ψ-functons, that s the monotonc ψ-functons (e.g. Huber s ψ-functons) and the redescendng ψ - functons (e.g., bweght ψ functon s, Beaton and Tukey (1974). M-estmators are the smplest hgh effcency robust procedures, both computatonally and theoretcally havng desrable asymptotc propertes. However, the M-estmator s not robust n the X- drecton and has a low breakdown pont, that s equal to (1/n) (Smpson et al., 199; Marronna et al., 006) ntroduced a class of methods whch s called the Generalzed M-estmators (GM-estmators) wth a major am of downweghtng those hgh leverage ponts whch have large resduals. Marronna et al. (006) also, reported that these estmators have hgh effcency and bounded nfluence propertes whch acheve a moderate breakdown pont equal to (1/P). The GM-estmator s the soluton of the normal equaton (4): y x βˆ n ϕ ψ x1 = 0 = 1 sϕ= (4) ˆ 1 (x / x) x / Y β= () Robust regresson procedures are manly am to provde resstant (stable) results n the presence of outlers. The Least Absolute Values (LAV) s one of the frst robust methods that was ntroduced by Armstrong and Kung (1987) wth a hgher effcency than LS by mnmzng the sum of the absolute resduals. The use of ths crteron, rather than ordnary least squares, provdes robustness aganst outlers and s partcularly useful when the dsturbances are generated by fattaled dstrbutons. Rousseuw (1984). Rousseuw et al. (003) ntroduced two robust methods namely the Least Medan of Squares (LMS) and the Least Trmmed Squares (LTS). The LMS attempts to mnmze the medan of e whle the LTS mnmze h e() where e (1) = 1 where, φ are defned to down weght hgh leverage ponts, wth hgh resduals and s s a robust scale estmate. Iteratvely Reweghted Least Squares (IRLS) may be used to solve (4). At convergence, the GMestmator may be wrtten as Eq. 5: β ˆ = ( XWX ) 1 XWY (5) e (),, e (n) are the ordered squared resduals, =1,,n and h s the number of resduals ncluded n the calculaton. Both estmators have hgh breakdown, that 50%. However, they are unbounded nfluence been publshed extensvely n the lterature. Snce hgh 130 GM where, n ths case, the dagonal elements of W are the weghts w defned as Eq. 6: y x ˆ β GM ψ ϕs W = (y x ˆ ) ϕ s GM (6) The man objectve of ths study s to study some alternatve estmators that are able to perform well where multple hgh leverage ponts are the cause of the multcollnearty problem n regresson analyss. In partcular, the development of such estmators has not

3 leverage ponts may be collnearty-enhancng observatons, we attempt to reduce ts nfluence by employng robust estmator whch s known to be resstant to hgh leverage ponts. In ths connecton, we wll consder the bounded nfluence or Generalzed M- estmators wth a major am of down weghtng the hgh leverage ponts whch have large resduals. Hence, n ths study, we propose manly alternatve mult-stage GM-estmators and weghted MM-estmators to remedy the problem of collnearty-enhancng observatons on the parameter estmates of the multple lnear regresson model. Unfortunately, the MM-estmators are also senstve to outlers n X-varables. As a soluton to ths drawback of MM-estmators, an alternatve robust method s developed n secton four. To confrm the advantage of our alternatve proposed methods, these methods compared wth reweghted least square based on LMS (RLS-LMS) defned by Rousseuw and Leroy (003). They computed scale estmator as: 0 5 s = * 1 med(r ) n p 1 where, r s the resdual of LS. The followng hard r rejecton functon for standardzed resduals s S utlzed to compute the followng ntal weghts Eq. 7: w r 1 f.5 0 = S 0 otherwse (7) The robust verson of (8) s Eq. 9: x medan(x) T = (9) Max(x) where, Mad (x) s the normalzed medan absolute devaton about the Medan (x) (Mad= 1.486(medan r -medan (r ) ). When the dstrbuton of the data s normal, T and T are approxmately equal. Any observaton whch has absolute value of T or T greater than 3, s consdered as outler (Marronna et al., 006). Ths method can be used n unvarate regresson models as a dagnostcs rule to detect hgh leverage ponts. Snce n most of the regresson analyss, more than one collnear explanatory varable exsts n the model, nvestgatng some useful methods n these cases seems to be necessary. One of the handest methods can be defned as hat matrx. Hat matrx whch s tradtonally used as a measure of leverage ponts n regresson analyss s defned as W=X (X T X) -1 X T. The most wdely used cutoff ponts of hat matrx s twcethe- mean-rule (k/n) by Hoagln and Welsch (1978). Had (1988) ponted out that the hat matrx may fal to dentfy the hgh leverage ponts due to the effect of hgh leverage ponts n leverage structure. He ntroduced another dagnostc tool as follows Eq. 10: p w = (10) 1 w T T where, w = ( ) x x x x s the dagonal element of W The fnal weghts for RLS-LMS are dentfed by the usage of a hard rejecton for standardzng the LMS w r resduals by new scale as scale = the n w 1 p 1 = weakness of ths method s usng LMS whch s a low effcency estmator. Dagnostc robust generalzed potental statstcs: A tradtonal measure of the outlyngness of an observaton x wth respect to the sample s three-sgma edt rule whch s defned as follows Eq. 8: x x T = (8) S Where: x = The mean s = The standard devaton of collnear explanatory varables 131 and the -th, dagonal potental p can be defned as p = T T 1 x ( x () x ()) x, where X () s the data matrx X wthout the -th row. He proposed a cut off pont for potental values p as Medan (p ) + c Mad (p ) (MADcutoff pont) and c can be taken as constant values of or 3. Ths method also s unable to detect all of the hgh leverage ponts. So, Had (1988) ntroduced another dagnostc tool as generalzed potentals for the whole data set whch s defned as Eq. 11: w p = 1 w for R = w for D ( D) ( D) ( ) (11) Where: D = The deleted set whch corresponds to the suspected outlers R = The remanng set from observatons after deletng d < (n-k) and t contans (n-d) cases

4 Snce there sn t any fnte upper bound for p * s and the theoretcal dstrbuton of them are not easy to derved, he ntroduced a MAD-cutoff pont for the generalzed potental as well. Recently, Habshah et al. (009) developed Dagnostc Robust Generalzed Potental (DRGP) to determne outlyng ponts n multvarate data set by utlzng the Robust Mahalanobs Dstance (RMD) based on Mnmum Volume Estmator (MVE) (RMD-MVE) (defned by Rousseuw (1985) as Eq. 1: 1 RMD = (x T R(x)) C R(x) ( T R(x)) for = 1,...,n (1) where, T R ( ) and C R ( ) are robust locaton and shape estmate such as MCD or MVE. The RMD- MVE has been used to detect the suspected group (D group) n generalzed potental method n (11). The mert of ths method s swampng less good leverage as hgh leverage ponts comparng wth the RMD -MVE. In the next secton we propose robust methods based on DRGP. Alternatve proposed robust methods: In ths secton, some proposed form of φ-weghts n (4) are generated and dscussed. It s mportant to pont out that n the proposed methods, P s the DRGP statstcs wth MADcutoff of ths statstc. In these methods, we wll employ the Tukey s bweght redescendng φ-functon whch s defned as Eq. 13: the calculaton of the φ. The frst two proposed estmators are mult-stage GM-estmators, whle the others are defned based on the M-estmator and MMestmators. The proposed methods wll be computed n three steps and summarzed as follow. GM-DRGP-L 1 : Step 1: Employ L 1 as ntal estmate and then obtan the standardzed resduals of L 1 estmator. Compute MAD = (med r -med (r ). accordng to Marronna and Yoha (000). It s mportant to menton that f MAD s computed from all the resduals of L 1 estmators, the scale estmates wll become too small due to defnng some zero resdual. Thus, non-null resduals have been used to compute the scale estmate. Step : Defnng ( ( )) (( )) ϕ = mn 1, MAD cutoff p / p n (4) and usng functon (13) to assgn fnal weghts to the observatons. Step 3: Compute a one step reweghted least squares as a convergence approach. GM-DRGP-LTS: Step 1: Consder the LTS as ntal estmate and compute the standardzed resduals and scale estmate based on LTS. f t t ψ (t) = t 1 c c f t > c (13) Step : Defne ϕ = mn 1, ( MAD cutoff ( p )) / (( p )) n (4) and usng functon (13) to assgn fnal weghts to the observatons. The Tukey s bweght wth the tunng constant c = wll result a 95% effcency under normal error dstrbuton. Assgnng lower weghts (even zero f the Step3: Compute a one step reweghted least squares as resdual s too large) to large outlers, a redescendng φ- convergence approach. functon s better compared to monotonc functons such as Huber s functon. In ths respect, a M-DRGP: redescendng φ-functons lmts the nfluence of outlers Step 1: Compute the resduals of M-estmates of scale more effectvely than a monotone φ-functon. Gven the by assgnng the ntal weght of W, (DRGP (MVE) fact that teratve technques are computatonally expensve and there s no guaranty to result n better mn 1, ( MAD cutoff ( p )) / (( p )) where P s DRGP estmatons (Smpson, 1995) a one step reweghted LS s (MVE) statstcs. used for most of the proposed methods except DRGP- MM estmator. Smpson et al. (199) enumerated that Step : Defne new weghts as w = r (Mestmator)/scale (M-estmator) and usng a Tukey s the GM and MM-estmators surpass other robust method. In ths connecton, most of the alternatve bweght to assgn fnal weght to the observatons. proposed methods are smlar to that of GM and MM estmators wth alght modfcaton n whch the DRGP proposed by Habshah et al. (009) s ncorporated n Step 3: Compute a one step reweghted least squares. 13

5 MM-DRGP: Ths method s smlar to that of M- DRGP, where on the second and thrd steps the M s replaced wth the MM-estmators. DRGP-MM: Step 1: Compute the ntal weght W, (DRGP(MVE)) whch s defned n the frst step of M-DRGP and usng functon (13) to assgn fnal weghts to the observatons. Step : Compute the weghted MM-estmators by these fnal weghts. weghted multcollnearty dagnostcs. These dagnostcs can be defned as a measure to evaluate whch method s more robust aganst the hgh leverage ponts that are responsble for the multcollnearty. It s mportant to pont out that all hgh leverage ponts are not collnearty-nfluental and vce versa (Had, 199) The weghted correlaton matrx can be computed through the correlaton matrx of X w. The weghted VIF s defnd as follows Eq. 15: w = ( ( )) 1 (15) VIF () 1 R w Weghted multcollnearty dagnostcs: Weghted multcollnearty dagnostcs are defned as practcal tools to nvestgate the source of multcollnearty whch may be the hgh leverage ponts n the data set. Indeed, robust estmators to deal wth multcolnearty problems are largely gnored ssues. Walker (1985) noted that sometmes the weghtng process n robust methods can mprove the multcollnearty of X matrx. An effectve measure of robust methods whch reduce multcollnearty problems due to the presence of multple hgh leverage ponts can be defned as weghted multcollnearty dagnostcs. The two most classcal and practcal multcollnearty dagnostcs are Correlaton X matrx and Varance Inflaton Factors (VIF). In bvarate regresson analyss, when correlaton coeffcent exceeds 0.9, multcollnearty can be detected. However, n the case of more than two explanatory varables model, multcollnearty may occur n less than 0.9 correlaton coeffcents (Rosen, 1999). Snce, ths multcollnearty dagnostcs s smple and easy to compute, t s more preferred (Belsley, 1991, Belsley et al., 1990). Another practcal approach to detect multcollnearty s by usng varance nflaton factors (VIF). VIF s defned as V1F () ( 1 R ( )) 1 w where R s the coeffcent determnaton of regressng each x on the other explanatory varables, whch produced a valuable ndces to detect nflated varances of regresson parameter estmatons (Marquardt, 1970). A cutoff pont of (11) s recommended as a rule of thumb for VIF to detect severe multcollnearty. The weghted lnear regresson can be expressed as a transformed model Eq. 14: Y w = X wβ + Є w (14) where, Y w = W 1/ Y, X w = W 1/ X and Є w = W 1/ Є (Neter et al. (004). The fnal weght of the proposed estmator, whch s expected to be robust aganst hgh leverage ponts, can be used n the computaton of 133 where, R (W) s the coeffcent of determnaton of regressng each X w on the other weghted explanatory varables. It s worth to menton that f the hgh leverage ponts are the source of multcollnearty n the data set, the weghted multcollnearty dagnostcs wll not detect multcollnearty due to these ponts otherwse multcollearty wll be detected easly. RESULTS Numercal example: In ths secton we consder a real data set to evaluate the performance of our proposed robust methods. Chld mortalty data set: Gujarat (00) ntroduced ths data set wth 64 observatons whch ncludes chld mortalty as dependent varable and Gross Natonal Producton (GNP) per capta and Female Lteracy Rate (FLR) as ndependent varables. Table 1 presents the classcal multcollnearty dagnostcs methods such as the correlaton matrx and VIF. The classcal dagnostcs measures of the orgnal data clearly ndcates that the data set doesn t have collnear explanatory varables. The T and F-tests confrm that there exsts relatonshp between the explanatory and response varable. Ths data set has two hgh leverage ponts based on the hat matrx by twcemean-rule cutoff pont, whle DRGP (MVE) can detect 11 observatons as hgh leverage ponts. Table 1: Multco llnearty dagnostcs and least square coeffcents of chld mortalty data set. Cor VIF b 1 b F p- S (e) (x 1,x ) (t p- (t p- value value) value) Orgnal data (0.007) (0.000) Modfed data (0.51) (0.938)

6 Table : Hgh leverage dagnostcs for chld mortalty data set Orgnal data Modfed data ndex Hat(x) (0.09) DRGP(x)(0.11) T 1 (3) T (3) ndex modfed X 1 Hat(x) (0.09) DRGP(x)( , Table 3: Multcollnearty dagnostcs and least square coeffcents of dfferent methods for modfed chld mortalty data set Method Cor (x 1,x ) VIF b 1 (t p-value) b (t p-value) F p-value s (e) Ls (0.51) (0.938) GM-DRGP-L (0.0) (0.34) GM-DRGP- LTS (0.00) (0.00) DRGP- MM (0.00) (0.00) - - MM-DRGP (0.0) (0.0) (0.33) M-DRGP (0.0) (0.33) RLS-LMS (0.00) (0.00) The hgh leverage ponts aren t collnearty-enhancng observatons evdent by the small value of correlaton matrx and VIF (Table 1). It s mportant to note here that the hgh leverage ponts wll be the prme source of multcollnearty when they are n the same observatons wth at least two explanatory varables. The robust three-sgma edt rule (Eq. 9) s shown n Table. The results of Table sgnfy that all the T exceeds the cutoff pont of 3 whch can be consdered as hgh leverage ponts, except observatons 1,5,38,and 54. In order to obtan a large magntude of hgh leverage ponts n x 1 as n x, a modfcaton n x n the ponts 4, 7, 30, 33, 53, 58-6 has been consdered based on the followng formula: ( ) ( ) Modfed x = T mad (x + medan (x ) The modfed x 1 s also dsplayed n Table. It s nterestng to pont out that after the modfcaton, the hat matrx can't detect all of these modfed observatons as hgh leverage ponts, whle the DRGP (MVE) statstcs dentfed them as hgh leverage defned estmators. 134 ponts. The results of Table 1 suggests that there s a strong multcollnearty n the modfed data set. Moreover, the non-sgnfcant of the t-statstcs and the sgnfcant of the F-statstcs of two coeffcent estmatons confrmed the presence of multcollnearty n the modfed data. The presence of multcollnearty has produced larger standard devaton of the errors for the modfed data. Table 3 presents the multcollnearty dagnostcs and least squares coeffcents of the modfed chld mortalty data set for proposed robust methods and the exstng robust methods whch were ntroduced prevously. The results of Table 3 pont out that the F-statstcs can t be obtaned for DRGP- MM estmator because t s not a one step reweghted estmator. It can be shown also from Table 3 that, among the proposed robust methods, only three estmators, that s the DRGP-MM, GM-DRGP-LTS and RLS-LMS can solve the multcollnearty problems. It s nterestng to note that the DRGP- MM has the least standard devaton error, followed by the GM-DRGP- LTS and RLS-LMS.Thus the new proposed estmators namely the DRGP-MM and the GM-DRGP-LTS, outperforms all other

7 DISCUSSION CONCLUSION Let us frst focus our attenton to the result of modfed chld mortalty data set whch s dsplayed n Table 1. The classcal dagnostcs measures of the orgnal data clearly ndcate that the data set does not have collnear explanatory varables. The T and F- tests confrm that there exsts relatonshp between the explanatory and response varable. Ths data set has two multple hgh leverage ponts based on the hat matrx by twce the mean-rule cutoff pont, whle DRGP (MVE) can detect 11 observatons as multple hgh leverage ponts. The hgh leverage ponts are not collnearty-enhansng observatons evdent by the small value of correlaton matrx and VIF (Table 1). The results of Table sgnfy that all the T of these multple hgh leverage ponts for the orgnal data exceeds the cutoff pont of 3 whch can be consdered as hgh leverage ponts n x, except for observaton 1,5,38and 54. It s nterestng to pont out that after the modfcaton (values for varable x 1 are modfed to become hgh leverage collnearty-enhancng observatons ), the hat matrx can not detect all of these modfed observatons as multple hgh leverage ponts, whle the DRGP (MVE) statstcs dentfed them as hgh leverage ponts. The result of Table 1 suggests that there s a strong multcollnearty n the modfed data set. Moreover, the non-sgnfcant of the t- statstcs and the sgnfcant of the F-statstcs of the two coeffcent estmatons confrmed the presence of multcollnearty n the modfed data. The presence of multcollnearty has produced larger standard devaton of the errors for the modfed data as well. It s mportant to pont out that the F-statstcs for the DRGP-MM estmator as shown n Table 3 can not be obtaned because t s not a one step reweghted estmator. It can be observed from Table 3 that, among the proposed robust methods, only three estmators, that s the DRGP-MM, GM-DRGP-LTS and RLT-LMS can solve the multcollnearty problems. Ths result also suggests that the other methods can hardly rectfy the multcollnearty problem evdent by the larger p values and hgher VIF values. It s nterestng to note that the DRGP-MM has the least standard devaton error, followed by the GM-DRGP-LTS and RLS-LMS. We have not pursued the analyss of ths example to the fnal concluson, but a reasonable nterpretaton up to ths stage s that the proposed Mult-stage GMncorporated the DRGP are able to solve the problem of multcollnearty whch s caused by hgh leverage ponts. 135 Outlers n the X-drecton whch are refer as multple hgh leverage ponts can render least squares estmaton meanngless and cause multcollnearty problems. Many robust methods have been developed to reduce the effect of outlers n the X-drecton. Nonetheless, the development of robust methods that deal wth the multcollnearty problems whch are manly due to multple hgh leverage ponts has not been publshed extensvely n the lterature. The man focus of ths study s to develop a relable method for correctng the problem of hgh leverage ponts enhancng multcollnearty. In ths study, we ncorporate the DRGP (MVE), one of the latest multple hgh leverage dagnostcs method wth dfferent types of robust estmators. The emprcal study ndcates that the DRGP-MM emerge to be more effcent and more relable than other methods, followed by the GM-DRGP-LTS as they are able to reduce the most effect of multcollnearty. The results seem to suggest that the DRGP-MM offers a substantal mprovement over other methods for correctng the problems of hgh leverage ponts enhancng multcollnearty. REFERENCES Andersen, R., 008. Modern Methods for Robust Regresson. 1st Edn., Stage Publcaton, The Unted States of Amerca, ISBN: , pp: 107. Armstrong, R.D. and M.T. Kung, Least absolute values estmates for a smple lnear regresson problem. J. R. Sc. Soc., 7: Beaton, A.E. and J.W. Tukey, The fttng of power seres, meanng polynomals, llustrated on band-spectroscopc data. Technometrcs, 16: Belsley, D.A., 1991.Condtonng Dagnostcs: Collnearty and Weak DAT n Regresson. 1st Edn., Wley, New York, ISBN: 10: , pp: 396. Belsley, D.A., E. Kuh and R.E. Welsch, Regresson Dagnostcs: Identng Influental Data and Sources of Collnearty. 1st Edn., Wley. New York, ISBN: , pp: 9. Ghaz, F.M., A. Majed, Alsharayr and M.D. Mwafaq, 010. Impact of frm s characterstcs on determnng the fnancal structure on the nsurance sector frms n jordan. J. Soc. Sc., 6: DOI: /jssp

8 Gujarat, D.N., 00. Basc Econometrcs. 4th Edn., Macgraw-Hll, New York, ISBN: 10: , pp: 100. Habshah, M., M.R. Norazan and A.H.M.R. Imon, 009. The performance of dagnostc-robust generalzed potentals for the dentfcaton of multple hgh leverage ponts n lnear regresson. J. Appled Stat., 36: DOI: / Had, A.S., Dagnosng collnearty-nfluental observatons. Comput. Stats. Data Anal., 7: DOI: / (88) Had, A.S.1, 199. A new measure of overall potental nfluence n lnear regresson. Comput. Stats. Data Analyss, 14: 1-7. DOI: / (9)90078-T Hoagln, D.C. and R.E. Welsch, The hat matrx n regresson and ANOVA. Am. Stat. Assoc., 3: 17-. Huber, P.J., Robust Regresson: Asymptotc, conjectures and monte carlo. Annals o Stats., 1: Marquardt, D.W., Generalzed nverses rdge regresson, based lnear estmaton and nonlnear estmaton. Technmetrcs, 1: Marronna, R.A. and V.J. Yoha, 000. Robust regresson wth both contnuous and categorcal predctors. J. Stat. Plan. Inferen., 89: DOI: /S (99) Marronna, R.A., R.D. Martn and V.J. Yoha, 006. Robust Statstcs: Theory and Methods. 1st Edn., John Wlly, New York, ISBN: 10: , pp: 403. Neter, J., M.H. Kutner, W. Wasserman and C.J. Nachtshem, 004. Appled Lnear Regresson Models. 3rd Edn., Macgraw-Hll, New York, ISBN: 10: X, pp: 70. Ramz, N.N. and N. Vvane, 010. Usng regresson to establsh weghts for a set of composte equatons through a numercal analyss approach: A case of admsson crtera to a college. J. Math. Stat., 6: DOI: /jmssp Rosen, D.H., The dagnoss of collnearty: A Monte Carlo smulaton study. Ph.D. Dssertaton, Department of Epdemology, School of Emory Unversty, pp: d = pqdweb? Dd = and sd = and Fmt = and clentld = 3665 and RQT=309 and VName = PQD Rousseuw, P.J. and A.M. Leroy, 003. Robust Regresson and Outler Detecton. 1st Edn., John Wlly, New York, ISBN: , pp: 39. Rousseuw, P.J., Least medan of squares regresson. J. Am. Stat. Assoc., 79: Rousseuw, P.J., Multvarate estmaton wth hgh breakdown pont. Math. Stat. Appled, B: Smpson, J.R., New methods and comparatve evaluatons for robust and based-robust regresson estmaton. Ph. Dssertaton, Arzona State Unversty. Smpson, D.G., D. Ruppert and R.J. Carroll, 199. On One-step GM estmates and stablty of nfluences n lnear regresson. J. Am. Stat. Assoc., 87: Walker, E., Influence, collnearty and robust estmaton n regresson. Ph.D. Dssertaton, Vrgnna Polytechnc Insttute and State Unversty, Blacksburg, Vergna. &sd=1&fmt=&clentid=3665&rot=309&vn ame=pqd 136

Robust Estimations as a Remedy for Multicollinearity Caused by Multiple High Leverage Points

Robust Estimations as a Remedy for Multicollinearity Caused by Multiple High Leverage Points Journal of Mathematcs and Statstcs 5 (4): 311-31, 009 ISSN 1549-3644 009 Scence Publcatons Robust Estmatons as a Remedy for Multcollnearty Caused by Multple Hgh Leverage Ponts Arezoo Bagher and Habshah

More information

Research Article On the Performance of the Measure for Diagnosing Multiple High Leverage Collinearity-Reducing Observations

Research Article On the Performance of the Measure for Diagnosing Multiple High Leverage Collinearity-Reducing Observations Hndaw Publshng Corporaton Mathematcal Problems n Engneerng Volume 212, Artcle ID 53167, 16 pages do:1.1155/212/53167 Research Artcle On the Performance of the Measure for Dagnosng Multple Hgh Leverage

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 31 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 6. Rdge regresson The OLSE s the best lnear unbased

More information

Chapter 15 Student Lecture Notes 15-1

Chapter 15 Student Lecture Notes 15-1 Chapter 15 Student Lecture Notes 15-1 Basc Busness Statstcs (9 th Edton) Chapter 15 Multple Regresson Model Buldng 004 Prentce-Hall, Inc. Chap 15-1 Chapter Topcs The Quadratc Regresson Model Usng Transformatons

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

Journal of Modern Applied Statistical Methods

Journal of Modern Applied Statistical Methods Journal of Modern Appled Statstcal Methods Volume 17 Issue 1 Artcle 17 6-29-2018 Robust Heteroscedastcty Consstent Covarance Matrx Estmator based on Robust Mahalanobs Dstance and Dagnostc Robust Generalzed

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

Chapter 15 - Multiple Regression

Chapter 15 - Multiple Regression Chapter - Multple Regresson Chapter - Multple Regresson Multple Regresson Model The equaton that descrbes how the dependent varable y s related to the ndependent varables x, x,... x p and an error term

More information

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department

More information

Some Robust Ridge Regression for handling Multicollinearity and Outlier

Some Robust Ridge Regression for handling Multicollinearity and Outlier Internatonal Journal of Scences: Basc and Appled Research (IJSBAR) ISSN 307-4531 (Prnt & Onlne) http://gssrr.org/ndex.php?journal=journalofbascandappled ----------------------------------------------------------------------------------------------------------------

More information

Robust Logistic Ridge Regression Estimator in the Presence of High Leverage Multicollinear Observations

Robust Logistic Ridge Regression Estimator in the Presence of High Leverage Multicollinear Observations Mathematcal and Computatonal Methods n Scence and Engneerng Robust Logstc Rdge Regresson Estmator n the Presence of Hgh Leverage Multcollnear Observatons SYAIBA BALQISH ARIFFIN 1 AND HABSHAH MIDI 1, Faculty

More information

Chapter 4 High Breakdown Regression Procedures

Chapter 4 High Breakdown Regression Procedures Chapter 4 Hgh Breakdown Regresson Procedures Introducton Whle M and BI estmators provde an mprovement over OLS f the data has an outler or hgh nfluence pont, they cannot provde protecton aganst data wth

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

Global Sensitivity. Tuesday 20 th February, 2018

Global Sensitivity. Tuesday 20 th February, 2018 Global Senstvty Tuesday 2 th February, 28 ) Local Senstvty Most senstvty analyses [] are based on local estmates of senstvty, typcally by expandng the response n a Taylor seres about some specfc values

More information

Statistics II Final Exam 26/6/18

Statistics II Final Exam 26/6/18 Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the

More information

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10)

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10) I. Defnton and Problems Econ7 Appled Econometrcs Topc 9: Heteroskedastcty (Studenmund, Chapter ) We now relax another classcal assumpton. Ths s a problem that arses often wth cross sectons of ndvduals,

More information

The Ordinary Least Squares (OLS) Estimator

The Ordinary Least Squares (OLS) Estimator The Ordnary Least Squares (OLS) Estmator 1 Regresson Analyss Regresson Analyss: a statstcal technque for nvestgatng and modelng the relatonshp between varables. Applcatons: Engneerng, the physcal and chemcal

More information

REGRESSION ANALYSIS II- MULTICOLLINEARITY

REGRESSION ANALYSIS II- MULTICOLLINEARITY REGRESSION ANALYSIS II- MULTICOLLINEARITY QUESTION 1 Departments of Open Unversty of Cyprus A and B consst of na = 35 and nb = 30 students respectvely. The students of department A acheved an average test

More information

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems Numercal Analyss by Dr. Anta Pal Assstant Professor Department of Mathematcs Natonal Insttute of Technology Durgapur Durgapur-713209 emal: anta.bue@gmal.com 1 . Chapter 5 Soluton of System of Lnear Equatons

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

Chap 10: Diagnostics, p384

Chap 10: Diagnostics, p384 Chap 10: Dagnostcs, p384 Multcollnearty 10.5 p406 Defnton Multcollnearty exsts when two or more ndependent varables used n regresson are moderately or hghly correlated. - when multcollnearty exsts, regresson

More information

Chapter 7 Generalized and Weighted Least Squares Estimation. In this method, the deviation between the observed and expected values of

Chapter 7 Generalized and Weighted Least Squares Estimation. In this method, the deviation between the observed and expected values of Chapter 7 Generalzed and Weghted Least Squares Estmaton The usual lnear regresson model assumes that all the random error components are dentcally and ndependently dstrbuted wth constant varance. When

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

[ ] λ λ λ. Multicollinearity. multicollinearity Ragnar Frisch (1934) perfect exact. collinearity. multicollinearity. exact

[ ] λ λ λ. Multicollinearity. multicollinearity Ragnar Frisch (1934) perfect exact. collinearity. multicollinearity. exact Multcollnearty multcollnearty Ragnar Frsch (934 perfect exact collnearty multcollnearty K exact λ λ λ K K x+ x+ + x 0 0.. λ, λ, λk 0 0.. x perfect ntercorrelated λ λ λ x+ x+ + KxK + v 0 0.. v 3 y β + β

More information

Chapter 3. Two-Variable Regression Model: The Problem of Estimation

Chapter 3. Two-Variable Regression Model: The Problem of Estimation Chapter 3. Two-Varable Regresson Model: The Problem of Estmaton Ordnary Least Squares Method (OLS) Recall that, PRF: Y = β 1 + β X + u Thus, snce PRF s not drectly observable, t s estmated by SRF; that

More information

a. (All your answers should be in the letter!

a. (All your answers should be in the letter! Econ 301 Blkent Unversty Taskn Econometrcs Department of Economcs Md Term Exam I November 8, 015 Name For each hypothess testng n the exam complete the followng steps: Indcate the test statstc, ts crtcal

More information

Polynomial Regression Models

Polynomial Regression Models LINEAR REGRESSION ANALYSIS MODULE XII Lecture - 6 Polynomal Regresson Models Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Test of sgnfcance To test the sgnfcance

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

Learning Objectives for Chapter 11

Learning Objectives for Chapter 11 Chapter : Lnear Regresson and Correlaton Methods Hldebrand, Ott and Gray Basc Statstcal Ideas for Managers Second Edton Learnng Objectves for Chapter Usng the scatterplot n regresson analyss Usng the method

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 14 Multple Regresson Models 1999 Prentce-Hall, Inc. Chap. 14-1 Chapter Topcs The Multple Regresson Model Contrbuton of Indvdual Independent Varables

More information

Chapter 6. Supplemental Text Material

Chapter 6. Supplemental Text Material Chapter 6. Supplemental Text Materal S6-. actor Effect Estmates are Least Squares Estmates We have gven heurstc or ntutve explanatons of how the estmates of the factor effects are obtaned n the textboo.

More information

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE P a g e ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE Darmud O Drscoll ¹, Donald E. Ramrez ² ¹ Head of Department of Mathematcs and Computer Studes

More information

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 1 Chapters 14, 15 & 16 Professor Ahmad, Ph.D. Department of Management Revsed August 005 Chapter 14 Formulas Smple Lnear Regresson Model: y =

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

Outlier Detection in Logistic Regression: A Quest for Reliable Knowledge from Predictive Modeling and Classification

Outlier Detection in Logistic Regression: A Quest for Reliable Knowledge from Predictive Modeling and Classification Outler Detecton n Logstc egresson: A Quest for elable Knowledge from Predctve Modelng and Classfcaton Abdul Nurunnab, Geoff West Department of Spatal Scences, Curtn Unversty, Perth, Australa CC for Spatal

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

On the Influential Points in the Functional Circular Relationship Models

On the Influential Points in the Functional Circular Relationship Models On the Influental Ponts n the Functonal Crcular Relatonshp Models Department of Mathematcs, Faculty of Scence Al-Azhar Unversty-Gaza, Gaza, Palestne alzad33@yahoo.com Abstract If the nterest s to calbrate

More information

Primer on High-Order Moment Estimators

Primer on High-Order Moment Estimators Prmer on Hgh-Order Moment Estmators Ton M. Whted July 2007 The Errors-n-Varables Model We wll start wth the classcal EIV for one msmeasured regressor. The general case s n Erckson and Whted Econometrc

More information

Methods of Detecting Outliers in A Regression Analysis Model.

Methods of Detecting Outliers in A Regression Analysis Model. Methods of Detectng Outlers n A Regresson Analyss Model. Ogu, A. I. *, Inyama, S. C+, Achugamonu, P. C++ *Department of Statstcs, Imo State Unversty,Owerr +Department of Mathematcs, Federal Unversty of

More information

28. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III Ftted Values and Resduals US Domestc Beers: Calores vs. % Alcohol To each observed x, there corresponds a y-value on the ftted lne, y ˆ = βˆ + βˆ x. The are called ftted

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

Now we relax this assumption and allow that the error variance depends on the independent variables, i.e., heteroskedasticity

Now we relax this assumption and allow that the error variance depends on the independent variables, i.e., heteroskedasticity ECON 48 / WH Hong Heteroskedastcty. Consequences of Heteroskedastcty for OLS Assumpton MLR. 5: Homoskedastcty var ( u x ) = σ Now we relax ths assumpton and allow that the error varance depends on the

More information

A Comparative Study for Estimation Parameters in Panel Data Model

A Comparative Study for Estimation Parameters in Panel Data Model A Comparatve Study for Estmaton Parameters n Panel Data Model Ahmed H. Youssef and Mohamed R. Abonazel hs paper examnes the panel data models when the regresson coeffcents are fxed random and mxed and

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Exerments-I MODULE III LECTURE - 2 EXPERIMENTAL DESIGN MODELS Dr. Shalabh Deartment of Mathematcs and Statstcs Indan Insttute of Technology Kanur 2 We consder the models

More information

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE)

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) June 7, 016 15:30 Frst famly name: Name: DNI/ID: Moble: Second famly Name: GECO/GADE: Instructor: E-mal: Queston 1 A B C Blank Queston A B C Blank Queston

More information

Lecture 2: Prelude to the big shrink

Lecture 2: Prelude to the big shrink Lecture 2: Prelude to the bg shrnk Last tme A slght detour wth vsualzaton tools (hey, t was the frst day... why not start out wth somethng pretty to look at?) Then, we consdered a smple 120a-style regresson

More information

January Examinations 2015

January Examinations 2015 24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction ECONOMICS 35* -- NOTE 7 ECON 35* -- NOTE 7 Interval Estmaton n the Classcal Normal Lnear Regresson Model Ths note outlnes the basc elements of nterval estmaton n the Classcal Normal Lnear Regresson Model

More information

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X).

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X). 11.4.1 Estmaton of Multple Regresson Coeffcents In multple lnear regresson, we essentally solve n equatons for the p unnown parameters. hus n must e equal to or greater than p and n practce n should e

More information

is the calculated value of the dependent variable at point i. The best parameters have values that minimize the squares of the errors

is the calculated value of the dependent variable at point i. The best parameters have values that minimize the squares of the errors Multple Lnear and Polynomal Regresson wth Statstcal Analyss Gven a set of data of measured (or observed) values of a dependent varable: y versus n ndependent varables x 1, x, x n, multple lnear regresson

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

Number of cases Number of factors Number of covariates Number of levels of factor i. Value of the dependent variable for case k

Number of cases Number of factors Number of covariates Number of levels of factor i. Value of the dependent variable for case k ANOVA Model and Matrx Computatons Notaton The followng notaton s used throughout ths chapter unless otherwse stated: N F CN Y Z j w W Number of cases Number of factors Number of covarates Number of levels

More information

Statistical Evaluation of WATFLOOD

Statistical Evaluation of WATFLOOD tatstcal Evaluaton of WATFLD By: Angela MacLean, Dept. of Cvl & Envronmental Engneerng, Unversty of Waterloo, n. ctober, 005 The statstcs program assocated wth WATFLD uses spl.csv fle that s produced wth

More information

Chapter 3 Describing Data Using Numerical Measures

Chapter 3 Describing Data Using Numerical Measures Chapter 3 Student Lecture Notes 3-1 Chapter 3 Descrbng Data Usng Numercal Measures Fall 2006 Fundamentals of Busness Statstcs 1 Chapter Goals To establsh the usefulness of summary measures of data. The

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

18. SIMPLE LINEAR REGRESSION III

18. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III US Domestc Beers: Calores vs. % Alcohol Ftted Values and Resduals To each observed x, there corresponds a y-value on the ftted lne, y ˆ ˆ = α + x. The are called ftted values.

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

Testing for outliers in nonlinear longitudinal data models based on M-estimation

Testing for outliers in nonlinear longitudinal data models based on M-estimation ISS 1746-7659, England, UK Journal of Informaton and Computng Scence Vol 1, o, 017, pp107-11 estng for outlers n nonlnear longtudnal data models based on M-estmaton Huhu Sun 1 1 School of Mathematcs and

More information

Testing for seasonal unit roots in heterogeneous panels

Testing for seasonal unit roots in heterogeneous panels Testng for seasonal unt roots n heterogeneous panels Jesus Otero * Facultad de Economía Unversdad del Rosaro, Colomba Jeremy Smth Department of Economcs Unversty of arwck Monca Gulett Aston Busness School

More information

Continuous vs. Discrete Goods

Continuous vs. Discrete Goods CE 651 Transportaton Economcs Charsma Choudhury Lecture 3-4 Analyss of Demand Contnuous vs. Dscrete Goods Contnuous Goods Dscrete Goods x auto 1 Indfference u curves 3 u u 1 x 1 0 1 bus Outlne Data Modelng

More information

Estimation: Part 2. Chapter GREG estimation

Estimation: Part 2. Chapter GREG estimation Chapter 9 Estmaton: Part 2 9. GREG estmaton In Chapter 8, we have seen that the regresson estmator s an effcent estmator when there s a lnear relatonshp between y and x. In ths chapter, we generalzed the

More information

A Hybrid Variational Iteration Method for Blasius Equation

A Hybrid Variational Iteration Method for Blasius Equation Avalable at http://pvamu.edu/aam Appl. Appl. Math. ISSN: 1932-9466 Vol. 10, Issue 1 (June 2015), pp. 223-229 Applcatons and Appled Mathematcs: An Internatonal Journal (AAM) A Hybrd Varatonal Iteraton Method

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

On the detection of influential outliers in linear regression analysis

On the detection of influential outliers in linear regression analysis Amercan Journal of Theoretcal and Appled Statstcs 04; 3(4): 00-06 Publshed onlne July 30, 04 (http://www.scencepublshnggroup.com/j/ajtas) do: 0.648/j.ajtas.040304.4 ISSN: 36-8999 (Prnt); ISSN: 36-9006

More information

Linear Correlation. Many research issues are pursued with nonexperimental studies that seek to establish relationships among 2 or more variables

Linear Correlation. Many research issues are pursued with nonexperimental studies that seek to establish relationships among 2 or more variables Lnear Correlaton Many research ssues are pursued wth nonexpermental studes that seek to establsh relatonshps among or more varables E.g., correlates of ntellgence; relaton between SAT and GPA; relaton

More information

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes 25/6 Canddates Only January Examnatons 26 Student Number: Desk Number:...... DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR Department Module Code Module Ttle Exam Duraton

More information

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva Econ 39 - Statstcal Propertes of the OLS estmator Sanjaya DeSlva September, 008 1 Overvew Recall that the true regresson model s Y = β 0 + β 1 X + u (1) Applyng the OLS method to a sample of data, we estmate

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours UNIVERSITY OF TORONTO Faculty of Arts and Scence December 005 Examnatons STA47HF/STA005HF Duraton - hours AIDS ALLOWED: (to be suppled by the student) Non-programmable calculator One handwrtten 8.5'' x

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

The Two-scale Finite Element Errors Analysis for One Class of Thermoelastic Problem in Periodic Composites

The Two-scale Finite Element Errors Analysis for One Class of Thermoelastic Problem in Periodic Composites 7 Asa-Pacfc Engneerng Technology Conference (APETC 7) ISBN: 978--6595-443- The Two-scale Fnte Element Errors Analyss for One Class of Thermoelastc Problem n Perodc Compostes Xaoun Deng Mngxang Deng ABSTRACT

More information

Lecture 16 Statistical Analysis in Biomaterials Research (Part II)

Lecture 16 Statistical Analysis in Biomaterials Research (Part II) 3.051J/0.340J 1 Lecture 16 Statstcal Analyss n Bomaterals Research (Part II) C. F Dstrbuton Allows comparson of varablty of behavor between populatons usng test of hypothess: σ x = σ x amed for Brtsh statstcan

More information

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9 Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,

More information

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method Appled Mathematcal Scences, Vol. 7, 0, no. 47, 07-0 HIARI Ltd, www.m-hkar.com Comparson of the Populaton Varance Estmators of -Parameter Exponental Dstrbuton Based on Multple Crtera Decson Makng Method

More information

Uncertainty as the Overlap of Alternate Conditional Distributions

Uncertainty as the Overlap of Alternate Conditional Distributions Uncertanty as the Overlap of Alternate Condtonal Dstrbutons Olena Babak and Clayton V. Deutsch Centre for Computatonal Geostatstcs Department of Cvl & Envronmental Engneerng Unversty of Alberta An mportant

More information

Basically, if you have a dummy dependent variable you will be estimating a probability.

Basically, if you have a dummy dependent variable you will be estimating a probability. ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy

More information

Statistical Hypothesis Testing for Returns to Scale Using Data Envelopment Analysis

Statistical Hypothesis Testing for Returns to Scale Using Data Envelopment Analysis Statstcal Hypothess Testng for Returns to Scale Usng Data nvelopment nalyss M. ukushge a and I. Myara b a Graduate School of conomcs, Osaka Unversty, Osaka 560-0043, apan (mfuku@econ.osaka-u.ac.p) b Graduate

More information

Chapter 12 Analysis of Covariance

Chapter 12 Analysis of Covariance Chapter Analyss of Covarance Any scentfc experment s performed to know somethng that s unknown about a group of treatments and to test certan hypothess about the correspondng treatment effect When varablty

More information

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y) Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,

More information

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the Chapter 11 Student Lecture Notes 11-1 Lnear regresson Wenl lu Dept. Health statstcs School of publc health Tanjn medcal unversty 1 Regresson Models 1. Answer What Is the Relatonshp Between the Varables?.

More information

The Study of Teaching-learning-based Optimization Algorithm

The Study of Teaching-learning-based Optimization Algorithm Advanced Scence and Technology Letters Vol. (AST 06), pp.05- http://dx.do.org/0.57/astl.06. The Study of Teachng-learnng-based Optmzaton Algorthm u Sun, Yan fu, Lele Kong, Haolang Q,, Helongang Insttute

More information

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis Resource Allocaton and Decson Analss (ECON 800) Sprng 04 Foundatons of Regresson Analss Readng: Regresson Analss (ECON 800 Coursepak, Page 3) Defntons and Concepts: Regresson Analss statstcal technques

More information

ANOVA. The Observations y ij

ANOVA. The Observations y ij ANOVA Stands for ANalyss Of VArance But t s a test of dfferences n means The dea: The Observatons y j Treatment group = 1 = 2 = k y 11 y 21 y k,1 y 12 y 22 y k,2 y 1, n1 y 2, n2 y k, nk means: m 1 m 2

More information

Exponential Type Product Estimator for Finite Population Mean with Information on Auxiliary Attribute

Exponential Type Product Estimator for Finite Population Mean with Information on Auxiliary Attribute Avalable at http://pvamu.edu/aam Appl. Appl. Math. ISSN: 193-9466 Vol. 10, Issue 1 (June 015), pp. 106-113 Applcatons and Appled Mathematcs: An Internatonal Journal (AAM) Exponental Tpe Product Estmator

More information