Using the Bootstrap for Estimating the Sample Size in Statistical Experiments

Size: px
Start display at page:

Download "Using the Bootstrap for Estimating the Sample Size in Statistical Experiments"

Transcription

1 Journal of Modern Appled Statstcal Methods Volume 1 Issue 1 Artcle Usng the Bootstrap for Estmatng the Sample Sze n Statstcal Experments Maher Qumsyeh Unversty of Dayton Follow ths and addtonal works at: Part of the Appled Statstcs Commons, Socal and Behavoral Scences Commons, and the Statstcal Theory Commons Recommended Ctaton Qumsyeh, Maher (013) "Usng the Bootstrap for Estmatng the Sample Sze n Statstcal Experments," Journal of Modern Appled Statstcal Methods: Vol. 1 : Iss. 1, Artcle 9. DOI: 10.37/jmasm/ Avalable at: Ths Regular Artcle s brought to you for free and open access by the Open Access Journals at DgtalCommons@WayneState. It has been accepted for ncluson n Journal of Modern Appled Statstcal Methods by an authorzed edtor of DgtalCommons@WayneState.

2 Usng the Bootstrap for Estmatng the Sample Sze n Statstcal Experments Cover Page Footnote Ths research s supported by Unversty of Dayton Seed Grant. Ths regular artcle s avalable n Journal of Modern Appled Statstcal Methods: ss1/9

3 Journal of Modern Appled Statstcal Methods Copyrght 013 JMASM, Inc. May 013, Vol. 1, No. 1, /13/$95.00 Usng the Bootstrap for Estmatng the Sample Sze n Statstcal Experments Maher Qumsyeh Unversty of Dayton, Dayton OH Efron s (1979) Bootstrap has been shown to be an effectve method for statstcal estmaton and testng. It provdes better estmates than normal approxmatons for studentzed means, least square estmates and many other statstcs of nterest. It can be used to select the actve factors - factors that have an effect on the response - n expermental desgns. Ths artcle shows that the bootstrap can be used to determne sample sze or the number of runs requred to acheve a certan confdence level n statstcal experments. Key words: Efron s bootstrap, expermental factors, statstcal estmaton, confdence level. Introducton Tradtonal methods of fndng sample szes depend on knowng the underlyng dstrbuton. For example, to determne a sample sze that wll result n (1-α) 100% confdence that the sample mean s wthn E unts from the populaton mean the followng s used: zα/σ n = (1) E assumng normalty, or usng the central lmt theorem, and determnng an approxmate value for σ. For a multple regresson model y = β0 + β1x1+ βx + + βkxk + ε () a (1 α) 100% confdence nterval for β j s gven by β ± t j 1 R. (1 R )( N p) / j (3) Maher Qumsyeh s an Assstant Professor n the Department of Mathematcs. Pror to jonng the Unversty of Dayton he was a Professor at Bethlehem Unversty for over 5 years, wth 10 years as Department Char. Emal hm at: qumsyeh@udayton.edu. where, R s the multple coeffcent of determnaton and R j s the same when x j s predcted from the remanng k 1 regressors. Usng equaton (3), the sample sze to predct the standardzed coeffcent wthn E unts of the true value (replacng t wth normal) s gven by z / (1 R ) n = + p. (4) E ( 1 ) R j However wth ths n there s approxmately a 50% chance that the nterval wll be longer than E (Kelly & Maxwell, 003). Hahn and Meker (1991) provde a value for N for whch the confdence s (1-δ) 100% that the nterval obtaned s of a length less than or equal E. The value of such N s ( n ) z / ) 1 (1 R χ δ N = + p. E ( 1 ) R n p j (5) where n s the value found n equaton (4). An alternatve s to use the bootstrap method to determne f the sample sze calculated usng equatons (1) and (5) s necessary or f t s larger than what s needed to acheve a certan confdence. The bootstrap has been shown to provde better than normal estmates of dstrbuton functons of studentzed 45

4 BOOTSTRAP USE FOR ESTIMATING SAMPLE SIZE IN STATISTICAL EXPERIMENTS statstcs (Sngh, 1981; Bckle & Freedman, 1980; Babu and Sngh, 1983, 1984). Qumsyeh (1994) showed that bootstrap approxmaton for the dstrbuton of the studentzed least square estmate s asymptotcally better, not only than the normal approxmaton, but also than the twoterm Edgeworth expanson. Lahr (199) showed the superorty of the bootstrap for approxmatng the dstrbuton of M-estmators. Bhattacharya and Qumsyeh (1989) preseneted an L p -comparson between the bootstrap and Edgeworth expansons. Fnally, Qumsyeh and Shaughnessy (008, 010) showed that the bootstrap can be used to determne the actve factors n two level desgns and how to estmate mssng responses n those desgns. In ths study the bootstrap was appled to three data sets; SAS and the SQL procedure n SAS were used to perform calculatons and resamplng. Data Set 1 Data set 1 s comprsed of 1,000 randomly selected samples of sze 61 each from a normal dstrbuton wth mean 0 and standard devaton (61 s the number n obtaned usng Equaton (1) wth E = 0.5 and α = 0.05). An example of one such sample of sze 61 s: The mean for ths sample was and the standard devaton was Data Set Data set s real data that correlates the GPA y (out of 4) of 194 students from Bethlehem Unversty, wth ther hgh school Math (x 1 ) and Englsh (x ) scores (out of 100 ponts). The frst few observatons are shown n Table 1. The model for data set two s: y = β + β x + β x + ε Table 1: Data Set Example y x 1 x Data Set 3 Data set 3 s an example provded by Bsgaard and Fuller (1995). It s a 4 full factoral experment to determne f blade sze (A), centerng (B), levelng (C) and speed (D) had an effect on the occurrence of undesrable marks on a steel sample. The desgn matrx s shown n Table, where Y represents the number of defectve (undesrable marks) among 0 samples at each settng and P s the proporton of defects at each settng. The Bootstrap The bootstrap was used to analyze the three data sets. SAS programmng and the SQL procedure n SAS were used to perform the analyses. Resamplng wth replacement was conducted 1,000 tmes based on Efron and Tbshran (1993) fndng that 1,000-,000 works best. The SAS program used for data set 1 46

5 MAHER QUMSIYEH Table : Data Set 3 Desgn Matrx Run A B C D Y P s provded n Appendx A; due to the length of the programs for data sets two and three they are not provded. A dfferent procedure was used for each data set. Data Set 1 For the frst example the sample sze was 61, ths s the sample sze necessary for 95% confdence that the sample mean s wthn 0.5 unts from the populaton mean usng the zα/σ equaton: n =. Usng resamplng and E takng a random sample of sze 61 from a N(0, ) dstrbuton, t s resampled 1,000 tmes wth replacement, the mean of each of the 1,000 samples s calculated and half the dfference between the.5 and 97.5 percentles of the 1,000 means s found. Ths should be the value of E. One sample of sze 61 from a N(0, ) dstrbuton to another the value of such E wll vary to a great degree, thus, ths procedure s repeated several tmes (500 n ths case) and an nterval for the values of such E s s lsted. The sample sze contnued to decrease and the values of E contnued to be recorded. Results are shown n Table 3. Table 3: Data Set 1 Results n Table 3 shows that a sample of sze 61 was not necessary; 48 would have been suffcent. The bootstrap was repeated 500 E

6 BOOTSTRAP USE FOR ESTIMATING SAMPLE SIZE IN STATISTICAL EXPERIMENTS tmes, each resamplng 1,000 tmes usng a dfferent 61 randomly selected data ponts wth replacement from a N(0, ) dstrbuton and the values of all 500 replcatons were n the nterval gven whch s for n = 61. (See Appendx A for the SAS program used for data set 1 wth n = 48.) Data Set The second data set, whch correlates the unversty GPA to hgh school Englsh and math scores, the model s: y= β 0 + β 1 x 1 + β x +ε. The ntal results usng SAS for the whole data set are shown n Table 4. Frst usng equaton (4), z / (1 R ) n = + p, E ( 1 ) R j to fnd an ntal n usng α as 0.05 and E as 0. for the standardzed betas (the value of E depends on the type of data at hand). Usng all 194 data ponts and the values of R and R j from the data set (R and R j ) and equaton (4), the value of s 60 (approxmatng to the next nteger), however, wth ths n there s approxmately a 50% chance that the nterval wll be longer than E. (Kelly & Maxwell, 003). By contrast, usng equaton (5), ( n ) z / ) 1 (1 R χ δ N = + p, E ( 1 ) R n p j δ = 0.05 and j = the value of N s determned to be 78; ths results n 95% confdence that the nterval obtaned s of a length less than or equal E (Hahn & Meker, 1991). Note that the value of E used s for the standardzed betas; thevalue of E for the non-standardzed betas (EN) wll be approxmately S y EN E = = S x Next, the half-length of the confdence nterval for β s determned usng the bootstrap method and a random sample of sze 78 from the 194 orgnal observatons. The procedure s as follows: 1. Select 78 ponts usng random samplng wthout replacement from the 194; name ths as subset and perform a regular regresson procedure obtanng (X 1,1, X,1, Y 1,E 1 ), (X 1,, X,, Y,E ),..., (X 1,78, X,78, Y 78,E 78 ). Here, E = Y Y, s the resdual for the th observaton.. Select 1,000 samples wth replacement from the subset, ths s the bootstrap sample. Each sample has 78 ponts and samples are desgnated as {sample 1 }, {sample },..., {sample 1000 }. 3. Examne {sample 1 }; n=78 ponts taken wth replacement from the subset. We have sets of ponts (X 1,1 *, X,1 *, Y 1 *, E 1 *), (X 1, *, X, *, Y *, E *),..., (X 1,78 *, X,78 *, Y 78 *,E 78 *). Each (X 1,j *, X,j *, Y j *, E j *) can be any of the (X 1,1, X,1, Y 1,E 1 ), (X 1,, X,, Y, E ),..., (X 1,78, X,78, Y 78, E 78 ) wth probablty 1/ Fnd the average of the E * s, and name ths ME 1. Due to the fact that the mean of the errors s assumed to be 0, standardze the errors by subtractng ME 1 from each of them. 5. Stll usng {sample 1 }, the new Y s are obtaned by the addng the respectve standardzed error term to the predcted values and these are termed as new Y j s. 6. Contnue to examne {sample 1 }; usng the least-square method, fnd the slope and the ntercept, (slope 1, slope, ntercept 1 ), based on (X 1,1 *, X,1 *,, newy 1 * ), (X 1, *, X, *, newy * ),..., (X 1,78 *,X,78 *, newy 78 *). 7. Repeat steps 3-6 for the other 999 samples to obtan 1,000 estmates for the ntercept β 0 and the slopes β 1 and β. Interest s n β. 8. Estmate the value of β by averagng the 1,000 estmates of β and calculate a 95% CI for β by fndng the.5 and 97.5 percentle 48

7 MAHER QUMSIYEH of those 1,000 values. Half the length of ths nterval, E *, wll be compared wth the EN = prevously obtaned. Based on ths procedure, how s t known that there s 95% confdence that half the length of the nterval wll not exceed EN? The answer s that by repeatng steps 1-8, 1,000 tmes to obtan 1,000 EN s and then fndng the top 95 percentle, t should not exceed EN. The estmate for β from one random subset of 78 ponts was = and a 95% CI for β was (0.0089, 0.04). Ths assumes that all condtons, such as normal resduals and constant varances, hold; n addton, half the length of the nterval s , whch s smaller than expected (0.0097). The n = 78 guarantees that 95% of the cases wll result n smaller half lengths. Usng the bootstrap method dscussed results n a mean half-length of 1,000 runs of the bootstrap method of and a 95% confdence nterval of ( , 0.010): ths s wthout any assumptons on the model. An estmate for β was calculated as an average of the 1,000 bootstrap sample estmate for β t was = for one run wth a 95% CI of ( , ). The bootstrap was repeated 1,000 tmes and the average value for the estmated values of β was wth a 95% CI of ( ). Wthout any assumpton on the model, the bootstrap produced an estmate for β that was close to that produced assumng the regular model assumptons hold, n addton, the length of the 95% CI was a lttle shorter than expected usng a sample of sze 78 ( vs ). It s mportant to note that the calculatons were carred out wthout assumng the error terms to be normal, however, t s valuable to understand what wll happen f the error terms n the example are exactly normal. Table 4: SAS Results for Data Set Model: Model1 Dependent Varable: y Number of Observatons Read: 194 Number of Observatons Used: 194 Analyss of Varance Source DF Sum of Mean Squares Square F Value Pr > F Model < Error Corrected Total Root MSE R-Square Dependent Mean Adj R-Sq Coeff Var Parameter Estmates Varable DF Parameter Standard Estmate Error t value Pr > t 95% Confdence Lmts Intercept x < x <

8 BOOTSTRAP USE FOR ESTIMATING SAMPLE SIZE IN STATISTICAL EXPERIMENTS nd Data Set (Normal Errors) For a random sample of sze 78, the predcted values of the y s, y were calculated and a new varable w was defned as: w = y + cε where ε s randomly chosen from a N(0,1) dstrbuton. The new w s wth the orgnal x s wll have normal errors wth constant varance. Note that the varance of the w s must be the same as the orgnal y s to be able to compare the length of the confdence nterval for the new β wth the prevous one. The solver functon n Mcrosoft Excel was used to provde a value for c to acheve ths; ths value of c was calculated to be Ths mproved the prevous results and half the length of the 95% CI for β usng the bootstrap method was much smaller ( vs ). Ths shows that usng the bootstrap requres a smaller sample sze than the prevous estmate of n = 78. Data Set 3 Bsgaard and Fuller (1995) provded a table that gves estmated sample szes (n) for the number of runs at each settng for two level full factoral experments usng proportons as a response. Ther estmate for n whch represents the number of runs needed to detect an error of sze Δ n the untransformed scale s gven by n ( z z β ) Nδ / = (6) where N s the total number of basc runs n a k factoral experment (4, 8, 16, ), α and β are the probabltes of type I and type II errors, 0.05 and 0.1 respectvely, and δ s the expected value of the effect (Bsgaard & Fuller, 1995). Bsgaard and Fuller s table presents values of Δ that vary from 10% to 90% of the proporton of defectve (p 0 ) and shows that sample sze depends on the average defectve level. If the average defectve level s low, for example 5%, a larger sample sze s needed to ndcate that a change has truly occurred. For the 3 rd data set the current level of defectve (p 0 ) was not known, t was approxmated wth the average proporton of defectve n the sample at each settng whch s Because n s gven n ths experment as 0, the method descrbed n Bsgaard and Fuller (1995) or the table they provde can be used to determne the mnmum sze of detectable error. For α to be at most 0.05, the mnmum sze s Δ >0.185; calculatng the effect sze of each factor, t was found that factors A, B and the AB nteracton have effect szes larger than ths (0.984, 0.08, 0.47 respectvely). Ths agrees wth the half normal plot (Danel, 1959) whch states that factors B and the AB nteracton appear to be slghtly actve (not very clear) and that factor A s a defntely actve factor (see Fgure 1). In the calculatons descrbed t s not certan that a 95% confdence nterval for the effect sze wll have ts lower bound less than for those factors (A, B and AB). Qumsyeh and Shaughnessy (008, 010) showed that the bootstrap can be used (under no assumptons) to determne actve factors n factoral experments, to estmate the sze of the effect and to determne a confdence nterval for the effect sze. The method can be descrbed wth the followng steps usng factor A for llustraton purposes: 1. Sample N/ responses wth replacement from data at the +1 level of the gven factor A.. Sample N/ responses wth replacement from data at the 1 level of the gven factor A. 3. Estmate the effect of that factor usng the dfference between +1 level and 1 level. 4. Repeat the samplng procedure a large number of tmes (1,000 n ths example). 5. Fnd the average of the 1,000 values; ths s an estmate of the effect sze of factor A. Determne the upper (1 α/) and lower α/ percentle ponts of the resampled effect values found n step 4. Use these values to construct the 50

9 MAHER QUMSIYEH Fgure 1: Half-Normal Plot for the Effects n Data Set 3.5 A CD ACD ABD AC BD C BCD D BC ABCD ABC AD B AB effect sze. If the confdence nterval doesn t contan 0 then ths factor s an actve factor a factor that has an effect on the response. Usng the procedure descrbed prevously for data set 3, the followng were determned: All confdence ntervals for the effect sze for all factors except factor A contaned 0, therefore they must be assumed as nactve factors. For factors A, B and the AB whch appear to be effectve usng the normal plot and are reported as actve factors by Bsgaard and Fuller (1995), usng the proporton of defectves, the results were as follows: Factor A: The mean effect sze for the 1,000 runs was and a 95% confdence nterval for the effect sze was (0.6188, ). Factor B: The mean effect sze for the 1,000 runs was and a 95% confdence nterval for the effect sze was ( 0.313, ). Factor AB (The AB nteracton): The mean effect sze for the 1,000 runs was and a 95% confdence nterval for the effect sze was ( 0.406, ). Results show that only factor A can be consdered actve. If ths s the case, the confdence nterval for effect A has a lower bound of whch leads to a much hgher value than the least expected of Ths ndcates that a sample sze smaller than 0 would have been suffcent. 51

10 BOOTSTRAP USE FOR ESTIMATING SAMPLE SIZE IN STATISTICAL EXPERIMENTS Concluson The bootstrap method can be used to determne sample szes n statstcal experments and to check whether a certan sample sze used s more than s needed by examnng the length of the confdence nterval resultng from usng the bootstrap method. The bootstrap s also good for selectng actve factors and n constructng confdence ntervals for effect sze. The avalablty of computers and statstcal software make usng re-samplng (bootstrap) easy and fast and provdes good predctons. Acknowledgment Ths research s supported by Unversty of Dayton Seed Grant. References Babu, G., & Sngh, K. (1983). Inference on means usng the bootstrap. The Annals of Statstcs, 11, Babu, G. J., & Sngh, K. (1984). On one term Edgeworth correcton by Efrons bootstrap. Sankhya, 46, Seres A, Bhattacharya, R. N., & Qumsyeh, M. (1989). Second order and L p - comparson between the bootstrap and emprcal Edgeworth expanson methodologes. Annals of Statstcs, 17, Bckel, P. J., & Freedman, D. A. (1980). On Edgeworth expansons for the bootstrap. Unpublshed. Bsgaard, S., & Fuller, H. (1995). Sample sze estmate for k-p desgns wth bnary responses. Qualty Engneerng, 7(4), Danel, C. (1959). Usng of half normal plots n nterpretng factoral two-level experments. Technometrcs, 1, Efron, B. (1979). Bootstrap methods: Another look at jacknfe. The Annals of Statstcs, 7, 1-6. Efron, B., & Tbshran, R. (1993). An ntroducton to the bootstrap. New York, NY: Chapman and Hall. Hahn, G. J., & Meeker, W. Q. (1991). Statstcal ntervals: A gude for practtoners. New York, NY: Wley. Kelley, K., & Maxwell, E. (003). Sample sze for multple regresson: Obtanng regresson coeffcents that are accurate, not smply sgnfcant. Psychologcal Methods, 8(3), Lahr, S. (199). Bootstrappng M- estmators of a multple lnear regresson parameter. The Annals of Statstcs, 0(3), Qumsyeh, M. (1994). Bootstrappng and emprcal Edgeworth expansons n multple lnear regresson models. Communcatons n Statstcal Theory and Methods, 3(11), Qumsyeh, M., & Shaughnessy, G. (008). Usng the bootstrap to select actve factors n unreplcated factoral experment. In JSM Proceedngs, Statstcal Computng Secton. Alexandra, VA: Amercan Statstcal Assocaton. Qumsyeh, M., & Shaughnessy, G. (010). Bootstrappng Un-replcated two-level desgns wth mssng responses. Journal of Statstcs: Advances n Theory and Applcatons, 4, Sngh, K. (1981). On the asymptotc accuracy of Efron s bootstrap. The Annals of Statstcs, 9, Appendx A: SAS Program Used for Data Set 1 %macro rep; %do rep=1 %to 500; data ma (drop=); *%let num=48; *do =1 to &num; ; do =1 to 48; x=*rannorm(56367)+0; output; end; %macro numberng(n); data numberng; do =1 to &N; output; end; qut; data ma1; set numberng; set ma; %mend; %numberng(48); %macro repeat; 5

11 MAHER QUMSIYEH Appendx A (contnued): SAS Program Used for Data Set 1 %do repeat=1 %to 1000; %macro dstnct(nthrow, N); data table; do j = 1 to &nthrow; pt = nt(nt(ranun(0) * &N) + 1.5); set ma1 pont=pt; output; end; stop; * requred for pont= ; %mend; %dstnct(48,48); proc means data=table n mean noprnt; var x; output out=table3 n=n mean=mx; qut; proc sql; create table tablef as select a1.mx as mx1 from table3 as a1; proc append data=tablef base=summary force; %end; %mend; %repeat; proc unvarate data=summary noprnt; var mx1; output out=z1 mean=tm pctlpts =.5, 97.5 pctlname=p5 p975 pctlpre = mx1; data E; set z1; E=(mx1p975-mx1p5)/; proc append data=e base=e1 force; proc sql; drop table E ; drop table Ma ; drop table Ma1 ; drop table Numberng ; drop table Summary ; drop table Table ; drop table Table3 ; drop table Tablef ; drop table z1 ; qut; %end; %mend; %rep; proc unvarate data=e1 noprnt; var E; output out=length mean=tm pctlpts =.5, 97.5 pctlname=p5 p975 pctlpre = E; qut; 53

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

Statistics II Final Exam 26/6/18

Statistics II Final Exam 26/6/18 Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

Assignment 5. Simulation for Logistics. Monti, N.E. Yunita, T.

Assignment 5. Simulation for Logistics. Monti, N.E. Yunita, T. Assgnment 5 Smulaton for Logstcs Mont, N.E. Yunta, T. November 26, 2007 1. Smulaton Desgn The frst objectve of ths assgnment s to derve a 90% two-sded Confdence Interval (CI) for the average watng tme

More information

Statistics for Business and Economics

Statistics for Business and Economics Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear

More information

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students. PPOL 59-3 Problem Set Exercses n Smple Regresson Due n class /8/7 In ths problem set, you are asked to compute varous statstcs by hand to gve you a better sense of the mechancs of the Pearson correlaton

More information

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS M. Krshna Reddy, B. Naveen Kumar and Y. Ramu Department of Statstcs, Osmana Unversty, Hyderabad -500 007, Inda. nanbyrozu@gmal.com, ramu0@gmal.com

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction ECONOMICS 35* -- NOTE 7 ECON 35* -- NOTE 7 Interval Estmaton n the Classcal Normal Lnear Regresson Model Ths note outlnes the basc elements of nterval estmaton n the Classcal Normal Lnear Regresson Model

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 14 Multple Regresson Models 1999 Prentce-Hall, Inc. Chap. 14-1 Chapter Topcs The Multple Regresson Model Contrbuton of Indvdual Independent Varables

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

Chapter 15 Student Lecture Notes 15-1

Chapter 15 Student Lecture Notes 15-1 Chapter 15 Student Lecture Notes 15-1 Basc Busness Statstcs (9 th Edton) Chapter 15 Multple Regresson Model Buldng 004 Prentce-Hall, Inc. Chap 15-1 Chapter Topcs The Quadratc Regresson Model Usng Transformatons

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

Joint Statistical Meetings - Biopharmaceutical Section

Joint Statistical Meetings - Biopharmaceutical Section Iteratve Ch-Square Test for Equvalence of Multple Treatment Groups Te-Hua Ng*, U.S. Food and Drug Admnstraton 1401 Rockvlle Pke, #200S, HFM-217, Rockvlle, MD 20852-1448 Key Words: Equvalence Testng; Actve

More information

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε Chapter 3 Secton 3.1 Model Assumptons: Multple Regresson Model Predcton Equaton Std. Devaton of Error Correlaton Matrx Smple Lnear Regresson: 1.) Lnearty.) Constant Varance 3.) Independent Errors 4.) Normalty

More information

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9 Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

Lecture 16 Statistical Analysis in Biomaterials Research (Part II)

Lecture 16 Statistical Analysis in Biomaterials Research (Part II) 3.051J/0.340J 1 Lecture 16 Statstcal Analyss n Bomaterals Research (Part II) C. F Dstrbuton Allows comparson of varablty of behavor between populatons usng test of hypothess: σ x = σ x amed for Brtsh statstcan

More information

Chapter 6. Supplemental Text Material

Chapter 6. Supplemental Text Material Chapter 6. Supplemental Text Materal S6-. actor Effect Estmates are Least Squares Estmates We have gven heurstc or ntutve explanatons of how the estmates of the factor effects are obtaned n the textboo.

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study

More information

Systematic Error Illustration of Bias. Sources of Systematic Errors. Effects of Systematic Errors 9/23/2009. Instrument Errors Method Errors Personal

Systematic Error Illustration of Bias. Sources of Systematic Errors. Effects of Systematic Errors 9/23/2009. Instrument Errors Method Errors Personal 9/3/009 Sstematc Error Illustraton of Bas Sources of Sstematc Errors Instrument Errors Method Errors Personal Prejudce Preconceved noton of true value umber bas Prefer 0/5 Small over large Even over odd

More information

Introduction to Dummy Variable Regressors. 1. An Example of Dummy Variable Regressors

Introduction to Dummy Variable Regressors. 1. An Example of Dummy Variable Regressors ECONOMICS 5* -- Introducton to Dummy Varable Regressors ECON 5* -- Introducton to NOTE Introducton to Dummy Varable Regressors. An Example of Dummy Varable Regressors A model of North Amercan car prces

More information

Statistics Chapter 4

Statistics Chapter 4 Statstcs Chapter 4 "There are three knds of les: les, damned les, and statstcs." Benjamn Dsrael, 1895 (Brtsh statesman) Gaussan Dstrbuton, 4-1 If a measurement s repeated many tmes a statstcal treatment

More information

Chapter 14 Simple Linear Regression

Chapter 14 Simple Linear Regression Chapter 4 Smple Lnear Regresson Chapter 4 - Smple Lnear Regresson Manageral decsons often are based on the relatonshp between two or more varables. Regresson analss can be used to develop an equaton showng

More information

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 1 Chapters 14, 15 & 16 Professor Ahmad, Ph.D. Department of Management Revsed August 005 Chapter 14 Formulas Smple Lnear Regresson Model: y =

More information

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y) Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,

More information

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems Numercal Analyss by Dr. Anta Pal Assstant Professor Department of Mathematcs Natonal Insttute of Technology Durgapur Durgapur-713209 emal: anta.bue@gmal.com 1 . Chapter 5 Soluton of System of Lnear Equatons

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours UNIVERSITY OF TORONTO Faculty of Arts and Scence December 005 Examnatons STA47HF/STA005HF Duraton - hours AIDS ALLOWED: (to be suppled by the student) Non-programmable calculator One handwrtten 8.5'' x

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

Lecture 6 More on Complete Randomized Block Design (RBD)

Lecture 6 More on Complete Randomized Block Design (RBD) Lecture 6 More on Complete Randomzed Block Desgn (RBD) Multple test Multple test The multple comparsons or multple testng problem occurs when one consders a set of statstcal nferences smultaneously. For

More information

STATISTICS QUESTIONS. Step by Step Solutions.

STATISTICS QUESTIONS. Step by Step Solutions. STATISTICS QUESTIONS Step by Step Solutons www.mathcracker.com 9//016 Problem 1: A researcher s nterested n the effects of famly sze on delnquency for a group of offenders and examnes famles wth one to

More information

17 - LINEAR REGRESSION II

17 - LINEAR REGRESSION II Topc 7 Lnear Regresson II 7- Topc 7 - LINEAR REGRESSION II Testng and Estmaton Inferences about β Recall that we estmate Yˆ ˆ β + ˆ βx. 0 μ Y X x β0 + βx usng To estmate σ σ squared error Y X x ε s ε we

More information

Learning Objectives for Chapter 11

Learning Objectives for Chapter 11 Chapter : Lnear Regresson and Correlaton Methods Hldebrand, Ott and Gray Basc Statstcal Ideas for Managers Second Edton Learnng Objectves for Chapter Usng the scatterplot n regresson analyss Usng the method

More information

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution Department of Statstcs Unversty of Toronto STA35HS / HS Desgn and Analyss of Experments Term Test - Wnter - Soluton February, Last Name: Frst Name: Student Number: Instructons: Tme: hours. Ads: a non-programmable

More information

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the Chapter 11 Student Lecture Notes 11-1 Lnear regresson Wenl lu Dept. Health statstcs School of publc health Tanjn medcal unversty 1 Regresson Models 1. Answer What Is the Relatonshp Between the Varables?.

More information

STAT 511 FINAL EXAM NAME Spring 2001

STAT 511 FINAL EXAM NAME Spring 2001 STAT 5 FINAL EXAM NAME Sprng Instructons: Ths s a closed book exam. No notes or books are allowed. ou may use a calculator but you are not allowed to store notes or formulas n the calculator. Please wrte

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

The Ordinary Least Squares (OLS) Estimator

The Ordinary Least Squares (OLS) Estimator The Ordnary Least Squares (OLS) Estmator 1 Regresson Analyss Regresson Analyss: a statstcal technque for nvestgatng and modelng the relatonshp between varables. Applcatons: Engneerng, the physcal and chemcal

More information

NUMERICAL DIFFERENTIATION

NUMERICAL DIFFERENTIATION NUMERICAL DIFFERENTIATION 1 Introducton Dfferentaton s a method to compute the rate at whch a dependent output y changes wth respect to the change n the ndependent nput x. Ths rate of change s called the

More information

This column is a continuation of our previous column

This column is a continuation of our previous column Comparson of Goodness of Ft Statstcs for Lnear Regresson, Part II The authors contnue ther dscusson of the correlaton coeffcent n developng a calbraton for quanttatve analyss. Jerome Workman Jr. and Howard

More information

Statistics MINITAB - Lab 2

Statistics MINITAB - Lab 2 Statstcs 20080 MINITAB - Lab 2 1. Smple Lnear Regresson In smple lnear regresson we attempt to model a lnear relatonshp between two varables wth a straght lne and make statstcal nferences concernng that

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

Lecture 4 Hypothesis Testing

Lecture 4 Hypothesis Testing Lecture 4 Hypothess Testng We may wsh to test pror hypotheses about the coeffcents we estmate. We can use the estmates to test whether the data rejects our hypothess. An example mght be that we wsh to

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method Appled Mathematcal Scences, Vol. 7, 0, no. 47, 07-0 HIARI Ltd, www.m-hkar.com Comparson of the Populaton Varance Estmators of -Parameter Exponental Dstrbuton Based on Multple Crtera Decson Makng Method

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

On Distribution Function Estimation Using Double Ranked Set Samples With Application

On Distribution Function Estimation Using Double Ranked Set Samples With Application Journal of Modern Appled Statstcal Methods Volume Issue Artcle -- On Dstrbuton Functon Estmaton Usng Double Raned Set Samples Wth Applcaton Wald A. Abu-Dayyeh Yarmou Unversty, Irbd Jordan, abudayyehw@yahoo.com

More information

Chapter 15 - Multiple Regression

Chapter 15 - Multiple Regression Chapter - Multple Regresson Chapter - Multple Regresson Multple Regresson Model The equaton that descrbes how the dependent varable y s related to the ndependent varables x, x,... x p and an error term

More information

Biostatistics 360 F&t Tests and Intervals in Regression 1

Biostatistics 360 F&t Tests and Intervals in Regression 1 Bostatstcs 360 F&t Tests and Intervals n Regresson ORIGIN Model: Y = X + Corrected Sums of Squares: X X bar where: s the y ntercept of the regresson lne (translaton) s the slope of the regresson lne (scalng

More information

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected.

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected. ANSWERS CHAPTER 9 THINK IT OVER thnk t over TIO 9.: χ 2 k = ( f e ) = 0 e Breakng the equaton down: the test statstc for the ch-squared dstrbuton s equal to the sum over all categores of the expected frequency

More information

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida Frst Year Examnaton Department of Statstcs, Unversty of Florda May 7, 010, 8:00 am - 1:00 noon Instructons: 1. You have four hours to answer questons n ths examnaton.. You must show your work to receve

More information

β0 + β1xi and want to estimate the unknown

β0 + β1xi and want to estimate the unknown SLR Models Estmaton Those OLS Estmates Estmators (e ante) v. estmates (e post) The Smple Lnear Regresson (SLR) Condtons -4 An Asde: The Populaton Regresson Functon B and B are Lnear Estmators (condtonal

More information

Global Sensitivity. Tuesday 20 th February, 2018

Global Sensitivity. Tuesday 20 th February, 2018 Global Senstvty Tuesday 2 th February, 28 ) Local Senstvty Most senstvty analyses [] are based on local estmates of senstvty, typcally by expandng the response n a Taylor seres about some specfc values

More information

Answers Problem Set 2 Chem 314A Williamsen Spring 2000

Answers Problem Set 2 Chem 314A Williamsen Spring 2000 Answers Problem Set Chem 314A Wllamsen Sprng 000 1) Gve me the followng crtcal values from the statstcal tables. a) z-statstc,-sded test, 99.7% confdence lmt ±3 b) t-statstc (Case I), 1-sded test, 95%

More information

Regression Analysis. Regression Analysis

Regression Analysis. Regression Analysis Regresson Analyss Smple Regresson Multvarate Regresson Stepwse Regresson Replcaton and Predcton Error 1 Regresson Analyss In general, we "ft" a model by mnmzng a metrc that represents the error. n mn (y

More information

28. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III Ftted Values and Resduals US Domestc Beers: Calores vs. % Alcohol To each observed x, there corresponds a y-value on the ftted lne, y ˆ = βˆ + βˆ x. The are called ftted

More information

Now we relax this assumption and allow that the error variance depends on the independent variables, i.e., heteroskedasticity

Now we relax this assumption and allow that the error variance depends on the independent variables, i.e., heteroskedasticity ECON 48 / WH Hong Heteroskedastcty. Consequences of Heteroskedastcty for OLS Assumpton MLR. 5: Homoskedastcty var ( u x ) = σ Now we relax ths assumpton and allow that the error varance depends on the

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva Econ 39 - Statstcal Propertes of the OLS estmator Sanjaya DeSlva September, 008 1 Overvew Recall that the true regresson model s Y = β 0 + β 1 X + u (1) Applyng the OLS method to a sample of data, we estmate

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department

More information

Cathy Walker March 5, 2010

Cathy Walker March 5, 2010 Cathy Walker March 5, 010 Part : Problem Set 1. What s the level of measurement for the followng varables? a) SAT scores b) Number of tests or quzzes n statstcal course c) Acres of land devoted to corn

More information

18. SIMPLE LINEAR REGRESSION III

18. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III US Domestc Beers: Calores vs. % Alcohol Ftted Values and Resduals To each observed x, there corresponds a y-value on the ftted lne, y ˆ ˆ = α + x. The are called ftted values.

More information

Discussion of Extensions of the Gauss-Markov Theorem to the Case of Stochastic Regression Coefficients Ed Stanek

Discussion of Extensions of the Gauss-Markov Theorem to the Case of Stochastic Regression Coefficients Ed Stanek Dscusson of Extensons of the Gauss-arkov Theorem to the Case of Stochastc Regresson Coeffcents Ed Stanek Introducton Pfeffermann (984 dscusses extensons to the Gauss-arkov Theorem n settngs where regresson

More information

COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS

COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS Avalable onlne at http://sck.org J. Math. Comput. Sc. 3 (3), No., 6-3 ISSN: 97-537 COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS

More information

A Comparative Study for Estimation Parameters in Panel Data Model

A Comparative Study for Estimation Parameters in Panel Data Model A Comparatve Study for Estmaton Parameters n Panel Data Model Ahmed H. Youssef and Mohamed R. Abonazel hs paper examnes the panel data models when the regresson coeffcents are fxed random and mxed and

More information

is the calculated value of the dependent variable at point i. The best parameters have values that minimize the squares of the errors

is the calculated value of the dependent variable at point i. The best parameters have values that minimize the squares of the errors Multple Lnear and Polynomal Regresson wth Statstcal Analyss Gven a set of data of measured (or observed) values of a dependent varable: y versus n ndependent varables x 1, x, x n, multple lnear regresson

More information

[ ] λ λ λ. Multicollinearity. multicollinearity Ragnar Frisch (1934) perfect exact. collinearity. multicollinearity. exact

[ ] λ λ λ. Multicollinearity. multicollinearity Ragnar Frisch (1934) perfect exact. collinearity. multicollinearity. exact Multcollnearty multcollnearty Ragnar Frsch (934 perfect exact collnearty multcollnearty K exact λ λ λ K K x+ x+ + x 0 0.. λ, λ, λk 0 0.. x perfect ntercorrelated λ λ λ x+ x+ + KxK + v 0 0.. v 3 y β + β

More information

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X).

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X). 11.4.1 Estmaton of Multple Regresson Coeffcents In multple lnear regresson, we essentally solve n equatons for the p unnown parameters. hus n must e equal to or greater than p and n practce n should e

More information

Chapter 6. Supplemental Text Material. Run, i X i1 X i2 X i1 X i2 Response total (1) a b ab

Chapter 6. Supplemental Text Material. Run, i X i1 X i2 X i1 X i2 Response total (1) a b ab Chapter 6. Supplemental Text Materal 6-. actor Effect Estmates are Least Squares Estmates We have gven heurstc or ntutve explanatons of how the estmates of the factor effects are obtaned n the textboo.

More information

Chapter 3 Describing Data Using Numerical Measures

Chapter 3 Describing Data Using Numerical Measures Chapter 3 Student Lecture Notes 3-1 Chapter 3 Descrbng Data Usng Numercal Measures Fall 2006 Fundamentals of Busness Statstcs 1 Chapter Goals To establsh the usefulness of summary measures of data. The

More information

Comparison of Re-sampling Methods to Generalized Linear Models and Transformations in Factorial and Fractional Factorial Designs

Comparison of Re-sampling Methods to Generalized Linear Models and Transformations in Factorial and Fractional Factorial Designs Journal of Modern Applied Statistical Methods Volume 11 Issue 1 Article 8 5-1-2012 Comparison of Re-sampling Methods to Generalized Linear Models and Transformations in Factorial and Fractional Factorial

More information

Copyright 2017 by Taylor Enterprises, Inc., All Rights Reserved. Adjusted Control Limits for P Charts. Dr. Wayne A. Taylor

Copyright 2017 by Taylor Enterprises, Inc., All Rights Reserved. Adjusted Control Limits for P Charts. Dr. Wayne A. Taylor Taylor Enterprses, Inc. Control Lmts for P Charts Copyrght 2017 by Taylor Enterprses, Inc., All Rghts Reserved. Control Lmts for P Charts Dr. Wayne A. Taylor Abstract: P charts are used for count data

More information

The SAS program I used to obtain the analyses for my answers is given below.

The SAS program I used to obtain the analyses for my answers is given below. Homework 1 Answer sheet Page 1 The SAS program I used to obtan the analyses for my answers s gven below. dm'log;clear;output;clear'; *************************************************************; *** EXST7034

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

7.1. Single classification analysis of variance (ANOVA) Why not use multiple 2-sample 2. When to use ANOVA

7.1. Single classification analysis of variance (ANOVA) Why not use multiple 2-sample 2. When to use ANOVA Sngle classfcaton analyss of varance (ANOVA) When to use ANOVA ANOVA models and parttonng sums of squares ANOVA: hypothess testng ANOVA: assumptons A non-parametrc alternatve: Kruskal-Walls ANOVA Power

More information

Uncertainty as the Overlap of Alternate Conditional Distributions

Uncertainty as the Overlap of Alternate Conditional Distributions Uncertanty as the Overlap of Alternate Condtonal Dstrbutons Olena Babak and Clayton V. Deutsch Centre for Computatonal Geostatstcs Department of Cvl & Envronmental Engneerng Unversty of Alberta An mportant

More information

PHYS 450 Spring semester Lecture 02: Dealing with Experimental Uncertainties. Ron Reifenberger Birck Nanotechnology Center Purdue University

PHYS 450 Spring semester Lecture 02: Dealing with Experimental Uncertainties. Ron Reifenberger Birck Nanotechnology Center Purdue University PHYS 45 Sprng semester 7 Lecture : Dealng wth Expermental Uncertantes Ron Refenberger Brck anotechnology Center Purdue Unversty Lecture Introductory Comments Expermental errors (really expermental uncertantes)

More information

Stat 642, Lecture notes for 01/27/ d i = 1 t. n i t nj. n j

Stat 642, Lecture notes for 01/27/ d i = 1 t. n i t nj. n j Stat 642, Lecture notes for 01/27/05 18 Rate Standardzaton Contnued: Note that f T n t where T s the cumulatve follow-up tme and n s the number of subjects at rsk at the mdpont or nterval, and d s the

More information

Copyright 2017 by Taylor Enterprises, Inc., All Rights Reserved. Adjusted Control Limits for U Charts. Dr. Wayne A. Taylor

Copyright 2017 by Taylor Enterprises, Inc., All Rights Reserved. Adjusted Control Limits for U Charts. Dr. Wayne A. Taylor Taylor Enterprses, Inc. Adjusted Control Lmts for U Charts Copyrght 207 by Taylor Enterprses, Inc., All Rghts Reserved. Adjusted Control Lmts for U Charts Dr. Wayne A. Taylor Abstract: U charts are used

More information

A Method for Analyzing Unreplicated Experiments Using Information on the Intraclass Correlation Coefficient

A Method for Analyzing Unreplicated Experiments Using Information on the Intraclass Correlation Coefficient Journal of Modern Appled Statstcal Methods Volume 5 Issue Artcle 7 --5 A Method for Analyzng Unreplcated Experments Usng Informaton on the Intraclass Correlaton Coeffcent Jams J. Perrett Unversty of Northern

More information

Chapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2

Chapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2 Chapter 4 Smple Lnear Regresson Page. Introducton to regresson analyss 4- The Regresson Equaton. Lnear Functons 4-4 3. Estmaton and nterpretaton of model parameters 4-6 4. Inference on the model parameters

More information

STAT 3340 Assignment 1 solutions. 1. Find the equation of the line which passes through the points (1,1) and (4,5).

STAT 3340 Assignment 1 solutions. 1. Find the equation of the line which passes through the points (1,1) and (4,5). (out of 15 ponts) STAT 3340 Assgnment 1 solutons (10) (10) 1. Fnd the equaton of the lne whch passes through the ponts (1,1) and (4,5). β 1 = (5 1)/(4 1) = 4/3 equaton for the lne s y y 0 = β 1 (x x 0

More information