EXST7015 : Statistical Techniques II ANOVA Design Identification Page 1

Size: px
Start display at page:

Download "EXST7015 : Statistical Techniques II ANOVA Design Identification Page 1"

Transcription

1 NOV Desgn Identfcaton Page 1 Expermental Desgn Identfcaton To correctly desgn an experment, or to analyze a desgned experment, you must be able to look at a desgn stuaton and correctly assess the salent aspects of the desgn. I wll ask you to dentfy the desgn, the treatment varable, dependent varable, degrees of freedom error, expermental unt, samplng unt (f any), and f the treatment s fxed or random. To begn wth, determne what the nvestgators are tryng to do and what they plan to measure. What s the Objectve of the study? pecfcally, what hypotheses are to be tested? What s the varable of nterest? What unt s the treatment appled to? What, exactly, s the unt measured? uppose an nvestgator wants to compare the oxygen levels n seven predefned "habtats" n the Lousana marsh. He wll randomly select and sample 4 stes n each habtat. One oxygen measurement s made at each ste. What varable s beng measured? Ths varable wll produce a seres of measurements or quanttes? Oxygen levels (usually n ppm) What are the treatments? What s the nvestgator nterested n comparng or testng for dfferences? Habtats (t=7) re there any blocks (.e. sources of varaton that should be recognzed, but whch are not mportant to the nvestgator). For example, dd he replcate the experment n several dfferent rvers or several locatons along the coast? re the 4 replcates just multple observatons or are they taken n 4 separate places? pparently no blocks. What are the expermental unts for the experment? What unt was the treatment appled to or what was sampled for each treatment (habtat)? ste (s=4) re there separate samplng unts at each ste, or s only one measurement taken n each expermental unt? In ths case t s a water sample on whch oxygen s measured. nce there s only a sngle sample at each ste we can consder each sample to represent the ste. lso the stes If there were multple samples taken at each ste these would be the samplng unts. These n turn can be splt nto sub-samplng unts. pparently ths was not done. Is that all? There are other ssues, not all of whch we have covered. re the treatments fxed or random? Is the desgn balanced? re there any partcular hypothess tests of nterest (contrasts)? re the treatment levels quanttatve? ny other specal post NOV applcatons? The topc of Desgn wll be dscussed n the second half of the course. For the moment our objectve s only to learn to dentfy the components of an experment. Therefore, I wll put a desgn descrpton on the Internet and durng each class perod I expect you to have looked at t and to be prepared to answer the followng questons.

2 NOV Desgn Identfcaton Page Questons: 1) What s the treatment arrangement for ths experment? (a) sngle factor (b) factoral (c) nested ) What s the expermental desgn for ths experment? (a) RD (b) RD (c) LD (e) plt-plot (d) Repeated Measures 3) Does t seem more lkely that the treatments are fxed or random? (a) fxed (b) random 4) What s the expermental unt for ths experment? (a) pens (b) dets (c) lve weght (d) egg yolk weght (e) ndvdual chckens 5) What s the samplng unt for ths experment? (a) pens (b) dets (c) lve weght (d) egg yolk weght (e) ndvdual chckens 6) What s the dependent varable for ths experment? (a) pens (b) dets (c) lve weght (d) egg yolk weght (e) ndvdual chckens 7) What s the treatment varable for ths experment? (a) pens (b) dets (c) lve weght (d) egg yolk weght (e) ndvdual chckens 8) If the desgn s RD, what are the blocks? (a) pens (b) dets (c) lve weght (d) egg yolk weght (e) ndvdual chckens (f) N 9 & 10) How many degrees of freedom are avalable for testng the treatment (combnatons)? Enter the correct value here: numerator =, denomnator = For the lttle experment dscussed the source table s: ource d.f. Treatment 6 Error 1 Total 7 fnal note on the Daly Desgns. These wll start early n the semester wth the smpler desgns and progress to more complcated desgns. Our only objectve wth these desgns s that you learn to dentfy the mportant aspects and components of the desgns. Durng the second half of the course we wll dscuss the desgns n detal. We wll see why these components exst, how they are analyzed and how they are nterpreted. t the start of each class I wll roll a dce. If a value of 6 s obtaned we wll have a quz. If any other value s rolled I wll smply gve you the answers to the quz. If a value other than 6 s rolled, then that number s not to be counted agan untl after a 6 has occurred. For example, f I roll a 3 then any values of 3 on subsequent days would not count untl a 6 occurred. Once a 6 s rolled all numbers are back n contenton.

3 NOV Desgn Identfcaton Page 3 Identfyng desgns The smplest type of desgn s the ompletely Randomzed Desgn (RD) Ths wll consst of: 1) a treatment (at least one, possbly more), and ) an error term (at least one, possbly more) Example 1: researcher s studyng the sze of tomatoes from plants grown under three waterng regmes, () daly waterng, () waterng at day ntervals and () waterng at 3 day ntervals. Eghteen plants are planted n large pots and 6 are randomly selected for each waterng regme. Plant productvty s measured as the total weght of the frst 10 tomatoes (n cm) produced by each plant or pot (these are synonymous for ths study snce there s only one plant per pot). Dagnoss: What are the treatments and expermental unts? The objectve s to compare the total weght of tomatoes. The varable of nterest (dependent varable) s the total weght of the tomatoes. The treatment s the varable whose levels are to be compared. In ths case the treatment has 3 levels, (.e. the 3 waterng regmes). The expermental unt s the entty that s assgned a treatment. In ths case the treatments are randomly assgned to pots / plants (synonymous n ths case). Note that the weght values we analyze are NOT for ndvdual tomatoes, but are the mean for each plant. nce we are nterested n the mean for each plant we wll have a dataset wth only one value for each plant or pot, so we consder pot/plant to be the samplng unt. In ths partcular experment the pots or plants are both the samplng unt and the expermental unt. In other experments the tomatoes could be measured ndvdually and the samplng unt would be ndvdual tomatoes. However, when the ndvdual unts are summed or averaged to gve just one measurement for each plant, the plant would be consdered the samplng unt. nce the desgn does not have ndvdual tomatoes we have the followng. Descrpton: The model s Y j = + + j Ths analyss s a RD wth a sngle factor treatment and an expermental error. Each treatment was replcated 6 tmes. The source table: ource d.f. EM Waterng Regme t 1 = σ + Q τ Plant (Regme) t(n 1) = 15 σ Total tn 1 = 17

4 NOV Desgn Identfcaton Page 4 Random versus Fxed treatment effects: There are a few other aspects of a desgn that wll be mportant to dentfyng desgns and achevng the correct analyss. One of these s to determne f treatment levels are randomly chosen from a large (theoretcally nfnte) number of choces, or f the treatment levels represent all possble levels, or at least all levels of nterest. Treatment levels that are randomly chosen from a large number of possble levels estmate the varablty among the levels, and are actually estmates of varance components. These are represented as, and represent the varablty among all the levels of the treatment. When the levels of the treatment represent all possble levels, or f they are chosen by the nvestgator as the only levels of nterest they are called fxed effects, and statstcal nference s lmted to the treatment levels ncluded n the experment. Fxed effect treatments do not estmate varances, but rather the sum of squared treatment effects (devatons of each treatment level from the overall mean,.e. n These are summed ( τ Y Y. 1 ) and are represented as Q τ τ Y Y ).. There are obvously many possble waterng regmes, but the 3 of nterest above do not appear to be randomly chosen. They would be fxed. For most of the smple analyses that we wll examne the analytcal dfferences between fxed effects and random effects wll not be obvous. However, for larger expermental desgns the dfferences become very mportant, so we wll contnue to pay attenton to ths aspect of each analyss. Example : uppose we modfed the experment above slghtly by mantanng ndvdual measurements for each tomato. Now, nstead of 18 measurements (one mean per plant) we would have 180 measurements, one weght for each of 10 tomatoes on each of the 18 plants. How would ths change the desgn? Dagnoss: What are the treatments and expermental unts? The objectve s to compare the weght of tomatoes, but n ths case the varable of nterest (dependent varable) s the weght of ndvdual tomatoes. The treatment s stll the 3 waterng regmes. The expermental unt (the entty that s assgned a treatment) s stll the pots or plants. The samplng unt s now the ndvdual tomatoes, snce we now measure each tomato ndvdually and record a separate measurement for each tomato frut.

5 NOV Desgn Identfcaton Page 5 Descrpton: The model s Y jk = + + j + jk Ths analyss s a RD wth a sngle factor treatment and wth both an expermental error for the expermental unts and a samplng error for the samplng unts. Each treatment was replcated 6 tmes and has 10 samplng unts per expermental unt. The source table: ource d.f. EM Regme t 1 = σ + nσ γ + Q τ Plant (Regme) t(p 1) = 15 σ + nσ γ Tomato (Plant x Regme) tp(n 1) = 16 σ Total tpn 1 = 179 Note that the σ estmates the component of varablty among tomatoes and σ γ estmates the component of varablty among the pots or plants. These two sources of varablty (among tomatoes and among pots) may or may not dffer (.e. σ γ may equal zero). lso note that the approprate error term for the waterng regme s the expermental error term. Example 3: uppose the room where we were to culture the plants has hghly varable condtons. In partcular there s a strong east to west varaton n temperature and lght condtons. If we place the treatments completely at random, we may get too many of one treatment on the east, where condtons are better, makng that treatment look better than t really s. lso, the dfferng condtons wll cause extra varablty among the plants producton and, therefore, the tomato weghts. Ths extra varablty wll be added to the error term, reducng the power of the experment (tests are the most powerful when the error term s small and the treatment dfferences are large). If we gnore the east-west varaton t wll become part of the error term. o we decde to place our treatments n 6 groups. Each group contans only 1 replcate of the each treatment, but the 6 groups themselves consttute our replcaton. Note that the 6 groups could be any type of groupng that accounts for varaton. For example, f we only had 3 sutable pots or chambers for the experment we could replcate the experment n 6 tme perods. s before, n the frst experment, we wll take the total weght of 10 tomatoes per plant for comparng the plants. Group 1 Group Group 3 Group 4 Group 5 Group 6

6 NOV Desgn Identfcaton Page 6 Dagnoss: What are the treatments and expermental unts? The objectve s to compare tomato total weghts. The varable of nterest s the total weght of 10 tomatoes, as before. The treatment s stll the 3 waterng regmes. The expermental unt s stll the pots or plants, and these also the samplng unts whch agan provde one measurement for each plant. The new wrnkle s the 6 dfferent groups on whch we conduct our experment. Due to the potental dfferences (east-west), we wll want to remove ths varaton from the error term. We do ths by blockng on group and actually puttng a separate varable n the model to account for block dfferences. If we don t, ths addtonal varaton among groups (east-west) goes nto the error term and reduces our power. Descrpton: The model s Y j = + + j + j Ths analyss s a Randomzed lock Desgn (RD) wth a sngle factor treatment and an expermental error. The source table: ource d.f. EM lock b 1 = 5 σ + tσ β Regme t 1 = σ + Q τ lock x Regme (t 1)(b 1) = 10 σ Total tb 1 = 17 Example 4: uppose we now combne the second and thrd desgn. We conduct the experment wth only 3 pots n each of the 6 groups, but we measure and record 10 ndvdual tomato weghts from each plant. Dagnoss: What are the treatments and expermental unts? The objectve s to compare tomato weghts, and we wll agan do ths wth ndvdual tomatoes. The treatment s stll the 3 waterng regmes. The expermental unt s stll the pots or plants, and the samplng unts (ndvdual tomatoes) agan provde one measurement for each plant. gan we are blockng on the groupngs. Descrpton: The model s Y jk = + + j + j + jk Ths analyss s a Randomzed lock Desgn (RD) wth a sngle factor treatment and wth both an expermental error and a samplng error. The source table: ource d.f. EM lock b 1 = 5 σ + nσ βτ + ntσ β Regme t 1 = σ + nσ βτ + Q τ lock x Regme (t 1)(b 1) = 10 σ + nσ Tomato (lock x Regme) tb(n 1) = 16 Total tbn 1 = 179

7 NOV Desgn Identfcaton Page 7 Example 5: One last example. uppose we go back to our frst experment, the RD where we take the total weght of 10 tomatoes per plant. However, suppose we have two treatments of nterest nstead of the sngle factor (regme). The two treatments of nterest are (a) the 3 waterng regmes and we are also nterested n comparng between plants that receve a low level of fertlzer and those that receve a hgh level. We wll examne all 6 combnatons of treatments, and 3 of the 18 potted plants wll be allocated to each treatment combnaton. Notce that the treatments are cross classfed. Each level of Waterng occurs wth each level of Fertlzer such that all possble combnatons exst. Ths s characterstc of a factoral analyss of varance, also called a two-way NOV. Fertlzer treatment ( levels) Waterng regme treatment (3 levels) daly waterng () -day ntervals () 3-day ntervals () Low (1) 3 replcate pots 3 replcate pots 3 replcate pots Hgh () 3 replcate pots 3 replcate pots 3 replcate pots Dagnoss: What are the treatments and expermental unts? The objectve s to compare tomato weghts, and we wll have one mean per plant. There are now two treatments, one wth levels and one wth 3 levels. Ths s called a x3 factoral treatment arrangement. Note that low fertlzer and hgh fertlzer would appear to cover all possble levels of ths treatment, so ths treatment s also fxed. oth the expermental unts and samplng unts are stll the pots / plants. There s no blockng n ths experment, so ths analyss s a RD wth a x3 factoral treatment arrangement. Descrpton: The model s Y jk = + + j + j + jk The source table: ource d.f. EM Fertlzer t 1 1 = 1 σ + Q Regme t 1 = σ + Q Fertlzer x Regme (t 1 1)(t 1) = σ + Q Pot (Fertlzer x Regme) t 1 t (n 1) = 1 σ Total t 1 t n 1 = 17 τ1 τ τ1τ

8 NOV Desgn Identfcaton Page 8 lternatvely: let treatments represent all t=6 combnatons of t 1 by t ource d.f. Treatments t 1 = 5 Expermental Error t(n 1) = 1 Total tn 1 = 17 ome fnal notes on the dentfcaton of desgned experments. There are other types of desgns that we wll dscuss ths semester. We wll dscuss the Latn quare Desgn (LD), whch has a pecular structure wth two sources of blockng (genercally referred to as rows and columns ). The treatments are arranged n such a way that each treatment occurs once n each row and once n each column. nother major class of desgns wll be dscussng under the ttle of plt plots and Repeated measures. Ths class of desgns has an ntal structure that can be RD, RD or LD wth treatment arrangements that may be ether a sngle factor or factoral. Then each expermental unt s ether splt nto two or more unts and a new treatment appled to the sub-unts of the expermental unt, or each expermental unt s sampled over tme. In the latter case tme becomes a source of varaton of nterest, and s ncluded n the source table. For example, n our tomato plant example above we mght splt our expermental unt (the plant) nto two levels, the lower half of the plant and the upper half of the plant. We mght take 10 tomatoes from the lower half and 10 tomatoes from the upper half. The new varable level would be ncluded n the experment. Ths would be an example of a splt plot. nother possblty s that we are nterested n the changng sze of the tomatoes produced by each plant over tme. We mght take the frst 10 tomatoes (tme 1), and the second 10 tomatoes (tme ), etc. Ths would gve a new varable called tme that would show f the sze of tomatoes was the same over tme or not, and f the dfferent waterng regmes were smlar of dfferent over tme. There s also another treatment arrangement called the nested treatment arrangement. Ths arrangement has a herarchcal arrangement of treatments wth one level nested wthn the hgher level. Do not confuse ths treatment arrangement wth the nested error terms dscussed above where the samplng error s also nested wthn the expermental error n a herarchcal fashon. lso note that t s possble to have many levels of nestng of both treatments and error terms. Random effects versus fxed effects. When the effects are fxed we are usually nterested n the ndvdual treatment levels. Frequently, we wll calculate a mean for each treatment level and do comparsons and tests among those means. Random effects, however, represent a random selecton from a very large number of possble levels, and we are not usually nterested n each of the ndvdual levels selected. For random effects we are most lkely nterested n the overall mean and the varablty about that mean, so we would lkely want to place a confdence nterval on that mean.

9 NOV Desgn Identfcaton Page 9 Introducton Major topcs (a comprehensve outlne s provded elsewhere) Regresson : LR, Multple, urvlnear & Logstc Expermental Desgn : RD, RD, LD, plt-plot & Repeated Measures Treatment arrangements : ngle factor, Factoral, Nested ourse Objectves The objectves of the ntroductory course were to develop an understandng of elemental statstcs, the ablty to understand and apply basc statstcal procedures. We wll develop those concepts further, applyng the termnology and notaton from the basc methods courses to advanced technques for makng statstcal nferences. We wll cover the major methodologes of parametrc statstcs used for predcton and hypotheses testng (prmarly regresson and expermental desgn). Our emphass wll be on REOGNIZING analytcal problems and on beng able to do the statstcal analyss wth software. We wll see programs and output for vrtually all analyses covered ths semester. Daly Desgn I wll be placng a desgn descrpton on the Internet for each class. You should plan on examnng ths desgn before class. t the begnnng of each class I wll randomly determne whether we have a quz on that desgn or not. I do not ntend to spend much tme on ths daly actvty. If there s a quz, I wll allow 5 mnutes for you to answer and turn n quz. If not, I wll gve you the answers. quz wll consst of specfyng the dependent varable, expermental and samplng unts, treatments, blocks, random effects, etc. We wll address most of these n the desgn secton of the course (followng regresson). However, some wll be covered n the daly desgn. Notes on Exams I usually schedule a revew sesson late on Tuesday for the Thursday exams and unday for Tuesday exams. Revew sesson s entrely voluntary, and you may leave anytme. I wll have not plan on coverng materal. I plan only to answer questons. There wll be no revew for the fnal exam. On the exam you wll be allowed to brng a calculator I do not expect to have many calculatons on the exam, but there may be some. For example, calculatng a t-test for a slope for an hypotheszed value other than zero (thought ths may be n the output, always check frst). lso confdence ntervals on slopes and treatment means. You may also brng an 8.5 by 11 nch sheet of paper wth equatons or whatever else you wsh to nclude. You may wrte on both sdes of that pece of paper. I wll provde you wth these on an exam. You wll need to understand MY t-tables. ee nterment for copes of these tables. ll exams, ncludng the fnal, wll be n our regular classroom(s).

10 NOV Desgn Identfcaton Page 10 mple Lnear Regresson (revew?) The objectve: Gven ponts plotted on two coordnates, Y and X, fnd the best lne to ft the data. Y - the dependent varable X - the ndependent varable The concept: Data conssts of pared observatons wth a presumed potental for the exstence of some underlyng relatonshp. We wsh to determne the nature of the relatonshp and quantfy t f t exsts. Note that we cannot prove that the relatonshp exsts by usng regresson (.e. we cannot prove cause and effect). Regresson can only show f a correlaton exsts, and provde an equaton for the relatonshp. Gven a data set consstng of pared, quanttatve varables, and recognzng that there s varaton n the data set, we wll defne, POPULTION MODEL (LR): Y 0 1X Ths s the model we wll ft. It s the equaton descrbng straght lne for a populaton and we want to estmate the parameters n the equaton. The populaton parameters to be estmated are for the underlyng model, yx. 0 1X, are: Termnology yx. = the true populaton mean of Y at each value of X 0 = the true value of the Y ntercept 1 = the true value of the slope, the change n Y per unt of X Dependent varable: varable to be predcted Y = dependent varable (all varaton occurs n Y) Independent varable: predctor or regressor varable X = ndependent varable (X s measured wthout error) Intercept: value of Y when X = 0, pont where the regresson lne passes through the Y axs. The unts on the ntercept are the same as the Y unts lope: the value of the change n Y for each unt ncrease n X. The unts on the slope are Y unts per X unt Devaton: dstance from an observed pont to the regresson lne, also called a resdual.

11 NOV Desgn Identfcaton Page 11 Least squares regresson lne: the lne that mnmzes the squared dstances from the lne to the ndvdual observatons. Regresson lne Y - the dependent varable Intercept Devatons X - the ndependent varable The regresson lne tself represents the mean of Y at each value of X ( yx. ). Regresson calculatons ll calculatons for smple lnear regresson start wth the same values. These are, n n n n n,,,,, X X Y Y X Y n alculatons for smple lnear regresson are frst adjusted for the mean. These are called corrected values. They are corrected for the MEN by subtractng a correcton factor. s a result, all smple lnear regressons are adjusted for the mean of X and Y and pass through the pont Y, X. Y Y, X X The orgnal sums and sums of squares of Y are dstances and squared dstances from zero. These are referred to as uncorrected meanng unadjusted for the mean. Y 0 X

12 NOV Desgn Identfcaton Page 1 The corrected devatons sum to zero (half negatve and half postve) and the sums of the squares are squared dstances from the mean of Y. Y Y X Once the means, obtaned, the calculatons for the parameter estmates are: lope = b XY 1 X Y and corrected sums of squares and cross products,, Intercept = b0 Y b1x We have ftted the sample equaton are YY XY Y b0 b1x e, whch estmates the populaton parameters of the model, Y 0 1X Varance estmates for regresson fter the regresson lne s ftted, varance calculatons are based on the devatons from the regresson. From the regresson model Y b0 b1x e we derve the formula for the devatons e Y b b X or e = Y-Y ˆ. 0 1 Y X s wth other calculatons of varance, we calculate a sum of squares (corrected for the mean). Ths s smplfed by the fact that the devatons, or resduals, already have a mean of zero, n e 1 Resduals = = Error. The degrees of freedom (d.f.) for the varance calculaton s n, snce two parameters are estmated pror to the varance ( 0 and 1 ). The varance estmate s called the ME (Mean square error). It s the Error dvded by the d.f., ME E n. The varances for the two parameter estmates and the predcted values are all dfferent, but all are based on the ME, and all have n d.f. (t-tests) or n d.f. for the denomnator (F tests). Varance of the slope = ME

13 NOV Desgn Identfcaton Page 13 Varance of the ntercept = 1 X ME n Varance of a predcted value at X = ME n X X 1 ny of these varances can be used for a t-test of an estmate aganst an hypotheszed value for the approprate parameter (.e. slope, ntercept or predcted value respectvely). NOV table for regresson common representaton of regresson results s an NOV table. Gven the Error (sum of squared devatons from the regresson), and the ntal total sum of squares ( YY ), the sum of squares of Y adjusted for the mean, we can construct an NOV table mple Lnear Regresson NOV table d.f. um of quares Mean quare F Regresson 1 Regresson MReg Error n Error MError Total n 1 YY = Total MReg / MError In the NOV table The Regresson and Error sum to the Total, so gven the total ( YY ) and one of the two terms, we can get the other. The easest to calculate frst s usually the Regresson snce we usually already have the necessary ntermedate values. Regresson = XY The Regresson s a measure of the mprovement n the ft due to the regresson lne. The devatons start at YY and are reduced to Error. The dfference s the mprovement, and s equal to the Regresson. Ths gves another statstc called the R. What porton of the Total ( YY ) s accounted for by the regresson? R = Regresson / Total The degrees of freedom n the NOV table are, n 1 for the total, one lost for the correcton for the mean (whch also fts the ntercept) n for the error, snce two parameters are estmated to get the regresson lne. 1 d.f. for the regresson, whch s the d.f. for the slope. tatstcs quote: He uses statstcs as a drunken man uses lampposts -- for support rather than for llumnaton. ndrew Lang ( ), cottsh poet, folklorst, bographer, translator, novelst, and scholar

14 NOV Desgn Identfcaton Page 14 The F test s constructed by calculatng the MRegresson / MError. Ths F test has 1 n the numerator and (n ) d.f. n the denomnator. Ths s exactly the same test as the t-test of the slope aganst zero. To test the slope aganst an hypotheszed value (say zero) usng the t-test wth n d.f., calculate b1 b1 Hypotheszed b1 0 t b ME 1 ssumptons for the Regresson We wll recognze 4 assumptons 1) Normalty We take the devatons from regresson and pool them all together nto one estmate of varance. ome of the tests we use requre the assumpton of normalty, so these devatons should be normally dstrbuted. Y X For each value of X there s a populaton of values for the varable Y (normally dstrbuted). ) Homogenety of varance When we pool these devatons (varances) we also assume that the varances are the same at each value of X. In some cases ths s not true, partcularly when the varance ncreases as X ncreases. 3) X s measured wthout error! nce varances are measured only vertcally, all varance s n Y, no provsons are made for varance n X. 4) Independence. Ths enters n several places. Frst, the observatons should be ndependent of each other (.e. the value of e should be ndependent of e j, for j). lso, n the equaton for the lne Y b0 b1x e we assume that the term e s ndependent of the rest of the model. We wll talk more of ths when we get to multple regresson. o the four assumptons are: Normalty Homogenety of varance Independence X measured wthout error These are explct assumptons, and we wll examne or test these assumptons when possble. There are also some other assumptons that I consder mplct. We wll not state these, but n some cases they can be tested. For example, There s order n the Unverse. Otherwse, what are you nvestgatng? haos? The underlyng fundamental relatonshp that I just ftted a straght lne to really s a straght lne. ometmes ths one can be examned statstcally.

15 NOV Desgn Identfcaton Page 15 haracterstcs of a Regresson Lne The lne wll pass through the pont Y, X (also the pont b 0, 0) The sum of devatons wll be zero ( e 0 ) 0 1 of the ponts from the regresson lne wll be a mnmum. Values on the lne can be descrbed by the equatonyˆ b0 b1x. The lne has some desrable propertes (f the assumptons are met) E b The sum of squared devatons (measured vertcally, e Y b bx o o E b E Y X Y X o. Therefore, the parameter estmates and predcted values are unbased estmates. Note that lnear regresson s consdered statstcally robust. That s, the tests of hypothess tend to gve good results f the assumptons are not volated to a great extent. rossproducts and correlaton rossproducts are used n a number of related calculatons (can be + or ). a crossproduct = YX um of crossproducts = YX orrected sum of crossproducts = XY ovarance = lope = XY XY Regresson = orrelaton = XY R =r = XY YY n 1 XY mple Lnear Regresson ummary YY = Regresson / Total ee mple lnear regresson notes from EXT7005 for addtonal nformaton, ncludng the dervaton of the equatons for the slope and ntercept. You are not responsble for these dervatons. Know the termnology, characterstcs and propertes of a regresson lne, the assumptons, and the components to the NOV table. You wll not be fttng regressons by hand, but I wll expect you to understand where the values on output come from and what they mean. Partcular emphass wll be placed on workng wth, and nterpretng, numercal regresson analyses. nalyses wll mostly be done wth.

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Topic 23 - Randomized Complete Block Designs (RCBD)

Topic 23 - Randomized Complete Block Designs (RCBD) Topc 3 ANOVA (III) 3-1 Topc 3 - Randomzed Complete Block Desgns (RCBD) Defn: A Randomzed Complete Block Desgn s a varant of the completely randomzed desgn (CRD) that we recently learned. In ths desgn,

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

Statistics MINITAB - Lab 2

Statistics MINITAB - Lab 2 Statstcs 20080 MINITAB - Lab 2 1. Smple Lnear Regresson In smple lnear regresson we attempt to model a lnear relatonshp between two varables wth a straght lne and make statstcal nferences concernng that

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

28. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III Ftted Values and Resduals US Domestc Beers: Calores vs. % Alcohol To each observed x, there corresponds a y-value on the ftted lne, y ˆ = βˆ + βˆ x. The are called ftted

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

18. SIMPLE LINEAR REGRESSION III

18. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III US Domestc Beers: Calores vs. % Alcohol Ftted Values and Resduals To each observed x, there corresponds a y-value on the ftted lne, y ˆ ˆ = α + x. The are called ftted values.

More information

ANOVA. The Observations y ij

ANOVA. The Observations y ij ANOVA Stands for ANalyss Of VArance But t s a test of dfferences n means The dea: The Observatons y j Treatment group = 1 = 2 = k y 11 y 21 y k,1 y 12 y 22 y k,2 y 1, n1 y 2, n2 y k, nk means: m 1 m 2

More information

The Ordinary Least Squares (OLS) Estimator

The Ordinary Least Squares (OLS) Estimator The Ordnary Least Squares (OLS) Estmator 1 Regresson Analyss Regresson Analyss: a statstcal technque for nvestgatng and modelng the relatonshp between varables. Applcatons: Engneerng, the physcal and chemcal

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

Topic- 11 The Analysis of Variance

Topic- 11 The Analysis of Variance Topc- 11 The Analyss of Varance Expermental Desgn The samplng plan or expermental desgn determnes the way that a sample s selected. In an observatonal study, the expermenter observes data that already

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution Department of Statstcs Unversty of Toronto STA35HS / HS Desgn and Analyss of Experments Term Test - Wnter - Soluton February, Last Name: Frst Name: Student Number: Instructons: Tme: hours. Ads: a non-programmable

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

STAT 511 FINAL EXAM NAME Spring 2001

STAT 511 FINAL EXAM NAME Spring 2001 STAT 5 FINAL EXAM NAME Sprng Instructons: Ths s a closed book exam. No notes or books are allowed. ou may use a calculator but you are not allowed to store notes or formulas n the calculator. Please wrte

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

Statistics for Business and Economics

Statistics for Business and Economics Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear

More information

SIMPLE LINEAR REGRESSION

SIMPLE LINEAR REGRESSION Smple Lnear Regresson and Correlaton Introducton Prevousl, our attenton has been focused on one varable whch we desgnated b x. Frequentl, t s desrable to learn somethng about the relatonshp between two

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise. Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the

More information

III. Econometric Methodology Regression Analysis

III. Econometric Methodology Regression Analysis Page Econ07 Appled Econometrcs Topc : An Overvew of Regresson Analyss (Studenmund, Chapter ) I. The Nature and Scope of Econometrcs. Lot s of defntons of econometrcs. Nobel Prze Commttee Paul Samuelson,

More information

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y) Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,

More information

NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION MTH352/MH3510 Regression Analysis

NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION MTH352/MH3510 Regression Analysis NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION 014-015 MTH35/MH3510 Regresson Analyss December 014 TIME ALLOWED: HOURS INSTRUCTIONS TO CANDIDATES 1. Ths examnaton paper contans FOUR (4) questons

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

x yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting.

x yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting. The Practce of Statstcs, nd ed. Chapter 14 Inference for Regresson Introducton In chapter 3 we used a least-squares regresson lne (LSRL) to represent a lnear relatonshp etween two quanttatve explanator

More information

Chapter 3 Describing Data Using Numerical Measures

Chapter 3 Describing Data Using Numerical Measures Chapter 3 Student Lecture Notes 3-1 Chapter 3 Descrbng Data Usng Numercal Measures Fall 2006 Fundamentals of Busness Statstcs 1 Chapter Goals To establsh the usefulness of summary measures of data. The

More information

January Examinations 2015

January Examinations 2015 24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)

More information

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS M. Krshna Reddy, B. Naveen Kumar and Y. Ramu Department of Statstcs, Osmana Unversty, Hyderabad -500 007, Inda. nanbyrozu@gmal.com, ramu0@gmal.com

More information

e i is a random error

e i is a random error Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown

More information

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9 Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,

More information

Chapter 14 Simple Linear Regression

Chapter 14 Simple Linear Regression Chapter 4 Smple Lnear Regresson Chapter 4 - Smple Lnear Regresson Manageral decsons often are based on the relatonshp between two or more varables. Regresson analss can be used to develop an equaton showng

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

Chapter 6. Supplemental Text Material

Chapter 6. Supplemental Text Material Chapter 6. Supplemental Text Materal S6-. actor Effect Estmates are Least Squares Estmates We have gven heurstc or ntutve explanatons of how the estmates of the factor effects are obtaned n the textboo.

More information

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε Chapter 3 Secton 3.1 Model Assumptons: Multple Regresson Model Predcton Equaton Std. Devaton of Error Correlaton Matrx Smple Lnear Regresson: 1.) Lnearty.) Constant Varance 3.) Independent Errors 4.) Normalty

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis Resource Allocaton and Decson Analss (ECON 800) Sprng 04 Foundatons of Regresson Analss Readng: Regresson Analss (ECON 800 Coursepak, Page 3) Defntons and Concepts: Regresson Analss statstcal technques

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

This column is a continuation of our previous column

This column is a continuation of our previous column Comparson of Goodness of Ft Statstcs for Lnear Regresson, Part II The authors contnue ther dscusson of the correlaton coeffcent n developng a calbraton for quanttatve analyss. Jerome Workman Jr. and Howard

More information

Kernel Methods and SVMs Extension

Kernel Methods and SVMs Extension Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general

More information

Lecture 4 Hypothesis Testing

Lecture 4 Hypothesis Testing Lecture 4 Hypothess Testng We may wsh to test pror hypotheses about the coeffcents we estmate. We can use the estmates to test whether the data rejects our hypothess. An example mght be that we wsh to

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VIII LECTURE - 34 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS EFFECTS MODEL Dr Shalabh Department of Mathematcs and Statstcs Indan

More information

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction ECONOMICS 35* -- NOTE 7 ECON 35* -- NOTE 7 Interval Estmaton n the Classcal Normal Lnear Regresson Model Ths note outlnes the basc elements of nterval estmaton n the Classcal Normal Lnear Regresson Model

More information

Statistics Chapter 4

Statistics Chapter 4 Statstcs Chapter 4 "There are three knds of les: les, damned les, and statstcs." Benjamn Dsrael, 1895 (Brtsh statesman) Gaussan Dstrbuton, 4-1 If a measurement s repeated many tmes a statstcal treatment

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

PHYS 450 Spring semester Lecture 02: Dealing with Experimental Uncertainties. Ron Reifenberger Birck Nanotechnology Center Purdue University

PHYS 450 Spring semester Lecture 02: Dealing with Experimental Uncertainties. Ron Reifenberger Birck Nanotechnology Center Purdue University PHYS 45 Sprng semester 7 Lecture : Dealng wth Expermental Uncertantes Ron Refenberger Brck anotechnology Center Purdue Unversty Lecture Introductory Comments Expermental errors (really expermental uncertantes)

More information

7.1. Single classification analysis of variance (ANOVA) Why not use multiple 2-sample 2. When to use ANOVA

7.1. Single classification analysis of variance (ANOVA) Why not use multiple 2-sample 2. When to use ANOVA Sngle classfcaton analyss of varance (ANOVA) When to use ANOVA ANOVA models and parttonng sums of squares ANOVA: hypothess testng ANOVA: assumptons A non-parametrc alternatve: Kruskal-Walls ANOVA Power

More information

Activity #13: Simple Linear Regression. actgpa.sav; beer.sav;

Activity #13: Simple Linear Regression. actgpa.sav; beer.sav; ctvty #3: Smple Lnear Regresson Resources: actgpa.sav; beer.sav; http://mathworld.wolfram.com/leastfttng.html In the last actvty, we learned how to quantfy the strength of the lnear relatonshp between

More information

The SAS program I used to obtain the analyses for my answers is given below.

The SAS program I used to obtain the analyses for my answers is given below. Homework 1 Answer sheet Page 1 The SAS program I used to obtan the analyses for my answers s gven below. dm'log;clear;output;clear'; *************************************************************; *** EXST7034

More information

Lecture 16 Statistical Analysis in Biomaterials Research (Part II)

Lecture 16 Statistical Analysis in Biomaterials Research (Part II) 3.051J/0.340J 1 Lecture 16 Statstcal Analyss n Bomaterals Research (Part II) C. F Dstrbuton Allows comparson of varablty of behavor between populatons usng test of hypothess: σ x = σ x amed for Brtsh statstcan

More information

UCLA STAT 13 Introduction to Statistical Methods for the Life and Health Sciences. Chapter 11 Analysis of Variance - ANOVA. Instructor: Ivo Dinov,

UCLA STAT 13 Introduction to Statistical Methods for the Life and Health Sciences. Chapter 11 Analysis of Variance - ANOVA. Instructor: Ivo Dinov, UCLA STAT 3 ntroducton to Statstcal Methods for the Lfe and Health Scences nstructor: vo Dnov, Asst. Prof. of Statstcs and Neurology Chapter Analyss of Varance - ANOVA Teachng Assstants: Fred Phoa, Anwer

More information

STATISTICS QUESTIONS. Step by Step Solutions.

STATISTICS QUESTIONS. Step by Step Solutions. STATISTICS QUESTIONS Step by Step Solutons www.mathcracker.com 9//016 Problem 1: A researcher s nterested n the effects of famly sze on delnquency for a group of offenders and examnes famles wth one to

More information

Correlation and Regression

Correlation and Regression Correlaton and Regresson otes prepared by Pamela Peterson Drake Index Basc terms and concepts... Smple regresson...5 Multple Regresson...3 Regresson termnology...0 Regresson formulas... Basc terms and

More information

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida Frst Year Examnaton Department of Statstcs, Unversty of Florda May 7, 010, 8:00 am - 1:00 noon Instructons: 1. You have four hours to answer questons n ths examnaton.. You must show your work to receve

More information

Midterm Examination. Regression and Forecasting Models

Midterm Examination. Regression and Forecasting Models IOMS Department Regresson and Forecastng Models Professor Wllam Greene Phone: 22.998.0876 Offce: KMC 7-90 Home page: people.stern.nyu.edu/wgreene Emal: wgreene@stern.nyu.edu Course web page: people.stern.nyu.edu/wgreene/regresson/outlne.htm

More information

Chapter 4: Regression With One Regressor

Chapter 4: Regression With One Regressor Chapter 4: Regresson Wth One Regressor Copyrght 2011 Pearson Addson-Wesley. All rghts reserved. 1-1 Outlne 1. Fttng a lne to data 2. The ordnary least squares (OLS) lne/regresson 3. Measures of ft 4. Populaton

More information

Gravitational Acceleration: A case of constant acceleration (approx. 2 hr.) (6/7/11)

Gravitational Acceleration: A case of constant acceleration (approx. 2 hr.) (6/7/11) Gravtatonal Acceleraton: A case of constant acceleraton (approx. hr.) (6/7/11) Introducton The gravtatonal force s one of the fundamental forces of nature. Under the nfluence of ths force all objects havng

More information

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 1 Chapters 14, 15 & 16 Professor Ahmad, Ph.D. Department of Management Revsed August 005 Chapter 14 Formulas Smple Lnear Regresson Model: y =

More information

experimenteel en correlationeel onderzoek

experimenteel en correlationeel onderzoek expermenteel en correlatoneel onderzoek lecture 6: one-way analyss of varance Leary. Introducton to Behavoral Research Methods. pages 246 271 (chapters 10 and 11): conceptual statstcs Moore, McCabe, and

More information

F statistic = s2 1 s 2 ( F for Fisher )

F statistic = s2 1 s 2 ( F for Fisher ) Stat 4 ANOVA Analyss of Varance /6/04 Comparng Two varances: F dstrbuton Typcal Data Sets One way analyss of varance : example Notaton for one way ANOVA Comparng Two varances: F dstrbuton We saw that the

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an

More information

β0 + β1xi and want to estimate the unknown

β0 + β1xi and want to estimate the unknown SLR Models Estmaton Those OLS Estmates Estmators (e ante) v. estmates (e post) The Smple Lnear Regresson (SLR) Condtons -4 An Asde: The Populaton Regresson Functon B and B are Lnear Estmators (condtonal

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study

More information

Cathy Walker March 5, 2010

Cathy Walker March 5, 2010 Cathy Walker March 5, 010 Part : Problem Set 1. What s the level of measurement for the followng varables? a) SAT scores b) Number of tests or quzzes n statstcal course c) Acres of land devoted to corn

More information

Learning Objectives for Chapter 11

Learning Objectives for Chapter 11 Chapter : Lnear Regresson and Correlaton Methods Hldebrand, Ott and Gray Basc Statstcal Ideas for Managers Second Edton Learnng Objectves for Chapter Usng the scatterplot n regresson analyss Usng the method

More information

Chapter 12 Analysis of Covariance

Chapter 12 Analysis of Covariance Chapter Analyss of Covarance Any scentfc experment s performed to know somethng that s unknown about a group of treatments and to test certan hypothess about the correspondng treatment effect When varablty

More information

Answers Problem Set 2 Chem 314A Williamsen Spring 2000

Answers Problem Set 2 Chem 314A Williamsen Spring 2000 Answers Problem Set Chem 314A Wllamsen Sprng 000 1) Gve me the followng crtcal values from the statstcal tables. a) z-statstc,-sded test, 99.7% confdence lmt ±3 b) t-statstc (Case I), 1-sded test, 95%

More information

Linear Feature Engineering 11

Linear Feature Engineering 11 Lnear Feature Engneerng 11 2 Least-Squares 2.1 Smple least-squares Consder the followng dataset. We have a bunch of nputs x and correspondng outputs y. The partcular values n ths dataset are x y 0.23 0.19

More information

Basically, if you have a dummy dependent variable you will be estimating a probability.

Basically, if you have a dummy dependent variable you will be estimating a probability. ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy

More information

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students. PPOL 59-3 Problem Set Exercses n Smple Regresson Due n class /8/7 In ths problem set, you are asked to compute varous statstcs by hand to gve you a better sense of the mechancs of the Pearson correlaton

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 31 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 6. Rdge regresson The OLSE s the best lnear unbased

More information

Statistics II Final Exam 26/6/18

Statistics II Final Exam 26/6/18 Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the

More information

Introduction to Vapor/Liquid Equilibrium, part 2. Raoult s Law:

Introduction to Vapor/Liquid Equilibrium, part 2. Raoult s Law: CE304, Sprng 2004 Lecture 4 Introducton to Vapor/Lqud Equlbrum, part 2 Raoult s Law: The smplest model that allows us do VLE calculatons s obtaned when we assume that the vapor phase s an deal gas, and

More information

A LINEAR PROGRAM TO COMPARE MULTIPLE GROSS CREDIT LOSS FORECASTS. Dr. Derald E. Wentzien, Wesley College, (302) ,

A LINEAR PROGRAM TO COMPARE MULTIPLE GROSS CREDIT LOSS FORECASTS. Dr. Derald E. Wentzien, Wesley College, (302) , A LINEAR PROGRAM TO COMPARE MULTIPLE GROSS CREDIT LOSS FORECASTS Dr. Derald E. Wentzen, Wesley College, (302) 736-2574, wentzde@wesley.edu ABSTRACT A lnear programmng model s developed and used to compare

More information

Turbulence classification of load data by the frequency and severity of wind gusts. Oscar Moñux, DEWI GmbH Kevin Bleibler, DEWI GmbH

Turbulence classification of load data by the frequency and severity of wind gusts. Oscar Moñux, DEWI GmbH Kevin Bleibler, DEWI GmbH Turbulence classfcaton of load data by the frequency and severty of wnd gusts Introducton Oscar Moñux, DEWI GmbH Kevn Blebler, DEWI GmbH Durng the wnd turbne developng process, one of the most mportant

More information

Chapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2

Chapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2 Chapter 4 Smple Lnear Regresson Page. Introducton to regresson analyss 4- The Regresson Equaton. Lnear Functons 4-4 3. Estmaton and nterpretaton of model parameters 4-6 4. Inference on the model parameters

More information

Analytical Chemistry Calibration Curve Handout

Analytical Chemistry Calibration Curve Handout I. Quck-and Drty Excel Tutoral Analytcal Chemstry Calbraton Curve Handout For those of you wth lttle experence wth Excel, I ve provded some key technques that should help you use the program both for problem

More information

NUMERICAL DIFFERENTIATION

NUMERICAL DIFFERENTIATION NUMERICAL DIFFERENTIATION 1 Introducton Dfferentaton s a method to compute the rate at whch a dependent output y changes wth respect to the change n the ndependent nput x. Ths rate of change s called the

More information

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva Econ 39 - Statstcal Propertes of the OLS estmator Sanjaya DeSlva September, 008 1 Overvew Recall that the true regresson model s Y = β 0 + β 1 X + u (1) Applyng the OLS method to a sample of data, we estmate

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

Statistical tables are provided Two Hours UNIVERSITY OF MANCHESTER. Date: Wednesday 4 th June 2008 Time: 1400 to 1600

Statistical tables are provided Two Hours UNIVERSITY OF MANCHESTER. Date: Wednesday 4 th June 2008 Time: 1400 to 1600 Statstcal tables are provded Two Hours UNIVERSITY OF MNCHESTER Medcal Statstcs Date: Wednesday 4 th June 008 Tme: 1400 to 1600 MT3807 Electronc calculators may be used provded that they conform to Unversty

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

Laboratory 3: Method of Least Squares

Laboratory 3: Method of Least Squares Laboratory 3: Method of Least Squares Introducton Consder the graph of expermental data n Fgure 1. In ths experment x s the ndependent varable and y the dependent varable. Clearly they are correlated wth

More information

FREQUENCY DISTRIBUTIONS Page 1 of The idea of a frequency distribution for sets of observations will be introduced,

FREQUENCY DISTRIBUTIONS Page 1 of The idea of a frequency distribution for sets of observations will be introduced, FREQUENCY DISTRIBUTIONS Page 1 of 6 I. Introducton 1. The dea of a frequency dstrbuton for sets of observatons wll be ntroduced, together wth some of the mechancs for constructng dstrbutons of data. Then

More information