Definitions of terms and examples. Experimental Design. Sampling versus experiments. For each experimental unit, measures of the variables of

Size: px
Start display at page:

Download "Definitions of terms and examples. Experimental Design. Sampling versus experiments. For each experimental unit, measures of the variables of"

Transcription

1 Experimetal Desig Samplig versus experimets similar to samplig ad ivetor desig i that iformatio about forest variables is gathered ad aalzed experimets presuppose itervetio through applig a treatmet (a actio or absece of a actio) to a uit, called the experimetal uit. The experimetal uit is a item o which the treatmet is applied. Defiitios of terms ad examples For each experimetal uit, measures of the variables of iterest (i.e., respose or depedet variables) are used to idicate treatmet impacts. Treatmets are radoml assiged to the experimetal uits. Replicatio is the observatio of two or more experimetal uits uder idetical experimetal coditios. A factor is a groupig of related treatmets. The goal is to obtai results that idicate cause ad effect.

2 Examples:.,000 seedligs i a field. Half of the seedligs get a tea bag of utriets, others do ot, radoml assiged. Experimetal uit: the seedlig. Treatmets are: o tea bag, ad tea bag. Factor: ol oe fertilizer (oe, tea bag) Replicatios: 500 seedligs get each treatmet. 300 plat pots i a greehouse: Each pot gets either ) stadard geetic stock; ) geetic stock from aother locatio; 3) improved geetic stock. Treatmets: the three tpes of geetic stock Experimetal Uit: The pot Factor(s): Geetic Stock (oe factor ol) Replicatios: 300 pots /3 treatmets 00 pots /treatmet 3. The umber of tailed frogs i differet forest tpes is of iterest. There are six areas. Three are cut ad the other three are ot cut. Treatmets: cut, ucut Experimetal Uit: each of the six areas Factor(s): ol oe, cuttig with two levels Replicatios: six areas/ two cuttig levels 3 replicates per treatmet. 4. Two forest tpes are idetified, Coastal wester hemlock ad iterior Douglas fir. For each, a umber of samples are located, ad the growth of each tree i each sample is measured. Treatmets: NOT AN EXPERIMENT!! Experimetal Uit: Factor(s): Replicatios: 3 4

3 What does it mea that treatmets are radoml assiged to experimetal uits? Haphazard vs. radom allocatio Practical problems ad implicatios Other terms: The ull hpothesis is that there are o differeces amog the treatmet meas. For more tha oe factor, there is more tha oe hpothesis The sum of squared differeces (termed, sum of squares) betwee the average for the respose variable b treatmet versus the average over all experimetal uits represets the variatio attributed to a factor. The degrees of freedom, associated with a factor, are the umber of treatmet levels withi the factor mius oe. Example of hpotheses: Factor A, fertilizer: oe, medium, heav (3 levels) Factor B, species: spruce, pie ( levels) Number of possible treatmets: 6 e..g, spruce, oe is oe treatmet. Experimetal Uit: 0.00 ha plots Replicates plaed: per treatmet (cost costrait). How ma experimetal uits do we eed? Variable of iterest: Average 5-ear height growth for trees i the plot Null hpotheses: There is o differet betwee the 6 treatmets. This ca be broke ito: ) There is o iteractio betwee species ad fertilizer. ) There is o differece betwee species. 3) There is o differece betwee fertilizers. 5 6

4 Experimetal error is the measure of variace due to chace causes, amog experimetal uits that received the same treatmet. The degrees of freedom for the experimetal error relate to the umber of experimetal uits ad the umber of treatmet levels. The impacts of treatmets o the respose variables will be detectable ol if the impacts are measurabl larger tha the variace due to chace causes. To reduce the variabilit due to causes other tha those Radom allocatio of a treatmet to a experimetal uit helps isure that the measured results are due to the treatmet, ad ot to aother cause. Example: if we have applied the o fertilizer treatmet to experimetal uits o orth facig sites, whereas moderate ad heav fertilizer treatmets are applied ol to south facig sites, we would ot kow if differeces i average height growth were due to the applicatio of fertilizatio, the orietatio of the sites, or both. The results would be cofouded ad ver difficult to iterpret. maipulated b the experimeter, relativel homogeous experimetal uits are carefull selected. 7 8

5 Variatios i experimetal desig Itroductio of More Tha Oe Factor: Iterested i the iteractio amog factors, ad the effect of each factor. A treatmet represets a particular combiatio of levels from each of the factors. Whe all factor levels of oe factor are give for all levels of each of the other factors, this is a crossed experimet. Example: two species ad three fertilizatio levels six treatmets usig a crossed experimet. Fixed, Radom, or Mixed Effects: Fixed factors: the experimeter would like to kow the chage that is due to the particular treatmets applied; ol iterested i the treatmet levels that are i the experimet (e.g., differece i growth betwee two particular geetic stocks) [fixed effects] Radom factors: the variace due to the factor is of iterest, ot particular levels (e.g., variace due to differet geetic stocks radoml select differet stock to use as the treatmet) [radom effects] Mixture of factor tpes: Commol, experimets i forestr iclude a mixture of factors, some radom ad some fixed [mixed effect]. 9 0

6 Restricted Radomizatio Through Blockig: Radomized Block (RCB), Lati Square, ad Icomplete Blocks Desigs: Radomize treatmets with blocks of experimetal uits Reduces the variace b takig awa variace due to the item used i blockig (e.g., high, medium ad low site productivit Results i more homogeeous experimetal uits withi each block. Restricted Radomizatio Through Splittig Experimetal Uits: Called split plot A experimetal uit is split. Aother factor is radoml applied to the split. Example: The factor fertilizer is applied to 0.00 ha plots. Each of the 0.00 ha plot is the split ito two, ad two differet species are plated i each. Fertilizer is applied to the whole plot, ad species is applied to the split plot. Species is therefore radoml assiged to the split plot, ot to the whole experimetal uit.

7 Nestig of Factors Treatmet levels for oe factor ma be particular to the level of aother factor, resultig i estig of treatmets. Example, for the first level of fertilizer, we might use medium ad heav thiig, whereas, for the secod level of fertilizer, we might use o thiig ad light thiig. Hierarchical Desigs ad Sub-Samplig: Commol i forestr experimets, the experimetal uit represets a group of items that we measure. E.g. several pots i a greehouse, each with several plats germiatig from seeds. Treatmets are radoml assiged to the larger uit (e.g, to each plot ot to each seedlig). The experimetal uit is the larger sized uit. Ma wat variace due to the experimetal uit (pots i the example) ad to uits withi (plats i the example). These are ) ested i the treatmet; ) radom effects; ad 3) hierarchical A commo variatio o hierarchical desigs is measurig a sample of items, istead of measurig all items i a experimetal uit. 3 4

8 Itroductio of Covariates The iitial coditios for a experimet ma ot be the same for all experimetal uits, eve if blockig is used to group the uits. Site measures such as soil moisture ad temperature, ad startig coditios for idividuals such as startig height, are the measured (called covariates) alog with the respose variable These covariates are used to reduce the experimetal error. Covariates are usuall iterval or ratio scale (cotiuous). Desigs i use The most simple desig is oe fixed-effects factor, with radom allocatio of treatmets to each experimetal uit, with o ) blockig; ) sub-samplig; 4) splits; or 5) covariates Most desigs use combiatios of the differet variatios. For example, oe fixed-effects factor, oe mixed-effects factor, blocked ito three sites, with trees measured withi plots withi experimetal uits (sub-samplig/hierarchical), ad measures take at the begiig of the experimet are used as covariates (e.g., iitial heights of trees. 5 6

9 Wh? Wat to look at iteractios amog factors ad/or is cheaper to use more tha oe factor i oe experimet tha do two experimets. Experimets ad measuremets are expesive use samplig withi experimetal uits to reduce costs Fidig homogeeous uits is quite difficult: blockig is eeded Mai questios i experimets Do the treatmets affect the variable of iterest? For fixed effects: Is there a differet betwee the treatmet meas of the variable of iterest? Which meas differ? What are the meas b treatmet ad cofidece itervals o these meas? For radom effects: Do the treatmets accout for some of the variace of the variables of iterest? How much? BUT ca ed up with problems: some elemets are ot measured, radom allocatio is ot possible, or measures are correlated i time ad/or space. I this course, start with the simple desigs ad add complexit. 7 8

10 Completel Radomized Desig (CRD) Homogeeous experimetal uits are located Treatmets are radoml assiged to experimetal uits No blockig is used We measure a variable of iterest for each experimetal uit CRD: Oe Factor Experimet, Fixed Effects Mai questios of iterest Are the treatmet meas differet? Which meas are differet? What are the estimated meas ad cofidece itervals for these Notatio: Populatio: i μ + τ + ε i OR i μ + ε i i respose variable measured o experimetal uit i ad treatmet to J treatmets μ the grad or overall mea regardless of treatmet μ the mea of all measures possible for treatmet τ the differece betwee the overall mea of all measures possible from all treatmets ad the mea of all possible measures for treatmet, called the treatmet effect ε i the differece betwee a particular measure for a experimetal uit i, ad the mea for the treatmet that was applied to it ε i i μ estimates? 9 0

11 For the experimet: i + τˆ + ei OR i + ei the grad or overall mea of all measures from the experimet regardless of treatmet; uder the assumptios for the error terms, this will be a ubiased estimate of μ the mea of all measures for treatmet ; uder the assumptios for the error terms, this will be a ubiased estimate of μ τˆ the differece betwee the mea of experimet measures for treatmet ad the overall mea of measures from all treatmets; uder the error term assumptios, will be a ubiased estimate of τ e i the differece betwee a particular measure for a experimetal uit i, ad the mea for the treatmet that was applied to it e i i the umber of experimetal uits measured i treatmet Example: Fertilizatio Trial A forester would like to test whether differet site preparatio methods result i differece i heights. Twet five areas each 0.0 ha i size are laid our over a fairl homogeeous area. Five site preparatio treatmets are radoml applied to 5 plots. Oe hudred trees are plated (same geetic stock ad same age) i each area. At the ed of 5 ears, the heights of seedligs i each plot were measured, ad averaged for the plot. i a particular 0.0 ha area i treatmet, from to 5. Respose variable i : 5-ear height growth (oe average for each experimetal uit) Number of treatmets: J5 site preparatio methods T the umber of experimetal uits measured over all 5 treatmets experimetal uits measured each treatmet T the umber of experimetal uits measured over all treatmets J

12 Schematic of Laout: Data Orgaizatio ad Prelimiar Calculatios For eas calculatios b had, the data could be orgaized i a spreadsheet as: Obs: Treatmet, to J i to 3 J 3 J 3 J J 3 J Sum J.. Averages 3 J J i i i i i T NO Example: J 5 site preparatio treatmets radoml applied to 5 plots. Respose Variable: Plot average seedlig height after 5 ears Plot Average Heights (m) Treatmets Overall Observatio SUMS Meas Example Calculatios: 5 5 i ( ) / 5 i 4. i 5 5 i ( ) / 5 96./ TE: ma ot be the same umber of observatios for each treatmet. 3 4

13 We the calculate: ) Sum of squared differeces betwee the observed values ad the overall mea (SS): SS J J ( i ) df i Also called, sum of squares total (same as i regressio) Alterative formulae for the sums of squares that ma be easier to calculate are: SS SS TR J J i i SSE SS SS TR T T ) Sum of squared differeces betwee the treatmet meas, ad the grad mea, weighted b the umber of experimetal uits i each treatmet (SS TR ) SS TR J J ( ) ( ) df J i 3) Sum of squared differeces betwee the observed values for each experimetal uit ad the treatmet meas (SSE) SSE J ( i ) i df T SS SS + TR SSE J 5 6

14 For the example, differeces from treatmet meas (m): Treatmets Overall Obs SUMS Sum of Squares Error s Differeces from grad mea (m) Treatmets Overall Obs SUMS Sum of Squares Total Example Calculatios: SSE for treatmet s 5 i ( i ) (4.6 4.) + (4.3 4.) + (3.7 4.) + (4.0 4.) + (4.0 4.) SSE for treatmet SS J ( i ) i SS for treatmet + SS for treatmet SS 6.6 for treatmet 5 SSE J ( i ) i SSE for treatmet + SSE for treatmet SSE for treatmet

15 Differece betwee treatmet meas ad grad mea (m) Treatmets Overall Mea Differece Sum of Squares Treatmet Example Calculatios: SS TR + J ( ) ( 5 ( ) ) + ( 5 ( ) ) ( 5 ( ) ) + ( 5 ( ) ) + ( 5 ( ) ) 3.86 Test for differeces amog treatmet meas The first mai questio is: Are the treatmet meas differet? H 0 : μ μ μ J H : ot all the same OR: H 0 : τ τ L τ J 0 H : ot all equal to 0 OR: H 0 : (φ TR+ σ ε) /σ ε H : (φ TR+ σ ε)/σ ε > Where σ ε is the variace of the error terms; φ TR is the effect of the fixed treatmets (see page 34 for more details o what this is). If the treatmet does ot accout for a of the variace i the respose variable, the treatmet effects are likel all 0, ad all the treatmet meas are likel all the same. 9 30

16 Usig a aalsis of variace table: Source df SS MS F p-value Treatmet J- SS TR MS TR F Prob F> SS TR /(J-) MS TR /MSE F (J-),( T -J), Error T -J SSE MSE SSE/( T -J) Total T - SS (- α) For example, if we have 4 treatmets, ad experimetal uits, ad we wat α0.05: F (3,8; 0.95)4.07 Reectio Regio SS F SSE / /( J ) SSTR /( J ) J SSE /( T J ) ( ) TR MSTR MSE Uder H 0, ad the assumptios of aalsis of variace, this follows a F-distributio. If F > F( J, T J, α ) We reect H 0 ad coclude that there is a differece betwee the treatmet meas. Notice that this is a oe-sided test, usig -α If the calculated F is larger tha 4.07, we reect H 0 : The treatmets meas are likel differet, uless a 5% error has occurred. OR: We take our calculated F value from our experimet ad plot it o this F curve. The, fid the area to the right of this value (p-value). We reect a hpothesis if the probabilit value (p-value) for the test is less tha the specified sigificace level. This is because we are testig if the ratio of variaces is >. 3 3

17 For the example: If assumptios of ANOVA are met the iterpret the F-value. H 0 : μ μ μ 3 μ 4 μ 5 H : ot all equal Aalsis of Variace (ANOVA) Table: Source df SS MS F p-value Treatmet Error Total If assumptios of ANOVA are met the iterpret the F-value. NOTE: Fcritical for alpha0.05, df treatmet4 ad df error0 is.87. Sice the p-value is ver smaller (smaller tha alpha0.05), we reect H0 ad coclude that there is a differece i the treatmet Assumptios regardig the error term For the estimated meas for this experimet to be ubiased estimates of the meas i the populatio, ad the MSE to be a ubiased estimate of the variace withi each experimetal uit, the followig assumptios must be met:. Observatios are idepedet ot related i time or i space [idepedet data]. There is ormal distributio of the -values [or the error terms] aroud each treatmet mea [ormall distributed] 3. The variaces of the s aroud each treatmet mea [or the error terms] are the same (homogeeous) for all treatmet meas [equal variace] meas. BUT this is ol a good test if the assumptios of aalsis of variace have bee met. Need to check these first (as with regressio aalsis)

18 Similar to regressio: a ormal probabilit plot for the error terms ca be used to check the assumptio of ormalit, ad a residual plot ca be used to visuall check the assumptio of equal variace. OR, these ca be tested usig () ormalit tests (as with regressio); () Bartlett s test for equal variaces (for more tha oe factor or for other desigs with blockig, etc. this becomes difficult). Trasformatios to meet assumptios Similar to regressio: logarithmic trasformatios ca be used to equalize variaces arcsie trasformatio ca be used to trasform proportios ito ormall distributed variables rak trasformatio ca be used whe data are ot ormall distributed ad other trasformatios do ot work [oparametric aalsis of variace usig raks] Ulike regressio ou must trasform the -variable Process: do our aalsis with the measured respose variable if assumptios of the error term are ot met, trasform the -variable do the aalsis agai ad check the assumptios; if ot me, tr aother trasformatio ma have to switch to aother method: geeralized liear models, etc

19 Expected values: Uder the assumptios of aalsis of variace, MSE is a ubiased estimate of σ ε ad MS TR is a ubiased estimate of For the example, before iterpretig the ANOVA table, we must check assumptios of ANOVA: Is there equal variace across treatmets? (estimated b MSE as 0.49 o our ANOVA table). Usig a residual plot ad EXCEL: Residual Plot φ TR+ σ ε. Therefore, this F-test will give the correct probabilities uder the assumptios. This is the same as saig that the expected value of MSE is Residuals (Error i m) Pred. Values (Treat. Meas i m) σ ε, ad the expected value of MS TR is φ TR+ σ ε. The F-test is the a measure of how much larger the value is whe the treatmet meas are accouted for

20 Are residuals ormall distributed? Agai usig EXCEL: Cumulative probabilit Residuals vs. ormal z(0,) z-values Where stadardized residuals are calculated b: e (stadardized) i ei 0 MSE Stad. Res z(0,) Compare these to z-values for a stadard ormal distributio with a mea of zero ad a variace of (z(0,)) Differeces amog particular treatmet meas If there are differeces amog meas detected, which meas differ? Ca use: Orthogoal cotrasts see textbook Multiple comparisos Multiple comparisos (or cotrasts): Ma differet tpes, e.g. o T-test for ever pair of meas; must adust the alpha level used b dividig b the umber of pairs. o Scheffe s multiple comparisos o Boferoi s adustmets Tr to preserve the alpha level used to test all the meas together (the F-test) For the example, give that there is a differece amog treatmet meas, which pairs of meas differ? t-test for pairs of meas: determie the umber of pairs possible 5 5! 3!! 0 possible pairs of meas 39 40

21 Comparig Treatmets (largest estimated mea) versus 4 (smallest estimated mea): H : μ μ 0 OR H : μ μ 0 H : μ μ Cofidece limits for treatmet meas Uder the assumptios, cofidece itervals for each treatmet mea ca be obtaied b: t ( 4 ) 0 MSE + 4 ± t( T J ), α MSE Sice MSE estimates the variace that is assumed to be equal, t Uder H 0 : This follows: t α /, T J ( ) Usig alpha0.005 (0.05/00.005), for 5 treatmets ad 5 observatios, the t-value is Result? Aother wa to assess this is to obtai the p-value for t3.686, with 0 degrees of freedom (5-5). This is Sice this is less tha 0.005, we reect H 0 ad coclude that these two meas differ. ad the observatios are ormall distributio ad idepedet. For the example: ± t ( T ), α / MSE All For treatmet : MSE are all the same sice (3.74,4.46) 0.73 t 0, are all equal 4.± ±

22 Usig SAS ad R: For etr ito statistical programs like SAS ad R, the data should be orgaized as: Treatmet Obs: Respose to J i to 3 3 () 3 3 Y () J J J J J 3 3J J J (J) 3 For the example, we ca put the data ito a EXCEL file: Treatmet Observatio AveHt

23 Power of the Test: A Tpe I error rate (α, sigificace level), the chace of reectig a ull hpothesis whe it is true (ou reect whe the If the differece betwee populatio meas (real treatmet meas) is ver large, tha a small umber of experimetal uits will result i reectio of the ull hpothesis. meas are actuall the same) must be selected. Give: a particular umber of experimetal uits sizes of the differeces betwee true populatio meas, ad If the umber of experimetal uits is ver large, the eve a small differece betwee populatio meas will be detected. variatio withi the experimetal uits this will set the Tpe II error rate (β), the chace of acceptig a ull hpothesis whe it is false (ou fail to reect whe the meas are actuall differet) The power of the test is - β, the probabilit ou will reect If the variatio withi experimetal uits is ver small, the the differece will be detected, eve with a small differece betwee populatio meas, ad eve with ol a few treatmet uits. the ull hpothesis ad coclude that there is a differece i meas, whe there IS a differece betwee populatio meas

24 Statistical Sigificace is ot the same as differeces of Practical importace! UNLESS ou: have some idea of withi experimetal uit variatio from a previous stud with the same coditios (e.g., MSE from a previous stud) kow the size of the differece that ou wish to detect have selected the α level Methods based o maximum likelihood rather tha least squares ML methods ca be used whe: Treatmets are radom rather tha fixed (more o this later) Trasformatios do ot result i assumptios beig met Your depedet variable is a cout, or it is a biar variable (e.g., es or o; dead or alive; preset or abset) The: You ca calculate the umber of experimetal uits per treatmet that will result i reectio of H 0 : whe the differeces are that large or greater. Alterativel: You ca calculate the power of the test for a experimet ou have alread completed. [see examples i course materials for FRST 430/533] 47 48

25 CRD: Two Factor Factorial Experimet, Fixed Effects Itroductio Treatmets ca be combiatios of more tha oe factor For -factor experimet, have several levels of Factor A ad of Factor B All levels of Factor A occur for Factor B ad vice versa (called a Factorial Experimet, or crossed treatmets) Example: Schematic ad Measured Respose for the Example: AB0 A3B5 A3B435 AB3 AB4 AB34 AB44 AB AB5 AB48 A3B33 A3B5 A3B7 AB43 A3B39 A3B6 AB37 AB A3B435 AB3 AB4 AB AB34 A3B330 AB39 AB8 AB430 A3B33 AB33 AB4 A3B AB49 A3B3 AB8 AB5 A3B3 AB5 A3B437 AB9 A3B4 A3B436 AB48 AB37 AB8 AB0 AB8 AB36 AB38 Factor A, (three levels of fertilizatio: A, A, ad A3) Factor B (four species: B, B, B3 ad B4) Crossed: treatmets Four replicatios per treatmet for a total of 48 experimetal uits AB0 idicates that the respose variable was 0 for this experimetal uit that received Factor A, level ad Factor B, level. Treatmets radoml assiged to the 48 experimetal uits. Measured Resposes: height growth i mm 49 50

26 Orgaizatio of data for aalsis usig a statistics package: A B result Mai questios. Is there a iteractio betwee Factor A ad Factor B (fertilizer ad species i the example)? Or do the meas b Factor A remai the same regardless of Factor B ad vice versa?. If there is o iteractio, is there a differece a. Betwee Factor A meas? b. Betwee Factor B meas? 3. If there are differeces: a. If there is a iteractios, which treatmet meas differ? b. If there is o iteractio, the which levels of Factor A meas differ? Factor B meas? 5 5

27 Notatio, Assumptios, ad Trasformatios Models Populatio: ik μ + τ + τ + τ + ε A Bk AB k ik respose variable measured o experimetal uit i ad factor A level, factor B level k to J levels for Factor A; k to K levels for Factor B μ the grad or overall mea regardless of treatmet τ A the treatmet effect for Factor A, level τ Bk the treatmet effect for Factor B, level k τ ABk ε ik the iteractio for Factor A, level ad Factor B, level k the differece betwee a particular measure for a experimetal uit i, ad the mea for a treatmet: ε ik ik μ + τ A + τ Bk + τ AB ) ik ( i For the experimet: + ˆ τ + ˆ τ + ˆ + e ik A Bk τ AB k the grad or overall mea of all measures from the experimet regardless of treatmet; uder the assumptios for the error terms, this will be a ubiased estimate of μ k the mea of all measures from the experimet for a particular treatmet k the mea of all measures from the experimet for a particular level of Factor A (icludes all data for all levels of Factor B) k the mea of all measures from the experimet for a particular level k of Factor B (icludes all data for all levels of Factor A) ˆ τ A, ˆ τ ˆ Bk, τ ABk uder the error term assumptios, will be ubiased estimates of correspodig treatmet effects for the populatio e ik the differece betwee a particular measure for a experimetal uit i, ad the mea for the treatmet k that was applied to it e ik ik k k the umber of experimetal uits measured i treatmet k T the umber of experimetal uits measured over all K J treatmets k k ik 53 54

28 Meas for the example: Factor A: 6 observatios per level A6.5, A3.38, A38.75 Factor B: observatios per level B7.08, B0.83, B34.7, B49.08 Treatmets (A X B): 4 observatios per treatmet Sums of Squares: SS SSTR + SSE as with CRD: Oe Factor. BUT SSTR is ow divided ito: SS TR SSA + SSB + SSAB SS: The sum of squared differeces betwee the observatios ad the grad mea: SS K J k ( ik ) df T k i SSA: Sum of squared differeces betwee the level meas for factor A ad the grad mea, weighted b the umber of experimetal uits for each treatmet: SSA K J k k ( ) df J 55 56

29 SSB: Sum of squared differeces betwee the level meas for factor B ad the grad mea, weighted b the umber of experimetal uits for each treatmet: SSB K J k k ( ) df k K SSAB: Sum of squared differeces betwee treatmet meas for SSE: Sum of squared differeces betwee the observed values for each experimetal uit ad the treatmet meas: SSE K J k ( ik k ) k i Alterative computatioal formulae: df T JK k ad the grad mea, mius the factor level differeces, all weighted b the umber of experimetal uits for each treatmet: SSAB K J k k (( )) Si k ) ( k ) ( ce some of the estimated grad meas cacel out we obtai: SS SS TR K k i K J J k TR k SSAB SS k ik k T T SSA SSB SSA SSB [See Excel Spreadsheet for the Example] K k K J J k k k SSE SS SS k TR T T SSAB K J k k ( k k + ) df ( J )( K ) 57 58

30 Assumptios ad Trasformatios: Assumptios regardig the error term Must meet assumptios to obtai ubiased estimates of populatio meas, ad a ubiased estimate of the variace of the error term (same as CRD: Oe Factor) o idepedet observatios (ot time or space related) o ormalit of the errors, o equal variace for each treatmet. Use residual plot ad a plot of the stadardized errors agaist the expected errors for a ormal distributio to check these assumptios. Trasformatios: As with CRD: Oe Factor, ou must trasform the -variable Test for Iteractios ad Mai Effects The first mai questio is: Is there a iteractio betwee the two factors? H 0 : No iteractio H : Iteractio OR: H 0 : (φ AB+ σ ε) /σ ε H : (φ AB+ σ ε)/σ ε > Where σ ε is the variace of the error terms; φ AB is the iteractio effect of the fixed treatmets. Process: do our aalsis with the measured respose variable if assumptios of the error term are ot met, trasform the -variable do the aalsis agai ad check the assumptios; if ot me, tr aother trasformatio ma have to switch to aother method: geeralized liear models, etc

31 Usig a aalsis of variace table: Source df SS MS F p-value A J- SSA MSA SSA/(J-) F MSA/MSE Prob F> F (J-),(dfE), - α B K- SSB MSB SSB/(K-) A X B (J-)(K-) SSAB MSAB SSAB/ (J-)(K-) Error T -JK SSE MSE SSE/( T -J) Total T - SS F MSB/MSE F MSAB/MSE Source df MS E[MS] A J- MSA σ ε + φ A B K- MSB A X B (J-)(K-) MSAB Error T -JK MSE σ Total T - σ ε + φ B σ ε + φ AB ε Prob F> F (K-),(dfE),- α Prob F> F dfab,dfe,,- α See Neter et al., page 86, Table 9.8 for details o expected mea squares; φ is used here to represet fixed effects. For the iteractios: SSAB /( J )( K ) F SSE /( JK) T MSAB MSE Uder H 0, this follows F df,df, - α where df is from the umerator (J-)(K-), ad df is from the deomiator ( T - JK) If the F calculated is greater tha the tabular F, or if the p- value for F calculated is less tha α, reect H 0. o The meas of Factor A are iflueced b the levels of Factor B ad the two factors caot be iterpreted separatel. o Graph the meas of all treatmets o Coduct multiple comparisos all treatmets (rather the o meas of each Factor, separatel o Not as much power (reect H 0 whe it is false), if this occurs. 6 6

32 If there are o iteractios betwee the factors, we ca look at each factor separatel fewer meas, less complicated. Factor A: H 0 : μ μ μ J OR: H 0 : (φ A+ σ ε))/σ ε H : (φ A+ σ ε)/σ ε > Where σ ε is the variace of the error terms; φ A is fixed effect for Factor A. From the ANOVA table: SSA/( J ) F SSE /( JK) T MSA MSE Uder H 0, this follows F df,df, - α where df is from the umerator (J-) ad df is from the deomiator ( T -JK) If the F calculated is greater tha the tabular F, or if the p- value for F calculated is less tha α, reect H 0. o The meas of Factor A i the populatio are likel ot all the same o Graph the meas of Factor A levels o Coduct multiple comparisos betwee meas for the J levels of Factor A, separatel The aalsis ad coclusios would follow the same patter for Factor B

33 Aalsis of Variace Table Results for the Example Source Degrees of Freedom Sum of Squares Mea Squares F p A <0.000 B <0.000 A X B Error Total If assumptios met, (residuals are idepedet, are ormall distributed, ad have equal variaces amog treatmets), we ca iterpret the results. Iterpretatio usig α 0.05: No sigificat iteractio (p0.0539); we ca examie species ad fertilizer effects separatel. Are sigificat differeces betwee the three fertilizer levels of Factor A (p<0.000), ad betwee the four species of Factor B (p<0.000). The mea values based o these data are: A6.5, A3.38, A38.75 B7.08, B0.83, B34.7, B49.08 Did ot have to calculate these for each of the treatmets sice there is o iteractio

34 Further aalses, for each Factor separatel: Scheffé s test for multiple comparisos, could the be used to compare ad cotrast Factor level meas. o The umber of observatios i each factor level are: 6 for Factor A, ad for Factor B o Use the MSE for both Factor A ad for Factor B (deomiator of their F-tests) Factor A: t-tests for pairs of meas Determie the umber of pairs possible 3 3!!! 3 possible pairs of meas Use a sigificace level of 0.05/3 pairs0.07 for each t-test Comparig Factor Levels ad : A vs. A H0 : μ μ 0 H : μ μ 0 t-tests for each pair of meas could be used istead. o Agai, use MSE, ad 6 observatios for Factor A versus for Factor B t MSE ( K k k ) 0 + K k 4 k o Must split alpha level used i the F-tests b the umber of pairs t ( )

35 Critical t value from a probabilit table for: df(error) 36 based o ( T JK), ad 0.07 sigificace level (For α 0.05 use 0.05/3 pairs for each t-test), -sided test Usig a EXCEL fuctio: tiv(0.07,36), returs the value of.50 (this assumes a -sided test). Sice the absolute value of the calculated t is greater tha.50 we reect H0. OR eter our t-value, df (error), ad (for -sided) ito the EXCEL fuctio tdist(8.58,36,) Returs a p-value of < (NOTE that ou must eter the positive value, ad the p-value is for the two eds (area greater tha 8.58 plus area less tha -8.58) Sice p<0.07, we reect H0 A Differet Iterpretatio usig α 0.0: There is a sigificat iteractio (p0.0539) usig α 0.0; caot iterpret mai effects (A ad B) separatel. The mea values based o these data are: [Excel] AB0.5 AB4.5 AB AB4.75 AB8.00 AB.50 AB3 4.5 AB48.75 A3B 3.00 A3B5.75 A3B A3B mea values as there is a sigificat iteractio The mea of treatmet A differs from the mea of A. For Factor B Recalculate the umber of possible pairs for 4 factor levels (will be 6 pairs; divide alpha b this for each test ) The observatios per factor level is, rather tha 6 Df(error) ad MSE are the same as for Factor A

36 Further aalses: Cofidece limits for factor level ad treatmet meas Scheffé s test for multiple comparisos (or others) could the be used to compare ad cotrast treatmet meas (pairs or other groupigs of meas). The umber of observatios i Treatmet meas: k ± t( JK ), α MSE k each treatmet are 4 [lower power tha if there was o iteractio], ad use the MSE. Usig t-tests for pairs of meas, the umber of observatios are 4 for each k treatmet, use the MSE, ad recalculate the umber of possible pairs out of treatmets (will be 66 pairs! Retaiig α 0.0, we would use 0.0/ for each t-test ) Factor A meas: Factor B meas: t ± ( JK ), α K t MSE k k ± ( JK ), α J k MSE k [see course materials for FRST 430/533 for other desigs] 7 7

Definitions of terms and examples. Experimental Design. Sampling versus experiments. For each experimental unit, measures of the variables of

Definitions of terms and examples. Experimental Design. Sampling versus experiments. For each experimental unit, measures of the variables of Experimental Design Sampling versus experiments similar to sampling and inventor design in that information about forest variables is gathered and analzed experiments presuppose intervention through appling

More information

Chapter 13, Part A Analysis of Variance and Experimental Design

Chapter 13, Part A Analysis of Variance and Experimental Design Slides Prepared by JOHN S. LOUCKS St. Edward s Uiversity Slide 1 Chapter 13, Part A Aalysis of Variace ad Eperimetal Desig Itroductio to Aalysis of Variace Aalysis of Variace: Testig for the Equality of

More information

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test. Math 308 Sprig 018 Classes 19 ad 0: Aalysis of Variace (ANOVA) Page 1 of 6 Itroductio ANOVA is a statistical procedure for determiig whether three or more sample meas were draw from populatios with equal

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to: STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio

More information

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples

More information

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +

More information

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo

More information

6 Sample Size Calculations

6 Sample Size Calculations 6 Sample Size Calculatios Oe of the major resposibilities of a cliical trial statisticia is to aid the ivestigators i determiig the sample size required to coduct a study The most commo procedure for determiig

More information

Power and Type II Error

Power and Type II Error Statistical Methods I (EXST 7005) Page 57 Power ad Type II Error Sice we do't actually kow the value of the true mea (or we would't be hypothesizig somethig else), we caot kow i practice the type II error

More information

11 Correlation and Regression

11 Correlation and Regression 11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record

More information

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS Lecture 5: Parametric Hypothesis Testig: Comparig Meas GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review from last week What is a cofidece iterval? 2 Review from last week What is a cofidece

More information

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2 Chapter 8 Comparig Two Treatmets Iferece about Two Populatio Meas We wat to compare the meas of two populatios to see whether they differ. There are two situatios to cosider, as show i the followig examples:

More information

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The

More information

GG313 GEOLOGICAL DATA ANALYSIS

GG313 GEOLOGICAL DATA ANALYSIS GG313 GEOLOGICAL DATA ANALYSIS 1 Testig Hypothesis GG313 GEOLOGICAL DATA ANALYSIS LECTURE NOTES PAUL WESSEL SECTION TESTING OF HYPOTHESES Much of statistics is cocered with testig hypothesis agaist data

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced

More information

Final Examination Solutions 17/6/2010

Final Examination Solutions 17/6/2010 The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:

More information

Sample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D.

Sample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D. ample ie Estimatio i the Proportioal Haards Model for K-sample or Regressio ettigs cott. Emerso, M.D., Ph.D. ample ie Formula for a Normally Distributed tatistic uppose a statistic is kow to be ormally

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Sample Size Determination (Two or More Samples)

Sample Size Determination (Two or More Samples) Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie

More information

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is: PROBABILITY FUNCTIONS A radom variable X has a probabilit associated with each of its possible values. The probabilit is termed a discrete probabilit if X ca assume ol discrete values, or X = x, x, x 3,,

More information

Linear Regression Models

Linear Regression Models Liear Regressio Models Dr. Joh Mellor-Crummey Departmet of Computer Sciece Rice Uiversity johmc@cs.rice.edu COMP 528 Lecture 9 15 February 2005 Goals for Today Uderstad how to Use scatter diagrams to ispect

More information

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +

More information

Common Large/Small Sample Tests 1/55

Common Large/Small Sample Tests 1/55 Commo Large/Small Sample Tests 1/55 Test of Hypothesis for the Mea (σ Kow) Covert sample result ( x) to a z value Hypothesis Tests for µ Cosider the test H :μ = μ H 1 :μ > μ σ Kow (Assume the populatio

More information

Regression, Inference, and Model Building

Regression, Inference, and Model Building Regressio, Iferece, ad Model Buildig Scatter Plots ad Correlatio Correlatio coefficiet, r -1 r 1 If r is positive, the the scatter plot has a positive slope ad variables are said to have a positive relatioship

More information

Open book and notes. 120 minutes. Cover page and six pages of exam. No calculators.

Open book and notes. 120 minutes. Cover page and six pages of exam. No calculators. IE 330 Seat # Ope book ad otes 120 miutes Cover page ad six pages of exam No calculators Score Fial Exam (example) Schmeiser Ope book ad otes No calculator 120 miutes 1 True or false (for each, 2 poits

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

Chapter 6 Sampling Distributions

Chapter 6 Sampling Distributions Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

y ij = µ + α i + ɛ ij,

y ij = µ + α i + ɛ ij, STAT 4 ANOVA -Cotrasts ad Multiple Comparisos /3/04 Plaed comparisos vs uplaed comparisos Cotrasts Cofidece Itervals Multiple Comparisos: HSD Remark Alterate form of Model I y ij = µ + α i + ɛ ij, a i

More information

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading Topic 15 - Two Sample Iferece I STAT 511 Professor Bruce Craig Comparig Two Populatios Research ofte ivolves the compariso of two or more samples from differet populatios Graphical summaries provide visual

More information

Read through these prior to coming to the test and follow them when you take your test.

Read through these prior to coming to the test and follow them when you take your test. Math 143 Sprig 2012 Test 2 Iformatio 1 Test 2 will be give i class o Thursday April 5. Material Covered The test is cummulative, but will emphasize the recet material (Chapters 6 8, 10 11, ad Sectios 12.1

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

This is an introductory course in Analysis of Variance and Design of Experiments.

This is an introductory course in Analysis of Variance and Design of Experiments. 1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class

More information

Describing the Relation between Two Variables

Describing the Relation between Two Variables Copyright 010 Pearso Educatio, Ic. Tables ad Formulas for Sulliva, Statistics: Iformed Decisios Usig Data 010 Pearso Educatio, Ic Chapter Orgaizig ad Summarizig Data Relative frequecy = frequecy sum of

More information

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01 ENGI 44 Cofidece Itervals (Two Samples) Page -0 Two Sample Cofidece Iterval for a Differece i Populatio Meas [Navidi sectios 5.4-5.7; Devore chapter 9] From the cetral limit theorem, we kow that, for sufficietly

More information

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS

UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS PART of UNIVERSITY OF TORONTO Faculty of Arts ad Sciece APRIL/MAY 009 EAMINATIONS ECO0YY PART OF () The sample media is greater tha the sample mea whe there is. (B) () A radom variable is ormally distributed

More information

Estimation for Complete Data

Estimation for Complete Data Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of

More information

Stat 139 Homework 7 Solutions, Fall 2015

Stat 139 Homework 7 Solutions, Fall 2015 Stat 139 Homework 7 Solutios, Fall 2015 Problem 1. I class we leared that the classical simple liear regressio model assumes the followig distributio of resposes: Y i = β 0 + β 1 X i + ɛ i, i = 1,...,,

More information

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen) Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................

More information

MidtermII Review. Sta Fall Office Hours Wednesday 12:30-2:30pm Watch linear regression videos before lab on Thursday

MidtermII Review. Sta Fall Office Hours Wednesday 12:30-2:30pm Watch linear regression videos before lab on Thursday Aoucemets MidtermII Review Sta 101 - Fall 2016 Duke Uiversity, Departmet of Statistical Sciece Office Hours Wedesday 12:30-2:30pm Watch liear regressio videos before lab o Thursday Dr. Abrahamse Slides

More information

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times Sigificace level vs. cofidece level Agreemet of CI ad HT Lecture 13 - Tests of Proportios Sta102 / BME102 Coli Rudel October 15, 2014 Cofidece itervals ad hypothesis tests (almost) always agree, as log

More information

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y. Testig Statistical Hypotheses Recall the study where we estimated the differece betwee mea systolic blood pressure levels of users of oral cotraceptives ad o-users, x - y. Such studies are sometimes viewed

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

Statistics Lecture 27. Final review. Administrative Notes. Outline. Experiments. Sampling and Surveys. Administrative Notes

Statistics Lecture 27. Final review. Administrative Notes. Outline. Experiments. Sampling and Surveys. Administrative Notes Admiistrative Notes s - Lecture 7 Fial review Fial Exam is Tuesday, May 0th (3-5pm Covers Chapters -8 ad 0 i textbook Brig ID cards to fial! Allowed: Calculators, double-sided 8.5 x cheat sheet Exam Rooms:

More information

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for

More information

University of California, Los Angeles Department of Statistics. Hypothesis testing

University of California, Los Angeles Department of Statistics. Hypothesis testing Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Elemets of a hypothesis test: Hypothesis testig Istructor: Nicolas Christou 1. Null hypothesis, H 0 (claim about µ, p, σ 2, µ

More information

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals 7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses

More information

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018 HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018 We are resposible for 2 types of hypothesis tests that produce ifereces about the ukow populatio mea, µ, each of which has 3 possible

More information

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech Fial Review Fall 2013 Prof. Yao Xie, yao.xie@isye.gatech.edu H. Milto Stewart School of Idustrial Systems & Egieerig Georgia Tech 1 Radom samplig model radom samples populatio radom samples: x 1,..., x

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

This chapter focuses on two experimental designs that are crucial to comparative studies: (1) independent samples and (2) matched pair samples.

This chapter focuses on two experimental designs that are crucial to comparative studies: (1) independent samples and (2) matched pair samples. Chapter 9 & : Comparig Two Treatmets: This chapter focuses o two eperimetal desigs that are crucial to comparative studies: () idepedet samples ad () matched pair samples Idepedet Radom amples from Two

More information

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight) Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........

More information

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10 DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set

More information

Chapter 20. Comparing Two Proportions. BPS - 5th Ed. Chapter 20 1

Chapter 20. Comparing Two Proportions. BPS - 5th Ed. Chapter 20 1 Chapter 0 Comparig Two Proportios BPS - 5th Ed. Chapter 0 Case Study Machie Reliability A study is performed to test of the reliability of products produced by two machies. Machie A produced 8 defective

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise)

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise) Lecture 22: Review for Exam 2 Basic Model Assumptios (without Gaussia Noise) We model oe cotiuous respose variable Y, as a liear fuctio of p umerical predictors, plus oise: Y = β 0 + β X +... β p X p +

More information

Statistical Hypothesis Testing. STAT 536: Genetic Statistics. Statistical Hypothesis Testing - Terminology. Hardy-Weinberg Disequilibrium

Statistical Hypothesis Testing. STAT 536: Genetic Statistics. Statistical Hypothesis Testing - Terminology. Hardy-Weinberg Disequilibrium Statistical Hypothesis Testig STAT 536: Geetic Statistics Kari S. Dorma Departmet of Statistics Iowa State Uiversity September 7, 006 Idetify a hypothesis, a idea you wat to test for its applicability

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics 8.2 Testig a Proportio Math 1 Itroductory Statistics Professor B. Abrego Lecture 15 Sectios 8.2 People ofte make decisios with data by comparig the results from a sample to some predetermied stadard. These

More information

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1 October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 1 Populatio parameters ad Sample Statistics October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 2 Ifereces

More information

Correlation Regression

Correlation Regression Correlatio Regressio While correlatio methods measure the stregth of a liear relatioship betwee two variables, we might wish to go a little further: How much does oe variable chage for a give chage i aother

More information

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion 1 Chapter 7 ad 8 Review for Exam Chapter 7 Estimates ad Sample Sizes 2 Defiitio Cofidece Iterval (or Iterval Estimate) a rage (or a iterval) of values used to estimate the true value of the populatio parameter

More information

Stat 200 -Testing Summary Page 1

Stat 200 -Testing Summary Page 1 Stat 00 -Testig Summary Page 1 Mathematicias are like Frechme; whatever you say to them, they traslate it ito their ow laguage ad forthwith it is somethig etirely differet Goethe 1 Large Sample Cofidece

More information

1 Models for Matched Pairs

1 Models for Matched Pairs 1 Models for Matched Pairs Matched pairs occur whe we aalyse samples such that for each measuremet i oe of the samples there is a measuremet i the other sample that directly relates to the measuremet i

More information

Chapter 12 Correlation

Chapter 12 Correlation Chapter Correlatio Correlatio is very similar to regressio with oe very importat differece. Regressio is used to explore the relatioship betwee a idepedet variable ad a depedet variable, whereas correlatio

More information

Lecture 7: Non-parametric Comparison of Location. GENOME 560 Doug Fowler, GS

Lecture 7: Non-parametric Comparison of Location. GENOME 560 Doug Fowler, GS Lecture 7: No-parametric Compariso of Locatio GENOME 560 Doug Fowler, GS (dfowler@uw.edu) 1 Review How ca we set a cofidece iterval o a proportio? 2 What do we mea by oparametric? 3 Types of Data A Review

More information

UCLA STAT 110B Applied Statistics for Engineering and the Sciences

UCLA STAT 110B Applied Statistics for Engineering and the Sciences UCLA STAT 110B Applied Statistics for Egieerig ad the Scieces Istructor: Ivo Diov, Asst. Prof. I Statistics ad Neurology Teachig Assistats: Bria Ng, UCLA Statistics Uiversity of Califoria, Los Ageles,

More information

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates. 5. Data, Estimates, ad Models: quatifyig the accuracy of estimates. 5. Estimatig a Normal Mea 5.2 The Distributio of the Normal Sample Mea 5.3 Normal data, cofidece iterval for, kow 5.4 Normal data, cofidece

More information

MA238 Assignment 4 Solutions (part a)

MA238 Assignment 4 Solutions (part a) (i) Sigle sample tests. Questio. MA38 Assigmet 4 Solutios (part a) (a) (b) (c) H 0 : = 50 sq. ft H A : < 50 sq. ft H 0 : = 3 mpg H A : > 3 mpg H 0 : = 5 mm H A : 5mm Questio. (i) What are the ull ad alterative

More information

Statistics 20: Final Exam Solutions Summer Session 2007

Statistics 20: Final Exam Solutions Summer Session 2007 1. 20 poits Testig for Diabetes. Statistics 20: Fial Exam Solutios Summer Sessio 2007 (a) 3 poits Give estimates for the sesitivity of Test I ad of Test II. Solutio: 156 patiets out of total 223 patiets

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

STA6938-Logistic Regression Model

STA6938-Logistic Regression Model Dr. Yig Zhag STA6938-Logistic Regressio Model Topic -Simple (Uivariate) Logistic Regressio Model Outlies:. Itroductio. A Example-Does the liear regressio model always work? 3. Maximum Likelihood Curve

More information

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N. 3/3/04 CDS M Phil Old Least Squares (OLS) Vijayamohaa Pillai N CDS M Phil Vijayamoha CDS M Phil Vijayamoha Types of Relatioships Oly oe idepedet variable, Relatioship betwee ad is Liear relatioships Curviliear

More information

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes. Term Test 3 (Part A) November 1, 004 Name Math 6 Studet Number Directio: This test is worth 10 poits. You are required to complete this test withi miutes. I order to receive full credit, aswer each problem

More information

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis Sectio 9.2 Tests About a Populatio Proportio P H A N T O M S Parameters Hypothesis Assess Coditios Name the Test Test Statistic (Calculate) Obtai P value Make a decisio State coclusio Sectio 9.2 Tests

More information

Chapter 23: Inferences About Means

Chapter 23: Inferences About Means Chapter 23: Ifereces About Meas Eough Proportios! We ve spet the last two uits workig with proportios (or qualitative variables, at least) ow it s time to tur our attetios to quatitative variables. For

More information

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9 BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous

More information

Error & Uncertainty. Error. More on errors. Uncertainty. Page # The error is the difference between a TRUE value, x, and a MEASURED value, x i :

Error & Uncertainty. Error. More on errors. Uncertainty. Page # The error is the difference between a TRUE value, x, and a MEASURED value, x i : Error Error & Ucertaity The error is the differece betwee a TRUE value,, ad a MEASURED value, i : E = i There is o error-free measuremet. The sigificace of a measuremet caot be judged uless the associate

More information

Comparing your lab results with the others by one-way ANOVA

Comparing your lab results with the others by one-way ANOVA Comparig your lab results with the others by oe-way ANOVA You may have developed a ew test method ad i your method validatio process you would like to check the method s ruggedess by coductig a simple

More information

Lecture 7: Non-parametric Comparison of Location. GENOME 560, Spring 2016 Doug Fowler, GS

Lecture 7: Non-parametric Comparison of Location. GENOME 560, Spring 2016 Doug Fowler, GS Lecture 7: No-parametric Compariso of Locatio GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review How ca we set a cofidece iterval o a proportio? 2 Review How ca we set a cofidece iterval

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

INSTRUCTIONS (A) 1.22 (B) 0.74 (C) 4.93 (D) 1.18 (E) 2.43

INSTRUCTIONS (A) 1.22 (B) 0.74 (C) 4.93 (D) 1.18 (E) 2.43 PAPER NO.: 444, 445 PAGE NO.: Page 1 of 1 INSTRUCTIONS I. You have bee provided with: a) the examiatio paper i two parts (PART A ad PART B), b) a multiple choice aswer sheet (for PART A), c) selected formulae

More information

Additional Notes and Computational Formulas CHAPTER 3

Additional Notes and Computational Formulas CHAPTER 3 Additioal Notes ad Computatioal Formulas APPENDIX CHAPTER 3 1 The Greek capital sigma is the mathematical sig for summatio If we have a sample of observatios say y 1 y 2 y 3 y their sum is y 1 + y 2 +

More information

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance Hypothesis Testig Empirically evaluatig accuracy of hypotheses: importat activity i ML. Three questios: Give observed accuracy over a sample set, how well does this estimate apply over additioal samples?

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE IN STATISTICS, 017 MODULE 4 : Liear models Time allowed: Oe ad a half hours Cadidates should aswer THREE questios. Each questio carries

More information

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE TERRY SOO Abstract These otes are adapted from whe I taught Math 526 ad meat to give a quick itroductio to cofidece

More information

Statistical Fundamentals and Control Charts

Statistical Fundamentals and Control Charts Statistical Fudametals ad Cotrol Charts 1. Statistical Process Cotrol Basics Chace causes of variatio uavoidable causes of variatios Assigable causes of variatio large variatios related to machies, materials,

More information

UCLA STAT 13 Introduction to Statistical Methods for the Life and Health Sciences

UCLA STAT 13 Introduction to Statistical Methods for the Life and Health Sciences UCLA STAT 13 Itroductio to Statistical Methods for the Life ad Health Scieces Istructor: Ivo Diov, Asst. Prof. of Statistics ad Neurolog Sample Size Calculatios & Cofidece Itervals for Proportios Teachig

More information

Biostatistics for Med Students. Lecture 2

Biostatistics for Med Students. Lecture 2 Biostatistics for Med Studets Lecture 2 Joh J. Che, Ph.D. Professor & Director of Biostatistics Core UH JABSOM JABSOM MD7 February 22, 2017 Lecture Objectives To uderstad basic research desig priciples

More information

Chapter 6 Part 5. Confidence Intervals t distribution chi square distribution. October 23, 2008

Chapter 6 Part 5. Confidence Intervals t distribution chi square distribution. October 23, 2008 Chapter 6 Part 5 Cofidece Itervals t distributio chi square distributio October 23, 2008 The will be o help sessio o Moday, October 27. Goal: To clearly uderstad the lik betwee probability ad cofidece

More information

Grant MacEwan University STAT 252 Dr. Karen Buro Formula Sheet

Grant MacEwan University STAT 252 Dr. Karen Buro Formula Sheet Grat MacEwa Uiversity STAT 5 Dr. Kare Buro Formula Sheet Descriptive Statistics Sample Mea: x = x i i= Sample Variace: s = i= (x i x) = Σ i=x i (Σ i= x i) Sample Stadard Deviatio: s = Sample Variace =

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

First, note that the LS residuals are orthogonal to the regressors. X Xb X y = 0 ( normal equations ; (k 1) ) So,

First, note that the LS residuals are orthogonal to the regressors. X Xb X y = 0 ( normal equations ; (k 1) ) So, 0 2. OLS Part II The OLS residuals are orthogoal to the regressors. If the model icludes a itercept, the orthogoality of the residuals ad regressors gives rise to three results, which have limited practical

More information

Formulas and Tables for Gerstman

Formulas and Tables for Gerstman Formulas ad Tables for Gerstma Measuremet ad Study Desig Biostatistics is more tha a compilatio of computatioal techiques! Measuremet scales: quatitative, ordial, categorical Iformatio quality is primary

More information

Instructor: Judith Canner Spring 2010 CONFIDENCE INTERVALS How do we make inferences about the population parameters?

Instructor: Judith Canner Spring 2010 CONFIDENCE INTERVALS How do we make inferences about the population parameters? CONFIDENCE INTERVALS How do we make ifereces about the populatio parameters? The samplig distributio allows us to quatify the variability i sample statistics icludig how they differ from the parameter

More information

10. Comparative Tests among Spatial Regression Models. Here we revisit the example in Section 8.1 of estimating the mean of a normal random

10. Comparative Tests among Spatial Regression Models. Here we revisit the example in Section 8.1 of estimating the mean of a normal random Part III. Areal Data Aalysis 0. Comparative Tests amog Spatial Regressio Models While the otio of relative likelihood values for differet models is somewhat difficult to iterpret directly (as metioed above),

More information