One-way ANOVA Inference for one-way ANOVA
|
|
- Roderick Webster
- 5 years ago
- Views:
Transcription
1 One-way ANOVA Inference for one-way ANOVA IPS Chater W.H. Freeman and Comany
2 Objectives (IPS Chater 12.1) Inference for one-way ANOVA Comaring means The two-samle t statistic An overview of ANOVA The ANOVA model Testing hyotheses in one-way ANOVA The F-test The ANOVA table
3 The idea of ANOVA Reminders: A factor is a variable that can take one of several levels used to differentiate one grou from another. An exeriment has a one-way, or comletely randomized, design if several levels of one factor are being studied and the individuals are randomly assigned to its levels. Examle: Four levels of nematode quantity in seedling growth exeriment. Two seed secies and four levels of nematodes would be a two-way design. Analysis of variance (ANOVA) is the technique used to determine if there are any differences among the means of the treatment grous. One-way ANOVA is used for comletely randomized, one-way designs.
4 Comaring means We want to know if the observed differences in samle means are likely to have occurred by chance just because of random samling. This will likely deend on both the difference between the samle means and how much variability there is within each samle.
5 Reminder: Two-samle t statistic A two samle t-test assuming equal variance or an ANOVA comaring only two grous will give you the exact same -value (for a two-sided hyothesis). H 0 : µ 1 = µ 2 H a : µ 1 µ 2 One-way ANOVA F-statistic H 0 : µ 1 = µ 2 H a : µ 1 µ 2 t-test assuming equal variance t-statistic F = t 2 and both -values are the same. But the t-test is more flexible: You may choose a one-sided alternative instead, or you may want to run a t-test assuming unequal variance if you are not sure that your two oulations have the same standard deviation σ.
6 An Overview of ANOVA We first examine the multile oulations or multile treatments to test for overall statistical significance as evidence of any difference among the arameters we want to comare using the ANOVA F-test If that overall test shows statistical significance, then a detailed follow-u analysis is legitimate. If we lanned our exeriment with secific alternative hyotheses in mind (before gathering the data), we can test them using contrasts. If we do not have secific alternatives, we can examine all air-wise arameter comarisons to define which arameters differ from which, using multile comarisons rocedures.
7 Nematodes and lant growth Do nematodes affect lant growth? A botanist reares 16 identical lanting ots and adds different numbers of nematodes into the ots. Seedling growth (in mm) is recorded two weeks later. Hyotheses: All µ i are the same (H 0 ) versus not All µ i are the same (H a )
8 The ANOVA model Random samling always roduces chance variations. Any factor effect would thus show u in our data as the factor-driven differences lus chance variations ( error ): Data = fit ( grou mean ) + residual ( error ) The one-way ANOVA model analyzes situations where chance variations are normally distributed N(0,σ) so that:
9 The ANOVA table Source of variation Sum of squares SS DF Mean square MS F P value Grous I -1 SSG/DFG MSG/MSE Tail area SSG = n i (x i x) 2 above F Error N - I SSE/DFE 2 SSE = (n i 1)s i Total SST=SSG+SSE N 1 = (x ij x) 2 R 2 = SSG/SST MSE = s Coefficient of determination Pooled standard deviation The sum of squares reresents variation in the data: SST = SSG + SSE. The degrees of freedom likewise reflect the ANOVA model: DFT = DFG + DFE.
10 Testing hyotheses in one-way ANOVA We have I indeendent SRSs, from I oulations or treatments. The i th oulation has a normal distribution with unknown mean µ i. All I oulations have the same standard deviation σ, unknown. The ANOVA F statistic tests: F SSG ( I = SSE ( N 1) I) H 0 : µ 1 = µ 2 = = µ I H a : not all the µ i are equal. When H 0 is true, F has the F distribution with I 1 (numerator) and N I (denominator) degrees of freedom.
11 The ANOVA F-test The ANOVA F-statistic comares variation due to secific sources (levels of the factor) with variation among individuals who should be similar (individuals in the same samle). F = variation among samlemeans variation among individuals in samesamle Difference in means small relative to overall variability Difference in means large relative to overall variability è F tends to be small è F tends to be large Larger F-values tyically yield more significant results. How large deends on the degrees of freedom (I 1 and N I).
12 Checking our assumtions Each of the I oulations must be normally distributed (histograms or normal quantile lots). But the test is robust to normality deviations for large enough samle sizes, thanks to the central limit theorem. The ANOVA F-test requires that all oulations have the same standard deviation σ. Since σ is unknown, this can be hard to check. Practically: The results of the ANOVA F-test are aroximately correct when the largest samle standard deviation is no more than twice as large as the smallest samle standard deviation. (Equal samle sizes also make ANOVA more robust to deviations from the equal σ rule)
13 Do nematodes affect lant growth? Seedling growth x i s i 0 nematode nematodes nematodes nematodes Conditions required: equal variances: checking that largest s i no more than twice smallest s i Largest s i = 2.053; smallest s i = Indeendent SRSs Four grous obviously indeendent Distributions roughly normal It is hard to assess normality with only four oints er condition. But the ots in each grou are identical, and there is no reason to susect skewed distributions.
14 Excel outut for the one-way ANOVA Menu/Tools/DataAnalysis/AnovaSingleFactor Anova: Single Factor SUMMARY Grous Count Sum Average Variance 0 nematode nematodes nematodes nematodes numerator denominator ANOVA Source of Variation SS df MS F P-value F crit Between Grous Within Grous Total Here, the calculated F-value (12.08) is larger than F critical (3.49) for α=0.05. Thus, the test is significant at α=5% è Not all mean seedling lengths are the same; the number of nematodes is an influential factor.
15 SPSS outut for the one-way ANOVA ANOVA SeedlingLength Between Grous Within Grous Total Sum of Squares df Mean Square F Sig The ANOVA found that the amount of nematodes in ots significantly imacts seedling growth. The grah suggests that nematode amounts above 1,000 er ot are detrimental to seedling growth.
16 Using Table E The F distribution is asymmetrical and has two distinct degrees of freedom. This was discovered by Fisher, hence the label F. Once again, what we do is calculate the value of F for our samle data and then look u the corresonding area under the curve in Table E.
17 Table E df num = I 1 For df: 5,4 F df den = N I
18 ANOVA Source of Variation SS df MS F P-value F crit Between Grous Within Grous Total F critical for α 5% is 3.49 F = > Thus < 0.001
19 Yogurt rearation and taste Yogurt can be made using three distinct commercial rearation methods: traditional, ultra filtration, and reverse osmosis. To study the effect of these methods on taste, an exeriment was designed where three batches of yogurt were reared for each of the three methods. A trained exert tasted each of the nine samles, resented in random order, and judged them on a scale of 1 to 10. Variables, hyotheses, assumtions, calculations? ANOVA table Source of variation SS df MS F P-value F crit Between grous 17.3 I-1= Within grous 4.6 N-I= Total 21.9
20 df num = I 1 df den = N I F
21 Comutation details F = MSG MSE = SSG ( I SSE ( N 1) I) MSG, the mean square for grous, measures how different the individual means are from the overall mean (weighted average of square distances of samle averages to the overall mean). SSG is the sum of squares for grous. MSE, the mean square for error is the ooled samle variance s 2 and estimates the common variance σ 2 of the I oulations (weighted average of the variances from each of the I samles). SSE is the sum of squares for error.
22 One-way ANOVA Comaring the means IPS Chater W.H. Freeman and Comany
23 Objectives (IPS Chater 12.2) Comaring the means Contrasts Multile comarisons Power of the one-way ANOVA test
24 You have calculated a -value for your ANOVA test. Now what? If you found a significant result, you still need to determine which treatments were different from which. You can gain insight by looking back at your lots (boxlot, mean ± s). There are several tests of statistical significance designed secifically for multile tests. You can choose a riori contrasts, or a osteriori multile comarisons. You can find the confidence interval for each mean µ i shown to be significantly different from the others.
25 Contrasts can be used only when there are clear exectations BEFORE starting an exeriment, and these are reflected in the exerimental design. Contrasts are lanned comarisons. Patients are given either drug A, drug B, or a lacebo. The lacebo is meant to rovide a baseline against which the other drugs can be comared. Multile comarisons should be used when there are no secific lanned comarisons. Those are a osteriori, air-wise tests of significance. We comare gas mileage for eight brands of SUVs. We have no rior knowledge to exect one brand to erform differently from the rest. Pairwise comarisons should be erformed here, but only if an ANOVA test on all eight brands reached statistical significance first. It is NOT aroriate to use a contrast test when suggested comarisons aear only after the data is collected.
26 Contrasts: lanned comarisons When an exeriment is designed to test one or more secific hyotheses that some treatments are different from other treatments, we can use contrasts to test for significant differences between these secific treatments, EVEN if the overall ANOVA F-test is not significant. Contrasts are more owerful than multile comarisons because they are more secific. They are more able to ick u a significant difference. You can use a t-test on the contrasts or calculate a t-confidence interval. The results are valid regardless of the results of your multile samle ANOVA test (you are still testing a valid hyothesis).
27 A contrast is a combination of oulation means of the form : ψ = a µ Where the coefficients a i have sum 0. The corresonding samle contrast is : c = a i x i The standard error of c is : SE c = s i 2 ai = n i i MSE a n 2 i i To test the null hyothesis H 0 : ψ = 0 use the t-statistic: t = c SE c With degrees of freedom DFE that is associated with s. The alternative hyothesis can be one- or two-sided. A level C confidence interval for the contrast ψ is : c ± t * SE c Where t* is the critical value defining the middle C% of the t distribution with DFE degrees of freedom.
28 Contrasts are not always readily available in statistical software ackages (when they are, you need to assign the coefficients a i ), or may be limited to comaring each samle to a control. If your software doesn t rovide an otion for contrasts, you can test your contrast hyothesis with a regular t-test using the formulas we just highlighted. Remember to use the ooled variance and degrees of freedom as they reflect the better estimate of the oulation variance.
29 Nematodes and lant growth Do nematodes affect lant growth? A botanist reares 16 identical lanting ots and adds different numbers of nematodes into the ots. Seedling growth (in mm) is recorded two weeks later. Nematodes Seedling growth , , , overall mean 8.03 x i One grou contains no nematodes at all. If the botanist lanned this grou as a baseline/control, then a contrast of all the nematode grous against the control would be valid.
30 Nematodes: lanned comarison Contrast of all the nematode grous against the control: Combined contrast hyotheses: H 0 : µ 1 = 1/3 (µ 2 + µ 3 + µ 4 ) H a : µ 1 > 1/3 (µ 2 + µ 3 + µ 4 ) x i s i G1: 0 nematode G2: 1,000 nematodes G3: 5,000 nematodes G4: 1,0000 nematodes Contrast coefficients: (+1 1/3 1/3 1/3) or ( ) c = a i xi = 3 * = SE c = s a n 2 i i = 2.78 * ( 1) + 3* t = c SE = df : N-I = 12 c In Excel: TDIST(3.6,12,1) = tdist(t, df, tails) (-value). Nematodes result in significantly shorter seedlings (alha 1%).
31 ANOVA vs. contrasts in SPSS ANOVA: SeedlingLength Between Grous Within Grous Total H 0 : all µ i are equal vs. H a : not all µ i are equal ANOVA Sum of Squares df Mean Square F Sig è not all µ i are equal Planned comarison: H 0 : µ 1 = 1/3 (µ 2 + µ 3 + µ 4 ) vs. H a : µ 1 > 1/3 (µ 2 + µ 3 + µ 4 ) à one tailed Contrast coefficients: ( ) Contrast 1 Contrast Coefficients NematodesLevel SeedlingLength Assume equal variances Does not assume equal variances Contrast 1 1 Contrast Tests Value of Contrast Std. Error t df Sig. (2-tailed) Nematodes result in significantly shorter seedlings (alha 1%).
32 Multile comarisons Multile comarison tests are variants on the two-samle t-test. They use the ooled standard deviation s = MSE, the ooled degrees of freedom DFE, and they comensate for the multile comarisons. We comute the t-statistic for all airs of means: A given test is significant (µ i and µ j significantly different), when t ij t** (df = DFE). The value of t** deends on which rocedure you choose to use.
33 The Bonferroni rocedure The Bonferroni rocedure erforms a number of airwise comarisons with t-tests and then multilies each -value by the number of comarisons made. This ensures that the robability of making one or more false rejections among all comarisons made is no greater than the chosen significance level α. As a consequence, the higher the number of air-wise comarisons you make, the more difficult it will be to show statistical significance for each test. But the chance of committing a tye I error also increases with the number of tests made. The Bonferroni rocedure lowers the working significance level of each test to comensate for the increased chance of tye I errors among all tests erformed.
34 Simultaneous confidence intervals We can also calculate simultaneous level C confidence intervals for all air-wise differences (µ i µ j ) between oulation means: CI 1 : ( xi x j ) ± t ** s + n i 1 n j q s is the ooled variance, MSE. q t** is the t critical with degrees of freedom DFE = N I, adjusted for multile, simultaneous comarisons (e.g., Bonferroni rocedure).
35 SYSTAT File contains variables: GROWTH NEMATODES$ Categorical values encountered during rocessing are: NEMATODES$ (four levels): 10K, 1K, 5K, none De Var: GROWTH N: 16 Multile R: Squared multile R: Analysis of Variance Source Sum-of-Squares df Mean-Square F-ratio P NEMATODES$ Error Post Hoc test of GROWTH Using model MSE of with 12 df. Matrix of airwise mean differences: Bonferroni Adjustment Matrix of airwise comarison robabilities:
36 SigmaStat One-Way Analysis of Variance Normality Test: Passed (P > 0.050) Equal Variance Test: Passed (P = 0.807) Grou Name N Missing Mean Std dev SEM None K K K Source of variation DF SS MS F P Between grous <0.001 Residual Total Power of erformed test with alha = 0.050: All Pairwise Multile Comarison Procedures (Bonferroni t-test): Comarisons for factor: Nematodes Comarison Diff of means t P P<0.050 None vs. 10K Yes None vs. 5K Yes None vs. 1K No 1K vs. 10K Yes 1K vs. 5K Yes 5K vs. 10K No
37 SPSS SeedlingLength Between Grous Within Grous Total ANOVA Sum of Squares df Mean Square F Sig Multile Comarisons Deendent Variable: SeedlingLength Bonferroni (I) NematodesLevel (J) NematodesLevel *. The mean difference is significant at the.05 level. Mean Difference 95% Confidence Interval (I-J) Std. Error Sig. Lower Bound Uer Bound * * * * * * * *
38 Teaching methods A study comares the reading comrehension ( COMP, a test score) of children randomly assigned to one of three teaching methods: basal, DRTA, and strategies. We test: H 0 : µ Basal = µ DRTA = µ Strat vs. H a : H 0 not true The ANOVA test is significant (α 5%): we have found evidence that the three methods do not all yield the same oulation mean reading comrehension score.
39 What do you conclude? The three methods do not yield the same results: We found evidence of a significant difference between DRTA and basal methods (DRTA gave better results on average), but the data gathered does not suort the claim of a difference between the other methods (DRTA vs. strategies or basal vs. strategies).
40 Power The ower, or sensitivity, of a one-way ANOVA is the robability that the test will be able to detect a difference among the grous (i.e. reach statistical significance) when there really is a difference. Estimate the ower of your test while designing your exeriment to select samle sizes aroriate to detect an amount of difference between means that you deem imortant. Too small a samle is a waste of exeriment, but too large a samle is also a waste of resources. A ower of at least 80% is often suggested.
41 Power comutations ANOVA ower is affected by The significance level α The samle sizes and number of grous being comared The differences between grou means µ i The guessed oulation standard deviation You need to decide what alternative H a you would consider imortant, detect statistically for the means µ i, and to guess the common standard deviation σ (from similar studies or reliminary work). The ower comutations then require calculating a non-centrality aramenter λ, which follows the F distribution with DFG and DFE degrees of freedom to arrive at the ower of the test.
42 Systat: Power analysis Alha = 0.05 Model = Oneway If we anticiated a gradual decrease of Number of grous = 4 Within cell S.D. = 1.5 guessed seedling length for increasing amounts Mean(01) = 7.0 of nematodes in the ots and would Mean(02) = 8.0 consider gradual changes of 1 mm on Mean(03) = 9.0 Mean(04) = 10.0 average to be imortant enough to be Effect Size = reorted in a scientific journal Noncentrality arameter = * samle size SAMPLE SIZE POWER (er cell) then we would reach a ower of 80% or more when using six ots or more for each condition (Four ots er condition would only bring a ower of 55%.)
43 Systat: Power analysis If we anticiated that only large amounts of nematodes in the ots would lead to substantially shorter seedling lengths and would consider ste changes of 4 mm on average an imortant effect then we would reach a ower of 80% or more when using four ots or more in each grou (three ots er condition might be close enough though). Alha = 0.05 Power = 0.80 Model = One-way Number of grous = 4 Within cell S.D. = 1.7 Mean(01) = 6.0 Mean(02) = 6.0 Mean(03) = 10.0 Mean(04) = 10.0 Effect size = Noncentrality arameter = SAMPLE SIZE POWER (er cell) Total samle size = 16 guessed * samle size
Example: Four levels of herbicide strength in an experiment on dry weight of treated plants.
The idea of ANOVA Reminders: A factor is a variable that can take one of several levels used to differentiate one group from another. An experiment has a one-way, or completely randomized, design if several
More informationHotelling s Two- Sample T 2
Chater 600 Hotelling s Two- Samle T Introduction This module calculates ower for the Hotelling s two-grou, T-squared (T) test statistic. Hotelling s T is an extension of the univariate two-samle t-test
More informationCHAPTER 5 STATISTICAL INFERENCE. 1.0 Hypothesis Testing. 2.0 Decision Errors. 3.0 How a Hypothesis is Tested. 4.0 Test for Goodness of Fit
Chater 5 Statistical Inference 69 CHAPTER 5 STATISTICAL INFERENCE.0 Hyothesis Testing.0 Decision Errors 3.0 How a Hyothesis is Tested 4.0 Test for Goodness of Fit 5.0 Inferences about Two Means It ain't
More informationTests for Two Proportions in a Stratified Design (Cochran/Mantel-Haenszel Test)
Chater 225 Tests for Two Proortions in a Stratified Design (Cochran/Mantel-Haenszel Test) Introduction In a stratified design, the subects are selected from two or more strata which are formed from imortant
More information1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College
1-Way ANOVA MATH 143 Department of Mathematics and Statistics Calvin College Spring 2010 The basic ANOVA situation Two variables: 1 Categorical, 1 Quantitative Main Question: Do the (means of) the quantitative
More informationANOVA: Analysis of Variation
ANOVA: Analysis of Variation The basic ANOVA situation Two variables: 1 Categorical, 1 Quantitative Main Question: Do the (means of) the quantitative variables depend on which group (given by categorical
More information7.2 Inference for comparing means of two populations where the samples are independent
Objectives 7.2 Inference for comaring means of two oulations where the samles are indeendent Two-samle t significance test (we give three examles) Two-samle t confidence interval htt://onlinestatbook.com/2/tests_of_means/difference_means.ht
More informationStatistiek II. John Nerbonne using reworkings by Hartmut Fitz and Wilbert Heeringa. February 13, Dept of Information Science
Statistiek II John Nerbonne using reworkings by Hartmut Fitz and Wilbert Heeringa Dept of Information Science j.nerbonne@rug.nl February 13, 2014 Course outline 1 One-way ANOVA. 2 Factorial ANOVA. 3 Repeated
More informationIntroduction to Probability and Statistics
Introduction to Probability and Statistics Chater 8 Ammar M. Sarhan, asarhan@mathstat.dal.ca Deartment of Mathematics and Statistics, Dalhousie University Fall Semester 28 Chater 8 Tests of Hyotheses Based
More informationEcon 3790: Business and Economics Statistics. Instructor: Yogesh Uppal
Econ 379: Business and Economics Statistics Instructor: Yogesh Ual Email: yual@ysu.edu Chater 9, Part A: Hyothesis Tests Develoing Null and Alternative Hyotheses Tye I and Tye II Errors Poulation Mean:
More informationInference for Regression Simple Linear Regression
Inference for Regression Simple Linear Regression IPS Chapter 10.1 2009 W.H. Freeman and Company Objectives (IPS Chapter 10.1) Simple linear regression p Statistical model for linear regression p Estimating
More informationSlides Prepared by JOHN S. LOUCKS St. Edward s s University Thomson/South-Western. Slide
s Preared by JOHN S. LOUCKS St. Edward s s University 1 Chater 11 Comarisons Involving Proortions and a Test of Indeendence Inferences About the Difference Between Two Poulation Proortions Hyothesis Test
More informationEcon 3790: Business and Economics Statistics. Instructor: Yogesh Uppal
Econ 379: Business and Economics Statistics Instructor: Yogesh Ual Email: yual@ysu.edu Chater 9, Part A: Hyothesis Tests Develoing Null and Alternative Hyotheses Tye I and Tye II Errors Poulation Mean:
More informationChapter 4 Randomized Blocks, Latin Squares, and Related Designs Solutions
Solutions from Montgomery, D. C. (008) Design and Analysis of Exeriments, Wiley, NY Chater 4 Randomized Blocks, Latin Squares, and Related Designs Solutions 4.. The ANOVA from a randomized comlete block
More informationInference for Regression Inference about the Regression Model and Using the Regression Line
Inference for Regression Inference about the Regression Model and Using the Regression Line PBS Chapter 10.1 and 10.2 2009 W.H. Freeman and Company Objectives (PBS Chapter 10.1 and 10.2) Inference about
More informationMonte Carlo Studies. Monte Carlo Studies. Sampling Distribution
Monte Carlo Studies Do not let yourself be intimidated by the material in this lecture This lecture involves more theory but is meant to imrove your understanding of: Samling distributions and tests of
More informationcompare time needed for lexical recognition in healthy adults, patients with Wernicke s aphasia, patients with Broca s aphasia
Analysis of Variance ANOVA ANalysis Of VAriance generalized t-test compares means of more than two groups fairly robust based on F distribution, compares variance two versions single ANOVA compare groups
More informationSTK4900/ Lecture 7. Program
STK4900/9900 - Lecture 7 Program 1. Logistic regression with one redictor 2. Maximum likelihood estimation 3. Logistic regression with several redictors 4. Deviance and likelihood ratio tests 5. A comment
More informationCompletely Randomized Design
CHAPTER 4 Comletely Randomized Design 4.1 Descrition of the Design Chaters 1 to 3 introduced some basic concets and statistical tools that are used in exerimental design. In this and the following chaters,
More informationThe one-sample t test for a population mean
Objectives Constructing and assessing hyotheses The t-statistic and the P-value Statistical significance The one-samle t test for a oulation mean One-sided versus two-sided tests Further reading: OS3,
More informationStatistics II Logistic Regression. So far... Two-way repeated measures ANOVA: an example. RM-ANOVA example: the data after log transform
Statistics II Logistic Regression Çağrı Çöltekin Exam date & time: June 21, 10:00 13:00 (The same day/time lanned at the beginning of the semester) University of Groningen, Det of Information Science May
More information2. Sample representativeness. That means some type of probability/random sampling.
1 Neuendorf Cluster Analysis Assumes: 1. Actually, any level of measurement (nominal, ordinal, interval/ratio) is accetable for certain tyes of clustering. The tyical methods, though, require metric (I/R)
More informationMeasuring center and spread for density curves. Calculating probabilities using the standard Normal Table (CIS Chapter 8, p 105 mainly p114)
Objectives 1.3 Density curves and Normal distributions Density curves Measuring center and sread for density curves Normal distributions The 68-95-99.7 (Emirical) rule Standardizing observations Calculating
More information4. Score normalization technical details We now discuss the technical details of the score normalization method.
SMT SCORING SYSTEM This document describes the scoring system for the Stanford Math Tournament We begin by giving an overview of the changes to scoring and a non-technical descrition of the scoring rules
More informationMeasuring center and spread for density curves. Calculating probabilities using the standard Normal Table (CIS Chapter 8, p 105 mainly p114)
Objectives Density curves Measuring center and sread for density curves Normal distributions The 68-95-99.7 (Emirical) rule Standardizing observations Calculating robabilities using the standard Normal
More informationGeneral Linear Model Introduction, Classes of Linear models and Estimation
Stat 740 General Linear Model Introduction, Classes of Linear models and Estimation An aim of scientific enquiry: To describe or to discover relationshis among events (variables) in the controlled (laboratory)
More informationAnalysis of Variance Inf. Stats. ANOVA ANalysis Of VAriance. Analysis of Variance Inf. Stats
Analysis of Variance ANOVA ANalysis Of VAriance generalized t-test compares means of more than two groups fairly robust based on F distribution, compares variance two versions single ANOVA compare groups
More informationObjectives. 6.1, 7.1 Estimating with confidence (CIS: Chapter 10) CI)
Objectives 6.1, 7.1 Estimating with confidence (CIS: Chater 10) Statistical confidence (CIS gives a good exlanation of a 95% CI) Confidence intervals. Further reading htt://onlinestatbook.com/2/estimation/confidence.html
More informationFactorial Studies. Factor A I levels Factor B J levels o I*J Treatment groups o n ij values for each treatment combination
Factorial tudies Factor A I levels Factor B J levels o I*J Treatment grous o n ij values for each treatment combination B1 B.. BJ A1 n 11 n 1 n 1J A n 1 n n 1J AI n I1 n I n IJ Equal relication, n relicates,
More informationIn ANOVA the response variable is numerical and the explanatory variables are categorical.
1 ANOVA ANOVA means ANalysis Of VAriance. The ANOVA is a tool for studying the influence of one or more qualitative variables on the mean of a numerical variable in a population. In ANOVA the response
More informationAI*IA 2003 Fusion of Multiple Pattern Classifiers PART III
AI*IA 23 Fusion of Multile Pattern Classifiers PART III AI*IA 23 Tutorial on Fusion of Multile Pattern Classifiers by F. Roli 49 Methods for fusing multile classifiers Methods for fusing multile classifiers
More informationSociology 6Z03 Review II
Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability
More informationHypothesis Test-Confidence Interval connection
Hyothesis Test-Confidence Interval connection Hyothesis tests for mean Tell whether observed data are consistent with μ = μ. More secifically An hyothesis test with significance level α will reject the
More informationMultiple Regression. Inference for Multiple Regression and A Case Study. IPS Chapters 11.1 and W.H. Freeman and Company
Multiple Regression Inference for Multiple Regression and A Case Study IPS Chapters 11.1 and 11.2 2009 W.H. Freeman and Company Objectives (IPS Chapters 11.1 and 11.2) Multiple regression Data for multiple
More informationBattery Life. Factory
Statistics 354 (Fall 2018) Analysis of Variance: Comparing Several Means Remark. These notes are from an elementary statistics class and introduce the Analysis of Variance technique for comparing several
More informationUse of Transformations and the Repeated Statement in PROC GLM in SAS Ed Stanek
Use of Transformations and the Reeated Statement in PROC GLM in SAS Ed Stanek Introduction We describe how the Reeated Statement in PROC GLM in SAS transforms the data to rovide tests of hyotheses of interest.
More informationCHAPTER-II Control Charts for Fraction Nonconforming using m-of-m Runs Rules
CHAPTER-II Control Charts for Fraction Nonconforming using m-of-m Runs Rules. Introduction: The is widely used in industry to monitor the number of fraction nonconforming units. A nonconforming unit is
More informationObjectives. Estimating with confidence Confidence intervals.
Objectives Estimating with confidence Confidence intervals. Sections 6.1 and 7.1 in IPS. Page 174-180 OS3. Choosing the samle size t distributions. Further reading htt://onlinestatbook.com/2/estimation/t_distribution.html
More informationCombining Logistic Regression with Kriging for Mapping the Risk of Occurrence of Unexploded Ordnance (UXO)
Combining Logistic Regression with Kriging for Maing the Risk of Occurrence of Unexloded Ordnance (UXO) H. Saito (), P. Goovaerts (), S. A. McKenna (2) Environmental and Water Resources Engineering, Deartment
More informationChapter 7 Sampling and Sampling Distributions. Introduction. Selecting a Sample. Introduction. Sampling from a Finite Population
Chater 7 and s Selecting a Samle Point Estimation Introduction to s of Proerties of Point Estimators Other Methods Introduction An element is the entity on which data are collected. A oulation is a collection
More informationSupplementary Materials for Robust Estimation of the False Discovery Rate
Sulementary Materials for Robust Estimation of the False Discovery Rate Stan Pounds and Cheng Cheng This sulemental contains roofs regarding theoretical roerties of the roosed method (Section S1), rovides
More informationInference for Distributions Inference for the Mean of a Population
Inference for Distributions Inference for the Mean of a Population PBS Chapter 7.1 009 W.H Freeman and Company Objectives (PBS Chapter 7.1) Inference for the mean of a population The t distributions The
More informationTABLES AND FORMULAS FOR MOORE Basic Practice of Statistics
TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Exploring Data: Distributions Look for overall pattern (shape, center, spread) and deviations (outliers). Mean (use a calculator): x = x 1 + x
More informationSolved Problems. (a) (b) (c) Figure P4.1 Simple Classification Problems First we draw a line between each set of dark and light data points.
Solved Problems Solved Problems P Solve the three simle classification roblems shown in Figure P by drawing a decision boundary Find weight and bias values that result in single-neuron ercetrons with the
More informationSoci Data Analysis in Sociological Research. Homework 4 Computer Handout. Chapter 19 Confidence Intervals for Proportions
University of North Carolina Chael Hill Soci252-002 Data Analysis in Sociological Research Sring 2013 Professor François Nielsen Homework 4 Comuter Handout Readings This handout covers comuter issues related
More informationAnalysis of variance
Analysis of variance 1 Method If the null hypothesis is true, then the populations are the same: they are normal, and they have the same mean and the same variance. We will estimate the numerical value
More information18. Two-sample problems for population means (σ unknown)
8. Two-samle roblems for oulatio meas (σ ukow) The Practice of Statistics i the Life Scieces Third Editio 04 W.H. Freema ad Comay Objectives (PSLS Chater 8) Comarig two meas (σ ukow) Two-samle situatios
More informationNotes for Week 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1
Notes for Wee 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1 Exam 3 is on Friday May 1. A part of one of the exam problems is on Predictiontervals : When randomly sampling from a normal population
More information¼ ¼ 6:0. sum of all sample means in ð8þ 25
1. Samling Distribution of means. A oulation consists of the five numbers 2, 3, 6, 8, and 11. Consider all ossible samles of size 2 that can be drawn with relacement from this oulation. Find the mean of
More informationDetermining Momentum and Energy Corrections for g1c Using Kinematic Fitting
CLAS-NOTE 4-17 Determining Momentum and Energy Corrections for g1c Using Kinematic Fitting Mike Williams, Doug Alegate and Curtis A. Meyer Carnegie Mellon University June 7, 24 Abstract We have used the
More informationSAS for Bayesian Mediation Analysis
Paer 1569-2014 SAS for Bayesian Mediation Analysis Miočević Milica, Arizona State University; David P. MacKinnon, Arizona State University ABSTRACT Recent statistical mediation analysis research focuses
More informationANOVA Situation The F Statistic Multiple Comparisons. 1-Way ANOVA MATH 143. Department of Mathematics and Statistics Calvin College
1-Way ANOVA MATH 143 Department of Mathematics and Statistics Calvin College An example ANOVA situation Example (Treating Blisters) Subjects: 25 patients with blisters Treatments: Treatment A, Treatment
More informationChapter 13 Variable Selection and Model Building
Chater 3 Variable Selection and Model Building The comlete regsion analysis deends on the exlanatory variables ent in the model. It is understood in the regsion analysis that only correct and imortant
More informationSTA 250: Statistics. Notes 7. Bayesian Approach to Statistics. Book chapters: 7.2
STA 25: Statistics Notes 7. Bayesian Aroach to Statistics Book chaters: 7.2 1 From calibrating a rocedure to quantifying uncertainty We saw that the central idea of classical testing is to rovide a rigorous
More informationOn split sample and randomized confidence intervals for binomial proportions
On slit samle and randomized confidence intervals for binomial roortions Måns Thulin Deartment of Mathematics, Usala University arxiv:1402.6536v1 [stat.me] 26 Feb 2014 Abstract Slit samle methods have
More informationChapter 10. Supplemental Text Material
Chater 1. Sulemental Tet Material S1-1. The Covariance Matri of the Regression Coefficients In Section 1-3 of the tetbook, we show that the least squares estimator of β in the linear regression model y=
More informationA Comparison between Biased and Unbiased Estimators in Ordinary Least Squares Regression
Journal of Modern Alied Statistical Methods Volume Issue Article 7 --03 A Comarison between Biased and Unbiased Estimators in Ordinary Least Squares Regression Ghadban Khalaf King Khalid University, Saudi
More informationRobustness of multiple comparisons against variance heterogeneity Dijkstra, J.B.
Robustness of multile comarisons against variance heterogeneity Dijkstra, J.B. Published: 01/01/1983 Document Version Publisher s PDF, also known as Version of Record (includes final age, issue and volume
More informationLecture 3: Inference in SLR
Lecture 3: Inference in SLR STAT 51 Spring 011 Background Reading KNNL:.1.6 3-1 Topic Overview This topic will cover: Review of hypothesis testing Inference about 1 Inference about 0 Confidence Intervals
More informationClassical gas (molecules) Phonon gas Number fixed Population depends on frequency of mode and temperature: 1. For each particle. For an N-particle gas
Lecture 14: Thermal conductivity Review: honons as articles In chater 5, we have been considering quantized waves in solids to be articles and this becomes very imortant when we discuss thermal conductivity.
More informationute measures of uncertainty called standard errors for these b j estimates and the resulting forecasts if certain conditions are satis- ed. Note the e
Regression with Time Series Errors David A. Dickey, North Carolina State University Abstract: The basic assumtions of regression are reviewed. Grahical and statistical methods for checking the assumtions
More informationThe Poisson Regression Model
The Poisson Regression Model The Poisson regression model aims at modeling a counting variable Y, counting the number of times that a certain event occurs during a given time eriod. We observe a samle
More informationHypothesis testing: Steps
Review for Exam 2 Hypothesis testing: Steps Exam 2 Review 1. Determine appropriate test and hypotheses 2. Use distribution table to find critical statistic value(s) representing rejection region 3. Compute
More informationUsing the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process
Using the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process P. Mantalos a1, K. Mattheou b, A. Karagrigoriou b a.deartment of Statistics University of Lund
More informationANOVA: Comparing More Than Two Means
1 ANOVA: Comparing More Than Two Means 10.1 ANOVA: The Completely Randomized Design Elements of a Designed Experiment Before we begin any calculations, we need to discuss some terminology. To make this
More informationAnnouncements. Unit 4: Inference for numerical variables Lecture 4: ANOVA. Data. Statistics 104
Announcements Announcements Unit 4: Inference for numerical variables Lecture 4: Statistics 104 Go to Sakai s to pick a time for a one-on-one meeting. Mine Çetinkaya-Rundel June 6, 2013 Statistics 104
More informationHypothesis testing: Steps
Review for Exam 2 Hypothesis testing: Steps Repeated-Measures ANOVA 1. Determine appropriate test and hypotheses 2. Use distribution table to find critical statistic value(s) representing rejection region
More informationarxiv: v1 [physics.data-an] 26 Oct 2012
Constraints on Yield Parameters in Extended Maximum Likelihood Fits Till Moritz Karbach a, Maximilian Schlu b a TU Dortmund, Germany, moritz.karbach@cern.ch b TU Dortmund, Germany, maximilian.schlu@cern.ch
More informationWeek 14 Comparing k(> 2) Populations
Week 14 Comparing k(> 2) Populations Week 14 Objectives Methods associated with testing for the equality of k(> 2) means or proportions are presented. Post-testing concepts and analysis are introduced.
More informationT.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS
ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS In our work on hypothesis testing, we used the value of a sample statistic to challenge an accepted value of a population parameter. We focused only
More informationThe legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization.
1 Chapter 1: Research Design Principles The legacy of Sir Ronald A. Fisher. Fisher s three fundamental principles: local control, replication, and randomization. 2 Chapter 2: Completely Randomized Design
More informationSystem Reliability Estimation and Confidence Regions from Subsystem and Full System Tests
009 American Control Conference Hyatt Regency Riverfront, St. Louis, MO, USA June 0-, 009 FrB4. System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests James C. Sall Abstract
More informationAnalysis of variance
Analysis of variance Tron Anders Moger 3.0.007 Comparing more than two groups Up to now we have studied situations with One observation per subject One group Two groups Two or more observations per subject
More informationi) the probability of type I error; ii) the 95% con dence interval; iii) the p value; iv) the probability of type II error; v) the power of a test.
Problem Set 5. Questions:. Exlain what is: i) the robability of tye I error; ii) the 95% con dence interval; iii) the value; iv) the robability of tye II error; v) the ower of a test.. Solve exercise 3.
More information1. The (dependent variable) is the variable of interest to be measured in the experiment.
Chapter 10 Analysis of variance (ANOVA) 10.1 Elements of a designed experiment 1. The (dependent variable) is the variable of interest to be measured in the experiment. 2. are those variables whose effect
More informationSTA2601. Tutorial letter 203/2/2017. Applied Statistics II. Semester 2. Department of Statistics STA2601/203/2/2017. Solutions to Assignment 03
STA60/03//07 Tutorial letter 03//07 Applied Statistics II STA60 Semester Department of Statistics Solutions to Assignment 03 Define tomorrow. university of south africa QUESTION (a) (i) The normal quantile
More informationSampling Distributions: Central Limit Theorem
Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)
More informationPlotting the Wilson distribution
, Survey of English Usage, University College London Setember 018 1 1. Introduction We have discussed the Wilson score interval at length elsewhere (Wallis 013a, b). Given an observed Binomial roortion
More informationHiggs Modeling using EXPER and Weak Fusion. by Woody Stanford (c) 2016 Stanford Systems.
iggs Modeling using EXPER and Weak Fusion by Woody Stanford (c) 2016 Stanford Systems. Introduction The EXPER roject, even though its original findings were inconclusive has lead to various ideas as to
More informationEstimation of the large covariance matrix with two-step monotone missing data
Estimation of the large covariance matrix with two-ste monotone missing data Masashi Hyodo, Nobumichi Shutoh 2, Takashi Seo, and Tatjana Pavlenko 3 Deartment of Mathematical Information Science, Tokyo
More informationLower Confidence Bound for Process-Yield Index S pk with Autocorrelated Process Data
Quality Technology & Quantitative Management Vol. 1, No.,. 51-65, 15 QTQM IAQM 15 Lower onfidence Bound for Process-Yield Index with Autocorrelated Process Data Fu-Kwun Wang * and Yeneneh Tamirat Deartment
More informationOperations Management
Universidade Nova de Lisboa Faculdade de Economia Oerations Management Winter Semester 009/010 First Round Exam January, 8, 009, 8.30am Duration: h30 RULES 1. Do not searate any sheet. Write your name
More informationAdaptive estimation with change detection for streaming data
Adative estimation with change detection for streaming data A thesis resented for the degree of Doctor of Philosohy of the University of London and the Diloma of Imerial College by Dean Adam Bodenham Deartment
More informationOne-Way ANOVA. Some examples of when ANOVA would be appropriate include:
One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement
More informationVariable Selection and Model Building
LINEAR REGRESSION ANALYSIS MODULE XIII Lecture - 38 Variable Selection and Model Building Dr. Shalabh Deartment of Mathematics and Statistics Indian Institute of Technology Kanur Evaluation of subset regression
More informationDeriving Indicator Direct and Cross Variograms from a Normal Scores Variogram Model (bigaus-full) David F. Machuca Mory and Clayton V.
Deriving ndicator Direct and Cross Variograms from a Normal Scores Variogram Model (bigaus-full) David F. Machuca Mory and Clayton V. Deutsch Centre for Comutational Geostatistics Deartment of Civil &
More informationThe One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)
The One-Way Repeated-Measures ANOVA (For Within-Subjects Designs) Logic of the Repeated-Measures ANOVA The repeated-measures ANOVA extends the analysis of variance to research situations using repeated-measures
More informationA New Asymmetric Interaction Ridge (AIR) Regression Method
A New Asymmetric Interaction Ridge (AIR) Regression Method by Kristofer Månsson, Ghazi Shukur, and Pär Sölander The Swedish Retail Institute, HUI Research, Stockholm, Sweden. Deartment of Economics and
More informationSensitivity and Robustness of Quantum Spin-½ Rings to Parameter Uncertainty
Sensitivity and Robustness of Quantum Sin-½ Rings to Parameter Uncertainty Sean O Neil, Member, IEEE, Edmond Jonckheere, Life Fellow, IEEE, Sohie Schirmer, Member, IEEE, and Frank Langbein, Member, IEEE
More informationUnit 27 One-Way Analysis of Variance
Unit 27 One-Way Analysis of Variance Objectives: To perform the hypothesis test in a one-way analysis of variance for comparing more than two population means Recall that a two sample t test is applied
More informationOHSU OGI Class ECE-580-DOE :Design of Experiments Steve Brainerd
Why We Use Analysis of Variance to Compare Group Means and How it Works The question of how to compare the population means of more than two groups is an important one to researchers. Let us suppose that
More informationSIMPLE REGRESSION ANALYSIS. Business Statistics
SIMPLE REGRESSION ANALYSIS Business Statistics CONTENTS Ordinary least squares (recap for some) Statistical formulation of the regression model Assessing the regression model Testing the regression coefficients
More informationEstimation of Separable Representations in Psychophysical Experiments
Estimation of Searable Reresentations in Psychohysical Exeriments Michele Bernasconi (mbernasconi@eco.uninsubria.it) Christine Choirat (cchoirat@eco.uninsubria.it) Raffaello Seri (rseri@eco.uninsubria.it)
More informationMultiple Comparisons
Multiple Comparisons Error Rates, A Priori Tests, and Post-Hoc Tests Multiple Comparisons: A Rationale Multiple comparison tests function to tease apart differences between the groups within our IV when
More information1 Introduction to One-way ANOVA
Review Source: Chapter 10 - Analysis of Variance (ANOVA). Example Data Source: Example problem 10.1 (dataset: exp10-1.mtw) Link to Data: http://www.auburn.edu/~carpedm/courses/stat3610/textbookdata/minitab/
More informationCOMPARING SEVERAL MEANS: ANOVA
LAST UPDATED: November 15, 2012 COMPARING SEVERAL MEANS: ANOVA Objectives 2 Basic principles of ANOVA Equations underlying one-way ANOVA Doing a one-way ANOVA in R Following up an ANOVA: Planned contrasts/comparisons
More informationarxiv: v3 [physics.data-an] 23 May 2011
Date: October, 8 arxiv:.7v [hysics.data-an] May -values for Model Evaluation F. Beaujean, A. Caldwell, D. Kollár, K. Kröninger Max-Planck-Institut für Physik, München, Germany CERN, Geneva, Switzerland
More informationANOVA: Analysis of Variance
ANOVA: Analysis of Variance Marc H. Mehlman marcmehlman@yahoo.com University of New Haven The analysis of variance is (not a mathematical theorem but) a simple method of arranging arithmetical facts so
More information22s:152 Applied Linear Regression. Chapter 8: 1-Way Analysis of Variance (ANOVA) 2-Way Analysis of Variance (ANOVA)
22s:152 Applied Linear Regression Chapter 8: 1-Way Analysis of Variance (ANOVA) 2-Way Analysis of Variance (ANOVA) We now consider an analysis with only categorical predictors (i.e. all predictors are
More informationWolf River. Lecture 19 - ANOVA. Exploratory analysis. Wolf River - Data. Sta 111. June 11, 2014
Aldrin in the Wolf River Wolf River Lecture 19 - Sta 111 Colin Rundel June 11, 2014 The Wolf River in Tennessee flows past an abandoned site once used by the pesticide industry for dumping wastes, including
More information