Confidence Intervals with σ unknown
|
|
- Georgia McLaughlin
- 6 years ago
- Views:
Transcription
1 STAT 141 Confidence Intervals and Hypothesis Testing 10/26/04 Today (Chapter 7): CI with σ unknown, t-distribution CI for proportions Two sample CI with σ known or unknown Hypothesis Testing, z-test Confidence Intervals with σ unknown Last Time: Confidence Interval when σ is known: A level C, or 100(1 α) % confidence interval for µ is [ X z α/2 σ n, X + z α/2 σ n ] But to return to reality, we don t know σ. Thus we must estimate the standard deviation of X with: SE X = s n But s is just a function of our X i s and thus is a random variable too it has a sampling distribution too. Before we could say if we knew σ which after algebra gave the confidence interval. P ( z α/2 < X µ σ/ n < z α/2) = 1 α [Remember for any s, z s is defined as where 1 2s of the area falls in ( z s, z s ). So z s = qnorm(1 s) = qnorm(s) = 1 s quantile. i.e. z s is the positive side.] Now we want a similar setup, so that: P (?? < X µ SE X <??) = α We need know the probability distribution of T = X µ. T has the Student s t-distribution with n 1 SE X degrees of freedom. We write this as T t n 1. The degrees of freedom=ν is the only parameter of this distribution. [book uses t s for T ] 1
2 t dist w/ df=1 t dist w/ df= t dist, df=1 N(0,1) t dist, df=5 N(0,1) t dist w/ df=10 t dist w/ df= t dist, df=10 N(0,1) t dist, df=100 N(0,1) RCode: > par(mfrow=c(2,2)) #tdist1.pdf > plot(seq(-6,6,length=10000),dnorm(seq(-6,6,length=10000)), type="l",lty=3,ylab="",xlab="",main="t-dist w/ df=1") > lines(seq(-6,6,length=10000),dt(seq(-6,6,length=10000),df=1), type="l",ylab="",xlab="") > legend(x=2,y=.4,lty=c(1,3),legend=c("t-dist, df=1","n(0,1)"))... Thus t-distribution approaches normal as ν increases, but for small n gives wider intervals. Why degrees of freedom?? 2
3 Let y i = x i x We have s 2 = 1 n 1 n 1 y 2 i and yi = 0( ) Now (*) < > 1 constraint on n numbers, hence the phrase n-1 degrees of freedom Now that we know the distribution, we know we can find the?? from above these are just the α/2 and 1 α/2 quantiles of the t-distribution. Let t n 1,s be defined similarly as z s and is equal to qt(1 s, df = n 1) = qt(s, df = n 1). We then have: P ( t n 1,α/2 < X µ SE X < t n 1,α/2 ) = 1 α This gives us a confidence interval like before, only we use the quantiles of the t-distribution rather than the normal distribution. Example. Taken from the original paper on t-test by W.S. Gossett, [Gossett was employed by Guiness Breweries, Dublin. A chemist, turned statistician, Guiness, fearing the results to be of commerical importance, forbade Gossett to publish under his own name. Chose pseudonym Student out of modesty] Two drugs to induce sleep: A- dextro, B= laevo. Each of ten patients receives both drugs (presumably in random order). Issue: Is drug B better than drug A? Student s sleep data:
4 data(sleep) extra group extra1=sleep[sleep[,2]==1,] extra2=sleep[sleep[,2]==2,] extradiff=extra2[,1]-extra1[,1] >extradiff > mean(extradiff) [1] 1.58 > sqrt(var(extradiff)) [1] > sqrt(var(extradiff)/10) [1] > 1.58/ [1] > qt(.975,9) [1] > qt(.995,9) [1] > qnorm(0.975) [1] > qnorm(0.995) [1] A level C conf. interval with σ unknown: exact if X Normal otherwise approx correct for large n Margin of error M in E ± M is t n 1, α 2 s = t n 1, α n 2 SE X Remark: Large value, 4.6 possible outlier, so some doubt about normal assumptions here. What s different?? Since we don t know σ, pay a penalty with a (slightly) wider interval: ( e.g t=2.262 vs. z=1.96 for 5% level confidence ) For large sample sizes we can just use the normal distribution quantiles z α/2, since the t-distribution quickly looks like the normal distribution. Proportions We saw last time that ˆp is approximately distributed as N(p, p(1 p) ). If we want a confidence interval for n ˆp we can use this normality to get an approximate confidence interval.
5 M = z α/2 SE p = z α/2 p(1 p) n The book offers a correction to this using p = y+0.5z2 α/2 n+z 2 α/2 and SE p = p(1 p) n+z 2 α/2. Two-samples One of the most common statistical procedures. Is there a difference? Is it real?? However, because of the preparatory work with one-sample problems, this should seem rather familiar, a case of dejà-vu., but with slightly more complex formulas. What do we mean by two-samples? Two groups Distinct populations [treatment/control,..., male/female... ] Grouping variable: categorical variable with 2 levels. Data is independent between groups Example: (Dalgaard p 87) Energy expenditure: Two groups of women, lean and obese. Twenty four hour energy expenditure in MJ. data(energy) lean_energy[energy$stature== lean,1] obese_energy[energy$stature== obese,1] obese [1] lean [1] plot(expend~stature,data=energy) Beware: Some data sets that may look like two sample problems are really better treated as paired data. Example: Sleep drugs data from above: 10 patients, Drugs A and B. But since each patient received both A and B, the samples are not really independent (common component of variation due to patient) better to look at differences. Becomes a one-sample problem. (Will discuss more about pairing/blocking later). Notation: Population Variable Mean SD SRS from Each Population Sample Size Sample Mean Sample SD Populatio X 1 µ 1 σ 1 Populatio X 2 µ 2 σ 2 Sample 1 X1 s 1 Sample 2 X2 s 2 5
6 Distribution of X 1 X 2 Sample mean difference: X 1 X 2 All depends on the variability and distribution of this difference!! Recall in general that if E(V ) = µ and E(W ) = ν then and if V and W are independent then So if X 1 (µ 1, σ2 1 ), X2 (µ 2, σ2 2 ), we will have E(V W ) = µ ν var(v W ) = var(v ) + var(w ) and for independent rvs X 1 and X 2 : µ X1 X 2 = E( X 1 X 2 ) = µ 1 µ 2 We need estimates for µ 1 µ 2 and σ 2 X1 X 2. σ 2 X1 X 2 = σ 2 X1 + σ 2 X2 = σ2 1 + σ2 2 Clearly X 1 X 2 is estimate for µ 1 µ 2. Once we have an estimate for the σ X1 X 2 then we can use similar method as for a 1-sample case to get a confidence interval. 1. Unequal variances: σ 2 1 σ 2 2 then use SE 2 X1 X 2 = s2 1 + s Equal Variances: If σ 2 1 = σ 2 2 = σ 2 is unknown but assumed to be equal, can use a pooled estimate of variance σ: s 2 pooled = ( 1)s ( 1)s i.e. average with weights equal to the respective degrees of freedom. Then our estimate of σ X1 + X 2 SE 2 pooled = s 2 pooled( ) Good method if the two SDs are close, but if also are moderate to large, there won t be much difference from the unequal variances method (below) If the two SDs are different, better to use unequal variances method. will use this pooled estimate again when we study Analysis of Variance As above, we need the distribution of: If X 1 N(µ 1, σ 2 1) and X 2 N(µ 2, σ 2 2) then: T = X 1 X 2 µ X1 X 2 SE of X 1 X 2 6
7 Equal Variances: If we have equal variances in the two populations, then SE of X 1 X 2 = SE pooled and T t ν with ν = + 2 Unequal Variances: Then SE of X 1 X 2 = SE X1 X 2 and T is approximately distributed as t ν. We use one of two values for ν 1. ν = min( 1, 1) 2. ν = s s ( s2 1 ) ( s2 2 ) 2 This is known as Welsh s formula which gives fractional degrees of freedom. More accurate formula (generally used by packages, and only on computers!): Can use either approximation, but say which! Note that one can generally not go too far wrong, since can show by algebra that min( 1, 1) ν + 2 Summary: Two sample confidence intervals for µ 1 µ 2 at the 100(1 α)% level E ± M, E = X 1 X 2, M = (z α/2 or t α/2 ) (appropriate SE) known large sample unknown, unequal unknown, equal σ1 M = z 2 α/2 + σ2 2 s M = z 1 α/2 + s2 2 s M = t 1 α/2,ν + s2 2 1 M = t α/2,ν s pooled ν = min( 1, 1) or ν ν = + 2 where z α/2 and t α/2,ν are same notation as for one-sample case. In energy data above, we can construct a 95% confidence interval for the difference in the true means between obese and lean. = 9, = 13 and X 1 X 2 = We ll use the conservative estimate for ν = min(9 1, 13 1) = 8. SE X1 X 2 = So our M = = Then a (conservative) 95% confidence interval is [0.93,3.53]. Computer output for Welsh s formula gives [1.00,3.46] > mean(obese)-mean(lean) [1] > qt(.9725,df=8) [1] > sqrt(var(obese)/length(obese)+var(lean)/length(lean)) [1] > t.test(obese,lean, conf.level=.95) Welch Two Sample t-test data: obese and lean t = , df = , p-value = alternative hypothesis: true difference in means is not equal to 0 + 1
8 95 percent confidence interval: sample estimates: mean of x mean of y Hypothesis Tests We will generally have some hypotheses about certain parameters of the population (or populations) from which our data arose, and we will be interested in using our data to see whether these hypotheses are consistent with what we have observed. To do this, we have already calculated confidence intervals for them, now we will be conducting hypothesis tests about the populations parameters of interest. We will discuss these two statistical procedures, in general they are built on the idea that if some theory about the population parameters is true, the observed data should follow, admittedly random, but generally predictable patterns. Thus, if the data do not fall within the likely outcomes under our supposed ideas about the population, we will tend to disbelieve these ideas, as the data do not strongly support them. We will initially be interested in using our data to make inferences about µ, the population mean. To do this, we will use our estimate of location from the data; namely, the sample mean (average) (since it is mathematically nicer than the median). We will do this in the framework of several different data structures, starting with the most basic, the one-sample situation. How can we decide if a given set of data, and in particular its sample mean, is close enough to a hypothesized value for µ for us to believe that the data are consistent with this value? In order to answer such a question, we need to know how a statistic like the sample average behaves, i.e. its distribution. Now we have already studied the distribution of the sample average and the sample proportions, when the sample size is large enough, they follow Normal distributions, centered at the expected value and with a spread of the order the relevant SE. INFERENCE FOR A SINGLE SAMPLE: Z-DISTRIBUTION Standard Error of the Sample Mean (σ known) Example: Testing whether the birthweights of the secher babies have above average mean. Variance of the original population σ=700 Known. We would like to test whether µ = 2500, versus the alternative µ > We have a sample of n = 107 observations. mean(bwt) gives that X = 2739, we would like to use this data to test µ > We have a sample of size 107, we know that X will be normal with variance σ2 = 7002 = If it is true that µ=2500 (this is called the null hypothesis), then under the central limit theorem, X N (2500, /107) = N (2500, ) and under the null hypothesis P ( X 2739) = P ( X µ σ n ) = P (Z 3.53) 67.7 What is the probability that a standard normal Z score is as big as 3.53? P (Z > 3.53) = 1 P (Z 3.53) = 1 Φ(3.53) =
9 using the R command pnorm(3.53) which returns [1] This is indeed very small, too small to be true. We reject the null hypothesis. Let X 1,..., X n be a sample of n i.i.d. random variables from a distribution having unknown mean µ, and known standard deviation σ. Assume n is large, say n > 30. Suppose interest centers on testing the hypothesis H 0 : µ = µ 0, where µ 0 is some fixed, pre-specified value. This will be our null hypothesis, notice that it is a simple one, i.e. it postulates a single hypothesized value for µ. The hypothesis against which the null hypothesis is to be compared, the alternative hypothesis, can take one of three basic forms: 1. H A : µ µ 0 2. H A : µ > µ 0 3. H A : µ < µ 0 The idea, as we have said, is to assess whether the data supports the null hypothesis (H 0 ) or whether it suggests the relevant alternative (H A ). To begin, we assert that the null hypothesis is true (i.e. that the true value of µ is actually µ 0 ). Under this assumption, the Central Limit Theorem implies that the test statistic Z = X µ 0 σ/ n, has a standard normal (N(0, 1)) distribution (notice that the test statistic is just the standardized version of X under the assumption that the true mean is actually equal to µ 0 ). The usual convention applies that if σ is unknown, and n is large then the sample standard deviation, s, is used in place of σ in forming the test statistic. The null hypothesis is supported if the observed value of the test statistic is small (i.e. X is close enough to µ 0, the hypothesized value, so that I would believe that the true mean is µ 0 ). On the other hand, if I observe a large value of the test statistic, this suggests that X is far from µ 0, which tends to discredit the null hypothesis in favor of the alternative hypothesis H A : µ µ 0. The real issue is how large is large? (or small is small?). For example, if I observe a Z value of 1, say, can we conclude in favor of H 0 over H A, or should we prefer H A over H 0. What about a Z value of 2? The answer to these question lies in considering what the test statistic actually measures. In words, the observed value of Z is just the number of standard errors the observed sample mean is from the hypothesized population mean; i.e. Z obs = number of standard errors X is away from µ0 This is determined by how rare a rare event should be to make us think soemthing else than H 0 is going on. This determines what we call the significance level α, most often α is taken to be 5%, sometimes 10%, and sometimes even.1 % (1/1000). We compute the P-value which the probability of observing a value as extreme as this. The P-value computation either takes P ( Z > Z obs ),P (Z > Z obs ) or P (Z < Z obs ) depending on what the alternative H A was. 9
Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc.
Chapter 23 Inferences About Means Sampling Distributions of Means Now that we know how to create confidence intervals and test hypotheses about proportions, we do the same for means. Just as we did before,
More informationInference for Single Proportions and Means T.Scofield
Inference for Single Proportions and Means TScofield Confidence Intervals for Single Proportions and Means A CI gives upper and lower bounds between which we hope to capture the (fixed) population parameter
More informationChapter 23. Inference About Means
Chapter 23 Inference About Means 1 /57 Homework p554 2, 4, 9, 10, 13, 15, 17, 33, 34 2 /57 Objective Students test null and alternate hypotheses about a population mean. 3 /57 Here We Go Again Now that
More informationStat 427/527: Advanced Data Analysis I
Stat 427/527: Advanced Data Analysis I Review of Chapters 1-4 Sep, 2017 1 / 18 Concepts you need to know/interpret Numerical summaries: measures of center (mean, median, mode) measures of spread (sample
More informationBusiness Statistics. Lecture 10: Course Review
Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,
More informationNull Hypothesis Significance Testing p-values, significance level, power, t-tests Spring 2017
Null Hypothesis Significance Testing p-values, significance level, power, t-tests 18.05 Spring 2017 Understand this figure f(x H 0 ) x reject H 0 don t reject H 0 reject H 0 x = test statistic f (x H 0
More informationINTERVAL ESTIMATION AND HYPOTHESES TESTING
INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,
More informationReview: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.
1 Review: Let X 1, X,..., X n denote n independent random variables sampled from some distribution might not be normal!) with mean µ) and standard deviation σ). Then X µ σ n In other words, X is approximately
More informationz and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests
z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests Chapters 3.5.1 3.5.2, 3.3.2 Prof. Tesler Math 283 Fall 2018 Prof. Tesler z and t tests for mean Math
More informationAMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015
AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking
More informationLab #12: Exam 3 Review Key
Psychological Statistics Practice Lab#1 Dr. M. Plonsky Page 1 of 7 Lab #1: Exam 3 Review Key 1) a. Probability - Refers to the likelihood that an event will occur. Ranges from 0 to 1. b. Sampling Distribution
More informationSwarthmore Honors Exam 2012: Statistics
Swarthmore Honors Exam 2012: Statistics 1 Swarthmore Honors Exam 2012: Statistics John W. Emerson, Yale University NAME: Instructions: This is a closed-book three-hour exam having six questions. You may
More informationT.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS
ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS In our work on hypothesis testing, we used the value of a sample statistic to challenge an accepted value of a population parameter. We focused only
More informationCh. 7. One sample hypothesis tests for µ and σ
Ch. 7. One sample hypothesis tests for µ and σ Prof. Tesler Math 18 Winter 2019 Prof. Tesler Ch. 7: One sample hypoth. tests for µ, σ Math 18 / Winter 2019 1 / 23 Introduction Data Consider the SAT math
More informationProbability theory and inference statistics! Dr. Paola Grosso! SNE research group!! (preferred!)!!
Probability theory and inference statistics Dr. Paola Grosso SNE research group p.grosso@uva.nl paola.grosso@os3.nl (preferred) Roadmap Lecture 1: Monday Sep. 22nd Collecting data Presenting data Descriptive
More informationThe Student s t Distribution
The Student s t Distribution What do we do if (a) we don t know σ and (b) n is small? If the population of interest is normally distributed, we can use the Student s t-distribution in place of the standard
More informationChapter 24. Comparing Means. Copyright 2010 Pearson Education, Inc.
Chapter 24 Comparing Means Copyright 2010 Pearson Education, Inc. Plot the Data The natural display for comparing two groups is boxplots of the data for the two groups, placed side-by-side. For example:
More informationHarvard University. Rigorous Research in Engineering Education
Statistical Inference Kari Lock Harvard University Department of Statistics Rigorous Research in Engineering Education 12/3/09 Statistical Inference You have a sample and want to use the data collected
More informationAdvanced Experimental Design
Advanced Experimental Design Topic Four Hypothesis testing (z and t tests) & Power Agenda Hypothesis testing Sampling distributions/central limit theorem z test (σ known) One sample z & Confidence intervals
More informationQuestions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.
Chapter 7 Reading 7.1, 7.2 Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.112 Introduction In Chapter 5 and 6, we emphasized
More informationGov 2000: 6. Hypothesis Testing
Gov 2000: 6. Hypothesis Testing Matthew Blackwell October 11, 2016 1 / 55 1. Hypothesis Testing Examples 2. Hypothesis Test Nomenclature 3. Conducting Hypothesis Tests 4. p-values 5. Power Analyses 6.
More informationECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12
ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12 Winter 2012 Lecture 13 (Winter 2011) Estimation Lecture 13 1 / 33 Review of Main Concepts Sampling Distribution of Sample Mean
More informationNull Hypothesis Significance Testing p-values, significance level, power, t-tests
Null Hypothesis Significance Testing p-values, significance level, power, t-tests 18.05 Spring 2014 January 1, 2017 1 /22 Understand this figure f(x H 0 ) x reject H 0 don t reject H 0 reject H 0 x = test
More informationSection 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples
Objective Section 9.4 Inferences About Two Means (Matched Pairs) Compare of two matched-paired means using two samples from each population. Hypothesis Tests and Confidence Intervals of two dependent means
More informationSociology 6Z03 Review II
Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability
More informationSTAT 135 Lab 5 Bootstrapping and Hypothesis Testing
STAT 135 Lab 5 Bootstrapping and Hypothesis Testing Rebecca Barter March 2, 2015 The Bootstrap Bootstrap Suppose that we are interested in estimating a parameter θ from some population with members x 1,...,
More informationOne-sample categorical data: approximate inference
One-sample categorical data: approximate inference Patrick Breheny October 6 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction It is relatively easy to think about the distribution
More informationGov Univariate Inference II: Interval Estimation and Testing
Gov 2000-5. Univariate Inference II: Interval Estimation and Testing Matthew Blackwell October 13, 2015 1 / 68 Large Sample Confidence Intervals Confidence Intervals Example Hypothesis Tests Hypothesis
More informationConfidence Intervals, Testing and ANOVA Summary
Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0
More informationThe t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary
Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis
More informationChapter 23: Inferences About Means
Chapter 3: Inferences About Means Sample of Means: number of observations in one sample the population mean (theoretical mean) sample mean (observed mean) is the theoretical standard deviation of the population
More informationRegression with a Single Regressor: Hypothesis Tests and Confidence Intervals
Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals (SW Chapter 5) Outline. The standard error of ˆ. Hypothesis tests concerning β 3. Confidence intervals for β 4. Regression
More informationCh. 7: Estimates and Sample Sizes
Ch. 7: Estimates and Sample Sizes Section Title Notes Pages Introduction to the Chapter 2 2 Estimating p in the Binomial Distribution 2 5 3 Estimating a Population Mean: Sigma Known 6 9 4 Estimating a
More informationThe t-test Pivots Summary. Pivots and t-tests. Patrick Breheny. October 15. Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18
and t-tests Patrick Breheny October 15 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18 Introduction The t-test As we discussed previously, W.S. Gossett derived the t-distribution as a way of
More information1 Statistical inference for a population mean
1 Statistical inference for a population mean 1. Inference for a large sample, known variance Suppose X 1,..., X n represents a large random sample of data from a population with unknown mean µ and known
More informationStatistical Inference
Statistical Inference Bernhard Klingenberg Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Outline Estimation: Review of concepts
More informationSTA Module 10 Comparing Two Proportions
STA 2023 Module 10 Comparing Two Proportions Learning Objectives Upon completing this module, you should be able to: 1. Perform large-sample inferences (hypothesis test and confidence intervals) to compare
More informationPHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1
PHP2510: Principles of Biostatistics & Data Analysis Lecture X: Hypothesis testing PHP 2510 Lec 10: Hypothesis testing 1 In previous lectures we have encountered problems of estimating an unknown population
More informationEstimating a population mean
Introductory Statistics Lectures Estimating a population mean Confidence intervals for means Department of Mathematics Pima Community College Redistribution of this material is prohibited without written
More informationLAB 2. HYPOTHESIS TESTING IN THE BIOLOGICAL SCIENCES- Part 2
LAB 2. HYPOTHESIS TESTING IN THE BIOLOGICAL SCIENCES- Part 2 Data Analysis: The mean egg masses (g) of the two different types of eggs may be exactly the same, in which case you may be tempted to accept
More informationSampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =
2. The distribution of t values that would be obtained if a value of t were calculated for each sample mean for all possible random of a given size from a population _ t ratio: (X - µ hyp ) t s x The result
More informationChapter 10: STATISTICAL INFERENCE FOR TWO SAMPLES. Part 1: Hypothesis tests on a µ 1 µ 2 for independent groups
Chapter 10: STATISTICAL INFERENCE FOR TWO SAMPLES Part 1: Hypothesis tests on a µ 1 µ 2 for independent groups Sections 10-1 & 10-2 Independent Groups It is common to compare two groups, and do a hypothesis
More informationPurposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions
Part 1: Probability Distributions Purposes of Data Analysis True Distributions or Relationships in the Earths System Probability Distribution Normal Distribution Student-t Distribution Chi Square Distribution
More informationInterval estimation. October 3, Basic ideas CLT and CI CI for a population mean CI for a population proportion CI for a Normal mean
Interval estimation October 3, 2018 STAT 151 Class 7 Slide 1 Pandemic data Treatment outcome, X, from n = 100 patients in a pandemic: 1 = recovered and 0 = not recovered 1 1 1 0 0 0 1 1 1 0 0 1 0 1 0 0
More informationCHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups 10. Comparing Two Means The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Comparing Two Means Learning
More informationCHAPTER 10 Comparing Two Populations or Groups
CHAPTER 10 Comparing Two Populations or Groups 10.2 Comparing Two Means The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers Comparing Two Means Learning
More informationThe Components of a Statistical Hypothesis Testing Problem
Statistical Inference: Recall from chapter 5 that statistical inference is the use of a subset of a population (the sample) to draw conclusions about the entire population. In chapter 5 we studied one
More informationTopic 3: Sampling Distributions, Confidence Intervals & Hypothesis Testing. Road Map Sampling Distributions, Confidence Intervals & Hypothesis Testing
Topic 3: Sampling Distributions, Confidence Intervals & Hypothesis Testing ECO22Y5Y: Quantitative Methods in Economics Dr. Nick Zammit University of Toronto Department of Economics Room KN3272 n.zammit
More informationTwo Sample Problems. Two sample problems
Two Sample Problems Two sample problems The goal of inference is to compare the responses in two groups. Each group is a sample from a different population. The responses in each group are independent
More informationComparing Means from Two-Sample
Comparing Means from Two-Sample Kwonsang Lee University of Pennsylvania kwonlee@wharton.upenn.edu April 3, 2015 Kwonsang Lee STAT111 April 3, 2015 1 / 22 Inference from One-Sample We have two options to
More informationSTAT Chapter 9: Two-Sample Problems. Paired Differences (Section 9.3)
STAT 515 -- Chapter 9: Two-Sample Problems Paired Differences (Section 9.3) Examples of Paired Differences studies: Similar subjects are paired off and one of two treatments is given to each subject in
More informationNote that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).
Confidence Intervals 1) What are confidence intervals? Simply, an interval for which we have a certain confidence. For example, we are 90% certain that an interval contains the true value of something
More informationIntroduction to Econometrics. Review of Probability & Statistics
1 Introduction to Econometrics Review of Probability & Statistics Peerapat Wongchaiwat, Ph.D. wongchaiwat@hotmail.com Introduction 2 What is Econometrics? Econometrics consists of the application of mathematical
More informationStatistics for IT Managers
Statistics for IT Managers 95-796, Fall 2012 Module 2: Hypothesis Testing and Statistical Inference (5 lectures) Reading: Statistics for Business and Economics, Ch. 5-7 Confidence intervals Given the sample
More informationBusiness Statistics. Lecture 5: Confidence Intervals
Business Statistics Lecture 5: Confidence Intervals Goals for this Lecture Confidence intervals The t distribution 2 Welcome to Interval Estimation! Moments Mean 815.0340 Std Dev 0.8923 Std Error Mean
More informationMAT2377. Rafa l Kulik. Version 2015/November/23. Rafa l Kulik
MAT2377 Rafa l Kulik Version 2015/November/23 Rafa l Kulik Rafa l Kulik 1 Rafa l Kulik 2 Rafa l Kulik 3 Rafa l Kulik 4 The Z-test Test on the mean of a normal distribution, σ known Suppose X 1,..., X n
More informationLecture 18: Simple Linear Regression
Lecture 18: Simple Linear Regression BIOS 553 Department of Biostatistics University of Michigan Fall 2004 The Correlation Coefficient: r The correlation coefficient (r) is a number that measures the strength
More informationTwo sample Hypothesis tests in R.
Example. (Dependent samples) Two sample Hypothesis tests in R. A Calculus professor gives their students a 10 question algebra pretest on the first day of class, and a similar test towards the end of the
More informationHYPOTHESIS TESTING. Hypothesis Testing
MBA 605 Business Analytics Don Conant, PhD. HYPOTHESIS TESTING Hypothesis testing involves making inferences about the nature of the population on the basis of observations of a sample drawn from the population.
More informationMathematics for Economics MA course
Mathematics for Economics MA course Simple Linear Regression Dr. Seetha Bandara Simple Regression Simple linear regression is a statistical method that allows us to summarize and study relationships between
More informationTwo-sample inference: Continuous Data
Two-sample inference: Continuous Data November 5 Diarrhea Diarrhea is a major health problem for babies, especially in underdeveloped countries Diarrhea leads to dehydration, which results in millions
More informationFirst we look at some terms to be used in this section.
8 Hypothesis Testing 8.1 Introduction MATH1015 Biostatistics Week 8 In Chapter 7, we ve studied the estimation of parameters, point or interval estimates. The construction of CI relies on the sampling
More informationEpidemiology Principles of Biostatistics Chapter 10 - Inferences about two populations. John Koval
Epidemiology 9509 Principles of Biostatistics Chapter 10 - Inferences about John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being covered 1. differences in
More informationM(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1
Math 66/566 - Midterm Solutions NOTE: These solutions are for both the 66 and 566 exam. The problems are the same until questions and 5. 1. The moment generating function of a random variable X is M(t)
More informationData analysis and Geostatistics - lecture VI
Data analysis and Geostatistics - lecture VI Statistical testing with population distributions Statistical testing - the steps 1. Define a hypothesis to test in statistics only a hypothesis rejection is
More informationMathematical Notation Math Introduction to Applied Statistics
Mathematical Notation Math 113 - Introduction to Applied Statistics Name : Use Word or WordPerfect to recreate the following documents. Each article is worth 10 points and should be emailed to the instructor
More informationTwo-sample inference: Continuous data
Two-sample inference: Continuous data Patrick Breheny April 6 Patrick Breheny University of Iowa to Biostatistics (BIOS 4120) 1 / 36 Our next several lectures will deal with two-sample inference for continuous
More informationStatistical Foundations:
Statistical Foundations: t distributions, t-tests tests Psychology 790 Lecture #12 10/03/2006 Today sclass The t-distribution t ib ti in its full glory. Why we use it for nearly everything. Confidence
More informationOn Assumptions. On Assumptions
On Assumptions An overview Normality Independence Detection Stem-and-leaf plot Study design Normal scores plot Correction Transformation More complex models Nonparametric procedure e.g. time series Robustness
More informationLecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t
Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t t Confidence Interval for Population Mean Comparing z and t Confidence Intervals When neither z nor t Applies
More informationStat 529 (Winter 2011) Experimental Design for the Two-Sample Problem. Motivation: Designing a new silver coins experiment
Stat 529 (Winter 2011) Experimental Design for the Two-Sample Problem Reading: 2.4 2.6. Motivation: Designing a new silver coins experiment Sample size calculations Margin of error for the pooled two sample
More informationInference for Distributions Inference for the Mean of a Population
Inference for Distributions Inference for the Mean of a Population PBS Chapter 7.1 009 W.H Freeman and Company Objectives (PBS Chapter 7.1) Inference for the mean of a population The t distributions The
More informationReview of Statistics 101
Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods
More informationLecture 10A: Chapter 8, Section 1 Sampling Distributions: Proportions
Lecture 10A: Chapter 8, Section 1 Sampling Distributions: Proportions Typical Inference Problem Definition of Sampling Distribution 3 Approaches to Understanding Sampling Dist. Applying 68-95-99.7 Rule
More informationChapter 7: Statistical Inference (Two Samples)
Chapter 7: Statistical Inference (Two Samples) Shiwen Shen University of South Carolina 2016 Fall Section 003 1 / 41 Motivation of Inference on Two Samples Until now we have been mainly interested in a
More informationHypothesis Testing One Sample Tests
STATISTICS Lecture no. 13 Department of Econometrics FEM UO Brno office 69a, tel. 973 442029 email:jiri.neubauer@unob.cz 12. 1. 2010 Tests on Mean of a Normal distribution Tests on Variance of a Normal
More informationComparing two independent samples
In many applications it is necessary to compare two competing methods (for example, to compare treatment effects of a standard drug and an experimental drug). To compare two methods from statistical point
More information7 Estimation. 7.1 Population and Sample (P.91-92)
7 Estimation MATH1015 Biostatistics Week 7 7.1 Population and Sample (P.91-92) Suppose that we wish to study a particular health problem in Australia, for example, the average serum cholesterol level for
More informationLast week: Sample, population and sampling distributions finished with estimation & confidence intervals
Past weeks: Measures of central tendency (mean, mode, median) Measures of dispersion (standard deviation, variance, range, etc). Working with the normal curve Last week: Sample, population and sampling
More informationConfidence Intervals. - simply, an interval for which we have a certain confidence.
Confidence Intervals I. What are confidence intervals? - simply, an interval for which we have a certain confidence. - for example, we are 90% certain that an interval contains the true value of something
More informationSTA Module 11 Inferences for Two Population Means
STA 2023 Module 11 Inferences for Two Population Means Learning Objectives Upon completing this module, you should be able to: 1. Perform inferences based on independent simple random samples to compare
More informationSTA Rev. F Learning Objectives. Two Population Means. Module 11 Inferences for Two Population Means
STA 2023 Module 11 Inferences for Two Population Means Learning Objectives Upon completing this module, you should be able to: 1. Perform inferences based on independent simple random samples to compare
More informationStatistical Distribution Assumptions of General Linear Models
Statistical Distribution Assumptions of General Linear Models Applied Multilevel Models for Cross Sectional Data Lecture 4 ICPSR Summer Workshop University of Colorado Boulder Lecture 4: Statistical Distributions
More informationBINF 702 SPRING Chapter 8 Hypothesis Testing: Two-Sample Inference. BINF702 SPRING 2014 Chapter 8 Hypothesis Testing: Two- Sample Inference 1
BINF 702 SPRING 2014 Chapter 8 Hypothesis Testing: Two-Sample Inference Two- Sample Inference 1 A Poster Child for two-sample hypothesis testing Ex 8.1 Obstetrics In the birthweight data in Example 7.2,
More informationHypothesis Testing in Action: t-tests
Hypothesis Testing in Action: t-tests Mark Muldoon School of Mathematics, University of Manchester Mark Muldoon, January 30, 2007 t-testing - p. 1/31 Overview large Computing t for two : reprise Today
More informationStat 135, Fall 2006 A. Adhikari HOMEWORK 10 SOLUTIONS
Stat 135, Fall 2006 A. Adhikari HOMEWORK 10 SOLUTIONS 1a) The model is cw i = β 0 + β 1 el i + ɛ i, where cw i is the weight of the ith chick, el i the length of the egg from which it hatched, and ɛ i
More informationLast two weeks: Sample, population and sampling distributions finished with estimation & confidence intervals
Past weeks: Measures of central tendency (mean, mode, median) Measures of dispersion (standard deviation, variance, range, etc). Working with the normal curve Last two weeks: Sample, population and sampling
More informationLecture 12: Small Sample Intervals Based on a Normal Population Distribution
Lecture 12: Small Sample Intervals Based on a Normal Population MSU-STT-351-Sum-17B (P. Vellaisamy: MSU-STT-351-Sum-17B) Probability & Statistics for Engineers 1 / 24 In this lecture, we will discuss (i)
More informationStatistical inference provides methods for drawing conclusions about a population from sample data.
Introduction to inference Confidence Intervals Statistical inference provides methods for drawing conclusions about a population from sample data. 10.1 Estimating with confidence SAT σ = 100 n = 500 µ
More informationMath 143: Introduction to Biostatistics
Math 143: Introduction to Biostatistics R Pruim Spring 2012 0.2 Last Modified: April 2, 2012 Math 143 : Spring 2012 : Pruim Contents 10 More About Random Variables 1 10.1 The Mean of a Random Variable....................................
More informationThe t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies
The t-test: So Far: Sampling distribution benefit is that even if the original population is not normal, a sampling distribution based on this population will be normal (for sample size > 30). Benefit
More informationOutline. Unit 3: Inferential Statistics for Continuous Data. Outline. Inferential statistics for continuous data. Inferential statistics Preliminaries
Unit 3: Inferential Statistics for Continuous Data Statistics for Linguists with R A SIGIL Course Designed by Marco Baroni 1 and Stefan Evert 1 Center for Mind/Brain Sciences (CIMeC) University of Trento,
More informationT test for two Independent Samples. Raja, BSc.N, DCHN, RN Nursing Instructor Acknowledgement: Ms. Saima Hirani June 07, 2016
T test for two Independent Samples Raja, BSc.N, DCHN, RN Nursing Instructor Acknowledgement: Ms. Saima Hirani June 07, 2016 Q1. The mean serum creatinine level is measured in 36 patients after they received
More informationChapter 9. Inferences from Two Samples. Objective. Notation. Section 9.2. Definition. Notation. q = 1 p. Inferences About Two Proportions
Chapter 9 Inferences from Two Samples 9. Inferences About Two Proportions 9.3 Inferences About Two s (Independent) 9.4 Inferences About Two s (Matched Pairs) 9.5 Comparing Variation in Two Samples Objective
More informationSTAT 4385 Topic 01: Introduction & Review
STAT 4385 Topic 01: Introduction & Review Xiaogang Su, Ph.D. Department of Mathematical Science University of Texas at El Paso xsu@utep.edu Spring, 2016 Outline Welcome What is Regression Analysis? Basics
More informationMathematical statistics
November 15 th, 2018 Lecture 21: The two-sample t-test Overview Week 1 Week 2 Week 4 Week 7 Week 10 Week 14 Probability reviews Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation
More informationChapter 24. Comparing Means
Chapter 4 Comparing Means!1 /34 Homework p579, 5, 7, 8, 10, 11, 17, 31, 3! /34 !3 /34 Objective Students test null and alternate hypothesis about two!4 /34 Plot the Data The intuitive display for comparing
More informationChapter 8: Confidence Intervals
Chapter 8: Confidence Intervals Introduction Suppose you are trying to determine the mean rent of a two-bedroom apartment in your town. You might look in the classified section of the newspaper, write
More informationHypothesis Testing in Action
Hypothesis Testing in Action Jonathan Bagley School of Mathematics, University of Manchester Jonathan Bagley, September 23, 2005 The t-tests - p. 1/23 Overview Today we ll examine three data sets and use
More informationStat 135, Fall 2006 A. Adhikari HOMEWORK 6 SOLUTIONS
Stat 135, Fall 2006 A. Adhikari HOMEWORK 6 SOLUTIONS 1a. Under the null hypothesis X has the binomial (100,.5) distribution with E(X) = 50 and SE(X) = 5. So P ( X 50 > 10) is (approximately) two tails
More information