Math 124: Modules Overall Goal. Point Estimations. Interval Estimation. Math 124: Modules Overall Goal.

Size: px
Start display at page:

Download "Math 124: Modules Overall Goal. Point Estimations. Interval Estimation. Math 124: Modules Overall Goal."

Transcription

1 What we will do today s David Meredith Department of Mathematics San Francisco State University October 22, 2009 s 1 2 s 3 What is a? Decision support Political decisions s s Goal of statistics: optimize decisions with partial information Example: political party wants to appeal to voters Cannot ask every voter what they think So ask a sample Statistics tells us how closely sample might equal population

2 Medical innovations Using sample statistics to estimate population parameters s s Example: medical researchers think they have effective drug Conduct careful test with randomization of patients, placebos, etc. Statistics helps plan test, decide number of subjects, etc. Statistics tells us if test provides good evidence of drug s effectiveness Population parameter: percentage of all voters who favor a single-payer health-care system. Sample statistic: percentage of sample of voters who favor a single-payer health-care system. The sample percentage is a point estimate for the population percentage Any sample statistic is a point estimate for the corresponding population parameter Example point estimates The big question s If we measure heights, the mean height of a sample is a point estimate for the mean height of the population The standard deviation of the sample heights is a point estimate for the standard deviation of the heights in the population The proportion of people favoring single-payer in a sample of voters is a point estimate for the proportion of people favoring single-payer in the population s The important questions is: how accurate is a point estimate? The rest of this course will study the question: how accurate? The slope of the regression line through a sample of heights and weights is a point estimate for the slope of the regression line through the heights and weights of the entire population

3 Three answers for quantitative variable Three answers for categorical variable s s Question: how effective is this medicine? Population: sick people; variable: blood pressure, present or absent (categorical) 1 The medicine lowers blood pressure by about 20 points on average (point estimate) 2 I m highly confident that the medicine lowers blood pressure by between 15 and 25 points on average (confidence interval) 3 I m almost certain the medicine lowers blood pressure (hypothesis test) Question: how effective is this medicine? Population: sick people; variable: disease, present or absent (categorical) 1 The medicine cures about 80% of cases (point estimate) 2 I m highly confident that the medicine cures between 70% and 90% of cases (confidence interval) 3 I m almost certain the medicine cures some people (hypothesis test) Confidence intervals Confidence intervals s Suppose you want to estimate some parameter for a population, like height, percentage type A blood, percentage of Republicans, average lifespan, etc. A sample will give you a point estimate p for the parameter. We do not expect p to be exactly correct. A confidence interval is an interval (a, b) that you are somewhat confident contains the true population parameter. We are somewhat confident that a < π < b where π is the true parameter s Your point estimate p is right in the middle of (a, b). p = a + b 2 a = p e b = p + e You need to learn how to compute the margin of error e for different levels of confidence.

4 Levels of confidence The key idea s What does it mean to be % confident, 95% confident You are % confident if you think there is a % chance that you are correct (not good). You are 95% confident if you think there is a 95% chance that you are correct (pretty high). In real life you are seldom certain Statistics is a rigorous way of dealing with uncertainty s To calculate a 95% confidence interval for a sample with mean m, standard deviation s and size n: Pretend your sample represents the population perfectly Let x be the sampling distribution for your variable and sample size Sample size is n. Mean of x is m. Standard deviation of x is s n. Find the central range that contains 95% of all possible samples Solve P( x < a) = and P( x < b) = Then P(a < x < b) = P( x < b) P( x < a) = = 0.95 (a, b) is your 95% confidence interval Example 1 Example 2 s Suppose we wanted a 95% confidence interval for male heights, and we took a random sample of men. The sample average height was 69.2" with a sample standard deviation of 3.1". Related problem: population average is 69.2" and population standard deviation is 3.1". Let x be sampling distribution for heights with sample size. Mean of x is 69.2, standard deviation of x is 3.1. We will find a and b such that P(H < a) = and P(H < b) = Then P(a < H < b) = P(H < b) P(H < a) = = 0.95 a = qnorm(0.025, 69.2, 3.1/sqrt()) = b = qnorm(0.975, 69.2, 3.1/sqrt()) = 70.06) 95% confidence interval is (68.34, 70.06). s Suppose we wanted a 90% confidence interval for male heights, and we took a random sample of men. The sample average height was 69.2" with a sample standard deviation of 3.1". Related problem: population average is 69.2" and population standard deviation is 3.1". Let x be sampling distribution for heights with sample size. Mean of x is 69.2, standard deviation of x is 3.1. We will find a and b such that P(H < a) = 0.05 and P(H < b) = Then P(a < H < b) = P(H < b) P(H < a) = = 0.90 a = qnorm(c(0.05,0.95), 69.2, 3.1/sqrt()) = 68.47, % confidence interval is (68.47, 69.92).

5 s Compare confidence intervals Two confidence intervals for heights; 95% confidence interval: (68.34, 70.06) 90% confidence interval: (68.47, 69.92) Sample average 69.2" is right in the middle of both. 95% margin of error: e = = = 0.86 s Convenient approximation for 95% CI If your sample has mean m and standard deviation s, an approximation frequently used for the 95% confidence interval is (m 2 s n, m + 2 s n ) 90% margin of error: e = = = % interval is wider, because we are more confident of a less precise statement. We are less confident of a more precise statement. In ( previous example, that would ) be , = (68.32, 70.08) Actual answer was (68.34, 70.06). Sample sizes Sample sizes s Picking sample size is an important question for researchers at the beginning of a project Too small a sample, and your research might not be significant. Too big a sample, and your research might be too expensive. s Suppose you wanted to estimate a quantitative variable like men s heights with a margin of error of e = 0.1 with 95% confidence. How big a sample do you need? Let n be the sample size and s standard deviation of the sample we will measure. We have to guess or estimate s to find n. We guess s = 3. Sometimes researchers do a small study to estimate s with a small (cheap) sample. Let solve P(Z < z) = z = qnorm(0.975,0,1) = 1.96 ( sz ) 2 Then n = = e Minimal sample size is 3458.

6 Sample sizes, the sequel The key idea s Suppose you wanted to estimate a quantitative variable like men s heights with a margin of error of e = 0.2 with 90% confidence. How big a sample do you need? Let n be the sample size and s standard deviation of the sample we will measure. Assume s = 3. Let solve P(Z < z) = 0.95 z = qnorm(0.95,0,1) = 1.64 ( sz ) 2 Then n = = e Minimal sample size is 609. s To calculate a 95% confidence interval for a sample with proportion p and size n: Pretend your sample represents the population perfectly Let x be the sampling distribution for your variable and sample size Sample size is n. Proportion of x is p. p(1 p) Standard deviation of x is. n Find the central range that contains 95% of all possible samples Solve P( x < a) = and P( x < b) = T (a, b) is your 95% confidence interval s Example 1 Suppose we wanted a 95% confidence interval for voters preferences, and we took a random sample of voters. 61% wanted congress to pass a health plan (CBS radio news, Sunday, October 18, 2009). Related problem: population proportion is Let ˆp be sampling distribution for voter preferences with sample size. Mean of ˆp is 0.61, standard deviation of ˆp is = We will find a and b such that P(ˆp < a) = and P(ˆp < b) = Then P(a < ˆp < b) = P(ˆp < b) P(ˆp < a) = = 0.95 a = qnorm(0.025, 0.61, sqrt(.61*.39/)) = 0.47 b = qnorm(0.975, 0.61, sqrt(.61*.39/)) = 0.75) 95% confidence interval is (0.47, 0.75). s Example 2 Suppose we wanted a 90% confidence interval for voters preferences, and we took a random sample of voters. 61% wanted congress to pass a health plan. Related problem: population proportion is Let ˆp be sampling distribution for voter preferences with sample size. Mean of ˆp is 0.61, standard deviation of ˆp is = We will find a and b such that P(ˆp < a) = 0.05 and P(ˆp < b) = Then P(a < ˆp < b) = P(ˆp < b) P(ˆp < a) = = 0.90 a = qnorm(0.05, 0.61, sqrt(.61*.39/)) = 0. b = qnorm(0.95, 0.61, sqrt(.61*.39/)) = 0.72) 95% confidence interval is (0., 0.72).

7 s Compare confidence intervals Two confidence intervals for heights; 95% confidence interval: (0.47, 0.75) 90% confidence interval: (0., 0.72) Sample proportion 0.61 is right in the middle of both. 95% margin of error: e = = = 0.14 s Convenient approximation for 95% CI If your sample has proportion p an approximation frequently used for the 95% confidence interval is ( ) p(1 p p(1 p p 2, p + 2 n n 90% margin of error: e = = = % interval is wider, because we are more confident of a less precise statement. We are less confident of a more precise statement. In ( previous example, that would be , (0.47, 0.75) Actual answer was the same ) = Sample sizes Sample sizes s Suppose you wanted to estimate a categorical variable like voter preferences with a margin of error of e = 0.03 with 95% confidence. How big a sample do you need? Let n be the sample size Let solve P(Z < z) = z = qnorm(0.975,0,1) = 1.96 ( z ) 2 Then n = = e Minimal sample size is Statisticians often replace 1.96 with 2 and calculate the sample size for a 95% CI with margin of error e to be n = 1 e 2. s Suppose you wanted to estimate a categorical variable like voter preferences with a margin of error of e = 0.03 with 90% confidence. How big a sample do you need? Let n be the sample size Let solve P(Z < z) = 0.95 z = qnorm(0.95,0,1) = 1.64 ( z ) 2 Then n = = e Minimal sample size is 748. Margin of error 3% requires sample size =

8 Why lecture better than textbook s Text often assumes you know standard deviation of population, but only mean of sample. Lecture used both mean and standard deviation derived from sample. More realistic.

One-sample categorical data: approximate inference

One-sample categorical data: approximate inference One-sample categorical data: approximate inference Patrick Breheny October 6 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction It is relatively easy to think about the distribution

More information

Harvard University. Rigorous Research in Engineering Education

Harvard University. Rigorous Research in Engineering Education Statistical Inference Kari Lock Harvard University Department of Statistics Rigorous Research in Engineering Education 12/3/09 Statistical Inference You have a sample and want to use the data collected

More information

Lecture #16 Thursday, October 13, 2016 Textbook: Sections 9.3, 9.4, 10.1, 10.2

Lecture #16 Thursday, October 13, 2016 Textbook: Sections 9.3, 9.4, 10.1, 10.2 STATISTICS 200 Lecture #16 Thursday, October 13, 2016 Textbook: Sections 9.3, 9.4, 10.1, 10.2 Objectives: Define standard error, relate it to both standard deviation and sampling distribution ideas. Describe

More information

Inferences About Two Population Proportions

Inferences About Two Population Proportions Inferences About Two Population Proportions MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2018 Background Recall: for a single population the sampling proportion

More information

10.1. Comparing Two Proportions. Section 10.1

10.1. Comparing Two Proportions. Section 10.1 /6/04 0. Comparing Two Proportions Sectio0. Comparing Two Proportions After this section, you should be able to DETERMINE whether the conditions for performing inference are met. CONSTRUCT and INTERPRET

More information

ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12

ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12 ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12 Winter 2012 Lecture 13 (Winter 2011) Estimation Lecture 13 1 / 33 Review of Main Concepts Sampling Distribution of Sample Mean

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 26 (MWF) Tests and CI based on two proportions Suhasini Subba Rao Comparing proportions in

More information

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p).

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p). Sampling distributions and estimation. 1) A brief review of distributions: We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation,

More information

Difference Between Pair Differences v. 2 Samples

Difference Between Pair Differences v. 2 Samples 1 Sectio1.1 Comparing Two Proportions Learning Objectives After this section, you should be able to DETERMINE whether the conditions for performing inference are met. CONSTRUCT and INTERPRET a confidence

More information

Lecture 11 - Tests of Proportions

Lecture 11 - Tests of Proportions Lecture 11 - Tests of Proportions Statistics 102 Colin Rundel February 27, 2013 Research Project Research Project Proposal - Due Friday March 29th at 5 pm Introduction, Data Plan Data Project - Due Friday,

More information

Lecture 7: Confidence interval and Normal approximation

Lecture 7: Confidence interval and Normal approximation Lecture 7: Confidence interval and Normal approximation 26th of November 2015 Confidence interval 26th of November 2015 1 / 23 Random sample and uncertainty Example: we aim at estimating the average height

More information

Categorical Data Analysis 1

Categorical Data Analysis 1 Categorical Data Analysis 1 STA 312: Fall 2012 1 See last slide for copyright information. 1 / 1 Variables and Cases There are n cases (people, rats, factories, wolf packs) in a data set. A variable is

More information

Lecture 10: Comparing two populations: proportions

Lecture 10: Comparing two populations: proportions Lecture 10: Comparing two populations: proportions Problem: Compare two sets of sample data: e.g. is the proportion of As in this semester 152 the same as last Fall? Methods: Extend the methods introduced

More information

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n = Hypothesis testing I I. What is hypothesis testing? [Note we re temporarily bouncing around in the book a lot! Things will settle down again in a week or so] - Exactly what it says. We develop a hypothesis,

More information

where Female = 0 for males, = 1 for females Age is measured in years (22, 23, ) GPA is measured in units on a four-point scale (0, 1.22, 3.45, etc.

where Female = 0 for males, = 1 for females Age is measured in years (22, 23, ) GPA is measured in units on a four-point scale (0, 1.22, 3.45, etc. Notes on regression analysis 1. Basics in regression analysis key concepts (actual implementation is more complicated) A. Collect data B. Plot data on graph, draw a line through the middle of the scatter

More information

Bayesian decision theory. Nuno Vasconcelos ECE Department, UCSD

Bayesian decision theory. Nuno Vasconcelos ECE Department, UCSD Bayesian decision theory Nuno Vasconcelos ECE Department, UCSD Notation the notation in DHS is quite sloppy e.g. show that ( error = ( error z ( z dz really not clear what this means we will use the following

More information

LECTURE 5. Introduction to Econometrics. Hypothesis testing

LECTURE 5. Introduction to Econometrics. Hypothesis testing LECTURE 5 Introduction to Econometrics Hypothesis testing October 18, 2016 1 / 26 ON TODAY S LECTURE We are going to discuss how hypotheses about coefficients can be tested in regression models We will

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics Math 140 Introductory Statistics Professor Silvia Fernández Lecture 8 Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. 5.1 Models of Random Behavior Outcome: Result or answer

More information

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p).

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p). Sampling distributions and estimation. 1) A brief review of distributions: We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation,

More information

Last few slides from last time

Last few slides from last time Last few slides from last time Example 3: What is the probability that p will fall in a certain range, given p? Flip a coin 50 times. If the coin is fair (p=0.5), what is the probability of getting an

More information

MA : Introductory Probability

MA : Introductory Probability MA 320-001: Introductory Probability David Murrugarra Department of Mathematics, University of Kentucky http://www.math.uky.edu/~dmu228/ma320/ Spring 2017 David Murrugarra (University of Kentucky) MA 320:

More information

Business Statistics. Lecture 5: Confidence Intervals

Business Statistics. Lecture 5: Confidence Intervals Business Statistics Lecture 5: Confidence Intervals Goals for this Lecture Confidence intervals The t distribution 2 Welcome to Interval Estimation! Moments Mean 815.0340 Std Dev 0.8923 Std Error Mean

More information

STA Module 10 Comparing Two Proportions

STA Module 10 Comparing Two Proportions STA 2023 Module 10 Comparing Two Proportions Learning Objectives Upon completing this module, you should be able to: 1. Perform large-sample inferences (hypothesis test and confidence intervals) to compare

More information

ST 305: Final Exam ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) ( ) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ Y. σ X. σ n.

ST 305: Final Exam ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) ( ) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ Y. σ X. σ n. ST 305: Final Exam By handing in this completed exam, I state that I have neither given nor received assistance from another person during the exam period. I have not copied from another person s paper.

More information

DETERMINE whether the conditions for performing inference are met. CONSTRUCT and INTERPRET a confidence interval to compare two proportions.

DETERMINE whether the conditions for performing inference are met. CONSTRUCT and INTERPRET a confidence interval to compare two proportions. Section 0. Comparing Two Proportions Learning Objectives After this section, you should be able to DETERMINE whether the conditions for performing inference are met. CONSTRUCT and INTERPRET a confidence

More information

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI Introduction of Data Analytics Prof. Nandan Sudarsanam and Prof. B Ravindran Department of Management Studies and Department of Computer Science and Engineering Indian Institute of Technology, Madras Module

More information

ACMS Statistics for Life Sciences. Chapter 13: Sampling Distributions

ACMS Statistics for Life Sciences. Chapter 13: Sampling Distributions ACMS 20340 Statistics for Life Sciences Chapter 13: Sampling Distributions Sampling We use information from a sample to infer something about a population. When using random samples and randomized experiments,

More information

Comparing p s Dr. Don Edwards notes (slightly edited and augmented) The Odds for Success

Comparing p s Dr. Don Edwards notes (slightly edited and augmented) The Odds for Success Comparing p s Dr. Don Edwards notes (slightly edited and augmented) The Odds for Success When the experiment consists of a series of n independent trials, and each trial may end in either success or failure,

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics 5. Models of Random Behavior Math 40 Introductory Statistics Professor Silvia Fernández Chapter 5 Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. Outcome: Result or answer

More information

Stat 101: Lecture 12. Summer 2006

Stat 101: Lecture 12. Summer 2006 Stat 101: Lecture 12 Summer 2006 Outline Answer Questions More on the CLT The Finite Population Correction Factor Confidence Intervals Problems More on the CLT Recall the Central Limit Theorem for averages:

More information

Chapter 10: Comparing Two Populations or Groups

Chapter 10: Comparing Two Populations or Groups Chapter 10: Comparing Two Populations or Groups Sectio0.1 The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Chapter 10 Comparing Two Populations or Groups 10.1 10.2 Comparing Two Means

More information

Introduction to Statistical Data Analysis Lecture 5: Confidence Intervals

Introduction to Statistical Data Analysis Lecture 5: Confidence Intervals Introduction to Statistical Data Analysis Lecture 5: Confidence Intervals James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis 1

More information

a Sample By:Dr.Hoseyn Falahzadeh 1

a Sample By:Dr.Hoseyn Falahzadeh 1 In the name of God Determining ee the esize eof a Sample By:Dr.Hoseyn Falahzadeh 1 Sample Accuracy Sample accuracy: refers to how close a random sample s statistic is to the true population s value it

More information

Midterm 1 and 2 results

Midterm 1 and 2 results Midterm 1 and 2 results Midterm 1 Midterm 2 ------------------------------ Min. :40.00 Min. : 20.0 1st Qu.:60.00 1st Qu.:60.00 Median :75.00 Median :70.0 Mean :71.97 Mean :69.77 3rd Qu.:85.00 3rd Qu.:85.0

More information

Chapter 10 Regression Analysis

Chapter 10 Regression Analysis Chapter 10 Regression Analysis Goal: To become familiar with how to use Excel 2007/2010 for Correlation and Regression. Instructions: You will be using CORREL, FORECAST and Regression. CORREL and FORECAST

More information

Null Hypothesis Significance Testing p-values, significance level, power, t-tests Spring 2017

Null Hypothesis Significance Testing p-values, significance level, power, t-tests Spring 2017 Null Hypothesis Significance Testing p-values, significance level, power, t-tests 18.05 Spring 2017 Understand this figure f(x H 0 ) x reject H 0 don t reject H 0 reject H 0 x = test statistic f (x H 0

More information

ECO220Y Hypothesis Testing: Type I and Type II Errors and Power Readings: Chapter 12,

ECO220Y Hypothesis Testing: Type I and Type II Errors and Power Readings: Chapter 12, ECO220Y Hypothesis Testing: Type I and Type II Errors and Power Readings: Chapter 12, 12.7-12.9 Winter 2012 Lecture 15 (Winter 2011) Estimation Lecture 15 1 / 25 Linking Two Approaches to Hypothesis Testing

More information

Confidence intervals

Confidence intervals Confidence intervals We now want to take what we ve learned about sampling distributions and standard errors and construct confidence intervals. What are confidence intervals? Simply an interval for which

More information

CONTENTS OF DAY 2. II. Why Random Sampling is Important 10 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

CONTENTS OF DAY 2. II. Why Random Sampling is Important 10 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE 1 2 CONTENTS OF DAY 2 I. More Precise Definition of Simple Random Sample 3 Connection with independent random variables 4 Problems with small populations 9 II. Why Random Sampling is Important 10 A myth,

More information

MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics

MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1 MAT 2379, Introduction to Biostatistics Sample Calculator Problems for the Final Exam Note: The exam will also contain some problems

More information

Are data normally normally distributed?

Are data normally normally distributed? Standard Normal Image source Are data normally normally distributed? Sample mean: 66.78 Sample standard deviation: 3.37 (66.78-1 x 3.37, 66.78 + 1 x 3.37) (66.78-2 x 3.37, 66.78 + 2 x 3.37) (66.78-3 x

More information

Inference for Single Proportions and Means T.Scofield

Inference for Single Proportions and Means T.Scofield Inference for Single Proportions and Means TScofield Confidence Intervals for Single Proportions and Means A CI gives upper and lower bounds between which we hope to capture the (fixed) population parameter

More information

Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t

Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t t Confidence Interval for Population Mean Comparing z and t Confidence Intervals When neither z nor t Applies

More information

Estimation and Confidence Intervals

Estimation and Confidence Intervals Estimation and Confidence Intervals Sections 7.1-7.3 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 17-3339 Cathy Poliak, Ph.D. cathy@math.uh.edu

More information

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis

More information

POLI 443 Applied Political Research

POLI 443 Applied Political Research POLI 443 Applied Political Research Session 6: Tests of Hypotheses Contingency Analysis Lecturer: Prof. A. Essuman-Johnson, Dept. of Political Science Contact Information: aessuman-johnson@ug.edu.gh College

More information

Inference for Proportions

Inference for Proportions Inference for Proportions Marc H. Mehlman marcmehlman@yahoo.com University of New Haven Based on Rare Event Rule: rare events happen but not to me. (University of New Haven) Inference for Proportions 1

More information

Probability and Statistics

Probability and Statistics Probability and Statistics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 4: IT IS ALL ABOUT DATA 4a - 1 CHAPTER 4: IT

More information

MATH220 Test 2 Fall Name. Section

MATH220 Test 2 Fall Name. Section MATH220 Test 2 Fall 2014 Name Section This test has problems which are worth 100 points Show your steps in each problem to receive full or partial credit Note only writing down the final answer without

More information

Inference for Proportions

Inference for Proportions Inference for Proportions Marc H. Mehlman marcmehlman@yahoo.com University of New Haven Based on Rare Event Rule: rare events happen but not to me. Marc Mehlman (University of New Haven) Inference for

More information

Confidence Intervals for the Mean of Non-normal Data Class 23, Jeremy Orloff and Jonathan Bloom

Confidence Intervals for the Mean of Non-normal Data Class 23, Jeremy Orloff and Jonathan Bloom Confidence Intervals for the Mean of Non-normal Data Class 23, 8.05 Jeremy Orloff and Jonathan Bloom Learning Goals. Be able to derive the formula for conservative normal confidence intervals for the proportion

More information

Chapter 10: Comparing Two Populations or Groups

Chapter 10: Comparing Two Populations or Groups Chapter 10: Comparing Two Populations or Groups Sectio0.1 The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Chapter 10 Comparing Two Populations or Groups 10.1 10.2 Comparing Two Means

More information

Categorical Data Analysis. The data are often just counts of how many things each category has.

Categorical Data Analysis. The data are often just counts of how many things each category has. Categorical Data Analysis So far we ve been looking at continuous data arranged into one or two groups, where each group has more than one observation. E.g., a series of measurements on one or two things.

More information

Chapters 3.2 Discrete distributions

Chapters 3.2 Discrete distributions Chapters 3.2 Discrete distributions In this section we study several discrete distributions and their properties. Here are a few, classified by their support S X. There are of course many, many more. For

More information

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Agenda Introduction to Estimation Point estimation Interval estimation Introduction to Hypothesis Testing Concepts en terminology

More information

Carolyn Anderson & YoungShil Paek (Slide contributors: Shuai Wang, Yi Zheng, Michael Culbertson, & Haiyan Li)

Carolyn Anderson & YoungShil Paek (Slide contributors: Shuai Wang, Yi Zheng, Michael Culbertson, & Haiyan Li) Carolyn Anderson & YoungShil Paek (Slide contributors: Shuai Wang, Yi Zheng, Michael Culbertson, & Haiyan Li) Department of Educational Psychology University of Illinois at Urbana-Champaign 1 Inferential

More information

Chapter 10: Comparing Two Populations or Groups

Chapter 10: Comparing Two Populations or Groups Chapter 10: Comparing Two Populations or Groups Sectio0.1 The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Chapter 10 Comparing Two Populations or Groups 10.1 10. Comparing Two Means

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science

UNIVERSITY OF TORONTO Faculty of Arts and Science UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator

More information

Gov 2000: 6. Hypothesis Testing

Gov 2000: 6. Hypothesis Testing Gov 2000: 6. Hypothesis Testing Matthew Blackwell October 11, 2016 1 / 55 1. Hypothesis Testing Examples 2. Hypothesis Test Nomenclature 3. Conducting Hypothesis Tests 4. p-values 5. Power Analyses 6.

More information

Central Limit Theorem Confidence Intervals Worked example #6. July 24, 2017

Central Limit Theorem Confidence Intervals Worked example #6. July 24, 2017 Central Limit Theorem Confidence Intervals Worked example #6 July 24, 2017 10 8 Raw scores 6 4 Mean=71.4% 2 0 0-10 10-20 20-30 30-40 40-50 50-60 60-70 70-80 80-90 90+ Scaling is to add 3.6% to bring mean

More information

Probability and Probability Distributions. Dr. Mohammed Alahmed

Probability and Probability Distributions. Dr. Mohammed Alahmed Probability and Probability Distributions 1 Probability and Probability Distributions Usually we want to do more with data than just describing them! We might want to test certain specific inferences about

More information

Confidence Intervals with σ unknown

Confidence Intervals with σ unknown STAT 141 Confidence Intervals and Hypothesis Testing 10/26/04 Today (Chapter 7): CI with σ unknown, t-distribution CI for proportions Two sample CI with σ known or unknown Hypothesis Testing, z-test Confidence

More information

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING LECTURE 1 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING INTERVAL ESTIMATION Point estimation of : The inference is a guess of a single value as the value of. No accuracy associated with it. Interval estimation

More information

Occupy movement - Duke edition. Lecture 14: Large sample inference for proportions. Exploratory analysis. Another poll on the movement

Occupy movement - Duke edition. Lecture 14: Large sample inference for proportions. Exploratory analysis. Another poll on the movement Occupy movement - Duke edition Lecture 14: Large sample inference for proportions Statistics 101 Mine Çetinkaya-Rundel October 20, 2011 On Tuesday we asked you about how closely you re following the news

More information

The t-test Pivots Summary. Pivots and t-tests. Patrick Breheny. October 15. Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18

The t-test Pivots Summary. Pivots and t-tests. Patrick Breheny. October 15. Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18 and t-tests Patrick Breheny October 15 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18 Introduction The t-test As we discussed previously, W.S. Gossett derived the t-distribution as a way of

More information

Probability and Discrete Distributions

Probability and Discrete Distributions AMS 7L LAB #3 Fall, 2007 Objectives: Probability and Discrete Distributions 1. To explore relative frequency and the Law of Large Numbers 2. To practice the basic rules of probability 3. To work with the

More information

Math 143: Introduction to Biostatistics

Math 143: Introduction to Biostatistics Math 143: Introduction to Biostatistics R Pruim Spring 2012 0.2 Last Modified: April 2, 2012 Math 143 : Spring 2012 : Pruim Contents 10 More About Random Variables 1 10.1 The Mean of a Random Variable....................................

More information

6 Sample Size Calculations

6 Sample Size Calculations 6 Sample Size Calculations A major responsibility of a statistician: sample size calculation. Hypothesis Testing: compare treatment 1 (new treatment) to treatment 2 (standard treatment); Assume continuous

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 26 (MWF) Tests and CI based on two proportions Suhasini Subba Rao Comparing proportions in

More information

Point Estimation and Confidence Interval

Point Estimation and Confidence Interval Chapter 8 Point Estimation and Confidence Interval 8.1 Point estimator The purpose of point estimation is to use a function of the sample data to estimate the unknown parameter. Definition 8.1 A parameter

More information

Stat 135 Fall 2013 FINAL EXAM December 18, 2013

Stat 135 Fall 2013 FINAL EXAM December 18, 2013 Stat 135 Fall 2013 FINAL EXAM December 18, 2013 Name: Person on right SID: Person on left There will be one, double sided, handwritten, 8.5in x 11in page of notes allowed during the exam. The exam is closed

More information

Acknowledgements. Outline. Marie Diener-West. ICTR Leadership / Team INTRODUCTION TO CLINICAL RESEARCH. Introduction to Linear Regression

Acknowledgements. Outline. Marie Diener-West. ICTR Leadership / Team INTRODUCTION TO CLINICAL RESEARCH. Introduction to Linear Regression INTRODUCTION TO CLINICAL RESEARCH Introduction to Linear Regression Karen Bandeen-Roche, Ph.D. July 17, 2012 Acknowledgements Marie Diener-West Rick Thompson ICTR Leadership / Team JHU Intro to Clinical

More information

Vocabulary: Samples and Populations

Vocabulary: Samples and Populations Vocabulary: Samples and Populations Concept Different types of data Categorical data results when the question asked in a survey or sample can be answered with a nonnumerical answer. For example if we

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 31 (MWF) Review of test for independence and starting with linear regression Suhasini Subba

More information

Null Hypothesis Significance Testing p-values, significance level, power, t-tests

Null Hypothesis Significance Testing p-values, significance level, power, t-tests Null Hypothesis Significance Testing p-values, significance level, power, t-tests 18.05 Spring 2014 January 1, 2017 1 /22 Understand this figure f(x H 0 ) x reject H 0 don t reject H 0 reject H 0 x = test

More information

Hypothesis for Means and Proportions

Hypothesis for Means and Proportions November 14, 2012 Hypothesis Tests - Basic Ideas Often we are interested not in estimating an unknown parameter but in testing some claim or hypothesis concerning a population. For example we may wish

More information

Probability theory and inference statistics! Dr. Paola Grosso! SNE research group!! (preferred!)!!

Probability theory and inference statistics! Dr. Paola Grosso! SNE research group!!  (preferred!)!! Probability theory and inference statistics Dr. Paola Grosso SNE research group p.grosso@uva.nl paola.grosso@os3.nl (preferred) Roadmap Lecture 1: Monday Sep. 22nd Collecting data Presenting data Descriptive

More information

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit LECTURE 6 Introduction to Econometrics Hypothesis testing & Goodness of fit October 25, 2016 1 / 23 ON TODAY S LECTURE We will explain how multiple hypotheses are tested in a regression model We will define

More information

Two sided, two sample t-tests. a) IQ = 100 b) Average height for men = c) Average number of white blood cells per cubic millimeter is 7,000.

Two sided, two sample t-tests. a) IQ = 100 b) Average height for men = c) Average number of white blood cells per cubic millimeter is 7,000. Two sided, two sample t-tests. I. Brief review: 1) We are interested in how a sample compares to some pre-conceived notion. For example: a) IQ = 100 b) Average height for men = 5 10. c) Average number

More information

DE CHAZAL DU MEE BUSINESS SCHOOL AUGUST 2003 MOCK EXAMINATIONS IOP 201-Q (INDUSTRIAL PSYCHOLOGICAL RESEARCH)

DE CHAZAL DU MEE BUSINESS SCHOOL AUGUST 2003 MOCK EXAMINATIONS IOP 201-Q (INDUSTRIAL PSYCHOLOGICAL RESEARCH) DE CHAZAL DU MEE BUSINESS SCHOOL AUGUST 003 MOCK EXAMINATIONS IOP 01-Q (INDUSTRIAL PSYCHOLOGICAL RESEARCH) Time: hours READ THE INSTRUCTIONS BELOW VERY CAREFULLY. Do not open this question paper until

More information

Sample Size Determination

Sample Size Determination Sample Size Determination 018 The number of subjects in a clinical study should always be large enough to provide a reliable answer to the question(s addressed. The sample size is usually determined by

More information

Binary Logistic Regression

Binary Logistic Regression The coefficients of the multiple regression model are estimated using sample data with k independent variables Estimated (or predicted) value of Y Estimated intercept Estimated slope coefficients Ŷ = b

More information

Statistical inference

Statistical inference Statistical inference Contents 1. Main definitions 2. Estimation 3. Testing L. Trapani MSc Induction - Statistical inference 1 1 Introduction: definition and preliminary theory In this chapter, we shall

More information

Statistics for Business and Economics

Statistics for Business and Economics Statistics for Business and Economics Chapter 6 Sampling and Sampling Distributions Ch. 6-1 6.1 Tools of Business Statistics n Descriptive statistics n Collecting, presenting, and describing data n Inferential

More information

1 Matched pair comparison(p430-)

1 Matched pair comparison(p430-) [1] ST301(AKI) LEC 25 2010/11/30 ST 301 (AKI) LECTURE #25 1 Matched pair comparison(p430-) This has a quite different assumption (matched pair) from the other three methods. Remember LEC 32 page 1 example:

More information

Confidence intervals CE 311S

Confidence intervals CE 311S CE 311S PREVIEW OF STATISTICS The first part of the class was about probability. P(H) = 0.5 P(T) = 0.5 HTTHHTTTTHHTHTHH If we know how a random process works, what will we see in the field? Preview of

More information

Quantitative Analysis and Empirical Methods

Quantitative Analysis and Empirical Methods Hypothesis testing Sciences Po, Paris, CEE / LIEPP Introduction Hypotheses Procedure of hypothesis testing Two-tailed and one-tailed tests Statistical tests with categorical variables A hypothesis A testable

More information

Pig organ transplants within 5 years

Pig organ transplants within 5 years www.breaking News English.com Ready-to-use ESL / EFL Lessons Pig organ transplants within 5 years URL: http://www.breakingnewsenglish.com/0509/050911-xenotransplant.html Today s contents The Article 2

More information

Probability, Statistics, and Bayes Theorem Session 3

Probability, Statistics, and Bayes Theorem Session 3 Probability, Statistics, and Bayes Theorem Session 3 1 Introduction Now that we know what Bayes Theorem is, we want to explore some of the ways that it can be used in real-life situations. Often the results

More information

Significance Tests. Review Confidence Intervals. The Gauss Model. Genetics

Significance Tests. Review Confidence Intervals. The Gauss Model. Genetics 15.0 Significance Tests Review Confidence Intervals The Gauss Model Genetics Significance Tests 1 15.1 CI Review The general formula for a two-sided C% confidence interval is: L, U = pe ± se cv (1 C)/2

More information

Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam. June 8 th, 2016: 9am to 1pm

Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam. June 8 th, 2016: 9am to 1pm Name: Biostatistics 1 st year Comprehensive Examination: Applied in-class exam June 8 th, 2016: 9am to 1pm Instructions: 1. This is exam is to be completed independently. Do not discuss your work with

More information

The t-statistic. Student s t Test

The t-statistic. Student s t Test The t-statistic 1 Student s t Test When the population standard deviation is not known, you cannot use a z score hypothesis test Use Student s t test instead Student s t, or t test is, conceptually, very

More information

and the Sample Mean Random Sample

and the Sample Mean Random Sample MATH 183 Random Samples and the Sample Mean Dr. Neal, WKU Henceforth, we shall assume that we are studying a particular measurement X from a population! for which the mean µ and standard deviation! are

More information

Problems Pages 1-4 Answers Page 5 Solutions Pages 6-11

Problems Pages 1-4 Answers Page 5 Solutions Pages 6-11 Part III Practice Problems Problems Pages 1-4 Answers Page 5 Solutions Pages 6-11 1. In estimating population mean or proportion what is the width of an interval? 2. If 25 college students out of 80 graduate

More information

You may use your calculator and a single page of notes. The room is crowded. Please be careful to look only at your own exam.

You may use your calculator and a single page of notes. The room is crowded. Please be careful to look only at your own exam. LAST NAME (Please Print): FIRST NAME (Please Print): HONOR PLEDGE (Please Sign): Statistics 111 Midterm 1 This is a closed book exam. You may use your calculator and a single page of notes. The room is

More information

Sampling Distributions: Central Limit Theorem

Sampling Distributions: Central Limit Theorem Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)

More information

Probability and Statistics. Terms and concepts

Probability and Statistics. Terms and concepts Probability and Statistics Joyeeta Dutta Moscato June 30, 2014 Terms and concepts Sample vs population Central tendency: Mean, median, mode Variance, standard deviation Normal distribution Cumulative distribution

More information

Math 10 - Compilation of Sample Exam Questions + Answers

Math 10 - Compilation of Sample Exam Questions + Answers Math 10 - Compilation of Sample Exam Questions + Sample Exam Question 1 We have a population of size N. Let p be the independent probability of a person in the population developing a disease. Answer the

More information

Interactions and Factorial ANOVA

Interactions and Factorial ANOVA Interactions and Factorial ANOVA STA442/2101 F 2017 See last slide for copyright information 1 Interactions Interaction between explanatory variables means It depends. Relationship between one explanatory

More information

Why should you care?? Intellectual curiosity. Gambling. Mathematically the same as the ESP decision problem we discussed in Week 4.

Why should you care?? Intellectual curiosity. Gambling. Mathematically the same as the ESP decision problem we discussed in Week 4. I. Probability basics (Sections 4.1 and 4.2) Flip a fair (probability of HEADS is 1/2) coin ten times. What is the probability of getting exactly 5 HEADS? What is the probability of getting exactly 10

More information