Understanding Inference: Confidence Intervals I. Questions about the Assignment. The Big Picture. Statistic vs. Parameter. Statistic vs.

Size: px
Start display at page:

Download "Understanding Inference: Confidence Intervals I. Questions about the Assignment. The Big Picture. Statistic vs. Parameter. Statistic vs."

Transcription

1 Questions about the Assignment If your answer is wrong, but you show your work you can get more partial credit. Understanding Inference: Confidence Intervals I parameter versus sample statistic Uncertainty in estimates Sampling distribution Confidence interval The Big Picture Statistic vs. Parameter Statistical Inference Sampling Sample A sample statistic is a number computed from sample data. (e.g., sample mean: mean income of the people in the sample) A population parameter is a number that describes some aspect of a population. (e.g., population mean: mean income of the entire population) We usually have a sample statistic and want to make inferences about the population parameter. The Big Picture Statistic vs. Parameter Parameter Statistical Inference Sampling Sample Statistic Mean Sample Statistics Parameters μ (mu) Proportion p Std. Deviation s (sigma) Correlation r ρ (rho) Slope b β (beta) 1

2 Obama s Approval Rating Gallup surveyed 1,500 Americans between June 9 th -11 th 2012 and 49% of these people approved of the job Barack Obama is doing as president. What is the population? ~330million (All Americans) What is the sample size? 1,500 Is this categorical or quantitative variable? Categorical For categorical variables, what sample statistic are we interested in? Sample proportion Sample statistic: (sample proportion) =.49 Based on this sample statistic, what do you think is the true proportion of Americans who approve of the job Barack Obama is doing as president? parameter: (population proportion) p =? Point and Interval Estimates The sample statistic gives a point estimate (a single number) for the population parameter. Usually, it is more useful to provide an interval estimate which gives a range of plausible values for the population parameter: interval estimate = point estimate margin of error How do we determine the margin of error??? Obama s Approval Rating Point Estimate: =.49 Interval Estimate: point estimate margin of error = (0.46, 0.52) Between 46% and 52% of Americans currently approve of the job Obama is doing as president. Important Points The population parameter is a fixed value. Sample statistics vary from sample to sample. They will not match the population parameter exactly. Reese s Pieces What proportion of Reese s pieces are orange? For a given sample statistic, what are plausible values for the population parameter? How much uncertainty surrounds the sample statistic? It depends on how much the sample statistic varies from sample to sample! 2

3 Let s Run Our Own Study When conducting a study, we need to select a sample size. Typically, we take only one sample, but because we re interested in knowing how much our sample statistic varies from sample to sample, we ll take multiple samples. Each person take a random sample of 10 Reese s pieces. Let s Run Our Own Study What is our population? 1,500 What is our sample size? 10 How many samples did we take? 6 What is your sample distribution (i.e. orange vs. not orange)? What is your sample proportion? What is the range of plausible values for the population proportion? What is the mean proportion of the sampling distribution? The sampling distribution will be centered around the true population parameter. Sampling Distribution A sampling distribution is the distribution of sample statistics computed from different samples of the same size taken from the same population. In the Reese s pieces sampling distribution, what does each dot represent? A. One Reese s piece B. One sample statistic The sampling distribution shows us how the sample statistic varies from sample to sample. Sampling Distribution: Shape and Center If samples are randomly selected and the sample size is large enough, the sampling distribution will be symmetric and bell-shaped. centered at the value of the population parameter. The sampling distribution is different from the sample distribution. The sample distribution is the distribution of values for variable x collected from one sample. The sampling distribution is the distribution of sample statistics collected from multiple samples. Sampling Distribution: Spread To assess the accuracy of our point estimate, we need to know how much the sample statistic varies from sample to sample. (i.e., we need to know the spread of the sampling distribution.) In the Reese s pieces sampling distribution we generated, what is the range of plausible values for the population proportion? We use the spread of the sampling distribution to determine the margin of error for a statistic. What is a standardized way to measure the spread of a distribution? Sampling Distribution: Standard Deviation Calculate the standard deviation of the sample statistics in the sampling distribution. s n x 2 i x i 1 n 1 n = the number of samples taken x i = the sample statistic for sample i = mean value for all of the sample statistics As the standard deviation (i.e., spread) of the sampling distribution decreases, the margin of error will decrease. As the variability (i.e., spread) in the sampling distribution decreases, the uncertainty in the estimate decreases. 3

4 The Importance of Sample Size 3 Sampling Distributions n = 1,000 n = 200 n = 50 Each dot represents a sample statistic. The number of samples taken to generate these sampling distributions is the same. What varies for each sampling distribution is the size of the sample taken to calculate the sample statistic. The sample size does not affect the shape of the sampling distribution. The sample size does not affect the center of the sampling distribution. The sample size does affect the spread of the sampling distribution. As the sample size increases, the spread decreases. Sample Size The sample size influences the spread of the sampling distribution (i.e., the variation in sample statistics), which influences the margin of error for our estimate of the population parameter. If we increased the sample size to 100, the standard deviation of the sampling distribution will... A. increase For each sample, the sample statistic (i.e., the proportion of orange pieces) would be closer B. decrease to the proportion of the population and thus C. remain the same closer to each other. and the margin of error for our point estimate will A. increase B. decrease C. remains the same Hypothesis Increasing the sample size will cause the standard deviation of the sampling distribution to decrease. Let s Test Our Hypothesis Random Samples If you take random samples, the sampling distribution will be centered around the true population parameter. If sampling bias exists (if you do not take random samples), the sampling distribution may provide inaccurate information about the true population parameter. Confidence Intervals Confidence Intervals A confidence interval for a population parameter estimate is an interval computed from sample data that will contain the true population parameter for a specified proportion of all samples. The confidence level is the proportion of samples whose intervals contain the true population parameter. The confidence level indicates how confident we are that our interval contains the population parameter. A 95% confidence interval will contain the true population parameter for 95% of all samples. We are 95% confident that the true population parameters falls within this range. Sampling Distribution Parameter % Proportion Confidence Interval p Sample Statistic >2 SDs *The standard deviation used to calculate the confidence interval is the standard deviation of the sampling distribution (not the sample distribution). The population parameter ( ) is fixed. It is typically not known. The sample statistic (x i ) is random. It depends on the sample. The confidence interval (x i 2SD)* is random. It depends on the sample statistic. The sampling distribution is comprised of the sample statistics and is centered on the population parameter. 95% of the sample statistics will fall within 2 standard deviations of the population parameter. 95% of the sample intervals will contain the population parameter. 4

5 Confidence Intervals A 95% confidence interval can be created by: sample statistic 2 standard deviations point estimate margin of error Standard Error: The Standard Deviation of the Sampling Distribution The standard deviation of the sampling distribution (i.e., the distribution of sample statistics) is called the standard error (SE). This is done to clearly distinguish it from the standard deviation of the sample distribution. The point estimate is calculated from our sample. The margin of error is calculated from the sampling distribution. Summary To create a plausible range of values for a parameter: 1. Take many random samples from the population, and compute the sample statistic for each sample. 2. Compute the standard error as the standard deviation of all these statistics. 3. Use: sample statistic ± 2 standard error One small problem Often we only have one sample! How can we calculate the variation in sample statistics, if we only have one sample? Part I: Graded Problems 3.12, 3.16, 3.24, and 3.54 Assignment Part II: (Type up this assignment in a Word document) Goto Find 3 quantitative variables and for each variable find another quantitative variable that you think is associated with it. Conduct a correlation test to see how correlated they are. For each pair of variables provide the following information: Variable names Question related to the variable Explain in your own words what this variable is measuring The unit used to measure the variable (e.g., years, dollars, inches, etc.) Min, Max, Mean, Median, Standard Deviation (Std Dev) The correlation score An interpretation of the correlation score Calculating Correlations from the GSS Calculating Correlations from the GSS Under the Analysis tab, click on the Correlation matrix tab. Enter the names of two quantitative variables here. Click on this button and the correlation statistics will open up in a new window. This is what will pop up in the new window. This is the correlation (r) score for the two variables 5

6 A recent survey of 1,502 Americans in found that 86% consider the economy a top priority for the president and congress this year. The standard error for this statistic is What is the 95% confidence interval for the true proportion of all Americans that consider the economy a top priority for the president and congress this year? A. (0.85, 0.87) B. (0.84, 0.88) C. (0.82, 0.90) Economy Calculating the Standard Error The standard error of a sample statistic is the same thing as the standard deviation of the sampling distribution (i.e., distribution of sample statistics). In order to calculate the standard deviation of the sampling distribution, we need the sample statistic for multiple samples. However, in reality we typically only have one sample! How do we know how much sample statistics vary, if we only have one sample? Terms Standard Deviation: Measures the spread of the distribution of values. (e.g., the distribution of sample values for variable x). Standard Error: Measures the standard deviation of the sampling distribution (i.e., the distribution of sample statistics). Margin of Error: The amount added and subtracted to a point estimate to calculate a confidence interval for a population parameter. 6

Business Statistics. Lecture 10: Course Review

Business Statistics. Lecture 10: Course Review Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,

More information

Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t

Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t t Confidence Interval for Population Mean Comparing z and t Confidence Intervals When neither z nor t Applies

More information

Lecture 2. Estimating Single Population Parameters 8-1

Lecture 2. Estimating Single Population Parameters 8-1 Lecture 2 Estimating Single Population Parameters 8-1 8.1 Point and Confidence Interval Estimates for a Population Mean Point Estimate A single statistic, determined from a sample, that is used to estimate

More information

Estimation and Confidence Intervals

Estimation and Confidence Intervals Estimation and Confidence Intervals Sections 7.1-7.3 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 17-3339 Cathy Poliak, Ph.D. cathy@math.uh.edu

More information

Statistics, continued

Statistics, continued Statistics, continued Visual Displays of Data Since numbers often do not resonate with people, giving visual representations of data is often uses to make the data more meaningful. We will talk about a

More information

9. Linear Regression and Correlation

9. Linear Regression and Correlation 9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,

More information

MALLOY PSYCH 3000 MEAN & VARIANCE PAGE 1 STATISTICS MEASURES OF CENTRAL TENDENCY. In an experiment, these are applied to the dependent variable (DV)

MALLOY PSYCH 3000 MEAN & VARIANCE PAGE 1 STATISTICS MEASURES OF CENTRAL TENDENCY. In an experiment, these are applied to the dependent variable (DV) MALLOY PSYCH 3000 MEAN & VARIANCE PAGE 1 STATISTICS Descriptive statistics Inferential statistics MEASURES OF CENTRAL TENDENCY In an experiment, these are applied to the dependent variable (DV) E.g., MEASURES

More information

Passing-Bablok Regression for Method Comparison

Passing-Bablok Regression for Method Comparison Chapter 313 Passing-Bablok Regression for Method Comparison Introduction Passing-Bablok regression for method comparison is a robust, nonparametric method for fitting a straight line to two-dimensional

More information

Review of the Normal Distribution

Review of the Normal Distribution Sampling and s Normal Distribution Aims of Sampling Basic Principles of Probability Types of Random Samples s of the Mean Standard Error of the Mean The Central Limit Theorem Review of the Normal Distribution

More information

Sampling Distribution Models. Chapter 17

Sampling Distribution Models. Chapter 17 Sampling Distribution Models Chapter 17 Objectives: 1. Sampling Distribution Model 2. Sampling Variability (sampling error) 3. Sampling Distribution Model for a Proportion 4. Central Limit Theorem 5. Sampling

More information

Intro to Confidence Intervals: A estimate is a single statistic based on sample data to estimate a population parameter Simplest approach But not always very precise due to variation in the sampling distribution

More information

Normal Curve in standard form: Answer each of the following questions

Normal Curve in standard form: Answer each of the following questions Basic Statistics Normal Curve in standard form: Answer each of the following questions What percent of the normal distribution lies between one and two standard deviations above the mean? What percent

More information

Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing

Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing 1. Purpose of statistical inference Statistical inference provides a means of generalizing

More information

Exam #2 Results (as percentages)

Exam #2 Results (as percentages) Oct. 30 Assignment: Read Chapter 19 Try exercises 1, 2, and 4 on p. 424 Exam #2 Results (as percentages) Mean: 71.4 Median: 73.3 Soda attitudes 2015 In a Gallup poll conducted Jul. 8 12, 2015, 1009 adult

More information

Harvard University. Rigorous Research in Engineering Education

Harvard University. Rigorous Research in Engineering Education Statistical Inference Kari Lock Harvard University Department of Statistics Rigorous Research in Engineering Education 12/3/09 Statistical Inference You have a sample and want to use the data collected

More information

Chapter 2: Tools for Exploring Univariate Data

Chapter 2: Tools for Exploring Univariate Data Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is

More information

2011 Pearson Education, Inc

2011 Pearson Education, Inc Statistics for Business and Economics Chapter 7 Inferences Based on Two Samples: Confidence Intervals & Tests of Hypotheses Content 1. Identifying the Target Parameter 2. Comparing Two Population Means:

More information

Advanced Experimental Design

Advanced Experimental Design Advanced Experimental Design Topic Four Hypothesis testing (z and t tests) & Power Agenda Hypothesis testing Sampling distributions/central limit theorem z test (σ known) One sample z & Confidence intervals

More information

Linear Correlation and Regression Analysis

Linear Correlation and Regression Analysis Linear Correlation and Regression Analysis Set Up the Calculator 2 nd CATALOG D arrow down DiagnosticOn ENTER ENTER SCATTER DIAGRAM Positive Linear Correlation Positive Correlation Variables will tend

More information

Interactietechnologie

Interactietechnologie Interactietechnologie Statistical Evaluation Remco Veltkamp www.scalable learning.com Select " SIGN IN OR SIGN UP" Select "Use your School/University Account" search "Utrecht" and select "Utrecht University"

More information

Business Statistics. Lecture 9: Simple Regression

Business Statistics. Lecture 9: Simple Regression Business Statistics Lecture 9: Simple Regression 1 On to Model Building! Up to now, class was about descriptive and inferential statistics Numerical and graphical summaries of data Confidence intervals

More information

Lecture 8 Sampling Theory

Lecture 8 Sampling Theory Lecture 8 Sampling Theory Thais Paiva STA 111 - Summer 2013 Term II July 11, 2013 1 / 25 Thais Paiva STA 111 - Summer 2013 Term II Lecture 8, 07/11/2013 Lecture Plan 1 Sampling Distributions 2 Law of Large

More information

Algebra 2. Outliers. Measures of Central Tendency (Mean, Median, Mode) Standard Deviation Normal Distribution (Bell Curves)

Algebra 2. Outliers. Measures of Central Tendency (Mean, Median, Mode) Standard Deviation Normal Distribution (Bell Curves) Algebra 2 Outliers Measures of Central Tendency (Mean, Median, Mode) Standard Deviation Normal Distribution (Bell Curves) Algebra 2 Notes #1 Chp 12 Outliers In a set of numbers, sometimes there will be

More information

Review of Multiple Regression

Review of Multiple Regression Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate

More information

Introduction to Econometrics. Review of Probability & Statistics

Introduction to Econometrics. Review of Probability & Statistics 1 Introduction to Econometrics Review of Probability & Statistics Peerapat Wongchaiwat, Ph.D. wongchaiwat@hotmail.com Introduction 2 What is Econometrics? Econometrics consists of the application of mathematical

More information

Statistics Introductory Correlation

Statistics Introductory Correlation Statistics Introductory Correlation Session 10 oscardavid.barrerarodriguez@sciencespo.fr April 9, 2018 Outline 1 Statistics are not used only to describe central tendency and variability for a single variable.

More information

Business Analytics and Data Mining Modeling Using R Prof. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee

Business Analytics and Data Mining Modeling Using R Prof. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee Business Analytics and Data Mining Modeling Using R Prof. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee Lecture - 04 Basic Statistics Part-1 (Refer Slide Time: 00:33)

More information

16.400/453J Human Factors Engineering. Design of Experiments II

16.400/453J Human Factors Engineering. Design of Experiments II J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential

More information

Determining the Spread of a Distribution

Determining the Spread of a Distribution Determining the Spread of a Distribution 1.3-1.5 Cathy Poliak, Ph.D. cathy@math.uh.edu Department of Mathematics University of Houston Lecture 3-2311 Lecture 3-2311 1 / 58 Outline 1 Describing Quantitative

More information

Unit 2: Lesson 10 Measures of Spread Name:

Unit 2: Lesson 10 Measures of Spread Name: Unit 2: Lesson 10 Measures of Spread Name: Part 1: Mean Absolute Deviation (nice, but not the measure of spread we ll use ) The heights, in inches, of the players on a basketball team are given below.

More information

CRP 272 Introduction To Regression Analysis

CRP 272 Introduction To Regression Analysis CRP 272 Introduction To Regression Analysis 30 Relationships Among Two Variables: Interpretations One variable is used to explain another variable X Variable Independent Variable Explaining Variable Exogenous

More information

L6: Regression II. JJ Chen. July 2, 2015

L6: Regression II. JJ Chen. July 2, 2015 L6: Regression II JJ Chen July 2, 2015 Today s Plan Review basic inference based on Sample average Difference in sample average Extrapolate the knowledge to sample regression coefficients Standard error,

More information

Determining the Spread of a Distribution

Determining the Spread of a Distribution Determining the Spread of a Distribution 1.3-1.5 Cathy Poliak, Ph.D. cathy@math.uh.edu Department of Mathematics University of Houston Lecture 3-2311 Lecture 3-2311 1 / 58 Outline 1 Describing Quantitative

More information

Sociology 593 Exam 1 February 17, 1995

Sociology 593 Exam 1 February 17, 1995 Sociology 593 Exam 1 February 17, 1995 I. True-False. (25 points) Indicate whether the following statements are true or false. If false, briefly explain why. 1. A researcher regressed Y on. When he plotted

More information

4.1 Introduction. 4.2 The Scatter Diagram. Chapter 4 Linear Correlation and Regression Analysis

4.1 Introduction. 4.2 The Scatter Diagram. Chapter 4 Linear Correlation and Regression Analysis 4.1 Introduction Correlation is a technique that measures the strength (or the degree) of the relationship between two variables. For example, we could measure how strong the relationship is between people

More information

Objectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters

Objectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters Objectives 10.1 Simple linear regression Statistical model for linear regression Estimating the regression parameters Confidence interval for regression parameters Significance test for the slope Confidence

More information

Chapter 18: Sampling Distributions

Chapter 18: Sampling Distributions Chapter 18: Sampling Distributions All random variables have probability distributions, and as statistics are random variables, they too have distributions. The random phenomenon that produces the statistics

More information

9/2/2010. Wildlife Management is a very quantitative field of study. throughout this course and throughout your career.

9/2/2010. Wildlife Management is a very quantitative field of study. throughout this course and throughout your career. Introduction to Data and Analysis Wildlife Management is a very quantitative field of study Results from studies will be used throughout this course and throughout your career. Sampling design influences

More information

Wilcoxon Test and Calculating Sample Sizes

Wilcoxon Test and Calculating Sample Sizes Wilcoxon Test and Calculating Sample Sizes Dan Spencer UC Santa Cruz Dan Spencer (UC Santa Cruz) Wilcoxon Test and Calculating Sample Sizes 1 / 33 Differences in the Means of Two Independent Groups When

More information

8.1 Absolute Value Functions

8.1 Absolute Value Functions 8.1 Absolute Value Functions We will explore one final function family this year known as piecewise functions. Piecewise functions are functions that are defined a piece at a time. In other words, for

More information

y response variable x 1, x 2,, x k -- a set of explanatory variables

y response variable x 1, x 2,, x k -- a set of explanatory variables 11. Multiple Regression and Correlation y response variable x 1, x 2,, x k -- a set of explanatory variables In this chapter, all variables are assumed to be quantitative. Chapters 12-14 show how to incorporate

More information

Simple Linear Regression Using Ordinary Least Squares

Simple Linear Regression Using Ordinary Least Squares Simple Linear Regression Using Ordinary Least Squares Purpose: To approximate a linear relationship with a line. Reason: We want to be able to predict Y using X. Definition: The Least Squares Regression

More information

Basic Statistics Exercises 66

Basic Statistics Exercises 66 Basic Statistics Exercises 66 42. Suppose we are interested in predicting a person's height from the person's length of stride (distance between footprints). The following data is recorded for a random

More information

STA Module 10 Comparing Two Proportions

STA Module 10 Comparing Two Proportions STA 2023 Module 10 Comparing Two Proportions Learning Objectives Upon completing this module, you should be able to: 1. Perform large-sample inferences (hypothesis test and confidence intervals) to compare

More information

Math 2311 Sections 4.1, 4.2 and 4.3

Math 2311 Sections 4.1, 4.2 and 4.3 Math 2311 Sections 4.1, 4.2 and 4.3 4.1 - Density Curves What do we know about density curves? Example: Suppose we have a density curve defined for defined by the line y = x. Sketch: What percent of observations

More information

Name Date Block. Go to thesciencequeen.net click on student resources and then chemistry matters to find the links to answer the question.

Name Date Block. Go to thesciencequeen.net click on student resources and then chemistry matters to find the links to answer the question. Name Date Block Chemistry Matters Go to thesciencequeen.net click on student resources and then chemistry matters to find the links to answer the question. SITE #1: Chemical Equations 1. What three things

More information

Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc.

Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc. Chapter 23 Inferences About Means Sampling Distributions of Means Now that we know how to create confidence intervals and test hypotheses about proportions, we do the same for means. Just as we did before,

More information

Week 11 Sample Means, CLT, Correlation

Week 11 Sample Means, CLT, Correlation Week 11 Sample Means, CLT, Correlation Slides by Suraj Rampure Fall 2017 Administrative Notes Complete the mid semester survey on Piazza by Nov. 8! If 85% of the class fills it out, everyone will get a

More information

Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays

Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays Histograms: Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays Sep 9 1:13 PM Shape: Skewed left Bell shaped Symmetric Bi modal Symmetric Skewed

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 y 1 2 3 4 5 6 7 x Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 32 Suhasini Subba Rao Previous lecture We are interested in whether a dependent

More information

Survey on Population Mean

Survey on Population Mean MATH 203 Survey on Population Mean Dr. Neal, Spring 2009 The first part of this project is on the analysis of a population mean. You will obtain data on a specific measurement X by performing a random

More information

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc.

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc. Chapter 8 Linear Regression Copyright 2010 Pearson Education, Inc. Fat Versus Protein: An Example The following is a scatterplot of total fat versus protein for 30 items on the Burger King menu: Copyright

More information

4/19/2009. Probability Distributions. Inference. Example 1. Example 2. Parameter versus statistic. Normal Probability Distribution N

4/19/2009. Probability Distributions. Inference. Example 1. Example 2. Parameter versus statistic. Normal Probability Distribution N Probability Distributions Normal Probability Distribution N Chapter 6 Inference It was reported that the 2008 Super Bowl was watched by 97.5 million people. But how does anyone know that? They certainly

More information

are the objects described by a set of data. They may be people, animals or things.

are the objects described by a set of data. They may be people, animals or things. ( c ) E p s t e i n, C a r t e r a n d B o l l i n g e r 2016 C h a p t e r 5 : E x p l o r i n g D a t a : D i s t r i b u t i o n s P a g e 1 CHAPTER 5: EXPLORING DATA DISTRIBUTIONS 5.1 Creating Histograms

More information

Lecture 8: Chapter 4, Section 4 Quantitative Variables (Normal)

Lecture 8: Chapter 4, Section 4 Quantitative Variables (Normal) Lecture 8: Chapter 4, Section 4 Quantitative Variables (Normal) 68-95-99.7 Rule Normal Curve z-scores Cengage Learning Elementary Statistics: Looking at the Big Picture 1 Looking Back: Review 4 Stages

More information

Experiment 1: The Same or Not The Same?

Experiment 1: The Same or Not The Same? Experiment 1: The Same or Not The Same? Learning Goals After you finish this lab, you will be able to: 1. Use Logger Pro to collect data and calculate statistics (mean and standard deviation). 2. Explain

More information

Prentice Hall Stats: Modeling the World 2004 (Bock) Correlated to: National Advanced Placement (AP) Statistics Course Outline (Grades 9-12)

Prentice Hall Stats: Modeling the World 2004 (Bock) Correlated to: National Advanced Placement (AP) Statistics Course Outline (Grades 9-12) National Advanced Placement (AP) Statistics Course Outline (Grades 9-12) Following is an outline of the major topics covered by the AP Statistics Examination. The ordering here is intended to define the

More information

Unit 22: Sampling Distributions

Unit 22: Sampling Distributions Unit 22: Sampling Distributions Summary of Video If we know an entire population, then we can compute population parameters such as the population mean or standard deviation. However, we generally don

More information

AP Final Review II Exploring Data (20% 30%)

AP Final Review II Exploring Data (20% 30%) AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure

More information

ECO220Y Simple Regression: Testing the Slope

ECO220Y Simple Regression: Testing the Slope ECO220Y Simple Regression: Testing the Slope Readings: Chapter 18 (Sections 18.3-18.5) Winter 2012 Lecture 19 (Winter 2012) Simple Regression Lecture 19 1 / 32 Simple Regression Model y i = β 0 + β 1 x

More information

Inferences for Correlation

Inferences for Correlation Inferences for Correlation Quantitative Methods II Plan for Today Recall: correlation coefficient Bivariate normal distributions Hypotheses testing for population correlation Confidence intervals for population

More information

Univariate analysis. Simple and Multiple Regression. Univariate analysis. Simple Regression How best to summarise the data?

Univariate analysis. Simple and Multiple Regression. Univariate analysis. Simple Regression How best to summarise the data? Univariate analysis Example - linear regression equation: y = ax + c Least squares criteria ( yobs ycalc ) = yobs ( ax + c) = minimum Simple and + = xa xc xy xa + nc = y Solve for a and c Univariate analysis

More information

One-factor analysis of variance (ANOVA)

One-factor analysis of variance (ANOVA) One-factor analysis of variance (ANOVA) March 1, 2017 psych10.stanford.edu Announcements / Action Items Schedule update: final R lab moved to Week 10 Optional Survey 5 coming soon, due on Saturday Last

More information

Lecture 11 - Tests of Proportions

Lecture 11 - Tests of Proportions Lecture 11 - Tests of Proportions Statistics 102 Colin Rundel February 27, 2013 Research Project Research Project Proposal - Due Friday March 29th at 5 pm Introduction, Data Plan Data Project - Due Friday,

More information

Percentile: Formula: To find the percentile rank of a score, x, out of a set of n scores, where x is included:

Percentile: Formula: To find the percentile rank of a score, x, out of a set of n scores, where x is included: AP Statistics Chapter 2 Notes 2.1 Describing Location in a Distribution Percentile: The pth percentile of a distribution is the value with p percent of the observations (If your test score places you in

More information

) The cumulative probability distribution shows the probability

) The cumulative probability distribution shows the probability Problem Set ECONOMETRICS Prof Òscar Jordà Due Date: Thursday, April 0 STUDENT S NAME: Multiple Choice Questions [0 pts] Please provide your answers to this section below: 3 4 5 6 7 8 9 0 ) The cumulative

More information

Inference with Simple Regression

Inference with Simple Regression 1 Introduction Inference with Simple Regression Alan B. Gelder 06E:071, The University of Iowa 1 Moving to infinite means: In this course we have seen one-mean problems, twomean problems, and problems

More information

Comparing Means from Two-Sample

Comparing Means from Two-Sample Comparing Means from Two-Sample Kwonsang Lee University of Pennsylvania kwonlee@wharton.upenn.edu April 3, 2015 Kwonsang Lee STAT111 April 3, 2015 1 / 22 Inference from One-Sample We have two options to

More information

Sampling Distributions in Regression. Mini-Review: Inference for a Mean. For data (x 1, y 1 ),, (x n, y n ) generated with the SRM,

Sampling Distributions in Regression. Mini-Review: Inference for a Mean. For data (x 1, y 1 ),, (x n, y n ) generated with the SRM, Department of Statistics The Wharton School University of Pennsylvania Statistics 61 Fall 3 Module 3 Inference about the SRM Mini-Review: Inference for a Mean An ideal setup for inference about a mean

More information

Daniel Boduszek University of Huddersfield

Daniel Boduszek University of Huddersfield Daniel Boduszek University of Huddersfield d.boduszek@hud.ac.uk Introduction to moderator effects Hierarchical Regression analysis with continuous moderator Hierarchical Regression analysis with categorical

More information

Section 3: Simple Linear Regression

Section 3: Simple Linear Regression Section 3: Simple Linear Regression Carlos M. Carvalho The University of Texas at Austin McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/ 1 Regression: General Introduction

More information

Business Statistics. Lecture 5: Confidence Intervals

Business Statistics. Lecture 5: Confidence Intervals Business Statistics Lecture 5: Confidence Intervals Goals for this Lecture Confidence intervals The t distribution 2 Welcome to Interval Estimation! Moments Mean 815.0340 Std Dev 0.8923 Std Error Mean

More information

ACTIVITY 6 Creating Wave Functions

ACTIVITY 6 Creating Wave Functions Name: WAVES of matter Class: Visual Quantum Mechanics ACTIVITY 6 Creating Wave Functions Goal To make wave functions useful we must be able to create them for physical situations. We will start with electrons

More information

Conservation of Momentum

Conservation of Momentum Learning Goals Conservation of Momentum After you finish this lab, you will be able to: 1. Use Logger Pro to analyze video and calculate position, velocity, and acceleration. 2. Use the equations for 2-dimensional

More information

Introduction to Estimation. Martina Litschmannová K210

Introduction to Estimation. Martina Litschmannová K210 Introduction to Estimation Martina Litschmannová martina.litschmannova@vsb.cz K210 Populations vs. Sample A population includes each element from the set of observations that can be made. A sample consists

More information

Lecture 22/Chapter 19 Part 4. Statistical Inference Ch. 19 Diversity of Sample Proportions

Lecture 22/Chapter 19 Part 4. Statistical Inference Ch. 19 Diversity of Sample Proportions Lecture 22/Chapter 19 Part 4. Statistical Inference Ch. 19 Diversity of Sample Proportions Probability versus Inference Behavior of Sample Proportions: Example Behavior of Sample Proportions: Conditions

More information

1. Create a scatterplot of this data. 2. Find the correlation coefficient.

1. Create a scatterplot of this data. 2. Find the correlation coefficient. How Fast Foods Compare Company Entree Total Calories Fat (grams) McDonald s Big Mac 540 29 Filet o Fish 380 18 Burger King Whopper 670 40 Big Fish Sandwich 640 32 Wendy s Single Burger 470 21 1. Create

More information

Vocabulary: Samples and Populations

Vocabulary: Samples and Populations Vocabulary: Samples and Populations Concept Different types of data Categorical data results when the question asked in a survey or sample can be answered with a nonnumerical answer. For example if we

More information

ESP 178 Applied Research Methods. 2/23: Quantitative Analysis

ESP 178 Applied Research Methods. 2/23: Quantitative Analysis ESP 178 Applied Research Methods 2/23: Quantitative Analysis Data Preparation Data coding create codebook that defines each variable, its response scale, how it was coded Data entry for mail surveys and

More information

STATS DOESN T SUCK! ~ CHAPTER 16

STATS DOESN T SUCK! ~ CHAPTER 16 SIMPLE LINEAR REGRESSION: STATS DOESN T SUCK! ~ CHAPTER 6 The HR manager at ACME food services wants to examine the relationship between a workers income and their years of experience on the job. He randomly

More information

Unit 6 - Simple linear regression

Unit 6 - Simple linear regression Sta 101: Data Analysis and Statistical Inference Dr. Çetinkaya-Rundel Unit 6 - Simple linear regression LO 1. Define the explanatory variable as the independent variable (predictor), and the response variable

More information

Statistical Inference

Statistical Inference Statistical Inference Bernhard Klingenberg Institute of Statistics Graz University of Technology Steyrergasse 17/IV, 8010 Graz www.statistics.tugraz.at February 12, 2008 Outline Estimation: Review of concepts

More information

Lab 1. EXCEL plus some basic concepts such as scientific notation, order of magnitude, logarithms, and unit conversions

Lab 1. EXCEL plus some basic concepts such as scientific notation, order of magnitude, logarithms, and unit conversions COMPUTER LAB 1 EARTH SYSTEMS SCIENCE I PG250 Fall 2010 Hunter College Lab 1. EXCEL plus some basic concepts such as scientific notation, order of magnitude, logarithms, and unit conversions Low Impact

More information

THE SAMPLING DISTRIBUTION OF THE MEAN

THE SAMPLING DISTRIBUTION OF THE MEAN THE SAMPLING DISTRIBUTION OF THE MEAN COGS 14B JANUARY 26, 2017 TODAY Sampling Distributions Sampling Distribution of the Mean Central Limit Theorem INFERENTIAL STATISTICS Inferential statistics: allows

More information

Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation).

Basic Statistics. 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation). Basic Statistics There are three types of error: 1. Gross error analyst makes a gross mistake (misread balance or entered wrong value into calculation). 2. Systematic error - always too high or too low

More information

Chapter 6 The Normal Distribution

Chapter 6 The Normal Distribution Chapter 6 The Normal PSY 395 Oswald Outline s and area The normal distribution The standard normal distribution Setting probable limits on a score/observation Measures related to 2 s and Area The idea

More information

AIM HIGH SCHOOL. Curriculum Map W. 12 Mile Road Farmington Hills, MI (248)

AIM HIGH SCHOOL. Curriculum Map W. 12 Mile Road Farmington Hills, MI (248) AIM HIGH SCHOOL Curriculum Map 2923 W. 12 Mile Road Farmington Hills, MI 48334 (248) 702-6922 www.aimhighschool.com COURSE TITLE: Statistics DESCRIPTION OF COURSE: PREREQUISITES: Algebra 2 Students will

More information

Unit 6 - Introduction to linear regression

Unit 6 - Introduction to linear regression Unit 6 - Introduction to linear regression Suggested reading: OpenIntro Statistics, Chapter 7 Suggested exercises: Part 1 - Relationship between two numerical variables: 7.7, 7.9, 7.11, 7.13, 7.15, 7.25,

More information

BIOL Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES

BIOL Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES BIOL 458 - Biometry LAB 6 - SINGLE FACTOR ANOVA and MULTIPLE COMPARISON PROCEDURES PART 1: INTRODUCTION TO ANOVA Purpose of ANOVA Analysis of Variance (ANOVA) is an extremely useful statistical method

More information

Chapter 18. Sampling Distribution Models. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 18. Sampling Distribution Models. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models Copyright 2010, 2007, 2004 Pearson Education, Inc. Normal Model When we talk about one data value and the Normal model we used the notation: N(μ, σ) Copyright 2010,

More information

MATH 1150 Chapter 2 Notation and Terminology

MATH 1150 Chapter 2 Notation and Terminology MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the

More information

Measurement Error PHYS Introduction

Measurement Error PHYS Introduction PHYS 1301 Measurement Error Introduction We have confidence that a particular physics theory is telling us something interesting about the physical universe because we are able to test quantitatively its

More information

STA Why Sampling? Module 6 The Sampling Distributions. Module Objectives

STA Why Sampling? Module 6 The Sampling Distributions. Module Objectives STA 2023 Module 6 The Sampling Distributions Module Objectives In this module, we will learn the following: 1. Define sampling error and explain the need for sampling distributions. 2. Recognize that sampling

More information

Measurement Error PHYS Introduction

Measurement Error PHYS Introduction PHYS 1301 Measurement Error Introduction We have confidence that a particular physics theory is telling us something interesting about the physical universe because we are able to test quantitatively its

More information

Hotelling s One- Sample T2

Hotelling s One- Sample T2 Chapter 405 Hotelling s One- Sample T2 Introduction The one-sample Hotelling s T2 is the multivariate extension of the common one-sample or paired Student s t-test. In a one-sample t-test, the mean response

More information

Chapter 8. Inferences Based on a Two Samples Confidence Intervals and Tests of Hypothesis

Chapter 8. Inferences Based on a Two Samples Confidence Intervals and Tests of Hypothesis Chapter 8 Inferences Based on a Two Samples Confidence Intervals and Tests of Hypothesis Copyright 2018, 2014, and 2011 Pearson Education, Inc. Slide - 1 Content 1. Identifying the Target Parameter 2.

More information

Chapter 12 - Lecture 2 Inferences about regression coefficient

Chapter 12 - Lecture 2 Inferences about regression coefficient Chapter 12 - Lecture 2 Inferences about regression coefficient April 19th, 2010 Facts about slope Test Statistic Confidence interval Hypothesis testing Test using ANOVA Table Facts about slope In previous

More information

Geog 210C Spring 2011 Lab 6. Geostatistics in ArcMap

Geog 210C Spring 2011 Lab 6. Geostatistics in ArcMap Geog 210C Spring 2011 Lab 6. Geostatistics in ArcMap Overview In this lab you will think critically about the functionality of spatial interpolation, improve your kriging skills, and learn how to use several

More information

Tests for Two Coefficient Alphas

Tests for Two Coefficient Alphas Chapter 80 Tests for Two Coefficient Alphas Introduction Coefficient alpha, or Cronbach s alpha, is a popular measure of the reliability of a scale consisting of k parts. The k parts often represent k

More information

Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals

Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals (SW Chapter 5) Outline. The standard error of ˆ. Hypothesis tests concerning β 3. Confidence intervals for β 4. Regression

More information