Study and research skills 2009 Duncan Golicher. and Adrian Newton. Last draft 11/24/2008

Size: px
Start display at page:

Download "Study and research skills 2009 Duncan Golicher. and Adrian Newton. Last draft 11/24/2008"

Transcription

1 Study and research skills and Adrian Newton. Last draft 11/24/2008

2 Inference about the mean: What you will learn Why we need to draw inferences from samples The difference between a population and a sample An intuitive understanding of the properties of a sample The sampling distribution of the mean The standard error of the mean Confidence intervals Large sample confidence intervals Small sample confidence intervals

3 Infering the mean Inference Statisticians try to make certain statements about uncertain quantities. We can t know everything about the world. Some properties that we are interested in must be inferred. This is a very difficult concept both to use and accept. Probability is non-intuitive and often misunderstood.

4 Infering the mean Sometimes we just have to do our best! I am not going to give you a number for it because it s not my business to do intelligent work. What I told him was not necessarily accurate. It might also not have been inaccurate, but I m disinclined to mislead anyone. Donald Rumsfeld when -asked to estimate the number of Iraqi insurgents while testifying before Congress

5 Infering the mean Making inferences about a mean If we want to know a mean why don t we just calculate it? Didn t we see that last time? It is not quite so simple. The problem arises when we want to know the mean of a population. We usually only have a single sample from that population. If we draw a different sample we get a different mean. So although we know what the mean of a sample is we can only estimate the mean of a population. It is a known unknown

6 Populations What is a population There is another tricky issue to resolve. What do we mean by a population? This can become a philosophical question A pragmatic way of looking at is that a population is anything we really want to know about by drawing a sample. A population could be finite or infinite For example The population might be all the pine trees in a 5 ha wood in the New Forest. We want to estimate the mean diameter of the trees growing at that site. Or... the population might be Pinus sylvestris. We might be interested in the mean needle length for the species. In the first case we could measure every single tree if we had time In the second case we can only ever get a sample. We can t measure all the members of an infinite population!

7 An example An example Imagine we have 100 trees in a forest. Their basal areas (area in cross section in cm 2 ) are taken from a theoretical normal distributed with mean= 50 and sd= 20 The population could be either the 100 trees (there are no more) or it could be treated as effectively infinite (Notice that the second assumption is not practical in this case) Put that to one side for a moment and look at the trees.

8 An example One hundred trees [

9 If only... If only we knew the truth! If we could really know that the tree basal areas were taken from a normal distribution with mean = 50 and sd = 20 we wouldn t need to measure any of them. About 68% of the values would lie between 30 and 70 About 95% of the values would lie between 10 and 90 We know the mean itself. It is 50. No uncertainty at all.

10 If only... If only we knew the truth! If there are only 100 trees (finite population) we could also become certain about this particular population of trees. We could measure every single tree with extremely precise instruments. We could then calculate the mean and sd for the population of 100 trees.

11 If only... The theoretical infinite population (red line) and the empirical finite population (histogram) pdf 2

12 If only... The finite population parameters The mean is The standard deviation is Not far from our theoretical vales (50 and 20)

13 Simulating sampling Sampling It takes about three minutes to measure a tree s diameter accurately plus walking time between each What if we only have one morning available? We might only manage to measure thirty of the hundred trees. What can we then say about the mean of the hundred trees?

14 Simulating sampling A representative sample It would not be a good idea to measure only the trees on the edge of the forest (they get more light and might be bigger) We should aim for a representative sample This could be obtained by randomly selecting trees

15 Simulating sampling A random sample

16 Simulating sampling A representative sample? pdf 2

17 Simulating sampling In fact it is completely representative We can never expect the histogram of a relatively small sample to look classically normal We can implement a test to find if it could have been drawn from a normal distribution Most small samples could. This was. Small samples rarely look that normal

18 Sample properties The sample properties The mean is The standard deviation is But what happens if we send someone else back to draw another random sample?

19 Sample properties Another random sample

20 Sample properties A different sample s properties The mean is The standard deviation is And what happens if we send another person back to draw another random sample?

21 Sample properties Another random sample

22 Sample properties Another sample s properties The mean is The standard deviation is And what happens if we send yet another person back to draw another random sample?

23 Sample properties Another random sample

24 Sample properties Another sample s properties The mean is The standard deviation is And so on and so on...

25 Sample properties Another random sample

26 Sample properties Another random sample

27 Sample properties Where is this taking us? We never really would take repeated samples from the same population in this way However frequentist statistical theory is based on this idea. There is something very interesting about the properties of repeated samples Let s get the computer to do this 1000 times and look at the result.

28 Sample properties The sampling distribution of the mean Histogram of the population basal areas Frequency Histogram of the mean values of samples of 30 trees Frequency

29 Sample properties The standard error of the mean The means of our repeated sampling experiments form a much tighter distribution than the data. We can find the mean of the means if we want. It is The mean of the finite population of one hundred trees is They are close. We can also find the standard deviation of our hypothetical set of sampling experiments. It is The standard deviation of the hundred trees is They are not close. The standard deviation of the mean is always less than the standard deviation and it decreases with sample size.

30 Sample properties 1000 samples size 2 Frequency replicate(10000, mean(sample(d$a, 2)))

31 Sample properties 1000 samples size 4 Frequency replicate(1000, mean(sample(d$a, 4)))

32 Sample properties 1000 samples size 10 Frequency replicate(1000, mean(sample(d$a, 10)))

33 Sample properties 1000 samples size 20 Frequency replicate(1000, mean(sample(d$a, 20)))

34 Sample properties 1000 samples size 50 Frequency replicate(1000, mean(sample(d$a, 50)))

35 Sample properties 1000 samples size 90 Frequency replicate(1000, mean(sample(d$a, 90)))

36 Sample properties So what use is this? No one would be stupid enough to waste time this way. But the idea is still very useful. It turns out that if we know the standard deviation of a population we know what the standard deviation of the mean will be for any sample size. SD x = σ n Where SD x represents the true standard deviation for the means and σ is the population standard deviation and n is the sample size.

37 Standard error The standard error But, that is no use to us! We don t know the population standard deviation unless we measure all the trees! We re back where we started. Fortunately we do have an estimate of it. We can get one every time we take a sample. It is the sample standard deviation. It won t be quite right and it also varies between samples. In the case of small samples it might even be quite hopeless, but if we only measure some of the trees once, its all we ve got. We can call the standard deviation of the mean calculated from the sample standard deviation the standard error. SE x = s n

38 Large samples Inference from large samples If the sample is large (n>30) we might safely assume that our sample standard deviation s is more or less equal to σ We can also assume that our standard error is pretty close to actually being the standard deviation of the mean. Now, this is beginning to look more useful. We already know all about standard deviations from last time.

39 Large samples 68 percent of observations lie within 1 sd of the mean x

40 Large samples 95 percent of observations lie within 2 sds of the mean x

41 Large samples Confidence intervals So, we can imagine what a histogram of the repeated sampling experiments would look like without having to do them. Their means would form a normal distribution with standard deviation= standard error. A 95% confidence interval can be calculated from the sample mean plus or minus two standard errors (or 1.96 standard errors if we want to be really fussy) x±2.se x It will include the true population mean 95% of the time.

42 Large samples Calculating a confidence interval: A random sample to try it out on BA

43 Large samples Get the computer to do the work Sample mean = Sample standard deviation = Standard error = % confidence interval for mean = ±7.44 What was the true population mean? It falls inside the interval! And it should do so 19 times out of 20.

44 Large samples What can make us more confident? The smaller the confidence interval the more precise is our estimate. Remember that if the sample was biased it could still be inaccurate. We can improve precision by Measuring something with intrinsically low variability In ecology most variability is naturally part of the system In this case taking very precise measurements of each element in a sample could be a bit of a waste of time Taking a large sample. But remember the denominator is the square root of the sample size Decreasing the uncertainty in your mean value estimate by a factor of two needs four times as many samples. Decreasing uncertainty by a factor of ten requires a hundred times as many samples.

45 Small samples Small sample inference Remember that the sample standard deviation varies between samples It varies more for small samples So as sample size decreases the estimate becomes worse The use of the t statistic is designed to control for this

46 Small samples What is the t distribution A t distribution with a large number of degrees of freedom (large sample) is the same as a normal distribution. So large sample inference using the t distribution is the same as using the z values of a normal distribution (±2SEs for 95% confidence intervals) However as the sample size gets smaller the t distribution gets longer tails

47 Small samples T distribution with 30 degrees of freedom function(x) dt(df = n 1, x) (x) x

48 Small samples T distribution with 20 degrees of freedom function(x) dt(df = n 1, x) (x) x

49 Small samples T distribution with 10 degrees of freedom function(x) dt(df = n 1, x) (x) x

50 Small samples T distribution with 5 degrees of freedom function(x) dt(df = n 1, x) (x) x

51 Small samples T distribution with 2 degrees of freedom function(x) dt(df = n 1, x) (x) x

52 Small samples So how do we use this? Look up the number that corresponds to the 97.5 percentile of the t-distribution (A) for the number of degrees of freedom you have (n-1) Use this instead of the previous rule of thumb number (2) to calculate your confidence intervals.

53 Small samples Values of the 97.5 percentile for a range of degrees of freedom df qt

54 Small samples Calculating a confidence interval using the t distribution: A small sample to try it out on BA

55 Small samples Get the computer to do the work again Sample mean = Sample standard deviation = 7.37 Standard error = percentile of t distribution for 4 degrees of freedom = % confidence interval for mean = ±9.15 What was the true population mean? It falls inside the interval! And it also should do so 19 times out of 20.

56 Small samples Assumptions for calculating confidence intervals Both small sample and large sample confidence intervals assume that the data are drawn from a normally distributed population. We will look more carefully at this assumption later Perhaps more importantly the sample must be representative of the population from which it has been drawn.

57 Small samples What have we covered The fundamental basis of statistics Understand this class and you ve passed the course! We ve learnt how to guess a defensible range of values for a number we don t know with certainty We use what we do know to make statements about what we don t. Don Runsfeldt would approve.

58 Small samples What you need to remember You must remember all the formulas from class 2. The additional concept is the standard error of the mean. This is the sample standard error divided by the square root of the number of observations in the sample. Multiply this by two to get a 95% confidence interval for sample size >30 Multiply it by a number greater than two (that you get from the t-distribution) for sample sizes <30

Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc.

Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc. Chapter 23 Inferences About Means Sampling Distributions of Means Now that we know how to create confidence intervals and test hypotheses about proportions, we do the same for means. Just as we did before,

More information

You separate binary numbers into columns in a similar fashion. 2 5 = 32

You separate binary numbers into columns in a similar fashion. 2 5 = 32 RSA Encryption 2 At the end of Part I of this article, we stated that RSA encryption works because it s impractical to factor n, which determines P 1 and P 2, which determines our private key, d, which

More information

Chapter 18. Sampling Distribution Models /51

Chapter 18. Sampling Distribution Models /51 Chapter 18 Sampling Distribution Models 1 /51 Homework p432 2, 4, 6, 8, 10, 16, 17, 20, 30, 36, 41 2 /51 3 /51 Objective Students calculate values of central 4 /51 The Central Limit Theorem for Sample

More information

Chapter 1 Review of Equations and Inequalities

Chapter 1 Review of Equations and Inequalities Chapter 1 Review of Equations and Inequalities Part I Review of Basic Equations Recall that an equation is an expression with an equal sign in the middle. Also recall that, if a question asks you to solve

More information

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b). Confidence Intervals 1) What are confidence intervals? Simply, an interval for which we have a certain confidence. For example, we are 90% certain that an interval contains the true value of something

More information

Business Statistics. Lecture 5: Confidence Intervals

Business Statistics. Lecture 5: Confidence Intervals Business Statistics Lecture 5: Confidence Intervals Goals for this Lecture Confidence intervals The t distribution 2 Welcome to Interval Estimation! Moments Mean 815.0340 Std Dev 0.8923 Std Error Mean

More information

Physics 509: Bootstrap and Robust Parameter Estimation

Physics 509: Bootstrap and Robust Parameter Estimation Physics 509: Bootstrap and Robust Parameter Estimation Scott Oser Lecture #20 Physics 509 1 Nonparametric parameter estimation Question: what error estimate should you assign to the slope and intercept

More information

Communication Engineering Prof. Surendra Prasad Department of Electrical Engineering Indian Institute of Technology, Delhi

Communication Engineering Prof. Surendra Prasad Department of Electrical Engineering Indian Institute of Technology, Delhi Communication Engineering Prof. Surendra Prasad Department of Electrical Engineering Indian Institute of Technology, Delhi Lecture - 41 Pulse Code Modulation (PCM) So, if you remember we have been talking

More information

Descriptive statistics

Descriptive statistics Patrick Breheny February 6 Patrick Breheny to Biostatistics (171:161) 1/25 Tables and figures Human beings are not good at sifting through large streams of data; we understand data much better when it

More information

Confidence Intervals. - simply, an interval for which we have a certain confidence.

Confidence Intervals. - simply, an interval for which we have a certain confidence. Confidence Intervals I. What are confidence intervals? - simply, an interval for which we have a certain confidence. - for example, we are 90% certain that an interval contains the true value of something

More information

Lecture 17: Small-Sample Inferences for Normal Populations. Confidence intervals for µ when σ is unknown

Lecture 17: Small-Sample Inferences for Normal Populations. Confidence intervals for µ when σ is unknown Lecture 17: Small-Sample Inferences for Normal Populations Confidence intervals for µ when σ is unknown If the population distribution is normal, then X µ σ/ n has a standard normal distribution. If σ

More information

( )( b + c) = ab + ac, but it can also be ( )( a) = ba + ca. Let s use the distributive property on a couple of

( )( b + c) = ab + ac, but it can also be ( )( a) = ba + ca. Let s use the distributive property on a couple of Factoring Review for Algebra II The saddest thing about not doing well in Algebra II is that almost any math teacher can tell you going into it what s going to trip you up. One of the first things they

More information

Confidence intervals

Confidence intervals Confidence intervals We now want to take what we ve learned about sampling distributions and standard errors and construct confidence intervals. What are confidence intervals? Simply an interval for which

More information

Error Analysis in Experimental Physical Science Mini-Version

Error Analysis in Experimental Physical Science Mini-Version Error Analysis in Experimental Physical Science Mini-Version by David Harrison and Jason Harlow Last updated July 13, 2012 by Jason Harlow. Original version written by David M. Harrison, Department of

More information

Solution to Proof Questions from September 1st

Solution to Proof Questions from September 1st Solution to Proof Questions from September 1st Olena Bormashenko September 4, 2011 What is a proof? A proof is an airtight logical argument that proves a certain statement in general. In a sense, it s

More information

Getting Started with Communications Engineering

Getting Started with Communications Engineering 1 Linear algebra is the algebra of linear equations: the term linear being used in the same sense as in linear functions, such as: which is the equation of a straight line. y ax c (0.1) Of course, if we

More information

Quantitative Understanding in Biology 1.7 Bayesian Methods

Quantitative Understanding in Biology 1.7 Bayesian Methods Quantitative Understanding in Biology 1.7 Bayesian Methods Jason Banfelder October 25th, 2018 1 Introduction So far, most of the methods we ve looked at fall under the heading of classical, or frequentist

More information

STA Why Sampling? Module 6 The Sampling Distributions. Module Objectives

STA Why Sampling? Module 6 The Sampling Distributions. Module Objectives STA 2023 Module 6 The Sampling Distributions Module Objectives In this module, we will learn the following: 1. Define sampling error and explain the need for sampling distributions. 2. Recognize that sampling

More information

Physics 6A Lab Experiment 6

Physics 6A Lab Experiment 6 Biceps Muscle Model Physics 6A Lab Experiment 6 Introduction This lab will begin with some warm-up exercises to familiarize yourself with the theory, as well as the experimental setup. Then you ll move

More information

Quadratic Equations Part I

Quadratic Equations Part I Quadratic Equations Part I Before proceeding with this section we should note that the topic of solving quadratic equations will be covered in two sections. This is done for the benefit of those viewing

More information

DIRECTED NUMBERS ADDING AND SUBTRACTING DIRECTED NUMBERS

DIRECTED NUMBERS ADDING AND SUBTRACTING DIRECTED NUMBERS DIRECTED NUMBERS POSITIVE NUMBERS These are numbers such as: 3 which can be written as +3 46 which can be written as +46 14.67 which can be written as +14.67 a which can be written as +a RULE Any number

More information

Structural Analysis II Prof. P. Banerjee Department of Civil Engineering Indian Institute of Technology, Bombay Lecture 38

Structural Analysis II Prof. P. Banerjee Department of Civil Engineering Indian Institute of Technology, Bombay Lecture 38 Structural Analysis II Prof. P. Banerjee Department of Civil Engineering Indian Institute of Technology, Bombay Lecture 38 Good morning. We have been looking at influence lines for the last couple of lectures

More information

- a value calculated or derived from the data.

- a value calculated or derived from the data. Descriptive statistics: Note: I'm assuming you know some basics. If you don't, please read chapter 1 on your own. It's pretty easy material, and it gives you a good background as to why we need statistics.

More information

Ch18 links / ch18 pdf links Ch18 image t-dist table

Ch18 links / ch18 pdf links Ch18 image t-dist table Ch18 links / ch18 pdf links Ch18 image t-dist table ch18 (inference about population mean) exercises: 18.3, 18.5, 18.7, 18.9, 18.15, 18.17, 18.19, 18.27 CHAPTER 18: Inference about a Population Mean The

More information

Uni- and Bivariate Power

Uni- and Bivariate Power Uni- and Bivariate Power Copyright 2002, 2014, J. Toby Mordkoff Note that the relationship between risk and power is unidirectional. Power depends on risk, but risk is completely independent of power.

More information

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b). Confidence Intervals 1) What are confidence intervals? Simply, an interval for which we have a certain confidence. For example, we are 90% certain that an interval contains the true value of something

More information

Descriptive Statistics (And a little bit on rounding and significant digits)

Descriptive Statistics (And a little bit on rounding and significant digits) Descriptive Statistics (And a little bit on rounding and significant digits) Now that we know what our data look like, we d like to be able to describe it numerically. In other words, how can we represent

More information

Sampling Distribution Models. Chapter 17

Sampling Distribution Models. Chapter 17 Sampling Distribution Models Chapter 17 Objectives: 1. Sampling Distribution Model 2. Sampling Variability (sampling error) 3. Sampling Distribution Model for a Proportion 4. Central Limit Theorem 5. Sampling

More information

Fitting a Straight Line to Data

Fitting a Straight Line to Data Fitting a Straight Line to Data Thanks for your patience. Finally we ll take a shot at real data! The data set in question is baryonic Tully-Fisher data from http://astroweb.cwru.edu/sparc/btfr Lelli2016a.mrt,

More information

MITOCW MIT18_01SCF10Rec_24_300k

MITOCW MIT18_01SCF10Rec_24_300k MITOCW MIT18_01SCF10Rec_24_300k JOEL LEWIS: Hi. Welcome back to recitation. In lecture, you've been doing related rates problems. I've got another example for you, here. So this one's a really tricky one.

More information

Algebra 8.6 Simple Equations

Algebra 8.6 Simple Equations Algebra 8.6 Simple Equations 1. Introduction Let s talk about the truth: 2 = 2 This is a true statement What else can we say about 2 that is true? Eample 1 2 = 2 1+ 1= 2 2 1= 2 4 1 = 2 2 4 2 = 2 4 = 4

More information

Chapter 23. Inference About Means

Chapter 23. Inference About Means Chapter 23 Inference About Means 1 /57 Homework p554 2, 4, 9, 10, 13, 15, 17, 33, 34 2 /57 Objective Students test null and alternate hypotheses about a population mean. 3 /57 Here We Go Again Now that

More information

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI Introduction of Data Analytics Prof. Nandan Sudarsanam and Prof. B Ravindran Department of Management Studies and Department of Computer Science and Engineering Indian Institute of Technology, Madras Module

More information

MA 1125 Lecture 15 - The Standard Normal Distribution. Friday, October 6, Objectives: Introduce the standard normal distribution and table.

MA 1125 Lecture 15 - The Standard Normal Distribution. Friday, October 6, Objectives: Introduce the standard normal distribution and table. MA 1125 Lecture 15 - The Standard Normal Distribution Friday, October 6, 2017. Objectives: Introduce the standard normal distribution and table. 1. The Standard Normal Distribution We ve been looking at

More information

Chapter 26: Comparing Counts (Chi Square)

Chapter 26: Comparing Counts (Chi Square) Chapter 6: Comparing Counts (Chi Square) We ve seen that you can turn a qualitative variable into a quantitative one (by counting the number of successes and failures), but that s a compromise it forces

More information

Chapter 5 Simplifying Formulas and Solving Equations

Chapter 5 Simplifying Formulas and Solving Equations Chapter 5 Simplifying Formulas and Solving Equations Look at the geometry formula for Perimeter of a rectangle P = L W L W. Can this formula be written in a simpler way? If it is true, that we can simplify

More information

P (E) = P (A 1 )P (A 2 )... P (A n ).

P (E) = P (A 1 )P (A 2 )... P (A n ). Lecture 9: Conditional probability II: breaking complex events into smaller events, methods to solve probability problems, Bayes rule, law of total probability, Bayes theorem Discrete Structures II (Summer

More information

Notes 11: OLS Theorems ECO 231W - Undergraduate Econometrics

Notes 11: OLS Theorems ECO 231W - Undergraduate Econometrics Notes 11: OLS Theorems ECO 231W - Undergraduate Econometrics Prof. Carolina Caetano For a while we talked about the regression method. Then we talked about the linear model. There were many details, but

More information

Continuity and One-Sided Limits

Continuity and One-Sided Limits Continuity and One-Sided Limits 1. Welcome to continuity and one-sided limits. My name is Tuesday Johnson and I m a lecturer at the University of Texas El Paso. 2. With each lecture I present, I will start

More information

LECTURE 15: SIMPLE LINEAR REGRESSION I

LECTURE 15: SIMPLE LINEAR REGRESSION I David Youngberg BSAD 20 Montgomery College LECTURE 5: SIMPLE LINEAR REGRESSION I I. From Correlation to Regression a. Recall last class when we discussed two basic types of correlation (positive and negative).

More information

Harvard University. Rigorous Research in Engineering Education

Harvard University. Rigorous Research in Engineering Education Statistical Inference Kari Lock Harvard University Department of Statistics Rigorous Research in Engineering Education 12/3/09 Statistical Inference You have a sample and want to use the data collected

More information

Two-sample Categorical data: Testing

Two-sample Categorical data: Testing Two-sample Categorical data: Testing Patrick Breheny April 1 Patrick Breheny Introduction to Biostatistics (171:161) 1/28 Separate vs. paired samples Despite the fact that paired samples usually offer

More information

Chapter 18. Sampling Distribution Models. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 18. Sampling Distribution Models. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models Copyright 2010, 2007, 2004 Pearson Education, Inc. Normal Model When we talk about one data value and the Normal model we used the notation: N(μ, σ) Copyright 2010,

More information

COMP6053 lecture: Sampling and the central limit theorem. Markus Brede,

COMP6053 lecture: Sampling and the central limit theorem. Markus Brede, COMP6053 lecture: Sampling and the central limit theorem Markus Brede, mb8@ecs.soton.ac.uk Populations: long-run distributions Two kinds of distributions: populations and samples. A population is the set

More information

Regression, part II. I. What does it all mean? A) Notice that so far all we ve done is math.

Regression, part II. I. What does it all mean? A) Notice that so far all we ve done is math. Regression, part II I. What does it all mean? A) Notice that so far all we ve done is math. 1) One can calculate the Least Squares Regression Line for anything, regardless of any assumptions. 2) But, if

More information

STAT/SOC/CSSS 221 Statistical Concepts and Methods for the Social Sciences. Random Variables

STAT/SOC/CSSS 221 Statistical Concepts and Methods for the Social Sciences. Random Variables STAT/SOC/CSSS 221 Statistical Concepts and Methods for the Social Sciences Random Variables Christopher Adolph Department of Political Science and Center for Statistics and the Social Sciences University

More information

CHAPTER 9: HYPOTHESIS TESTING

CHAPTER 9: HYPOTHESIS TESTING CHAPTER 9: HYPOTHESIS TESTING THE SECOND LAST EXAMPLE CLEARLY ILLUSTRATES THAT THERE IS ONE IMPORTANT ISSUE WE NEED TO EXPLORE: IS THERE (IN OUR TWO SAMPLES) SUFFICIENT STATISTICAL EVIDENCE TO CONCLUDE

More information

MIT BLOSSOMS INITIATIVE

MIT BLOSSOMS INITIATIVE MIT BLOSSOMS INITIATIVE The Broken Stick Problem Taught by Professor Richard C. Larson Mitsui Professor of Engineering Systems and of Civil and Environmental Engineering Segment 1 Hi! My name is Dick Larson

More information

Conceptual Explanations: Simultaneous Equations Distance, rate, and time

Conceptual Explanations: Simultaneous Equations Distance, rate, and time Conceptual Explanations: Simultaneous Equations Distance, rate, and time If you travel 30 miles per hour for 4 hours, how far do you go? A little common sense will tell you that the answer is 120 miles.

More information

Algebra. Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed.

Algebra. Here are a couple of warnings to my students who may be here to get a copy of what happened on a day that you missed. This document was written and copyrighted by Paul Dawkins. Use of this document and its online version is governed by the Terms and Conditions of Use located at. The online version of this document is

More information

1.20 Formulas, Equations, Expressions and Identities

1.20 Formulas, Equations, Expressions and Identities 1.0 Formulas, Equations, Expressions and Identities Collecting terms is equivalent to noting that 4 + 4 + 4 + 4 + 4 + 4 can be written as 6 4; i.e., that multiplication is repeated addition. It s wise

More information

Calculus II. Calculus II tends to be a very difficult course for many students. There are many reasons for this.

Calculus II. Calculus II tends to be a very difficult course for many students. There are many reasons for this. Preface Here are my online notes for my Calculus II course that I teach here at Lamar University. Despite the fact that these are my class notes they should be accessible to anyone wanting to learn Calculus

More information

appstats27.notebook April 06, 2017

appstats27.notebook April 06, 2017 Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves

More information

CHAPTER 1. Introduction

CHAPTER 1. Introduction CHAPTER 1 Introduction A typical Modern Geometry course will focus on some variation of a set of axioms for Euclidean geometry due to Hilbert. At the end of such a course, non-euclidean geometries (always

More information

MITOCW watch?v=ruz33p1icrs

MITOCW watch?v=ruz33p1icrs MITOCW watch?v=ruz33p1icrs The following content is provided under a Creative Commons license. Your support will help MIT OpenCourseWare continue to offer high quality educational resources for free. To

More information

Ask. Don t Tell. Annotated Examples

Ask. Don t Tell. Annotated Examples Ask. Don t Tell. Annotated Examples Alfonso Gracia-Saz (alfonso@math.toronto.edu) The three principles: 1. Where is the student? 2. Help minimally. 3. Ask. Don t Tell. Ask. Don t Tell. 1 BRETT 1 Brett

More information

Making Measurements. On a piece of scrap paper, write down an appropriate reading for the length of the blue rectangle shown below: (then continue )

Making Measurements. On a piece of scrap paper, write down an appropriate reading for the length of the blue rectangle shown below: (then continue ) On a piece of scrap paper, write down an appropriate reading for the length of the blue rectangle shown below: (then continue ) 0 1 2 3 4 5 cm If the measurement you made was 3.7 cm (or 3.6 cm or 3.8 cm),

More information

Business Statistics. Lecture 10: Course Review

Business Statistics. Lecture 10: Course Review Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,

More information

Fourier and Stats / Astro Stats and Measurement : Stats Notes

Fourier and Stats / Astro Stats and Measurement : Stats Notes Fourier and Stats / Astro Stats and Measurement : Stats Notes Andy Lawrence, University of Edinburgh Autumn 2013 1 Probabilities, distributions, and errors Laplace once said Probability theory is nothing

More information

Physics 6A Lab Experiment 6

Physics 6A Lab Experiment 6 Rewritten Biceps Lab Introduction This lab will be different from the others you ve done so far. First, we ll have some warmup exercises to familiarize yourself with some of the theory, as well as the

More information

Chapter 10 Regression Analysis

Chapter 10 Regression Analysis Chapter 10 Regression Analysis Goal: To become familiar with how to use Excel 2007/2010 for Correlation and Regression. Instructions: You will be using CORREL, FORECAST and Regression. CORREL and FORECAST

More information

Do students sleep the recommended 8 hours a night on average?

Do students sleep the recommended 8 hours a night on average? BIEB100. Professor Rifkin. Notes on Section 2.2, lecture of 27 January 2014. Do students sleep the recommended 8 hours a night on average? We first set up our null and alternative hypotheses: H0: μ= 8

More information

Chapter 27 Summary Inferences for Regression

Chapter 27 Summary Inferences for Regression Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test

More information

COMP6053 lecture: Sampling and the central limit theorem. Jason Noble,

COMP6053 lecture: Sampling and the central limit theorem. Jason Noble, COMP6053 lecture: Sampling and the central limit theorem Jason Noble, jn2@ecs.soton.ac.uk Populations: long-run distributions Two kinds of distributions: populations and samples. A population is the set

More information

Introductory Quantum Chemistry Prof. K. L. Sebastian Department of Inorganic and Physical Chemistry Indian Institute of Science, Bangalore

Introductory Quantum Chemistry Prof. K. L. Sebastian Department of Inorganic and Physical Chemistry Indian Institute of Science, Bangalore Introductory Quantum Chemistry Prof. K. L. Sebastian Department of Inorganic and Physical Chemistry Indian Institute of Science, Bangalore Lecture - 4 Postulates Part 1 (Refer Slide Time: 00:59) So, I

More information

CH 24 IDENTITIES. [Each product is 35] Ch 24 Identities. Introduction

CH 24 IDENTITIES. [Each product is 35] Ch 24 Identities. Introduction 139 CH 4 IDENTITIES Introduction First we need to recall that there are many ways to indicate multiplication; for eample the product of 5 and 7 can be written in a variety of ways: 5 7 5 7 5(7) (5)7 (5)(7)

More information

5.2 Infinite Series Brian E. Veitch

5.2 Infinite Series Brian E. Veitch 5. Infinite Series Since many quantities show up that cannot be computed exactly, we need some way of representing it (or approximating it). One way is to sum an infinite series. Recall that a n is the

More information

Basic Probability Reference Sheet

Basic Probability Reference Sheet February 27, 2001 Basic Probability Reference Sheet 17.846, 2001 This is intended to be used in addition to, not as a substitute for, a textbook. X is a random variable. This means that X is a variable

More information

Solving with Absolute Value

Solving with Absolute Value Solving with Absolute Value Who knew two little lines could cause so much trouble? Ask someone to solve the equation 3x 2 = 7 and they ll say No problem! Add just two little lines, and ask them to solve

More information

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p).

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p). Sampling distributions and estimation. 1) A brief review of distributions: We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation,

More information

Two-sample inference: Continuous data

Two-sample inference: Continuous data Two-sample inference: Continuous data Patrick Breheny April 6 Patrick Breheny University of Iowa to Biostatistics (BIOS 4120) 1 / 36 Our next several lectures will deal with two-sample inference for continuous

More information

MITOCW ocw f99-lec30_300k

MITOCW ocw f99-lec30_300k MITOCW ocw-18.06-f99-lec30_300k OK, this is the lecture on linear transformations. Actually, linear algebra courses used to begin with this lecture, so you could say I'm beginning this course again by

More information

CS1800: Strong Induction. Professor Kevin Gold

CS1800: Strong Induction. Professor Kevin Gold CS1800: Strong Induction Professor Kevin Gold Mini-Primer/Refresher on Unrelated Topic: Limits This is meant to be a problem about reasoning about quantifiers, with a little practice of other skills, too

More information

Exam #2 Results (as percentages)

Exam #2 Results (as percentages) Oct. 30 Assignment: Read Chapter 19 Try exercises 1, 2, and 4 on p. 424 Exam #2 Results (as percentages) Mean: 71.4 Median: 73.3 Soda attitudes 2015 In a Gallup poll conducted Jul. 8 12, 2015, 1009 adult

More information

Intermediate Algebra. Gregg Waterman Oregon Institute of Technology

Intermediate Algebra. Gregg Waterman Oregon Institute of Technology Intermediate Algebra Gregg Waterman Oregon Institute of Technology c August 2013 Gregg Waterman This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.

More information

- measures the center of our distribution. In the case of a sample, it s given by: y i. y = where n = sample size.

- measures the center of our distribution. In the case of a sample, it s given by: y i. y = where n = sample size. Descriptive Statistics: One of the most important things we can do is to describe our data. Some of this can be done graphically (you should be familiar with histograms, boxplots, scatter plots and so

More information

the probability of getting either heads or tails must be 1 (excluding the remote possibility of getting it to land on its edge).

the probability of getting either heads or tails must be 1 (excluding the remote possibility of getting it to land on its edge). Probability One of the most useful and intriguing aspects of quantum mechanics is the Heisenberg Uncertainty Principle. Before I get to it however, we need some initial comments on probability. Let s first

More information

Chapter 8: Estimating with Confidence

Chapter 8: Estimating with Confidence Chapter 8: Estimating with Confidence Section 8.3 The Practice of Statistics, 4 th edition For AP* STARNES, YATES, MOORE Chapter 8 Estimating with Confidence n 8.1 Confidence Intervals: The Basics n 8.2

More information

Originality in the Arts and Sciences: Lecture 2: Probability and Statistics

Originality in the Arts and Sciences: Lecture 2: Probability and Statistics Originality in the Arts and Sciences: Lecture 2: Probability and Statistics Let s face it. Statistics has a really bad reputation. Why? 1. It is boring. 2. It doesn t make a lot of sense. Actually, the

More information

Multiple Regression Analysis

Multiple Regression Analysis Multiple Regression Analysis y = β 0 + β 1 x 1 + β 2 x 2 +... β k x k + u 2. Inference 0 Assumptions of the Classical Linear Model (CLM)! So far, we know: 1. The mean and variance of the OLS estimators

More information

Introduction to Estimation. Martina Litschmannová K210

Introduction to Estimation. Martina Litschmannová K210 Introduction to Estimation Martina Litschmannová martina.litschmannova@vsb.cz K210 Populations vs. Sample A population includes each element from the set of observations that can be made. A sample consists

More information

Chapter 23: Inferences About Means

Chapter 23: Inferences About Means Chapter 3: Inferences About Means Sample of Means: number of observations in one sample the population mean (theoretical mean) sample mean (observed mean) is the theoretical standard deviation of the population

More information

One sided tests. An example of a two sided alternative is what we ve been using for our two sample tests:

One sided tests. An example of a two sided alternative is what we ve been using for our two sample tests: One sided tests So far all of our tests have been two sided. While this may be a bit easier to understand, this is often not the best way to do a hypothesis test. One simple thing that we can do to get

More information

MITOCW ocw f99-lec01_300k

MITOCW ocw f99-lec01_300k MITOCW ocw-18.06-f99-lec01_300k Hi. This is the first lecture in MIT's course 18.06, linear algebra, and I'm Gilbert Strang. The text for the course is this book, Introduction to Linear Algebra. And the

More information

THE SAMPLING DISTRIBUTION OF THE MEAN

THE SAMPLING DISTRIBUTION OF THE MEAN THE SAMPLING DISTRIBUTION OF THE MEAN COGS 14B JANUARY 26, 2017 TODAY Sampling Distributions Sampling Distribution of the Mean Central Limit Theorem INFERENTIAL STATISTICS Inferential statistics: allows

More information

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER /2018

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER /2018 ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER 1 2017/2018 DR. ANTHONY BROWN 1. Arithmetic and Algebra 1.1. Arithmetic of Numbers. While we have calculators and computers

More information

(1) If Bush had not won the last election, then Nader would have won it.

(1) If Bush had not won the last election, then Nader would have won it. 24.221 Metaphysics Counterfactuals When the truth functional material conditional (or ) is introduced, it is normally glossed with the English expression If..., then.... However, if this is the correct

More information

STA Module 4 Probability Concepts. Rev.F08 1

STA Module 4 Probability Concepts. Rev.F08 1 STA 2023 Module 4 Probability Concepts Rev.F08 1 Learning Objectives Upon completing this module, you should be able to: 1. Compute probabilities for experiments having equally likely outcomes. 2. Interpret

More information

CS1800: Mathematical Induction. Professor Kevin Gold

CS1800: Mathematical Induction. Professor Kevin Gold CS1800: Mathematical Induction Professor Kevin Gold Induction: Used to Prove Patterns Just Keep Going For an algorithm, we may want to prove that it just keeps working, no matter how big the input size

More information

22. The Quadratic Sieve and Elliptic Curves. 22.a The Quadratic Sieve

22. The Quadratic Sieve and Elliptic Curves. 22.a The Quadratic Sieve 22. The Quadratic Sieve and Elliptic Curves 22.a The Quadratic Sieve Sieve methods for finding primes or for finding factors of numbers are methods by which you take a set P of prime numbers one by one,

More information

Section 5.4. Ken Ueda

Section 5.4. Ken Ueda Section 5.4 Ken Ueda Students seem to think that being graded on a curve is a positive thing. I took lasers 101 at Cornell and got a 92 on the exam. The average was a 93. I ended up with a C on the test.

More information

Multiple Regression Theory 2006 Samuel L. Baker

Multiple Regression Theory 2006 Samuel L. Baker MULTIPLE REGRESSION THEORY 1 Multiple Regression Theory 2006 Samuel L. Baker Multiple regression is regression with two or more independent variables on the right-hand side of the equation. Use multiple

More information

STARTING WITH CONFIDENCE

STARTING WITH CONFIDENCE STARTING WITH CONFIDENCE A- Level Maths at Budmouth Name: This booklet has been designed to help you to bridge the gap between GCSE Maths and AS Maths. Good mathematics is not about how many answers you

More information

Conservation of Momentum

Conservation of Momentum Learning Goals Conservation of Momentum After you finish this lab, you will be able to: 1. Use Logger Pro to analyze video and calculate position, velocity, and acceleration. 2. Use the equations for 2-dimensional

More information

PHYSICS 15a, Fall 2006 SPEED OF SOUND LAB Due: Tuesday, November 14

PHYSICS 15a, Fall 2006 SPEED OF SOUND LAB Due: Tuesday, November 14 PHYSICS 15a, Fall 2006 SPEED OF SOUND LAB Due: Tuesday, November 14 GENERAL INFO The goal of this lab is to determine the speed of sound in air, by making measurements and taking into consideration the

More information

Constant linear models

Constant linear models Constant linear models A constant linear model is a type of model that provides us with tools for drawing statistical inferences about means of random variables. Means of random variables are theoretical

More information

CH 59 SQUARE ROOTS. Every positive number has two square roots. Ch 59 Square Roots. Introduction

CH 59 SQUARE ROOTS. Every positive number has two square roots. Ch 59 Square Roots. Introduction 59 CH 59 SQUARE ROOTS Introduction W e saw square roots when we studied the Pythagorean Theorem. They may have been hidden, but when the end of a right-triangle problem resulted in an equation like c =

More information

Sequential Decision Problems

Sequential Decision Problems Sequential Decision Problems Michael A. Goodrich November 10, 2006 If I make changes to these notes after they are posted and if these changes are important (beyond cosmetic), the changes will highlighted

More information

The Central Limit Theorem

The Central Limit Theorem The Central Limit Theorem Patrick Breheny March 1 Patrick Breheny STA 580: Biostatistics I 1/23 Kerrich s experiment A South African mathematician named John Kerrich was visiting Copenhagen in 1940 when

More information

(Refer Slide Time: 00:10)

(Refer Slide Time: 00:10) Chemical Reaction Engineering 1 (Homogeneous Reactors) Professor R. Krishnaiah Department of Chemical Engineering Indian Institute of Technology Madras Lecture No 10 Design of Batch Reactors Part 1 (Refer

More information