Confidence Intervals 1
|
|
- Reynard Wheeler
- 5 years ago
- Views:
Transcription
1 Confidence Intervals 1 November 1, HMS, 2017, v1.1
2 Chapter References Diez: Chapter 4.2 Navidi, Chapter 5.0, 5.1, (Self read, 5.2), 5.3, 5.4, 5.6, not 5.7, 5.8 Chapter References 2
3 Terminology Point Estimate A sample statistic used to estimate the value of a population parameter. 1. Provides a single value: Based on observations from 1 sample, there is no sampling distribution. 2. Information: Gives no information about how close the value is to the unknown population parameter. 3. Example: Sample mean x is the point estimate of the unknown population mean. Confidence Intervals 3
4 Terminology Confidence interval (interval estimate) A range of values defined by the confidence level within which the population parameter is estimated to fall. 1. Provides a range of values. 2. Information: Gives information about closeness to unknown population parameter 3. Example: Unknown population mean lies between 50 and 70 with 95% confidence Confidence Level The likelihood, expressed as a percentage or a probability, that a specified interval will contain the population parameter. Confidence Intervals 4
5 Interval Estimation A probability that the population parameter falls somewhere within the interval. µ x = X ± error Confidence Intervals 5
6 Error Bars We ve all seen errors bars: One standard deviation from the mean of nine independently evolved populations. Designing and engineering evolutionary robust genetic circuits Journal of Biological Engineering 2010, 4:12 Confidence Intervals 6
7 Error Bars Error Bar Type Description Formula Range Descriptive Distance between extremes Highest and of data lowest points Standard Deviation Descriptive Average deviation SD from the mean Standard Error Inferential Variability of the mean SD/ n Confidence Limit Inferential Range of values you can be 95% confidence contains the true mean To be determined Confidence Intervals 7
8 Confidence Limits Recall what the standard error means: The standard error gives the standard deviation of the distribution of the mean. We are 68% confident that the mean is within the limits ±SE. But we could do better, we could widen the limits to 95%. Confidence Intervals 8
9 Confidence Limits The standard error of the mean can be interpreted as: If we were to take another sample from the population and compute its mean, there is a 68% chance that the mean of the sample will lie within 1 standard error of the population mean. But 68% is not that big, better to use a wider range - a given range is referred to as the confidence level. Confidence Intervals 9
10 Confidence Limits Confidence Level: The likelihood, expressed as a percentage or a probability, that a specified interval will contain the population parameter. 95% confidence level there is a 0.95 probability that a specified interval does contain the population mean. In other words, there are 5 chances out of 100 (or 1 chance out of 20) that the interval does not contain the population mean. Two σ 99% confidence level there is 1 chance out of 100 that the interval does not contains the population mean. Three σ Confidence Intervals 10
11 Confidence Limits Confidence Intervals 11
12 Confidence Limits How far out is 95% of the area on a normal curve? This means there is a 2.5% area on both sides of the normal curve. Let α = 0.05 i.e 5% Confidence Intervals 12
13 Confidence Limits To find the z-value we look up 97.5 in the standard normal cumulative probability table and we get 1.96 Confidence Intervals 13
14 Confidence Limits z Value Percentage Area % % % % Confidence Intervals 14
15 Confidence Limits In other words, to 1.96 on the z scale represents 95% of the area: or (and this is the critical point) 95% is bounded by ±1.96 SE Confidence Intervals 15
16 Confidence Limits We can therefore state that for 95% of the time, the mean will be bounded by: µ ± 1.96 σ n However, we don t actually have the population mean, µ, or the population standard deviation, σ. Instead we use the sample mean and standard deviation as proxies: x ± 1.96 s n Hence the limits are actually only approximate. Moreover the approximation get worse as the sample size gets smaller. Confidence Intervals 16
17 Confidence Limits You ll also see the confidence level expressed as: z α/2 for 95% confidence, 1.96 z α/3 for 99.7% confidence, 3 Note: The 99.0% level is actually bounded by The z score changes rapidly at the limits of the normal curve. Confidence Intervals 17
18 Exercise Find the z limits for a 90% confidence level. Confidence Intervals 18
19 Example The mean birth weights for 200 babies is 3.28 Kg grams with a population standard deviation of 0.85 Kg. Compute the 95% confidence limits for the mean birth weight. x = 3.28 ± = 3.28 ± 0.12 Kg Confidence Intervals 19
20 Example The mean concentration for a sample of 100 insulin vials is 15 grams/vial with a population standard deviation of 3.4 grams. Compute the 90% confidence limits for the mean concentration of insulin. x = 15 ± = 15 ± 0.56 grams/vial Confidence Intervals 20
21 Summary Large sample confidence interval for a population mean: General Formula: x ± (z critical value) σ n Levels of confidence and corresponding z critical value: 99% % % Since n is large the unknown σ can be replaced by the sample value s: x ± (z critical value) s n Confidence Intervals 21
22 Problems in Paradise You may have noticed in the previous examples that the samples were relatively large. This was to ensure that the means and standard deviations were reasonable representatives of the population measures. In fact the problems stated that the standard deviation was in fact the population standard deviation. For small samples 30 we have to make a slight modification to the procedure. Confidence Intervals 22
23 Student s t Distribution For large samples we assume that the means are distributed as: X N(µ, σ 2 /n) and that the standardized distribution (X µ)/(σ/ n) has a normal distribution with mean 0 and variance 1. However, when the sample size is small, there is significant error in the estimate for the population standard deviation, σ, because we will often estimate the population standard deviation from the sample standard deviation, s. In 1908 Gossett proposed that the quantity (X µ)/(s/ n) was in fact distributed via a different distribution which he called the Student s t distribution. Confidence Intervals 23
24 Student s t Distribution Definition: Let X 1,... be a small sample (n 30) from a normal population with mean µ. Then the quantity: X µ s/ n has a Student s t distribution with n 1 degrees of freedom, denoted, t n 1 Note: The t distribution is a function of the sample size minus 1. Confidence Intervals 24
25 Confidence Limits For small sample sizes, use the t distribution instead of the standard normal distribution. The t distribution is a symmetrical distribution whose probability density function is defined by a single parameter known as the degrees of freedom (df). Example, if the sample size if 19, then the degrees of freedom will be 18. Confidence Intervals 25
26 Confidence Limits A larger portion of the probability area is in the tails compare to the standard normal distribution. This in turn means the confidence limits computed using the t distribution will be larger. Confidence Intervals 26
27 Confidence Limits for Small Samples The confidence limits for a small sample is given by: s X ± t n 1,α/2 n where n is the sample size, n 1 the degrees of freedom, α/2 the confidence level (eg 0.05/2 = 95%) Just as there are z tables there are also t tables Confidence Intervals 27
28 t Tables Confidence Intervals 28
29 t Tables Confidence Intervals 29
30 t Tables: One Tailed and Two Tailed Tables Confidence Intervals 30
31 t Tables: How to use the t Table Confidence Intervals 31
32 t Tables: Example Six vials of penicillin were randomly selected and the concentration of penicillin was determined in each vial in mg/ml to be: 8.6, 9.7, 13.4, 11.4, 10.2, 12.3 Find the 95% confidence limits for the true mean concentration of penicillin. Confidence Intervals 32
33 t Tables: Example Since we are dealing with less than a sample size of 30, we will use the t-statistic to determine the confidence limits. n = 6 x = Sample standard deviation = 1.77 X is the random variable that represents the mean. s X ± t n 1,α/2 n Confidence Intervals 33
34 t Tables: Example n 1 = 6 1 = 5 α/2 = 0.05/2 = Note: α is the area outside the critical bounds = 0.05 In the t table we will look for row 5, and the column marked Note that 0l025 is the areas of a single tail therefore we will use the single tailed table. Confidence Intervals 34
35 t Tables: Example Therefore: t 5,α/2=0.025 = µ = ± = ± 1.86 For the z-statistic the critical value would be 1.96, therefore the range has widened with the t-statistic. Confidence Intervals 35
36 Confidence Intervals for Difference between two Means Consider two samples from two different populations. What is the confidence limit for the difference in the two means? i.e µ X µ Y From previous lectures on combining means and variances we know that: X Y N(µ X µ Y, σ 2 X + σ 2 Y ) That is the difference is also normally distributed but with different mean and variance. Confidence Intervals 36
37 Confidence Intervals for Difference between two Means Consider a mean X and Y that have standard errors: σ X n1 and σ Y n2 where n 1 and n 2 are the sizes of the corresponding samples. Then the difference X Y : Mean = µ X µ Y and variance σ 2 X n1 + σ2 Y n2 Given the new variance and mean we can compute the 95% confidence limit as: σx 2 µ X µ Y ± σ2 Y n1 n2 Confidence Intervals 37
38 Confidence Intervals for Difference between two Means If the samples are small then the confidence limits for the difference in two means requires the use of the t distribution as before. The main complication is the calculation of the degrees of freedom but its difficult to do. I refer you to section 5.6 in Navidi for details. Confidence Intervals 38
39 Using Simulation to Estimate Confidence Limits All the methods so far assume the sample is obtained from a population that is normal or near normal. What happens if the population is not normal? Confidence Intervals 39
40 Using Simulation to Estimate Confidence Limits Consider a series of gene expression rates measures from seven cultures of E. coli. The data are as follows: 7.69, 4.97, 4.56, 6.49, 4.34, 6.24, 4.45 Using the following code a norm Q-Q or probability plot was made: import numpy as np import pylab import scipy.stats as stats measurements = [7.69, 4.97, 4.56, 6.49, 4.34, 6.24, 4.45 ] stats.probplot(measurements, dist="norm", plot=pylab) pylab.show() Confidence Intervals 40
41 Using Simulation to Estimate Confidence Limits The data does not appear to be normally distributed. Confidence Intervals 41
42 Using Simulation to Estimate Confidence Limits To find 95% confidence intervals for this data we must create synthetic data sets using a Bootstrap. We will create a new sample by drawing at random (and with replacement) values from the measured sample. For example the following could be a new synthetic sample: 6.49, 4.97, 4.34, 6.24, 7.69, 6.49, 4.34 Because of replacement it is possible we could pick the same value multiple times. We do this 100,000 times in order to create 100,000 synthetic data sets. We compute the mean for each synthetic data set to generate a population of means. We can use this population to work out the distribution and hence the confidence limits on the mean. Confidence Intervals 42
43 Using Simulation to Estimate Confidence Limits Sort the sample means from low to high Find the 2.5% and 97.5% percentiles. The interval will contain 95% of the data. These calculations can be easily done in Python. Confidence Intervals 43
44 Using Simulation to Estimate Confidence Limits import numpy as np import scipy.stats as stats import random measurements = [7.69, 4.97, 4.56, 6.49, 4.34, 6.24, 4.45 ] ensemble = [] for i in range (100000): sample = [] for j in range (7): sample.append (measurements [random.randint (0,6)]) ensemble.append (np.mean (sample)) np.sort (ensemble) p1 = np.percentile(ensemble, 2.5) p2 = np.percentile(ensemble, 97.5) print p1, p2 Confidence Intervals 44
45 Using Simulation to Estimate Confidence Limits Running the python script yields the values: That is the 95% confidence limits on the gene expression is: 4.72 to 6.46 Confidence Intervals 45
Review. One-way ANOVA, I. What s coming up. Multiple comparisons
Review One-way ANOVA, I 9.07 /15/00 Earlier in this class, we talked about twosample z- and t-tests for the difference between two conditions of an independent variable Does a trial drug work better than
More informationSTAT 135 Lab 5 Bootstrapping and Hypothesis Testing
STAT 135 Lab 5 Bootstrapping and Hypothesis Testing Rebecca Barter March 2, 2015 The Bootstrap Bootstrap Suppose that we are interested in estimating a parameter θ from some population with members x 1,...,
More informationTwo-Sample Inferential Statistics
The t Test for Two Independent Samples 1 Two-Sample Inferential Statistics In an experiment there are two or more conditions One condition is often called the control condition in which the treatment is
More informationThe t-statistic. Student s t Test
The t-statistic 1 Student s t Test When the population standard deviation is not known, you cannot use a z score hypothesis test Use Student s t test instead Student s t, or t test is, conceptually, very
More informationPsychology 282 Lecture #4 Outline Inferences in SLR
Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations
More informationRegression Analysis: Basic Concepts
The simple linear model Regression Analysis: Basic Concepts Allin Cottrell Represents the dependent variable, y i, as a linear function of one independent variable, x i, subject to a random disturbance
More informationCOSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan
COSC 341 Human Computer Interaction Dr. Bowen Hui University of British Columbia Okanagan 1 Last Topic Distribution of means When it is needed How to build one (from scratch) Determining the characteristics
More informationCS 5014: Research Methods in Computer Science. Bernoulli Distribution. Binomial Distribution. Poisson Distribution. Clifford A. Shaffer.
Department of Computer Science Virginia Tech Blacksburg, Virginia Copyright c 2015 by Clifford A. Shaffer Computer Science Title page Computer Science Clifford A. Shaffer Fall 2015 Clifford A. Shaffer
More informationQuestions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.
Chapter 7 Reading 7.1, 7.2 Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.112 Introduction In Chapter 5 and 6, we emphasized
More informationBusiness Statistics. Lecture 5: Confidence Intervals
Business Statistics Lecture 5: Confidence Intervals Goals for this Lecture Confidence intervals The t distribution 2 Welcome to Interval Estimation! Moments Mean 815.0340 Std Dev 0.8923 Std Error Mean
More informationProbability and Statistics Notes
Probability and Statistics Notes Chapter Seven Jesse Crawford Department of Mathematics Tarleton State University Spring 2011 (Tarleton State University) Chapter Seven Notes Spring 2011 1 / 42 Outline
More informationACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 2 MATH00040 SEMESTER / Probability
ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 2 MATH00040 SEMESTER 2 2017/2018 DR. ANTHONY BROWN 5.1. Introduction to Probability. 5. Probability You are probably familiar with the elementary
More informationDistribution-Free Procedures (Devore Chapter Fifteen)
Distribution-Free Procedures (Devore Chapter Fifteen) MATH-5-01: Probability and Statistics II Spring 018 Contents 1 Nonparametric Hypothesis Tests 1 1.1 The Wilcoxon Rank Sum Test........... 1 1. Normal
More informationHypothesis Testing: Chi-Square Test 1
Hypothesis Testing: Chi-Square Test 1 November 9, 2017 1 HMS, 2017, v1.0 Chapter References Diez: Chapter 6.3 Navidi, Chapter 6.10 Chapter References 2 Chi-square Distributions Let X 1, X 2,... X n be
More information7 Estimation. 7.1 Population and Sample (P.91-92)
7 Estimation MATH1015 Biostatistics Week 7 7.1 Population and Sample (P.91-92) Suppose that we wish to study a particular health problem in Australia, for example, the average serum cholesterol level for
More informationSingle Sample Means. SOCY601 Alan Neustadtl
Single Sample Means SOCY601 Alan Neustadtl The Central Limit Theorem If we have a population measured by a variable with a mean µ and a standard deviation σ, and if all possible random samples of size
More informationIV. The Normal Distribution
IV. The Normal Distribution The normal distribution (a.k.a., the Gaussian distribution or bell curve ) is the by far the best known random distribution. It s discovery has had such a far-reaching impact
More informationSampling and estimation theories
Chapter 66 Sampling and estimation theories 66.1 Introduction The concepts of elementary sampling theory and estimation theories introduced in this chapter will provide the basis for a more detailed study
More information16.400/453J Human Factors Engineering. Design of Experiments II
J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential
More informationStatistical Inference for Means
Statistical Inference for Means Jamie Monogan University of Georgia February 18, 2011 Jamie Monogan (UGA) Statistical Inference for Means February 18, 2011 1 / 19 Objectives By the end of this meeting,
More informationLecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series
Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 7 Estimates and Sample Sizes 7-1 Overview 7-2 Estimating a Population Proportion 7-3
More informationSTAT 200 Chapter 1 Looking at Data - Distributions
STAT 200 Chapter 1 Looking at Data - Distributions What is Statistics? Statistics is a science that involves the design of studies, data collection, summarizing and analyzing the data, interpreting the
More informationSection 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples
Objective Section 9.4 Inferences About Two Means (Matched Pairs) Compare of two matched-paired means using two samples from each population. Hypothesis Tests and Confidence Intervals of two dependent means
More informationThis does not cover everything on the final. Look at the posted practice problems for other topics.
Class 7: Review Problems for Final Exam 8.5 Spring 7 This does not cover everything on the final. Look at the posted practice problems for other topics. To save time in class: set up, but do not carry
More informationThe independent-means t-test:
The independent-means t-test: Answers the question: is there a "real" difference between the two conditions in my experiment? Or is the difference due to chance? Previous lecture: (a) Dependent-means t-test:
More informationConfidence Intervals. - simply, an interval for which we have a certain confidence.
Confidence Intervals I. What are confidence intervals? - simply, an interval for which we have a certain confidence. - for example, we are 90% certain that an interval contains the true value of something
More informationIV. The Normal Distribution
IV. The Normal Distribution The normal distribution (a.k.a., a the Gaussian distribution or bell curve ) is the by far the best known random distribution. It s discovery has had such a far-reaching impact
More informationConfidence intervals
Confidence intervals We now want to take what we ve learned about sampling distributions and standard errors and construct confidence intervals. What are confidence intervals? Simply an interval for which
More informationThe t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary
Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis
More information(right tailed) or minus Z α. (left-tailed). For a two-tailed test the critical Z value is going to be.
More Power Stuff What is the statistical power of a hypothesis test? Statistical power is the probability of rejecting the null conditional on the null being false. In mathematical terms it is ( reject
More informationAn inferential procedure to use sample data to understand a population Procedures
Hypothesis Test An inferential procedure to use sample data to understand a population Procedures Hypotheses, the alpha value, the critical region (z-scores), statistics, conclusion Two types of errors
More informationBusiness Statistics: A Decision-Making Approach, 6e. Chapter Goals
Chapter 4 Student Lecture Notes 4-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 4 Using Probability and Probability Distributions Fundamentals of Business Statistics Murali Shanker
More informationa table or a graph or an equation.
Topic (8) POPULATION DISTRIBUTIONS 8-1 So far: Topic (8) POPULATION DISTRIBUTIONS We ve seen some ways to summarize a set of data, including numerical summaries. We ve heard a little about how to sample
More informationMultiple t Tests. Introduction to Analysis of Variance. Experiments with More than 2 Conditions
Introduction to Analysis of Variance 1 Experiments with More than 2 Conditions Often the research that psychologists perform has more conditions than just the control and experimental conditions You might
More informationMATH Chapter 21 Notes Two Sample Problems
MATH 1070 - Chapter 21 Notes Two Sample Problems Recall: So far, we have dealt with inference (confidence intervals and hypothesis testing) pertaining to: Single sample of data. A matched pairs design
More informationPSYC 331 STATISTICS FOR PSYCHOLOGISTS
PSYC 331 STATISTICS FOR PSYCHOLOGISTS Session 4 A PARAMETRIC STATISTICAL TEST FOR MORE THAN TWO POPULATIONS Lecturer: Dr. Paul Narh Doku, Dept of Psychology, UG Contact Information: pndoku@ug.edu.gh College
More informationContent by Week Week of October 14 27
Content by Week Week of October 14 27 Learning objectives By the end of this week, you should be able to: Understand the purpose and interpretation of confidence intervals for the mean, Calculate confidence
More informationSupporting Australian Mathematics Project. A guide for teachers Years 11 and 12. Probability and statistics: Module 25. Inference for means
1 Supporting Australian Mathematics Project 2 3 4 6 7 8 9 1 11 12 A guide for teachers Years 11 and 12 Probability and statistics: Module 2 Inference for means Inference for means A guide for teachers
More informationLast two weeks: Sample, population and sampling distributions finished with estimation & confidence intervals
Past weeks: Measures of central tendency (mean, mode, median) Measures of dispersion (standard deviation, variance, range, etc). Working with the normal curve Last two weeks: Sample, population and sampling
More informationIntroduction to Statistical Data Analysis Lecture 5: Confidence Intervals
Introduction to Statistical Data Analysis Lecture 5: Confidence Intervals James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis 1
More informationStatistical Intervals (One sample) (Chs )
7 Statistical Intervals (One sample) (Chs 8.1-8.3) Confidence Intervals The CLT tells us that as the sample size n increases, the sample mean X is close to normally distributed with expected value µ and
More informationSection 9 2B:!! Using Confidence Intervals to Estimate the Difference ( µ 1 µ 2 ) in Two Population Means using Two Independent Samples.
Section 9 2B:!! Using Confidence Intervals to Estimate the Difference ( µ 1 µ 2 ) in Two Population Means using Two Independent Samples Requirements 1.A random sample of each population is taken. The sample
More informationConfidence intervals CE 311S
CE 311S PREVIEW OF STATISTICS The first part of the class was about probability. P(H) = 0.5 P(T) = 0.5 HTTHHTTTTHHTHTHH If we know how a random process works, what will we see in the field? Preview of
More informationStudent s t-distribution. The t-distribution, t-tests, & Measures of Effect Size
Student s t-distribution The t-distribution, t-tests, & Measures of Effect Size Sampling Distributions Redux Chapter 7 opens with a return to the concept of sampling distributions from chapter 4 Sampling
More informationChapter 4. Displaying and Summarizing. Quantitative Data
STAT 141 Introduction to Statistics Chapter 4 Displaying and Summarizing Quantitative Data Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 31 4.1 Histograms 1 We divide the range
More informationSamples and Populations Confidence Intervals Hypotheses One-sided vs. two-sided Statistical Significance Error Types. Statistiek I.
Statistiek I Sampling John Nerbonne CLCG, Rijksuniversiteit Groningen http://www.let.rug.nl/nerbonne/teach/statistiek-i/ John Nerbonne 1/41 Overview 1 Samples and Populations 2 Confidence Intervals 3 Hypotheses
More informationLinear Regression. In this lecture we will study a particular type of regression model: the linear regression model
1 Linear Regression 2 Linear Regression In this lecture we will study a particular type of regression model: the linear regression model We will first consider the case of the model with one predictor
More informationStatistics 1. Edexcel Notes S1. Mathematical Model. A mathematical model is a simplification of a real world problem.
Statistics 1 Mathematical Model A mathematical model is a simplification of a real world problem. 1. A real world problem is observed. 2. A mathematical model is thought up. 3. The model is used to make
More informationStatistics 224 Solution key to EXAM 2 FALL 2007 Friday 11/2/07 Professor Michael Iltis (Lecture 2)
NOTE : For the purpose of review, I have added some additional parts not found on the original exam. These parts are indicated with a ** beside them Statistics 224 Solution key to EXAM 2 FALL 2007 Friday
More informationMath 10 - Compilation of Sample Exam Questions + Answers
Math 10 - Compilation of Sample Exam Questions + Sample Exam Question 1 We have a population of size N. Let p be the independent probability of a person in the population developing a disease. Answer the
More informationZ score indicates how far a raw score deviates from the sample mean in SD units. score Mean % Lower Bound
1 EDUR 8131 Chat 3 Notes 2 Normal Distribution and Standard Scores Questions Standard Scores: Z score Z = (X M) / SD Z = deviation score divided by standard deviation Z score indicates how far a raw score
More informationStatistical Inference
Chapter 14 Confidence Intervals: The Basic Statistical Inference Situation: We are interested in estimating some parameter (population mean, μ) that is unknown. We take a random sample from this population.
More informationPHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1
PHP2510: Principles of Biostatistics & Data Analysis Lecture X: Hypothesis testing PHP 2510 Lec 10: Hypothesis testing 1 In previous lectures we have encountered problems of estimating an unknown population
More informationLast week: Sample, population and sampling distributions finished with estimation & confidence intervals
Past weeks: Measures of central tendency (mean, mode, median) Measures of dispersion (standard deviation, variance, range, etc). Working with the normal curve Last week: Sample, population and sampling
More informationSurvey of Smoking Behavior. Samples and Elements. Survey of Smoking Behavior. Samples and Elements
s and Elements Units are Same as Elementary Units Frame Elements Analyzed as a binomial variable 9 Persons from, Frame Elements N =, N =, n = 9 Analyzed as a binomial variable HIV+ HIV- % population smokes
More informationDescriptive Statistics
Descriptive Statistics Once an experiment is carried out and the results are measured, the researcher has to decide whether the results of the treatments are different. This would be easy if the results
More informationMultiple Regression Analysis
Multiple Regression Analysis y = β 0 + β 1 x 1 + β 2 x 2 +... β k x k + u 2. Inference 0 Assumptions of the Classical Linear Model (CLM)! So far, we know: 1. The mean and variance of the OLS estimators
More informationToday - SPSS and standard error - End of Midterm 1 exam material - T-scores
Today - SPSS and standard error - End of Midterm 1 exam material - T-scores Previously, on StatsClass: The standard error is a measure of the typical amount that that a sample mean will be off from the
More informationChapter 4: An Introduction to Probability and Statistics
Chapter 4: An Introduction to Probability and Statistics 4. Probability The simplest kinds of probabilities to understand are reflected in everyday ideas like these: (i) if you toss a coin, the probability
More informationCHAPTER 6 SOME CONTINUOUS PROBABILITY DISTRIBUTIONS. 6.2 Normal Distribution. 6.1 Continuous Uniform Distribution
CHAPTER 6 SOME CONTINUOUS PROBABILITY DISTRIBUTIONS Recall that a continuous random variable X is a random variable that takes all values in an interval or a set of intervals. The distribution of a continuous
More informationEcon 300/QAC 201: Quantitative Methods in Economics/Applied Data Analysis. 12th Class 6/23/10
Econ 300/QAC 201: Quantitative Methods in Economics/Applied Data Analysis 12th Class 6/23/10 In God we trust, all others must use data. --Edward Deming hand out review sheet, answer, point to old test,
More informationSurvey of Smoking Behavior. Survey of Smoking Behavior. Survey of Smoking Behavior
Sample HH from Frame HH One-Stage Cluster Survey Population Frame Sample Elements N =, N =, n = population smokes Sample HH from Frame HH Elementary units are different from sampling units Sampled HH but
More informationNotes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing
Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing 1. Purpose of statistical inference Statistical inference provides a means of generalizing
More informationInference for Single Proportions and Means T.Scofield
Inference for Single Proportions and Means TScofield Confidence Intervals for Single Proportions and Means A CI gives upper and lower bounds between which we hope to capture the (fixed) population parameter
More informationChapter 8 - Statistical intervals for a single sample
Chapter 8 - Statistical intervals for a single sample 8-1 Introduction In statistics, no quantity estimated from data is known for certain. All estimated quantities have probability distributions of their
More informationFREQUENCY DISTRIBUTIONS AND PERCENTILES
FREQUENCY DISTRIBUTIONS AND PERCENTILES New Statistical Notation Frequency (f): the number of times a score occurs N: sample size Simple Frequency Distributions Raw Scores The scores that we have directly
More informationCorrelation 1. December 4, HMS, 2017, v1.1
Correlation 1 December 4, 2017 1 HMS, 2017, v1.1 Chapter References Diez: Chapter 7 Navidi, Chapter 7 I don t expect you to learn the proofs what will follow. Chapter References 2 Correlation The sample
More informationLecture 12: Small Sample Intervals Based on a Normal Population Distribution
Lecture 12: Small Sample Intervals Based on a Normal Population MSU-STT-351-Sum-17B (P. Vellaisamy: MSU-STT-351-Sum-17B) Probability & Statistics for Engineers 1 / 24 In this lecture, we will discuss (i)
More informationBusiness Analytics and Data Mining Modeling Using R Prof. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee
Business Analytics and Data Mining Modeling Using R Prof. Gaurav Dixit Department of Management Studies Indian Institute of Technology, Roorkee Lecture - 04 Basic Statistics Part-1 (Refer Slide Time: 00:33)
More informationConfidence Intervals with σ unknown
STAT 141 Confidence Intervals and Hypothesis Testing 10/26/04 Today (Chapter 7): CI with σ unknown, t-distribution CI for proportions Two sample CI with σ known or unknown Hypothesis Testing, z-test Confidence
More informationUNIVERSITY OF TORONTO MISSISSAUGA. SOC222 Measuring Society In-Class Test. November 11, 2011 Duration 11:15a.m. 13 :00p.m.
UNIVERSITY OF TORONTO MISSISSAUGA SOC222 Measuring Society In-Class Test November 11, 2011 Duration 11:15a.m. 13 :00p.m. Location: DV2074 Aids Allowed You may be charged with an academic offence for possessing
More informationNote that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).
Confidence Intervals 1) What are confidence intervals? Simply, an interval for which we have a certain confidence. For example, we are 90% certain that an interval contains the true value of something
More informationII. The Normal Distribution
II. The Normal Distribution The normal distribution (a.k.a., a the Gaussian distribution or bell curve ) is the by far the best known random distribution. It s discovery has had such a far-reaching impact
More informationLecture 30. DATA 8 Summer Regression Inference
DATA 8 Summer 2018 Lecture 30 Regression Inference Slides created by John DeNero (denero@berkeley.edu) and Ani Adhikari (adhikari@berkeley.edu) Contributions by Fahad Kamran (fhdkmrn@berkeley.edu) and
More informationChapter 2: Tools for Exploring Univariate Data
Stats 11 (Fall 2004) Lecture Note Introduction to Statistical Methods for Business and Economics Instructor: Hongquan Xu Chapter 2: Tools for Exploring Univariate Data Section 2.1: Introduction What is
More informationStat 427/527: Advanced Data Analysis I
Stat 427/527: Advanced Data Analysis I Review of Chapters 1-4 Sep, 2017 1 / 18 Concepts you need to know/interpret Numerical summaries: measures of center (mean, median, mode) measures of spread (sample
More informationSTATISTICS INDEX NUMBER
NAME SCHOOL INDEX NUMBER DATE STATISTICS KCSE 1989 2012 Form 4 Mathematics Answer all the questions 1. 1989 Q12 P1 The table below shows the defective bolts from 40 samples No. of detective 0 1 2 3 4 5
More informationThe Normal Distribution (Pt. 2)
Chapter 5 The Normal Distribution (Pt 2) 51 Finding Normal Percentiles Recall that the Nth percentile of a distribution is the value that marks off the bottom N% of the distribution For review, remember
More informationLecture 11. Data Description Estimation
Lecture 11 Data Description Estimation Measures of Central Tendency (continued, see last lecture) Sample mean, population mean Sample mean for frequency distributions The median The mode The midrange 3-22
More informationCIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8
CIVL - 7904/8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 Chi-square Test How to determine the interval from a continuous distribution I = Range 1 + 3.322(logN) I-> Range of the class interval
More informationDiscrete Distributions: Poisson Distribution 1
Discrete Distributions: Poisson Distribution 1 November 6, 2017 1 HMS, 2017, v1.1 Chapter References Diez: Chapter 3.3, 3.4 (not 3.4.2), 3.5.2 Navidi, Chapter 4.1, 4.2, 4.3 Chapter References 2 Poisson
More information11: Comparing Group Variances. Review of Variance
11: Comparing Group Variances Review of Variance Parametric measures of variability are often based on sum of squares (SS) around e mean: (1) For e data set {3, 4, 5, 8}, = 5 and SS = (3 5) + (4 5) + (5
More informationReview of Multiple Regression
Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate
More informationHYPOTHESIS TESTING. Hypothesis Testing
MBA 605 Business Analytics Don Conant, PhD. HYPOTHESIS TESTING Hypothesis testing involves making inferences about the nature of the population on the basis of observations of a sample drawn from the population.
More informationMAT2377. Rafa l Kulik. Version 2015/November/23. Rafa l Kulik
MAT2377 Rafa l Kulik Version 2015/November/23 Rafa l Kulik Rafa l Kulik 1 Rafa l Kulik 2 Rafa l Kulik 3 Rafa l Kulik 4 The Z-test Test on the mean of a normal distribution, σ known Suppose X 1,..., X n
More informationUsing the z-table: Given an Area, Find z ID1050 Quantitative & Qualitative Reasoning
Using the -Table: Given an, Find ID1050 Quantitative & Qualitative Reasoning between mean and beyond 0.0 0.000 0.500 0.1 0.040 0.460 0.2 0.079 0.421 0.3 0.118 0.382 0.4 0.155 0.345 0.5 0.192 0.309 0.6
More informationDART_LAB Tutorial Section 5: Adaptive Inflation
DART_LAB Tutorial Section 5: Adaptive Inflation UCAR 14 The National Center for Atmospheric Research is sponsored by the National Science Foundation. Any opinions, findings and conclusions or recommendations
More informationLecture on Null Hypothesis Testing & Temporal Correlation
Lecture on Null Hypothesis Testing & Temporal Correlation CS 590.21 Analysis and Modeling of Brain Networks Department of Computer Science University of Crete Acknowledgement Resources used in the slides
More informationFin285a:Computer Simulations and Risk Assessment Section 2.3.2:Hypothesis testing, and Confidence Intervals
Fin285a:Computer Simulations and Risk Assessment Section 2.3.2:Hypothesis testing, and Confidence Intervals Overview Hypothesis testing terms Testing a die Testing issues Estimating means Confidence intervals
More informationz and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests
z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests Chapters 3.5.1 3.5.2, 3.3.2 Prof. Tesler Math 283 Fall 2018 Prof. Tesler z and t tests for mean Math
More informationPart 3: Parametric Models
Part 3: Parametric Models Matthew Sperrin and Juhyun Park August 19, 2008 1 Introduction There are three main objectives to this section: 1. To introduce the concepts of probability and random variables.
More informationCounting principles, including permutations and combinations.
1 Counting principles, including permutations and combinations. The binomial theorem: expansion of a + b n, n ε N. THE PRODUCT RULE If there are m different ways of performing an operation and for each
More informationLecture 17: Small-Sample Inferences for Normal Populations. Confidence intervals for µ when σ is unknown
Lecture 17: Small-Sample Inferences for Normal Populations Confidence intervals for µ when σ is unknown If the population distribution is normal, then X µ σ/ n has a standard normal distribution. If σ
More informationThe t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies
The t-test: So Far: Sampling distribution benefit is that even if the original population is not normal, a sampling distribution based on this population will be normal (for sample size > 30). Benefit
More informationPOLI 443 Applied Political Research
POLI 443 Applied Political Research Session 4 Tests of Hypotheses The Normal Curve Lecturer: Prof. A. Essuman-Johnson, Dept. of Political Science Contact Information: aessuman-johnson@ug.edu.gh College
More informationChapter 5 Confidence Intervals
Chapter 5 Confidence Intervals Confidence Intervals about a Population Mean, σ, Known Abbas Motamedi Tennessee Tech University A point estimate: a single number, calculated from a set of data, that is
More informationAMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015
AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking
More informationy = a + bx 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation Review: Interpreting Computer Regression Output
12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation y = a + bx y = dependent variable a = intercept b = slope x = independent variable Section 12.1 Inference for Linear
More information10/4/2013. Hypothesis Testing & z-test. Hypothesis Testing. Hypothesis Testing
& z-test Lecture Set 11 We have a coin and are trying to determine if it is biased or unbiased What should we assume? Why? Flip coin n = 100 times E(Heads) = 50 Why? Assume we count 53 Heads... What could
More informationp = q ˆ = 1 -ˆp = sample proportion of failures in a sample size of n x n Chapter 7 Estimates and Sample Sizes
Chapter 7 Estimates and Sample Sizes 7-1 Overview 7-2 Estimating a Population Proportion 7-3 Estimating a Population Mean: σ Known 7-4 Estimating a Population Mean: σ Not Known 7-5 Estimating a Population
More information