COMPARING GROUPS PART 1CONTINUOUS DATA
|
|
- Valentine Terry
- 5 years ago
- Views:
Transcription
1 COMPARING GROUPS PART 1CONTINUOUS DATA Min Chen, Ph.D. Assistant Professor Quantitative Biomedical Research Center Department of Clinical Sciences Bioinformatics Shared Resource Simmons Comprehensive Cancer Center Lecture 4 July 9, 2013 Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 1 / 38 OUTLINE 1 REVIEW 2 INTRODUCTION 3 COMPARISON OF TWO GROUPS Parametric tests Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 2 / 38
2 REVIEW: (1 α)% CONFIDENCE INTERVAL OF THE MEAN Lower Limit : L = X z α/2 s n Standard Normal Distribution: µ = 0,σ = 1 Upper Limit : U = X + z α/2 s n Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 3 / 38 REVIEW OF CONFIDENCE INTERVAL FROM SMALL SAMPLE As a rule of thumb, if sample size, N < 30, use the formula below. (1 α)% Confidence Interval: X ± t α/2,n 1 s n where t α/2 is the (α/2)th quantile of the t-distribution with (n -1) degrees of freedom. Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 4 / 38
3 REVIEW: INTERPRETATION OF CI The CI: Pr(L(X) θ U(X)) = 1 α. It is temping to state the probability that the θ lies between two numbers, L and U, is (1 α). Wrong because θ is a fixed number; L(X) and U(X) are random variables, not numbers. On average 95% times the calculated intervals will contain the true population parameter θ. Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 5 / 38 RELATIONSHIP BETWEEN TYPE I ERROR (α) AND POWER Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 6 / 38
4 PARAMETRIC VS NON-PARAMETRIC Parametric tests Assume data follow some known distribution E.g., Normal, t-distribution, chi-square, Binomial distribution etc. Compare means, variances Non-parametric tests Don t assume a form of distribution Compare other measures of central tendency (e.g., median, or location shift) Useful for skewed data, small samples, ordinal data Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 7 / 38 NOTATION Population parameter Sample value Mean µ X Standard deviation σ s Variance σ 2 s 2 Sample Size n Sample Value x i Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 8 / 38
5 ONE SAMPLE t TEST Recall one - sample t-test: t = X µ 0 s/ n Test statistic for comparing the mean of one group against a fixed value. General form of a t-statistic is t = T-statistic follows a t-distribution! difference of means. standard error Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 9 / 38 STUDENT S t-distribution Here is how to generate a Student s t random variable: T ν = Z, V/ν where Z is a standard normal distribution; V has a chi-squared distribution with ν degrees of freedom (df), i.e., V = ν Zi 2 i=1 where Z i are iid standard normal r.v. s. (Recall E[Zi 2 ]=1. So E[V]=ν.) Z and V are independent. Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 10 / 38
6 t A FAMILY OF DISTRIBUTIONS IDENTIFIED BY df Recall t = X µ 0 s/ n = ( X µ 0 )/ σ n. s 2 /σ 2 Approaches Normal distribution as df increases. Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 11 / 38 SMALL SAMPLE VS. LARGE SAMPLE Recall in CI, as a rule of thumb, if sample size n < 30, use the t statistic for the (1 α)% confidence Interval: while for large samples we have X ± t α/2,n 1 s n X ± z α/2 s n. The reason is when sample size is large, t α/2,n 1 z α/2. Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 12 / 38
7 OUTLINE 1 REVIEW 2 INTRODUCTION 3 COMPARISON OF TWO GROUPS Parametric tests Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 13 / 38 COMPARING MEANS OF PAIRED SAMPLES In paired samples each data point in one sample is matched to another data point in the second sample. Same subject Measured at 2 time points Before and after intervention Two eyes (Left, Right) Two organs (Heart, Liver) Matched subjects Experimental animal, Pair-fed Match Male, Female Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 14 / 38
8 COMPARING MEANS OF TWO INDEPENDENT SAMPLES Two independent samples Subjects are unrelated in two separate groups; Sample sizes may be different in each group, (n 1,n 2 ) Variances in each group may be Equal, σ 2 1 = σ 2 2 Unequal, σ 2 1 = σ 2 2 Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 15 / 38 EXAMPLE 1 In a hypertension research study, subjects are given dietary counseling to restrict their sodium intake. Data on urinary sodium from 8 subjects at Baseline (Week 0), and Week 1, are shown. Subject Week 0 Week 1 Change X s Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 16 / 38
9 EXAMPLE 1(CONTD.) Subject Week 0 Week 1 Change X s Q1:Paired samples or two independent samples? Q2: Is there a change in mean levels of urinary sodium after 1 week? Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 17 / 38 PAIRED t-test Example 1 has paired sample data (since same subject was measured at two time points). Compute the mean and standard deviations of differences. H 0 : µ 1 µ 2 = c vs. H a : µ 1 µ 2 = c X t = d c s d / n, which follows a t-distribution with (n 1) degreesoffreedom. If t > tn 1 (1 α/2), reject H 0. Here tn 1 (1 α/2) is the (1 α/2) quantile of T n 1. P value = Pr(T n 1 > t ). Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 18 / 38
10 REJECTION REGIONS Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 19 / 38 PAIRED T-TEST USING EXCEL EXAMPLE 1 Values shown in bold red have been modified from original data. Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 20 / 38
11 EXAMPLE 2 A study was performed to compare the mean ERG (electroretinogram) amplitude of patients with different genetic types of retinitis pigmentosa (RP), a genetic eye disease that often results in blindness. Data was collected in patients of age years with different genetic types. Genetic type Mean ± SD N Dominant 0.85 ± Recessive 0.38 ± Table shows values for natural log of ERG. Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 21 / 38 EXAMPLE 2(CONTD.) Q1:Paired samples or two independent samples? Q2: Is there a difference in mean log(erg) amplitude between patients with dominant RP versus those with the recessive form? Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 22 / 38
12 TWO-SAMPLE t-test WITH EQUAL VARIANCES Example 2 has two independent samples. H 0 : µ 1 = µ 2 vs. H a : µ 1 = µ 2 X t = 1 X 2, s 1 p n n 2 which follows a t-distribution with (n 1 + n 2 2) degreesoffreedom,where is the pooled variance. s 2 p = (n 1 1)s 2 1 +(n 2 1)s 2 2 n 1 + n 2 2 If t > t n 1 +n 2 2 (1 α/2), reject H 0. P value = Pr(T n 1 > t ). Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 23 / 38 TWO-SAMPLE t-test FOR EQUAL VARIANCES USING EXCEL EXAMPLE 2 Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 24 / 38
13 COMPARING VARIANCES In Example 2, the two-sample t-test for independent samples assumed that variances were equal Variance of Group 1 = Variance of Group 2 Note that σ 2 1 = = and σ 2 2 = = Is equal variance assumption true? Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 25 / 38 COMPARING VARIANCES To compare variances, we conduct a hypothesis test to exam if the ratio of variances is equal to 1. H 0 : σ 2 1 σ 2 2 which follows an F-distribution. = 1 vs. H a : σ 2 1 σ 2 2 Test statistic: f = s2 1 s 2, 2 = 1 Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 26 / 38
14 F-DISTRIBUTION Here is how to generate a F random variable: where F ν1,ν 2 = V 1/ν 1 V 2 /ν 2, V 1 and V 2 have chi-squared distributions with ν 1 and ν 2 degrees of freedom (df), respectively. V 1 and V 2 are independent. Recall E[V]=ν. Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 27 / 38 F-DISTRIBUTION F-distribution is a family of distributions that are identified by numerator and denominator degrees of freedom (df). F-distribution are always right-skewed; Have numerator and denominator df. Recall f = s2 1 s 2 = s2 1 /σ s 2 2 /σ 2 2. Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 28 / 38
15 REJECTION REGIONS FOR THE F-TEST Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 29 / 38 F-TEST FOR COMPARING VARIANCES H 0 : σ 2 1 σ 2 2 = 1 vs. H a : σ 2 1 σ 2 2 Test statistic: f = s2 1 s 2, 2 = 1 which follows an F-distribution with (n 1 1,n 2 1) degreesoffreedom. If f > F n1 1,n 2 1(1 α/2) or f < F n1 1,n 2 1(α/2), Reject H 0. If f 1, then P value = 2 Pr(F n1 1,n 2 1 > f ); If f < 1, then P value = 2 Pr(F n1 1,n 2 1 < f ). Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 30 / 38
16 F-TEST FOR EQUALITY OF VARIANCES USING EXCEL EXAMPLE 2 Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 31 / 38 TWO-SAMPLE t-test WITH UNEQUAL VARIANCES H 0 : µ 1 = µ 2 vs. H a : µ 1 = µ 2 X t = 1 X 2, s 2 1 n 1 + s2 2 n 2 which follows a t-distribution with d degrees of freedom, where s d 2 = 1 /n 1 + s 2 2 /n 2 2. s 2 1 /n 1 n s2 2 /n 2 n 2 1 Round d down to nearest integer and call it d. If t > t d (1 α/2), reject H 0. P value = Pr(T d > t ). Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 32 / 38
17 EXAMPLE 3 Aresearchstudyaimedtoassessthefamilialaggregationofcholesterol levels by collecting data on children of age 2- to 14-years. Cholesterol levels (mg/dl) were collected in one group of children (say, cases ) whose father died from heart disease. Data were also collected in historical control group of children of same age. Group Mean ± SD N Cases ± Historical Control ± Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 33 / 38 EXAMPLE 3(CONTD.) Paired sample or two independent samples? Is there a difference in mean cholesterol levels between Cases and Historical Control group? Which statistical test should we use? Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 34 / 38
18 F-TEST FOR EQUALITY OF VARIANCES USING EXCEL EXAMPLE 3 Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 35 / 38 TWO-SAMPLE t-test FOR UNEQUAL VARIANCES USING EXCEL EXAMPLE 3 Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 36 / 38
19 ADVANTAGES OF PAIRED SAMPLES Suppose we want to test H 0 : µ 1 = µ 2 vs. H a : µ 1 = µ 2 Test statistic is related to X 1 X 2. The variance is: Var( X 1 X 2 )=Var( X 1 )+Var( X 2 ) 2ρ 12 Var( X 1 ) Var( X 2 ) The positive correlation ρ 12 in paired-samples reduces the variance of the difference, yielding more powerful test than the independent sample design. Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 37 / 38 REFERENCES I Rafia Bhore. Lecture notes. Berman, Nancy (2007). Comparison of Means. In Methods in Molecular Biology, Vol 404: Topics in Biostatistics, edited by W. T. Ambrosius. Humana Press Inc., Totowa, NJ, USA. Rosner, Bernard (2000). Fundamentals of Biostatistics, 5th edition. Duxbury Press, California, USA. Min Chen (QBRC/CCBSR) Comparing groups Continuous Data 1 Lec 4 38 / 38
BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings
BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings Yujin Chung October 4th, 2016 Fall 2016 Yujin Chung Lec6: Statistical hypothesis testings Fall 2016 1/30 Previous Two types of statistical
More informationComparison of Two Population Means
Comparison of Two Population Means Esra Akdeniz March 15, 2015 Independent versus Dependent (paired) Samples We have independent samples if we perform an experiment in two unrelated populations. We have
More informationEpidemiology Principles of Biostatistics Chapter 10 - Inferences about two populations. John Koval
Epidemiology 9509 Principles of Biostatistics Chapter 10 - Inferences about John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being covered 1. differences in
More informationCh. 7. One sample hypothesis tests for µ and σ
Ch. 7. One sample hypothesis tests for µ and σ Prof. Tesler Math 18 Winter 2019 Prof. Tesler Ch. 7: One sample hypoth. tests for µ, σ Math 18 / Winter 2019 1 / 23 Introduction Data Consider the SAT math
More informationReview: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.
1 Review: Let X 1, X,..., X n denote n independent random variables sampled from some distribution might not be normal!) with mean µ) and standard deviation σ). Then X µ σ n In other words, X is approximately
More informationAMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015
AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking
More informationCBA4 is live in practice mode this week exam mode from Saturday!
Announcements CBA4 is live in practice mode this week exam mode from Saturday! Material covered: Confidence intervals (both cases) 1 sample hypothesis tests (both cases) Hypothesis tests for 2 means as
More informationStatistics in medicine
Statistics in medicine Lecture 3: Bivariate association : Categorical variables Proportion in one group One group is measured one time: z test Use the z distribution as an approximation to the binomial
More informationChapter 8 - Statistical intervals for a single sample
Chapter 8 - Statistical intervals for a single sample 8-1 Introduction In statistics, no quantity estimated from data is known for certain. All estimated quantities have probability distributions of their
More informationLecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2
Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2 Fall, 2013 Page 1 Random Variable and Probability Distribution Discrete random variable Y : Finite possible values {y
More informationComparing two independent samples
In many applications it is necessary to compare two competing methods (for example, to compare treatment effects of a standard drug and an experimental drug). To compare two methods from statistical point
More informationLinear Regression Model. Badr Missaoui
Linear Regression Model Badr Missaoui Introduction What is this course about? It is a course on applied statistics. It comprises 2 hours lectures each week and 1 hour lab sessions/tutorials. We will focus
More informationChapter 9. Hypothesis testing. 9.1 Introduction
Chapter 9 Hypothesis testing 9.1 Introduction Confidence intervals are one of the two most common types of statistical inference. Use them when our goal is to estimate a population parameter. The second
More informationMS&E 226: Small Data
MS&E 226: Small Data Lecture 15: Examples of hypothesis tests (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 32 The recipe 2 / 32 The hypothesis testing recipe In this lecture we repeatedly apply the
More informationPHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1
PHP2510: Principles of Biostatistics & Data Analysis Lecture X: Hypothesis testing PHP 2510 Lec 10: Hypothesis testing 1 In previous lectures we have encountered problems of estimating an unknown population
More informationMathematical statistics
November 15 th, 2018 Lecture 21: The two-sample t-test Overview Week 1 Week 2 Week 4 Week 7 Week 10 Week 14 Probability reviews Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation
More informationLecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t
Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t t Confidence Interval for Population Mean Comparing z and t Confidence Intervals When neither z nor t Applies
More informationHYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă
HYPOTHESIS TESTING II TESTS ON MEANS Sorana D. Bolboacă OBJECTIVES Significance value vs p value Parametric vs non parametric tests Tests on means: 1 Dec 14 2 SIGNIFICANCE LEVEL VS. p VALUE Materials and
More informationStat 427/527: Advanced Data Analysis I
Stat 427/527: Advanced Data Analysis I Review of Chapters 1-4 Sep, 2017 1 / 18 Concepts you need to know/interpret Numerical summaries: measures of center (mean, median, mode) measures of spread (sample
More information16.400/453J Human Factors Engineering. Design of Experiments II
J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential
More information1 Statistical inference for a population mean
1 Statistical inference for a population mean 1. Inference for a large sample, known variance Suppose X 1,..., X n represents a large random sample of data from a population with unknown mean µ and known
More informationPubH 5450 Biostatistics I Prof. Carlin. Lecture 13
PubH 5450 Biostatistics I Prof. Carlin Lecture 13 Outline Outline Sample Size Counts, Rates and Proportions Part I Sample Size Type I Error and Power Type I error rate: probability of rejecting the null
More informationInference for Regression
Inference for Regression Section 9.4 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 13b - 3339 Cathy Poliak, Ph.D. cathy@math.uh.edu
More informationClass 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 4 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 013 by D.B. Rowe 1 Agenda: Recap Chapter 9. and 9.3 Lecture Chapter 10.1-10.3 Review Exam 6 Problem Solving
More informationBE540W Estimation Page 1 of 72. Topic 6 Estimation
BE540W Estimation Page 1 of 7 Topic 6 Estimation Topics 1. Introduction......... Goals of Estimation.... 3. Some notation and definitions,..... 4. How to Interpret a Confidence Interval..... 5. Normal:
More informationLecture 9 Two-Sample Test. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech
Lecture 9 Two-Sample Test Fall 2013 Prof. Yao Xie, yao.xie@isye.gatech.edu H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech Computer exam 1 18 Histogram 14 Frequency 9 5 0 75 83.33333333
More informationThe t-test Pivots Summary. Pivots and t-tests. Patrick Breheny. October 15. Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18
and t-tests Patrick Breheny October 15 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18 Introduction The t-test As we discussed previously, W.S. Gossett derived the t-distribution as a way of
More informationINTERVAL ESTIMATION OF THE DIFFERENCE BETWEEN TWO POPULATION PARAMETERS
INTERVAL ESTIMATION OF THE DIFFERENCE BETWEEN TWO POPULATION PARAMETERS Estimating the difference of two means: μ 1 μ Suppose there are two population groups: DLSU SHS Grade 11 Male (Group 1) and Female
More informationInference for Distributions Inference for the Mean of a Population
Inference for Distributions Inference for the Mean of a Population PBS Chapter 7.1 009 W.H Freeman and Company Objectives (PBS Chapter 7.1) Inference for the mean of a population The t distributions The
More information7 Estimation. 7.1 Population and Sample (P.91-92)
7 Estimation MATH1015 Biostatistics Week 7 7.1 Population and Sample (P.91-92) Suppose that we wish to study a particular health problem in Australia, for example, the average serum cholesterol level for
More informationBIOS 6222: Biostatistics II. Outline. Course Presentation. Course Presentation. Review of Basic Concepts. Why Nonparametrics.
BIOS 6222: Biostatistics II Instructors: Qingzhao Yu Don Mercante Cruz Velasco 1 Outline Course Presentation Review of Basic Concepts Why Nonparametrics The sign test 2 Course Presentation Contents Justification
More informationTwo sided, two sample t-tests. a) IQ = 100 b) Average height for men = c) Average number of white blood cells per cubic millimeter is 7,000.
Two sided, two sample t-tests. I. Brief review: 1) We are interested in how a sample compares to some pre-conceived notion. For example: a) IQ = 100 b) Average height for men = 5 10. c) Average number
More informationTwo sample hypothesis testing
Statistics February 26, 2014 Debdeep Pati Two sample hypothesis testing 1. Suppose we want to study the relationship between use of oral contraceptives (OC) and level of blood pressure (BP) in women. 2.
More informationInferences About the Difference Between Two Means
7 Inferences About the Difference Between Two Means Chapter Outline 7.1 New Concepts 7.1.1 Independent Versus Dependent Samples 7.1. Hypotheses 7. Inferences About Two Independent Means 7..1 Independent
More informationPaired comparisons. We assume that
To compare to methods, A and B, one can collect a sample of n pairs of observations. Pair i provides two measurements, Y Ai and Y Bi, one for each method: If we want to compare a reaction of patients to
More informationAnalysis of repeated measurements (KLMED8008)
Analysis of repeated measurements (KLMED8008) Eirik Skogvoll, MD PhD Professor and Consultant Institute of Circulation and Medical Imaging Dept. of Anaesthesiology and Intensive Care 1 Repeated measurements
More informationChapter 7 Comparison of two independent samples
Chapter 7 Comparison of two independent samples 7.1 Introduction Population 1 µ σ 1 1 N 1 Sample 1 y s 1 1 n 1 Population µ σ N Sample y s n 1, : population means 1, : population standard deviations N
More informationChapter 10: Inferences based on two samples
November 16 th, 2017 Overview Week 1 Week 2 Week 4 Week 7 Week 10 Week 12 Chapter 1: Descriptive statistics Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation Chapter 8: Confidence
More informationChapter 7: Statistical Inference (Two Samples)
Chapter 7: Statistical Inference (Two Samples) Shiwen Shen University of South Carolina 2016 Fall Section 003 1 / 41 Motivation of Inference on Two Samples Until now we have been mainly interested in a
More informationGEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs
STATISTICS 4 Summary Notes. Geometric and Exponential Distributions GEOMETRIC -discrete A discrete random variable R counts number of times needed before an event occurs P(X = x) = ( p) x p x =,, 3,...
More informationAnalysis of Variance. Contents. 1 Analysis of Variance. 1.1 Review. Anthony Tanbakuchi Department of Mathematics Pima Community College
Introductory Statistics Lectures Analysis of Variance 1-Way ANOVA: Many sample test of means Department of Mathematics Pima Community College Redistribution of this material is prohibited without written
More information* Tuesday 17 January :30-16:30 (2 hours) Recored on ESSE3 General introduction to the course.
Name of the course Statistical methods and data analysis Audience The course is intended for students of the first or second year of the Graduate School in Materials Engineering. The aim of the course
More informationECON Introductory Econometrics. Lecture 5: OLS with One Regressor: Hypothesis Tests
ECON4150 - Introductory Econometrics Lecture 5: OLS with One Regressor: Hypothesis Tests Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 5 Lecture outline 2 Testing Hypotheses about one
More informationThis does not cover everything on the final. Look at the posted practice problems for other topics.
Class 7: Review Problems for Final Exam 8.5 Spring 7 This does not cover everything on the final. Look at the posted practice problems for other topics. To save time in class: set up, but do not carry
More informationGROUPED DATA E.G. FOR SAMPLE OF RAW DATA (E.G. 4, 12, 7, 5, MEAN G x / n STANDARD DEVIATION MEDIAN AND QUARTILES STANDARD DEVIATION
FOR SAMPLE OF RAW DATA (E.G. 4, 1, 7, 5, 11, 6, 9, 7, 11, 5, 4, 7) BE ABLE TO COMPUTE MEAN G / STANDARD DEVIATION MEDIAN AND QUARTILES Σ ( Σ) / 1 GROUPED DATA E.G. AGE FREQ. 0-9 53 10-19 4...... 80-89
More informationMAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics
MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1 MAT 2379, Introduction to Biostatistics Sample Calculator Problems for the Final Exam Note: The exam will also contain some problems
More informationThe t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary
Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis
More informationTA: Sheng Zhgang (Th 1:20) / 342 (W 1:20) / 343 (W 2:25) / 344 (W 12:05) Haoyang Fan (W 1:20) / 346 (Th 12:05) FINAL EXAM
STAT 301, Fall 2011 Name Lec 4: Ismor Fischer Discussion Section: Please circle one! TA: Sheng Zhgang... 341 (Th 1:20) / 342 (W 1:20) / 343 (W 2:25) / 344 (W 12:05) Haoyang Fan... 345 (W 1:20) / 346 (Th
More informationChapter 10: STATISTICAL INFERENCE FOR TWO SAMPLES. Part 1: Hypothesis tests on a µ 1 µ 2 for independent groups
Chapter 10: STATISTICAL INFERENCE FOR TWO SAMPLES Part 1: Hypothesis tests on a µ 1 µ 2 for independent groups Sections 10-1 & 10-2 Independent Groups It is common to compare two groups, and do a hypothesis
More informationWelcome! Webinar Biostatistics: sample size & power. Thursday, April 26, 12:30 1:30 pm (NDT)
. Welcome! Webinar Biostatistics: sample size & power Thursday, April 26, 12:30 1:30 pm (NDT) Get started now: Please check if your speakers are working and mute your audio. Please use the chat box to
More informationDover- Sherborn High School Mathematics Curriculum Probability and Statistics
Mathematics Curriculum A. DESCRIPTION This is a full year courses designed to introduce students to the basic elements of statistics and probability. Emphasis is placed on understanding terminology and
More informationStudy Ch. 10.3, 67 70all, (no CI), 81, 83
GOALS: 1. Compare 2 sample means when the population standard deviations are not equal. 2. Use the distribution of the difference between the means to evaluate the samples. 3. Arrive at a conclusion: are
More informationSample Size Estimation for Studies of High-Dimensional Data
Sample Size Estimation for Studies of High-Dimensional Data James J. Chen, Ph.D. National Center for Toxicological Research Food and Drug Administration June 3, 2009 China Medical University Taichung,
More informationIndependent Samples t tests. Background for Independent Samples t test
Independent Samples t tests Dr. J. Kyle Roberts Southern Methodist University Simmons School of Education and Human Development Department of Teaching and Learning Background for Independent Samples t
More informationCorrelation and Simple Linear Regression
Correlation and Simple Linear Regression Sasivimol Rattanasiri, Ph.D Section for Clinical Epidemiology and Biostatistics Ramathibodi Hospital, Mahidol University E-mail: sasivimol.rat@mahidol.ac.th 1 Outline
More informationFinal Exam - Solutions
Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your
More informationStat 6640 Solution to Midterm #2
Stat 6640 Solution to Midterm #2 1. A study was conducted to examine how three statistical software packages used in a statistical course affect the statistical competence a student achieves. At the end
More informationStatistics in medicine
Statistics in medicine Lecture 1- part 1: Describing variation, and graphical presentation Outline Sources of variation Types of variables Fatma Shebl, MD, MS, MPH, PhD Assistant Professor Chronic Disease
More informationPubHlth 540 Estimation Page 1 of 69. Unit 6 Estimation
PubHlth 540 Estimation Page 1 of 69 Unit 6 Estimation Topic 1. Introduction........ a. Goals of Estimation. b. Notation and Definitions. c. How to Interpret a Confidence Interval. Preliminaries: Some Useful
More informationUnit 1 Review of BIOSTATS 540 Practice Problems SOLUTIONS - Stata Users
BIOSTATS 640 Spring 2017 Review of Introductory Biostatistics STATA solutions Page 1 of 16 Unit 1 Review of BIOSTATS 540 Practice Problems SOLUTIONS - Stata Users #1. The following table lists length of
More informationStatistics: revision
NST 1B Experimental Psychology Statistics practical 5 Statistics: revision Rudolf Cardinal & Mike Aitken 29 / 30 April 2004 Department of Experimental Psychology University of Cambridge Handouts: Answers
More informationSampling Distributions: Central Limit Theorem
Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)
More informationTHE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE
THE ROYAL STATISTICAL SOCIETY 004 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER II STATISTICAL METHODS The Society provides these solutions to assist candidates preparing for the examinations in future
More informationConfidence Intervals, Testing and ANOVA Summary
Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0
More informationANOVA: Comparing More Than Two Means
ANOVA: Comparing More Than Two Means Chapter 11 Cathy Poliak, Ph.D. cathy@math.uh.edu Office Fleming 11c Department of Mathematics University of Houston Lecture 25-3339 Cathy Poliak, Ph.D. cathy@math.uh.edu
More informationStatistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017
Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017 I. χ 2 or chi-square test Objectives: Compare how close an experimentally derived value agrees with an expected value. One method to
More informationSample size re-estimation in clinical trials. Dealing with those unknowns. Chris Jennison. University of Kyoto, January 2018
Sample Size Re-estimation in Clinical Trials: Dealing with those unknowns Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj University of Kyoto,
More informationConfidence Intervals with σ unknown
STAT 141 Confidence Intervals and Hypothesis Testing 10/26/04 Today (Chapter 7): CI with σ unknown, t-distribution CI for proportions Two sample CI with σ known or unknown Hypothesis Testing, z-test Confidence
More informationThe Components of a Statistical Hypothesis Testing Problem
Statistical Inference: Recall from chapter 5 that statistical inference is the use of a subset of a population (the sample) to draw conclusions about the entire population. In chapter 5 we studied one
More informationStat 704 Data Analysis I Probability Review
1 / 39 Stat 704 Data Analysis I Probability Review Dr. Yen-Yi Ho Department of Statistics, University of South Carolina A.3 Random Variables 2 / 39 def n: A random variable is defined as a function that
More informationComparison of two samples
Comparison of two samples Pierre Legendre, Université de Montréal August 009 - Introduction This lecture will describe how to compare two groups of observations (samples) to determine if they may possibly
More information1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College
1-Way ANOVA MATH 143 Department of Mathematics and Statistics Calvin College Spring 2010 The basic ANOVA situation Two variables: 1 Categorical, 1 Quantitative Main Question: Do the (means of) the quantitative
More informationStatistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018
Statistical inference (estimation, hypothesis tests, confidence intervals) Oct 2018 Sampling A trait is measured on each member of a population. f(y) = propn of individuals in the popn with measurement
More informationLECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING
LECTURE 1 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING INTERVAL ESTIMATION Point estimation of : The inference is a guess of a single value as the value of. No accuracy associated with it. Interval estimation
More informationPsychology 282 Lecture #4 Outline Inferences in SLR
Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations
More informationStat 5102 Final Exam May 14, 2015
Stat 5102 Final Exam May 14, 2015 Name Student ID The exam is closed book and closed notes. You may use three 8 1 11 2 sheets of paper with formulas, etc. You may also use the handouts on brand name distributions
More informationBios 6649: Clinical Trials - Statistical Design and Monitoring
Bios 6649: Clinical Trials - Statistical Design and Monitoring Spring Semester 2015 John M. Kittelson Department of Biostatistics & Informatics Colorado School of Public Health University of Colorado Denver
More information+ Specify 1 tail / 2 tail
Week 2: Null hypothesis Aeroplane seat designer wonders how wide to make the plane seats. He assumes population average hip size μ = 43.2cm Sample size n = 50 Question : Is the assumption μ = 43.2cm reasonable?
More informationSTAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015
STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots March 8, 2015 The duality between CI and hypothesis testing The duality between CI and hypothesis
More informationChapter 7. Inference for Distributions. Introduction to the Practice of STATISTICS SEVENTH. Moore / McCabe / Craig. Lecture Presentation Slides
Chapter 7 Inference for Distributions Introduction to the Practice of STATISTICS SEVENTH EDITION Moore / McCabe / Craig Lecture Presentation Slides Chapter 7 Inference for Distributions 7.1 Inference for
More informationContent by Week Week of October 14 27
Content by Week Week of October 14 27 Learning objectives By the end of this week, you should be able to: Understand the purpose and interpretation of confidence intervals for the mean, Calculate confidence
More informationLinear models and their mathematical foundations: Simple linear regression
Linear models and their mathematical foundations: Simple linear regression Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/21 Introduction
More informationSampling Distributions of Statistics Corresponds to Chapter 5 of Tamhane and Dunlop
Sampling Distributions of Statistics Corresponds to Chapter 5 of Tamhane and Dunlop Slides prepared by Elizabeth Newton (MIT), with some slides by Jacqueline Telford (Johns Hopkins University) 1 Sampling
More informationAnswer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text)
Answer keys for Assignment 10: Measurement of study variables (The correct answer is underlined in bold text) 1. A quick and easy indicator of dispersion is a. Arithmetic mean b. Variance c. Standard deviation
More informationSTAT Chapter 9: Two-Sample Problems. Paired Differences (Section 9.3)
STAT 515 -- Chapter 9: Two-Sample Problems Paired Differences (Section 9.3) Examples of Paired Differences studies: Similar subjects are paired off and one of two treatments is given to each subject in
More information7.1 Basic Properties of Confidence Intervals
7.1 Basic Properties of Confidence Intervals What s Missing in a Point Just a single estimate What we need: how reliable it is Estimate? No idea how reliable this estimate is some measure of the variability
More informationLecture 12: Small Sample Intervals Based on a Normal Population Distribution
Lecture 12: Small Sample Intervals Based on a Normal Population MSU-STT-351-Sum-17B (P. Vellaisamy: MSU-STT-351-Sum-17B) Probability & Statistics for Engineers 1 / 24 In this lecture, we will discuss (i)
More informationGeneral Linear Model (Chapter 4)
General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients
More informationMVE055/MSG Lecture 8
MVE055/MSG810 2017 Lecture 8 Petter Mostad Chalmers September 23, 2017 The Central Limit Theorem (CLT) Assume X 1,..., X n is a random sample from a distribution with expectation µ and variance σ 2. Then,
More informationRelax and good luck! STP 231 Example EXAM #2. Instructor: Ela Jackiewicz
STP 31 Example EXAM # Instructor: Ela Jackiewicz Honor Statement: I have neither given nor received information regarding this exam, and I will not do so until all exams have been graded and returned.
More informationNormal (Gaussian) distribution The normal distribution is often relevant because of the Central Limit Theorem (CLT):
Lecture Three Normal theory null distributions Normal (Gaussian) distribution The normal distribution is often relevant because of the Central Limit Theorem (CLT): A random variable which is a sum of many
More informationINTERVAL ESTIMATION AND HYPOTHESES TESTING
INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,
More informationLecture 5: ANOVA and Correlation
Lecture 5: ANOVA and Correlation Ani Manichaikul amanicha@jhsph.edu 23 April 2007 1 / 62 Comparing Multiple Groups Continous data: comparing means Analysis of variance Binary data: comparing proportions
More informationUnit 14: Nonparametric Statistical Methods
Unit 14: Nonparametric Statistical Methods Statistics 571: Statistical Methods Ramón V. León 8/8/2003 Unit 14 - Stat 571 - Ramón V. León 1 Introductory Remarks Most methods studied so far have been based
More informationComparison of Two Samples
2 Comparison of Two Samples 2.1 Introduction Problems of comparing two samples arise frequently in medicine, sociology, agriculture, engineering, and marketing. The data may have been generated by observation
More informationANOVA: Analysis of Variation
ANOVA: Analysis of Variation The basic ANOVA situation Two variables: 1 Categorical, 1 Quantitative Main Question: Do the (means of) the quantitative variables depend on which group (given by categorical
More informationPower and Sample Size Bios 662
Power and Sample Size Bios 662 Michael G. Hudgens, Ph.D. mhudgens@bios.unc.edu http://www.bios.unc.edu/ mhudgens 2008-10-31 14:06 BIOS 662 1 Power and Sample Size Outline Introduction One sample: continuous
More informationT.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS
ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS In our work on hypothesis testing, we used the value of a sample statistic to challenge an accepted value of a population parameter. We focused only
More informationThe t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies
The t-test: So Far: Sampling distribution benefit is that even if the original population is not normal, a sampling distribution based on this population will be normal (for sample size > 30). Benefit
More informationAs an example, consider the Bond Strength data in Table 2.1, atop page 26 of y1 y 1j/ n , S 1 (y1j y 1) 0.
INSY 7300 6 F01 Reference: Chapter of Montgomery s 8 th Edition Point Estimation As an example, consider the Bond Strength data in Table.1, atop page 6 of By S. Maghsoodloo Montgomery s 8 th edition, on
More information