Lecture 4: Statistical Hypothesis Testing
|
|
- Hugo Damon Paul
- 5 years ago
- Views:
Transcription
1 EAS31136/B9036: Statistics in Earth & Atmospheric Sciences Lecture 4: Statistical Hypothesis Testing Instructor: Prof. Johnny Luo
2 Dates Topic Reading (Based on the 2 nd Edition of Wilks book) Other Activity Aug 31 Introduction; Review of probability Wilks, Chap 2 Pre-test Sep 7 Matlab tutorial (optional) Sep 14 Review of probability; Probability Distribution 1 Wilks, Chap 2, 3 Sep 21 Probability Distribution 2 Wilks, Chap 3, 4 Sep 28 Hypothesis testing Wilks, Chap 5 Oct 5 Linear regression I Wilks Chap 6; von Storch 8-9 Oct 12 Linear regression II Wilks Chap 6; von Storch 8-9 Oct 19 Time series analysis I Wilks 8; von Storch Oct 26 Midterm; discussion of final project Project 1-page abstract due Nov 2 Time series analysis II Wilks 8; von Storch Nov 9 Principal Component Analysis & Empirical orthogonal functions I Wilks 11; von Storch 13 Nov 16 Principal Component Analysis & Empirical orthogonal functions II Wilks 11; von Storch 13 Project progress report due Nov 30 Cluster analysis Wilks 14 Dec 7 Final project presentation
3 Outline 1. How to test hypotheses in statistics? 2. Commonly-used test 1: t test 3. Commonly-used test 2: χ 2 test
4 Definition of Terms Ø Statistic (singular): a single measure of some attribute of a sample (e.g., the mean value of 100 readings of January temperatures) Ø Sampling distribution: probability distribution of a statistic based on a batch of samples. For example, the sample mean of Jan temperature (in Ithaca) has a probability distribution. The true mean is just a single number (which we don t know and can be estimated only through observations). Ø Statistical hypotheses Testing (aka, significance testing): test a hypothesis concerning the sampled statistic versus the modeled distribution (e.g., a parametric distribution).
5 Example 1. Suppose that advertisements for a tourist resort in the sunny desert southwest claim that, on average, six days out of seven are cloudless during winter. To verify this claim, we would need to observe the sky conditions in the area on a number of winter days (sampled statistic), and then compare the fraction observed to be cloudless with the claimed proportion of 6/7= Assume that we could arrange to take observations on 25 independent occasions. If cloudless skies are observed on 15 of those 25 days, is this observation consistent with the claim? How do we make a statistical statement to test to what extent the hypothesis made by the resort is true?
6 Procedure concerning a hypothesis testing Ø Identify a test statistic (e.g., the sample mean) Ø Define a null hypothesis (denoted H 0 ). Oftentimes the null hypothesis will be something we hope to reject. Ø Define an alternative hypothesis (denoted H A ). Oftentimes H A is the complement of H 0 (i.e., H 0 is not true). Ø Obtain the null distribution (i.e., the sample distribution of the test statistic), given that H 0 is true. Ø Compare the observed test statistic to the null distribution. If it falls within an improbable region of the null distribution, we reject H 0.
7 Procedure concerning a hypothesis testing Ø Identify a test statistic (e.g., the sample mean) Ø Define a null hypothesis (denoted H 0 ). Oftentimes the null hypothesis will be something we hope to reject. Ø Define an alternative hypothesis (denoted H A ). Oftentimes H A is the complement of H 0 (i.e., H 0 is not true). Ø Obtain the null distribution (i.e., the sample distribution of the test statistic), given that H 0 is true. Ø Compare the observed test statistic to the null distribution. If it falls within an improbable region of the null distribution, we reject H 0.
8 Example 1. Suppose that advertisements for a tourist resort in the sunny desert southwest claim that, on average, six days out of seven are cloudless during winter (i.e., the claimed cloudless proportion: 6/7 = 0.857). Observations of 25 independent days shows that cloudless skies are observed on 15 of those 25 days. This fits a binomial distribution (it s like flipping a tricky coin 25 times and we get 15 heads). The test statistic = number of cloudless days. H 0 : resort s ads is correct that p = (we want to shoot it down) H A : p < (we hope to prove this) Null distribution: a binomial with N=25 and p=0.857 (as claimed by the ads). Test the probability of the observed statistic (X=15): ( ) ( ) =
9 plot(0:25,binopdf(0:25,25,0.857), o- ) ( ) ( ) = N = 25, p = # of cloudless days
10 Test Level and p Value Ø Test level: in the null distribution (assuming null hypothesis is true), we define a region or a level where probability is sufficiently small. For example, 5% level basically means only 5% or less chance of occurrence for null hypothesis to hold. Ø The p value: the probability of the observed value of the test statistic will occur in the null distribution. Ø The null hypothesis is rejected if p test level.
11 Example 1. Suppose that advertisements for a tourist resort in the sunny desert southwest claim that, on average, six days out of seven are cloudless during winter (i.e., the claimed cloudless proportion: 6/7 = 0.857). Observations of 25 independent days shows that cloudless skies are observed on 15 of those 25 days. H 0 : resort s ads is correct that cloudless probability = H A : p < (we hope to prove this) Null distribution: a binomial with N=25 and p= Test the probability of the observed statistic: ( ) ( ) = This is our P-value
12 Some commonly-used parametric tests Ø One sample t test Ø Two sample t test: test for differences b/w two mean Ø χ-square test
13 One sample t test H 0 : the observed sample mean centers at some specified or assumed value, μ 0. Test statistic: The probability distribution for t is practically the same as that of the standardized Gaussian distribution (so we can use Table B.1), except that the variance in the denominator is n times smaller than the sample variance. where s 2 is the sample variance
14 One sample t test H 0 : the observed sample mean centers at some specified or assumed value, μ 0. Test statistic: where s 2 is the sample variance T-P-S: Why would the sample mean have smaller variance?
15 where s 2 is the sample variance Variance of the sample mean Vs variance of the raw sample data
16 Two sample t test: test for differences b/w two means (assuming independence) H 0 : the difference between the two means is zero Test statistic: where So,
17 Two-sided tests For Gaussian distribution, we usually adopt two-sided tests. For example, the test level of 5% means we check if the test statistic is smaller than the 2.5% of the left tail or greater than 1-2.5%=97.5% of the right tail.
18 Example 2. Test the hypothesis that the average max temperature at Ithaca and Canandaigua are significantly different H 0 : = 0 = 7.71^2/ ^2/31 = 3.91 Sqrt(3.91) = 1.98
19 Example 2. Test the hypothesis that the average max temperature at Ithaca and Canandaigua are significantly different z = ( )/1.98 = H 0 : = 0 = 7.71^2/ ^2/31 = 3.91 Sqrt(3.91) = 1.98
20 This is the p-value: 16.8%. If we set the test level at 5%, this means the p-value didn t pass the test. In other words, H 0 is not rejected and the max temp. at the two locations are NOT significantly different from zero.
21 Two sample t test: test for differences b/w two means for paired samples (assuming some dependence) For paired samples with some dependence (e.g., Ithaca temperature is probably correlated with temperature of Canandaigua), the variance is smaller than when the two samples are independent. So, p-value < 5% or even 1% test level
22 Some commonly-used parametric tests Ø One sample t test Ø Two sample t test: test for differences b/w two mean Ø χ-square test
23 χ 2 test: test goodness-of-fit in parametric distribution H 0 : the observed data fit well with the hypothesized distribution # here is the occurrence frequency of each class. Basically, we chop data into discrete (and MECE) trunks.
24 χ 2 test: test goodness-of-fit in parametric distribution H 0 : the observed data fit well with the hypothesized distribution If χ 2 is large, it means the difference b/w observed data and hypothesized distribution is large not a good fit!
25 The statistic χ 2 follow the χ 2 distribution, which is the sum of the squares of k independent standard normal random variables. PDF If the fit is good, #Observed should be very similar to #Expected, so that χ 2 is small.
26 PDF The statistic χ 2 follow the χ 2 distribution, which is the sum of the squares of k independent standard normal random variables. CDF If the fit is good, #Observed should be very similar to #Expected, so that χ 2 is small.
27 χ 2 test: test goodness-of-fit in parametric distribution H 0 : the observed data fit well with the hypothesized distribution Test of goodness-of-fit using χ 2 distribution is different from the t-test in that we wish to prove the H 0 (i.e., the fit is good).
28 Procedure for χ 2 test Ø Identify a test statistic (χ 2 ) Ø Define a null hypothesis (denoted H 0 ; good fit or small χ 2 ). Here, we wish to accept H 0. Ø Define an alternative hypothesis (denoted H A ): not a good fit or χ 2 is large Ø Obtain the null distribution : the standard χ 2 distribution. Ø Compare the observed test statistic (χ 2 value calculated) to the null distribution (the standard χ 2 distribution).
29
30
31 Χ 2 Distribution Table Degree of freedom ν = for the example here. 6 refers to six classes; 2 is the number of parameters for each distribution.
32 Χ 2 Distribution Table Degree of freedom ν = for the example here. 6 refers to six classes; 2 is the number of parameters for each distribution.
Lecture 5: Linear Regression
EAS31136/B9036: Statistics in Earth & Atmospheric Sciences Lecture 5: Linear Regression Instructor: Prof. Johnny Luo www.sci.ccny.cuny.edu/~luo Dates Topic Reading (Based on the 2 nd Edition of Wilks book)
More informationLecture 1: Review of Probability
EAS31136/B9036: Statistics in Earth & Atmospheric Sciences Lecture 1: Review of Probability Instructor: Prof. Johnny Luo www.sci.ccny.cuny.edu/~luo Dates Topic Reading (Based on the 2 nd Edition of Wilks
More informationLecture 2: Probability Distributions
EAS31136/B9036: Statistics in Earth & Atmospheric Sciences Lecture 2: Probability Distributions Instructor: Prof. Johnny Luo www.sci.ccny.cuny.edu/~luo Dates Topic Reading (Based on the 2 nd Edition of
More informationHypothesis Tests Solutions COR1-GB.1305 Statistics and Data Analysis
Hypothesis Tests Solutions COR1-GB.1305 Statistics and Data Analysis Introduction 1. An analyst claims to have a reliable model for Twitter s quarterly revenues. His model predicted that the most recent
More informationTUTORIAL 8 SOLUTIONS #
TUTORIAL 8 SOLUTIONS #9.11.21 Suppose that a single observation X is taken from a uniform density on [0,θ], and consider testing H 0 : θ = 1 versus H 1 : θ =2. (a) Find a test that has significance level
More informationPurposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions
Part 1: Probability Distributions Purposes of Data Analysis True Distributions or Relationships in the Earths System Probability Distribution Normal Distribution Student-t Distribution Chi Square Distribution
More informationStatistics for IT Managers
Statistics for IT Managers 95-796, Fall 2012 Module 2: Hypothesis Testing and Statistical Inference (5 lectures) Reading: Statistics for Business and Economics, Ch. 5-7 Confidence intervals Given the sample
More informationStats Review Chapter 14. Mary Stangler Center for Academic Success Revised 8/16
Stats Review Chapter 14 Revised 8/16 Note: This review is meant to highlight basic concepts from the course. It does not cover all concepts presented by your instructor. Refer back to your notes, unit
More informationCourtesy of Jes Jørgensen
Courtesy of Jes Jørgensen Testing Models 3 May 2016 Science is all about models Use physical mechanisms to predict outcomes Test the outcomes in order to test our understanding of the physics Science is
More informationz and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests
z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests Chapters 3.5.1 3.5.2, 3.3.2 Prof. Tesler Math 283 Fall 2018 Prof. Tesler z and t tests for mean Math
More informationLecture 9 Two-Sample Test. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech
Lecture 9 Two-Sample Test Fall 2013 Prof. Yao Xie, yao.xie@isye.gatech.edu H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech Computer exam 1 18 Histogram 14 Frequency 9 5 0 75 83.33333333
More informationPhysics 403. Segev BenZvi. Classical Hypothesis Testing: The Likelihood Ratio Test. Department of Physics and Astronomy University of Rochester
Physics 403 Classical Hypothesis Testing: The Likelihood Ratio Test Segev BenZvi Department of Physics and Astronomy University of Rochester Table of Contents 1 Bayesian Hypothesis Testing Posterior Odds
More informationChapter 7: Hypothesis Testing
Chapter 7: Hypothesis Testing *Mathematical statistics with applications; Elsevier Academic Press, 2009 The elements of a statistical hypothesis 1. The null hypothesis, denoted by H 0, is usually the nullification
More informationexp{ (x i) 2 i=1 n i=1 (x i a) 2 (x i ) 2 = exp{ i=1 n i=1 n 2ax i a 2 i=1
4 Hypothesis testing 4. Simple hypotheses A computer tries to distinguish between two sources of signals. Both sources emit independent signals with normally distributed intensity, the signals of the first
More informationLab #12: Exam 3 Review Key
Psychological Statistics Practice Lab#1 Dr. M. Plonsky Page 1 of 7 Lab #1: Exam 3 Review Key 1) a. Probability - Refers to the likelihood that an event will occur. Ranges from 0 to 1. b. Sampling Distribution
More informationThe goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions.
The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions. A common problem of this type is concerned with determining
More informationProbability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur
Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Lecture No. # 36 Sampling Distribution and Parameter Estimation
More informationTables Table A Table B Table C Table D Table E 675
BMTables.indd Page 675 11/15/11 4:25:16 PM user-s163 Tables Table A Standard Normal Probabilities Table B Random Digits Table C t Distribution Critical Values Table D Chi-square Distribution Critical Values
More informationStatistical Preliminaries. Stony Brook University CSE545, Fall 2016
Statistical Preliminaries Stony Brook University CSE545, Fall 2016 Random Variables X: A mapping from Ω to R that describes the question we care about in practice. 2 Random Variables X: A mapping from
More information14.30 Introduction to Statistical Methods in Economics Spring 2009
MIT OpenCourseWare http://ocw.mit.edu 4.0 Introduction to Statistical Methods in Economics Spring 009 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.
More informationLing 289 Contingency Table Statistics
Ling 289 Contingency Table Statistics Roger Levy and Christopher Manning This is a summary of the material that we ve covered on contingency tables. Contingency tables: introduction Odds ratios Counting,
More informationSection 9.5. Testing the Difference Between Two Variances. Bluman, Chapter 9 1
Section 9.5 Testing the Difference Between Two Variances Bluman, Chapter 9 1 This the last day the class meets before spring break starts. Please make sure to be present for the test or make appropriate
More informationChapte The McGraw-Hill Companies, Inc. All rights reserved.
er15 Chapte Chi-Square Tests d Chi-Square Tests for -Fit Uniform Goodness- Poisson Goodness- Goodness- ECDF Tests (Optional) Contingency Tables A contingency table is a cross-tabulation of n paired observations
More information41.2. Tests Concerning a Single Sample. Introduction. Prerequisites. Learning Outcomes
Tests Concerning a Single Sample 41.2 Introduction This Section introduces you to the basic ideas of hypothesis testing in a non-mathematical way by using a problem solving approach to highlight the concepts
More informationVisual interpretation with normal approximation
Visual interpretation with normal approximation H 0 is true: H 1 is true: p =0.06 25 33 Reject H 0 α =0.05 (Type I error rate) Fail to reject H 0 β =0.6468 (Type II error rate) 30 Accept H 1 Visual interpretation
More informationHypothesis Testing One Sample Tests
STATISTICS Lecture no. 13 Department of Econometrics FEM UO Brno office 69a, tel. 973 442029 email:jiri.neubauer@unob.cz 12. 1. 2010 Tests on Mean of a Normal distribution Tests on Variance of a Normal
More informationINTERVAL ESTIMATION AND HYPOTHESES TESTING
INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,
More informationPractical Statistics
Practical Statistics Lecture 1 (Nov. 9): - Correlation - Hypothesis Testing Lecture 2 (Nov. 16): - Error Estimation - Bayesian Analysis - Rejecting Outliers Lecture 3 (Nov. 18) - Monte Carlo Modeling -
More information1; (f) H 0 : = 55 db, H 1 : < 55.
Reference: Chapter 8 of J. L. Devore s 8 th Edition By S. Maghsoodloo TESTING a STATISTICAL HYPOTHESIS A statistical hypothesis is an assumption about the frequency function(s) (i.e., pmf or pdf) of one
More informationStatistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017
Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017 I. χ 2 or chi-square test Objectives: Compare how close an experimentally derived value agrees with an expected value. One method to
More informationSection 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples
Objective Section 9.4 Inferences About Two Means (Matched Pairs) Compare of two matched-paired means using two samples from each population. Hypothesis Tests and Confidence Intervals of two dependent means
More informationLecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series
Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 9 Inferences from Two Samples 9-1 Overview 9-2 Inferences About Two Proportions 9-3
More informationInference for Proportions, Variance and Standard Deviation
Inference for Proportions, Variance and Standard Deviation Sections 7.10 & 7.6 Cathy Poliak, Ph.D. cathy@math.uh.edu Office Fleming 11c Department of Mathematics University of Houston Lecture 12 Cathy
More informationLecture 10: Generalized likelihood ratio test
Stat 200: Introduction to Statistical Inference Autumn 2018/19 Lecture 10: Generalized likelihood ratio test Lecturer: Art B. Owen October 25 Disclaimer: These notes have not been subjected to the usual
More informationDover- Sherborn High School Mathematics Curriculum Probability and Statistics
Mathematics Curriculum A. DESCRIPTION This is a full year courses designed to introduce students to the basic elements of statistics and probability. Emphasis is placed on understanding terminology and
More informationChapter 15: Nonparametric Statistics Section 15.1: An Overview of Nonparametric Statistics
Section 15.1: An Overview of Nonparametric Statistics Understand Difference between Parametric and Nonparametric Statistical Procedures Parametric statistical procedures inferential procedures that rely
More informationPHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1
PHP2510: Principles of Biostatistics & Data Analysis Lecture X: Hypothesis testing PHP 2510 Lec 10: Hypothesis testing 1 In previous lectures we have encountered problems of estimating an unknown population
More information16.400/453J Human Factors Engineering. Design of Experiments II
J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential
More informationClassroom Activity 7 Math 113 Name : 10 pts Intro to Applied Stats
Classroom Activity 7 Math 113 Name : 10 pts Intro to Applied Stats Materials Needed: Bags of popcorn, watch with second hand or microwave with digital timer. Instructions: Follow the instructions on the
More informationThe Chi-Square Distributions
MATH 183 The Chi-Square Distributions Dr. Neal, WKU The chi-square distributions can be used in statistics to analyze the standard deviation σ of a normally distributed measurement and to test the goodness
More informationIntroduction to Statistics
MTH4106 Introduction to Statistics Notes 15 Spring 2013 Testing hypotheses about the mean Earlier, we saw how to test hypotheses about a proportion, using properties of the Binomial distribution It is
More informationCBA4 is live in practice mode this week exam mode from Saturday!
Announcements CBA4 is live in practice mode this week exam mode from Saturday! Material covered: Confidence intervals (both cases) 1 sample hypothesis tests (both cases) Hypothesis tests for 2 means as
More informationIV. The Normal Distribution
IV. The Normal Distribution The normal distribution (a.k.a., the Gaussian distribution or bell curve ) is the by far the best known random distribution. It s discovery has had such a far-reaching impact
More informationDiscrete distribution. Fitting probability models to frequency data. Hypotheses for! 2 test. ! 2 Goodness-of-fit test
Discrete distribution Fitting probability models to frequency data A probability distribution describing a discrete numerical random variable For example,! Number of heads from 10 flips of a coin! Number
More informationNull Hypothesis Significance Testing p-values, significance level, power, t-tests
Null Hypothesis Significance Testing p-values, significance level, power, t-tests 18.05 Spring 2014 January 1, 2017 1 /22 Understand this figure f(x H 0 ) x reject H 0 don t reject H 0 reject H 0 x = test
More informationCh. 7. One sample hypothesis tests for µ and σ
Ch. 7. One sample hypothesis tests for µ and σ Prof. Tesler Math 18 Winter 2019 Prof. Tesler Ch. 7: One sample hypoth. tests for µ, σ Math 18 / Winter 2019 1 / 23 Introduction Data Consider the SAT math
More informationStatistical Data Analysis Stat 3: p-values, parameter estimation
Statistical Data Analysis Stat 3: p-values, parameter estimation London Postgraduate Lectures on Particle Physics; University of London MSci course PH4515 Glen Cowan Physics Department Royal Holloway,
More information10/4/2013. Hypothesis Testing & z-test. Hypothesis Testing. Hypothesis Testing
& z-test Lecture Set 11 We have a coin and are trying to determine if it is biased or unbiased What should we assume? Why? Flip coin n = 100 times E(Heads) = 50 Why? Assume we count 53 Heads... What could
More informationGAMINGRE 8/1/ of 7
FYE 09/30/92 JULY 92 0.00 254,550.00 0.00 0 0 0 0 0 0 0 0 0 254,550.00 0.00 0.00 0.00 0.00 254,550.00 AUG 10,616,710.31 5,299.95 845,656.83 84,565.68 61,084.86 23,480.82 339,734.73 135,893.89 67,946.95
More informationStatistical Distribution Assumptions of General Linear Models
Statistical Distribution Assumptions of General Linear Models Applied Multilevel Models for Cross Sectional Data Lecture 4 ICPSR Summer Workshop University of Colorado Boulder Lecture 4: Statistical Distributions
More informationACTEX CAS EXAM 3 STUDY GUIDE FOR MATHEMATICAL STATISTICS
ACTEX CAS EXAM 3 STUDY GUIDE FOR MATHEMATICAL STATISTICS TABLE OF CONTENTS INTRODUCTORY NOTE NOTES AND PROBLEM SETS Section 1 - Point Estimation 1 Problem Set 1 15 Section 2 - Confidence Intervals and
More informationProbability and Statistics Notes
Probability and Statistics Notes Chapter Seven Jesse Crawford Department of Mathematics Tarleton State University Spring 2011 (Tarleton State University) Chapter Seven Notes Spring 2011 1 / 42 Outline
More informationTests and Their Power
Tests and Their Power Ling Kiong Doong Department of Mathematics National University of Singapore 1. Introduction In Statistical Inference, the two main areas of study are estimation and testing of hypotheses.
More informationAMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015
AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking
More informationReview of Statistics
Review of Statistics Topics Descriptive Statistics Mean, Variance Probability Union event, joint event Random Variables Discrete and Continuous Distributions, Moments Two Random Variables Covariance and
More informationSTA 2101/442 Assignment 2 1
STA 2101/442 Assignment 2 1 These questions are practice for the midterm and final exam, and are not to be handed in. 1. A polling firm plans to ask a random sample of registered voters in Quebec whether
More informationMultivariate Statistical Analysis
Multivariate Statistical Analysis Fall 2011 C. L. Williams, Ph.D. Lecture 9 for Applied Multivariate Analysis Outline Addressing ourliers 1 Addressing ourliers 2 Outliers in Multivariate samples (1) For
More information* Tuesday 17 January :30-16:30 (2 hours) Recored on ESSE3 General introduction to the course.
Name of the course Statistical methods and data analysis Audience The course is intended for students of the first or second year of the Graduate School in Materials Engineering. The aim of the course
More informationLinear Models: Comparing Variables. Stony Brook University CSE545, Fall 2017
Linear Models: Comparing Variables Stony Brook University CSE545, Fall 2017 Statistical Preliminaries Random Variables Random Variables X: A mapping from Ω to ℝ that describes the question we care about
More informationSYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions
SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu
More informationProbability theory and inference statistics! Dr. Paola Grosso! SNE research group!! (preferred!)!!
Probability theory and inference statistics Dr. Paola Grosso SNE research group p.grosso@uva.nl paola.grosso@os3.nl (preferred) Roadmap Lecture 1: Monday Sep. 22nd Collecting data Presenting data Descriptive
More informationPartitioning the Parameter Space. Topic 18 Composite Hypotheses
Topic 18 Composite Hypotheses Partitioning the Parameter Space 1 / 10 Outline Partitioning the Parameter Space 2 / 10 Partitioning the Parameter Space Simple hypotheses limit us to a decision between one
More informationRecall the Basics of Hypothesis Testing
Recall the Basics of Hypothesis Testing The level of significance α, (size of test) is defined as the probability of X falling in w (rejecting H 0 ) when H 0 is true: P(X w H 0 ) = α. H 0 TRUE H 1 TRUE
More informationCOURSE INTRODUCTION. J. Elder CSE 6390/PSYC 6225 Computational Modeling of Visual Perception
COURSE INTRODUCTION COMPUTATIONAL MODELING OF VISUAL PERCEPTION 2 The goal of this course is to provide a framework and computational tools for modeling visual inference, motivated by interesting examples
More information(x t. x t +1. TIME SERIES (Chapter 8 of Wilks)
45 TIME SERIES (Chapter 8 of Wilks) In meteorology, the order of a time series matters! We will assume stationarity of the statistics of the time series. If there is non-stationarity (e.g., there is a
More informationHypothesis Tests and Estimation for Population Variances. Copyright 2014 Pearson Education, Inc.
Hypothesis Tests and Estimation for Population Variances 11-1 Learning Outcomes Outcome 1. Formulate and carry out hypothesis tests for a single population variance. Outcome 2. Develop and interpret confidence
More informationStatistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests
Statistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests 1999 Prentice-Hall, Inc. Chap. 8-1 Chapter Topics Hypothesis Testing Methodology Z Test
More informationElementary Statistics Triola, Elementary Statistics 11/e Unit 17 The Basics of Hypotheses Testing
(Section 8-2) Hypotheses testing is not all that different from confidence intervals, so let s do a quick review of the theory behind the latter. If it s our goal to estimate the mean of a population,
More informationBias Variance Trade-off
Bias Variance Trade-off The mean squared error of an estimator MSE(ˆθ) = E([ˆθ θ] 2 ) Can be re-expressed MSE(ˆθ) = Var(ˆθ) + (B(ˆθ) 2 ) MSE = VAR + BIAS 2 Proof MSE(ˆθ) = E((ˆθ θ) 2 ) = E(([ˆθ E(ˆθ)]
More informationStat 609: Mathematical Statistics I (Fall Semester, 2016) Introduction
Stat 609: Mathematical Statistics I (Fall Semester, 2016) Introduction Course information Instructor Professor Jun Shao TA Mr. Han Chen Office 1235A MSC 1335 MSC Phone 608-262-7938 608-263-5948 Email shao@stat.wisc.edu
More informationChapter 8 of Devore , H 1 :
Chapter 8 of Devore TESTING A STATISTICAL HYPOTHESIS Maghsoodloo A statistical hypothesis is an assumption about the frequency function(s) (i.e., PDF or pdf) of one or more random variables. Stated in
More informationSTAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015
STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots March 8, 2015 The duality between CI and hypothesis testing The duality between CI and hypothesis
More informationdetermine whether or not this relationship is.
Section 9-1 Correlation A correlation is a between two. The data can be represented by ordered pairs (x,y) where x is the (or ) variable and y is the (or ) variable. There are several types of correlations
More informationhp calculators HP 50g Probability distributions The MTH (MATH) menu Probability distributions
The MTH (MATH) menu Probability distributions Practice solving problems involving probability distributions The MTH (MATH) menu The Math menu is accessed from the WHITE shifted function of the Pkey by
More informationLecture on Null Hypothesis Testing & Temporal Correlation
Lecture on Null Hypothesis Testing & Temporal Correlation CS 590.21 Analysis and Modeling of Brain Networks Department of Computer Science University of Crete Acknowledgement Resources used in the slides
More informationTopic 22 Analysis of Variance
Topic 22 Analysis of Variance Comparing Multiple Populations 1 / 14 Outline Overview One Way Analysis of Variance Sample Means Sums of Squares The F Statistic Confidence Intervals 2 / 14 Overview Two-sample
More information+ Specify 1 tail / 2 tail
Week 2: Null hypothesis Aeroplane seat designer wonders how wide to make the plane seats. He assumes population average hip size μ = 43.2cm Sample size n = 50 Question : Is the assumption μ = 43.2cm reasonable?
More informationIntroduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution
Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis
More informationEcon 325: Introduction to Empirical Economics
Econ 325: Introduction to Empirical Economics Chapter 9 Hypothesis Testing: Single Population Ch. 9-1 9.1 What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population
More informationCHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC
CHI SQUARE ANALYSIS I N T R O D U C T I O N T O N O N - P A R A M E T R I C A N A L Y S E S HYPOTHESIS TESTS SO FAR We ve discussed One-sample t-test Dependent Sample t-tests Independent Samples t-tests
More informationCH.9 Tests of Hypotheses for a Single Sample
CH.9 Tests of Hypotheses for a Single Sample Hypotheses testing Tests on the mean of a normal distributionvariance known Tests on the mean of a normal distributionvariance unknown Tests on the variance
More informationThe One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)
The One-Way Repeated-Measures ANOVA (For Within-Subjects Designs) Logic of the Repeated-Measures ANOVA The repeated-measures ANOVA extends the analysis of variance to research situations using repeated-measures
More informationReview: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.
1 Review: Let X 1, X,..., X n denote n independent random variables sampled from some distribution might not be normal!) with mean µ) and standard deviation σ). Then X µ σ n In other words, X is approximately
More informationSmoking Habits. Moderate Smokers Heavy Smokers Total. Hypertension No Hypertension Total
Math 3070. Treibergs Final Exam Name: December 7, 00. In an experiment to see how hypertension is related to smoking habits, the following data was taken on individuals. Test the hypothesis that the proportions
More informationThe t-test Pivots Summary. Pivots and t-tests. Patrick Breheny. October 15. Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18
and t-tests Patrick Breheny October 15 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18 Introduction The t-test As we discussed previously, W.S. Gossett derived the t-distribution as a way of
More informationLecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 9.1-1
Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by Mario F. Triola Copyright 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. 9.1-1 Chapter 9 Inferences
More informationThe t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies
The t-test: So Far: Sampling distribution benefit is that even if the original population is not normal, a sampling distribution based on this population will be normal (for sample size > 30). Benefit
More informationIntroduction to Statistical Inference
Introduction to Statistical Inference Dr. Fatima Sanchez-Cabo f.sanchezcabo@tugraz.at http://www.genome.tugraz.at Institute for Genomics and Bioinformatics, Graz University of Technology, Austria Introduction
More informationLECTURE 10: NEYMAN-PEARSON LEMMA AND ASYMPTOTIC TESTING. The last equality is provided so this can look like a more familiar parametric test.
Economics 52 Econometrics Professor N.M. Kiefer LECTURE 1: NEYMAN-PEARSON LEMMA AND ASYMPTOTIC TESTING NEYMAN-PEARSON LEMMA: Lesson: Good tests are based on the likelihood ratio. The proof is easy in the
More informationHypothesis testing:power, test statistic CMS:
Hypothesis testing:power, test statistic The more sensitive the test, the better it can discriminate between the null and the alternative hypothesis, quantitatively, maximal power In order to achieve this
More informationChapter 5: HYPOTHESIS TESTING
MATH411: Applied Statistics Dr. YU, Chi Wai Chapter 5: HYPOTHESIS TESTING 1 WHAT IS HYPOTHESIS TESTING? As its name indicates, it is about a test of hypothesis. To be more precise, we would first translate
More informationHypothesis tests
6.1 6.4 Hypothesis tests Prof. Tesler Math 186 February 26, 2014 Prof. Tesler 6.1 6.4 Hypothesis tests Math 186 / February 26, 2014 1 / 41 6.1 6.2 Intro to hypothesis tests and decision rules Hypothesis
More informationStatistical concepts in climate research
Statistical concepts in climate research (some misuses of statistics in climatology) Slava Kharin Canadian Centre for Climate Modelling and Analysis, Environment Canada, Victoria, British Columbia, Canada
More informationLecture 7: Hypothesis Testing and ANOVA
Lecture 7: Hypothesis Testing and ANOVA Goals Overview of key elements of hypothesis testing Review of common one and two sample tests Introduction to ANOVA Hypothesis Testing The intent of hypothesis
More informationAdvanced Herd Management Probabilities and distributions
Advanced Herd Management Probabilities and distributions Anders Ringgaard Kristensen Slide 1 Outline Probabilities Conditional probabilities Bayes theorem Distributions Discrete Continuous Distribution
More informationChapter Three. Hypothesis Testing
3.1 Introduction The final phase of analyzing data is to make a decision concerning a set of choices or options. Should I invest in stocks or bonds? Should a new product be marketed? Are my products being
More informationPolitical Science 236 Hypothesis Testing: Review and Bootstrapping
Political Science 236 Hypothesis Testing: Review and Bootstrapping Rocío Titiunik Fall 2007 1 Hypothesis Testing Definition 1.1 Hypothesis. A hypothesis is a statement about a population parameter The
More informationChapter 3 ANALYSIS OF RESPONSE PROFILES
Chapter 3 ANALYSIS OF RESPONSE PROFILES 78 31 Introduction In this chapter we present a method for analysing longitudinal data that imposes minimal structure or restrictions on the mean responses over
More informationINDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -27 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.
INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY Lecture -27 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc. Summary of the previous lecture Frequency factors Normal distribution
More informationMathematical Notation Math Introduction to Applied Statistics
Mathematical Notation Math 113 - Introduction to Applied Statistics Name : Use Word or WordPerfect to recreate the following documents. Each article is worth 10 points and should be emailed to the instructor
More information