Lecture 4: Parameter Es/ma/on and Confidence Intervals. GENOME 560, Spring 2015 Doug Fowler, GS
|
|
- Beverley Hudson
- 5 years ago
- Views:
Transcription
1 Lecture 4: Parameter Es/ma/on and Confidence Intervals GENOME 560, Spring 2015 Doug Fowler, GS 1
2 Review: Probability DistribuIons Discrete: Binomial distribuion Hypergeometric distribuion Poisson distribuion 2
3 Review: Probability DistribuIons Discrete: Binomial distribuion Hypergeometric distribuion Poisson distribuion Con/nuous: Uniform distribuion ExponenIal distribuion Gamma distribuion Normal distribuion 3
4 Review: Probability DistribuIons Discrete: Binomial distribuion Hypergeometric distribuion Poisson distribuion Con/nuous: Uniform distribuion ExponenIal distribuion Gamma distribuion Normal distribuion The sums or means of samples drawn from any dist n are normally distributed 4
5 MulIvariate Hypergeometric Dist n The HGD can be generalized to picking a sample of size n where there are exactly (k 1, k 2 k c ) items from each of c classes from a populaion of N items of c classes where there are K i items of of class i pmf: Q c i=1 N n K i k i Example: There are 5 black, 10 white and 15 red balls in an urn. If you draw six without replacement, what is the probability that you pick 2 of each color? =0.08 5
6 Goals Basic concepts of parameter esimaion Confidence intervals Intro to hypothesis tesing 6
7 What Is Parameter? 7
8 What Is Parameter? Variables vs. Parameters According to Bard & Yonathan (1974) * Usually a probabilisic model is designed to explain the relaionships that exist among quaniies which can be measured independently in an experiment; these are the variables of the model. To formulate these relaionships, however, one frequently introduces "constants" which stand for inherent properies of nature. These are the parameters. We oeen denote by θ * Bard, Yonathan (1974). Nonlinear Parameter Estimation. New York 8
9 Which are parameters, variables? Binomial distribuion (coin tossing) X: number of Heads aeer n coin tosses P n k n k { X = k} = p (1 p) k variable parameter θ = p Poisson distribuion X: number of e- mails within a week P variable { X = k} = e k λ λ k! parameter θ = λ 9
10 Parameters Determine DistribuIons When sampling is from a populaion described by a pdf or pmf f(x θ), knowledge of θ yields knowledge of the enire populaion 10
11 Parameters Determine DistribuIons When sampling is from a populaion described by a pdf or pmf f(x θ), knowledge of θ yields knowledge of the enire populaion This is why parameter esimaion is useful: if we are tossing a coin we would like to esimate the parameter p If we are couning the number of s per week, we would like to esimate λ 11
12 Central Dogma of StaIsIcs 12
13 Parameter EsImaIon Es/mator: StaIsIc whose calculated value is used to esimate a parameter, θ Es/mate: A paricular realizaion of an esimator, θ Types of esimates: Point es/mate: single number that can be regarded as the most plausible value of θ Interval es/mate: a range of numbers, called a confidence interval, that informs us about the quality of our esimate 13
14 Simple Example EsImators Suppose we take a sample from a binomial distribuion whose parameters are unknown. We get m successes from n samples. How can we esimate the parameter π (the populaion p)? 14
15 Simple Example EsImators Suppose we take a sample from a binomial distribuion whose parameters are unknown. We get m successes from n samples. How can we esimate the parameter π (the populaion p)? Method 1: We could just use m and n 15
16 Simple Example EsImators Suppose we take a sample from a binomial distribuion whose parameters are unknown. We get m successes from n samples. How can we esimate the parameter π (the populaion p)? Method 1: We could just use m and n Method 2: Alternately, we could just look in the literature for similar experiments. We could ignore our data and set 16
17 Good EsImators Are: Consistent: as sample size increases, gets closer to θ Would our example esimators be consistent? 17
18 Good EsImators Are: Consistent: as sample size increases, gets closer to θ Would our example esimators be consistent? EsImator 1, yes (m/n will approach π, law of large numbers) EsImator 2, no (our data doesn t maner) 18
19 Good EsImators Are: Consistent: as sample size increases, gets closer to θ Unbiased: 19
20 What do we mean by unbiased? A biased esimator diverges systemaically from the true parameter value 20
21 Good EsImators Are: Consistent: as sample size increases, gets closer to θ Unbiased: the expected value of is equal to θ 21
22 Good EsImators Are: Consistent: as sample size increases, gets closer to θ Unbiased: the expected value of is equal to θ Are our example esimators biased? 22
23 Good EsImators Are: Consistent: as sample size increases, gets closer to θ Unbiased: the expected value of is equal to θ Are our example esimators biased? EsImator 1 turns out to be unbiased EsImator 2 is has a bias 23
24 Good EsImators Are: Consistent: as sample size increases, gets closer to θ Unbiased: Precise: 24
25 What do we mean by precise? An imprecise esimator is subject to random variability 25
26 Good EsImators Are: Consistent: as sample size increases, gets closer to θ Unbiased: Precise: the variance of EsImator 1 should be minimal EsImator 2? 26
27 Good EsImators Are: Consistent: as sample size increases, gets closer to θ Unbiased: Precise: the variance of EsImator 1 should be minimal EsImator 2 has zero variance Bias and variance are intertwined, and oeen you will have to chose to minimize one or the other 27
28 EsImators for normally distributed data Since we know that much experimental data is normally distributed, let s start here General methods for esimaing parameters (MLE, Bayesian) will be covered later. 28
29 EsImators for normally distributed data F(x) x 29
30 EsImators for normally distributed data F(x) What two parameters define a normal distribuion? x 30
31 EsImators for normally distributed data F(x) What two parameters define a normal distribuion? mean = μ standard deviaion = σ x 31
32 EsImators for normally distributed data F(x) μ σ What two parameters define a normal distribuion? mean = μ standard deviaion = σ x 32
33 EsImators for normally distributed data Given a sample from a normally distributed populaion, what esimators would you use for μ,σ? ˆµ = ˆ = 33
34 EsImators for normally distributed data Given a sample from a normally distributed populaion, what esimators would you use for μ,σ? ˆµ = ˆ = 34
35 Confidence intervals: how good is my parameter esimate? 35
36 Back to our fluorescent yeast Let s say we measure the fluorescence of 25 yeast cells and find x = 89.1; s = How good is our esimate ˆµ = 89.1? 36
37 Back to our fluorescent yeast Let s say we measure the fluorescence of 25 yeast cells and find x = 89.1; s = How good is our esimate ˆµ = 89.1? On what will the goodness of the esimate depend? 37
38 Back to our fluorescent yeast Let s say we measure the fluorescence of 25 yeast cells and find x = 89.1; s = How good is our esimate ˆµ = 89.1? On what will the goodness of the esimate depend? Sample size Variability 38
39 A simple staring point What is the probability that the mean fluorescence of all ~2E8 cells in the culture is within 79.1 and 99.1? 39
40 A simple staring point What is the probability that the mean fluorescence of all ~2E8 cells in the culture is within 79.1 and 99.1? P (µ is within 10 of x) 40
41 A simple staring point What is the probability that the mean fluorescence of all ~2E8 cells in the culture is within 79.1 and 99.1? P (µ is within 10 of x) Recall that x is a RV with its own sampling distribuion ˆ = µ x The sampling distribution of the sample mean is: Normal (by central limit theorem) Has µ x = µ = x Has x = p = p s n n 41
42 Standard Error of the Mean SEM is the standard deviaion of the sampling distribuion of the mean Oeen confused with standard deviaion of a sample in the literature. The standard deviaion is a descripive staisic, but SEM describes the spread of the sampling distribuion. SD of the sample is the degree to which individuals within a sample differ from the sample mean SEM is an esimate of how far the sample mean is likely to be from the populaion mean 42
43 Standard Error of the Mean are independent obs from a pop. with mean μ and stdev σ 43
44 Standard Error of the Mean are independent obs from a pop. with mean μ and stdev σ 44
45 Standard Error of the Mean are independent obs from a pop. with mean μ and stdev σ Is a property of RV 45
46 Standard Error of the Mean are independent obs from a pop. with mean μ and stdev σ Is a property of RV 46
47 Standard Error of the Mean are independent obs from a pop. with mean μ and stdev σ Is a property of RV 47
48 Standard Error of the Mean are independent obs from a pop. with mean μ and stdev σ Is a property of RV 48
49 Standard Error of the Mean are independent obs from a pop. with mean μ and stdev σ Is a property of RV 49
50 A simple staring point What is the probability that the mean fluorescence of all ~2E8 cells in the culture is within 79.1 and 99.1? P (µ is within 10 of x) Recall that x is a RV with a sampling distribuion ˆ = µ x The sampling distribution of the sample mean is: Normal Has µ x = µ = x Has x = p = p s n n We just need to find the area under the sampling distribution of the sample mean corresponding to the mean +/
51 A simple staring point What is the probability that the mean fluorescence of all ~2E8 cells in the culture is within 79.1 and 99.1? P (µ is within 10 of x) Recall that x is a RV with a sampling distribuion ˆ = µ x The sampling distribution of the sample mean is: Normal Has Has µ x = µ = x x = p n = s p n = 24.25/5 =
52 A simple staring point What is the probability that the mean fluorescence of all ~2E8 cells in the culture is within 79.1 and 99.1? P (µ is within 10 of x) Recall that x is a RV with a sampling distribuion ˆ = µ x The sampling distribution of the sample mean is: Normal Has Has µ x = µ = x x = p n = s p n = 24.25/5 =4.85 > pnorm(c(79.1, 99.1), mean = 89.1, sd = 4.85) [1] > [1]
53 GeneralizaIon We want to set a confidence interval such that 95% of samples from the distribuion are within the interval Given that we can esimate the mean and standard deviaion of the sampling distribuion of the sample mean, how do we do this? 53
54 GeneralizaIon We find the number of standard deviaions (z) we must move away from the mean to encompass 95% of the sampling distribuion of the sample mean 5% of total area z µ x 54
55 GeneralizaIon Since the distribuion is symmetric, we can just use the CDF to accomplish this 97.5% of total area z To find z such that CI 95% = µ x ± s p n z We can use the normal cumulative distribution function µ x 55
56 GeneralizaIon Now we can set a 95% CI for our fluorescence data 97.5% of total area µ x z To find z such that CI 95% = µ x ± s p n z We can use the normal cumulative distribution function > min(which(pnorm(seq(-3,3,0.01))>=0.975)) [1] 497 > seq(-3,3,0.01)[497] [1] 1.96 > * 1.96 [1] > * 1.96 [1]
57 A pracical note When sample sizes are greater than ~30, the sampling distribuion of the sample mean is normal and x = p s n is a good esimate When sample sizes are smaller than ~30, is an underesimate x = s p n Thus, in pracice we use the t- distribuion as opposed to the normal distribuion (more on this later) 57
58 InterpretaIon of confidence intervals If you repeatedly sample the same populaion, the CI (which differs for each sample) would contain the true populaion parameter X% of the Ime NOT the probability that this paricular CI from this paricular sample actually contains the populaion parameter NOT that there is an X% probability of a sample mean from a repeat experiment falling within the interval 58
59 IntroducIon to hypothesis tesing 59
60 What is hypothesis tesing? A staisical test examines a set of sample data and, on the basis of an expected distribuion of the data, leads to a decision about whether to accept the hypothesis underlying the expected distribuion or reject that hypothesis and accept an alternaive one. Over the next several weeks, we will come at this basic problem from many angles, but the general idea will remain the same 60
61 What is hypothesis tesing? A staisical test examines a set of sample data and, on the basis of an expected distribuion of the data, leads to a decision about whether to accept the hypothesis underlying the expected distribuion or reject that hypothesis and accept an alternaive one. StaIsIcal tests rely on compuing a test staisic (i.e. a staisic we will compare between samples, etc) 61
62 What is hypothesis tesing? A staisical test examines a set of sample data and, on the basis of an expected distribuion of the data, leads to a decision about whether to accept the hypothesis underlying the expected distribuion or reject that hypothesis and accept an alternaive one. To generate the expected distribuion (oeen called the null distribuion) of the test staisic we can use parametric distribuions or, through randomizaion, generate our own based on the data 62
63 What is hypothesis tesing? A staisical test examines a set of sample data and, on the basis of an expected distribuion of the data, leads to a decision about whether to accept the hypothesis underlying the expected distribuion or reject that hypothesis and accept an alternaive one. Then, we can see what the likelihood of obtaining the test staisic value we calculated from the sample. If it is highly unlikely, we might reject the null hypothesis. 63
64 What is hypothesis tesing? A staisical test examines a set of sample data and, on the basis of an expected distribuion of the data, leads to a decision about whether to accept the hypothesis underlying the expected distribuion or reject that hypothesis and accept an alternaive one. Then, we can see what the likelihood of obtaining the test staisic value we calculated from the sample. If it is highly unlikely, we might reject the null hypothesis
65 Hypothesis TesIng Formally examine two opposing conjectures (hypotheses), H 0 and H 1 These two hypotheses are mutually exclusive and exhausive so that one is true to the exclusion of the other We accumulate evidence collect and analyze sample informaion for the purpose of determining which of the two hypotheses is true and which of the two hypotheses is false 65
66 Example Consider a genome- wide associaion study (GWAS) for T2D and you measure the blood glucose level of the case/control groups The null hypothesis, H 0 : There is no difference between the case/control groups in the mean blood glucose levels H 0 : μ 1 - μ 2 = 0 The alterna/ve hypothesis, H A : The mean blood glucose levels in the case/control groups are different H A : μ 1 - μ
67 The Null and AlternaIve Hypothesis The null hypothesis, H 0 : States the assumpion (numerical to be tested) Begin with the assumpion that the null hypothesis is TRUE Always contains the = sign The alterna/ve hypothesis, H A : Is the opposite of the null hypothesis Challenges the status quo Never contains just the = sign Is generally the hypothesis that is believed to be true by the researcher 67
68 One and Two Sided Tests Hypothesis tests can be one or two sided (tailed) One tailed tests are direcional: H 0 : μ 1 - μ 2 0 H A : μ 1 - μ 2 > 0 Two tailed tests are not direcional: H 0 : μ 1 - μ 2 = 0 H A : μ 1 - μ
69 P- values Calculate a test sta/s/c from the sample data that is relevant to the hypothesis being tested e.g. In our GWAS example, the test staisic can be determined based on μ 1, μ 2 and σ 1, σ 2 computed from the GWAS data Aeer calculaing a test staisic we convert this to a P- value by comparing its value to distribuion of test staisic s under the null hypothesis Null distribuion Test staisic value Area = P- value (probability of obtaining a test staisic value at least as extreme as the one we observed) 69
70 P- values Calculate a test sta/s/c in the sample data that is relevant to the hypothesis being tested e.g. In our GWAS example, the test staisic can be determined based on μ 1, μ 2 and σ 1, σ 2 computed from the GWAS data Aeer calculaing a test staisic we convert this to a P- value by comparing its value to distribuion of test staisic s under the null hypothesis Null distribuion Test staisic value Area = P- value (probability of obtaining a test staisic value at least as extreme as the one we observed) 70
71 When To Reject H 0 Level of significance, α: Specified before an experiment to define rejecion region Rejec/on region: set of all test staisic values for which H 0 will be rejected One sided α = 0.05 Two sided α = 0.05 The test staisic value required to reject is oeen called the criical value 71
72 When To Reject H 0 Level of significance, α: Specified before an experiment to define rejecion region Rejec/on region: set of all test staisic values for which H 0 will be rejected One sided α = 0.05 CriIcal value RejecIon region 72
73 When To Reject H 0 Level of significance, α: Specified before an experiment to define rejecion region Rejec/on region: set of all test staisic values for which H 0 will be rejected One sided α = 0.05 When we obtain a test staisic in the rejecion region we can conclude that either the null is true and a highly improbable event has occurred, or the null is false 73
74 Some NotaIon In general, criical values for an α level test denoted as: One sided test: X α Two sided test: X α/2 where X indicates the distribuion of the test staisic For example, if X ~ N(0,1): One sided test: z α (i.e., z 0.05 = 1.64) Two sided test: z α/2 (i.e., z 0.05/2 = z 0.05/2 = ) 74
75 Errors in Hypothesis TesIng Level of significance, α: Specified before an experiment to define rejecion region Rejec/on region: set of all test staisic values for which H 0 will be rejected One sided α = 0.05 Given α= 0.05, what is the chance we will falsely reject the null hypothesis? 75
76 Errors in Hypothesis TesIng Level of significance, α: Specified before an experiment to define rejecion region Rejec/on region: set of all test staisic values for which H 0 will be rejected One sided α = 0.05 Given α= 0.05, what is the chance we will falsely reject the null hypothesis? 5% of the Ime 76
77 Errors in Hypothesis TesIng Actual Situation Truth Decision Do Not Reject H 0 Reject H 0 H 0 True H 0 False Correct Decision 1- α Incorrect Decision Type I Error α Incorrect Decision Type II Error Β Correct Decision 1- β 77
78 Type I and II Errors Actual Situation Truth Decision Do Not Reject H 0 Reject H 0 H 0 True H 0 False Correct Decision 1- α Incorrect Decision Type I Error α Incorrect Decision Type II Error Β Correct Decision 1- β α = P(Type I Error) β = P(Type II Error) Power = 1 - β 78
79 Parametric and Non- Parametric Tests Parametric Tests: Relies on theoreical distribuions of the test staisic under the null hypothesis and assumpions about the distribuion of the sample data (i.e., normality) Non- Parametric Tests: Referred to as DistribuIon Free as they do not assume that data are drawn from any paricular distribuion 79
80 R Session Goals Confidence interval calculaions User- defined funcions 80
Lecture 7: Hypothesis Testing and ANOVA
Lecture 7: Hypothesis Testing and ANOVA Goals Overview of key elements of hypothesis testing Review of common one and two sample tests Introduction to ANOVA Hypothesis Testing The intent of hypothesis
More informationORF 245 Fundamentals of Statistics Chapter 9 Hypothesis Testing
ORF 245 Fundamentals of Statistics Chapter 9 Hypothesis Testing Robert Vanderbei Fall 2014 Slides last edited on November 24, 2014 http://www.princeton.edu/ rvdb Coin Tossing Example Consider two coins.
More informationMath 494: Mathematical Statistics
Math 494: Mathematical Statistics Instructor: Jimin Ding jmding@wustl.edu Department of Mathematics Washington University in St. Louis Class materials are available on course website (www.math.wustl.edu/
More informationF79SM STATISTICAL METHODS
F79SM STATISTICAL METHODS SUMMARY NOTES 9 Hypothesis testing 9.1 Introduction As before we have a random sample x of size n of a population r.v. X with pdf/pf f(x;θ). The distribution we assign to X is
More informationIntroductory Econometrics. Review of statistics (Part II: Inference)
Introductory Econometrics Review of statistics (Part II: Inference) Jun Ma School of Economics Renmin University of China October 1, 2018 1/16 Null and alternative hypotheses Usually, we have two competing
More informationME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV
Theory of Engineering Experimentation Chapter IV. Decision Making for a Single Sample Chapter IV 1 4 1 Statistical Inference The field of statistical inference consists of those methods used to make decisions
More informationPractice Problems Section Problems
Practice Problems Section 4-4-3 4-4 4-5 4-6 4-7 4-8 4-10 Supplemental Problems 4-1 to 4-9 4-13, 14, 15, 17, 19, 0 4-3, 34, 36, 38 4-47, 49, 5, 54, 55 4-59, 60, 63 4-66, 68, 69, 70, 74 4-79, 81, 84 4-85,
More informationBasic Concepts of Inference
Basic Concepts of Inference Corresponds to Chapter 6 of Tamhane and Dunlop Slides prepared by Elizabeth Newton (MIT) with some slides by Jacqueline Telford (Johns Hopkins University) and Roy Welsch (MIT).
More informationThe t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary
Patrick Breheny October 13 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction Introduction What s wrong with z-tests? So far we ve (thoroughly!) discussed how to carry out hypothesis
More informationBusiness Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing
Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing Agenda Introduction to Estimation Point estimation Interval estimation Introduction to Hypothesis Testing Concepts en terminology
More informationReview. December 4 th, Review
December 4 th, 2017 Att. Final exam: Course evaluation Friday, 12/14/2018, 10:30am 12:30pm Gore Hall 115 Overview Week 2 Week 4 Week 7 Week 10 Week 12 Chapter 6: Statistics and Sampling Distributions Chapter
More informationIntroduction to Statistical Inference
Introduction to Statistical Inference Dr. Fatima Sanchez-Cabo f.sanchezcabo@tugraz.at http://www.genome.tugraz.at Institute for Genomics and Bioinformatics, Graz University of Technology, Austria Introduction
More informationBias Variance Trade-off
Bias Variance Trade-off The mean squared error of an estimator MSE(ˆθ) = E([ˆθ θ] 2 ) Can be re-expressed MSE(ˆθ) = Var(ˆθ) + (B(ˆθ) 2 ) MSE = VAR + BIAS 2 Proof MSE(ˆθ) = E((ˆθ θ) 2 ) = E(([ˆθ E(ˆθ)]
More informationStatistical Data Analysis Stat 3: p-values, parameter estimation
Statistical Data Analysis Stat 3: p-values, parameter estimation London Postgraduate Lectures on Particle Physics; University of London MSci course PH4515 Glen Cowan Physics Department Royal Holloway,
More informationMTH U481 : SPRING 2009: PRACTICE PROBLEMS FOR FINAL
MTH U481 : SPRING 2009: PRACTICE PROBLEMS FOR FINAL 1). Two urns are provided as follows: urn 1 contains 2 white chips and 4 red chips, while urn 2 contains 5 white chips and 3 red chips. One chip is chosen
More informationNull Hypothesis Significance Testing p-values, significance level, power, t-tests
Null Hypothesis Significance Testing p-values, significance level, power, t-tests 18.05 Spring 2014 January 1, 2017 1 /22 Understand this figure f(x H 0 ) x reject H 0 don t reject H 0 reject H 0 x = test
More informationStatistical Preliminaries. Stony Brook University CSE545, Fall 2016
Statistical Preliminaries Stony Brook University CSE545, Fall 2016 Random Variables X: A mapping from Ω to R that describes the question we care about in practice. 2 Random Variables X: A mapping from
More informationPerformance Evaluation and Comparison
Outline Hong Chang Institute of Computing Technology, Chinese Academy of Sciences Machine Learning Methods (Fall 2012) Outline Outline I 1 Introduction 2 Cross Validation and Resampling 3 Interval Estimation
More informationNull Hypothesis Significance Testing p-values, significance level, power, t-tests Spring 2017
Null Hypothesis Significance Testing p-values, significance level, power, t-tests 18.05 Spring 2017 Understand this figure f(x H 0 ) x reject H 0 don t reject H 0 reject H 0 x = test statistic f (x H 0
More informationexp{ (x i) 2 i=1 n i=1 (x i a) 2 (x i ) 2 = exp{ i=1 n i=1 n 2ax i a 2 i=1
4 Hypothesis testing 4. Simple hypotheses A computer tries to distinguish between two sources of signals. Both sources emit independent signals with normally distributed intensity, the signals of the first
More informationWooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics
Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics A short review of the principles of mathematical statistics (or, what you should have learned in EC 151).
More informationStatistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests
Statistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests 1999 Prentice-Hall, Inc. Chap. 8-1 Chapter Topics Hypothesis Testing Methodology Z Test
More informationClass 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 4 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 013 by D.B. Rowe 1 Agenda: Recap Chapter 9. and 9.3 Lecture Chapter 10.1-10.3 Review Exam 6 Problem Solving
More informationStatistical Inference
Statistical Inference Classical and Bayesian Methods Revision Class for Midterm Exam AMS-UCSC Th Feb 9, 2012 Winter 2012. Session 1 (Revision Class) AMS-132/206 Th Feb 9, 2012 1 / 23 Topics Topics We will
More informationThe Central Limit Theorem
The Central Limit Theorem Patrick Breheny September 27 Patrick Breheny University of Iowa Biostatistical Methods I (BIOS 5710) 1 / 31 Kerrich s experiment Introduction 10,000 coin flips Expectation and
More informationMAT 271E Probability and Statistics
MAT 71E Probability and Statistics Spring 013 Instructor : Class Meets : Office Hours : Textbook : Supp. Text : İlker Bayram EEB 1103 ibayram@itu.edu.tr 13.30 1.30, Wednesday EEB 5303 10.00 1.00, Wednesday
More informationCS 543 Page 1 John E. Boon, Jr.
CS 543 Machine Learning Spring 2010 Lecture 05 Evaluating Hypotheses I. Overview A. Given observed accuracy of a hypothesis over a limited sample of data, how well does this estimate its accuracy over
More informationz and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests
z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests Chapters 3.5.1 3.5.2, 3.3.2 Prof. Tesler Math 283 Fall 2018 Prof. Tesler z and t tests for mean Math
More informationPsychology 282 Lecture #4 Outline Inferences in SLR
Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations
More informationRandom variables. DS GA 1002 Probability and Statistics for Data Science.
Random variables DS GA 1002 Probability and Statistics for Data Science http://www.cims.nyu.edu/~cfgranda/pages/dsga1002_fall17 Carlos Fernandez-Granda Motivation Random variables model numerical quantities
More informationTest Code: STA/STB (Short Answer Type) 2013 Junior Research Fellowship for Research Course in Statistics
Test Code: STA/STB (Short Answer Type) 2013 Junior Research Fellowship for Research Course in Statistics The candidates for the research course in Statistics will have to take two shortanswer type tests
More information18.05 Practice Final Exam
No calculators. 18.05 Practice Final Exam Number of problems 16 concept questions, 16 problems. Simplifying expressions Unless asked to explicitly, you don t need to simplify complicated expressions. For
More informationLecture 10: Probability distributions TUESDAY, FEBRUARY 19, 2019
Lecture 10: Probability distributions DANIEL WELLER TUESDAY, FEBRUARY 19, 2019 Agenda What is probability? (again) Describing probabilities (distributions) Understanding probabilities (expectation) Partial
More informationChapter 5: HYPOTHESIS TESTING
MATH411: Applied Statistics Dr. YU, Chi Wai Chapter 5: HYPOTHESIS TESTING 1 WHAT IS HYPOTHESIS TESTING? As its name indicates, it is about a test of hypothesis. To be more precise, we would first translate
More informationFrequentist Statistics and Hypothesis Testing Spring
Frequentist Statistics and Hypothesis Testing 18.05 Spring 2018 http://xkcd.com/539/ Agenda Introduction to the frequentist way of life. What is a statistic? NHST ingredients; rejection regions Simple
More informationProbability theory and inference statistics! Dr. Paola Grosso! SNE research group!! (preferred!)!!
Probability theory and inference statistics Dr. Paola Grosso SNE research group p.grosso@uva.nl paola.grosso@os3.nl (preferred) Roadmap Lecture 1: Monday Sep. 22nd Collecting data Presenting data Descriptive
More informationParameter estimation and forecasting. Cristiano Porciani AIfA, Uni-Bonn
Parameter estimation and forecasting Cristiano Porciani AIfA, Uni-Bonn Questions? C. Porciani Estimation & forecasting 2 Temperature fluctuations Variance at multipole l (angle ~180o/l) C. Porciani Estimation
More informationChapter Three. Hypothesis Testing
3.1 Introduction The final phase of analyzing data is to make a decision concerning a set of choices or options. Should I invest in stocks or bonds? Should a new product be marketed? Are my products being
More information18.05 Final Exam. Good luck! Name. No calculators. Number of problems 16 concept questions, 16 problems, 21 pages
Name No calculators. 18.05 Final Exam Number of problems 16 concept questions, 16 problems, 21 pages Extra paper If you need more space we will provide some blank paper. Indicate clearly that your solution
More informationSYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions
SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu
More informationLecture 4: Random Variables and Distributions
Lecture 4: Random Variables and Distributions Goals Random Variables Overview of discrete and continuous distributions important in genetics/genomics Working with distributions in R Random Variables A
More informationSTAT 135 Lab 5 Bootstrapping and Hypothesis Testing
STAT 135 Lab 5 Bootstrapping and Hypothesis Testing Rebecca Barter March 2, 2015 The Bootstrap Bootstrap Suppose that we are interested in estimating a parameter θ from some population with members x 1,...,
More informationWhy I think Bayesian Analysis is the greatest thing since NHST (bread)
Why I think Bayesian Analysis is the greatest thing since NHST (bread) Data Binge Nate Powell May 12, 2017 NHST = Null Hypothesis Significance TesIng Textbook Doing Bayesian Data Analysis by John K. Kruschke
More informationHypothesis Testing. Part I. James J. Heckman University of Chicago. Econ 312 This draft, April 20, 2006
Hypothesis Testing Part I James J. Heckman University of Chicago Econ 312 This draft, April 20, 2006 1 1 A Brief Review of Hypothesis Testing and Its Uses values and pure significance tests (R.A. Fisher)
More informationStatistical inference
Statistical inference Contents 1. Main definitions 2. Estimation 3. Testing L. Trapani MSc Induction - Statistical inference 1 1 Introduction: definition and preliminary theory In this chapter, we shall
More informationPreliminary Statistics Lecture 5: Hypothesis Testing (Outline)
1 School of Oriental and African Studies September 2015 Department of Economics Preliminary Statistics Lecture 5: Hypothesis Testing (Outline) Gujarati D. Basic Econometrics, Appendix A.8 Barrow M. Statistics
More informationIntroductory Econometrics
Session 4 - Testing hypotheses Roland Sciences Po July 2011 Motivation After estimation, delivering information involves testing hypotheses Did this drug had any effect on the survival rate? Is this drug
More informationHypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3
Hypothesis Testing CB: chapter 8; section 0.3 Hypothesis: statement about an unknown population parameter Examples: The average age of males in Sweden is 7. (statement about population mean) The lowest
More informationStatistics - Lecture One. Outline. Charlotte Wickham 1. Basic ideas about estimation
Statistics - Lecture One Charlotte Wickham wickham@stat.berkeley.edu http://www.stat.berkeley.edu/~wickham/ Outline 1. Basic ideas about estimation 2. Method of Moments 3. Maximum Likelihood 4. Confidence
More informationIEOR 165 Lecture 13 Maximum Likelihood Estimation
IEOR 165 Lecture 13 Maximum Likelihood Estimation 1 Motivating Problem Suppose we are working for a grocery store, and we have decided to model service time of an individual using the express lane (for
More informationBusiness Statistics. Lecture 10: Course Review
Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,
More information2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008
MIT OpenCourseWare http://ocw.mit.edu 2.830J / 6.780J / ESD.63J Control of Processes (SMA 6303) Spring 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.
More informationCSE 312 Final Review: Section AA
CSE 312 TAs December 8, 2011 General Information General Information Comprehensive Midterm General Information Comprehensive Midterm Heavily weighted toward material after the midterm Pre-Midterm Material
More informationBayesian inference. Fredrik Ronquist and Peter Beerli. October 3, 2007
Bayesian inference Fredrik Ronquist and Peter Beerli October 3, 2007 1 Introduction The last few decades has seen a growing interest in Bayesian inference, an alternative approach to statistical inference.
More informationMA : Introductory Probability
MA 320-001: Introductory Probability David Murrugarra Department of Mathematics, University of Kentucky http://www.math.uky.edu/~dmu228/ma320/ Spring 2017 David Murrugarra (University of Kentucky) MA 320:
More informationMATH 3670 First Midterm February 17, No books or notes. No cellphone or wireless devices. Write clearly and show your work for every answer.
No books or notes. No cellphone or wireless devices. Write clearly and show your work for every answer. Name: Question: 1 2 3 4 Total Points: 30 20 20 40 110 Score: 1. The following numbers x i, i = 1,...,
More information10/4/2013. Hypothesis Testing & z-test. Hypothesis Testing. Hypothesis Testing
& z-test Lecture Set 11 We have a coin and are trying to determine if it is biased or unbiased What should we assume? Why? Flip coin n = 100 times E(Heads) = 50 Why? Assume we count 53 Heads... What could
More informationHypothesis testing: theory and methods
Statistical Methods Warsaw School of Economics November 3, 2017 Statistical hypothesis is the name of any conjecture about unknown parameters of a population distribution. The hypothesis should be verifiable
More informationHypothesis Test. The opposite of the null hypothesis, called an alternative hypothesis, becomes
Neyman-Pearson paradigm. Suppose that a researcher is interested in whether the new drug works. The process of determining whether the outcome of the experiment points to yes or no is called hypothesis
More informationDefinition 3.1 A statistical hypothesis is a statement about the unknown values of the parameters of the population distribution.
Hypothesis Testing Definition 3.1 A statistical hypothesis is a statement about the unknown values of the parameters of the population distribution. Suppose the family of population distributions is indexed
More information1; (f) H 0 : = 55 db, H 1 : < 55.
Reference: Chapter 8 of J. L. Devore s 8 th Edition By S. Maghsoodloo TESTING a STATISTICAL HYPOTHESIS A statistical hypothesis is an assumption about the frequency function(s) (i.e., pmf or pdf) of one
More informationLecture 1: August 28
36-705: Intermediate Statistics Fall 2017 Lecturer: Siva Balakrishnan Lecture 1: August 28 Our broad goal for the first few lectures is to try to understand the behaviour of sums of independent random
More informationLecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS
Lecture 5: Parametric Hypothesis Testig: Comparig Meas GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review from last week What is a cofidece iterval? 2 Review from last week What is a cofidece
More informationStat 135, Fall 2006 A. Adhikari HOMEWORK 6 SOLUTIONS
Stat 135, Fall 2006 A. Adhikari HOMEWORK 6 SOLUTIONS 1a. Under the null hypothesis X has the binomial (100,.5) distribution with E(X) = 50 and SE(X) = 5. So P ( X 50 > 10) is (approximately) two tails
More informationProbability and Statistics. Joyeeta Dutta-Moscato June 29, 2015
Probability and Statistics Joyeeta Dutta-Moscato June 29, 2015 Terms and concepts Sample vs population Central tendency: Mean, median, mode Variance, standard deviation Normal distribution Cumulative distribution
More informationStatistical Inference: Estimation and Confidence Intervals Hypothesis Testing
Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing 1 In most statistics problems, we assume that the data have been generated from some unknown probability distribution. We desire
More informationA.I. in health informatics lecture 2 clinical reasoning & probabilistic inference, I. kevin small & byron wallace
A.I. in health informatics lecture 2 clinical reasoning & probabilistic inference, I kevin small & byron wallace today a review of probability random variables, maximum likelihood, etc. crucial for clinical
More informationStatistics: Learning models from data
DS-GA 1002 Lecture notes 5 October 19, 2015 Statistics: Learning models from data Learning models from data that are assumed to be generated probabilistically from a certain unknown distribution is a crucial
More informationStatistical Inference. Hypothesis Testing
Statistical Inference Hypothesis Testing Previously, we introduced the point and interval estimation of an unknown parameter(s), say µ and σ 2. However, in practice, the problem confronting the scientist
More informationIntroduction. Probability and distributions
Introduction. Probability and distributions Joe Felsenstein Genome 560, Spring 2011 Introduction. Probability and distributions p.1/18 Probabilities We will assume you know that Probabilities of mutually
More informationProbability and Probability Distributions. Dr. Mohammed Alahmed
Probability and Probability Distributions 1 Probability and Probability Distributions Usually we want to do more with data than just describing them! We might want to test certain specific inferences about
More informationChapter 8: An Introduction to Probability and Statistics
Course S3, 200 07 Chapter 8: An Introduction to Probability and Statistics This material is covered in the book: Erwin Kreyszig, Advanced Engineering Mathematics (9th edition) Chapter 24 (not including
More informationProbability Theory and Statistics. Peter Jochumzen
Probability Theory and Statistics Peter Jochumzen April 18, 2016 Contents 1 Probability Theory And Statistics 3 1.1 Experiment, Outcome and Event................................ 3 1.2 Probability............................................
More informationSampling Distributions: Central Limit Theorem
Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)
More informationChapter 7: Hypothesis Testing
Chapter 7: Hypothesis Testing *Mathematical statistics with applications; Elsevier Academic Press, 2009 The elements of a statistical hypothesis 1. The null hypothesis, denoted by H 0, is usually the nullification
More informationA3. Statistical Inference Hypothesis Testing for General Population Parameters
Appendix / A3. Statistical Inference / General Parameters- A3. Statistical Inference Hypothesis Testing for General Population Parameters POPULATION H 0 : θ = θ 0 θ is a generic parameter of interest (e.g.,
More informationGov 2000: 6. Hypothesis Testing
Gov 2000: 6. Hypothesis Testing Matthew Blackwell October 11, 2016 1 / 55 1. Hypothesis Testing Examples 2. Hypothesis Test Nomenclature 3. Conducting Hypothesis Tests 4. p-values 5. Power Analyses 6.
More informationSmoking Habits. Moderate Smokers Heavy Smokers Total. Hypertension No Hypertension Total
Math 3070. Treibergs Final Exam Name: December 7, 00. In an experiment to see how hypertension is related to smoking habits, the following data was taken on individuals. Test the hypothesis that the proportions
More informationHarvard University. Rigorous Research in Engineering Education
Statistical Inference Kari Lock Harvard University Department of Statistics Rigorous Research in Engineering Education 12/3/09 Statistical Inference You have a sample and want to use the data collected
More informationRecursive Estimation
Recursive Estimation Raffaello D Andrea Spring 08 Problem Set : Bayes Theorem and Bayesian Tracking Last updated: March, 08 Notes: Notation: Unless otherwise noted, x, y, and z denote random variables,
More informationCIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8
CIVL - 7904/8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 Chi-square Test How to determine the interval from a continuous distribution I = Range 1 + 3.322(logN) I-> Range of the class interval
More informationDesign of Engineering Experiments
Design of Engineering Experiments Hussam Alshraideh Chapter 2: Some Basic Statistical Concepts October 4, 2015 Hussam Alshraideh (JUST) Basic Stats October 4, 2015 1 / 29 Overview 1 Introduction Basic
More informationCS 6140: Machine Learning Spring What We Learned Last Week. Survey 2/26/16. VS. Model
Logis@cs CS 6140: Machine Learning Spring 2016 Instructor: Lu Wang College of Computer and Informa@on Science Northeastern University Webpage: www.ccs.neu.edu/home/luwang Email: luwang@ccs.neu.edu Assignment
More informationMaximum-Likelihood Estimation: Basic Ideas
Sociology 740 John Fox Lecture Notes Maximum-Likelihood Estimation: Basic Ideas Copyright 2014 by John Fox Maximum-Likelihood Estimation: Basic Ideas 1 I The method of maximum likelihood provides estimators
More informationCS 6140: Machine Learning Spring 2016
CS 6140: Machine Learning Spring 2016 Instructor: Lu Wang College of Computer and Informa?on Science Northeastern University Webpage: www.ccs.neu.edu/home/luwang Email: luwang@ccs.neu.edu Logis?cs Assignment
More informationBasic Data Analysis. Stephen Turnbull Business Administration and Public Policy Lecture 5: May 9, Abstract
Basic Data Analysis Stephen Turnbull Business Administration and Public Policy Lecture 5: May 9, 2013 Abstract We discuss random variables and probability distributions. We introduce statistical inference,
More informationFirst we look at some terms to be used in this section.
8 Hypothesis Testing 8.1 Introduction MATH1015 Biostatistics Week 8 In Chapter 7, we ve studied the estimation of parameters, point or interval estimates. The construction of CI relies on the sampling
More informationMATH 3510: PROBABILITY AND STATS July 1, 2011 FINAL EXAM
MATH 3510: PROBABILITY AND STATS July 1, 2011 FINAL EXAM YOUR NAME: KEY: Answers in blue Show all your work. Answers out of the blue and without any supporting work may receive no credit even if they are
More informationMath 494: Mathematical Statistics
Math 494: Mathematical Statistics Instructor: Jimin Ding jmding@wustl.edu Department of Mathematics Washington University in St. Louis Class materials are available on course website (www.math.wustl.edu/
More informationStat 535 C - Statistical Computing & Monte Carlo Methods. Arnaud Doucet.
Stat 535 C - Statistical Computing & Monte Carlo Methods Arnaud Doucet Email: arnaud@cs.ubc.ca 1 CS students: don t forget to re-register in CS-535D. Even if you just audit this course, please do register.
More informationProbability theory for Networks (Part 1) CS 249B: Science of Networks Week 02: Monday, 02/04/08 Daniel Bilar Wellesley College Spring 2008
Probability theory for Networks (Part 1) CS 249B: Science of Networks Week 02: Monday, 02/04/08 Daniel Bilar Wellesley College Spring 2008 1 Review We saw some basic metrics that helped us characterize
More informationELEG 3143 Probability & Stochastic Process Ch. 2 Discrete Random Variables
Department of Electrical Engineering University of Arkansas ELEG 3143 Probability & Stochastic Process Ch. 2 Discrete Random Variables Dr. Jingxian Wu wuj@uark.edu OUTLINE 2 Random Variable Discrete Random
More informationLecture Testing Hypotheses: The Neyman-Pearson Paradigm
Math 408 - Mathematical Statistics Lecture 29-30. Testing Hypotheses: The Neyman-Pearson Paradigm April 12-15, 2013 Konstantin Zuev (USC) Math 408, Lecture 29-30 April 12-15, 2013 1 / 12 Agenda Example:
More informationQuantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing
Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu October
More informationMath Review Sheet, Fall 2008
1 Descriptive Statistics Math 3070-5 Review Sheet, Fall 2008 First we need to know about the relationship among Population Samples Objects The distribution of the population can be given in one of the
More informationQuantitative Methods for Economics, Finance and Management (A86050 F86050)
Quantitative Methods for Economics, Finance and Management (A86050 F86050) Matteo Manera matteo.manera@unimib.it Marzio Galeotti marzio.galeotti@unimi.it 1 This material is taken and adapted from Guy Judge
More informationExam 2 Practice Questions, 18.05, Spring 2014
Exam 2 Practice Questions, 18.05, Spring 2014 Note: This is a set of practice problems for exam 2. The actual exam will be much shorter. Within each section we ve arranged the problems roughly in order
More informationLecture 18: Bayesian Inference
Lecture 18: Bayesian Inference Hyang-Won Lee Dept. of Internet & Multimedia Eng. Konkuk University Lecture 18 Probability and Statistics, Spring 2014 1 / 10 Bayesian Statistical Inference Statiscal inference
More informationTUTORIAL 8 SOLUTIONS #
TUTORIAL 8 SOLUTIONS #9.11.21 Suppose that a single observation X is taken from a uniform density on [0,θ], and consider testing H 0 : θ = 1 versus H 1 : θ =2. (a) Find a test that has significance level
More information2.3 Analysis of Categorical Data
90 CHAPTER 2. ESTIMATION AND HYPOTHESIS TESTING 2.3 Analysis of Categorical Data 2.3.1 The Multinomial Probability Distribution A mulinomial random variable is a generalization of the binomial rv. It results
More information