Pearson s meta-analysis revisited

Size: px
Start display at page:

Download "Pearson s meta-analysis revisited"

Transcription

1 Pearson s meta-analysis revisited 1 Pearson s meta-analysis revisited in a microarray context Art B. Owen Department of Statistics Stanford University

2 Pearson s meta-analysis revisited 2 Long story short 1) A microarray analysis needed a meta-analysis that accounts for directionality of effects 2) Pearson (1934) already had the same idea 3) And Birnbaum (1954) showed inadmissibility 4) But Birnbaum misread Pearson 5) The method is admissible & competitive vs Fisher (where we need it) 6) and the proof leads to something new that may be better

3 Pearson s meta-analysis revisited 3 Karl Pearson quote Stigler (2008) recounting Karl Pearson s amazing productivity includes this from Stouffer (1958): You Americans would not understand, but I never answer a telephone or attend a committee meeting. Pearson was born in 1857

4 Pearson s meta-analysis revisited 4 Two example problems Work with NIA and Kim lab AGEMAP Zahn et al. PLOS Is gene i correlated with age in tissue j of the mouse? For 8932 genes and 16 tissues We get a matrix of p-values fmri Benjamini & Heller Is brain location i activated in task j? Similar problems

5 Pearson s meta-analysis revisited 5 AGEMAP goals Which genes are age related generically? They should show age relationship in multiple tissues Ideally the sign should be common too Too much to suppose that the slope is exactly the same Two tasks 1) Combine 16 p values into one decision per gene 2) Adjust for having tested 8932 genes Here We look at task 1) understanding that it is for screening For this talk: pretend tests are independent & ignore gene groups

6 Pearson s meta-analysis revisited 6 Given a collection of p-values: We have n null hypotheses H 01,..., H 0n Multiple hypothesis testing We get n p-values p 1,..., p n p i for H 0i Decide which to reject, controlling false discoveries Meta-analysis We have 1 hypothesis H 0 We have m tests and m p-values for H 0 Combine p 1,..., p m into one decision Or combine m underlying test statistics

7 Pearson s meta-analysis revisited 7 An age related gene 1) should have a statistically significant regression slope 2) in multiple tissues (not necessarily all) 3) predominantly of one sign 4) not necessarily a common slope The underlying model Regress expression for gene i and tissue j on age adjusting for sex. Y ijk = β 0ij + β 1ij Age k + β 1ij Sex k + ε ijk There were 40 animals... so 37 degrees of freedom responses (apart from some missing values)

8 Pearson s meta-analysis revisited 8 ( m ) Refer 2 log j=1 p j to χ 2 (2m) Choose 1 tailed or 2 tailed p values Run Fisher vs β j < 0 run again vs β j > 0 use whichever one tailed test is most extreme Fisher s test K. Pearson s test What we get 1) Strong preference for concordant alternatives 2) We don t have to know the direction a priori 3) Still have some power if one test is discordant Pearson gets better power vs concordant alternatives and less power vs discordant.

9 Pearson s meta-analysis revisited 9 Notation for 1 gene Parameters: β 1 β m Estimates: ˆβ1 ˆβm Obs. Values: ˆβobs 1 ˆβobs m Null hypothesis H 0,j : β j = 0 Alternative H L,j : β j < 0 H R,j : β j > 0 H C,j : β j 0 p value Pr( ˆβ j Pr( ˆβ j Pr( ˆβ j ˆβ obs j β j = 0 ) p j ˆβ obs j β j = 0 ) 1 p j ˆβ obs j β j = 0 ) p j = 2 min( p j, 1 p j )

10 Pearson s meta-analysis revisited 10 Hypotheses on β = (β 1,..., β m ) Null H 0 : β = 0 Left orthant H L : β (, 0] m {0} Right orthant H R : β [0, ) m {0} Any H A : β 0 For > 0 In screening, we don t know whether to use H L or H R We prefer β = ±(,,..., ) to most β = (±, ±,..., ± ) But β = (,,...,, ) or (,,...,, 0) is also interesting So we use H A and a test with more power in H L and H R than elsewhere

11 Pearson s meta-analysis revisited 11 Test statistics Fisher s test, 3 ways ( m Q L = 2 log j=1 p j ) ( m ) Q R = 2 log (1 p j ) j=1 ( m ) Q C = 2 log p j j=1 For m = 1 Q U = Q C but not for m > 1 Pearson s test Q U max(q L, Q R ) Mnemonic: U for undirected

12 Pearson s meta-analysis revisited 12 Null distributions Q L, Q R, Q C χ 2 (2m) Via associated random variables, we find Pr ( Q U > x ) = Pr ( Q L > x ) + Pr ( Q R > x ) Pr ( Q L > x & Q R > x ) 2 Pr ( Q L > x ) Pr ( Q L > x ) 2 So Bonferroni is quite sharp for small α α Pr ( Q U χ 2,1 α/2 ) α 2 (2m) α 4 For α =.01, the level is in [ , 0.01]

13 Pearson s meta-analysis revisited 13 Stouffer et al (1949) test statistics Under H 0 Z j = Φ 1 ( p j ) N(0, 1) Reject H 0 for large S S L = 1 m m j=1 S R = 1 m m j=1 S C = 1 m m j=1 Φ 1 (1 p j ) Φ 1 ( p j ) S U = max(s L, S R ) Φ 1 ( p j ) Stouffer test is mostly a straw man Though S U advocated by Whitlock (2005)

14 Pearson s meta-analysis revisited 14 Meta-analysis refresher Key ref: Hedges and Olkin (1985) We have 1 hypothesis H 0 p values p 1,..., p m indep U(0, 1) under H 0 There is no unique best way to combine them (Birnbaum 1954) Condition 1 If H 0 is rejected for any given (p 1,..., p m ) then it will also be rejected for all (p 1,..., p m) such that p j p j for j = 1,..., m. Birnbaum shows that any combination method which satisfies Condition 1 is admissible.

15 Pearson s meta-analysis revisited 15 Meta-analysis geometry min(p 1, p 2 ) max(p 1, p 2 ) Fisher Stouffer x axis is p 1 y axis is p 2 Blue for α = 0.1 rejection region They all satisfy Condition 1 min is due to Tippett 1931 max is due to Wilkinson 1951

16 Pearson s meta-analysis revisited 16 Geometry again min(p 1, p 2 ) max(p 1, p 2 ) Fisher Stouffer Top row coords (p 1, p 2 ) bottom row coords ( p 1, p 2 )

17 Pearson s meta-analysis revisited 17 Undirected tests Fisher Q U Stouffer S U Rejection regions in one tailed ( p 1, p 2 ) coords Thicker rejection region for coordinated alternatives Stouffer allows one p j to veto the others

18 Pearson s meta-analysis revisited 18 A more stringent admissibility Tippet and Wilkinson are optimal at some alternatives hence admissible Some alternatives are far fetched For ˆβ j in exponential families Birnbaum Condition 2: Admissibility convex acceptance region for ( ˆβ 1,..., ˆβ m ) In a world of Gaussian data ˆβ j N (β j, σ 2 /n j ) p j = Φ( n j ˆβj /σ) ˆβ j = Φ 1 ( p j ) σ/ n j regions in p j regions in ˆβ j

19 Pearson s meta-analysis revisited 19 Birnbaum s result Reject for small Q B Get non convex acceptance regions Hence inadmissible test Quite right, but not Pearson s proposal ( m ) Q B = 2 log (1 p j ) j=1 What went wrong χ 2 (2m) Birnbaum 1954 misread Egon Pearson (1938) describing Karl Pearson (1934) Two problems 1) 1 vs 2 tailed p values mixed up 2) the word or misinterpreted

20 Pearson s meta-analysis revisited 20 Acceptance regions Q C Q U Q L Q B x axis is ˆβ 1 & y axis is ˆβ 2 Blue curve = rejection boundary Dot (origin) is in acceptance region for H 0 Admissible = dot in convex region Pearson s Q U region looks convex Of course it is! Intersect Q L and Q R regions

21 Pearson s meta-analysis revisited 21 Theorem 1 For ˆβ 1,..., ˆβ m R m let ( Q U = max 2 log Admissibility of Q U m j=1 Φ( ˆβ j ), 2 log m j=1 ) Φ( ˆβ j ). Then {( ˆβ 1,..., ˆβ m ) Q U < q} is convex so that Pearson s test is admissible in the exponential family context, for Gaussian data. 1) ϕ(t) is log concave Ideas of proof 2) so therefore are Φ(t) and Φ( t) Boyd and Vandenberge 3) log(log concave) is convex 4) sum of convex is convex 5) max of convex is convex these steps apply in other settings too

22 Pearson s meta-analysis revisited 22 Marden (1985) For Z j = Φ 1 ( p j ) Likelihood ratio tests Left, right, and center versions Λ L = Λ R = Λ C = m max(0, Z j ) 2 j=1 m max(0, Z j ) 2 j=1 m j=1 Z 2 j New one Λ U = max(λ L, Λ R ) Admissible, favors concordant alternatives, Bonferroni fairly tight

23 Pearson s meta-analysis revisited 23 Undirected LRT vs Fisher in ( p 1, p 2 ) Λ U Q U Λ U will catch more discordant tests Q U has more power for concordant tests

24 Pearson s meta-analysis revisited 24 More acceptance regions Two Gaussian variables: Und. Likelihood ratio Λ U Und. Fisher Q U Stouffer S U

25 Pearson s meta-analysis revisited 25 Alternatives of interest Most β j either zero or of common sign (β 1,..., β m ) R m Simpler special cases: each β j {0, } > 0

26 Pearson s meta-analysis revisited 26 Power of tests k nonzero {}}{ β = ±(,...,, 0,..., 0) H }{{} A R m ˆβ N (β, Im ) m k zero Power Delta m = 16 k {2, 4, 8, 16} Q U Λ U Λ C = m j=1 ˆβ 2 j

27 Pearson s meta-analysis revisited 27 Scale to k k nonzero {}}{ β = ±( k,..., k, 0,..., 0) H }{{} A R m ˆβ N (β, Im ) m k zero Choose k so j ˆβ 2 j has power 0.8 at α = 0.01 Power Number nonzero Q U Λ U S U S C

28 Pearson s meta-analysis revisited 28 One negative k 1 nonzero {}}{ β = ±( k, k,..., k, 0,..., 0) H }{{} A R m ˆβ N (β, Im ) m k zero Choose k so j ˆβ 2 j has power 0.8 at α = 0.01 Power Number nonzero Q U Λ U S U S C

29 Pearson s meta-analysis revisited 29 Computing the power e.g. Q L = m log ( Φ( p j ) ) j=1 A sum of independent random variables, distns F j under H A Get distribution by convolution (FFT) Monahan (2001) convolves characteristic functions New (?) alternative Get Discrete CDFs F j F j F + j (stochastic inequality) Support on grid {0, η, 2η,..., (N 1)η, + } η > 0 When convolving upper bounds, round overflow up to + When convolving lower bounds, round overflow down to (N 1)η After convolution m j=1 F j L(Q L ) m j=1 F + j We get 100% confidence, finite width

30 Pearson s meta-analysis revisited 30 Recommendations All j same sign = S U = j ˆβ j recommended Most Many j same sign = Q U = max(q L, Q R ) recommended j same sign = Λ U = max(λ L, Λ R ) recommended

31 Pearson s meta-analysis revisited 31 Extensive simulation Fisher-Pearson Q U has better precision-recall than S U or ˆβ2 j for finding truly age related genes in a simulation where we know which ones are related with β = (,...,, 0,..., 0) and resampled residuals No free lunch Increased power for concordant comes with decreased power for discordant If we wanted to We could design a test that preferred discordant results or concordant within subgroups

32 Pearson s meta-analysis revisited 32 Some results, for 9 tissues Pool via QC at level Num. of neg coef at 0.05 Num. of pos coef at Pool via QU at level Num. of neg coef at 0.05 Num. of pos coef at 0.05 Left shows genes found via Q C right via Q U each circle is one gene (Expect genes by chance) x axis is # tissues with p j < y axis is # tissues with p j > Q U pulls up more unanimous genes (269 vs 216), fewer split decisions, fewer total

33 Pearson s meta-analysis revisited 33 1) Pick a prior on β A more principled approach 2) Quantify the relative value of split decisions vs unanimous findings 3) Find a test to optimize expected value of discoveries Steps 1 and 2 look harder than 3

34 Pearson s meta-analysis revisited 34 Simes test regions p = min 1 j m m j p (j) U(0, 1) Under H 0 p = min(2p (1), p (2) ) for m = 2 C L T x axis is ˆβ 1 y axis is ˆβ 2 95% regions

35 Pearson s meta-analysis revisited 35 Partial conjunction hypotheses Benjamini and Heller (2007) Alt. is only interesting if r or more of β j 0 Null and alternative H 0r : m 1 βj 0 < r H Cr : j=1 m 1 βj 0 r j=1 NB: the null is composite for r > 1, e.g {0} and the axes when r = 2 Ignore the most significant r 1 p values combine the rest Test statistics

36 Pearson s meta-analysis revisited 36 Partial conjunction test statistics p (1) p (2) p (m) indep of p (1) p (2) p (m) Fisher style ( m 2 log j=r p (j) ) ( m 2 log j=r p (r) ) (m r+1 ) 2 log (1 p (r) ) j=1

37 Pearson s meta-analysis revisited 37 Partial conjunction test statistics p (1) p (2) p (m) indep of p (1) p (2) p (m) Fisher style ( m 2 log j=r p (j) ) ( m 2 log j=r p (r) ) Stouffer style (m r+1 ) 2 log (1 p (r) ) j=1 m Φ 1 (p (j) ) m Φ 1 ( p (j) ) m r+1 Φ 1 (1 p (j) ) j=r j=r j=1

38 Pearson s meta-analysis revisited 38 Partial conjunction test statistics p (1) p (2) p (m) indep of p (1) p (2) p (m) Fisher style ( m 2 log j=r p (j) ) ( m 2 log j=r p (r) ) Stouffer style (m r+1 ) 2 log (1 p (r) ) j=1 m Φ 1 (p (j) ) m Φ 1 ( p (j) ) m r+1 Φ 1 (1 p (j) ) j=r j=r j=1 Simes style min r j m m r + 1 j r + 1 p (j) min r j m m r + 1 j r + 1 p (j) min r j m m r + 1 j r + 1 (1 p (m j+1)) worth considering LRT and undirected versions

39 Pearson s meta-analysis revisited 39 Partial conjunction regions C L U For m = 2 and r = 2 need both significant Simes/Fisher/Stouffer collapse into one p (r) p (m) is just p (2) { } (β 1, β 0 ) β 1 = 0 or β 2 = 0 Null is

40 Pearson s meta-analysis revisited 40 Next steps Partial conjunction tests have nonconvex acceptance regions So they re not suited to a point null They were not motivated by that null either So how to pick good tests for this setting? Or rule out bad ones?

41 Pearson s meta-analysis revisited 41 Acknowledgments Stuart Kim and Jacob Zahn for many discussions about testing Ingram Olkin and John Marden for comments on meta-analysis NSF for support Nancy Zhang, Ed George, Adam Greenberg

42 Pearson s meta-analysis revisited 42 Quotes Given time, here s the history of the mixup. More details in paper Karl Pearson s Meta-Analysis Revisited Annals of Statistics, (2009)

43 Pearson s meta-analysis revisited 43 Birnbaum (1954) p 562 Quote Karl Pearson s method: reject H 0 if and only if (1 u 1 )(1 u 2 ) (1 u k ) c, where c is a predetermined constant corresponding to the desired significance level. In applications, c can be computed by a direct adaptation of the method used to calculate the c used in Fisher s method. Upshot In our notation (1 u 1 )(1 u 2 ) (1 u k ) is m j=1 (1 p j). It is clear from his Figure 4 that it does not mean m j=1 (1 p j). Birnbaum does not cite any of Karl Pearson s papers. Instead he cites Egon Pearson

44 Pearson s meta-analysis revisited 44 E. Pearson (1938) p 136 Quote Following what may be described as the intuitional line of approach, K. Pearson (1933) suggested as suitable test criterion one or other of the products Q 1 = y 1 y 2 y n, or Q 1 = (1 y 1 )(1 y 2 ) (1 y n ). Upshot In our notation Q 1 = m j=1 p j and Q 1 = m j=1 (1 p j). E. Pearson cites K. Pearson s 1933 paper, although it appears that he should have cited the 1934 paper instead, because the former has only Q 1, while the latter has Q 1 and Q 1. or or or K. Pearson s or meant try them both and take the more extreme. A. Birnbaum s or meant try either of them one at a time. He also used two-tailed p j where Pearson had one-tailed p j.

45 Pearson s meta-analysis revisited 45 Hedges & Olkin (1985) Several other functions for combining p-values have been proposed. In 1933 Karl Pearson suggested combining p-values via the product (1 p 1 )(1 p 2 ) (1 p k ). Other functions of the statistics p i = Min{p i, 1 p i }, i = 1,..., k, were suggested by David(1934) for the combination of two-sided test statistic, which treat large and small values of the p i symmetrically. Neither of these procedures has a convex acceptance region, so these procedures are not admissible for combining test statistics from the one-parameter exponential family. Upshot The complaint vs Q U may be stuck in the literature for a while. Birnbaum points out that finding something inadmissible does not mean it will be easy to find the thing that beats it.

By Art B. Owen 1 Stanford University

By Art B. Owen 1 Stanford University The Annals of Statistics 2009, Vol. 37, No. 6B, 3867 3892 DOI: 10.1214/09-AOS697 c Institute of Mathematical Statistics, 2009 KARL PEARSON S META-ANALYSIS REVISITED arxiv:0911.3531v1 [math.st] 18 Nov 2009

More information

Adaptive Filtering Multiple Testing Procedures for Partial Conjunction Hypotheses

Adaptive Filtering Multiple Testing Procedures for Partial Conjunction Hypotheses Adaptive Filtering Multiple Testing Procedures for Partial Conjunction Hypotheses arxiv:1610.03330v1 [stat.me] 11 Oct 2016 Jingshu Wang, Chiara Sabatti, Art B. Owen Department of Statistics, Stanford University

More information

Topic 3: Hypothesis Testing

Topic 3: Hypothesis Testing CS 8850: Advanced Machine Learning Fall 07 Topic 3: Hypothesis Testing Instructor: Daniel L. Pimentel-Alarcón c Copyright 07 3. Introduction One of the simplest inference problems is that of deciding between

More information

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between 7.2 One-Sample Correlation ( = a) Introduction Correlation analysis measures the strength and direction of association between variables. In this chapter we will test whether the population correlation

More information

Parameter Estimation, Sampling Distributions & Hypothesis Testing

Parameter Estimation, Sampling Distributions & Hypothesis Testing Parameter Estimation, Sampling Distributions & Hypothesis Testing Parameter Estimation & Hypothesis Testing In doing research, we are usually interested in some feature of a population distribution (which

More information

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis. 401 Review Major topics of the course 1. Univariate analysis 2. Bivariate analysis 3. Simple linear regression 4. Linear algebra 5. Multiple regression analysis Major analysis methods 1. Graphical analysis

More information

Testing Independence

Testing Independence Testing Independence Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/50 Testing Independence Previously, we looked at RR = OR = 1

More information

Chapter 2. Binary and M-ary Hypothesis Testing 2.1 Introduction (Levy 2.1)

Chapter 2. Binary and M-ary Hypothesis Testing 2.1 Introduction (Levy 2.1) Chapter 2. Binary and M-ary Hypothesis Testing 2.1 Introduction (Levy 2.1) Detection problems can usually be casted as binary or M-ary hypothesis testing problems. Applications: This chapter: Simple hypothesis

More information

Statistical Applications in Genetics and Molecular Biology

Statistical Applications in Genetics and Molecular Biology Statistical Applications in Genetics and Molecular Biology Volume 5, Issue 1 2006 Article 28 A Two-Step Multiple Comparison Procedure for a Large Number of Tests and Multiple Treatments Hongmei Jiang Rebecca

More information

Journal Club: Higher Criticism

Journal Club: Higher Criticism Journal Club: Higher Criticism David Donoho (2002): Higher Criticism for Heterogeneous Mixtures, Technical Report No. 2002-12, Dept. of Statistics, Stanford University. Introduction John Tukey (1976):

More information

Hypothesis testing (cont d)

Hypothesis testing (cont d) Hypothesis testing (cont d) Ulrich Heintz Brown University 4/12/2016 Ulrich Heintz - PHYS 1560 Lecture 11 1 Hypothesis testing Is our hypothesis about the fundamental physics correct? We will not be able

More information

Lecture 10: Generalized likelihood ratio test

Lecture 10: Generalized likelihood ratio test Stat 200: Introduction to Statistical Inference Autumn 2018/19 Lecture 10: Generalized likelihood ratio test Lecturer: Art B. Owen October 25 Disclaimer: These notes have not been subjected to the usual

More information

Analysis of Variance

Analysis of Variance Statistical Techniques II EXST7015 Analysis of Variance 15a_ANOVA_Introduction 1 Design The simplest model for Analysis of Variance (ANOVA) is the CRD, the Completely Randomized Design This model is also

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Data Mining. CS57300 Purdue University. March 22, 2018

Data Mining. CS57300 Purdue University. March 22, 2018 Data Mining CS57300 Purdue University March 22, 2018 1 Hypothesis Testing Select 50% users to see headline A Unlimited Clean Energy: Cold Fusion has Arrived Select 50% users to see headline B Wedding War

More information

Hypothesis Testing. Part I. James J. Heckman University of Chicago. Econ 312 This draft, April 20, 2006

Hypothesis Testing. Part I. James J. Heckman University of Chicago. Econ 312 This draft, April 20, 2006 Hypothesis Testing Part I James J. Heckman University of Chicago Econ 312 This draft, April 20, 2006 1 1 A Brief Review of Hypothesis Testing and Its Uses values and pure significance tests (R.A. Fisher)

More information

Problems. Suppose both models are fitted to the same data. Show that SS Res, A SS Res, B

Problems. Suppose both models are fitted to the same data. Show that SS Res, A SS Res, B Simple Linear Regression 35 Problems 1 Consider a set of data (x i, y i ), i =1, 2,,n, and the following two regression models: y i = β 0 + β 1 x i + ε, (i =1, 2,,n), Model A y i = γ 0 + γ 1 x i + γ 2

More information

Table of Outcomes. Table of Outcomes. Table of Outcomes. Table of Outcomes. Table of Outcomes. Table of Outcomes. T=number of type 2 errors

Table of Outcomes. Table of Outcomes. Table of Outcomes. Table of Outcomes. Table of Outcomes. Table of Outcomes. T=number of type 2 errors The Multiple Testing Problem Multiple Testing Methods for the Analysis of Microarray Data 3/9/2009 Copyright 2009 Dan Nettleton Suppose one test of interest has been conducted for each of m genes in a

More information

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing

Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing Statistical Inference: Estimation and Confidence Intervals Hypothesis Testing 1 In most statistics problems, we assume that the data have been generated from some unknown probability distribution. We desire

More information

Chapter 7: Hypothesis testing

Chapter 7: Hypothesis testing Chapter 7: Hypothesis testing Hypothesis testing is typically done based on the cumulative hazard function. Here we ll use the Nelson-Aalen estimate of the cumulative hazard. The survival function is used

More information

A. Motivation To motivate the analysis of variance framework, we consider the following example.

A. Motivation To motivate the analysis of variance framework, we consider the following example. 9.07 ntroduction to Statistics for Brain and Cognitive Sciences Emery N. Brown Lecture 14: Analysis of Variance. Objectives Understand analysis of variance as a special case of the linear model. Understand

More information

Mathematical Statistics

Mathematical Statistics Mathematical Statistics MAS 713 Chapter 8 Previous lecture: 1 Bayesian Inference 2 Decision theory 3 Bayesian Vs. Frequentist 4 Loss functions 5 Conjugate priors Any questions? Mathematical Statistics

More information

Stat 206: Estimation and testing for a mean vector,

Stat 206: Estimation and testing for a mean vector, Stat 206: Estimation and testing for a mean vector, Part II James Johndrow 2016-12-03 Comparing components of the mean vector In the last part, we talked about testing the hypothesis H 0 : µ 1 = µ 2 where

More information

Summary and discussion of: Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing

Summary and discussion of: Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing Summary and discussion of: Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing Statistics Journal Club, 36-825 Beau Dabbs and Philipp Burckhardt 9-19-2014 1 Paper

More information

Quantitative Methods for Economics, Finance and Management (A86050 F86050)

Quantitative Methods for Economics, Finance and Management (A86050 F86050) Quantitative Methods for Economics, Finance and Management (A86050 F86050) Matteo Manera matteo.manera@unimib.it Marzio Galeotti marzio.galeotti@unimi.it 1 This material is taken and adapted from Guy Judge

More information

Zhiguang Huo 1, Chi Song 2, George Tseng 3. July 30, 2018

Zhiguang Huo 1, Chi Song 2, George Tseng 3. July 30, 2018 Bayesian latent hierarchical model for transcriptomic meta-analysis to detect biomarkers with clustered meta-patterns of differential expression signals BayesMP Zhiguang Huo 1, Chi Song 2, George Tseng

More information

3. (a) (8 points) There is more than one way to correctly express the null hypothesis in matrix form. One way to state the null hypothesis is

3. (a) (8 points) There is more than one way to correctly express the null hypothesis in matrix form. One way to state the null hypothesis is Stat 501 Solutions and Comments on Exam 1 Spring 005-4 0-4 1. (a) (5 points) Y ~ N, -1-4 34 (b) (5 points) X (X,X ) = (5,8) ~ N ( 11.5, 0.9375 ) 3 1 (c) (10 points, for each part) (i), (ii), and (v) are

More information

Composite Hypotheses and Generalized Likelihood Ratio Tests

Composite Hypotheses and Generalized Likelihood Ratio Tests Composite Hypotheses and Generalized Likelihood Ratio Tests Rebecca Willett, 06 In many real world problems, it is difficult to precisely specify probability distributions. Our models for data may involve

More information

The One-Way Independent-Samples ANOVA. (For Between-Subjects Designs)

The One-Way Independent-Samples ANOVA. (For Between-Subjects Designs) The One-Way Independent-Samples ANOVA (For Between-Subjects Designs) Computations for the ANOVA In computing the terms required for the F-statistic, we won t explicitly compute any sample variances or

More information

Post-Selection Inference

Post-Selection Inference Classical Inference start end start Post-Selection Inference selected end model data inference data selection model data inference Post-Selection Inference Todd Kuffner Washington University in St. Louis

More information

High-Throughput Sequencing Course. Introduction. Introduction. Multiple Testing. Biostatistics and Bioinformatics. Summer 2018

High-Throughput Sequencing Course. Introduction. Introduction. Multiple Testing. Biostatistics and Bioinformatics. Summer 2018 High-Throughput Sequencing Course Multiple Testing Biostatistics and Bioinformatics Summer 2018 Introduction You have previously considered the significance of a single gene Introduction You have previously

More information

INTERVAL ESTIMATION AND HYPOTHESES TESTING

INTERVAL ESTIMATION AND HYPOTHESES TESTING INTERVAL ESTIMATION AND HYPOTHESES TESTING 1. IDEA An interval rather than a point estimate is often of interest. Confidence intervals are thus important in empirical work. To construct interval estimates,

More information

Controlling Bayes Directional False Discovery Rate in Random Effects Model 1

Controlling Bayes Directional False Discovery Rate in Random Effects Model 1 Controlling Bayes Directional False Discovery Rate in Random Effects Model 1 Sanat K. Sarkar a, Tianhui Zhou b a Temple University, Philadelphia, PA 19122, USA b Wyeth Pharmaceuticals, Collegeville, PA

More information

One sided tests. An example of a two sided alternative is what we ve been using for our two sample tests:

One sided tests. An example of a two sided alternative is what we ve been using for our two sample tests: One sided tests So far all of our tests have been two sided. While this may be a bit easier to understand, this is often not the best way to do a hypothesis test. One simple thing that we can do to get

More information

Many natural processes can be fit to a Poisson distribution

Many natural processes can be fit to a Poisson distribution BE.104 Spring Biostatistics: Poisson Analyses and Power J. L. Sherley Outline 1) Poisson analyses 2) Power What is a Poisson process? Rare events Values are observational (yes or no) Random distributed

More information

Ling 289 Contingency Table Statistics

Ling 289 Contingency Table Statistics Ling 289 Contingency Table Statistics Roger Levy and Christopher Manning This is a summary of the material that we ve covered on contingency tables. Contingency tables: introduction Odds ratios Counting,

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Statistics for Particle Physics. Kyle Cranmer. New York University. Kyle Cranmer (NYU) CERN Academic Training, Feb 2-5, 2009

Statistics for Particle Physics. Kyle Cranmer. New York University. Kyle Cranmer (NYU) CERN Academic Training, Feb 2-5, 2009 Statistics for Particle Physics Kyle Cranmer New York University 91 Remaining Lectures Lecture 3:! Compound hypotheses, nuisance parameters, & similar tests! The Neyman-Construction (illustrated)! Inverted

More information

Statistical Data Analysis Stat 3: p-values, parameter estimation

Statistical Data Analysis Stat 3: p-values, parameter estimation Statistical Data Analysis Stat 3: p-values, parameter estimation London Postgraduate Lectures on Particle Physics; University of London MSci course PH4515 Glen Cowan Physics Department Royal Holloway,

More information

Eco517 Fall 2004 C. Sims MIDTERM EXAM

Eco517 Fall 2004 C. Sims MIDTERM EXAM Eco517 Fall 2004 C. Sims MIDTERM EXAM Answer all four questions. Each is worth 23 points. Do not devote disproportionate time to any one question unless you have answered all the others. (1) We are considering

More information

Statistical inference

Statistical inference Statistical inference Contents 1. Main definitions 2. Estimation 3. Testing L. Trapani MSc Induction - Statistical inference 1 1 Introduction: definition and preliminary theory In this chapter, we shall

More information

appstats27.notebook April 06, 2017

appstats27.notebook April 06, 2017 Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves

More information

Scatter plot of data from the study. Linear Regression

Scatter plot of data from the study. Linear Regression 1 2 Linear Regression Scatter plot of data from the study. Consider a study to relate birthweight to the estriol level of pregnant women. The data is below. i Weight (g / 100) i Weight (g / 100) 1 7 25

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 15: Examples of hypothesis tests (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 32 The recipe 2 / 32 The hypothesis testing recipe In this lecture we repeatedly apply the

More information

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n = Hypothesis testing I I. What is hypothesis testing? [Note we re temporarily bouncing around in the book a lot! Things will settle down again in a week or so] - Exactly what it says. We develop a hypothesis,

More information

STAT 135 Lab 5 Bootstrapping and Hypothesis Testing

STAT 135 Lab 5 Bootstrapping and Hypothesis Testing STAT 135 Lab 5 Bootstrapping and Hypothesis Testing Rebecca Barter March 2, 2015 The Bootstrap Bootstrap Suppose that we are interested in estimating a parameter θ from some population with members x 1,...,

More information

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS In our work on hypothesis testing, we used the value of a sample statistic to challenge an accepted value of a population parameter. We focused only

More information

Inference in Regression Model

Inference in Regression Model Inference in Regression Model Christopher Taber Department of Economics University of Wisconsin-Madison March 25, 2009 Outline 1 Final Step of Classical Linear Regression Model 2 Confidence Intervals 3

More information

DATA IN SERIES AND TIME I. Several different techniques depending on data and what one wants to do

DATA IN SERIES AND TIME I. Several different techniques depending on data and what one wants to do DATA IN SERIES AND TIME I Several different techniques depending on data and what one wants to do Data can be a series of events scaled to time or not scaled to time (scaled to space or just occurrence)

More information

http://www.math.uah.edu/stat/hypothesis/.xhtml 1 of 5 7/29/2009 3:14 PM Virtual Laboratories > 9. Hy pothesis Testing > 1 2 3 4 5 6 7 1. The Basic Statistical Model As usual, our starting point is a random

More information

Probability and Statistics Notes

Probability and Statistics Notes Probability and Statistics Notes Chapter Seven Jesse Crawford Department of Mathematics Tarleton State University Spring 2011 (Tarleton State University) Chapter Seven Notes Spring 2011 1 / 42 Outline

More information

Previous lecture. Single variant association. Use genome-wide SNPs to account for confounding (population substructure)

Previous lecture. Single variant association. Use genome-wide SNPs to account for confounding (population substructure) Previous lecture Single variant association Use genome-wide SNPs to account for confounding (population substructure) Estimation of effect size and winner s curse Meta-Analysis Today s outline P-value

More information

Warm-up Using the given data Create a scatterplot Find the regression line

Warm-up Using the given data Create a scatterplot Find the regression line Time at the lunch table Caloric intake 21.4 472 30.8 498 37.7 335 32.8 423 39.5 437 22.8 508 34.1 431 33.9 479 43.8 454 42.4 450 43.1 410 29.2 504 31.3 437 28.6 489 32.9 436 30.6 480 35.1 439 33.0 444

More information

Scatter plot of data from the study. Linear Regression

Scatter plot of data from the study. Linear Regression 1 2 Linear Regression Scatter plot of data from the study. Consider a study to relate birthweight to the estriol level of pregnant women. The data is below. i Weight (g / 100) i Weight (g / 100) 1 7 25

More information

Modified Simes Critical Values Under Positive Dependence

Modified Simes Critical Values Under Positive Dependence Modified Simes Critical Values Under Positive Dependence Gengqian Cai, Sanat K. Sarkar Clinical Pharmacology Statistics & Programming, BDS, GlaxoSmithKline Statistics Department, Temple University, Philadelphia

More information

FDR-CONTROLLING STEPWISE PROCEDURES AND THEIR FALSE NEGATIVES RATES

FDR-CONTROLLING STEPWISE PROCEDURES AND THEIR FALSE NEGATIVES RATES FDR-CONTROLLING STEPWISE PROCEDURES AND THEIR FALSE NEGATIVES RATES Sanat K. Sarkar a a Department of Statistics, Temple University, Speakman Hall (006-00), Philadelphia, PA 19122, USA Abstract The concept

More information

Harvard University. Rigorous Research in Engineering Education

Harvard University. Rigorous Research in Engineering Education Statistical Inference Kari Lock Harvard University Department of Statistics Rigorous Research in Engineering Education 12/3/09 Statistical Inference You have a sample and want to use the data collected

More information

Part 1.) We know that the probability of any specific x only given p ij = p i p j is just multinomial(n, p) where p k1 k 2

Part 1.) We know that the probability of any specific x only given p ij = p i p j is just multinomial(n, p) where p k1 k 2 Problem.) I will break this into two parts: () Proving w (m) = p( x (m) X i = x i, X j = x j, p ij = p i p j ). In other words, the probability of a specific table in T x given the row and column counts

More information

Chapter 1 Review of Equations and Inequalities

Chapter 1 Review of Equations and Inequalities Chapter 1 Review of Equations and Inequalities Part I Review of Basic Equations Recall that an equation is an expression with an equal sign in the middle. Also recall that, if a question asks you to solve

More information

Statistical Modeling and Analysis of Scientific Inquiry: The Basics of Hypothesis Testing

Statistical Modeling and Analysis of Scientific Inquiry: The Basics of Hypothesis Testing Statistical Modeling and Analysis of Scientific Inquiry: The Basics of Hypothesis Testing So, What is Statistics? Theory and techniques for learning from data How to collect How to analyze How to interpret

More information

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3 Hypothesis Testing CB: chapter 8; section 0.3 Hypothesis: statement about an unknown population parameter Examples: The average age of males in Sweden is 7. (statement about population mean) The lowest

More information

Central Limit Theorem ( 5.3)

Central Limit Theorem ( 5.3) Central Limit Theorem ( 5.3) Let X 1, X 2,... be a sequence of independent random variables, each having n mean µ and variance σ 2. Then the distribution of the partial sum S n = X i i=1 becomes approximately

More information

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur Lecture No. # 36 Sampling Distribution and Parameter Estimation

More information

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit LECTURE 6 Introduction to Econometrics Hypothesis testing & Goodness of fit October 25, 2016 1 / 23 ON TODAY S LECTURE We will explain how multiple hypotheses are tested in a regression model We will define

More information

Analysis of Variance

Analysis of Variance Analysis of Variance Blood coagulation time T avg A 62 60 63 59 61 B 63 67 71 64 65 66 66 C 68 66 71 67 68 68 68 D 56 62 60 61 63 64 63 59 61 64 Blood coagulation time A B C D Combined 56 57 58 59 60 61

More information

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015 STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots March 8, 2015 The duality between CI and hypothesis testing The duality between CI and hypothesis

More information

exp{ (x i) 2 i=1 n i=1 (x i a) 2 (x i ) 2 = exp{ i=1 n i=1 n 2ax i a 2 i=1

exp{ (x i) 2 i=1 n i=1 (x i a) 2 (x i ) 2 = exp{ i=1 n i=1 n 2ax i a 2 i=1 4 Hypothesis testing 4. Simple hypotheses A computer tries to distinguish between two sources of signals. Both sources emit independent signals with normally distributed intensity, the signals of the first

More information

Unit 14: Nonparametric Statistical Methods

Unit 14: Nonparametric Statistical Methods Unit 14: Nonparametric Statistical Methods Statistics 571: Statistical Methods Ramón V. León 8/8/2003 Unit 14 - Stat 571 - Ramón V. León 1 Introductory Remarks Most methods studied so far have been based

More information

Let us first identify some classes of hypotheses. simple versus simple. H 0 : θ = θ 0 versus H 1 : θ = θ 1. (1) one-sided

Let us first identify some classes of hypotheses. simple versus simple. H 0 : θ = θ 0 versus H 1 : θ = θ 1. (1) one-sided Let us first identify some classes of hypotheses. simple versus simple H 0 : θ = θ 0 versus H 1 : θ = θ 1. (1) one-sided H 0 : θ θ 0 versus H 1 : θ > θ 0. (2) two-sided; null on extremes H 0 : θ θ 1 or

More information

A Sequential Bayesian Approach with Applications to Circadian Rhythm Microarray Gene Expression Data

A Sequential Bayesian Approach with Applications to Circadian Rhythm Microarray Gene Expression Data A Sequential Bayesian Approach with Applications to Circadian Rhythm Microarray Gene Expression Data Faming Liang, Chuanhai Liu, and Naisyin Wang Texas A&M University Multiple Hypothesis Testing Introduction

More information

Introduction to the Analysis of Variance (ANOVA) Computing One-Way Independent Measures (Between Subjects) ANOVAs

Introduction to the Analysis of Variance (ANOVA) Computing One-Way Independent Measures (Between Subjects) ANOVAs Introduction to the Analysis of Variance (ANOVA) Computing One-Way Independent Measures (Between Subjects) ANOVAs The Analysis of Variance (ANOVA) The analysis of variance (ANOVA) is a statistical technique

More information

Testing Research and Statistical Hypotheses

Testing Research and Statistical Hypotheses Testing Research and Statistical Hypotheses Introduction In the last lab we analyzed metric artifact attributes such as thickness or width/thickness ratio. Those were continuous variables, which as you

More information

TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST

TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST Econometrics Working Paper EWP0402 ISSN 1485-6441 Department of Economics TESTING FOR NORMALITY IN THE LINEAR REGRESSION MODEL: AN EMPIRICAL LIKELIHOOD RATIO TEST Lauren Bin Dong & David E. A. Giles Department

More information

Hypothesis Testing. We normally talk about two types of hypothesis: the null hypothesis and the research or alternative hypothesis.

Hypothesis Testing. We normally talk about two types of hypothesis: the null hypothesis and the research or alternative hypothesis. Hypothesis Testing Today, we are going to begin talking about the idea of hypothesis testing how we can use statistics to show that our causal models are valid or invalid. We normally talk about two types

More information

MATH Notebook 3 Spring 2018

MATH Notebook 3 Spring 2018 MATH448001 Notebook 3 Spring 2018 prepared by Professor Jenny Baglivo c Copyright 2010 2018 by Jenny A. Baglivo. All Rights Reserved. 3 MATH448001 Notebook 3 3 3.1 One Way Layout........................................

More information

UCLA STAT 251. Statistical Methods for the Life and Health Sciences. Hypothesis Testing. Instructor: Ivo Dinov,

UCLA STAT 251. Statistical Methods for the Life and Health Sciences. Hypothesis Testing. Instructor: Ivo Dinov, UCLA STAT 251 Statistical Methods for the Life and Health Sciences Instructor: Ivo Dinov, Asst. Prof. In Statistics and Neurology University of California, Los Angeles, Winter 22 http://www.stat.ucla.edu/~dinov/

More information

ECO375 Tutorial 4 Introduction to Statistical Inference

ECO375 Tutorial 4 Introduction to Statistical Inference ECO375 Tutorial 4 Introduction to Statistical Inference Matt Tudball University of Toronto Mississauga October 19, 2017 Matt Tudball (University of Toronto) ECO375H5 October 19, 2017 1 / 26 Statistical

More information

Lectures 5 & 6: Hypothesis Testing

Lectures 5 & 6: Hypothesis Testing Lectures 5 & 6: Hypothesis Testing in which you learn to apply the concept of statistical significance to OLS estimates, learn the concept of t values, how to use them in regression work and come across

More information

Looking at the Other Side of Bonferroni

Looking at the Other Side of Bonferroni Department of Biostatistics University of Washington 24 May 2012 Multiple Testing: Control the Type I Error Rate When analyzing genetic data, one will commonly perform over 1 million (and growing) hypothesis

More information

ST495: Survival Analysis: Hypothesis testing and confidence intervals

ST495: Survival Analysis: Hypothesis testing and confidence intervals ST495: Survival Analysis: Hypothesis testing and confidence intervals Eric B. Laber Department of Statistics, North Carolina State University April 3, 2014 I remember that one fateful day when Coach took

More information

Linear Regression. Chapter 3

Linear Regression. Chapter 3 Chapter 3 Linear Regression Once we ve acquired data with multiple variables, one very important question is how the variables are related. For example, we could ask for the relationship between people

More information

Statistics for IT Managers

Statistics for IT Managers Statistics for IT Managers 95-796, Fall 2012 Module 2: Hypothesis Testing and Statistical Inference (5 lectures) Reading: Statistics for Business and Economics, Ch. 5-7 Confidence intervals Given the sample

More information

14.30 Introduction to Statistical Methods in Economics Spring 2009

14.30 Introduction to Statistical Methods in Economics Spring 2009 MIT OpenCourseWare http://ocw.mit.edu 4.0 Introduction to Statistical Methods in Economics Spring 009 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

1 Least Squares Estimation - multiple regression.

1 Least Squares Estimation - multiple regression. Introduction to multiple regression. Fall 2010 1 Least Squares Estimation - multiple regression. Let y = {y 1,, y n } be a n 1 vector of dependent variable observations. Let β = {β 0, β 1 } be the 2 1

More information

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6. Chapter 7 Reading 7.1, 7.2 Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.112 Introduction In Chapter 5 and 6, we emphasized

More information

Probability. Lecture Notes. Adolfo J. Rumbos

Probability. Lecture Notes. Adolfo J. Rumbos Probability Lecture Notes Adolfo J. Rumbos October 20, 204 2 Contents Introduction 5. An example from statistical inference................ 5 2 Probability Spaces 9 2. Sample Spaces and σ fields.....................

More information

Fundamental Probability and Statistics

Fundamental Probability and Statistics Fundamental Probability and Statistics "There are known knowns. These are things we know that we know. There are known unknowns. That is to say, there are things that we know we don't know. But there are

More information

Lecture 21: October 19

Lecture 21: October 19 36-705: Intermediate Statistics Fall 2017 Lecturer: Siva Balakrishnan Lecture 21: October 19 21.1 Likelihood Ratio Test (LRT) To test composite versus composite hypotheses the general method is to use

More information

You have 3 hours to complete the exam. Some questions are harder than others, so don t spend too long on any one question.

You have 3 hours to complete the exam. Some questions are harder than others, so don t spend too long on any one question. Data 8 Fall 2017 Foundations of Data Science Final INSTRUCTIONS You have 3 hours to complete the exam. Some questions are harder than others, so don t spend too long on any one question. The exam is closed

More information

Chapter 27 Summary Inferences for Regression

Chapter 27 Summary Inferences for Regression Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test

More information

HAPPY BIRTHDAY CHARLES

HAPPY BIRTHDAY CHARLES HAPPY BIRTHDAY CHARLES MY TALK IS TITLED: Charles Stein's Research Involving Fixed Sample Optimality, Apart from Multivariate Normal Minimax Shrinkage [aka: Everything Else that Charles Wrote] Lawrence

More information

Search for b Ø bz. CDF note Adam Scott, David Stuart UCSB. 1 Exotics Meeting. Blessing

Search for b Ø bz. CDF note Adam Scott, David Stuart UCSB. 1 Exotics Meeting. Blessing Search for b Ø bz CDF note 8465 Adam Scott, David Stuart UCSB Exotics Meeting Blessing 1 Exotics Meeting Analysis in a Nutshell Looking for new particles decaying to Z+jets Select Z s in the dielectron

More information

Multiple samples: Modeling and ANOVA

Multiple samples: Modeling and ANOVA Multiple samples: Modeling and Patrick Breheny April 29 Patrick Breheny Introduction to Biostatistics (171:161) 1/23 Multiple group studies In the latter half of this course, we have discussed the analysis

More information

determine whether or not this relationship is.

determine whether or not this relationship is. Section 9-1 Correlation A correlation is a between two. The data can be represented by ordered pairs (x,y) where x is the (or ) variable and y is the (or ) variable. There are several types of correlations

More information

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F). STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis 1. Indicate whether each of the following is true (T) or false (F). (a) T In 2 2 tables, statistical independence is equivalent to a population

More information

Statistical Tests. Matthieu de Lapparent

Statistical Tests. Matthieu de Lapparent Statistical Tests Matthieu de Lapparent matthieu.delapparent@epfl.ch Transport and Mobility Laboratory, School of Architecture, Civil and Environmental Engineering, Ecole Polytechnique Fédérale de Lausanne

More information

Advanced Statistical Methods: Beyond Linear Regression

Advanced Statistical Methods: Beyond Linear Regression Advanced Statistical Methods: Beyond Linear Regression John R. Stevens Utah State University Notes 3. Statistical Methods II Mathematics Educators Worshop 28 March 2009 1 http://www.stat.usu.edu/~jrstevens/pcmi

More information

Charles Geyer University of Minnesota. joint work with. Glen Meeden University of Minnesota.

Charles Geyer University of Minnesota. joint work with. Glen Meeden University of Minnesota. Fuzzy Confidence Intervals and P -values Charles Geyer University of Minnesota joint work with Glen Meeden University of Minnesota http://www.stat.umn.edu/geyer/fuzz 1 Ordinary Confidence Intervals OK

More information

10. Composite Hypothesis Testing. ECE 830, Spring 2014

10. Composite Hypothesis Testing. ECE 830, Spring 2014 10. Composite Hypothesis Testing ECE 830, Spring 2014 1 / 25 In many real world problems, it is difficult to precisely specify probability distributions. Our models for data may involve unknown parameters

More information

HYPOTHESIS TESTING: FREQUENTIST APPROACH.

HYPOTHESIS TESTING: FREQUENTIST APPROACH. HYPOTHESIS TESTING: FREQUENTIST APPROACH. These notes summarize the lectures on (the frequentist approach to) hypothesis testing. You should be familiar with the standard hypothesis testing from previous

More information