Statistical Hypothesis Testing. STAT 536: Genetic Statistics. Statistical Hypothesis Testing - Terminology. Hardy-Weinberg Disequilibrium

Size: px
Start display at page:

Download "Statistical Hypothesis Testing. STAT 536: Genetic Statistics. Statistical Hypothesis Testing - Terminology. Hardy-Weinberg Disequilibrium"

Transcription

1 Statistical Hypothesis Testig STAT 536: Geetic Statistics Kari S. Dorma Departmet of Statistics Iowa State Uiversity September 7, 006 Idetify a hypothesis, a idea you wat to test for its applicability to your data set. Hardy-Weiberg equilibrium applies to this data set. The two loci I am studyig are idepedet of each other. Idetify ad calculate a test statistic. Ideally, the statistic should: summarize ad accetuate ay deviatios of the data from what is expected uder the hypothesis, have a kow samplig distributio uder the ull hypothesis. Compute or estimate the probability of the observed test statistic uder the assumptio of the hypothesis. Reject the hypothesis if this probability is small. Statistical Hypothesis Testig - Termiology Null hypothesis (H 0 ): the hypothesis you wish to test. Alterative hypothesis (H a ): whe you reject the ull hypothesis, you coclude the alterative hypothesis or hypotheses. Type I error: the test statistic causes you to (erroeously) reject the hypothesis whe it is true. size, or sigificace level (α): the probability of a type I error Type II error: you accept the hypothesis whe it is false β: the probability of a type II error power (1 β): probability that you reject the hypothesis whe it is false. Procedure: classically, oe decides o the size of the test before collectig the data, the selects the most powerful test for the desired size. Hypothesis Accepted Hypothesis Rejected Hypothesis True 1 α Type I (size, α) Hypothesis False Type II (β) power (1 β) Hardy-Weiberg Disequilibrium Let u ad v be alleles at a sigle locus. The, HWE implies P uu = pu P uv = p u p v wheever u v where P uv is the populatio geotype frequecy ad p u are the populatio allele frequecies. Recall that if two radom variables X ad Y are idepedet, the P(X ad Y ) = P(X)P(Y ). I Eglish, kowig X tells you othig about Y ad vice versa. HWE equatios imply idepedece (or o associatio) amog the alleles at a locus.

2 Hardy-Weiberg Disequilibrium (cot) These equatios may ot be satisfied i a populatio where ay oe (or more) of the Hardy-Weiberg assumptios is (are) violated. Whe the equatios are ot satisfied, a Hardy-Weiberg disequilibrium applies. P uu p u P uv p u p v wheever u v Oe could mathematically quatitate this disequilibrium i multiple ways. We will cosider two for ow: Suppose there is covariatio amog alleles. Write the disequilibrium i terms of this covariatio. Cosider subtractive disequilibrium: P uu p u ad P uv p u p v. Cosider multiplicative disequilibrium oce agai ad covert to log-liear model. Covariatio Betwee > Alleles What do you do with f whe there are more tha alleles? Say, for example, there are 3 alleles u, v, ad w. There ca be correlatio betwee u ad v or u ad w, etc. We ca subscript f. Let f uv be the correlatio betwee alleles u ad v, where u ca equal v. The, P uu = pu + p u (1 p u ) f uu P uv = p u p v (1 f uv ). But, we are ot doe. There are relatioships amog these parameters. If there are differet alleles, there are 1 free allele frequecies p u ad (+1) correlatio coefficiets f uv, for a total of +3 parameters. However, there are oly (+1) 1 free geotype couts i the data. How may relatioships are there amog the parameters? Covariatio Betwee Alleles Let x j be a idicator variable idicatig whether the jth allele of a radom idividual is allele 1. Recall the model What is the meaig of f? Var `x j Cov `x i, x j Corr `x i, x j P 11 = p 1 + p 1(1 p 1 )f P 1 = p 1 (1 p 1 )(1 f ) = p 1 (1 p 1 ) = E `x i x j E (xi ) E `x j = P11 p1 Cov `x i, x j = qvar (x i ) Var `x j = P 11 p 1 p 1 (1 p 1 ) = p 1 + p 1 (1 p 1 ) f p 1 p 1 (1 p 1 ) = f. Covariatio Betwee > Alleles (cot) There are d.f. i parameters d.f. i data = + 3 ( + 1) + 1 = relatioships amog the model parameters. To fid the relatioships amog the parameters, recall that p u = P uu + 1 P uv. There are such relatioships, ad they will cosume all the extra degrees of freedom. Substitute i the expressios for the geotype frequecies to observe f uu = p v f uv. 1 p u v u v u

3 Covariatio Betwee > Alleles (d approach) Suppose that associatios betwee alleles are ot a cosequece of specific iteractios amog alleles. Suppose istead that there is a geeral associatio betwee alleles regardless of allele idetity. (We will discuss how this ca arise later.) The, there is oe correlatio that applies to all pairs of alleles u ad v ad the applicable equatios are agai P uu = p u + p u (1 p u ) f P uv = p u p v (1 f ) where u v. There are 1 free allele frequecies p u ad 1 free correlatio f, leadig to free parameters. There are plety of degrees of freedom (+1) 1 i the data to cover these parameters. Problems with Correlatios f uv But all is ot perfect. Notice, f appears as a multiplier of allele frequecies. We model disequilibrium as a multiplicative factor that expads/shriks geotype couts away from HWE expectatio. We derived some estimates for correlatio f i previous lecture(s). I geeral, estimatio of multiplicative factors ivolves ratios of statistics. Ratios are otoriously difficult statistically. A easier approach, statistically, is to look for additive disequilibrium. Defie D uu = P uu pu D uv = P uv p u p v. Let D uv = D uv, to write more coveietly P uu = pu + D uu P uv = p u p v D uv wheever u v. d.f. for Additive Disequilibrium Agai, there must be relatioships amog the parameters to accout for the differece i degrees of freedom. p u = P uu + 1 P uv v u = pu + D uu + v<u = p u v u p v + D uu v<u = p u + D uu v<u D uv (p u p v D uv ) D uv Rage of Additive Disequilibrium Because 0 P uu, P uv 1, we also recogize that the additive disequilibrium ca t be just aythig. I fact, the total list of costraits is p u D uu 1 p u p u p v 1 D uv p u p v D uu = X v<u D uv I the case of just two alleles at a locus 1 ad, the D 11 = D 1 = D, so P 11 = p 1 + D 1 P 1 = p 1 p D 1 leadig to D uu = v<u D uv. ad P = p + D 1 max pu D 1 p 1 p. u {1,}

4 Testig D 1 = 0 (HWE) Estimatig D 1 Testig for HWE is equivalet to testig the ull hypothesis H 0 : D 1 = 0. Here, we have restricted to the two allele case. We eed two thigs: A estimate ˆD A. Is it close to 0? A samplig distributio for the estimate ˆD 1 to determie whether it is farther from 0 tha we would expect by chace. Two free parameters p 1, D 1 ad two free pieces of data 11, 1 suggests Bailey s method: 11 = ( p 1 + D 1 ) ca be solved to produce 1 = (p 1 p D 1 ) ˆp 1 = = p 1 ˆD 1 = 11 p 1 = P 11 p 1. ˆD 1 Bias ˆD 1 Samplig Variace Is our estimate ˆD 1 ubiased? E ˆD1 ) = E ( P11 E ( p 1 ) = P 11 p1 1 ( p1 + P 11 p 1) = D 1 1 [p 1 (1 p 1 ) + D 1 ]. Sice E ˆD1 D 1 we coclude that the estimator is biased. However, we are ecouraged to ote that as, the bias goes to 0. Usig Fisher s approximatio (it applies because ˆD 1 is a fuctio of proportios), we obtai a approximate samplig variace for ˆD 1 Var ˆD1 1 [ ] p1 (1 p 1 ) + (1 p 1 ) D 1 D1. To estimate the samplig variace, we substitute i our estimates for p 1 ad D 1 Var ˆD1 ˆ= 1 [ ˆp 1 (1 ˆp 1 ) + (1 ˆp 1 ) ˆD1 ˆD ] 1. Sice ˆD 1 is the MLE, we have for large samples that [ ] ˆD 1 N E ˆD1, Var ˆD1.

5 Testig H 0 : D 1 = 0 Usig z-values Computig z Compute the stadard ormal variate z ˆD 1 E ˆD1 z = Var ˆD1 Uder the ull hypothesis H 0, z approximately follows the stadard ormal distributio. Compare z agaist stadard ormal distributio. The key is that if ˆD 1 is very positive or very egative, the z will ted to be far from 0 ad your statistic will fall i the tails of the samplig distributio, where it is ot expected to fall if the ull hypothesis of HWE is true. Uder the ull hypothesis, we kow E ˆD1 0 Var ˆD1 1 so z = [ ˆp 1 (1 ˆp 1 ) ]. ˆD1 ˆp 1 (1 ˆp 1 ) Note, we have assumed is sufficietly large that the bias term is egligible. Relevat Alterative Hypotheses Relevat Alterative Hypotheses (cot) P 11 = p 1 + D 1 P 1 = p 1 p D 1 P = p + D 1 Depedig o your purpose, there may be differet alterative hypotheses you cosider. Suppose z = 1.5, the H A : D 1 > 0 or D 1 < 0. You have o a priori feelig for whether heterozygotes will be over- or uder-represeted. Use a two-tailed test. > *porm(q=-1.5) [1] > *(1-porm(q=1.5)) [1] Oe-side hypotheses are appropriate whe you suspected heterozygotes would either be uder- or over-represeted before you collected the data. H A : D 1 > 0. You suspect that heterozygotes will be uder-represeted. Use a oe-tailed (right tail) test. > (1-porm(q=1.5)) [1] H A : D 1 < 0. You suspect that heterozygotes will be over-represeted. Use a oe-tailed (left tail) test. > porm(q=1.5) [1]

6 Chi-Square Chi-Square Goodess-of-Fit A equivalet test is the Chi-Square test for HWE. It depeds o comparig z agaist its samplig distributio, which uder the ull, is a chi-square distributio with 1 degree of freedom. z = X 1 = ˆD 1 ˆp 1 (1 ˆp 1). However, ote that both positive ad egative values of z give the same z statistic. It is ot so easy to cosider oe-sided alterative hypotheses. Sice the tests are equivalet, use the z statistic for oe-sided tests. Test Assumptio: The sample size is large so both ormality (or chi-square) applies ad bias ca be igored. Geotype 11 1 Observed (O) 11 1 Expected (E) Observed - Expected ˆp 1 ˆD 1 ˆp 1 (1 ˆp 1 ) ˆD 1 (1 ˆp 1 ) ˆD 1 Here, we have made the assumptio that is sufficietly large that the bias terms are 0. The goodess-of-fit chi-square statistic is defied as X 1 = = geotypes ( ˆD ) 1 ˆp 1 (O E) E ( ˆD 1 ) ( ˆD 1 ) + ˆp 1 (1 ˆp 1 ) + (1 ˆp 1 ). Stadard Cautios About Chi-Square Tests Likelihood Ratio Suppose your ull hypothesis is Apply oly whe expected couts E 5. Because the expected couts E appear i the deomiator, small variatio whe they are small results i huge chages i X 1. Apply Yates correctio to accout for discrete ature of data. Because the observed data are discrete, but the samplig distributio (ormal or chi-square) is cotiuous, the Yates correctio is recommeded: X1 ( O E 0.5) = E geotypes H 0 : φ = φ 0 for some parameter φ. Let the maximum likelihood value uder H 0 be L 0 ad the maximum likelihood value without the restrictio o φ be L 1. The, L 0 will always be smaller tha L 1 sice φ 0 may ot be the maximum likelihood value of φ. However, if the ull is true, ˆφ should be very close to φ 0 ad L 0 will be very close to L 1. Defie the likelihood ratio as λ = L 0 L 1. Whe the ull hypothesis is true ad the size of φ is s, the l λ χ (s).

7 Likelihood Ratio Test for HWE Uder the ucostraied model (the alterative hypothesis), the parameters are p 11, p 1, p ad the data are 11, 1,. There are two degrees of freedom i the model ad the data, so Bailey s method applies to yield ˆp 11 = 11 ˆp 1 = 1. The maximum likelihood uder the ucostraied model is L 1 =! 11! 1!! ( 11 ) 11 ( 1 Uder the costraied model, ˆP 11 = ˆp 1 ˆP 1 = ˆp 1 (1 ˆp 1 ) with ˆp 1 = 1. ) 1 ( A Multiplicative Model That Works Let us cosider aother multiplicative model P 11 = MM 1 M 11 P 1 = MM 1 M M 1 P = MM M Here M is the mea effect, M 11, M 1, M represet associatios betwee allele frequecies, ad M 1, M represet the allele frequecy cotributios. Takig logarithms puts this model back i the additive space l P 11 = l M + l M 1 + l M 11 ) l P 1 = l M + l + l M 1 + l M + l M 1 l P = l M + l M + l M Likelihood Ratio Test for HWE (cot) The maximum likelihood uder the costraied model is L 0 =! ( 1 11! 1!! ) 11 ( 1 () ) 1 ( ). The test statistic is therefore [ ] 11 1 ( 1 ) 1 l λ = l () Hadlig Overparameterizatio There are, as usual, more parameters tha observatios. Ad this time there are multiple ways to deal with the overparameterizatio. Oe way is to set the M M 1 = 1 M M = 1, P 11 = MM 1 M 11 P 1 = MM 1 P = M. There is still a extra degree of freedom, but summig all three equatios yields 1 = M M1 M 11 + M or 1 M = 1 + M 1 + M1 M. 11

8 Estimatig Parameters M 1 ad M 11 Agai, Bailey s method applies ad the maximum likelihood estimates are ˆM 1 = 1 Mˆ 11 = with the tag-alog ˆM =. Substitutig these MLEs back ito the origial multiplicative equatios produces the same likelihood uder H A. ˆP 11 = 11 ˆP 1 = 1 ˆP = L 1 =! 11! 1!! Log Likelihood Test for Multiplicate Model HWE implies o iteractio term, i.e. M 11 = 1. Uder this costrait, we agai apply Bailey s method to fid ) ( 1 ˆP 11 = ˆP 1 = 1 ( ) ˆP =. It turs out that λ has the same form uder this log-liear model as the additive disequilibrium model. So the log-liear model for testig HWE is equivalet to the additive model. Exact Tests Exact Tests for HWE If the probability of the observed sample uder the ull hypothesis ad all less likely samples is small, the the evidece suggests the data is ulikely to have arise uder the ull hypothesis. If oe ca compute the probability of all possible samples, the obtaiig a exact probability (o approximatio) is possible. are useful whe all the possible observed data ca be eumerated practically. This occurs geerally whe the expected couts are small i some categories (i.e. whe the previous tests fail). The probability of the observed data 11, 1, is give by the multiomial distributio P ( 11, 1, ) =! 11! 1!! p p 1 1 p. Whe HWE applies, the P ( 11, 1, ) =! 11! 1!! p 11 1 (p 1 p ) 1 p. I additio, the allele couts 1 ad are biomially distributed P ( 1, ) = ()! 1!! p 1 1 p. The coditioal probability, where we coditio o the observed allele couts is P ( 11, 1, 1, ) = P ( 11, 1,, 1, ) P ( 1, )! 1!! 1 = 11! 1!!()!. = P ( 11, 1, ) P ( 1, )

9 Exact Test for HWE (Example) Summary of Tests for HWE Suppose we observe 11 = 10, 1 = 1, =. Use a exact test to calculate a p-value for rejectig the ull hypothesis of HWE Probability Cumul. Prob The p-value is p = sice there is o dataset more extreme tha the observed. Normal approximatio for MLEs uses the z statistic. Chi-square test uses the X 1 = z statistic ad is equivalet to the above test. The chi-square goodess-of-fit test is idetical to the above chi-square test, but highlights the eed for substatial data i each category. The likelihood ratio test is widely applicable ad flexible whe a likelihood fuctio is available. The log-liear model uses a multiplicative model ad leads to a test equivalet to the likelihood ratio test. The exact test is useful whe the data set is small ad particularly whe couts i some categories are small. Tests for Multiple Alleles Testig Complete HWE Whe there are more tha two alleles at a locus, the geeral equatios P uu = p u + D uu P uv = p u p v D uv wheever u v Therefore l λ for H 0 : D uv = 0 for all u v approximately follows a chi-square distributio with k(k 1) degrees of freedom. If you are more comfortable with a goodess-of-fit test, that statistic apply, with relatioship D uu = X v<u D uv X T = u v ( uv E ( uv ) 0.5) E ( uv ) ad MLEs obtaied by Bailey s method (verify this; it is ot hard) with ˆp u = p u ˆD uv = p u p v 1 P uv. Uder complete HWE, D uv = 0 for all u v, that is there are k(k 1) costraits applied (oe for each heterozygote) whe there are k alleles. E ( uu ) = p u E ( uv ) = p u p v. follows the same samplig distributio.

10 Testig Partial HWE z-test for Partial HWE If you wat to test oly certai combiatios of alleles, the tests are more complicated. Test a sigle D uv. Example: If H 0 : D 1 = 0, the likelihood ratio statistic l λ follows a chi-square with 1 degree of freedom. But Bailey s method does ot apply uless k = so L 0 is difficult to compute. Iterative methods are required. Or, oe could apply the z-test or the chi-square test. Details for these tests follow o the ext slide. Test multiple, but ot all D uv. Example: If k = 4 ad H 0 : D 1, D 34 = 0, l λ follows a chi-square with degrees of freedom. Iterative methods are still required. This time, there are o easy alteratives. Complex hypotheses cause difficulties for the z-test ad related chi-square. The likelihood ratio test hadles complex hypotheses quite aturally. Here, you oly eed the MLEs uder the full alterative model (where Bailey s method applies). For large samples, the MLE ˆD uv is approximately ormally distributed uder H 0 with mea 0 ad variace Var ˆDuv. z uv = ˆD uv r Var ˆDuv We ca agai use Fisher s approximatio to compute the variace. 8 1 < Var ˆDuv = : p up v [(1 p u )(1 p v ) + p u p v ] h (1 p u p v ) (p u p v ) i D uv 9 + X = (pud vw + pv D uw ) Duv ; w u,v Uder the H 0, the variace is obtaied by assumig D uv = 0. Exact Tests for Multiple Alleles Approximate Exact Tests The exact test geeralizes to multiple alleles. The formula is GeePOP iput: Test loc1 Pop a1, 0101 a, 0101 a3, 0101 a4, 00 a5, 003 P ({ uv } { u }) =!H Q u u! ()! Q u,v uv!. GUO, S. ad THOMPSON, E Performig the exact test of Hardy Weiberg proportio for multiple alleles. Biometrics 48 pp LAZZERONI, L. C. ad LANGE, K Markov chais for Mote Carlo tests of geetic equilibrium i multidimesioal cotigecy tables. A. Statist. 5 pp Whe it is impossible to eumerate all the possible datasets with the same allele frequecies, approximate methods are eeded. Oe of the simplest uses Mote Carlo. Calculate F = P ({ uv } { u }) for the observed data. Set S = 0 ad put all your geotypes (13, 66, 31, 16,...) i a big vector of legth Permute all alleles ad clump successive alleles ito geotypes. (36)(16)(36)(13) Compute F = P ({ uv } { u }) for the permuted dataset. If F F, icremet S by 1. Repeat M times. Estimate the p-value as S M.

11 Example Power Calculatios Cosider the followig table of geotype couts for a locus with four alleles. A 1 0 A 3 1 A A A 1 A A 3 A 4 Method Estimate Exact χ MC itegratio (M = 1700) Recall that the power of a statistical test is the probability that you reject the ull hypothesis whe it is false. Before you begi collectig data, you may wish to estimate whether you will be able to detect a disequilibrium of a give size, say D 1 = To detect it meas you reject the ull that D 1 = 0. The test statistic X1 = ˆD 1 ˆp 1 (1 ˆp 1) follows a differet distributio depedig o whether H 0 is true or ot. H 0 true X1 χ (1) H 0 false X1 χ (1,ν) where χ (1,ν) is the ocetral chi-square distributio with ocetrality parameter ν. The Nocetrality Parameter Size of D 1 or The ocetral chi-square distributio is a approximate distributio uder H A. The ocetrality parameter is give by ν = D 1 p 1 (1 p 1) ν is bigger whe D 1 is farther from 0. But ote, the approximatio is oly valid whe ν is small, say of order 1. If we take the stadard sigificace level α = 0.05, the we will reject H 0 if X 1 > With this kowledge oe ca ask a couple of questios How big does D 1 eed to be i order to have 90% chace of gettig X 1 > 3.84 ad therefore rejectig H 0 ad detectig the disequilibrium? How big does my sample eed to be i order to detect a disequilibrium D 1 = 0.1 with 90% probability? r ν D 1 = p 1 (1 p 1 ) Whe ν = 10.5, the X 1 will exceed 3.84 with 90% probability. (These kids of results are tabulated or available i statistics packages.) So, for alterative sample sizes you ca see how large a disequilibrium D 1 you will be likely (90% probabilitity) to detect. Also, further rearragemet yields = νp A(1 p A ) D A which ca be used to compute the size of the sample we ll eed to detect a specified disequilibrium D A. Note, for both of these applicatios, you eed to kow p A to use the formulas.

12 Power Calculatios for Exact Tests Power Calculatios for Exact Tests (cot) Recall, you eed to calculate P ({ uv } { u }) = P ({ uv}) P ({ u }) Uder the alterative hypothesis, the geotype frequecies are give by the formulas P uu = p u + D uu so you ca compute the umerator P uv = p u p v D uv wheever u v,! Y P ({ uv }) = Q u v uv! u v P uv uv But, of course, u = uu + 1 uv. so if you kow P ({ uv }), you ca compute P ({ u }) by summig over the former. u<v but you ca o loger assume the biomial distributio for allele frequecies.

STAT 536: Genetic Statistics

STAT 536: Genetic Statistics STAT 536: Genetic Statistics Tests for Hardy Weinberg Equilibrium Karin S. Dorman Department of Statistics Iowa State University September 7, 2006 Statistical Hypothesis Testing Identify a hypothesis,

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to: STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio

More information

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo

More information

Understanding Samples

Understanding Samples 1 Will Moroe CS 109 Samplig ad Bootstrappig Lecture Notes #17 August 2, 2017 Based o a hadout by Chris Piech I this chapter we are goig to talk about statistics calculated o samples from a populatio. We

More information

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more

More information

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen) Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

General IxJ Contingency Tables

General IxJ Contingency Tables page1 Geeral x Cotigecy Tables We ow geeralize our previous results from the prospective, retrospective ad cross-sectioal studies ad the Poisso samplig case to x cotigecy tables. For such tables, the test

More information

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading Topic 15 - Two Sample Iferece I STAT 511 Professor Bruce Craig Comparig Two Populatios Research ofte ivolves the compariso of two or more samples from differet populatios Graphical summaries provide visual

More information

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight) Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

Table 12.1: Contingency table. Feature b. 1 N 11 N 12 N 1b 2 N 21 N 22 N 2b. ... a N a1 N a2 N ab

Table 12.1: Contingency table. Feature b. 1 N 11 N 12 N 1b 2 N 21 N 22 N 2b. ... a N a1 N a2 N ab Sectio 12 Tests of idepedece ad homogeeity I this lecture we will cosider a situatio whe our observatios are classified by two differet features ad we would like to test if these features are idepedet

More information

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency Math 152. Rumbos Fall 2009 1 Solutios to Review Problems for Exam #2 1. I the book Experimetatio ad Measuremet, by W. J. Youde ad published by the by the Natioal Sciece Teachers Associatio i 1962, the

More information

Lecture 3. Properties of Summary Statistics: Sampling Distribution

Lecture 3. Properties of Summary Statistics: Sampling Distribution Lecture 3 Properties of Summary Statistics: Samplig Distributio Mai Theme How ca we use math to justify that our umerical summaries from the sample are good summaries of the populatio? Lecture Summary

More information

Chapter 13: Tests of Hypothesis Section 13.1 Introduction

Chapter 13: Tests of Hypothesis Section 13.1 Introduction Chapter 13: Tests of Hypothesis Sectio 13.1 Itroductio RECAP: Chapter 1 discussed the Likelihood Ratio Method as a geeral approach to fid good test procedures. Testig for the Normal Mea Example, discussed

More information

Stat 200 -Testing Summary Page 1

Stat 200 -Testing Summary Page 1 Stat 00 -Testig Summary Page 1 Mathematicias are like Frechme; whatever you say to them, they traslate it ito their ow laguage ad forthwith it is somethig etirely differet Goethe 1 Large Sample Cofidece

More information

Chapter 6 Sampling Distributions

Chapter 6 Sampling Distributions Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to

More information

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test. Math 308 Sprig 018 Classes 19 ad 0: Aalysis of Variace (ANOVA) Page 1 of 6 Itroductio ANOVA is a statistical procedure for determiig whether three or more sample meas were draw from populatios with equal

More information

Last Lecture. Wald Test

Last Lecture. Wald Test Last Lecture Biostatistics 602 - Statistical Iferece Lecture 22 Hyu Mi Kag April 9th, 2013 Is the exact distributio of LRT statistic typically easy to obtai? How about its asymptotic distributio? For testig

More information

Common Large/Small Sample Tests 1/55

Common Large/Small Sample Tests 1/55 Commo Large/Small Sample Tests 1/55 Test of Hypothesis for the Mea (σ Kow) Covert sample result ( x) to a z value Hypothesis Tests for µ Cosider the test H :μ = μ H 1 :μ > μ σ Kow (Assume the populatio

More information

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ STATISTICAL INFERENCE INTRODUCTION Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I oesample testig, we essetially

More information

Chapter 23: Inferences About Means

Chapter 23: Inferences About Means Chapter 23: Ifereces About Meas Eough Proportios! We ve spet the last two uits workig with proportios (or qualitative variables, at least) ow it s time to tur our attetios to quatitative variables. For

More information

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced

More information

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times Sigificace level vs. cofidece level Agreemet of CI ad HT Lecture 13 - Tests of Proportios Sta102 / BME102 Coli Rudel October 15, 2014 Cofidece itervals ad hypothesis tests (almost) always agree, as log

More information

1 Models for Matched Pairs

1 Models for Matched Pairs 1 Models for Matched Pairs Matched pairs occur whe we aalyse samples such that for each measuremet i oe of the samples there is a measuremet i the other sample that directly relates to the measuremet i

More information

6 Sample Size Calculations

6 Sample Size Calculations 6 Sample Size Calculatios Oe of the major resposibilities of a cliical trial statisticia is to aid the ivestigators i determiig the sample size required to coduct a study The most commo procedure for determiig

More information

Mathematical Notation Math Introduction to Applied Statistics

Mathematical Notation Math Introduction to Applied Statistics Mathematical Notatio Math 113 - Itroductio to Applied Statistics Name : Use Word or WordPerfect to recreate the followig documets. Each article is worth 10 poits ad ca be prited ad give to the istructor

More information

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance Hypothesis Testig Empirically evaluatig accuracy of hypotheses: importat activity i ML. Three questios: Give observed accuracy over a sample set, how well does this estimate apply over additioal samples?

More information

Chapter 13, Part A Analysis of Variance and Experimental Design

Chapter 13, Part A Analysis of Variance and Experimental Design Slides Prepared by JOHN S. LOUCKS St. Edward s Uiversity Slide 1 Chapter 13, Part A Aalysis of Variace ad Eperimetal Desig Itroductio to Aalysis of Variace Aalysis of Variace: Testig for the Equality of

More information

GG313 GEOLOGICAL DATA ANALYSIS

GG313 GEOLOGICAL DATA ANALYSIS GG313 GEOLOGICAL DATA ANALYSIS 1 Testig Hypothesis GG313 GEOLOGICAL DATA ANALYSIS LECTURE NOTES PAUL WESSEL SECTION TESTING OF HYPOTHESES Much of statistics is cocered with testig hypothesis agaist data

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y. Testig Statistical Hypotheses Recall the study where we estimated the differece betwee mea systolic blood pressure levels of users of oral cotraceptives ad o-users, x - y. Such studies are sometimes viewed

More information

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio

More information

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1. Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio

More information

Power and Type II Error

Power and Type II Error Statistical Methods I (EXST 7005) Page 57 Power ad Type II Error Sice we do't actually kow the value of the true mea (or we would't be hypothesizig somethig else), we caot kow i practice the type II error

More information

Chapter 11: Asking and Answering Questions About the Difference of Two Proportions

Chapter 11: Asking and Answering Questions About the Difference of Two Proportions Chapter 11: Askig ad Aswerig Questios About the Differece of Two Proportios These otes reflect material from our text, Statistics, Learig from Data, First Editio, by Roxy Peck, published by CENGAGE Learig,

More information

UCLA STAT 110B Applied Statistics for Engineering and the Sciences

UCLA STAT 110B Applied Statistics for Engineering and the Sciences UCLA STAT 110B Applied Statistics for Egieerig ad the Scieces Istructor: Ivo Diov, Asst. Prof. I Statistics ad Neurology Teachig Assistats: Bria Ng, UCLA Statistics Uiversity of Califoria, Los Ageles,

More information

Topic 18: Composite Hypotheses

Topic 18: Composite Hypotheses Toc 18: November, 211 Simple hypotheses limit us to a decisio betwee oe of two possible states of ature. This limitatio does ot allow us, uder the procedures of hypothesis testig to address the basic questio:

More information

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS Lecture 5: Parametric Hypothesis Testig: Comparig Meas GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review from last week What is a cofidece iterval? 2 Review from last week What is a cofidece

More information

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

Sampling Distributions, Z-Tests, Power

Sampling Distributions, Z-Tests, Power Samplig Distributios, Z-Tests, Power We draw ifereces about populatio parameters from sample statistics Sample proportio approximates populatio proportio Sample mea approximates populatio mea Sample variace

More information

Frequentist Inference

Frequentist Inference Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for

More information

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9 Hypothesis testig PSYCHOLOGICAL RESEARCH (PYC 34-C Lecture 9 Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I

More information

Economics Spring 2015

Economics Spring 2015 1 Ecoomics 400 -- Sprig 015 /17/015 pp. 30-38; Ch. 7.1.4-7. New Stata Assigmet ad ew MyStatlab assigmet, both due Feb 4th Midterm Exam Thursday Feb 6th, Chapters 1-7 of Groeber text ad all relevat lectures

More information

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. Comments:

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. Comments: Recall: STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS Commets:. So far we have estimates of the parameters! 0 ad!, but have o idea how good these estimates are. Assumptio: E(Y x)! 0 +! x (liear coditioal

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio

More information

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700 Class 23 Daiel B. Rowe, Ph.D. Departmet of Mathematics, Statistics, ad Computer Sciece Copyright 2017 by D.B. Rowe 1 Ageda: Recap Chapter 9.1 Lecture Chapter 9.2 Review Exam 6 Problem Solvig Sessio. 2

More information

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate

More information

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise)

Lecture 22: Review for Exam 2. 1 Basic Model Assumptions (without Gaussian Noise) Lecture 22: Review for Exam 2 Basic Model Assumptios (without Gaussia Noise) We model oe cotiuous respose variable Y, as a liear fuctio of p umerical predictors, plus oise: Y = β 0 + β X +... β p X p +

More information

Correlation. Two variables: Which test? Relationship Between Two Numerical Variables. Two variables: Which test? Contingency table Grouped bar graph

Correlation. Two variables: Which test? Relationship Between Two Numerical Variables. Two variables: Which test? Contingency table Grouped bar graph Correlatio Y Two variables: Which test? X Explaatory variable Respose variable Categorical Numerical Categorical Cotigecy table Cotigecy Logistic Grouped bar graph aalysis regressio Mosaic plot Numerical

More information

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1 October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 1 Populatio parameters ad Sample Statistics October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 2 Ifereces

More information

Introductory statistics

Introductory statistics CM9S: Machie Learig for Bioiformatics Lecture - 03/3/06 Itroductory statistics Lecturer: Sriram Sakararama Scribe: Sriram Sakararama We will provide a overview of statistical iferece focussig o the key

More information

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is: PROBABILITY FUNCTIONS A radom variable X has a probabilit associated with each of its possible values. The probabilit is termed a discrete probabilit if X ca assume ol discrete values, or X = x, x, x 3,,

More information

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes. Term Test 3 (Part A) November 1, 004 Name Math 6 Studet Number Directio: This test is worth 10 poits. You are required to complete this test withi miutes. I order to receive full credit, aswer each problem

More information

Chapter 12 Correlation

Chapter 12 Correlation Chapter Correlatio Correlatio is very similar to regressio with oe very importat differece. Regressio is used to explore the relatioship betwee a idepedet variable ad a depedet variable, whereas correlatio

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Aalysis ad Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasii/teachig.html Suhasii Subba Rao Review of testig: Example The admistrator of a ursig home wats to do a time ad motio

More information

10. Comparative Tests among Spatial Regression Models. Here we revisit the example in Section 8.1 of estimating the mean of a normal random

10. Comparative Tests among Spatial Regression Models. Here we revisit the example in Section 8.1 of estimating the mean of a normal random Part III. Areal Data Aalysis 0. Comparative Tests amog Spatial Regressio Models While the otio of relative likelihood values for differet models is somewhat difficult to iterpret directly (as metioed above),

More information

11 Correlation and Regression

11 Correlation and Regression 11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record

More information

Sample Size Determination (Two or More Samples)

Sample Size Determination (Two or More Samples) Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie

More information

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates. 5. Data, Estimates, ad Models: quatifyig the accuracy of estimates. 5. Estimatig a Normal Mea 5.2 The Distributio of the Normal Sample Mea 5.3 Normal data, cofidece iterval for, kow 5.4 Normal data, cofidece

More information

This is an introductory course in Analysis of Variance and Design of Experiments.

This is an introductory course in Analysis of Variance and Design of Experiments. 1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class

More information

Class 27. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Class 27. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700 Class 7 Daiel B. Rowe, Ph.D. Departmet of Mathematics, Statistics, ad Computer Sciece Copyright 013 by D.B. Rowe 1 Ageda: Skip Recap Chapter 10.5 ad 10.6 Lecture Chapter 11.1-11. Review Chapters 9 ad 10

More information

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Important Formulas. Expectation: E (X) = Σ [X P(X)] = n p q σ = n p q. P(X) = n! X1! X 2! X 3! X k! p X. Chapter 6 The Normal Distribution.

Important Formulas. Expectation: E (X) = Σ [X P(X)] = n p q σ = n p q. P(X) = n! X1! X 2! X 3! X k! p X. Chapter 6 The Normal Distribution. Importat Formulas Chapter 3 Data Descriptio Mea for idividual data: X = _ ΣX Mea for grouped data: X= _ Σf X m Stadard deviatio for a sample: _ s = Σ(X _ X ) or s = 1 (Σ X ) (Σ X ) ( 1) Stadard deviatio

More information

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 5

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 5 CS434a/54a: Patter Recogitio Prof. Olga Veksler Lecture 5 Today Itroductio to parameter estimatio Two methods for parameter estimatio Maimum Likelihood Estimatio Bayesia Estimatio Itroducto Bayesia Decisio

More information

Chapter 5: Hypothesis testing

Chapter 5: Hypothesis testing Slide 5. Chapter 5: Hypothesis testig Hypothesis testig is about makig decisios Is a hypothesis true or false? Are wome paid less, o average, tha me? Barrow, Statistics for Ecoomics, Accoutig ad Busiess

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

Problem Set 4 Due Oct, 12

Problem Set 4 Due Oct, 12 EE226: Radom Processes i Systems Lecturer: Jea C. Walrad Problem Set 4 Due Oct, 12 Fall 06 GSI: Assae Gueye This problem set essetially reviews detectio theory ad hypothesis testig ad some basic otios

More information

Estimation for Complete Data

Estimation for Complete Data Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of

More information

Confidence intervals summary Conservative and approximate confidence intervals for a binomial p Examples. MATH1005 Statistics. Lecture 24. M.

Confidence intervals summary Conservative and approximate confidence intervals for a binomial p Examples. MATH1005 Statistics. Lecture 24. M. MATH1005 Statistics Lecture 24 M. Stewart School of Mathematics ad Statistics Uiversity of Sydey Outlie Cofidece itervals summary Coservative ad approximate cofidece itervals for a biomial p The aïve iterval

More information

6.3 Testing Series With Positive Terms

6.3 Testing Series With Positive Terms 6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial

More information

Final Examination Solutions 17/6/2010

Final Examination Solutions 17/6/2010 The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:

More information

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10 DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set

More information

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for

More information

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1 EECS564 Estimatio, Filterig, ad Detectio Hwk 2 Sols. Witer 25 4. Let Z be a sigle observatio havig desity fuctio where. p (z) = (2z + ), z (a) Assumig that is a oradom parameter, fid ad plot the maximum

More information

Read through these prior to coming to the test and follow them when you take your test.

Read through these prior to coming to the test and follow them when you take your test. Math 143 Sprig 2012 Test 2 Iformatio 1 Test 2 will be give i class o Thursday April 5. Material Covered The test is cummulative, but will emphasize the recet material (Chapters 6 8, 10 11, ad Sectios 12.1

More information

Correlation Regression

Correlation Regression Correlatio Regressio While correlatio methods measure the stregth of a liear relatioship betwee two variables, we might wish to go a little further: How much does oe variable chage for a give chage i aother

More information

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9 BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous

More information

Stat 421-SP2012 Interval Estimation Section

Stat 421-SP2012 Interval Estimation Section Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible

More information

NANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS

NANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS NANYANG TECHNOLOGICAL UNIVERSITY SYLLABUS FOR ENTRANCE EXAMINATION FOR INTERNATIONAL STUDENTS AO-LEVEL MATHEMATICS STRUCTURE OF EXAMINATION PAPER. There will be oe 2-hour paper cosistig of 4 questios.

More information

Chapter 22: What is a Test of Significance?

Chapter 22: What is a Test of Significance? Chapter 22: What is a Test of Sigificace? Thought Questio Assume that the statemet If it s Saturday, the it s the weeked is true. followig statemets will also be true? Which of the If it s the weeked,

More information

1 Constructing and Interpreting a Confidence Interval

1 Constructing and Interpreting a Confidence Interval Itroductory Applied Ecoometrics EEP/IAS 118 Sprig 2014 WARM UP: Match the terms i the table with the correct formula: Adrew Crae-Droesch Sectio #6 5 March 2014 ˆ Let X be a radom variable with mea µ ad

More information

POWER COMPARISON OF EMPIRICAL LIKELIHOOD RATIO TESTS: SMALL SAMPLE PROPERTIES THROUGH MONTE CARLO STUDIES*

POWER COMPARISON OF EMPIRICAL LIKELIHOOD RATIO TESTS: SMALL SAMPLE PROPERTIES THROUGH MONTE CARLO STUDIES* Kobe Uiversity Ecoomic Review 50(2004) 3 POWER COMPARISON OF EMPIRICAL LIKELIHOOD RATIO TESTS: SMALL SAMPLE PROPERTIES THROUGH MONTE CARLO STUDIES* By HISASHI TANIZAKI There are various kids of oparametric

More information

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

Statisticians use the word population to refer the total number of (potential) observations under consideration

Statisticians use the word population to refer the total number of (potential) observations under consideration 6 Samplig Distributios Statisticias use the word populatio to refer the total umber of (potetial) observatios uder cosideratio The populatio is just the set of all possible outcomes i our sample space

More information

Efficient GMM LECTURE 12 GMM II

Efficient GMM LECTURE 12 GMM II DECEMBER 1 010 LECTURE 1 II Efficiet The estimator depeds o the choice of the weight matrix A. The efficiet estimator is the oe that has the smallest asymptotic variace amog all estimators defied by differet

More information

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N. 3/3/04 CDS M Phil Old Least Squares (OLS) Vijayamohaa Pillai N CDS M Phil Vijayamoha CDS M Phil Vijayamoha Types of Relatioships Oly oe idepedet variable, Relatioship betwee ad is Liear relatioships Curviliear

More information

Lecture 7: Non-parametric Comparison of Location. GENOME 560, Spring 2016 Doug Fowler, GS

Lecture 7: Non-parametric Comparison of Location. GENOME 560, Spring 2016 Doug Fowler, GS Lecture 7: No-parametric Compariso of Locatio GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review How ca we set a cofidece iterval o a proportio? 2 Review How ca we set a cofidece iterval

More information

STAC51: Categorical data Analysis

STAC51: Categorical data Analysis STAC51: Categorical data Aalysis Mahida Samarakoo Jauary 28, 2016 Mahida Samarakoo STAC51: Categorical data Aalysis 1 / 35 Table of cotets Iferece for Proportios 1 Iferece for Proportios Mahida Samarakoo

More information

1036: Probability & Statistics

1036: Probability & Statistics 036: Probability & Statistics Lecture 0 Oe- ad Two-Sample Tests of Hypotheses 0- Statistical Hypotheses Decisio based o experimetal evidece whether Coffee drikig icreases the risk of cacer i humas. A perso

More information

Chapter 8: Estimating with Confidence

Chapter 8: Estimating with Confidence Chapter 8: Estimatig with Cofidece Sectio 8.2 The Practice of Statistics, 4 th editio For AP* STARNES, YATES, MOORE Chapter 8 Estimatig with Cofidece 8.1 Cofidece Itervals: The Basics 8.2 8.3 Estimatig

More information

University of California, Los Angeles Department of Statistics. Hypothesis testing

University of California, Los Angeles Department of Statistics. Hypothesis testing Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Elemets of a hypothesis test: Hypothesis testig Istructor: Nicolas Christou 1. Null hypothesis, H 0 (claim about µ, p, σ 2, µ

More information

Lecture 33: Bootstrap

Lecture 33: Bootstrap Lecture 33: ootstrap Motivatio To evaluate ad compare differet estimators, we eed cosistet estimators of variaces or asymptotic variaces of estimators. This is also importat for hypothesis testig ad cofidece

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information