Lecture 12: Hypothesis Testing

Size: px
Start display at page:

Download "Lecture 12: Hypothesis Testing"

Transcription

1 9.07 Itroductio to Statistics for Brai ad Cogitive Scieces Emery N. Brow Lecture : Hypothesis Testig I. Objectives. Uderstad the hypothesis testig paradigm.. Uderstad how hypothesis testig procedures are costructed. 3. Uderstad how to do sample size calculatios. 4. Uderstad the relatio betwee hypothesis testig, cofidece itervals, likelihood ad Bayesia methods ad their uses for iferece purposes. II. The Hypothesis Testig Paradigm ad Oe-Sample Tests A. Oe-Sample Tests To motivate the hypothesis testig paradigm we review first two problems. I both cases there is a sigle sample of data. Example 3. (cotiued) Aalysis of MEG Sesor Bias. Is there a oise bias i this SQUID sesor? Here we have a sample,..., x where we model each x i : (, ) We assume that x N µ. µ is ukow ad is kow. If there is bias, the µ 0, ad µ > 0 would suggest a positive bias where µ < 0 would suggest a egative bias. Example. (cotiued) Behavioral Learig. Has the aimal leared the task? We have data x,..., x ad we recall that our model is x i : B(, p) where p is ukow. We defie learig as performace greater tha expected by chace. I particular, chace suggests p = (biary choice), learig suggests p >, whereas impaired learig suggests p <. To defie the hypothesis testig paradigm we state a series of defiitios. A hypothesis is the statemet of a scietific questio i terms of a proposed value of a parameter i a probability model. Hypothesis testig is a process of establishig proof by falsificatio. It has two essetial compoets: a ull hypothesis ad a alterative hypothesis. The ull hypothesis is a stated value of the parameter which defies the hypothesis we wat to falsify. It is usually stated as a sigle value although it ca be composite. We deote it as H 0. The alterative hypothesis is the hypothesis whose veracity we wish to establish. It is usually defied by a value or set of values of the parameter that are differet from the oe specified i the ull hypothesis. We deote it as H A.

2 page : 9.07 Lecture : Hypothesis Testig To establish the result, we carry out a test i a attempt to reject the ull hypothesis. The test is a procedure based o the observed data that allows us to choose betwee the ull ad alterative hypothesis. Example 3. (cotiued) Aalysis of MEG Sesor Bias. For this example the ull hypothesis could be H 0 : µ = 0 ad alterative hypotheses could be H A : µ > 0 or H A : µ < 0. Example. (cotiued) Behavioral Learig. Here the ull hypothesis is H : p = ad the 0 alterative hypotheses could be oe-sided as either or H A : p > H A : p < or two-sided H A : p. To ivestigate the hypothesis we require a test statistic. A test statistic is a statistic whose values will allow us to distiguish betwee the ull ad the alterative hypotheses. Example 3. (cotiued) Aalysis of MEG Sesor Bias. For this example we recall that x : N ( µ, ). Hece, we ca choose x, because of its distributioal properties. I geeral, choosig the statistic is a importat issue. Here there are two cases: Case i): If x >> 0 or x << 0 we coclude µ 0. That is, we would be willig to coclude µ 0 if x is sufficietly large. Ideed, the larger x is i absolute value, the more likely we are to coclude µ 0. If our data allow us to reach this coclusio, we say we reject the ull hypothesis H 0. Case ii) If x is close to 0, we say we fail to reject the ull hypothesis H 0. We do ot say we accept the ull hypothesis because, i this case, we do ot reach a coclusio. There are two types of errors we ca commit i hypothesis testig. If we reject H 0 whe it is true, this is a error. It is called a Type error. The probability of the error is deoted as α. We write

3 page 3: 9.07 Lecture : Hypothesis Testig I Example 3. we have α = Pr(rejectig H0 H0 is true). α = Pr( x > c α µ = 0). We choose α as small as possible. Typical values are α = 0.0 ad α = What is c α? To determie it, we compute Pr(Type I error) = Pr(rejectig H H true) = α. 0 0 We wat α to be small α = Pr( x > c α µ = µ 0 = 0) ( x µ ) 0 ( c α µ 0 ) α = Pr( > ) x µ = Pr( 0 > z α ) where z α is a quatile of the stadard Gaussia. Hece, we have c α = µ 0 + z α. Some values of z α for the correspodig values of α are α z α The area to the right of z α or c α is the critical regio. It has probability cotet of α. The value c α is the cut-off value. This test based o the ormal distributio is the z-test. If H 0 is ot true, i.e. H A is true ad we fail to reject H 0, the this is also a error. It is termed a Type II error or β error ad is defied as Pr(Type II error) = Pr(fail to reject H H is true) 0 A Assume H A is true, i.e. µ = µ A > 0. The we have β = Pr( x < c α µ = µ A) ( x µ A ) ( cα µ A) = Pr( < ) = Pr( ( x µ A ) < z β )

4 page 4: 9.07 Lecture : Hypothesis Testig We do ot talk i terms of β but β which is the power of the test. The power of the test is the probability of rejectig the ull hypothesis whe it is false. We compute it as power = β = Pr(rejectig H H is true) 0 A = Pr( x > c α µ = µ A ). Remark.. You should ot carry out a test if the power is ot at least > 0.80 for importat H A. Remark.. Never report a egative result (failig to reject H 0 ) without reportig the power. Remark.3. A statistical test is a iformatio assay. As such, it is oly as useful as it is powerful agaist a importat alterative hypothesis. If we reject H 0 we report the p-value which is the smallest value of α for which we ca reject H 0. The p-value is also the probability of all evets that are at least as rare as the observed statistic. It is the probability that we are makig a mistake i rejectig the ull hypothesis whe the ull hypothesis is true. A observed value of the test statistic has statistical sigificace if p < α. Statistical sigificace does ot imply scietific sigificace. Example 3. (cotiued) Aalysis of MEG Sesor Bias. I this example assume we have 500 observatios,..., x ad the stadard deviatio is =. 0 x f Tesla. Suppose we wish to test H 0 : µ = 0 agaist the alterative hypothesis H A : µ > 0 with α = If we have x = 0. 0 f Tesla the ( x ) (500) (0.) z = = =.5. From the Table of a stadard ormal distributio we have z = z =.645 α 0.95 or α = 0.05 ad we see p = 0.0. Therefore, we reject ad coclude that there is a positive bias i the magetic field aroud this recordig sesor. Example. (cotiued) Behavioral Learig. For this experimet if our hypotheses are H 0 : p = H A : p > k We have = 40 trials ad observe k = ad pˆ =. Based o our Cetral Limit Theorem 40 results i Lecture 7, we ca aalyze this questio usig the Gaussia approximatio to the biomial provided p > 5 ad ( p) > 5. Because we have

5 page 5: 9.07 Lecture : Hypothesis Testig p( p) = 40 = 0 > 5 we ca use the Gaussia approximatio to the biomial to test H 0. Notice that if p( p) > 5 the it must be that p > 5 ad ( p) > 5. If we take α = 0.05, our test statistic is ( pˆ p ) (40) ( ) 6.3(0.05) z = = = = 0.63 [ p ( p)] 4 z =.645, hece z< z ad we fail to reject H 0. We see that α α z.645 α c p [ p ( p)] = + α = + = What is the power of this test if the true probability of a correct respose is p A = 0.7? power = Pr(rejectig H 0 H A = 0.7) = Pr( pˆ > 0.63 p A = 0.7) ( pˆ p A ) ( cα p A) = Pr( > p A = 0.7) [ p A ( p A )] [ p A ( p A )] [ (40) ( ) = Φ [(0.8)(0.7)] (6.3)(0.09) = Φ [ ] (.36) = Φ [.58] = = Therefore, if the true propesity to respod correctly were p A = 0.7 the there was a probability of of rejectig the ull hypothesis i this case. B. Oe-Sample Test, Power ad Sample Size Calculatio Give a ull ad a oe-sided alterative hypothesis we compute the power as H 0 : µ = µ 0 H A : µ = µ A > µ 0

6 page 6: 9.07 Lecture : Hypothesis Testig because z α the it is easy to show that Power = Pr(rejectig H H 0 A is true) ( x = Pr( µ 0 ) > z α µ = µ A ) z α = Pr( x > µ 0 + µ = µ A ) = Pr( x µ z α A > µ 0 µ A + µ = µ ( x µ ) (µ 0 µ A) = Pr( A > + z α µ = µ A ) (µ µ ) 0 A (µ 0 µ A) Φ( + z α ) = Φ( z α ) = z. Similarly, if we have α I geeral for a oe-sided alterative H 0 : µ = µ 0 H A : µ = µ A < µ 0 A ) (µ A µ 0 ) = Φ(z α + ) (µ 0 µ A ) power = Φ[ + z α ] µ 0 µ A power = Φ[ + z α ] We use these formulae to derive expressios for sample size calculatios. Notice that µ 0 µ A power = Φ( + z α ) µ 0 µ A power = β = Φ( + z ) α If we apply Φ to the left ad right had sides of the equatio above we get µ A µ 0 Φ ( β ) = Φ [Φ( + z α )] µ A µ 0 z β = + z z z = β α µ A µ 0 α

7 page 7: 9.07 Lecture : Hypothesis Testig Or sice z α = z α where Δ = µ 0 µ A. we obtai the sample size formula (z β + z α ) = Example 3. (cotiued) Aalysis of MEG Sesor Bias. How may measuremets should Steve Stufflebeam make daily to be at least 80% sure that if there is a positive drift of 0. 0 f Tesla he ca detect it with α = 0.05? To aswer this questio, we apply our sample size formula with.645, z = 0.84, = f Tesla ad we obtai z = Δ (.) ( ) = (0.). 6.8 = 0.0 = 748 Therefore, i our problem studied above, we should have take aroud 750 measuremets istead of 500. Remark.4. The ratio is like the iverse of a sigal-to-oise ratio. As we wat a smaller Δ Type I error (z α ), ad/or more power icreases. Similarly, icreases with ad decreases with Δ. III. Oe-Sample Two-Sided Tests If the alterative hypothesis is two-sided the we eed to costruct a two-sided test. A. Two-Sided Tests Example 3. (cotiued) Aalysis of MEG Sesor Bias. Suppose our ull ad alterative hypotheses for this problem are respectively H 0 : µ = 0 H A : µ 0. This alterative hypothesis implies that we would reject H 0 for either a positive bias or egative bias. Uder H 0 we have x : N ( µ, ). We would therefore reject H 0 if x >> µ 0 = 0 or if x << µ 0 = 0. We take as our test statistic x ad we will reject H 0 if x >> 0. Pick α ad take α α = α + α. Sice we do ot favor µ > 0 more tha µ < 0, we take α = α =. To reject H0, we cosider

8 page 8: 9.07 Lecture : Hypothesis Testig Pr( x > c α ) = Pr( x > c α or x < cα ) ( x µ ) x µ ) ( α µ ) = Pr( 0 ) ( cα > µ 0 ( c 0 0 or < ) = Pr(z > z α or z < z α ) = Pr( z > z α ) This is a two-sided test because we reject H 0 for either very large positive or egative values of the test statistic. We reject H 0 for z > z α or equivaletly, we reject H 0 if Examples of α ad z α are x > c α = µ + z α 0 α z α Example 3. (cotiued) Aalysis of MEG Sesor Bias. We cosider H 0 : µ = 0 H A : µ 0 Suppose we pick α = 0.05, we assume =. ad we compute x = 0., the we have ( x ) (500) 0. z = = =.5. ad z α = z =.96. Because.5 <.96, we have z < z α ad hece, we reject H Example. (cotiued) Behavioral Learig. We ca perform a similar aalysis for the learig example H 0 : p 0 = H A : p This alterative implies either impaired learig or learig ad would lead us to reject if pˆ >> or pˆ <<. We have that uder the Gaussia approximatio

9 page 9: 9.07 Lecture : Hypothesis Testig ( pˆ p 0 ) z = [ p ( p ] 0 0 ) Hece, give α we reject H 0 if or equivaletly if z> z α or z< zα pˆ > c α or pˆ < c α p 0 ( p 0 ) where cα = p +[ ] 0 z. Give α = 0.0, we obtai z = z =.645 ad sice α α 0.95 k pˆ = =, we have ( pˆ p 0 ) (40) ( ) 6.3(0.05) z = = = = 0.63 [ p ( p )] 0 0 ( ) 4 Because z< z0.95 we fail to reject H0. B. Power for the Two-Sided Alterative It is straightforward to show that for the mea of a Gaussia distributio with kow variace if the ull hypothesis is H 0 : µ = µ 0 versus the two-sided alterative H A : µ µ 0 the power of the two-sided test is defied as ( x µ 0 ) ( x µ 0 ) ( x µ 0 ) Pr( > z α H A is true) = Pr( > z α or < z H α A is true) This simplifies to (µ 0 µ A ) (µ A µ 0 ) power = Φ [ z α + ] + Φ [ z α + ]. The correspodig sample size formula is where Δ = µ A µ 0. = (z α + z β ). Δ Example 3. (cotiued) Aalysis of MEG Sesor Bias. If Steve wated to worry about both positive ad egative drift, the the previous sample size calculatio becomes with z α / = z =.96,

10 page 0: 9.07 Lecture : Hypothesis Testig (.) ( ) = (0.).(.8).(7.84) = = = 949. Remark.5. Notice that (z z 0.8 ) 8. Hece, 8 SNR Remark.6. I Homework Assigmet 9 we will explore a similar formula for the biomial distributio. C. Adjustmets for the Gaussia Assumptios. Oe-Sample t- Test for Ukow. The z test allows us to test hypotheses about the mea of a Gaussia distributio uder the assumptio that the variace is kow. The t- test allows us to test the same hypotheses whe the sample size is ot large ad the variace is ukow ad must be estimated from the sample. The t- test was developed by Gossett i 908 while he was workig i the Guiess Brewery. Gossett wrote uder the pseudoym of Studet. For this reaso it is still referred to as Studet s t-test. The distributio was worked out later by R.A. Fisher. Suppose x,..., x is a radom sample from a Gaussia probability model N( µ, ) ad we wish to test the ull hypothesis H 0 : µ = µ 0 agaist the alterative H A : µ µ 0. Assume is ot kow ad is ot large, say 5 < <0. Therefore, as discussed i Lecture 8, we costruct a t- test by estimatig with a ubiased estimate, ad istead of a z statistic we costruct a t- statistic as ( x t = µ 0 ) s where s = ( ) (x i x) i= t : t, a t- distributio o degrees of freedom. Recall that we showed i the practice problems for the secod I Class Examiatio that s is a ubiased estimate of. Give α, to test H 0 : µ = µ 0 agaist H A : µ µ 0 we reject H 0 for t > t, α

11 page : 9.07 Lecture : Hypothesis Testig or equivaletly if either s s x > µ 0 + t, α or x < µ t 0, α Example. Reactio Time Measuremets. I a learig experimet, alog with the correct ad icorrect resposes, we record the reactio times which are the times it takes the aimal to execute the task. I a previous study, oce it had bee determied that a aimal had leared a task, it was foud that the average reactio time was 0 secods. O the 4 trials after the aimal leared the task by the behavioral criteria, the average reactio time was 8.5 secods. The sample stadard deviatio was.. Is this aimal s performace differet from that previously reported? We have H 0 : µ = 0.0 H A : µ 0.0 ( x µ) (4) (8.5 0) (3.74)(.75) t = = = =.6 s (.) (.) Now it follows from the Table of the t-distributio that t 3,0.975 =.6. Because t > t 3,0.975 we reject H 0.. Biomial Exact Method If p 0( p 0 ) < 5, we caot use the Gaussia approximatio to the biomial to tests hypotheses about the biomial proportio. I this case, we base the test o the exact biomial probabilities. k We have x : B (, p ) ad we observe k successes, ad we take pˆ =. The p-value depeds o whether pˆ p0 or pˆ > p 0. If pˆ p 0, the If pˆ > p, the 0 p value = Pr( k successes i trials H0 ) k j j = ( ) p ( p ) j 0 0 j=0 p value = Pr( k successes i trials H0 ) j j = ( )p ( p ) j 0 0 j= k Example.. A New Learig Experimet. Assume that we execute the learig experimet with 0 trials ad there is a probability of a correct respose by chace. Suppose k = ad 3 3 pˆ = =. We wat to test H 0 : p = agaist H A : p. We see that

12 page : 9.07 Lecture : Hypothesis Testig 40 4 p 0( p 0 ) = 0 = = 4 < We have > ad hece pˆ > p 0 ad we compute the p-value as j 0 j p 0 = Pr( x ) = ( j ) 3 3 j= = or equivaletly p = (0.086) = Therefore, we reject H 0 ad coclude that the aimal is ot performig at chace ad most likely has leared. Remark.7. A importat topic that we have ot cosidered is o-parametric tests. Each of the mai parametric tests we cosidered, i.e., the z-test ad t-test has a o-parametric aalog. It is importat to use these oparametric tests whe the sample size is small ad the Gaussia assumptio o which most of the tests are based is ot valid (Roser, 006). Remark.8. The requiremet of the Gaussia assumptio for most the stadard hypothesis testig paradigms is very limitig. For this reaso, we have spet more time i the course learig about priciples of moder statistical aalysis such as likelihood methods, the bootstrap ad other Mote Carlo methods, Bayesia methods ad, as we shall see i Lecture 6, the geeralized liear model. D. The Relatio Amog Cofidece Itervals, Hypothesis Tests ad p- Values The three statistics we have cosidered thus far for two-sided hypothesis tests ca be used to costruct 00 ( α) cofidece itervals. These are z- test (Gaussia mea ad variace kow) x ± z α z- test (Biomial proportio) pˆ ± pˆ( pˆ) z α t- test (Gaussia mea ad variace ukow) s x ± t, α

13 page 3: 9.07 Lecture : Hypothesis Testig We ca reject H 0 with α = 0.05 if the value of the parameter uder the ull hypothesis is ot i the 00%( α) cofidece iterval. I this way, a cofidece iterval provides a hypothesis test. A similar costructio of a cofidece boud ca be used to carry out a oe-sided test. Most importatly, the cofidece iterval tells us the reasoable rage for the parameter that ca be iferred from the data aalysis. Cofidece itervals report the aalysis results o the physical scale o which the problem takes place. The p- value oly tells us how likely the observed statistic would be uder H 0. I this way, the hypothesis test simply provides a mechaism for makig a decisio. Cofidece itervals are always more iformative tha p-values. Realizig this i the early 80s, the New Eglad Joural of Medicie set out a recommedatio obligig that all statistical results be reported i terms of cofidece itervals. This is ow the stadard for publicatio of research articles i that joural. Ideed, p- values aloe mea othig! Example 3. (cotiued) Aalysis of MEG Sesor Bias. To illustrate this poit, we ote that i this problem, the 95% cofidece iterval is. 0. ±.96 (500) 0. ± [0.04, 0.06] ad we reject H 0 because 0 is ot i the cofidece iterval. The p- value was < 0.05 which essetially tells us othig about the magitude of the bias i the sesor. Example. (cotiued) Behavioral Learig. I this problem, the 95% cofidece iterval is 4 pˆ ± ± ± 0.58 [0.39, 0.708]. Hece, the p- value is > We fail to reject the ull hypothesis. The cofidece iterval ot oly makes it clear that the ull hypothesis is ot rejected, it also shows what the reasoable rage of ucertaity is for the aimal s propesity to lear. Remark.9. Oe of the true values of the hypothesis-testig paradigm is sample size calculatios.

14 page 4: 9.07 Lecture : Hypothesis Testig IV. Two-Sample Tests A. Two-Sample t Test Example.3. Reactio Time Aalysis i a Learig Experimet. Suppose we have the reactio times o trial 50 of the 9 rats from the treatmet group ad 3 rats from the cotrol group we studied i Homework Assigmet 8-9. The mea reactio time for the treatmet group was 5.5 sec with a stadard deviatio of 3.5 sec ad the mea reactio time for the cotrol group was.5 sec with a stadard deviatio of 3. sec. What ca be said about the uderlyig mea differeces i reactio times betwee the two groups? Assume we have T x i : N (µt, T ) i =,..., T c x j : N(µ c, c ) j =,..., c Take H 0 : µ T = µ c H A : µ T µ c We have x = 5.5sec T s = 3.5sec T x c =.5sec s c = 3.sec Uder we have E x ) = E x ) ( ) = µ ad hece, H 0 ( T x c ( T E x c T µ c = 0 T T x : N ( µ, ) T T c c x : N ( µ, ) c c where T = 9 ad c = 3. Now uder the idepedece assumptio of the two samples T c Var( x T x c ) = Var ( x T ) +Var ( x c ) = + Hece, uder ad the assumptio that = =, H 0 T c T c

15 page 5: 9.07 Lecture : Hypothesis Testig If were kow, the we would have x T x c : N(0, ( + )). T c x T x c ( + ) T c : N(0,) ad we could base our hypothesis test o this z- statistic. Sice is ukow, let us cosider the estimate of defied by where ( T ) st + ( c )sc s = T + c T T s T = ( T ) (x j x T ) j= c c s c = ( c ) (x i x c ). i= T c Notice that if we assume that = = the ( T ) E( s T ) + ( c ) E( s c ) ( T ) + ( c ) Es ( )= = =, + + ad s is a ubiased estimate of. T c T c Give α we ca test H 0 with the followig test statistic x T xc t = s( + ) T termed the two-sample t- statistic with equal variace with = T + c degrees of freedom. We reject H 0 at level α c t > t, α Example.3 (cotiued). For this problem we have

16 page 6: 9.07 Lecture : Hypothesis Testig ad hece, ( T ) s T + (c )s s = c T + c 8(3.5) + ()(3.) = 0 8(0.56) +(9.6) = = = t = = =.9 [0 ( + )] (.88) 9 3 From the Table of the t-distributio we have t =.086. Sice t t we reject H 0 ad > 0, ,0.975 coclude that the mea reactio times of the two groups are differet. The loger average reactio time for the treatmet group suggests that learig may be impaired i that group. B. Cofidece Iterval for the True Differece i the Meas Because our test statistic follows a t- distributio, we ca costruct a 00%( α) cofidece iterval for the true differece i the measure as follows x T x c ± t, α s +. Example.3 (cotiued) Reactio Time Aalysis i a Behavioral Experimet. If we apply this formula to the data from Example.3, we obtai with α = 0.05 a 95% cofidece iterval for the true mea differece of T C 4 ±.086 [0 + ] ±.086 (.37) 4 ±.86 which is [ ]. The iterval does ot cotai zero as expected based o our hypothesis test. More importatly, we see that ot oly is it ulikely that the true mea differece is 0, but that the differece could be as small as.4 sec or as large as 6.86 sec.

17 page 7: 9.07 Lecture : Hypothesis Testig Remark.0. If we caot assume that the variaces i the two samples are equal but ukow, the we have to devise a alterative t- statistic that takes accout of the ukow ad estimated variaces. It ca be show that a appropriate t- statistic i this case is where the umber of degrees of freedom is x T xc t = s s T c + T c T sc s ( + ) T c d ' = s s T ( T ) + c ( c ) T c This statistic is the Satterthwaite approximatio to the degrees of freedom for a t- statistic with ukow ad uequal variace (Roser, 006). C. Two-Sample Test for Biomial Proportios Example. (cotiued) Behavioral Learig Experimet. Let us assume that o Day, the rat had k = correct resposes ad o Day we had k = 5 correct resposes. Day is out of 40 trials ad Day is out of 0 trials. What ca we say about the differece i performace betwee the two days? We could treat the results from Day as truth ad compare Day to Day. A more appropriate way to proceed is to treat the data from both days as if they were observed with ucertaity. For this we assume Take x j : B(, p ) j =,..., x j : B(, p ) j =..,,.. H 0 : p = p H A : p p Our test statistic ca be derived from the estimates of p ad p k pˆ = pˆ = k

18 page 8: 9.07 Lecture : Hypothesis Testig Uder H 0 ad usig the Gaussia approximatio to the biomial p( p) pˆ N( p, ) p( p) pˆ N( p, ) ad if we assume the samples are idepedet we have the approximate z- statistic ( pˆ pˆ ) z = N(0,) [ pˆ ( pˆ ) + ] where, sice p is ukow, we estimate it as pˆ defied as pˆ ˆp k k ˆ + + p = = + + Hece, give a level α we have the approximate z- statistic is pˆ pˆ z = [ pˆ ( pˆ) + ] We reject H 0 : if z > z α ad the approximate p- value is p = [ Φ ( z )]. A alterative form of z that icludes the cotiuity correctio (Lecture 8) to make the Gaussia approximatio to the biomial more accurate is defied as pˆ pˆ + z =. [ pˆ ( pˆ) + ] Example. (cotiued) Behavioral Learig Experimet. For this problem, we have or k + k pˆ = = = = pˆ =

19 page 9: 9.07 Lecture : Hypothesis Testig z = [(0.667)(0.3833) + ] = = = = [ ] (0.0773) Because z α = z =.96 we have z > z so we fail to reject the ull hypothesis of o differece i performace. Here the p- value is Similarly, the 95% cofidece iterval is p = [ Φ (.50)] = [0.0668] = or pˆ pˆ ± z [ pˆ ( pˆ) + ] 0.0 ±.96(0.33) 0.0 ± 0.6 [-0.46, 0.06]. As expected, the 95% cofidece iterval icludes zero which explais why we fail to reject H 0. The cofidece iterval suggests that there may be evidece of improved performace o the secod day relative to the first sice most of the iterval icludes egative values of the differece. The value of the cotiuity correctio is + = + = This chages z from to z =.465 ad the corrected p-value is Remark.. The two sample biomial data ca also be aalyzed as a cotigecy table Correct Icorrect Total Day k k Day k k k + k + ( k + k ) + A cotigecy table is a table cosistig of two rows cross-classified by two colums. The cotigecy table aalysis uses a test statistic based o a chi-squared distributio with oe degree of freedom ad gives the same result as the z- test. This is to be expected sice we showed i Lecture 4 that the square of a stadard Gaussia radom variable is a chi-squared radom variable with oe degree of freedom. We will study this i Homework Assigmet 9. Remark.. Use of the Gaussia approximatio to the biomial to costruct this z- test is valid provided p 5, p 5, ( p ) 5 ad ( p ) 5. This correspods to the coditio

20 page 0: 9.07 Lecture : Hypothesis Testig that the aalysis of the cotigecy table usig a chi-squared statistic is valid if the expected umber of observatios per cell is at least 5. Remark.3. This problem could easily be aalyzed usig a Bayesia aalysis i which p ad p had uiform priors. We could the use Algorithm 0. to compare the posterior desities of p ad p. Remark.4. We ca perform a likelihood aalysis ad compare the overlap i the likelihoods. We could alteratively costruct α cofidece itervals for p ad p separately usig the likelihood theory ad see if they overlap. Remark.5. All the tests we have discussed here ca be derived from the likelihood theory we preseted by the likelihood ratio procedure. A detailed discussio of this approach is beyod the scope of this course. (See DeGroot ad Schervish, 00; Rice, 007). V. Summary Hypothesis testig is a key part of classical statistics. It emphasizes procedures based primarily o the Gaussia distributio ad Gaussia approximatios. The hypothesis testig paradigm is very useful for prospective plaig of studies usig sample size formulae. Cofidece itervals are always more iformative tha p-values. Cofidece itervals report the aalysis results o the physical scale o which the problem takes place. May of the classical problems i hypothesis testig ca ow be carried out i a more iformative way usig more moder approaches such as Mote Carlo methods, bootstrappig, ad Bayesia methods. Ackowledgmets I am grateful to Julie Scott for techical assistace i preparig this lecture ad to Jim Mutch for careful proofreadig ad commets. Refereces DeGroot MH, Schervish MJ. Probability ad Statistics, 3rd editio. Bosto, MA: Addiso Wesley, 00. Rice JA. Mathematical Statistics ad Data Aalysis, 3 rd editio. Bosto, MA, 007. Roser B. Fudametals of Biostatistics, 6 th editio. Bosto, MA: Duxbury Press, 006.

21 MIT OpeCourseWare Statistics for Brai ad Cogitive Sciece Fall 06 For iformatio about citig these materials or our Terms of Use, visit:

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate

More information

6 Sample Size Calculations

6 Sample Size Calculations 6 Sample Size Calculatios Oe of the major resposibilities of a cliical trial statisticia is to aid the ivestigators i determiig the sample size required to coduct a study The most commo procedure for determiig

More information

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700 Class 23 Daiel B. Rowe, Ph.D. Departmet of Mathematics, Statistics, ad Computer Sciece Copyright 2017 by D.B. Rowe 1 Ageda: Recap Chapter 9.1 Lecture Chapter 9.2 Review Exam 6 Problem Solvig Sessio. 2

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight) Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........

More information

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced

More information

The standard deviation of the mean

The standard deviation of the mean Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider

More information

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times Sigificace level vs. cofidece level Agreemet of CI ad HT Lecture 13 - Tests of Proportios Sta102 / BME102 Coli Rudel October 15, 2014 Cofidece itervals ad hypothesis tests (almost) always agree, as log

More information

Lecture 6 Simple alternatives and the Neyman-Pearson lemma

Lecture 6 Simple alternatives and the Neyman-Pearson lemma STATS 00: Itroductio to Statistical Iferece Autum 06 Lecture 6 Simple alteratives ad the Neyma-Pearso lemma Last lecture, we discussed a umber of ways to costruct test statistics for testig a simple ull

More information

Common Large/Small Sample Tests 1/55

Common Large/Small Sample Tests 1/55 Commo Large/Small Sample Tests 1/55 Test of Hypothesis for the Mea (σ Kow) Covert sample result ( x) to a z value Hypothesis Tests for µ Cosider the test H :μ = μ H 1 :μ > μ σ Kow (Assume the populatio

More information

4. Understand the key properties of maximum likelihood estimation. 6. Understand how to compute the uncertainty in maximum likelihood estimates.

4. Understand the key properties of maximum likelihood estimation. 6. Understand how to compute the uncertainty in maximum likelihood estimates. 9.07 Itroductio to Statistics for Brai ad Cogitive Scieces Emery N. Brow Lecture 9: Likelihood Methods I. Objectives 1. Review the cocept of the joit distributio of radom variables.. Uderstad the cocept

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Aalysis ad Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasii/teachig.html Suhasii Subba Rao Review of testig: Example The admistrator of a ursig home wats to do a time ad motio

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Topic 18: Composite Hypotheses

Topic 18: Composite Hypotheses Toc 18: November, 211 Simple hypotheses limit us to a decisio betwee oe of two possible states of ature. This limitatio does ot allow us, uder the procedures of hypothesis testig to address the basic questio:

More information

Chapter 11: Asking and Answering Questions About the Difference of Two Proportions

Chapter 11: Asking and Answering Questions About the Difference of Two Proportions Chapter 11: Askig ad Aswerig Questios About the Differece of Two Proportios These otes reflect material from our text, Statistics, Learig from Data, First Editio, by Roxy Peck, published by CENGAGE Learig,

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 016 MODULE : Statistical Iferece Time allowed: Three hours Cadidates should aswer FIVE questios. All questios carry equal marks. The umber

More information

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01 ENGI 44 Cofidece Itervals (Two Samples) Page -0 Two Sample Cofidece Iterval for a Differece i Populatio Meas [Navidi sectios 5.4-5.7; Devore chapter 9] From the cetral limit theorem, we kow that, for sufficietly

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Lecture Notes 15 Hypothesis Testing (Chapter 10)

Lecture Notes 15 Hypothesis Testing (Chapter 10) 1 Itroductio Lecture Notes 15 Hypothesis Testig Chapter 10) Let X 1,..., X p θ x). Suppose we we wat to kow if θ = θ 0 or ot, where θ 0 is a specific value of θ. For example, if we are flippig a coi, we

More information

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS Lecture 5: Parametric Hypothesis Testig: Comparig Meas GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review from last week What is a cofidece iterval? 2 Review from last week What is a cofidece

More information

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading Topic 15 - Two Sample Iferece I STAT 511 Professor Bruce Craig Comparig Two Populatios Research ofte ivolves the compariso of two or more samples from differet populatios Graphical summaries provide visual

More information

Final Examination Solutions 17/6/2010

Final Examination Solutions 17/6/2010 The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:

More information

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE TERRY SOO Abstract These otes are adapted from whe I taught Math 526 ad meat to give a quick itroductio to cofidece

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics 8.2 Testig a Proportio Math 1 Itroductory Statistics Professor B. Abrego Lecture 15 Sectios 8.2 People ofte make decisios with data by comparig the results from a sample to some predetermied stadard. These

More information

Introductory statistics

Introductory statistics CM9S: Machie Learig for Bioiformatics Lecture - 03/3/06 Itroductory statistics Lecturer: Sriram Sakararama Scribe: Sriram Sakararama We will provide a overview of statistical iferece focussig o the key

More information

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9 BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous

More information

STAT431 Review. X = n. n )

STAT431 Review. X = n. n ) STAT43 Review I. Results related to ormal distributio Expected value ad variace. (a) E(aXbY) = aex bey, Var(aXbY) = a VarX b VarY provided X ad Y are idepedet. Normal distributios: (a) Z N(, ) (b) X N(µ,

More information

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y. Testig Statistical Hypotheses Recall the study where we estimated the differece betwee mea systolic blood pressure levels of users of oral cotraceptives ad o-users, x - y. Such studies are sometimes viewed

More information

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9 Hypothesis testig PSYCHOLOGICAL RESEARCH (PYC 34-C Lecture 9 Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I

More information

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency Math 152. Rumbos Fall 2009 1 Solutios to Review Problems for Exam #2 1. I the book Experimetatio ad Measuremet, by W. J. Youde ad published by the by the Natioal Sciece Teachers Associatio i 1962, the

More information

Chapter 6 Sampling Distributions

Chapter 6 Sampling Distributions Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to

More information

Frequentist Inference

Frequentist Inference Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for

More information

Sample Size Determination (Two or More Samples)

Sample Size Determination (Two or More Samples) Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis Sectio 9.2 Tests About a Populatio Proportio P H A N T O M S Parameters Hypothesis Assess Coditios Name the Test Test Statistic (Calculate) Obtai P value Make a decisio State coclusio Sectio 9.2 Tests

More information

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2 Chapter 8 Comparig Two Treatmets Iferece about Two Populatio Meas We wat to compare the meas of two populatios to see whether they differ. There are two situatios to cosider, as show i the followig examples:

More information

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo

More information

Chapter 8: Estimating with Confidence

Chapter 8: Estimating with Confidence Chapter 8: Estimatig with Cofidece Sectio 8.2 The Practice of Statistics, 4 th editio For AP* STARNES, YATES, MOORE Chapter 8 Estimatig with Cofidece 8.1 Cofidece Itervals: The Basics 8.2 8.3 Estimatig

More information

Stat 319 Theory of Statistics (2) Exercises

Stat 319 Theory of Statistics (2) Exercises Kig Saud Uiversity College of Sciece Statistics ad Operatios Research Departmet Stat 39 Theory of Statistics () Exercises Refereces:. Itroductio to Mathematical Statistics, Sixth Editio, by R. Hogg, J.

More information

MATH/STAT 352: Lecture 15

MATH/STAT 352: Lecture 15 MATH/STAT 352: Lecture 15 Sectios 5.2 ad 5.3. Large sample CI for a proportio ad small sample CI for a mea. 1 5.2: Cofidece Iterval for a Proportio Estimatig proportio of successes i a biomial experimet

More information

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ STATISTICAL INFERENCE INTRODUCTION Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I oesample testig, we essetially

More information

Module 1 Fundamentals in statistics

Module 1 Fundamentals in statistics Normal Distributio Repeated observatios that differ because of experimetal error ofte vary about some cetral value i a roughly symmetrical distributio i which small deviatios occur much more frequetly

More information

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance Hypothesis Testig Empirically evaluatig accuracy of hypotheses: importat activity i ML. Three questios: Give observed accuracy over a sample set, how well does this estimate apply over additioal samples?

More information

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to: STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio

More information

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10 DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set

More information

1.010 Uncertainty in Engineering Fall 2008

1.010 Uncertainty in Engineering Fall 2008 MIT OpeCourseWare http://ocw.mit.edu.00 Ucertaity i Egieerig Fall 2008 For iformatio about citig these materials or our Terms of Use, visit: http://ocw.mit.edu.terms. .00 - Brief Notes # 9 Poit ad Iterval

More information

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes. Term Test 3 (Part A) November 1, 004 Name Math 6 Studet Number Directio: This test is worth 10 poits. You are required to complete this test withi miutes. I order to receive full credit, aswer each problem

More information

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 3

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 3 Itroductio to Ecoometrics (3 rd Updated Editio) by James H. Stock ad Mark W. Watso Solutios to Odd- Numbered Ed- of- Chapter Exercises: Chapter 3 (This versio August 17, 014) 015 Pearso Educatio, Ic. Stock/Watso

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

This chapter focuses on two experimental designs that are crucial to comparative studies: (1) independent samples and (2) matched pair samples.

This chapter focuses on two experimental designs that are crucial to comparative studies: (1) independent samples and (2) matched pair samples. Chapter 9 & : Comparig Two Treatmets: This chapter focuses o two eperimetal desigs that are crucial to comparative studies: () idepedet samples ad () matched pair samples Idepedet Radom amples from Two

More information

Chapter 23: Inferences About Means

Chapter 23: Inferences About Means Chapter 23: Ifereces About Meas Eough Proportios! We ve spet the last two uits workig with proportios (or qualitative variables, at least) ow it s time to tur our attetios to quatitative variables. For

More information

This is an introductory course in Analysis of Variance and Design of Experiments.

This is an introductory course in Analysis of Variance and Design of Experiments. 1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class

More information

A statistical method to determine sample size to estimate characteristic value of soil parameters

A statistical method to determine sample size to estimate characteristic value of soil parameters A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig

More information

STATS 200: Introduction to Statistical Inference. Lecture 1: Course introduction and polling

STATS 200: Introduction to Statistical Inference. Lecture 1: Course introduction and polling STATS 200: Itroductio to Statistical Iferece Lecture 1: Course itroductio ad pollig U.S. presidetial electio projectios by state (Source: fivethirtyeight.com, 25 September 2016) Pollig Let s try to uderstad

More information

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples

More information

MA238 Assignment 4 Solutions (part a)

MA238 Assignment 4 Solutions (part a) (i) Sigle sample tests. Questio. MA38 Assigmet 4 Solutios (part a) (a) (b) (c) H 0 : = 50 sq. ft H A : < 50 sq. ft H 0 : = 3 mpg H A : > 3 mpg H 0 : = 5 mm H A : 5mm Questio. (i) What are the ull ad alterative

More information

Computing Confidence Intervals for Sample Data

Computing Confidence Intervals for Sample Data Computig Cofidece Itervals for Sample Data Topics Use of Statistics Sources of errors Accuracy, precisio, resolutio A mathematical model of errors Cofidece itervals For meas For variaces For proportios

More information

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test. Math 308 Sprig 018 Classes 19 ad 0: Aalysis of Variace (ANOVA) Page 1 of 6 Itroductio ANOVA is a statistical procedure for determiig whether three or more sample meas were draw from populatios with equal

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

GUIDELINES ON REPRESENTATIVE SAMPLING

GUIDELINES ON REPRESENTATIVE SAMPLING DRUGS WORKING GROUP VALIDATION OF THE GUIDELINES ON REPRESENTATIVE SAMPLING DOCUMENT TYPE : REF. CODE: ISSUE NO: ISSUE DATE: VALIDATION REPORT DWG-SGL-001 002 08 DECEMBER 2012 Ref code: DWG-SGL-001 Issue

More information

LECTURE 14 NOTES. A sequence of α-level tests {ϕ n (x)} is consistent if

LECTURE 14 NOTES. A sequence of α-level tests {ϕ n (x)} is consistent if LECTURE 14 NOTES 1. Asymptotic power of tests. Defiitio 1.1. A sequece of -level tests {ϕ x)} is cosistet if β θ) := E θ [ ϕ x) ] 1 as, for ay θ Θ 1. Just like cosistecy of a sequece of estimators, Defiitio

More information

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen) Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................

More information

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals 7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses

More information

Notes on Hypothesis Testing, Type I and Type II Errors

Notes on Hypothesis Testing, Type I and Type II Errors Joatha Hore PA 818 Fall 6 Notes o Hypothesis Testig, Type I ad Type II Errors Part 1. Hypothesis Testig Suppose that a medical firm develops a ew medicie that it claims will lead to a higher mea cure rate.

More information

Open book and notes. 120 minutes. Cover page and six pages of exam. No calculators.

Open book and notes. 120 minutes. Cover page and six pages of exam. No calculators. IE 330 Seat # Ope book ad otes 120 miutes Cover page ad six pages of exam No calculators Score Fial Exam (example) Schmeiser Ope book ad otes No calculator 120 miutes 1 True or false (for each, 2 poits

More information

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1 October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 1 Populatio parameters ad Sample Statistics October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 2 Ifereces

More information

Economics Spring 2015

Economics Spring 2015 1 Ecoomics 400 -- Sprig 015 /17/015 pp. 30-38; Ch. 7.1.4-7. New Stata Assigmet ad ew MyStatlab assigmet, both due Feb 4th Midterm Exam Thursday Feb 6th, Chapters 1-7 of Groeber text ad all relevat lectures

More information

Statistical Inference

Statistical Inference Statistical Iferece Statistical Iferece Statistical iferece provides us with the process of testig hypotheses uder ivestigatio from data. Procedures of statistical iferece are give for i parameter estimates

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

Biostatistics for Med Students. Lecture 2

Biostatistics for Med Students. Lecture 2 Biostatistics for Med Studets Lecture 2 Joh J. Che, Ph.D. Professor & Director of Biostatistics Core UH JABSOM JABSOM MD7 February 22, 2017 Lecture Objectives To uderstad basic research desig priciples

More information

Problem Set 4 Due Oct, 12

Problem Set 4 Due Oct, 12 EE226: Radom Processes i Systems Lecturer: Jea C. Walrad Problem Set 4 Due Oct, 12 Fall 06 GSI: Assae Gueye This problem set essetially reviews detectio theory ad hypothesis testig ad some basic otios

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

Chapter 13: Tests of Hypothesis Section 13.1 Introduction

Chapter 13: Tests of Hypothesis Section 13.1 Introduction Chapter 13: Tests of Hypothesis Sectio 13.1 Itroductio RECAP: Chapter 1 discussed the Likelihood Ratio Method as a geeral approach to fid good test procedures. Testig for the Normal Mea Example, discussed

More information

Lecture 7: Non-parametric Comparison of Location. GENOME 560, Spring 2016 Doug Fowler, GS

Lecture 7: Non-parametric Comparison of Location. GENOME 560, Spring 2016 Doug Fowler, GS Lecture 7: No-parametric Compariso of Locatio GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review How ca we set a cofidece iterval o a proportio? 2 Review How ca we set a cofidece iterval

More information

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions We have previously leared: KLMED8004 Medical statistics Part I, autum 00 How kow probability distributios (e.g. biomial distributio, ormal distributio) with kow populatio parameters (mea, variace) ca give

More information

Chapter 5: Hypothesis testing

Chapter 5: Hypothesis testing Slide 5. Chapter 5: Hypothesis testig Hypothesis testig is about makig decisios Is a hypothesis true or false? Are wome paid less, o average, tha me? Barrow, Statistics for Ecoomics, Accoutig ad Busiess

More information

Power and Type II Error

Power and Type II Error Statistical Methods I (EXST 7005) Page 57 Power ad Type II Error Sice we do't actually kow the value of the true mea (or we would't be hypothesizig somethig else), we caot kow i practice the type II error

More information

CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics

CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS 8.1 Radom Samplig The basic idea of the statistical iferece is that we are allowed to draw ifereces or coclusios about a populatio based

More information

4. Partial Sums and the Central Limit Theorem

4. Partial Sums and the Central Limit Theorem 1 of 10 7/16/2009 6:05 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 4. Partial Sums ad the Cetral Limit Theorem The cetral limit theorem ad the law of large umbers are the two fudametal theorems

More information

5. Likelihood Ratio Tests

5. Likelihood Ratio Tests 1 of 5 7/29/2009 3:16 PM Virtual Laboratories > 9. Hy pothesis Testig > 1 2 3 4 5 6 7 5. Likelihood Ratio Tests Prelimiaries As usual, our startig poit is a radom experimet with a uderlyig sample space,

More information

Stat 421-SP2012 Interval Estimation Section

Stat 421-SP2012 Interval Estimation Section Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible

More information

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion 1 Chapter 7 ad 8 Review for Exam Chapter 7 Estimates ad Sample Sizes 2 Defiitio Cofidece Iterval (or Iterval Estimate) a rage (or a iterval) of values used to estimate the true value of the populatio parameter

More information

GG313 GEOLOGICAL DATA ANALYSIS

GG313 GEOLOGICAL DATA ANALYSIS GG313 GEOLOGICAL DATA ANALYSIS 1 Testig Hypothesis GG313 GEOLOGICAL DATA ANALYSIS LECTURE NOTES PAUL WESSEL SECTION TESTING OF HYPOTHESES Much of statistics is cocered with testig hypothesis agaist data

More information

Chapter 13, Part A Analysis of Variance and Experimental Design

Chapter 13, Part A Analysis of Variance and Experimental Design Slides Prepared by JOHN S. LOUCKS St. Edward s Uiversity Slide 1 Chapter 13, Part A Aalysis of Variace ad Eperimetal Desig Itroductio to Aalysis of Variace Aalysis of Variace: Testig for the Equality of

More information

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates. 5. Data, Estimates, ad Models: quatifyig the accuracy of estimates. 5. Estimatig a Normal Mea 5.2 The Distributio of the Normal Sample Mea 5.3 Normal data, cofidece iterval for, kow 5.4 Normal data, cofidece

More information

Statistical Intervals for a Single Sample

Statistical Intervals for a Single Sample 3/5/06 Applied Statistics ad Probability for Egieers Sixth Editio Douglas C. Motgomery George C. Ruger Chapter 8 Statistical Itervals for a Sigle Sample 8 CHAPTER OUTLINE 8- Cofidece Iterval o the Mea

More information

Chapter two: Hypothesis testing

Chapter two: Hypothesis testing : Hypothesis testig - Some basic cocepts: - Data: The raw material of statistics is data. For our purposes we may defie data as umbers. The two kids of umbers that we use i statistics are umbers that result

More information

Last Lecture. Wald Test

Last Lecture. Wald Test Last Lecture Biostatistics 602 - Statistical Iferece Lecture 22 Hyu Mi Kag April 9th, 2013 Is the exact distributio of LRT statistic typically easy to obtai? How about its asymptotic distributio? For testig

More information

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1. Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio

More information

University of California, Los Angeles Department of Statistics. Hypothesis testing

University of California, Los Angeles Department of Statistics. Hypothesis testing Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Elemets of a hypothesis test: Hypothesis testig Istructor: Nicolas Christou 1. Null hypothesis, H 0 (claim about µ, p, σ 2, µ

More information

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018 HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018 We are resposible for 2 types of hypothesis tests that produce ifereces about the ukow populatio mea, µ, each of which has 3 possible

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS

UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS PART of UNIVERSITY OF TORONTO Faculty of Arts ad Sciece APRIL/MAY 009 EAMINATIONS ECO0YY PART OF () The sample media is greater tha the sample mea whe there is. (B) () A radom variable is ormally distributed

More information