CONFIDENCE INTERVALS

Size: px
Start display at page:

Download "CONFIDENCE INTERVALS"

Transcription

1 CONFIDENCE INTERVALS CONFIDENCE INTERVALS Documets prepared for use i course B , New York Uiversity, Ster School of Busiess The otio of statistical iferece page 3 This sectio describes the tasks of statistical iferece. Simple estimatio is oe form of iferece, ad cofidece itervals are aother. The derivatio of the cofidece iterval page 5 This shows how we get the iterval for the populatio mea, assumig a ormal populatio with kow stadard deviatio. This situatio is ot realistic, but it does a ice job of layig out the algebra. Discussio of cofidece itervals ad examples page 7 This gives some basic backgroud ad the uses illustratios of cofidece itervals for a ormal populatio mea ad for a biomial proportio. Some examples page 13 Here are illustratios of itervals for a ormal populatio mea ad for a biomial proportio. Cofidece itervals obtaied through Miitab page 14 Miitab ca prepare a cofidece iterval for ay colum of a worksheet (spreadsheet). Miitab also has a special provisio for computig cofidece itervals directly from x ad s or, i the biomial case, from p. More details o biomial cofidece itervals page 16 revisio date 18 NOV 005 Gary Simo, 005 Cover photo: IBM 79 tape drive, Computer Museum, Moutai View, Califoria. 1

2 CONFIDENCE INTERVALS

3 THE NOTION OF STATISTICAL INFERENCE A statistical iferece is a quatifiable statemet about either a populatio parameter or a future radom variable. There are may varieties of statistical iferece, but we will focus o just four of them: parameter estimatio, cofidece itervals, hypothesis tests, ad predictios. Parameter estimatio is coceptually the simplest. Estimatio is doe by givig a sigle umber which represets a guess at a ukow populatio parameter. If X 1, X,, X is a sample of values from a populatio with ukow mea µ, the we might cosider usig X as a estimate of µ. We would write µ = X. This is ot the oly estimate of µ, but it makes a lot of sese. A cofidece iterval is a iterval which has a specified probability of cotaiig a ukow populatio parameter. If X 1, X,, X is a sample of values from a populatio which is assumed to be ormal ad which has a ukow mea µ, the a 1 - α cofidece iterval for µ s is X ± t α/; 1. Here t α/;-1 is a poit from the t table. Oce the data leads to actual umbers, you ll make a statemet of the form I m 95% cofidet that the value of µ lies betwee ad A hypothesis test is a yes-o decisio about a ukow populatio parameter. There is cosiderable formalism, itese otatio, ad jargo associated with hypothesis testig. If X 1, X,, X is a sample of values from a populatio which is assumed to be ormal ad which has a ukow mea µ, we might cosider the ull hypothesis H 0 : µ = µ 0 versus alterative H 1 : µ µ 0. The symbol µ 0 is a specified compariso value, ad it will be a umber i ay applicatio. Based o data, we will decide either to accept H 0 or to reject H 0. We work with a level of sigificace (usually oted as α ad early always 0.05) such that the probability of rejectig H 0 whe it is really true is limited to the level of sigificace. I the situatio illustrated here, suppose that = 4, H 0 : µ = 310, H 1 : µ 310, ad α = The H 0 will be rejected if ad oly if t t 0.05;3 =.069. The X 310 symbol t refers to the t statistic, ad t =. s 3

4 THE NOTION OF STATISTICAL INFERENCE Predictios are guesses about values of future radom variables. We ca subdivide this otio ito poit predictios ad iterval predictios, but poit predictios are usually obvious. If X 1, X,, X is a sample of values from a populatio which is assumed to be ormal ad which has a ukow mea µ, we might wish to predict the ext value X +1. Implicit i this discussio is that we have observed X 1 through X, but we have ot yet observed X +1. The poit predictio is certaily X ; i fact, we would write X +1 = X. The 1 - α predictio iterval works out to be X ± t /; 1 s α

5 THE DERIVATION OF THE CONFIDENCE INTERVAL The ormal table gives us the fact that P[ < Z < 1.96 ] = With a sample of values from a populatio with mea µ ad stadard deviatio σ, the Cetral Limit theorem gives us the result that Z = X µ = X µ is approximately σ σ ormally distributed with mea 0 ad with stadard deviatio 1. If the populatio is assumed to be exactly ormal to start with, the X µ σ automatically ormally distributed. (This is ot a use of the Cetral Limit theorem.) is If oe does ot make the assumptio that the populatio is exactly ormal to start with, the X µ is approximately ormal, provided is large eough. (This σ is precisely the Cetral Limit theorem.) The official stadard is that should be at least 30. The result works well for as small as 10, provided that oe is ot workig with probabilities like or which are too close to zero or oe. Start from P[ < Z < 1.96 ] = 0.95 ad the substitute for Z the expressio X µ σ. This will give us X µ P 1.96 < < 1.96 σ = 0.95 We ca rewrite this as σ σ P 1.96 < X µ < 1.96 = 0.95 Now subtract X from all items to get σ σ P X 1.96 < µ < X = 0.95 Multiply by -1 (which requires reversig iequality directio) to obtai σ σ P X > µ > X 1.96 =

6 THE DERIVATION OF THE CONFIDENCE INTERVAL This is i the form large > medium > small. Rewrite as small < medium < large to get σ σ P X 1.96 < µ < X = 0.95 This is ow a probability statemet i which µ is i the middle! If we kew σ ad if we observe X, the this is a special kid of probability statemet about µ. We call this a cofidece iterval. We are 95% cofidet that µ is i this iterval. This is a very useful statistical iferetial statemet. We ofte say simply, we are 95% cofidet that σ µ is i the iterval X ± Most cofidece itervals are used with 95% cofidece. To make thigs more geeral, we use the z γ otatio. Specifically, z γ is the upper gamma poit from the stadard ormal distributio. This meas that z γ has γ of the probability to its right; we ca write this as P[ Z > z γ ] = γ. The value 1.96 is the upper.5% poit, i that we would write P[ Z > z 0.05 ] = The most commoly ecoutered symbol is z α/. Here are some simple examples. α α/ z α/ Whe we wated 95% cofidece, we put.5% of the probability i each of the upper ad lower tails. I geeral, if we wat 1 - α cofidece, we put α of probability i each of the upper ad lower tails. The statemet that creates the 1 - α cofidece iterval is the σ P X z < µ < X + z α/ α/ σ = 1 - α The 1 - α cofidece iterval is the X ± z σ α/. The statemet is hard to use i practice, because it is very rare to be able to claim kowledge of σ but o kowledge of µ. There is a simple ext step to cover this shortcomig. 6

7 DISCUSSION OF CONFIDENCE INTERVALS AND EXAMPLES A cofidece iterval is a statemet used to trap ukow populatio parameters. A example... We are 95% cofidet that the September mea sales per store is betwee 46 thousad dollars ad 8 thousad dollars. The speaker is tryig to estimate µ, the true-but-ukow populatio mea sales per store. Presumably he or she has used a sample to make the statemet. The statemet is either true or false (ad we do t kow which), but it would be reasoable to give 95-to-5 bettig odds that the statemet is true. The simplest approximate cofidece iterval statemet is this: We are approximately 95% cofidet that the ukow parameter is i the iterval [estimate] ± [stadard error]. This is approximate for a umber of reasos. First of all, we are tryig to make a all-purpose statemet that really might ot apply perfectly to everythig. Also, the should really be 1.96, which has the property that P[ Z 1.96 ] = 0.95 for a stadard ormal radom variable Z. Oe ca refie the statistical theory a bit, tuig up ay or all of choice for the estimate techique for obtaiig the stadard error refiig the multiplier Let s give ext what may be the most-commoly used cofidece iterval. Suppose that you have a sample X 1, X, X 3, X 4,, X. As usual, we will use the upper case letters whe we thik of the X i s as radom variables ad x 1, x, x 3, x 4,, x whe we thik of these as observed umerical values. I practice, this distictio will be hard to eforce. The procedure which we will use depeds o the assumptio that the X i values costitute a sample from a populatio which follows the ormal distributio. As usual, statig a assumptio does ot make it true. Noetheless, it is importat to state a assumptio that you are goig to exploit, eve if you do ot completely believe the assumptio. It is importat to ivoke the coditios o which the work depeds. The procedure which follows, by the way, happes to work rather well eve if the sampled populatio does ot follow the ormal distributio. Moreover, whe the sample size exceeds 30, the ormal distributio assumptio is ot critical. Let s use µ for the mea of the populatio ad use σ for the stadard deviatio of the populatio. I all realistic problems, the values of µ ad σ are both ukow. The statistical work will be based o the sample mea X ad the sample stadard deviatio s. 7

8 DISCUSSION OF CONFIDENCE INTERVALS AND EXAMPLES Clearly X, the sample mea, will be used as the estimate for µ. Sice SD( X ) = σ, s we ll use SE( X ) = for the stadard error. As a cosequece of the fact that X µ s = X µ follows the t distributio with - 1 degrees of freedom, we ca give the s 95% cofidece iterval for µ as X ± t s 0.05; 1. The value t 0.05;-1 refers to the poit from the t-table. The t-table is Table 4 of Hildebrad, Ott, ad Gray, ad it appears o page 79 ad o the iside back cover. The 0.05 idicates the colum to be used. The - 1 refers to the degrees of freedom. Thus, if = 17, the - 1 = 16 ad we look up t 0.05;16 =.10. The 0.05 i the subscript refers to probability excluded to the right. If T -1 refers to the t radom variable with - 1 degrees of freedom, the the mathematical descriptio of this is P[ T -1 > t 0.05;-1 ] = Said aother way, t 0.05;-1 is the upper.5 percet poit for the t distributio with - 1 degrees of freedom. We use the 0.05 for 95% cofidece itervals because we exclude the most extreme 5% of the distributio, meaig.5% i each tail. By established custom, the t table gives probabilities for oe tail. The use of 95% cofidece itervals is most commo, but there are occasios whe we eed 90% or 99% itervals. For a 99% iterval, oe would use t 0.005;-1. I geeral, if oe seeks cofidece level 1 - α, the correspodig poit from the t table is t α/; -1. Some people like to express the cofidece as a percet, rather tha as a decimal. That is, they d rather talk about a 100(1 - α)% cofidece iterval tha a 1 - α cofidece iterval. This is a exceedigly petty issue. Hildebrad, Ott, ad Gray use the symbol t α/ where we have used t α/; -1. Apparetly they believe that the degrees of freedom umber is obvious i most cases. They are correct, but we ll use the more detailed subscript ayhow. Sice 95% cofidece itervals are the most commo, we usually use the 0.05 colum i Hildebrad, Ott, ad Gray s Table 4. It should be oted that oce gets to be as big as about 15, the the values i this colum are rather close to. This reflects the empirical rule that most quatities are withi two stadard deviatios of their expected values with probability about 95%. 8

9 DISCUSSION OF CONFIDENCE INTERVALS AND EXAMPLES EXAMPLE: A sample of 0 five-poud potato bags, sampled from a huge shipmet of such bags, had a mea weight of 5.1 pouds, with a stadard deviatio of 0.14 poud. Give a 95% cofidece iterval for the true mea weight of all the bags i the shipmet. SOLUTION: Assume that the distributio of bag weights is ormal. The the iterval is s X ± t0.05 ; 1 which is ±. 0 ad umerically this becomes 5.1 ± This ca also be give as (5.054, 5.186). The value.093 is foud i the row for 19 degrees of freedom i the t table, usig the colum for two-sided 95% itervals (the oe headed 0.05). I describig a series of steps leadig to a cofidece iterval, try to avoid usig the equals sig. That is, do NOT write somethig like ±. = 5.1 ± = (5.054, 5.186). 0 This iterval is foud as defiitio 7.9 o page 78 of Hildebrad, Ott, ad Gray. This box otes the hierarchy of assumptios that are ivolved. ad if you are willig to assume that the sample comes from a ormal populatio ad if you are ot willig to assume that the sample comes from a ormal populatio If is small ( < 30) use defiitio 7.9 o page 78 you caot give a cofidece iterval of this variety If is large ( 30) use defiitio 7.9 o page 78 use defiitio 7.9 o page 78 Cosider ow a biomial problem. Suppose that X is the umber of successes i idepedet trials, each with success probability p. Let p = X be the sample proportio of successes. (May other authors use π for the success probability ad use π = X for the sample proportio.) The stadard error of p, meaig the estimate of the stadard 9

10 DISCUSSION OF CONFIDENCE INTERVALS AND EXAMPLES deviatio of p is SE( p ) = ) iterval for π ca be give the as. The covetioal, or Wald, 95% cofidece p ± 1.96 ) If you wished to use somethig other that 95% as the cofidece value, say you wat the cofidece level to be 1 - α, the iterval would be give as p ± z α / ) The value z α/ represets the upper oe-sided α poit from the ormal distributio. For istace, if 1 - α = 90%, the α = 10%, α = 5%, ad we use z0.05 = To make the correspodece to the 95% iterval, thik of 1 - α = 95%, α = 5%, α =.5% ad we use z0.05 = This is the most commo form of the iterval, but it is ot recommeded. The forms below are better. The otio of better is that they are more likely to come close to the target 1 - α coverage. It would be better to use the slightly loger cotiuity-corrected iterval p ± 1.96 ( ) The use of the fractio 1 is called a cotiuity correctio, the cosequece of usig a (cotiuous) ormal distributio approximatio o a (discrete) biomial radom variable. Alas, most texts do ot use this correctio. The iterval is certaily simpler without this correctio, ad perhaps that is the reaso that most 1 texts avoid it. However, omittig this cheats o the cofidece, i that a claimed 95% iterval might really oly be a 9% iterval. The best (simple) optio, ad the oe recommeded for most cases, is the Agresti-Coull x + iterval. It s based o p =. The correspodig 1 - α cofidece iterval is the

11 DISCUSSION OF CONFIDENCE INTERVALS AND EXAMPLES p ± z α / p p ) + 4 This is computatioally idetical to ivetig four additioal trials, of which two are success. EXAMPLE: I a sample of 00 purchasers of Megabrew Beer there were 84 who purchased the beer specifically for cosumptio durig televised sportig evets. Give a 95% cofidece iterval for the ukow populatio proportio. SOLUTION: Note that p = ( ) = 0.4. The stadard error associated with this estimate is SE( p) = = = The 00 covetioal 95% cofidece iterval is the 0.4 ± or 0.4 ± This ca be writte as (0.3516, ). This is urealistically short. Miitab will produce this iterval. Use Stat Basic Statistics 1 Proportio... Provide the values 00 ad 84 i the Summarized Data area. Click o Optios ad the check off Use test ad iterval based o ormal distributio. If you ucheck Use test ad iterval based o ormal distributio you will get a elaborate calculatio based o the exact distributio. The method is idetified as Clopper-Pearso. Details are i Distributios i Statistics: Discrete Distributios, by Johso ad Kotz, pages The iterval foud by Miitab will be (0.3507, ). L The cotiuity-corrected form is 0.4 ± NM or 0.4 ± , which may be writte as ( , ). O QP The Agresti-Coull form based o p begis by otig that p = The the 95% iterval is =

12 DISCUSSION OF CONFIDENCE INTERVALS AND EXAMPLES ± or ± This is (0.3538, ). This is the form that we d recommed. EXAMPLE: A examiatio of 5 fiished televisio sets resulted i 18 beig classified as OK, meaig ready for shipmet to stores. Give a 95% estimate for the correspodig populatio fractio. SOLUTION: First ote that p = 18 = 0.7. With this small, we should really use the 5 iterval based o p. We have p = , ad the 95% iterval is ± or ± This is (0.50, 0.859). This is the preferred solutio. By way of compariso, the covetioal iterval is 0.7 ± which is 0.7 ± , meaig (0.5436, ). The cotiuity-corrected iterval is L NM 0.7 ± or 0.7 ± , ad this is (0.536, ). The exact distributio versio give by Miitab is (0.5061, 0.879). O Q P Oe-sided cofidece itervals are occasioally used i accoutig frameworks i which oe eeds a oe-sided statemet about a value estimated by samplig. Examples of such situatios iclude auditig eviromets i which you eed statemets like We are 95% cofidet that the percetage of ivoices i error is at most 8.0%. 1

13 SOME EXAMPLES EXAMPLE: A sample of 8 subjects from a cosumer pael reported weekly orage juice cosumptio. These values had a average of 0.4 oz with a stadard deviatio of 1. oz. Give a 95% cofidece iterval for the populatio mea. SOLUTION: To get started...be sure that you ca idetify µ, σ, x, s,. Now assume that the populatio is at least approximately ormal. ASIDE: We kow that the populatio ca t be ormal. The sample mea 0.4 is oly 1.67 estimated stadard deviatios away from zero. There are o data values two stadard deviatios below the mea! With a sample size of 8, we could probably justify use of the Cetral Limit theorem, though 8 is just a bit short of the recommeded 30. The iterval is 0.4 ±.05 1., which is 0.4 ± We re 95% cofidet that the 8 value of µ is i the iterval 0.4 ± 4.73, which is (15.67, 5.13). The value.05 is t 0.05;-1, the oe-sided.5% poit (two-sided 5% poit) from the t distributio with - 1 = 7 degrees of freedom. EXAMPLE: Paelists were asked if they added a liquid bleach to their washig. Of the 8, there were 17 who used liquid bleach. Give a 95% cofidece iterval for the populatio proportio. SOLUTION: Before startig...what are p, p, σ,? The Agresti-Coull p versio is recommeded. Note that p = p p ) The 95% iterval is p ± zα /, which i this case is + 4 = 19 3 = ± This is ± This ca be reasoably give as 0.59 ± 0.17, or (0.4, 0.76). This is much more likely to hit the 95% target cofidece. Just for compariso, the iterval based o p ± is about (0.4, 0.79). z α / ) is ± This 13

14 CONFIDENCE INTERVALS OBTAINED THROUGH MINITAB oooooooooooooooooooooooooooooooooooo The program Miitab ca do the computatioal work of obtaiig cofidece itervals. Suppose that i a weight loss study, the subjects iitial percets of body fat were oted. Suppose that we d like a 95% cofidece iterval for the mea of the populatio from which these subjects are claimed to represet a sample. I Miitab, just do Stat Basic Statistics 1-Sample t. If we have = 80, x = , ad s = 5.676, we ca use the Summarized data area to eter the iput. The default cofidece is 95%, but you ca chage it if you wat. The output looks like this: Oe-Sample T N Mea StDev SE Mea 95% CI ( , ) s The figure represets, as may be easily checked. The 95% cofidece iterval is give as ( , ). This meas that we re 95% sure that the ukow populatio mea µ for the body fat percets is i this iterval. (It should be oted that these subjects were specially recruited by virtue of beig overweight.) If the data were give i a colum of a Miitab worksheet, the that colum would be amed (istead of usig the Summarized data area). You ca also get Miitab to give a cofidece iterval for the differece betwee two meas. I this case, there are two subject groups, idetified as 1 ad accordig to the treatmet regime which they are give. Let s give a 95% cofidece iterval for µ 1 - µ, where the µ s represet the populatio mea drops i body fat over four weeks. For this situatio, give Stat Basic Statistics -Sample t. Geerally you will mark the radio butto for Samples i oe colum. The for Samples: idicate the colum umber (or ame) for the variable to be aalyzed. (The variable ame is drop4 for this example.) Next to Subscripts: give the colum umber (or ame) for the variable which idetifies the groups. (I this example, the actual ame Group is used.) Miitab also allows for the possibility that your two samples appear i two differet colums. For most applicatios, this teds to be a very icoveiet data layout, ad you should probably avoid it. There is a box Assume equal variaces which is uchecked as a default. You should however be very willig to check this box. (More about this below.) The Miitab output is this: o 14

15 CONFIDENCE INTERVALS OBTAINED THROUGH MINITAB oooooooooooooooooooooooooooooooooooo Two Sample T-Test ad Cofidece Iterval Two sample T for drop4 Group N 33 Mea 0.5 StDev 4.08 SE Mea % CI for mu () - mu (1): ( -.13, 1.65) T-Test mu () = mu (1) (vs ot =): T= -0.6 P=0.80 DF= 6 Both use Pooled StDev = 3.78 This listig first shows iformatio idividually for the two groups. (There are fewer tha 80 subjects because some did ot make the four-week evaluatio.) You might be amused to kow that the value 0.5 does ot represet 5% i this study; it represets 0.5%. The subjects did ot lose a lot of weight. We have the 95% cofidece iterval for µ - µ 1 as (-.13, 1.65). Of course, the iterval for µ 1 - µ would be (-1.65,.13). The fact that the iterval icludes zero should covice you that the two groups do ot materially differ from each other. You might woder what would have happeed had you ot check off the box Assume equal variaces. Here is that output: Two Sample T-Test ad Cofidece Iterval Two sample T for drop4 Group N Mea StDev SE Mea % CI for mu () - mu (1): ( -.1, 1.64) T-Test mu () = mu (1) (vs ot =): T= -0.6 P=0.80 DF= 61 The cofidece iterval is ow give as (-.1, 1.64), obviously ot very differet. The first of these rus, the oe with the iterval (-.13, 1.65), assumed that the two groups were samples from ormal populatios with the same stadard deviatio σ. The estimate of σ is called s p ad here its value is Whe the stadard deviatio is assumed equal for the two groups, the iterval is give as x x 1 ± t s α /; 1+ p Whe the Assume equal variaces box is ot checked, the cofidece iterval is s1 s x x 1 ± z α/ + 1 For most situatios, the two versios of the cofidece iterval are very close. o 15

16 MORE DETAILS ON BINOMIAL CONFIDENCE INTERVALS T T T T T T T T T T T T T T T T T T T T T T T T T T T T T Cosider a biomial experimet, resultig i X evets i trails. Let x be the observed value of the radom variable X. We seek here a 1 - α cofidece iterval for the ukow populatio parameter p. We use ˆp = x as the sample proportio. We have E ˆp = p ad SD( ˆp ) = p(1 p). Sice ˆp is the obvious estimate of p, we have SE( ˆp ) = ) The importat cocept is that sample quatities acquire a probability law of their ow. The stadard error of the mea is critical here. The covetioal (Wald) cofidece iterval for the biomial proportio is ˆp ± z α / ) [Wald; ot recommeded] This covetioal iterval is give i most textbooks, but it is uacceptably overreachig. That is, its coverage probability teds to be less that the claimed 1 - α. A repair that you ll sometimes see replaces z α/ with t α/;-1, sice t α/;-1 > z α/ gives a slightly loger iterval ad thus gets closer to the claimed 1 - α. This is misguided because we have lack the statistical theory to use the t distributio. A improved aswer: ˆp ± z α / ( ) [cotiuity-corrected] This uses the biomial-to-ormal cotiuity correctio. This is a little wider that the covetioal iterval, ad it is more hoest i terms of the cofidece. This procedure is OK, but we ca do better. The itervals oted above ca have some aoyig problems. If x = 0, the Wald iterval will be 0 ± 0. If x =, the Wald iterval will be 1 ± 0. These do ot make sese. Either of these itervals ca sometimes have left eds below 0 or right eds above 1. There is a strategy more precise tha usig this cotiuity correctio. The cofidece iterval is based o the approximatio that p p p) T 16

17 MORE DETAILS ON BINOMIAL CONFIDENCE INTERVALS T T T T T T T T T T T T T T T T T T T T T T T T T T T T T is approximately ormally distributed. This gives the approximate probability statemet p P z < < z p p) α / α / = 1 - α It s importat to realize that ˆp represets the (radom) quatity observed i the data, while p deotes the ukow parameter value. I the covetioal derivatio of the biomial cofidece iterval, the ukow deomiator p ( 1 p) sample estimate ) is replaced by the. This time we ll do somethig differet. Sice the statemet { a < x < a } is equivalet to { x < a }, the iterval above ca be recast as p P < z p p) α / = 1 - α A little rearragemet gives ( p) ) P < zα p p / = 1 - α Oe more step provides ( ˆ ˆ ) α / ( P p pp+ p < z p 1 p = 1 - α We ll collect the terms accordig to powers of p. This gets us to ) ( ) ( ˆ α/ α/) P z p p z p < 0 = 1 - α The expressio i [ ] is a quadratic iequality i p. The correspodig equality ( ) ( ˆ α/ α/) z p p z p = [roots give score iterval] has two roots, call them p lo ad p hi. The iequality holds betwee these roots, ad we would the give the 1 - α cofidece iterval as (p lo, p hi ) [score iterval] I practice, we ca collect umeric values for, iterval umerically. z α /, ad ˆp ad the solve the score T 17

18 MORE DETAILS ON BINOMIAL CONFIDENCE INTERVALS T T T T T T T T T T T T T T T T T T T T T T T T T T T T T If you wat to see the score iterval directly, it s this: 1 z α / z + + α/ + z α/ ± z ( ˆ) 1 p 1 z + α / α / + zα/ 4 + zα/ The ceter of this iterval, the expressio before the ±, is a weighted average of p ad 1. This ceter ca also be writte as 1 x + zα /, which looks like + zα / umber of successes 1 after creatig / umber of trials z 1 α fake successes ad z α / fake failures. This leads to the Agresti-Coull iterval, oted below. The half-width of the iterval, the expressio after the ±, is a slight adjustmet from z α / ), the half-width of the Wald iterval. You will also see a style based o creatig four artificial observatios, two successes (1s) ad two failures (0s). Let p = x The use the covetioal iterval based o p ad the sample size + 4. The iterval is p ± z α / p p ) + 4 [Agresti-Coull; recommeded] This is the Agresti-Coull iterval. If you use cofidece 95% i the score iterval above, you ll use z α/ = z 0.05 = 1.96 ad this produces almost exactly the Agresti-Coull iterval. There are helpful discussios of these itervals i Aalyzig Categorical Data, by Jeffrey Simooff, Spriger Publicatios, 003. See especially pages 57 ad Here s a example. The umeric results are summarized i the chart at the ed of this sectio. Suppose that we have a sample of = 400 ad that we observe x = 190 successes. This leads immediately to ˆp = = T 18

19 MORE DETAILS ON BINOMIAL CONFIDENCE INTERVALS T T T T T T T T T T T T T T T T T T T T T T T T T T T T T The covetioal (Wald) 95% cofidece iterval is ˆp ± z 0.05 ) ad this computes to ± 1.96, or about ± This is to The 1 cotiuity-corrected form is ± , or about ± This is to We could also use the Agresti-Coull p form. Note that p = The compute = p ± z α / p p ) This is ± 1.96, which is about ± This iterval is to We ca also give the score iterval based o the quadratic iequality: ( ) ( ˆ α/ α/) + z p p+ z p + = 0 Usig z α/ = 1.96, = 400, ˆp = 0.475, we get this as p p = 0 The roots of this quadratic are ± These are about ad ; thus to is the iterval. T 19

20 MORE DETAILS ON BINOMIAL CONFIDENCE INTERVALS T T T T T T T T T T T T T T T T T T T T T T T T T T T T T What does Miitab do for this problem? Use Stat Basic Statistics 1 Proportio. If the data appear i a worksheet colum, just eter the colum ame or umber. If you have just x ad, you ca use the Summarized data box. Select Optios, ad click the radio butto for Use test ad iterval based o ormal approximatio. This will produce exactly the covetioal cofidece iterval. For these data, the Miitab versio usig the ormal approximatio gives the iterval to If you ucheck the ormal approximatio butto, you get the Clopper-Pearso iterval based o the exact biomial distributio. This souds like a good idea, but it s a procedure also fraught with cotroversy. For these data, it produces the iterval to These data were x = 190, = 400. At p = , we fid exactly P[ X 190 ] = At p = , we fid exactly P[ X 190 ] = Cosider the oe-sided hypothesis test problem H 0 : p = p 0 versus H 1 : p > p 0, ad suppose that sigificace level 1 α = 0.05 is used. With = 400 ad x = 190, the use of a large value for p 0 would lead to acceptig H 0. The smallest value of p 0 at which H 0 could be accepted is , the lower ed of the iterval. As for the other ed, cosider the oe-sided hypothesis test problem H 0 : p = p 0 versus H 1 : p < p 0, ad suppose agai that sigificace level 1 α = 0.05 is used. With = 400 ad x = 190, the use of a small value for p 0 would lead to acceptig H 0. The largest value of p 0 at which H 0 could be accepted is , the upper ed of the iterval. Thus, the Clopper-Pearso method for a 1 - α cofidece iterval gives the ed poits which are the limits for p 0 at which separate oe-sided ull hypotheses (each at level 1 α) would be accepted. Miitab uses a iterestig modificatio of the Clopper-Pearso method if x = 0 or x =. Suppose that we wat a 95% iterval for p, ad the data give us = 0 ad x = 0. The lower ed of the iterval should certaily be Cosider the hypothesis testig problem at level 1 α = 0.05 of H 0: p = p 0 versus H 1 : p > p 0 i which a very, very small value of p 0 (say ) appears. There is o way to take advatage of a sigificace level of 0.05, sice for obvious rejectio rule { X 1 } we have P[ X 1 ] = 1 P[ X = 0 ] = 1 (1 p 0 ) j = 1 ( p0 ) = 1 j= 0 j 1 p0 + p0 p T 0

21 MORE DETAILS ON BINOMIAL CONFIDENCE INTERVALS T T T T T T T T T T T T T T T T T T T T T T T T T T T T T 0 p 0. This uses the biomial theorem ad the igores miuscule terms ivolvig p, p, ad so o. Miitab the ivokes 0.00 as the lower ed of the iterval, ad uses the etire α = 0.05 o the upper ed. The iterval is the give as ( , ), ad the correspodig probability statemet is this: At p = , we fid exactly P[ X 0 ] = P[ X = 0 ] = Now, let s check this agai for a similar result, obtaied with a smaller sample size. Suppose that we have a sample of = 40 ad that we observe x = 19 successes. This leads to ˆp = = This is exactly the same ˆp that we had i the larger problem. The covetioal 95% cofidece iterval is ˆp ± z 0.05 ) ad this computes to ± 1.96, or about ± This is to The cotiuity-corrected form is ± , or about ± This is to We could also use the Agresti-Coull p form. Note that p = The we compute p ± z α / p p ) + 4. This is = ± 1.96, which is about ± This 44 iterval is to T 1

22 MORE DETAILS ON BINOMIAL CONFIDENCE INTERVALS T T T T T T T T T T T T T T T T T T T T T T T T T T T T T We could also give the score iterval based o the quadratic iequality. Usig z α/ = 1.96, = 40, ˆp = 0.475, we get this as p p = 0. The roots are about ad ; thus to is the iterval. The Miitab versio, usig the ormal approximatio is to Without the ormal approximatio it is to Here is a summary table: Method = 400, x = 190 = 40, x = 19 Low Ed High Ed Low Ed High Ed Covetioal (Wald) Miitab with ormal approximatio correctio Agresti-Coull p method Score iterval Miitab without ormal approximatio For the = 400 problem with ˆp = 0.475, the differeces amog the methods are trivial. For the much smaller problem with = 40 ad ˆp = 0.475, the differeces are material. Please avoid the covetioal ad Miitab with ormal approximatio methods. T

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more

More information

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo

More information

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates. 5. Data, Estimates, ad Models: quatifyig the accuracy of estimates. 5. Estimatig a Normal Mea 5.2 The Distributio of the Normal Sample Mea 5.3 Normal data, cofidece iterval for, kow 5.4 Normal data, cofidece

More information

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight) Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........

More information

STAT-UB.0103 NOTES for Wednesday 2012.APR.25. Here s a rehash on the p-value notion:

STAT-UB.0103 NOTES for Wednesday 2012.APR.25. Here s a rehash on the p-value notion: STAT-UB.3 NOTES for Wedesday 22.APR.25 Here s a rehash o the -value otio: The -value is the smallest α at which H would have bee rejected, with these data. The -value is a measure of SHOCK i the data.

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

Chapter 5: Hypothesis testing

Chapter 5: Hypothesis testing Slide 5. Chapter 5: Hypothesis testig Hypothesis testig is about makig decisios Is a hypothesis true or false? Are wome paid less, o average, tha me? Barrow, Statistics for Ecoomics, Accoutig ad Busiess

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The

More information

Frequentist Inference

Frequentist Inference Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for

More information

Chapter 6 Sampling Distributions

Chapter 6 Sampling Distributions Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to: STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics 8.2 Testig a Proportio Math 1 Itroductory Statistics Professor B. Abrego Lecture 15 Sectios 8.2 People ofte make decisios with data by comparig the results from a sample to some predetermied stadard. These

More information

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples

More information

STAT431 Review. X = n. n )

STAT431 Review. X = n. n ) STAT43 Review I. Results related to ormal distributio Expected value ad variace. (a) E(aXbY) = aex bey, Var(aXbY) = a VarX b VarY provided X ad Y are idepedet. Normal distributios: (a) Z N(, ) (b) X N(µ,

More information

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times Sigificace level vs. cofidece level Agreemet of CI ad HT Lecture 13 - Tests of Proportios Sta102 / BME102 Coli Rudel October 15, 2014 Cofidece itervals ad hypothesis tests (almost) always agree, as log

More information

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700 Class 23 Daiel B. Rowe, Ph.D. Departmet of Mathematics, Statistics, ad Computer Sciece Copyright 2017 by D.B. Rowe 1 Ageda: Recap Chapter 9.1 Lecture Chapter 9.2 Review Exam 6 Problem Solvig Sessio. 2

More information

Chapter 8: Estimating with Confidence

Chapter 8: Estimating with Confidence Chapter 8: Estimatig with Cofidece Sectio 8.2 The Practice of Statistics, 4 th editio For AP* STARNES, YATES, MOORE Chapter 8 Estimatig with Cofidece 8.1 Cofidece Itervals: The Basics 8.2 8.3 Estimatig

More information

6.3 Testing Series With Positive Terms

6.3 Testing Series With Positive Terms 6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial

More information

Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p

Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE Part 3: Summary of CI for µ Cofidece Iterval for a Populatio Proportio p Sectio 8-4 Summary for creatig a 100(1-α)% CI for µ: Whe σ 2 is kow ad paret

More information

Common Large/Small Sample Tests 1/55

Common Large/Small Sample Tests 1/55 Commo Large/Small Sample Tests 1/55 Test of Hypothesis for the Mea (σ Kow) Covert sample result ( x) to a z value Hypothesis Tests for µ Cosider the test H :μ = μ H 1 :μ > μ σ Kow (Assume the populatio

More information

MATH/STAT 352: Lecture 15

MATH/STAT 352: Lecture 15 MATH/STAT 352: Lecture 15 Sectios 5.2 ad 5.3. Large sample CI for a proportio ad small sample CI for a mea. 1 5.2: Cofidece Iterval for a Proportio Estimatig proportio of successes i a biomial experimet

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced

More information

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ STATISTICAL INFERENCE INTRODUCTION Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I oesample testig, we essetially

More information

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen) Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................

More information

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10 DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set

More information

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018 HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018 We are resposible for 2 types of hypothesis tests that produce ifereces about the ukow populatio mea, µ, each of which has 3 possible

More information

Homework 5 Solutions

Homework 5 Solutions Homework 5 Solutios p329 # 12 No. To estimate the chace you eed the expected value ad stadard error. To do get the expected value you eed the average of the box ad to get the stadard error you eed the

More information

Sampling Distributions, Z-Tests, Power

Sampling Distributions, Z-Tests, Power Samplig Distributios, Z-Tests, Power We draw ifereces about populatio parameters from sample statistics Sample proportio approximates populatio proportio Sample mea approximates populatio mea Sample variace

More information

Some Properties of the Exact and Score Methods for Binomial Proportion and Sample Size Calculation

Some Properties of the Exact and Score Methods for Binomial Proportion and Sample Size Calculation Some Properties of the Exact ad Score Methods for Biomial Proportio ad Sample Size Calculatio K. KRISHNAMOORTHY AND JIE PENG Departmet of Mathematics, Uiversity of Louisiaa at Lafayette Lafayette, LA 70504-1010,

More information

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE TERRY SOO Abstract These otes are adapted from whe I taught Math 526 ad meat to give a quick itroductio to cofidece

More information

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1 October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 1 Populatio parameters ad Sample Statistics October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 2 Ifereces

More information

Confidence Intervals for the Population Proportion p

Confidence Intervals for the Population Proportion p Cofidece Itervals for the Populatio Proportio p The cocept of cofidece itervals for the populatio proportio p is the same as the oe for, the samplig distributio of the mea, x. The structure is idetical:

More information

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y. Testig Statistical Hypotheses Recall the study where we estimated the differece betwee mea systolic blood pressure levels of users of oral cotraceptives ad o-users, x - y. Such studies are sometimes viewed

More information

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9 Hypothesis testig PSYCHOLOGICAL RESEARCH (PYC 34-C Lecture 9 Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I

More information

Confidence intervals summary Conservative and approximate confidence intervals for a binomial p Examples. MATH1005 Statistics. Lecture 24. M.

Confidence intervals summary Conservative and approximate confidence intervals for a binomial p Examples. MATH1005 Statistics. Lecture 24. M. MATH1005 Statistics Lecture 24 M. Stewart School of Mathematics ad Statistics Uiversity of Sydey Outlie Cofidece itervals summary Coservative ad approximate cofidece itervals for a biomial p The aïve iterval

More information

Final Examination Solutions 17/6/2010

Final Examination Solutions 17/6/2010 The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:

More information

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis Sectio 9.2 Tests About a Populatio Proportio P H A N T O M S Parameters Hypothesis Assess Coditios Name the Test Test Statistic (Calculate) Obtai P value Make a decisio State coclusio Sectio 9.2 Tests

More information

Chapter 11: Asking and Answering Questions About the Difference of Two Proportions

Chapter 11: Asking and Answering Questions About the Difference of Two Proportions Chapter 11: Askig ad Aswerig Questios About the Differece of Two Proportios These otes reflect material from our text, Statistics, Learig from Data, First Editio, by Roxy Peck, published by CENGAGE Learig,

More information

Read through these prior to coming to the test and follow them when you take your test.

Read through these prior to coming to the test and follow them when you take your test. Math 143 Sprig 2012 Test 2 Iformatio 1 Test 2 will be give i class o Thursday April 5. Material Covered The test is cummulative, but will emphasize the recet material (Chapters 6 8, 10 11, ad Sectios 12.1

More information

Stat 200 -Testing Summary Page 1

Stat 200 -Testing Summary Page 1 Stat 00 -Testig Summary Page 1 Mathematicias are like Frechme; whatever you say to them, they traslate it ito their ow laguage ad forthwith it is somethig etirely differet Goethe 1 Large Sample Cofidece

More information

Sample Size Determination (Two or More Samples)

Sample Size Determination (Two or More Samples) Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie

More information

Estimation of a population proportion March 23,

Estimation of a population proportion March 23, 1 Social Studies 201 Notes for March 23, 2005 Estimatio of a populatio proportio Sectio 8.5, p. 521. For the most part, we have dealt with meas ad stadard deviatios this semester. This sectio of the otes

More information

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate

More information

This is an introductory course in Analysis of Variance and Design of Experiments.

This is an introductory course in Analysis of Variance and Design of Experiments. 1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class

More information

STAC51: Categorical data Analysis

STAC51: Categorical data Analysis STAC51: Categorical data Aalysis Mahida Samarakoo Jauary 28, 2016 Mahida Samarakoo STAC51: Categorical data Aalysis 1 / 35 Table of cotets Iferece for Proportios 1 Iferece for Proportios Mahida Samarakoo

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Aalysis ad Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasii/teachig.html Suhasii Subba Rao Review of testig: Example The admistrator of a ursig home wats to do a time ad motio

More information

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals 7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses

More information

Chapter 23: Inferences About Means

Chapter 23: Inferences About Means Chapter 23: Ifereces About Meas Eough Proportios! We ve spet the last two uits workig with proportios (or qualitative variables, at least) ow it s time to tur our attetios to quatitative variables. For

More information

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test. Math 308 Sprig 018 Classes 19 ad 0: Aalysis of Variace (ANOVA) Page 1 of 6 Itroductio ANOVA is a statistical procedure for determiig whether three or more sample meas were draw from populatios with equal

More information

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01 ENGI 44 Cofidece Itervals (Two Samples) Page -0 Two Sample Cofidece Iterval for a Differece i Populatio Meas [Navidi sectios 5.4-5.7; Devore chapter 9] From the cetral limit theorem, we kow that, for sufficietly

More information

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2 Chapter 8 Comparig Two Treatmets Iferece about Two Populatio Meas We wat to compare the meas of two populatios to see whether they differ. There are two situatios to cosider, as show i the followig examples:

More information

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion 1 Chapter 7 ad 8 Review for Exam Chapter 7 Estimates ad Sample Sizes 2 Defiitio Cofidece Iterval (or Iterval Estimate) a rage (or a iterval) of values used to estimate the true value of the populatio parameter

More information

Power and Type II Error

Power and Type II Error Statistical Methods I (EXST 7005) Page 57 Power ad Type II Error Sice we do't actually kow the value of the true mea (or we would't be hypothesizig somethig else), we caot kow i practice the type II error

More information

Understanding Samples

Understanding Samples 1 Will Moroe CS 109 Samplig ad Bootstrappig Lecture Notes #17 August 2, 2017 Based o a hadout by Chris Piech I this chapter we are goig to talk about statistics calculated o samples from a populatio. We

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

Statistical Intervals for a Single Sample

Statistical Intervals for a Single Sample 3/5/06 Applied Statistics ad Probability for Egieers Sixth Editio Douglas C. Motgomery George C. Ruger Chapter 8 Statistical Itervals for a Sigle Sample 8 CHAPTER OUTLINE 8- Cofidece Iterval o the Mea

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

(7 One- and Two-Sample Estimation Problem )

(7 One- and Two-Sample Estimation Problem ) 34 Stat Lecture Notes (7 Oe- ad Two-Sample Estimatio Problem ) ( Book*: Chapter 8,pg65) Probability& Statistics for Egieers & Scietists By Walpole, Myers, Myers, Ye Estimatio 1 ) ( ˆ S P i i Poit estimate:

More information

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 3

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 3 Itroductio to Ecoometrics (3 rd Updated Editio) by James H. Stock ad Mark W. Watso Solutios to Odd- Numbered Ed- of- Chapter Exercises: Chapter 3 (This versio August 17, 014) 015 Pearso Educatio, Ic. Stock/Watso

More information

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS Lecture 5: Parametric Hypothesis Testig: Comparig Meas GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review from last week What is a cofidece iterval? 2 Review from last week What is a cofidece

More information

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading Topic 15 - Two Sample Iferece I STAT 511 Professor Bruce Craig Comparig Two Populatios Research ofte ivolves the compariso of two or more samples from differet populatios Graphical summaries provide visual

More information

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +

More information

1 Constructing and Interpreting a Confidence Interval

1 Constructing and Interpreting a Confidence Interval Itroductory Applied Ecoometrics EEP/IAS 118 Sprig 2014 WARM UP: Match the terms i the table with the correct formula: Adrew Crae-Droesch Sectio #6 5 March 2014 ˆ Let X be a radom variable with mea µ ad

More information

Infinite Sequences and Series

Infinite Sequences and Series Chapter 6 Ifiite Sequeces ad Series 6.1 Ifiite Sequeces 6.1.1 Elemetary Cocepts Simply speakig, a sequece is a ordered list of umbers writte: {a 1, a 2, a 3,...a, a +1,...} where the elemets a i represet

More information

Instructor: Judith Canner Spring 2010 CONFIDENCE INTERVALS How do we make inferences about the population parameters?

Instructor: Judith Canner Spring 2010 CONFIDENCE INTERVALS How do we make inferences about the population parameters? CONFIDENCE INTERVALS How do we make ifereces about the populatio parameters? The samplig distributio allows us to quatify the variability i sample statistics icludig how they differ from the parameter

More information

Lecture 6 Simple alternatives and the Neyman-Pearson lemma

Lecture 6 Simple alternatives and the Neyman-Pearson lemma STATS 00: Itroductio to Statistical Iferece Autum 06 Lecture 6 Simple alteratives ad the Neyma-Pearso lemma Last lecture, we discussed a umber of ways to costruct test statistics for testig a simple ull

More information

PH 425 Quantum Measurement and Spin Winter SPINS Lab 1

PH 425 Quantum Measurement and Spin Winter SPINS Lab 1 PH 425 Quatum Measuremet ad Spi Witer 23 SPIS Lab Measure the spi projectio S z alog the z-axis This is the experimet that is ready to go whe you start the program, as show below Each atom is measured

More information

Stat 225 Lecture Notes Week 7, Chapter 8 and 11

Stat 225 Lecture Notes Week 7, Chapter 8 and 11 Normal Distributio Stat 5 Lecture Notes Week 7, Chapter 8 ad Please also prit out the ormal radom variable table from the Stat 5 homepage. The ormal distributio is by far the most importat distributio

More information

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio

More information

MBACATÓLICA. Quantitative Methods. Faculdade de Ciências Económicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS

MBACATÓLICA. Quantitative Methods. Faculdade de Ciências Económicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS MBACATÓLICA Quatitative Methods Miguel Gouveia Mauel Leite Moteiro Faculdade de Ciêcias Ecoómicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS MBACatólica 006/07 Métodos Quatitativos

More information

Kinetics of Complex Reactions

Kinetics of Complex Reactions Kietics of Complex Reactios by Flick Colema Departmet of Chemistry Wellesley College Wellesley MA 28 wcolema@wellesley.edu Copyright Flick Colema 996. All rights reserved. You are welcome to use this documet

More information

NUMERICAL METHODS FOR SOLVING EQUATIONS

NUMERICAL METHODS FOR SOLVING EQUATIONS Mathematics Revisio Guides Numerical Methods for Solvig Equatios Page 1 of 11 M.K. HOME TUITION Mathematics Revisio Guides Level: GCSE Higher Tier NUMERICAL METHODS FOR SOLVING EQUATIONS Versio:. Date:

More information

Median and IQR The median is the value which divides the ordered data values in half.

Median and IQR The median is the value which divides the ordered data values in half. STA 666 Fall 2007 Web-based Course Notes 4: Describig Distributios Numerically Numerical summaries for quatitative variables media ad iterquartile rage (IQR) 5-umber summary mea ad stadard deviatio Media

More information

MA238 Assignment 4 Solutions (part a)

MA238 Assignment 4 Solutions (part a) (i) Sigle sample tests. Questio. MA38 Assigmet 4 Solutios (part a) (a) (b) (c) H 0 : = 50 sq. ft H A : < 50 sq. ft H 0 : = 3 mpg H A : > 3 mpg H 0 : = 5 mm H A : 5mm Questio. (i) What are the ull ad alterative

More information

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:

[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is: PROBABILITY FUNCTIONS A radom variable X has a probabilit associated with each of its possible values. The probabilit is termed a discrete probabilit if X ca assume ol discrete values, or X = x, x, x 3,,

More information

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ), Cofidece Iterval Estimatio Problems Suppose we have a populatio with some ukow parameter(s). Example: Normal(,) ad are parameters. We eed to draw coclusios (make ifereces) about the ukow parameters. We

More information

Topic 18: Composite Hypotheses

Topic 18: Composite Hypotheses Toc 18: November, 211 Simple hypotheses limit us to a decisio betwee oe of two possible states of ature. This limitatio does ot allow us, uder the procedures of hypothesis testig to address the basic questio:

More information

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES

SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES SECTION 1.5 : SUMMATION NOTATION + WORK WITH SEQUENCES Read Sectio 1.5 (pages 5 9) Overview I Sectio 1.5 we lear to work with summatio otatio ad formulas. We will also itroduce a brief overview of sequeces,

More information

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process. Iferetial Statistics ad Probability a Holistic Approach Iferece Process Chapter 8 Poit Estimatio ad Cofidece Itervals This Course Material by Maurice Geraghty is licesed uder a Creative Commos Attributio-ShareAlike

More information

Chapter 6. Sampling and Estimation

Chapter 6. Sampling and Estimation Samplig ad Estimatio - 34 Chapter 6. Samplig ad Estimatio 6.. Itroductio Frequetly the egieer is uable to completely characterize the etire populatio. She/he must be satisfied with examiig some subset

More information

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date: Confidence Interval Guesswork with Confidence

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date: Confidence Interval Guesswork with Confidence PSet ----- Stats, Cocepts I Statistics Cofidece Iterval Guesswork with Cofidece VII. CONFIDENCE INTERVAL 7.1. Sigificace Level ad Cofidece Iterval (CI) The Sigificace Level The sigificace level, ofte deoted

More information

Statistics 300: Elementary Statistics

Statistics 300: Elementary Statistics Statistics 300: Elemetary Statistics Sectios 7-, 7-3, 7-4, 7-5 Parameter Estimatio Poit Estimate Best sigle value to use Questio What is the probability this estimate is the correct value? Parameter Estimatio

More information

The standard deviation of the mean

The standard deviation of the mean Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider

More information

NCSS Statistical Software. Tolerance Intervals

NCSS Statistical Software. Tolerance Intervals Chapter 585 Itroductio This procedure calculates oe-, ad two-, sided tolerace itervals based o either a distributio-free (oparametric) method or a method based o a ormality assumptio (parametric). A two-sided

More information

Problem Set 4 Due Oct, 12

Problem Set 4 Due Oct, 12 EE226: Radom Processes i Systems Lecturer: Jea C. Walrad Problem Set 4 Due Oct, 12 Fall 06 GSI: Assae Gueye This problem set essetially reviews detectio theory ad hypothesis testig ad some basic otios

More information

Last Lecture. Wald Test

Last Lecture. Wald Test Last Lecture Biostatistics 602 - Statistical Iferece Lecture 22 Hyu Mi Kag April 9th, 2013 Is the exact distributio of LRT statistic typically easy to obtai? How about its asymptotic distributio? For testig

More information

Binomial Distribution

Binomial Distribution 0.0 0.5 1.0 1.5 2.0 2.5 3.0 0 1 2 3 4 5 6 7 0.0 0.5 1.0 1.5 2.0 2.5 3.0 Overview Example: coi tossed three times Defiitio Formula Recall that a r.v. is discrete if there are either a fiite umber of possible

More information

Chapter 6 Part 5. Confidence Intervals t distribution chi square distribution. October 23, 2008

Chapter 6 Part 5. Confidence Intervals t distribution chi square distribution. October 23, 2008 Chapter 6 Part 5 Cofidece Itervals t distributio chi square distributio October 23, 2008 The will be o help sessio o Moday, October 27. Goal: To clearly uderstad the lik betwee probability ad cofidece

More information

GG313 GEOLOGICAL DATA ANALYSIS

GG313 GEOLOGICAL DATA ANALYSIS GG313 GEOLOGICAL DATA ANALYSIS 1 Testig Hypothesis GG313 GEOLOGICAL DATA ANALYSIS LECTURE NOTES PAUL WESSEL SECTION TESTING OF HYPOTHESES Much of statistics is cocered with testig hypothesis agaist data

More information

Stat 319 Theory of Statistics (2) Exercises

Stat 319 Theory of Statistics (2) Exercises Kig Saud Uiversity College of Sciece Statistics ad Operatios Research Departmet Stat 39 Theory of Statistics () Exercises Refereces:. Itroductio to Mathematical Statistics, Sixth Editio, by R. Hogg, J.

More information

µ and π p i.e. Point Estimation x And, more generally, the population proportion is approximately equal to a sample proportion

µ and π p i.e. Point Estimation x And, more generally, the population proportion is approximately equal to a sample proportion Poit Estimatio Poit estimatio is the rather simplistic (ad obvious) process of usig the kow value of a sample statistic as a approximatio to the ukow value of a populatio parameter. So we could for example

More information

Understanding Dissimilarity Among Samples

Understanding Dissimilarity Among Samples Aoucemets: Midterm is Wed. Review sheet is o class webpage (i the list of lectures) ad will be covered i discussio o Moday. Two sheets of otes are allowed, same rules as for the oe sheet last time. Office

More information