PubHlth 540 Fall Estimation Page 1 of 72. Unit 6. Estimation. Use at least twelve observations in constructing a confidence interval

Size: px
Start display at page:

Download "PubHlth 540 Fall Estimation Page 1 of 72. Unit 6. Estimation. Use at least twelve observations in constructing a confidence interval"

Transcription

1 PubHlth 540 Fall Estimatio Page 1 of 7 Uit 6. Estimatio Use at least twelve observatios i costructig a cofidece iterval - Gerald va Belle What is the mea of the blood pressures of all the studets at the Amherst Regioal High School? It s too much work to measure the blood pressure of every idividual studet i this populatio. So we will make a guess based o the blood pressures of a sample of studets. This is estimatio. Estimatio ivolves usig a statistic calculated for our sample as our estimate of the populatio quatity of iterest. I this example, where the populatio mea is of iterest, possible choices might be (1) the sample mea blood pressure; () the sample media blood pressure; (3) the average of the smallest ad largest blood pressure; ad so o. The poit is, there is o oe choice for estimatio. What we mea by a good choice of estimator is oe focus of this uit. The other focus of this uit is cofidece iterval estimatio. A cofidece iterval is a sigle estimate together with a safety et, which ca also be thought of as a margi of error

2 PubHlth 540 Fall Estimatio Page of 7 Table of Cotets Topic 1. Uit Roadmap.. Learig Objectives 3. Itroductio a. Goals of Estimatio. b. Notatio ad Defiitios. c. How to Iterpret a Cofidece Iterval 4. Prelimiaries: Some Useful Probability Distributios. a. Itroductio to the Studet t- Distributio. b. Itroductio to the Chi Square Distributio.. c. Itroductio to the F-Distributio. d. Sums ad Differeces of Idepedet Normal Radom Vars.. 5. Normal Distributio: Oe Group a. Cofidece Iterval for µ, Kow.... b. Cofidece Iterval for µ, Ukow c. Cofidece Iterval for. 6. Normal Distributio: Paired.. a. Cofidece Iterval for µ DIFFERENCE... b. Cofidece Iterval for DIFFERENCE. 7. Normal Distributio: Two Idepedet Groups:. a. Cofidece Iterval for [µ 1 - µ ] b. Cofidece Iterval for Biomial Distributio: Oe Group.. a. Cofidece Iterval for π Biomial Distributio: Two Idepedet Groups.. a. Cofidece Iterval for [ π1 π ]... Appedices i. Derivatio of Cofidece Iterval for µ Sigle Normal with Kow.. ii. Derivatio of Cofidece Iterval for Sigle Normal. iii. SE of a Biomial Proportio

3 PubHlth 540 Fall Estimatio Page 3 of 7 1. Uit Roadmap Uit 6. Estimatio / Populatios Relatioships Recall that umbers that are calculated from the etirety of a populatio are called populatio parameters. They are represeted by Greek letters such as µ (populatio mea) ad (populatio variace). Ofte, it is ot feasible to calculate the value of a populatio parameter. Numbers that we calculate from a sample are called statistics. They are represeted by Roma letters such as X (sample mea) ad S (sample variace). I our itroductio to samplig distributios, we leared that a sample statistic such as X is a radom variable i its ow right. This ca be uderstood by imagiig that there are ifiitely may replicatios of our study so that there are ifiitely may X. Puttig this together, if a sample statistic such as X is to be used as a estimate of a populatio parameter such as µ, the icorporatio of a measure of its variability (i this case the stadard error of X ) allows us to costruct a margi of error about X as our estimate of µ. The result is a cofidece iterval estimate.

4 PubHlth 540 Fall Estimatio Page 4 of 7. Learig Objectives Whe you have fiished this uit, you should be able to:! Explai that there is more tha oe way to estimate a populatio parameter based o data i a sample.! Explai the criteria of ubiased ad miimum variace i the selectio of a good estimator.! Defie the Studet t, chi square, ad F probability distributio models.! Explai that the sum ad differece of idepedet radom variables that are distributed ormal are also distributed Normal.! Iterpret a cofidece iterval.! Calculate poit ad cofidece iterval estimates of the mea ad variace of a sigle Normal distributio.! Calculate poit ad cofidece iterval estimates of the mea ad variace of a sigle Normal distributio i the paired data settig.! Calculate poit ad cofidece iterval estimates of the differece betwee the meas of two idepedet Normal distributios.! Calculate poit ad cofidece iterval estimates of the ratio of the variaces of two idepedet Normal distributios..! Calculate poit ad cofidece iterval estimates of the π (evet probability) parameter of a sigle biomial distributio.! Calculate poit ad cofidece iterval estimates of the differece betwee the π (evet probability) parameters of two idepedet biomial distributios.

5 PubHlth 540 Fall Estimatio Page 5 of 7 3. Itroductio Recall Biostatistics is the applicatio of probability models ad associated tools to observed pheomea for the purposes of learig about a populatio ad gaugig the relative plausibility of alterative explaatios. Descriptio - Iformatio i a sample is used to summarize the sample itself. It is also used to make guesses of the characteristics of the source populatio. Hypothesis Testig Iformatio i a sample is used to help us compare differet explaatios for what we have observed. Uit 6 is about usig iformatio i a sample to make estimates of the characteristics (parameters) of the populatio that gave rise to the sample. Recall the distictio betwee statistics ad parameters: statistics are estimators of populatio parameters mea X µ variace S

6 PubHlth 540 Fall Estimatio Page 6 of 7 What does it mea to say we kow X from a sample but we do t kow the populatio mea µ? Suppose we have a simple radom sample of observatios X 1 X from some populatio. We have calculated the sample average X. What populatio gave rise to our sample? I theory, there are ifiitely may possible populatios. For simplicity here, suppose there are just 3 possibilities, schematically show below: µ I µ II µ III X Suppose this is the locatio of our X.

7 PubHlth 540 Fall Estimatio Page 7 of 7 Okay, sorry. Here, I m imagiig four possibilities istead of three. Look at this page from the bottom up. Aroud our X, I ve costructed a cofidece iterval. Notice the dashed lies extedig upwards ito the 4 ormal distributios. µ I ad µ IV are outside the iterval aroud X. µ II ad µ III are iside the iterval. I. µ I II. µ II III. µ III IV. µ IV X Cofidece Iterval

8 PubHlth 540 Fall Estimatio Page 8 of 7 We are cofidet that µ could be either µ II or µ III. 3a. Goals of Estimatio Whether a estimator is good or ot good depeds o what criteria we use to defie good. There are potetially lots of criteria. Here, we ll use oe set of two criteria: ubiased ad miimum variace. Covetioal Criteria for a Good Estimator 1. I the log ru, correct (ubiased). I the short ru, i error by as little as possible (miimum variace) 1. Ubiased - I the Log Ru Correct - Tip: Recall the itroductio to statistical expectatio ad the meaig of ubiased (See Uit 4 Beroulli & Biomial pp I the log ru correct. Imagie replicatig the study over ad over agai, ifiitely may times. Each time, calculate your statistic of iterest so as to produce the samplig distributio of that statistic of iterest. Now calculate the mea of the samplig distributio for your statistic of iterest. Is it the same as the populatio parameter value that you are tryig to estimate? If so, the that statistic is a ubiased estimate of the populatio parameter that is beig estimated. Example Uder ormality ad simple radom samplig, S as a ubiased estimate of. I the log ru correct meas that the statistical expectatio of S, computed over the samplig distributio of S, is equal to its target. S i = # samples i samplig dist all possiblesamples"i" Recall that we use the otatio E [ ] to refer to statistical expectatio. Here it is E [ S ] =.

9 PubHlth 540 Fall Estimatio Page 9 of 7. Miimum Variace I Error by as Little as Possible I error by as little as possible. We would like that our estimates ot vary wildly from sample to sample; i fact, we d like these to vary as little as possible. This is the idea of precisio. Whe the estimates vary by as little as possible, we have miimum variace. Puttig together the two criteria ( log ru correct ad i error by as little as possible ) Suppose we wat to idetify the miimum variace ubiased estimator of µ i the settig of a simple radom sample from a ormal distributio. ~ X as estimators of the Cadidate estimators might iclude the sample mea X or the sample media populatio mea µ. Which would be a better choice accordig to the criteria i the log ru correct ad i the short ru i error by as little as possible? Step 1 First, idetify the ubiased estimators Step From amog the pool of ubiased estimators, choose the oe with miimum variace. Illustratio for data from a ormal distributio 1. The ubiased estimators are the sample mea X ~ ad media X ~. variace [ X ] < variace [ X ] Choose the sample mea X. It is the miimum variace ubiased estimator. For a radom sample of data from a ormal probability distributio, X is the miimum variace ubiased estimator of the populatio mea µ. Take home message: Here, we will be usig the criteria of miimum variace ubiased. However, other criteria are possible.

10 PubHlth 540 Fall Estimatio Page 10 of 7 3b. Notatio ad Defiitios Estimatio, Estimator, Estimate - Estimatio is the computatio of a statistic from sample data, ofte yieldig a value that is a approximatio (guess) of its target, a ukow true populatio parameter value. The statistic itself is called a estimator ad ca be of two types - poit or iterval. The value or values that the estimator assumes are called estimates. Poit versus Iterval Estimators - A estimator that represets a "sigle best guess" is called a poit estimator. Whe the estimate is of the form of a "rage of plausible values", it is called a iterval estimator. Thus, A poit estimate is of the form: [ Value ], A iterval estimate is of the form: [ lower limit, upper limit ] Example - The sample mea X, calculated usig data i a sample of size, is a poit estimator of the populatio mea µ. If X mea µ. = 10, the value 10 is called a poit estimate of the populatio

11 PubHlth 540 Fall Estimatio Page 11 of 7 Samplig Distributio Recall the idea of a samplig distributio. It is a theoretically obtaied etity obtaied by imagiig that we repeat, over ad over ifiitely may times, the drawig of a simple radom sample ad the calculatio of somethig from that sample, such as the sample mea X based o a sample size draw of size equal to. The resultig collectio of all possible sample meas is what we call the samplig distributio of X. Recall. The samplig distributio of X plays a fudametal role i the cetral limit theorem. Ubiased Estimator A statistic is said to be a ubiased estimator of the correspodig populatio parameter if its mea or expected value, take over its samplig distributio, is equal to the populatio parameter value. Ituitively, this is sayig that the "log ru" average of the statistic, take over all the possibilities i the samplig distributio, has value equal to the value of its target populatio parameter.

12 PubHlth 540 Fall Estimatio Page 1 of 7 Cofidece Iterval, Cofidece Coefficiet A cofidece iterval is a particular type of iterval estimator. Iterval estimates defied as cofidece itervals provide ot oly several poit estimates, but also a feelig for the precisio of the estimates. This is because they are costructed usig two igrediets: 1) a poit estimate, ad ) the stadard error of the poit estimate. May Cofidece Iterval Estimators are of a Specific Form: lower limit = (poit estimate) - (cofidece coefficiet multiplier)(stadard error) upper limit = (poit estimate) + (cofidece coefficiet multiplier)(stadard error) The "multiple" i these expressios is related to the precisio of the iterval estimate; the multiple has a special ame - cofidece coefficiet. A wide iterval suggests imprecisio of estimatio. Narrow cofidece iterval widths reflects large sample size or low variability or both. Exceptios to this geeric structure of a cofidece iterval are those for a variace parameter ad those for a ratio of variace parameters Take care whe computig ad iterpretig a cofidece iterval!! A commo mistake is to calculate a cofidece iterval but the use it icorrectly by focusig oly o its midpoit.

13 PubHlth 540 Fall Estimatio Page 13 of 7 3c. How to Iterpret a Cofidece Iterval A cofidece iterval is a safety et. Tip: I this sectio, the focus is o the idea of a cofidece iterval. For ow, do t worry about the details. Example Suppose we wat to estimate the average icome from wages for a populatio of 5000 workers, X The average icome that we wat to estimate is the populatio mea µ.,..., X X i µ = i= For purposes of this illustratio, suppose we actually kow the populatio = $1,573. I real life, we would t have such luxury! Suppose the ukow µ = $19,987. Note I m oly tellig you this so that we ca see how well this illustratio performs!. We ll costruct two cofidece iterval estimates of µ to illustrate the importace of sample size i cofidece iterval estimatio: (1) from a sample size of =10, versus () from a sample size of =100

14 PubHlth 540 Fall Estimatio Page 14 of 7 (1) Carol uses a sample size =10 Carol s data are X 1,, X 10 X = 10 = 19,887 = 1,573 SE X = 10 = = 3, () Ed uses a sample size =100 Ed s data are X 1,, X 100 X = 100 = 19,813 = 1,573 SE X = 100 = = 1,57 100

15 PubHlth 540 Fall Estimatio Page 15 of 7 Compare the two SE, oe based o =10 ad the other based o =100 The variability of a average of 100 is less tha the variability of a average of 10. It seems reasoable that, all other thigs beig equal, we should have more cofidece (smaller safety et) i our sample mea as a guess of the populatio mea whe it is based o a larger sample size (100 versus 10).. Takig this oe step further we ought to have complete (100%) cofidece (o safety et required at all) if we iterviewed the etire populatio!. This makes sese sice we would obtai the correct aswer of $19,987 every time. Defiitio Cofidece Iterval (Iformal): A cofidece iterval is a guess (poit estimate) together with a safety et (iterval) of guesses of a populatio characteristic. I most istaces, it is easy to see the 3 compoets of a cofidece iterval: 1) A poit estimate (e.g. the sample mea X ) ) The stadard error of the poit estimate ( e.g. SEX = ) 3) A cofidece coefficiet (cof. coeff) I most istaces (meas, differeces of meas, regressio parameters, etc), the structure of a cofidece iterval is calculated as follows: Lower limit = (poit estimate) (cofidece coefficiet)(se) Upper limit = (poit estimate) + (cofidece coefficiet)(se) I other istaces (as you ll see i the ext pages), the structure of a cofidece iterval looks differet, as for cofidece itervals for Populatio variace Populatio stadard deviatio Ratio of two populatio variaces relative risk Odds ratio

16 PubHlth 540 Fall Estimatio Page 16 of 7 Example: Carol samples = 10 workers. mea X = $19,887 Stadard error of sample mea, SE X = = $3,976 for =10 Cofidece coefficiet for 95% cofidece iterval = 1.96 Lower limit = (poit estimate) (cofidece coefficiet)(se) = $19,887 (1.96)($3976) = $1,094 Upper limit = (poit estimate) + (cofidece coefficiet)(se) = $19,887 + (1.96)($3976) = $7,680 Width = ($7,680 - $1,094) = $15,586 Example: Ed samples = 100 workers. mea X = $19,813 Stadard error of sample mea, SE X = = $1,57 for =100 Cofidece coefficiet for 95% cofidece iterval = 1.96 Lower limit = (poit estimate) (cofidece coefficiet)(se) = $19,813 (1.96)($157) = $17,349 Upper limit = (poit estimate) + (cofidece coefficiet)(se) =$19,813 + (1.96)($157) = $,77 Width = ($,77 - $17,349) = $4,98 Estimate 95% Cofidece Iterval Carol 10 $19,887 ($1,094, $7,680) Wide Ed 100 $19,813 ($17,349, $,77) Narrow Truth 5000 $19,987 $19,987 No safety et Defiitio 95% Cofidece Iterval If all possible radom samples (a ifiite umber) of a give sample size (e.g. 10 or 100) were obtaied ad if each were used to obtai its ow cofidece iterval, The 95% of all such cofidece itervals would cotai the ukow; the remaiig 5% would ot.

17 PubHlth 540 Fall Estimatio Page 17 of 7 But Carol ad Ed Each Have Oly ONE Iterval: So ow what?! The defiitio above does t seem to help us. What ca we say? Carol says: With 95% cofidece, the iterval $1,094 to $7,680 cotais the ukow true mea µ. Ed says: With 95% cofidece, the iterval $17,349 to $,77 cotais the ukow true mea µ. Cautio o the use of Cofidece Itervals: 1) It is icorrect to say The probability that a give 95% cofidece iterval cotais µ is 95% A give iterval either cotais µ or it does ot. ) The cofidece coefficiet (recall this is the multiplier we attach to the SE) for a 95% cofidece iterval is the umber eeded to esure 95% coverage i the log ru (i probability).

18 PubHlth 540 Fall Estimatio Page 18 of 7 Here is a picture to get a feel for the ideas of cofidece iterval, safety et, ad precisio: For each samplig pla (=10, or =100, or =ifiity), the figure (admittedly ot the faciest of art) gives a feel for the collectio of all possible cofidece itervals. Notice (1) Ay oe cofidece iterval either cotais µ or it does ot. This illustrates that it is icorrect to say There is a 95% probability that the cofidece iterval cotais µ () For a give sample size (10 or 100 or ), the width of all the cofidece itervals is the same. (3) Cofidece itervals based o larger sample sizes are more arrow (more precise) (4) Whe is equal to the size of the populatio, µ is i the iterval every time.

19 PubHlth 540 Fall Estimatio Page 19 of 7 Some additioal remarks o the iterpretatio of a cofidece iterval might be helpful Each sample gives rise to its ow poit estimate ad cofidece iterval estimate built aroud the poit estimate. The idea is to costruct our itervals so that: IF all possible samples of a give sample size (a ifiite #!) were draw from the uderlyig distributio ad each sample gave rise to its ow iterval estimate, THEN 95% of all such cofidece itervals would iclude the ukow µ while 5% would ot Aother Illustratio of - It is NOT CORRECT to say: The probability that the iterval (1.3, 9.5) cotais µ is Why? Because either µ is i (1.3, 9.5) or it is ot. For example, if µ=5.3 the µ is i (1.3, 9.5) with probability = 1. If µ=1.0 the µ is i (1.3, 9.5) with probability=0. I toss a fair coi, but do t look at the result. The probability of heads is 1/. I am 50% cofidet that the result of the toss is heads. I other words, I will guess heads with 50% cofidece. Either the coi shows heads or it shows tails. I am either right or wrog o this particular toss. I the log ru, if I were to do this, I should be right about 50% of the time hece 50% cofidece. But for this particular toss, I m either right or wrog. I most experimets or research studies we ca t look to see if we are right or wrog but we defie a cofidece iterval i a way that we kow i the log ru 95% of such itervals will get it right.

20 PubHlth 540 Fall Estimatio Page 0 of 7 4. Prelimiaries: Some Useful Probability Distributios 4a. Itroductio to the Studet t-distributio Lookig ahead. Percetiles of the studet t-distributio are used i cofidece itervals for meas whe the populatio variace is NOT kow. There are a variety of defiitios of a studet t radom variable. A particularly useful oe for us here is the followig. It appeals to our uderstadig of the z-score. A Defiitio of a Studet s t Radom Variable Cosider a simple radom sample X usual way:... X from a Normal(µ, ) distributio. Calculate X ad S i the 1 X = i=1 X i ad S = i=1 ( X i X) -1 A studet s t distributed radom variable results if we costruct a t-score istead of a z-score. t - score = t = X - DF=-1 s / µ is distributed Studet s t with degrees of freedom = (-1) Note The abbreviatio df is ofte used to refer to degrees of freedom

21 PubHlth 540 Fall Estimatio Page 1 of 7 The features of the Studet s t-distributio are similar, but ot idetical, to those of a Normal Distributio x Bell Shaped Symmetric about zero Flatter tha the Normal (0,1). This meas (i) The variability of a studet t variable greater tha that of a stadard ormal (0,1) (ii) Thus, there is more area uder the tails ad less at ceter (iii) Because variability is greater, resultig cofidece itervals will be wider. The relative greater variability of a Studet s t- distributio (compared to a Normal) makes sese. We have added ucertaity i our cofidece iterval because we are usig a estimate of the stadard error rather tha the actual value of the stadard error.

22 PubHlth 540 Fall Estimatio Page of 7 Each degree of freedom (df) defies a separate studet s t-distributio. As the degrees of freedom gets larger, the studet s t-distributio looks more ad more like the stadard ormal distributio with mea=0 ad variace=1. Normal (0,1) Studet s t DF=5 Studet s t DF=5 Degrees of freedom=5 Degrees of freedom=5

23 PubHlth 540 Fall Estimatio Page 3 of 7 How to Use the Studet t Distributio Calculator Provided by Texas A & M Source: I the box degrees of freedom, eter the degrees of freedom (df) you wat. At lower left you ca choose left tail or right tail areas You ca either (a) solve for a probability (area) from a t-statistic value; or (b) solve for a t-statistic value from a probability (area) Click Compute!. Example - Area to left of t=3.078 is =.90 Area to right of t=3.078 is =.10

24 PubHlth 540 Fall Estimatio Page 4 of 7 4b. Itroductio to the Chi Square Distributio Lookig ahead. Percetiles of the chi square distributio are used i cofidece itervals for a sigle populatio variace or sigle populatio stadard deviatio. Suppose we have a simple radom sample from a Normal distributio. We wat to calculate a cofidece iterval estimate of the ormal distributio variace parameter,. To do this, we work with a ew radom variable Y that is defied as follows: (-1)S Y=, I this formula, S is the sample variace. Uder simple radom samplig from a Normal(µ, ) (-1)S Y= is distributed Chi Square with degrees of freedom = (-1) Mathematical Defiitio Chi Square Distributio The above ca be stated more formally. (1) If the radom variable X follows a ormal probability distributio with mea µ ad variace, The the radom variable V defied: V= ( X-µ ) is distributed chi square distributio with degree of freedom = 1. () If each of the radom variables V 1,..., V k is distributed chi square with degree of freedom = 1, ad if these are idepedet, The their sum, defied: V V k is distributed chi square distributio with degrees of freedom = k

25 PubHlth 540 Fall Estimatio Page 5 of 7 So ow we have defiitios of a Chi Square Distributio. What do we do with that?! The two defiitios o the previous page are cosistet because it is possible (with a little algebra) to re-write (-1)S Y= as the sum of (-1) idepedet chi square radom variables V, each with degrees of freedom = 1. NOTE: For this course, it is ot ecessary to kow the probability desity fuctio for the chi square distributio. Features of the Chi Square Distributio: (1) Whe data are a radom sample of idepedet observatios from a ormal probability distributio ad iterest is i the behavior of the radom variable defied as the sample variace S, the assumptios of the chi square probability distributio hold. () The first mathematical defiitio of the chi square distributio says that it is defied as the square of a stadard ormal radom variable. (3) A chi square radom variable caot be egative. Because the chi square distributio is obtaied by the squarig of a radom variable, this meas that a chi square radom variable ca assume oly o-egative values. That is, the probability desity fuctio has domai [0, ) ad is ot defied for outcome values less tha zero. Thus, the chi square distributio is NOT symmetric. Here is a picture.

26 PubHlth 540 Fall Estimatio Page 6 of 7 Features of the Chi Square Distributio - cotiued: (4) The fact that the chi square distributio is NOT symmetric about zero meas that for Y=y where y>0: Pr[Y > y] is NOT EQUAL to Pr[Y < -y] However, because the total area uder a probability distributio is 1, it is still true that 1 = Pr[Y < y] + Pr[Y > y] (5) The chi square distributio is less skewed as the umber of degrees of freedom icreases. See below. Degrees of freedom=1 Degrees of freedom=5 Degrees of freedom=0 Source: (6) Like the degrees of freedom for the Studet's t-distributio, the degrees of freedom associated with a chi square distributio is a idex of the extet of idepedet iformatio available for estimatig populatio parameter values. Thus, the chi square distributios with small associated degrees of freedom are relatively flat to reflect the imprecisio of estimates based o small sample sizes. Similarly, chi square distributios with relatively large degrees of freedom are more cocetrated ear their expected value.

27 PubHlth 540 Fall Estimatio Page 7 of 7 How to Use the Chi Square Distributio Calculator Provided by Texas A & M Source: I the box degrees of freedom, eter the degrees of freedom (df) you wat. Note This calculator provides ONLY right tail areas. Therefore, if you wat a left tail area you have to solve it as left tail area = 1 - [right tail area] You ca either (a) solve for a right tail probability (area) from a chi square statistic value; or (b) solve for a chi square value from a right tail probability (area). Click Compute!. Example - What is the 95th percetile value of a chi square distributio with degrees of freedom equal to 1? Aswer = Tip The area uder the curve to the right of the 95 th percetile is Calculator used:

28 PubHlth 540 Fall Estimatio Page 8 of 7 4c. Itroductio to the F Distributio Lookig ahead. Percetiles of the F distributio are used i cofidece itervals for the ratio of two idepedet variaces. Suppose we are Iterested i Comparig Two Idepedet Variaces Ulike the approach used to compare two meas i the cotiuous variable settig (where we will look at their differece), the compariso of two variaces is accomplished by lookig at their ratio. Ratio values close to oe are evidece of similarity. Of iterest will be a cofidece iterval estimate of the ratio of two variaces i the settig where data are comprised of two idepedet samples of data, each from a separate Normal distributio. Examples - I have a ew measuremet procedure. Are the results more variable tha those obtaied usig the stadard procedure? I am doig a prelimiary aalysis to determie whether or ot it is appropriate to compute a pooled variace estimate or ot, whe the goal is comparig the mea levels of two groups. Whe comparig two idepedet variaces, we will use a RATIO rather tha a differece. Specifically, we will look at the ratios of variaces of the form: s x /sy If the value of the ratio is close to 1, this suggests that the populatio variaces are similar. If the value of the ratio is very differet from 1, this suggests that the populatio variaces are ot the same. We use percetiles from the F distributio to costruct a cofidece iterval for X Y

29 PubHlth 540 Fall Estimatio Page 9 of 7 A Defiitio of the F-Distributio Suppose X 1,..., X x is a simple radom sample from a ormal distributio with mea µ X ad variace X. Suppose further that Y 1,..., Y y is a simple radom sample from a ormal distributio with mea µ Y ad variace Y. If the two sample variaces are calculated i the usual way S The = x i=1 X ( x x) i -1 x ad S = Y i=1 Y ( y y) i -1 Y F S X x x 1,y-1 SY y = is distributed F with two degree of freedom specificatios Numerator degrees of freedom = x -1 Deomiator degrees of freedom = y -1 For the advaced reader This ca be skipped if you are usig a olie calculator There is a relatioship betwee the values of percetiles for pairs of F Distributios that is defied as follows: F d,d ; α / 1 = F d,d ;(1 α )/ 1 1 Notice that (1) the degrees of freedom are i opposite order, ad () the solutio for a left tail percetile is expressed i terms of a right tail percetile. This is useful whe the published table does ot list the required percetile value; usually the missig percetiles are the oes i the left tail.

30 PubHlth 540 Fall Estimatio Page 30 of 7 How to Use the F Distributio Calculator Provided by Texas A & M Source: df stads for degrees of freedom. Eter umerator df ad deomiator df. If you wat a right tail probability, type i F-value i box Area right of. If you wat the value of a percetile, type i the right tail area i the box = Click Compute!. Example - What is the 95th percetile value of a F distributio radom variable with umerator degrees of freedom equal to 4 ad deomiator degrees of freedom equal to 3? Aswer: The 95 th percetile of a F distributio with umerator degrees of freedom = 4 ad deomiator degrees of freedom = 3 is equal to 8.639

31 PubHlth 540 Fall Estimatio Page 31 of 7 4d. Sums ad Differeces of Idepedet Normal Radom Variables Lookig ahead. We will be calculatig cofidece itervals of such thigs as the differece betwee two idepedet meas (eg cotrol versus itervetio i a radomized cotrolled trial) Suppose we have to idepedet radom samples, from two idepedet ormal distributios. eg radomized cotrolled trial of placebo versus treatmet groups). We suppose we wat to compute a cofidece iterval estimates of the differece of the meas. Poit Estimator: How do we obtai a poit estimate of the differece [ µ Group 1 - µ Group ]? A good poit estimator of the differece betwee populatio meas is the differece betwee sample meas, X X ] [ Group 1 Group Stadard Error of the Poit Estimator: We eed the stadard error of [ XGroup 1 XGroup ] Defiitios IF THEN! (for group 1): X 11, X 1, X 11 is a simple radom sample from a Normal (µ 1, 1 )! (for group ): X 1, X, X is a simple radom sample from a Normal (µ, )! This is great! We already kow the samplig distributio of each sample mea X is distributed Normal (µ 1, / ) Group 1 Group 1 1 / X is distributed Normal (µ, ) [XGroup 1 X Group ] is also distributed Normal with [ µ µ ] Mea = Group1 Group Variace = 1 + 1

32 PubHlth 540 Fall Estimatio Page 3 of 7 Be careful!! The stadard error of the differece is NOT the sum of the two separate stadard errors. Notice You must first sum the variace ad the take the square root of the sum. SE X X = + Group 1 Group 1 1 A Geeral Result Hady! If radom variables X ad Y are idepedet with E [X] = µ X ad Var [X] = E [Y] = µ Y ad Var [Y] = X Y The E [ ax + by ] = aµ X + bµ Y Var [ ax + by ] = a X + b Y ad Var [ ax - by ] = a X + b Y NOTE: This result ALSO says that, whe X ad Y are idepedet, the variace of their differece is equal to the variace of their sum. This makes sese if it is recalled that variace is defied usig squared deviatios which are always positive.

33 PubHlth 540 Fall Estimatio Page 33 of 7 5. Normal Distributio: Oe Group 5a. Cofidece Iterval for µ ( Kow) Itroductio ad where we are goig Hopefully, you will see that the logic ad mechaics of cofidece iterval costructio are very similar across a variety of settigs. I this uit, we cosider the settig of data from a ormal distributio (or two ormal distributios) ad the settig of data from a biomial distributio (or two biomial distributios). We wat to compute a cofidece iterval estimate of µ for a populatio distributio that is ormal with kow. Available are data from a radom sample of size=. These pages show you how to costruct a cofidece iterval. Appedix 1 gives the statistical theory uderlyig this calculatio 1. The Poit Estimate of µ is the Mea X Recall that, for a sample of size=, the sample mea is calculated as Features: X X = i= 1 i 1. Uder simple radom samplig, the sample mea (X) is a ubiased estimator of the populatio mea parameter µ, regardless of the uderlyig probability distributio.. Whe the uderlyig probability distributio is ormal, the sample mea X also satisfies the criterio of beig miimum variace ubiased (See page 5).

34 PubHlth 540 Fall Estimatio Page 34 of 7. The Stadard Error of X is The precisio of X as a estimate of the ukow populatio mea parameter µ is reflected i its stadard error. Recall SE(X ) = variace(x ) = SE is smaller for smaller (measuremet error) SE is smaller for larger (study desig) 3. The Cofidece Coefficiet The cofidece coefficiet for a 95% cofidece iterval is the umber eeded to isure 95% coverage i the log ru (i probability). See agai, page 18. For a 95% cofidece iterval, this umber will be the 97.5 th percetile of the Normal (0,1) distributio. For a (1-α)100% cofidece iterval, this umber will be the (1-α/)100 th percetile of the Normal (0,1) distributio. The table below gives some of these values i the settig of costructig a cofidece iterval estimate of µ whe data are from a Normal distributio with kow. Cofidece Level Percetile Cofidece Coefficiet = Percetile Value from Normal (0,1) th th th th th th.576 (1-α ) (1-α/)100 th - Example - For a 50% CI, th.50 = (1- α ) says α=.50 ad says (1-α/)=.75. Thus, use 75 percetile of N(0,1)=0.674

35 PubHlth 540 Fall Estimatio Page 35 of 7 Example - The followig data are the weights (micrograms) of drug iside each of 30 capsules, after subtractig the capsule weight. Suppose it is kow that = 0.5. Uder the assumptio of ormality, calculate a 95% cofidece iterval estimate of µ The data are simple radom sample of size =30 from a Normal distributio with mea = µ ad variace =. The populatio variace is kow ad has value = 0.5 Remark I real life, we will rarely kow! This example is for illustratio oly. The solutio for the cofidece iterval is poit estimate + safety et: Lower limit = ( poit estimate ) - ( multiple ) (SE of poit estimate) Upper limit = ( poit estimate ) + ( multiple ) (SE of poit estimate) Poit Estimate of µ is the Mea =30 X =30 X Xi = i= 1 = 051. = 30 The Stadard Error of X is SE(X 0. 5 ) = variace(x ) = = = =30 =30

36 PubHlth 540 Fall Estimatio Page 36 of 7 The Cofidece Coefficiet For a 95% cofidece iterval, this umber will be the 97.5 th percetile of the Normal (0,1) distributio. From the table o page 34 (or a Normal(0,1) calculator o the web), obtai the value Desired Cofidece Level Value of Cofidece Coefficiet th.95 = (1- α ) says α=.05 ad says (1-α/)=.975. Thus, use 97.5 percetile of N(0,1)=1.96 For a 95% CI, Puttig this all together Lower limit = ( poit estimate ) - ( multiple ) (SE of poit estimate) = ( 1.96 ) ( ) = 0.33 Upper limit = ( poit estimate ) + ( multiple ) (SE of poit estimate) = ( 1.96 ) ( ) = 0.69 Thus, we have the followig geeral formula for a (1 - α)100% cofidece iterval - ( ) th X ± [1-α100 percetile of Normal(0,1)] SE(X )

37 PubHlth 540 Fall Estimatio Page 37 of 7 How to Calculate the Proportio of Meas i a Give Iterval (Use the idea of stadardizatio) We leared how to do this i Uit 5. The Normal Distributio. Example A sample of size =100 from a ormal distributio with ukow mea yields a sample mea X =100 = The populatio variace of the ormal distributio is kow to be equal to = 36, What proportio of meas of size=100 will lie i the iterval [00,300] if it is kow that µ = 50 Aswer = 99.1% Solutio: Step 1 - The radom variable that we eed to stadardize is X =100. Mea = 50 SE = 100 = 36, = Step - Probability [ 00 < X =100 < 300] by the stadardizatio formula is X µ / = Pr < < = Pr[ < Z <+.608] =

38 PubHlth 540 Fall Estimatio Page 38 of 7 5b. Cofidece Iterval for µ ( NOT kow) - I sectio 5a, we assumed that is kow ad obtaied a cofidece iterval for µ of the form ( ) ( ) lower limit = X - z (1-α /)100 / upper limit = X + z (1-α /)100 / - The required cofidece coefficiet (z 1-α/ ) was obtaied as a percetile from the stadard ormal, N(0,1), distributio. (e.g. for a 95% CI, we used the 97.5 th percetile) - More realistically, however, will ot be kow. Now what? Reasoably, we might replace with s. Recall that s is the sample stadard deviatio ad we get it as follows: s= s where s - So far so good. But there is a problem. Whereas = i=1 X µ ( x x) i -1 IS distributed Normal (0,1) X µ is NOT distributed Normal (0,1). s - We have to modify our machiery (specifically the SE piece of our machiery) to accommodate the ukow-ess of. Whereas we previously used whe was kow z-score Percetile from Normal(0,1) With ukow we ow use t-score Percetile from Studet s t Uder simple radom samplig from a ormal distributio, the cofidece iterval for a ukow mea µ, the cofidece iterval will be of the followig form ( ) ( ) lower limit = X - t DF; (1-α /)100 s / upper limit = X + t DF; (1-α /)100 s /

39 PubHlth 540 Fall Estimatio Page 39 of 7 Whe is ot kow, the computatio of a cofidece iterval for the mea µ is ot altered much. We simply replace the cofidece coefficiet from the N(0,1) with oe from the appropriate Studet s t-distributio (the oe with df = -1) We replace the (ow ukow) stadard error with its estimate. The latter looks early idetical except that it utilizes s i place of Recall Thus, s= ( Xi X) i=1 ( 1) Cofidece Iterval for µ i two settigs of a sample from a Normal Distributio is KNOWN is NOT Kow X ± (z 1 α/ )(/ ) X ± (t 1;1 α/ )(s/ )

40 PubHlth 540 Fall Estimatio Page 40 of 7 Example A radom sample of size =0 duratios (miutes) of cardiac bypass surgeries has a mea duratio of X = 67 miutes, ad variace s = 36,700 miutes. Assumig the uderlyig distributio is ormal with ukow variace, costruct a 90% CI estimate of the ukow true mea, µ. Aswer: (193., 340.8) miutes Solutio: Step 1 - Poit Estimate of µ is the Mea X X i i= 1 X=0 = = 67 =0 miutes. Step The Estimated Stadard Error of X is s S 36,700 SE(X ˆ ˆ =0) = variace(x =0) = = = 4.7 miutes 0 Step 3 - The Cofidece Coefficiet For a 90% cofidece iterval, this umber will be the 95 th percetile of the Studet s t-distributio that has degrees of freedom = (-1) = 19. This value is Puttig this all together Lower limit = ( poit estimate ) - ( cofidece coefficiet. ) (SE of poit estimate) = 67 - ( 1.79 ) (4.7 ) = Upper limit = ( poit estimate ) + ( cofidece coefficiet ) (SE of poit estimate) = 67 + ( 1.79 ) (4.7 ) = Thus, a 90% cofidece iterval for the true mea duratio of surgery is (193., 340.8) miutes.

41 PubHlth 540 Fall Estimatio Page 41 of 7 5c. Cofidece Iterval for A cofidece iterval for is calculated usig percetiles from the chi square distributio. The followig are some settigs where our iterest lies i estimatio of the variace, - Stadardizatio of equipmet repeated measuremet of a stadard should have small variability - Evaluatio of techicias are the results from a particular techicia too variable - Compariso of measuremet techiques are the measuremets obtaied usig a ew techique too variable compared to the precisio of the old techique? We have a poit estimator of. It is S. How do we get a cofidece iterval? The aswer will utilize a ew stadardized variable, based o the way i which S is computed. It is a chi square radom variable. The defiitio of the chi square distributio gives us what we eed to costruct a cofidece iterval estimate of whe data are a simple radom sample from a ormal probability distributio. The approach here is similar to that for estimatig the mea µ. The table below shows how to costruct a cofidece iterval. For the iterested reader, Appedix is the derivatio behid the calculatio. (1-α)100% Cofidece Iterval for Settig Normal Distributio Lower limit = (-1)S χ 1- α/ Upper limit = (-1)S χ α /

42 PubHlth 540 Fall Estimatio Page 4 of 7 Example A precisio istrumet is guarateed to read accurately to withi + uits. A sample of 4 readigs o the same object yield 353, 351, 351, ad 355. Fid a 95% cofidece iterval estimate of the populatio variace ad also for the populatio stadard deviatio. Aswer: (1.18, 51.0) uits squared Solutio: 1. Obtai the poit estimate of. It is the sample variace S To get the sample variace S, we will eed to compute the sample mea first. X i i=1 X= = 35.5 ad ( ) Xi X i= 1 S = = Determie the correct chi square distributio to use. It has degrees of freedom, df = (4-1) = Obtai the correct multipliers. Because the desired cofidece level is 0.95, we set 0.95 = (1-α). Thus α =.05 For a 95% cofidece level, the percetiles we wat are (i) (α/)100 th =.5 th percetile (ii) (1 α/)100 th = 97.5 th percetile Obtai percetiles for chi square distributio with degrees of freedom = 3 (i) (ii) χ = = df 3, df 3, χ = =

43 PubHlth 540 Fall Estimatio Page 43 of 7 4. Thus, (i) Lower limit = (ii) Upper limit = (-1)S χ 1- α/ (-1)S χ α / (3)(3.67) = = (3)(3.67) = = Obtai a Cofidece Iterval for the Populatio Stadard Deviatio Aswer: (1.09, 7.14) uits Solutio: Step 1 Obtai a cofidece iterval for (1.178, 51.0) Step The associated cofidece iterval for is obtaied by takig the square root of each of the lower ad upper limits - 95% Cofidece Iterval = ( 1.178, 51.0) = (1.09,7.14) - Poit estimate = 3.67 = 1.9 Remarks o the Cofidece Iterval for It is NOT symmetric about the poit estimate; the safety et o each side of the poit estimate is of differet legths. These itervals ted to be wide. Thus, large sample sizes are required to obtai reasoably arrow cofidece iterval estimates for the variace ad stadard deviatio parameters.

44 PubHlth 540 Fall Estimatio Page 44 of 7 Itroductio to Paired 6. Normal Distributio: Paired Paired data occur whe each idividual (more specifically, each uit of measuremet) i a sample is measured twice. Paired data are familiar: "pre/post, "before/after", right/left,, paret/child, etc. Examples - 1) Blood pressure prior to ad followig treatmet, ) Number of cigarettes smoked per week measured prior to ad followig participatio i a smokig cessatio program, 3) Number of sex parters i the moth prior to ad i the moth followig a HIV educatio campaig. I each of these examples that the two occasios of measuremet are liked by virtue of the two measuremets beig made o the same idividual. We are iterested i comparig the two paired outcomes. Whe the paired data are cotiuous, the compariso focuses o the differece betwee the two paired measuremets. Note We ll see later that whe the data are discrete, a aalysis of paired data might focus o the ratio (eg. relative risk) of the two measuremets rather tha o the differece. Examples: 1) Blood pressure prior to ad followig treatmet. Iterest is d = pre-post. Large differeces are evidece of blood pressure lowerig associated w treatmet. ) Number of cigarettes smoked per week measured prior to ad followig participatio i a smokig cessatio program. Iterest is d=pre-post. Large differeces d are evidece of smokig reductio. 3) Number of sex parters i the moth prior to ad i the moth followig a HIV educatio campaig. Iterest is d=pre post. Large differeces are evidece of safer sex behaviors.

45 PubHlth 540 Fall Estimatio Page 45 of 7 6a. Cofidece Iterval for µ DIFFERENCE Suppose two paired measuremets are made of the same pheomeo (eg. blood pressure, # cigarettes/week, etc) o each idividual i a sample. Call them X ad Y. If X ad Y are each ormally distributed, the their differece is also distributed ormal. (see agai, sectio 4d) Thus, the settig is our focus o the differece D ad the followig assumptios (1) D = (X - Y) is distributed Normal with () Mea of D = µ differece. Let s write this as µ d (3) Variace of D = Let s write this as DIFFERENCE d I this settig, estimatio for paired data is a special case of selected methods already preseted. Attetio is restricted to the sigle radom variable defied as the differece betwee the two measuremets. The methods already preseted that we ca use here are (1) Cofidece Iterval for µ d - Normal Distributio d ukow () Cofidece Iterval for d - Normal Distributio Example source: Aderso TW ad Sclove SL. Itroductory Statistical Aalysis. Bosto: Houghto Miffli, page 339 A researcher is iterested assessig the improvemet i readig skills upo completio of the secod grade (Y) i compariso to those prior to the secod grade (X). For each child, his or her improvemet is measured usig the differece d which is defied d = Y X. A sample of =30 childre are studied. The data are show o the ext page.

46 PubHlth 540 Fall Estimatio Page 46 of 7 ID PRE(X) POST(Y) d=(y-x) Calculate (1) A 99% cofidece iterval for µ d () A 80% cofidece Iterval for d

47 PubHlth 540 Fall Estimatio Page 47 of 7 Solutio for a 99% Cofidece Iterval for µ d Step 1 Poit Estimate of µ d is the Mea =30 d i i= 1 d=30 = = 0.51 =30 d Step The Estimated Stadard Error of d is S d Calculate the sample variace of the idividual differeces: ( ) di d i= 1 Sd = = The estimated variace of the sample mea of the differeces is therefore: Thus, ˆ Sd variace(d ˆ =30) = = 30 d SE(d =30) = variace(d =30) = = = Step 3 The Cofidece Coefficiet ˆ S For a 99% cofidece iterval, this umber will be the 99.5 th percetile of the Studet s t-distributio that has degrees of freedom = (-1) = 9. This value is.756. Step 4 Substitute ito the formula for a cofidece iterval Lower limit = ( poit estimate ) - ( cofidece coefficiet ) (SE of poit estimate) = (.756 ) ( ) = 0.67 Upper limit = ( poit estimate ) + ( cofidece coefficiet. ) (SE of poit estimate) = (.756 ) ( ) =

48 PubHlth 540 Fall Estimatio Page 48 of 7 6b. Cofidece Iterval for DIFFERENCE Solutio for a 80% Cofidece Iterval for d. Step 1 - Obtai the poit estimate of d. ( ) di d i= 1 Sd = = Step - Determie the correct chi square distributio to use. It has df = (30-1) = 9. Step 3 - Obtai the correct multipliers. Because the desired cofidece level is 0.80, set 0.80 = (1-α). Thus α =.0 For a 80% cofidece level, α =.0 ad α/ =.10 so we wat: (i) (α/)100 th = 10 th percetile (ii) (1 α/)100 th = 90 th percetile Usig a chi square distributio calculator, set degrees of freedom, df = 9 (i) (ii) χ = = df 9, df 9, χ = = Step 4 Substitute ito the formula for the cofidece iterval (i) Lower limit = (ii) Upper limit = (-1)S χ 1- α/ (-1)S χ α / d d (9)(0.416) = = (9)(0.416) = =

49 PubHlth 540 Fall Estimatio Page 49 of 7 7. Normal Distributio: Two Idepedet Groups Illustratio of the Settig of Two Idepedet Groups Example - A researcher performs a drug trial ivolvig two idepedet groups. o A cotrol group is treated with a placebo while, separately; o A idepedet itervetio group is treated with a active aget. o Iterest is i a compariso of the mea cotrol respose with the mea itervetio respose uder the assumptio that the resposes are idepedet. o The tools of cofidece iterval costructio described for paired data are ot appropriate. 7a. Cofidece Iterval for [ µ GROUP1 - µ GROUP ] Suppose we wat to compare the mea respose i oe group with the mea respose i a separate group. Suppose further that two groups are idepedet. Examples - 1) Is mea blood pressure the same for males ad females? ) Is body mass idex (BMI) similar for breast cacer cases versus o-cacer patiets? 3) Is legth of stay (LOS) for patiets i hospital A the same as that for similar patiets i hospital B? For cotiuous data, the compariso of two idepedet groups focuses o the differece betwee the meas of the two groups.! Similarity of the two groups is reflected i a differece betwee meas that is ear zero.! Focus is o [ µ Group 1 - µ Group ]

50 PubHlth 540 Fall Estimatio Page 50 of 7 Poit Estimator: We wat a poit estimate of the differece [ µ Group 1 - µ Group ] Our poit estimator will be the differece betwee sample meas, [ XGroup 1 XGroup ] Stadard Error of the Poit Estimator: We eed the value (or estimate of ) the stadard error of [ X X ]. We ca use what we leared i sectio 4d (see pp 31-3) Group 1 Group! Sice X 11, X 1, X 11 is a simple radom sample from a Normal (µ 1, )! Ad sice X 1, X, X is a simple radom sample from a Normal (µ, ) 1! We have X is distributed Normal (µ 1, / ) Group 1 Group 1 1 / X is distributed Normal (µ, )! Ad thus, [XGroup 1 X Group ] is also distributed Normal with [ µ µ ] Mea = Group1 Group Variace = 1 + 1

51 PubHlth 540 Fall Estimatio Page 51 of 7 How to Estimate the Stadard Error The correct solutio depeds o ad 1. Solutio 1 - ad 1 are both kow SE X X = + Group 1 Group 1 1 Solutio - ad 1 are both NOT kow but are assumed EQUAL SEˆ X S S X = + Group 1 Group pool pool 1 S pool is a weighted average of the two separate sample variaces, with weights equal to the associated degrees of freedom cotributios. S ( 1)S + ( 1)S 1 1 pool = (1 1) + ( 1) Solutio 3 - ad 1 are both NOT kow ad NOT EQUAL SEˆ X S S X = + Group 1 Group 1 1

52 PubHlth 540 Fall Estimatio Page 5 of 7 Cofidece Coefficiet ( Multiplier) Agai, the correct solutio depeds o ad 1. Solutio 1 - ad 1 are both kow Use percetile of Normal(0,1) Solutio - ad 1 are both NOT kow but are assumed EQUAL Use percetile of Studet s t Degrees of freedom = ( 1 1) + ( 1) Solutio 3 - ad 1 are both NOT kow ad NOT EQUAL Use percetile of Studet s t Degrees of freedom = f where f is give by formula (Satterthwaite) S1 S + 1 f= S 1 S Horrible, is t it!

53 PubHlth 540 Fall Estimatio Page 53 of 7 Summary Normal Distributio: Cofidece Iterval for [ µ 1 - µ ] (Two Idepedet Groups) Sceario " Estimate 1 ad kow X CI = [poit estimate] are both ± (cof.coeff)se[poit estimate] 1 ad are both NOT kow but are assumed EQUAL 1 ad are both NOT kow ad NOT Equal Group 1 XGroup XGroup 1 XGroup XGroup 1 XGroup SE to use 1 SE XGroup 1 XGroup = + 1 SEˆ X S S X = + Group 1 Group where you already have obtaied: pool pool 1 SEˆ X S S X = + Group 1 Group 1 1 S ( 1)S + ( 1)S 1 1 pool = (1 1) + ( 1) Cofidece Coefficiet Use Percetiles from Normal Studet s t Studet s t Degrees freedom Not applicable ( 1 1) + ( 1) S1 S + 1 f= S 1 S

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

Topic 10: Introduction to Estimation

Topic 10: Introduction to Estimation Topic 0: Itroductio to Estimatio Jue, 0 Itroductio I the simplest possible terms, the goal of estimatio theory is to aswer the questio: What is that umber? What is the legth, the reactio rate, the fractio

More information

Unit 6 Estimation Week #10 - Practice Problems SOLUTIONS

Unit 6 Estimation Week #10 - Practice Problems SOLUTIONS PubHlth 540 Itroductory Biostatistics Page of 7 Uit 6 Estimatio Week #0 - Practice Problems SOLUTIONS. A etomologist samples a field for egg masses of a harmful isect by placig a yardsquare frame at radom

More information

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals 7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to: STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio

More information

Chapter 8: Estimating with Confidence

Chapter 8: Estimating with Confidence Chapter 8: Estimatig with Cofidece Sectio 8.2 The Practice of Statistics, 4 th editio For AP* STARNES, YATES, MOORE Chapter 8 Estimatig with Cofidece 8.1 Cofidece Itervals: The Basics 8.2 8.3 Estimatig

More information

BIOSTATS 640 Intermediate Biostatistics Frequently Asked Questions Topic 1 FAQ 1 Review of BIOSTATS 540 Introductory Biostatistics

BIOSTATS 640 Intermediate Biostatistics Frequently Asked Questions Topic 1 FAQ 1 Review of BIOSTATS 540 Introductory Biostatistics BIOTAT 640 Itermediate Biostatistics Frequetly Asked Questios Topic FAQ Review of BIOTAT 540 Itroductory Biostatistics. I m cofused about the jargo ad otatio, especially populatio versus sample. Could

More information

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions We have previously leared: KLMED8004 Medical statistics Part I, autum 00 How kow probability distributios (e.g. biomial distributio, ormal distributio) with kow populatio parameters (mea, variace) ca give

More information

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates. 5. Data, Estimates, ad Models: quatifyig the accuracy of estimates. 5. Estimatig a Normal Mea 5.2 The Distributio of the Normal Sample Mea 5.3 Normal data, cofidece iterval for, kow 5.4 Normal data, cofidece

More information

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading Topic 15 - Two Sample Iferece I STAT 511 Professor Bruce Craig Comparig Two Populatios Research ofte ivolves the compariso of two or more samples from differet populatios Graphical summaries provide visual

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

Chapter 6 Sampling Distributions

Chapter 6 Sampling Distributions Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

BIOSTATISTICS. Lecture 5 Interval Estimations for Mean and Proportion. dr. Petr Nazarov

BIOSTATISTICS. Lecture 5 Interval Estimations for Mean and Proportion. dr. Petr Nazarov Microarray Ceter BIOSTATISTICS Lecture 5 Iterval Estimatios for Mea ad Proportio dr. Petr Nazarov 15-03-013 petr.azarov@crp-sate.lu Lecture 5. Iterval estimatio for mea ad proportio OUTLINE Iterval estimatios

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9 BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous

More information

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process. Iferetial Statistics ad Probability a Holistic Approach Iferece Process Chapter 8 Poit Estimatio ad Cofidece Itervals This Course Material by Maurice Geraghty is licesed uder a Creative Commos Attributio-ShareAlike

More information

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

Binomial Distribution

Binomial Distribution 0.0 0.5 1.0 1.5 2.0 2.5 3.0 0 1 2 3 4 5 6 7 0.0 0.5 1.0 1.5 2.0 2.5 3.0 Overview Example: coi tossed three times Defiitio Formula Recall that a r.v. is discrete if there are either a fiite umber of possible

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

Stat 421-SP2012 Interval Estimation Section

Stat 421-SP2012 Interval Estimation Section Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible

More information

Computing Confidence Intervals for Sample Data

Computing Confidence Intervals for Sample Data Computig Cofidece Itervals for Sample Data Topics Use of Statistics Sources of errors Accuracy, precisio, resolutio A mathematical model of errors Cofidece itervals For meas For variaces For proportios

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01 ENGI 44 Cofidece Itervals (Two Samples) Page -0 Two Sample Cofidece Iterval for a Differece i Populatio Meas [Navidi sectios 5.4-5.7; Devore chapter 9] From the cetral limit theorem, we kow that, for sufficietly

More information

STAT 155 Introductory Statistics Chapter 6: Introduction to Inference. Lecture 18: Estimation with Confidence

STAT 155 Introductory Statistics Chapter 6: Introduction to Inference. Lecture 18: Estimation with Confidence The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL STAT 155 Itroductory Statistics Chapter 6: Itroductio to Iferece Lecture 18: Estimatio with Cofidece 11/14/06 Lecture 18 1 Itroductio Statistical Iferece

More information

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10 DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set

More information

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS Lecture 5: Parametric Hypothesis Testig: Comparig Meas GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review from last week What is a cofidece iterval? 2 Review from last week What is a cofidece

More information

Confidence Intervals for the Population Proportion p

Confidence Intervals for the Population Proportion p Cofidece Itervals for the Populatio Proportio p The cocept of cofidece itervals for the populatio proportio p is the same as the oe for, the samplig distributio of the mea, x. The structure is idetical:

More information

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis Sectio 9.2 Tests About a Populatio Proportio P H A N T O M S Parameters Hypothesis Assess Coditios Name the Test Test Statistic (Calculate) Obtai P value Make a decisio State coclusio Sectio 9.2 Tests

More information

Statistical Intervals for a Single Sample

Statistical Intervals for a Single Sample 3/5/06 Applied Statistics ad Probability for Egieers Sixth Editio Douglas C. Motgomery George C. Ruger Chapter 8 Statistical Itervals for a Sigle Sample 8 CHAPTER OUTLINE 8- Cofidece Iterval o the Mea

More information

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance Hypothesis Testig Empirically evaluatig accuracy of hypotheses: importat activity i ML. Three questios: Give observed accuracy over a sample set, how well does this estimate apply over additioal samples?

More information

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more

More information

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo

More information

This is an introductory course in Analysis of Variance and Design of Experiments.

This is an introductory course in Analysis of Variance and Design of Experiments. 1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class

More information

Understanding Samples

Understanding Samples 1 Will Moroe CS 109 Samplig ad Bootstrappig Lecture Notes #17 August 2, 2017 Based o a hadout by Chris Piech I this chapter we are goig to talk about statistics calculated o samples from a populatio. We

More information

Module 1 Fundamentals in statistics

Module 1 Fundamentals in statistics Normal Distributio Repeated observatios that differ because of experimetal error ofte vary about some cetral value i a roughly symmetrical distributio i which small deviatios occur much more frequetly

More information

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate

More information

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test. Math 308 Sprig 018 Classes 19 ad 0: Aalysis of Variace (ANOVA) Page 1 of 6 Itroductio ANOVA is a statistical procedure for determiig whether three or more sample meas were draw from populatios with equal

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Aalysis ad Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasii/teachig.html Suhasii Subba Rao Review of testig: Example The admistrator of a ursig home wats to do a time ad motio

More information

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1 October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 1 Populatio parameters ad Sample Statistics October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 2 Ifereces

More information

Anna Janicka Mathematical Statistics 2018/2019 Lecture 1, Parts 1 & 2

Anna Janicka Mathematical Statistics 2018/2019 Lecture 1, Parts 1 & 2 Aa Jaicka Mathematical Statistics 18/19 Lecture 1, Parts 1 & 1. Descriptive Statistics By the term descriptive statistics we will mea the tools used for quatitative descriptio of the properties of a sample

More information

Simulation. Two Rule For Inverting A Distribution Function

Simulation. Two Rule For Inverting A Distribution Function Simulatio Two Rule For Ivertig A Distributio Fuctio Rule 1. If F(x) = u is costat o a iterval [x 1, x 2 ), the the uiform value u is mapped oto x 2 through the iversio process. Rule 2. If there is a jump

More information

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y. Testig Statistical Hypotheses Recall the study where we estimated the differece betwee mea systolic blood pressure levels of users of oral cotraceptives ad o-users, x - y. Such studies are sometimes viewed

More information

The standard deviation of the mean

The standard deviation of the mean Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider

More information

MBACATÓLICA. Quantitative Methods. Faculdade de Ciências Económicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS

MBACATÓLICA. Quantitative Methods. Faculdade de Ciências Económicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS MBACATÓLICA Quatitative Methods Miguel Gouveia Mauel Leite Moteiro Faculdade de Ciêcias Ecoómicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS MBACatólica 006/07 Métodos Quatitativos

More information

PubHlth 540 Estimation Page 1 of 69. Unit 6 Estimation

PubHlth 540 Estimation Page 1 of 69. Unit 6 Estimation PubHlth 540 Estimation Page 1 of 69 Unit 6 Estimation Topic 1. Introduction........ a. Goals of Estimation. b. Notation and Definitions. c. How to Interpret a Confidence Interval. Preliminaries: Some Useful

More information

MATH/STAT 352: Lecture 15

MATH/STAT 352: Lecture 15 MATH/STAT 352: Lecture 15 Sectios 5.2 ad 5.3. Large sample CI for a proportio ad small sample CI for a mea. 1 5.2: Cofidece Iterval for a Proportio Estimatig proportio of successes i a biomial experimet

More information

Common Large/Small Sample Tests 1/55

Common Large/Small Sample Tests 1/55 Commo Large/Small Sample Tests 1/55 Test of Hypothesis for the Mea (σ Kow) Covert sample result ( x) to a z value Hypothesis Tests for µ Cosider the test H :μ = μ H 1 :μ > μ σ Kow (Assume the populatio

More information

CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics

CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS 8.1 Radom Samplig The basic idea of the statistical iferece is that we are allowed to draw ifereces or coclusios about a populatio based

More information

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised Questio 1. (Topics 1-3) A populatio cosists of all the members of a group about which you wat to draw a coclusio (Greek letters (μ, σ, Ν) are used) A sample is the portio of the populatio selected for

More information

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE TERRY SOO Abstract These otes are adapted from whe I taught Math 526 ad meat to give a quick itroductio to cofidece

More information

(7 One- and Two-Sample Estimation Problem )

(7 One- and Two-Sample Estimation Problem ) 34 Stat Lecture Notes (7 Oe- ad Two-Sample Estimatio Problem ) ( Book*: Chapter 8,pg65) Probability& Statistics for Egieers & Scietists By Walpole, Myers, Myers, Ye Estimatio 1 ) ( ˆ S P i i Poit estimate:

More information

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The

More information

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ), Cofidece Iterval Estimatio Problems Suppose we have a populatio with some ukow parameter(s). Example: Normal(,) ad are parameters. We eed to draw coclusios (make ifereces) about the ukow parameters. We

More information

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times Sigificace level vs. cofidece level Agreemet of CI ad HT Lecture 13 - Tests of Proportios Sta102 / BME102 Coli Rudel October 15, 2014 Cofidece itervals ad hypothesis tests (almost) always agree, as log

More information

Chapter 11: Asking and Answering Questions About the Difference of Two Proportions

Chapter 11: Asking and Answering Questions About the Difference of Two Proportions Chapter 11: Askig ad Aswerig Questios About the Differece of Two Proportios These otes reflect material from our text, Statistics, Learig from Data, First Editio, by Roxy Peck, published by CENGAGE Learig,

More information

Analysis of Experimental Data

Analysis of Experimental Data Aalysis of Experimetal Data 6544597.0479 ± 0.000005 g Quatitative Ucertaity Accuracy vs. Precisio Whe we make a measuremet i the laboratory, we eed to kow how good it is. We wat our measuremets to be both

More information

AP Statistics Review Ch. 8

AP Statistics Review Ch. 8 AP Statistics Review Ch. 8 Name 1. Each figure below displays the samplig distributio of a statistic used to estimate a parameter. The true value of the populatio parameter is marked o each samplig distributio.

More information

Chapter 6 Part 5. Confidence Intervals t distribution chi square distribution. October 23, 2008

Chapter 6 Part 5. Confidence Intervals t distribution chi square distribution. October 23, 2008 Chapter 6 Part 5 Cofidece Itervals t distributio chi square distributio October 23, 2008 The will be o help sessio o Moday, October 27. Goal: To clearly uderstad the lik betwee probability ad cofidece

More information

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced

More information

Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p

Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE Part 3: Summary of CI for µ Cofidece Iterval for a Populatio Proportio p Sectio 8-4 Summary for creatig a 100(1-α)% CI for µ: Whe σ 2 is kow ad paret

More information

Parameter, Statistic and Random Samples

Parameter, Statistic and Random Samples Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,

More information

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen) Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................

More information

6 Sample Size Calculations

6 Sample Size Calculations 6 Sample Size Calculatios Oe of the major resposibilities of a cliical trial statisticia is to aid the ivestigators i determiig the sample size required to coduct a study The most commo procedure for determiig

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

Statisticians use the word population to refer the total number of (potential) observations under consideration

Statisticians use the word population to refer the total number of (potential) observations under consideration 6 Samplig Distributios Statisticias use the word populatio to refer the total umber of (potetial) observatios uder cosideratio The populatio is just the set of all possible outcomes i our sample space

More information

Frequentist Inference

Frequentist Inference Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for

More information

Stat 139 Homework 7 Solutions, Fall 2015

Stat 139 Homework 7 Solutions, Fall 2015 Stat 139 Homework 7 Solutios, Fall 2015 Problem 1. I class we leared that the classical simple liear regressio model assumes the followig distributio of resposes: Y i = β 0 + β 1 X i + ɛ i, i = 1,...,,

More information

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight) Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........

More information

MEASURES OF DISPERSION (VARIABILITY)

MEASURES OF DISPERSION (VARIABILITY) POLI 300 Hadout #7 N. R. Miller MEASURES OF DISPERSION (VARIABILITY) While measures of cetral tedecy idicate what value of a variable is (i oe sese or other, e.g., mode, media, mea), average or cetral

More information

Stat 225 Lecture Notes Week 7, Chapter 8 and 11

Stat 225 Lecture Notes Week 7, Chapter 8 and 11 Normal Distributio Stat 5 Lecture Notes Week 7, Chapter 8 ad Please also prit out the ormal radom variable table from the Stat 5 homepage. The ormal distributio is by far the most importat distributio

More information

Estimation of a population proportion March 23,

Estimation of a population proportion March 23, 1 Social Studies 201 Notes for March 23, 2005 Estimatio of a populatio proportio Sectio 8.5, p. 521. For the most part, we have dealt with meas ad stadard deviatios this semester. This sectio of the otes

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

PH 425 Quantum Measurement and Spin Winter SPINS Lab 1

PH 425 Quantum Measurement and Spin Winter SPINS Lab 1 PH 425 Quatum Measuremet ad Spi Witer 23 SPIS Lab Measure the spi projectio S z alog the z-axis This is the experimet that is ready to go whe you start the program, as show below Each atom is measured

More information

Exam II Covers. STA 291 Lecture 19. Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Location CB 234

Exam II Covers. STA 291 Lecture 19. Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Location CB 234 STA 291 Lecture 19 Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Locatio CB 234 STA 291 - Lecture 19 1 Exam II Covers Chapter 9 10.1; 10.2; 10.3; 10.4; 10.6

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should be doe

More information

Read through these prior to coming to the test and follow them when you take your test.

Read through these prior to coming to the test and follow them when you take your test. Math 143 Sprig 2012 Test 2 Iformatio 1 Test 2 will be give i class o Thursday April 5. Material Covered The test is cummulative, but will emphasize the recet material (Chapters 6 8, 10 11, ad Sectios 12.1

More information

Chapter 6. Sampling and Estimation

Chapter 6. Sampling and Estimation Samplig ad Estimatio - 34 Chapter 6. Samplig ad Estimatio 6.. Itroductio Frequetly the egieer is uable to completely characterize the etire populatio. She/he must be satisfied with examiig some subset

More information

Median and IQR The median is the value which divides the ordered data values in half.

Median and IQR The median is the value which divides the ordered data values in half. STA 666 Fall 2007 Web-based Course Notes 4: Describig Distributios Numerically Numerical summaries for quatitative variables media ad iterquartile rage (IQR) 5-umber summary mea ad stadard deviatio Media

More information

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples

More information

Sample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D.

Sample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D. ample ie Estimatio i the Proportioal Haards Model for K-sample or Regressio ettigs cott. Emerso, M.D., Ph.D. ample ie Formula for a Normally Distributed tatistic uppose a statistic is kow to be ormally

More information

Final Examination Solutions 17/6/2010

Final Examination Solutions 17/6/2010 The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:

More information

Data Description. Measure of Central Tendency. Data Description. Chapter x i

Data Description. Measure of Central Tendency. Data Description. Chapter x i Data Descriptio Describe Distributio with Numbers Example: Birth weights (i lb) of 5 babies bor from two groups of wome uder differet care programs. Group : 7, 6, 8, 7, 7 Group : 3, 4, 8, 9, Chapter 3

More information

µ and π p i.e. Point Estimation x And, more generally, the population proportion is approximately equal to a sample proportion

µ and π p i.e. Point Estimation x And, more generally, the population proportion is approximately equal to a sample proportion Poit Estimatio Poit estimatio is the rather simplistic (ad obvious) process of usig the kow value of a sample statistic as a approximatio to the ukow value of a populatio parameter. So we could for example

More information

GG313 GEOLOGICAL DATA ANALYSIS

GG313 GEOLOGICAL DATA ANALYSIS GG313 GEOLOGICAL DATA ANALYSIS 1 Testig Hypothesis GG313 GEOLOGICAL DATA ANALYSIS LECTURE NOTES PAUL WESSEL SECTION TESTING OF HYPOTHESES Much of statistics is cocered with testig hypothesis agaist data

More information

Economics Spring 2015

Economics Spring 2015 1 Ecoomics 400 -- Sprig 015 /17/015 pp. 30-38; Ch. 7.1.4-7. New Stata Assigmet ad ew MyStatlab assigmet, both due Feb 4th Midterm Exam Thursday Feb 6th, Chapters 1-7 of Groeber text ad all relevat lectures

More information

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2 Chapter 8 Comparig Two Treatmets Iferece about Two Populatio Meas We wat to compare the meas of two populatios to see whether they differ. There are two situatios to cosider, as show i the followig examples:

More information

Understanding Dissimilarity Among Samples

Understanding Dissimilarity Among Samples Aoucemets: Midterm is Wed. Review sheet is o class webpage (i the list of lectures) ad will be covered i discussio o Moday. Two sheets of otes are allowed, same rules as for the oe sheet last time. Office

More information

(6) Fundamental Sampling Distribution and Data Discription

(6) Fundamental Sampling Distribution and Data Discription 34 Stat Lecture Notes (6) Fudametal Samplig Distributio ad Data Discriptio ( Book*: Chapter 8,pg5) Probability& Statistics for Egieers & Scietists By Walpole, Myers, Myers, Ye 8.1 Radom Samplig: Populatio:

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA, 016 MODULE : Statistical Iferece Time allowed: Three hours Cadidates should aswer FIVE questios. All questios carry equal marks. The umber

More information

Estimation for Complete Data

Estimation for Complete Data Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of

More information

2 Definition of Variance and the obvious guess

2 Definition of Variance and the obvious guess 1 Estimatig Variace Statistics - Math 410, 11/7/011 Oe of the mai themes of this course is to estimate the mea µ of some variable X of a populatio. We typically do this by collectig a sample of idividuals

More information

ANALYSIS OF EXPERIMENTAL ERRORS

ANALYSIS OF EXPERIMENTAL ERRORS ANALYSIS OF EXPERIMENTAL ERRORS All physical measuremets ecoutered i the verificatio of physics theories ad cocepts are subject to ucertaities that deped o the measurig istrumets used ad the coditios uder

More information