Chapter 6. Sampling and Estimation

Size: px
Start display at page:

Download "Chapter 6. Sampling and Estimation"

Transcription

1 Samplig ad Estimatio - 34 Chapter 6. Samplig ad Estimatio 6.. Itroductio Frequetly the egieer is uable to completely characterize the etire populatio. She/he must be satisfied with examiig some subset of the populatio, or several subsets of the populatio, i order to ifer iformatio about the etire populatio. Such subsets are called samples. A populatio is the etirety of observatios ad a sample is a subset of the populatio. A sample that gives correct ifereces about the populatio is a radom sample, otherwise it is biased. Statistics are give differet symbols tha the expectatio values because statistics are approximatios of the expectatio value. The statistic called the mea is a approximatio to the expectatio value of the mea. The statistic mea is the mea of the sample ad the expectatio value mea is the mea of the etire populatio. I order to calculate a expectatio, oe requires kowledge of the PDF. I practice, the motivatio i calculatig a statistic is that oe has o kowledge of the uderlyig PDF. 6.. Statistics Ay fuctio of the radom variables costitutig a radom sample is called a statistic. Example 6..: Mea The mea is a statistic of a radom sample of size ad is defied as X X i i (6.) Example 6..: Media The media is a statistic of a radom sample of size, which represets the middle value of the sample ad, for a samplig arraged i icreasig order of magitude, is defied as 34

2 Samplig ad Estimatio - 35 ~ X X ~ X X ( + / ) / + X ( + ) / for odd for eve (6.) The media of the sample space {,,3} is. The media of the sample space {3,,} is. The media of the sample space {,,3,4} is.5. Example 6.3.: Mode The mode is a statistic of a radom sample of size, which represets the most frequetly appearig value i the sample. The mode may ot exist ad, if it does, it may ot be uique. The mode of the sample space {,,,3} is. The mode of the sample space {,,,3,4,4} is ad 4. (bimodal) The mode of the sample space {,,3} does ot exist sice each etry occurs oly oce. Example 6.4.: Rage The rage is a statistic of a radom sample of size, which represets the spa of the sample ad, for a samplig arraged i icreasig order of magitude, is defied as rage(x) X -X (6.3) The rage of {,,3,4,5} is 5-4. Example 6.5.: Variace The variace is a statistic of a radom sample of size, which represets the spread of the sample ad is defied as S ( X X ) ( X i ) i i i i ( ) X i (6.4) The reaso for usig (-) i the deomiator rather tha is give later. Example 6.6.: Stadard Deviatio The stadard deviatio, s, is a statistic of a radom sample of size, which represets the spread of the sample ad is defied as the positive square root of the variace. S S (6.5) 35

3 Samplig ad Estimatio Samplig Distributios We have ow stated the defiitios of the statistics we are iterested i. Now, we eed to kow the distributio of the statistics to determie how good these samplig approximatios are to the true expectatio values of the populatio. Statistic. Mea whe the variace is kow: Samplig Distributio If X is the mea of a radom sample of size take from a populatio with mea µ ad variace, the the limitig form of the distributio of X µ Z (6.6) / as, is the stadard ormal distributio (z;0,). This is kow as the Cetral Limit Theorem. What this says is that, give a collectio of radom samples, each of size, yieldig a mea X, the distributio of X approximates a ormal distributio, ad becomes exactly a ormal distributio as the sample size goes to ifiity. The distributio of X does ot have to be ormal. Geerally, the ormal approximatio for X is good if > 30. We provide a derivatio i Appedix V provig that the distributio of the sample mea is give by the ormal distributio. Example 6.7.: distributio of the mea, variace kow I a reactor iteded to grow crystals i solutio, a seed is used to ecourage ucleatio. Idividual crystals are radomly sampled from the effluet of each reactor of sizes 0. The populatio has variace i crystal size of. 0 µm. (We must kow this from previous research.) The samples yield mea crystal sizes of x 5. 0 µm. What is the likelihood that the true populatio mea, µ, is actually less tha 4.0 µm? Z x µ / / 0 ( < 4 ) P( z > 3.6) P µ We have the chage i sig because as µ icreases, z decreases. The evaluatio of the cumulative ormal probability distributio ca be performed several ways. First, whe the pioeers were crossig the plais i their covered wagos ad they wated to evaluate probabilities from the ormal distributio, they used Tables of the cumulative ormal PDF, such as those provided i the back of the statistics textbook. These tables are also available olie. For example wikipedia has a table of cumulative stadard umeral PDFs at 36

4 Samplig ad Estimatio Usig the table, we fid ( < 4 ) P( z > 3.6) P( z < 3.6) P µ Secod, we ca use a moder computatioal tool like MATLAB to evaluate the probability. The problem ca be worked i terms of the stadard ormal PDF (µ 0 ad ), which for P µ < 4 P z > 3.6 P z < 3.6 is ( ) ( ) ( ) >> p - cdf('ormal',3.6,0,) p e-04 Alteratively, the problem ca be worked i terms of the o-stadard ormal PDF ( x 5 ad / / 0 P µ < 4 ), which for ( ) >> p cdf('ormal',4,5,/sqrt(0)) p e-04 The differece i these results is due to the roud-off i 3.6, used as a argumet i the fuctio call for the stadard ormal distributio. Based o our samplig data, the probability that the true sample mea is less tha 4.0 µm is 0.078%. Statistic. differece of meas whe the variace is kow: Samplig Distributio It is useful to kow the samplig differece of two meas whe you wat to determie whether there is a sigificat differece betwee two populatios. This situatio applies whe you takes two radom samples of size ad from two differet populatios, with meas µ ad µ ad variaces ad, respectively. The the samplig distributio of the differece of meas, X X X X, is approximately ormal, distributed with mea µ µ µ ad variace XX 37

5 Samplig ad Estimatio - 38 Hece, Z ( X X ) ( µ µ ) + (6.7) is approximately a stadard ormal variable. Example 6.8.: distributio of the differece of meas, variaces kow I a reactor iteded to grow crystals, two differet types of seeds are used to ecourage ucleatio. Idividual crystals are radomly sampled from the effluet of each reactor of sizes 0 ad 0. The populatios have variaces i crystal size of. 0 µm ad.0 µm. (We must kow this from previous research.) The samples yield mea crystal sizes of X 5. 0 µm ad X 0. 0 µm. How cofidet ca we be that the true differece i populatio meas, µ µ, is actually 4.0 µm or greater? Usig equatio (6.7) we have: Z ( X X ) ( µ µ ) ( 5 0) ( 4) ( µ > 4.0) P( z.36) P µ < We have the chage i sig because as µ icreases, z decreases. The probability that µ µ is greater 4.0 µm is the give by P(Z<.36). How do we kow that we wat P(Z<.36) ad ot P(Z>.36)? We just have to sit dow ad thik what the problem physically meas. Sice we wat the probability that µ µ is greater 4.0 µm, we kow we eed to iclude the area due to higher values of µ µ. Higher values of µ µ yield lower values of Z. Therefore, we eed the less tha sig. The evaluatio of the cumulative ormal probability distributio ca agai be performed two ways. First, usig a stadard ormal table, we have P(Z <. 4 ) Secod, usig MATLAB we have >> p cdf('ormal',.36,0,) 38

6 Samplig ad Estimatio - 39 p We expect 98.73% of the differeces i crystal size of the two populatios to be at least 4.0 µm. Statistic 3. Mea whe the variace is ukow: Samplig Distributio Of course, usually we do t kow the populatio variace. I that case, we have to use some other statistic to get a hadle o the distributio of the mea. If X is the mea of a radom sample of size take from a populatio with mea µ ad ukow variace, the the limitig form of the distributio of T X µ (6.8) S / as, is the t distributio f T ( t; v). The T-statistic has a t-distributio with v- degrees of freedom. The t-distributio is just aother cotiuous PDF, like the others we leared about i the previous sectio. The t distributio is give by Γ f ( t) Γ [( v + ) / ) ] ( v / ) πv t + v As a remider, the t distributio is plotted agai i Figure 6.. v+ Example 6.9.: distributio of the mea, variace ukow I a reactor iteded to grow crystals, a seed is used to ecourage ucleatio. Idividual crystals are radomly sampled from the effluet of each reactor of sizes 0. The populatio has ukow variace i crystal size. The samples yield mea crystal sizes of x 5. 0 µm ad a sample variace of s. 0µm. f(t) for < t < t Figure 6.. The t distributio as a fuctio of the degrees of freedom ad the ormal distributio. 39 ormal

7 Samplig ad Estimatio - 40 What is the likelihood that the true populatio mea, µ, is actually less tha 4.0 µm? t x µ s / / 0 ( < 4 ) P( t > 3.6) P µ We have the chage i sig because as µ icreases, t decreases. The parameter v - 9. The evaluatio of the cumulative t probability distributio ca agai be performed two ways. First, we ca use a table of critical values of the t-distributio. It is crucial to ote that such a table does ot provide cumulative PDFs, rather it provides oe mius the cumulative PDF. I other words, where as the stadard ormal table provides the probability less tha z (the cumulative PDF), the t-distributio table provides the probability greater tha t (oe mius the cumulative PDF). We the have ( < 4) P( t > 3.6) P µ Secod, usig MATLAB we have P ( µ < 4 ) P( t > 3.6) P( t < 3.6) >> p - cdf('t',3.6,9) p Based o our samplig data, the probability that the true sample mea is less tha 4.0 µm is 0.57%. We should poit out that our percetage here is substatially greater tha for our percetage whe we kew the populatio variace (0.078%). That is because kowig the populatio variace reduces our ucertaity. Approximatig the populatio variace with the samplig variace adds to the ucertaity ad results i a larger percetage of our populatio deviatig farther from the sample mea. Example 6.0.: distributio of the mea, variace ukow A egieer claims that the populatio mea yield of a batch process is 500 g/ml of raw material. To verify this, she samples 5 batches each moth. Oe moth the sample has a mea X 58 g ad a stadard deviatio of s40 g. Does this sample support his claim that µ 500 g? The first step i solvig this problem is to compute the T statistic. T X µ S / 40 / 5 40

8 Samplig ad Estimatio - 4 Secod, usig MATLAB we have P ( µ > 58) P( t <.5) >> p cdf('t',-.5,4) p (Or usig a Table, we fid that whe v4 ad T.5, 0.0). This meas there is oly a.6% probability that a populatio with µ 500 would yield a sample with X 58 or higher. Therefore, it is ulikely that 500 is the populatio mea. Statistic 4. differece of meas whe the variace is ukow: Samplig Distributio It is useful to kow the samplig differece of two meas whe you wat to determie whether there is a sigificat differece betwee two populatios. Sometimes you wat to do this whe you do t kow the populatio variaces. This situatio applies whe you takes two radom samples of size ad from two differet populatios, with meas µ ad µ ad ukow variaces. The the samplig distributio of the differece of meas, X X, follows the t- distributio. trasformatio: T symmetry: t t, ( X X ) ( µ µ ) s s + (6.9) parameters: v + if s s + parameters: v if s ( ) + ( ) s Sice we do t kow either populatio variace i this case, we ca t assume they are equal uless we are told they are equal. Example 6..: distributio of the differece of meas, variaces ukow I a reactor iteded to grow crystals, two differet types of seeds are used to ecourage ucleatio. Idividual crystals are radomly sampled from the effluet of each reactor of sizes 0 ad 0. The populatios have ukow variaces i crystal size. The samples yield 4

9 Samplig ad Estimatio - 4 mea crystal sizes of X 5. 0 µm ad 0. X 0 µm ad sample variaces of s. 0 µm ad s.0 µm. What percetage of true populatio differeces yieldig these samplig results would have a true differece i populatio meas, µ µ, of 4.0 µm or greater? T ( X X ) ( µ µ ) ( 5 0) ( 4) s s The degree of freedom parameter is give by: v s ( ) + ( ) ( 0 ) + ( 0 ) s s + s ( µ > 4.0) P( t <.36) P( t.36) P µ > The evaluatio of the cumulative ormal probability distributio ca agai be performed two ways. First, usig a table of critical values of the t-distributio, we have ( µ > 4.0) P( t <.36) P( t >.36) P µ Secod, usig MATLAB we have for P ( µ µ > 4.0) P( t.36) >> p cdf('t',.36,8) p < We expect 98.3% of the differeces i crystal size of the two populatios to be at least 4.0 µm. Statistic 5. Variace: Samplig Distributio We ow wish to kow the samplig distributio of the sample variace, S. If S is the variace of a radom sample of size take from a populatio with mea µ ad variace, the the statistic χ ( ) S ( X i X ) (6.0) i 4

10 Samplig ad Estimatio - 43 has a chi-squared distributio with v- degrees of freedom, f ( χ ; ). The chi-squared distributio is defied as f χ ( x; v) v / x Γ( v / ) 0 v/- e -x/ for x > 0 elsewhere χ It is a special case of the Gamma Distributio, whe v/ ad β, where v is called the degrees of freedom ad is a positive iteger. As a remider, we provide a plot of the chisquared distributio i Figure 6.. f(χ ) Example 6..: distributio of 0.06 the variace 0.04 I a reactor iteded to grow crystals, a seed is used to 0.0 ecourage ucleatio. Idividual 0 crystals are radomly sampled χ from the effluet of each reactor Figure 6.. The chi-squared distributio for various values of sizes 0. The samples of v. yield mea crystal sizes of x 5.0 µm ad a sample variace of s. 0µm. What is the likelihood that the true populatio variace,, is actually less tha 0.5 µm? ( ) S χ P (0 ) ( < 0.5) P( χ > 8) 9. We have the chage i sig because as icreases, χ decreases. The parameter v - The evaluatio of the cumulative χ probability distributio ca agai be performed two ways. First, we ca use a table of critical values of the χ -distributio. It is crucial to ote that such a table does ot provide cumulative PDFs, rather it provides oe mius the cumulative PDF. We the have 43

11 Samplig ad Estimatio - 44 P ( < 0.5) P( χ > 8) Secod, usig MATLAB we have P ( < 0.5) P( χ > 8) P( χ < 8) >> p - cdf('chi',8,9) p Based o our samplig data, the probability that the true variace is less tha 0.5 µm is 3.5%. Statistic 6. the ratio of Variaces: Samplig Distributio (F-distributio) Just as we studied the distributio of two sample meas, so too are we iterested i the distributio of two variaces. I the case of the mea, it was a differece. I the case of the variace, the ratio is more useful. Now cosider samplig two radom samples of size ad from two differet populatios, with meas ad, respectively. The statistic, F, S / S F (6.) S / S provides a distributio of the ratio of two variaces. This distributio is called the F-distributio with v ad v degrees of freedom. The f-distributio is defied as h ( f ; v, v ) f v + v v Γ v v v Γ Γ 0 v v + v f v + v v f for f > 0 elsewhere As a remider, the f-distributio is plotted i Figure 6.3. Example 6.3.: ratio of the variaces I a reactor iteded to grow crystals, two differet types of seeds are used to ecourage ucleatio. Idividual crystals are radomly sampled from the effluet of each reactor of sizes 0 ad 0. The populatios have ukow variaces i crystal size. The samples yield 44

12 Samplig ad Estimatio - 45 mea crystal sizes of X 5. 0 µm ad X 0. 0 µm ad sample variaces of s. 0 µm ad s.0 µm. What is the probability that the ratio of variaces,, is less tha 0.5? h(f ) v 0, v0 v0, v0 v 0, v5 v5, v0 v5, v0 v5, v5 S F S 0.5 P < 0.5 > P F ( ) f Figure 6.3. The F distributio for various values of v ad v. We have the chage i sig because as icreases, F decreases. The parameters are v 9 ad v 9. The evaluatio of the cumulative F probability distributio ca agai be performed i oe way. We caot use tables because there are o tables for arbitrary values of the probability. There are oly tables for two values of the probability, 0.0 ad Therefore, usig MATLAB we have P < 0.5 ( > ) ( < ) P F P F >> p - cdf('f',,9,9) p Based o our samplig data, the probability that the ratio of variaces is less tha 0.5 is 9.7% Cofidece Itervals I the previous sectio we showed what types of distributios describe various statistics of a radom sample. I this sectio, we discuss estimatig the populatio mea ad variace from the sample mea ad variace. I additio, we itroduce cofidece itervals to quatify the goodess of these estimates. 45

13 Samplig ad Estimatio - 46 A cofidece iterval is some subset of radom variable space with which someoe ca say somethig like, I am 95% sure that the true populatio mea is betwee µ low ad µ hi. I this sectio, we discuss how a cofidece iterval is defied ad calculated. The cofidece iterval is defied by a percet. This percet is called (-). So if 0.05, the you would have a 90% cofidece iterval. The cocept of a cofidece iterval is illustrated i graphical terms i Figure 6.4. Figure 6.4. A schematic illustratig a cofidece iterval. The trick the is to fid µ low z ad µ hi z so that you ca say for a give, I am ( )% cofidet that µ low < µ < µ hi. Statistic. mea, kow: cofidece iterval We ow kow that the sample mea is distributed with the stadard ormal distributio. For a symmetric PDF, cetered aroud zero, like the stadard ormal, µ low µ hi. We ca the make the statemet: P ( z < Z < z ) Now the ormal distributio is symmetric about the y-axis so we ca write z z so P ( z < Z < z P z < Z < z ) ( ) 46

14 where Samplig ad Estimatio - 47 Z X µ. / We ca rearrage this to equatio to read P ( X + z < µ < X z ) (6.) where we ow have µ low ad µ hi explicitly. Example 6.4.: cofidece iterval o mea, variace kow Samples of dioxi cotamiatio i 36 frot yards i St. Louis show a cocetratio of 6 ppm. Fid the 95% cofidece iterval for the populatio mea. Assume that the stadard deviatio is.0 ppm. To solve this, first calculate, z, z z z z z.96 The z value came from a stadard ormal table. Alteratively, we ca compute this value from MATLAB, >> z icdf('ormal',0.05,0,) z Here we used the iverse cumulative distributio fuctio (icdf) commad. Sice we have the stadard ormal PDF, the mea is 0 ad the variace is. The value of 0.05 correspods to alpha, the probability. To get the value of the other limit, we either rely o symmetry, or compute it directly, >> z icdf('ormal',0.975,0,) z Note that these values of z are idepedet of all aspects of the problem except the value of the cofidece iterval. 47

15 Samplig ad Estimatio - 48 Therefore, by equatio (6.) P ( 6 + (.96) < µ < X (.96) so the 95% cofidece iterval for the mea is < µ < Statistic. mea, ukow: cofidece iterval Now usually, we do t kow the variace. We have to use our estimate of the variace, s, for. I that case, estimatig the mea requires the T-distributio. (See previous sectio.) Let me stress that we do everythig exactly as we did before but we use s for ad use the t-distributio istead of the ormal distributio. Remember the t-distributio is also symmetric about the origi, so t t. (this meas you oly have to compute the t probability oce. Remember, v-. where P ( t < T < t ) P( t < T < t ) T X µ. s / Just as before, we ca rearrage this to equatio to read s s P ( X + t < µ < X t ) (6.3) where we ow have µ low ad µ hi explicitly. Example 6.5.: cofidece iterval o mea, variace ukow Samples of dioxi cotamiatio i 36 frot yards i St. Louis show a cocetratio of 6 ppm. Fid the 95% cofidece iterval for the populatio mea. The sample stadard deviatio, s, was measured to be.0. To solve this, first calculate, t, t for v t t t t

16 Samplig ad Estimatio - 49 The t value came from a table of t-distributio values. Alteratively, we ca compute this value usig MATLAB, >> t icdf('t',0.05,35) t ad for the upper limit >> t icdf('t',0.975,35) t , which ca also be obtaied by symmetry. Note that these values of t are idepedet of all aspects of the problem except the value of the cofidece iterval ad the umber of sample poits,. Therefore, by equatio (6.3) P ( 6 (.03) < µ < X + (.03) so the 95% cofidece iterval for the mea is 5.66 < µ < You should ote that we are a little less cofidet about the mea whe we use the sample variace as the estimate for the populatio variace, for which the 95% cofidece iterval for the mea was < µ < Statistic 3. differece of meas, kow: cofidece iterval The exact same derivatio that we used above for a sigle mea ca be used for the differece of meas. Whe we the variaces of the two samples are kow, we have: P ( X X ) + z + < ( µ µ ) < ( X X ) z + (6.4) where z is a radom variable obeyig the stadard ormal PDF. Example 6.6.: cofidece iterval o the differece of meas, variaces kow Samples of dioxi cotamiatio i 36 frot yards i Times Beach, a suburb of St. Louis, show a cocetratio of 6 ppm with a populatio variace of.0 ppm. Samples of dioxi cotamiatio i 6 frot yards i Quail Ru, aother suburb of St. Louis, show a cocetratio of 8 ppm with a populatio variace of 3.0 ppm. Fid the 95% cofidece iterval for the differece of populatio meas.. 49

17 Samplig ad Estimatio - 50 To solve this, first calculate, z, z z z z z.96 The z value came from a table of stadard ormal PDF values. Alteratively, we ca compute this value from MATLAB, >> z icdf('ormal',0.05,0,) z Therefore, by equatio (6.6) P P ( 8).96 + < ( µ µ ) < ( 6 8) (0.05) 36 6 [.909 < ( µ µ ) <.09] So the 95% cofidece iterval for the mea is.909 < ( µ µ ) <. 09. If we are determiig which site is more cotamiated, the we are 95% sure that site (Quail Ru) is more cotamiated by to 3 ppm tha site, (Times Beach). Statistic 4. differece of meas, ukow: cofidece iterval Whe we the variaces of the two samples are ukow, we have: s s s s P ( X X ) + t + < ( µ µ ) < ( X X ) + t + (6.5) where the umber of degrees of freedom for the t-distributio is v + if 50

18 Samplig ad Estimatio ( ) ( ) + + s s s s v if Example 6.6.: cofidece iterval o the differece of meas, variaces ukow Samples of dioxi cotamiatio i 36 frot yards i Times Beach, a suburb of St. Louis, show a cocetratio of 6 ppm with a sample variace of.0 ppm. Samples of dioxi cotamiatio i 6 frot yards i Quail Ru, aother suburb of St. Louis, show a cocetratio of 8 ppm with a sample variace of 3.0 ppm. Fid the 95% cofidece iterval for the differece of populatio meas.. To solve this, first calculate,, t t. ( ) ( ) ( ) ( ) s s s s v t t t t The t value came from a table of t-pdf values. Alteratively, we ca compute this value usig MATLAB, >> t icdf('t',0.05,0) t Therefore, substitutig ito equatio (6.5) yields ( ) ( ) ( ) ) ( < < + µ µ P ( ) [ ] < < µ µ P

19 So the 95% cofidece iterval for the mea is.97 < ( µ ) <. 03 µ. Samplig ad Estimatio - 5 If we are determiig which site is more cotamiated, the we are 95% sure that site (Quail Ru) is more cotamiated by to 3 ppm tha site, (Times Beach). Statistic 5. variace: cofidece iterval The cofidece iterval of the variace ca be estimated i a precisely aalogous way, kowig that the statistic ( ) S ( X i X ) χ i has a chi-squared distributio with v- degrees of freedom, f ( χ ; ). So χ ( ) ( ) P < < (6.6) χ χ Perversely, the tables of the critical values for the χ distributio, have defied to be -, so the idices have to be switched whe usig the table. ( ) ( ) P < < whe usig the χ critical values table oly! χ χ If you get cofused, just remember that the upper limit must be greater tha the lower limit. Remember also that the f ( χ ; ) is ot symmetric about the origi, so we caot use the χ symmetry argumets used for the cofidece itervals for fuctios of the mea. Example 6.7.: variace Samples of dioxi cotamiatio i 6 frot yards i St. Louis show a cocetratio of 6 ppm. Fid the 95% cofidece iterval for the populatio mea. The sample stadard deviatio, s, was measured to be.0. To solve this, first calculate χ, χ. For v 5, we have, χ χ 0.05 χ χ

20 Samplig ad Estimatio - 53 The t value came from a table of value usig MATLAB, >> chi icdf('chi',0.05,5) chi ad >> chi icdf('chi',0.975,5) chi Therefore, substitutig ito equatio (6.6) yields χ -distributio values. Alteratively, we ca compute this ( 6 ).0 P < (6 ).0 < (0.05) 6.6 P ( < <.395) So the 95% cofidece iterval for the mea is < < Statistic 6. ratio of variaces: cofidece iterval (p. 53) The ratio of two populatio variaces ca be estimated i a precisely aalogous way, kowig that the statistic S / F S / S S follows the F-distributio with v ad v degrees of freedom. Remember, the F- distributio has a symmetry, f / ( v, v). This symmetry relatio is essetial if oe f / ( v, v ) is to use tables for the critical value of the F-distributio. It is ot essetial if oe uses MATLAB commads. If oe is computig the cumulative PDF for the f distributio, the oe simply, rearrages this equatio for 53

21 Samplig ad Estimatio - 54 S F S F S S S S P < < (6.7) S f ( v, v ) S f ( v, v ) Oe otes that the order of the limits has chaged here, sice as goes up, F goes dow. I ay case, the lower limit must be smaller tha the upper limit. If oe chooses to use tables of critical values, oe must take ito accout two idiosycrasies of the procedure. First, as was the case with the t ad chi-squared distributios, the table provide the probability that f is greater tha a value, ot the cumulative PDF, which is the probability that f is less tha a value. Secod, the tables oly provide data for small values of. Therefore, we must elimiate all istaces of -., usig a symmetry relatio. The result is S S P < < f ( v, v ) whe usig the tables oly! S f ( v, v) S Example 6.8.: cofidece iterval o the ratio of variaces Samples of dioxi cotamiatio i 0 frot yards i Times Beach, a suburb of St. Louis, show a cocetratio of 6 ppm with a sample variace of.0 ppm. Samples of dioxi cotamiatio i 6 frot yards i Quail Ru, aother suburb of St. Louis, show a cocetratio of 8 ppm with a sample variace of 3.0 ppm. Fid the 90% cofidece iterval for the differece of populatio meas.. To solve this, first calculate, F, F, with v 9 ad v We ca compute the f probabilities usig MATLAB, >> f icdf('f',0.05,9,5) f

22 Samplig ad Estimatio - 55 ad >> f icdf('f',0.95,9,5) f Substitutig ito equatio (6.6) yields P < < P 0.45 < < (0.05) Alteratively, we ca use the table of critical values F F 0.05 F ( v 0.05 F 5, v 0.05 ( v 9, v 9).3 5) F 0.05 ( v 0, v 5).33 P 3.33 P 0.43 < < <.3 (0.05) 3 < So the 90% cofidece iterval for the mea is.45 < < If we are determiig which site has a greater variace of cotamiatio levels the we are 90% sure that site (Quail Ru) has more variace by a factor of.3 to Problems We ited to purchase a liquid as a raw material for a material we are desigig. Two vedors offer us samples of their product ad a statistic sheet. We ru the samples i our ow labs ad come up with the followig data: 55

23 Samplig ad Estimatio - 56 Vedor Vedor sample # outcome sample # outcome Vedor Specificatio Claims: Vedor : µ. 0 ad 0. 05, 0.36 Vedor : µ. 3 ad 0., Sample statistics, based o the data provided i the table above. 6 i x ( x i x ) 6.80 x i i x ( ) [ ] 0.09 s 53 6 s 0. 0 [ ] x i.8 s 0 i 0 i x i x s 0.78 Problem 6.. Determie a 95% cofidece iterval o the mea of sample. Use the value of the populatio variace give. Is the give populatio mea legitimate? Problem 6.. Determie a 95% cofidece iterval o the differece of meas betwee samples ad. Use the values of the populatio variace give. Is the differece betwee the give populatio meas legitimate? 56

24 Samplig ad Estimatio - 57 Problem 6.3. Determie a 95% cofidece iterval o the mea of sample. Assume the give values of the populatio variaces are suspect ad ot to be trusted. Is the give populatio mea legitimate? Problem 6.4. Determie a 95% cofidece iterval o the differece of meas betwee samples ad. Assume the give values of the populatio variaces are suspect ad ot to be trusted. Is the differece betwee the give populatio meas legitimate? Problem 6.5. Determie a 95% cofidece iterval o the variace of sample. Is the give populatio variace legitimate? Problem 6.6. Determie a 98% cofidece iterval o the ratio of variace of samples &. Is the ratio of the give populatio variaces legitimate? 57

Statistics 511 Additional Materials

Statistics 511 Additional Materials Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability

More information

Stat 421-SP2012 Interval Estimation Section

Stat 421-SP2012 Interval Estimation Section Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible

More information

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Discrete Mathematics for CS Spring 2008 David Wagner Note 22 CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Expectation and Variance of a random variable

Expectation and Variance of a random variable Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio

More information

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be

More information

Module 1 Fundamentals in statistics

Module 1 Fundamentals in statistics Normal Distributio Repeated observatios that differ because of experimetal error ofte vary about some cetral value i a roughly symmetrical distributio i which small deviatios occur much more frequetly

More information

6.3 Testing Series With Positive Terms

6.3 Testing Series With Positive Terms 6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial

More information

Chapter 8: Estimating with Confidence

Chapter 8: Estimating with Confidence Chapter 8: Estimatig with Cofidece Sectio 8.2 The Practice of Statistics, 4 th editio For AP* STARNES, YATES, MOORE Chapter 8 Estimatig with Cofidece 8.1 Cofidece Itervals: The Basics 8.2 8.3 Estimatig

More information

Chapter 6 Part 5. Confidence Intervals t distribution chi square distribution. October 23, 2008

Chapter 6 Part 5. Confidence Intervals t distribution chi square distribution. October 23, 2008 Chapter 6 Part 5 Cofidece Itervals t distributio chi square distributio October 23, 2008 The will be o help sessio o Moday, October 27. Goal: To clearly uderstad the lik betwee probability ad cofidece

More information

Frequentist Inference

Frequentist Inference Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for

More information

Random Variables, Sampling and Estimation

Random Variables, Sampling and Estimation Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig

More information

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance Hypothesis Testig Empirically evaluatig accuracy of hypotheses: importat activity i ML. Three questios: Give observed accuracy over a sample set, how well does this estimate apply over additioal samples?

More information

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would

More information

Chapter 6 Sampling Distributions

Chapter 6 Sampling Distributions Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to

More information

CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics

CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS 8.1 Radom Samplig The basic idea of the statistical iferece is that we are allowed to draw ifereces or coclusios about a populatio based

More information

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ), Cofidece Iterval Estimatio Problems Suppose we have a populatio with some ukow parameter(s). Example: Normal(,) ad are parameters. We eed to draw coclusios (make ifereces) about the ukow parameters. We

More information

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting

Lecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would

More information

Computing Confidence Intervals for Sample Data

Computing Confidence Intervals for Sample Data Computig Cofidece Itervals for Sample Data Topics Use of Statistics Sources of errors Accuracy, precisio, resolutio A mathematical model of errors Cofidece itervals For meas For variaces For proportios

More information

Exam II Covers. STA 291 Lecture 19. Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Location CB 234

Exam II Covers. STA 291 Lecture 19. Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Location CB 234 STA 291 Lecture 19 Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Locatio CB 234 STA 291 - Lecture 19 1 Exam II Covers Chapter 9 10.1; 10.2; 10.3; 10.4; 10.6

More information

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n. Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator

More information

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample. Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized

More information

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1. Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio

More information

1 Inferential Methods for Correlation and Regression Analysis

1 Inferential Methods for Correlation and Regression Analysis 1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet

More information

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates. 5. Data, Estimates, ad Models: quatifyig the accuracy of estimates. 5. Estimatig a Normal Mea 5.2 The Distributio of the Normal Sample Mea 5.3 Normal data, cofidece iterval for, kow 5.4 Normal data, cofidece

More information

Power and Type II Error

Power and Type II Error Statistical Methods I (EXST 7005) Page 57 Power ad Type II Error Sice we do't actually kow the value of the true mea (or we would't be hypothesizig somethig else), we caot kow i practice the type II error

More information

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to: STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio

More information

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER / Statistics

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER / Statistics ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER 1 018/019 DR. ANTHONY BROWN 8. Statistics 8.1. Measures of Cetre: Mea, Media ad Mode. If we have a series of umbers the

More information

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio

More information

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals 7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses

More information

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals

More information

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4 MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.

More information

Properties and Hypothesis Testing

Properties and Hypothesis Testing Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.

More information

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The

More information

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01 ENGI 44 Cofidece Itervals (Two Samples) Page -0 Two Sample Cofidece Iterval for a Differece i Populatio Meas [Navidi sectios 5.4-5.7; Devore chapter 9] From the cetral limit theorem, we kow that, for sufficietly

More information

Basis for simulation techniques

Basis for simulation techniques Basis for simulatio techiques M. Veeraraghava, March 7, 004 Estimatio is based o a collectio of experimetal outcomes, x, x,, x, where each experimetal outcome is a value of a radom variable. x i. Defiitios

More information

Estimation of a population proportion March 23,

Estimation of a population proportion March 23, 1 Social Studies 201 Notes for March 23, 2005 Estimatio of a populatio proportio Sectio 8.5, p. 521. For the most part, we have dealt with meas ad stadard deviatios this semester. This sectio of the otes

More information

1 Approximating Integrals using Taylor Polynomials

1 Approximating Integrals using Taylor Polynomials Seughee Ye Ma 8: Week 7 Nov Week 7 Summary This week, we will lear how we ca approximate itegrals usig Taylor series ad umerical methods. Topics Page Approximatig Itegrals usig Taylor Polyomials. Defiitios................................................

More information

Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p

Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE Part 3: Summary of CI for µ Cofidece Iterval for a Populatio Proportio p Sectio 8-4 Summary for creatig a 100(1-α)% CI for µ: Whe σ 2 is kow ad paret

More information

Lecture 2: Monte Carlo Simulation

Lecture 2: Monte Carlo Simulation STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?

More information

Lecture 7: Properties of Random Samples

Lecture 7: Properties of Random Samples Lecture 7: Properties of Radom Samples 1 Cotiued From Last Class Theorem 1.1. Let X 1, X,...X be a radom sample from a populatio with mea µ ad variace σ

More information

1.010 Uncertainty in Engineering Fall 2008

1.010 Uncertainty in Engineering Fall 2008 MIT OpeCourseWare http://ocw.mit.edu.00 Ucertaity i Egieerig Fall 2008 For iformatio about citig these materials or our Terms of Use, visit: http://ocw.mit.edu.terms. .00 - Brief Notes # 9 Poit ad Iterval

More information

µ and π p i.e. Point Estimation x And, more generally, the population proportion is approximately equal to a sample proportion

µ and π p i.e. Point Estimation x And, more generally, the population proportion is approximately equal to a sample proportion Poit Estimatio Poit estimatio is the rather simplistic (ad obvious) process of usig the kow value of a sample statistic as a approximatio to the ukow value of a populatio parameter. So we could for example

More information

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1 October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 1 Populatio parameters ad Sample Statistics October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 2 Ifereces

More information

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test. Math 308 Sprig 018 Classes 19 ad 0: Aalysis of Variace (ANOVA) Page 1 of 6 Itroductio ANOVA is a statistical procedure for determiig whether three or more sample meas were draw from populatios with equal

More information

Simulation. Two Rule For Inverting A Distribution Function

Simulation. Two Rule For Inverting A Distribution Function Simulatio Two Rule For Ivertig A Distributio Fuctio Rule 1. If F(x) = u is costat o a iterval [x 1, x 2 ), the the uiform value u is mapped oto x 2 through the iversio process. Rule 2. If there is a jump

More information

ANALYSIS OF EXPERIMENTAL ERRORS

ANALYSIS OF EXPERIMENTAL ERRORS ANALYSIS OF EXPERIMENTAL ERRORS All physical measuremets ecoutered i the verificatio of physics theories ad cocepts are subject to ucertaities that deped o the measurig istrumets used ad the coditios uder

More information

This is an introductory course in Analysis of Variance and Design of Experiments.

This is an introductory course in Analysis of Variance and Design of Experiments. 1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class

More information

(6) Fundamental Sampling Distribution and Data Discription

(6) Fundamental Sampling Distribution and Data Discription 34 Stat Lecture Notes (6) Fudametal Samplig Distributio ad Data Discriptio ( Book*: Chapter 8,pg5) Probability& Statistics for Egieers & Scietists By Walpole, Myers, Myers, Ye 8.1 Radom Samplig: Populatio:

More information

Statistical Intervals for a Single Sample

Statistical Intervals for a Single Sample 3/5/06 Applied Statistics ad Probability for Egieers Sixth Editio Douglas C. Motgomery George C. Ruger Chapter 8 Statistical Itervals for a Sigle Sample 8 CHAPTER OUTLINE 8- Cofidece Iterval o the Mea

More information

NICK DUFRESNE. 1 1 p(x). To determine some formulas for the generating function of the Schröder numbers, r(x) = a(x) =

NICK DUFRESNE. 1 1 p(x). To determine some formulas for the generating function of the Schröder numbers, r(x) = a(x) = AN INTRODUCTION TO SCHRÖDER AND UNKNOWN NUMBERS NICK DUFRESNE Abstract. I this article we will itroduce two types of lattice paths, Schröder paths ad Ukow paths. We will examie differet properties of each,

More information

Confidence Level We want to estimate the true mean of a random variable X economically and with confidence.

Confidence Level We want to estimate the true mean of a random variable X economically and with confidence. Cofidece Iterval 700 Samples Sample Mea 03 Cofidece Level 095 Margi of Error 0037 We wat to estimate the true mea of a radom variable X ecoomically ad with cofidece True Mea μ from the Etire Populatio

More information

(7 One- and Two-Sample Estimation Problem )

(7 One- and Two-Sample Estimation Problem ) 34 Stat Lecture Notes (7 Oe- ad Two-Sample Estimatio Problem ) ( Book*: Chapter 8,pg65) Probability& Statistics for Egieers & Scietists By Walpole, Myers, Myers, Ye Estimatio 1 ) ( ˆ S P i i Poit estimate:

More information

Estimation for Complete Data

Estimation for Complete Data Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of

More information

1 of 7 7/16/2009 6:06 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 6. Order Statistics Defiitios Suppose agai that we have a basic radom experimet, ad that X is a real-valued radom variable

More information

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10 DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set

More information

Chapter 23: Inferences About Means

Chapter 23: Inferences About Means Chapter 23: Ifereces About Meas Eough Proportios! We ve spet the last two uits workig with proportios (or qualitative variables, at least) ow it s time to tur our attetios to quatitative variables. For

More information

Statistics 300: Elementary Statistics

Statistics 300: Elementary Statistics Statistics 300: Elemetary Statistics Sectios 7-, 7-3, 7-4, 7-5 Parameter Estimatio Poit Estimate Best sigle value to use Questio What is the probability this estimate is the correct value? Parameter Estimatio

More information

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE TERRY SOO Abstract These otes are adapted from whe I taught Math 526 ad meat to give a quick itroductio to cofidece

More information

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.

The variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2. SAMPLE STATISTICS A radom sample x 1,x,,x from a distributio f(x) is a set of idepedetly ad idetically variables with x i f(x) for all i Their joit pdf is f(x 1,x,,x )=f(x 1 )f(x ) f(x )= f(x i ) The sample

More information

Bayesian Methods: Introduction to Multi-parameter Models

Bayesian Methods: Introduction to Multi-parameter Models Bayesia Methods: Itroductio to Multi-parameter Models Parameter: θ = ( θ, θ) Give Likelihood p(y θ) ad prior p(θ ), the posterior p proportioal to p(y θ) x p(θ ) Margial posterior ( θ, θ y) is Iterested

More information

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f. Lecture 5 Let us give oe more example of MLE. Example 3. The uiform distributio U[0, ] o the iterval [0, ] has p.d.f. { 1 f(x =, 0 x, 0, otherwise The likelihood fuctio ϕ( = f(x i = 1 I(X 1,..., X [0,

More information

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more

More information

Parameter, Statistic and Random Samples

Parameter, Statistic and Random Samples Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,

More information

Data Analysis and Statistical Methods Statistics 651

Data Analysis and Statistical Methods Statistics 651 Data Aalysis ad Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasii/teachig.html Suhasii Subba Rao Review of testig: Example The admistrator of a ursig home wats to do a time ad motio

More information

Understanding Samples

Understanding Samples 1 Will Moroe CS 109 Samplig ad Bootstrappig Lecture Notes #17 August 2, 2017 Based o a hadout by Chris Piech I this chapter we are goig to talk about statistics calculated o samples from a populatio. We

More information

Common Large/Small Sample Tests 1/55

Common Large/Small Sample Tests 1/55 Commo Large/Small Sample Tests 1/55 Test of Hypothesis for the Mea (σ Kow) Covert sample result ( x) to a z value Hypothesis Tests for µ Cosider the test H :μ = μ H 1 :μ > μ σ Kow (Assume the populatio

More information

Statistical inference: example 1. Inferential Statistics

Statistical inference: example 1. Inferential Statistics Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either

More information

An Introduction to Randomized Algorithms

An Introduction to Randomized Algorithms A Itroductio to Radomized Algorithms The focus of this lecture is to study a radomized algorithm for quick sort, aalyze it usig probabilistic recurrece relatios, ad also provide more geeral tools for aalysis

More information

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech

Final Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech Fial Review Fall 2013 Prof. Yao Xie, yao.xie@isye.gatech.edu H. Milto Stewart School of Idustrial Systems & Egieerig Georgia Tech 1 Radom samplig model radom samples populatio radom samples: x 1,..., x

More information

The Growth of Functions. Theoretical Supplement

The Growth of Functions. Theoretical Supplement The Growth of Fuctios Theoretical Supplemet The Triagle Iequality The triagle iequality is a algebraic tool that is ofte useful i maipulatig absolute values of fuctios. The triagle iequality says that

More information

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight) Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS

UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS PART of UNIVERSITY OF TORONTO Faculty of Arts ad Sciece APRIL/MAY 009 EAMINATIONS ECO0YY PART OF () The sample media is greater tha the sample mea whe there is. (B) () A radom variable is ormally distributed

More information

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen) Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................

More information

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions We have previously leared: KLMED8004 Medical statistics Part I, autum 00 How kow probability distributios (e.g. biomial distributio, ormal distributio) with kow populatio parameters (mea, variace) ca give

More information

AP Statistics Review Ch. 8

AP Statistics Review Ch. 8 AP Statistics Review Ch. 8 Name 1. Each figure below displays the samplig distributio of a statistic used to estimate a parameter. The true value of the populatio parameter is marked o each samplig distributio.

More information

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018 HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018 We are resposible for 2 types of hypothesis tests that produce ifereces about the ukow populatio mea, µ, each of which has 3 possible

More information

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo

More information

Instructor: Judith Canner Spring 2010 CONFIDENCE INTERVALS How do we make inferences about the population parameters?

Instructor: Judith Canner Spring 2010 CONFIDENCE INTERVALS How do we make inferences about the population parameters? CONFIDENCE INTERVALS How do we make ifereces about the populatio parameters? The samplig distributio allows us to quatify the variability i sample statistics icludig how they differ from the parameter

More information

Statisticians use the word population to refer the total number of (potential) observations under consideration

Statisticians use the word population to refer the total number of (potential) observations under consideration 6 Samplig Distributios Statisticias use the word populatio to refer the total umber of (potetial) observatios uder cosideratio The populatio is just the set of all possible outcomes i our sample space

More information

Chapter 2 The Monte Carlo Method

Chapter 2 The Monte Carlo Method Chapter 2 The Mote Carlo Method The Mote Carlo Method stads for a broad class of computatioal algorithms that rely o radom sampligs. It is ofte used i physical ad mathematical problems ad is most useful

More information

The standard deviation of the mean

The standard deviation of the mean Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider

More information

Confidence intervals summary Conservative and approximate confidence intervals for a binomial p Examples. MATH1005 Statistics. Lecture 24. M.

Confidence intervals summary Conservative and approximate confidence intervals for a binomial p Examples. MATH1005 Statistics. Lecture 24. M. MATH1005 Statistics Lecture 24 M. Stewart School of Mathematics ad Statistics Uiversity of Sydey Outlie Cofidece itervals summary Coservative ad approximate cofidece itervals for a biomial p The aïve iterval

More information

Infinite Sequences and Series

Infinite Sequences and Series Chapter 6 Ifiite Sequeces ad Series 6.1 Ifiite Sequeces 6.1.1 Elemetary Cocepts Simply speakig, a sequece is a ordered list of umbers writte: {a 1, a 2, a 3,...a, a +1,...} where the elemets a i represet

More information

Read through these prior to coming to the test and follow them when you take your test.

Read through these prior to coming to the test and follow them when you take your test. Math 143 Sprig 2012 Test 2 Iformatio 1 Test 2 will be give i class o Thursday April 5. Material Covered The test is cummulative, but will emphasize the recet material (Chapters 6 8, 10 11, ad Sectios 12.1

More information

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9 BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous

More information

Binomial Distribution

Binomial Distribution 0.0 0.5 1.0 1.5 2.0 2.5 3.0 0 1 2 3 4 5 6 7 0.0 0.5 1.0 1.5 2.0 2.5 3.0 Overview Example: coi tossed three times Defiitio Formula Recall that a r.v. is discrete if there are either a fiite umber of possible

More information

Output Analysis (2, Chapters 10 &11 Law)

Output Analysis (2, Chapters 10 &11 Law) B. Maddah ENMG 6 Simulatio Output Aalysis (, Chapters 10 &11 Law) Comparig alterative system cofiguratio Sice the output of a simulatio is radom, the comparig differet systems via simulatio should be doe

More information

Discrete Mathematics for CS Spring 2007 Luca Trevisan Lecture 22

Discrete Mathematics for CS Spring 2007 Luca Trevisan Lecture 22 CS 70 Discrete Mathematics for CS Sprig 2007 Luca Trevisa Lecture 22 Aother Importat Distributio The Geometric Distributio Questio: A biased coi with Heads probability p is tossed repeatedly util the first

More information

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion 1 Chapter 7 ad 8 Review for Exam Chapter 7 Estimates ad Sample Sizes 2 Defiitio Cofidece Iterval (or Iterval Estimate) a rage (or a iterval) of values used to estimate the true value of the populatio parameter

More information

Sample Size Determination (Two or More Samples)

Sample Size Determination (Two or More Samples) Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie

More information

Chapter 2 Descriptive Statistics

Chapter 2 Descriptive Statistics Chapter 2 Descriptive Statistics Statistics Most commoly, statistics refers to umerical data. Statistics may also refer to the process of collectig, orgaizig, presetig, aalyzig ad iterpretig umerical data

More information

Lecture 3. Properties of Summary Statistics: Sampling Distribution

Lecture 3. Properties of Summary Statistics: Sampling Distribution Lecture 3 Properties of Summary Statistics: Samplig Distributio Mai Theme How ca we use math to justify that our umerical summaries from the sample are good summaries of the populatio? Lecture Summary

More information

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process. Iferetial Statistics ad Probability a Holistic Approach Iferece Process Chapter 8 Poit Estimatio ad Cofidece Itervals This Course Material by Maurice Geraghty is licesed uder a Creative Commos Attributio-ShareAlike

More information

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2 Chapter 8 Comparig Two Treatmets Iferece about Two Populatio Meas We wat to compare the meas of two populatios to see whether they differ. There are two situatios to cosider, as show i the followig examples:

More information

Confidence Intervals

Confidence Intervals Cofidece Itervals Berli Che Deartmet of Comuter Sciece & Iformatio Egieerig Natioal Taiwa Normal Uiversity Referece: 1. W. Navidi. Statistics for Egieerig ad Scietists. Chater 5 & Teachig Material Itroductio

More information

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ STATISTICAL INFERENCE INTRODUCTION Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I oesample testig, we essetially

More information

Kinetics of Complex Reactions

Kinetics of Complex Reactions Kietics of Complex Reactios by Flick Colema Departmet of Chemistry Wellesley College Wellesley MA 28 wcolema@wellesley.edu Copyright Flick Colema 996. All rights reserved. You are welcome to use this documet

More information

BIOSTATISTICS. Lecture 5 Interval Estimations for Mean and Proportion. dr. Petr Nazarov

BIOSTATISTICS. Lecture 5 Interval Estimations for Mean and Proportion. dr. Petr Nazarov Microarray Ceter BIOSTATISTICS Lecture 5 Iterval Estimatios for Mea ad Proportio dr. Petr Nazarov 15-03-013 petr.azarov@crp-sate.lu Lecture 5. Iterval estimatio for mea ad proportio OUTLINE Iterval estimatios

More information