FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures
|
|
- Myrtle Wilkerson
- 5 years ago
- Views:
Transcription
1 FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals for meas 6. Hypothesis testig. Mea ad stadard error of sample data Two differet radom variables ca be measured for the same object: e.g. the height ad weight of a perso. Both these radom variables have distributios, mea values ad variaces. However, taller people are usually heavier tha shorter people so these two variables are depedet. Variables ca also be idepedet: idepedet evets P (A B) = P(A)P(B) idepedet discrete radom variables P (X = x i Y = y j )=P(X=x i )P(Y =y j ) The latter defies a joit distributio for the two radom variables. Ex. A ew plat at a maufacturig site has to be istalled ad the commissioed. The times required for the two steps deped upo differet radom factors, ad ca therefore be regarded as idepedet. Based o past experiece the respective distributios for X (istallatio time) ad Y (commissioig time), both i days, are P (X =)=0., P(X =4)=0.5, P(X =5)=0. P(Y =)=0.4, P(Y =)=0.6 Fid the joit distributio for X ad Y, ad the probability that the total time will ot exceed 6 days. Sice the factors are idepedet P (X = x i Y = y j )=P(X=x i )P(Y =y j ). Thus the joit probability table is Y/X 4 5 Y/X i.e Note that the colum ad row totals give the idividual distributios for X ad Y. For the secod part of the questio P (X + Y 6) = P (X = Y =)+P(X= Y =)+P(X=4 Y =) : o other combiatios are possible. The joit distributios ca the be read from the table givig P (X + Y 6) = = 0.40 It would be easy to calculate P (X +Y = w i )foreachw i, ad the evaluate E(X +Y ) usig the expressio E(X + Y )= w i w i P(X+Y =w i ).
2 However, we ca use a more geeral result. Give that E(X) = x i P(X=x i ) ad that E(Y) satisfies x i a similar formula, the it ca be show that for idepedet variables E(X + Y )=E(X)+E(Y) ad Var(X +Y)=Var(X)+Var(Y). Give some data we usually do ot kow the exact distributio but it would be useful to try to estimate the mea ad variace from the data Def. For a sample {X,X,..., X } of data the sample average is defied by X = X i i= Def. For sample data {X,X,..., X } the sample variace is defied by S X = (X i X). i= [Note that is used i the deomiator i S X because the differeces X i X sum to zero ad so are ot idepedet.] Ex. A die was tossed six times producig the followig results 6,, 4,,, 5. Fid the sample average ad the stadard variace. X = 0 ( )= 6 6 = 0. [ ( S X = 6 0 ) ( + 0 ) ( ) ( + 0 ) ( + 0 ) ( ) ] 6 [ (8 = ) ( + 4 ( ) 4 ( + ) ) 7 ] 5 + = = = S X.97 [For ubiased die it ca be show that the theoretical results for X ad S X are.5 ad.708 respectively. ] for a large umber of tosses. Biomial distributio Specifyig the exact distributio of a radom variable requires a lot of iformatio. Good estimates of mea, variace etc. ca be obtaied from data. Ofte probability distributios ca be determied by formulae usig the estimated values of the parameters. Cosider the simple coi tossig experimet where oly two outcomes are possible success (say ) or failure (say 0). A Beroulli trial is a simple observatio of a radom variable X, say, that ca take the values or0:suppose P(X=)=p, P (X =0)= p. The E(X) =X=(p)+0( p)=p+0=p, ad Var(X) =E(X X) =E(X p) =(0 p) P(X=0)+( p) P(X=) =p ( p)+( p) p=p( p)(p +( p)) = p( p). Now let {X,..., X } deote idepedet Beroulli trials, each with success probability p.the umber of successes Y = X + X X. Suppose Y = k,where 0 k,the k of the X i values equal ad k equal 0. The probability of k s at first with k 0 s followig is p k ( p) k, sice the outcomes are idepedet. The umber of ways of distributig k successes amog trials = hece P (Y = k) = = k p k ( p) k. k! ( k)!k!,
3 This is the geeral form of the biomial distributio ad leads to mea = p, variace = p( p). Ex. If o average i 0 of a certai type of colum fails uder loadig, what is the probability that amog 6 such colums at most will fail? Give P (colum failig) = P (F )= 9 =0.05, P(colum ot failig) = P (F )= 0 0 =0.95, therefore P (0 F )=(0.95) 6 =0.440, P ( F )=(0.05) (0.95) 5 6 P ( F )=(0.05) (0.95) 4 6 P (at most fail) = = =(0.05)(0.469) 6! =(0.05)(0.469)(6) = !! =(0.005)( ) 6! 4!! =(0.005)( )(6)(5) = Normal distributio This occurs very frequetly i practice. Def. A cotiuous radom variable X has a ormal distributio (or Gaussia distributio) with mea µ X ad variace σ X if the probability desity fuctio satisfies f X (x) = exp [ (x µ X) ] π σ X ( <x< ). Write X N (µ X,σ X ) to represet a radom variable with ormal distributio which has mea µ X variace σ X (see figure ). f X 0.4 µ =0, σ = X X ad µ X =0, = figure 6 x The stadard ormal distributio has mea 0 ad variace, ad leads to Def. Stadard ormal cumulative distributio is Φ(z) = z e x / dx. π Φ(z) is usually tabulated, ad a summary of values appears o the Formula Sheet. Note that Φ(z) deotes the area uder the probability distributio curve to the left of x = z. Suppose that X is a ormal variable with mea µ X ad variace σ X the it ca be show that Z = X µ X is also a ormal variable with mea 0 ad variace. (The latter result is very useful i applicatios.)
4 Ex 4. The burig time X of a experimetal rocket is a radom variable havig (approximately) a ormal distributio with mea 600s ad stadard deviatio 5s. Fid the probability that such a rocket will bur for (a) less tha 550s, (b) more tha 67.5s. (a) Give µ X = 600 ad = 5, therefore where Z = X (b) ( X 600 P (X <550) = P < 5 ) = P (Z < ), 5. From symmetry (look at figure ), ad the usig the results o the Formula Sheet, P (Z < ) = P (Z >) = P (Z ) = Φ() = = 0.08 X P (X >67.5) = P > = P (Z >.5) 5 5 = P(Z.5) = Φ(.5) = 0.9 = Whe >0 the it ca be show that the ormal distributio with mea p ad variace p( p) provides a very accurate approximatio to the biomial distributio. 4. Samplig Oe of the major problems i statistics, estimatig the properties of a large populatio from the properties of a sample of idividuals chose from that populatio, is cosidered i this sectio. Select at radom a sample of observatios X,X,..., X take from a populatio. From these observatios you ca calculate the values of a umber of statistical quatities, for example the sample mea X. If you choose aother radom sample of size from the same populatio, a differet value of the statistic will, i geeral, result. I fact, if repeated radom samples are take, you ca regard the statistic itself as a radom variable, ad its distributio is called the samplig distributio of the statistic. For example, cosider the distributio of heights of all adult me i Eglad, which is kow to coform very closely to the ormal curve. Take a large umber of samples of size four, draw at radom from the populatio, ad calculate the mea height of each sample. How will these mea heights be distributed? We fid that they are also ormally distributed about the same mea as the origial distributio. However, a radom sample of four is likely to iclude me both above ad below average height ad so the mea of the sample will deviate from the true mea less tha a sigle observatio will. This importat geeral result ca be stated as follows: If radom samples of size are take from a distributio whose mea is µ X ad whose stadard deviatio is, the the sample meas form a distributio with mea µ X ad stadard deviatio = /. Note that the theorem holds for all distributios of the paret populatio. However, if the paret distributio is ormal the it ca be show that the samplig distributio of the sample mea is also ormal. The stadard deviatio of the sample mea, defied above, is usually called the stadard error of the sample mea. Let us ow preset three worked examples. Ex 5. A radom sample is draw from a populatio with a kow stadard deviatio of.0. Fid the stadard error of the sample mea if the sample is of size (i) 9, (ii) 00. What sample size would give a stadard error equal to 0.5? Usig the result stated earlier (i) stadard error = σx = =0.667, 9 to decimal places, 4
5 (ii) stadard error = σx = 00 =0. If the stadard error equals 0.5, the / =0.5. Squarig the implies that 4/ =0.5 or =6, (i.e. the sample size is 6). Ex 6. The diameters of shafts made by a certai maufacturig process are kow to be ormally distributed with mea.500 cm ad stadard deviatio cm. What is the distributio of the sample mea diameter of ie such shafts selected at radom? Calculate the percetage of such sample meas which ca be expected to exceed.506 cm. Sice the process is ormal we kow that the samplig distributio of the sample mea will also be ormal, with the same mea,.500 cm, but with a stadard error (or stadard deviatio) =0.009/ 9=0.00 cm. I order to calculate the probability that the sample mea is bigger tha.506, i.e. X>.506, we stadardise i the usual way by puttig Z =(X.500)/0.00, ad the X P (X >.506) = P > = P (Z >.0) = P(Z.0) = Φ(.0) =.977 = 0.08, usig the formula sheet Hece,.8% of the sample meas ca be expected to exceed.506 cm. Ex 7. What is the probability that a observed value of a ormally distributed radom variable lies withi oe stadard deviatio from the mea? Give ormally distributed radom variable, X, has mea µ X ad stadard deviatio, i.e. X N(µ X,σ X ). We eed to calculate P (µ X X µ X + ). Defie Z = X µ X,the Z N(0, ). It follows that ( (µx ) µ X P (µ X X µ X + )=P X µ X (µ ) X + ) µ X =P( Z ) = P (0 Z ), by symmetry = (Φ() Φ(0)) = ( ) =(0.4) = It was stated above that whe the paret distributio is ormal the the samplig distributio of the sample mea is also ormal. Whe the paret distributio is ot ormal, the obtai the followig theorem (surprisig result?): Cetral limit theorem If a radom sample of size, ( 0), is take from ANY distributio with mea µ X ad stadard deviatio, the the samplig distributio of X is approximately ormal with mea µ X ad stadard deviatio /, the approximatio improvig as icreases. Ex 8. It is kow that a particular make of light bulb has a average life of 800 hrs with a stadard deviatio of 48 hrs. Fid the probability that a radom sample of 44 bulbs will have a average life of less tha 790 hrs. Sice the umber of bulbs i the sample is large, the sample mea will be ormally distributed with mea = 800 ad stadard error = 48 =4. Put Z= X µ X = (X 800),the 44 4 X P(X<790) = P < 4 4 = P (Z <.5) = P (Z >.5), by symmetry = P(Z.5) = =
6 To coclude this sectio the mai results cocerig the distributio of the sample mea X are summarised. Cosider a paret populatio with mea µ X ad stadard deviatio. From this populatio take a radom sample of size with sample mea X ad stadard error /. Defie Z = X µ X / the (i) for all the distributio of Z is N(0, ) if the distributio of the paret populatio is ormal; (ii) < 0 the distributio of Z is approximately N(0, ) if the distributio of the paret populatio is approximately ormal; (iii) 0 the distributio of Z is a good approximatio to N(0, ) for all distributios of the paret populatio. 5. Cofidece itervals for meas Choose a sample at radom from a populatio, the the mea, X, of the sample is said to provide a poit estimator of the populatio mea µ X. The accuracy of this estimate is measured by the cofidece iterval, which is the iterval withi which you ca be reasoably sure the value of the populatio mea µ X lies. Oe usually calculates k by specifyig that the probability that the iterval (X k, X + k) cotais the populatio mea is 0.95, or For example, if k is calculated so that P (X k µ X X + k) =.95 the the iterval is called the 95% cofidece iterval if the iterval is calculated for very may samples the 95 out of 00 itervals would cotai µ X. O replacig 95 by 99 you obtai the defiitio of a 99% cofidece iterval. To proceed further, assume that the stadard deviatio is kow ad that Z = X µ X / is distributed with the stadard ormal distributio N(0, ) (the coditios for this to hold were stated at the ed of sectio 4). The results o the Formula Sheet show that 95% of the stadard ormal distributio lies betwee.96 ad.96. Hece P (.96 Z.96) = Rearragig the iequalities iside the bracket to obtai coditios o X yields Z = X µ X /.96 X µ X.96 / µ X X.96 / ; Z = X µ X /.96 X µ X.96 / µ X X +.96 /. It follows that the earlier expressio ca be re-writte as P ( X.96 σx µ X X +.96 σx ) =0.95, ad hece the iterval X.96 σx µ X X +.96 σx is the 95% cofidece iterval for µ X. Similarly X.58 σx µ X X +.58 σx is the 99% cofidece iterval for µ X. 6
7 Ex 9. The percetage of copper i a certai chemical is to be estimated by takig a series of measuremets o radomly chose small quatities of the chemical ad usig the sample mea to estimate the true percetage. From previous experiece idividual measuremets of this type are kow to have a stadard deviatio of.0%. How may measuremets must be made so that the stadard error of the estimate is less tha 0.%? If the sample mea w of 45 measuremets is foud to be.9%, give a 95% cofidece iterval for the true percetage, ω. Assume that measuremets are made. The stadard error of the sample mea is (/ )%. For the required precisio require / <0., i.e. >(/0.) =4/0.09 = Sice must be a iteger, at least 45 measuremets are ecessary for required precisio. With a sample of 45 measuremets, you ca use the cetral limit theorem ad take the sample mea percetage W to be distributed ormally with mea ω ad stadard error / 45. Hece, if ω is the true percetage, it follows that Z = W ω / is distributed as N(0, ). Sice 95% of the area uder the stadard 45 ormal curve lies betwee Z =.96 ad Z =.96, ( P.96 W ω ) / =0.95. Re-arragig, we obtai P ( ) W.96 ω W +.96 = Hece, the 95% cofidece iterval for the true percetage is (.9.96(0.98), (0.98)) = (.,.49). To complete this sectio we defie the sample variace. Def. Give a sample of observatios X,X,..., X the sample variace, s,isgiveby s = ( Xi X ),where X deotes the sample mea. i= I our discussio of cofidece itervals for the mea it was assumed that the populatio variace σ X was kow. What happes whe this it is ot kow? For samples of size >0, a good estimate of σ X is obtaied by calculatig the sample variace s ad usig this value. (For small samples, <0, eed to use the t-distributio ot cosidered i this module). 6. Hypothesis testig A assumptio made about a populatio is called a statistical hypothesis. From iformatio cotaied i a radom sample we try to decide whether or ot the hypothesis is true: if evidece from the sample is icosistet with the hypothesis, the hypothesis is rejected; if the evidece is cosistet with the hypothesis, the the hypothesis is accepted. The hypothesis beig tested is called the ull hypothesis (usually deoted by H 0 ) it either specifies a particular value of the populatio parameter or specifies that two or more parameters are equal. A cotrary assumptio is called the alterative hypothesis (usually deoted by H ) ormally specifies a rage of values for the parameter. A commo example of the ull hypothesis is H 0 : µ X = µ 0. The three alterative hypotheses are (i) H : µ X >µ 0, (ii) H : µ X <µ 0, (iii) H : µ X µ 0. Types (i) ad (ii) are said to be oe-sided (or oe-tailed, see figure 6b) type (iii) is two-sided (or two-tailed, see figure 6a). The result of a test is a decisio to choose H 0 or H. This decisio is subject to ucertaity, ad two types of error are possible: 7
8 (i) a type I error occurs whe we reject H 0 o the basis of the test although it happes to be true the probability of this happeig is called the level of sigificace of the test ad this is prescribed before testig most commoly chose values are 5% or %. (ii) a type II error occurs whe you accept the ull hypothesis o the basis of the test although it happes to be false. The above ideas are ow applied to determie whether or ot the mea, X, of a sample is cosistet with a specified populatio mea µ 0. The ull hypothesis is H 0 : µ X = µ 0 ad a suitable statistic to use is Z = X µ 0 /, where is the stadard deviatio of the populatio ad is the size of the sample. Fid the rage of values of Z for which the ull hypothesis would be accepted kow as acceptace for the test depeds o the pre-determied sigificace level ad the choice of H. Correspodig rage of values of Z for which H 0 is rejected (i.e. ot accepted) is called the rejectio. Ex 0. A stadard process produces yar with mea breakig stregth 5.8 kg ad stadard deviatio.9 kg. A modificatio is itroduced ad a sample of 0 legths of yar produced by the ew process is tested to see if the breakig stregth has chaged. The sample mea breakig stregth is 6.5 kg. Assumig the stadard deviatio is uchaged, is it correct to say that there is o chage i the mea breakig stregth? Here H 0 : µ X = µ 0, H : µ X µ 0, where µ 0 =5.8adµ X is the mea breakig stregth for the ew process. If H 0 is true (i.e. µ X = µ 0 ), the Z = X µ 0 / has approximately the N(0, ) distributio, where X is the mea breakig stregth of the 0 sample values ad =0. At the 5% sigificace level there is a rejectio of.5% i each tail, as show i figure 6a (sice, uder H 0, P (Z <.96) = P (Z >.96) = P (Z.96) = Φ(.96) = 0.05, i.e..5%). This is a example of a two-sided test leadig to a two-tailed rejectio. rejectio acceptace rejectio % % Figure 6a The test is therefore: accept H 0 if.96 Z.96, otherwise reject From the data, Z =.9/ =.08. Hece, H 0 is rejected at the 5% sigificace level: i.e. the 0 evidece suggests that there IS a chage i the mea breakig stregth. Let us ow cosider a slightly differetly worded questio. Suppose the modificatio was specifically desiged so as to icrease the stregth of the yar. I this case H 0 : µ X = µ 0, H : µ X >µ 0, ad H 0 is rejected if the value of Z is ureasoably large. I this situatio the test is oe-sided ad acceptace ad (oe-tailed) rejectio s at the 5% sigificace level are show below. 8
9 acceptace 0 Figure 6b rejectio.64 5% At 5% sigificace level, test is accept H 0 if Z.64, otherwise reject. From earlier work Z =.08 ad agai the ull hypothesis is rejected. [Compare the two diagrams above, which illustrate the statemet that the rejectio for a test depeds o the form of both the alterative hypothesis ad the sigificace level.] rec/0ls 9
Expectation and Variance of a random variable
Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio
More informationChapter 6 Sampling Distributions
Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to
More informationMATH/STAT 352: Lecture 15
MATH/STAT 352: Lecture 15 Sectios 5.2 ad 5.3. Large sample CI for a proportio ad small sample CI for a mea. 1 5.2: Cofidece Iterval for a Proportio Estimatig proportio of successes i a biomial experimet
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationThe standard deviation of the mean
Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider
More informationMOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.
XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced
More informationFrequentist Inference
Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for
More informationRandom Variables, Sampling and Estimation
Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig
More informationSampling Distributions, Z-Tests, Power
Samplig Distributios, Z-Tests, Power We draw ifereces about populatio parameters from sample statistics Sample proportio approximates populatio proportio Sample mea approximates populatio mea Sample variace
More informationTests of Hypotheses Based on a Single Sample (Devore Chapter Eight)
Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationBinomial Distribution
0.0 0.5 1.0 1.5 2.0 2.5 3.0 0 1 2 3 4 5 6 7 0.0 0.5 1.0 1.5 2.0 2.5 3.0 Overview Example: coi tossed three times Defiitio Formula Recall that a r.v. is discrete if there are either a fiite umber of possible
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationApril 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE
April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE TERRY SOO Abstract These otes are adapted from whe I taught Math 526 ad meat to give a quick itroductio to cofidece
More information1 Inferential Methods for Correlation and Regression Analysis
1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet
More informationEcon 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara
Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio
More informationStatistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.
Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized
More informationStatistics 511 Additional Materials
Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability
More informationDiscrete Mathematics for CS Spring 2008 David Wagner Note 22
CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig
More informationStat 319 Theory of Statistics (2) Exercises
Kig Saud Uiversity College of Sciece Statistics ad Operatios Research Departmet Stat 39 Theory of Statistics () Exercises Refereces:. Itroductio to Mathematical Statistics, Sixth Editio, by R. Hogg, J.
More informationMath 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency
Math 152. Rumbos Fall 2009 1 Solutios to Review Problems for Exam #2 1. I the book Experimetatio ad Measuremet, by W. J. Youde ad published by the by the Natioal Sciece Teachers Associatio i 1962, the
More informationThe variance of a sum of independent variables is the sum of their variances, since covariances are zero. Therefore. V (xi )= n n 2 σ2 = σ2.
SAMPLE STATISTICS A radom sample x 1,x,,x from a distributio f(x) is a set of idepedetly ad idetically variables with x i f(x) for all i Their joit pdf is f(x 1,x,,x )=f(x 1 )f(x ) f(x )= f(x i ) The sample
More informationA quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population
A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate
More informationCommon Large/Small Sample Tests 1/55
Commo Large/Small Sample Tests 1/55 Test of Hypothesis for the Mea (σ Kow) Covert sample result ( x) to a z value Hypothesis Tests for µ Cosider the test H :μ = μ H 1 :μ > μ σ Kow (Assume the populatio
More informationModule 1 Fundamentals in statistics
Normal Distributio Repeated observatios that differ because of experimetal error ofte vary about some cetral value i a roughly symmetrical distributio i which small deviatios occur much more frequetly
More informationHypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance
Hypothesis Testig Empirically evaluatig accuracy of hypotheses: importat activity i ML. Three questios: Give observed accuracy over a sample set, how well does this estimate apply over additioal samples?
More informationThis exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.
Probability ad Statistics FS 07 Secod Sessio Exam 09.0.08 Time Limit: 80 Miutes Name: Studet ID: This exam cotais 9 pages (icludig this cover page) ad 0 questios. A Formulae sheet is provided with the
More informationBecause it tests for differences between multiple pairs of means in one test, it is called an omnibus test.
Math 308 Sprig 018 Classes 19 ad 0: Aalysis of Variace (ANOVA) Page 1 of 6 Itroductio ANOVA is a statistical procedure for determiig whether three or more sample meas were draw from populatios with equal
More informationSTA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:
STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio
More informationProperties and Hypothesis Testing
Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.
More information[ ] ( ) ( ) [ ] ( ) 1 [ ] [ ] Sums of Random Variables Y = a 1 X 1 + a 2 X 2 + +a n X n The expected value of Y is:
PROBABILITY FUNCTIONS A radom variable X has a probabilit associated with each of its possible values. The probabilit is termed a discrete probabilit if X ca assume ol discrete values, or X = x, x, x 3,,
More informationClass 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 23 Daiel B. Rowe, Ph.D. Departmet of Mathematics, Statistics, ad Computer Sciece Copyright 2017 by D.B. Rowe 1 Ageda: Recap Chapter 9.1 Lecture Chapter 9.2 Review Exam 6 Problem Solvig Sessio. 2
More information2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2
Chapter 8 Comparig Two Treatmets Iferece about Two Populatio Meas We wat to compare the meas of two populatios to see whether they differ. There are two situatios to cosider, as show i the followig examples:
More informationCHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics
CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS 8.1 Radom Samplig The basic idea of the statistical iferece is that we are allowed to draw ifereces or coclusios about a populatio based
More informationA statistical method to determine sample size to estimate characteristic value of soil parameters
A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig
More informationLecture 7: Properties of Random Samples
Lecture 7: Properties of Radom Samples 1 Cotiued From Last Class Theorem 1.1. Let X 1, X,...X be a radom sample from a populatio with mea µ ad variace σ
More information6 Sample Size Calculations
6 Sample Size Calculatios Oe of the major resposibilities of a cliical trial statisticia is to aid the ivestigators i determiig the sample size required to coduct a study The most commo procedure for determiig
More informationLecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS
Lecture 5: Parametric Hypothesis Testig: Comparig Meas GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review from last week What is a cofidece iterval? 2 Review from last week What is a cofidece
More informationCEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering
CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio
More informationStatistics 20: Final Exam Solutions Summer Session 2007
1. 20 poits Testig for Diabetes. Statistics 20: Fial Exam Solutios Summer Sessio 2007 (a) 3 poits Give estimates for the sesitivity of Test I ad of Test II. Solutio: 156 patiets out of total 223 patiets
More informationDirection: This test is worth 150 points. You are required to complete this test within 55 minutes.
Term Test 3 (Part A) November 1, 004 Name Math 6 Studet Number Directio: This test is worth 10 poits. You are required to complete this test withi miutes. I order to receive full credit, aswer each problem
More informationRecall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.
Testig Statistical Hypotheses Recall the study where we estimated the differece betwee mea systolic blood pressure levels of users of oral cotraceptives ad o-users, x - y. Such studies are sometimes viewed
More informationStatisticians use the word population to refer the total number of (potential) observations under consideration
6 Samplig Distributios Statisticias use the word populatio to refer the total umber of (potetial) observatios uder cosideratio The populatio is just the set of all possible outcomes i our sample space
More informationAgreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times
Sigificace level vs. cofidece level Agreemet of CI ad HT Lecture 13 - Tests of Proportios Sta102 / BME102 Coli Rudel October 15, 2014 Cofidece itervals ad hypothesis tests (almost) always agree, as log
More informationEstimation for Complete Data
Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of
More information1036: Probability & Statistics
036: Probability & Statistics Lecture 0 Oe- ad Two-Sample Tests of Hypotheses 0- Statistical Hypotheses Decisio based o experimetal evidece whether Coffee drikig icreases the risk of cacer i humas. A perso
More informationLecture 6 Simple alternatives and the Neyman-Pearson lemma
STATS 00: Itroductio to Statistical Iferece Autum 06 Lecture 6 Simple alteratives ad the Neyma-Pearso lemma Last lecture, we discussed a umber of ways to costruct test statistics for testig a simple ull
More informationOverview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions
Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples
More information7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals
7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses
More information5. Likelihood Ratio Tests
1 of 5 7/29/2009 3:16 PM Virtual Laboratories > 9. Hy pothesis Testig > 1 2 3 4 5 6 7 5. Likelihood Ratio Tests Prelimiaries As usual, our startig poit is a radom experimet with a uderlyig sample space,
More informationPower and Type II Error
Statistical Methods I (EXST 7005) Page 57 Power ad Type II Error Sice we do't actually kow the value of the true mea (or we would't be hypothesizig somethig else), we caot kow i practice the type II error
More informationLecture 4. Random variable and distribution of probability
Itroductio to theory of probability ad statistics Lecture. Radom variable ad distributio of probability dr hab.iż. Katarzya Zarzewsa, prof.agh Katedra Eletroii, AGH e-mail: za@agh.edu.pl http://home.agh.edu.pl/~za
More informationResampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.
Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator
More informationTopic 18: Composite Hypotheses
Toc 18: November, 211 Simple hypotheses limit us to a decisio betwee oe of two possible states of ature. This limitatio does ot allow us, uder the procedures of hypothesis testig to address the basic questio:
More informationDiscrete probability distributions
Discrete probability distributios I the chapter o probability we used the classical method to calculate the probability of various values of a radom variable. I some cases, however, we may be able to develop
More informationStatistical inference: example 1. Inferential Statistics
Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either
More informationStat 421-SP2012 Interval Estimation Section
Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible
More informationUnderstanding Samples
1 Will Moroe CS 109 Samplig ad Bootstrappig Lecture Notes #17 August 2, 2017 Based o a hadout by Chris Piech I this chapter we are goig to talk about statistics calculated o samples from a populatio. We
More informationSimulation. Two Rule For Inverting A Distribution Function
Simulatio Two Rule For Ivertig A Distributio Fuctio Rule 1. If F(x) = u is costat o a iterval [x 1, x 2 ), the the uiform value u is mapped oto x 2 through the iversio process. Rule 2. If there is a jump
More informationLast Lecture. Wald Test
Last Lecture Biostatistics 602 - Statistical Iferece Lecture 22 Hyu Mi Kag April 9th, 2013 Is the exact distributio of LRT statistic typically easy to obtai? How about its asymptotic distributio? For testig
More informationData Analysis and Statistical Methods Statistics 651
Data Aalysis ad Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasii/teachig.html Suhasii Subba Rao Review of testig: Example The admistrator of a ursig home wats to do a time ad motio
More informationPH 425 Quantum Measurement and Spin Winter SPINS Lab 1
PH 425 Quatum Measuremet ad Spi Witer 23 SPIS Lab Measure the spi projectio S z alog the z-axis This is the experimet that is ready to go whe you start the program, as show below Each atom is measured
More informationSection 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis
Sectio 9.2 Tests About a Populatio Proportio P H A N T O M S Parameters Hypothesis Assess Coditios Name the Test Test Statistic (Calculate) Obtai P value Make a decisio State coclusio Sectio 9.2 Tests
More informationLecture 5. Random variable and distribution of probability
Itroductio to theory of probability ad statistics Lecture 5. Radom variable ad distributio of probability prof. dr hab.iż. Katarzya Zarzewsa Katedra Eletroii, AGH e-mail: za@agh.edu.pl http://home.agh.edu.pl/~za
More informationBIOS 4110: Introduction to Biostatistics. Breheny. Lab #9
BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous
More informationLecture 2: Monte Carlo Simulation
STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?
More informationMath 140 Introductory Statistics
8.2 Testig a Proportio Math 1 Itroductory Statistics Professor B. Abrego Lecture 15 Sectios 8.2 People ofte make decisios with data by comparig the results from a sample to some predetermied stadard. These
More information32 estimating the cumulative distribution function
32 estimatig the cumulative distributio fuctio 4.6 types of cofidece itervals/bads Let F be a class of distributio fuctios F ad let θ be some quatity of iterest, such as the mea of F or the whole fuctio
More informationKLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions
We have previously leared: KLMED8004 Medical statistics Part I, autum 00 How kow probability distributios (e.g. biomial distributio, ormal distributio) with kow populatio parameters (mea, variace) ca give
More informationLecture 1 Probability and Statistics
Wikipedia: Lecture 1 Probability ad Statistics Bejami Disraeli, British statesma ad literary figure (1804 1881): There are three kids of lies: lies, damed lies, ad statistics. popularized i US by Mark
More informationBig Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.
5. Data, Estimates, ad Models: quatifyig the accuracy of estimates. 5. Estimatig a Normal Mea 5.2 The Distributio of the Normal Sample Mea 5.3 Normal data, cofidece iterval for, kow 5.4 Normal data, cofidece
More informationChapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.
Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo
More informationParameter, Statistic and Random Samples
Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,
More informationThis is an introductory course in Analysis of Variance and Design of Experiments.
1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class
More informationChapter 13: Tests of Hypothesis Section 13.1 Introduction
Chapter 13: Tests of Hypothesis Sectio 13.1 Itroductio RECAP: Chapter 1 discussed the Likelihood Ratio Method as a geeral approach to fid good test procedures. Testig for the Normal Mea Example, discussed
More informationIf, for instance, we were required to test whether the population mean μ could be equal to a certain value μ
STATISTICAL INFERENCE INTRODUCTION Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I oesample testig, we essetially
More information1 Review of Probability & Statistics
1 Review of Probability & Statistics a. I a group of 000 people, it has bee reported that there are: 61 smokers 670 over 5 960 people who imbibe (drik alcohol) 86 smokers who imbibe 90 imbibers over 5
More informationMATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4
MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.
More informationf(x)dx = 1 and f(x) 0 for all x.
OCR Statistics 2 Module Revisio Sheet The S2 exam is 1 hour 30 miutes log. You are allowed a graphics calculator. Before you go ito the exam make sureyou are fully aware of the cotets of theformula booklet
More informationChapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.
Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more
More informationSTAT431 Review. X = n. n )
STAT43 Review I. Results related to ormal distributio Expected value ad variace. (a) E(aXbY) = aex bey, Var(aXbY) = a VarX b VarY provided X ad Y are idepedet. Normal distributios: (a) Z N(, ) (b) X N(µ,
More informationLecture 1 Probability and Statistics
Wikipedia: Lecture 1 Probability ad Statistics Bejami Disraeli, British statesma ad literary figure (1804 1881): There are three kids of lies: lies, damed lies, ad statistics. popularized i US by Mark
More informationConfidence intervals summary Conservative and approximate confidence intervals for a binomial p Examples. MATH1005 Statistics. Lecture 24. M.
MATH1005 Statistics Lecture 24 M. Stewart School of Mathematics ad Statistics Uiversity of Sydey Outlie Cofidece itervals summary Coservative ad approximate cofidece itervals for a biomial p The aïve iterval
More informationSample Size Determination (Two or More Samples)
Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie
More informationThomas Whitham Form Centre
Thomas Whitham Form Cetre Statistics Module . The Uiform Distributio (or rectagular distributio) U(a, b) a b Pdf f x a x b a b otherwise cdf E F x a b var, b a x a, b a, by symmetry x a a x b x b (proof
More information5. A formulae page and two tables are provided at the end of Part A of the examination PART A
Istructios: 1. You have bee provided with: (a) this questio paper (Part A ad Part B) (b) a multiple choice aswer sheet (for Part A) (c) Log Aswer Sheet(s) (for Part B) (d) a booklet of tables. (a) I PART
More informationIE 230 Seat # Name < KEY > Please read these directions. Closed book and notes. 60 minutes.
IE 230 Seat # Name < KEY > Please read these directios. Closed book ad otes. 60 miutes. Covers through the ormal distributio, Sectio 4.7 of Motgomery ad Ruger, fourth editio. Cover page ad four pages of
More informationParameter, Statistic and Random Samples
Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,
More informationPSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9
Hypothesis testig PSYCHOLOGICAL RESEARCH (PYC 34-C Lecture 9 Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I
More informationMA238 Assignment 4 Solutions (part a)
(i) Sigle sample tests. Questio. MA38 Assigmet 4 Solutios (part a) (a) (b) (c) H 0 : = 50 sq. ft H A : < 50 sq. ft H 0 : = 3 mpg H A : > 3 mpg H 0 : = 5 mm H A : 5mm Questio. (i) What are the ull ad alterative
More informationDS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10
DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set
More informationDiscrete Mathematics for CS Spring 2007 Luca Trevisan Lecture 22
CS 70 Discrete Mathematics for CS Sprig 2007 Luca Trevisa Lecture 22 Aother Importat Distributio The Geometric Distributio Questio: A biased coi with Heads probability p is tossed repeatedly util the first
More information- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion
1 Chapter 7 ad 8 Review for Exam Chapter 7 Estimates ad Sample Sizes 2 Defiitio Cofidece Iterval (or Iterval Estimate) a rage (or a iterval) of values used to estimate the true value of the populatio parameter
More informationEE 4TM4: Digital Communications II Probability Theory
1 EE 4TM4: Digital Commuicatios II Probability Theory I. RANDOM VARIABLES A radom variable is a real-valued fuctio defied o the sample space. Example: Suppose that our experimet cosists of tossig two fair
More informationEstimation of a population proportion March 23,
1 Social Studies 201 Notes for March 23, 2005 Estimatio of a populatio proportio Sectio 8.5, p. 521. For the most part, we have dealt with meas ad stadard deviatios this semester. This sectio of the otes
More information6.3 Testing Series With Positive Terms
6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial
More informationMathematical Notation Math Introduction to Applied Statistics
Mathematical Notatio Math 113 - Itroductio to Applied Statistics Name : Use Word or WordPerfect to recreate the followig documets. Each article is worth 10 poits ad ca be prited ad give to the istructor
More informationStatistical Intervals for a Single Sample
3/5/06 Applied Statistics ad Probability for Egieers Sixth Editio Douglas C. Motgomery George C. Ruger Chapter 8 Statistical Itervals for a Sigle Sample 8 CHAPTER OUTLINE 8- Cofidece Iterval o the Mea
More informationProblem Set 4 Due Oct, 12
EE226: Radom Processes i Systems Lecturer: Jea C. Walrad Problem Set 4 Due Oct, 12 Fall 06 GSI: Assae Gueye This problem set essetially reviews detectio theory ad hypothesis testig ad some basic otios
More informationHYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018
HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018 We are resposible for 2 types of hypothesis tests that produce ifereces about the ukow populatio mea, µ, each of which has 3 possible
More information