Confidence intervals summary Conservative and approximate confidence intervals for a binomial p Examples. MATH1005 Statistics. Lecture 24. M.
|
|
- Thomas Gray
- 5 years ago
- Views:
Transcription
1 MATH1005 Statistics Lecture 24 M. Stewart School of Mathematics ad Statistics Uiversity of Sydey
2 Outlie Cofidece itervals summary Coservative ad approximate cofidece itervals for a biomial p The aïve iterval ad problems with it The coservative iterval Examples
3 Pivots ad Cofidece Itervals For a give statistical model, a pivot is a fuctio of the data ad the parameters which always has the same distributio (whatever the values of the parameters). For the Z-test model (data modelled as radom sample size from a populatio with ukow mea µ kow variace σ 2 ad populatio ormal ad/or sample size large) the with X the sample average, the pivot X µ σ/ N(0, 1) (whatever µ is). For the (oe-sample) t-test model (radom sample size from ormal populatio with ukow mea µ ad variace), with X ad S the sample average ad sd (resp.), the pivot X µ S/ t 1 (whatever µ is).
4 For the two-sample t-test model (two radom samples, sizes x, y from ormal populatios with ukow meas µ x, µ y ad ukow but equal variaces, the with X, Ȳ the sample averages, S X, S Y the sample sds, ad S p = ( x 1)S 2 X +(y 1)S2 Y x + y 2 X Ȳ (µ x µ y ) S p q 1 x + 1 y the pooled sample sd, the pivot t x + y 2 (whatever the values of µ x, µ y ). I all cases the pivot is of the form EST PARAM SE where the umerator is the differece betwee a parameter ad a estimate of it, the so-called estimatio error ad SE is the sd (or a estimate thereof) of the estimatio error i the umerator, regarded as a radom variable.
5 Suppose we ca fid a c so that for the pivot i questio, ( ) EST PARAM P c c = SE The we ca say that the radom iterval of the form EST ± c SE cotais PARAM with probability 0.95, that is P (EST c SE PARAM EST + c SE) = The observed value of this radom iterval is called a 95% cofidece iterval. Differet cofidece levels (e.g. 90%, 99%) ca be obtaied by choosig c differetly, such that the right-had side above is 0.90, 0.99, etc.
6 Thus for the Z-test model, if x is the observed value of X, we eed upper percetage poits from N(0, 1) (available o the bottom row of a t-table): For a 100(1 α)% cofidece iterval we eed c such that P(Z c) = α/2, sice the we also have P(Z c) = α/2 ad so P( c Z c) = 1 P(Z c) P(Z c) = 1 α/2 α/2 = 1 α. 95% cofidece iterval is x ± 1.96 σ/ 90% cofidece iterval is x ± σ/ 99% cofidece iterval is x ± σ/
7 Suppose we have a (oe-sample) t-test model with 16 observatios, ad that x ad s are the observed values of the sample average X ad sd S. The pivot ( X µ)/(s/ 16) t 15 so we cosult that row of the t-table: For 95% (corresp. to α = 0.05), sice P(t 15 > 2.131) = (i.e. α/2), we use x ± 2.131(s/4). For 90% (corresp. to α = 0.1), sice P(t 15 > 1.753) = 0.05 (i.e. α/2), we use x ± 1.753(s/4). For 99% (corresp. to α = 0.01), sice P(t 15 > 2.947) = (i.e. α/2), we use x ± 2.947(s/4).
8 Suppose we have a two-sample t-test model with sample sizes 11 ad 15, let x ad ȳ deote the observed sample averages, s x ad s y the observed sample sds ad s p = (10s 2 x + 14s 2 y ) /24 the observed pooled sample sd. The pivot here has a t 24 distributio: For 95% cofidece (corresp. to α = 0.05), the multiplier we eed is (sice P(t 24 > 2.064) = 0.025, i.e. α/2). A 95% cofidece iterval for the populatio mea differece µ x µ y is therefore give by ( x ȳ) ± (s p For 90% cofidece (α = 0.1), sice P(t 24 > 1.711) = 0.05 (i.e. α/2), use For 99% cofidece (α = 0.01), sice P(t 24 > 2.797) = (i.e. α/2), use ).
9 Iterpretatio May have difficulty i properly iterpretig a cofidece iterval. The cofidece level is a property of the procedure you have used. It says how ofte it covers the target i the log-ru. This is thus a property oly realised after may repetitios. If we just compute a sigle cofidece iterval i practice, the we may or may ot have covered the target. We do t kow, ad we possibly ever will kow exactly. However we kow that if we repeated this procedure may times, i 95% (or whatever the cofidece level is) of the time the cofidece iterval would iclude the ukow parameter value.
10 Coservative ad approximate cofidece itervals Our ability to costruct exact cofidece itervals for the 3 models cosidered depeded crucially o the fact that we had a pivot with a kow distributio of the form (est-param)/se. I that case we ca defie a radom iterval with the property that P(iterval icludes parameter) = 0.95 I some models the form of (approximate) pivots ad/or se s make are ot so coveiet.
11 Fallback optios are: a coservative 95% cofidece iterval obtaied by defiig a radom iterval such that P(iterval icludes parameter) 0.95, so that the iterval is possibly wider tha it really eeds to be, but at least still has at least the omial coverage probability; a approximate 95% cofidece iterval whereby P(iterval icludes parameter) Such itervals should be used with cautio. Strictly speakig, itervals i the Z-test model where we are usig a Cetral-Limit-Theorem-approximately-ormal argumet are of this type, although i those cases the approximatio is ofte quite accurate. We shall examie such thigs i oe particular example: cofidece itervals for a biomial p-parameter.
12 Cofidece itervals for a biomial p Suppose we model a cout X as a B(, p) for some kow but ukow p. Example: i a cliical trial, of 100 patiets sufferig from a certai coditio, 68 obtai relief. Modellig this cout as a B(100, p) radom variable, provide a 95% cofidece iterval for p. The estimate is just ˆp = = 0.68, the observed proportio obtaiig relief. A first guess would be to work out the stadard error of the estimate ad the, sice X, ad thus ˆp are approximately ormal, use ˆp ± 1.96SE(ˆp). What is the stadard error of the estimate ˆp i geeral?
13 The radom variable ˆp = X has ( ) ( ) X 1 2 Var = Var(X ) ( ) 1 2 = Var(X ) = p(1 p) = p(1 p) Thus the stadard deviatio of the estimator ˆp = X / is p(1 p)/. However, this depeds o the ukow p, so a computable versio, (i.e. the stadard error) is obtaied by pluggig i the estimate ito this expressio. Thus ˆp(1 ˆp) SE(ˆp) =. So i our example, the estimate is ˆp = 0.68 with stadard error ˆp(1 ˆp)/ Ca we use a iterval of the form ˆp ± c SE(ˆp)?.
14 To do this, we eed ˆp p SE(ˆp) = ˆp p ˆp(1 ˆp) to be a pivot, that is to have a kow distributio, ot depedig o p. Is this a pivot? Is it approximately? If so, with what distributio? If is large eough so that the Cetral Limit applies, we do have a (approximate) pivot here, but ot the ratio above, rather a versio of it with the true p put back ito the deomiator i place of ˆp (i.e. the exact SD istead of its approximatio, the SE): ˆp p SD(ˆp) = ˆp p p(1 p) approx N(0, 1) (ote the differece betwee these two ratios, i particular the deomiators!)
15 Ufortuately the approximate-n(0, 1)-pivot has the ukow p appearig i the deomiator; it ca t be used to costruct cofidece itervals directly; the ±-factor is ot computable. Eve more ufortuately, if we revert back to our first guess ad plug-i ˆp for p i the deomiator, the resultat ratio is i geeral ot-at-all-a-approximate-n(0, 1)-pivot: Dist of ˆp p SE(ˆp) = ˆp p ˆp(1 ˆp) chages sigificatly for differet p s, (particularly for small-to-moderate, say 50). Eve more ufortuately tha that, this is still recommeded i may textbooks as a good idea. As we illustrate below, this iterval ca have a serious problem. More precisely, for certai ulucky choices of ad p, the coverage probability of the ˆp ± 1.96SE (ˆp) is otably below 0.95.
16 We illustrate this pheomeo with a particularly ulucky pair, = 32, p = 0.2: > x=rbiom(10000,32,.2) > phat=x/32 > se=sqrt(phat*(1-phat)/32) > lower=phat-1.96*se > upper=phat+1.96*se > sum((lower<=.2)*(upper>=.2)) [1] 8889 # this couts how may simulated itervals # cover the true value of 0.2 This is sigificatly less tha the expected 9500 (P-value of 1-sided test of H 0 : p = 0.95 versus H 1 : p < 0.95 is pretty small!): > pbiom(8889,10000,.95) [1] e-131
17 Cofidece itervals Table summary 1 lists the smallest Coservative after adwhich approximate the coverage cofidece itervals stays atfor 0.93 a biomial or above p for Examples selected values of p for the stadard iterval ad three alterative itervals. s, J, FIG. 1. Coverage probability of the stadard iterval for p = 0.5 ad = This shows P(p ˆp ± 1.96 ˆp(1 ˆp)/) for p = 0.5 ad ragig from 10 to 100 (from Brow et al. Aals of Statistics 2002).
18 This graph seems to suggest that, at least for p = 0.5, the situatio improves as gets bigger (as we expect is should, because the the SE should be almost perfect at estimatig SD(ˆp) ad so the ratio should be like a N(0, 1) the. Let us cosider = 100, = 1000 ad = 2000 with p = 0.2 agai: Firstly, = 100, p = 0.2: > x=rbiom(10000,100,.2) > phat=x/100 > se=sqrt(phat*(1-phat)/100) > lower=phat-1.96*se > upper=phat+1.96*se > sum((lower<=.2)*(upper>=.2)) [1] 9343 > pbiom(9343,10000,.95) [1] e-12 Agai, the coverage probability is clearly less tha 0.95.
19 Next = 1000, p = 0.2: > x=rbiom(10000,1000,.2) > phat=x/1000 > se=sqrt(phat*(1-phat)/1000) > lower=phat-1.96*se > upper=phat+1.96*se > sum((lower<=.2)*(upper>=.2)) [1] 9429 > pbiom(9429,10000,.95) [1] Eve here with = 1000, the umber of itervals that work is sigificatly less tha the 9500 that oe would expect if the cofidece level really was 95%.
20 Fially = 2000, p = 0.2: > x=rbiom(10000,2000,.2) > phat=x/2000 > se=sqrt(phat*(1-phat)/2000) > lower=phat-1.96*se > upper=phat+1.96*se > sum((lower<=.2)*(upper>=.2)) [1] 9485 > pbiom(9485,10000,.95) [1] Here although less that 9500, it is ot sigificatly less ad so we would be happy believig that the actual cofidece level is 95% here. So oly use this iterval for massive sample sizes (well over 1000).
21 Two sources of error i the approximate cofidece iterval for p There are two approximatios at work with the so-called approximate iterval ˆp ± 1.95 ˆp(1 ˆp)/, where ˆp = X / ad X B(, p): approximatig the SD(ˆp) = p(1 p)/ with the SE = ˆp(1 ˆp)/; approximatig the biomial distributio of X with a ormal. The mai source of error is the first oe; so log as p ad (1 p) are both bigger tha 5 we are happy that the ormal approximatio to the biomial is pretty good.
22 That is to say we are reasoably happy that ˆp p p(1 p) approx N(0, 1) (ote: the true p appears i the deomiator here, ot ˆp) ad so that ( ) p(1 p) P p i ˆp ± is a pretty accurate approximatio. The problem here is that this iterval, while havig a close-to-95% coverage probability, caot be computed!
23 The poor performace of the so-called approximate cofidece iterval is because of the difficulty i accurately p(1 p) approximatig the quatity. Aother approach is to determie a upper boud for this. For 0 p 1, p(1 p) is maximised at p = 0.5 where it equals p(1 p) Thus because 1 2 for all p, the ucomputable iterval p(1 p) ˆp ± 1.96 (which has coverage probability 95%) is always icluded i the coservative iterval ˆp ±
24 Coservative cofidece iterval for biomial p Thus we have that ( P ˆp ) 2 1 p ˆp (approx) 0.95 (Note: it is still techically oly approximately coservative sice we are usig a ormal approximatio to a biomial distributio). We refer to the observed value of the radom iterval ˆp ± as a coservative 95% cofidece iterval for p. It is always the maximum width of ay correspodig approximate iterval for that value of.
25 Summary Thus, we have the followig two optios for providig a 95% 1 cofidece iterval for the biomial p parameter, based o a sigle observatio x modelled as the observed value of a radom variable X B(, p) for kow but p ukow: 1. The approximate 95% C.I. for p: ˆp(1 ˆp) ˆp ± 1.96 which should oly be used for massive (> 1000). 2. The coservative 95% C.I. for p: ˆp ± which ca be (eedlessly) wide, but is (at least approx) valid. 1 Differet cofidece levels are obtaied by replacig 1.96 with the appropriate value from the N(0, 1) table: for 90%, for 98%, for 99%, etc.
26 Examples Cotiuig our earlier example with 100 patiets, 68 of whom experiece relief, our poit estimate of p is ˆp = = 0.68, the stadard error is ˆp(1 ˆp)/ The sample size is too small to use the approximate iterval. The coservative 95% iterval is thus ˆp ± ± [0.582, 0, 778].
27 Left-hadedess A radom sample of 1500 people from a certai populatio was foud to cotai 129 left-haded people. Provide a 95% cofidece iterval for the true proportio of left-haders i the populatio. Our poit estimate is ˆp = 129/ ad its stadard error is Sice our here is i the thousads, we ca perhaps use the approximate iterval. It yields ˆp ± 1.96 SE ± ( ) [0.072, 0.100] It is of iterest to compare this to the coservative iterval: ( ) ± [0.061, 0.111] The coservative iterval is cosiderably wider, which will of course happe wheever ˆp is far from 0.5 as it is here.
28 I light of the last example, we ca do a little simulatio to see how reliable that approximate 95% cofidece iterval is: we simulate from B(1500, 0.08) may times ad see how ofte the iterval covers 0.08: > x=rbiom(10000,1500,.08) > phat=x/1500 > l1=phat-1.96*sqrt(phat*(1-phat)/1500) > u1=phat+1.96*sqrt(phat*(1-phat)/1500) > sum((l1<=.08)*(u1>=.08)) [1] 9494 This is clearly ot sigificatly differet from the ideal 9500! This makes us feel good about the approximate iterval here. How about the coservative 95% iterval? > l2=phat-1.96*sqrt(1/(4*1500)) > u2=phat+1.96*sqrt(1/(4*1500)) > sum((l2<=.08)*(u2>=.08)) [1] 9997 Wow! I all but 3 of the simulatios the coservative iterval covered So although very wide, it will cover the true p at least 95% of the time.
29 Goodess-of-fit tests Our last topic relates to discrete data, e.g. couts or frequecies. Sometimes it is desired to compare a set of observed frequecies to either 1. a give set of expected probabilties/proportios or 2. a family of such sets to see if the set of probs (or oe member of the family of such sets) ca well explai what is observed.
Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.
Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized
More informationA quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population
A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationMATH/STAT 352: Lecture 15
MATH/STAT 352: Lecture 15 Sectios 5.2 ad 5.3. Large sample CI for a proportio ad small sample CI for a mea. 1 5.2: Cofidece Iterval for a Proportio Estimatig proportio of successes i a biomial experimet
More informationSince X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain
Assigmet 9 Exercise 5.5 Let X biomial, p, where p 0, 1 is ukow. Obtai cofidece itervals for p i two differet ways: a Sice X / p d N0, p1 p], the variace of the limitig distributio depeds oly o p. Use the
More informationENGI 4421 Confidence Intervals (Two Samples) Page 12-01
ENGI 44 Cofidece Itervals (Two Samples) Page -0 Two Sample Cofidece Iterval for a Differece i Populatio Meas [Navidi sectios 5.4-5.7; Devore chapter 9] From the cetral limit theorem, we kow that, for sufficietly
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationEstimation of a population proportion March 23,
1 Social Studies 201 Notes for March 23, 2005 Estimatio of a populatio proportio Sectio 8.5, p. 521. For the most part, we have dealt with meas ad stadard deviatios this semester. This sectio of the otes
More informationStatistics 511 Additional Materials
Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability
More informationFACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures
FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationApril 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE
April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE TERRY SOO Abstract These otes are adapted from whe I taught Math 526 ad meat to give a quick itroductio to cofidece
More informationDiscrete Mathematics for CS Spring 2008 David Wagner Note 22
CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig
More informationResampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.
Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator
More informationBig Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.
5. Data, Estimates, ad Models: quatifyig the accuracy of estimates. 5. Estimatig a Normal Mea 5.2 The Distributio of the Normal Sample Mea 5.3 Normal data, cofidece iterval for, kow 5.4 Normal data, cofidece
More information7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals
7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses
More informationChapter 8: Estimating with Confidence
Chapter 8: Estimatig with Cofidece Sectio 8.2 The Practice of Statistics, 4 th editio For AP* STARNES, YATES, MOORE Chapter 8 Estimatig with Cofidece 8.1 Cofidece Itervals: The Basics 8.2 8.3 Estimatig
More informationBIOSTATISTICS. Lecture 5 Interval Estimations for Mean and Proportion. dr. Petr Nazarov
Microarray Ceter BIOSTATISTICS Lecture 5 Iterval Estimatios for Mea ad Proportio dr. Petr Nazarov 15-03-013 petr.azarov@crp-sate.lu Lecture 5. Iterval estimatio for mea ad proportio OUTLINE Iterval estimatios
More informationOverview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions
Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples
More informationFrequentist Inference
Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for
More informationAgreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times
Sigificace level vs. cofidece level Agreemet of CI ad HT Lecture 13 - Tests of Proportios Sta102 / BME102 Coli Rudel October 15, 2014 Cofidece itervals ad hypothesis tests (almost) always agree, as log
More informationMATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4
MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.
More informationBIOS 4110: Introduction to Biostatistics. Breheny. Lab #9
BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous
More informationLecture 2: Monte Carlo Simulation
STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?
More informationProperties and Hypothesis Testing
Chapter 3 Properties ad Hypothesis Testig 3.1 Types of data The regressio techiques developed i previous chapters ca be applied to three differet kids of data. 1. Cross-sectioal data. 2. Time series data.
More informationSTAT431 Review. X = n. n )
STAT43 Review I. Results related to ormal distributio Expected value ad variace. (a) E(aXbY) = aex bey, Var(aXbY) = a VarX b VarY provided X ad Y are idepedet. Normal distributios: (a) Z N(, ) (b) X N(µ,
More informationGoodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)
Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................
More information1 Review of Probability & Statistics
1 Review of Probability & Statistics a. I a group of 000 people, it has bee reported that there are: 61 smokers 670 over 5 960 people who imbibe (drik alcohol) 86 smokers who imbibe 90 imbibers over 5
More informationStat 421-SP2012 Interval Estimation Section
Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible
More informationStatistical Intervals for a Single Sample
3/5/06 Applied Statistics ad Probability for Egieers Sixth Editio Douglas C. Motgomery George C. Ruger Chapter 8 Statistical Itervals for a Sigle Sample 8 CHAPTER OUTLINE 8- Cofidece Iterval o the Mea
More informationExam II Covers. STA 291 Lecture 19. Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Location CB 234
STA 291 Lecture 19 Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Locatio CB 234 STA 291 - Lecture 19 1 Exam II Covers Chapter 9 10.1; 10.2; 10.3; 10.4; 10.6
More informationInterval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),
Cofidece Iterval Estimatio Problems Suppose we have a populatio with some ukow parameter(s). Example: Normal(,) ad are parameters. We eed to draw coclusios (make ifereces) about the ukow parameters. We
More informationChapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p
Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE Part 3: Summary of CI for µ Cofidece Iterval for a Populatio Proportio p Sectio 8-4 Summary for creatig a 100(1-α)% CI for µ: Whe σ 2 is kow ad paret
More information(7 One- and Two-Sample Estimation Problem )
34 Stat Lecture Notes (7 Oe- ad Two-Sample Estimatio Problem ) ( Book*: Chapter 8,pg65) Probability& Statistics for Egieers & Scietists By Walpole, Myers, Myers, Ye Estimatio 1 ) ( ˆ S P i i Poit estimate:
More informationInstructor: Judith Canner Spring 2010 CONFIDENCE INTERVALS How do we make inferences about the population parameters?
CONFIDENCE INTERVALS How do we make ifereces about the populatio parameters? The samplig distributio allows us to quatify the variability i sample statistics icludig how they differ from the parameter
More informationChapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.
Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more
More informationSample Size Determination (Two or More Samples)
Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie
More informationSTA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:
STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio
More informationChapter 6 Sampling Distributions
Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to
More informationSimulation. Two Rule For Inverting A Distribution Function
Simulatio Two Rule For Ivertig A Distributio Fuctio Rule 1. If F(x) = u is costat o a iterval [x 1, x 2 ), the the uiform value u is mapped oto x 2 through the iversio process. Rule 2. If there is a jump
More informationHomework 5 Solutions
Homework 5 Solutios p329 # 12 No. To estimate the chace you eed the expected value ad stadard error. To do get the expected value you eed the average of the box ad to get the stadard error you eed the
More informationAnnouncements. Unit 5: Inference for Categorical Data Lecture 1: Inference for a single proportion
Housekeepig Aoucemets Uit 5: Iferece for Categorical Data Lecture 1: Iferece for a sigle proportio Statistics 101 Mie Çetikaya-Rudel PA 4 due Friday at 5pm (exteded) PS 6 due Thursday, Oct 30 October 23,
More informationChapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.
Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo
More informationAAEC/ECON 5126 FINAL EXAM: SOLUTIONS
AAEC/ECON 5126 FINAL EXAM: SOLUTIONS SPRING 2015 / INSTRUCTOR: KLAUS MOELTNER This exam is ope-book, ope-otes, but please work strictly o your ow. Please make sure your ame is o every sheet you re hadig
More informationRead through these prior to coming to the test and follow them when you take your test.
Math 143 Sprig 2012 Test 2 Iformatio 1 Test 2 will be give i class o Thursday April 5. Material Covered The test is cummulative, but will emphasize the recet material (Chapters 6 8, 10 11, ad Sectios 12.1
More informationKLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions
We have previously leared: KLMED8004 Medical statistics Part I, autum 00 How kow probability distributios (e.g. biomial distributio, ormal distributio) with kow populatio parameters (mea, variace) ca give
More informationMath 140 Introductory Statistics
8.2 Testig a Proportio Math 1 Itroductory Statistics Professor B. Abrego Lecture 15 Sectios 8.2 People ofte make decisios with data by comparig the results from a sample to some predetermied stadard. These
More informationA sequence of numbers is a function whose domain is the positive integers. We can see that the sequence
Sequeces A sequece of umbers is a fuctio whose domai is the positive itegers. We ca see that the sequece,, 2, 2, 3, 3,... is a fuctio from the positive itegers whe we write the first sequece elemet as
More informationTests of Hypotheses Based on a Single Sample (Devore Chapter Eight)
Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........
More information1 Inferential Methods for Correlation and Regression Analysis
1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet
More informationSampling Distributions, Z-Tests, Power
Samplig Distributios, Z-Tests, Power We draw ifereces about populatio parameters from sample statistics Sample proportio approximates populatio proportio Sample mea approximates populatio mea Sample variace
More informationHypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance
Hypothesis Testig Empirically evaluatig accuracy of hypotheses: importat activity i ML. Three questios: Give observed accuracy over a sample set, how well does this estimate apply over additioal samples?
More informationACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER / Statistics
ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER 1 018/019 DR. ANTHONY BROWN 8. Statistics 8.1. Measures of Cetre: Mea, Media ad Mode. If we have a series of umbers the
More informationLet us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.
Lecture 5 Let us give oe more example of MLE. Example 3. The uiform distributio U[0, ] o the iterval [0, ] has p.d.f. { 1 f(x =, 0 x, 0, otherwise The likelihood fuctio ϕ( = f(x i = 1 I(X 1,..., X [0,
More informationSequences A sequence of numbers is a function whose domain is the positive integers. We can see that the sequence
Sequeces A sequece of umbers is a fuctio whose domai is the positive itegers. We ca see that the sequece 1, 1, 2, 2, 3, 3,... is a fuctio from the positive itegers whe we write the first sequece elemet
More informationThis is an introductory course in Analysis of Variance and Design of Experiments.
1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class
More informationMath 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency
Math 152. Rumbos Fall 2009 1 Solutios to Review Problems for Exam #2 1. I the book Experimetatio ad Measuremet, by W. J. Youde ad published by the by the Natioal Sciece Teachers Associatio i 1962, the
More informationExpectation and Variance of a random variable
Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio
More informationEstimation for Complete Data
Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of
More information6.3 Testing Series With Positive Terms
6.3. TESTING SERIES WITH POSITIVE TERMS 307 6.3 Testig Series With Positive Terms 6.3. Review of what is kow up to ow I theory, testig a series a i for covergece amouts to fidig the i= sequece of partial
More informationLecture 3. Properties of Summary Statistics: Sampling Distribution
Lecture 3 Properties of Summary Statistics: Samplig Distributio Mai Theme How ca we use math to justify that our umerical summaries from the sample are good summaries of the populatio? Lecture Summary
More informationThe standard deviation of the mean
Physics 6C Fall 20 The stadard deviatio of the mea These otes provide some clarificatio o the distictio betwee the stadard deviatio ad the stadard deviatio of the mea.. The sample mea ad variace Cosider
More informationUnderstanding Dissimilarity Among Samples
Aoucemets: Midterm is Wed. Review sheet is o class webpage (i the list of lectures) ad will be covered i discussio o Moday. Two sheets of otes are allowed, same rules as for the oe sheet last time. Office
More informationSTATISTICAL INFERENCE
STATISTICAL INFERENCE POPULATION AND SAMPLE Populatio = all elemets of iterest Characterized by a distributio F with some parameter θ Sample = the data X 1,..., X, selected subset of the populatio = sample
More informationRecall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.
Testig Statistical Hypotheses Recall the study where we estimated the differece betwee mea systolic blood pressure levels of users of oral cotraceptives ad o-users, x - y. Such studies are sometimes viewed
More informationChi-Squared Tests Math 6070, Spring 2006
Chi-Squared Tests Math 6070, Sprig 2006 Davar Khoshevisa Uiversity of Utah February XXX, 2006 Cotets MLE for Goodess-of Fit 2 2 The Multiomial Distributio 3 3 Applicatio to Goodess-of-Fit 6 3 Testig for
More informationChapter 6 Principles of Data Reduction
Chapter 6 for BST 695: Special Topics i Statistical Theory. Kui Zhag, 0 Chapter 6 Priciples of Data Reductio Sectio 6. Itroductio Goal: To summarize or reduce the data X, X,, X to get iformatio about a
More informationChapter 11: Asking and Answering Questions About the Difference of Two Proportions
Chapter 11: Askig ad Aswerig Questios About the Differece of Two Proportios These otes reflect material from our text, Statistics, Learig from Data, First Editio, by Roxy Peck, published by CENGAGE Learig,
More informationBinomial Distribution
0.0 0.5 1.0 1.5 2.0 2.5 3.0 0 1 2 3 4 5 6 7 0.0 0.5 1.0 1.5 2.0 2.5 3.0 Overview Example: coi tossed three times Defiitio Formula Recall that a r.v. is discrete if there are either a fiite umber of possible
More informationEconomics Spring 2015
1 Ecoomics 400 -- Sprig 015 /17/015 pp. 30-38; Ch. 7.1.4-7. New Stata Assigmet ad ew MyStatlab assigmet, both due Feb 4th Midterm Exam Thursday Feb 6th, Chapters 1-7 of Groeber text ad all relevat lectures
More informationConfidence Interval for Standard Deviation of Normal Distribution with Known Coefficients of Variation
Cofidece Iterval for tadard Deviatio of Normal Distributio with Kow Coefficiets of Variatio uparat Niwitpog Departmet of Applied tatistics, Faculty of Applied ciece Kig Mogkut s Uiversity of Techology
More informationStatistical inference: example 1. Inferential Statistics
Statistical iferece: example 1 Iferetial Statistics POPULATION SAMPLE A clothig store chai regularly buys from a supplier large quatities of a certai piece of clothig. Each item ca be classified either
More informationLecture 6 Simple alternatives and the Neyman-Pearson lemma
STATS 00: Itroductio to Statistical Iferece Autum 06 Lecture 6 Simple alteratives ad the Neyma-Pearso lemma Last lecture, we discussed a umber of ways to costruct test statistics for testig a simple ull
More informationEcon 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.
Eco 325/327 Notes o Sample Mea, Sample Proportio, Cetral Limit Theorem, Chi-square Distributio, Studet s t distributio 1 Sample Mea By Hiro Kasahara We cosider a radom sample from a populatio. Defiitio
More informationLecture 19: Convergence
Lecture 19: Covergece Asymptotic approach I statistical aalysis or iferece, a key to the success of fidig a good procedure is beig able to fid some momets ad/or distributios of various statistics. I may
More informationS160 #12. Sampling Distribution of the Proportion, Part 2. JC Wang. February 25, 2016
S160 #12 Samplig Distributio of the Proportio, Part 2 JC Wag February 25, 2016 Outlie 1 Estimatig Proportio Usig Itervals Cofidece Iterval for the Populatio Proportio iclicker Questios 2 JC Wag (WMU) S160
More informationGG313 GEOLOGICAL DATA ANALYSIS
GG313 GEOLOGICAL DATA ANALYSIS 1 Testig Hypothesis GG313 GEOLOGICAL DATA ANALYSIS LECTURE NOTES PAUL WESSEL SECTION TESTING OF HYPOTHESES Much of statistics is cocered with testig hypothesis agaist data
More informationDS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10
DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set
More informationPower and Type II Error
Statistical Methods I (EXST 7005) Page 57 Power ad Type II Error Sice we do't actually kow the value of the true mea (or we would't be hypothesizig somethig else), we caot kow i practice the type II error
More informationOutput Analysis and Run-Length Control
IEOR E4703: Mote Carlo Simulatio Columbia Uiversity c 2017 by Marti Haugh Output Aalysis ad Ru-Legth Cotrol I these otes we describe how the Cetral Limit Theorem ca be used to costruct approximate (1 α%
More information1 Constructing and Interpreting a Confidence Interval
Itroductory Applied Ecoometrics EEP/IAS 118 Sprig 2014 WARM UP: Match the terms i the table with the correct formula: Adrew Crae-Droesch Sectio #6 5 March 2014 ˆ Let X be a radom variable with mea µ ad
More informationComputing Confidence Intervals for Sample Data
Computig Cofidece Itervals for Sample Data Topics Use of Statistics Sources of errors Accuracy, precisio, resolutio A mathematical model of errors Cofidece itervals For meas For variaces For proportios
More informationClass 27. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 7 Daiel B. Rowe, Ph.D. Departmet of Mathematics, Statistics, ad Computer Sciece Copyright 013 by D.B. Rowe 1 Ageda: Skip Recap Chapter 10.5 ad 10.6 Lecture Chapter 11.1-11. Review Chapters 9 ad 10
More informationLecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting
Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would
More informationµ and π p i.e. Point Estimation x And, more generally, the population proportion is approximately equal to a sample proportion
Poit Estimatio Poit estimatio is the rather simplistic (ad obvious) process of usig the kow value of a sample statistic as a approximatio to the ukow value of a populatio parameter. So we could for example
More informationConfidence Intervals for the Population Proportion p
Cofidece Itervals for the Populatio Proportio p The cocept of cofidece itervals for the populatio proportio p is the same as the oe for, the samplig distributio of the mea, x. The structure is idetical:
More informationRandom Variables, Sampling and Estimation
Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig
More informationOctober 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1
October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 1 Populatio parameters ad Sample Statistics October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 2 Ifereces
More information6 Sample Size Calculations
6 Sample Size Calculatios Oe of the major resposibilities of a cliical trial statisticia is to aid the ivestigators i determiig the sample size required to coduct a study The most commo procedure for determiig
More informationDirection: This test is worth 150 points. You are required to complete this test within 55 minutes.
Term Test 3 (Part A) November 1, 004 Name Math 6 Studet Number Directio: This test is worth 10 poits. You are required to complete this test withi miutes. I order to receive full credit, aswer each problem
More informationLecture 7: Non-parametric Comparison of Location. GENOME 560 Doug Fowler, GS
Lecture 7: No-parametric Compariso of Locatio GENOME 560 Doug Fowler, GS (dfowler@uw.edu) 1 Review How ca we set a cofidece iterval o a proportio? 2 What do we mea by oparametric? 3 Types of Data A Review
More informationLecture 6 Chi Square Distribution (χ 2 ) and Least Squares Fitting
Lecture 6 Chi Square Distributio (χ ) ad Least Squares Fittig Chi Square Distributio (χ ) Suppose: We have a set of measuremets {x 1, x, x }. We kow the true value of each x i (x t1, x t, x t ). We would
More informationUniversity of California, Los Angeles Department of Statistics. Hypothesis testing
Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Elemets of a hypothesis test: Hypothesis testig Istructor: Nicolas Christou 1. Null hypothesis, H 0 (claim about µ, p, σ 2, µ
More informationClass 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 23 Daiel B. Rowe, Ph.D. Departmet of Mathematics, Statistics, ad Computer Sciece Copyright 2017 by D.B. Rowe 1 Ageda: Recap Chapter 9.1 Lecture Chapter 9.2 Review Exam 6 Problem Solvig Sessio. 2
More informationHYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018
HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018 We are resposible for 2 types of hypothesis tests that produce ifereces about the ukow populatio mea, µ, each of which has 3 possible
More informationInfinite Sequences and Series
Chapter 6 Ifiite Sequeces ad Series 6.1 Ifiite Sequeces 6.1.1 Elemetary Cocepts Simply speakig, a sequece is a ordered list of umbers writte: {a 1, a 2, a 3,...a, a +1,...} where the elemets a i represet
More informationLecture 7: Non-parametric Comparison of Location. GENOME 560, Spring 2016 Doug Fowler, GS
Lecture 7: No-parametric Compariso of Locatio GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review How ca we set a cofidece iterval o a proportio? 2 Review How ca we set a cofidece iterval
More information1 Constructing and Interpreting a Confidence Interval
Itroductory Applied Ecoometrics EEP/IAS 118 Sprig 2014 WARM UP: Match the terms i the table with the correct formula: Adrew Crae-Droesch Sectio #6 5 March 2014 ˆ Let X be a radom variable with mea µ ad
More informationParameter, Statistic and Random Samples
Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,
More informationt distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference
EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The
More information