Binary Data in Epidemiology Epidemiologic Measures of Association with Time to Event Data
|
|
- Pamela Arabella Boyd
- 5 years ago
- Views:
Transcription
1 Leture 3: Measures of Assoiatio tober 3, 23 Biost 536 / Epi 536 Categorial Data Aalysis i Epidemiology Sott S. Emerso, M.D., Ph.D. Professor of Biostatistis Uiversity of Washigto Leture utlie Measures of Assoiatio (Siee) With full time to evet data With biary data Preisio of Iferee (Statistis) Leture 3: Measures of Assoiatio tober 3, 23 2 Biary Data i Epidemiology Epidemiologi Measures of Assoiatio with Time to Evet Data Idetifyig Risk Fators ad Evaluatig Prevetive ad Therapeuti Agets Fous of epidemiologi researh (i broad ategories) Disease surveillae: morbidity, mortality Quatifyig patters of iidee, prevalee Idetifyig risk fators or protetive fators for disease / death Comparig patters of iidee aross exposure groups Evaluatig prevetive strategies ad treatmets Comparig patters of iidee / outomes aross itervetios 3 Sietifi lassifiatio of biary data Cliial outomes Diagosis of disease, ure, disease progressio, death Exposures sometimes dihotomized Geotypes, evirometal exposures, behavioral exposures Idiators of risk status Time, plae, populatio 4 Categorial Data Aalysis, AUT 23
2 Leture 3: Measures of Assoiatio tober 3, 23 Risk Sets Summarizig Disease Epidemiology Most ofte, we reogize that the probability of a evet depeds i some way upo time I may ases, that time depedee is somethig we merely wat to adjust for as we ompare differet groups It is ot as importat to otrast the evet probability over time Examples Comparig rates of aer betwee smokers ad osmokers, we kow aer ours more ofte late i life Comparig exposure groups for premature delivery, we kow that probability of delivery ireases as pregay otiues We thus fid it oveiet to ouh may of our aalyses of biary data i terms that also osider time to evet 5 Three alterative ways of desribig the probability of observig a evet (disease diagosis, progressio, death, ) over time Ay of the first four ompletely speifies all of the others Radom variable T measures time of evet Cumulative dist futio Survivor futio Desity Iidee rate (hazard) Cumul iidee from t F F t PrT t e u du S t PrT t F t e d f t Prt T t t F t dt t Prt T t t t T Prt T t t f ( t Prt T F t t Prt T t t T F t Ft Ft t t u du ) ( t) 6 A Aside: Prevalee Risk Fators for Disease I epidemiology we also osider aother biary variable: prevalee Prevalee is i some sese a omposite of Disease iidee Disease ase fatality rate Mortality rate from all other auses Wheever it is feasible we would geerally prefer to separately study Disease iidee rates Disease ase fatality rates However, i some settigs (e.g., health servies, pharmaeutis) prevalee may ideed be most importat, ad studyig that 7 Summarizig disease D epidemiology i a speified populatio Iidee rate (over time) Cumulative iidee (over time) A exposure E is a risk fator if the disease epidemiology differs i some way aross populatios havig differet levels of E Differs at some time for some levels of E Does ot have to differ at all times Does ot have to differ for all groups A assoiatio betwee the distributio of D ad E My workig defiitio of ausality: It would be a ause if experimetally alterig the value of E ivariably alters the distributio of D i some way (Eve RCT do ot prove that E is the proximal ause, but they ome loser tha observatioal studies a) 8 Categorial Data Aalysis, AUT 23 2
3 Leture 3: Measures of Assoiatio tober 3, 23 Comparig Distributios Cosider the followig three graphs depitig the mortality experiee predited usig US Soial Seurity Data from 29 What measures would you look at desriptively to deide that there is a assoiatio betwee sex ad mortality? I partiular, how a you read from the survival urves Comparisos of survival at a partiular time Comparisos of medias (quatiles) of the distributios Comparisos of meas Comparisos of geometri meas Comparisos of hazards Is there effet modifiatio? Detetig Assoiatios with Full Data No assoiatio Graphs of desities, survival futios, or hazards will show oiidet urves Assoiatio Curves are differet somewhere Assoiatio i some speifi summary measure (e.g., mea, ) Curves differ i a partiular aspet 9 Detetig Effet Modifiatio with Full Data Effet modifiatio depeds o the summary measure used to quatify effet No effet modifiatio usig some speifi summary measure Graphs of desities, survival futios, or hazards will have same appearae with respet to the orrespodig aspet Comparig Desities: Diffiult Mode / Multimodality Age at highest peak Survival probability at t Area to the right of t Media (other quatiles) Area uder urve to the left of the media age will equal ½ Meas Not easily visualized Hazard Desity divided by area uder urve to the right 2 Categorial Data Aalysis, AUT 23 3
4 Leture 3: Measures of Assoiatio tober 3, 23 Comparig Survival Curves: S(t) vs t Comparig Survival Curves: S(t) vs log t Multimodality Chage i ovexity Differee i survival at t Vertial separatio at t Differee i quatiles Horizotal separatio at p Differee i meas Area betwee urves Hazard Slope divided by height of the urve Differee i survival at t Vertial separatio at t Log ratio of quatiles Horizotal separatio at p Log ratio of geometri meas Area betwee urves Hazard Slope divided by height of the urve 3 4 Comparig Survival Curves: log S(t) vs t Comparig CDF : log ( - S(t)) vs t Log ratio of survival at t Vertial separatio at t Log ratio of evet probability at t Vertial separatio at t 5 6 Categorial Data Aalysis, AUT 23 4
5 Leture 3: Measures of Assoiatio tober 3, 23 dds of Death : log [(-S(t))/S(t)] vs t Comparig Hazards: (t) vs t Log odds ratio of evet at t Vertial separatio at t Differee i survival at t Differee i expoetiated area uder urve to left of t o Differee i hazards at t Vertial separatio at t 7 8 Comparig Hazards: log (t) vs t Comparig Hazards: log (t) vs log t Log hazard ratio at t Vertial separatio at t Shape of distributio Weibull is straight lie Expoetial is flat lie Log hazard ratio at t Vertial separatio at t 9 2 Categorial Data Aalysis, AUT 23 5
6 Leture 3: Measures of Assoiatio tober 3, 23 Commo Measures of Assoiatio Derived Measures of Assoiatio Commo epidemiologi measures of assoiatio with full time to evet data Used most ofte: Relative risk Also defied: Risk differee Relative Risk (RR) Risk Differee (RD) RR t RD t t t t t 2 Relative risk differee: Proportioate ireased risk i exposed relative to otrols Attributable risk: Proportio of risk i exposed from exposure Populatio attributable risk: Proportio of disease i populatio attributable to prevalee of exposure p t Relative Risk Differee (RRD) Attributable Risk (AR) PopulatioAttributable Risk (PAR) RRD AR PAR t t t t t RR( t) t t t t RR( t pt t t pt t pt t pt RR t p RR t p t t t ) 22 Epidemiologi Study Desig I Epidemiologi Measures of Assoiatio with Biary Data Idetifyig Risk Fators ad Evaluatig Prevetive ad Therapeuti Agets Cross setioal surveys Populatio measured at a sigle poit i time May be aledar time, evet time, age Ca study prevalee of exposures ad prevalee of disease Ca estimate Pr [ D E ] ; Pr [ E D ] ; Pr [ E ] ; Pr[ D ] Whe iterested i evaluatig risk fators, suh desigs ofuse Disease iidee Disease ase fatality rate Mortality from ompetig risks Categorial Data Aalysis, AUT 23 6
7 Leture 3: Measures of Assoiatio tober 3, 23 Epidemiologi Study Desig II Measures of Assoiatio Cohort study Populatios idetified by levels of exposure at time Followed (prospetively or retrospetively) over speified time Ca study iidee rates ad umulative iidee of disease Ca estimate Pr [ D E ] fte, the time of follow-up is suffiietly short that iidee rate is (early) ostat Cumulative iidee is iidee rate times perso-years of follow-up ther times the use of idetial follow-up periods avoids ofoudig the determiatio of umulative iidee by a timevaryig iidee rate Eah group s umulative iidee will be averaged over the Quatifiatio of equality betwee two umbers Differee is Ratio is Comparig distributios for biary measures (e.g., disease) Differee i proportios Ratio of proportios same part of the iidee rate urve Measures of Assoiatio: Cohort Desigs Simple settig (or withi groups defied by ovariates) Exposed (E) vs o-exposed (E ) Diseased (D) vs o-diseased (D ) Parameter Spae Graph of all possible values for disease iidees Estimated iidee of disease withi eah exposure group Might be viewed as average iidee rate if divided by perso years of exposure Risk differee: RD = Pr [ D E ] Pr [ D E ] Iterpretable as exess proportio of disease Relative risk: RR = Pr [ D E ] / Pr [ D E ] f partiular use whe establishig a risk fator Categorial Data Aalysis, AUT 23 7
8 Leture 3: Measures of Assoiatio tober 3, 23 Cotours: Equal Risk Differee (RD) Graph of all possible values for disease iidees Cotours: Equal Relative Risk (RR) Graph of all possible values for disease iidees 29 3 Lookig Ahead: dds Ratios Limitatios o the rage of values for RD ad RR RD a be betwee - ad But if uexposed group already has high iidee, say,.75, the RD a oly be betwee -.75 ad.25 RR a be betwee ad ifiity But if uexposed group already has high iidee, say,.75, the RR a oly be betwee ad.33 dds Ratio dds of disease i uexposed dds of disease i exposed P P D P D P E D E D E E These fats argue that effet modifiatio might be regularly see whe usig RD or RR to summarize omparisos of risk The odds ratio has o suh limitatio The R a always be ay oegative umber, o matter the iidee i the uexposed populatio dds Ratio R P D E P D E P D P D E E P D, E P D, E P D, E P D, E 3 32 Categorial Data Aalysis, AUT 23 8
9 Leture 3: Measures of Assoiatio tober 3, 23 Cotours: Equal dds Ratio (R) Graph of all possible values for disease iidees Cotours: Equal RR vs Equal R Graph of all possible values for disease iidees Relative Risk dds Ratio Epidemiologi Study Desig III Case-otrol study Populatios idetified by disease status (aordig to eligibility riteria) Retrospetive follow-up to determie exposure status Ca study distributio of exposures withi disease groups Ca estimate Pr [ E D ] Mathig of otrols to same time / spae athmet of ases may avoid some ofoudig Use of odds ratio as a good approximatio for relative risk for rare disease 35 dds of dds of dds Ratio RE D Rare disease : dds Ratio from Case-Cotrol exposure i healthy exposure i disease E D E D R P E D P E D P E D P E D E D E D P P E P E P D E D D E D P D E P D E P D E P D E P D, E P D, E P D, E P D, E P D, E P D, E R RR 36 Categorial Data Aalysis, AUT 23 9
10 Leture 3: Measures of Assoiatio tober 3, 23 Summary Measures Two Sample Iferee with Biary Data Large Sample Iferee (Uesored Data) Comparig distributios of biary variables aross two groups Differee of proportios Most ommo Iferee based o differee of estimated proportios Ratio of proportios f most relevae with low probabilities fte atually use odds ratio dds ratio Large Sample Distributio With totally idepedet data, we use the Cetral Limit Theorem Proportios are meas Sample proportios are sample meas Stadard error estimates for eah group s estimated proportio based o the mea variae relatioship Asymptoti Samplig Dist Comparig two biomial proportios Suppose idepedet X, p X i X ~ B, p i, p Yi Y ~ B, p i We wat to make iferee about p p pˆ ~ B ad Y ~ B X p ( p) ~ N p, i i pˆ Y p ( p) ~ N p, 39 ˆ pˆ pˆ ~ N p p, p ( p ) p( p) 4 Categorial Data Aalysis, AUT 23
11 Leture 3: Measures of Assoiatio tober 3, 23 Asymptoti (Wald) Cofidee Itervals Cofidee iterval for differee betwee two biomial proportios We wat to make iferee about p ( )% ofidee iterval is ˆ z se ˆ z se Estimate / 2 se ˆ ˆ, ˆ pˆ pˆ / 2 ˆ pˆ ( pˆ ˆ ˆ ) p( p) p Asymptoti (Sore) Hypothesis Tests Test statisti for differee betwee two biomial proportios Compute variae usig plug-i omputed uder the ull Suppose we wat to test H Uder the ull hypothesis, Uder H Estimate ˆ : p ˆ the etire distributios are equal so se pˆ( pˆ) pˆ( pˆ) ˆ Z se p ~ N(,) X Y with pˆ 4 42 Elevator Statistis Need for Large Samples Well, early elevator statistis a m b d m N Note that beause the goodess of fit test is relyig o asymptoti properties, it is oly valid i large samples A ommoly used rule of thumb if that the expeted outs be greater tha 5 i the vast majority of the ells 2 ad b N m m 2 Z ad b N m m Categorial Data Aalysis, AUT 23
12 Leture 3: Measures of Assoiatio tober 3, 23 ther Large Sample Tests Likelihood Ratio Test Also has hi squared distributio i large samples But ot the same statisti as hi squared statisti Stata Commads: CI s respvar groupvar, level(#) Both variables must be oded as ad Respose will be alled ases ad oases CI a be foud uder Risk differee Chi squared statisti ad two-sided P value Good large sample properties fte most powerful tabulate respvar groupvar, row ol hi2 lr Row ad olum peretages Chi squared ad likelihood ratio P values Less ommoly used i 2 x 2 tables We will use it more ofte i logisti regressio Caveat: Sad Fat of Life Differet variae estimates are typially used for CI ad hypothesis tests Yates Corretio Historially, a otiuity orretio to the hi squared test to try to avoid its ati-oservatism i small samples We a see disagreemet betwee the olusio reahed by CI ad P value The P value might be less tha.5, but the CI otai The P value might be greater tha.5, but the CI exlude Ivertig a sore or LR test would alleviate some of this problem I kow of o software that does this All that was ahieved was gettig a test that behaves as poorly as the Fisher s exat test I heartily disreommed use of the otiuity orretio whe omparig two samples (There is a otiuity orretio used i oe sample Z tests that is useful, but exat distributios are eve better) Categorial Data Aalysis, AUT 23 2
13 Leture 3: Measures of Assoiatio tober 3, 23 Small Sample Distributio Two Sample Iferee with Biary Data The exat distributio for the differee i two proportios a ot be determied i geeral, beause of the mea variae relatioship Small Sample Iferee (Uesored) We eed to kow the value of the two proportios beig ompared i order to fid the exat distributio of the differee 49 5 Small Sample CI We have o way of obtaiig exat CI for the differee i proportios We ould osider all possible values of the two proportios, ad see whether a test would rejet eah ombiatio But the resultig joit ofidee iterval would ot always give the same deisio for equal differees E.g, it might rejet. ad.2, but ot.4 ad.5 Small Sample Tests We a, however, desribe the exat distributio of the data uder the ull hypothesis oditioal o all the margis of a otigey table A permutatio distributio We imagie radomly assigig observatios betwee the groups 5 52 Categorial Data Aalysis, AUT 23 3
14 Leture 3: Measures of Assoiatio tober 3, 23 Permutatio Idea Radomly permute m positives ad m egatives Call the first group ad the last group Repeat may times ad see how ofte group has a or more positives Group Respose a b d m m N Permutatio Tests I usually objet to permutatio distributios exept as a last resort They test equality of distributios, ot just equality of the populatio parameter Usually they are ot, however, guarateed to detet arbitrary differees betwee distributios eve i ifiite samples Coditio o values of m, m,, Permutatio with Biary Data However, with biary data, distributios are differet if ad oly if the proportios are differet Hee permutatio tests are okay for testig But still, we have o ofidee itervals beause we have ot quatified alteratives Small Sample Tests Coditioig o the margis fte oe margi is fixed by desig Cohort studies sample by exposure Case-otrol studies sample by disease I ay ase, it a be show that oe of the margi totals otribute iformatio about the differee i proportios Categorial Data Aalysis, AUT 23 4
15 Leture 3: Measures of Assoiatio tober 3, 23 Fisher s Exat Test Probability of more extreme otigey tables with the same margial totals Probabilities by hypergeometri distributio (Use a omputer) Stata Commads The Fisher s exat test P values are give by several ommads s respvar groupvar, exat tabulate respvar groupvar, exat e-sided ad two-sided P values are provided Respose Group a k k m b k d k m N Cosider all possible values of k Commets Problem Fisher s exat test does ot tur out to be a exat test i pratie We the fae a dilemma A problem is posed by the disrete ature of the data To ahieve the desired level.5 two-sided test, we would sometimes have to rejet the ull whe both groups had suesses With some results flip a biased oi to deide whether sigifiat Few people are willig to do this The hi squared test (Z test for proportios) may be atioservative i small samples Geerally so log as all ell outs i the otigey table are expeted to be greater tha 5 uder the ull hypothesis, we are K The Fisher s exat test is too oservative 59 6 Categorial Data Aalysis, AUT 23 5
16 Leture 3: Measures of Assoiatio tober 3, 23 Regios of Valid Iferee Use of hi square ad LR tests whe =3 i eah group Valid Use of the Uadjusted Chi Square Valid Use of the Uadjusted LR Nomial Type I Error Nomial Type I Error Alteratives Great improvemets i statistial power obtaied by modifyig either of those tests to ahieve as lose to the omial type I error without exeedig Several statistial pakages provide suh modified tests (e.g., StatExat) Stata provides exat logisti regressio for this settig I have suh futios i R p = p = p p = p = p 6 62 Modifiatios: Basi Idea Critial Regio Modifiatios Use the statisti Do t presume the lassial distributio Do t assume hi squared statisti has hi square distributio Do t assume Fisher s Exat P value has uiform distributio Cosider all possible values of p ommo to both groups, ad use exat distributio The take worst ase Categorial Data Aalysis, AUT 23 6
17 Leture 3: Measures of Assoiatio tober 3, 23 True Type I Error by Commo p Use of adjusted ad uadjusted tests for =3/arm ad =5/arm Gais i Power Power of uadjusted, adjusted level.5 tests Nomial Level.25 Nomial Level.25 Leged Atual Type I Error Atual Type I Error Adj Chi Square Uadj Chi Square Adj Fisher's Uadj Fisher's Adj Lklhd Ratio Uadj Lklhd Ratio p = p = p p = p = p Geeral Commets It is geerally immaterial whether the Fisher s exat test P value or the hi square statisti or likelihood ratio statisti is used as the basis for the exat test I ay ase, the ritial value is depedet upo the sample sizes Usig this approah, substatial improvemet i power is obtaied i low sample sizes I strogly reommed its use whe ofroted with small samples i real life 67 Categorial Data Aalysis, AUT 23 7
Chapter 8 Hypothesis Testing
Chapter 8 for BST 695: Speial Topis i Statistial Theory Kui Zhag, Chapter 8 Hypothesis Testig Setio 8 Itrodutio Defiitio 8 A hypothesis is a statemet about a populatio parameter Defiitio 8 The two omplemetary
More informationSample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D.
ample ie Estimatio i the Proportioal Haards Model for K-sample or Regressio ettigs cott. Emerso, M.D., Ph.D. ample ie Formula for a Normally Distributed tatistic uppose a statistic is kow to be ormally
More informationOne way Analysis of Variance (ANOVA)
Oe way Aalysis of Variae (ANOVA) ANOVA Geeral ANOVA Settig"Slide 43-45) Ivestigator otrols oe or more fators of iterest Eah fator otais two or more levels Levels a be umerial or ategorial ifferet levels
More information(Dependent or paired samples) Step (1): State the null and alternate hypotheses: Case1: One-tailed test (Right)
(epedet or paired samples) Step (1): State the ull ad alterate hypotheses: Case1: Oe-tailed test (Right) Upper tail ritial (where u1> u or u1 -u> 0) H0: 0 H1: > 0 Case: Oe-tailed test (Left) Lower tail
More informationResampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.
Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator
More informationChapter 8: Estimating with Confidence
Chapter 8: Estimatig with Cofidece Sectio 8.2 The Practice of Statistics, 4 th editio For AP* STARNES, YATES, MOORE Chapter 8 Estimatig with Cofidece 8.1 Cofidece Itervals: The Basics 8.2 8.3 Estimatig
More informationChapter 6 Sampling Distributions
Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to
More informationFormulas and Tables for Gerstman
Formulas ad Tables for Gerstma Measuremet ad Study Desig Biostatistics is more tha a compilatio of computatioal techiques! Measuremet scales: quatitative, ordial, categorical Iformatio quality is primary
More informationTABLES AND FORMULAS FOR MOORE Basic Practice of Statistics
TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +
More informationWhat is a Hypothesis? Hypothesis is a statement about a population parameter developed for the purpose of testing.
What is a ypothesis? ypothesis is a statemet about a populatio parameter developed for the purpose of testig. What is ypothesis Testig? ypothesis testig is a proedure, based o sample evidee ad probability
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationAgreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times
Sigificace level vs. cofidece level Agreemet of CI ad HT Lecture 13 - Tests of Proportios Sta102 / BME102 Coli Rudel October 15, 2014 Cofidece itervals ad hypothesis tests (almost) always agree, as log
More informationMOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.
XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced
More informationMATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4
MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.
More informationNonparametric Goodness-of-Fit Tests for Discrete, Grouped or Censored Data 1
Noparametri Goodess-of-Fit Tests for Disrete, Grouped or Cesored Data Boris Yu. Lemeshko, Ekateria V. Chimitova ad Stepa S. Kolesikov Novosibirsk State Tehial Uiversity Departmet of Applied Mathematis
More informationBernoulli Numbers. n(n+1) = n(n+1)(2n+1) = n(n 1) 2
Beroulli Numbers Beroulli umbers are amed after the great Swiss mathematiia Jaob Beroulli5-705 who used these umbers i the power-sum problem. The power-sum problem is to fid a formula for the sum of the
More informationMath Third Midterm Exam November 17, 2010
Math 37 1. Treibergs σιι Third Midterm Exam Name: November 17, 1 1. Suppose that the mahie fillig mii-boxes of Fruitlad Raisis fills boxes so that the weight of the boxes has a populatio mea µ x = 14.1
More information1 of 7 7/16/2009 6:06 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 6. Order Statistics Defiitios Suppose agai that we have a basic radom experimet, ad that X is a real-valued radom variable
More informationBIOS 4110: Introduction to Biostatistics. Breheny. Lab #9
BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous
More informationStatistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.
Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized
More informationThe beta density, Bayes, Laplace, and Pólya
The beta desity, Bayes, Laplae, ad Pólya Saad Meimeh The beta desity as a ojugate form Suppose that is a biomial radom variable with idex ad parameter p, i.e. ( ) P ( p) p ( p) Applyig Bayes s rule, we
More informationSample Size Determination (Two or More Samples)
Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie
More informationANOTHER PROOF FOR FERMAT S LAST THEOREM 1. INTRODUCTION
ANOTHER PROOF FOR FERMAT S LAST THEOREM Mugur B. RĂUŢ Correspodig author: Mugur B. RĂUŢ, E-mail: m_b_raut@yahoo.om Abstrat I this paper we propose aother proof for Fermat s Last Theorem (FLT). We foud
More informationFinal Review. Fall 2013 Prof. Yao Xie, H. Milton Stewart School of Industrial Systems & Engineering Georgia Tech
Fial Review Fall 2013 Prof. Yao Xie, yao.xie@isye.gatech.edu H. Milto Stewart School of Idustrial Systems & Egieerig Georgia Tech 1 Radom samplig model radom samples populatio radom samples: x 1,..., x
More informationChapter 1 (Definitions)
FINAL EXAM REVIEW Chapter 1 (Defiitios) Qualitative: Nomial: Ordial: Quatitative: Ordial: Iterval: Ratio: Observatioal Study: Desiged Experimet: Samplig: Cluster: Stratified: Systematic: Coveiece: Simple
More informationTests of Hypotheses Based on a Single Sample (Devore Chapter Eight)
Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........
More informationSTA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:
STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio
More informationProbability & Statistics Chapter 8
I. Estimatig with Large Samples Probability & Statistis Poit Estimate of a parameter is a estimate of a populatio parameter give by a sigle umber. Use x (the sample mea) as a poit estimate for (the populatio
More information7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals
7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationA quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population
A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate
More informationBig Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.
5. Data, Estimates, ad Models: quatifyig the accuracy of estimates. 5. Estimatig a Normal Mea 5.2 The Distributio of the Normal Sample Mea 5.3 Normal data, cofidece iterval for, kow 5.4 Normal data, cofidece
More informationSTK4011 and STK9011 Autumn 2016
STK4 ad STK9 Autum 6 ypothesis testig Covers (most of) the followig material from hapter 8: Setio 8. Setios 8.. ad 8..3 Setio 8.3. Setio 8.3. (util defiitio 8.3.6) Ørulf Borga Departmet of Mathematis Uiversity
More information1 Models for Matched Pairs
1 Models for Matched Pairs Matched pairs occur whe we aalyse samples such that for each measuremet i oe of the samples there is a measuremet i the other sample that directly relates to the measuremet i
More informationHypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance
Hypothesis Testig Empirically evaluatig accuracy of hypotheses: importat activity i ML. Three questios: Give observed accuracy over a sample set, how well does this estimate apply over additioal samples?
More informationInferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.
Iferetial Statistics ad Probability a Holistic Approach Iferece Process Chapter 8 Poit Estimatio ad Cofidece Itervals This Course Material by Maurice Geraghty is licesed uder a Creative Commos Attributio-ShareAlike
More informationMATH/STAT 352: Lecture 15
MATH/STAT 352: Lecture 15 Sectios 5.2 ad 5.3. Large sample CI for a proportio ad small sample CI for a mea. 1 5.2: Cofidece Iterval for a Proportio Estimatig proportio of successes i a biomial experimet
More informationGoodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)
Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................
More informationStatistics 511 Additional Materials
Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability
More informationExpectation and Variance of a random variable
Chapter 11 Expectatio ad Variace of a radom variable The aim of this lecture is to defie ad itroduce mathematical Expectatio ad variace of a fuctio of discrete & cotiuous radom variables ad the distributio
More information10. Comparative Tests among Spatial Regression Models. Here we revisit the example in Section 8.1 of estimating the mean of a normal random
Part III. Areal Data Aalysis 0. Comparative Tests amog Spatial Regressio Models While the otio of relative likelihood values for differet models is somewhat difficult to iterpret directly (as metioed above),
More informationFACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures
FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals
More informationTABLES AND FORMULAS FOR MOORE Basic Practice of Statistics
TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +
More informationPower and Type II Error
Statistical Methods I (EXST 7005) Page 57 Power ad Type II Error Sice we do't actually kow the value of the true mea (or we would't be hypothesizig somethig else), we caot kow i practice the type II error
More informationRandom Variables, Sampling and Estimation
Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig
More informationChapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.
Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo
More informationFrequentist Inference
Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for
More informationConfidence Intervals for the Population Proportion p
Cofidece Itervals for the Populatio Proportio p The cocept of cofidece itervals for the populatio proportio p is the same as the oe for, the samplig distributio of the mea, x. The structure is idetical:
More informationObserver Design with Reduced Measurement Information
Observer Desig with Redued Measuremet Iformatio I pratie all the states aot be measured so that SVF aot be used Istead oly a redued set of measuremets give by y = x + Du p is available where y( R We assume
More informationData Analysis and Statistical Methods Statistics 651
Data Aalysis ad Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasii/teachig.html Suhasii Subba Rao Review of testig: Example The admistrator of a ursig home wats to do a time ad motio
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationImportant Formulas. Expectation: E (X) = Σ [X P(X)] = n p q σ = n p q. P(X) = n! X1! X 2! X 3! X k! p X. Chapter 6 The Normal Distribution.
Importat Formulas Chapter 3 Data Descriptio Mea for idividual data: X = _ ΣX Mea for grouped data: X= _ Σf X m Stadard deviatio for a sample: _ s = Σ(X _ X ) or s = 1 (Σ X ) (Σ X ) ( 1) Stadard deviatio
More informationCommon Large/Small Sample Tests 1/55
Commo Large/Small Sample Tests 1/55 Test of Hypothesis for the Mea (σ Kow) Covert sample result ( x) to a z value Hypothesis Tests for µ Cosider the test H :μ = μ H 1 :μ > μ σ Kow (Assume the populatio
More information2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2
Chapter 8 Comparig Two Treatmets Iferece about Two Populatio Meas We wat to compare the meas of two populatios to see whether they differ. There are two situatios to cosider, as show i the followig examples:
More informationLecture 5. Materials Covered: Chapter 6 Suggested Exercises: 6.7, 6.9, 6.17, 6.20, 6.21, 6.41, 6.49, 6.52, 6.53, 6.62, 6.63.
STT 315, Summer 006 Lecture 5 Materials Covered: Chapter 6 Suggested Exercises: 67, 69, 617, 60, 61, 641, 649, 65, 653, 66, 663 1 Defiitios Cofidece Iterval: A cofidece iterval is a iterval believed to
More informationUniversity of California, Los Angeles Department of Statistics. Hypothesis testing
Uiversity of Califoria, Los Ageles Departmet of Statistics Statistics 100B Elemets of a hypothesis test: Hypothesis testig Istructor: Nicolas Christou 1. Null hypothesis, H 0 (claim about µ, p, σ 2, µ
More informationBasic Probability/Statistical Theory I
Basi Probability/Statistial Theory I Epetatio The epetatio or epeted values of a disrete radom variable X is the arithmeti mea of the radom variable s distributio. E[ X ] p( X ) all Epetatio by oditioig
More information1 Inferential Methods for Correlation and Regression Analysis
1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet
More informationChapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.
Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more
More informationSequences. Notation. Convergence of a Sequence
Sequeces A sequece is essetially just a list. Defiitio (Sequece of Real Numbers). A sequece of real umbers is a fuctio Z (, ) R for some real umber. Do t let the descriptio of the domai cofuse you; it
More informationModule 1 Fundamentals in statistics
Normal Distributio Repeated observatios that differ because of experimetal error ofte vary about some cetral value i a roughly symmetrical distributio i which small deviatios occur much more frequetly
More informationEECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1
EECS564 Estimatio, Filterig, ad Detectio Hwk 2 Sols. Witer 25 4. Let Z be a sigle observatio havig desity fuctio where. p (z) = (2z + ), z (a) Assumig that is a oradom parameter, fid ad plot the maximum
More information11 Correlation and Regression
11 Correlatio Regressio 11.1 Multivariate Data Ofte we look at data where several variables are recorded for the same idividuals or samplig uits. For example, at a coastal weather statio, we might record
More informationChapter 13, Part A Analysis of Variance and Experimental Design
Slides Prepared by JOHN S. LOUCKS St. Edward s Uiversity Slide 1 Chapter 13, Part A Aalysis of Variace ad Eperimetal Desig Itroductio to Aalysis of Variace Aalysis of Variace: Testig for the Equality of
More informationRead through these prior to coming to the test and follow them when you take your test.
Math 143 Sprig 2012 Test 2 Iformatio 1 Test 2 will be give i class o Thursday April 5. Material Covered The test is cummulative, but will emphasize the recet material (Chapters 6 8, 10 11, ad Sectios 12.1
More informationStat 421-SP2012 Interval Estimation Section
Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible
More informationSTA6938-Logistic Regression Model
Dr. Yig Zhag STA6938-Logistic Regressio Model Topic -Simple (Uivariate) Logistic Regressio Model Outlies:. Itroductio. A Example-Does the liear regressio model always work? 3. Maximum Likelihood Curve
More informationApril 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE
April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE TERRY SOO Abstract These otes are adapted from whe I taught Math 526 ad meat to give a quick itroductio to cofidece
More informationDiscrete Mathematics for CS Spring 2008 David Wagner Note 22
CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig
More informationEstimation for Complete Data
Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of
More informationParameter, Statistic and Random Samples
Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,
More information32 estimating the cumulative distribution function
32 estimatig the cumulative distributio fuctio 4.6 types of cofidece itervals/bads Let F be a class of distributio fuctios F ad let θ be some quatity of iterest, such as the mea of F or the whole fuctio
More informationSTAT 515 fa 2016 Lec Sampling distribution of the mean, part 2 (central limit theorem)
STAT 515 fa 2016 Lec 15-16 Samplig distributio of the mea, part 2 cetral limit theorem Karl B. Gregory Moday, Sep 26th Cotets 1 The cetral limit theorem 1 1.1 The most importat theorem i statistics.............
More information- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion
1 Chapter 7 ad 8 Review for Exam Chapter 7 Estimates ad Sample Sizes 2 Defiitio Cofidece Iterval (or Iterval Estimate) a rage (or a iterval) of values used to estimate the true value of the populatio parameter
More informationChapter 11: Asking and Answering Questions About the Difference of Two Proportions
Chapter 11: Askig ad Aswerig Questios About the Differece of Two Proportios These otes reflect material from our text, Statistics, Learig from Data, First Editio, by Roxy Peck, published by CENGAGE Learig,
More informationLecture 2: Monte Carlo Simulation
STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?
More informationChapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p
Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE Part 3: Summary of CI for µ Cofidece Iterval for a Populatio Proportio p Sectio 8-4 Summary for creatig a 100(1-α)% CI for µ: Whe σ 2 is kow ad paret
More informationCEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering
CEE 5 Autum 005 Ucertaity Cocepts for Geotechical Egieerig Basic Termiology Set A set is a collectio of (mutually exclusive) objects or evets. The sample space is the (collectively exhaustive) collectio
More informationε > 0 N N n N a n < ε. Now notice that a n = a n.
4 Sequees.5. Null sequees..5.. Defiitio. A ull sequee is a sequee (a ) N that overges to 0. Hee, by defiitio of (a ) N overges to 0, a sequee (a ) N is a ull sequee if ad oly if ( ) ε > 0 N N N a < ε..5..
More informationSummary. Recap ... Last Lecture. Summary. Theorem
Last Lecture Biostatistics 602 - Statistical Iferece Lecture 23 Hyu Mi Kag April 11th, 2013 What is p-value? What is the advatage of p-value compared to hypothesis testig procedure with size α? How ca
More informationCastiel, Supernatural, Season 6, Episode 18
13 Differetial Equatios the aswer to your questio ca best be epressed as a series of partial differetial equatios... Castiel, Superatural, Seaso 6, Episode 18 A differetial equatio is a mathematical equatio
More information1.010 Uncertainty in Engineering Fall 2008
MIT OpeCourseWare http://ocw.mit.edu.00 Ucertaity i Egieerig Fall 2008 For iformatio about citig these materials or our Terms of Use, visit: http://ocw.mit.edu.terms. .00 - Brief Notes # 9 Poit ad Iterval
More informationSection 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis
Sectio 9.2 Tests About a Populatio Proportio P H A N T O M S Parameters Hypothesis Assess Coditios Name the Test Test Statistic (Calculate) Obtai P value Make a decisio State coclusio Sectio 9.2 Tests
More informationLinear Regression Models
Liear Regressio Models Dr. Joh Mellor-Crummey Departmet of Computer Sciece Rice Uiversity johmc@cs.rice.edu COMP 528 Lecture 9 15 February 2005 Goals for Today Uderstad how to Use scatter diagrams to ispect
More informationResponse Variable denoted by y it is the variable that is to be predicted measure of the outcome of an experiment also called the dependent variable
Statistics Chapter 4 Correlatio ad Regressio If we have two (or more) variables we are usually iterested i the relatioship betwee the variables. Associatio betwee Variables Two variables are associated
More informationPackage diffdepprop. February 19, 2015
Type Pakage Pakage diffdepprop Feruary 19, 2015 Title Calulates Cofidee Itervals for two Depedet Proportios Versio 0.1-9 Date 2013-05-03 Author Maitaier Daiela Wezel The pakage iludes
More informationDirection: This test is worth 150 points. You are required to complete this test within 55 minutes.
Term Test 3 (Part A) November 1, 004 Name Math 6 Studet Number Directio: This test is worth 10 poits. You are required to complete this test withi miutes. I order to receive full credit, aswer each problem
More informationEcon 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara
Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio
More informationComparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading
Topic 15 - Two Sample Iferece I STAT 511 Professor Bruce Craig Comparig Two Populatios Research ofte ivolves the compariso of two or more samples from differet populatios Graphical summaries provide visual
More information(7 One- and Two-Sample Estimation Problem )
34 Stat Lecture Notes (7 Oe- ad Two-Sample Estimatio Problem ) ( Book*: Chapter 8,pg65) Probability& Statistics for Egieers & Scietists By Walpole, Myers, Myers, Ye Estimatio 1 ) ( ˆ S P i i Poit estimate:
More informationProbability and statistics: basic terms
Probability ad statistics: basic terms M. Veeraraghava August 203 A radom variable is a rule that assigs a umerical value to each possible outcome of a experimet. Outcomes of a experimet form the sample
More informationCOMP26120: Introducing Complexity Analysis (2018/19) Lucas Cordeiro
COMP60: Itroduig Complexity Aalysis (08/9) Luas Cordeiro luas.ordeiro@mahester.a.uk Itroduig Complexity Aalysis Textbook: Algorithm Desig ad Appliatios, Goodrih, Mihael T. ad Roberto Tamassia (hapter )
More informationPSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9
Hypothesis testig PSYCHOLOGICAL RESEARCH (PYC 34-C Lecture 9 Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I
More informationUNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS
PART of UNIVERSITY OF TORONTO Faculty of Arts ad Sciece APRIL/MAY 009 EAMINATIONS ECO0YY PART OF () The sample media is greater tha the sample mea whe there is. (B) () A radom variable is ormally distributed
More informationSTAT 350 Handout 19 Sampling Distribution, Central Limit Theorem (6.6)
STAT 350 Hadout 9 Samplig Distributio, Cetral Limit Theorem (6.6) A radom sample is a sequece of radom variables X, X 2,, X that are idepedet ad idetically distributed. o This property is ofte abbreviated
More informationLecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS
Lecture 5: Parametric Hypothesis Testig: Comparig Meas GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review from last week What is a cofidece iterval? 2 Review from last week What is a cofidece
More informationLecture 8: Non-parametric Comparison of Location. GENOME 560, Spring 2016 Doug Fowler, GS
Lecture 8: No-parametric Compariso of Locatio GENOME 560, Sprig 2016 Doug Fowler, GS (dfowler@uw.edu) 1 Review What do we mea by oparametric? What is a desirable locatio statistic for ordial data? What
More informationStatistical Fundamentals and Control Charts
Statistical Fudametals ad Cotrol Charts 1. Statistical Process Cotrol Basics Chace causes of variatio uavoidable causes of variatios Assigable causes of variatio large variatios related to machies, materials,
More informationIntroductory statistics
CM9S: Machie Learig for Bioiformatics Lecture - 03/3/06 Itroductory statistics Lecturer: Sriram Sakararama Scribe: Sriram Sakararama We will provide a overview of statistical iferece focussig o the key
More informationComputing Confidence Intervals for Sample Data
Computig Cofidece Itervals for Sample Data Topics Use of Statistics Sources of errors Accuracy, precisio, resolutio A mathematical model of errors Cofidece itervals For meas For variaces For proportios
More information