Confounding and effect modification: Mantel-Haenszel estimation, testing effect homogeneity. Dankmar Böhning

Size: px
Start display at page:

Download "Confounding and effect modification: Mantel-Haenszel estimation, testing effect homogeneity. Dankmar Böhning"

Transcription

1 Confounding and effect modification: Mantel-Haenszel estimation, testing effect homogeneity Dankmar Böhning Southampton Statistical Sciences Research Institute University of Southampton, UK Advanced Statistical Methods in Epidemiology February 4-6,

2 Overview 1. Cohort Studies with Similar Observation Time 2. Cohort Studies with Individual, Different Observation Time 3. Case-Control Studies: Unmatched Situation 4. Case-Control Studies: Matched Situation 2

3 1. Cohort Studies with Similar Observation Time Situation in the population: Case Non-Case Exposed p 1 1-p 1 Nonexposed p 0 1-p 0 interest in: RR = p p 1 0 3

4 Situation in the sample: Case Non-Case At Risk Exposed Y 1 n 1 - Y 1 n 1 Nonexposed Y 0 n 0 - Y 0 n 0 p = p : Interest in estimating RR 1 0 RR ^ = Y 1/n 1 Y 0 /n 0 4

5 Example: Radiation Exposure and Cancer Occurrence Case Non-Case At Risk Exposed Nonexposed RR ^ = 52/2872 6/5049 = =

6 Tests and Confidence Intervals Estimated Variance of log(rr^ ): ^ Var ( log RR^ ) = 1/Y1-1/n 1 + 1/Y 0-1/n 0 Estimated Standard Error of log(rr^ ): SE^ (log RR^ ) = 1/Y 1-1/n 1 + 1/Y 0-1/n 0 For the above example: ^ Var ( log RR^ ) = 1/52-1/ /6-1/5049 = SE^ (log RR^ ) =

7 H 0 : RR= 1 or log(rr) = 0 H 1 : H 0 is false Testing Statistic used for testing: Z = log( RR ^ )/ SE^ (log RR ^ ) Z is approx. standard normally distributed if H 0 true Test with Significance level 5%: reject H 0 if Z > 1.96 accept H 0 if Z 1.96 For the example: Z = log(15.24)/ =

8 Confidence Interval 95%-CI covers with 95% confidence the true log (RR): log( RR ^ ) ± 1.96 SE^ (log RR ^ ) For the example: log(15.24) ± = (1.8801, ) and back to the relative risk scale: (exp(1.8801), exp(3.5677) ) = (6.55, 35.43) 8

9 In STATA Exposed Unexposed Total Cases Noncases Total Risk Point estimate [95% Conf. Interval] Risk difference Risk ratio Attr. frac. ex Attr. frac. pop chi2(1) = Pr>chi2 =

10 Potential Confounding and Stratification with Respect to the Confounder Situation: Explanation? Exposed Non-Exposed Stratum Case Non- Case Case Non-Case RR Total

11 A more realistic example: Drinking Coffee and CHD Exposed (coffee) Non-Exposed Stratum Case Non- Case Case Non-Case RR Smoker Non-S Total

12 How to diagnose confounding? Stratify! Situation: Exposed Non-Exposed Stratum Case Non-Case Case Non-Case RR 1 (1) Y 1 n (1) (1) 1 - Y 1 (1) Y 0 n (1) (1) 0 - Y 0 2 (2) Y 1 n (2) (2) 1 - Y 1 (2) Y 0 n (2) (2) 1 - Y 0 k (k) Y 1 n (k) (k) 1 - Y 1 (k) Y 0 n (k) (k) 1 - Y 0 RR (1) RR (2) RR (k) Total Y 1 n 1 - Y 1 Y 0 n 1 - Y 0 RR 12

13 How should the RR be estimated? Use an average of stratum-specific weights: RR ^ = w 1 RR ^ (1) + + w k RR ^ (k) /(w 1 + +w k ) Which weights? 13

14 with n (i) = n 0 (i) + n 1 (i). Good Properties! Mantel-Haenszel Approach RR ^ MH= Y 1 (1) n (1) 0 /n (1) + + Y (k) 1 n (k) 0 /n (k) Y (1) 0 n (1) 1 /n (1) + + Y (k) 0 n (k) 1 /n (k) Mantel-Haenszel Weight: w i = Y 0 (i) n 1 (i) /n (i) w 1 RR ^ (1) + + w k RR^ (k) /(w 1 + +w k ) = RR^ MH 14

15 Illustration of the MH-weights Exposed Non-Exposed Stratum Case Non- Case Case Non-Case w i *150/ *1010/

16 Stratum Case Exposure obs In STATA Stratum RR [95% Conf. Interval] M-H Weight Crude M-H combined Test of homogeneity (M-H) chi2(1) = Pr>chi2 =

17 Illustration: Coffee-CHD-Data Case Exposure Smoking freque~y

18 Smoking RR [95% Conf. Interval] M-H Weight Crude M-H combined Test of homogeneity (M-H) chi2(1) = Pr>chi2 =

19 Inflation, Masking and Effect Modification Inflation (Confounding): Crude RR is larger (in absolute value) than stratified RR Masking (Confounding): Crude RR is smaller (in absolute value) than stratified RR Effect Modification: Crude Rate is in between stratified RR 19

20 How can these situations be diagnosed? Use heterogeneity or homogeneity test: Homogeneity Hypothesis H 0 : RR (1) = RR (2) = =RR (k) H 1 : H 0 is wrong Teststatistic: k i i 2 2 χ( k 1) i= 1 () () = (logrr log RR ) /Var(log MH RR ) 20

21 Illustration of the Heterogeneity Test for CHD-Coffee Exposed Non-Exposed Stratum Case Non- Case Case Non-Case χ 2 Smoke Non Smoke Total

22 Test of homogeneity (M-H) chi2(1) = Pr>chi2 = Smoking RR [95% Conf. Interval] M-H Weight Crude M-H combined

23 Situation: Cohort Studies with Individual, different Observation Time Event-Risk Person-Time At Risk Exposed p 1 T 1 n 1 Nonexposed p 0 T 0 n 0 Definition: Person-Time is the time that n persons spend under risk in the study period 23

24 Interest in: RR = p 1 /p 0 Situation: Events Person-Time At Risk Exposed Y 1 T 1 n 1 Nonexposed Y 0 T 0 n 0 RR ^ = Y 1/T 1 Y 0 /T 0 Y/T is also called the incidence density (ID)! 24

25 Example: Smoking Exposure and CHD Occurrence Events Person-Time ID (Events per 10,000 PYs) Exposed Nonexposed RR ^ = 206/ /5710 = =

26 Tests and Confidence Intervals Estimated Variance of log(rr ^ ^ ^ ) = log( ID 1 / ID0 ): ^ Var ( log RR^ ) = 1/Y1 + 1/Y 0 Estimated Standard Error of log(rr^ ): SE^ (log RR^ ) = 1/Y 1 + 1/Y 0 For the above example: ^ Var ( log RR^ ) = 1/206 +1/28 = SE^ (log RR^ ) =

27 Testing H 0 : RR= 1 or log(rr) = 0 H 1 : H 0 is false Statistic used for testing: Z = log( RR ^ )/ SE^ (log RR ^ ) Z is approx. normally distributed if H 0 true: Test with Significance level 5%: reject H 0 if Z > 1.96 accept H 0 if Z 1.96 For the example: Z = log(1.47)/ =

28 Confidence Interval 95%-CI covers with 95% confidence the true log (RR): log( RR ^ ) ± 1.96 SE^ (log RR ^ ) For the example: log(1.47) ± = ( , ) and back to the relative risk scale: (exp( ),exp(0.7798) ) = (0.99, 2.18) 28

29 In STATA Exposed Unexposed Total Cases Person-time Incidence Rate Point estimate [95% Conf. Interval] Inc. rate diff Inc. rate ratio (exact) Attr. frac. ex (exact) Attr. frac. pop (midp) Pr(k>=206) = (exact) (midp) 2*Pr(k>=206) = (exact) 29

30 Stratification with Respect to a Potential Confounder Example: energy intake (as surrogate measure for physical inactivity) and Ischaemic Heart Disease Exposed (<2750 kcal) Non-Exposed ( 2750 kcal) Stratum Cases P-Time Cases P-Time RR Total

31 Situation: Exposed Non-Exposed Stratum Cases P-Time Cases P-Time RR 1 (1) Y 1 (1) T 1 (1) Y 0 (1) T 0 2 (2) Y 1 (2) T 1 (2) Y 0 (2) T 0 k (k) Y 1 (k) T 1 (k) Y 0 (k) T 0 RR (1) RR (2) RR (k) Total Y 1 T 1 Y 0 T 0 RR 31

32 How should the RR be estimated? Use an average of stratum-specific weights: Which weights? with T (i) = T 0 (i) + T 1 (i). RR ^ = w (1) w k RR ^ (k) /(w 1 + +w k ) Mantel-Haenszel Approach RR ^ MH= Y 1 (1) T (1) 0 /T (1) + + Y (k) 1 T (k) 0 /T (k) Y (1) 0 T (1) 1 /T (1) + + Y (k) 0 T (k) 1 /T (k) Mantel-Haensel Weight: w i = Y 0 (i) T 1 (i) /T (i) w 1 RR ^ (1) + + w k RR^ (k) /(w 1 + +w k ) = RR^ MH 32

33 In STATA Stratum Exposure number~e Person~e Stratum IRR [95% Conf. Interval] M-H Weight (exact) (exact) (exact) Crude (exact) M-H combined Test of homogeneity (M-H) chi2(2) = 1.57 Pr>chi2 =

34 2. Case-Control Studies: Unmatched Situation Situation: Case Controls Exposed q 1 q 0 Nonexposed 1-q 1 1-q 0 Interest is in: RR = p 1 /p 0 which is not estimable not in RR e = q 1 /q 0 34

35 Illustration with a Hypo-Population: Bladder-Ca Healthy Smoking , ,000 Non-smoke , , ,000 1,000,000 RR = p 1 /p 0 = 4 = 2.504= 5/ /9990 =q 1/q 0 = RR e 35

36 However, consider the (disease) Odds Ratio defined as OR = p 1/(1-p 1 ) p 0 /(1-p 0 ) Pr(D/E) = p 1, Pr(D/NE) = p 0, Pr(E/D) = q 1, Pr(E/ND) = q 0, p = Pr(D) 36

37 p 1 = P(D/E) using Bayes Theorem p 0 = P(D/NE) = = Pr(E/D)Pr(D) Pr(E/D)Pr(D)+ Pr(E/ND)Pr(ND) = Pr(NE/D)Pr(D) Pr(NE/D)Pr(D)+ Pr(NE/ND)Pr(ND) = q 1 p q 1 p + q 0 (1-p) (1-q 1 ) p (1-q 1 ) p + (1 q 0 )(1-p) p 1 /(1-p 1 ) = q 1 p/q 0 (1-p) und p 0 /(1-p 0 ) = [(1-q 1 )p]/[(1-q 0 )(1-p)]. it follows that OR = p 1/(1-p 1 ) p 0 /(1-p 0 ) = q 1 /q 0 (1-q 1 )/(1-q 0 ) = q 1/(1-q 1,) q 0 /(1-q 0 ) = OR e Disease Odds Ratio = Exposure Odds Ratio 37

38 Illustration with a Hypo-Population: Bladder-Ca Healthy Smoking , ,000 Non-smoke , , ,000 1,000,000 OR = (500/199,500)/(500/799,500) = (500/500)/(199,500/799,500) = OR e = Also, if disease occurrence is low (low prevalence), OR RR 38

39 Estimation of OR Situation: Case Controls Exposed X 1 X 0 Nonexposed m 1 -X 1 m 0 -X 0 m 1 m 0 OR ^ = q ^ ^ 1 /(1-q1 ) ^ ^ /(1-q0 ) q 0 = X 1/(m 1 -X 1 ) X 0 /(m 0 -X 0 ) = X 1(m 0 -X 0 ) X 0 (m 1 -X 1 ) 39

40 Example: Sun Exposure and Lip Cancer Occurrence in Population of year old men Case Controls Exposed Nonexposed OR ^ = =

41 Tests and Confidence Intervals Estimated Variance of log(or ^ ): ^ Var ^ 1 ( log OR ) = X m 1 - X X m 0 - X 0 Estimated Standard Error of log(or ^ ): SE^ (log OR ^ ) = 1 X m 1 - X X + 0 m 0 - X 0 For the above example: ^ ^ Var ( log OR ) = 1/66 + 1/27 +1/14 + 1/15 = SE^ (log OR ^ ) =

42 Testing H 0 : OR= 1 or log(or) = 0 H 1 : H 0 is false Statistic used for testing: Z = log(or ^ )/ SE^ (log OR ^ ) Z is approx. normally distributed if H 0 true: Test with Significance level 5%: reject H 0 if Z > 1.96 accept H 0 if Z 1.96 For the example: Z = log(2.619)/ =

43 Confidence Interval 95%-CI covers with 95% confidence the true log (RR): log(or ^ ) ± 1.96 SE^ (log OR ^ ) For the example: log(2.619) ± = (0.1078, ) and back to the relative risk scale: (exp(0.1078),exp(1.8177) ) = (1.11, 6.16) 43

44 In STATA. Proportion Exposed Unexposed Total Exposed Cases Controls Total Point estimate [95% Conf. Interval] Odds ratio (Woolf) Attr. frac. ex (Woolf) Attr. frac. pop chi2(1) = 4.22 Pr>chi2 = Exercise: A case-control study investigates if a keeping a pet bird is a risk factor: Cases: 98 Bird Owners, 141 None, Controls: 101 Bird Owners, 328 None 44

45 Potential Confounding and Stratification with Respect to the Confounder Situation: Lip-Cancer Sun- Exposure Smoking 45

46 Lip-Cancer and Sun Exposure with Smoking as Potential Confounder Cases Controls Stratum Exposed Non- Exp. Exp. Non- Exp. OR Smoke Non- Smoke Total Explanation? 46

47 How to diagnose confounding? Stratify! Situation: Cases Controls Cases Stratuposeposed Ex- Non-Exp. Ex- Non-Exp. OR 1 (1) X 1 m (1) (1) 1 - X 1 (1) X 0 m (1) (1) 0 - X 0 OR (1) 2 (2) X 1 m (2) (2) 1 - X 1 (2) X 0 m (2) (2) 1 - X 0 OR (2) k (k) X 1 m (k) (k) 1 - X 1 (k) X 0 m (k) (k) 1 - X 0 OR (k) Total X 1 m 1 - X 1 X 0 m 1 - X 0 OR How should the OR based upon stratification be estimated? 47

48 Use an average of stratum-specific weights: Which weights? OR ^ = w 1 OR ^ (1) + + w k OR ^ (k) /(w 1 + +w k ) Mantel-Haenszel Weight: w i = X 0 (i) (m 1 (i) -X 1 (i) )/ m (i) Mantel-Haenszel Approach OR ^ MH= X 1 (1) (m 0 (1) -X 0 (1) ) /m (1) + + X 1 (k) (m 0 (k) -X 0 (k) )/m (1) X 0 (1) (m 1 (1) -X 1 (1) )/ m (1) + + X 1 (1) (m 0 (1) -X 0 (1) )/ m (1) with m (i) = m 0 (i) + m 1 (i). w 1 OR ^ (1) + + w k OR ^ (k) /(w 1 + +w k ) = OR ^ MH 48

49 Illustration of the MH-weights Cases Controls Stratum Exposed Non- Exp. Exp. Non- Exp. w i Smoke *24/91 Non- Smoke *3/31 49

50 In STATA Case Exposure Smoke Pop cc Case Control [freq=pop], by(smoke) Smoke OR [95% Conf. Interval] M-H Weight 50

51 (exact) (exact) Crude (exact) M-H combined Test of homogeneity (M-H) chi2(1) = 0.01 Pr>chi2 = Test that combined OR = 1: Mantel-Haenszel chi2(1) = 6.96 Pr>chi2 = Note that freq=pop is optional, e.g. raw data can be used with this analysis 51

52 Inflation, Masking and Effect Modification Inflation (Confounding): Crude OR is larger (in absolute value) than stratified OR Masking (Confounding): Crude OR is smaller (in absolute value) than stratified OR Effect Modification: Crude Rate is in between stratified OR How can these situations be diagnosed? Use heterogeneity or homogeneity test: Homogeneity Hypothesis H 0 : OR (1) = OR (2) = =OR (k) H 1 : H 0 is wrong k i i 2 2 χ( k 1) i= 1 () () = (logor log OR ) / Var (log MH OR ) 52

53 Illustration of the Heterogeneity Test for Lip Cancer -Sun Exposure Cases Controls Stratum Exposed Non- Exp. Exp. Non- Exp. χ 2 Smoke Non Smoke Total

54 D E stratum freq

55 stratum OR [95% Conf. Interval] M-H Weight (exact) (exact) Crude (exact) M-H combined Test of homogeneity (M-H) chi2(1) = 0.01 Pr>chi2 = Test that combined OR = 1: Mantel-Haenszel chi2(1) = 6.96 Pr>chi2 =

56 3. Case-Control Studies: Matched Situation Given a case is sampled, a comparable control is sampled: comparable w.r.t. matching criteria Examples of matching criteria are age, gender, SES, etc. Matched pairs sampling is more elaborate: to be effective often a two stage sampling of controls is done: first stage, controls are sampled as in the unmatched case; second stage, from the sample of controls. strata are built according to the matching criteria from which the matched controls are sampled Result: data consist of pairs: (Case,Control) 56

57 Because of the design the case-control study the data are no longer two independent samples of the diseased and the healthy population, but rather one independent sample of the diseased population, and a stratified sample of the healthy population, stratified by the matching variable as realized for the case Case 1 (40 ys, man) Control 1 (40 ys, man) Case 2 (33 ys, wom) Control 2 (33 ys, wom). Because of the design of the matched case-control study, stratified analysis is most appropriate with each pair defining a stratum What is the principal structure of a pair? 57

58 Four Situations a) Case Control exposed 1 1 non-exposed 2 b) Case Control exposed 1 non-exposed

59 c) Case Control exposed 1 non-exposed 1 2 d) Case Control exposed non-exposed

60 How many pairs of each type? Four frequencies a pairs of type a) Case Control exposed 1 1 non-exposed 2 b pairs of type b) Case Control exposed 1 non-exposed

61 c pairs of type c) Case Control exposed 1 non-exposed 1 d pairs of type d) Case Control exposed non-exposed

62 OR ^ MH= X 1 (1) (m 0 (1) -X 0 (1) ) /m (1) + + X 1 (k) (m 0 (k) -X 0 (k) )/m (1) X 0 (1) (m 1 (1) -X 1 (1) )/ m (1) + + X 1 (1) (m 0 (1) -X 0 (1) )/ m (1) = a 1 0 /2 + b 1 1 /2 +c 0 0 /2 + d 0 1 /2 a 0 1 /2 + b 0 0 /2 +c 1 1 /2 + d 1 0 /2 = = b/c # pairs with case exposed and control unexposed # pairs with case unexposed and controlexposed In a matched case-control study, the Mantel-Haenszel odds ratio is estimated by the ratio of the frequency of pairs with case exposed and control unexposed to the frequency of pairs with case unexposed and control exposed: 62

63 (typical presentation of paired studies) Case Control exposed unexposed exposed a b a+b unexposed c d c+d a+c b+d OR ^ (conventional, unadjusted) = (a+b)(b+d) (a+c)(c+d) OR ^ MH = b/c (ratio of discordant pairs) 63

64 Example: Reye-Syndrome and Aspirin Intake Case Control exposed unexposed exposed unexposed OR ^ (conventional, unadjusted) = (a+b)(b+d) (a+c)(c+d) = = 7.90 OR ^ MH = b/c (ratio of discordant pairs) = 57/5 = 11.4 Cleary, for the inference only discordant pairs are required! Therefore, inference is done conditional upon discordant pairs 64

65 What is the probability that a pair is of type (Case exposed, Control unexposed) given it is discordant? π = Pr ( Case E, Control NE pair is discordant) = P(Case E, Control NE) / P(pair is discordant) = P(Case E, Control NE) / P(Case E, Control NE or Case NE, Control E) = q 1 (1-q 0 )/[ q 1 (1-q 0 ) + (1-q 1 )q 0 ] = q 1(1-q 0 ) (1-q 1 )q /( q 1(1-q 0 ) 0 (1-q 1 )q +1 ) = OR/ (OR+1) 0 65

66 How can I estimate π? π^ = frequency of pairs: Case E; Control NE frequency of all discordant pairs = b/(b+c) now, π = OR/(OR+1) or OR = π/(1-π) How can I estimate OR? OR ^ = π^ /(1-π^ ) = (b/(b+c) / (1- b/(b+c)) = b/c which corresponds to the Mantel-Haenszel-estimate used before! 66

67 Testing and CI Estimation H 0 : OR = 1 or π = OR/(OR+1) = ½ H 1 : H 0 is false since π^ is a proportion estimator its estimated standard error is: SE of π^ : π (1-π)/m = Null-Hpyothesis = ½ 1/m where m=b+c (number of discordant pairs) 67

68 Teststatistic: Z = (π^ - ½ )/ (½ 1/m ) = b+c (2 b/(b+c) 1) = (b-c)/ b+c and χ 2 = Z 2 = (b-c) 2 /(b+c) is McNemar s Chi-Square test statistic! In the example: χ 2 = (57-5) 2 /62 =

69 Confidence Interval (again using π) π^ ± 1.96 SE^ (π^ ) = π^ ± 1.96 π^ (1-π^ )/m and, to get Odds Ratios, use transform. OR = π/(1-π): ^ π^ ± 1.96 π (1-π^ )/m 1- π^ ± 1.96 π^ (1-π^ )/m to provide a 95% CI for the Odds Ratio! 69

70 In the Example, π^ = 57/62 = , π^ ± 1.96 π^ (1-π^ )/m = ± = (0.8516, ) leading to the 95%-CI for the Odds Ratio: [0.8516/( ), /( ) ] = [5.7375, ] 70

71 In Stata: Controls Cases Exposed Unexposed Total. Exposed Unexposed Total McNemar's chi2(1) = Prob > chi2 = Exact McNemar significance probability = Proportion with factor Cases.945 Controls.685 [95% Conf. Interval] difference ratio rel. diff odds ratio (exact) 71

Lecture 3: Measures of effect: Risk Difference Attributable Fraction Risk Ratio and Odds Ratio

Lecture 3: Measures of effect: Risk Difference Attributable Fraction Risk Ratio and Odds Ratio Lecture 3: Measures of effect: Risk Difference Attributable Fraction Risk Ratio and Odds Ratio Dankmar Böhning Southampton Statistical Sciences Research Institute University of Southampton, UK March 3-5,

More information

Lecture 5: Poisson and logistic regression

Lecture 5: Poisson and logistic regression Dankmar Böhning Southampton Statistical Sciences Research Institute University of Southampton, UK S 3 RI, 3-5 March 2014 introduction to Poisson regression application to the BELCAP study introduction

More information

Suppose that we are concerned about the effects of smoking. How could we deal with this?

Suppose that we are concerned about the effects of smoking. How could we deal with this? Suppose that we want to study the relationship between coffee drinking and heart attacks in adult males under 55. In particular, we want to know if there is an association between coffee drinking and heart

More information

Person-Time Data. Incidence. Cumulative Incidence: Example. Cumulative Incidence. Person-Time Data. Person-Time Data

Person-Time Data. Incidence. Cumulative Incidence: Example. Cumulative Incidence. Person-Time Data. Person-Time Data Person-Time Data CF Jeff Lin, MD., PhD. Incidence 1. Cumulative incidence (incidence proportion) 2. Incidence density (incidence rate) December 14, 2005 c Jeff Lin, MD., PhD. c Jeff Lin, MD., PhD. Person-Time

More information

Statistical Modelling with Stata: Binary Outcomes

Statistical Modelling with Stata: Binary Outcomes Statistical Modelling with Stata: Binary Outcomes Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester 21/11/2017 Cross-tabulation Exposed Unexposed Total Cases a b a + b Controls

More information

Tests for Two Correlated Proportions in a Matched Case- Control Design

Tests for Two Correlated Proportions in a Matched Case- Control Design Chapter 155 Tests for Two Correlated Proportions in a Matched Case- Control Design Introduction A 2-by-M case-control study investigates a risk factor relevant to the development of a disease. A population

More information

Lecture 2: Poisson and logistic regression

Lecture 2: Poisson and logistic regression Dankmar Böhning Southampton Statistical Sciences Research Institute University of Southampton, UK S 3 RI, 11-12 December 2014 introduction to Poisson regression application to the BELCAP study introduction

More information

Epidemiology Wonders of Biostatistics Chapter 13 - Effect Measures. John Koval

Epidemiology Wonders of Biostatistics Chapter 13 - Effect Measures. John Koval Epidemiology 9509 Wonders of Biostatistics Chapter 13 - Effect Measures John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being covered 1. risk factors 2. risk

More information

Part IV Statistics in Epidemiology

Part IV Statistics in Epidemiology Part IV Statistics in Epidemiology There are many good statistical textbooks on the market, and we refer readers to some of these textbooks when they need statistical techniques to analyze data or to interpret

More information

Lecture 12: Effect modification, and confounding in logistic regression

Lecture 12: Effect modification, and confounding in logistic regression Lecture 12: Effect modification, and confounding in logistic regression Ani Manichaikul amanicha@jhsph.edu 4 May 2007 Today Categorical predictor create dummy variables just like for linear regression

More information

Epidemiology Principle of Biostatistics Chapter 14 - Dependent Samples and effect measures. John Koval

Epidemiology Principle of Biostatistics Chapter 14 - Dependent Samples and effect measures. John Koval Epidemiology 9509 Principle of Biostatistics Chapter 14 - Dependent Samples and effect measures John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being covered

More information

Case-control studies

Case-control studies Matched and nested case-control studies Bendix Carstensen Steno Diabetes Center, Gentofte, Denmark b@bxc.dk http://bendixcarstensen.com Department of Biostatistics, University of Copenhagen, 8 November

More information

Statistical Methods in Epidemiologic Research. EP 521 Spring 2006 Course Notes Vol I (Part 1 of 5)

Statistical Methods in Epidemiologic Research. EP 521 Spring 2006 Course Notes Vol I (Part 1 of 5) EP 521, Spring 2006, Vol I, Part 1 Statistical Methods in Epidemiologic Research 1 EP 521 Spring 2006 Course Notes Vol I (Part 1 of 5) A. Russell Localio*, and Jesse A Berlin ( The Great Master ) *Department

More information

Analytic Methods for Applied Epidemiology: Framework and Contingency Table Analysis

Analytic Methods for Applied Epidemiology: Framework and Contingency Table Analysis Analytic Methods for Applied Epidemiology: Framework and Contingency Table Analysis 2014 Maternal and Child Health Epidemiology Training Pre-Training Webinar: Friday, May 16 2-4pm Eastern Kristin Rankin,

More information

Journal of Biostatistics and Epidemiology

Journal of Biostatistics and Epidemiology Journal of Biostatistics and Epidemiology Methodology Marginal versus conditional causal effects Kazem Mohammad 1, Seyed Saeed Hashemi-Nazari 2, Nasrin Mansournia 3, Mohammad Ali Mansournia 1* 1 Department

More information

Meta-analysis of epidemiological dose-response studies

Meta-analysis of epidemiological dose-response studies Meta-analysis of epidemiological dose-response studies Nicola Orsini 2nd Italian Stata Users Group meeting October 10-11, 2005 Institute Environmental Medicine, Karolinska Institutet Rino Bellocco Dept.

More information

11 November 2011 Department of Biostatistics, University of Copengen. 9:15 10:00 Recap of case-control studies. Frequency-matched studies.

11 November 2011 Department of Biostatistics, University of Copengen. 9:15 10:00 Recap of case-control studies. Frequency-matched studies. Matched and nested case-control studies Bendix Carstensen Steno Diabetes Center, Gentofte, Denmark http://staff.pubhealth.ku.dk/~bxc/ Department of Biostatistics, University of Copengen 11 November 2011

More information

Modelling Rates. Mark Lunt. Arthritis Research UK Epidemiology Unit University of Manchester

Modelling Rates. Mark Lunt. Arthritis Research UK Epidemiology Unit University of Manchester Modelling Rates Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester 05/12/2017 Modelling Rates Can model prevalence (proportion) with logistic regression Cannot model incidence in

More information

Statistics in medicine

Statistics in medicine Statistics in medicine Lecture 3: Bivariate association : Categorical variables Proportion in one group One group is measured one time: z test Use the z distribution as an approximation to the binomial

More information

Chapter Six: Two Independent Samples Methods 1/51

Chapter Six: Two Independent Samples Methods 1/51 Chapter Six: Two Independent Samples Methods 1/51 6.3 Methods Related To Differences Between Proportions 2/51 Test For A Difference Between Proportions:Introduction Suppose a sampling distribution were

More information

Tests for the Odds Ratio in a Matched Case-Control Design with a Quantitative X

Tests for the Odds Ratio in a Matched Case-Control Design with a Quantitative X Chapter 157 Tests for the Odds Ratio in a Matched Case-Control Design with a Quantitative X Introduction This procedure calculates the power and sample size necessary in a matched case-control study designed

More information

Correlation and regression

Correlation and regression 1 Correlation and regression Yongjua Laosiritaworn Introductory on Field Epidemiology 6 July 2015, Thailand Data 2 Illustrative data (Doll, 1955) 3 Scatter plot 4 Doll, 1955 5 6 Correlation coefficient,

More information

PB HLTH 240A: Advanced Categorical Data Analysis Fall 2007

PB HLTH 240A: Advanced Categorical Data Analysis Fall 2007 Cohort study s formulations PB HLTH 240A: Advanced Categorical Data Analysis Fall 2007 Srine Dudoit Division of Biostatistics Department of Statistics University of California, Berkeley www.stat.berkeley.edu/~srine

More information

Online supplement. Absolute Value of Lung Function (FEV 1 or FVC) Explains the Sex Difference in. Breathlessness in the General Population

Online supplement. Absolute Value of Lung Function (FEV 1 or FVC) Explains the Sex Difference in. Breathlessness in the General Population Online supplement Absolute Value of Lung Function (FEV 1 or FVC) Explains the Sex Difference in Breathlessness in the General Population Table S1. Comparison between patients who were excluded or included

More information

One-stage dose-response meta-analysis

One-stage dose-response meta-analysis One-stage dose-response meta-analysis Nicola Orsini, Alessio Crippa Biostatistics Team Department of Public Health Sciences Karolinska Institutet http://ki.se/en/phs/biostatistics-team 2017 Nordic and

More information

STA6938-Logistic Regression Model

STA6938-Logistic Regression Model Dr. Ying Zhang STA6938-Logistic Regression Model Topic 2-Multiple Logistic Regression Model Outlines:. Model Fitting 2. Statistical Inference for Multiple Logistic Regression Model 3. Interpretation of

More information

E509A: Principle of Biostatistics. GY Zou

E509A: Principle of Biostatistics. GY Zou E509A: Principle of Biostatistics (Effect measures ) GY Zou gzou@robarts.ca We have discussed inference procedures for 2 2 tables in the context of comparing two groups. Yes No Group 1 a b n 1 Group 2

More information

Section 9c. Propensity scores. Controlling for bias & confounding in observational studies

Section 9c. Propensity scores. Controlling for bias & confounding in observational studies Section 9c Propensity scores Controlling for bias & confounding in observational studies 1 Logistic regression and propensity scores Consider comparing an outcome in two treatment groups: A vs B. In a

More information

Ignoring the matching variables in cohort studies - when is it valid, and why?

Ignoring the matching variables in cohort studies - when is it valid, and why? Ignoring the matching variables in cohort studies - when is it valid, and why? Arvid Sjölander Abstract In observational studies of the effect of an exposure on an outcome, the exposure-outcome association

More information

Sociology 362 Data Exercise 6 Logistic Regression 2

Sociology 362 Data Exercise 6 Logistic Regression 2 Sociology 362 Data Exercise 6 Logistic Regression 2 The questions below refer to the data and output beginning on the next page. Although the raw data are given there, you do not have to do any Stata runs

More information

1 Comparing two binomials

1 Comparing two binomials BST 140.652 Review notes 1 Comparing two binomials 1. Let X Binomial(n 1,p 1 ) and ˆp 1 = X/n 1 2. Let Y Binomial(n 2,p 2 ) and ˆp 2 = Y/n 2 3. We also use the following notation: n 11 = X n 12 = n 1 X

More information

Testing Independence

Testing Independence Testing Independence Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/50 Testing Independence Previously, we looked at RR = OR = 1

More information

10: Crosstabs & Independent Proportions

10: Crosstabs & Independent Proportions 10: Crosstabs & Independent Proportions p. 10.1 P Background < Two independent groups < Binary outcome < Compare binomial proportions P Illustrative example ( oswege.sav ) < Food poisoning following church

More information

Statistical and psychometric methods for measurement: G Theory, DIF, & Linking

Statistical and psychometric methods for measurement: G Theory, DIF, & Linking Statistical and psychometric methods for measurement: G Theory, DIF, & Linking Andrew Ho, Harvard Graduate School of Education The World Bank, Psychometrics Mini Course 2 Washington, DC. June 27, 2018

More information

Case-control studies C&H 16

Case-control studies C&H 16 Case-control studies C&H 6 Bendix Carstensen Steno Diabetes Center & Department of Biostatistics, University of Copenhagen bxc@steno.dk http://bendixcarstensen.com PhD-course in Epidemiology, Department

More information

For more information about how to cite these materials visit

For more information about how to cite these materials visit Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/

More information

Lecture 25. Ingo Ruczinski. November 24, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University

Lecture 25. Ingo Ruczinski. November 24, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University Lecture 25 Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University November 24, 2015 1 2 3 4 5 6 7 8 9 10 11 1 Hypothesis s of homgeneity 2 Estimating risk

More information

Additive and multiplicative models for the joint effect of two risk factors

Additive and multiplicative models for the joint effect of two risk factors Biostatistics (2005), 6, 1,pp. 1 9 doi: 10.1093/biostatistics/kxh024 Additive and multiplicative models for the joint effect of two risk factors A. BERRINGTON DE GONZÁLEZ Cancer Research UK Epidemiology

More information

Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2)

Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2) Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2) B.H. Robbins Scholars Series June 23, 2010 1 / 29 Outline Z-test χ 2 -test Confidence Interval Sample size and power Relative effect

More information

Cohen s s Kappa and Log-linear Models

Cohen s s Kappa and Log-linear Models Cohen s s Kappa and Log-linear Models HRP 261 03/03/03 10-11 11 am 1. Cohen s Kappa Actual agreement = sum of the proportions found on the diagonals. π ii Cohen: Compare the actual agreement with the chance

More information

Section IX. Introduction to Logistic Regression for binary outcomes. Poisson regression

Section IX. Introduction to Logistic Regression for binary outcomes. Poisson regression Section IX Introduction to Logistic Regression for binary outcomes Poisson regression 0 Sec 9 - Logistic regression In linear regression, we studied models where Y is a continuous variable. What about

More information

Lecture 24. Ingo Ruczinski. November 24, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University

Lecture 24. Ingo Ruczinski. November 24, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University November 24, 2015 1 2 3 4 5 1 Odds ratios for retrospective studies 2 Odds ratios approximating the

More information

Logistic Regression. Building, Interpreting and Assessing the Goodness-of-fit for a logistic regression model

Logistic Regression. Building, Interpreting and Assessing the Goodness-of-fit for a logistic regression model Logistic Regression In previous lectures, we have seen how to use linear regression analysis when the outcome/response/dependent variable is measured on a continuous scale. In this lecture, we will assume

More information

Lecture 8: Summary Measures

Lecture 8: Summary Measures Lecture 8: Summary Measures Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology Medical University of South Carolina Lecture 8:

More information

Lecture 25: Models for Matched Pairs

Lecture 25: Models for Matched Pairs Lecture 25: Models for Matched Pairs Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology Medical University of South Carolina Lecture

More information

SCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models

SCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models SCHOOL OF MATHEMATICS AND STATISTICS Linear and Generalised Linear Models Autumn Semester 2017 18 2 hours Attempt all the questions. The allocation of marks is shown in brackets. RESTRICTED OPEN BOOK EXAMINATION

More information

Lecture 23. November 15, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.

Lecture 23. November 15, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Lecture 4: Generalized Linear Mixed Models

Lecture 4: Generalized Linear Mixed Models Dankmar Böhning Southampton Statistical Sciences Research Institute University of Southampton, UK S 3 RI, 11-12 December 2014 An example with one random effect An example with two nested random effects

More information

STAT 5500/6500 Conditional Logistic Regression for Matched Pairs

STAT 5500/6500 Conditional Logistic Regression for Matched Pairs STAT 5500/6500 Conditional Logistic Regression for Matched Pairs Motivating Example: The data we will be using comes from a subset of data taken from the Los Angeles Study of the Endometrial Cancer Data

More information

Lab 8. Matched Case Control Studies

Lab 8. Matched Case Control Studies Lab 8 Matched Case Control Studies Control of Confounding Technique for the control of confounding: At the design stage: Matching During the analysis of the results: Post-stratification analysis Advantage

More information

Marginal, crude and conditional odds ratios

Marginal, crude and conditional odds ratios Marginal, crude and conditional odds ratios Denitions and estimation Travis Loux Gradute student, UC Davis Department of Statistics March 31, 2010 Parameter Denitions When measuring the eect of a binary

More information

More Statistics tutorial at Logistic Regression and the new:

More Statistics tutorial at  Logistic Regression and the new: Logistic Regression and the new: Residual Logistic Regression 1 Outline 1. Logistic Regression 2. Confounding Variables 3. Controlling for Confounding Variables 4. Residual Linear Regression 5. Residual

More information

Three-Way Contingency Tables

Three-Way Contingency Tables Newsom PSY 50/60 Categorical Data Analysis, Fall 06 Three-Way Contingency Tables Three-way contingency tables involve three binary or categorical variables. I will stick mostly to the binary case to keep

More information

Review. More Review. Things to know about Probability: Let Ω be the sample space for a probability measure P.

Review. More Review. Things to know about Probability: Let Ω be the sample space for a probability measure P. 1 2 Review Data for assessing the sensitivity and specificity of a test are usually of the form disease category test result diseased (+) nondiseased ( ) + A B C D Sensitivity: is the proportion of diseased

More information

Lab #11. Variable B. Variable A Y a b a+b N c d c+d a+c b+d N = a+b+c+d

Lab #11. Variable B. Variable A Y a b a+b N c d c+d a+c b+d N = a+b+c+d BIOS 4120: Introduction to Biostatistics Breheny Lab #11 We will explore observational studies in today s lab and review how to make inferences on contingency tables. We will only use 2x2 tables for today

More information

Unit 3. Discrete Distributions

Unit 3. Discrete Distributions BIOSTATS 640 - Spring 2016 3. Discrete Distributions Page 1 of 52 Unit 3. Discrete Distributions Chance favors only those who know how to court her - Charles Nicolle In many research settings, the outcome

More information

STAT 705: Analysis of Contingency Tables

STAT 705: Analysis of Contingency Tables STAT 705: Analysis of Contingency Tables Timothy Hanson Department of Statistics, University of South Carolina Stat 705: Analysis of Contingency Tables 1 / 45 Outline of Part I: models and parameters Basic

More information

Dependent Nondifferential Misclassification of Exposure

Dependent Nondifferential Misclassification of Exposure Dependent Nondifferential Misclassification of Exposure DISCLAIMER: I am REALLY not an expert in data simulations or misclassification Outline Relevant definitions Review of implications of dependent nondifferential

More information

Probability and Probability Distributions. Dr. Mohammed Alahmed

Probability and Probability Distributions. Dr. Mohammed Alahmed Probability and Probability Distributions 1 Probability and Probability Distributions Usually we want to do more with data than just describing them! We might want to test certain specific inferences about

More information

Agenda Quoted in Breslow 1996.

Agenda Quoted in Breslow 1996. EPIB 634 Survival Analysis & Related Topics: Regression models for "case-control" and matched studies 1 Agenda Quoted in Breslow 1996. History, case-control methods.. up to "modern" times Unmatched c-c

More information

Asymptotic equivalence of paired Hotelling test and conditional logistic regression

Asymptotic equivalence of paired Hotelling test and conditional logistic regression Asymptotic equivalence of paired Hotelling test and conditional logistic regression Félix Balazard 1,2 arxiv:1610.06774v1 [math.st] 21 Oct 2016 Abstract 1 Sorbonne Universités, UPMC Univ Paris 06, CNRS

More information

2 Describing Contingency Tables

2 Describing Contingency Tables 2 Describing Contingency Tables I. Probability structure of a 2-way contingency table I.1 Contingency Tables X, Y : cat. var. Y usually random (except in a case-control study), response; X can be random

More information

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH The First Step: SAMPLE SIZE DETERMINATION THE ULTIMATE GOAL The most important, ultimate step of any of clinical research is to do draw inferences;

More information

Analysis of categorical data S4. Michael Hauptmann Netherlands Cancer Institute Amsterdam, The Netherlands

Analysis of categorical data S4. Michael Hauptmann Netherlands Cancer Institute Amsterdam, The Netherlands Analysis of categorical data S4 Michael Hauptmann Netherlands Cancer Institute Amsterdam, The Netherlands m.hauptmann@nki.nl 1 Categorical data One-way contingency table = frequency table Frequency (%)

More information

BIOS 625 Fall 2015 Homework Set 3 Solutions

BIOS 625 Fall 2015 Homework Set 3 Solutions BIOS 65 Fall 015 Homework Set 3 Solutions 1. Agresti.0 Table.1 is from an early study on the death penalty in Florida. Analyze these data and show that Simpson s Paradox occurs. Death Penalty Victim's

More information

Statistics in medicine

Statistics in medicine Statistics in medicine Lecture 4: and multivariable regression Fatma Shebl, MD, MS, MPH, PhD Assistant Professor Chronic Disease Epidemiology Department Yale School of Public Health Fatma.shebl@yale.edu

More information

Measures of Association and Variance Estimation

Measures of Association and Variance Estimation Measures of Association and Variance Estimation Dipankar Bandyopadhyay, Ph.D. Department of Biostatistics, Virginia Commonwealth University D. Bandyopadhyay (VCU) BIOS 625: Categorical Data & GLM 1 / 35

More information

The sophisticated use and understanding of case-control studies is the most important methodologic development of Unmatched c-c study

The sophisticated use and understanding of case-control studies is the most important methodologic development of Unmatched c-c study EPIB 681: Regression models for "case-control" and matched studies 1 Agenda Quoted in Breslow 1996. History, case-control methods.. up to "modern" times The sophisticated use and understanding of case-control

More information

Statistik for MPH: september Analyse af kohortestudier, rater, standardisering (Silva: 70-80,

Statistik for MPH: september Analyse af kohortestudier, rater, standardisering (Silva: 70-80, Statistik for MPH: 3 21. september 2017 www.biostat.ku.dk/~pka/mph17 Analyse af kohortestudier, rater, standardisering (Silva: 70-80, 110-133.) Per Kragh Andersen 1 Fra den. 2. uges statistikundervisning:

More information

Lecture 15 (Part 2): Logistic Regression & Common Odds Ratio, (With Simulations)

Lecture 15 (Part 2): Logistic Regression & Common Odds Ratio, (With Simulations) Lecture 15 (Part 2): Logistic Regression & Common Odds Ratio, (With Simulations) Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology

More information

STAT 5500/6500 Conditional Logistic Regression for Matched Pairs

STAT 5500/6500 Conditional Logistic Regression for Matched Pairs STAT 5500/6500 Conditional Logistic Regression for Matched Pairs The data for the tutorial came from support.sas.com, The LOGISTIC Procedure: Conditional Logistic Regression for Matched Pairs Data :: SAS/STAT(R)

More information

BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings

BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings Yujin Chung October 4th, 2016 Fall 2016 Yujin Chung Lec6: Statistical hypothesis testings Fall 2016 1/30 Previous Two types of statistical

More information

Multivariate Dose-Response Meta-Analysis: the dosresmeta R Package

Multivariate Dose-Response Meta-Analysis: the dosresmeta R Package Multivariate Dose-Response Meta-Analysis: the dosresmeta R Package Alessio Crippa Karolinska Institutet Nicola Orsini Karolinska Institutet Abstract An increasing number of quantitative reviews of epidemiological

More information

Lecture 5: ANOVA and Correlation

Lecture 5: ANOVA and Correlation Lecture 5: ANOVA and Correlation Ani Manichaikul amanicha@jhsph.edu 23 April 2007 1 / 62 Comparing Multiple Groups Continous data: comparing means Analysis of variance Binary data: comparing proportions

More information

Epidemiology Wonders of Biostatistics Chapter 11 (continued) - probability in a single population. John Koval

Epidemiology Wonders of Biostatistics Chapter 11 (continued) - probability in a single population. John Koval Epidemiology 9509 Wonders of Biostatistics Chapter 11 (continued) - probability in a single population John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being

More information

Introduction to the Analysis of Tabular Data

Introduction to the Analysis of Tabular Data Introduction to the Analysis of Tabular Data Anthropological Sciences 192/292 Data Analysis in the Anthropological Sciences James Holland Jones & Ian G. Robertson March 15, 2006 1 Tabular Data Is there

More information

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

 M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2 Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the

More information

4.1 Example: Exercise and Glucose

4.1 Example: Exercise and Glucose 4 Linear Regression Post-menopausal women who exercise less tend to have lower bone mineral density (BMD), putting them at increased risk for fractures. But they also tend to be older, frailer, and heavier,

More information

Chapter 10: Inferences based on two samples

Chapter 10: Inferences based on two samples November 16 th, 2017 Overview Week 1 Week 2 Week 4 Week 7 Week 10 Week 12 Chapter 1: Descriptive statistics Chapter 6: Statistics and Sampling Distributions Chapter 7: Point Estimation Chapter 8: Confidence

More information

Standardization methods have been used in epidemiology. Marginal Structural Models as a Tool for Standardization ORIGINAL ARTICLE

Standardization methods have been used in epidemiology. Marginal Structural Models as a Tool for Standardization ORIGINAL ARTICLE ORIGINAL ARTICLE Marginal Structural Models as a Tool for Standardization Tosiya Sato and Yutaka Matsuyama Abstract: In this article, we show the general relation between standardization methods and marginal

More information

Modelling Binary Outcomes 21/11/2017

Modelling Binary Outcomes 21/11/2017 Modelling Binary Outcomes 21/11/2017 Contents 1 Modelling Binary Outcomes 5 1.1 Cross-tabulation.................................... 5 1.1.1 Measures of Effect............................... 6 1.1.2 Limitations

More information

Sampling approaches in animal health studies. Al Ain November 2017

Sampling approaches in animal health studies. Al Ain November 2017 Sampling approaches in animal health studies Al Ain 13-14 November 2017 Definitions Populations and Samples From a statistical point of view: Population is a set of measurements, that may be finite or

More information

Journal of Statistical Software

Journal of Statistical Software JSS Journal of Statistical Software MMMMMM YYYY, Volume VV, Code Snippet II. http://www.jstatsoft.org/ Multivariate Dose-Response Meta-Analysis: the dosresmeta R Package Alessio Crippa Karolinska Institutet

More information

y response variable x 1, x 2,, x k -- a set of explanatory variables

y response variable x 1, x 2,, x k -- a set of explanatory variables 11. Multiple Regression and Correlation y response variable x 1, x 2,, x k -- a set of explanatory variables In this chapter, all variables are assumed to be quantitative. Chapters 12-14 show how to incorporate

More information

ST3241 Categorical Data Analysis I Two-way Contingency Tables. Odds Ratio and Tests of Independence

ST3241 Categorical Data Analysis I Two-way Contingency Tables. Odds Ratio and Tests of Independence ST3241 Categorical Data Analysis I Two-way Contingency Tables Odds Ratio and Tests of Independence 1 Inference For Odds Ratio (p. 24) For small to moderate sample size, the distribution of sample odds

More information

Chapter 11: Analysis of matched pairs

Chapter 11: Analysis of matched pairs Chapter 11: Analysis of matched pairs Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 42 Chapter 11: Models for Matched Pairs Example: Prime

More information

Logistic Regression. Fitting the Logistic Regression Model BAL040-A.A.-10-MAJ

Logistic Regression. Fitting the Logistic Regression Model BAL040-A.A.-10-MAJ Logistic Regression The goal of a logistic regression analysis is to find the best fitting and most parsimonious, yet biologically reasonable, model to describe the relationship between an outcome (dependent

More information

Does low participation in cohort studies induce bias? Additional material

Does low participation in cohort studies induce bias? Additional material Does low participation in cohort studies induce bias? Additional material Content: Page 1: A heuristic proof of the formula for the asymptotic standard error Page 2-3: A description of the simulation study

More information

Missing covariate data in matched case-control studies: Do the usual paradigms apply?

Missing covariate data in matched case-control studies: Do the usual paradigms apply? Missing covariate data in matched case-control studies: Do the usual paradigms apply? Bryan Langholz USC Department of Preventive Medicine Joint work with Mulugeta Gebregziabher Larry Goldstein Mark Huberman

More information

EC2001 Econometrics 1 Dr. Jose Olmo Room D309

EC2001 Econometrics 1 Dr. Jose Olmo Room D309 EC2001 Econometrics 1 Dr. Jose Olmo Room D309 J.Olmo@City.ac.uk 1 Revision of Statistical Inference 1.1 Sample, observations, population A sample is a number of observations drawn from a population. Population:

More information

Introduction to Inferential Statistics. Jaranit Kaewkungwal, Ph.D. Faculty of Tropical Medicine Mahidol University

Introduction to Inferential Statistics. Jaranit Kaewkungwal, Ph.D. Faculty of Tropical Medicine Mahidol University Introduction to Inferential Statistics Jaranit Kaewkungwal, Ph.D. Faculty of Tropical Medicine Mahidol University 1 Data & Variables 2 Types of Data QUALITATIVE Data expressed by type Data that has been

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Unit 5 Logistic Regression

Unit 5 Logistic Regression BIOSTATS 640 - Spring 2018 5. Logistic Regression Page 1 of 66 Unit 5 Logistic Regression To all the ladies present and some of those absent - Jerzy Neyman What behaviors influence the chances of developing

More information

Correlation and Simple Linear Regression

Correlation and Simple Linear Regression Correlation and Simple Linear Regression Sasivimol Rattanasiri, Ph.D Section for Clinical Epidemiology and Biostatistics Ramathibodi Hospital, Mahidol University E-mail: sasivimol.rat@mahidol.ac.th 1 Outline

More information

Sampling Techniques. Esra Akdeniz. February 9th, 2016

Sampling Techniques. Esra Akdeniz. February 9th, 2016 Sampling Techniques Esra Akdeniz February 9th, 2016 HOW TO DO RESEARCH? Question. Literature research. Hypothesis. Collect data. Analyze data. Interpret and present results. HOW TO DO RESEARCH? Collect

More information

The Chi-Square Distributions

The Chi-Square Distributions MATH 183 The Chi-Square Distributions Dr. Neal, WKU The chi-square distributions can be used in statistics to analyze the standard deviation σ of a normally distributed measurement and to test the goodness

More information

STA6938-Logistic Regression Model

STA6938-Logistic Regression Model Dr. Ying Zhang STA6938-Logistic Regression Model Topic 6-Logistic Regression for Case-Control Studies Outlines: 1. Biomedical Designs 2. Logistic Regression Models for Case-Control Studies 3. Logistic

More information

Lecture 8 Stat D. Gillen

Lecture 8 Stat D. Gillen Statistics 255 - Survival Analysis Presented February 23, 2016 Dan Gillen Department of Statistics University of California, Irvine 8.1 Example of two ways to stratify Suppose a confounder C has 3 levels

More information

Frequency table: Var2 (Spreadsheet1) Count Cumulative Percent Cumulative From To. Percent <x<=

Frequency table: Var2 (Spreadsheet1) Count Cumulative Percent Cumulative From To. Percent <x<= A frequency distribution is a kind of probability distribution. It gives the frequency or relative frequency at which given values have been observed among the data collected. For example, for age, Frequency

More information

Regression Models for Risks(Proportions) and Rates. Proportions. E.g. [Changes in] Sex Ratio: Canadian Births in last 60 years

Regression Models for Risks(Proportions) and Rates. Proportions. E.g. [Changes in] Sex Ratio: Canadian Births in last 60 years Regression Models for Risks(Proportions) and Rates Proportions E.g. [Changes in] Sex Ratio: Canadian Births in last 60 years Parameter of Interest: (male)... just above 0.5 or (male)/ (female)... "ODDS"...

More information