Statistik for MPH: september Analyse af kohortestudier, rater, standardisering (Silva: 70-80,

Size: px
Start display at page:

Download "Statistik for MPH: september Analyse af kohortestudier, rater, standardisering (Silva: 70-80,"

Transcription

1 Statistik for MPH: september Analyse af kohortestudier, rater, standardisering (Silva: 70-80, ) Per Kragh Andersen 1

2 Fra den. 2. uges statistikundervisning: skulle jeg gerne 1. forstå at den relative risiko er et fornuftigt mål for association mellem determinant og udfald i kohorteundersøgelser 2. kunne beregne relativ risiko med tilhørende sikkerhedsgrænser ud fra en given tabel 3. kunne beregne chi-square teststørrelsen for hypotesen: relativ risiko=1 ud fra en given tabel 4. så småt begynde at vænne mig til begreberne: nulhypotese, teststørrelse, fordeling af teststørrelse, P -værdi, signifikansniveau, accept/forkastelse af hypotese 5. forstå at der er en tæt sammenhæng mellem sikkerhedsgrænser og hypotesetest 2

3 Fra den. 2. uges statistikundervisning: behøver jeg derimod ikke nødvendigvis 1. at have forstået, hvordan formlen for SD(ln R) fremkommer 2. at have forstået, hvordan standard afvigelsen i nævneren af teststørrelsen fremkommer 3. at have forstået, hvorfor teststørrelsens værdi netop skal slås op i χ 2 -tabellen med 1. frihedsgrad 4. at have forstået alle begreberne nævnt under 4. ovenfor til bunds 3

4 Cohort studies Classical example, The Evans County Study: 609 white men, aged 40-76, followed for 7 years from CHD CHL>260mg/100mL Yes No Total Yes No Total Summarizing data in this way, it has been assumed that everyone has been followed for 7 years - this is more an exception than the rule: People leave the study due to loss of follow-up People may enter the study at different time points We wish to take the varying times at risk into account. 4

5 The 7-year CHD-risk ( cumulative incidence ) expresses the proportion of the cohort that has a CHD-event during the 7 years of follow-up. It is, therefore, the 5-year risk and the 10-year risk Alternative frequency measure: the rate, r(t) expressing risk per time unit at time t ( speed at t). For h > 0 (small): r(t) Prob(Disease before time t + h given no disease time t)/h. We estimate a constant rate as (Silva, p.63): r = number of (CHD-) events obs. total person-time at risk = a y. Alternative names: incidence rate, incidence density, hazard function, hazard rate. 5

6 Calculation of person-years Calculation of person-years of observation for six individuals (see also Silva, p.62). Date of birth Date observation Date observation Death began ended (0=no, 1=yes) Jan. 4, 1897 July 11, 1954 Dec. 31, Sept. 5, 1884 Aug. 3, 1954 Nov. 25, Dec. 16, 1904 Oct. 25, 1954 Dec 31, Jan. 16, 1899 Nov. 1, 1954 Dec. 31, Apr. 9, 1912 Feb. 19, 1957 Dec. 31, Feb. 22, 1910 Dec. 8, 1957 Aug. 18, Years Months Days Years

7 r = years = per year = 51 per 1000 years Note that while the risk is between 0 and 1 (it is a probability), the rate has no upper limit (the value depends on the time unit used). Interpretation of a rate of r = per year: If we observe a cohort where the rate has this value and somehow collect 1000 person years of observation, then we expect to see 51 events. Some times: approximations are used for person-years when dates are unavailable (see, e.g. Silva p.71). What does the fraction 2 6 tell us? Not much! If it is to be thought of as a risk estimate then it is quite unclear to which follow-up period it corresponds. 7

8 Confidence interval for a rate Better (and more complicated) method: 1. Calculate l = ln(r) 2. Calculate L 2 = l a Calculate L 1 = l a 3. The desired 95% confidence limits are: from exp(l 1 ) to exp(l 2 ) Example: l = ln(0.051) = 2.98 L 2 = = 1.59 L 1 = = % confidence limits from exp( 4.36) = per year to exp( 1.59) = per year. 8

9 Relation between rate and risk If the rate is (constant) = r (per year) what is then the T -year risk (if there are no other competing events)? h = T N h = T N h = T N 0 T Probability of no event before T = (1 r h) (1 r h) (1 r h) = (1 r h) N = ( 1 r T N ) N exp( r T ). That is: T -year risk = 1 exp( rt ). If the cumulative rate rt is small then T -year risk rt (Silva, p.77). 9

10 Example Assume that r = per year and consider a follow-up period of T = 5 years. The T -year risk is then 1 exp( rt ) = 1 exp( ) = 1 exp( 0.255) = The corresponding cumulative rate is r T = = Note that: 1 exp( rt ) = < rt = Note also that it is quite smart that, given a (constant) rate, it is possible to estimate the risk for any chosen follow-up period, T. 10

11 Corresponding measure of association RATE RATIO, some times called Relative risk which is bad terminology, but not totally silly if the disease is rare, because then: Risk 1 Rate 1 T Risk 2 Rate 2 T = Rate 1. Rate 2 95% confidence limits for rate ratio, RR: 1. Calculate ln(rr) 2. Calculate L 2 = ln(rr) L 1 = ln(rr) The 95% confidence limits are: from exp(l 1 ) to exp(l 2 ) 1 a a 2 1 a a 2 11

12 Example: Nurses health study (p. 179) BC cases Person-years Age Group OC Non-OC OC Non-OC a 1 = 204 a 2 = 240 y 1 = y 2 = Table 8.12: Cohort study of breast cancer (BC) and oral contraceptive (OC) use. OC users Non-OC users r O = r N = = per year l O =ln( ) l N =ln( ) = = L 2 = L 2 = = L 1 = L 1 = exp(l 1 )= per year exp(l 2 )= per year exp(l 1 )= per year exp(l 2 )= per year 12

13 Example (contd.) RR = = ln(rr) = L 2 = = L 1 = = exp(l 1 ) = 0.964, exp(l 2 ) =

14 Alternative measure of association: rate difference The rate difference is r 2 r 1 and is estimated by a 2 y 2 a 1 y 1 with SD(r 2 r 1 ) = a1 y a 2 y2 2. In the example this is r O r N = per year (or 30 per years) with 95% confidence interval from 204 r O r N = = (per year) to 204 r O r N = = (per year). 14

15 Exercise: 1. Calculate the rates for stomach cancer (SC) in Cali and Birmingham (with 95% confidence limits) based on the table in Silva, p Calculate the rate ratio with 95% confidence limits 3. Calculate the chi-square test (to be defined in the following) for comparing the two rates. 15

16 Solution (1) SC cases Person-years Cali Birmingham Cali Birmingham r C = = per year r B = ln(r C ) = ln(r B ) = L 2 = L = = L 1 = L = = exp(l 1 )= per year exp(l 2 )= per year exp(l 1 )= per year exp(l 2 )= per year = per year 16

17 Solution (2) RR = = 0.59, ln(rr) = L 2 = = 0.447, 1 L 1 = = 0.618, exp(l 1 ) = 0.54, exp(l 2 ) =

18 Test for comparing two rates Nurses health study: Events (BC) Person-years OC-users 204 (a 1 ) (y 1 ) Non-OC-users 240 (a 2 ) (y 2 ) 444 (a) (y) Here: a = a 1 + a 2, y = y 1 + y 2. 18

19 Mantel-Haenszel idea Compare: OBServed = 204 = a 1 with : EXPected = = y 1 a y = 187.6, SD(a 1 ) = = a y1 y 2 y y = Test statistic = ( OBS EXP SD ) 2 ( = ) = The chi-square table (1 d.f.) gives 0.10 < P <

20 Solution to exercise (3) OBS = EXP = = SD = = 27.0 ( ) 2 OBS EXP = chi-square (1 d.f.) SD P <

21 And now for something completely different 21

22 How can we adjust/standardize for age when comparing two groups? Two types of standardization exist: DIRECT cf. Silva INDIRECT p Direct : Compare weighted averages of the age-specific rates in the two groups using some standard age-distribution as weights Indirect : In each group, compare the observed number of events with what one would have expected, had the group been exposed to some standard age-specific rates. Other ways of adjustment (also for other factors than age): STRATIFIED REGRESSION ANALYSIS 22

23 Example: 1970 US mortality data California (a) Maine (b) Age Pop.in No.of Rate per Pop.in No.of Rate per 1000 deaths 1000 ys deaths 1000 ys. < Total (Kahn & Sempos, 1989) The total rates are often denoted crude rates, and the values are: (a) California: = 8.3 per 1000 ys. (b) Maine: = 11.1 per 1000 ys. leading to RR =

24 The age distributions in California and Maine differ so the apparent difference may be (partly) ascribed to this. California Maine Age Pop. in 1000 % Pop. in 1000 % < Total

25 Example: 1970 US mortality data United States Age Pop.in Weight No.of Rate per 1000 deaths 1000 ys. < Total Directly standardized rates: DSR a = = 8.8 per 1000 ys. DSR b = = 9.9 per 1000 ys. leading to RR =

26 Example from Silva, p.71 Male stomach cancer cases: Cali Birmingham Age Popu- Person- No.of Rate per Popu- Person- No.of Rate per lation years cancers ys. lation years cancers ys Total Age distributions (%): Age Cali Birmingham Standard Total

27 The age distributions in Cali and Birhingham differ so the difference between the crude rates may be (partly) ascribed to this. Directly standardized rates: DSR C = = DSR B = = (both per ys.) 27

28 Indirect standardization (a) California: Obs. number of deaths = Expected number of deaths = = Standardized mortality ratio: SMR a = = = (b) Maine: Obs. = Exp. = SMR b = = =

29 If one wants to express the results as rates then indirectly standardized rates may be computed: ISR a = SMR a = 8.8 per 1000 ys., ISR b = SMR b = 9.9 per 1000 ys., where per year=9.4 per 1000 ys. is the overall crude rate in the standard population. Silva (p. 74) illustrates indirect standardization by computing the number of cancer cases expected in Cali if the age-specific rates from Birmingham apply. 29

30 Discussion Standardization is a classical technique for age-adjustment. For comparing groups statistically, stratified (and regression) analysis (to be discussed later) is superior. The average calculated in direct standardization frequently provides an over-simplification for analysis (but it may be OK for graphical displays etc.). Indirect standardization (SMR) may be used for comparing a sample with standard rates (for this, we need a confidence interval, see below) but for comparing two (or more) groups (samples) stratified (and regression) analysis is better. 30

31 Confidence interval for SMR Best method (using ln!): 1. Calculate ln(sm R) 2. Calculate ln(smr) OBS = L 2 ln(smr) OBS = L 1 3. The desired 95% confidence limits are from exp(l 1 ) to exp(l 2 ) Example: Maine compared to US: 1. ln(1.050) = = L 2 = 0.068, = L 1 = exp(l 1 ) = 1.030, exp(l 2 ) = Exercise: California! 31

32 Solution SM R for California: 1. ln(smr) = = L 1 = = L 2 = = exp(l 1 ) = exp(l 2 ) =

Person-Time Data. Incidence. Cumulative Incidence: Example. Cumulative Incidence. Person-Time Data. Person-Time Data

Person-Time Data. Incidence. Cumulative Incidence: Example. Cumulative Incidence. Person-Time Data. Person-Time Data Person-Time Data CF Jeff Lin, MD., PhD. Incidence 1. Cumulative incidence (incidence proportion) 2. Incidence density (incidence rate) December 14, 2005 c Jeff Lin, MD., PhD. c Jeff Lin, MD., PhD. Person-Time

More information

2011/04 LEUKAEMIA IN WALES Welsh Cancer Intelligence and Surveillance Unit

2011/04 LEUKAEMIA IN WALES Welsh Cancer Intelligence and Surveillance Unit 2011/04 LEUKAEMIA IN WALES 1994-2008 Welsh Cancer Intelligence and Surveillance Unit Table of Contents 1 Definitions and Statistical Methods... 2 2 Results 7 2.1 Leukaemia....... 7 2.2 Acute Lymphoblastic

More information

GAMINGRE 8/1/ of 7

GAMINGRE 8/1/ of 7 FYE 09/30/92 JULY 92 0.00 254,550.00 0.00 0 0 0 0 0 0 0 0 0 254,550.00 0.00 0.00 0.00 0.00 254,550.00 AUG 10,616,710.31 5,299.95 845,656.83 84,565.68 61,084.86 23,480.82 339,734.73 135,893.89 67,946.95

More information

Confounding and effect modification: Mantel-Haenszel estimation, testing effect homogeneity. Dankmar Böhning

Confounding and effect modification: Mantel-Haenszel estimation, testing effect homogeneity. Dankmar Böhning Confounding and effect modification: Mantel-Haenszel estimation, testing effect homogeneity Dankmar Böhning Southampton Statistical Sciences Research Institute University of Southampton, UK Advanced Statistical

More information

Lecture 3: Measures of effect: Risk Difference Attributable Fraction Risk Ratio and Odds Ratio

Lecture 3: Measures of effect: Risk Difference Attributable Fraction Risk Ratio and Odds Ratio Lecture 3: Measures of effect: Risk Difference Attributable Fraction Risk Ratio and Odds Ratio Dankmar Böhning Southampton Statistical Sciences Research Institute University of Southampton, UK March 3-5,

More information

Subject CT4 Models. October 2015 Examination INDICATIVE SOLUTION

Subject CT4 Models. October 2015 Examination INDICATIVE SOLUTION Institute of Actuaries of India Subject CT4 Models October 2015 Examination INDICATIVE SOLUTION Introduction The indicative solution has been written by the Examiners with the aim of helping candidates.

More information

Where Was Mars At Your Birth?

Where Was Mars At Your Birth? Where Was Mars At Your Birth? This chart will make it easy for you to determine your Mars sign. We ve listed each of the dates that Mars enters a new sign. If you were born after June 11, 1950, when Mars

More information

E509A: Principle of Biostatistics. GY Zou

E509A: Principle of Biostatistics. GY Zou E509A: Principle of Biostatistics (Effect measures ) GY Zou gzou@robarts.ca We have discussed inference procedures for 2 2 tables in the context of comparing two groups. Yes No Group 1 a b n 1 Group 2

More information

ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables

ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES Cox s regression analysis Time dependent explanatory variables Henrik Ravn Bandim Health Project, Statens Serum Institut 4 November 2011 1 / 53

More information

Suppose that we are concerned about the effects of smoking. How could we deal with this?

Suppose that we are concerned about the effects of smoking. How could we deal with this? Suppose that we want to study the relationship between coffee drinking and heart attacks in adult males under 55. In particular, we want to know if there is an association between coffee drinking and heart

More information

Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2)

Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2) Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2) B.H. Robbins Scholars Series June 23, 2010 1 / 29 Outline Z-test χ 2 -test Confidence Interval Sample size and power Relative effect

More information

2 Describing Contingency Tables

2 Describing Contingency Tables 2 Describing Contingency Tables I. Probability structure of a 2-way contingency table I.1 Contingency Tables X, Y : cat. var. Y usually random (except in a case-control study), response; X can be random

More information

Confidence Intervals. Contents. Technical Guide

Confidence Intervals. Contents. Technical Guide Technical Guide Confidence Intervals Contents Introduction Software options Directory of methods 3 Appendix 1 Byar s method 6 Appendix χ exact method 7 Appendix 3 Wilson Score method 8 Appendix 4 Dobson

More information

Part IV Statistics in Epidemiology

Part IV Statistics in Epidemiology Part IV Statistics in Epidemiology There are many good statistical textbooks on the market, and we refer readers to some of these textbooks when they need statistical techniques to analyze data or to interpret

More information

Means or "expected" counts: j = 1 j = 2 i = 1 m11 m12 i = 2 m21 m22 True proportions: The odds that a sampled unit is in category 1 for variable 1 giv

Means or expected counts: j = 1 j = 2 i = 1 m11 m12 i = 2 m21 m22 True proportions: The odds that a sampled unit is in category 1 for variable 1 giv Measures of Association References: ffl ffl ffl Summarize strength of associations Quantify relative risk Types of measures odds ratio correlation Pearson statistic ediction concordance/discordance Goodman,

More information

REPORT ON LABOUR FORECASTING FOR CONSTRUCTION

REPORT ON LABOUR FORECASTING FOR CONSTRUCTION REPORT ON LABOUR FORECASTING FOR CONSTRUCTION For: Project: XYZ Local Authority New Sample Project Contact us: Construction Skills & Whole Life Consultants Limited Dundee University Incubator James Lindsay

More information

Modelling Rates. Mark Lunt. Arthritis Research UK Epidemiology Unit University of Manchester

Modelling Rates. Mark Lunt. Arthritis Research UK Epidemiology Unit University of Manchester Modelling Rates Mark Lunt Arthritis Research UK Epidemiology Unit University of Manchester 05/12/2017 Modelling Rates Can model prevalence (proportion) with logistic regression Cannot model incidence in

More information

Describing Stratified Multiple Responses for Sparse Data

Describing Stratified Multiple Responses for Sparse Data Describing Stratified Multiple Responses for Sparse Data Ivy Liu School of Mathematical and Computing Sciences Victoria University Wellington, New Zealand June 28, 2004 SUMMARY Surveys often contain qualitative

More information

Section IX. Introduction to Logistic Regression for binary outcomes. Poisson regression

Section IX. Introduction to Logistic Regression for binary outcomes. Poisson regression Section IX Introduction to Logistic Regression for binary outcomes Poisson regression 0 Sec 9 - Logistic regression In linear regression, we studied models where Y is a continuous variable. What about

More information

= observed volume on day l for bin j = base volume in jth bin, and = residual error, assumed independent with mean zero.

= observed volume on day l for bin j = base volume in jth bin, and = residual error, assumed independent with mean zero. QB research September 4, 06 Page -Minute Bin Volume Forecast Model Overview In response to strong client demand, Quantitative Brokers (QB) has developed a new algorithm called Closer that specifically

More information

THE LIGHT SIDE OF TRIGONOMETRY

THE LIGHT SIDE OF TRIGONOMETRY MATHEMATICAL METHODS: UNIT 2 APPLICATION TASK THE LIGHT SIDE OF TRIGONOMETRY The earth s movement around the sun is an example of periodic motion. The earth s tilt on its axis and corresponding movement

More information

Logistic regression analysis. Birthe Lykke Thomsen H. Lundbeck A/S

Logistic regression analysis. Birthe Lykke Thomsen H. Lundbeck A/S Logistic regression analysis Birthe Lykke Thomsen H. Lundbeck A/S 1 Response with only two categories Example Odds ratio and risk ratio Quantitative explanatory variable More than one variable Logistic

More information

UNCLASSIFIED. Environment, Safety and Health (ESH) Report

UNCLASSIFIED. Environment, Safety and Health (ESH) Report Environment, Safety and Health (ESH) Report Perfect Days On a Perfect Day nobody is hurt, we receive no community complaints, we have no security breaches, we do not breach our environmental permit conditions,

More information

DAILY QUESTIONS 28 TH JUNE 18 REASONING - CALENDAR

DAILY QUESTIONS 28 TH JUNE 18 REASONING - CALENDAR DAILY QUESTIONS 28 TH JUNE 18 REASONING - CALENDAR LEAP AND NON-LEAP YEAR *A non-leap year has 365 days whereas a leap year has 366 days. (as February has 29 days). *Every year which is divisible by 4

More information

Chapter Six: Two Independent Samples Methods 1/51

Chapter Six: Two Independent Samples Methods 1/51 Chapter Six: Two Independent Samples Methods 1/51 6.3 Methods Related To Differences Between Proportions 2/51 Test For A Difference Between Proportions:Introduction Suppose a sampling distribution were

More information

WHEN IS IT EVER GOING TO RAIN? Table of Average Annual Rainfall and Rainfall For Selected Arizona Cities

WHEN IS IT EVER GOING TO RAIN? Table of Average Annual Rainfall and Rainfall For Selected Arizona Cities WHEN IS IT EVER GOING TO RAIN? Table of Average Annual Rainfall and 2001-2002 Rainfall For Selected Arizona Cities Phoenix Tucson Flagstaff Avg. 2001-2002 Avg. 2001-2002 Avg. 2001-2002 October 0.7 0.0

More information

Significant Rainfall and Peak Sustained Wind Estimates For Downtown San Francisco

Significant Rainfall and Peak Sustained Wind Estimates For Downtown San Francisco Significant Rainfall and Peak Sustained Wind Estimates For Downtown San Francisco Report Prepared by John P. Monteverdi, PhD, CCM July 30, 1998 Mayacamas Weather Consultants 1. Impact of Location The location

More information

Visual Display of Information

Visual Display of Information Visual Display of Information XKCD Edward Tufte Charles Joseph Minard s dramatic account of Napoleon's Russian campaign of 1812 (drawn in 1861) 1, men arrived in Moscow 422, men started the journey to

More information

Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation

Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation Patrick J. Heagerty PhD Department of Biostatistics University of Washington 166 ISCB 2010 Session Four Outline Examples

More information

Changes in spatial distribution of chub mackerel under climate change: the case study using Japanese purse seine fisheries data in the East China Sea

Changes in spatial distribution of chub mackerel under climate change: the case study using Japanese purse seine fisheries data in the East China Sea Changes in spatial distribution of chub mackerel under climate change: the case study using Japanese purse seine fisheries data in the East China Sea Tohya Yasuda, Ryuji Yukami, Seiji Ohshimo Seikai National

More information

Duration data analysis - basic concepts

Duration data analysis - basic concepts Duration data analysis - basic concepts Rasmus Waagepetersen Department of Mathematics Aalborg University Denmark September 7, 2018 1 / 25 Course topics (tentative) duration data - censoring and likelihoods

More information

Survival Analysis with Time- Dependent Covariates: A Practical Example. October 28, 2016 SAS Health Users Group Maria Eberg

Survival Analysis with Time- Dependent Covariates: A Practical Example. October 28, 2016 SAS Health Users Group Maria Eberg Survival Analysis with Time- Dependent Covariates: A Practical Example October 28, 2016 SAS Health Users Group Maria Eberg Outline Why use time-dependent covariates? Things to consider in definition of

More information

Lecture 7 Time-dependent Covariates in Cox Regression

Lecture 7 Time-dependent Covariates in Cox Regression Lecture 7 Time-dependent Covariates in Cox Regression So far, we ve been considering the following Cox PH model: λ(t Z) = λ 0 (t) exp(β Z) = λ 0 (t) exp( β j Z j ) where β j is the parameter for the the

More information

Epidemiology Wonders of Biostatistics Chapter 13 - Effect Measures. John Koval

Epidemiology Wonders of Biostatistics Chapter 13 - Effect Measures. John Koval Epidemiology 9509 Wonders of Biostatistics Chapter 13 - Effect Measures John Koval Department of Epidemiology and Biostatistics University of Western Ontario What is being covered 1. risk factors 2. risk

More information

Cohen s s Kappa and Log-linear Models

Cohen s s Kappa and Log-linear Models Cohen s s Kappa and Log-linear Models HRP 261 03/03/03 10-11 11 am 1. Cohen s Kappa Actual agreement = sum of the proportions found on the diagonals. π ii Cohen: Compare the actual agreement with the chance

More information

11 November 2011 Department of Biostatistics, University of Copengen. 9:15 10:00 Recap of case-control studies. Frequency-matched studies.

11 November 2011 Department of Biostatistics, University of Copengen. 9:15 10:00 Recap of case-control studies. Frequency-matched studies. Matched and nested case-control studies Bendix Carstensen Steno Diabetes Center, Gentofte, Denmark http://staff.pubhealth.ku.dk/~bxc/ Department of Biostatistics, University of Copengen 11 November 2011

More information

Computing & Telecommunications Services

Computing & Telecommunications Services Computing & Telecommunications Services Monthly Report September 214 CaTS Help Desk (937) 775-4827 1-888-775-4827 25 Library Annex helpdesk@wright.edu www.wright.edu/cats/ Table of Contents HEAT Ticket

More information

STATISTICS Relationships between variables: Correlation

STATISTICS Relationships between variables: Correlation STATISTICS 16 Relationships between variables: Correlation The gentleman pictured above is Sir Francis Galton. Galton invented the statistical concept of correlation and the use of the regression line.

More information

Pump failure data. Pump Failures Time

Pump failure data. Pump Failures Time Outline 1. Poisson distribution 2. Tests of hypothesis for a single Poisson mean 3. Comparing multiple Poisson means 4. Likelihood equivalence with exponential model Pump failure data Pump 1 2 3 4 5 Failures

More information

(Specification B) 40352H (JUN H01) General Certificate of Secondary Education Higher Tier June 2011 TOTAL. Time allowed 1 hour

(Specification B) 40352H (JUN H01) General Certificate of Secondary Education Higher Tier June 2011 TOTAL. Time allowed 1 hour Centre Number Surname Candidate Number For Examiner s Use Other Names Candidate Signature Examiner s Initials Geography (Specification B) Unit 2 Hostile World Friday 17 June 2011 For this paper you must

More information

CLIMATE OVERVIEW. Thunder Bay Climate Overview Page 1 of 5

CLIMATE OVERVIEW. Thunder Bay Climate Overview Page 1 of 5 CLIMATE OVERVIEW The climate in the Thunder Bay area is typical of a mid-latitude inland location with a Great Lake Moderating influence. The moderating effect of Lake Superior results in cooler summer

More information

Tables Table A Table B Table C Table D Table E 675

Tables Table A Table B Table C Table D Table E 675 BMTables.indd Page 675 11/15/11 4:25:16 PM user-s163 Tables Table A Standard Normal Probabilities Table B Random Digits Table C t Distribution Critical Values Table D Chi-square Distribution Critical Values

More information

REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520

REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520 REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520 Department of Statistics North Carolina State University Presented by: Butch Tsiatis, Department of Statistics, NCSU

More information

Lecture 14: Introduction to Poisson Regression

Lecture 14: Introduction to Poisson Regression Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu 8 May 2007 1 / 52 Overview Modelling counts Contingency tables Poisson regression models 2 / 52 Modelling counts I Why

More information

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview Modelling counts I Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu Why count data? Number of traffic accidents per day Mortality counts in a given neighborhood, per week

More information

The 2017 Statistical Model for U.S. Annual Lightning Fatalities

The 2017 Statistical Model for U.S. Annual Lightning Fatalities The 2017 Statistical Model for U.S. Annual Lightning Fatalities William P. Roeder Private Meteorologist Rockledge, FL, USA 1. Introduction The annual lightning fatalities in the U.S. have been generally

More information

Session 2 Functions, Linear Functions, Graphs, Polynomial Equations, and Percents

Session 2 Functions, Linear Functions, Graphs, Polynomial Equations, and Percents Session 2 Functions, Linear Functions, Graphs, Polynomial Equations, and Percents 1 Functions A function is a rule or procedure such that when you give it a number, it gives you back one and only one number.

More information

Probability and Probability Distributions. Dr. Mohammed Alahmed

Probability and Probability Distributions. Dr. Mohammed Alahmed Probability and Probability Distributions 1 Probability and Probability Distributions Usually we want to do more with data than just describing them! We might want to test certain specific inferences about

More information

Jackson County 2013 Weather Data

Jackson County 2013 Weather Data Jackson County 2013 Weather Data 61 Years of Weather Data Recorded at the UF/IFAS Marianna North Florida Research and Education Center Doug Mayo Jackson County Extension Director 1952-2008 Rainfall Data

More information

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

 M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2 Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the

More information

Testing Independence

Testing Independence Testing Independence Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/50 Testing Independence Previously, we looked at RR = OR = 1

More information

STA6938-Logistic Regression Model

STA6938-Logistic Regression Model Dr. Ying Zhang STA6938-Logistic Regression Model Topic 6-Logistic Regression for Case-Control Studies Outlines: 1. Biomedical Designs 2. Logistic Regression Models for Case-Control Studies 3. Logistic

More information

Analysing The Temporal Pattern Of Crime Hotspots In South Yorkshire, United Kingdom

Analysing The Temporal Pattern Of Crime Hotspots In South Yorkshire, United Kingdom Advances in Social Sciences Research Journal Vol.5, No.6 Publication Date: June. 25, 218 DoI:1.14738/assrj.56.4822. Abdullahi, H., & Abbas, I. I. (218). Analysing The Temporal Pattern Of Crime Hotspots

More information

(b) Equation for a parabola: c) Direction of Opening (1) If a is positive, it opens (2) If a is negative, it opens

(b) Equation for a parabola: c) Direction of Opening (1) If a is positive, it opens (2) If a is negative, it opens Section.1 Graphing Quadratics Objectives: 1. Graph Quadratic Functions. Find the ais of symmetry and coordinates of the verte of a parabola.. Model data using a quadratic function. y = 5 I. Think and Discuss

More information

Standardization methods have been used in epidemiology. Marginal Structural Models as a Tool for Standardization ORIGINAL ARTICLE

Standardization methods have been used in epidemiology. Marginal Structural Models as a Tool for Standardization ORIGINAL ARTICLE ORIGINAL ARTICLE Marginal Structural Models as a Tool for Standardization Tosiya Sato and Yutaka Matsuyama Abstract: In this article, we show the general relation between standardization methods and marginal

More information

A multistate additive relative survival semi-markov model

A multistate additive relative survival semi-markov model 46 e Journées de Statistique de la SFdS A multistate additive semi-markov Florence Gillaizeau,, & Etienne Dantan & Magali Giral, & Yohann Foucher, E-mail: florence.gillaizeau@univ-nantes.fr EA 4275 - SPHERE

More information

Stat 587: Key points and formulae Week 15

Stat 587: Key points and formulae Week 15 Odds ratios to compare two proportions: Difference, p 1 p 2, has issues when applied to many populations Vit. C: P[cold Placebo] = 0.82, P[cold Vit. C] = 0.74, Estimated diff. is 8% What if a year or place

More information

TABLE -I RAINFALL RECORDED AT PORT BLAIR (MM) FROM 1949 TO 2009

TABLE -I RAINFALL RECORDED AT PORT BLAIR (MM) FROM 1949 TO 2009 A. RAINFALL TABLE -I RAINFALL RECORDED AT PORT BLAIR (MM) FROM 1949 TO 2009 MONTH/YEAR 1949 1950 1951 1952 1953 1954 1955 1956 1957 1958 JANUARY 0.0 0.8 82.5 0.0 26.9 37.3 71.4 46.2 10.2 28.7 FEBRUARY

More information

Statistical Models for Rainfall with Applications to Index Insura

Statistical Models for Rainfall with Applications to Index Insura Statistical Models for Rainfall with Applications to April 21, 2008 Overview The idea: Insure farmers against the risk of crop failure, like drought, instead of crop failure itself. It reduces moral hazard

More information

Analytic Methods for Applied Epidemiology: Framework and Contingency Table Analysis

Analytic Methods for Applied Epidemiology: Framework and Contingency Table Analysis Analytic Methods for Applied Epidemiology: Framework and Contingency Table Analysis 2014 Maternal and Child Health Epidemiology Training Pre-Training Webinar: Friday, May 16 2-4pm Eastern Kristin Rankin,

More information

The science of learning from data.

The science of learning from data. STATISTICS (PART 1) The science of learning from data. Numerical facts Collection of methods for planning experiments, obtaining data and organizing, analyzing, interpreting and drawing the conclusions

More information

MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics

MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1 MAT 2379, Introduction to Biostatistics Sample Calculator Problems for the Final Exam Note: The exam will also contain some problems

More information

Case-control studies

Case-control studies Matched and nested case-control studies Bendix Carstensen Steno Diabetes Center, Gentofte, Denmark b@bxc.dk http://bendixcarstensen.com Department of Biostatistics, University of Copenhagen, 8 November

More information

One-stage dose-response meta-analysis

One-stage dose-response meta-analysis One-stage dose-response meta-analysis Nicola Orsini, Alessio Crippa Biostatistics Team Department of Public Health Sciences Karolinska Institutet http://ki.se/en/phs/biostatistics-team 2017 Nordic and

More information

Chapter 8. The analysis of count data. This is page 236 Printer: Opaque this

Chapter 8. The analysis of count data. This is page 236 Printer: Opaque this Chapter 8 The analysis of count data This is page 236 Printer: Opaque this For the most part, this book concerns itself with measurement data and the corresponding analyses based on normal distributions.

More information

PhD course in Advanced survival analysis. One-sample tests. Properties. Idea: (ABGK, sect. V.1.1) Counting process N(t)

PhD course in Advanced survival analysis. One-sample tests. Properties. Idea: (ABGK, sect. V.1.1) Counting process N(t) PhD course in Advanced survival analysis. (ABGK, sect. V.1.1) One-sample tests. Counting process N(t) Non-parametric hypothesis tests. Parametric models. Intensity process λ(t) = α(t)y (t) satisfying Aalen

More information

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F). STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis 1. Indicate whether each of the following is true (T) or false (F). (a) T In 2 2 tables, statistical independence is equivalent to a population

More information

Swine Enteric Coronavirus Disease (SECD) Situation Report Sept 17, 2015

Swine Enteric Coronavirus Disease (SECD) Situation Report Sept 17, 2015 Animal and Plant Health Inspection Service Veterinary Services Swine Enteric Coronavirus Disease (SECD) Situation Report Sept 17, 2015 Information current as of 12:00 pm MDT, 09/16/2015 This report provides

More information

Meta-analysis. 21 May Per Kragh Andersen, Biostatistics, Dept. Public Health

Meta-analysis. 21 May Per Kragh Andersen, Biostatistics, Dept. Public Health Meta-analysis 21 May 2014 www.biostat.ku.dk/~pka Per Kragh Andersen, Biostatistics, Dept. Public Health pka@biostat.ku.dk 1 Meta-analysis Background: each single study cannot stand alone. This leads to

More information

Long-term Water Quality Monitoring in Estero Bay

Long-term Water Quality Monitoring in Estero Bay Long-term Water Quality Monitoring in Estero Bay Keith Kibbey Laboratory Director Lee County Environmental Laboratory Division of Natural Resource Management Estero Bay Monitoring Programs Three significant

More information

Multi-state models: prediction

Multi-state models: prediction Department of Medical Statistics and Bioinformatics Leiden University Medical Center Course on advanced survival analysis, Copenhagen Outline Prediction Theory Aalen-Johansen Computational aspects Applications

More information

Section II: Assessing Chart Performance. (Jim Benneyan)

Section II: Assessing Chart Performance. (Jim Benneyan) Section II: Assessing Chart Performance (Jim Benneyan) 1 Learning Objectives Understand concepts of chart performance Two types of errors o Type 1: Call an in-control process out-of-control o Type 2: Call

More information

Measures of Central Tendency

Measures of Central Tendency Statistics It is the science of assembling, analyzing, characterizing, and interpreting the collection of data. The general characterized of data: 1. Data shows a tendency to concentrate at certain values:

More information

Analysis of categorical data S4. Michael Hauptmann Netherlands Cancer Institute Amsterdam, The Netherlands

Analysis of categorical data S4. Michael Hauptmann Netherlands Cancer Institute Amsterdam, The Netherlands Analysis of categorical data S4 Michael Hauptmann Netherlands Cancer Institute Amsterdam, The Netherlands m.hauptmann@nki.nl 1 Categorical data One-way contingency table = frequency table Frequency (%)

More information

Computing & Telecommunications Services Monthly Report January CaTS Help Desk. Wright State University (937)

Computing & Telecommunications Services Monthly Report January CaTS Help Desk. Wright State University (937) January 215 Monthly Report Computing & Telecommunications Services Monthly Report January 215 CaTS Help Desk (937) 775-4827 1-888-775-4827 25 Library Annex helpdesk@wright.edu www.wright.edu/cats/ Last

More information

CIMA Professional

CIMA Professional CIMA Professional 201819 Birmingham Interactive Timetable Version 3.1 Information last updated 12/10/18 Please note: Information and dates in this timetable are subject to change. A better way of learning

More information

Dynamic Scheduling of the Upcoming Exam in Cancer Screening

Dynamic Scheduling of the Upcoming Exam in Cancer Screening Dynamic Scheduling of the Upcoming Exam in Cancer Screening Dongfeng 1 and Karen Kafadar 2 1 Department of Bioinformatics and Biostatistics University of Louisville 2 Department of Statistics University

More information

Principles of Mathematics 12: Explained!

Principles of Mathematics 12: Explained! www.math12.com 18 Part I Ferris Wheels One of the most common application questions for graphing trigonometric functions involves Ferris wheels, since the up and down motion of a rider follows the shape

More information

Activity Sheet Counting M&Ms

Activity Sheet Counting M&Ms Counting M&Ms Pour a half-pound bag of M&Ms onto a paper plate so that the candies are one layer thick. You will need to spread the M&Ms to the edges of the plate. Remove all the M&Ms that have the M showing

More information

Chapter 11. Correlation and Regression

Chapter 11. Correlation and Regression Chapter 11. Correlation and Regression The word correlation is used in everyday life to denote some form of association. We might say that we have noticed a correlation between foggy days and attacks of

More information

STA6938-Logistic Regression Model

STA6938-Logistic Regression Model Dr. Ying Zhang STA6938-Logistic Regression Model Topic 2-Multiple Logistic Regression Model Outlines:. Model Fitting 2. Statistical Inference for Multiple Logistic Regression Model 3. Interpretation of

More information

First Aid Kit for Survival. Hypoxia cohort. Goal. DFS=Clinical + Marker 1/21/2015. Two analyses to exemplify some concepts of survival techniques

First Aid Kit for Survival. Hypoxia cohort. Goal. DFS=Clinical + Marker 1/21/2015. Two analyses to exemplify some concepts of survival techniques First Aid Kit for Survival Melania Pintilie pintilie@uhnres.utoronto.ca Two analyses to exemplify some concepts of survival techniques Checking linearity Checking proportionality of hazards Predicted curves:

More information

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH The First Step: SAMPLE SIZE DETERMINATION THE ULTIMATE GOAL The most important, ultimate step of any of clinical research is to do draw inferences;

More information

Thursday 4 June 2015 Afternoon

Thursday 4 June 2015 Afternoon Oxford Cambridge and RSA Thursday 4 June 2015 Afternoon AS GCE PHYSICS B (ADVANCING PHYSICS) G492/01 Understanding Processes, Experimentation and Data Handling INSERT *5000035423* Duration: 2 hours INSTRUCTIONS

More information

Annual Average NYMEX Strip Comparison 7/03/2017

Annual Average NYMEX Strip Comparison 7/03/2017 Annual Average NYMEX Strip Comparison 7/03/2017 To Year to Year Oil Price Deck ($/bbl) change Year change 7/3/2017 6/1/2017 5/1/2017 4/3/2017 3/1/2017 2/1/2017-2.7% 2017 Average -10.4% 47.52 48.84 49.58

More information

Bayesian Hierarchical Models

Bayesian Hierarchical Models Bayesian Hierarchical Models Gavin Shaddick, Millie Green, Matthew Thomas University of Bath 6 th - 9 th December 2016 1/ 34 APPLICATIONS OF BAYESIAN HIERARCHICAL MODELS 2/ 34 OUTLINE Spatial epidemiology

More information

Comparison of Particulate Monitoring Methods at Fort Air Partnership Monitoring Stations

Comparison of Particulate Monitoring Methods at Fort Air Partnership Monitoring Stations Comparison of Particulate Monitoring Methods at Fort Air Partnership Monitoring Stations Melanie Larsen Harry Benders RS Environmental (Tom Dann) March 13, 2014 Executive Summary Historically FAP has acquired

More information

CONTENTS OF DAY 2. II. Why Random Sampling is Important 10 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE

CONTENTS OF DAY 2. II. Why Random Sampling is Important 10 A myth, an urban legend, and the real reason NOTES FOR SUMMER STATISTICS INSTITUTE COURSE 1 2 CONTENTS OF DAY 2 I. More Precise Definition of Simple Random Sample 3 Connection with independent random variables 4 Problems with small populations 9 II. Why Random Sampling is Important 10 A myth,

More information

1 and 2 proportions starts here

1 and 2 proportions starts here 1 and 2 proportions starts here Example: Would You Pay Higher Prices to Protect the Environment? In 2000, the GSS asked: Are you willing to pay much higher prices in order to protect the environment? Of

More information

Class Biology

Class Biology Class - 10+1 Biology April & May Chapter 1 Chapter 2 Chapter 3 The Living World Biological Classification Plant Kingdom July Chapter 4 Chapter 5 Animal kingdom Morphology of flowering plants August Chapter

More information

Statistics in medicine

Statistics in medicine Statistics in medicine Lecture 4: and multivariable regression Fatma Shebl, MD, MS, MPH, PhD Assistant Professor Chronic Disease Epidemiology Department Yale School of Public Health Fatma.shebl@yale.edu

More information

ECE 510 Lecture 6 Confidence Limits. Scott Johnson Glenn Shirley

ECE 510 Lecture 6 Confidence Limits. Scott Johnson Glenn Shirley ECE 510 Lecture 6 Confidence Limits Scott Johnson Glenn Shirley Concepts 28 Jan 2013 S.C.Johnson, C.G.Shirley 2 Statistical Inference Population True ( population ) value = parameter Sample Sample value

More information

Statistics in medicine

Statistics in medicine Statistics in medicine Lecture 3: Bivariate association : Categorical variables Proportion in one group One group is measured one time: z test Use the z distribution as an approximation to the binomial

More information

Cluster investigations using Disease mapping methods International workshop on Risk Factors for Childhood Leukemia Berlin May

Cluster investigations using Disease mapping methods International workshop on Risk Factors for Childhood Leukemia Berlin May Cluster investigations using Disease mapping methods International workshop on Risk Factors for Childhood Leukemia Berlin May 5-7 2008 Peter Schlattmann Institut für Biometrie und Klinische Epidemiologie

More information

Statistical tables are attached Two Hours UNIVERSITY OF MANCHESTER. May 2007 Final Draft

Statistical tables are attached Two Hours UNIVERSITY OF MANCHESTER. May 2007 Final Draft Statistical tables are attached wo Hours UNIVERSIY OF MNHESER Ma 7 Final Draft Medical Statistics M377 Electronic calculators ma be used provided that the cannot store tet nswer LL si questions in SEION

More information

CPSC 531 Systems Modeling and Simulation FINAL EXAM

CPSC 531 Systems Modeling and Simulation FINAL EXAM CPSC 531 Systems Modeling and Simulation FINAL EXAM Department of Computer Science University of Calgary Professor: Carey Williamson December 21, 2017 This is a CLOSED BOOK exam. Textbooks, notes, laptops,

More information

Regression M&M 2.3 and 10. Uses Curve fitting Summarization ('model') Description Prediction Explanation Adjustment for 'confounding' variables

Regression M&M 2.3 and 10. Uses Curve fitting Summarization ('model') Description Prediction Explanation Adjustment for 'confounding' variables Uses Curve fitting Summarization ('model') Description Prediction Explanation Adjustment for 'confounding' variables MALES FEMALES Age. Tot. %-ile; weight,g Tot. %-ile; weight,g wk N. 0th 50th 90th No.

More information

Public Library Use and Economic Hard Times: Analysis of Recent Data

Public Library Use and Economic Hard Times: Analysis of Recent Data Public Library Use and Economic Hard Times: Analysis of Recent Data A Report Prepared for The American Library Association by The Library Research Center University of Illinois at Urbana Champaign April

More information

Introduction to the Analysis of Tabular Data

Introduction to the Analysis of Tabular Data Introduction to the Analysis of Tabular Data Anthropological Sciences 192/292 Data Analysis in the Anthropological Sciences James Holland Jones & Ian G. Robertson March 15, 2006 1 Tabular Data Is there

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your

More information