Introduction to Inferential Statistics. Jaranit Kaewkungwal, Ph.D. Faculty of Tropical Medicine Mahidol University

Size: px
Start display at page:

Download "Introduction to Inferential Statistics. Jaranit Kaewkungwal, Ph.D. Faculty of Tropical Medicine Mahidol University"

Transcription

1 Introduction to Inferential Statistics Jaranit Kaewkungwal, Ph.D. Faculty of Tropical Medicine Mahidol University 1

2 Data & Variables 2

3 Types of Data QUALITATIVE Data expressed by type Data that has been described QUANTITATIVE Data classified by numeric value Data that has been measured or counted QUALITITATIVE and QUANTITATIVE data are not mutually exclusive Adapted from: Dr. Craig Jackson, University of Central England

4 Types of Data: Qualitative (Categorical) Data NOMINAL DATA values that the data may have do not have specific order values act as labels with no real meaning Binomial: two possible values (categories, states) Multinomial: more than two possible values (categories, states) e.g. Health status healthy =1 sick=2 e.g. Treatment new regimen = 1 standard regimen = 2 e.g. hair colour brown =1 blond =2 black =100 ORDINAL DATA values with some kind of ordering data that has been measured or counted e.g. social class: upper=1 middle = 2 working = 3 e.g. glioblastoma tumor grade: e.g. position in a race: Adapted from: Dr. Craig Jackson, University of Central England

5 Types of Data: Quantitative Data DISCRETE distinct or separate parts, with no finite detail e.g children in family CONTINUOUS between any two values, there would be a third e.g between meters there are centimetres INTERVAL equal intervals between values and an arbitrary zero on the scale e.g temperature gradient RATIO equal intervals between values and an absolute zero e.g body mass index Adapted from: Dr. Craig Jackson, University of Central England

6 Temperature Levels of Variables White Hot Dangerous 80 o C Unpleasant 60 o C Unsafe Red Hot Uncomfortable 40 o C Tolerable Comfortable 20 o C Safe Cold Cold 10 o C Adapted from: Dr. Craig Jackson, University of Central England

7 Examples of Data Coding Nominal/Cat. Var Exclude from Analysis? Ordinal/Cat. Var 7

8 Examples of Data Coding Cont. Cont

9 Example of Descriptive Statistics 9

10 Constant vs. Variable Variables are the specific properties that have the ability to take different values. Constants are the specific properties that cannot vary or won t be made to vary. 10

11 Terminology - Variables INDEPENDENT (syn: treatment, experimental, predictor, input, exposure, explanatory variable) is a stimulus or activity that is identified or manipulated to predict the dependent variable; they are considered as the causal factors, or that you may manipulate. e.g. new drug, working hours, exposure, worker attitudes, policies ies DEPENDENT (syn: Effect, criterion, criterion measure, outcome, output variable) is a response that the researcher wanted to predict; they are considered as the outcomes of the treatments or the responses to changes in the independent variables. e.g. Symptomotology,, productivity, accident rates, attitudes, health status, performance on neuropsychological test Adapted from: Dr. Craig Jackson, University of Central England

12 Terminology - Variables CONTROLLED Extraneous variable is a variable that has a potential to distorts the relationship between dependent and independent variables. Controlled extraneous variables are recognized before the study is initiated and are controlled in the design and selection criteria. Uncontrolled extraneous variables are recognized before the study is initiated or, sometimes, even if recognized cannot be controlled in the design and selection phase. Usually an attempt is made to assess and adjust them through sophisticated statistical tools. e.g., Working hours, temperatures, extraneous exposure, diet, class, income, Ambient noise and temperature in testing room Adapted from: Dr. Craig Jackson, University of Central England

13 Study Variables Independent Variables & Dependent variables X Y X (independent) Y (dependent) Extraneous variable X X (independent) Y (dependent) X2 (independent) 13

14 Study Variables Confounding Variable - When the effects of two or more variables cannot be separated. 14

15 Study Variables Confounding Variable - When the effects of two or more variables cannot be separated. BUT... Condom STD rate Yes 55/95 (61%) Use No 45/105 (43%) Condom Use increases the risk of STD STD rate # Partners < 5 Condom Yes 5/15 (33%) Use No 30/82 (37%) # Partners > 5 Condom Yes 50/80 (62%) Use No 15/23 (65%) Explanation: Individuals with more partners are more likely to use condoms. But individuals with more partners are also more likely to get STD. 15

16 Example of Study Variables Dependent Var: Infant//Child Growth Indepependent Var: Adult Fatness Extraneous Var: - Confouding Var (Adj,/Controlled Var) : Child Age, Adult Age, Socio-economic, Smoking, Physical Acitivity, etc. - Uncontrolled Var: Sex (male & female) 16

17 Bias & Chance 17

18 Measuring Outcomes: Observed vs. Truth Possible Explanations of Outcome Measured Bias Chance Truth Observed = Truth + Error Systematic error + Random error (Bias) (Chance) 18

19 Bias vs. Chance 19

20 Bias vs.chance Bias: A process at any stage of inference tending to produce results that depart systematically from the true values. Chance: The divergence of an observation on a sample from the true population value in either direction. The divergence due to chance alone is called random variation Bias and chance- are not mutually exclusive. 20

21 Bias vs.chance A well designed, carefully executed study usually gives results that are obvious without a formal analysis and if there are substantial flaws in design or execution a formal analysis will not help. Johnson AF. Beneath the technological fix. J Chron Dis 1985 (38),

22 Free kick Chance 2.5 % 50% 50% 68% 95% 2.5 % 22 Probability of being hit

23 Free kick Chance 2.5 % 50% 50% 68% 95% 2.5 % 23 Probability of getting goal

24 Normal Distribution in Descriptive Statistics Standard Score Raw Score X = 30; SD = 5 24

25 Types of Statistical Methods 25

26 Types of Statistics By Level of Generalization Descriptive Statistics Inferential Statistics Parameter Estimation Hypothesis Testing Comparison Association Multivariable data analysis Sampling Techniques Generalization/ Inferential Statistics By Level of Underlying Distribution Parametric Statistics Non-parametric Statistics 26

27 Descriptive Statistics 27

28 Descriptive Statistics Measure of Location (Categorical Vars) Frequency ( f ) Female 240 Male Count 210 Female Male Gender Measure of Location (Continuous Vars) Mean Average Median Mid-point Mode x = i = The Most Frequent n x i 1 or μ = n x i = 1 N X 1 X 2 X 3 X 4 X 5 X 6 X 7 X 8 X 9 n i X 1 X 2 X 3 X 4 X 5 X 6 X 7 X 8 X 9 ( ) 28

29 Descriptive Statistics Measure of Spread Range Max - Min Standard Deviation / Variance X i s deviate from Mean S or = n ( x i X ) i = 1 n - 1 σ = n ( x i μ ) i = 1 N x 29

30 Descriptive Statistics Measure of Shape Normal Distribution Skewed Distribution Positively skewed Negatively skewed 30

31 Inferential Statistics 31

32 Inferential Statistics Purpose of Inferential Statistics Generalisabiliy of research results from Sample Statistics to Population Parameters Types of Inferential Statistical Methods Parameter Estimation - to estimate the range of values that is likely to include the true value in population X μ proportion Π Hypothesis Testing - to ask whether an effect (difference) is present or not among different groups Ho: X 1 = X 2 Ho: μ 1 = μ 2 Ho: r xy = 0 Ho: ρ xy = 0 32

33 Sampling Distribution =26 =30 =25 SD = 9.1 SD = 11.3 SD = 12.2 μ = Inferential Statistics Inferential Statistics Sampling Method Sampling Method x 1 x 2 x 3

34 Sampling Distribution (Normal Distribution) in Parameter (μ)( Estimation Confidence Limits of μ : X ± Z α/2,ν S X Standard error 95% CI of μ : X ± (1.96 * (SD/ n)) 25 ± (1.96 * (12.2/ 100)) μ = 25 (22.6 to 27.4, 95% CI) n = 100 =26 =30 =25 SD = 9.1 SD = 11.3 SD = 12.2 μ = 27 N = 10,000 34

35 Parameter Estimation 35

36 Point Estimates: Single values (Mean, Variance, Correlation, treatment effect, relative risk, etc.) representing characteristics in the whole population Parameter Estimation Interval Estimates: Ranges of values, usually centered around point estimates, indicating bounds within which we expect the true values for the whole population to lie (stability of the estimate) SD = 0.75 Average: Example Confidence Limits of μ : X ± Z S α / 2, υ X 36

37 Parameter Estimation Point estimates & Confidence Intervals Point estimates and confidence intervals are used to characterise the statistical precision of any rate (incidence, prevalence), comparisons of rates (e.g., relative risk), and other statistics. US adults have used unconventional therapy = 34% (31% - 37%, 95%CI) Sensitivity of clinical examination of splenomegaly = 27% (19-36%, 95%CI) Relative risk of lung cancer of smoker vs. non-smoker = 5.6 ( , 95%CI) Relative risk of HIV infected of male vs. female = 2.1 ( , 95%CI) 37

38 Parameter Estimation (Normal Distribution) Confidence Limits of μ : X ± Z α/2,ν S X Standard error 95% CI of μ : X ± (1.96 * (SD/ n)) 25 ± (1.96 * (12.2/ 100)) μ = 25 (22.6 to 27.4, 95% CI) n = 100 =26 =30 =25 SD = 9.1 SD = 11.3 SD = 12.2 μ = 27 N = 10,000 38

39 Parameter Estimation Consideration in confidence level of the estimate Select a cut-point for CI Confidence Interval, usually set at 95% CI, can be interpreted such that - if the study is unbiased and repeated 100 times, there is 95% chance that the true value is included in these intervals of the 100 samples 95% CI (from Sample 1) True Value (in population) 95% CI (from Sample 2) 95% CI (from Sample 3) 39

40 Hypothesis Testing - Comparisons 40

41 Hypothesis Testing Hypothesis & Tail of the test One-sided vs. Two-sided Test Two-sided test: Ex Ho: Outcome 1 = Outcome 2 Ha: Outcome 1 Outcome 2 One-sided test: Ex Ho: Outcome 1 Outcome 2 Ha: Outcome 1 > Outcome 2 Ho: Outcome 1 Outcome 2 Ha: Outcome 1 < Outcome 2 O1<O2 O1=O2 O1>O2 2.5% 95% 2.5% O1<O2 O1 >= O2 5% 95% 41

42 Hypothesis Testing Basic steps in hypothesis testing 42

43 Hypothesis Testing Ho: μ 1 = μ 2 Ho: μ 1 μ 2 = 0 Ha: μ 1 μ 2 = 0 Not Reject Ho!! μ 1 = μ 2 μ 1 μ 2 43

44 Hypothesis Testing Ho: μ 1 μ 2 = 0 Ha: μ 1 μ 2 = 0 Reject Ho!! μ 1 < μ 2 μ 1 μ 2 44

45 Hypothesis Testing H 0 : μ 1 μ 2 = 0 H a : μ 1 μ 2 = 0 given n = very large at α = 0.05 Reject H 0!! μ 1 > μ 2 at α = 0.01 Not Reject H 0!! μ 1 = μ 2 α / 2 = p-value = 0.04 α / 2 =

46 Type I & Type II errors H 0 : G 1 = G 2 Hypothesis Testing Reality/Truth H 0 True (G 1 =G 2 ) H 0 False (G 1 <>G 2 ) Decision Accept H 0 (G1=G2) Reject H 0 (G1<>G2) Correct Type II Error Confidence : 1 - α A B 0.99, , 0.20 Type I Error α C D β Correct Power : 1 - β 0.01, ,

47 Choosing the Right Statistical Procedure 47

48 Choosing the Right Data Analysis Procedure Basic questions that you should have answers before you can choose the correct test are the following: What is the purpose of the analysis? - describe the data; Or - compare groups of data to make decisions; or - examine the association between variables for prediction or forecasting? Is the distribution of the data approximately normal? Is the sample size large enough that the Central Limit Theorem will allow a normality assumption? 48

49 Selecting a Statistical Method Goal Type of Outcome Data of the Continuous Categorical Binomial Survival analysis (from Gaussian Continuous Time Population) (Non-Gaussian) Describe Value of Data (1 Group) Mean, SD Median, Interquartile range Proportion (Percent) Kaplan- Meier survival curve Compare Value of Data vs. Hypothetical Value Onesample t- test Wilcoxon s test Chi-square (χ 2 ) or Binomial/ Runs test (1 Group) 49

50 Selecting a Statistical Method Goal Type of Outcome Data of the Continuous Categorical Binomial Survival analysis (from Gaussian Continuous Time Population) (Non-Gaussian) Compare Values 2 Grps of Indept >2 Grps Grps. Unpaired t- test One-way ANOVA Mann- Whitney test Kruskal- Wallis test χ 2 test, Fisher s Exact, χ 2 test Log-rank / Mantel- Haenszel Cox Prop Haz.Reg. Compare Values 2 Grps of Paired >2 Grps Grps/Vars. Paired t-test Repeated measures ANOVA Wilcoxon s test Friedman s test McNemar s χ 2 test Cochrane s Q test Condt n Prop Haz.Reg. Condt n Prop Haz.Reg. 50

51 Selecting a Statistical Method Goal Type of Outcome Data of the Continuous Categorical Binomial Survival analysis (from Gaussian Continuous Time Population) (Non-Gaussian) Quantify Association Values of Two variables Pearson s Correlation Spearman s Correlation Contingency coefficient, Crude Odds Ratio, Relat v Risk Predict Value of Outcome Var: from 1 Var (Simple Reg. ) from > 2 Vars (Multiple Reg.) Linear or Non-linear Regression. Nonparametric Regression Logistic Regression Cox s Proportional Haz. Reg. 51

52 Selecting a Statistical Method Goal Type of Outcome Data of the Continuous Categorical Binomial Survival analysis (from Gaussian Continuous Time Population) (Non-Gaussian) Measures of Agreement Values from Two Raters/Methods Pearson s Correlation Weighted Kappa (κ) Weighted Kappa (κ) Agreement rate Cohen s κ ICC Measures of Validity Values from Two Raters/Methods ANOVA Factor Analysis Nonparametric ANOVA Factor Analysis χ 2 Sensitivity Specificity ROC curve 52

53 Example of Inferential Statistical Methods 53

54 Example of Parameter Estimates & 95%CI 54

55 Example of Parameter Estimates & 95%CI 55

56 Example of Comparisons 56

57 Example of Comparisons 57

58 58

59 Example of Comparisons 59

60 Example of Comparisons 60

61 Example of Association 61

62 Example of Association 62

63 Example of Association 63

64 Example of Association 64

65 Example of Association 65

66 Example of Association 66

67 Dose-Response 67

68 68

69 Example of Parameter Estimates 69

70 Example of Parameter Estimates Figure 2. Survival from time of human immunodeficiency virus (HIV) infection of 194 CSWs. A, Overall. B, By serum virus load (HIV type 1 RNA copies/ml). Each curve is truncated when <10 women remain in that group <10, , ,000 Survival (%) A Survival (%) >100,000 B Months Months 70

71 Example of Association Table 2. Survival from time of infection of 194 HIV-infected CSWs Charactersitcs No. of No. of Patients 7-Year survival, Rate ratio patients who died % % (95% CI)* (95% CI)** Age at Infection, years <= (29.5) 72.3 ( ) Referent >= (39.3) 63.3 ( ) 1.50 ( ) Sex work Brothel (34.0) 69.6 ( ) 1.34 ( ) Nonbrothel (34.3) 62.9 ( ) Referent Oral contraceptive use Yes (32.1) 69.6 ( ) 0.83 ( ) No (36.6) 67.1 ( ) Referent Depot medroxyprogesterone use Yes (32.7) 70.9 ( ) 0.78 ( ) No (34.5) 67.8 ( ) Referent Infection status Seroconverted 34 7 (20.6) *** 1.42 ( ) Seropositive at enrl (36.9) 69.6 ( ) Referent Viral load, HIV-1 RNA copies/ml. >1000, (70.6) 34.5 ( ) ( ) 10, , (33.6) 70.3 ( ) 4.63 ( ) <10, (8.5) 92.5 ( ) Referent Total (34.0) 68.7 ( ) * Survival analysis ** Cox proportional hazard model *** Insufficient follow-up time to this m ore recent converted group; 5-year survival = 77.8 ( ) % 71

72 Statistical Methods: Multi-variable Data Analysis 72

73 Multi-variable Data Analysis Causes/Exposures vs. Outcomes Example: Causes of Tuberculosis Malnutrition Vaccination crowding Genetic Susceptible Host Exposure to Mycobacterium Infection Tissue Invasion and Reaction Tuberculosis Risk factors for tuberculosis (Distant from Outcome) Mechanism of Tuberculosis (Proximal to Outcome) 73

74 Multi-variable Data Analysis Causes/Exposures vs. Outcomes Example: Relationship between risk factors and disease : hypertension ( BP) and congestive heart failure (CHF). Hypertension causes many diseases, including congestive heart failure, and congestive heart failure has many causes, including hypertension. 74

75 Example of MDA Risk (never) = 7/47 =.149 Risk (sporadic) = 9/35 =.257 RR (sporadic vs.never) = 25.7/14.9 =

76 Example of MDA 76

77 Example of MDA Table 3. Mortality from time of first CD4 T lymphocyte count of 157 HIV-infected CSWs (125 women were HIV seropositive at study enrollment and 32 seroconverted during study) Charactersitcs No. of No. of Pts 5-Year survival, Rate ratio Adjusted rate ratio patients who died % % (95% CI)* (95% CI)** (95% CI)*** Initial CD4 lymphocyte, cells/μl < (93.3) ( ) 15.5 ( ) (38.6) 63.4 ( ) 2.46 ( ) 1.42 ( ) > (18.5) 84.7 ( ) Referent Referent Viral load, HIV-1 RNA copies/ml. >1000, (76.7) 26.7 ( ) 13.9 ( ) 12.5 ( ) 10, , (34.8) 65.0 ( ) 3.87 ( ) 3.42 ( ) <10, (10.5) 96.7 ( ) Referent Referent Total (36.9) 64.6 ( ) * Survival analysis ** Cox proportional hazard model *** Cox proporational hazard model adjusted for initial CD4 lymphocyte count and virus load 77

78 The End of Inferential Statistics 78

79 Bias vs.chance: Observed & Truth in Statistical Methods Observed = Truth + Error Example of Statistical Analysis: Chi-Square test: χ 2 =Σ(Ο Ε) 2 /Ε Kappa Statistics: Observed agreement - Expected Agreement κ = Obs Agreemt - Expct Agreemt 1 - Expected Agreement Systematic error + Random error OBSERVED exposed EXPECTED exposed Yes No Yes No Outcome Yes Yes No Outcome No

80 Bias vs.chance: Observed & Truth in Statistical Methods Observed = Truth + Error Systematic error + Random error Example of Statistical Analysis: 190 ε Analysis of variance: systolic μ τ ε τ F = σ 2 (τ+ε) /σ2 ε <=20 3 agegrp >= 41 age group

81 Bias vs.chance: Observed & Truth in Statistical Methods Observed = Truth + Error Example of Statistical Analysis: Regression Analysis: Y = Y + ε Y = α + β 1 X 1 Systematic error + Random error CHD Mortality per 10, ε Y=13, X =6 (Y = 11, X = 6) Cigarette Consumption per Adult per Day 81

Small n, σ known or unknown, underlying nongaussian

Small n, σ known or unknown, underlying nongaussian READY GUIDE Summary Tables SUMMARY-1: Methods to compute some confidence intervals Parameter of Interest Conditions 95% CI Proportion (π) Large n, p 0 and p 1 Equation 12.11 Small n, any p Figure 12-4

More information

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages: Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the

More information

Turning a research question into a statistical question.

Turning a research question into a statistical question. Turning a research question into a statistical question. IGINAL QUESTION: Concept Concept Concept ABOUT ONE CONCEPT ABOUT RELATIONSHIPS BETWEEN CONCEPTS TYPE OF QUESTION: DESCRIBE what s going on? DECIDE

More information

Introduction to Statistical Analysis

Introduction to Statistical Analysis Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive

More information

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. Preface p. xi Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. 6 The Scientific Method and the Design of

More information

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics DETAILED CONTENTS About the Author Preface to the Instructor To the Student How to Use SPSS With This Book PART I INTRODUCTION AND DESCRIPTIVE STATISTICS 1. Introduction to Statistics 1.1 Descriptive and

More information

Statistics in medicine

Statistics in medicine Statistics in medicine Lecture 3: Bivariate association : Categorical variables Proportion in one group One group is measured one time: z test Use the z distribution as an approximation to the binomial

More information

Lecture 7: Hypothesis Testing and ANOVA

Lecture 7: Hypothesis Testing and ANOVA Lecture 7: Hypothesis Testing and ANOVA Goals Overview of key elements of hypothesis testing Review of common one and two sample tests Introduction to ANOVA Hypothesis Testing The intent of hypothesis

More information

Analysis of categorical data S4. Michael Hauptmann Netherlands Cancer Institute Amsterdam, The Netherlands

Analysis of categorical data S4. Michael Hauptmann Netherlands Cancer Institute Amsterdam, The Netherlands Analysis of categorical data S4 Michael Hauptmann Netherlands Cancer Institute Amsterdam, The Netherlands m.hauptmann@nki.nl 1 Categorical data One-way contingency table = frequency table Frequency (%)

More information

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007) FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter

More information

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă HYPOTHESIS TESTING II TESTS ON MEANS Sorana D. Bolboacă OBJECTIVES Significance value vs p value Parametric vs non parametric tests Tests on means: 1 Dec 14 2 SIGNIFICANCE LEVEL VS. p VALUE Materials and

More information

Readings Howitt & Cramer (2014) Overview

Readings Howitt & Cramer (2014) Overview Readings Howitt & Cramer (4) Ch 7: Relationships between two or more variables: Diagrams and tables Ch 8: Correlation coefficients: Pearson correlation and Spearman s rho Ch : Statistical significance

More information

CDA Chapter 3 part II

CDA Chapter 3 part II CDA Chapter 3 part II Two-way tables with ordered classfications Let u 1 u 2... u I denote scores for the row variable X, and let ν 1 ν 2... ν J denote column Y scores. Consider the hypothesis H 0 : X

More information

Readings Howitt & Cramer (2014)

Readings Howitt & Cramer (2014) Readings Howitt & Cramer (014) Ch 7: Relationships between two or more variables: Diagrams and tables Ch 8: Correlation coefficients: Pearson correlation and Spearman s rho Ch 11: Statistical significance

More information

Intuitive Biostatistics: Choosing a statistical test

Intuitive Biostatistics: Choosing a statistical test pagina 1 van 5 < BACK Intuitive Biostatistics: Choosing a statistical This is chapter 37 of Intuitive Biostatistics (ISBN 0-19-508607-4) by Harvey Motulsky. Copyright 1995 by Oxfd University Press Inc.

More information

BIOL 51A - Biostatistics 1 1. Lecture 1: Intro to Biostatistics. Smoking: hazardous? FEV (l) Smoke

BIOL 51A - Biostatistics 1 1. Lecture 1: Intro to Biostatistics. Smoking: hazardous? FEV (l) Smoke BIOL 51A - Biostatistics 1 1 Lecture 1: Intro to Biostatistics Smoking: hazardous? FEV (l) 1 2 3 4 5 No Yes Smoke BIOL 51A - Biostatistics 1 2 Box Plot a.k.a box-and-whisker diagram or candlestick chart

More information

Statistics in medicine

Statistics in medicine Statistics in medicine Lecture 4: and multivariable regression Fatma Shebl, MD, MS, MPH, PhD Assistant Professor Chronic Disease Epidemiology Department Yale School of Public Health Fatma.shebl@yale.edu

More information

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous

More information

Textbook Examples of. SPSS Procedure

Textbook Examples of. SPSS Procedure Textbook s of IBM SPSS Procedures Each SPSS procedure listed below has its own section in the textbook. These sections include a purpose statement that describes the statistical test, identification of

More information

Nemours Biomedical Research Statistics Course. Li Xie Nemours Biostatistics Core October 14, 2014

Nemours Biomedical Research Statistics Course. Li Xie Nemours Biostatistics Core October 14, 2014 Nemours Biomedical Research Statistics Course Li Xie Nemours Biostatistics Core October 14, 2014 Outline Recap Introduction to Logistic Regression Recap Descriptive statistics Variable type Example of

More information

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F). STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis 1. Indicate whether each of the following is true (T) or false (F). (a) (b) (c) (d) (e) In 2 2 tables, statistical independence is equivalent

More information

Contents. Acknowledgments. xix

Contents. Acknowledgments. xix Table of Preface Acknowledgments page xv xix 1 Introduction 1 The Role of the Computer in Data Analysis 1 Statistics: Descriptive and Inferential 2 Variables and Constants 3 The Measurement of Variables

More information

Lecture 5: ANOVA and Correlation

Lecture 5: ANOVA and Correlation Lecture 5: ANOVA and Correlation Ani Manichaikul amanicha@jhsph.edu 23 April 2007 1 / 62 Comparing Multiple Groups Continous data: comparing means Analysis of variance Binary data: comparing proportions

More information

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F). STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis 1. Indicate whether each of the following is true (T) or false (F). (a) T In 2 2 tables, statistical independence is equivalent to a population

More information

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 Statistics Boot Camp Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 March 21, 2018 Outline of boot camp Summarizing and simplifying data Point and interval estimation Foundations of statistical

More information

An introduction to biostatistics: part 1

An introduction to biostatistics: part 1 An introduction to biostatistics: part 1 Cavan Reilly September 6, 2017 Table of contents Introduction to data analysis Uncertainty Probability Conditional probability Random variables Discrete random

More information

NON-PARAMETRIC STATISTICS * (http://www.statsoft.com)

NON-PARAMETRIC STATISTICS * (http://www.statsoft.com) NON-PARAMETRIC STATISTICS * (http://www.statsoft.com) 1. GENERAL PURPOSE 1.1 Brief review of the idea of significance testing To understand the idea of non-parametric statistics (the term non-parametric

More information

Statistics: revision

Statistics: revision NST 1B Experimental Psychology Statistics practical 5 Statistics: revision Rudolf Cardinal & Mike Aitken 29 / 30 April 2004 Department of Experimental Psychology University of Cambridge Handouts: Answers

More information

APPENDIX B Sample-Size Calculation Methods: Classical Design

APPENDIX B Sample-Size Calculation Methods: Classical Design APPENDIX B Sample-Size Calculation Methods: Classical Design One/Paired - Sample Hypothesis Test for the Mean Sign test for median difference for a paired sample Wilcoxon signed - rank test for one or

More information

Non-parametric methods

Non-parametric methods Eastern Mediterranean University Faculty of Medicine Biostatistics course Non-parametric methods March 4&7, 2016 Instructor: Dr. Nimet İlke Akçay (ilke.cetin@emu.edu.tr) Learning Objectives 1. Distinguish

More information

Discrete Multivariate Statistics

Discrete Multivariate Statistics Discrete Multivariate Statistics Univariate Discrete Random variables Let X be a discrete random variable which, in this module, will be assumed to take a finite number of t different values which are

More information

Unit 14: Nonparametric Statistical Methods

Unit 14: Nonparametric Statistical Methods Unit 14: Nonparametric Statistical Methods Statistics 571: Statistical Methods Ramón V. León 8/8/2003 Unit 14 - Stat 571 - Ramón V. León 1 Introductory Remarks Most methods studied so far have been based

More information

Nonparametric statistic methods. Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health

Nonparametric statistic methods. Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health Nonparametric statistic methods Waraphon Phimpraphai DVM, PhD Department of Veterinary Public Health Measurement What are the 4 levels of measurement discussed? 1. Nominal or Classificatory Scale Gender,

More information

Person-Time Data. Incidence. Cumulative Incidence: Example. Cumulative Incidence. Person-Time Data. Person-Time Data

Person-Time Data. Incidence. Cumulative Incidence: Example. Cumulative Incidence. Person-Time Data. Person-Time Data Person-Time Data CF Jeff Lin, MD., PhD. Incidence 1. Cumulative incidence (incidence proportion) 2. Incidence density (incidence rate) December 14, 2005 c Jeff Lin, MD., PhD. c Jeff Lin, MD., PhD. Person-Time

More information

Basic Statistical Analysis

Basic Statistical Analysis indexerrt.qxd 8/21/2002 9:47 AM Page 1 Corrected index pages for Sprinthall Basic Statistical Analysis Seventh Edition indexerrt.qxd 8/21/2002 9:47 AM Page 656 Index Abscissa, 24 AB-STAT, vii ADD-OR rule,

More information

Glossary for the Triola Statistics Series

Glossary for the Triola Statistics Series Glossary for the Triola Statistics Series Absolute deviation The measure of variation equal to the sum of the deviations of each value from the mean, divided by the number of values Acceptance sampling

More information

Analysis of Categorical Data. Nick Jackson University of Southern California Department of Psychology 10/11/2013

Analysis of Categorical Data. Nick Jackson University of Southern California Department of Psychology 10/11/2013 Analysis of Categorical Data Nick Jackson University of Southern California Department of Psychology 10/11/2013 1 Overview Data Types Contingency Tables Logit Models Binomial Ordinal Nominal 2 Things not

More information

Section IX. Introduction to Logistic Regression for binary outcomes. Poisson regression

Section IX. Introduction to Logistic Regression for binary outcomes. Poisson regression Section IX Introduction to Logistic Regression for binary outcomes Poisson regression 0 Sec 9 - Logistic regression In linear regression, we studied models where Y is a continuous variable. What about

More information

Basics on t-tests Independent Sample t-tests Single-Sample t-tests Summary of t-tests Multiple Tests, Effect Size Proportions. Statistiek I.

Basics on t-tests Independent Sample t-tests Single-Sample t-tests Summary of t-tests Multiple Tests, Effect Size Proportions. Statistiek I. Statistiek I t-tests John Nerbonne CLCG, Rijksuniversiteit Groningen http://www.let.rug.nl/nerbonne/teach/statistiek-i/ John Nerbonne 1/46 Overview 1 Basics on t-tests 2 Independent Sample t-tests 3 Single-Sample

More information

Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2)

Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2) Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2) B.H. Robbins Scholars Series June 23, 2010 1 / 29 Outline Z-test χ 2 -test Confidence Interval Sample size and power Relative effect

More information

Nonparametric Statistics

Nonparametric Statistics Nonparametric Statistics Nonparametric or Distribution-free statistics: used when data are ordinal (i.e., rankings) used when ratio/interval data are not normally distributed (data are converted to ranks)

More information

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics

Last Lecture. Distinguish Populations from Samples. Knowing different Sampling Techniques. Distinguish Parameters from Statistics Last Lecture Distinguish Populations from Samples Importance of identifying a population and well chosen sample Knowing different Sampling Techniques Distinguish Parameters from Statistics Knowing different

More information

Welcome! Webinar Biostatistics: sample size & power. Thursday, April 26, 12:30 1:30 pm (NDT)

Welcome! Webinar Biostatistics: sample size & power. Thursday, April 26, 12:30 1:30 pm (NDT) . Welcome! Webinar Biostatistics: sample size & power Thursday, April 26, 12:30 1:30 pm (NDT) Get started now: Please check if your speakers are working and mute your audio. Please use the chat box to

More information

Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA)

Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA) BSTT523 Pagano & Gauvreau Chapter 13 1 Nonparametric Statistics Data are sometimes not compatible with the assumptions of parametric statistical tests (i.e. t-test, regression, ANOVA) In particular, data

More information

Correlation and Simple Linear Regression

Correlation and Simple Linear Regression Correlation and Simple Linear Regression Sasivimol Rattanasiri, Ph.D Section for Clinical Epidemiology and Biostatistics Ramathibodi Hospital, Mahidol University E-mail: sasivimol.rat@mahidol.ac.th 1 Outline

More information

E509A: Principle of Biostatistics. (Week 11(2): Introduction to non-parametric. methods ) GY Zou.

E509A: Principle of Biostatistics. (Week 11(2): Introduction to non-parametric. methods ) GY Zou. E509A: Principle of Biostatistics (Week 11(2): Introduction to non-parametric methods ) GY Zou gzou@robarts.ca Sign test for two dependent samples Ex 12.1 subj 1 2 3 4 5 6 7 8 9 10 baseline 166 135 189

More information

Non-parametric tests, part A:

Non-parametric tests, part A: Two types of statistical test: Non-parametric tests, part A: Parametric tests: Based on assumption that the data have certain characteristics or "parameters": Results are only valid if (a) the data are

More information

WORKSHOP 3 Measuring Association

WORKSHOP 3 Measuring Association WORKSHOP 3 Measuring Association Concepts Analysing Categorical Data o Testing of Proportions o Contingency Tables & Tests o Odds Ratios Linear Association Measures o Correlation o Simple Linear Regression

More information

Computational Systems Biology: Biology X

Computational Systems Biology: Biology X Bud Mishra Room 1002, 715 Broadway, Courant Institute, NYU, New York, USA L#7:(Mar-23-2010) Genome Wide Association Studies 1 The law of causality... is a relic of a bygone age, surviving, like the monarchy,

More information

Lecture 7 Time-dependent Covariates in Cox Regression

Lecture 7 Time-dependent Covariates in Cox Regression Lecture 7 Time-dependent Covariates in Cox Regression So far, we ve been considering the following Cox PH model: λ(t Z) = λ 0 (t) exp(β Z) = λ 0 (t) exp( β j Z j ) where β j is the parameter for the the

More information

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty.

What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. What is Statistics? Statistics is the science of understanding data and of making decisions in the face of variability and uncertainty. Statistics is a field of study concerned with the data collection,

More information

Probability and Probability Distributions. Dr. Mohammed Alahmed

Probability and Probability Distributions. Dr. Mohammed Alahmed Probability and Probability Distributions 1 Probability and Probability Distributions Usually we want to do more with data than just describing them! We might want to test certain specific inferences about

More information

Review. More Review. Things to know about Probability: Let Ω be the sample space for a probability measure P.

Review. More Review. Things to know about Probability: Let Ω be the sample space for a probability measure P. 1 2 Review Data for assessing the sensitivity and specificity of a test are usually of the form disease category test result diseased (+) nondiseased ( ) + A B C D Sensitivity: is the proportion of diseased

More information

Correlation and Regression Bangkok, 14-18, Sept. 2015

Correlation and Regression Bangkok, 14-18, Sept. 2015 Analysing and Understanding Learning Assessment for Evidence-based Policy Making Correlation and Regression Bangkok, 14-18, Sept. 2015 Australian Council for Educational Research Correlation The strength

More information

More Statistics tutorial at Logistic Regression and the new:

More Statistics tutorial at  Logistic Regression and the new: Logistic Regression and the new: Residual Logistic Regression 1 Outline 1. Logistic Regression 2. Confounding Variables 3. Controlling for Confounding Variables 4. Residual Linear Regression 5. Residual

More information

Introduction to inferential statistics. Alissa Melinger IGK summer school 2006 Edinburgh

Introduction to inferential statistics. Alissa Melinger IGK summer school 2006 Edinburgh Introduction to inferential statistics Alissa Melinger IGK summer school 2006 Edinburgh Short description Prereqs: I assume no prior knowledge of stats This half day tutorial on statistical analysis will

More information

Statistics in medicine

Statistics in medicine Statistics in medicine Lecture 1- part 1: Describing variation, and graphical presentation Outline Sources of variation Types of variables Fatma Shebl, MD, MS, MPH, PhD Assistant Professor Chronic Disease

More information

Means or "expected" counts: j = 1 j = 2 i = 1 m11 m12 i = 2 m21 m22 True proportions: The odds that a sampled unit is in category 1 for variable 1 giv

Means or expected counts: j = 1 j = 2 i = 1 m11 m12 i = 2 m21 m22 True proportions: The odds that a sampled unit is in category 1 for variable 1 giv Measures of Association References: ffl ffl ffl Summarize strength of associations Quantify relative risk Types of measures odds ratio correlation Pearson statistic ediction concordance/discordance Goodman,

More information

Bio 183 Statistics in Research. B. Cleaning up your data: getting rid of problems

Bio 183 Statistics in Research. B. Cleaning up your data: getting rid of problems Bio 183 Statistics in Research A. Research designs B. Cleaning up your data: getting rid of problems C. Basic descriptive statistics D. What test should you use? What is science?: Science is a way of knowing.(anon.?)

More information

= 1 i. normal approximation to χ 2 df > df

= 1 i. normal approximation to χ 2 df > df χ tests 1) 1 categorical variable χ test for goodness-of-fit ) categorical variables χ test for independence (association, contingency) 3) categorical variables McNemar's test for change χ df k (O i 1

More information

10: Crosstabs & Independent Proportions

10: Crosstabs & Independent Proportions 10: Crosstabs & Independent Proportions p. 10.1 P Background < Two independent groups < Binary outcome < Compare binomial proportions P Illustrative example ( oswege.sav ) < Food poisoning following church

More information

Statistical. Psychology

Statistical. Psychology SEVENTH у *i km m it* & П SB Й EDITION Statistical M e t h o d s for Psychology D a v i d C. Howell University of Vermont ; \ WADSWORTH f% CENGAGE Learning* Australia Biaall apan Korea Меяко Singapore

More information

Basics of Experimental Design. Review of Statistics. Basic Study. Experimental Design. When an Experiment is Not Possible. Studying Relations

Basics of Experimental Design. Review of Statistics. Basic Study. Experimental Design. When an Experiment is Not Possible. Studying Relations Basics of Experimental Design Review of Statistics And Experimental Design Scientists study relation between variables In the context of experiments these variables are called independent and dependent

More information

Testing Independence

Testing Independence Testing Independence Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/50 Testing Independence Previously, we looked at RR = OR = 1

More information

ESP 178 Applied Research Methods. 2/23: Quantitative Analysis

ESP 178 Applied Research Methods. 2/23: Quantitative Analysis ESP 178 Applied Research Methods 2/23: Quantitative Analysis Data Preparation Data coding create codebook that defines each variable, its response scale, how it was coded Data entry for mail surveys and

More information

STATISTICS ( CODE NO. 08 ) PAPER I PART - I

STATISTICS ( CODE NO. 08 ) PAPER I PART - I STATISTICS ( CODE NO. 08 ) PAPER I PART - I 1. Descriptive Statistics Types of data - Concepts of a Statistical population and sample from a population ; qualitative and quantitative data ; nominal and

More information

psychological statistics

psychological statistics psychological statistics B Sc. Counselling Psychology 011 Admission onwards III SEMESTER COMPLEMENTARY COURSE UNIVERSITY OF CALICUT SCHOOL OF DISTANCE EDUCATION CALICUT UNIVERSITY.P.O., MALAPPURAM, KERALA,

More information

9/2/2010. Wildlife Management is a very quantitative field of study. throughout this course and throughout your career.

9/2/2010. Wildlife Management is a very quantitative field of study. throughout this course and throughout your career. Introduction to Data and Analysis Wildlife Management is a very quantitative field of study Results from studies will be used throughout this course and throughout your career. Sampling design influences

More information

Sample Size/Power Calculation by Software/Online Calculators

Sample Size/Power Calculation by Software/Online Calculators Sample Size/Power Calculation by Software/Online Calculators May 24, 2018 Li Zhang, Ph.D. li.zhang@ucsf.edu Associate Professor Department of Epidemiology and Biostatistics Division of Hematology and Oncology

More information

8 Nominal and Ordinal Logistic Regression

8 Nominal and Ordinal Logistic Regression 8 Nominal and Ordinal Logistic Regression 8.1 Introduction If the response variable is categorical, with more then two categories, then there are two options for generalized linear models. One relies on

More information

Logistic regression: Miscellaneous topics

Logistic regression: Miscellaneous topics Logistic regression: Miscellaneous topics April 11 Introduction We have covered two approaches to inference for GLMs: the Wald approach and the likelihood ratio approach I claimed that the likelihood ratio

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Frequency table: Var2 (Spreadsheet1) Count Cumulative Percent Cumulative From To. Percent <x<=

Frequency table: Var2 (Spreadsheet1) Count Cumulative Percent Cumulative From To. Percent <x<= A frequency distribution is a kind of probability distribution. It gives the frequency or relative frequency at which given values have been observed among the data collected. For example, for age, Frequency

More information

McGill University. Faculty of Science MATH 204 PRINCIPLES OF STATISTICS II. Final Examination

McGill University. Faculty of Science MATH 204 PRINCIPLES OF STATISTICS II. Final Examination McGill University Faculty of Science MATH 204 PRINCIPLES OF STATISTICS II Final Examination Date: 20th April 2009 Time: 9am-2pm Examiner: Dr David A Stephens Associate Examiner: Dr Russell Steele Please

More information

Chapter Six: Two Independent Samples Methods 1/51

Chapter Six: Two Independent Samples Methods 1/51 Chapter Six: Two Independent Samples Methods 1/51 6.3 Methods Related To Differences Between Proportions 2/51 Test For A Difference Between Proportions:Introduction Suppose a sampling distribution were

More information

Biostatistics for biomedical profession. BIMM34 Karin Källen & Linda Hartman November-December 2015

Biostatistics for biomedical profession. BIMM34 Karin Källen & Linda Hartman November-December 2015 Biostatistics for biomedical profession BIMM34 Karin Källen & Linda Hartman November-December 2015 12015-11-02 Who needs a course in biostatistics? - Anyone who uses quntitative methods to interpret biological

More information

Psych 230. Psychological Measurement and Statistics

Psych 230. Psychological Measurement and Statistics Psych 230 Psychological Measurement and Statistics Pedro Wolf December 9, 2009 This Time. Non-Parametric statistics Chi-Square test One-way Two-way Statistical Testing 1. Decide which test to use 2. State

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 1 1-1 Basic Business Statistics 11 th Edition Chapter 1 Chi-Square Tests and Nonparametric Tests Basic Business Statistics, 11e 009 Prentice-Hall, Inc. Chap 1-1 Learning Objectives In this chapter,

More information

Suppose that we are concerned about the effects of smoking. How could we deal with this?

Suppose that we are concerned about the effects of smoking. How could we deal with this? Suppose that we want to study the relationship between coffee drinking and heart attacks in adult males under 55. In particular, we want to know if there is an association between coffee drinking and heart

More information

Introduction to Statistics with GraphPad Prism 7

Introduction to Statistics with GraphPad Prism 7 Introduction to Statistics with GraphPad Prism 7 Outline of the course Power analysis with G*Power Basic structure of a GraphPad Prism project Analysis of qualitative data Chi-square test Analysis of quantitative

More information

Intro to Parametric & Nonparametric Statistics

Intro to Parametric & Nonparametric Statistics Kinds of variable The classics & some others Intro to Parametric & Nonparametric Statistics Kinds of variables & why we care Kinds & definitions of nonparametric statistics Where parametric stats come

More information

CHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC

CHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC CHI SQUARE ANALYSIS I N T R O D U C T I O N T O N O N - P A R A M E T R I C A N A L Y S E S HYPOTHESIS TESTS SO FAR We ve discussed One-sample t-test Dependent Sample t-tests Independent Samples t-tests

More information

Statistics 262: Intermediate Biostatistics Regression & Survival Analysis

Statistics 262: Intermediate Biostatistics Regression & Survival Analysis Statistics 262: Intermediate Biostatistics Regression & Survival Analysis Jonathan Taylor & Kristin Cobb Statistics 262: Intermediate Biostatistics p.1/?? Introduction This course is an applied course,

More information

Exploring, summarizing and presenting data. Berghold, IMI, MUG

Exploring, summarizing and presenting data. Berghold, IMI, MUG Exploring, summarizing and presenting data Example Patient Nr Gender Age Weight Height PAVK-Grade W alking Distance Physical Functioning Scale Total Cholesterol Triglycerides 01 m 65 90 185 II b 200 70

More information

What Are Nonparametric Statistics and When Do You Use Them? Jennifer Catrambone

What Are Nonparametric Statistics and When Do You Use Them? Jennifer Catrambone What Are Nonparametric Statistics and When Do You Use Them? Jennifer Catrambone First, a bit about Parametric Statistics Data are expected to be randomly drawn from a normal population Minimum sample size

More information

Online supplement. Absolute Value of Lung Function (FEV 1 or FVC) Explains the Sex Difference in. Breathlessness in the General Population

Online supplement. Absolute Value of Lung Function (FEV 1 or FVC) Explains the Sex Difference in. Breathlessness in the General Population Online supplement Absolute Value of Lung Function (FEV 1 or FVC) Explains the Sex Difference in Breathlessness in the General Population Table S1. Comparison between patients who were excluded or included

More information

Fundamentals to Biostatistics. Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur

Fundamentals to Biostatistics. Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur Fundamentals to Biostatistics Prof. Chandan Chakraborty Associate Professor School of Medical Science & Technology IIT Kharagpur Statistics collection, analysis, interpretation of data development of new

More information

Cohen s s Kappa and Log-linear Models

Cohen s s Kappa and Log-linear Models Cohen s s Kappa and Log-linear Models HRP 261 03/03/03 10-11 11 am 1. Cohen s Kappa Actual agreement = sum of the proportions found on the diagonals. π ii Cohen: Compare the actual agreement with the chance

More information

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 13 Nonparametric Statistics 13-1 Overview 13-2 Sign Test 13-3 Wilcoxon Signed-Ranks

More information

ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables

ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES Cox s regression analysis Time dependent explanatory variables Henrik Ravn Bandim Health Project, Statens Serum Institut 4 November 2011 1 / 53

More information

Lecture Slides. Section 13-1 Overview. Elementary Statistics Tenth Edition. Chapter 13 Nonparametric Statistics. by Mario F.

Lecture Slides. Section 13-1 Overview. Elementary Statistics Tenth Edition. Chapter 13 Nonparametric Statistics. by Mario F. Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 13 Nonparametric Statistics 13-1 Overview 13-2 Sign Test 13-3 Wilcoxon Signed-Ranks

More information

2 Describing Contingency Tables

2 Describing Contingency Tables 2 Describing Contingency Tables I. Probability structure of a 2-way contingency table I.1 Contingency Tables X, Y : cat. var. Y usually random (except in a case-control study), response; X can be random

More information

(Where does Ch. 7 on comparing 2 means or 2 proportions fit into this?)

(Where does Ch. 7 on comparing 2 means or 2 proportions fit into this?) 12. Comparing Groups: Analysis of Variance (ANOVA) Methods Response y Explanatory x var s Method Categorical Categorical Contingency tables (Ch. 8) (chi-squared, etc.) Quantitative Quantitative Regression

More information

TA: Sheng Zhgang (Th 1:20) / 342 (W 1:20) / 343 (W 2:25) / 344 (W 12:05) Haoyang Fan (W 1:20) / 346 (Th 12:05) FINAL EXAM

TA: Sheng Zhgang (Th 1:20) / 342 (W 1:20) / 343 (W 2:25) / 344 (W 12:05) Haoyang Fan (W 1:20) / 346 (Th 12:05) FINAL EXAM STAT 301, Fall 2011 Name Lec 4: Ismor Fischer Discussion Section: Please circle one! TA: Sheng Zhgang... 341 (Th 1:20) / 342 (W 1:20) / 343 (W 2:25) / 344 (W 12:05) Haoyang Fan... 345 (W 1:20) / 346 (Th

More information

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization. Statistical Tools in Evaluation HPS 41 Dr. Joe G. Schmalfeldt Types of Scores Continuous Scores scores with a potentially infinite number of values. Discrete Scores scores limited to a specific number

More information

Statistics Handbook. All statistical tables were computed by the author.

Statistics Handbook. All statistical tables were computed by the author. Statistics Handbook Contents Page Wilcoxon rank-sum test (Mann-Whitney equivalent) Wilcoxon matched-pairs test 3 Normal Distribution 4 Z-test Related samples t-test 5 Unrelated samples t-test 6 Variance

More information

Statistical Inference Theory Lesson 46 Non-parametric Statistics

Statistical Inference Theory Lesson 46 Non-parametric Statistics 46.1-The Sign Test Statistical Inference Theory Lesson 46 Non-parametric Statistics 46.1 - Problem 1: (a). Let p equal the proportion of supermarkets that charge less than $2.15 a pound. H o : p 0.50 H

More information

Confounding and effect modification: Mantel-Haenszel estimation, testing effect homogeneity. Dankmar Böhning

Confounding and effect modification: Mantel-Haenszel estimation, testing effect homogeneity. Dankmar Böhning Confounding and effect modification: Mantel-Haenszel estimation, testing effect homogeneity Dankmar Böhning Southampton Statistical Sciences Research Institute University of Southampton, UK Advanced Statistical

More information

Lecture 01: Introduction

Lecture 01: Introduction Lecture 01: Introduction Dipankar Bandyopadhyay, Ph.D. BMTRY 711: Analysis of Categorical Data Spring 2011 Division of Biostatistics and Epidemiology Medical University of South Carolina Lecture 01: Introduction

More information