C. Statistics. X = n geometric the n th root of the product of numerical data ln X GM = or ln GM = X 2. X n X 1

Similar documents
Lecture Notes Types of economic variables

Summary of the lecture in Biostatistics

Simple Linear Regression

Mean is only appropriate for interval or ratio scales, not ordinal or nominal.

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model

Chapter 13 Student Lecture Notes 13-1

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

Multiple Regression. More than 2 variables! Grade on Final. Multiple Regression 11/21/2012. Exam 2 Grades. Exam 2 Re-grades

STA 105-M BASIC STATISTICS (This is a multiple choice paper.)

ENGI 3423 Simple Linear Regression Page 12-01

Statistics MINITAB - Lab 5

Chapter 13, Part A Analysis of Variance and Experimental Design. Introduction to Analysis of Variance. Introduction to Analysis of Variance

MEASURES OF DISPERSION

Probability and. Lecture 13: and Correlation

Continuous Distributions

Special Instructions / Useful Data

Simple Linear Regression

Lecture 3. Sampling, sampling distributions, and parameter estimation

Chapter Statistics Background of Regression Analysis

CHAPTER VI Statistical Analysis of Experimental Data

Previous lecture. Lecture 8. Learning outcomes of this lecture. Today. Statistical test and Scales of measurement. Correlation

Chapter 14 Logistic Regression Models

Bootstrap Method for Testing of Equality of Several Coefficients of Variation

hp calculators HP 30S Statistics Averages and Standard Deviations Average and Standard Deviation Practice Finding Averages and Standard Deviations

Lecture 1 Review of Fundamental Statistical Concepts

Chapter 8. Inferences about More Than Two Population Central Values

Statistics: Unlocking the Power of Data Lock 5

residual. (Note that usually in descriptions of regression analysis, upper-case

12.2 Estimating Model parameters Assumptions: ox and y are related according to the simple linear regression model

Analysis of Variance with Weibull Data

THE ROYAL STATISTICAL SOCIETY GRADUATE DIPLOMA

Chapter Business Statistics: A First Course Fifth Edition. Learning Objectives. Correlation vs. Regression. In this chapter, you learn:

STA302/1001-Fall 2008 Midterm Test October 21, 2008

Multiple Choice Test. Chapter Adequacy of Models for Regression

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions.

Parameter, Statistic and Random Samples

Chapter 3 Sampling For Proportions and Percentages

Lecture 8: Linear Regression

Chapter 11 The Analysis of Variance

ESS Line Fitting

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

Functions of Random Variables

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

22 Nonparametric Methods.

Uncertainty, Data, and Judgment

Module 7: Probability and Statistics

Objectives of Multiple Regression

Lesson 3. Group and individual indexes. Design and Data Analysis in Psychology I English group (A) School of Psychology Dpt. Experimental Psychology

Random Variables and Probability Distributions

CLASS NOTES. for. PBAF 528: Quantitative Methods II SPRING Instructor: Jean Swanson. Daniel J. Evans School of Public Affairs

ENGI 4421 Joint Probability Distributions Page Joint Probability Distributions [Navidi sections 2.5 and 2.6; Devore sections

A New Family of Transformations for Lifetime Data

: At least two means differ SST

Example: Multiple linear regression. Least squares regression. Repetition: Simple linear regression. Tron Anders Moger

b. There appears to be a positive relationship between X and Y; that is, as X increases, so does Y.

( ) = ( ) ( ) Chapter 13 Asymptotic Theory and Stochastic Regressors. Stochastic regressors model

is the score of the 1 st student, x

Improving coverage probabilities of confidence intervals in random effects meta-analysis with publication bias

Econometric Methods. Review of Estimation

{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution:

best estimate (mean) for X uncertainty or error in the measurement (systematic, random or statistical) best

LECTURE - 4 SIMPLE RANDOM SAMPLING DR. SHALABH DEPARTMENT OF MATHEMATICS AND STATISTICS INDIAN INSTITUTE OF TECHNOLOGY KANPUR


Correlation and Simple Linear Regression

Lecture notes on epidemiology

Measures of Dispersion

IFYMB002 Mathematics Business Appendix C Formula Booklet

Statistics. Correlational. Dr. Ayman Eldeib. Simple Linear Regression and Correlation. SBE 304: Linear Regression & Correlation 1/3/2018

Likelihood Ratio, Wald, and Lagrange Multiplier (Score) Tests. Soccer Goals in European Premier Leagues

ECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity

Multiple Linear Regression Analysis

Chapter -2 Simple Random Sampling

= 1. UCLA STAT 13 Introduction to Statistical Methods for the Life and Health Sciences. Parameters and Statistics. Measures of Centrality

Module 7. Lecture 7: Statistical parameter estimation

4. Standard Regression Model and Spatial Dependence Tests

2.28 The Wall Street Journal is probably referring to the average number of cubes used per glass measured for some population that they have chosen.

Chapter 8: Statistical Analysis of Simulated Data

THE ROYAL STATISTICAL SOCIETY 2016 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE MODULE 5

Chapter 5 Properties of a Random Sample

Handout #1. Title: Foundations of Econometrics. POPULATION vs. SAMPLE

Comparison of Dual to Ratio-Cum-Product Estimators of Population Mean

Lecture 3 Probability review (cont d)

Chapter 4 Multiple Random Variables

Logistic regression (continued)

Permutation Tests for More Than Two Samples

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ " 1

Chapter -2 Simple Random Sampling

Chapter Two. An Introduction to Regression ( )

X ε ) = 0, or equivalently, lim

GOALS The Samples Why Sample the Population? What is a Probability Sample? Four Most Commonly Used Probability Sampling Methods

Fitting models to data.

Chapter 4 Multiple Random Variables

Arithmetic Mean Suppose there is only a finite number N of items in the system of interest. Then the population arithmetic mean is

Outline. treatments. Introduction

STATISTICAL INFERENCE

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #1

Third handout: On the Gini Index

BAYESIAN INFERENCES FOR TWO PARAMETER WEIBULL DISTRIBUTION

Simple Linear Regression - Scalar Form

Bayesian Inferences for Two Parameter Weibull Distribution Kipkoech W. Cheruiyot 1, Abel Ouko 2, Emily Kirimi 3

Transcription:

C. Statstcs a. Descrbe the stages the desg of a clcal tral, takg to accout the: research questos ad hypothess, lterature revew, statstcal advce, choce of study protocol, ethcal ssues, data collecto ad processg. b. Expla cocepts statstcs such as: dstrbuto of data ad frequecy dstrbutos, measures of cetral tedecy ad dsperso of data ad the approprate selecto ad applcato of o-parametrc ad parametrc tests statstcal ferece. data types omal a lst of possble results e.g. death/dscharge/trasfer to aother sttuto ordal a ordered groupg of results o a scale wth dscrete pots e.g. ASA status, Duke s stagg umercal terval equal tervals betwee values but o absolute zero e.g. temperature C rato a lear scale from a absolute zero e.g. mea arteral pressure parametrc data whch are dstrbuted ormally varable a measuremet of a sample parameter a measuremet of the populato measuremet of cetral tedecy mea arthmetc X the average of umercal data: X = geometrc the th root of the product of umercal data l X GM = X 2 X or l GM = ot applcable to omal or ordal data affected by outlers meda the mddle result rak order mode the most commo result data o a dscrete scale dstrbutos may have more tha oe mode measuremet of varablty rage the dfferece betwee largest ad smallest values terquartle (25%-75%) or 5%-95% rages are sometmes quoted sutable for ordal or terval data varace (X X ) 2 var = Statstcs 2.C.1 James Mtchell (December 24, 2003)

sutable for terval data oly stadard devato (X X ) 2 s = 1 s used for descrptos of a sample ad -1 for descrptos of a populato derved from a sample both varace ad stadard devato are a fucto of the populato ad do ot chage wth sample sze comparg varaces of samples s a test to see f they are from the same populato: F test coeffcet of varato the stadard devato as a percetage of the mea allows comparso of the degree of varace betwee measuremets of dfferet quattes depedet of sample sze ad mea value CV = s X F-test varaces of data sets ca also be compared usg the F-test larger varace s dvded by smaller varace tables provde a cofdece lmt for the rato obtaed that the data sets are from the same populato stadard error of the mea descrbes the relatoshp betwee the sample mea ad the populato mea SEM = s for a ormally dstrbuted data set, the populato mea has a probablty of 96% of fallg wth 2xSEM of the sample mea ths s used to provde cofdece tervals from sample data cofdece tervals are ow preferred to p values SEM falls wth creasg sample sze hypothess testg ull hypothess (H 0 ) there s o dfferece betwee groups studed alterate hypothess (H 1 ) there s a dfferece betwee groups studed type I error false cocluso that H 0 s false based o sample (spurous sgfcat result) probablty of a type I error a gve study s called α probablty that a type I error has occurred s called p type II error false cocluso that H 0 s true based o sample (mssg a real dfferece) probablty of a type II error a gve study s called ß ß vares versely wth α, depedg o study desg sgfcace a arbtrary decso as to the maxmum p value acceptable as evdece that H 0 s false typcally 0.05 for bologcal studes power probablty of fdg H 0 s false gve that t really s false (fdg a real dfferece) power = 1 - ß creases wth sample sze, α, parametrc (>o-parametrc) aalyss dstrbutos bomal Statstcs 2.C.2 James Mtchell (December 24, 2003)

descrbes the probablty dstrbuto for a fxed umber of depedet evets wth two possble outcomes of costat probablty probablty of x succeses each of probablty π from trals s P(x;, π) = C x π x (1 - π) -x for large values of, ths approaches ormal dstrbuto posso descrbes dstrbuto of results where a evet occurs wth a kow frequecy (λ) at radom tervals ormal a symmetrcal dstrbuto represetg the lmt of the bomal dstrbuto as approaches parametrc tests requre ormal dstrbuto smlar varace betwee data sets (F-test) depedet data sets Studet s t-test compares two groups from the same populato to detect a dfferece meas at a specfed level of sgfcace t = X 2 SE 2 + SE X 2 2 or t = X 2 var p ( 1 1 + 1 2 ) where var p = var 1 + var 2 ( 1 1) +( 2 1) prcple, t s a mea dvded by a SEM t s a expresso of the spread of the SEM dstrbuto for a gve sample sze (degrees of freedom). There s a 95% probablty that the populato mea les wth t x SEM of the sample mea. t approaches 1.96 as sample sze approaches, ad s approxmated to 1.96 for >30. the result t s compared wth tables gvg mmum values of t for specfed sgfcace ad degrees of freedom ca be performed o pared (depedet) or upared data, oe- or two-taled depedg o whether the alteratve hypothess postulates a drecto for chage of the mea ANOVA (aalyss of varace) compares the varace of multple groups agast the depedet varable (oe-way ANOVA) or agast each other (Multple ANOVA) by calculatg a F-rato = betwee-groups varace wth-groups varace ths detfes the presece of a dfferece but ot whch groups cause t multple comparso tests used to detfy dfferet groups from ANOVA based o t-test but modfed to dmsh the rsk of type I error may epoymous varetes Boferro, Newma-Keuls, Duca, Duett, Du, Tukey, Scheffe, Least sgfcat dfferece etc. o-parametrc tests sutable for ordal data ad data whch s ot ormally dstrbuted less power tha parametrc tests Wlcoxo sged raks test pared t-test Ma-Whtey u test 2-taled t-test two data sets to compare all data are raked 1 to 1 + 2 rak sums R 1 ad R 2 Statstcs 2.C.3 James Mtchell (December 24, 2003)

U = 1 2 + 1 / 2 ( 1 ( 1 + 1)) - R 1 U s compared wth values for a specfed α ad degrees of freedom Kruskall Walls test ANOVA Fredma s test Multple ANOVA Spearma rak order r χ 2 test used for omal data compares rates of depedet evets for sgfcat dfferece requres expected rates of more tha 5 evets 80% of cells (otherwse use Fsher s Exact Test) χ 2 (O E) 2 = where E s the expected umber ad O the observed umber of E evets tables of χ 2 values for levels of sgfcace ad degrees of freedom degrees of freedom = (tervetos - 1) x (outcomes - 1) Vomtg No Vomtg Total Atemetc 10 90 100 Placebo 30 70 100 Total 40 160 200 E vomtg = 20 each group, E o vomtg = 80 each group χ 2 = 5 + 5 + 1.25 + 1.25 = 12.5, df = 1 Yates correcto for 2x2 matrces wth fewer tha 40 trals reduces the magtude of (O - E) by 0.5 for each term χ 2 Fsher s exact test for small sample szes 2x2 matrces calculates p regardless of df usg the bomal dstrbuto p = R 1!R 2!C 1!C 2!! 11! 12! 21! 22! for the example p=0.000248 oerous to calculate for large McNemar s test s used for matched data. odds rato the rato of the cdece of a outcome a exposed group versus a cotrol group a case-cotrol study called a rsk rato a prospectve cohort study a cofdece terval assocated wth a odds rato does ot spa 1 f the correlato s sgfcat odds rato descrbes the stregth of assocato as well as ts presece regresso ad correlato regresso s a mathematcal descrpto of the assocato betwee two varables the depedet varable s plotted o the Y axs lear regresso produces a relatoshp y=ax+b such that (Y - ax +b) 2 for the set of data pots s mmzed correlato s a descrpto of the tghtess wth whch a data set matches a regresso (X a = X )(Y Y ) (X X ) 2 b = Y ax the coeffcet of determato, r 2 expresses stregth of correlato (from 0 to 1) (ax r 2 = + b Y ) 2 (Y Y ) 2 the correlato coeffcet s r (varyg from -1 to +1) r 2 expresses the proporto of the value of the depedet varable attrbutable to the depedet varable Regresso assumes a costat varace over the rage of data. If varace s ot Statstcs 2.C.4 James Mtchell (December 24, 2003)

costat, a correcto factor s added to the least squares calculato. Where the ature of the relatoshp betwee depedet ad depedet varables s kow to be of a o-lear ature (e.g. dose-respose), a trasform ca be appled to the data to reder the relatoshp lear (e.g. Hll plot or Leweaver-Burke plot). sequetal aalyss sutable for pared subjects gvg omal results e.g. drug vs placebo: vomtg or o vomtg a curve ca be draw o a graph of umber of uted pars versus excess outcomes, the boudares of whch mark statstcal sgfcace at a specfed p value ad H 0 demostrated: H 1 demostrated excess results umatched pars H 0 demostrated Rato Rate allows termato of a tral as soo as sgfcace s acheved largely obsolete techque, replaced by term aalyss (wth a low p) the umber of oe of two possble results dvded by that of the other used for comparso betwee data sets the umber of oe result dvded by the total umber of data pots applcable to ordal ad omatve data c. Expla the prcples of errors of statstcal ferece ad descrbe techques to mmze such errors through good study desg. d. Descrbe the features of a dagostc test, cludg the cocepts: sestvty, specfcty, postve ad egatve predctve value ad how these are flueces by the prevalece of the dsease questo. sestvty a property of a test, regardless of the populato tested the proporto of subjects havg the attrbute beg tested for who show a postve result for the test e.g. proporto of hepatts B fected patets testg postve for hepatts B surface atge (HBsAg) = true postves (true postves + false egatves) specfcty the proporto of subjects ot havg the attrbute beg tested for who show a egatve result for the test e.g. proporto of patets wthout hepatts B testg egatve for HBsAg = true egatves (true egatves + false postves) Statstcs 2.C.5 James Mtchell (December 24, 2003)