RACE 618: Meta-analysis for genetic association studies

Size: px
Start display at page:

Download "RACE 618: Meta-analysis for genetic association studies"

Transcription

1 RACE 618: Meta-analyss for genetc assocaton studes Prof. Ammarn Thakknstan, Ph.D. Tel: , e-mal:

2 Outlne of talk Introducton & background Data analyss Checkng HWE Poolng allele prevalence Checkng heterogenety of genetc effects Determnaton of gene effects One-stage approach: Regresson analyss Two-stage approach: Multvarate meta-analyss Determnng the best genetc model Publcaton bas

3 Background Wth the rse of genetc and molecular epdemology, populaton-based molecular assocaton studes have become a popular research desgn Atta J, Thakknstan A, D'Este C. Meta-analyses of molecular assocaton studes: methodologc lessons for genetc epdemology. Journal of clncal epdemology 2003;56(4): ground

4 Background Current paradgm Multple genes wll contrbute to a complex dsease Each gene wll have a small effect sze Typcal studes wll not have enough power to detect these small effects Therefore, there wll be an even greater need for meta-analyss

5 Meta-analyss n genetc assocaton studes Increasngly used for poolng data n tradtonal epdemology Increasng work on applyng ths to genetc lnkage studes BUT, lttle attenton has been focused on usng ths method n populatonbased genetc ssocaton studes

6 Methodologc ssues ) Adherence to tradtonal qualty for systematc revew & meta-analyss Locate studes Databases/search terms & strateges Selecton of studes Incluson/excluson Data extracton Two revewers Poolng data Fxed vs random-effect model Heterogenety test, Publcaton bas

7 ) Handlng of partcular genetc ssues Hardy-Wenberg Equlbrum (HWE) Two alleles: A/a wth frequences p, q Genotype frequences AA, Aa, aa: p 2, 2pq, q 2 HW dsequlbrum Alleles are not segregatng ndependently Non-random matng Genotypng error Populaton stratfcaton Selecton bas

8 Genetc models used n poolng Usual meta-analyses pool 2 groups Exp vs non-exp, Rx vs Placebo Gene: 2 alleles: A, a 3 genotypes: AA, Aa, aa Determnaton of genetc effects Per-allele approach Comparng allele frequences (A, a) Per-genotype approach Parwse comparson of genotypes AA vs aa, Aa vs aa, AA vs Aa Assumng a genetc model

9 Effect sze Domnant effect of A AA Aa aa

10 Effect sze Recessve effect of A AA Aa aa

11 Effect sze Codomnant/addtve effect of A AA Aa aa

12 Recommendatons Observe gudelnes for tradtonal epdemologcal meta-analyss Partcular attenton to heterogenety Check for HWE Need to decde how to pool data n a way that s senstve to genetc models wthout multple comparsons J Cln Epdemol Apr;56(4): ; Stat Med May 15;24(9):

13 Applyng basc meta-analyss One par each of the tme Multple comparsons Increase type I error Number of genotypes = 3 Number of comparsons 3C2 = 3 Total type I error = 1-{1-0.05} 3 = Regresson analyss Logstc regresson Lnear regresson

14 Data analyss 1. Checkng HWE For dchotomous outcomes, Case-controls, or cohorts, or cross-sectonal studes HWE should be assessed only n the control or non-dsease group for case-control, use data for the whole cohort/cross-sectonal study

15 For contnuous outcomes, Study desgns are usually cohort or crosssectonal studes, HWE should be checked consderng the subjects as one large group The ch-square goodness of ft s used to test devaton from HWE 2 d. f ( o e e k 11 ) 2

16 Table 2. Assessng HWE for CFH polymorphsm Genotype Observe frequency Expected frequency P value TT TC CC CFH, complement factor H

17 0.141 p - value 1 d.f , ) ( ) ( ) ( ) / ( ) 40 ( x xn p e x x x xn p p e x xn p e p p p x n n a aa a A Aa AA A a A allele A

18 genhw Genotype Observed Expected AA Aa aa total Allele Observed Frequency Std. Err A a total Estmated dsequlbrum coeffcent (D) = Hardy-Wenberg Equlbrum Test: Pearson ch2 (1) = Pr= lkelhood-rato ch2 (1) = Pr= Exact sgnfcance prob =

19 Handlng studes that do not comply wth HWE Senstvty analyses ncludng and excludng studes not n HWE. Include all studes, but those ~HWE wll be adjusted for the degree of dsequlbrum usng the nbreedng coeffcent (F) suggested by Trkalnos et al (Am J Epdemol, 163: 300, 2006)

20 2. Poolng prevalence A requrement of Human Genome Epdemology Network (HuGENet) Poolng prevalence of mnor allele n control group How often the gene s dentfed n general populaton Pool by ethncty

21 Poolng prevalence Thakknstan et al; Am J Epdemol 2005:16; 201 Anothasntawee et al; Cln Nephrol 2009;71: 244

22 var( p ) pq n

23 3. Checkng heterogenety of genetc effects Table 3. Data notatons for a dchotomous outcome Genotype Case Control OR AA n 11 n 10 OR 1 Aa n 21 n 20 OR 2 aa n 31 n 30 1 *

24 OR OR 1 2 n n n n n n n n

25 ^ k k ^ ^ p p k p n n n n ) OR ( ) OR ( w n n n n OR θ w OR w OR θ θ θ w Q ^ ^ ln var ln var 1 ) ln( ln ˆ ln ln ˆ ) ˆ ˆ (

26 Contnuous data Table 4. Data notaton for contnuous outcome Genotype n Mean SD AA n 1 Mean 1 SD 1 Aa n 2 Mean 2 SD 2 aa n 3 Mean 3 SD 3

27 k Q w (d D) ˆ 2 k w d ˆ 1 D k w 1 w 1 var( d )

28 SMD ( x x ) d 1 3 sd ( n 1) sd2 ( n 1) sd2 sd n n n d2 var( d ) n n (Cohen's 1 3 2( n 2) method) USMD d ( x x ) 1 3 sd2 2 var( d ) 1 sd 3 n n 1 3 If the Q test s sgnfcant OR I 2 25%, between -study varaton should be accounted n poolng

29 4. Determnaton of gene effects A) One-stage approach by regresson analyss Dchotomous outcome Logstc regresson Bnary regresson Posson regresson Contnuous outcome Lnear regresson

30 Fxed effect model for dchotomous outcome πj(x) ln (1 πj(x)) α 2 j1 β j (genotype j ) ε j

31 Random effect model π j (x) 2 ln α (1 π (x)) j 1 j β j ( genotype j ) j j Lkelhood rato test s appled by comparng - loglkelhoo(ll) of the null model wth the model contaned genotype varable.if effects s next explored. t s sgnfcant, the mode of gene

32 Fxed effect model Contnuous outcome 2 Y j α j1 β j ( genotype j ) j

33 Random effect model 2 Y j α j1 β j ( genotype j ) j j F test s used to assess gene effects by comparng the full model wth the model wthout genotype varable.

34 B) Two-stage approach usng multvarate (mv) meta-analyss The relatve gene effects (e.g., d 1 and d 2 for contnuous outcome; OR 1 and OR 2 for dchotomous outcome) are smultaneously pooled at once Model Y ~ N(, V ~ N( X ), )

35 Y s a vector of genotype effects estmated from each study, V s a varance-covarance matrx of genotype effects (.e., OR 1 & OR 2 ), µ s study-specfc mean vector, X s study-genotype matrx. The mvmeta ams to pool gene effects,.e., β coeffcents and between-study varancecovarance matrx. Degree of heterogenety I 2 s also estmated for each comparson.

36 C) Two-stage approach usng IPD meta-analyss Perform two-stage nverse-varance ndvdual patent data meta-analyss The same concept as mvmeta,.e., estmate coeffcent & SE, then pool coeffcents across studes Type of model use depends on type of outcome, e.g., logt, regress

37 5. Determnng the best genetc model OR Mean dfference Mode of effects OR OR 1and OR OR OR 1and OR D D 0 and D D D 0 and D Recessve effect domnant effect codomnant (addtve) effect OR OR 1and OR OR 1 D D 0 and D D Or or OR OR 1 2 1and OR OR D 1 D 2 0 and D 1 D 3 0

38 Table 5. Multple comparsons of genotype effects and possble modes of nhertance *D s pooled mean dfference, OR s pooled odds rato Mode of nhertance AA versus aa Aa versus aa (D 1 or OR 1 ) * (D 2 or OR 2 ) AA versus Aa (D 3 or OR 3 ) Recessve Domnant Complete overdomnant Codomnant

39 By Mnell C et al (Int J Epdemol, 34: 1319, 2005): Bayesan approach ln ln OR OR Aa AA λ Gene effects 0 Recessve effect 1 Domnant effect 0.5 Addtve effect

40 Example 5. Assess assocaton between CFH Y402H polymorphsm & AMD The Y402H polymorphsm (OMIM# ) s defned by a substtuton of tyrosne to hstdne at codon 402 of the CFH gene A regon of the proten that bnds heparn and C- reactve proten Has an ant-nflammatory role TC, 3 genotypes TT, TC, CC

41 Genotype frequences between case and control groups Authors AMD Control N Genotype N Genotype TT TC CC TT TC CC Conley YP Edwards AO Hageman GS Klen RJ Magnusson KP Rvera A* Soued EH Zarepars S

42 Step of analyss Checkng HWE n control groups Pool mnor allele prevalence Checkng heterogenety of ORs Assess overall gene effects Assess mode of genetc effects Am J Epdemol Jun 15;173(12): ; Stat Med May 15;24(9):

43 Table7. Allelc data and estmaton of the pooled prevalence of the CFH Y402H allele Authors HWE N T C P-value Frequency Allele Frequency Allele Var(p) Conley YP Edwards AO Hageman GS Klen RJ Magnusson KP Rvera A* Soued EH Zarepars S Pooled prevalence *Not ncluded n the man poolng

44 Heterogenety OR1 Ch-square = (d.f. = 6) p = I-square = 69.0% OR2 Ch-square = (d.f. = 6) p = I-square = 58.7%

45 Gene effects Apply a mxed-effect logstc regresson π j (x) 2 ln α (1 π (x)) j 1 j β j ( genotype j ) j j

46 Modfy APD to ndvdual patent data (IPD) Step 1: reshape genotype from wde-wde to wde-long ren (TT_case TC_case CC_case) (gene11 gene12 gene13) ren (TT_cont TC_cont CC_cont) (gene01 gene02 gene03) reshape long gene0 gene1, (Study) j(genotype) Step 2: reshape group from wde-long to long-long reshape long gene, (Study genotype) j(gr) tab genotype gr [ freq=gene] Expand summary data to IPD expand gene bysort Study: tab genotype gr May need to drop record were cell = 0

47 The mxed-effect logt model wth panel data approach xtlogt gr.genotype f Study ~=8, (Study) nolog lncom 3.genotype-2.genotype pwcompare genotype, or est store A Test overall gene effect xtlogt gr f Study ~=8, (Study) lrtest A

48 Mult-level regresson Use melogt, meqrlogt, or xtmelogt command melogt gr.genotype f Study ~=8 Study: melogt, or pwcompare genotype, eform Unadjusted exp(b) Std. Err. [95% Conf. Interval] gr genotype 2 vs vs vs

49 Forrest plot tab genotype, gen(genotype) set more off xtmelogt gr genotype3 genotype2 f Study ~=8 /// Study:genotype3, nolog or pdforest genotype3, label(author Year) or qu xtmelogt gr genotype2 genotype3 f Study ~=8 Study: /// genotype2, nolog pdforest genotype2, label(author Year) or

50 Two-stage usng IPD meta DerSmonan-Lad method pdmetan, study(author) random or: /// logt gr genotype3 genotype2 f Study~=8 pdmetan, study(author) random or: /// logt gr genotype2 genotype3 f Study~=8

51 OR3 (CC vs TT) OR2 (TC vs TT) Author Odds Rato (95% CI) % Weght Author Odds Rato (95% CI) % Weght Conley YP Edwards AO et al (5.32, 24.39) 4.54 (2.70, 7.65)??10.99??15.40 Conley YP Edwards AO et al 4.97 (2.66, 9.28) 2.14 (1.43, 3.18)????9.10??15.14 Hageman GS et al 5.44 (3.82, 7.77)??18.98 Hageman GS et al 2.53 (1.93, 3.31)??20.24 Klen at al 8.50 (2.80, 25.79)????6.84 Klen at al 1.89 (0.80, 4.51)????5.58 Magnusson et al Soued et al Zarepars S et al Overall (I-squared = 69.0%, p = 0.004) 4.13 (3.28, 5.20) 6.84 (3.07, 15.21) (7.05, 19.14) 6.44 (4.57, 9.08)??21.51??10.42?? Magnusson et al Soued et al Zarepars S et al Overall (I-squared = 58.7%, p = 0.024) 1.87 (1.55, 2.25) 2.99 (1.61, 5.57) 3.03 (2.15, 4.28) 2.52 (2.00, 3.16)??23.67????9.13?? NOTE: Weghts are from Random-effects; DerSmonan-Lard estmator NOTE: Weghts are from Random-effects; DerSmonan-Lard estmator

52 Two-stage usng mvmeta vs networkmeta Requre b V b = lnorj V = varance-covarance matrx These two parameter-matrxes can be prepared usng mvmeta_make command, agan requre IPD

53 Requred data: long format for genotype & d n Study d n genotype ren (TT_case TC_case CC_case) (gene11 gene12 gene13) ren (TT_cont TC_cont CC_cont) (gene01 gene02 gene03) reshape long gene0 gene1, (Study) j(genotype) lst Study genotype gene1 gene0 n 1/ Study genotype gene1 gene

54 gen ngenotype =gene1+gene0 ren gene1 event1 lst Study genotype event1 gene0 ngenotype n 1/ Study genotype event1 gene0 ngenot~e Setup data as network data network setup event1 ngenotype, or studyvar(study) trtvar(genotype) armvars(drop)

55 Droppng arm-level varables: Treatments used A (reference): 1 B: 2 C: 3 Measure Studes ID varable: Number used: 8 IDs wth zero cells: Network nformaton Components: D.f. for nconsstency: 0 D.f. for heterogenety: 14 Current data Data format: Desgn varable: Estmate varables: Varance varables: Command to lst the data: gene0 Log odds rato Study [none] 1 (connected) augmented _desgn _y* _S* lst Study _y* _S*, noo sepby(_desgn)

56 . lst Study _desgn _y_b _y_c _S_B_B _S_B_C _S_C_C Study _desgn _y_b _y_c _S_B_B _S_B_C _S_C_C A B C A B C A B C A B C A B C A B C A B C A B C

57 network meta consstency f Study~=8, eform Command s: mvmeta _y _S f Study~=8, eform bscovarance(exch 0.5) longparm suppress(uv mm) vars(_y_b _y_c) Note: usng method reml Note: usng varables _y_b _y_c Note: 7 observatons on 2 varables Note: varance-covarance matrx s proportonal to.5*i(2)+.5*j(2,2,1) ntal: log lkelhood = Iteraton 4: log lkelhood = Multvarate meta-analyss Varance-covarance matrx = proportonal.5*i(2)+.5*j(2,2,1) Method = reml Number of dmensons = 2 Restrcted log lkelhood = Number of observatons = exp(coef) Std. Err. z P> z [95% Conf. Interval] _y_b _cons _y_c _cons

58 Forest plot network forest f Study~=8, eform xlne(1) xlne( ) xlab( ) Study 1 Study 2 Study 3 Study 4 Study 5 Study 6 Study 7 All studes Study 1 Study 2 Study 3 Study 4 Study 5 Study 6 Study 7 All studes Study 1 Study 2 Study 3 Study 4 Study 5 Study 6 Study 7 All studes 2 vs. 1 3 vs. 1 3 vs Odds rato Studes Pooled overall Graphs by column

59 Contnuous outcome: BsmI & BMD BsmI BB = 1 Bb = 2 Bb = 3 Outcome: mean BMD Two-stage approach usng mvmetaanalyss va network meta command Requre long data for n mean sd

60 lst study n1 mean1 sd1 n2 mean2 sd2 n3 mean3 sd3 n 1/3 study n1 mean1 sd1 n2 mean2 sd2 n3 mean3 sd reshape long n mean sd, (study) j(genotype) lst study n mean sd genotype n 1/ study n mean sd genotype

61 . network setup mean sd n, studyvar(study) trtvar(genotype) Treatments used A: (reference): 1 B: 2 C 3 Measure Standard devaton poolng: Studes ID varable: Mean dfference off Number used: 13 Network nformaton Components: D.f. for nconsstency: 0 D.f. for heterogenety: 24 Current data Data format: Desgn varable: Estmate varables: Varance varables: Command to lst the data: study 1 (connected) augmented _desgn _y* _S* lst study _y* _S*, noo sepby(_desgn)

62 lst study _desgn _y* _S_B_B _S_C_C _S_B_C study _desgn _y_b _y_c _S_B_B _S_C_C _S_B_C A B C A B C A B C

63 network meta c Multvarate meta-analyss Varance-covarance matrx = proportonal.5*i(2)+.5*j(2,2,1) Method = reml Number of dmensons = 2 Restrcted log lkelhood = Number of observatons = Coef. Std. Err. z P> z [95% Conf. Interval] _y_b _cons _y_c _cons lncom _b[_y_c: _cons] - _b[_y_b: _cons] ( 1) - [_y_b]_cons + [_y_c]_cons = Coef. Std. Err. z P> z [95% Conf. Interval] (1)

64 network forest Study 1 Study 10 Study 16 Study 20 Study 30 Study 37 Study 43 Study 48 Study 57 Study 59 Study 63 Study 68 Study 70 All studes Study 1 Study 10 Study 16 Study 20 Study 30 Study 37 Study 43 Study 48 Study 57 Study 59 Study 63 Study 68 Study 70 All studes 2 vs. 1 3 vs. 2 Study 1 Study 10 Study 16 Study 20 Study 30 Study 37 Study 43 Study 48 Study 57 Study 59 Study 63 Study 68 Study 70 All studes 3 vs Mean dfference Studes Pooled overall Graphs by column

65 BB 0.02 (-0.00,0.04) 0.03 (0.01,0.05) (-0.04,0.00) Bb 0.01 (-0.01,0.03) (-0.05,-0.01) (-0.03,0.01) bb Multple comparsons: netleague, label(bb Bb bb) sort(bb Bb bb ) export("c:\data\bsm.xlsx") Above dagonal cell, parwse comparson between genotype lne on column vs genotype lnes on the left row Below dagonal, parwse comparson between genotype n column vs genotype n the rgh row

MODULE III ADVANCED META-ANALYSIS

MODULE III ADVANCED META-ANALYSIS Academc Year 2014 Doctor of Phlosophy Program n Clncal Epdemology (Internatonal Program) Secton for Clncal Epdemology and Bostatstcs Faculty of Medcne Ramathbod Hosptal Mahdol Unversty RACE 618 MODULE

More information

Recall that quantitative genetics is based on the extension of Mendelian principles to polygenic traits.

Recall that quantitative genetics is based on the extension of Mendelian principles to polygenic traits. BIOSTT/STT551, Statstcal enetcs II: Quanttatve Trats Wnter 004 Sources of varaton for multlocus trats and Handout Readng: Chapter 5 and 6. Extensons to Multlocus trats Recall that quanttatve genetcs s

More information

Lab 4: Two-level Random Intercept Model

Lab 4: Two-level Random Intercept Model BIO 656 Lab4 009 Lab 4: Two-level Random Intercept Model Data: Peak expratory flow rate (pefr) measured twce, usng two dfferent nstruments, for 17 subjects. (from Chapter 1 of Multlevel and Longtudnal

More information

Jon Deeks and Julian Higgins. on Behalf of the Statistical Methods Group of The Cochrane Collaboration. April 2005

Jon Deeks and Julian Higgins. on Behalf of the Statistical Methods Group of The Cochrane Collaboration. April 2005 Standard statstcal algorthms n Cochrane revews Verson 5 Jon Deeks and Julan Hggns on Behalf of the Statstcal Methods Group of The Cochrane Collaboraton Aprl 005 Data structure Consder a meta-analyss of

More information

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data Lab : TWO-LEVEL NORMAL MODELS wth school chldren popularty data Purpose: Introduce basc two-level models for normally dstrbuted responses usng STATA. In partcular, we dscuss Random ntercept models wthout

More information

Scientific Question Determine whether the breastfeeding of Nepalese children varies with child age and/or sex of child.

Scientific Question Determine whether the breastfeeding of Nepalese children varies with child age and/or sex of child. Longtudnal Logstc Regresson: Breastfeedng of Nepalese Chldren PART II GEE models (margnal, populaton average) covered last lab Random Intercept models (subject specfc) Transton models Scentfc Queston Determne

More information

DrPH Seminar Session 3. Quantitative Synthesis. Qualitative Synthesis e.g., GRADE

DrPH Seminar Session 3. Quantitative Synthesis. Qualitative Synthesis e.g., GRADE DrPH Semnar Sesson 3 Quanttatve Synthess Focusng on Heterogenety Qualtatve Synthess e.g., GRADE Me Chung, PhD, MPH Research Assstant Professor Nutrton/Infecton Unt, Department of Publc Health and Communty

More information

Using the estimated penetrances to determine the range of the underlying genetic model in casecontrol

Using the estimated penetrances to determine the range of the underlying genetic model in casecontrol Georgetown Unversty From the SelectedWorks of Mark J Meyer 8 Usng the estmated penetrances to determne the range of the underlyng genetc model n casecontrol desgn Mark J Meyer Neal Jeffres Gang Zheng Avalable

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

Quantitative Genetic Models Least Squares Genetic Model. Hardy-Weinberg (1908) Principle. change of allele & genotype frequency over generations

Quantitative Genetic Models Least Squares Genetic Model. Hardy-Weinberg (1908) Principle. change of allele & genotype frequency over generations Quanttatve Genetc Models Least Squares Genetc Model Hardy-Wenberg (1908) Prncple partton of effects P = G + E + G E P s phenotypc effect G s genetc effect E s envronmental effect G E s nteracton effect

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

The Geometry of Logit and Probit

The Geometry of Logit and Probit The Geometry of Logt and Probt Ths short note s meant as a supplement to Chapters and 3 of Spatal Models of Parlamentary Votng and the notaton and reference to fgures n the text below s to those two chapters.

More information

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X).

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X). 11.4.1 Estmaton of Multple Regresson Coeffcents In multple lnear regresson, we essentally solve n equatons for the p unnown parameters. hus n must e equal to or greater than p and n practce n should e

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Data Abstraction Form for population PK, PD publications

Data Abstraction Form for population PK, PD publications Data Abstracton Form for populaton PK/PD publcatons Brendel K. 1*, Dartos C. 2*, Comets E. 1, Lemenuel-Dot A. 3, Laffont C.M. 3, Lavelle C. 4, Grard P. 2, Mentré F. 1 1 INSERM U738, Pars, France 2 EA3738,

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study

More information

Statistics for Business and Economics

Statistics for Business and Economics Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear

More information

[ ] λ λ λ. Multicollinearity. multicollinearity Ragnar Frisch (1934) perfect exact. collinearity. multicollinearity. exact

[ ] λ λ λ. Multicollinearity. multicollinearity Ragnar Frisch (1934) perfect exact. collinearity. multicollinearity. exact Multcollnearty multcollnearty Ragnar Frsch (934 perfect exact collnearty multcollnearty K exact λ λ λ K K x+ x+ + x 0 0.. λ, λ, λk 0 0.. x perfect ntercorrelated λ λ λ x+ x+ + KxK + v 0 0.. v 3 y β + β

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

Diagnostics in Poisson Regression. Models - Residual Analysis

Diagnostics in Poisson Regression. Models - Residual Analysis Dagnostcs n Posson Regresson Models - Resdual Analyss 1 Outlne Dagnostcs n Posson Regresson Models - Resdual Analyss Example 3: Recall of Stressful Events contnued 2 Resdual Analyss Resduals represent

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

Topic 23 - Randomized Complete Block Designs (RCBD)

Topic 23 - Randomized Complete Block Designs (RCBD) Topc 3 ANOVA (III) 3-1 Topc 3 - Randomzed Complete Block Desgns (RCBD) Defn: A Randomzed Complete Block Desgn s a varant of the completely randomzed desgn (CRD) that we recently learned. In ths desgn,

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VIII LECTURE - 34 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS EFFECTS MODEL Dr Shalabh Department of Mathematcs and Statstcs Indan

More information

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y) Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

Heterogeneous Treatment Effect Analysis

Heterogeneous Treatment Effect Analysis Heterogeneous Treatment Effect Analyss Ben Jann ETH Zurch In cooperaton wth Jenne E. Brand (UCLA) and Yu Xe (Unversty of Mchgan) German Stata Users Group Meetng Berln, June 25, 2010 Ben Jann (ETH Zurch)

More information

Statistical tables are provided Two Hours UNIVERSITY OF MANCHESTER. Date: Wednesday 4 th June 2008 Time: 1400 to 1600

Statistical tables are provided Two Hours UNIVERSITY OF MANCHESTER. Date: Wednesday 4 th June 2008 Time: 1400 to 1600 Statstcal tables are provded Two Hours UNIVERSITY OF MNCHESTER Medcal Statstcs Date: Wednesday 4 th June 008 Tme: 1400 to 1600 MT3807 Electronc calculators may be used provded that they conform to Unversty

More information

Chapter 14: Logit and Probit Models for Categorical Response Variables

Chapter 14: Logit and Probit Models for Categorical Response Variables Chapter 4: Logt and Probt Models for Categorcal Response Varables Sect 4. Models for Dchotomous Data We wll dscuss only ths secton of Chap 4, whch s manly about Logstc Regresson, a specal case of the famly

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

Regression Analysis. Regression Analysis

Regression Analysis. Regression Analysis Regresson Analyss Smple Regresson Multvarate Regresson Stepwse Regresson Replcaton and Predcton Error 1 Regresson Analyss In general, we "ft" a model by mnmzng a metrc that represents the error. n mn (y

More information

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation Econ 388 R. Butler 204 revsons Lecture 4 Dummy Dependent Varables I. Lnear Probablty Model: the Regresson model wth a dummy varables as the dependent varable assumpton, mplcaton regular multple regresson

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

Phase I Monitoring of Nonlinear Profiles

Phase I Monitoring of Nonlinear Profiles Phase I Montorng of Nonlnear Profles James D. Wllams Wllam H. Woodall Jeffrey B. Brch May, 003 J.D. Wllams, Bll Woodall, Jeff Brch, Vrgna Tech 003 Qualty & Productvty Research Conference, Yorktown Heghts,

More information

A Comparative Study for Estimation Parameters in Panel Data Model

A Comparative Study for Estimation Parameters in Panel Data Model A Comparatve Study for Estmaton Parameters n Panel Data Model Ahmed H. Youssef and Mohamed R. Abonazel hs paper examnes the panel data models when the regresson coeffcents are fxed random and mxed and

More information

Decision Analysis (part 2 of 2) Review Linear Regression

Decision Analysis (part 2 of 2) Review Linear Regression Harvard-MIT Dvson of Health Scences and Technology HST.951J: Medcal Decson Support, Fall 2005 Instructors: Professor Lucla Ohno-Machado and Professor Staal Vnterbo 6.873/HST.951 Medcal Decson Support Fall

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

Checking Pairwise Relationships. Lecture 19 Biostatistics 666

Checking Pairwise Relationships. Lecture 19 Biostatistics 666 Checkng Parwse Relatonshps Lecture 19 Bostatstcs 666 Last Lecture: Markov Model for Multpont Analyss X X X 1 3 X M P X 1 I P X I P X 3 I P X M I 1 3 M I 1 I I 3 I M P I I P I 3 I P... 1 IBD states along

More information

Number of cases Number of factors Number of covariates Number of levels of factor i. Value of the dependent variable for case k

Number of cases Number of factors Number of covariates Number of levels of factor i. Value of the dependent variable for case k ANOVA Model and Matrx Computatons Notaton The followng notaton s used throughout ths chapter unless otherwse stated: N F CN Y Z j w W Number of cases Number of factors Number of covarates Number of levels

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

University of California at Berkeley Fall Introductory Applied Econometrics Final examination

University of California at Berkeley Fall Introductory Applied Econometrics Final examination SID: EEP 118 / IAS 118 Elsabeth Sadoulet and Daley Kutzman Unversty of Calforna at Berkeley Fall 01 Introductory Appled Econometrcs Fnal examnaton Scores add up to 10 ponts Your name: SID: 1. (15 ponts)

More information

β0 + β1xi and want to estimate the unknown

β0 + β1xi and want to estimate the unknown SLR Models Estmaton Those OLS Estmates Estmators (e ante) v. estmates (e post) The Smple Lnear Regresson (SLR) Condtons -4 An Asde: The Populaton Regresson Functon B and B are Lnear Estmators (condtonal

More information

Multinomial logit regression

Multinomial logit regression 07/0/6 Multnomal logt regresson Introducton We now turn our attenton to regresson models for the analyss of categorcal dependent varables wth more than two response categores: Y car owned (many possble

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrcs of Panel Data Jakub Mućk Meetng # 8 Jakub Mućk Econometrcs of Panel Data Meetng # 8 1 / 17 Outlne 1 Heterogenety n the slope coeffcents 2 Seemngly Unrelated Regresson (SUR) 3 Swamy s random

More information

ANOVA. The Observations y ij

ANOVA. The Observations y ij ANOVA Stands for ANalyss Of VArance But t s a test of dfferences n means The dea: The Observatons y j Treatment group = 1 = 2 = k y 11 y 21 y k,1 y 12 y 22 y k,2 y 1, n1 y 2, n2 y k, nk means: m 1 m 2

More information

Introduction to Analysis of Variance (ANOVA) Part 1

Introduction to Analysis of Variance (ANOVA) Part 1 Introducton to Analss of Varance (ANOVA) Part 1 Sngle factor The logc of Analss of Varance Is the varance explaned b the model >> than the resdual varance In regresson models Varance explaned b regresson

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Interpreting Slope Coefficients in Multiple Linear Regression Models: An Example

Interpreting Slope Coefficients in Multiple Linear Regression Models: An Example CONOMICS 5* -- Introducton to NOT CON 5* -- Introducton to NOT : Multple Lnear Regresson Models Interpretng Slope Coeffcents n Multple Lnear Regresson Models: An xample Consder the followng smple lnear

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

Systematic Review & Systematic review

Systematic Review & Systematic review Systematc Revew & Meta-analyssanalyss Ammarn Thakknstan, Ph.D. Secton for Clncal Epdemology and Bostatstcs Faculty of Medcne, Ramathbod Hosptal Tel: 0-0-69, 0-0-76 Fax: 0-084 e-mal: raatk@mahdol.ac.th

More information

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction ECONOMICS 35* -- NOTE 7 ECON 35* -- NOTE 7 Interval Estmaton n the Classcal Normal Lnear Regresson Model Ths note outlnes the basc elements of nterval estmaton n the Classcal Normal Lnear Regresson Model

More information

Reduced slides. Introduction to Analysis of Variance (ANOVA) Part 1. Single factor

Reduced slides. Introduction to Analysis of Variance (ANOVA) Part 1. Single factor Reduced sldes Introducton to Analss of Varance (ANOVA) Part 1 Sngle factor 1 The logc of Analss of Varance Is the varance explaned b the model >> than the resdual varance In regresson models Varance explaned

More information

Lecture 2: Prelude to the big shrink

Lecture 2: Prelude to the big shrink Lecture 2: Prelude to the bg shrnk Last tme A slght detour wth vsualzaton tools (hey, t was the frst day... why not start out wth somethng pretty to look at?) Then, we consdered a smple 120a-style regresson

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 1 Chapters 14, 15 & 16 Professor Ahmad, Ph.D. Department of Management Revsed August 005 Chapter 14 Formulas Smple Lnear Regresson Model: y =

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the Chapter 11 Student Lecture Notes 11-1 Lnear regresson Wenl lu Dept. Health statstcs School of publc health Tanjn medcal unversty 1 Regresson Models 1. Answer What Is the Relatonshp Between the Varables?.

More information

Lecture 4 Hypothesis Testing

Lecture 4 Hypothesis Testing Lecture 4 Hypothess Testng We may wsh to test pror hypotheses about the coeffcents we estmate. We can use the estmates to test whether the data rejects our hypothess. An example mght be that we wsh to

More information

ECON 351* -- Note 23: Tests for Coefficient Differences: Examples Introduction. Sample data: A random sample of 534 paid employees.

ECON 351* -- Note 23: Tests for Coefficient Differences: Examples Introduction. Sample data: A random sample of 534 paid employees. Model and Data ECON 35* -- NOTE 3 Tests for Coeffcent Dfferences: Examples. Introducton Sample data: A random sample of 534 pad employees. Varable defntons: W hourly wage rate of employee ; lnw the natural

More information

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected.

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected. ANSWERS CHAPTER 9 THINK IT OVER thnk t over TIO 9.: χ 2 k = ( f e ) = 0 e Breakng the equaton down: the test statstc for the ch-squared dstrbuton s equal to the sum over all categores of the expected frequency

More information

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9 Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

Population Design in Nonlinear Mixed Effects Multiple Response Models: extension of PFIM and evaluation by simulation with NONMEM and MONOLIX

Population Design in Nonlinear Mixed Effects Multiple Response Models: extension of PFIM and evaluation by simulation with NONMEM and MONOLIX Populaton Desgn n Nonlnear Mxed Effects Multple Response Models: extenson of PFIM and evaluaton by smulaton wth NONMEM and MONOLIX May 4th 007 Carolne Bazzol, Sylve Retout, France Mentré Inserm U738 Unversty

More information

Composite Hypotheses testing

Composite Hypotheses testing Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter

More information

Testing for seasonal unit roots in heterogeneous panels

Testing for seasonal unit roots in heterogeneous panels Testng for seasonal unt roots n heterogeneous panels Jesus Otero * Facultad de Economía Unversdad del Rosaro, Colomba Jeremy Smth Department of Economcs Unversty of arwck Monca Gulett Aston Busness School

More information

T E C O L O T E R E S E A R C H, I N C.

T E C O L O T E R E S E A R C H, I N C. T E C O L O T E R E S E A R C H, I N C. B rdg n g En g neern g a nd Econo mcs S nce 1973 THE MINIMUM-UNBIASED-PERCENTAGE ERROR (MUPE) METHOD IN CER DEVELOPMENT Thrd Jont Annual ISPA/SCEA Internatonal Conference

More information

Indirect Evidence: Indirect Treatment Comparisons in Meta-analysis

Indirect Evidence: Indirect Treatment Comparisons in Meta-analysis Evdence: Treatment Comparsons n Meta-analyss analyss George Wells, Shagufta Sultan, L Chen, Doug Coyle Current Issues for Health Technology Assessment n Canada An Invtatonal Symposum for HTA Researchers

More information

Chapter 12 Analysis of Covariance

Chapter 12 Analysis of Covariance Chapter Analyss of Covarance Any scentfc experment s performed to know somethng that s unknown about a group of treatments and to test certan hypothess about the correspondng treatment effect When varablty

More information

Two-factor model. Statistical Models. Least Squares estimation in LM two-factor model. Rats

Two-factor model. Statistical Models. Least Squares estimation in LM two-factor model. Rats tatstcal Models Lecture nalyss of Varance wo-factor model Overall mean Man effect of factor at level Man effect of factor at level Y µ + α + β + γ + ε Eε f (, ( l, Cov( ε, ε ) lmr f (, nteracton effect

More information

A Monte Carlo Study for Swamy s Estimate of Random Coefficient Panel Data Model

A Monte Carlo Study for Swamy s Estimate of Random Coefficient Panel Data Model A Monte Carlo Study for Swamy s Estmate of Random Coeffcent Panel Data Model Aman Mousa, Ahmed H. Youssef and Mohamed R. Abonazel Department of Appled Statstcs and Econometrcs, Instute of Statstcal Studes

More information

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Maxmum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models

More information

Statistical pattern recognition

Statistical pattern recognition Statstcal pattern recognton Bayes theorem Problem: decdng f a patent has a partcular condton based on a partcular test However, the test s mperfect Someone wth the condton may go undetected (false negatve

More information

Chapter 15 - Multiple Regression

Chapter 15 - Multiple Regression Chapter - Multple Regresson Chapter - Multple Regresson Multple Regresson Model The equaton that descrbes how the dependent varable y s related to the ndependent varables x, x,... x p and an error term

More information

Systems of Equations (SUR, GMM, and 3SLS)

Systems of Equations (SUR, GMM, and 3SLS) Lecture otes on Advanced Econometrcs Takash Yamano Fall Semester 4 Lecture 4: Sstems of Equatons (SUR, MM, and 3SLS) Seemngl Unrelated Regresson (SUR) Model Consder a set of lnear equatons: $ + ɛ $ + ɛ

More information

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε Chapter 3 Secton 3.1 Model Assumptons: Multple Regresson Model Predcton Equaton Std. Devaton of Error Correlaton Matrx Smple Lnear Regresson: 1.) Lnearty.) Constant Varance 3.) Independent Errors 4.) Normalty

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours UNIVERSITY OF TORONTO Faculty of Arts and Scence December 005 Examnatons STA47HF/STA005HF Duraton - hours AIDS ALLOWED: (to be suppled by the student) Non-programmable calculator One handwrtten 8.5'' x

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Introduction to Dummy Variable Regressors. 1. An Example of Dummy Variable Regressors

Introduction to Dummy Variable Regressors. 1. An Example of Dummy Variable Regressors ECONOMICS 5* -- Introducton to Dummy Varable Regressors ECON 5* -- Introducton to NOTE Introducton to Dummy Varable Regressors. An Example of Dummy Varable Regressors A model of North Amercan car prces

More information

Reminder: Nested models. Lecture 9: Interactions, Quadratic terms and Splines. Effect Modification. Model 1

Reminder: Nested models. Lecture 9: Interactions, Quadratic terms and Splines. Effect Modification. Model 1 Lecture 9: Interactons, Quadratc terms and Splnes An Manchakul amancha@jhsph.edu 3 Aprl 7 Remnder: Nested models Parent model contans one set of varables Extended model adds one or more new varables to

More information

Overview. Multiple Treatment Meta Analysis II

Overview. Multiple Treatment Meta Analysis II Multple Treatment Meta Analyss II Sofa Das Unversty of Brstol s.das@brstol.ac.uk SMG Tranng Course, March 010, Cardff Wth thanks to: Georga Salant, Ncky Welton, Tony Ades, Debb Caldwell, Alex Sutton Overvew

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

β0 + β1xi. You are interested in estimating the unknown parameters β

β0 + β1xi. You are interested in estimating the unknown parameters β Ordnary Least Squares (OLS): Smple Lnear Regresson (SLR) Analytcs The SLR Setup Sample Statstcs Ordnary Least Squares (OLS): FOCs and SOCs Back to OLS and Sample Statstcs Predctons (and Resduals) wth OLS

More information

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida Frst Year Examnaton Department of Statstcs, Unversty of Florda May 7, 010, 8:00 am - 1:00 noon Instructons: 1. You have four hours to answer questons n ths examnaton.. You must show your work to receve

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

F statistic = s2 1 s 2 ( F for Fisher )

F statistic = s2 1 s 2 ( F for Fisher ) Stat 4 ANOVA Analyss of Varance /6/04 Comparng Two varances: F dstrbuton Typcal Data Sets One way analyss of varance : example Notaton for one way ANOVA Comparng Two varances: F dstrbuton We saw that the

More information

Continuous vs. Discrete Goods

Continuous vs. Discrete Goods CE 651 Transportaton Economcs Charsma Choudhury Lecture 3-4 Analyss of Demand Contnuous vs. Dscrete Goods Contnuous Goods Dscrete Goods x auto 1 Indfference u curves 3 u u 1 x 1 0 1 bus Outlne Data Modelng

More information

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes 25/6 Canddates Only January Examnatons 26 Student Number: Desk Number:...... DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR Department Module Code Module Ttle Exam Duraton

More information

Homework 9 STAT 530/J530 November 22 nd, 2005

Homework 9 STAT 530/J530 November 22 nd, 2005 Homework 9 STAT 530/J530 November 22 nd, 2005 Instructor: Bran Habng 1) Dstrbuton Q-Q plot Boxplot Heavy Taled Lght Taled Normal Skewed Rght Department of Statstcs LeConte 203 ch-square dstrbuton, Telephone:

More information

9. Binary Dependent Variables

9. Binary Dependent Variables 9. Bnar Dependent Varables 9. Homogeneous models Log, prob models Inference Tax preparers 9.2 Random effects models 9.3 Fxed effects models 9.4 Margnal models and GEE Appendx 9A - Lkelhood calculatons

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

This column is a continuation of our previous column

This column is a continuation of our previous column Comparson of Goodness of Ft Statstcs for Lnear Regresson, Part II The authors contnue ther dscusson of the correlaton coeffcent n developng a calbraton for quanttatve analyss. Jerome Workman Jr. and Howard

More information

On Outlier Robust Small Area Mean Estimate Based on Prediction of Empirical Distribution Function

On Outlier Robust Small Area Mean Estimate Based on Prediction of Empirical Distribution Function On Outler Robust Small Area Mean Estmate Based on Predcton of Emprcal Dstrbuton Functon Payam Mokhtaran Natonal Insttute of Appled Statstcs Research Australa Unversty of Wollongong Small Area Estmaton

More information

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis Resource Allocaton and Decson Analss (ECON 800) Sprng 04 Foundatons of Regresson Analss Readng: Regresson Analss (ECON 800 Coursepak, Page 3) Defntons and Concepts: Regresson Analss statstcal technques

More information

STATISTICS QUESTIONS. Step by Step Solutions.

STATISTICS QUESTIONS. Step by Step Solutions. STATISTICS QUESTIONS Step by Step Solutons www.mathcracker.com 9//016 Problem 1: A researcher s nterested n the effects of famly sze on delnquency for a group of offenders and examnes famles wth one to

More information

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS M. Krshna Reddy, B. Naveen Kumar and Y. Ramu Department of Statstcs, Osmana Unversty, Hyderabad -500 007, Inda. nanbyrozu@gmal.com, ramu0@gmal.com

More information