Combined high resolution linkage and association mapping of quantitative trait loci

Size: px
Start display at page:

Download "Combined high resolution linkage and association mapping of quantitative trait loci"

Transcription

1 European Journal of Human Genetcs (), 5 37 ª Nature Publsng Group ll rgts reserved 8 483/ $5 wwwnaturecom/ejg RTICLE Combned g resoluton lnkage and assocaton mappng of quanttatve trat loc Ruzong Fan*,, and Momao Xong 3 Department of Statstcs, Te Texas &M Unversty, 447 locker uldng, College Staton, Texas , US; Insttute of Medcal ometry, Informatcs and Epdemology, Unversty of onn, Sgmund Freud Strasse 5, D-535 onn, Germany; 3 Human Genetcs Center, Unversty of Texas Houston, PO ox 334, Houston, Texas 775, US In ts paper, we nvestgate varance component models of bot lnkage analyss and g resoluton lnkage dsequlbrum (LD) mappng for quanttatve trat loc (QTL) Te models are based on bot famly pedgree and populaton data We consder lkeloods wc utlze flankng marker nformaton, and carry out an analyss of model buldng and parameter estmatons Te lkeloods jontly nclude recombnaton fractons, LD coeffcents, te average allele substtuton effect and allele domnant effect as parameters Hence, te model smultaneously takes care of te lnkage, LD or assocaton and te effects of te putatve trat locus Te models clearly demonstrate tat lnkage analyss and LD mappng are complementary, not exclusve, metods for QTL mappng y power calculatons and comparsons, we sow te advantages of te proposed metod: () populaton data can provde nformaton for LD mappng, and famly pedgree data can provde nformaton for bot lnkage analyss and LD mappng; () usng famly pedgree data and a sparse marker map, one may nvestgate te pror suggestve lnkage between trat locus and markers to obtan low resoluton of te trat loc, because lnkage analyss can locate a broad canddate regon; (3) wt te pror knowledge of suggestve lnkage from lnkage analyss, bot populaton and famly pedgree data can be used smultaneously n g resoluton LD mappng based on a dense marker map, snce LD mappng can ncrease te resoluton for canddate regons; (4) models of g resoluton LD mappngs usng two flankng markers ave ger power tan tat of models of usng only one marker n te analyss; (5) excludng te domnant varance from te analyss wen t does exst would lose power; (6) by performng lnkage nterval mappngs, one may get ger power tan by usng only one marker n te analyss European Journal of Human Genetcs (), 5 37 do:38/sjejg594 Keywords: varance component models; lnkage nterval mappng; lnkage dsequlbrum mappng; QTL *Correspondence: Dr Ruzong Fan; Department of Statstcs, Te Texas &M Unversty, 447 locker uldng, College Staton, Texas , US Tel: ; Fax: ; E-mal: rfanstattamuedu Receved 4 June ; revsed 4 October ; accepted November Introducton Twenty years ago, varatons n uman DN were recognzed as genetc markers n lnkage study fter tat, te advances n molecular bology and computatonal tecnology ave led to mappng several uman nerted dsease genes Usng restrcton fragment lengt polymorpsm (RFLP) markers and polymorpc mcrosatellte loc, lnkage analyss and postonal clonng ave been used successfully n mappng te cromosome locatons of Mendelan dsease genes Te success manly depends on one premse tat te dsease genes of Mendelan trats ave a large effect on te penotypes In fact, tere s usually a one-to-one correspondence between dsease gene genotypes and te dsease penotype for Mendelan trats Moreover, te correlatons

2 6 Lnkage and assocaton mappng of QTL R Fan and M Xong between genotypes and penotype of Mendelan trats are strong Gven suffcent famly data, Mendelan trats can be mapped wt g probablty by lnkage analyss Wt te encouragement of successful mappng Mendelan trat genes, tere as been growng nterests and endeavors n te study of complex trats suc as astma and dabetes For complex dseases, te nertance patterns and penotype defntons as wt genetc etology are muc more complex Te trat/affecton status s usually a contnuous varable 3 Te mappng of complex dsease genes s muc arder Novel statstcal metods suc as bot lnkage analyss and lnkage dsequlbrum (LD) mappng or assocaton study are needed n dssectng complex trats s very dense marker maps suc as sngle nucleotde polymorpsm (SNP) are avalable, 4 bot lnkage analyss and assocaton study are utlzed smultaneously for mappng complex dsease loc 5,6 lmasy et al 7 and Fulker et al 8 proposed to use combned lnkage and assocaton analyss for quanttatve trat loc (QTL) Sam et al 9 studed te power of lnkage versus assocaton analyss of quanttatve trats by analytcally calculatng non-centralty parameters of test statstcs becass et al proposed test statstcs of assocaton studes for quanttatve trats n nuclear famles, general pedgrees, and selected samples Cardon 3 studed a sbpar regresson model of LD for quanttatve trats ll tese researces concentrated on famly data wc nclude sb-pars, and used only one marker n analyss In Fan and Xong, 4 we proposed a lnear regresson metod of g resoluton mappng of quanttatve trat loc by LD mappng analyss Te metod s based on populaton data Usng two flankng markers, te regresson models ave ger power tan tat of models usng only one marker 4 It s well-known tat famly pedgree data can be used n bot lnkage analyss and assocaton study, and populaton data can be used n assocaton study Hence, t s necessary to consder a metod to combne bot populaton data and famly pedgree data n te analyss In ts paper, we propose to perform bot lnkage analyss and g resoluton LD mappng for QTL based on combned famly and populaton data Lnkage nterval mappng s based on famly data, and LD mappng s based on bot famly pedgree and populaton data ased on varance component models, we construct lkelood to analyse famly and populaton data n Secton of Models Ten, we dscuss te parameter estmatons and regresson coeffcents Te lnkage nformaton, e, recombnaton fractons, s contaned n te varance-covarance matrx, and te assocaton nformaton, e, te LD coeffcents, s contaned n te mean parameters or te regresson coeffcents We calculate te non-centralty parameters for assocaton study and lnkage analyss, respectvely Usng te noncentralty parameters, we perform power calculatons and comparsons Te tecncal detals to calculate te regresson coeffcents, parameters, non-centralty parameters are left n te ppendxes Models Consder a quanttatve trat locus Q wc as two alleles Q and Q Suppose tat te allele frequences of Q and Q are q and q, respectvely ssume tat two markers and flank te trat locus Q n an order of Q Marker as two alleles and a wt frequences P and P a, respectvely Marker as two alleles and b wt frequences P and P b, respectvely For a nuclear famly of k cldren and two parents, let us denote ter quanttatve trats by a vector y=(y f, y m, y,, y k ) t, genotypes at marker by a vector ( f, m,,, k ) t, and genotypes at marker by a vector ( f, m,,, k ) t Here y f s te trat value of te fater, f s te genotype of te fater at marker, and f s te genotype of te fater at marker Oter notatons are defned, smlarly, for te moter wt subscrpt m and for te -t cld wt subscrpt Te log-lkelood s defned by L ¼ k logðpþ logjj ðy XmÞt ðy XmÞ Te notatons of te log-lkelood are defned as follows For te mean component Xm, we consder te followng regresson equaton suc as model () n Fan and Xong 4 y ¼ b! g x a x a z z G e ; were b s overall mean, w s a row vector of covarates suc as gender and age, g s a column vector of regresson coeffcents for te covarates w, G s polygenc effect, e s error term ssume tat G s normal Nð; s G Þ, and e s normal Nð; s e Þ Moreover, G and e are ndependent x, x,z and z are dummy varables defned by 8 8 < P a f ¼ >< Pa f ¼ x ¼ P a P f ¼ a ; z : ¼ P a P f ¼ a ; >: P f ¼ aa P f ¼ aa 8 8 < P b f ¼ >< Pb f ¼ x ¼ P b P f ¼ b; z : ¼ P b P f ¼ b: >: P f ¼ bb P f ¼ bb a, a, d, and d are regresson coeffcents of te dummy varables x,x,z and z Te model matrx X s defned by 8 9 w f x f x f z f z f w m x m x m z m z m >< >= X ¼ w x x z z ¼ >: >; w k x k x k z k z k and m=(b, g t, a, a, d, d ) t s a vector of regresson coeffcents S s a (k+)6(k+) X t f X t m X t X t k ðþ ; C European Journal of Human Genetcs

3 Lnkage and assocaton mappng of QTL R Fan and M Xong 7 varance-covarance matrx defned as r r r r r r r r r r k ¼ r r r r s ; were s ¼ k C r r r k r k s g s G s e ; s g s varance explaned by te putatve QTL Q, s G s polygenc varance, and s e s error varance Te genetc varances s g ¼ s ga s gd and s G ¼ s Ga s Gd are decomposed nto addtve and domnant components r ¼ðs ga s Ga Þ=ðs Þ s correlaton between parents and cldren, r j ¼ðp jq s ga jqs gd s Ga = s Gd =4Þ=s s correlaton between te -t cld and te j-t cld, p jq s te proporton of alleles sared dentcal by descent (ID) at QTL Q by te -t cld and te j-t cld, and jq s te probablty tat bot alleles at QTL Q sared by te -t cld and te j-t cld are ID For populaton data, an ntutve ratonale of regresson model () s gven n Fan and Xong 4 In general, one can construct a varance-covarance matrx for any type of pedgree n a smlar way as above ssume tat tere are two ndependent sub-samples of data: () populaton data: n ndependent ndvduals; () famly data: I n (I4n) ndependent famles Let us lst te log-lkelood of te n ndependent ndvduals by L,, L n, and te lkelood of te I n famles by L n+,, L I Ten te overall log-lkelood s L ¼ P I ¼ L Te unknown parameters are m ¼ ðb; g; a ; a ; ; Þ t ; s ga ; s gd ; s Ga ; s Gd ; and s e Usng te lkelood rato tests, one may test statstcal sgnfcance of te parameters of nterest Parameter estmatons and regresson coeffcents Regresson coeffcents Let m j be te effect of genotype Q Q j,,j=,, m =m Denote te populaton effect mean by m ¼ m q m q q m q and defne a Q =q m +(q 7q )m 7q m, d Q =m 7m 7m Ifm =a, m =d, and m = a as n te tradtonal quanttatve genetcs, 5 a Q = a+(q q )d s te average allele substtuton effect, and d Q =d caracterzes te domnant effect In general, one may defne a=m 7(m +m )/ and d=m 7(m +m )/ It s well known tat te addtve varance s ga ¼ q q a Q and te domnant varance s gd ¼ðq q Þ Q true random effect model descrbng te trat value s y =b+w g+g +G +e, were 8 < m for genotype Q Q g ¼ m for genotype Q Q : : m for genotype Q Q Denote LD coeffcent between trat locus Q and marker by D Q =P(Q ) q P, LD coeffcent between trat locus Q and marker by D Q =P(Q ) q P, and LD coeffcent between marker and marker by D =P() P P Let te addtve and domnant varance covarance matrces be V ¼ P ap D D P b P ; and V D ¼ P a P D D Pb P : ðþ Moreover, let us denote tree ratos D =ðp ap P b P Þ¼ R ; D Q =ðp ap q q Þ¼R Q ; and D Q =ðq q P b P Þ¼R Q s n ppendx, 4 we can sow tat te coeffcents of regresson equaton () are gven by a ¼ V D Q a Q ¼ a D Q ¼ VD D Q D Q! Q ¼ R Q R pffffffffffffff R Q P P a R Q R R Q p ffffffffffffff P P b R Q R R Q P P a R Q R R Q P P b pffffffffffffffffffff q q a Q R ; ð3þ C q q Q R 4 : ð4þ Parameters of varance covarances Denote te recombnaton fracton between trat locus Q and marker by y Q, te recombnaton fracton between trat locus Q and marker by y Q, and te recombnaton fracton between marker and marker by y Fulker and Cardon 6 proposed to estmate te proporton p jq of allele ID at putatve QTL Q for a sb-par and j by ^p jq ¼ Eðp jq jp j ; p j Þ¼a p b p p j b p p j were p j and p j are te ID proportons of alleles sared at te marker and marker, respectvely Te coeffcents a p, b p and b p are gven by b p ¼ ð y QÞ ð y Þ ð y Q Þ ð y Þ 4 ; b p ¼ ð y QÞ ð y Þ ð y Q Þ ð y Þ 4 ; a p ¼ b p b p : Let j, j be te probablty of sarng two alleles ID at markers and for a par of sbs, respectvely In Fan, 7 we proposed to estmate jq by equaton ^ jq ¼ a b p j b p j r j r j Under te assumpton of no nterference, te coeffcents are as follows (Fan 7 ): r ¼ ð y QÞ 4 ð y Þ 4 ð y Þ 4 ð y Þ 8 ; r ¼ ð y QÞ 4 ð y Q Þ 4 ð y Þ 4 ð y Þ 8 ; b ¼ b p r ; b ¼ b p r ; ð c a ¼ Þ ð c Þ ½c c ð c Þð c ÞŠ ; were c ¼ y Q ð y QÞ and c ¼ y Q ð y QÞ : ssumng tat te postons of marker and marker are known, y can be calculated troug Haldane s map functon Ten only one of y Q and y Q s unknown snce te European Journal of Human Genetcs

4 8 Lnkage and assocaton mappng of QTL R Fan and M Xong oter can be calculated troug Trow s formula 8 For general relatves and j, lmasy and langero 9 proposed an algortm to calculate te proporton p jq of allele ID at putatve QTL Q, and te expected probablty jq tat bot alleles at QTL Q are ID In Fan, 7 we derved formulas to calculate te covarances of trat values for a few types of relatves drectly wtout performng matrx operatons ssocaton and lnkage studes From equatons (3) and (4), we can see tat te coeffcents of LD (e, D Q and D Q ) and gene effects (e, a Q and d Q ) are contaned n te regresson coeffcents Moreover, we sow n te above paragrap tat te lnkage parameters (e, recombnaton fractons y Q, y Q and y ) are contaned n te varancecovarance matrx ssume tat markers and are n LD wt te trat locus Q, e, D Q 6¼,D Q 6¼ We may smultaneously test LD of marker and marker wt trat locus Q, te gene substtuton and domnant effects by testng a =a =d =d = From equaton (3), we may test LD of markers and wt te trat locus Q and te gene substtuton effect a Q by testng a =a = From equaton (4), we may test LD of markers and wt te trat locus Q and te domnant effect by testng d =d = To test lnkage, one may use te lkelood rato test of te log-lkelood L Under te null ypotess of no lnkage between te major trat locus Q and te markers, y Q = y Q =/ Under te alternatve ypotess of lnkage, y Q 6¼/ ory Q 6¼/ y comparng te dfference of maxmum log-lkeloods under te alternatve and null ypoteses, we may use w statstc to test te lnkage We wll derve analytcal formulas to explore te lnkage nterval mappng by te nuclear famles n a smlar way to Sam et al 9 accordng to statstcal teory of lkelood rato tests Non-centralty parameters of assocaton study ssume tat tere are no covarates Ten m=(b, a, a, d, d ) t Consder te overall log-lkelood L ¼ P I ¼ L, were L s te log-lkelood of trat values y of te -t famly or ndvdual Let S be te varance-covarance matrx of y, and X be ts model matrx Denote te total trat values y ¼ðy t ; ; y^t Þ t, te total varance covarance matrx by S=dag(S,,S I ), and te model matrx X ¼ðX t ; ; Xt I Þt Let b; ^a ; ^a ; ^ ; ^ ; ^ ; ^ be te maxmum lkelood estmators of b, a, a, d, d,s, S Te estmate of m s ^m ¼ ½X t ^ XŠ X t ^ ~y ¼½ P I ¼ Xt ^ X Š P I ¼ Xt ^ ~y Let H be a q65 test matrx of rank q Suppose tat te total number of ndvduals s N y Graybll, Capter 6, te test statstc of a ypotess Hm= s non-central F(q, N 5) defned by F ¼ ðh^mþ t ½HðX t ^ XÞ H t Š ðh^mþ Y t ½^ ^ XðX t ^ XÞ X t ^ ŠY N 5 : q Te non-centralty parameter of te test statstc F can be calculated by l ¼ðHmÞ t ½H½X t XŠ H t Š Hm If te data are composed of n ndvduals of a populaton, Fan and Xong 4 worked out te non-centralty parameters to test f tere are allele substtuton and/or domnant effects and LDs between te markers and te major gene locus In te followng, we dscuss a stuaton tat te data are composed of bot ndvdual populaton data and famly data Suppose tat tere are n ndvduals of a populaton, and n s suffcently large For eac y of te n ndvduals, S =s and X =( x x z z ), =,,, n From formulas n Fan and Xong, 4 ppendx and ppendx, we may sow tat X n X t n X ¼ X n ns X t X s dagð; V ; V D Þ; ¼ ¼ were V and V D are addtve and domnant varance-covarance matrces gven n () Secondly, suppose tat tere are m tro famles, and m s suffcently large tro famly s composed of bot parents and a sngle cld Notce tat te means of x,x,z and z are Let K f =(x f x f z f z f ) and K m =(x m x m z m z m ) We sow n ppendx tat te covarance matrx between parents and ter offsprng s E Kf t K ¼ E Km t K ¼ V = O ; ð6þ O O were K =(x x z z ) and O s zero 6 matrx For eac of te tro famles, te varance covarance S s a 363 matrx wose nverse s ð5þ r r r ¼ ð r r Þs r r : ð7þ r r Usng equatons (5), (6), and (7), we sow n ppendx X t m X ð r ¼n Þs 3 4r ð3 r r ÞV C : ð3 r ÞV D Xnm Trdly, suppose tat tere are k nuclear famles eac of tem as bot parents and two offsprng, and te correlaton of te two offsprng s r ssume tat k s suffcently large For eac famly, te varance covarance S s a 464 matrx wose nverse s ¼ s Cr Cr C C Cr Cr C C C C C C Cð r Þ r ð r Þ Cðr r Þ r ð r Þ Cðr r Þ r ð r Þ Cð r Þ r ð r Þ ; C were C ¼ r ð r Þ=½ð r Þ ðr r Þ Š In ppendx C, we sow tat te covarance matrx between two offsprng s ð8þ ð9þ European Journal of Human Genetcs

5 Lnkage and assocaton mappng of QTL R Fan and M Xong 9 Eðx x z z Þ t ðx x z z Þ¼ V = O : ðþ O V D =4 Usng equatons (5), (6), (9) and (), we sow n ppendx D tat k nmk X ¼nm X t X dagðd ; d V ; d 44 V D Þ=s ; ðþ were te constants are gven by d ¼ ½ 4Cr 4C C=r Š; d ¼ 4Cðr Þ Cð r r Þ=½r ð r ÞŠ; d 44 ¼ ð Cr ÞC½4ð r Þ ðr r ÞŠ=½r ð r ÞŠ: Combne te n ndvduals, m tro famles, and k famles wt two offsprng Defne a ¼ n mð r Þ ð3 4r Þkd ; a ¼ n mð r Þ ð3 r r Þ kd ; a 3 ¼ n mð r Þ ð3 r Þkd 44 Ten equatons (5), (8) and () lead to nmk X ¼ X t X dagða ; a V ; a 3 V D Þ=s : ðþ To test f tere are addtve and domnant effects, we may test te ypotess H,ad : a =a =d =d = Ten te test matrx H s defned by H ¼ C : Let us denote te correspondng F-test statstc by F,ad, and te non-centralty parameter by l,ad Ten we ave from (3), (4), and () tat l ;ad s a ða a ÞV a a a 3 ð ÞV D ¼ s a a Q ½P bp D Q D QD D Q P a P D Q Š= ðp a P P b P D Þa 3 Q ½P b P D4 Q D Q D D Q P a P D4 Q Š=ðP a P P b P D4 Þ ¼ s a s ga ½R Q R QR R Q R Q Šð R Þ a 3 s gd ½R4 Q R Q R R Q R4 Q Š=ð R4 Þ : ssume tat te two markers and are n lnkage equlbrum, ten D = Moreover, assume tat te trat locus Q s n LD wt marker but not wt marker, ten D Q = and D Q 6¼ Ten l ;ad ½=s Š a s ga R Q a 3s gd R4 Q, wc only nvolves marker and can be wrtten as l,ad Correspondngly, we denote te F-test statstc by F,ad Smlarly, l ;a ½a =s Šs ga R Q s te non-centralty parameter of a test statstc F,a To test te oter ypoteses, we may get te non-centralty parameters n a smlar way by takng approprate test matrces H To test f tere s domnant effect, we may test te ypotess H,d : d =d = Te non-centralty parameter s l ;d a3 s s gd ½R4 Q R Q R R Q R4 Q Š=ð R4 Þ To test f tere s an addtve or substtuton effect, we may test te ypotess H ;a : a ¼ a ¼ Te non-centralty parameter s l ;a a s s ga ½R Q R QR R Q R Q Š=ð R Þ Te correspondng F-test statstc s denoted by F,a Non-centralty parameters of lnkage studes Consder a nuclear famly wt k cldren and bot parents Under te null ypotess of no lnkage between te trat locus and markers, te correlaton of eac sb-par s r N ¼ s ga s s gd 4s s Ga s s Gd 4s : Te expected log-lkelood s EðL Null Þ¼ ðkþ logðps Þ log ð r Þðk Þðr N r Þ ð r N Þ k : Under te alternatve ypotess of lnkage between te trat locus and marker, te correlaton between a sb-par s C ¼ Covðy ; y jp ¼ =Þ=s ¼ðs ga s gd ÞPðp Q ¼jp ¼ =Þ= s s ga Pðp Q ¼=jp ¼=Þ=s ½s Ga = s Gd =4Š=s ; ¼; ; From Haseman and Elston, Table IV, we ave C ¼ ðs ga s gd Þc s ga c ð c Þs Ga = s Gd =4 =s C ¼ ðs ga s gd Þc ð c Þs ga ½ c ð c ÞŠ= s Ga = s Gd =4 =s C ¼ ðs ga s gd Þð c Þ s ga c ð c Þs Ga =s Gd =4 =s : ð3þ Te expected log-lkelood under te alternatve ypotess of lnkage s EðL random; Þ¼ ðkþ½logðps ÞŠ X X Pðp ÞPðp k ;k Þ: p p k ;k r r r r r r r r C p C pk log det r r C p C pk ; C r r C pk C pk were P(p j =)=P(p j =)=/4 and P(p j =/)=/ From Stuart and Ord, te non-centralty parameter for lnkage of te nuclear famly s equal to l lnkage, =E(L random, ) E(L Null ) If k=, t can be sown tat l lnkage; ¼ log½ 4r r N 4r r NŠ P ¼ Pðp ¼ =Þlog½ 4r C 4r C Š European Journal of Human Genetcs

6 3 Lnkage and assocaton mappng of QTL R Fan and M Xong Under te alternatve ypotess of lnkage between te trat locus and markers and, te correlaton between a sb-par s gven by for, j=,, C j ¼ Covðy ; y jp ¼ =; p ¼ j=þ=s ¼ ðs ga s gd ÞPðp Q ¼ jp ¼ =; p ¼ j=þ s ga Pðp Q ¼ =jp ¼ =; p ¼ j=þ s Ga = s Gd =s =4 : ð4þ To calculate C j, we need to calculate te jont dstrbuton of p, p Q and p of a sb-par under te alternatve ypotess of lnkage ssume tat tere s no nterference for dsjont regons of te cromosome Ten we ave Pðp ¼ ; p Q ¼ Q ; p ¼ Þ ¼ Pðp ¼ ; p Q ¼ Q ÞPðp ¼ jp ¼ ; p Q ¼ Q Þ ¼ Pðp ¼ jp Q ¼ Q ÞPðp Q ¼ Q ÞPðp ¼ jp Q ¼ Q Þ: ð5þ Haldane s map functon y =[ exp( l )]/ Smlarly, we may calculate te recombnaton fractons y Q and y Q by te map dstances l Q and l Q ssume tat marker and marker are n lnkage equlbrum, e, D =, te genetc dstances l =5 cm, l Q =l Q =5 cm, and te ertablty =5 Suppose we ave a sample wt n= ndvduals, m=3 tro famles, and k= nuclear famles wt two offsprng ssume tat te ID proportons sared by te two offsprng n te k= famles at bot markers and are p = p =5, and te probablty of sarng two alleles ID at markers and are = =5 Fgure sows te power of te test statstcs F,ad, F,a, F,ad, and F,a aganst te dsequlbrum coeffcent D Q wen D Q =5 for a mode of domnant nertance wt a=d=, and a mode of recessve nertance wt a=, d= 5, respectvely Several features are nterestng n te two graps of Fgure Frst, te power of F,ad and F,a are ger tan tat of F,ad and F,a Hence, te regresson mappng wc uses two markers and as ts advantage From Haseman and Elston, Table IV, we may construct te jont dstrbuton of p Q, p and p by relaton (5), and te results are presented n Table 3 of Fan 7 ased on te results, we can calculate C j,,j=,,, wc are gven n ppendx D of Fan 7 Te expected log-lkelood under te alternatve ypotess of lnkage s E(L random, )= (k+)[log(ps )+] S p S p S pk,k S pk7,k P(p )P(p ) P(p k,k ) P(p k,k ) r r r r r r r r C p;p C pk ;p k log det r r C p;p C pk ; ;p k C r r C pk ;p k C pk ;p k were P(p j =)=P(p j =)=/4 and P(p j =/)=/ suc as tose for marker From Stuart and Ord, te non-centralty parameter for lnkage of te nuclear famly s equal to l lnkage, =E(L random, ) E(L Null ) If k=, t can be sown tat l lnkage; ¼ log½ 4r r N 4r r NŠ P ;j¼ Pðp ¼ =ÞPðp ¼ j=þlog½ 4r C j 4r C jš Power calculaton and comparson Let us denote ertablty by wc s defned by ¼ s ga =s In te power calculatons, we take te addtve polygenc varance s Ga ¼ :, polygenc domnant varance s Gd ¼ :5, te equal allele frequences P =q =P =5 at te two markers and, and te QTL Q Moreover, suppose tat m =a, m =m =d and m = a Suppose tat te map dstance l between marker and marker s known Under te assumpton of no nterference, we may calculate te recombnaton fracton by Fgure Power of test statstcs F,ad, F,a, F,ad, and F,a aganst dsequlbrum coeffcent D Q at sgnfcant level, wen q =P =P =5, D =, D Q =5, =5, n=, m=3, k=, p =p = = =5, l =5 cm, l Q =l Q =5 cm, Ga ¼ :; Gd ¼ :5 for a mode of domnant nertance a=d=, and a mode of recessve nertance a=, d= 5, respectvely European Journal of Human Genetcs

7 Lnkage and assocaton mappng of QTL R Fan and M Xong 3 over te one marker mappng wc only uses one marker or Second, te statstc F,ad as ger power tan tat of F,a, and te statstc F,ad as ger power tan tat of F,a Tus, excludng te domnant varance from te analyss wen t does exst would lose power Trd, as expected, wen D Q = te power to detect LD usng only marker s mnmal More nterestngly, wen D Q =5 te power s stll ger usng te flankng two markers tan usng only marker Fgure sows te power of te test statstcs F,ad, F,a, F,ad, and F,a aganst te ertablty wen D = and D Q =D Q =5 for a mode of domnant nertance wt a=d=, and a mode of recessve nertance wt a=, d= 5, respectvely Te oter parameters are te same as tose of Fgure mong te features observed n Fgure, te power s reasonably g wen te ertablty s bgger tan 5 To compare te power of populaton based and famly based metods, Fgure 3 sows te power of te test statstcs F,ad and F,a for a mode of domnant nertance wt a=d=, and a mode of recessve nertance wt a=, d= 5, respectvely For Fgure 3, populaton data contan n=5 ndvduals, but no famly data (m=k=) For domnant nertance of Fgure 3, te data contan m=84 tro famles (n=k=) For recessve nertance of Fgure 3, te data contan k=63 nuclear famles eac as two offsprng (n=m=) Notce tat m=84 or k=63 famly data contan 5 ndvduals, and tus te number of ndvduals s te same as tat of te populaton data We can see tat populaton based metod s more powerful tan te famly based metod for te same number of ndvduals In a populaton, te LD can exst due to mutatons at te trat locus In te absence of tgt lnkage between te trat locus and a marker, te recombnaton between te marker locus and te trat locus can rapdly dsspate te dsequlbrum from generaton to generaton Denote te frequency of aplotype Q at te generaton wen te mutatons occur by P(Q)() Ten LD coeffcent s D Q ()= P(Q)() q P for te generaton wen te mutatons occur For te followng generatons, te dsequlbrum Fgure Power of test statstcs F,ad, F,a, F,ad, and F,a aganst ertablty at sgnfcant level, wen q =P =P = 5, D =, D Q =D Q =5, n=, m=3, k=, p =p = = =5, l =5 cm, l Q =l Q =5 cm, Ga ¼ :; Gd ¼ :5 for a mode of domnant nertance a=d=, and a mode of recessve nertance a=, d= 5, respectvely Fgure 3 Power of test statstcs F,ad and F,a aganst ertablty at sgnfcant level for a mode of domnant nertance a=d=, and a mode of recessve nertance a=, d= 5, respectvely For populaton data n=5, m=k=; for domnant famly data n=k=, m=84; for recessve famly data n=m=, k=63 Oter parameters are te same as tose of Fgure European Journal of Human Genetcs

8 3 Lnkage and assocaton mappng of QTL R Fan and M Xong coeffcent s reduced by a factor 7y Q n eac generaton 3 Suppose tat te mutaton s already T generaton old Ten te LD coeffcent s D Q (T)=D Q ()( y Q ) T Smlarly, te oter LD coeffcents are D (T)=D ()( y ) T and D Q (T)=D Q ()( y Q ) T ssume tat te map dstance between marker and marker s l =5 cm, and te oter parameters are gven by D ()=,D Q ()=D Q ()=5, =5, l =5 cm, n=, m=3, k=, T=3, p = p =5, = =5 Fgure 4 sows te power of te test statstcs F,ad, F a, F ad, and F,a aganst te recombnaton fracton y Q for a mode of domnant nertance wt a=d=, and a mode of recessve nertance wt a=, d= 5, respectvely We can see tat te power curves of F,ad and F,a are very g, altoug te power curves of F ad and F,a decrease very rapdly as te recombnaton fracton y Q ncreases Hence, g resoluton LD mappngs ave advantage to do fne gene mappngs, and approprate for te dense marker maps suc as sngle nucleotde polymorpsms on uman genome To nvestgate te effect of te age of te mutaton on te power, Fgure 5 sows te power curves aganst te poston of markers In te Fgure, te QTL locates at 5 cm wc s flanked by two markers and One marker s one te rgt-and sde of te QTL, and te oter s on te left-and sde wt equal dstance to te QTL Te power decreases quckly wen te age of te mutaton ncreases For a mutaton wc s 3 generatons old, one sould expect very low power f te markers locate 5 cm away from te QTL To explore te lnkage nterval mappng, we take a sample of k=5 nuclear famles eac as two offsprng Multplyng l lnkage, and l lnkage, by k, we may calculate te non-centralty parameters for te lnkage mappng usng marker and te lnkage nterval mappng usng markers and Moreover, assume tat te genetc dstances are l =3 cm, and l Q =l Q =5 cm, e, te QTL Q s rgt n te mddle between markers and Fgure 6 gves power curves of lnkage nterval mappng by markers and, and lnkage mappng by marker aganst ertablty for a mode of domnant nertance wt Fgure 4 Power curves of te test statstcs F,ad, F,a, F,ad, and F,a aganst te recombnaton fracton y Q at sgnfcant level, wen q =P =P =5, D ()=, D Q ()=D Q ()= 5, =5, l =5 cm, T=3, n=, m=3, k=, p =p =5, = =5, Ga ¼ :; Gd ¼ :5 for a mode of domnant nertance a=d=, and a mode of recessve nertance a=, d= 5, respectvely Fgure 5 Power curves of te test statstcs F,ad aganst te poston of markers at sgnfcant level for a mode of domnant nertance a=d=, and a mode of recessve nertance a=, d= 5, respectvely Te QTL locates at 5 cm wc s flanked by two markers and Here te mutaton age T=, 3, 4, 6, and te oter parameters are te same as tose n Fgure 4 European Journal of Human Genetcs

9 Lnkage and assocaton mappng of QTL R Fan and M Xong 33 Fgure 6 Power curves of te lnkage nterval mappng by markers and, and lnkage mappng by marker aganst te ertablty, wen q =P =P =5, l =3 cm, l Q =l Q = 5 cm, k=5, Ga ¼ :; Gd ¼ :5, at 5 sgnfcant level for a mode of domnant nertance a=d=, and a mode of recessve nertance a=, d= 5, respectvely a=d=, and a mode of recessve nertance wt a=, d= 5, respectvely It s clear tat te power of nterval lnkage mappng usng bot markers and s ger tan tat of lnkage mappng usng only one marker Dscusson In ts paper, we nvestgate varance component models of bot g resoluton LD mappng and lnkage analyss for QTL Te models are based on famly pedgree and populaton data We consder lkeloods wc utlzes flankng marker nformaton Te lkeloods jontly nclude recombnaton fractons, LD coeffcents, te average allele substtuton effect and allele domnant effect as parameters Te lnkage parameters are contaned n te varance-covarance matrx Te parameters of LD and gene effects are contaned n te regresson coeffcents 8,9,, Te model smultaneously takes care of te lnkage, LD and te effects of te putatve trat locus Q, and ence clearly demonstrates tat lnkage analyss and LD mappng are complmentary, not exclusve, metods for QTL mappng Te famly data wc ave at least two offsprng contan nformaton for bot lnkage and assocaton, and populaton data and tro famly data wc ave two parents and only one offsprng contan nformaton for assocaton y combnng te famly and populaton data n te analyss, one may expect to get better results tan tat by analysng tem separately Lnkage analyss can localze genetc trat loc n broad cromosome regons of a few cm (5 cm), and s less senstve to populaton admxture tan LD mappng In practce, one may carry out lnkage analyss as a frst step to obtan pror suggestve lnkage based on a sparse marker map y performng lnkage nterval mappngs, one may get ger power tan tat of usng only one marker Wt pror lnkage n and, LD mappng can be used to get g resoluton of te genetc trat loc based on a dense marker map We ave sown tat models of g resoluton LD mappngs usng two flankng markers ave ger power tan tat of models of usng only one marker Hence, g resoluton LD mappngs ave te advantages to do fne gene mappngs, and approprate for te dense marker maps suc as SNPs on uman genome Performng bot LD mappng and lnkage analyss as potental to avod false postves due to populaton story or envronmental effects In te meantme, t takes te advantage of g resoluton of LD mappng Te power of assocaton study depends on te exstence of LD between trat locus and markers In te absence of LD, te power of LD mappngs s very low To ncrease te probablty of detectng LD, one may need to carry out sutable desgn for a genetc study 4 It s well known tat te level of LD s eavly affected by populaton stratfcaton On te one and, te famly based metods are less lkely nfluenced by populaton stratfcaton tan tose of populaton data based metods On te oter and, a famly based assocaton study s less powerful tan tat of populaton based study for te same number of ndvduals Combnng te famly and populaton data, one may expect more nformaton, and take te advantage of populaton data and famly data More nvestgaton s needed to explore te populaton stratfcaton effect on g resoluton LD mappng of QTL, and to develop robust metods to dentfy assocaton between multple markers and QTL n te presence of populaton stratfcaton To our knowledge, tere s not muc researc on statstcal analyss about g resoluton LD mappng of QTL Usng only one b-allelc marker, te statstcal analyss of LD mappng as been studed by a few colleagues 8 3 Relatvely, multpont lnkage mappng as been studed more ntensvely 6,9,5 It s our ope tat te current researc may sed more lgt on te g resoluton assocaton study, and stmulate more nterests to utlze te advantage of LD mappng n fne resoluton of genetc studes In te Secton of power calculaton and comparson, we manly explore a set of scenaros of LD mappng For several sets of parameters, we compare te power of four test statstcs European Journal of Human Genetcs

10 34 Lnkage and assocaton mappng of QTL R Fan and M Xong for LD mappng Moreover, we compare te power of LD mappng of usng populaton data and famly data We also nvestgate te effect of mutaton age on te power For lnkage mappng, we only nclude one fgure to make power comparson of lnkage nterval mappng usng two markers wt lnkage mappng usng only one marker 9 Ts reflects te need for more researc on g resoluton LD mappng of QTL, snce te researc on lnkage nterval/multpont mappng s more mature In ts paper, we treat LD as a fxed effect snce only two markers are consdered In general, nference about te LD structure n te populaton are desrable, and LD sould be modeled as a random effect wen multple markers/aplotypes are used n analyss, wc would need more nvestgaton We assume tat te data of all famly members are avalable For some late-onset dseases, te data for te parents or former famly members may no longer be avalable In prncple, one can use smlar metods as te ones proposed n ts paper to perform g resoluton LD mappng for sb-par data of late-onset dseases Ts s an area wc s of mportance and needs more researc Due to te lengt of ts paper, we do not pursue tese ssues n dept, and tey wll be explored n oter projects cknowledgements We tank two revewers, and Dr Gert-Jan van Ommen for ter elpful comments to mprove te paper R Fan was supported partally by a researc fellowsp from te lexander von Humboldt Foundaton, Germany, and an Internatonal Researc Travel ssstance Grant of te Texas &M Unversty M Xong was supported by NIH grant R-GM5655, and MH5958 Ms JS Jung kndly wrtes te SS mcro, wc s avalable from R Fan upon request References otsten D, Wte RL, Skolnck MH, Davs RW: Constructon of a genetc lnkage map n man usng restrcton fragment lengt polymorpsms m J Hum Genet 98; 3: Morton NE: Sequental tests for te detecton of lnkage m J Hum Genet 955; 7: Morton NE: Sgnfcance levels n complex nertance m J Hum Genet 998; 6: Te Internatonal SNP Map Workng Group: map of uman genome sequence varaton contanng 4 mllon sngle nucleotde polymorpsms Nature ; 49: Rsc N, Merkangas K: Te future of genetc studes of complex uman dseases Scence 996; 73: becass GR, Cerny SS, Cookson WOC, Cardon LR: Merln rapd analyss of dense genetc maps usng sparse gene flow tress Nature Genetcs ; 3: 97 7 lmasy L, Wllams JT, Dyer TD, langero J: Quanttatve trat locus detecton usng combned lnkage/dsequlbrum analyss Genetc Epdemology 999; 7(Suppl ): S3 S36 8 Fulker DW, Cerny SS, Sam PC, Hewtt JK: Combned lnkage and assocaton sb-par analyss for quanttatve trats m J Hum Genet 999; 64: Sam PC, Cerny SS, Purcell S, Hewtt JK: Power of lnkage versus assocaton analyss of quanttatve trats, by use of varancecomponents models, for sbsp data m J Hum Genet ; 66: becass GR, Cardon LR, Cookson WOC: general test of assocaton for quanttatve trats n nuclear famles m J Hum Genet ; 66: 79 9 becass GR, Cookson WOC, Cardon LR: Pedgree tests of lnkage dsequlbrum Eur J Hum Genet ; 8: becass GR, Cookson WOC, Cardon LR: Te power to detect lnkage dsequlbrum wt quanttatve trats n selected samples m J Hum Genet ; 68: Cardon LR: sb-par regresson model of lnkage dsequlbrum for quanttatve trats Hum Hered ; 5: Fan R, Xong M: Hg resoluton mappng of quanttatve trat loc by lnkage dsequlbrum analyss Eur J Hum Gen ; : Falconer DS, Mackay TFC: Introducton to Quanttatve Genetcs London: Longman, 996, 4t edn 6 Fulker DW, Cardon LR: sb-par approac to nterval mappng of quanttatve trat loc m J Hum Genet 994; 54: Fan R: Interval mappng of quanttatve trat loc ttp:// stattamuedu/*rfan/papertml/nterval_mappngpdf 8 Lange K: Matematcal and Statstcal Metods for Genetc nalyss New York: Sprnger-Verlag, lmasy L, langero J: Multpont quanttatve trat lnkage analyss n general pedgrees m J Hum Genet 998; 6: 98 Stuart, Ord JK: Kendall s dvanced Teory of Statstcs, Vol : Classcal Inference and Relatonsps Oxford, 99, 5t edn Graybll F: Teory and pplcaton of te Lnear Model Calforna: Pacfc Grove, 976 Haseman JK, Elston RC: Te nvestgaton of lnkage between a quanttatve trat and a marker locus eavor Genetcs 97; : Hartl DL, Clark G: Prncples of Populaton Genetcs nd edn Snauer, oenke M, Langefeld CD: Genetc assocaton mappng based on dscordant sb pars: te dscordant-alleles test m J Hum Genet 998; 6: Pratt SC, Daly M, Kruglyak: Exact multpont quanttatve-trat lnkage analyss n pedgrees by varance components m J Hum Genet ; 66: European Journal of Human Genetcs

11 Lnkage and assocaton mappng of QTL R Fan and M Xong 35 ppendx In ts ppendx, we sow equaton (6) ctually, we ave E½x f x Š¼P a E½x ; f ¼ ŠðP a P ÞE½x ; f ¼ aš P E½x ; f ¼ aaš ¼ P a P a P ðp a P ÞP a P P P P a ðp a P ÞP P P a P a ðp a P Þ P a P ðp a P ÞðP a P Þ P P a ðp P a =Þ ¼ P a P a P ðp a P ÞðP a P ÞP a P P ð P Pa Þ¼P ap E½x f x Š¼P a E½x ; f ¼ ŠðP a P ÞE½x ; f ¼ aš P E½x ; f ¼ aaš ¼ P a P b PðÞP P ðp b P Þ PðÞP P b PðbÞP P P PðbÞP P b ðp a P Þ P b PðÞP a P PðaÞP P ðp b P Þ PðÞP a P b PðbÞP a P PðaÞP P b PðabÞP P P PðbÞP a P b PðabÞP P b P P b PðaÞP a P ðp b P Þ PðaÞP a P b PðabÞP a P P PðabÞP a P b ¼ P a ½PðÞP P b PðbÞP P Š ðp a P Þ½PðÞP a P b PðaÞP P b PðbÞP a P PðabÞP P Š P ½PðaÞP a P b PðabÞP a P Š ¼ PðÞP a P b PðaÞP P b PðbÞP a P PðabÞP P ¼ D E½x f z Š¼P a E½z ; f ¼ ŠðP a P ÞE½z ; f ¼ aš P E½z ; f ¼ aaš ¼ P a Pa P ðp a P ÞP a P P P Pa P ap P P P a Pa ðp a P Þ Pa P P a P ðp a P Þ P P a P P a = ¼ E½x f z Š¼P a E½z ; f ¼ ŠðP a P ÞE½z ; f ¼ aš P E½z ; f ¼ aaš ¼ P a P b PðÞP P P b P PðÞP P b PðbÞP P P PðbÞP P b ðp a P Þ Pb PðÞP a P PðaÞP P P b P PðÞP a P b PðbÞP a P PðaÞP P b PðabÞP P P PðbÞP a P b PðabÞP P b P Pb PðaÞP a P P b P PðaÞP a P b PðabÞP a P P PðabÞP a P b ¼ : Smlarly, we may sow te oter terms n equaton (6) ppendx y equatons (6), (7), and large number teory, we can sow te approxmaton (8) For nstance, te approxmaton for element on te second row and te second column s m Xnm ¼m ðx f x m x Þ ðx f x m x Þ t Xnm ¼ ð ð r r Þs m Þx f r x m r x x f ¼n r x f ð r Þx m r x x m r x f r x m x x ð r ð r Þs ÞP ap 4r P a P P a P ¼ P ap ð3 r r Þ ð r Þs : For te element on te fort row and te fort column, we ave m Xnm ¼m ðz f z m z Þ z f z m z ¼ ð r Þs m Xnm ¼n ð r Þz f r z m r z z f r z f ð r Þz m r z z m r z f r z m z z P a P ð3 r Þ ð r : Þs t European Journal of Human Genetcs

12 36 Lnkage and assocaton mappng of QTL R Fan and M Xong ppendx C To prove equaton (), we frst ave Eðx x Þ¼P 4 E½x x j f ¼ ; m ¼ Š P ðp P a ÞE½x x j f ¼ ; m ¼ aš P P a E½x x j f ¼ ; m ¼ aaš ðp P a Þ E½x x j f ¼ a; m ¼ aš Pa ðp P a ÞE½x x j f ¼ a; m ¼ aaš Pa 4 E½x x j f ¼ aa; m ¼ aaš ¼ P 4 ð4p a Þ4P3 P a P a = ðp a P Þ= P P a ½P a P Š ðp P a Þ P a =4 ðp a P Þ= P =4 4Pa 3 P ðp a P Þ= P = P 4 a ð4p Þ ¼ P P a Eðx z Þ¼P 4 E½x z j f ¼ ; m ¼ Š P ðp P a ÞE½x z j f ¼ ; m ¼ aš P P a E½x z j f ¼ ; m ¼ aaš ðp P a Þ E½x z j f ¼ a; m ¼ aš Pa ðp P a ÞE½x z j f ¼ a; m ¼ aaš Pa 4 E½x z j f ¼ aa; m ¼ aaš ¼ P 4 ð P3 a Þ4P3 P a½p a = ðp a P Þ=Š ½ Pa = P ap =ŠP P a ½P a P ŠP a P ðp P a Þ P a =4 ðp a P Þ= P =4 P a =4 P ap = P =4 4P 3 a P P 4 a ðp3 Þ¼ ðp a P Þ= P = P a P = P = Eðz z Þ¼P 4 E½z z j f ¼ ; m ¼ Š P ðp P a ÞE½z z j f ¼ ; m ¼ aš P P a E½z z j f ¼ ; m ¼ aaš ðp P a Þ E½z z j f ¼ a; m ¼ aš Pa ðp P a ÞE½z z j f ¼ a; m ¼ aaš Pa 4 E½z z j f ¼ aa; m ¼ aaš ¼ P 4 ð P a Þ 4P 3 P a Pa = P ap = P P a ½P ap Š ðp P a Þ Pa =4 P ap = P =4 4Pa 3 P P a P = P = P 4 a ð P Þ ¼ P a P =4: Smlarly, we may sow tat Eðx x Þ¼P b P ; E ðx z Þ¼ ; Eðz z Þ¼Pb P =4: To sow te oter terms of (), we frst calculate te jont probabltes P(, ), n wc te frst offsprng s genotype s at marker and te second offsprng s genotype s at marker, {,a, aa}, {, b, bb}we need to consder nne possble pases {, a, aa}6{, b, bb} for eac parent t te frst glance, one needs to consder 969 possble matngs to calculate P(, ) However, many matngs do not lead to specfc genotypes (, ) of a sb par Ts elmnates many terms and reduces te amount of calculatons For nstance, a matng of ( f =, f =)6( m =, m =) only results offsprng wt genotype (,) Ten, we ave Pð ¼ ; ¼ Þ ¼ X f ; f Pð f ; f Þ X m; m Pð m ; m ÞP½ ¼ ; ¼ jð f ; f Þ; ð m ; m ÞŠ ¼ PðÞ PðÞ PðÞPðbÞ= PðÞPðaÞ= ½PðÞPðabÞ PðbÞPðaÞŠ=4 PðÞPðbÞ PðÞPðbÞ=4 PðÞPðaÞ=4 ½PðÞPðabÞ PðbÞPðaÞŠ = =4 PðÞPðaÞ PðÞPðaÞ=4 ½PðÞPðabÞ PðbÞPðaÞŠ = =4 ½PðÞPðabÞPðbÞPðaÞŠ =4 =4 ¼ðPðÞ P P Þ =4 P P PðÞ ¼ =4 P P PðÞ: ð6þ Symmetrcally, we may get te followng tree terms Pð ¼ ; ¼ bbþ ¼ =4 P P b PðbÞ; Pð ¼ aa; ¼ Þ ¼ =4 P ap PðaÞ; Pð ¼ aa; ¼ bbþ ¼ =4 P ap b PðabÞ: ð7þ Note tat P( =, =b)=p( =)7P( =, = or bb) Hence, Pð ¼ ; ¼ bþ ¼ = PðÞP P b PðbÞP P : ð8þ Smlarly, we may calculate te followng tree terms Pð ¼ aa; ¼ bþ ¼ = PðaÞP ap b PðabÞP a P Pð ¼ a; ¼ Þ ¼ = PðÞP ap PðaÞP P Pð ¼ a; ¼ bbþ ¼ = PðbÞP ap b PðabÞP P b : ð9þ Fnally, we can calculate te followng term usng equaton S S P(, )= Pð ¼ a; ¼ bþ ¼ PðÞP ap b PðbÞP a P PðaÞP P b PðabÞP P : ðþ European Journal of Human Genetcs

13 Lnkage and assocaton mappng of QTL R Fan and M Xong 37 Usng equatons (6), (7), (8), (9) and (), we may calculate E½x x Š¼P a P b P½ ¼ ; ¼ Š ðp b P ÞP½ ¼ ; ¼ bš P P½ ¼ ; ¼ bbš ðp a P Þ P b P½ ¼ a; ¼ Š ðp b P ÞP½ ¼ a; ¼ bš P P½ ¼ a; ¼ bbš P P b P½ ¼ aa; ¼ Š ðp b P ÞP½ ¼ aa; ¼ bš P P½ ¼ aa; ¼ bbš ¼ D : Smlarly, we may get E½x z Š¼; E½z z Š¼D =4 y symmetrc property, we may calculate te remanng terms n () ppendx D Let P be te matrx gven by (9) To sow te approxmaton of (), we notce tat d can be calculated by d ¼ s ð Þ P ð Þ t Te element on te second row and te second column of approxmaton () can be calculated by k nmk X ¼nm ¼ s k ðx f x m x x Þ ðx f x m x x Þ t nmk X ¼nm ð Cr Þx f Cr x m Cx Cx x f Cr x f ð Cr Þx m Cx Cx x m Cx f Cx m Cð r Þ r ð r Þ x Cðr r Þ r ð r Þ x x Cx f Cx m Cðr r Þ r ð r Þ x Cð r Þ r ð r Þ x x ð Cr s ÞP a P CP a P CP a P ðcr ÞP a P CP a P CP a P CP a P CP a P Cð r Þ r ð r Þ P ap Cðr r Þ r ð r Þ P ap CP a P CP a P Cðr r Þ r ð r Þ P ap Cð r Þ r ð r Þ P ap ¼ P a P d =s : Smlarly, te element on te fort row and te fort column of approxmaton () s k nmk X ¼nm ¼ s k ðz f z m z z Þ ðz f z m z z Þ t nmk X ¼nm ð Cr Þz f Cr z m Cz Cz z f Cr z f ð Cr Þz m Cz Cz z m Cz f Cz m Cð r Þ r ð r Þ z Cðr r Þ r ð r Þ z z Cz f Cz m Cðr r Þ r ð r Þ z Cð r Þ r ð r Þ z z s ð Cr ÞP a P Cð r Þ r ð r Þ P a P Cðr r Þ r ð r Þ P a P =4 ¼ P a P d 44=s : Te oter terms of approxmaton () can be calculated n a smlar manner European Journal of Human Genetcs

Recall that quantitative genetics is based on the extension of Mendelian principles to polygenic traits.

Recall that quantitative genetics is based on the extension of Mendelian principles to polygenic traits. BIOSTT/STT551, Statstcal enetcs II: Quanttatve Trats Wnter 004 Sources of varaton for multlocus trats and Handout Readng: Chapter 5 and 6. Extensons to Multlocus trats Recall that quanttatve genetcs s

More information

Multivariate Ratio Estimator of the Population Total under Stratified Random Sampling

Multivariate Ratio Estimator of the Population Total under Stratified Random Sampling Open Journal of Statstcs, 0,, 300-304 ttp://dx.do.org/0.436/ojs.0.3036 Publsed Onlne July 0 (ttp://www.scrp.org/journal/ojs) Multvarate Rato Estmator of te Populaton Total under Stratfed Random Samplng

More information

Quantitative Genetic Models Least Squares Genetic Model. Hardy-Weinberg (1908) Principle. change of allele & genotype frequency over generations

Quantitative Genetic Models Least Squares Genetic Model. Hardy-Weinberg (1908) Principle. change of allele & genotype frequency over generations Quanttatve Genetc Models Least Squares Genetc Model Hardy-Wenberg (1908) Prncple partton of effects P = G + E + G E P s phenotypc effect G s genetc effect E s envronmental effect G E s nteracton effect

More information

Using the estimated penetrances to determine the range of the underlying genetic model in casecontrol

Using the estimated penetrances to determine the range of the underlying genetic model in casecontrol Georgetown Unversty From the SelectedWorks of Mark J Meyer 8 Usng the estmated penetrances to determne the range of the underlyng genetc model n casecontrol desgn Mark J Meyer Neal Jeffres Gang Zheng Avalable

More information

Checking Pairwise Relationships. Lecture 19 Biostatistics 666

Checking Pairwise Relationships. Lecture 19 Biostatistics 666 Checkng Parwse Relatonshps Lecture 19 Bostatstcs 666 Last Lecture: Markov Model for Multpont Analyss X X X 1 3 X M P X 1 I P X I P X 3 I P X M I 1 3 M I 1 I I 3 I M P I I P I 3 I P... 1 IBD states along

More information

Title: Bounds and normalization of the composite linkage disequilibrium coefficient.

Title: Bounds and normalization of the composite linkage disequilibrium coefficient. Ttle: ounds and normalzaton of the composte lnkage dsequlbrum coeffcent. (Genetc Epdemology 004 7:5-57 uthor: Dmtr V. Zaykn * * GlaxoSmthKlne Inc., Research Trangle Park, NC Contact detals: Dmtr Zaykn

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

Joint Statistical Meetings - Biopharmaceutical Section

Joint Statistical Meetings - Biopharmaceutical Section Iteratve Ch-Square Test for Equvalence of Multple Treatment Groups Te-Hua Ng*, U.S. Food and Drug Admnstraton 1401 Rockvlle Pke, #200S, HFM-217, Rockvlle, MD 20852-1448 Key Words: Equvalence Testng; Actve

More information

Stanford University CS254: Computational Complexity Notes 7 Luca Trevisan January 29, Notes for Lecture 7

Stanford University CS254: Computational Complexity Notes 7 Luca Trevisan January 29, Notes for Lecture 7 Stanford Unversty CS54: Computatonal Complexty Notes 7 Luca Trevsan January 9, 014 Notes for Lecture 7 1 Approxmate Countng wt an N oracle We complete te proof of te followng result: Teorem 1 For every

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

Kinship Coefficients. Biostatistics 666

Kinship Coefficients. Biostatistics 666 Knshp Coeffcents Bostatstcs 666 Today s Lecture Genetc analyses requre relatonshps to be specfed Msspecfed relatonshps lead to tests of napproprate sze Inflated Type I error Decreased power Knshp Coeffcents

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida Frst Year Examnaton Department of Statstcs, Unversty of Florda May 7, 010, 8:00 am - 1:00 noon Instructons: 1. You have four hours to answer questons n ths examnaton.. You must show your work to receve

More information

PubH 7405: REGRESSION ANALYSIS. SLR: INFERENCES, Part II

PubH 7405: REGRESSION ANALYSIS. SLR: INFERENCES, Part II PubH 7405: REGRESSION ANALSIS SLR: INFERENCES, Part II We cover te topc of nference n two sessons; te frst sesson focused on nferences concernng te slope and te ntercept; ts s a contnuaton on estmatng

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method Appled Mathematcal Scences, Vol. 7, 0, no. 47, 07-0 HIARI Ltd, www.m-hkar.com Comparson of the Populaton Varance Estmators of -Parameter Exponental Dstrbuton Based on Multple Crtera Decson Makng Method

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

Multipoint Analysis for Sibling Pairs. Biostatistics 666 Lecture 18

Multipoint Analysis for Sibling Pairs. Biostatistics 666 Lecture 18 Multpont Analyss for Sblng ars Bostatstcs 666 Lecture 8 revously Lnkage analyss wth pars of ndvduals Non-paraetrc BS Methods Maxu Lkelhood BD Based Method ossble Trangle Constrant AS Methods Covered So

More information

Adaptive Kernel Estimation of the Conditional Quantiles

Adaptive Kernel Estimation of the Conditional Quantiles Internatonal Journal of Statstcs and Probablty; Vol. 5, No. ; 206 ISSN 927-7032 E-ISSN 927-7040 Publsed by Canadan Center of Scence and Educaton Adaptve Kernel Estmaton of te Condtonal Quantles Rad B.

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours UNIVERSITY OF TORONTO Faculty of Arts and Scence December 005 Examnatons STA47HF/STA005HF Duraton - hours AIDS ALLOWED: (to be suppled by the student) Non-programmable calculator One handwrtten 8.5'' x

More information

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department

More information

Not-for-Publication Appendix to Optimal Asymptotic Least Aquares Estimation in a Singular Set-up

Not-for-Publication Appendix to Optimal Asymptotic Least Aquares Estimation in a Singular Set-up Not-for-Publcaton Aendx to Otmal Asymtotc Least Aquares Estmaton n a Sngular Set-u Antono Dez de los Ros Bank of Canada dezbankofcanada.ca December 214 A Proof of Proostons A.1 Proof of Prooston 1 Ts roof

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

Chapter 12 Analysis of Covariance

Chapter 12 Analysis of Covariance Chapter Analyss of Covarance Any scentfc experment s performed to know somethng that s unknown about a group of treatments and to test certan hypothess about the correspondng treatment effect When varablty

More information

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

TR/95 February Splines G. H. BEHFOROOZ* & N. PAPAMICHAEL

TR/95 February Splines G. H. BEHFOROOZ* & N. PAPAMICHAEL TR/9 February 980 End Condtons for Interpolatory Quntc Splnes by G. H. BEHFOROOZ* & N. PAPAMICHAEL *Present address: Dept of Matematcs Unversty of Tabrz Tabrz Iran. W9609 A B S T R A C T Accurate end condtons

More information

Bayesian predictive Configural Frequency Analysis

Bayesian predictive Configural Frequency Analysis Psychologcal Test and Assessment Modelng, Volume 54, 2012 (3), 285-292 Bayesan predctve Confgural Frequency Analyss Eduardo Gutérrez-Peña 1 Abstract Confgural Frequency Analyss s a method for cell-wse

More information

Lecture 4. Heritability. Heritability: An Intuitive Approach First Definition

Lecture 4. Heritability. Heritability: An Intuitive Approach First Definition Lecture Hertablty Hertablty: n Intutve pproac Frst enton Broa Sense: Proporton o te penotypc varaton ue to genetc causes H G Y Narro Sense: Proporton o te penotypc varaton ue to atve genetc eects Y Useul

More information

Chat eld, C. and A.J.Collins, Introduction to multivariate analysis. Chapman & Hall, 1980

Chat eld, C. and A.J.Collins, Introduction to multivariate analysis. Chapman & Hall, 1980 MT07: Multvarate Statstcal Methods Mke Tso: emal mke.tso@manchester.ac.uk Webpage for notes: http://www.maths.manchester.ac.uk/~mkt/new_teachng.htm. Introducton to multvarate data. Books Chat eld, C. and

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

Statistical methods for mapping quantitative traits using high density SNPs in family samples

Statistical methods for mapping quantitative traits using high density SNPs in family samples Statstcal methods for mappng quanttatve trats usng hgh densty SNPs n famly samples Josée Dupus Department of Bostatstcs Boston Unversty School of Publc Health Outlne Motvatng dataset: Framngham Heart Study

More information

Structure and Drive Paul A. Jensen Copyright July 20, 2003

Structure and Drive Paul A. Jensen Copyright July 20, 2003 Structure and Drve Paul A. Jensen Copyrght July 20, 2003 A system s made up of several operatons wth flow passng between them. The structure of the system descrbes the flow paths from nputs to outputs.

More information

Shuai Dong. Isaac Newton. Gottfried Leibniz

Shuai Dong. Isaac Newton. Gottfried Leibniz Computatonal pyscs Sua Dong Isaac Newton Gottred Lebnz Numercal calculus poston dervatve ntegral v velocty dervatve ntegral a acceleraton Numercal calculus Numercal derentaton Numercal ntegraton Roots

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

Problem Set 4: Sketch of Solutions

Problem Set 4: Sketch of Solutions Problem Set 4: Sketc of Solutons Informaton Economcs (Ec 55) George Georgads Due n class or by e-mal to quel@bu.edu at :30, Monday, December 8 Problem. Screenng A monopolst can produce a good n dfferent

More information

One-sided finite-difference approximations suitable for use with Richardson extrapolation

One-sided finite-difference approximations suitable for use with Richardson extrapolation Journal of Computatonal Physcs 219 (2006) 13 20 Short note One-sded fnte-dfference approxmatons sutable for use wth Rchardson extrapolaton Kumar Rahul, S.N. Bhattacharyya * Department of Mechancal Engneerng,

More information

This column is a continuation of our previous column

This column is a continuation of our previous column Comparson of Goodness of Ft Statstcs for Lnear Regresson, Part II The authors contnue ther dscusson of the correlaton coeffcent n developng a calbraton for quanttatve analyss. Jerome Workman Jr. and Howard

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VIII LECTURE - 34 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS EFFECTS MODEL Dr Shalabh Department of Mathematcs and Statstcs Indan

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS M. Krshna Reddy, B. Naveen Kumar and Y. Ramu Department of Statstcs, Osmana Unversty, Hyderabad -500 007, Inda. nanbyrozu@gmal.com, ramu0@gmal.com

More information

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables LINEAR REGRESSION ANALYSIS MODULE VIII Lecture - 7 Indcator Varables Dr. Shalabh Department of Maematcs and Statstcs Indan Insttute of Technology Kanpur Indcator varables versus quanttatve explanatory

More information

DERIVATION OF THE PROBABILITY PLOT CORRELATION COEFFICIENT TEST STATISTICS FOR THE GENERALIZED LOGISTIC DISTRIBUTION

DERIVATION OF THE PROBABILITY PLOT CORRELATION COEFFICIENT TEST STATISTICS FOR THE GENERALIZED LOGISTIC DISTRIBUTION Internatonal Worshop ADVANCES IN STATISTICAL HYDROLOGY May 3-5, Taormna, Italy DERIVATION OF THE PROBABILITY PLOT CORRELATION COEFFICIENT TEST STATISTICS FOR THE GENERALIZED LOGISTIC DISTRIBUTION by Sooyoung

More information

Statistics for Business and Economics

Statistics for Business and Economics Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

Testing for seasonal unit roots in heterogeneous panels

Testing for seasonal unit roots in heterogeneous panels Testng for seasonal unt roots n heterogeneous panels Jesus Otero * Facultad de Economía Unversdad del Rosaro, Colomba Jeremy Smth Department of Economcs Unversty of arwck Monca Gulett Aston Busness School

More information

Composite Hypotheses testing

Composite Hypotheses testing Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

Polynomial Regression Models

Polynomial Regression Models LINEAR REGRESSION ANALYSIS MODULE XII Lecture - 6 Polynomal Regresson Models Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Test of sgnfcance To test the sgnfcance

More information

STATISTICS QUESTIONS. Step by Step Solutions.

STATISTICS QUESTIONS. Step by Step Solutions. STATISTICS QUESTIONS Step by Step Solutons www.mathcracker.com 9//016 Problem 1: A researcher s nterested n the effects of famly sze on delnquency for a group of offenders and examnes famles wth one to

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

Markov Chain Monte Carlo Lecture 6

Markov Chain Monte Carlo Lecture 6 where (x 1,..., x N ) X N, N s called the populaton sze, f(x) f (x) for at least one {1, 2,..., N}, and those dfferent from f(x) are called the tral dstrbutons n terms of mportance samplng. Dfferent ways

More information

Andreas C. Drichoutis Agriculural University of Athens. Abstract

Andreas C. Drichoutis Agriculural University of Athens. Abstract Heteroskedastcty, the sngle crossng property and ordered response models Andreas C. Drchouts Agrculural Unversty of Athens Panagots Lazards Agrculural Unversty of Athens Rodolfo M. Nayga, Jr. Texas AMUnversty

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

Estimation: Part 2. Chapter GREG estimation

Estimation: Part 2. Chapter GREG estimation Chapter 9 Estmaton: Part 2 9. GREG estmaton In Chapter 8, we have seen that the regresson estmator s an effcent estmator when there s a lnear relatonshp between y and x. In ths chapter, we generalzed the

More information

Chapter 3 Describing Data Using Numerical Measures

Chapter 3 Describing Data Using Numerical Measures Chapter 3 Student Lecture Notes 3-1 Chapter 3 Descrbng Data Usng Numercal Measures Fall 2006 Fundamentals of Busness Statstcs 1 Chapter Goals To establsh the usefulness of summary measures of data. The

More information

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y) Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,

More information

The Finite Element Method: A Short Introduction

The Finite Element Method: A Short Introduction Te Fnte Element Metod: A Sort ntroducton Wat s FEM? Te Fnte Element Metod (FEM) ntroduced by engneers n late 50 s and 60 s s a numercal tecnque for solvng problems wc are descrbed by Ordnary Dfferental

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS

COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS Avalable onlne at http://sck.org J. Math. Comput. Sc. 3 (3), No., 6-3 ISSN: 97-537 COMPARISON OF SOME RELIABILITY CHARACTERISTICS BETWEEN REDUNDANT SYSTEMS REQUIRING SUPPORTING UNITS FOR THEIR OPERATIONS

More information

Number of cases Number of factors Number of covariates Number of levels of factor i. Value of the dependent variable for case k

Number of cases Number of factors Number of covariates Number of levels of factor i. Value of the dependent variable for case k ANOVA Model and Matrx Computatons Notaton The followng notaton s used throughout ths chapter unless otherwse stated: N F CN Y Z j w W Number of cases Number of factors Number of covarates Number of levels

More information

Uncertainty in measurements of power and energy on power networks

Uncertainty in measurements of power and energy on power networks Uncertanty n measurements of power and energy on power networks E. Manov, N. Kolev Department of Measurement and Instrumentaton, Techncal Unversty Sofa, bul. Klment Ohrdsk No8, bl., 000 Sofa, Bulgara Tel./fax:

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

An Application of Fuzzy Hypotheses Testing in Radar Detection

An Application of Fuzzy Hypotheses Testing in Radar Detection Proceedngs of the th WSES Internatonal Conference on FUZZY SYSEMS n pplcaton of Fuy Hypotheses estng n Radar Detecton.K.ELSHERIF, F.M.BBDY, G.M.BDELHMID Department of Mathematcs Mltary echncal Collage

More information

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models Computaton of Hgher Order Moments from Two Multnomal Overdsperson Lkelhood Models BY J. T. NEWCOMER, N. K. NEERCHAL Department of Mathematcs and Statstcs, Unversty of Maryland, Baltmore County, Baltmore,

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

Statistics II Final Exam 26/6/18

Statistics II Final Exam 26/6/18 Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the

More information

A Comparative Study for Estimation Parameters in Panel Data Model

A Comparative Study for Estimation Parameters in Panel Data Model A Comparatve Study for Estmaton Parameters n Panel Data Model Ahmed H. Youssef and Mohamed R. Abonazel hs paper examnes the panel data models when the regresson coeffcents are fxed random and mxed and

More information

Efficient nonresponse weighting adjustment using estimated response probability

Efficient nonresponse weighting adjustment using estimated response probability Effcent nonresponse weghtng adjustment usng estmated response probablty Jae Kwang Km Department of Appled Statstcs, Yonse Unversty, Seoul, 120-749, KOREA Key Words: Regresson estmator, Propensty score,

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

A FORMULA FOR COMPUTING INTEGER POWERS FOR ONE TYPE OF TRIDIAGONAL MATRIX

A FORMULA FOR COMPUTING INTEGER POWERS FOR ONE TYPE OF TRIDIAGONAL MATRIX Hacettepe Journal of Mathematcs and Statstcs Volume 393 0 35 33 FORMUL FOR COMPUTING INTEGER POWERS FOR ONE TYPE OF TRIDIGONL MTRIX H Kıyak I Gürses F Yılmaz and D Bozkurt Receved :08 :009 : ccepted 5

More information

338 A^VÇÚO 1n ò Lke n Mancn (211), we make te followng assumpton to control te beavour of small jumps. Assumpton 1.1 L s symmetrc α-stable, were α (,

338 A^VÇÚO 1n ò Lke n Mancn (211), we make te followng assumpton to control te beavour of small jumps. Assumpton 1.1 L s symmetrc α-stable, were α (, A^VÇÚO 1n ò 1oÏ 215c8 Cnese Journal of Appled Probablty and Statstcs Vol.31 No.4 Aug. 215 Te Speed of Convergence of te Tresold Verson of Bpower Varaton for Semmartngales Xao Xaoyong Yn Hongwe (Department

More information

Two-factor model. Statistical Models. Least Squares estimation in LM two-factor model. Rats

Two-factor model. Statistical Models. Least Squares estimation in LM two-factor model. Rats tatstcal Models Lecture nalyss of Varance wo-factor model Overall mean Man effect of factor at level Man effect of factor at level Y µ + α + β + γ + ε Eε f (, ( l, Cov( ε, ε ) lmr f (, nteracton effect

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

LOGIT ANALYSIS. A.K. VASISHT Indian Agricultural Statistics Research Institute, Library Avenue, New Delhi

LOGIT ANALYSIS. A.K. VASISHT Indian Agricultural Statistics Research Institute, Library Avenue, New Delhi LOGIT ANALYSIS A.K. VASISHT Indan Agrcultural Statstcs Research Insttute, Lbrary Avenue, New Delh-0 02 amtvassht@asr.res.n. Introducton In dummy regresson varable models, t s assumed mplctly that the dependent

More information

Lecture 6 More on Complete Randomized Block Design (RBD)

Lecture 6 More on Complete Randomized Block Design (RBD) Lecture 6 More on Complete Randomzed Block Desgn (RBD) Multple test Multple test The multple comparsons or multple testng problem occurs when one consders a set of statstcal nferences smultaneously. For

More information

Convexity preserving interpolation by splines of arbitrary degree

Convexity preserving interpolation by splines of arbitrary degree Computer Scence Journal of Moldova, vol.18, no.1(52), 2010 Convexty preservng nterpolaton by splnes of arbtrary degree Igor Verlan Abstract In the present paper an algorthm of C 2 nterpolaton of dscrete

More information

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE)

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE) June 7, 016 15:30 Frst famly name: Name: DNI/ID: Moble: Second famly Name: GECO/GADE: Instructor: E-mal: Queston 1 A B C Blank Queston A B C Blank Queston

More information

USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE

USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE STATISTICA, anno LXXV, n. 4, 015 USE OF DOUBLE SAMPLING SCHEME IN ESTIMATING THE MEAN OF STRATIFIED POPULATION UNDER NON-RESPONSE Manoj K. Chaudhary 1 Department of Statstcs, Banaras Hndu Unversty, Varanas,

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

Reports of the Institute of Biostatistics

Reports of the Institute of Biostatistics Reports of the Insttute of Bostatstcs No 0 / 2007 Lebnz Unversty of Hannover Natural Scences Faculty Ttel: IUT for multple endponts Authors: Maro Hasler Introducton Some of the focus n new drug development

More information

F statistic = s2 1 s 2 ( F for Fisher )

F statistic = s2 1 s 2 ( F for Fisher ) Stat 4 ANOVA Analyss of Varance /6/04 Comparng Two varances: F dstrbuton Typcal Data Sets One way analyss of varance : example Notaton for one way ANOVA Comparng Two varances: F dstrbuton We saw that the

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an

More information

Copyright 2010 Cengage Learning, Inc. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part.

Copyright 2010 Cengage Learning, Inc. All Rights Reserved. May not be copied, scanned, or duplicated, in whole or in part. yes to (3) two-sample problem? no to (4) underlyng dstrbuton normal or can centrallmt theorem be assumed to hold? and yes to (5) underlyng dstrbuton bnomal? We now refer to the flowchart at the end of

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 14 Multple Regresson Models 1999 Prentce-Hall, Inc. Chap. 14-1 Chapter Topcs The Multple Regresson Model Contrbuton of Indvdual Independent Varables

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

ANOVA. The Observations y ij

ANOVA. The Observations y ij ANOVA Stands for ANalyss Of VArance But t s a test of dfferences n means The dea: The Observatons y j Treatment group = 1 = 2 = k y 11 y 21 y k,1 y 12 y 22 y k,2 y 1, n1 y 2, n2 y k, nk means: m 1 m 2

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study

More information