10. Generalized linear models

Size: px
Start display at page:

Download "10. Generalized linear models"

Transcription

1 10. Generalzed lnear models 10.1 Homogeneous models Exponental famles of dstrbutons, lnk functons, lkelhood estmaton 10.2 Example: Tort flngs 10.3 Margnal models and GEE 10.4 Random effects models 10.5 Fxed effects models Maxmum lkelhood, condonal lkelhood, Posson data 10.6 Baesan Inference Appendx 10A Exponental famles of dstrbutons

2 10.1 Homogeneous models Secton Outlne Exponental famles of dstrbutons Lnk functons Lkelhood estmaton In ths secton, we consder onl ndependent responses. No seral correlaton. No random effects that would nduce seral correlaton.

3 Exponental famles of dstrbutons The basc one parameter exponental faml s θ b( θ ) p(, θ, φ) = exp + S(, φ) φ Here, s a response and θ s the parameter of nterest. The parameter φ s a scale parameter that we often wll assume s known. The term b(θ) depends onl on the parameter θ, not the responses. S(, φ ) depends onl on the responses and the scale parameter, not the parameter θ. The response ma be dscrete or contnuous. Some straghtforward calculatons show that E = b (θ) and Var = b (θ) φ.

4 Specal cases of the basc exponental faml Normal The probabl dens functon s f( πσ 2 1 ( µ ) ( µ µ / 2) 1, µ, σ ) = exp = exp ln πσ 2σ σ 2σ 2 Take µ = θ, σ 2 = φ, b(θ) = θ 2 /2 and S(, φ ) = - 2 / (2φ) - ln(2 π φ))/2. Note that E = b (θ) = θ =µand Var = b (µ) σ 2 = σ 2. Bnomal, n trals and prob π of success The probabl mass functon s n n π f(, π ) = π (1 π ) = exp ln + n ln(1 π ) (1 π ) ( 2 ) n + ln Take ln (π/(1-π))= log (π) = θ, 1 = φ, b(θ) = n ln (1 + e θ ) and S(, φ ) = ln((n choose )). Note that E = b (θ) = n e θ /(1 + e θ ) = n π and Var = b (θ) (1) = n e θ /(1 + e θ ) 2 = n π(1-π), as antcpated.

5 Another specal case of the basc exponental faml Posson The probabl mass functon s ( λ) λ exp f(, λ) = = exp λ! Take ln (λ) = θ, 1 = φ, b(θ) = e θ and S(, φ ) = -ln(!)). Note that E = b (θ) = e θ = λ and Var = b (θ) (1) = e θ = λ, as antcpated. ( ln λ ln!)

6 Dstrbuton General Table 10A.1 Selected Dstrbutons of the One-Parameter Exponental Faml Parameters Dens Components E Var or Mass Functon θ, φ θ b( θ ) θ φ b(θ) S(,φ) b (θ) b (θ) φ exp + S(, φ) φ 2 1 ( µ ) µ σ θ ln(2πφ) θ = µ φ = σ 2 exp 2 σ 2π 2σ 2 + 2φ 2 Normal µ, σ 2 Bnomal π n n π π ( 1 π ) ln 1 π Posson λ exp( λ) Gamma α, β α β Γ( α)! 1 θ n ln ( 1 + e ) θ n e ln n θ 1 + e = nπ n e θ θ ( 1 + e ) 2 = n π ( 1 π ) λ ln λ 1 e θ -ln (!) e θ = λ e θ = λ α 1 exp ( β ) β α 1 -ln (-θ) 1 φ lnφ α ln ( 1 Γ φ ) + ( φ ( ) 1 1) ln 1 α θ = φ α 2 = 2 β θ β

7 Lnk functons To lnk up the unvarate exponental faml wh regresson problems, we defne the sstematc component of to be η = x β. The dea s to now choose a lnk between the sstematc component and the mean of, sa µ, of the form: η = g(µ ). g(.) s the lnk functon. Lnear combnatons of explanator varables, η = x β, ma var between negatve and posve nfn. However, means ma be restrcted to smaller range. For example, Posson means var between zero and nfn. The lnk functon serves to map the doman of the mean functon onto the whole real lne.

8 Bernoull llustraton of lnks Bernoull means var between 0 and 1, although lnear combnatons of explanator varables ma var between negatve and posve nfn. Here are three mportant examples of lnk functons for the Bernoull dstrbuton: Log: η = g(µ) = log(µ) = ln (µ/(1 µ)). Prob: η = g(µ) = Φ -1 (µ), where Φ -1 s the nverse of the standard normal dstrbuton functon. Complementar log-log: η = g(µ) = ln ( -ln(1 µ) ). Each functon maps the un nterval (0,1) onto the whole real lne.

9 Canoncal lnks As we have seen wh the Bernoull, there are several lnk functons that ma be suable for a partcular dstrbuton. When the sstematc component equals the parameter of nterest (η = θ ), ths s an ntuvel appealng case. That s, the parameter of nterest, θ, equals a lnear combnaton of explanator varables, η. Recall that η = g(µ) and µ = b (θ). Thus, f g -1 = b, then η = g(b (θ)) = θ. The choce of g, such that g -1 = b, s called a canoncal lnk. Examples: Normal: g(θ) = θ, Bnomal: g(θ) = log(θ), Posson: g(θ) = ln θ.

10 Estmaton Begn wh lkelhood estmaton for canoncal lnks Consder responses, wh mean µ, sstematc component η = g(µ ) = x β and canoncal lnk so that η = θ. Assume the responses are ndependent. Then, the log-lkelhood s ln p( ) = θ φ b ( θ ) + S(, φ ) = ( x β ) b( x β ) + S( φ, φ )

11 The log-lkelhood s MLEs - Canoncal lnks ln p( ) = ( x β) b( x β) Takng the partal dervatve wh respect to β elds the score equatons: 0= because µ = b (θ ) = b (x β ). Thus, we can solve for the mle s of β through: + S( φ 0 = Σ x ( - µ )., φ ) ( b ( xβ) ) x ( ) = µ x φ Ths s a specal case of the method of moments. φ

12 MLEs - general lnks For general lnks, we no longer assume the relaton θ = x β. We assume that β s related to θ through µ = b (θ ) and η = x β = g(µ ). Recall that the log-lkelhood s ( ) θ b θ ln p( ) = + S(, φ) φ Further, E = µ and Var = b (θ ) / φ. The jth element of the score functon s θ µ ln p( ) = β j β j φ because b (θ ) = µ

13 MLEs - more on general lnks To elmnate θ, we use the chan rule to get Thus, µ β Ths elds j = b ( θ β θ β j j ln p( ) ) = = = b ( θ µ β j β j β j Ths s called the generalzed estmatng equatons form. θ ) β φ Var µ j = Var φ θ β 1 ( Var ) ( µ ) j

14 Overdsperson When ftng models to data wh bnar or count dependent varables, s common to observe that the varance exceeds that antcpated b the f of the mean parameters. Ths phenomenon s known as overdsperson. A probablstc models ma be avalable to explan ths phenomenon. In man suatons, analsts are content to postulate an approxmate model through the relaton Var = σ 2 φ b (x β) / w. The scale parameter φ s specfed through the choce of the dstrbuton The scale parameter σ 2 allows for extra varabl. When the addonal scale parameter σ 2 s ncluded, s customar to estmate b Pearson s ch-square statstc dvded b the error degrees of freedom. That s, ( b ( x b MLE ) ˆ σ = w N K φ b ( x b ) MLE

15 10.2 Example: Tort flngs Table 10.2 State Characterstcs Dependent Varable NUMFILE Number of flngs of tort actons aganst nsurance companes. State Legal Characterstcs JSLIAB An ndcator of jont and several labl reform. COLLRULE An ndcator of collateral source reform. CAPS An ndcator of caps on non-economc reform. PUNITIVE An ndcator of lms of punve damage State Economc and Demographc Characterstcs POP The state populaton, n mllons. POPLAWYR The populaton per lawer. VEHCMILE Number of automobles mles per mle of road, n thousands. POPDENSY Number of people per ten square mles of land. WCMPMAX Maxmum workers compensaton weekl benef. URBAN Percentage of populaton lvng n urban areas. UNEMPLOY State unemploment rate, n percentages. Source: An Emprcal Stud of the Effects of Tort Reforms on the Rate of Tort Flngs, unpublshed Ph.D. Dssertaton, Han-Duck Lee, Unvers of Wsconsn (1994).

16 Table 10.3 Averages wh Explanator Indcator Varables Explanator Varable JSLIAB COLLRULE CAPS PUNITIVE Average Explanator Varable Average NUMFILE When Explanator Varable = 0 15,530 20,727 24,682 17,693 When Explanator Varable = 1 25,967 20,027 6,727 26,469 Table Summar Statstcs for Other Varables Varable Mean Medan Mnmum Maxmum Standard Correlaton wh NUMFILE devaton NUMFILE POP POPLAWYR VEHCMILE POPDENSY WCMPMAX URBAN UNEMPLOY

17 Offsets We assume that s Posson dstrbuton wh parameter POP exp(x β), where POP s the populaton of the h state at tme t. In GLM termnolog, a varable wh a known coeffcent equal to 1 s known as an offset. Usng logarhmc populaton, our Posson parameter for s ( ) ( ) ( β) exp, ln POP + x, 1β1 + L + x K β K = exp ln POP + xβ = POP exp x An alternatve approach s to use the average number of tort flngs as the response and assume approxmate normal. Note that n the Posson model above the expectaton of the average response s E ( / POP ) = exp( x β) whereas the varance s Var / POP = exp x β / POP ( ) ( )

18 Tort flngs Purpose: to understand was n whch state legal, economc and demographc characterstcs affect the number of flngs. Table 10.3 suggests more flngs under JSLIAB and PUNITIVE but less under CAPS Table 10.5 All varables under the homogenous model are statstcall sgnfcant However, estmated scale parameter seems mportant Here, onl JSLIAB s (posvel) statstcall sgnfcant Tme (categorcal) varable seems mportant

19 Table 10.5 Tort Flngs Model Coeffcent Estmates Based on N = 112 observatons from n = 19 states and T = 6 ears. Logarhmc populaton s used as an offset. Homogeneous Posson model Model wh estmated scale parameter Model wh scale parameter and tme categorcal varable Varable Parameter p-values Parameter p-values Parameter p-values estmate estmate estmate Intercept < <.0001 POPLAWYR/ < VEHCMILE/ < < <.0001 POPDENSY/ < WCMPMAX/ < URBAN/ < UNEMPLOY < JSLIAB < COLLRULE < CAPS < PUNITIVE < Scale Devance 118, , ,496.4 Pearson Ch-Square 129, , ,073.9

20 10.3 Margnal models Ths approach reduces the relance on the dstrbutonal assumptons b focusng on the frst two moments. We frst assume that the varance s a known functon of the mean up to a scale parameter, that s, Var = v(µ ) φ. Ths s a consequence of the exponental faml, although now s a basc assumpton. That s, n the GLM settng, we have Var = b (θ ) φ and µ = b (θ ). Because b(.) and φ are assumed known, Var s a known functon of µ. We also assume that the correlaton between two observatons whn the same subject s a known functon of ther means, up to a vector of parameters τ. That s corr( r, s ) = ρ(µ r, µ s, τ), for ρ(.) known.

21 Margnal model Ths framework ncorporates the lnear model ncel; we smpl use a GLM wh a normal dstrbuton. However, for nonlnear suatons, a correlaton s not alwas the best wa to capture dependences among observatons. Here s some notaton to help see the estmaton procedures. Defne µ = (µ 1, µ 2,..., µ T ) to be the vector of means for the h subject. To express the varance-covarance matrx, we defne a dagonal matrx of varances V = dag(v(µ 1 ),..., v( µ T ) ) and the matrx of correlatons R (τ) to be a matrx wh ρ(µ r, µ s, τ) n the rth row and sth column. Thus, Var = V 1/2 R (τ) V 1/2.

22 Generalzed estmatng equatons These assumptons are suable for a method of moments estmaton procedure called generalzed estmatng equatons (GEE) n bostatstcs, also known as the generalzed method of moments (GMM) n econometrcs. GEE wh known correlaton parameter Assumng τ s known, the jth row of the GEE s n 0 = K G µ = 1 Here, the matrx 1 ( b) V ( b)( µ ( b)) µ G β = µ 1 T µ (, τ ) L β β s T x K*. For lnear models wh µ = z α + x β, ths s the GLS estmator ntroduced n Secton 3.3.

23 Consstenc of GEEs The soluton, b EE, s asmptotcall normal wh covarance matrx Var n = 1 b EE G µ ( b) V ( b) G µ = 1 ( b) 1 Because ths s a functon of the means, µ, can be consstentl estmated.

24 Robust estmaton of standard errors emprcal standard errors ma be calculated usng the followng estmator of the asmptotc varance of b EE ( )( ) = = = n n n µ µ µ µ µ µ G V G G V µ µ V G G V G

25 GEE - correlaton parameter estmaton For GEEs wh unknown correlaton parameters, Prentce (1988) suggests usng a second estmatng equaton of the form: * E ( ) 1 * * W E = τ 0 where * 2 2 L L L = ( 2 ) Dggle, Lang and Zeger (1994) suggest usng the dent matrx for most dscrete data. However, for bnar responses, the note that the last T observatons are redundant because = 2 and should be gnored. the recommend usng 1 T 1 2 T ( Var( ) Var( )) W = dag 2 L, 1 1 T T

26 Tort flngs Assume an ndependent workng correlaton Ths elds at the same parameter estmators as n Table 10.5, under the homogenous Posson model wh an estmated scale parameter. JSLIAB s (posvel) statstcall sgnfcant, usng both model-based and robust standard errors. To test the robustness of ths model f, we f the same model wh an AR (1) workng correlaton. Agan, JSLIAB s (posvel) statstcall sgnfcant. Interestng that CAPS s now borderlne but n the oppose drecton suggested b Table 10.3

27 Parameter Estmate Emprcal Standard Error Table 10.6 Comparson of GEE Estmators. All models use an estmated scale parameter. Logarhmc populaton s used as an offset. Independent Workng Correlaton AR(1) Workng Correlaton Model-Based Standard Error Estmate Emprcal Standard Error Model-Based Standard Error Intercept * * POPLAWYR/ VEHCMILE/ * * POPDENSY/ * * WCMPMAX/ URBAN/ UNEMPLOY 0.087* * JSLIAB 0.177* * COLLRULE CAPS * PUNITIVE Scale AR(1) Coeffcent The astersk (*) ndcates that the estmate s more than twce the emprcal standard error, n absolute value.

28 10.4 Random effects models The motvaton and samplng ssues regardng random effects were ntroduced n Chapter 3. The model s easest to ntroduce and nterpret n the followng herarchcal fashon: 1. Subject effects {α } are a random sample from a dstrbuton that s known up to a vector of parameters τ. 2. Condonal on {α }, the responses { 1, 2,..., T } are a random sample from a GLM wh sstematc component η = z α + x β.

29 Random effects models Ths model s a generalzaton of: 1. The lnear random effects model n Chapter 3 - use a normal dstrbuton. 2. The bnar dependent varables random effects model of Secton usng a Bernoull dstrbuton. (In Secton 9.2, we focused on the case z =1.) Because we are samplng from a known dstrbuton wh a fne/small number of parameters, the maxmum lkelhood method of estmaton s readl avalable. We wll use ths method, assumng normall dstrbuted random effects. Also avalable n the lerature s the EM (for expectatonmaxmzaton) algorhm for estmaton - See Dggle, Lang and Zeger (1994).

30 Random effects lkelhood Condonal on α, the lkelhood for the h subject at the tth observaton s θ b( θ ) exp + S(, φ) φ where b (θ ) = E ( α ) and η = z α + x β = g(e ( α ) ). Condonal on α, the lkelhood for the h subject s: exp t θ b φ ( θ ) + S(, φ) We take expectatons over α to get the (uncondonal) lkelhood. To see ths explcl, let s use the canoncal lnk so that θ = η. The (uncondonal) lkelhood for the h subject s z a + x β b z a + x β l = exp S(, φ) exp d G( a) t t φ Hence, the total log-lkelhood s Σ ln l. The constant Σ S(, φ ) s unmportant for determnng mle s. Although evaluatng, and maxmzng, the lkelhood requres numercal ntegraton, s eas to do on the computer. ( ) ( )

31 Random effects and seral correlaton We saw n Chapter 3 that permtng subject-specfc effects, α, to be random nduced seral correlaton n the responses. Ths s because the varance-covarance matrx of s no longer dagonal. Ths s also true for the nonlnear GLM models. To see ths, let s use a canoncal lnk and recall that E ( α ) ) = b (θ ) = b (η ) = b (α + x β).

32 Covarance calculatons The covarance between two responses, 1 and 2, s Cov( 1, 2 ) = E 1 2 -E 1 E 2 = E {b (α +x 1 β) b (α +x 2 β)} -E b (α +x 1 β) E b (α +x 2 β) To see ths, usng the law of erated expectatons, E 1 2 = E E ( 1 2 α ) = E {E ( 1 α ) E( 2 α )} = E {b (α + x 1 β) b (α + x 2 β)}

33 More covarance calculatons Normal For the normal dstrbuton we have b (a) = a. Thus, Cov( 1, 2 ) = E {(α + x 1 β) (α + x 2 β)} - E (α + x 1 β) E (α + x 2 β) = E α 2 + (x 1 β) (x 2 β)-(x 1 β) (x 2 β)= Varα. For the Posson, we have b (a) = e a. Thus, E = E b (α + x β) = E exp(α + x β) = exp(x β) E exp(α ) and Cov( 1, 2 ) = E {exp(α + x 1 β) exp(α + x 2 β)} - exp((x 1 +x 2 ) β) {E exp(α )} 2 = exp((x 1 +x 2 ) β) {E exp(2α) - (E exp(α)) 2 } = exp((x 1 +x 2 ) β) Var exp(α).

34 Random effects lkelhood Recall, from Secton 10.2, that the (uncondonal) lkelhood for the h subject s ( ) ( ) z a + x β b z a + x β l = exp S(, φ) exp d G( a) t t φ Here, we use z = 1,φ = 1, and g(a) s the dens of α. For the Posson, we have b(a) = e a, and S(, φ) = -ln(!), so the lkelhood s l = exp ln(!) exp ( ( a + x β ) exp( a + x β )) g( a) da t t exp ln(!) exp = x β exp β t t t ( a exp( a + x )) g( a) da As before, evaluatng and maxmzng the lkelhood requres numercal ntegraton, et s eas to do on the computer.

35 Table 10.7 Tort Flngs Model Coeffcent Estmates Random Effects Logarhmc populaton s used as an offset. Homogeneous Model wh Random Effects Model estmated scale parameter Varable Parameter p-values Parameter estmate p-values estmate Intercept < <.0001 POPLAWYR/ <.0001 VEHCMILE/ < POPDENSY/ <.0001 WCMPMAX/ <.0001 URBAN/ <.0001 UNEMPLOY <.0001 JSLIAB COLLRULE CAPS PUNITIVE State Varance Log Lkelhood 119,576 15,623

36 10.5 Fxed effects models Consder responses, wh mean µ, sstematc component η = g(µ ) = z α + x β and canoncal lnk so that η = θ. Assume the responses are ndependent. Then, the log-lkelhood s ln p( ) = = ( θ ) + S(, φ) Thus, the responses depend on the parameters through onl summar statstcs. θ φ ( z α + x β ) b( z α + x β ) + S( φ That s, the statstcs Σ t z are suffcent for α. The statstcs Σ x are suffcent for β. Ths s a convenent propert of the canoncal lnks. It s not avalable for other choces of lnks. b, φ )

37 The log-lkelhood s ln p( ) MLEs - Canoncal lnks = ( z α + x β) b( z α + x β) + S( φ Takng the partal dervatve wh respect to α elds: z b ( z α + x β) z µ 0 = = t φ t φ ( ) ( ) because µ = b (θ ) = b (z α + x β ). Takng the partal dervatve wh respect to β elds: ( ( )) ( ), φ ) x b zα + xβ x µ 0 = = φ φ Thus, we can solve for the mle s of α and β through: 0 = Σ t z ( - µ ), and 0 = Σ x ( - µ ). Ths s a specal case of the method of moments. Ths ma produce nconsstent estmates of β, as we have seen n Chapter 9.

38 Table 10.8 Tort Flngs Model Coeffcent Estmates Fxed Effects All models have an estmated scale parameter. Logarhmc populaton s used as an offset. Homogeneous Model Model wh state categorcal varable Model wh state and tme categorcal varables Varable Parameter p-values Parameter p-values Parameter p-values estmate estmate estmate Intercept <.0001 POPLAWYR/ VEHCMILE/ < POPDENSY/ WCMPMAX/ URBAN/ UNEMPLOY JSLIAB COLLRULE CAPS PUNITIVE Scale Devance 118, , ,834.2 Pearson Ch-Square 129, , ,763.0

39 Condonal lkelhood estmaton Assume the canoncal lnk so that θ = η = z α + x β. Defne the lkelhood for a sngle observaton to be ( z α + x β) b( z α + x β) p(, α, β) = exp + S(, φ) φ Let S be the random vector representng Σ t z and let sum be the realzaton of Σ t z. Recall that Σ t z are suffcent for α. The condonal lkelhood of the data set s n p( 1, α, β)p( 2, α, β) Lp( T, α, β) = = 1 Prob( S sum ) Ths lkelhood does not depend on {α }, onl on β. Maxmzng wh respect to β elds root-n consstent estmates. The dstrbuton of S s mess and s dffcult to compute.

40 Posson dstrbuton The Posson s the most wdel used dstrbuton for counted responses. Examples nclude the number of mgrants from state to state and the number of tort flngs whn a state. A feature of the fxed effects verson of the model s that the mean equals the varance. To llustrate the applcaton of Posson panel data models, let s use the canoncal lnk and z = 1, so that ln E ( α ) = g(e ( α ) ) = θ = η = α + x β. Through the log functon, lnks the mean to a lnear combnaton of explanator varables. It s the bass of the so-called log-lnear model.

41 Condonal lkelhood estmaton We frst examne the fxed effects model and thus assume that {α } are fxed parameters. Thus, E = exp (α + x β). The dstrbuton s ( E ) exp( - E ) p(, α, β) =! From Secton 10.1, Σ t s a suffcent statstc for α. The dstrbuton of Σ t turns out to be Posson, wh mean exp(α ) Σ t exp(x β). Note that the rato of means, E exp( x β) π = = E exp( x β) does not depend on α. t t

42 Condonal lkelhood detals Thus, as n Secton 10.1, the condonal lkelhood for the h subject s ) Pr ob( ),, p( ),, )p(,, p( 2 1 = t T S β α β α β α L ( ) ( ) ( ) ( )! E - exp E!! E - exp E E = t t t T T T t T L L L

43 Condonal lkelhood detals where = Ths s a multnomal dstrbuton. = 1! t! L! T ( ) ( ) 1 E L E 1 t E t! t π 1! L T! t E exp ( x β) π = = E exp ( x β) t t T T

44 Multnomal dstrbuton Thus, the jont dstrbuton of 1,..., T gven Σ t has a multnomal dstrbuton. The condonal lkelhood s: Takng partal dervatves elds: ln β L = n t = 1 1! L T! where π ( β ) =. exp ( x β ) t Thus, the condonal MLE, b, s the soluton of: L ln π β =! ( β ) = x x π ( β ) exp ( x β ) x xπ = t ( b) 0 t t π

9. Binary Dependent Variables

9. Binary Dependent Variables 9. Bnar Dependent Varables 9. Homogeneous models Log, prob models Inference Tax preparers 9.2 Random effects models 9.3 Fxed effects models 9.4 Margnal models and GEE Appendx 9A - Lkelhood calculatons

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Maxmum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

Rockefeller College University at Albany

Rockefeller College University at Albany Rockefeller College Unverst at Alban PAD 705 Handout: Maxmum Lkelhood Estmaton Orgnal b Davd A. Wse John F. Kenned School of Government, Harvard Unverst Modfcatons b R. Karl Rethemeer Up to ths pont n

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

Limited Dependent Variables and Panel Data. Tibor Hanappi

Limited Dependent Variables and Panel Data. Tibor Hanappi Lmted Dependent Varables and Panel Data Tbor Hanapp 30.06.2010 Lmted Dependent Varables Dscrete: Varables that can take onl a countable number of values Censored/Truncated: Data ponts n some specfc range

More information

Systems of Equations (SUR, GMM, and 3SLS)

Systems of Equations (SUR, GMM, and 3SLS) Lecture otes on Advanced Econometrcs Takash Yamano Fall Semester 4 Lecture 4: Sstems of Equatons (SUR, MM, and 3SLS) Seemngl Unrelated Regresson (SUR) Model Consder a set of lnear equatons: $ + ɛ $ + ɛ

More information

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Mamum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models for

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

Introduction to Generalized Linear Models

Introduction to Generalized Linear Models INTRODUCTION TO STATISTICAL MODELLING TRINITY 00 Introducton to Generalzed Lnear Models I. Motvaton In ths lecture we extend the deas of lnear regresson to the more general dea of a generalzed lnear model

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

e i is a random error

e i is a random error Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models Computaton of Hgher Order Moments from Two Multnomal Overdsperson Lkelhood Models BY J. T. NEWCOMER, N. K. NEERCHAL Department of Mathematcs and Statstcs, Unversty of Maryland, Baltmore County, Baltmore,

More information

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1 Random varables Measure of central tendences and varablty (means and varances) Jont densty functons and ndependence Measures of assocaton (covarance and correlaton) Interestng result Condtonal dstrbutons

More information

Composite Hypotheses testing

Composite Hypotheses testing Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter

More information

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010 Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Survey Workng Group Semnar March 29, 2010 1 Outlne Introducton Proposed method Fractonal mputaton Approxmaton Varance estmaton Multple mputaton

More information

Chapter 14: Logit and Probit Models for Categorical Response Variables

Chapter 14: Logit and Probit Models for Categorical Response Variables Chapter 4: Logt and Probt Models for Categorcal Response Varables Sect 4. Models for Dchotomous Data We wll dscuss only ths secton of Chap 4, whch s manly about Logstc Regresson, a specal case of the famly

More information

4.3 Poisson Regression

4.3 Poisson Regression of teratvely reweghted least squares regressons (the IRLS algorthm). We do wthout gvng further detals, but nstead focus on the practcal applcaton. > glm(survval~log(weght)+age, famly="bnomal", data=baby)

More information

MLE and Bayesian Estimation. Jie Tang Department of Computer Science & Technology Tsinghua University 2012

MLE and Bayesian Estimation. Jie Tang Department of Computer Science & Technology Tsinghua University 2012 MLE and Bayesan Estmaton Je Tang Department of Computer Scence & Technology Tsnghua Unversty 01 1 Lnear Regresson? As the frst step, we need to decde how we re gong to represent the functon f. One example:

More information

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected.

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected. ANSWERS CHAPTER 9 THINK IT OVER thnk t over TIO 9.: χ 2 k = ( f e ) = 0 e Breakng the equaton down: the test statstc for the ch-squared dstrbuton s equal to the sum over all categores of the expected frequency

More information

QUASI-LIKELIHOOD APPROACH TO RATER AGREEMENT PLUS LINEAR BY LINEAR ASSOCIATION MODEL FOR ORDINAL CONTINGENCY TABLES

QUASI-LIKELIHOOD APPROACH TO RATER AGREEMENT PLUS LINEAR BY LINEAR ASSOCIATION MODEL FOR ORDINAL CONTINGENCY TABLES Journal of Statstcs: Advances n Theory and Applcatons Volume 6, Number, 26, Pages -5 Avalable at http://scentfcadvances.co.n DOI: http://dx.do.org/.8642/jsata_72683 QUASI-LIKELIHOOD APPROACH TO RATER AGREEMENT

More information

A New Method for Estimating Overdispersion. David Fletcher and Peter Green Department of Mathematics and Statistics

A New Method for Estimating Overdispersion. David Fletcher and Peter Green Department of Mathematics and Statistics A New Method for Estmatng Overdsperson Davd Fletcher and Peter Green Department of Mathematcs and Statstcs Byron Morgan Insttute of Mathematcs, Statstcs and Actuaral Scence Unversty of Kent, England Overvew

More information

The EM Algorithm (Dempster, Laird, Rubin 1977) The missing data or incomplete data setting: ODL(φ;Y ) = [Y;φ] = [Y X,φ][X φ] = X

The EM Algorithm (Dempster, Laird, Rubin 1977) The missing data or incomplete data setting: ODL(φ;Y ) = [Y;φ] = [Y X,φ][X φ] = X The EM Algorthm (Dempster, Lard, Rubn 1977 The mssng data or ncomplete data settng: An Observed Data Lkelhood (ODL that s a mxture or ntegral of Complete Data Lkelhoods (CDL. (1a ODL(;Y = [Y;] = [Y,][

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

Hidden Markov Models & The Multivariate Gaussian (10/26/04)

Hidden Markov Models & The Multivariate Gaussian (10/26/04) CS281A/Stat241A: Statstcal Learnng Theory Hdden Markov Models & The Multvarate Gaussan (10/26/04) Lecturer: Mchael I. Jordan Scrbes: Jonathan W. Hu 1 Hdden Markov Models As a bref revew, hdden Markov models

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise. Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the

More information

Tests of Single Linear Coefficient Restrictions: t-tests and F-tests. 1. Basic Rules. 2. Testing Single Linear Coefficient Restrictions

Tests of Single Linear Coefficient Restrictions: t-tests and F-tests. 1. Basic Rules. 2. Testing Single Linear Coefficient Restrictions ECONOMICS 35* -- NOTE ECON 35* -- NOTE Tests of Sngle Lnear Coeffcent Restrctons: t-tests and -tests Basc Rules Tests of a sngle lnear coeffcent restrcton can be performed usng ether a two-taled t-test

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

8 : Learning in Fully Observed Markov Networks. 1 Why We Need to Learn Undirected Graphical Models. 2 Structural Learning for Completely Observed MRF

8 : Learning in Fully Observed Markov Networks. 1 Why We Need to Learn Undirected Graphical Models. 2 Structural Learning for Completely Observed MRF 10-708: Probablstc Graphcal Models 10-708, Sprng 2014 8 : Learnng n Fully Observed Markov Networks Lecturer: Erc P. Xng Scrbes: Meng Song, L Zhou 1 Why We Need to Learn Undrected Graphcal Models In the

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

a. (All your answers should be in the letter!

a. (All your answers should be in the letter! Econ 301 Blkent Unversty Taskn Econometrcs Department of Economcs Md Term Exam I November 8, 015 Name For each hypothess testng n the exam complete the followng steps: Indcate the test statstc, ts crtcal

More information

Computing MLE Bias Empirically

Computing MLE Bias Empirically Computng MLE Bas Emprcally Kar Wa Lm Australan atonal Unversty January 3, 27 Abstract Ths note studes the bas arses from the MLE estmate of the rate parameter and the mean parameter of an exponental dstrbuton.

More information

Lecture 3: Probability Distributions

Lecture 3: Probability Distributions Lecture 3: Probablty Dstrbutons Random Varables Let us begn by defnng a sample space as a set of outcomes from an experment. We denote ths by S. A random varable s a functon whch maps outcomes nto the

More information

Conjugacy and the Exponential Family

Conjugacy and the Exponential Family CS281B/Stat241B: Advanced Topcs n Learnng & Decson Makng Conjugacy and the Exponental Famly Lecturer: Mchael I. Jordan Scrbes: Bran Mlch 1 Conjugacy In the prevous lecture, we saw conjugate prors for the

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

Hydrological statistics. Hydrological statistics and extremes

Hydrological statistics. Hydrological statistics and extremes 5--0 Stochastc Hydrology Hydrologcal statstcs and extremes Marc F.P. Berkens Professor of Hydrology Faculty of Geoscences Hydrologcal statstcs Mostly concernes wth the statstcal analyss of hydrologcal

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

Generative classification models

Generative classification models CS 675 Intro to Machne Learnng Lecture Generatve classfcaton models Mlos Hauskrecht mlos@cs.ptt.edu 539 Sennott Square Data: D { d, d,.., dn} d, Classfcaton represents a dscrete class value Goal: learn

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

Limited Dependent Variables

Limited Dependent Variables Lmted Dependent Varables. What f the left-hand sde varable s not a contnuous thng spread from mnus nfnty to plus nfnty? That s, gven a model = f (, β, ε, where a. s bounded below at zero, such as wages

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

A Comparative Study for Estimation Parameters in Panel Data Model

A Comparative Study for Estimation Parameters in Panel Data Model A Comparatve Study for Estmaton Parameters n Panel Data Model Ahmed H. Youssef and Mohamed R. Abonazel hs paper examnes the panel data models when the regresson coeffcents are fxed random and mxed and

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

Primer on High-Order Moment Estimators

Primer on High-Order Moment Estimators Prmer on Hgh-Order Moment Estmators Ton M. Whted July 2007 The Errors-n-Varables Model We wll start wth the classcal EIV for one msmeasured regressor. The general case s n Erckson and Whted Econometrc

More information

Diagnostics in Poisson Regression. Models - Residual Analysis

Diagnostics in Poisson Regression. Models - Residual Analysis Dagnostcs n Posson Regresson Models - Resdual Analyss 1 Outlne Dagnostcs n Posson Regresson Models - Resdual Analyss Example 3: Recall of Stressful Events contnued 2 Resdual Analyss Resduals represent

More information

1 Binary Response Models

1 Binary Response Models Bnary and Ordered Multnomal Response Models Dscrete qualtatve response models deal wth dscrete dependent varables. bnary: yes/no, partcpaton/non-partcpaton lnear probablty model LPM, probt or logt models

More information

The Geometry of Logit and Probit

The Geometry of Logit and Probit The Geometry of Logt and Probt Ths short note s meant as a supplement to Chapters and 3 of Spatal Models of Parlamentary Votng and the notaton and reference to fgures n the text below s to those two chapters.

More information

Applications of GEE Methodology Using the SAS System

Applications of GEE Methodology Using the SAS System Applcatons of GEE Methodology Usng the SAS System Gordon Johnston Maura Stokes SAS Insttute Inc, Cary, NC Abstract The analyss of correlated data arsng from repeated measurements when the measurements

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

CS 2750 Machine Learning. Lecture 5. Density estimation. CS 2750 Machine Learning. Announcements

CS 2750 Machine Learning. Lecture 5. Density estimation. CS 2750 Machine Learning. Announcements CS 750 Machne Learnng Lecture 5 Densty estmaton Mlos Hauskrecht mlos@cs.ptt.edu 539 Sennott Square CS 750 Machne Learnng Announcements Homework Due on Wednesday before the class Reports: hand n before

More information

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva Econ 39 - Statstcal Propertes of the OLS estmator Sanjaya DeSlva September, 008 1 Overvew Recall that the true regresson model s Y = β 0 + β 1 X + u (1) Applyng the OLS method to a sample of data, we estmate

More information

Maximum Likelihood Estimation

Maximum Likelihood Estimation Maxmum Lkelhood Estmaton INFO-2301: Quanttatve Reasonng 2 Mchael Paul and Jordan Boyd-Graber MARCH 7, 2017 INFO-2301: Quanttatve Reasonng 2 Paul and Boyd-Graber Maxmum Lkelhood Estmaton 1 of 9 Why MLE?

More information

A Monte Carlo Study for Swamy s Estimate of Random Coefficient Panel Data Model

A Monte Carlo Study for Swamy s Estimate of Random Coefficient Panel Data Model A Monte Carlo Study for Swamy s Estmate of Random Coeffcent Panel Data Model Aman Mousa, Ahmed H. Youssef and Mohamed R. Abonazel Department of Appled Statstcs and Econometrcs, Instute of Statstcal Studes

More information

Lecture 12: Discrete Laplacian

Lecture 12: Discrete Laplacian Lecture 12: Dscrete Laplacan Scrbe: Tanye Lu Our goal s to come up wth a dscrete verson of Laplacan operator for trangulated surfaces, so that we can use t n practce to solve related problems We are mostly

More information

Analyzing Longitudinal Data Using Gee-Smoothing Spline

Analyzing Longitudinal Data Using Gee-Smoothing Spline Proceedngs of the 8th WSEAS Internatonal Conference on Appled Computer and Appled Computatonal Scence Analyzng Longtudnal Data Usng Gee-Smoothng Splne NOOR AKMA IBRAHIM Unverst Putra Malaysa Insttute for

More information

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9 Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,

More information

Andreas C. Drichoutis Agriculural University of Athens. Abstract

Andreas C. Drichoutis Agriculural University of Athens. Abstract Heteroskedastcty, the sngle crossng property and ordered response models Andreas C. Drchouts Agrculural Unversty of Athens Panagots Lazards Agrculural Unversty of Athens Rodolfo M. Nayga, Jr. Texas AMUnversty

More information

x yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting.

x yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting. The Practce of Statstcs, nd ed. Chapter 14 Inference for Regresson Introducton In chapter 3 we used a least-squares regresson lne (LSRL) to represent a lnear relatonshp etween two quanttatve explanator

More information

However, since P is a symmetric idempotent matrix, of P are either 0 or 1 [Eigen-values

However, since P is a symmetric idempotent matrix, of P are either 0 or 1 [Eigen-values Fall 007 Soluton to Mdterm Examnaton STAT 7 Dr. Goel. [0 ponts] For the general lnear model = X + ε, wth uncorrelated errors havng mean zero and varance σ, suppose that the desgn matrx X s not necessarly

More information

How its computed. y outcome data λ parameters hyperparameters. where P denotes the Laplace approximation. k i k k. Andrew B Lawson 2013

How its computed. y outcome data λ parameters hyperparameters. where P denotes the Laplace approximation. k i k k. Andrew B Lawson 2013 Andrew Lawson MUSC INLA INLA s a relatvely new tool that can be used to approxmate posteror dstrbutons n Bayesan models INLA stands for ntegrated Nested Laplace Approxmaton The approxmaton has been known

More information

MAXIMUM LIKELIHOOD FOR GENERALIZED LINEAR MODEL AND GENERALIZED ESTIMATING EQUATIONS

MAXIMUM LIKELIHOOD FOR GENERALIZED LINEAR MODEL AND GENERALIZED ESTIMATING EQUATIONS www.arpapress.com/volumes/vol18issue1/ijrras_18_1_08.pdf MAXIMUM LIKELIHOOD FOR GENERALIZED LINEAR MODEL AND GENERALIZED ESTIMATING EQUATIONS A. Lanan MAM Laboratory. Department of Mathematcs. Unversty

More information

Using T.O.M to Estimate Parameter of distributions that have not Single Exponential Family

Using T.O.M to Estimate Parameter of distributions that have not Single Exponential Family IOSR Journal of Mathematcs IOSR-JM) ISSN: 2278-5728. Volume 3, Issue 3 Sep-Oct. 202), PP 44-48 www.osrjournals.org Usng T.O.M to Estmate Parameter of dstrbutons that have not Sngle Exponental Famly Jubran

More information

Basically, if you have a dummy dependent variable you will be estimating a probability.

Basically, if you have a dummy dependent variable you will be estimating a probability. ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy

More information

MATH 829: Introduction to Data Mining and Analysis The EM algorithm (part 2)

MATH 829: Introduction to Data Mining and Analysis The EM algorithm (part 2) 1/16 MATH 829: Introducton to Data Mnng and Analyss The EM algorthm (part 2) Domnque Gullot Departments of Mathematcal Scences Unversty of Delaware Aprl 20, 2016 Recall 2/16 We are gven ndependent observatons

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

Biostatistics. Chapter 11 Simple Linear Correlation and Regression. Jing Li

Biostatistics. Chapter 11 Simple Linear Correlation and Regression. Jing Li Bostatstcs Chapter 11 Smple Lnear Correlaton and Regresson Jng L jng.l@sjtu.edu.cn http://cbb.sjtu.edu.cn/~jngl/courses/2018fall/b372/ Dept of Bonformatcs & Bostatstcs, SJTU Recall eat chocolate Cell 175,

More information

[ ] λ λ λ. Multicollinearity. multicollinearity Ragnar Frisch (1934) perfect exact. collinearity. multicollinearity. exact

[ ] λ λ λ. Multicollinearity. multicollinearity Ragnar Frisch (1934) perfect exact. collinearity. multicollinearity. exact Multcollnearty multcollnearty Ragnar Frsch (934 perfect exact collnearty multcollnearty K exact λ λ λ K K x+ x+ + x 0 0.. λ, λ, λk 0 0.. x perfect ntercorrelated λ λ λ x+ x+ + KxK + v 0 0.. v 3 y β + β

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Testing for seasonal unit roots in heterogeneous panels

Testing for seasonal unit roots in heterogeneous panels Testng for seasonal unt roots n heterogeneous panels Jesus Otero * Facultad de Economía Unversdad del Rosaro, Colomba Jeremy Smth Department of Economcs Unversty of arwck Monca Gulett Aston Busness School

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VIII LECTURE - 34 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS EFFECTS MODEL Dr Shalabh Department of Mathematcs and Statstcs Indan

More information

RELIABILITY ASSESSMENT

RELIABILITY ASSESSMENT CHAPTER Rsk Analyss n Engneerng and Economcs RELIABILITY ASSESSMENT A. J. Clark School of Engneerng Department of Cvl and Envronmental Engneerng 4a CHAPMAN HALL/CRC Rsk Analyss for Engneerng Department

More information

β0 + β1xi and want to estimate the unknown

β0 + β1xi and want to estimate the unknown SLR Models Estmaton Those OLS Estmates Estmators (e ante) v. estmates (e post) The Smple Lnear Regresson (SLR) Condtons -4 An Asde: The Populaton Regresson Functon B and B are Lnear Estmators (condtonal

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

WORKSHOP ON THE ANALYSIS OF NEURAL DATA 2001 MARINE BIOLOGICAL LABORATORY WOODS HOLE, MASSACHUSETTS

WORKSHOP ON THE ANALYSIS OF NEURAL DATA 2001 MARINE BIOLOGICAL LABORATORY WOODS HOLE, MASSACHUSETTS Emery. Brown, M.D., Ph.D. WORKSHOP O THE AALYSIS OF EURAL DATA 00 MARIE BIOLOGICAL LABORATORY WOODS HOLE, MASSACHUSETTS A REVIEW OF STATISTICS PART 3: REGRESSIO AALYSIS AD THE GEERALIZED LIEAR MODEL EMERY.

More information

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation Econ 388 R. Butler 204 revsons Lecture 4 Dummy Dependent Varables I. Lnear Probablty Model: the Regresson model wth a dummy varables as the dependent varable assumpton, mplcaton regular multple regresson

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

Statistical analysis using matlab. HY 439 Presented by: George Fortetsanakis

Statistical analysis using matlab. HY 439 Presented by: George Fortetsanakis Statstcal analyss usng matlab HY 439 Presented by: George Fortetsanaks Roadmap Probablty dstrbutons Statstcal estmaton Fttng data to probablty dstrbutons Contnuous dstrbutons Contnuous random varable X

More information

The Ordinary Least Squares (OLS) Estimator

The Ordinary Least Squares (OLS) Estimator The Ordnary Least Squares (OLS) Estmator 1 Regresson Analyss Regresson Analyss: a statstcal technque for nvestgatng and modelng the relatonshp between varables. Applcatons: Engneerng, the physcal and chemcal

More information

Marginal Effects in Probit Models: Interpretation and Testing. 1. Interpreting Probit Coefficients

Marginal Effects in Probit Models: Interpretation and Testing. 1. Interpreting Probit Coefficients ECON 5 -- NOE 15 Margnal Effects n Probt Models: Interpretaton and estng hs note ntroduces you to the two types of margnal effects n probt models: margnal ndex effects, and margnal probablty effects. It

More information

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students. PPOL 59-3 Problem Set Exercses n Smple Regresson Due n class /8/7 In ths problem set, you are asked to compute varous statstcs by hand to gve you a better sense of the mechancs of the Pearson correlaton

More information

β0 + β1xi. You are interested in estimating the unknown parameters β

β0 + β1xi. You are interested in estimating the unknown parameters β Revsed: v3 Ordnar Least Squares (OLS): Smple Lnear Regresson (SLR) Analtcs The SLR Setup Sample Statstcs Ordnar Least Squares (OLS): FOCs and SOCs Back to OLS and Sample Statstcs Predctons (and Resduals)

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction ECONOMICS 35* -- NOTE 7 ECON 35* -- NOTE 7 Interval Estmaton n the Classcal Normal Lnear Regresson Model Ths note outlnes the basc elements of nterval estmaton n the Classcal Normal Lnear Regresson Model

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrcs of Panel Data Jakub Mućk Meetng # 8 Jakub Mućk Econometrcs of Panel Data Meetng # 8 1 / 17 Outlne 1 Heterogenety n the slope coeffcents 2 Seemngly Unrelated Regresson (SUR) 3 Swamy s random

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2017 Instructor: Victor Aguirregabiria

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2017 Instructor: Victor Aguirregabiria ECOOMETRICS II ECO 40S Unversty of Toronto Department of Economcs Wnter 07 Instructor: Vctor Agurregabra SOLUTIO TO FIAL EXAM Tuesday, Aprl 8, 07 From :00pm-5:00pm 3 hours ISTRUCTIOS: - Ths s a closed-book

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study

More information

Empirical Methods for Corporate Finance. Identification

Empirical Methods for Corporate Finance. Identification mprcal Methods for Corporate Fnance Identfcaton Causalt Ultmate goal of emprcal research n fnance s to establsh a causal relatonshp between varables.g. What s the mpact of tangblt on leverage?.g. What

More information

Lab 4: Two-level Random Intercept Model

Lab 4: Two-level Random Intercept Model BIO 656 Lab4 009 Lab 4: Two-level Random Intercept Model Data: Peak expratory flow rate (pefr) measured twce, usng two dfferent nstruments, for 17 subjects. (from Chapter 1 of Multlevel and Longtudnal

More information

Lecture Notes on Linear Regression

Lecture Notes on Linear Regression Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume

More information

Outline. Zero Conditional mean. I. Motivation. 3. Multiple Regression Analysis: Estimation. Read Wooldridge (2013), Chapter 3.

Outline. Zero Conditional mean. I. Motivation. 3. Multiple Regression Analysis: Estimation. Read Wooldridge (2013), Chapter 3. Outlne 3. Multple Regresson Analyss: Estmaton I. Motvaton II. Mechancs and Interpretaton of OLS Read Wooldrdge (013), Chapter 3. III. Expected Values of the OLS IV. Varances of the OLS V. The Gauss Markov

More information

Chapter 7 Generalized and Weighted Least Squares Estimation. In this method, the deviation between the observed and expected values of

Chapter 7 Generalized and Weighted Least Squares Estimation. In this method, the deviation between the observed and expected values of Chapter 7 Generalzed and Weghted Least Squares Estmaton The usual lnear regresson model assumes that all the random error components are dentcally and ndependently dstrbuted wth constant varance. When

More information