The Occurrence and Timing of Events: The Application of Event History Models in Accounting and Finance Research
|
|
- Ruth Roberts
- 6 years ago
- Views:
Transcription
1 The Occurrence and Tmng of Events: The Applcaton of Event Hstory Models n Accountng and Fnance Research Marc J. LeClere Assstant Professor Department of Accountng School of Busness Admnstraton Loyola Unversty Chcago (O) (F) mlecle@luc.edu September 1999 I wsh to thank Felca LeClere for comments on an earler verson of ths paper.
2 1.0 INTRODUCTION Survval analyss s a statstcal method concerned wth the study of events and the substantve processes that govern the occurrence and tmng of events [Allson, 1984; Blossfeld and Rohwer, 1995]. 1 An event s a qualtatve change that occurs at a gven pont n tme to an ndvdual, organzaton, entty, or socety [hereafter referred to collectvely as ndvdual(s) ]. The substantve process s characterzed by unts changng from one dscrete state to another dscrete state at any pont n tme as a result of tme-constant or tme-dependent covarates [Coleman, 1981]. Researchers n bomedcne, socology, demography, crmnology, engneerng, economcs, and management use the method to study a wde range of problems such as dsease, patent survval, brth, death, marrage, dvorce, mgraton, revolutons, crme, arrests, recdvsm, manufacturng durablty, equpment falure, strkes, unemployment, labor force partcpaton, job changes, promotons, retrements, and mergers. Most accountng research studes nterested n events use qualtatve response models such as probt or logstc regresson. Probt and logstc regresson use a dchotomous dependent varable to dstngush between the occurrence and non-occurrence of an event of nterest ['0-1' or 'yes - no']. These statstcal methods typcally examne the relaton between the probablty of an event and a set of covarates for a sngle year. As an example, consder accountng research n the area of bankruptcy [e.g., see Ohlson, 1980]. Bankruptcy research typcally uses cross-sectonal observatons and the standard research desgn nvolves selectng a sample of bankrupt and non- 2
3 bankrupt frms for a gven year and dstngushng between these frms on the bass of ndependent varables that are thought to nfluence frm bankruptcy. Although bankruptcy studes may focus on the year of bankruptcy, one year pror to bankruptcy, or n -years pror to bankruptcy, the analyss s cross-sectonal and nvolves examnng the relaton between the probablty of bankruptcy and a set of covarates for a specfc pont n tme. Survval analyss, on the other hand, uses longtudnal rather than crosssectonal data. The analyss requres data for ndvduals for a tme perod pror to the occurrence of a gven event. Because survval analyss focuses on the duraton of tme precedng an event, the researcher must stpulate a tme orgn, a tme scale, and an exact defnton of the event that termnates the duraton [Kefer, 1988]. Assume that a researcher was nterested n the duraton of tme precedng the bankruptcy of startup frms. The tme orgn would be the date at whch the frm began operatons. In a large sample of frms, each frm would have a dfferent tme orgn as they most lkely began busness on dfferent dates. The tme scale would be calendar months. The duraton would end when the frm experenced the event termed 'bankruptcy'. Whereas a cross-sectonal study analyzes the relaton between the probablty of bankruptcy and a set of covarates for a specfc year, survval analyss uses longtudnal data from the tme perod precedng bankruptcy because the focus s on the process that governs bankruptcy rather than the observaton of the bankrupt/non-bankrupt state. Survval analyss examnes the effect of changes n the covarates on the duraton of tme precedng the event as well as the probablty that the event wll 3
4 occur. If an underlyng process generates the event, the choce of crosssectonal rather than longtudnal data reles on the assumpton that the process s n statstcal equlbrum [Coleman, 1981]. 1.1 Benefts of Survval Analyss The prmary advantage of survval analyss, relatve to regresson and qualtatve response models such as logstc regresson and probt, les n the manner n whch t handles censored observatons and tme-varyng covarates. Censorng occurs when ncomplete nformaton s avalable on the occurrence of a specfc event. In event hstory studes, ndvduals are n orgn states and are observed for the occurrence of a specfc event such as marrage, death, dvorce, or job termnaton. For example, n studes modelng the tmng of frst brth, women who have not gven brth to a chld are n the orgn state termed nullparous. When a woman gves brth to the frst chld, the event ( frst brth ) has occurred, and the woman s no longer n the orgn state. Tme spent n the orgn state s defned as the duraton of tme precedng the brth of the chld and tme orgn. Tme orgn mght be defned as 0, age of menarche, age of marrage, or date of last contraceptve use. 1.2 Censorng Censorng occurs when knowledge of the tme that the ndvdual spends n the orgn state s ncomplete and the exact duraton of tme ('lfetme') s known for only a porton of a sample. Survval analyss models use methods of 4
5 estmaton (generally ether maxmum or partal lkelhood) that ncorporate nformaton from censored and uncensored observatons to provde consstent parameter estmates [Allson, 1995]. In contrast, statstcal methods such as regresson analyss cannot ncorporate nformaton from censored observatons. The llustraton n Fgure 1 provdes an overvew of the common forms of censorng. Assume that the numbered lnes represent ndvduals [ndvduals 1 through 7]. Observatons are made on hstorcal tme and the perod of observaton begns at t and ends at 1 t 2. A sold lne represents a perod of tme n whch the ndvdual s observed and a dotted lne represents a tme perod where the ndvdual s not observed. The end of each sold lne, f not followed by a dotted lne, represents the occurrence of an event. The end of each dotted lne does not necessarly sgnfy the occurrence of an event. In Fgure 1, ndvdual 1 s not censored. Indvdual 1 entered the orgn state after the start of the observaton perod and experenced the event durng the perod. Indvdual 2 s censored on the rght or rght censored and represents the most common form of censorng. In ths stuaton, the event dd not occur durng the observaton perod and may or may not occur after the observaton perod ends. Observatons are rght censored at L when the knowledge of an ndvdual's lfe s that t s equal to or greater than the value L but the exact value of the observaton s unknown [Lawless, 1982]. T, the tme of event occurrence, s known to be greater than some value L, but the exact occurrence of the event s unknown. As an example, assume that a study examnes a sample of frms to model the process of bankruptcy. Some frms wll experence 5
6 bankruptcy durng the observaton perod. Frms that have not experenced bankruptcy by the end of the observaton perod may or may not experence bankruptcy after the observaton perod. Rght censorng may take one of two common forms. Type I rght censorng (tme censorng) refers to censorng that occurs because observaton s termnated at a preassgned tme pont or date and the ndvdual dd not experence the event up to that pont n tme. Type II rght censorng occurs when the observaton perod s termnated after the occurrence of a specfc number of events. Type II censorng s sometmes referred to as order censorng snce the study s termnated after observng certan order statstcs [Kalbflesch and Prentce, 1980]. Ths form of censorng s qute rare n socal scence research but common n bomedcal research [Allson, 1995]. In terms of estmaton, rght censorng provdes no specal problems n survval analyss snce the termnaton date s determned ndependently of the process under study [Blossfeld and Rohwer, 1995; Kalbflesch and Prentce, 1980]. The ablty of event hstory analyss to handle rght censored observatons s t major advantage over tradtonal statstcal methods [Yamaguch, 1991]. Indvdual 3 s rght censored and ths may be the result of random or nonrandom censorng. Indvdual 3 left the study because some event other than the event of nterest occurred and ndvdual 3 dd not reach the maxmum observaton tme. Indvdual 3 may or may not experence the event after censorng. Subjects may de, move, or leave a research study for any number of unspecfed reasons. 2 The salent ssue s whether the event that caused the 6
7 ndvdual to leave the study s ndependent of the event of nterest or the event s a case of non-ndependent censorng. If ths censorng s random t provdes no estmaton problems for survval analyss methods. However, f the censorng s not random, problems arse due to selectvty bas snce the characterstcs of the ndvduals lost to non-ndependent censorng are often related to the process under study [Blossfeld and Rohwer, 1995; Kalbflesch and Prentce, 1980]. In a study of bankruptcy, frms that merge durng the observaton perod represent ths type of censorng. If mergers are random, no problem exsts. But f fnancally dstressed frms demonstrate a propensty for mergers (as one of a number of alternatves to deal wth fnancal dstress), selectvty bas arses snce the observaton contans nformaton related to the process of bankruptcy. Indvdual 4 s fully censored on the left. The ndvdual entered and left the orgn state pror to the perod of observaton and the only knowledge s that the event occurred before a gven pont n tme. T, the tme of event occurrence, s known to be less than some value L, but the exact occurrence of the event s unknown. Fully censored observatons are qute dffcult to ncorporate nto survval analyss because the duraton of tme n the orgn state s unknown [Blossfeld and Rohwer, 1995]. Sample selecton bas s dffcult to correct because the bas s a functon of the duraton of tme n the orgn state and the event that occurred to generate the mssng data s unknown [Yamaguch, 1991]. Fully left censored observatons result n based parameter estmates. Consder frms that experence bankruptcy before the observaton perod and therefore are not part of the sample. If these frms demonstrate a 7
8 predsposton for bankruptcy (e.g., startup frms that fal early and frequently), then the nformaton these observatons provde about the process s lost and the parameter estmates are bas. Indvdual 5 s partally censored on the left. The ndvdual has already spent tme n the orgn state at the start of the observaton perod and the event occurs durng the observaton perod. If there s no nformaton as to the start of the orgn state, the value of duraton at t 1 s unknown. Survval analyss cannot properly use nformaton from ths observaton snce the begnnng of the observaton perod cannot be substtuted for the begnnng of the orgn state [Yamaguch, 1991]. Sometmes t s possble to observe the duraton of tme spent n the orgn state at t 1 by ether examnng retrospectve nformaton or reconstructng data. But ths ntroduces a sample selecton bas because nformaton pror to t 1 s known for some observatons (ndvdual 5) but not other observatons (ndvdual 4) and early entry nto the orgn state and a short duraton untl the occurrence of the event ncrease the lkelhood that the observaton wll not appear n the study perod [Blossfeld and Rohwer, 1995]. Survval analyss can address ths type of censorng by evaluatng only that part of the duraton n the observaton perod [Blossfeld and Rohwer, 1995; Yamaguch, 1991]. The analyss examnes the survval of the ndvdual from t 1 to the occurrence of the event condtoned on the fact that the ndvdual survved untl t 1. Indvdual 6 s completely rght censored. The entrance and ext from the orgn state occur after the observaton perod. The duraton of tme untl the 8
9 occurrence of the event s unknown for ths ndvdual. Unless the duraton of the orgn state s ndependent of the start of the orgn state, survval analyss must take nto account the effect of the start of the orgn state on duraton n order to elmnate sample selecton bas [Yamaguch, 1991]. Fnally, ndvdual 7 represents an ndvdual that s left and rght censored. The ndvdual s n the orgn state at the start and end of the observaton perod but there s no nformaton about the start or end of tme spent n the orgn state. 1.3 Tme-varyng Covarates The second major ssue whch survval analyss addresses concerns the value of covarates over the observaton perod. Covarates can be tme-nvarant or tme-varyng (tme-dependent). Tme-nvarant covarates do not change durng the perod that precedes the occurrence of an event. For nstance, n the case of ndvduals, some covarates such as sex and blood-type never change over tme. Other covarates mght change over tme, but the change s so nsgnfcant that the covarate may be regarded as tme-nvarant. As an example, f a survval analyss model used the nflaton rate as a covarate, although the rate mght ncrease or decrease over tme, the change mght be so slght that the nflaton rate could be regarded as constant. On the other hand, tme-varyng covarates change durng the course of the observaton perod. Covarates such as ncome, job status, educaton, famly status, and wealth generally do change over tme. 9
10 When modelng the duraton of tme that precedes the occurrence of an event, the value of a covarate along the tme path effects the probablty of event occurrence. The major contrbuton of survval analyss methods n ths area s that the estmaton procedures consder changes n the value of covarates over tme. Cross-sectonal studes only examne the level of a varable at a gven pont n tme. Cross-sectonal analyss uses a snap-shot methodology because t only vews the ndvdual at a snap-shot n tme. Survval analyss, relyng on longtudnal data rather than cross-sectonal data, ncorporates changes n the covarates over tme n the estmaton process. Tme-dependent covarates can be classfed as nternal and external [Blossfeld and Rohwer, 1995; Kalbflesch and Prentce, 1980]. External covarates are not drectly nvolved wth mechansm that causes falure. They may be fxed or vary wth tme but the value of an external covarate s not a functon of the tme pror to the event although t may affect the duraton. Internal tme-dependent covarates are the result of a stochastc process generated by an ndvdual and provde nformaton about the survval tme of the ndvdual [Blossfeld, et al., 1989; Kalbflesch and Prentce, 1980]. 2.0 SURVIVAL DISTRIBUTIONS Survval analyss models the probablty of a change n a dependent varable Y t from an orgn state j to a destnaton state k as a result of causal factors [Blossfeld and Rohwer, 1995]. The duraton of tme between states s referred to as event (falure) tme. Event tme s represented by a non-negatve 10
11 random varable T that represents the duraton of tme untl the dependent varable at tme t 0 (Y t0 ) changes from state j to state k. 3 Dfferent survval analyss models assume dfferent probablty dstrbutons for T. Regardless of the probablty dstrbuton of T, the probablty dstrbuton can be specfed as a cumulatve dstrbuton functon, a survvor functon, a probablty densty functon, or a hazard functon. The cumulatve dstrbuton functon s t F( t) = P( T t) = f ( x) dx. [1] 0 It represents the probablty that T s less than or equal to a value t and denotes the probablty that the event occurs before some tme t. F( t) s also referred to as the lfetme dstrbuton or falure dstrbuton [Elandt-Johnson and Johnson, 1980]. If T represents the frst occurrence of an event (e.g., age at onset of dsease or age at frst marrage) then F( t ) represents the dstrbuton of event or falure tme. The survval functon (sometmes referred to as the relablty functon, cumulatve survval rate, or survvorshp functon) s the complementary functon of F( t) and s represented as S( t) = P( T > t). [2] It represents the probablty that the event tme s greater than a value t. The survval functon ndcates that survval tme s longer than t (the event has not occurred at tme t ) or that the ndvdual survves untl tme t. The survval functon s a monotone non-ncreasng left contnuous functon of tme t wth S ( 0 ) = 1 (snce event tme cannot be negatve) and S( ) lm F( t) = = 0. As tme t 11
12 elapses, the functon approaches 0, snce the event (e.g., death) wll occur for all ndvduals. In event hstores t s more common to use the survval functon rather than the cumulatve dstrbuton functon because t s more ntutve to thnk of ndvduals survvng an event to a certan pont n tme rather than not survvng the event [Blossfeld and Rohwer, 1995]. The probablty densty functon s defned as f ( t) ( t T < t + t) df( t) P S( t) lm = = [3] t 0 t dt dt = and t represents the uncondtonal nstantaneous probablty that falure occurs n the perod of tme from t to t + t per unt wdth t. Before takng the lmt, P( t T < t + t) represents the probablty that the event occurs n the tme perod between t and t and f (t) s proportonal to ths probablty as the nterval becomes very small. The densty functon s also known as the uncondtonal falure rate or the curve of deaths. The hazard functon s represented as ( t T < t + t T t) f ( t) = t F( t) P λ ( t) = lm [4] t 0 1 and t defnes the nstantaneous rate of falure at T = t condtonal upon survvng to tme t. 4 The hazard functon quantfes the probablty of falure for ndvduals that have survved untl tme t and effectvely removes ndvduals who have experenced the event pror to t from consderaton. The hazard functon s sometmes referred to as a hazard rate because t s a dmensonal quantty that has the form number of events per nterval of tme [Allson, 1995]. It provdes a 12
13 local, tme-related descrpton of the behavor of the process over tme by provdng a measure of the rsk of falure per unt of tme and represents the propensty of the rsk set at tme t to change from the orgn state to the event state ( Y j Y k) t = = [Blossfeld and Rohwer, 1995]. Because tme s contnuous, t the probablty that an event wll occur exactly at tme t s 0 so the hazard functon s expressed as the probablty that an event wll occur n the small nterval between t and t [Allson, 1995]. The hazard functon s not a condtonal probablty because t can be greater than 1. The best approxmaton of the condtonal probablty P( t T < t T t) s λ( t) t for very small values of t [Blossfeld, et al., 1989]. The hazard functon provdes nformaton concernng future events f the ndvdual survves to tme t n that the recprocal of the hazard functon 1/ λ( t ) denotes the expected length of tme untl the event occurs [Allson, 1995]. The hazard functon may ncrease, decrease, or reman constant over tme dependng upon the underlyng process. Because the cumulatve dstrbuton functon, survvor functon, probablty densty functon, and hazard functon all descrbe a contnuous probablty dstrbuton, each can be defned n terms of the other [Allson, 1995; Kalbflesch and Prentce, 1980; Lee, 1980]. If you know F( t ), then equatons [1] and [2] provde ( ) ( ) S t = 1 F t [5] because survval and non-survval probabltes add to 1 and equaton [4] provdes 13
14 ( ) λ t = ( ) ( ) f t S t. [6] If S( t ) s known, f ( t) can be determned snce and ( ) ( ) f t = d dt F t [7] ( 1 S t ) S ( t) ( ) ( ) f t d = = dt [8] and equaton [6] provdes λ( t ). If λ( t) s known, then substtutng equaton [8] nto equaton [6] provdes ( ) λ t ( ) ( ) S t d = = log S t S t dt e ( ) [9] and ntegraton provdes ( ) ( ) S t t = exp λ x dx 0 [10] and equatons [6] and [10] provde that t f ( t) = ( t) λ exp λ( x) dx 0. [11] 3.0 NONPARAMETRIC METHODS OF ESTIMATION The most basc approach to descrbe the dstrbuton of survval tmes conssts of nonparametrc descrptve methods. Nonparametrc methods make no assumpton about the dstrbuton of event tmes (T ) but nstead focus on provdng descrptve nformaton of the survvor functon of event tmes [Lee, 14
15 1980]. Nonparametrc or dstrbuton-free methods for analyzng survval data have been favored by bostatstcans [Allson, 1995]. Although nonparametrc methods of estmaton are used less frequently than parametrc and semparametrc methods, the methods are approprate when a theoretcal dstrbuton s not known. Pror to fttng a theoretcal dstrbuton, nonparametrc methods are useful for prelmnary examnaton of data, suggestng functonal form, and assessng homogenety [Allson, 1995; Kefer, 1988]. The two common technques for nonparametrc estmaton of the survvorshp functon are the lfe table method and the Kaplan-Meer estmator. 3.1 Lfe Tables The lfe table method s among the most common methods to analyze survval data. It s used n demographc and nsurance studes and s sometmes referred to as the actuaral method [Allson, 1995]. A lfe table s bascally a frequency table desgned to deal wth censored data [Lawless, 1982]. Lfe tables provde estmates of the survval experence of a cohort and when the cohort s a random sample from the populaton, the lfe table provdes estmates of survval propertes of the populaton [Lawless, 1982]. The prmary functon of a lfe table s to summarze survval data grouped nto ntervals and to provde estmates of the survvor functon, the densty functon, and the hazard rate. It s desgned for stuatons where only the nterval n whch falure or censorng occurred s known as the actual falure or censorng tme s unknown [Kalbflesch and Prentce, 1980]. 15
16 The constructon of a lfe table begns wth the observaton of lfetme t or censorng tme L for each member of a random sample. Because the exact occurrence of t or L s not known, the observatons are grouped nto ntervals. 5 The tme axs of a lfe table s dvded nto q +1 ntervals such that [ ) I = a, a, k = 1,..., q + 1 a = 0, a = [12] k k 1 k 0 q+ 1 and a q s the last observable tme pont [Blossfeld, et al., 1989]. Assumng that the dstrbuton of lfetmes have a survvor functon, S( t ), then we can defne ( ) ( ) P = S a = P T a [13] k k k as the probablty that an ndvdual survves beyond a specfc nterval I k, and ( ) p = P T I T I = k k k 1 Pk P k 1 [14] as the probablty that an ndvdual survves beyond a specfc nterval condtoned on the probablty on survvng untl that nterval. The hazard functon of the kth nterval s ( [ a a ) T a ) λ [15] k = P T k 1, k k 1 and t s the probablty of event occurrence n the kth tme nterval condtoned upon reachng that nterval [Blossfeld, et al., 1989]. Defnng equaton [14] n terms of equaton [15], p k = 1 λ [16] k allows P k to be rewrtten as ( ) ( ) ( ) P = P T a T a... P T a T a P T a = p... p [17] k k k k 1 16
17 Restated, the probablty of survvng past a gven nterval I k s the product of the condtonal probabltes of survvng past a gven nterval gven survval to the start of the nterval [Blossfeld, et al., 1989; Lawless, 1982]. Estmaton of a lfe table nvolves gatherng a random sample or grouped data from some populaton and ether observng event T durng lfetme t or censorng tme L. The observatons are grouped and placed n the kth nterval correspondng to the occurrence of the event. As a result of ths, the data conssts of n ndvduals that begn the study, d k events n the kth nterval, and w k censored observatons n the kth nterval. The rsk set, R k, conssts of all ndvduals who are at rsk for the occurrence of the event at each pont n tme (the begnnng of the k th nterval) such that R = 1 n and R = R d w for k k 1 k 1 k 1 k = 2, K, q + 1 [Blossfeld, et al., 1989]. If there s no censorng n the kth nterval, the estmate of the hazard rate λ^ k s smply d k R k. It s the number of events occurrng n an nterval dvded by the rsk set for that partcular nterval and t represents the condtonal probablty of the event occurrng n a specfc nterval. However, f censorng s present n a specfc nterval ( w k > 0 ), ths estmate underestmates the actual hazard because some of the censored observatons mght have experenced the event durng the nterval [Blossfeld, et al., 1989]. An arbtrary adjustment s made by assumng the censored observatons are at rsk for half of the nterval and the estmator s λ^ = d k Rk wk 2. [18] 17
18 Once an estmate of the hazard functon s obtaned, estmates of p k and P k are obtaned from p k ^ ^ ^ ^ ^ = 1 λ k and P = pk K p 1. The estmated probablty of an event occurrng n the k th nterval s ( [, )) for the k th nterval s ^ ^ ^ P = T ak 1 ak = P k 1 P k and the densty ^ ^ ^ ^ ^ P k 1 Pk P k 1 λ k f k = = [19] hk hk where hk = ak ak 1 characterzes the length of the k th nterval [Blossfeld, et al., 1989]. Standard errors can be approxmated for ^ λ, ^ P, and ^ p and wth large samples, the values of ^ λ, ^ P, and ^ p dvded by the standard errors are assumed to be normally dstrbuted [Lee, 1980; Yamaguch, 1991]. By dvdng the sample nto subsets, ^ λ, ^ P, and ^ p can be estmated for each subset and tests can be made to determne f the estmates for the subsets are smlar or dfferent. Although the popularty of estmatng survval curves wth lfe table based methods has fallen n the last twenty years, the curves are useful for prelmnary evaluaton of data and evaluatng the ft of regresson models [Allson, 1995; Teachman, 1983]. Lfe tables for populaton subgroups provde survvorshp functons that allow an assessment of exogenous varables to nclude n more complex analyses [Teachman, 1983]. However, the prmary crtcsm of the lfe table method s that the choce of ntervals s arbtrary. The arbtrarness of the ntervals leads to arbtrary results and uncertanty over nterval choce [Allson, 1995]. Another crtcsm s that the relablty of the method depends upon a large number of epsodes [Blossfeld and Rohwer, 1995]. 18
19 3.2 Kaplan-Meer Estmator In a lfe table, the exact occurrence of t or L s not known, but ths s not always the case. Sometmes t or L s known wth certanty and observed for each of all N observatons and the product-lmt or Kaplan-Meer method can be used for nonparametrc estmaton of the survvorshp functon. For the perod under observaton, assume tme begns at t 0, ends at t e, and the perod ( 0,t e ] s dvded nto M ntervals ( 0, t 1 ], ( t 1, t 2 ],, ( t m 1, te ] K where the ntervals are so small that the probablty of more than one event n an nterval s almost nonexstent [Elandt-Johnson and Johnson, 1980]. The event tmes are ordered such that t 1 < t 2 <K t e where e n [Blossfeld, et al., 1989]. The rsk set at any gven pont s R and t represents the number of ndvduals or frms that survved untl tme t (or n actualty, survved untl the moment of tme just before t ) [Elandt- Johnson and Johnson, 1980]. Defnng and 1 ( t, t ] 1 f death occurs n 1 φ = [20] 0 otherwse q as the condtonal probablty of death n ( t, ] s alve at 1 φ φ 1 t, then L ( q ) ( ) p R 1 provdes 1 t gven that the ndvdual ( t, t ] 1 ^ f event occurs n 1 f λ 1 = R [21] otherwse 0 as the unbased maxmum lkelhood estmator of the hazard functon [Elandt- Johnson and Johnson, 1980]. The probablty of survvng beyond the current perod s 19
20 ^ p 1 R 1 ^ f event occurs at t λ = R, [22] otherwse 1 = 1 1 and the survval functon s ^ where S( ) ^ ( ) ^ ^ ^ ^ ^ S t = P = p p K p, [23] = P0 = 1 [Elandt-Johnson and Johnson, 1980]. Because an event may not occur n some tme nterval t, p = 1 1 and tme dvsons wthout an event do not enter nto the estmate of the survvor functon. The nterest s therefore only on the ordered tme perods where an event occurs, t < t K 2 < K< t j < t K, where K sgnfes the number of events at a dstnct tme pont [Elandt-Johnson and Johnson, 1980]. Defnng R j as the rsk set at t j, then the product-lmt estmator or Kaplan-Meer estmator of S( t ) s 1 for t < t 1 ^ R j 1 S( t) = for t < 1 t t+ 1, = 1,2, K K 1. [24] j= 1 R j for t t k K R j 1 j= 1 R j Equaton [24] states that for any tme perod t before an event occurs, the probablty of survvng to t 1 s equal to one. For any other tme perod except where t t k, the survvor functon s equal to the product of the condtonal probabltes of survvng to R j gven survval to R j 1. If the tme perod s greater than or equal to t k, the survvor functon depends on the nature of censored observatons. If there s no censorng (no events occur n or after k t ) then S( t) 20
21 equals 0. But f there are rght censored observatons, S( t) s undefned. As wth the lfe table method, t s possble to compare survvor functons for sub-groups. Lke the lfe table method, estmates of the standard error of the survvor functon allow the calculaton of confdence ntervals. Addtonally, several test statstcs exst for comparng survvor functons generated by the product-lmt estmator [e.g., see Lee, 1980 and Blossfeld and Rohwer, 1995]. 4.0 PARAMETRIC METHODS OF ESTIMATION Although nonparametrc methods of estmaton provde nformaton on the survval functon, the prmary dsadvantage of nonparametrc estmaton s that nformaton from these models reles a large part on comparsons among subgroups of sample data. As an ncreasng number of comparsons results n an ncreasng number of subgroups, problems arse wth subgroups contanng a small number of cases, complex nterpretatons among subgroups, and loss of nformaton due to the groupng of subgroups accordng to qualtatve characterstcs [Blossfeld and Rohwer, 1995]. Consequently, survval analyss n the last twenty years has shfted ts focus from non-parametrc to parametrc methods of analyss. 4.1 Falure Tme Dstrbutons The prmary purpose of survval analyss s to descrbe the occurrence and tmng of events. Consequently, hazard functons are used to descrbe the probablty dstrbuton of the tme of event occurrence. However, because 21
22 dfferent processes generate unque patterns of survval data, many dstrbutons are used to model the dstrbuton of falure tme. As a result, f the decson has been made to use a parametrc method of estmaton, one of the key decsons that needs to be made s to choose a probablty dstrbuton that best descrbes the underlyng process that generates the observed duraton data. Ths secton provdes a revew of the exponental, Webull, and log-normal probablty dstrbutons. These are the more common probablty dstrbutons that form the bass for regresson models Exponental Dstrbuton The exponental dstrbuton s the smplest and most wdely used dstrbuton n survval studes [Lee, 1980]. The exponental dstrbuton s characterzed by a sngle parameter and t has a constant hazard functon over T. The falure rate s ndependent of t and the chance of falure over any gven tme nterval s the same. Ths s referred to as the memoryless or lack of memory property n that tme does not affect survval [Kalbflesch and Prentce, 1980; Lee, 1980]. Large (small) values of λ ndcate hgh (low) rsk and short (long) survval [Lee, 1980]. Because tme does not affect falure, the occurrence of the event s a random event. The relevant functons for the exponental dstrbuton are ( ) λ t = λ λ > 0, t 0, [25] ( ) ( ) S t = exp λt t 0, [26] 22
23 ( ) ( ) f t = λ exp λt t 0, λ 0, and [27] 0 t < 0 ( ) ( ) F t = 1 exp λ t t 0. [28] Due to smplcty, the exponental dstrbuton was the frst wdely used dstrbuton model but the restrctveness of a constant hazard functon has led to cautous use n recent years [Lawless, 1982]. Fgures 2, 3, and 4 show the hazard functon, the survvorshp functon, and the probablty densty functon for an exponental dstrbuton wth a constant hazard rate of 1. The fgures show a constant hazard rate ( λ = 1), a survvorshp functon that approaches 0 as tme elapses, and a probablty densty functon that ndcates an uncondtonal probablty of falure that also approaches 0 as tme elapses. 4.3 Webull Dstrbuton The Webull dstrbuton s a generalzaton of the exponental dstrbuton that allows for a power dependence of the hazard on tme. It has broader applcatons than the exponental dstrbuton because t does not assume a constant hazard rate [Lee, 1980]. It s probably the most wdely used dstrbuton and s often used to test the lfe of manufactured tems [Blossfeld, et al., 1989; Lawless, 1982]. The relevant functons for the Webull dstrbuton are γ 1 ( ) = λγ ( λt) t 0, λ, γ > 0, λ t [29] ( t ) ( ) ( ) S t = exp λ γ, [30] γ ( ) γ ( ) = λγ ( λ ) 1 exp ( λ ) f t t t, and [31] 23
24 F t ( t ) ( ) ( ) = 1 exp λ γ. [32] The hazard functon s monotone decreasng (ncreasng) for values of γ <1 ( γ > 1 ) and reduces to the exponental dstrbuton for γ = 1 [Kalbflesch and Prentce, 1980]. Consequently, the dstrbuton can be used to model survval dstrbutons of populatons wth ncreasng, decreasng, and constant rsk [Lee, 1980]. The Webull dstrbuton s characterzed by the two parameters, λ and γ. The value of γ s called the shape parameter because t determnes the shape of the dstrbuton [Lawless, 1982; Lee, 1980]. Typcal values of γ range from 1 to 3 [Lawless, 1982]. The value of λ determnes the scale of the dstrbuton and t s called the scale parameter [Lee, 1980]. Fgures 2, 3, and 4 show the hazard functon, the survvorshp functon, and the probablty densty functon for a dstrbuton wth λ = 2 and alternatve values of γ =.5, γ =1, and γ = 3. When γ =1 the functon reduces to the exponental functon. When γ =.5 the survval dstrbuton has decreasng rsk and when γ = 3 t has ncreasng rsk. 4.4 Log-normal Dstrbuton The log-normal dstrbuton s a dstrbuton of a varable whose logarthm follows the normal dstrbuton [Lee, 1980]. The log-normal dstrbuton s often used to model the dstrbuton of survval tmes of dseases such as Hodgkn s, leukema, and lung cancer [Lawless, 1982; Lee, 1980]. The dstrbuton assumes that f Y = log T s normally dstrbuted wth mean µ and varance σ 2, then 24
25 lfetme T s log-normally dstrbuted wth scale parameter µ and shape parameter σ 2 2 (wrtten as T = Λ ( µ, σ ) 1 ( ) 2π exp σ ). As the probablty densty functon of Y s y µ σ the probablty densty functon of ( ) ( ) f t T = exp Y s 1 = 1 2 exp ( 2π ) σ t The other relevant densty functons are and < y <, [33] log t µ σ 2, t > 0. [34] log t µ F( t) = Φ, [35] σ log t µ S( t) = 1 Φ, [36] σ h( t) = f ( t) S( t). [37] Fgures 2, 3, and 4 show the dstrbutons for a log-normal dstrbuton where T = Λ( 0,. 3). 7 Note that the probablty densty functon s postvely skewed and that the hazard functon has a value of 0 at t = 0, ncreases to a maxmum value, and then approaches 0 as t becomes large. Greater values of σ 2 would ncrease the skewness and dfferent values of µ would change the scale on the x axs. 25
26 5.0 REGRESSION MODELS The survval dstrbutons descrbe earler, such as the exponental and Webull, are useful for modelng the survval experence of a homogeneous populaton but they do not consder the effect of covarates on the survval dstrbuton. It s possble to use these dstrbutons n a bvarate model to perform survval analyss when the only nformaton used s the tme of the event. Although such dstrbutons may adequately model the process of nterest, a large amount of research focuses on heterogeneous populatons. Because the populaton s heterogeneous, covarates exst for each ndvdual or subject n the populaton. As a result, t becomes an nterestng queston to estmate causal models that determne the effect of such covarates, as well as tme, on the hazard rate [Blossfeld, et al., 1989; Kalbflesch and Prentce, 1980; Lawless, 1982]. Ths s accomplshed through the use of regresson models n whch the hazard rate or tme to falure s the fundamental dependent varable. When nonparametrc methods of estmaton are used, data gatherng nvolves obtanng falure or duraton data for each subject. Expandng the method of estmaton to a parametrc method requres that covarates be gathered for each subject n the sample. Upon completon of data gatherng, there s a vector of covarates where = ( x..., ) x 1, x s for a process wth falure tme T > 0. The basc ssue s to specfy a model for the dstrbuton of t gven x and ths can be accomplshed wth parametrc or sem-parametrc models. Parametrc models use dstrbutons such as the exponental and Webull whle sem-parametrc models make no assumptons about the underlyng dstrbuton. 26
27 5.1 Maxmum Lkelhood Estmaton Parametrc survval models are estmated usng the maxmum lkelhood method. Maxmum lkelhood estmaton s used because t produces estmators that are consstent, asymptotcally effcent, and asymptotcally normal. If data s gathered for a sample of n ndvduals ( 1, K, n) =, the data wll consst of t, the tme of the event (or f the observaton s censored, the tme of censorng), an ndcator varable, δ, representng the presence ( δ = 0) or absence ( = 1) δ of censorng, and a vector of covarates, x = x K 1 x k. In the absence of censored observatons, the probablty of observng the entre data s the product of the probabltes of observng the data for each specfc ndvdual. Representng the probablty of each observaton by ts probablty densty functon provdes the lkelhood functon L = f ( t ) n = 1, where L represents the probablty of the entre data. If censorng s presence, then the lkelhood functon becomes n δ [ ( )] ( ) [ ] L = f t S t = 1 1 δ. The lkelhood functon effectvely combnes uncensored and censored observatons n that f an ndvdual s not censored, the probablty of the event s f ( t ), and f the ndvdual s censored at t, the probablty of the event s S ( t ), the survvorshp functon evaluated at t. Takng the natural log of L, the objectve s to maxmze the expresson log n ( L) = δ ln f ( t ) + ( 1 δ ) ln S( t ) = 1 n = 1. Once the approprate dstrbuton has been specfed, the process reduces to usng a numeratve method such as the Newton-Raphson algorthm to solve for the parameters. 27
28 5.2 Parametrc Models Parametrc models of survvor functons are referred to as accelerated falure tme (AFT) models. Gven an underlyng process that generates event tmes T > 0, accelerated falure tme models specfy the relaton between event tme, T, or the hazard rate, ( t) λ, and a vector of covarates, x = x, K 1, x. The k dependence of falure tme or the hazard rate on explanatory varables s accomplshed by specfyng parameters of probablty dstrbutons as a parametrc functon of the covarates [Namboodr and Suchndran, 1987]. The general form of the parametrc models s log T = β + β x + K + β x + σω [38] or * * * ( t) β 0 + β1 x 1 + β k xk k k log λ = K +. [39] Equaton [38] uses survval tme as the dependent varable and equaton [39] uses the hazard rate. Both models make use of a log transformaton n order that event tme and the hazard rate have a postve value. For each ndvdual, T s event tme, λ s the hazard rate, kth covarate. Estmated parameters are σ and ω s a random dsturbance term, and x k s the β k [Allson, 1995]. Alternate parametrc models change the value of σ and dfferent values of σ mply alternate dstrbutons of ω as well as T. Equaton [39] does not have a dsturbance term because of random varaton between λ ( t) and T. If two ndvduals had the same values for each of the covarates and, therefore, the same hazard rate, they would not have the same event tme [Allson, 1995]. 28
29 5.3 Exponental Regresson Model The exponental regresson model s a generalzaton of the exponental dstrbuton [see equatons [25], [26], [27], and [28]] where the falure rate s a functon of covarates rather than a constant parameter λ. As such, t s approprate when subjects have constant hazard functons that can be expressed as a functon of covarates [Blossfeld, et al., 1989; Lee, 1980]. Recall that the exponental dstrbuton specfes that λ ( t ) = λ model, the hazard functon s wrtten as λ ( t x) = λ( x). Generalzng to a regresson,. The hazard rate s stll a constant characterzed by an exponental dstrbuton but the hazard depends on covarates x. Specfyng the hazard rate as the functon of the covarates and a set of parameters where λ( t x) = λc( xβ ),, the functonal form x chosen, and the hazard rate s represented as λ ( t x) = λe x c = e s generally ; [Kalbflesch and Prentce, 1980]. Takng the log of each sde of the equaton, the exponental regresson model takes the form of equaton [39] whereby ( t) = β 0 + β1 x 1 + K β k xk log λ +, [40] log λ = β 0 0, and the log of the hazard rate s a seen as a lnear functon of the covarates. Alternatvely the exponental regresson can model event tme as a functon of the covarates. Gven that falure tme follows an exponental dstrbuton wth the probablty densty functon n equaton [27], the probablty dstrbuton of Y log( T ) = follows an extreme value dstrbuton wth the probablty 29
30 densty functon gven by ( y) ω ω e f = exp where ω θ = y and λ = exp( θ ) [Namboodr and Suchndran, 1987]. Rewrtng reparameterzng θ as xβ provdes the model ω = y θ as y = θ + ω and * * * logt β + β x + K + β x + ω. 8 [41] = k k The regresson model n ths form states that the log of event tme s a lnear functon of the covarates. The relaton between the coeffcents n equatons [40] and [41] s β = [Allson, 1995]. * β 5.4 Webull Regresson Model The prmary shortfall of the exponental regresson s the restrctve assumpton of a constant hazard rate. Hazard rates generally rse or fall over tme dependng on the event of nterest but they are rarely constant for long duratons. Consequently, accelerated falure tme models often assume an underlyng dstrbuton other than the exponental dstrbuton. One popular choce s the Webull regresson model. The Webull dstrbuton s selected because ts flexblty allows t to descrbe many patterns of survval data [Namboodr and Suchndran, 1987]. Gven the hazard functon n equaton [29], the functon can be rewrtten wth ether the shape or scale parameter specfed as a functon of covarates although the general conventon transforms only the scale parameter [Elandt-Johnson and Johnson, 1980; Namboodr and Suchndran, 1987]. The regresson model agan can take two forms although the more common form of the model s stated n equaton [38]. The model specfes a standard extreme-value dstrbuton for ω but allows 0 < σ. Ths mples a 30
31 Webull dstrbuton for T and allows the hazard to ncrease or decrease over tme. 5.5 Sem-Parametrc Models Although parametrc models are an mprovement over lfe tables and the Kaplan-Mejer estmator, they stll have lmtatons. Foremost among these problems are the necessty to specfy the behavor of the hazard functon over tme, fndng a model wth an approprate shape f the hazard functon s nonmonotonc, and a cumbersome estmaton process when the covarates change over tme [Allson, 1984]. 5.6 Cox's Proportonal Hazards Model The dffcultes encountered wth the parametrc models are resolved wth the proportonal hazards models. 9 The proportonal hazards model s represented as λ ( ) ( ) t xβ = λ 0 t e. [42] The model states that the hazard rate for any ndvdual s the product of an arbtrary unspecfed baselne hazard rate ( λ 0 ( t) ) and an exponentated set of covarates. It s ths lack of specfcty of a base-lne hazard functon that makes the model sem-parametrc or dstrbuton-free. If a specfc form were specfed for λ( t ), a parametrc model would result. ( ) λ 0 t may be thought of as the hazard functon for an ndvdual that has a value of 0 for each of the covarates and for xβ whom e = The regresson model s wrtten as 31
32 where ( t) λ ( t) ( t) = α( t) + β1x 1 + K β k xk log λ +. [43] α = log 0 [Allson, 1984]. The model s called the proportonal hazards model because t has the property that dfferent unts have hazard functons that are proportonal [Lawless, 1982]. Ths means that the rato of the hazard functon for two unts wth ndependent covarates does not vary wth t. For two ndvduals, and j, equaton [53] can be expressed as the rato of two hazard functons such that λ λ ( t) ( t) j = e ( x x j ) + + βk ( xk x jk ) β1 1 1 K. [44] The hazard for any ndvdual s a fxed proporton of the hazard of any other ndvdual at any pont n tme. The unqueness of the proportonal hazards model s the manner n whch the β parameters are estmated n the absence of knowledge of λ( t ). Cox [1972] referred to ths estmaton procedure as the method of partal lkelhood. The method of partal lkelhood begns by assumng that there s a group of ( ) ndvduals, R t( ), that are at rsk of falure just before the occurrence of t ( ). If only one falure occurs at t ( ), the condtonal probablty that the falure occurs to ndvdual, gven that ndvdual has a vector of covarates x, s represented by λ( t( ) x( )) λ( t( ) xl ) = λ e l R( t( ) ) l R t( ) x β x β 0 = xl β xl β λ0e e ( ) l R( t( ) ) e. [45] Equaton [45] s the hazard functon for ndvdual at a specfc pont n tme, t ( ), dvded by the sum of the hazard functons for all ndvduals n the rsk set just 32
33 before the occurrence of tme t ( ). Because λ 0 s common to every term n the equaton t s elmnated. The partal lkelhood functon s obtaned by takng the product of equaton [45] over all k ponts n tme such that L ( β ) = k e e = 1 l R xβ xl β δ. [46] Equaton [46] does not depend on λ o ( t) and can be maxmzed to provde an estmate of β^ that s consstent and asymptotcally normally dstrbuted [Kalbflesch and Prentce, 1980; Lawless, 1982; Namboodr and Suchndran, 1987]. Although the proportonal hazards model does not requre the specfcaton of a hazard functon, t does not provde for tests about the shape of the hazard functon [Allson, 1995]. Ths lmtaton s overcome wth the use of a pecewse exponental model. The dea behnd the pecewse exponental model s that the tme scale s dvded nto ntervals. Wthn each nterval, the hazard s constant but the hazard s allowed to vary across tme ntervals. The tme scale has J ntervals and the cutponts are defned as a, 0, a1, K aj wth 0 = 0 a and a J =. Each ndvdual has a hazard functon of the form X ( t) = λe for a j 1 t < a j λ [47] or where α j j ( ) = α j + log λ t X [48] = log λ [Allson, 1995]. Ths allows the ntercept to vary across ntervals. 33
34 5.7 Other Issues The models dscussed up to ths pont assumed that the process of nterest contaned only a sngle, nonrepeatable event and two states (orgn and destnaton). But t s not necessary that only one destnaton state exst. Indvduals mght transton to multple end states. In ths stuaton, the model s called an event hstory model wth competng rsks. In a stuaton wth competng rsks, a state-specfc hazard rate s defned for each state and s represented as ( t T < t + t, J = j T t) P λ t ( t) = lm, j = 1, 2, K, k. [49] t t A comparson of equaton [49] to equaton [4] shows that the only dfference s the ncluson of J = j. Ths represents the condtonal probablty that the event occurs between t and t and the event s type j, gven that the event had not occur before tme t [Allson, 1984; Blossfeld, et al., 1989]. The overall hazard rate s ( t) λ ( t) λ = j [50] j and t s the sum of all the state-specfc hazards [Allson, 1984; Blossfeld, et al., 1989]. In a competng rsks model, the ndvdual s at rsk to experence one of several events and the nterest s on the relatve rate assocated wth each event as well as the effect of the covarates on these rates. The earler models also assumed that the destnaton state was absorbng. Ths s certanly the case when the event of nterest s death. Indvduals de and enter the state known as death. The death state s absorbng because there s no returnng from the state. But not all states are absorbng. For ths stuaton, mult-epsode or repeatable event models are approprate. The most common 34
35 case would be where one specfc event occurred repeatedly. Assume that an ndvdual experences an event and the tme that the event s experenced s represented by nonnegatve random varables 0 < T < T < K < T. The duraton 1 2 k between events or the length of an epsode s represented by V k = Tk Tk 1 for k = 1,2,K. For each epsode there s a vector of covarates for the ndvdual. Followng Blossfeld, Hamerle, and Mayer's exposton [Blossfeld, et al., 1989], the hazard rate for the frst epsode s ( t T < t + t T t, x ) 1 P λ ( t x1 ) = lm. [51] t 0 t The hazard rate of the second epsode s ( t T < t + t T t, H ) 2 P 2 2 x, 1 λ ( t x 2, H1 ) = lm 2 [52] t 0 t and may depend on H1 whch contans the prehstory of the process. The hazard rate of the kth epsode s ( t T < t + t T t,, H ) k P k k x k 1 λ ( t x k, H k 1 ) = lm k. [53] t 0 t At the occurrence of the kth epsode, the prehstory s { t x,, t, } H K. =, k k 1 x k-1 The model states that f an ndvdual experences multple epsodes of a sngle event, then the duraton between events s a functon of the duraton of prevous duraton between events. All parametrc regressons are easly adaptable to mult-perod cases [Blossfeld, et al., 1989]. A hazard model wth competng rsk assumes that there are multple possble event states and the mult-epsode model assumes that an ndvdual could experence a sngle event more than once. The combnaton of both of 35
36 these concepts s the mult-epsode and mult-state event hstory model. Usng the same transton tmes and duraton of epsodes from the mult-epsode model, a state varable s defned as Y k : k = 1, 2, K, K. For the kth epsode, there exsts an epsode at k 1 that termnated n state y k 1. From state y k 1, the set of attanable states s represented as M ( y k 1 ). For Y k = j, the hazard rate for the kth epsode s defned as ( t T < t + t, Y = j T t,, H ) k P k k k x k 1 λ j ( t x k, H k 1 ) = lm k, [54] t 0 t where { t y, t, y x,, t y, } H K contans the prehstory of the k 1 = 0, 0 1 1, 1 k 1, k 1 x k-1 process untl tme t k 1 [Blossfeld, et al., 1989]. The total hazard rate s m k k λ ( t x k, H k 1 ) = λ j ( t x k, H k 1 ). [55] j= 1 It s the average of the transton specfc hazard rates and represents the hazard of a transton from state yk 1 n the kth epsode to any other state. As wth the prevous model, all parametrc regressons are easly adaptable to ths case. 6.0 SURVIVAL ANALYSIS IN ACCOUNTING AND FINANCE RESEARCH Relatve to other busness dscplnes such as economcs and management, the use of survval analyss n accountng and fnance research s lmted. There are probably two explanatons for the lmted use of ths research method. The frst reason s the manner n whch econometrcs s presented to graduate students n accountng and fnance doctoral programs. Graduate educaton typcally provdes a two or three sequence econometrc course 36
37 centered on regresson analyss. The standard texts used n these courses are Judge and Hll [1985], Kmenta [1986], and Maddala [1983]. Such textbooks devote very lttle coverage to survval analyss, and, to the extent that the texts provde coverage, t s cursory. Because graduate students are not exposed to survval analyss n ther educaton, ther knowledge of the technque s lmted, and consequently, they do not use the models. The second reason that accounts for the lmted use s the nature of the research problems. A glance at any of the top research journals n accountng or fnance makes t clear that the domnant research area s market-based research. Market-based research questons do not lend themselves to models of survval analyss. To the extent that survval analyss has been used n the accountng or fnance lterature, t s used to address tradtonal research questons. Usng survval analyss ntroduces event tme as a concept of nterest to research areas that have nterested researchers for a number of years. Instead of assumng that the underlyng process s statc, researchers use survval analyss to examne the process that generates the duraton of tme untl an event of nterest occurs. The common applcatons of survval analyss snce the md-1980s are n the areas of fnancal dstress, occupatonal success, and frm survval. 6.1 Fnancal Dstress The ablty of accountng nformaton to predct fnancal dstress generates consderable research n accountng and fnance. Early works by Altman [1968] and Beaver [1966; 1968a; 1968b] represent the emergence of a large body of 37
Chapter 20 Duration Analysis
Chapter 20 Duraton Analyss Duraton: tme elapsed untl a certan event occurs (weeks unemployed, months spent on welfare). Survval analyss: duraton of nterest s survval tme of a subject, begn n an ntal state
More informationSee Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)
Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes
More informationPredictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore
Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.
More informationEcon107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)
I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes
More informationBasically, if you have a dummy dependent variable you will be estimating a probability.
ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy
More informationLinear Regression Analysis: Terminology and Notation
ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented
More informationChapter 13: Multiple Regression
Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to
More informationAndreas C. Drichoutis Agriculural University of Athens. Abstract
Heteroskedastcty, the sngle crossng property and ordered response models Andreas C. Drchouts Agrculural Unversty of Athens Panagots Lazards Agrculural Unversty of Athens Rodolfo M. Nayga, Jr. Texas AMUnversty
More informationLimited Dependent Variables
Lmted Dependent Varables. What f the left-hand sde varable s not a contnuous thng spread from mnus nfnty to plus nfnty? That s, gven a model = f (, β, ε, where a. s bounded below at zero, such as wages
More informationLimited Dependent Variables and Panel Data. Tibor Hanappi
Lmted Dependent Varables and Panel Data Tbor Hanapp 30.06.2010 Lmted Dependent Varables Dscrete: Varables that can take onl a countable number of values Censored/Truncated: Data ponts n some specfc range
More informationRELIABILITY ASSESSMENT
CHAPTER Rsk Analyss n Engneerng and Economcs RELIABILITY ASSESSMENT A. J. Clark School of Engneerng Department of Cvl and Envronmental Engneerng 4a CHAPMAN HALL/CRC Rsk Analyss for Engneerng Department
More information3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X
Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number
More informationNon-Mixture Cure Model for Interval Censored Data: Simulation Study ABSTRACT
Malaysan Journal of Mathematcal Scences 8(S): 37-44 (2014) Specal Issue: Internatonal Conference on Mathematcal Scences and Statstcs 2013 (ICMSS2013) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal
More informationChapter 5 Multilevel Models
Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level
More informationLOGIT ANALYSIS. A.K. VASISHT Indian Agricultural Statistics Research Institute, Library Avenue, New Delhi
LOGIT ANALYSIS A.K. VASISHT Indan Agrcultural Statstcs Research Insttute, Lbrary Avenue, New Delh-0 02 amtvassht@asr.res.n. Introducton In dummy regresson varable models, t s assumed mplctly that the dependent
More informationANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)
Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of
More informationx i1 =1 for all i (the constant ).
Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by
More informationComparison of Regression Lines
STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence
More informationChapter 11: Simple Linear Regression and Correlation
Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests
More informationDepartment of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6
Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.
More informationCHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE
CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE Analytcal soluton s usually not possble when exctaton vares arbtrarly wth tme or f the system s nonlnear. Such problems can be solved by numercal tmesteppng
More information1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands
Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of
More informationJanuary Examinations 2015
24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)
More informationLecture 3 Stat102, Spring 2007
Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture
More informationComposite Hypotheses testing
Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter
More informationStructure and Drive Paul A. Jensen Copyright July 20, 2003
Structure and Drve Paul A. Jensen Copyrght July 20, 2003 A system s made up of several operatons wth flow passng between them. The structure of the system descrbes the flow paths from nputs to outputs.
More informationSTK4080/9080 Survival and event history analysis
SK48/98 Survval and event hstory analyss Lecture 7: Regresson modellng Relatve rsk regresson Regresson models Assume that we have a sample of n ndvduals, and let N (t) count the observed occurrences of
More informationNegative Binomial Regression
STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...
More informationThe Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction
ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also
More informationStatistics Chapter 4
Statstcs Chapter 4 "There are three knds of les: les, damned les, and statstcs." Benjamn Dsrael, 1895 (Brtsh statesman) Gaussan Dstrbuton, 4-1 If a measurement s repeated many tmes a statstcal treatment
More information2016 Wiley. Study Session 2: Ethical and Professional Standards Application
6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton
More informationLinear Approximation with Regularization and Moving Least Squares
Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...
More informationDO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes
25/6 Canddates Only January Examnatons 26 Student Number: Desk Number:...... DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR Department Module Code Module Ttle Exam Duraton
More informationLecture 6: Introduction to Linear Regression
Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6
More informationStatistics for Economics & Business
Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable
More informationx = , so that calculated
Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to
More informationLectures - Week 4 Matrix norms, Conditioning, Vector Spaces, Linear Independence, Spanning sets and Basis, Null space and Range of a Matrix
Lectures - Week 4 Matrx norms, Condtonng, Vector Spaces, Lnear Independence, Spannng sets and Bass, Null space and Range of a Matrx Matrx Norms Now we turn to assocatng a number to each matrx. We could
More informationLecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding
Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an
More informationLecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding
Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study
More informationSTAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression
STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Ths handout covers materal found n Secton 3.7 of your text. You may also want to revew regresson technques n Chapter. In ths handout,
More information4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA
4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected
More informationHere is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)
Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,
More informationLINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity
LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have
More informationPsychology 282 Lecture #24 Outline Regression Diagnostics: Outliers
Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.
More information3/3/2014. CDS M Phil Econometrics. Vijayamohanan Pillai N. CDS Mphil Econometrics Vijayamohan. 3-Mar-14. CDS M Phil Econometrics.
Dummy varable Models an Plla N Dummy X-varables Dummy Y-varables Dummy X-varables Dummy X-varables Dummy varable: varable assumng values 0 and to ndcate some attrbutes To classfy data nto mutually exclusve
More informationFREQUENCY DISTRIBUTIONS Page 1 of The idea of a frequency distribution for sets of observations will be introduced,
FREQUENCY DISTRIBUTIONS Page 1 of 6 I. Introducton 1. The dea of a frequency dstrbuton for sets of observatons wll be ntroduced, together wth some of the mechancs for constructng dstrbutons of data. Then
More informationStatistics for Business and Economics
Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear
More informationsince [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation
Econ 388 R. Butler 204 revsons Lecture 4 Dummy Dependent Varables I. Lnear Probablty Model: the Regresson model wth a dummy varables as the dependent varable assumpton, mplcaton regular multple regresson
More informationChapter 8 Indicator Variables
Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n
More informationChapter 3 Describing Data Using Numerical Measures
Chapter 3 Student Lecture Notes 3-1 Chapter 3 Descrbng Data Usng Numercal Measures Fall 2006 Fundamentals of Busness Statstcs 1 Chapter Goals To establsh the usefulness of summary measures of data. The
More informationAn Introduction to Censoring, Truncation and Sample Selection Problems
An Introducton to Censorng, Truncaton and Sample Selecton Problems Thomas Crossley SPIDA June 2003 1 A. Introducton A.1 Basc Ideas Most of the statstcal technques we study are for estmatng (populaton)
More informationChapter 14: Logit and Probit Models for Categorical Response Variables
Chapter 4: Logt and Probt Models for Categorcal Response Varables Sect 4. Models for Dchotomous Data We wll dscuss only ths secton of Chap 4, whch s manly about Logstc Regresson, a specal case of the famly
More informationHeterogeneous Treatment Effect Analysis
Heterogeneous Treatment Effect Analyss Ben Jann ETH Zurch In cooperaton wth Jenne E. Brand (UCLA) and Yu Xe (Unversty of Mchgan) German Stata Users Group Meetng Berln, June 25, 2010 Ben Jann (ETH Zurch)
More informationLecture 3: Probability Distributions
Lecture 3: Probablty Dstrbutons Random Varables Let us begn by defnng a sample space as a set of outcomes from an experment. We denote ths by S. A random varable s a functon whch maps outcomes nto the
More informationIII. Econometric Methodology Regression Analysis
Page Econ07 Appled Econometrcs Topc : An Overvew of Regresson Analyss (Studenmund, Chapter ) I. The Nature and Scope of Econometrcs. Lot s of defntons of econometrcs. Nobel Prze Commttee Paul Samuelson,
More informationThe Geometry of Logit and Probit
The Geometry of Logt and Probt Ths short note s meant as a supplement to Chapters and 3 of Spatal Models of Parlamentary Votng and the notaton and reference to fgures n the text below s to those two chapters.
More informationEconomics 130. Lecture 4 Simple Linear Regression Continued
Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do
More informationCredit Card Pricing and Impact of Adverse Selection
Credt Card Prcng and Impact of Adverse Selecton Bo Huang and Lyn C. Thomas Unversty of Southampton Contents Background Aucton model of credt card solctaton - Errors n probablty of beng Good - Errors n
More informationA Robust Method for Calculating the Correlation Coefficient
A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal
More informationProperties of Least Squares
Week 3 3.1 Smple Lnear Regresson Model 3. Propertes of Least Squares Estmators Y Y β 1 + β X + u weekly famly expendtures X weekly famly ncome For a gven level of x, the expected level of food expendtures
More informationMaximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models
ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Maxmum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models
More informationWinter 2008 CS567 Stochastic Linear/Integer Programming Guest Lecturer: Xu, Huan
Wnter 2008 CS567 Stochastc Lnear/Integer Programmng Guest Lecturer: Xu, Huan Class 2: More Modelng Examples 1 Capacty Expanson Capacty expanson models optmal choces of the tmng and levels of nvestments
More informationHomework Assignment 3 Due in class, Thursday October 15
Homework Assgnment 3 Due n class, Thursday October 15 SDS 383C Statstcal Modelng I 1 Rdge regresson and Lasso 1. Get the Prostrate cancer data from http://statweb.stanford.edu/~tbs/elemstatlearn/ datasets/prostate.data.
More informationDr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur
Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed
More informationModule 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur
Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:
More informationIntroduction to Vapor/Liquid Equilibrium, part 2. Raoult s Law:
CE304, Sprng 2004 Lecture 4 Introducton to Vapor/Lqud Equlbrum, part 2 Raoult s Law: The smplest model that allows us do VLE calculatons s obtaned when we assume that the vapor phase s an deal gas, and
More informationUncertainty and auto-correlation in. Measurement
Uncertanty and auto-correlaton n arxv:1707.03276v2 [physcs.data-an] 30 Dec 2017 Measurement Markus Schebl Federal Offce of Metrology and Surveyng (BEV), 1160 Venna, Austra E-mal: markus.schebl@bev.gv.at
More informationLecture 17 : Stochastic Processes II
: Stochastc Processes II 1 Contnuous-tme stochastc process So far we have studed dscrete-tme stochastc processes. We studed the concept of Makov chans and martngales, tme seres analyss, and regresson analyss
More informationDETERMINATION OF UNCERTAINTY ASSOCIATED WITH QUANTIZATION ERRORS USING THE BAYESIAN APPROACH
Proceedngs, XVII IMEKO World Congress, June 7, 3, Dubrovn, Croata Proceedngs, XVII IMEKO World Congress, June 7, 3, Dubrovn, Croata TC XVII IMEKO World Congress Metrology n the 3rd Mllennum June 7, 3,
More informationStatistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation
Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear
More informationWeek3, Chapter 4. Position and Displacement. Motion in Two Dimensions. Instantaneous Velocity. Average Velocity
Week3, Chapter 4 Moton n Two Dmensons Lecture Quz A partcle confned to moton along the x axs moves wth constant acceleraton from x =.0 m to x = 8.0 m durng a 1-s tme nterval. The velocty of the partcle
More information/ n ) are compared. The logic is: if the two
STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence
More information9. Binary Dependent Variables
9. Bnar Dependent Varables 9. Homogeneous models Log, prob models Inference Tax preparers 9.2 Random effects models 9.3 Fxed effects models 9.4 Margnal models and GEE Appendx 9A - Lkelhood calculatons
More informationMore metrics on cartesian products
More metrcs on cartesan products If (X, d ) are metrc spaces for 1 n, then n Secton II4 of the lecture notes we defned three metrcs on X whose underlyng topologes are the product topology The purpose of
More informationGlobal Sensitivity. Tuesday 20 th February, 2018
Global Senstvty Tuesday 2 th February, 28 ) Local Senstvty Most senstvty analyses [] are based on local estmates of senstvty, typcally by expandng the response n a Taylor seres about some specfc values
More informationA Generalized Multiple Durations Proportional Hazard Model With an Application to Activity Behavior During the Evening Work-to-Home Commute
A Generalzed Multple Duratons Proportonal Hazard Model Wth an Applcaton to Actvty Behavor Durng the Evenng Work-to-Home Commute Chandra R. Bhat Unversty of Massachusetts at Amherst Abstract The model developed
More informationELASTIC WAVE PROPAGATION IN A CONTINUOUS MEDIUM
ELASTIC WAVE PROPAGATION IN A CONTINUOUS MEDIUM An elastc wave s a deformaton of the body that travels throughout the body n all drectons. We can examne the deformaton over a perod of tme by fxng our look
More informationNotes on Frequency Estimation in Data Streams
Notes on Frequency Estmaton n Data Streams In (one of) the data streamng model(s), the data s a sequence of arrvals a 1, a 2,..., a m of the form a j = (, v) where s the dentty of the tem and belongs to
More informationComparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method
Appled Mathematcal Scences, Vol. 7, 0, no. 47, 07-0 HIARI Ltd, www.m-hkar.com Comparson of the Populaton Varance Estmators of -Parameter Exponental Dstrbuton Based on Multple Crtera Decson Makng Method
More informationJAB Chain. Long-tail claims development. ASTIN - September 2005 B.Verdier A. Klinger
JAB Chan Long-tal clams development ASTIN - September 2005 B.Verder A. Klnger Outlne Chan Ladder : comments A frst soluton: Munch Chan Ladder JAB Chan Chan Ladder: Comments Black lne: average pad to ncurred
More informationA Comparative Study for Estimation Parameters in Panel Data Model
A Comparatve Study for Estmaton Parameters n Panel Data Model Ahmed H. Youssef and Mohamed R. Abonazel hs paper examnes the panel data models when the regresson coeffcents are fxed random and mxed and
More informationMaximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models
ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Mamum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models for
More informationNEW ASTERISKS IN VERSION 2.0 OF ACTIVEPI
NEW ASTERISKS IN VERSION 2.0 OF ACTIVEPI ASTERISK ADDED ON LESSON PAGE 3-1 after the second sentence under Clncal Trals Effcacy versus Effectveness versus Effcency The apprasal of a new or exstng healthcare
More informationEcon107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10)
I. Defnton and Problems Econ7 Appled Econometrcs Topc 9: Heteroskedastcty (Studenmund, Chapter ) We now relax another classcal assumpton. Ths s a problem that arses often wth cross sectons of ndvduals,
More informationRockefeller College University at Albany
Rockefeller College Unverst at Alban PAD 705 Handout: Maxmum Lkelhood Estmaton Orgnal b Davd A. Wse John F. Kenned School of Government, Harvard Unverst Modfcatons b R. Karl Rethemeer Up to ths pont n
More informationFeature Selection: Part 1
CSE 546: Machne Learnng Lecture 5 Feature Selecton: Part 1 Instructor: Sham Kakade 1 Regresson n the hgh dmensonal settng How do we learn when the number of features d s greater than the sample sze n?
More informationChapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.
Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the
More informationKernel Methods and SVMs Extension
Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general
More informationStat 642, Lecture notes for 01/27/ d i = 1 t. n i t nj. n j
Stat 642, Lecture notes for 01/27/05 18 Rate Standardzaton Contnued: Note that f T n t where T s the cumulatve follow-up tme and n s the number of subjects at rsk at the mdpont or nterval, and d s the
More informationChapter 6. Supplemental Text Material
Chapter 6. Supplemental Text Materal S6-. actor Effect Estmates are Least Squares Estmates We have gven heurstc or ntutve explanatons of how the estmates of the factor effects are obtaned n the textboo.
More informationStat260: Bayesian Modeling and Inference Lecture Date: February 22, Reference Priors
Stat60: Bayesan Modelng and Inference Lecture Date: February, 00 Reference Prors Lecturer: Mchael I. Jordan Scrbe: Steven Troxler and Wayne Lee In ths lecture, we assume that θ R; n hgher-dmensons, reference
More informationNUMERICAL DIFFERENTIATION
NUMERICAL DIFFERENTIATION 1 Introducton Dfferentaton s a method to compute the rate at whch a dependent output y changes wth respect to the change n the ndependent nput x. Ths rate of change s called the
More informationAn (almost) unbiased estimator for the S-Gini index
An (almost unbased estmator for the S-Gn ndex Thomas Demuynck February 25, 2009 Abstract Ths note provdes an unbased estmator for the absolute S-Gn and an almost unbased estmator for the relatve S-Gn for
More informationBIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data
Lab : TWO-LEVEL NORMAL MODELS wth school chldren popularty data Purpose: Introduce basc two-level models for normally dstrbuted responses usng STATA. In partcular, we dscuss Random ntercept models wthout
More information1 Binary Response Models
Bnary and Ordered Multnomal Response Models Dscrete qualtatve response models deal wth dscrete dependent varables. bnary: yes/no, partcpaton/non-partcpaton lnear probablty model LPM, probt or logt models
More informationEstimation of the Mean of Truncated Exponential Distribution
Journal of Mathematcs and Statstcs 4 (4): 84-88, 008 ISSN 549-644 008 Scence Publcatons Estmaton of the Mean of Truncated Exponental Dstrbuton Fars Muslm Al-Athar Department of Mathematcs, Faculty of Scence,
More informationEcon Statistical Properties of the OLS estimator. Sanjaya DeSilva
Econ 39 - Statstcal Propertes of the OLS estmator Sanjaya DeSlva September, 008 1 Overvew Recall that the true regresson model s Y = β 0 + β 1 X + u (1) Applyng the OLS method to a sample of data, we estmate
More informationP R. Lecture 4. Theory and Applications of Pattern Recognition. Dept. of Electrical and Computer Engineering /
Theory and Applcatons of Pattern Recognton 003, Rob Polkar, Rowan Unversty, Glassboro, NJ Lecture 4 Bayes Classfcaton Rule Dept. of Electrcal and Computer Engneerng 0909.40.0 / 0909.504.04 Theory & Applcatons
More informationBayesian predictive Configural Frequency Analysis
Psychologcal Test and Assessment Modelng, Volume 54, 2012 (3), 285-292 Bayesan predctve Confgural Frequency Analyss Eduardo Gutérrez-Peña 1 Abstract Confgural Frequency Analyss s a method for cell-wse
More informationSection 8.3 Polar Form of Complex Numbers
80 Chapter 8 Secton 8 Polar Form of Complex Numbers From prevous classes, you may have encountered magnary numbers the square roots of negatve numbers and, more generally, complex numbers whch are the
More information