STAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression
|
|
- Elwin Gibbs
- 6 years ago
- Views:
Transcription
1 STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Ths handout covers materal found n Secton 3.7 of your text. You may also want to revew regresson technques n Chapter. In ths handout, we wll dscuss the regresson problem when the response varable s bnary, ndcatng ether the presence or absence of some characterstc. EXAMPLE: The data n the fle CHD.csv were taken from the text Appled Logstc Regresson by Hosmer and Lemeshow. Researchers are nterested n the relatonshp between age and the presence (or absence) of evdence of coronary heart dsease (CHD). A porton of ths data set s shown below. Suppose we wanted to predct the presence of CHD based on Age. After readng the data nto R, you should attach the data set so that you can reference varables by name. > attach(chd.data) Then, we can examne the relatonshp between the presence of CHD and Age. > scatter.smooth(age,chd)
2 STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Note that we could ft a lnear regresson model wth Age as the predctor and CHD as the response. Before we do ths n R, consder our lnear regresson model: CHD Age η η Age e, where CHD Age n the absenc eof CHD, n the presenc eof CHD. Note that the mean functon s gven by E(CHD Age ) η η Age. Bernoull Random Varables Now, let θ(age) denote the probablty of havng CHD for a gven Age. Note that s a Bernoull random varable wth the followng probablty dstrbuton: CHD Age CHD Age P(CHD Age) - θ(age) θ(age) We can fnd the expected value of the Bernoull random varable as follows: θ(age ) θ(age ) θ(age ) E(CHD Age ). Why s ths mportant? Ths shows that E(CHD Age ) η η Age θ(age ) P(CHD Age ). That s, the regresson lne gves an estmate of the probablty of havng CHD for a gven Age. > chd.lm<-lm(chd~age) > plot(age,chd) > ablne(chd.lm) 2
3 STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Some Problems that Arse Usng ths Model. Non-normalty of the error terms. Only two dfferent error terms are possble for each Age : - θ(age) f the response s, and - θ(age) f the response s. 2. Non-constant varance of the error terms. Snce CHD Age s a Bernoull random varable, we know that Var( CHD Age ) = θ(age) [- θ(age)]. Ths then mples that Var( CHD Age ) = [ ηage wth Age and s NOT constant. η ] [- η η Age ]. That s, the varance functon vares 3. Constrants on the response functon. A lnear representaton permts estmates or predctons outsde the range to, whch s not correct when modelng probabltes. For example, what s our estmate of θ(age=2) f we use a lnear regresson model? Comment: The constrant that the mean functon fall between and frequently rules out a lnear response functon. For our CHD example, the use of a lnear response functon mght requre us to assume a probablty of for the mean response for all ndvduals beneath a certan age and a probablty of for all ndvduals over a certan age (see below). Such a model s often consdered unreasonable, however. Ideally, we d lke to fnd a model where the probabltes and are reached asymptotcally. One such model s the logstc regresson model. The Smple Logstc Mean Functon We parameterze ths model as follows: exp(η ηx ) E(y x ) θ(x ). exp(η η x ) 3
4 STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Some examples of smple logstc mean functons are shown below: Wth η = Wth η = - Comments:. The logstc mean functon s always between and. 2. As η ncreases, the functon becomes more S-shaped; therefore, the functon changes more rapdly n the center. 3. When η s postve, the functon s monotone ncreasng; when η s negatve, the functon s monotone decreasng. 4. Changng η shfts the functon horzontally. 5. The logstc functon possesses the property of symmetry. If the response varable s recoded by changng s to s and s to s, the sgns of all coeffcents wll be reversed. To ft the logstc regresson model n R, you can use the followng programmng statements: > chd.glm <- glm(chd~age,famly="bnomal") > summary(chd.glm) Call: glm(formula = CHD ~ Age, famly = "bnomal") Devance Resduals: Mn Q Medan 3Q Max Coeffcents: Estmate Std. Error z value Pr(> z ) (Intercept) e-6 *** Age e-6 *** --- Sgnf. codes: ***. **. *.5.. (Dsperson parameter for bnomal famly taken to be ) Null devance: on 99 degrees of freedom Resdual devance: 7.35 on 98 degrees of freedom AIC:.35 Number of Fsher Scorng teratons: 4 4
5 STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson To see a plot of the predcted probabltes based on age, use the followng commands. > prob.chd <- ftted(chd.glm) > plot(age,prob.chd,ylab="p(chd Age)") Ths curve s a plot of: θ(age ˆ ) P(CHD Age ˆ ) exp(η ηage ) exp(η η Age ) To ft the logstc regresson model n SAS, you can use the followng programmng statements: ods html; ods graphcs on; proc logstc descendng plots=effect; model CHD = age / lnk=logt; run; ods graphcs off; ods html close; 5
6 STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Questons:. Based on the plot of predcted probabltes, fnd θ ˆ (4) P(CHD ˆ Age 4). 2. Based on the plot of predcted probabltes, fnd θ ˆ (6) P(CHD ˆ Age 6). Interpretng the Model Parameters exp(η ηage ) Mean functon: E(CHD Age ) θ(age ). exp(η ηage ) exp(η η Age ) Ftted Model Equaton (or Ftted Probabltes): E(CHD Age ˆ ) θ(age ˆ ˆ ˆ ) exp(η ˆ ηˆ Age ) Note that n the mean functon, the probabltes θ(age) are nonlnear functons of η and η. However, a smple transformaton results n a lnear model. That s, we can show the followng: θ(age ) ln η ηage θ(age ) Proof of the prevous clam: 6
7 STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Much of the nterpretaton of the logstc regresson model centers on ths rato: θ(age ). θ(age ) Ths rato compares the probablty of havng CHD for a gven age to the probablty of NOT havng CHD for a gven age; n other words, ths s the odds of CHD for age. The natural logarthm of the odds s referred to as the log odds, or the logt. Note that the smple logstc regresson model assumes a lnear model for the logt θ(age ) ln η θ(age ) η Age. Ths representaton of the model shows that the regresson coeffcents do n fact represent changes n the log odds (or the logt). To see how ths works, let s go back to our example. > chd.glm <- glm(chd~age,famly="bnomal") > summary(chd.glm) Call: glm(formula = CHD ~ Age, famly = "bnomal") Devance Resduals: Mn Q Medan 3Q Max Coeffcents: Estmate Std. Error z value Pr(> z ) (Intercept) e-6 *** Age e-6 *** --- Sgnf. codes: ***. **. *.5.. (Dsperson parameter for bnomal famly taken to be ) Null devance: on 99 degrees of freedom Resdual devance: 7.35 on 98 degrees of freedom AIC:.35 Number of Fsher Scorng teratons: 4 From ths output, fnd the followng: ηˆ : ηˆ : exp(η η Age ) θ(age ˆ ˆ ˆ ), where Age = 4: exp(η ˆ ηˆ Age ) The estmated odds of CHD when Age = 4: 7
8 STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Note that θˆ (Age ) = 4 s the estmated probablty that a 4-year old wll have CHD. Recall that the estmated probabltes n R can be obtaned as follows: > prob.chd <- ftted(chd.glm) > cbnd(age,prob.chd) Age prob.chd These estmated probabltes can also be obtaned from SAS: proc logstc descendng; model CHD = age / lnk=logt; output out=get_values predcted=predcted_probabltes; run; proc prnt data=get_values; run; 8
9 STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Fnd the estmated odds of CHD when Age = 45: θ(age ˆ ) = θ(age ˆ ) Fnd the estmated odds of CHD when Age = 5: θ(age ˆ ) = θ(age ˆ ) Based on the prevous two answers, fnd an odds rato for a 5 year ncrease n Age: Fndng the Odds Rato for a t-year Increase n Age Recall that the rato θ(age ˆ ) s the estmated odds of CHD when Age= Age. θ(age ˆ ) θ(age ˆ j) Frst, let Age = Agej. Then we know that ln ηˆ ηˆ Agej. θ(age ˆ j) θ(age ˆ j t) Next, let Age = Agej + t. Agan, we have ln ηˆ ηˆ (Agej t). θ(age ˆ j t) Now, consder ther dfference: θ(age ˆ j t) θ(age ˆ j) ln - ln ηˆ θ(age ˆ j t) θ(age ˆ j) ηˆ (Age t) j ηˆ ηˆ Agej t ηˆ 9
10 STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson θ(age ˆ j t) θ(age ˆ j t) Note that ln(or assocated wth a t-year ncrease n age) = ln t ηˆ. θ(age ˆ j) θ(age ˆ j) t η e ˆ. Therefore, the odds rato assocated wth a t-year ncrease n age s gven by Back to our example:. Dscuss the odds rato assocated wth a one-year ncrease n age. 2. Dscuss the odds rato assocated wth a fve-year ncrease n age. 3. Is t reasonable to assume that a t-year ncrease n a contnuous predctor s constant, regardless of the startng pont? For example, does the rsk assocated wth a 5-year ncrease n age reman constant throughout one s lfe?
11 STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Estmaton of the Model Parameters In a lnear regresson analyss, the regresson coeffcents are estmated based on the least squares method. That s, the estmates are obtaned by mnmzng the sum of the squared resduals. In a logstc regresson analyss, the model parameters are estmated through a process called the maxmum lkelhood method. The basc prncple of maxmum lkelhood s to choose as estmates those parameter values whch, f true, would maxmze the probablty of observng what we have actually observed. Ths nvolves:. Fndng an expresson (.e., the lkelhood functon) for the probablty of the data as a functon of the unknown parameters. For the logstc model, the bnary response varable s assumed to follow a bnomal dstrbuton wth a sngle tral (n=) and probablty of success equal to θ(x). Therefore, for the th observed par (x,y), the contrbuton to the lkelhood s θ(x ) y ( θ(x )) where y βo βx e θ(x ) and βo βx e y Then, snce we assume ndependence across observatons, the lkelhood functon s gven by L β L β ~ n o,β y θ(x ) (θ(x )) 2. Fndng the values of the unknown parameters whch make the value of ths expresson as large as possble. For computatonal purposes t s usually easer to maxmze the logarthm of the lkelhood functon rather than the lkelhood functon tself. Ths works because the logarthm s a monotonc ncreasng functon; therefore, the maxmzng parameters are the same for the lkelhood and log-lkelhood functons. The log-lkelhood functon s gven by y lnl(β,β ) o n ( y )ln θ(x ) y ln θ(x ) To fnd the parameter estmates, we solve smultaneously the equatons gven by settng the partal dervatves wth respect to each parameter equal to : β o lnl(βo,β) β lnl(β,β ) o
12 STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Several dfferent nonlnear optmzaton routnes are used to fnd solutons to such systems. Ths process gets ncreasngly computatonally ntensve as the number of terms n the model ncreases. Statstcal Inference for the Logstc Regresson Model Frst, consder the followng output from PROC LOGISTIC: proc logstc descendng; model CHD = age / lnk=logt; output out=get_values predcted=predcted_probabltes; run; All of these statstcs are testng the same null hypothess: Ho: all explanatory varables n the model have coeffcents of zero. Ha: at least one explanatory varable n the model has a coeffcent dfferent from zero. The Lkelhood Rato test compares the log-lkelhood for the ftted model wth the lkelhood for a model wth NO explanatory varables. PROC LOGISTIC reports -2 log-lkelhood for each of these models, and the ch-square test statstc s the dfference of these two numbers. Note that the df = corresponds to the one ndependent varable n the model. The Score statstc s a functon of the frst and second dervatves of the log-lkelhood functon under the null hypothess. There s some evdence that ths test does not perform as well as the lkelhood rato test for small samples. The Wald statstc s an approxmaton that s more accurate wth larger sample szes. 2
13 STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Carryng out the Lkelhood Rato Test n R: > summary(chd.glm) Call: glm(formula = CHD ~ Age, famly = "bnomal") Devance Resduals: Mn Q Medan 3Q Max Coeffcents: Estmate Std. Error z value Pr(> z ) (Intercept) e-6 *** Age e-6 *** --- Sgnf. codes: ***. **. *.5.. (Dsperson parameter for bnomal famly taken to be ) Null devance: on 99 degrees of freedom Resdual devance: 7.35 on 98 degrees of freedom AIC:.35 Number of Fsher Scorng teratons: 4 > -pchsq( ,) [] e-8 Hypothess Testng For Indvdual Coeffcents H H o a : η : η When the sample sze s large, the test for sgnfcance of the slope parameter (η ) can be calculated as follows: ηˆ z = SE(η ˆ ) χ 2 = z 2 = 3
14 STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Confdence Intervals for Coeffcents and Correspondng Odds Ratos A ( α)% confdence nterval for η can be calculated as follows: ηˆ z α/2 SE(η ˆ ) A ( α)% confdence nterval for the odds rato assocated wth η s calculated as follows: exp( ηˆ z α/2 SE(η ˆ )) These ntervals can be calculated n SAS PROC LOGISTIC as follows: proc logstc descendng; model CHD = age / lnk=logt clparm=wald; run; To compute the confdence ntervals for the parameter estmates n R, enter the followng: > confnt.default(chd.glm) 2.5 % 97.5 % (Intercept) Age To compute the odds rato and the assocated confdence ntervals n R, you can use the followng commands. > exp(coef(chd.glm)) (Intercept) Age > exp(confnt.default(chd.glm)) 2.5 % 97.5 % (Intercept) Age
15 STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson If η corresponds to a contnuous predctor and we wsh to examne the odds rato assocated wth a t unt ncrease n that predctor, the confdence nterval for the odds rato becomes exp(t ηˆ z α/2 t SE(η ˆ )) Example: Fnd the odds rato for CHD assocated wth a year ncrease n age, and gve a 95% confdence nterval based on ths estmate. proc logstc descendng; model CHD = age / lnk=logt clparm=pl clodds=pl; unts age=; run; In R: > exp(*coef(chd.glm)) (Intercept) Age e e+ > exp(*confnt.default(chd.glm)) 2.5 % 97.5 % (Intercept).9573e e-4 Age.89225e e+ The ntervals shown above from R are all known as Wald ntervals (based on normal-theory methods). These may not be approprate for small samples; therefore, you may want to consder another method called the Profle Lkelhood method. Ths nvolves an teratve evaluaton of the lkelhood functon and produces ntervals that may not be symmetrc around the estmate. proc logstc descendng; model CHD = age / lnk=logt clparm=pl; run; 5
16 STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Usng the profle lkelhood method n R: > confnt(chd.glm) Watng for proflng to be done % 97.5 % (Intercept) Age Questons:. How do these compare to the Wald confdence ntervals? 2. How would you fnd the profle lkelhood confdence nterval for the odds rato for a one-year ncrease n age? 6
Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore
Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.
More informationStatistics for Economics & Business
Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable
More informationChapter 11: Simple Linear Regression and Correlation
Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests
More informationDepartment of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6
Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.
More informationSTAT 3008 Applied Regression Analysis
STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,
More informationNegative Binomial Regression
STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...
More informationsince [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation
Econ 388 R. Butler 204 revsons Lecture 4 Dummy Dependent Varables I. Lnear Probablty Model: the Regresson model wth a dummy varables as the dependent varable assumpton, mplcaton regular multple regresson
More informationBasic Business Statistics, 10/e
Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson
More informationStatistics for Business and Economics
Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear
More information[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.
PPOL 59-3 Problem Set Exercses n Smple Regresson Due n class /8/7 In ths problem set, you are asked to compute varous statstcs by hand to gve you a better sense of the mechancs of the Pearson correlaton
More informationStatistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation
Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear
More informationChapter 13: Multiple Regression
Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to
More informationLecture 6: Introduction to Linear Regression
Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6
More informationANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)
Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of
More informationMaximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models
ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Maxmum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models
More informationSTATISTICS QUESTIONS. Step by Step Solutions.
STATISTICS QUESTIONS Step by Step Solutons www.mathcracker.com 9//016 Problem 1: A researcher s nterested n the effects of famly sze on delnquency for a group of offenders and examnes famles wth one to
More informationCorrelation and Regression. Correlation 9.1. Correlation. Chapter 9
Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,
More informationLecture 4 Hypothesis Testing
Lecture 4 Hypothess Testng We may wsh to test pror hypotheses about the coeffcents we estmate. We can use the estmates to test whether the data rejects our hypothess. An example mght be that we wsh to
More information1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands
Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of
More informationLinear Regression Analysis: Terminology and Notation
ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented
More information4.3 Poisson Regression
of teratvely reweghted least squares regressons (the IRLS algorthm). We do wthout gvng further detals, but nstead focus on the practcal applcaton. > glm(survval~log(weght)+age, famly="bnomal", data=baby)
More information/ n ) are compared. The logic is: if the two
STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence
More informationIntroduction to Regression
Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes
More informationRegression Analysis. Regression Analysis
Regresson Analyss Smple Regresson Multvarate Regresson Stepwse Regresson Replcaton and Predcton Error 1 Regresson Analyss In general, we "ft" a model by mnmzng a metrc that represents the error. n mn (y
More informationChapter 9: Statistical Inference and the Relationship between Two Variables
Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,
More informationx i1 =1 for all i (the constant ).
Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by
More informationDiagnostics in Poisson Regression. Models - Residual Analysis
Dagnostcs n Posson Regresson Models - Resdual Analyss 1 Outlne Dagnostcs n Posson Regresson Models - Resdual Analyss Example 3: Recall of Stressful Events contnued 2 Resdual Analyss Resduals represent
More informationY = β 0 + β 1 X 1 + β 2 X β k X k + ε
Chapter 3 Secton 3.1 Model Assumptons: Multple Regresson Model Predcton Equaton Std. Devaton of Error Correlaton Matrx Smple Lnear Regresson: 1.) Lnearty.) Constant Varance 3.) Independent Errors 4.) Normalty
More informationMaximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models
ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Mamum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models for
More informationDO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes
25/6 Canddates Only January Examnatons 26 Student Number: Desk Number:...... DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR Department Module Code Module Ttle Exam Duraton
More informationHere is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)
Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,
More informationLecture Notes on Linear Regression
Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume
More informationECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics
ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott
More informationEconomics 130. Lecture 4 Simple Linear Regression Continued
Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do
More informationDr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur
Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed
More informationSee Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)
Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes
More informationLimited Dependent Variables
Lmted Dependent Varables. What f the left-hand sde varable s not a contnuous thng spread from mnus nfnty to plus nfnty? That s, gven a model = f (, β, ε, where a. s bounded below at zero, such as wages
More informationComparison of Regression Lines
STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence
More informationResource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis
Resource Allocaton and Decson Analss (ECON 800) Sprng 04 Foundatons of Regresson Analss Readng: Regresson Analss (ECON 800 Coursepak, Page 3) Defntons and Concepts: Regresson Analss statstcal technques
More informatione i is a random error
Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown
More informationLOGIT ANALYSIS. A.K. VASISHT Indian Agricultural Statistics Research Institute, Library Avenue, New Delhi
LOGIT ANALYSIS A.K. VASISHT Indan Agrcultural Statstcs Research Insttute, Lbrary Avenue, New Delh-0 02 amtvassht@asr.res.n. Introducton In dummy regresson varable models, t s assumed mplctly that the dependent
More informationLecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding
Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an
More informationLecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding
Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study
More informationFirst Year Examination Department of Statistics, University of Florida
Frst Year Examnaton Department of Statstcs, Unversty of Florda May 7, 010, 8:00 am - 1:00 noon Instructons: 1. You have four hours to answer questons n ths examnaton.. You must show your work to receve
More informationSTAT 3340 Assignment 1 solutions. 1. Find the equation of the line which passes through the points (1,1) and (4,5).
(out of 15 ponts) STAT 3340 Assgnment 1 solutons (10) (10) 1. Fnd the equaton of the lne whch passes through the ponts (1,1) and (4,5). β 1 = (5 1)/(4 1) = 4/3 equaton for the lne s y y 0 = β 1 (x x 0
More informationChapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.
Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the
More informationLecture 2: Prelude to the big shrink
Lecture 2: Prelude to the bg shrnk Last tme A slght detour wth vsualzaton tools (hey, t was the frst day... why not start out wth somethng pretty to look at?) Then, we consdered a smple 120a-style regresson
More information28. SIMPLE LINEAR REGRESSION III
8. SIMPLE LINEAR REGRESSION III Ftted Values and Resduals US Domestc Beers: Calores vs. % Alcohol To each observed x, there corresponds a y-value on the ftted lne, y ˆ = βˆ + βˆ x. The are called ftted
More informationJanuary Examinations 2015
24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)
More informationChapter 14 Simple Linear Regression
Chapter 4 Smple Lnear Regresson Chapter 4 - Smple Lnear Regresson Manageral decsons often are based on the relatonshp between two or more varables. Regresson analss can be used to develop an equaton showng
More information18. SIMPLE LINEAR REGRESSION III
8. SIMPLE LINEAR REGRESSION III US Domestc Beers: Calores vs. % Alcohol Ftted Values and Resduals To each observed x, there corresponds a y-value on the ftted lne, y ˆ ˆ = α + x. The are called ftted values.
More informationRockefeller College University at Albany
Rockefeller College Unverst at Alban PAD 705 Handout: Maxmum Lkelhood Estmaton Orgnal b Davd A. Wse John F. Kenned School of Government, Harvard Unverst Modfcatons b R. Karl Rethemeer Up to ths pont n
More informationRegression with limited dependent variables. Professor Bernard Fingleton
Regresson wth lmted dependent varables Professor Bernard Fngleton Regresson wth lmted dependent varables Whether a mortgage applcaton s accepted or dened Decson to go on to hgher educaton Whether or not
More informationx = , so that calculated
Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to
More informationBiostatistics 360 F&t Tests and Intervals in Regression 1
Bostatstcs 360 F&t Tests and Intervals n Regresson ORIGIN Model: Y = X + Corrected Sums of Squares: X X bar where: s the y ntercept of the regresson lne (translaton) s the slope of the regresson lne (scalng
More informationa. (All your answers should be in the letter!
Econ 301 Blkent Unversty Taskn Econometrcs Department of Economcs Md Term Exam I November 8, 015 Name For each hypothess testng n the exam complete the followng steps: Indcate the test statstc, ts crtcal
More information2016 Wiley. Study Session 2: Ethical and Professional Standards Application
6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton
More informationHydrological statistics. Hydrological statistics and extremes
5--0 Stochastc Hydrology Hydrologcal statstcs and extremes Marc F.P. Berkens Professor of Hydrology Faculty of Geoscences Hydrologcal statstcs Mostly concernes wth the statstcal analyss of hydrologcal
More informationEcon107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)
I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes
More informationInterval Estimation in the Classical Normal Linear Regression Model. 1. Introduction
ECONOMICS 35* -- NOTE 7 ECON 35* -- NOTE 7 Interval Estmaton n the Classcal Normal Lnear Regresson Model Ths note outlnes the basc elements of nterval estmaton n the Classcal Normal Lnear Regresson Model
More informationβ0 + β1xi. You are interested in estimating the unknown parameters β
Ordnary Least Squares (OLS): Smple Lnear Regresson (SLR) Analytcs The SLR Setup Sample Statstcs Ordnary Least Squares (OLS): FOCs and SOCs Back to OLS and Sample Statstcs Predctons (and Resduals) wth OLS
More informationx yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting.
The Practce of Statstcs, nd ed. Chapter 14 Inference for Regresson Introducton In chapter 3 we used a least-squares regresson lne (LSRL) to represent a lnear relatonshp etween two quanttatve explanator
More informationPubH 7405: REGRESSION ANALYSIS INTRODUCTION TO POISSON REGRESSION
PubH 7405: REGRESSION ANALYSIS INTRODUCTION TO POISSON REGRESSION PROTOTYPE EXAMPLE #1 We have data for 44 physcans workng n emergency medcne at a major hosptal system. The concern s about the number of
More informationThe Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction
ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also
More informationβ0 + β1xi. You are interested in estimating the unknown parameters β
Revsed: v3 Ordnar Least Squares (OLS): Smple Lnear Regresson (SLR) Analtcs The SLR Setup Sample Statstcs Ordnar Least Squares (OLS): FOCs and SOCs Back to OLS and Sample Statstcs Predctons (and Resduals)
More informationLecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management
Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 1 Chapters 14, 15 & 16 Professor Ahmad, Ph.D. Department of Management Revsed August 005 Chapter 14 Formulas Smple Lnear Regresson Model: y =
More information8/25/17. Data Modeling. Data Modeling. Data Modeling. Patrice Koehl Department of Biological Sciences National University of Singapore
8/5/17 Data Modelng Patrce Koehl Department of Bologcal Scences atonal Unversty of Sngapore http://www.cs.ucdavs.edu/~koehl/teachng/bl59 koehl@cs.ucdavs.edu Data Modelng Ø Data Modelng: least squares Ø
More informationwhere I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X).
11.4.1 Estmaton of Multple Regresson Coeffcents In multple lnear regresson, we essentally solve n equatons for the p unnown parameters. hus n must e equal to or greater than p and n practce n should e
More informationStatistics MINITAB - Lab 2
Statstcs 20080 MINITAB - Lab 2 1. Smple Lnear Regresson In smple lnear regresson we attempt to model a lnear relatonshp between two varables wth a straght lne and make statstcal nferences concernng that
More informationLearning Objectives for Chapter 11
Chapter : Lnear Regresson and Correlaton Methods Hldebrand, Ott and Gray Basc Statstcal Ideas for Managers Second Edton Learnng Objectves for Chapter Usng the scatterplot n regresson analyss Usng the method
More informationTHE ROYAL STATISTICAL SOCIETY 2006 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE
THE ROYAL STATISTICAL SOCIETY 6 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER I STATISTICAL THEORY The Socety provdes these solutons to assst canddates preparng for the eamnatons n future years and for
More informationTopic 7: Analysis of Variance
Topc 7: Analyss of Varance Outlne Parttonng sums of squares Breakdown the degrees of freedom Expected mean squares (EMS) F test ANOVA table General lnear test Pearson Correlaton / R 2 Analyss of Varance
More informationChapter 15 - Multiple Regression
Chapter - Multple Regresson Chapter - Multple Regresson Multple Regresson Model The equaton that descrbes how the dependent varable y s related to the ndependent varables x, x,... x p and an error term
More informationPolynomial Regression Models
LINEAR REGRESSION ANALYSIS MODULE XII Lecture - 6 Polynomal Regresson Models Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Test of sgnfcance To test the sgnfcance
More informationStatistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models
Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 14 Multple Regresson Models 1999 Prentce-Hall, Inc. Chap. 14-1 Chapter Topcs The Multple Regresson Model Contrbuton of Indvdual Independent Varables
More informationStatistics II Final Exam 26/6/18
Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the
More informationChapter 14: Logit and Probit Models for Categorical Response Variables
Chapter 4: Logt and Probt Models for Categorcal Response Varables Sect 4. Models for Dchotomous Data We wll dscuss only ths secton of Chap 4, whch s manly about Logstc Regresson, a specal case of the famly
More informationLaboratory 1c: Method of Least Squares
Lab 1c, Least Squares Laboratory 1c: Method of Least Squares Introducton Consder the graph of expermental data n Fgure 1. In ths experment x s the ndependent varable and y the dependent varable. Clearly
More informationDr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur
Analyss of Varance and Desgn of Experment-I MODULE VIII LECTURE - 34 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS EFFECTS MODEL Dr Shalabh Department of Mathematcs and Statstcs Indan
More informationBIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data
Lab : TWO-LEVEL NORMAL MODELS wth school chldren popularty data Purpose: Introduce basc two-level models for normally dstrbuted responses usng STATA. In partcular, we dscuss Random ntercept models wthout
More informationIntroduction to Generalized Linear Models
INTRODUCTION TO STATISTICAL MODELLING TRINITY 00 Introducton to Generalzed Lnear Models I. Motvaton In ths lecture we extend the deas of lnear regresson to the more general dea of a generalzed lnear model
More informationCHAPTER 8. Exercise Solutions
CHAPTER 8 Exercse Solutons 77 Chapter 8, Exercse Solutons, Prncples of Econometrcs, 3e 78 EXERCISE 8. When = N N N ( x x) ( x x) ( x x) = = = N = = = N N N ( x ) ( ) ( ) ( x x ) x x x x x = = = = Chapter
More informationLaboratory 3: Method of Least Squares
Laboratory 3: Method of Least Squares Introducton Consder the graph of expermental data n Fgure 1. In ths experment x s the ndependent varable and y the dependent varable. Clearly they are correlated wth
More informationLinear Approximation with Regularization and Moving Least Squares
Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...
More informationThe Geometry of Logit and Probit
The Geometry of Logt and Probt Ths short note s meant as a supplement to Chapters and 3 of Spatal Models of Parlamentary Votng and the notaton and reference to fgures n the text below s to those two chapters.
More informationTests of Single Linear Coefficient Restrictions: t-tests and F-tests. 1. Basic Rules. 2. Testing Single Linear Coefficient Restrictions
ECONOMICS 35* -- NOTE ECON 35* -- NOTE Tests of Sngle Lnear Coeffcent Restrctons: t-tests and -tests Basc Rules Tests of a sngle lnear coeffcent restrcton can be performed usng ether a two-taled t-test
More informationReminder: Nested models. Lecture 9: Interactions, Quadratic terms and Splines. Effect Modification. Model 1
Lecture 9: Interactons, Quadratc terms and Splnes An Manchakul amancha@jhsph.edu 3 Aprl 7 Remnder: Nested models Parent model contans one set of varables Extended model adds one or more new varables to
More information1 Binary Response Models
Bnary and Ordered Multnomal Response Models Dscrete qualtatve response models deal wth dscrete dependent varables. bnary: yes/no, partcpaton/non-partcpaton lnear probablty model LPM, probt or logt models
More informationStatistics Chapter 4
Statstcs Chapter 4 "There are three knds of les: les, damned les, and statstcs." Benjamn Dsrael, 1895 (Brtsh statesman) Gaussan Dstrbuton, 4-1 If a measurement s repeated many tmes a statstcal treatment
More informationChapter 15 Student Lecture Notes 15-1
Chapter 15 Student Lecture Notes 15-1 Basc Busness Statstcs (9 th Edton) Chapter 15 Multple Regresson Model Buldng 004 Prentce-Hall, Inc. Chap 15-1 Chapter Topcs The Quadratc Regresson Model Usng Transformatons
More informationChapter 5 Multilevel Models
Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level
More informationUsing T.O.M to Estimate Parameter of distributions that have not Single Exponential Family
IOSR Journal of Mathematcs IOSR-JM) ISSN: 2278-5728. Volume 3, Issue 3 Sep-Oct. 202), PP 44-48 www.osrjournals.org Usng T.O.M to Estmate Parameter of dstrbutons that have not Sngle Exponental Famly Jubran
More informationMarginal Effects in Probit Models: Interpretation and Testing. 1. Interpreting Probit Coefficients
ECON 5 -- NOE 15 Margnal Effects n Probt Models: Interpretaton and estng hs note ntroduces you to the two types of margnal effects n probt models: margnal ndex effects, and margnal probablty effects. It
More informationMidterm Examination. Regression and Forecasting Models
IOMS Department Regresson and Forecastng Models Professor Wllam Greene Phone: 22.998.0876 Offce: KMC 7-90 Home page: people.stern.nyu.edu/wgreene Emal: wgreene@stern.nyu.edu Course web page: people.stern.nyu.edu/wgreene/regresson/outlne.htm
More informationProperties of Least Squares
Week 3 3.1 Smple Lnear Regresson Model 3. Propertes of Least Squares Estmators Y Y β 1 + β X + u weekly famly expendtures X weekly famly ncome For a gven level of x, the expected level of food expendtures
More informationThe Ordinary Least Squares (OLS) Estimator
The Ordnary Least Squares (OLS) Estmator 1 Regresson Analyss Regresson Analyss: a statstcal technque for nvestgatng and modelng the relatonshp between varables. Applcatons: Engneerng, the physcal and chemcal
More informationCIS526: Machine Learning Lecture 3 (Sept 16, 2003) Linear Regression. Preparation help: Xiaoying Huang. x 1 θ 1 output... θ M x M
CIS56: achne Learnng Lecture 3 (Sept 6, 003) Preparaton help: Xaoyng Huang Lnear Regresson Lnear regresson can be represented by a functonal form: f(; θ) = θ 0 0 +θ + + θ = θ = 0 ote: 0 s a dummy attrbute
More informationMaximum Likelihood Estimation
Maxmum Lkelhood Estmaton INFO-2301: Quanttatve Reasonng 2 Mchael Paul and Jordan Boyd-Graber MARCH 7, 2017 INFO-2301: Quanttatve Reasonng 2 Paul and Boyd-Graber Maxmum Lkelhood Estmaton 1 of 9 Why MLE?
More informationChapter 12 Analysis of Covariance
Chapter Analyss of Covarance Any scentfc experment s performed to know somethng that s unknown about a group of treatments and to test certan hypothess about the correspondng treatment effect When varablty
More informationANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected.
ANSWERS CHAPTER 9 THINK IT OVER thnk t over TIO 9.: χ 2 k = ( f e ) = 0 e Breakng the equaton down: the test statstc for the ch-squared dstrbuton s equal to the sum over all categores of the expected frequency
More information