Designing a Pseudo R-Squared Goodness-of-Fit Measure in Generalized Linear Models
|
|
- Anthony Richard
- 5 years ago
- Views:
Transcription
1 Desgnng a Pseudo R-Squared Goodness-of-Ft Measure n Generalzed Lnear Models H. I. Mbachu Dept. of Mathematcs/Statstcs, Unversty of Port Harcourt, Port Harcourt E. C. Nduka Dept. of Mathematcs/Statstcs, Unversty of Port Harcourt, Port Harcourt M. E. Nja (Correspondng author) Dept. of Mathematcs/Statstcs, Cross Rver Unversty of Technology, Calabar Receved: December 19, 2011 Accepted: January 4, 2012 Publshed: Aprl 1, 2012 do: /jmr.v4n2p148 Abstract URL: The coeffcent of determnaton s a functon of resduals n the General Lnear Models. The devance, logt, standardzed and the studentzed resduals were examned n generalzed lnear models n order to determne the behavour of resduals n ths class of models and thereby desgn a new pseudo R-squared goodness-of-ft measure. The Newton-Raphson estmaton procedure was adopted. It was observed that these resduals exhbt patterns that are unque to the subpopulatons defned by levels of categorcal predctors. Resduals block on the bass of sgns, where postve sgns ndcate success responses and negatve sgns falure responses. It was also observed that the devance s a close approxmaton of the studentzed resdual. The logt resdual s two tmes the sze of the standardzed resduals. Borrowng from the Nagelkerke s mprovement of Cox and Snell s goodness-of-ft measure n generalzed lnear models and the coeffcent of determnaton counterpart of the general lnear model, a new pseudo R squared goodness-of-ft test whch uses predcted probabltes and a monotonc lnk functon s here proposed to serve both the lnear and Generalzed Lnear Models. Keywords: Devance, Normalzed resduals, Logt, Standardzed resduals, Loglkelhood functon, Response probablty 1. Introducton A generalzed lnear model s one n whch each component of the response varable Y has a dstrbuton n the exponental famly, takng the form yθ b(θ) f y (y, θ, ϕ) = exp{ + c(y, ϕ)} a(θ) for some specfc functon a( ), b( ) and c(, ) (McCullagh & Nelder, 1990). The functons a and c are such that a(ϕ) = ϕ/w and c = c(y, ϕ/w), where w s a known weght for each observaton. The model can be stated as z = Σ p j=1 x jβ j + e h (µ) = Σ j=1 x j β + e (y µ )h (µ), = 1, 2, 3,..., n (1) where z s the adjusted dependent varate, x j s the (, j)th element of the desgn matrx, h(µ ) s the lnk functon and e s the resdual error. The lnk between y and z s n the expresson. Where y s a bnomal random response varable. From (1), a resdual n generalzed lnear model can be defned as h = h(µ ) (2) e, so defned s called Pearson resdual. e = z x j β j h (µ) Standard theory for ths type of dstrbuton expresses the mean and varance of the response y as: E(y) = b (θ) and var(y) = b (θ)ϕ w = V(µ)ϕ w (3) 148 ISSN E-ISSN
2 where V s the varance functon. The log-lkelhood functon, a goodness-of-ft measure s defned for the followng exponental famly models: Generally, the log-lkelhood functon s of the form wth ndvdual contrbuton for the bnomal functon as 2. The Newton-Raphson Method The Newton-Raphson estmaton scheme s gven as where H, the Hessan matrx s gven as wth and L(y, µ, ϕ) = Σ log( f (y, µ, ϕ)) l = [r log(p ) + (n r )log(1 p )] β k+1 = β k H 1 g 2 l H = { } rs β r β s 2 l = Σ[(y µ) {W dη β r β s β s dµ x r} + W dη dµ x r (y µ)] β s 2 l = Σ[(y µ) {W dη β j dµ x j} + W dη dµ x j (y µ)] β j β 2 j l = [ y µ β j a(ϕ) 1 dµ v dη x j] = l, the loglkelhood for a bnary response varable can be wrtten as η = β 0 + Σx j β j s the lnear predctor. W dη (y µ) a(ϕ) dµ x j l = l(β; y) = Σ Σ j y x j β j Σm log(1 + expσσx j β j ) W, the weght matrx s gven as W = dag{m ( dµ dη ) 2 /µ (1 µ )}. m s row subtotal n the cross tabulaton table. The gradent vector g s gven as g = ( l β 0, l β 1,..., where the response or ftted probablty µ s defned as l ) = l = Σ y m µ dµ = Σ(y m µ )x r β n β r µ (1 µ ) dη µ = expσx jβ j 1 + expσx j β j An alternatve estmaton procedure s the Iteratve Weghted Least Squares method whch often adopted n order to avod the computatonal tedum assocated wth the Hessan matrx. 3. Resduals n Generalzed Lnear Models The coeffcent of determnaton R 2, s a functon of the resdual. It was orgnally developed for the normal-theory model. Cameron and Wndmejer (1996) desgned an R 2 for the Posson and related count data after observng that t was rarely used for count data. Nagelkerke (1991) generalzed the defnton of R 2 n what s called the generalzed R 2. The generalzed R 2 s consstent wth the classcal R 2 and s also maxmzed by the maxmum lkelhood estmaton of a model. The generalzed coeffcent of determnaton s gven as follows: R 2 = 1 ( L(0) L(θ) ) 2 n where L(0) s the lkelhood of the model wth only ntercept. L(θ) s the lkelhood of the estmated model and n s the sample sze. Resduals n a logstc model can be defned as the dfference between y and the predcted probablty θ for Publshed by Canadan Center of Scence and Educaton 149
3 y. We defne the predcted probablty n a cross-classfed data as the probablty that an object or a person selected from a subgroup s a success (Stroke et al., 1997). θ = exp{β 0 + Σβ x j } 1 + exp{β 0 + Σβ x j } The monotonc lnk functon relates the predcted probablty to the set of lnear predctors. For the logstc regresson where the underlyng dstrbuton s bnomal, the lnk functon s a logt. The devance, Pearson χ 2, standardzed, logt and studentzed resduals are the resduals normally assocated wth generalzed lnear models. The analyss of resduals made n ths paper shows that the logt resdual s approxmately twce the sze of standandzed resduals. The standardzed resdual s approxmately equal to the devance resdual. Ths can be seen n the appendx. 4. Goodness of Ft Measures n Generalzed Lnear Models The devance and the generalzed Pearson χ 2 statstc are two measures of goodness of ft n generalzed lnear models. Both the devance and the generalzed Pearson χ 2 have exact χ 2 dstrbutons for Normal-theory lnear models f the models are true (McCullagh & Nelder, 1990). The devance uses the log of the rato of lkelhoods. Cox and Snell R squared, another measure of goodness of ft n generalzed lnear models s a psudo R squared and a modfcaton of the devance whch confgures the test nterval to le between 0 and 1 (excludng 1) such that a smaller rato mples a greater mprovement. The devance for the set of dstrbutons n generalzed lnear models s gven as follows: for the normal dstrbuton, t s stated as D = Σw (y µ ) 2 For the posson, bnomal and gamma we have and 2 w [y log( y ) (y µ )], µ 2 w m [y log( y ) + (1 y )(log 1 y )] µ 1 µ 2 w [ log( y ) + y µ ] µ µ respectvely. For the nverse-gaussan, multnomal and negatve bnomal, we have and respectvely. Cox and Snell R 2 s defned as w (y µ ) 2 µ 2 y y j w y j log( ) p j m j 2 w [ylog(y/µ) (1 + 1/k)log( y + 1/k µ + 1/k )] R 2 = 1 { L(m nt) L(m f ull ) }2/N where L(m nt ) s the condtonal probablty of the dependent varable for the ntercept model. If L(m f ull ) s 1 then R 2 < 1. The Nagelkerke/Gragg & Uhler s modfcaton s R 2 = 1 { L(m nt) L(m f ull ) }2/N /1 L(m nt ) 2/N In ths paper a new goodness of ft test that makes use of ftted probabltes, a monotomc lnk functon and the Nagelkerke range of possble values s proposed. The test s desgned to serve both the general lnear and the generalzed lnear models. It s gven as follows: R 2 G&G = 1 [h (θ)] 1 (y θ) 2 (y h(θ)) 150 ISSN E-ISSN
4 RG&G 2, desgned for the generalzed lnear models can be adapted for use as a goodness of ft measure n the general lnear model by replacng the ftted probabltes and the lnk functon values wth ftted y values and the mean of y respectvely. The value of RG&G 2 range from 0 to 1, wth hgher values mplyng better fts. 5. Illustratve Example The hypothetcal data below s used for the llustraton of resdual analyss n generalzed lnear models: <Table 1> The probablty that a person from the th sex level and the jth locaton status s nfected wth a certan vrus. The model Let y j be a bnomal random response varable correspondng to the th sex status and the jth locaton whch assumes the value 0 or 1. The probablty θ j ; that a person of the hth sex and jth locaton s nfected by the vrus s modeled as where = 1, 2, j = 1, 2, β 0 = overall mean sex() = effect of th sex level =β 1 locaton(j) = effect of jth locaton status = β 2 θ j = exp exp[β 0 + sex() + locaton( j)] 1 + exp[β 0 + sex() + locaton( j)] e j = random error assocated wth observaton. The Newton-Raphson estmates of the llustratve example are as follows: Soluton β 0 = , β 1 = s the effect of the th sex level. β 2 = s the effect of the jth locaton status. The pseudo-r squared goodness of ft test reveals the followng results: Cox and Snell R 2 = Nagelkerke/Gragg & Uhler s R 2 = The proposed R 2 G&G = The outlned resduals assocated wth ths example are shown n the appendx. It s observed that resduals exhbt unque patterns n accordance wth subpopulatons defned by levels of the categorcal varables. Resduals form blocks on the bass of sgns, where postve sgns ndcate success and negatve sgns ndcate falure responses. The devance and the studentzed resduals exhbt very close resdual patterns. Stat Computng (2011) gave three nterpretatons of R 2 as follows: () R 2 as explaned varablty: The denomnator of the rato ndcates total varaton n the dependent varable whle the numerator s the varablty n the dependent varable that s not predcted by the model. The rato s the proporton of the total varablty explaned by the model whch agrees wth R 2 n Ordnary Lnear Models (Koutsoyanns, 1983). Thus a hgher rato mples a better model. () R 2 as mprovement from null model to ftted model: A smaller rato mples a greater mprovement. () R 2 as the square of the correlaton: correlaton between predcted values and the actual values. A hgher R 2 mples a greater mprovement of ft. It can be seen that the proposed R2 goodness-of-ft measure compares favourably wth the Nagelkerke/Gragg & Uhler s R 2 (0.180 aganst 0.187). 6. Concluson The Nagelkerke/Gragg & Uhler s Improvement of Cox and Snell R 2 s applcable n Generalzed Lnear models only. The exstng R squared goodness of ft measure n General Lnear models s not applcable n Generalzed Lnear model. Ths s because the model estmates from Generalzed Lnear models are maxmum lkelhood estmates whch are obtaned by teratve procedures. They are not calculated to mnmze varance; so the Ordnary Least Squares approach to goodness of ft does not apply. To evaluate goodness of ft n generalzed lnear models a pseudo R 2 s requred. Ths paper ntroduces a new pseudo R squared goodness of ft measure whch has the advantage of assessng goodness of ft n both lnear and generalzed lnear models. The result shows that the new pseudo-r squared method desgned n ths paper compares favourably wth the exstng Nagelkerke/Gragg & Uhler s desgn. Publshed by Canadan Center of Scence and Educaton 151
5 References Cameron, A. C., & Wndmeyer F. A. G. (1996). R-Squared Measures for Count Data Regresson Models wth Applcatons to Health-Care Utlzaton. Journal of Busness and Economc Statstc. Koutsoyanns, A. (1983). Theory of Econometrcs: An Introductory Exposton of Econometrc Methods. 2 Ed. The Macmllan Press Ltd, London. McCullagh, P., & Nelder, J. A. (1990). Generalzed Lnear Models. Chapman and Hall. Madras. Nagelkerke, N. (1991). A note on a General Defnton of the Coeffcent of Determnaton. Bometrka, 78 (3), pp Nja, M. E., & Bamduro, T. A. (2006). Relatve performance of Optmzaton Methods In Solutons of Generalzed Lnear Models. An unpublshed Ph. D thess, Unversty of Ibadan, Ngera. Stoke, M. E., Davs, C. S., & Koch, G. G. (1997). Categorcal Data Analyss usng the SAS system, SAS Insttute Inc., Cary, NC, USA. Table 1. Hypothetcal data I Sex x 1 Locaton x 2 Infected y Not nfected Total m 1 Female Urban Female Rural Male Urban Male Rural Appendx: Resduals PRE 1 COO 1 LEV 1 RES 1 LRE 1 SRE 1 ZRE 1 DEV 1 DFB0 1 DFB1 1 DFB2 152 ISSN E-ISSN
6 PRE 1 COO 1 LEV 1 RES 1 LRE 1 S RE 1 ZRE 1 DEV 1 DFB0 1 DFB1 1 DFB2 PRE 1 COO 1 LEV 1 RES 1 LRE 1 S RE 1 ZRE 1 DEV 1 DFB0 1 DFB1 1 DFB2 Publshed by Canadan Center of Scence and Educaton 153
7 PRE 1 COO 1 LEV 1 RES 1 LRE 1 S RE 1 ZRE 1 DEV 1 DFB0 1 DFB1 1 DFB2 Key: PRE 1 COO 1 LEV 1 RES 1 LRE 1 S RE 1 ZRE 1 DEV 1 DFB0 1 DFB1 1 DFB2 1 Predcted probablty Analog of Cook s nfluence statstcs Leverage value Dfference between observed and predcted probabltes Logt Resdual Standard Resdual Normalzed Resdual Devance value DFBeta for constant DFBeta for VAR00002(1) DFBeta for VAR00003(1) 154 ISSN E-ISSN
Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore
Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.
More informationNegative Binomial Regression
STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...
More informationMaximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models
ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Maxmum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models
More informationChapter 14: Logit and Probit Models for Categorical Response Variables
Chapter 4: Logt and Probt Models for Categorcal Response Varables Sect 4. Models for Dchotomous Data We wll dscuss only ths secton of Chap 4, whch s manly about Logstc Regresson, a specal case of the famly
More informationDiagnostics in Poisson Regression. Models - Residual Analysis
Dagnostcs n Posson Regresson Models - Resdual Analyss 1 Outlne Dagnostcs n Posson Regresson Models - Resdual Analyss Example 3: Recall of Stressful Events contnued 2 Resdual Analyss Resduals represent
More informationSTAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression
STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Ths handout covers materal found n Secton 3.7 of your text. You may also want to revew regresson technques n Chapter. In ths handout,
More informationChapter 11: Simple Linear Regression and Correlation
Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests
More informationStatistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation
Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear
More informationIntroduction to Generalized Linear Models
INTRODUCTION TO STATISTICAL MODELLING TRINITY 00 Introducton to Generalzed Lnear Models I. Motvaton In ths lecture we extend the deas of lnear regresson to the more general dea of a generalzed lnear model
More informationMaximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models
ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Mamum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models for
More informationComparison of Regression Lines
STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence
More informationRegression with limited dependent variables. Professor Bernard Fingleton
Regresson wth lmted dependent varables Professor Bernard Fngleton Regresson wth lmted dependent varables Whether a mortgage applcaton s accepted or dened Decson to go on to hgher educaton Whether or not
More information1 Binary Response Models
Bnary and Ordered Multnomal Response Models Dscrete qualtatve response models deal wth dscrete dependent varables. bnary: yes/no, partcpaton/non-partcpaton lnear probablty model LPM, probt or logt models
More informationStatistics for Economics & Business
Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable
More information8/25/17. Data Modeling. Data Modeling. Data Modeling. Patrice Koehl Department of Biological Sciences National University of Singapore
8/5/17 Data Modelng Patrce Koehl Department of Bologcal Scences atonal Unversty of Sngapore http://www.cs.ucdavs.edu/~koehl/teachng/bl59 koehl@cs.ucdavs.edu Data Modelng Ø Data Modelng: least squares Ø
More informationPsychology 282 Lecture #24 Outline Regression Diagnostics: Outliers
Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.
More informationStatistics for Business and Economics
Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear
More informationIntroduction to the R Statistical Computing Environment R Programming
Introducton to the R Statstcal Computng Envronment R Programmng John Fox McMaster Unversty ICPSR 2018 John Fox (McMaster Unversty) R Programmng ICPSR 2018 1 / 14 Programmng Bascs Topcs Functon defnton
More informationChapter 15 - Multiple Regression
Chapter - Multple Regresson Chapter - Multple Regresson Multple Regresson Model The equaton that descrbes how the dependent varable y s related to the ndependent varables x, x,... x p and an error term
More informationChapter 13: Multiple Regression
Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to
More informationLogistic regression models 1/12
Logstc regresson models 1/12 2/12 Example 1: dogs look lke ther owners? Some people beleve that dogs look lke ther owners. Is ths true? To test the above hypothess, The New York Tmes conducted a quz onlne.
More informationFirst Year Examination Department of Statistics, University of Florida
Frst Year Examnaton Department of Statstcs, Unversty of Florda May 7, 010, 8:00 am - 1:00 noon Instructons: 1. You have four hours to answer questons n ths examnaton.. You must show your work to receve
More informationLECTURE 9 CANONICAL CORRELATION ANALYSIS
LECURE 9 CANONICAL CORRELAION ANALYSIS Introducton he concept of canoncal correlaton arses when we want to quantfy the assocatons between two sets of varables. For example, suppose that the frst set of
More informationA New Method for Estimating Overdispersion. David Fletcher and Peter Green Department of Mathematics and Statistics
A New Method for Estmatng Overdsperson Davd Fletcher and Peter Green Department of Mathematcs and Statstcs Byron Morgan Insttute of Mathematcs, Statstcs and Actuaral Scence Unversty of Kent, England Overvew
More informationECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics
ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott
More informationThe Geometry of Logit and Probit
The Geometry of Logt and Probt Ths short note s meant as a supplement to Chapters and 3 of Spatal Models of Parlamentary Votng and the notaton and reference to fgures n the text below s to those two chapters.
More informationLecture Notes on Linear Regression
Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume
More informationMATH 829: Introduction to Data Mining and Analysis The EM algorithm (part 2)
1/16 MATH 829: Introducton to Data Mnng and Analyss The EM algorthm (part 2) Domnque Gullot Departments of Mathematcal Scences Unversty of Delaware Aprl 20, 2016 Recall 2/16 We are gven ndependent observatons
More informationLecture 6: Introduction to Linear Regression
Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6
More informationLinear Regression Analysis: Terminology and Notation
ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented
More information4.3 Poisson Regression
of teratvely reweghted least squares regressons (the IRLS algorthm). We do wthout gvng further detals, but nstead focus on the practcal applcaton. > glm(survval~log(weght)+age, famly="bnomal", data=baby)
More informationSTAT 3008 Applied Regression Analysis
STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,
More informationOptimization. August 30, 2016
Optmzaton August 30, 2016 Optmzaton problem 1/31 An optmzaton problem s the problem of fndng the best soluton for an objectve functon. Optmzaton method plays an mportant role n statstcs, for example, to
More informationChapter 5 Multilevel Models
Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level
More information3/3/2014. CDS M Phil Econometrics. Vijayamohanan Pillai N. CDS Mphil Econometrics Vijayamohan. 3-Mar-14. CDS M Phil Econometrics.
Dummy varable Models an Plla N Dummy X-varables Dummy Y-varables Dummy X-varables Dummy X-varables Dummy varable: varable assumng values 0 and to ndcate some attrbutes To classfy data nto mutually exclusve
More informationBasic R Programming: Exercises
Basc R Programmng: Exercses RProgrammng John Fox ICPSR, Summer 2009 1. Logstc Regresson: Iterated weghted least squares (IWLS) s a standard method of fttng generalzed lnear models to data. As descrbed
More informationx i1 =1 for all i (the constant ).
Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by
More informationSimulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests
Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth
More informationBasic Business Statistics, 10/e
Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson
More informationDr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur
Analyss of Varance and Desgn of Exerments-I MODULE III LECTURE - 2 EXPERIMENTAL DESIGN MODELS Dr. Shalabh Deartment of Mathematcs and Statstcs Indan Insttute of Technology Kanur 2 We consder the models
More informationa. (All your answers should be in the letter!
Econ 301 Blkent Unversty Taskn Econometrcs Department of Economcs Md Term Exam I November 8, 015 Name For each hypothess testng n the exam complete the followng steps: Indcate the test statstc, ts crtcal
More informationParametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010
Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Survey Workng Group Semnar March 29, 2010 1 Outlne Introducton Proposed method Fractonal mputaton Approxmaton Varance estmaton Multple mputaton
More informationThe Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction
ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also
More informationDr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur
Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed
More informationStatistics MINITAB - Lab 2
Statstcs 20080 MINITAB - Lab 2 1. Smple Lnear Regresson In smple lnear regresson we attempt to model a lnear relatonshp between two varables wth a straght lne and make statstcal nferences concernng that
More informationProperties of Least Squares
Week 3 3.1 Smple Lnear Regresson Model 3. Propertes of Least Squares Estmators Y Y β 1 + β X + u weekly famly expendtures X weekly famly ncome For a gven level of x, the expected level of food expendtures
More informationLecture 16 Statistical Analysis in Biomaterials Research (Part II)
3.051J/0.340J 1 Lecture 16 Statstcal Analyss n Bomaterals Research (Part II) C. F Dstrbuton Allows comparson of varablty of behavor between populatons usng test of hypothess: σ x = σ x amed for Brtsh statstcan
More informationMaximum Likelihood Estimation
Maxmum Lkelhood Estmaton INFO-2301: Quanttatve Reasonng 2 Mchael Paul and Jordan Boyd-Graber MARCH 7, 2017 INFO-2301: Quanttatve Reasonng 2 Paul and Boyd-Graber Maxmum Lkelhood Estmaton 1 of 9 Why MLE?
More informationSTAT 511 FINAL EXAM NAME Spring 2001
STAT 5 FINAL EXAM NAME Sprng Instructons: Ths s a closed book exam. No notes or books are allowed. ou may use a calculator but you are not allowed to store notes or formulas n the calculator. Please wrte
More informationDepartment of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6
Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.
More informationLOGIT ANALYSIS. A.K. VASISHT Indian Agricultural Statistics Research Institute, Library Avenue, New Delhi
LOGIT ANALYSIS A.K. VASISHT Indan Agrcultural Statstcs Research Insttute, Lbrary Avenue, New Delh-0 02 amtvassht@asr.res.n. Introducton In dummy regresson varable models, t s assumed mplctly that the dependent
More informationBasically, if you have a dummy dependent variable you will be estimating a probability.
ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy
More informationMarginal Effects in Probit Models: Interpretation and Testing. 1. Interpreting Probit Coefficients
ECON 5 -- NOE 15 Margnal Effects n Probt Models: Interpretaton and estng hs note ntroduces you to the two types of margnal effects n probt models: margnal ndex effects, and margnal probablty effects. It
More informationLinear Approximation with Regularization and Moving Least Squares
Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...
More informationModule Contact: Dr Susan Long, ECO Copyright of the University of East Anglia Version 1
UNIVERSITY OF EAST ANGLIA School of Economcs Man Seres PG Examnaton 016-17 ECONOMETRIC METHODS ECO-7000A Tme allowed: hours Answer ALL FOUR Questons. Queston 1 carres a weght of 5%; Queston carres 0%;
More informationGoodness of fit and Wilks theorem
DRAFT 0.0 Glen Cowan 3 June, 2013 Goodness of ft and Wlks theorem Suppose we model data y wth a lkelhood L(µ) that depends on a set of N parameters µ = (µ 1,...,µ N ). Defne the statstc t µ ln L(µ) L(ˆµ),
More informationLogistic Regression Maximum Likelihood Estimation
Harvard-MIT Dvson of Health Scences and Technology HST.951J: Medcal Decson Support, Fall 2005 Instructors: Professor Lucla Ohno-Machado and Professor Staal Vnterbo 6.873/HST.951 Medcal Decson Support Fall
More informationAndreas C. Drichoutis Agriculural University of Athens. Abstract
Heteroskedastcty, the sngle crossng property and ordered response models Andreas C. Drchouts Agrculural Unversty of Athens Panagots Lazards Agrculural Unversty of Athens Rodolfo M. Nayga, Jr. Texas AMUnversty
More informationSTK4080/9080 Survival and event history analysis
SK48/98 Survval and event hstory analyss Lecture 7: Regresson modellng Relatve rsk regresson Regresson models Assume that we have a sample of n ndvduals, and let N (t) count the observed occurrences of
More informationThe EM Algorithm (Dempster, Laird, Rubin 1977) The missing data or incomplete data setting: ODL(φ;Y ) = [Y;φ] = [Y X,φ][X φ] = X
The EM Algorthm (Dempster, Lard, Rubn 1977 The mssng data or ncomplete data settng: An Observed Data Lkelhood (ODL that s a mxture or ntegral of Complete Data Lkelhoods (CDL. (1a ODL(;Y = [Y;] = [Y,][
More informationLINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity
LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have
More informationQUASI-LIKELIHOOD APPROACH TO RATER AGREEMENT PLUS LINEAR BY LINEAR ASSOCIATION MODEL FOR ORDINAL CONTINGENCY TABLES
Journal of Statstcs: Advances n Theory and Applcatons Volume 6, Number, 26, Pages -5 Avalable at http://scentfcadvances.co.n DOI: http://dx.do.org/.8642/jsata_72683 QUASI-LIKELIHOOD APPROACH TO RATER AGREEMENT
More informationChapter 15 Student Lecture Notes 15-1
Chapter 15 Student Lecture Notes 15-1 Basc Busness Statstcs (9 th Edton) Chapter 15 Multple Regresson Model Buldng 004 Prentce-Hall, Inc. Chap 15-1 Chapter Topcs The Quadratc Regresson Model Usng Transformatons
More informationOptimization. September 4, 2018
Optmzaton September 4, 2018 Optmzaton problem 1/34 An optmzaton problem s the problem of fndng the best soluton for an objectve functon. Optmzaton method plays an mportant role n statstcs, for example,
More informationLinear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the
Chapter 11 Student Lecture Notes 11-1 Lnear regresson Wenl lu Dept. Health statstcs School of publc health Tanjn medcal unversty 1 Regresson Models 1. Answer What Is the Relatonshp Between the Varables?.
More informationSTATISTICS QUESTIONS. Step by Step Solutions.
STATISTICS QUESTIONS Step by Step Solutons www.mathcracker.com 9//016 Problem 1: A researcher s nterested n the effects of famly sze on delnquency for a group of offenders and examnes famles wth one to
More informationLecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding
Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an
More informationChapter 9: Statistical Inference and the Relationship between Two Variables
Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,
More informationComputation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models
Computaton of Hgher Order Moments from Two Multnomal Overdsperson Lkelhood Models BY J. T. NEWCOMER, N. K. NEERCHAL Department of Mathematcs and Statstcs, Unversty of Maryland, Baltmore County, Baltmore,
More informationLecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding
Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study
More informationInfluence Diagnostics on Competing Risks Using Cox s Model with Censored Data. Jalan Gombak, 53100, Kuala Lumpur, Malaysia.
Proceedngs of the 8th WSEAS Internatonal Conference on APPLIED MAHEMAICS, enerfe, Span, December 16-18, 5 (pp14-138) Influence Dagnostcs on Competng Rsks Usng Cox s Model wth Censored Data F. A. M. Elfak
More informationsince [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation
Econ 388 R. Butler 204 revsons Lecture 4 Dummy Dependent Varables I. Lnear Probablty Model: the Regresson model wth a dummy varables as the dependent varable assumpton, mplcaton regular multple regresson
More informationEcon107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)
I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes
More informationA Robust Method for Calculating the Correlation Coefficient
A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal
More informationDO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes
25/6 Canddates Only January Examnatons 26 Student Number: Desk Number:...... DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR Department Module Code Module Ttle Exam Duraton
More informationAn R implementation of bootstrap procedures for mixed models
The R User Conference 2009 July 8-10, Agrocampus-Ouest, Rennes, France An R mplementaton of bootstrap procedures for mxed models José A. Sánchez-Espgares Unverstat Poltècnca de Catalunya Jord Ocaña Unverstat
More informationDr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur
Analyss of Varance and Desgn of Experment-I MODULE VIII LECTURE - 34 ANALYSIS OF VARIANCE IN RANDOM-EFFECTS MODEL AND MIXED-EFFECTS EFFECTS MODEL Dr Shalabh Department of Mathematcs and Statstcs Indan
More informationComparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method
Appled Mathematcal Scences, Vol. 7, 0, no. 47, 07-0 HIARI Ltd, www.m-hkar.com Comparson of the Populaton Varance Estmators of -Parameter Exponental Dstrbuton Based on Multple Crtera Decson Makng Method
More informationMACHINE APPLIED MACHINE LEARNING LEARNING. Gaussian Mixture Regression
11 MACHINE APPLIED MACHINE LEARNING LEARNING MACHINE LEARNING Gaussan Mture Regresson 22 MACHINE APPLIED MACHINE LEARNING LEARNING Bref summary of last week s lecture 33 MACHINE APPLIED MACHINE LEARNING
More informationSee Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)
Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes
More information9. Binary Dependent Variables
9. Bnar Dependent Varables 9. Homogeneous models Log, prob models Inference Tax preparers 9.2 Random effects models 9.3 Fxed effects models 9.4 Margnal models and GEE Appendx 9A - Lkelhood calculatons
More informationAdvanced Statistical Methods: Beyond Linear Regression
Advanced Statstcal Methods: Beyond Lnear Regresson John R. Stevens Utah State Unversty Notes 2. Statstcal Methods I Mathematcs Educators Workshop 28 March 2009 1 http://www.stat.usu.edu/~rstevens/pcm 2
More informationHow its computed. y outcome data λ parameters hyperparameters. where P denotes the Laplace approximation. k i k k. Andrew B Lawson 2013
Andrew Lawson MUSC INLA INLA s a relatvely new tool that can be used to approxmate posteror dstrbutons n Bayesan models INLA stands for ntegrated Nested Laplace Approxmaton The approxmaton has been known
More informationLINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity
LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 31 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 6. Rdge regresson The OLSE s the best lnear unbased
More informationx = , so that calculated
Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to
More information2016 Wiley. Study Session 2: Ethical and Professional Standards Application
6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton
More informationP R. Lecture 4. Theory and Applications of Pattern Recognition. Dept. of Electrical and Computer Engineering /
Theory and Applcatons of Pattern Recognton 003, Rob Polkar, Rowan Unversty, Glassboro, NJ Lecture 4 Bayes Classfcaton Rule Dept. of Electrcal and Computer Engneerng 0909.40.0 / 0909.504.04 Theory & Applcatons
More informationComposite Hypotheses testing
Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter
More informationECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2017 Instructor: Victor Aguirregabiria
ECOOMETRICS II ECO 40S Unversty of Toronto Department of Economcs Wnter 07 Instructor: Vctor Agurregabra SOLUTIO TO FIAL EXAM Tuesday, Aprl 8, 07 From :00pm-5:00pm 3 hours ISTRUCTIOS: - Ths s a closed-book
More informationJanuary Examinations 2015
24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)
More informationChapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.
Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the
More information/ n ) are compared. The logic is: if the two
STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence
More informationDERIVATION OF THE PROBABILITY PLOT CORRELATION COEFFICIENT TEST STATISTICS FOR THE GENERALIZED LOGISTIC DISTRIBUTION
Internatonal Worshop ADVANCES IN STATISTICAL HYDROLOGY May 3-5, Taormna, Italy DERIVATION OF THE PROBABILITY PLOT CORRELATION COEFFICIENT TEST STATISTICS FOR THE GENERALIZED LOGISTIC DISTRIBUTION by Sooyoung
More informationANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)
Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of
More informationDurban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications
Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department
More informationLearning Objectives for Chapter 11
Chapter : Lnear Regresson and Correlaton Methods Hldebrand, Ott and Gray Basc Statstcal Ideas for Managers Second Edton Learnng Objectves for Chapter Usng the scatterplot n regresson analyss Usng the method
More informationOutlier Detection in Logistic Regression: A Quest for Reliable Knowledge from Predictive Modeling and Classification
Outler Detecton n Logstc egresson: A Quest for elable Knowledge from Predctve Modelng and Classfcaton Abdul Nurunnab, Geoff West Department of Spatal Scences, Curtn Unversty, Perth, Australa CC for Spatal
More informationANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE
P a g e ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE Darmud O Drscoll ¹, Donald E. Ramrez ² ¹ Head of Department of Mathematcs and Computer Studes
More informationInterval Estimation in the Classical Normal Linear Regression Model. 1. Introduction
ECONOMICS 35* -- NOTE 7 ECON 35* -- NOTE 7 Interval Estmaton n the Classcal Normal Lnear Regresson Model Ths note outlnes the basc elements of nterval estmaton n the Classcal Normal Lnear Regresson Model
More informationLimited Dependent Variables
Lmted Dependent Varables. What f the left-hand sde varable s not a contnuous thng spread from mnus nfnty to plus nfnty? That s, gven a model = f (, β, ε, where a. s bounded below at zero, such as wages
More information