Latent Class Regression
|
|
- Lionel Wiggins
- 6 years ago
- Views:
Transcription
1 Latent Class Regresson Karen Bandeen-Roche October 8, 06
2 Objectves For you to leave here knowng What s the LCR model and ts underlyng assumptons? How are LCR parameters nterpreted? How does one check the assumptons of an LCR model? Latent class regresson analogs to LCA for fttng and dentfablty
3 Motvatng Eample: Fralty Latent trat (IRT) assumes t s contnuous. Latent class model assumes t s dscrete Densty % Robust 80 Intermedate Fral 5 Fral Fralty 3
4 Motvatng Eample: Fralty Is fralty assocated wth age, educaton, and/or dsease burden? Do age, educaton, and/or dsease burden predct heghtened rsk for membershp n some fralty classes as opposed to others? 4
5 Part I: Model
6 Latent Class Regresson (LCR) Recall the standard latent class model : Dscrete latent varables & dscrete ndcator varables Indcators measure dscrete subpopulatons rather than underlyng contnuous scores Patterns of responses are thought to contan nformaton above and beyond aggregaton of responses The predcton goal s clusterng ndvduals rather than contnuous response varables We add structural pece to model where covarates eplan class membershp 6
7 Latent Class Regresson Model Structural Model 7
8 Analyss of underlyng subpopulatons Latent class regresson POPULATION POPULATION U X P P (X) P J P J (X) M J JM M J JM Y Y M Y Y Y M Y M Y Y M Goodman, 974 Dayton & Macready, 988
9 Latent Varable Models Latent Class Regresson (LCR) Model Model: Structural model: f Y A latent polytomous logstc regresson J ( y ) P (, ) π ( π j j M m y mj m y m [ U ] Pr{ U j } P (, ) mj ) ( ) ep J + ep k j j ( ) k Measurement model: π Pr Y U π s MJ { j} mj m [ ] Y U condtonal probabltes 9
10 Structural Model Wth two classes, the latent varable (class membershp) s dchotomous 0
11 Parameter Interpretaton Consder smplest case: classes ( vs. ) Pr( C ) log Pr( C ) or equvalently, + 0 Pr( C log Pr( C s a log odds rato. ) ) + 0 Ths s a latent logstc regresson.
12 Parameter Interpretaton Consder smplest case: classes ( vs. ) log Pr( C Pr( C ) ) + 0 where s a log odds rato. Eample: ep( ) and f female, 0 f male Women have twce the odds of beng n class versus class than men, holdng all else constant
13 3 Solvng for P j ( ) Pr(C j ) 0 ), Pr( ), Pr( log ) ( ) ( log C C p p + + Usng the fact: p ( ) + p ( ), we obtan ), Pr( ) ( ), Pr( ) ( e C p e e C p
14 Parameter Interpretaton General Case (J-)*(p+) s where p number of covarates J-: one class s reference class so all of ts coeffcents are techncally zero p+: for each class (ecept the reference), there s one for each covarate plus another for the ntercept. 4
15 Parameter Interpretaton General Case Need more than one equaton Choose a reference class (e.g. class ) log Pr( C Pr( C,, ) ) e e 3 Pr( C log Pr( C e / e e 3 3 3,, ) ) OR for class versus class for females versus males OR for class 3 versus class for females versus males 3 OR for class 3 versus class for 3 females versus males 5
16 6 Solvng for P j ( ) Pr(C ) 0 ), Pr( ), Pr( log ) ( ) ( log C C P P Where we assume that ), Pr( ) ( j j j j e e e e e C P (Snce class s the reference class)
17 Assumptons Condtonal Independence: gven an ndvdual s class, hs symptoms are ndependent Pr(y m, y r C ) Pr(y m C )* Pr(y r C ) Non-dfferental Measurement: gven an ndvdual s class, covarates are not assocated wth symptoms Pr(y m, C ) Pr(y m C ) 7
18 Condtonal Dependence 8
19 Dfferental Measurement 9
20 Part II: Fttng
21 Model Buldng Step : Get the measurement part rght f y Ft standard latent class model frst. Works: margnalzaton property J [ ] M ( ) y J M ( ) y y y m π π P * π π ( y ) P ( ) dg( ) m m j mj mj j j m j m Bandeen-Roche et al., J Am Statst Assoc., 997 Step : Model buldng as n multple logstc regresson m mj mj
22 Mamum lkelhood estmator Latent Class Regresson Lkelhood Pr( Y y j j m EM Algorthm E-step as before Model Estmaton ) Pr( Y J y,..., ( y M-step polytomous logstc regresson wth posteror probabltes as outcomes, M Y ym p ( ) π mj ( π where p j ( ) y J e j Y mj j e M ) j m y ) M ) Bandeen-Roche et al., J Am Statst Assoc., 997
23 Eample: Fralty of older adults Step. Measurement model Crteron -Class Model 3-Class Model CLASS CLASS NON-FRAIL FRAIL CLASS ROBUST CLASS INTERMEDIATE CLASS 3 FRAIL Weght Loss Weakness Slowness Low Physcal Actvty Ehauston Class Prevalence (%) (Bandeen-Roche et al. 006) 3
24 Step. Structural Model To evaluate the assocatons between fralty and age, educaton, and dsease burden Run latent class regresson whle fng the number of latent classes derved from the LCA Assumng condtonal ndependence and non-dfferental measurement 4
25 Fralty: Structural Model Reference group Non-fral ep(coeff) Varable Coeff OR Coeff SE Coeff Z Coeff+/-.96SE Coeff CI ep(coeff CI) OR CI Age, (0.04,0.33) (.04,.6) Educaton, (-0.466,-0.58) (0.63,0.77) Dseases (0.6,.58) (.86,3.9) Intercept (-.80,-.478) (0.06,0.3) 4 Centered at means Years 3 Number (count) 4 Odds (among the non-fral; rather than odds ratos) 5
26 Fralty: Structural Model Reference group Non-fral Varable Coeff OR Coeff SE Coeff Z Coeff CI OR CI Age, (0.04,0.33) (.04,.6) Educaton, (-0.466,-0.58) (0.63,0.77) Dseases (0.6,.58) (.86,3.9) Intercept (-.80,-.478) (0.06,0.3) 4 Prob odds/(+odds) 0.8/ estmated fral prevalence among those wth mean age and educaton & no dseases 6
27 Fralty: Structural Model Reference group Non-fral Varable Coeff OR Coeff SE Coeff Z Coeff CI OR CI Age, (0.04,0.33) (.04,.6) Educaton, (-0.466,-0.58) (0.63,0.77) Dseases (0.6,.58) (.86,3.9) Intercept (-.80,-.478) (0.06,0.3) 4 we estmate that the odds of beng fral (vs. nonfral) ncreases.4-fold wth each added dsease 7
28 Part III: Evaluatng Ft
29 Model Checkng Analog resdual checkng n lnear regresson IT S CRITICALLY IMPORTANT! Can gve msleadng fndngs f measurement model assumptons are unwarranted Phlosophcal opnon: we learn prmarly by specfyng how smple models fal to ft, not by observng that comple models happen to ft Two types of checkng Whether the model fts (e.g. observed vs. epected) How a model may fal to ft (ASSUMPTIONS) 9
30 Checkng Whether the Model Fts Means ) Do Y s aggregate as epected gven the model? Ø Check the measurement model (LCA) Ø Check whether the measurement model s comparable wthout (LCA) and wth (LCR) covarates 30
31 Checkng Whether the Model Fts ) Do Y s relate to the X s as epected gven the model Idea: focus on one tem at a tme J M ym ( y y,..., YM ym ) Pj ( ) π mj ( mj ) j m P( Y π m ) v If nterested n tem m, gnore ( margnalze over ) other tems: ym P( Y y ) P ( ) π ( π m m J j j mj mj ) ( y m ) 3
32 Comparng Ftted to Observed v Construct the predcted curve by plottng ths probablty versus any gven Add a smooth splne (e.g. use lowess n STATA) to reveal systematc trend (sold lne) v Supermpose t wth an observed tem response curve by Plot tem response (0 or ) by Add smooth splne to reveal systematc trend (dashed lne) 3
33 Checkng How the Model Fals to Ft Check Assumptons condtonal ndependence non-dfferental measurement 33
34 Checkng How the Model Fals to Ft Basc deas: Suppose the model s true If we knew persons latent class membershps, we would check drectly: v Stratfy nto classes, then, wthn classes: v Check correlatons or parwse odds ratos among the tem responses (Condtonal Independence) v Regress tem responses on covarates (non-dfferental measurement) v Regress class membershps on covarates, hope for v Smlar fndngs re regresson coeffcents v No strong effects of outlers v Identfy strongly nonlnear covarates effects 34
35 Checkng How the Model Fals to Ft But n realty, we don t know the true latent class membershp! Latent class membershps must be estmated Randomze people nto pseudo classes usng ther posteror probabltes or assgn to most lkely class correspondng to the hghest posteror probablty Posteror probablty s defned as Pr( C Pr( y C j) Pr( C j ) j, y ) J Pr( y C j) Pr( C j ) j Analyze as descrbed before, ecept usng pseudo class membershp rather than true ones Bandeen-Roche et al., J Am Statst Assoc.,
36 Utlty of Model Checkng May modfy nterpretaton to ncorporate lack of ft/volaton of assumpton May help elucdate a transformaton that that would be more approprate (e.g. log(age) versus age) May suggest how to mprove measurement (e.g. better survey nstrument) May lead to beleve that LCR s not approprate 36
37 Part IV: Identfablty / Estmablty
38 Identfablty Latent class (bnary Y) regresson Latent class analyss (measurement only) ~ multnomal, dm Y ~ ( p ) ( p) M Unconstraned J-class model: M(J-) parameters Need M M( J ) Latent class regresson As above + full-rank X Bandeen-Roche et al, JASA, 997; Huang & Bandeen-Roche, Psychometrka, 004
39 Identfablty To best assure dentfcaton Incorporate a pror theory as much as possble v Set πs to 0 or where t makes sense to do so v Set πs equal to each other If program fals to converge v Run the program longer v Re-ntlze n very dfferent places v Add constrants (e.g. set πs to 0 or where sensble) v Stop (attemptng to do too much wth one s data) 39
40 Objectves For you to leave here knowng What s the LCR model and ts underlyng assumptons? How are LCR parameters nterpreted? How does one check the assumptons of an LCR model? Latent class regresson analogs to LCA for fttng and dentfablty
41 Append Mplus codes 4
42 MPLUS fttng of LCA TITLE: Latent Class Analyss of Fralty Components Usng Combned WHAS I and II Data Age DATA: FILE IS "h:\teachng\40.658\007\lcr.dat"; VARIABLE: NAMES ARE based shrnk weak slow ehaust kcal sweght age educ dsease; USEVARIABLES ARE shrnk weak slow ehaust kcal; MISSING ARE ALL (999999); CATEGORICAL ARE shrnk-kcal; CLASSES fralty(); ANALYSIS: TYPE IS MIXTURE; MODEL: %OVERALL% Declare mssng value code Assgn label fralty to the latent class varable and specfy number of classes %fralty#% [shrnk$* weak$* slow$* ehaust$* kcal$*]; %fralty#% [shrnk$*- weak$*- slow$*- ehaust$*- kcal$*-]; OUTPUT: TECH0 TECH; SAVEDATA: FILE IS "h:\teachng\40.658\00\lcasave.out"; SAVECPROB; Assgn startng values for thresholds (optonal) TECH0: output of observed vs. estmated frequences of response patterns; TECH, TECH4: results of Lo-Mendell-Rubn test and bootstrapped lkelhood rato test for comparng models wth k vs. k- classes Save posteror class probabltes 4
43 Step. Structural Model TECHNICAL OUTPUT VUONG-LO-MENDELL-RUBIN LIKELIHOOD RATIO TEST FOR (H0) VERSUS 3 CLASSES H0 Loglkelhood Value Tmes the Loglkelhood Dfference.48 Dfference n the Number of Parameters 6 Mean Standard Devaton 6.56 P-Value 0.44 LO-MENDELL-RUBIN ADJUSTED LRT TEST Value.78 P-Value 0.50 TECHNICAL 4 OUTPUT PARAMETRIC BOOTSTRAPPED LIKELIHOOD RATIO TEST FOR (H0) VERSUS 3 CLASSES H0 Loglkelhood Value Tmes the Loglkelhood Dfference.48 Dfference n the Number of Parameters 6 Appromate P-Value Successful Bootstrap Draws 67 43
44 MPLUS fttng of LCR TITLE: Latent Class Regresson Analyss of Fralty Components Usng Combned WHAS I and II Data Age DATA: FILE IS "h:\whas\fral\paper\lcr.dat"; VARIABLE: NAMES ARE based shrnk weak slow ehaust kcal sweght age educ dsease; USEVARIABLES ARE shrnk weak slow ehaust kcal age educ dsease; CENTERING GRANDMEAN(age educ); MISSING ARE ALL (999999); CATEGORICAL ARE shrnk-kcal; CLASSES fralty(); Centerng predctors age and educaton for meanngful nterpretaton of ntercept ANALYSIS: TYPE IS MIXTURE; MODEL: %OVERALL% fralty# ON age educ dsease; %fralty#% [shrnk$*3 weak$*3 slow$*3 ehaust$*3 kcal$*3]; %fralty#% [shrnk$*- weak$*- slow$*- ehaust$*- kcal$*-]; Structural regresson model usng Class as the reference group Why have startng values? 44
45 MPLUS Output Categorcal Latent Varables Estmates S.E. Est./S.E. FRAILTY# ON AGE EDUC DISEASE Intercepts FRAILTY# Usng Class as the reference group ALTERNATIVE PARAMETERIZATIONS FOR THE CATEGORICAL LATENT VARIABLE REGRESSION Parameterzaton usng Reference Class FRAILTY# ON AGE EDUC DISEASE Intercepts FRAILTY#
46 Checkng Whether the Model Fts ) Do Y s aggregate as epected gven the model? v Compare observed pattern frequences to predcted pattern frequences 46
47 Fralty Eample: Observed versus Epected Response Patterns: Ignorng Covarates 5 most frequently observed patterns among the non-fral Crteron Weght loss Weak Slow Ehauston Low Actvty Pattern Frequences Observed Epected -class -class 3-class N N N N N N N Y N N N N N N Y N Y N N N N N Y N Y most frequently observed patterns among the fral (Bandeen-Roche et al. 006) N Y Y N Y N Y Y Y Y N N Y Y Y Y Y Y Y Y Y Y Y N Y Latent Class Model Ft statstcs Pearson Ch-Square 568 (p<.000) 4.4 (p.) 3. (p.5) AIC BIC
48 MPLUS Input TITLE: Weghted Latent Class Analyss of Fralty Components Usng Combned WHAS I and II Data Age DATA: FILE IS "C:\teachng\ \lcr.dat"; VARIABLE: NAMES ARE based shrnk weak slow ehaust kcal sweght age educ dsease; USEVARIABLES ARE shrnk weak slow ehaust kcal; MISSING ARE ALL (999999); CATEGORICAL ARE shrnk-kcal; CLASSES fralty(); ANALYSIS: TYPE IS MIXTURE; MODEL: %OVERALL% %fralty#% [shrnk$*- weak$*- slow$*0 ehaust$*- kcal$*-]; %fralty#% [shrnk$* weak$* slow$* ehaust$* kcal$*]; OUTPUT: TECH0 SAVEDATA: FILE IS "C:\teachng\ \lcasave.out"; SAVECPROB; Contans observed vs. epected frequences of response patterns Save estmated posteror probabltes of class membershp 48
49 MPLUS Output TECHNICAL 0 OUTPUT MODEL FIT INFORMATION FOR THE LATENT CLASS INDICATOR MODEL PART RESPONSE PATTERNS No. Pattern No. Pattern No. Pattern No. Pattern Order of response patterns? RESPONSE PATTERN FREQUENCIES AND CHI-SQUARE CONTRIBUTIONS Response Frequency Standardzed Ch-square Contrbuton Pattern Observed Estmated Resdual Pearson Loglkelhood Deleted (z-score) Recall: for a bvarate table, standardzed resdual (O E) / [(E) / *(-E/N) / ] 49
50 Emprcal Testng of Identfablty Must run model more than once usng dfferent startng values to check dentfablty! Mplus nput: ANALYSIS: TYPE IS MIXTURE; STARTS ; STITERATIONS0; Number of ntal stage random sets of startng values and the number of fnal stage optmzatons to use Mamum number of teraton allowng n the ntal stage 50
51 Parameter Constrants: Mplus Eample TITLE: ths s an eample of a LCA wth bnary latent class ndcators and parameter constrants DATA: FILE IS e7.3.dat; VARIABLE: NAMES ARE u-u4; CLASSES c (); CATEGORICAL u-u4; ANALYSIS: TYPE MIXTURE; MODEL: %OVERALL% %c#% [u$*-]; [u$-u3$*-] (); [u4$*-] (p); %c#% [u$@-5]; [u$-u3$*] (); [u4$*] (p); MODEL CONSTRAINT: p - p; OUTPUT: TECH TECH8; u and u3 have same π n class u has π equal to n class u and u3 have same π n class The threshold of u4 n class s equal to -*threshold of u4 n class (.e. same error rate) MPLUS User s Gude p. 53 5
Lecture 6: Introduction to Linear Regression
Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6
More informationLecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding
Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an
More informationLecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding
Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study
More informationIntroduction to Regression
Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes
More informationChapter 11: Simple Linear Regression and Correlation
Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests
More informationPredictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore
Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.
More information1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands
Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of
More informationMaximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models
ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Maxmum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models
More informationCIS526: Machine Learning Lecture 3 (Sept 16, 2003) Linear Regression. Preparation help: Xiaoying Huang. x 1 θ 1 output... θ M x M
CIS56: achne Learnng Lecture 3 (Sept 6, 003) Preparaton help: Xaoyng Huang Lnear Regresson Lnear regresson can be represented by a functonal form: f(; θ) = θ 0 0 +θ + + θ = θ = 0 ote: 0 s a dummy attrbute
More informationMaximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models
ECO 452 -- OE 4: Probt and Logt Models ECO 452 -- OE 4 Mamum Lkelhood Estmaton of Bnary Dependent Varables Models: Probt and Logt hs note demonstrates how to formulate bnary dependent varables models for
More informationChapter 9: Statistical Inference and the Relationship between Two Variables
Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,
More informationMultinomial logit regression
07/0/6 Multnomal logt regresson Introducton We now turn our attenton to regresson models for the analyss of categorcal dependent varables wth more than two response categores: Y car owned (many possble
More informationChapter 13: Multiple Regression
Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to
More informationDiagnostics in Poisson Regression. Models - Residual Analysis
Dagnostcs n Posson Regresson Models - Resdual Analyss 1 Outlne Dagnostcs n Posson Regresson Models - Resdual Analyss Example 3: Recall of Stressful Events contnued 2 Resdual Analyss Resduals represent
More informationsince [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation
Econ 388 R. Butler 204 revsons Lecture 4 Dummy Dependent Varables I. Lnear Probablty Model: the Regresson model wth a dummy varables as the dependent varable assumpton, mplcaton regular multple regresson
More informationBIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data
Lab : TWO-LEVEL NORMAL MODELS wth school chldren popularty data Purpose: Introduce basc two-level models for normally dstrbuted responses usng STATA. In partcular, we dscuss Random ntercept models wthout
More informationComparison of Regression Lines
STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence
More informationNegative Binomial Regression
STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...
More informationLimited Dependent Variables
Lmted Dependent Varables. What f the left-hand sde varable s not a contnuous thng spread from mnus nfnty to plus nfnty? That s, gven a model = f (, β, ε, where a. s bounded below at zero, such as wages
More informationLab 4: Two-level Random Intercept Model
BIO 656 Lab4 009 Lab 4: Two-level Random Intercept Model Data: Peak expratory flow rate (pefr) measured twce, usng two dfferent nstruments, for 17 subjects. (from Chapter 1 of Multlevel and Longtudnal
More informationStatistics for Economics & Business
Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable
More informationSTAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression
STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Ths handout covers materal found n Secton 3.7 of your text. You may also want to revew regresson technques n Chapter. In ths handout,
More information28. SIMPLE LINEAR REGRESSION III
8. SIMPLE LINEAR REGRESSION III Ftted Values and Resduals US Domestc Beers: Calores vs. % Alcohol To each observed x, there corresponds a y-value on the ftted lne, y ˆ = βˆ + βˆ x. The are called ftted
More information1 Binary Response Models
Bnary and Ordered Multnomal Response Models Dscrete qualtatve response models deal wth dscrete dependent varables. bnary: yes/no, partcpaton/non-partcpaton lnear probablty model LPM, probt or logt models
More informationLogistic Regression Maximum Likelihood Estimation
Harvard-MIT Dvson of Health Scences and Technology HST.951J: Medcal Decson Support, Fall 2005 Instructors: Professor Lucla Ohno-Machado and Professor Staal Vnterbo 6.873/HST.951 Medcal Decson Support Fall
More informationLecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management
Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 1 Chapters 14, 15 & 16 Professor Ahmad, Ph.D. Department of Management Revsed August 005 Chapter 14 Formulas Smple Lnear Regresson Model: y =
More information18. SIMPLE LINEAR REGRESSION III
8. SIMPLE LINEAR REGRESSION III US Domestc Beers: Calores vs. % Alcohol Ftted Values and Resduals To each observed x, there corresponds a y-value on the ftted lne, y ˆ ˆ = α + x. The are called ftted values.
More informationReminder: Nested models. Lecture 9: Interactions, Quadratic terms and Splines. Effect Modification. Model 1
Lecture 9: Interactons, Quadratc terms and Splnes An Manchakul amancha@jhsph.edu 3 Aprl 7 Remnder: Nested models Parent model contans one set of varables Extended model adds one or more new varables to
More informationLecture 3 Stat102, Spring 2007
Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture
More informationCorrelation and Regression. Correlation 9.1. Correlation. Chapter 9
Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,
More informationThe Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction
ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also
More informationStatistics MINITAB - Lab 2
Statstcs 20080 MINITAB - Lab 2 1. Smple Lnear Regresson In smple lnear regresson we attempt to model a lnear relatonshp between two varables wth a straght lne and make statstcal nferences concernng that
More informationLogistic Regression. CAP 5610: Machine Learning Instructor: Guo-Jun QI
Logstc Regresson CAP 561: achne Learnng Instructor: Guo-Jun QI Bayes Classfer: A Generatve model odel the posteror dstrbuton P(Y X) Estmate class-condtonal dstrbuton P(X Y) for each Y Estmate pror dstrbuton
More informationMarginal Effects in Probit Models: Interpretation and Testing. 1. Interpreting Probit Coefficients
ECON 5 -- NOE 15 Margnal Effects n Probt Models: Interpretaton and estng hs note ntroduces you to the two types of margnal effects n probt models: margnal ndex effects, and margnal probablty effects. It
More informationLearning Objectives for Chapter 11
Chapter : Lnear Regresson and Correlaton Methods Hldebrand, Ott and Gray Basc Statstcal Ideas for Managers Second Edton Learnng Objectves for Chapter Usng the scatterplot n regresson analyss Usng the method
More information3/3/2014. CDS M Phil Econometrics. Vijayamohanan Pillai N. CDS Mphil Econometrics Vijayamohan. 3-Mar-14. CDS M Phil Econometrics.
Dummy varable Models an Plla N Dummy X-varables Dummy Y-varables Dummy X-varables Dummy X-varables Dummy varable: varable assumng values 0 and to ndcate some attrbutes To classfy data nto mutually exclusve
More informationChapter 14 Simple Linear Regression
Chapter 4 Smple Lnear Regresson Chapter 4 - Smple Lnear Regresson Manageral decsons often are based on the relatonshp between two or more varables. Regresson analss can be used to develop an equaton showng
More informationLecture Nov
Lecture 18 Nov 07 2008 Revew Clusterng Groupng smlar obects nto clusters Herarchcal clusterng Agglomeratve approach (HAC: teratvely merge smlar clusters Dfferent lnkage algorthms for computng dstances
More information/ n ) are compared. The logic is: if the two
STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence
More informationLinear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the
Chapter 11 Student Lecture Notes 11-1 Lnear regresson Wenl lu Dept. Health statstcs School of publc health Tanjn medcal unversty 1 Regresson Models 1. Answer What Is the Relatonshp Between the Varables?.
More informationExpectation Maximization Mixture Models HMMs
-755 Machne Learnng for Sgnal Processng Mture Models HMMs Class 9. 2 Sep 200 Learnng Dstrbutons for Data Problem: Gven a collecton of eamples from some data, estmate ts dstrbuton Basc deas of Mamum Lelhood
More informationJanuary Examinations 2015
24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)
More informationBasic Business Statistics, 10/e
Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson
More informationThe Geometry of Logit and Probit
The Geometry of Logt and Probt Ths short note s meant as a supplement to Chapters and 3 of Spatal Models of Parlamentary Votng and the notaton and reference to fgures n the text below s to those two chapters.
More informationSee Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)
Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes
More informationCS 2750 Machine Learning. Lecture 5. Density estimation. CS 2750 Machine Learning. Announcements
CS 750 Machne Learnng Lecture 5 Densty estmaton Mlos Hauskrecht mlos@cs.ptt.edu 539 Sennott Square CS 750 Machne Learnng Announcements Homework Due on Wednesday before the class Reports: hand n before
More informationMixture o f of Gaussian Gaussian clustering Nov
Mture of Gaussan clusterng Nov 11 2009 Soft vs hard lusterng Kmeans performs Hard clusterng: Data pont s determnstcally assgned to one and only one cluster But n realty clusters may overlap Soft-clusterng:
More informationLaboratory 3: Method of Least Squares
Laboratory 3: Method of Least Squares Introducton Consder the graph of expermental data n Fgure 1. In ths experment x s the ndependent varable and y the dependent varable. Clearly they are correlated wth
More informationLaboratory 1c: Method of Least Squares
Lab 1c, Least Squares Laboratory 1c: Method of Least Squares Introducton Consder the graph of expermental data n Fgure 1. In ths experment x s the ndependent varable and y the dependent varable. Clearly
More informationChapter 15 - Multiple Regression
Chapter - Multple Regresson Chapter - Multple Regresson Multple Regresson Model The equaton that descrbes how the dependent varable y s related to the ndependent varables x, x,... x p and an error term
More informationSpace of ML Problems. CSE 473: Artificial Intelligence. Parameter Estimation and Bayesian Networks. Learning Topics
/7/7 CSE 73: Artfcal Intellgence Bayesan - Learnng Deter Fox Sldes adapted from Dan Weld, Jack Breese, Dan Klen, Daphne Koller, Stuart Russell, Andrew Moore & Luke Zettlemoyer What s Beng Learned? Space
More informationWeb Appendix B Estimation. We base our sampling procedure on the method of data augmentation (e.g., Tanner and Wong,
Web Appendx B Estmaton Lkelhood and Data Augmentaton We base our samplng procedure on the method of data augmentaton (eg anner and Wong 987) here e treat the unobserved ndvdual choces as parameters Specfcally
More informationStatistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models
Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 14 Multple Regresson Models 1999 Prentce-Hall, Inc. Chap. 14-1 Chapter Topcs The Multple Regresson Model Contrbuton of Indvdual Independent Varables
More informationStatistics for Business and Economics
Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear
More informatione i is a random error
Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown
More informationQiong (Joan) Wu Harvard Center for Population and Development Studies. INDEPTH-SAGE WORKSHOP April 20, 2010
Qong Joan Wu Harvard Center for Populaton and Development Studes INDEPTH-SAGE WORKSHOP Aprl 20, 2010 1 IRT vs Classcal test theory CTT CTT: focuses test scores observed score = true score + error O=T+E
More informationChapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.
Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the
More informationINF 5860 Machine learning for image classification. Lecture 3 : Image classification and regression part II Anne Solberg January 31, 2018
INF 5860 Machne learnng for mage classfcaton Lecture 3 : Image classfcaton and regresson part II Anne Solberg January 3, 08 Today s topcs Multclass logstc regresson and softma Regularzaton Image classfcaton
More informationx i1 =1 for all i (the constant ).
Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by
More informationChapter 8 Indicator Variables
Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n
More informationLinear Regression Analysis: Terminology and Notation
ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented
More informationThe Relationship between Factor Analytic and Item Response Models
The Relatonshp between Factor Analytc and Item Response Models Akhto Kamata Department of Educaton Polcy and Leadershp Department of Psychology Center on Research and Evaluaton Southern Methodst Unversty
More informationPolynomial Regression Models
LINEAR REGRESSION ANALYSIS MODULE XII Lecture - 6 Polynomal Regresson Models Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Test of sgnfcance To test the sgnfcance
More informationStatistics II Final Exam 26/6/18
Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the
More informationComposite Hypotheses testing
Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter
More informationHomework Assignment 3 Due in class, Thursday October 15
Homework Assgnment 3 Due n class, Thursday October 15 SDS 383C Statstcal Modelng I 1 Rdge regresson and Lasso 1. Get the Prostrate cancer data from http://statweb.stanford.edu/~tbs/elemstatlearn/ datasets/prostate.data.
More informationStatistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation
Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear
More informationAdvanced Statistical Methods: Beyond Linear Regression
Advanced Statstcal Methods: Beyond Lnear Regresson John R. Stevens Utah State Unversty Notes 2. Statstcal Methods I Mathematcs Educators Workshop 28 March 2009 1 http://www.stat.usu.edu/~rstevens/pcm 2
More informationResource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis
Resource Allocaton and Decson Analss (ECON 800) Sprng 04 Foundatons of Regresson Analss Readng: Regresson Analss (ECON 800 Coursepak, Page 3) Defntons and Concepts: Regresson Analss statstcal technques
More informationLecture Notes on Linear Regression
Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume
More informationUnit 10: Simple Linear Regression and Correlation
Unt 10: Smple Lnear Regresson and Correlaton Statstcs 571: Statstcal Methods Ramón V. León 6/28/2004 Unt 10 - Stat 571 - Ramón V. León 1 Introductory Remarks Regresson analyss s a method for studyng the
More informationY = β 0 + β 1 X 1 + β 2 X β k X k + ε
Chapter 3 Secton 3.1 Model Assumptons: Multple Regresson Model Predcton Equaton Std. Devaton of Error Correlaton Matrx Smple Lnear Regresson: 1.) Lnearty.) Constant Varance 3.) Independent Errors 4.) Normalty
More informationExercise 1 The General Linear Model : Answers
Eercse The General Lnear Model Answers. Gven the followng nformaton on 67 pars of values on and -.6 - - - 9 a fnd the OLS coeffcent estmate from a regresson of on. Usng b 9 So. 9 b Suppose that now also
More informationScientific Question Determine whether the breastfeeding of Nepalese children varies with child age and/or sex of child.
Longtudnal Logstc Regresson: Breastfeedng of Nepalese Chldren PART II GEE models (margnal, populaton average) covered last lab Random Intercept models (subject specfc) Transton models Scentfc Queston Determne
More informationSome basic statistics and curve fitting techniques
Some basc statstcs and curve fttng technques Statstcs s the dscplne concerned wth the study of varablty, wth the study of uncertanty, and wth the study of decsonmakng n the face of uncertanty (Lndsay et
More informationLinear Feature Engineering 11
Lnear Feature Engneerng 11 2 Least-Squares 2.1 Smple least-squares Consder the followng dataset. We have a bunch of nputs x and correspondng outputs y. The partcular values n ths dataset are x y 0.23 0.19
More informationDummy variables in multiple variable regression model
WESS Econometrcs (Handout ) Dummy varables n multple varable regresson model. Addtve dummy varables In the prevous handout we consdered the followng regresson model: y x 2x2 k xk,, 2,, n and we nterpreted
More informationThe Ordinary Least Squares (OLS) Estimator
The Ordnary Least Squares (OLS) Estmator 1 Regresson Analyss Regresson Analyss: a statstcal technque for nvestgatng and modelng the relatonshp between varables. Applcatons: Engneerng, the physcal and chemcal
More informationSinger & Willett, 2003 October 13, 2003
Snger & Wllett, October, Dong Data Analyss n n the the Multlevel Model for for Change Judy Snger & John Wllett Harvard Unversty Graduate School of Educaton What What we we wll wll cover? cover? Composte
More informationPubH 7405: REGRESSION ANALYSIS. SLR: INFERENCES, Part II
PubH 7405: REGRESSION ANALSIS SLR: INFERENCES, Part II We cover te topc of nference n two sessons; te frst sesson focused on nferences concernng te slope and te ntercept; ts s a contnuaton on estmatng
More informationHomework 9 STAT 530/J530 November 22 nd, 2005
Homework 9 STAT 530/J530 November 22 nd, 2005 Instructor: Bran Habng 1) Dstrbuton Q-Q plot Boxplot Heavy Taled Lght Taled Normal Skewed Rght Department of Statstcs LeConte 203 ch-square dstrbuton, Telephone:
More informationHere is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)
Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,
More informationDurban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications
Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department
More information4.1. Lecture 4: Fitting distributions: goodness of fit. Goodness of fit: the underlying principle
Lecture 4: Fttng dstrbutons: goodness of ft Goodness of ft Testng goodness of ft Testng normalty An mportant note on testng normalty! L4.1 Goodness of ft measures the extent to whch some emprcal dstrbuton
More informationSTAT 3008 Applied Regression Analysis
STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,
More information[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.
PPOL 59-3 Problem Set Exercses n Smple Regresson Due n class /8/7 In ths problem set, you are asked to compute varous statstcs by hand to gve you a better sense of the mechancs of the Pearson correlaton
More informationChapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2
Chapter 4 Smple Lnear Regresson Page. Introducton to regresson analyss 4- The Regresson Equaton. Lnear Functons 4-4 3. Estmaton and nterpretaton of model parameters 4-6 4. Inference on the model parameters
More informationSIMPLE LINEAR REGRESSION
Smple Lnear Regresson and Correlaton Introducton Prevousl, our attenton has been focused on one varable whch we desgnated b x. Frequentl, t s desrable to learn somethng about the relatonshp between two
More information8/25/17. Data Modeling. Data Modeling. Data Modeling. Patrice Koehl Department of Biological Sciences National University of Singapore
8/5/17 Data Modelng Patrce Koehl Department of Bologcal Scences atonal Unversty of Sngapore http://www.cs.ucdavs.edu/~koehl/teachng/bl59 koehl@cs.ucdavs.edu Data Modelng Ø Data Modelng: least squares Ø
More informationCredit Card Pricing and Impact of Adverse Selection
Credt Card Prcng and Impact of Adverse Selecton Bo Huang and Lyn C. Thomas Unversty of Southampton Contents Background Aucton model of credt card solctaton - Errors n probablty of beng Good - Errors n
More informationGlobal Sensitivity. Tuesday 20 th February, 2018
Global Senstvty Tuesday 2 th February, 28 ) Local Senstvty Most senstvty analyses [] are based on local estmates of senstvty, typcally by expandng the response n a Taylor seres about some specfc values
More informationNumber of cases Number of factors Number of covariates Number of levels of factor i. Value of the dependent variable for case k
ANOVA Model and Matrx Computatons Notaton The followng notaton s used throughout ths chapter unless otherwse stated: N F CN Y Z j w W Number of cases Number of factors Number of covarates Number of levels
More informationRegression with limited dependent variables. Professor Bernard Fingleton
Regresson wth lmted dependent varables Professor Bernard Fngleton Regresson wth lmted dependent varables Whether a mortgage applcaton s accepted or dened Decson to go on to hgher educaton Whether or not
More informationDepartment of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6
Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.
More informationSTATISTICS QUESTIONS. Step by Step Solutions.
STATISTICS QUESTIONS Step by Step Solutons www.mathcracker.com 9//016 Problem 1: A researcher s nterested n the effects of famly sze on delnquency for a group of offenders and examnes famles wth one to
More informationBasically, if you have a dummy dependent variable you will be estimating a probability.
ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy
More informationUNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours
UNIVERSITY OF TORONTO Faculty of Arts and Scence December 005 Examnatons STA47HF/STA005HF Duraton - hours AIDS ALLOWED: (to be suppled by the student) Non-programmable calculator One handwrtten 8.5'' x
More informationLECTURE 9 CANONICAL CORRELATION ANALYSIS
LECURE 9 CANONICAL CORRELAION ANALYSIS Introducton he concept of canoncal correlaton arses when we want to quantfy the assocatons between two sets of varables. For example, suppose that the frst set of
More informationLatent Class Analysis. October 9 and 11, 2006 Statistics for Psychosocial Research
Latent Class Analss October 9 and 11, 2006 Statstcs for Pschosocal Research Introducton: Tpes of Latent Varable Models Latent Varable Observed Varables Contnuous Contnuous Factor analss Categorcal Latent
More informationThis column is a continuation of our previous column
Comparson of Goodness of Ft Statstcs for Lnear Regresson, Part II The authors contnue ther dscusson of the correlaton coeffcent n developng a calbraton for quanttatve analyss. Jerome Workman Jr. and Howard
More information