Introducing Generalized Linear Models: Logistic Regression

Size: px
Start display at page:

Download "Introducing Generalized Linear Models: Logistic Regression"

Transcription

1 Ron Heck, Summer 2012 Seminars 1 Multilevel Regression Models and Their Applications Seminar Introducing Generalized Linear Models: Logistic Regression The generalized linear model (GLM) represents and extension of the linear model for investigating outcomes that are categorical (e.g., dichotomous, ordinal, multinomial, counts). The model has three components that are important to consider: A random component that specifies the variance in terms of the mean ( ), or expected value, of Y for individual i in group j; A link function, often represented as g(), which converts the expected values ( ) of Y to transformed predicted values of [ f ( ) ]. The Greek letter (eta) is typically used to denote the transformed linear predictor. The link function therefore provides the relationship between the linear predictor and the mean of the distribution function; and A structural component that relates the transformed predictor of Y to a set of predictors. One type of GLM that can be applied in various situations is the logistic regression model. It can be applied to dichotomous dependent variables (e.g., pass/fail, stay/leave) and can also be extended to consider a dependent variable with several categories (e.g., manager, clerical, custodian). The categories may be ordered (as in ordinal regression) or unordered. In this latter case (called multinomial logistic regression), each category is compared to a reference category (i.e., the last category is the default in SPSS). Logistic regression can also readily accommodate interactions among predictors (e.g., gender x ethnicity). It is important to note that extremely high correlations between predictors may lead to multicollinearity. One fix is to eliminate one of the variables in multicollinear pairs from the model. The Model Ordinary least squares regression cannot be correctly applied to data where the outcome is categorical (e.g., two categories coded 0 and 1) or ordinal (i.e., with a restricted range). In the case of a dichotomous dependent variable (Y), for example, linear models with nonzero slopes will eventually predict values of Y greater than 1 or less than 0. The problem is that when an outcome can only be a 0 or a 1, it will not be well described by a line since there is a limit on values that describe someone s outcomes (0 = dropout, 1 = persist). Predictions will break down Summer 2012 Quantitative Methods Series at Portland State University

2 Introducing Generalized Linear Models: Logistic Regression 2 at the boundaries (0 or 1). Let s suppose we use students socioeconomic status (SES) to predict their likelihood to persist versus drop out. Any linear model with a nonzero slope for the effect of SES on probability of persisting will eventually predict values of Y greater than 1 or less than 0 (e.g., 1.5 or -0.5), yet these are impossible predictions since by definition the values for Y can only be 0 or 1. Moreover, for dichotomous outcomes, the distribution of the errors from predicting whether someone persists or not will be nonrandom (since they can only have two values) and, therefore, they cannot normally distributed (i.e., violating a basic assumption of the linear model). Contrast this with a linear relationship, for example, predicting someone s math test score, from their family income, where one family income might be $25,000 while another has a family income of $250,000. The regression line will extend along the Y axis and X axis well beyond the limits of 0 and 1, and the prediction errors will also tend to be random and normally distributed. Because of these problems, we need a nonlinear model that can transform predicted values of Y to lie within the boundaries of 0 and 1 (Hamilton, 1992). Logistic regression emphasizes the probability of a particular outcome occurring, given an individual s pattern of responses to a set of independent variables. This relationship between independent variables and categories is nonlinear (i.e., not constant across levels of the independent variable) with respect to the probabilities of an outcome being either 0 or 1. The logic transformation is typically a type of S- curve. We might standardize income (X) such as in the following graph. Figure 1. S-curve graph

3 Introducing Generalized Linear Models: Logistic Regression 3 Logit Coefficient While the logistic regression model is nonlinear for probabilities (due to properties of its variation between 0 and 1), it is linear with respect to the logit coefficients. The probability of an event occurring (e.g., π = 1 versus not occurring) can then be written as follows: PY ( 1) 1 By taking the natural logarithm of the odds of the event occurring versus not occurring, we obtain a logit coefficient (η). The logit coefficient then can be predicted as a linear function of the following: η = log(π/(1-π)) = β 0 + β 1 X 1 + β 2 X β 2 X p If we take the natural log of the probability π = 1/(1- π), we can interpret the effect of a predictor as the change in the log odds of η associated with a one-unit change in the independent variable. So if we had a simple model predicting the log odds of persisting using only female, the equation might look like this: η = (female) The coefficient for female is 0.76, which suggests that as gender changes from 0 to 1 (i.e., male coded 0 to female coded 1), the log odds of persisting increase by 0.76 units. The intercept can be interpreted as the value for Y when all the X predictors are equal to zero. In this case, for a man, the log odds of persisting would then be 0.06 [ (0)]. Notice also that there is no error term for a logistic model. This is because in the binomial probability distribution the variance is related to the mean proportion, so it cannot be estimated separately. We can use the inverse of the logit link function to obtain the probability of persisting 1. 1 e In this case, the probability of persisting for a man would be: 1/[1 + e -(0.06) ] = 0.51, where e is value of the natural log, which is approximately

4 Introducing Generalized Linear Models: Logistic Regression 4 Since the estimate for a male s probability of persisting is just about 0.50 (or half the time), our knowledge that an individual is male is not much better than flipping a coin in terms of knowing whether or not a particular male individual will persist. If the estimate were considerably below 0.5, then we would predict that the individual would be unlikely to persist. For a female, however, the log odds of persisting are much better. The estimated log odds would be 0.82 [ (1)]. The probability of persisting would then be 1/[1 + e -(0.82) ] = 0.69 So, since the estimate is considerably above 0.5, we would predict that a given female is likely to persist. Odds Ratio People generally interpret the odds ratio (i.e., the increase or decrease in the odds of being in one outcome category when the predictor increases by one unit). Consider the case where we flip a coin. The ratio of the odds of it coming up heads or tails is the same = 1.50 Therefore, when the events are equally likely to occur we say the ratio of their odds is equal to 1 (or equally likely). The odds ratio (e β ) can be expressed as follows: P( ) P(1 ) = 0 1 X 1 2 X 2... qxq e This ought to look somewhat similar to the log odds equation. The odds ratio for a particular predictor variable is defined as e β, where β is the logit coefficient estimate for the predictor and e is the natural log. If β is zero, the odds ratio will equal 1 (i.e., since any number to the 0 power is 1), which leaves the odds unchanged. If β is positive, the odds ratio will be greater than 1, which means the odds are increased. If β is negative, the coefficient will be less than 1, which means the odds are decreased. Consider the case previous case where the relationship of female (coded male = 0, female = 1) to persisting to graduation is examined. If the odds ratio [also abbreviated as Exp(β)] is 2.5, it means as gender changes from male to female (i.e., a unit change in the independent variable),

5 Introducing Generalized Linear Models: Logistic Regression 5 the odds of persisting versus dropping out are multiplied by a factor of 2.5. In contrast, consider the case where the odds ratio is Suppose we reversed the coding of gender (females = 0, males = 1), then it would imply that the odds of persisting diminish for males by a factor of 0.40 compared with females. These two odds ratios can be shown to be equivalent by division (1/0.40 = 2.5 and 1/2.5 = 0.40). Model Fitting Logistic models are commonly estimated with maximum likelihood rather than ordinary least squares (as in multiple regression). Estimates are sought that yield the highest possible values for the likelihood function. The equations are nonlinear in that the parameters cannot be solved directly (as in OLS regression). Instead, an iterative procedure is used, where successively better approximates are found that satisfy the log likelihood function (Hamilton, 1992). Nested Models Successive logistic regression models can be evaluated against each other. Successive models (called nested models when all of the elements of the smaller model are also in the bigger model) can be evaluated in logistic regression by comparing them against a baseline model using the change in their log likelihoods. We compare the deviance (or -2*log likelihood) of each model. 2 The difference in models is distributed as with degrees of freedom equal to the difference in the number of parameters estimated between the two models. For example, if one is evaluating a model with three predictors and chooses to add another predictor, the change in model fit can be assessed. For one degree of freedom, a significant improvement (at p =.05) in model fit would result from a chi-square change of The likelihood ratio test (G 2 ) is defined as the following: G 2 = 2[(log-likelihood for bigger model)- (log likelihood for smaller model)] So for example, if the log-likelihood of the smaller (3 predictors) is and for the bigger model (4 predictors) is , the equation looks like this: G 2 = 2[(-8.470) - ( )] = 2.71 Because the required chi square for 1degree of freedom (df) is 3.84, we would generally conclude that the model with three predictors is preferred over the other model, since adding the fourth predictor did not improve the model s fit significantly.

6 Introducing Generalized Linear Models: Logistic Regression 6 Common Problems A number of problems can occur if there are too few cases relative to the number of predictors in the model. It is likely that the model will produce some very large parameter estimates and large standard errors. This can be a result of having combinations of variables that produce too many cells with no cases in the cells. The model may also fail to converge on a solution. Sometimes this can be fixed by collapsing the categories (i.e., making fewer categories) or by eliminating independent variables that are not important. A Two-Level GLM Model The single-level model can easily be extended to a mixed model (GLMM). We start here by introducing the basic specification for a two-level model. For a two-level model, we include subscript i to represent an individual nested within a level-2 unit designated by subscript j. The level-1 model for individual i nested in group j is of the general form: x, where x is a (p + 1) x 1 vector of predictors for the linear predictor of Y and is a vector of corresponding regression coefficients. Notice there is no error term at level 1. An appropriate link function is then used to link the expected value of Y to. In this case, we will use the logistic link function. log X X X 1 0 j q q At level 2, the level-1 coefficients qj can become an outcome variable. Following Raudenbush et al. (2004), a generic structural model can be denoted as follows: W W W u, qj q0 q1 1j q2 2 j qs Sqj qj where qs (q = 0,1,,S q ) are the level-2 coefficients, W q q Sj. are level-2 predictors, and uqj are level-2 random effects. You can see that the level-2 model (and successive levels) are specified the same as a model with continuous outcomes. It is only the level-1 model that is different.

7 Introducing Generalized Linear Models: Logistic Regression 7 We use the GENLIN MIXED program (starting in SPSS Version 19) to build an example twolevel model to examine students proficiency in reading. In samples where individuals are clustered in higher order social groupings (e.g., a department, a school, or some other type of organization), simple random sampling does not hold because individuals clustered in groups will tend to be similar in various ways. For example, they attend schools with particular student compositions, expectations for student academic achievement, and curricular organization and instructional strategies. If the clustering of students is ignored, it is likely that bias will be introduced in estimating model parameters. As has been noted previously, where there are clustered data, it is likely that there is a distribution of both intercepts and slopes around their respective average fixed effects. In this situation, we might wish to investigate the random variability in intercepts and slopes across the sample of higher level units in the study. Once we determine that variation exists in the parameters of interest across groups, we can build lelvel-2 models to explain this variation. In some cases, we may have a specific theoretical model in mind that we wish to test, while in other cases, we might be trying to explore possible new mechanisms that explain this observed variation in parameters across groups. We have a data set with 7,009 high school students in 988 schools. We wish to determine what might affect their likelihood to be proficient in math. Within schools we have background variables associated with student SES, grade point average, and whether they were in a primarily college prep curricular program or a more advanced high school program. Between schools, we have a student composition variable and a variable describing the academic focus of the school. No Predictors Model We can begin with a no predictors model. At level 1, the unconditional model to relate the transformed predicted values to an intercept parameter is defined as follows: log 1 0 j. We note again there is no separate level-1 residual variance term for models with categorical outcomes. The level-2 model will simply be the following: u. 0 j 00 0 j

8 Introducing Generalized Linear Models: Logistic Regression 8 Through substitution, the combined single equation is the following: u, 00 0 j which suggests there are two parameters to be estimated (the intercept and random level-2 effect). Here we can see the estimated log odds of being proficient is Table 1: Fixed Effects 95% Confidence Interval for Exp(Coefficient) Model Term Coefficients Std. Error T Sig. Exp(Coefficient) Lower Upper Intercept Probability distribution: Binomial; Link function: Logit Notice the look of the tables is a bit different in the GENLIN MIXED program (i.e., it requires using the computer mouse to open up each part of the output and then converting heat maps to tables). I have used table template I created to present the relevant output. Table 2: mathprof2 We can see in the above table that most students are likely to be proficient in math. The percentage of proficient students at the individual level is about 66.4%. We can calculate the odds of being proficient from the fixed effects table above as the following: 1 1 = 1 e (.684) 1 e The resulting proficiency level averaged across schools is about 0.67 [1/( ) = 0.67]. This is a little different since it is the average unit estimated proficiency level, rather than the average for the population. If we look at the odds ratio (1.98), we can say students are about 2:1 more

9 Introducing Generalized Linear Models: Logistic Regression 9 likely to be proficient than not proficient. If we made this a proportion, it would be something like 0.667/0.333 = 2. Variance Components The level 2 variability suggests that the probability varies significantly across schools (Z = 8.144, p <.01). Table 3: Variance Components Random and Residual Effects Estimate Std. Error Z-test Significance 95% Confidence Interval for Exp(Coefficient) Lower Upper Random Var(Intercept) a Residual b 1.00 a Covariance Structure: Variance components; Subject specification: schcode b Covariance Structure: Scaled Identity; Subject Specification: none We can notice also that the variability at level 1 (Residual) is scaled to 1.0. This is because the variance at level 1 is tied to the population proportion of individuals who are proficient, so it cannot be estimated separately from the mean. Instead, it is simply scaled to 1.0 to provide a metric for the log odds scale. Despite the scaling to 1.0, an intraclass correlation can be estimated describing the proportion of variance that lies between units ( ) relative to the total variance (i.e., ). Between The variance of a logistic distribution with scale factor 1.0 is can calculate an ICC ( ) as follows: Between Within 2 /3 3.29(Hox, 2002), so we /( 3.29 ). 2 2 Between Between Within In this case, it will be 0.386/( ), or 0.386/3.676, which is This suggests about 10.5% of the variance in math proficiency lies between schools. Individual Predictors We might decide to go ahead and build a multilevel model. We can interpret the intercept as the log odds of being proficient when all the other variables are 0. In this case, SES and GPA are

10 Introducing Generalized Linear Models: Logistic Regression 10 standardized (Mean = 0, SD = 1). The intercept is therefore the log odds of being proficient when the student is not in the more advanced curricular program (coded 0). Such a person is about 2.02 times more likely to be proficient as not proficient (Odds Ratio = 2.02, p <.01). Table 4: Fixed Effects Model Term Coefficient Std. Error T Sig. Exp(Coefficient) 95% Confidence Interval for Exp(Coefficient) Lower Upper Intercept ses gpa acprog= acprog=0 0 a Probability distribution: Binomial; Link function: Logit a This coefficient is set to zero because it is redundant. Increasing student SES by 1-SD (since SES is standardized with mean = 0 and standard deviation =1) would result in an increase in predicted log odds of being proficient of 0.527, other variables being held constant. Alternatively, we can say that the odds of such an individual being proficient increase by a factor of compared with individuals at the mean for SES. We can see some support for the view that being in the stronger academic program also increases the log odds of being proficient in math (0.137, p <.07).

11 Introducing Generalized Linear Models: Logistic Regression 11 Adding School Predictors We can see in the following table that when we add the two school-level predictors, only student composition is significantly related to likelihood to be proficient (log odds = 0.541, p <.01). Increasing student SES composition by 1-SD would increase the log odds of being proficient by units. Table 5: Fixed Effects Model Term Coefficient Std. Error T Sig. Exp(Coefficient) 95% Confidence Interval for Exp(Coefficient) Lower Upper Intercept Ses Gpa acprog= acprog=0 0 a acadfocus studentcomp Probability distribution: Binomial; Link function: Logit a This coefficient is set to zero because it is redundant. In terms of odds ratios, increasing composition by 1-SD (since it is also standardized) would increase the odds of being proficient by a factor of We should note that because odds ratios are multiplicative rather than additive, if we increased student composition by 2-SD, the resulting odds of persisting would increase by a factor of To obtain this new estimated odds ratio, we can first add the log odds (which are the exponents) and then exponentiate the new log odds: e = e = 2.95 We can also obtain this result by multiplying the odds ratios (1.717*1.717 = 2.95). The odds ratios should not be added, however ( = 3.434). So at 2-SD above the grand mean, the odds of being proficient would be increased by a factor of 2.95, or approximately 3 to 1. We could, of course, extend our analysis to examine a random slope such as the SES-proficiency slope, or perhaps the impact of being in the more advanced academic program might vary across schools. We can also extend the basic two-level model to three-level models. Similarly, we could extend this basic dichotomous cross-sectional model to represent a longitudinal model looking at the likelihood of being proficient at different points in time (e.g., from 9 th through 12 th grades). We cover a number of other type of models with categorical outcomes in our new workbook on categorical models using IBM SPSS.

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011) Ron Heck, Fall 2011 1 EDEP 768E: Seminar in Multilevel Modeling rev. January 3, 2012 (see footnote) Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October

More information

Ronald Heck Week 14 1 EDEP 768E: Seminar in Categorical Data Modeling (F2012) Nov. 17, 2012

Ronald Heck Week 14 1 EDEP 768E: Seminar in Categorical Data Modeling (F2012) Nov. 17, 2012 Ronald Heck Week 14 1 From Single Level to Multilevel Categorical Models This week we develop a two-level model to examine the event probability for an ordinal response variable with three categories (persist

More information

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data Ronald Heck Class Notes: Week 8 1 Class Notes: Week 8 Probit versus Logit Link Functions and Count Data This week we ll take up a couple of issues. The first is working with a probit link function. While

More information

Ron Heck, Fall Week 3: Notes Building a Two-Level Model

Ron Heck, Fall Week 3: Notes Building a Two-Level Model Ron Heck, Fall 2011 1 EDEP 768E: Seminar on Multilevel Modeling rev. 9/6/2011@11:27pm Week 3: Notes Building a Two-Level Model We will build a model to explain student math achievement using student-level

More information

Additional Notes: Investigating a Random Slope. When we have fixed level-1 predictors at level 2 we show them like this:

Additional Notes: Investigating a Random Slope. When we have fixed level-1 predictors at level 2 we show them like this: Ron Heck, Summer 01 Seminars 1 Multilevel Regression Models and Their Applications Seminar Additional Notes: Investigating a Random Slope We can begin with Model 3 and add a Random slope parameter. If

More information

Investigating Models with Two or Three Categories

Investigating Models with Two or Three Categories Ronald H. Heck and Lynn N. Tabata 1 Investigating Models with Two or Three Categories For the past few weeks we have been working with discriminant analysis. Let s now see what the same sort of model might

More information

Model Estimation Example

Model Estimation Example Ronald H. Heck 1 EDEP 606: Multivariate Methods (S2013) April 7, 2013 Model Estimation Example As we have moved through the course this semester, we have encountered the concept of model estimation. Discussions

More information

EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7

EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7 Introduction to Generalized Univariate Models: Models for Binary Outcomes EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7 EPSY 905: Intro to Generalized In This Lecture A short review

More information

Hierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture!

Hierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture! Hierarchical Generalized Linear Models ERSH 8990 REMS Seminar on HLM Last Lecture! Hierarchical Generalized Linear Models Introduction to generalized models Models for binary outcomes Interpreting parameter

More information

Single-level Models for Binary Responses

Single-level Models for Binary Responses Single-level Models for Binary Responses Distribution of Binary Data y i response for individual i (i = 1,..., n), coded 0 or 1 Denote by r the number in the sample with y = 1 Mean and variance E(y) =

More information

Review of Multiple Regression

Review of Multiple Regression Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate

More information

Class Notes. Examining Repeated Measures Data on Individuals

Class Notes. Examining Repeated Measures Data on Individuals Ronald Heck Week 12: Class Notes 1 Class Notes Examining Repeated Measures Data on Individuals Generalized linear mixed models (GLMM) also provide a means of incorporang longitudinal designs with categorical

More information

2/26/2017. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2

2/26/2017. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 When and why do we use logistic regression? Binary Multinomial Theory behind logistic regression Assessing the model Assessing predictors

More information

Generalized Linear Models for Non-Normal Data

Generalized Linear Models for Non-Normal Data Generalized Linear Models for Non-Normal Data Today s Class: 3 parts of a generalized model Models for binary outcomes Complications for generalized multivariate or multilevel models SPLH 861: Lecture

More information

Generalized logit models for nominal multinomial responses. Local odds ratios

Generalized logit models for nominal multinomial responses. Local odds ratios Generalized logit models for nominal multinomial responses Categorical Data Analysis, Summer 2015 1/17 Local odds ratios Y 1 2 3 4 1 π 11 π 12 π 13 π 14 π 1+ X 2 π 21 π 22 π 23 π 24 π 2+ 3 π 31 π 32 π

More information

Specifying Latent Curve and Other Growth Models Using Mplus. (Revised )

Specifying Latent Curve and Other Growth Models Using Mplus. (Revised ) Ronald H. Heck 1 University of Hawai i at Mānoa Handout #20 Specifying Latent Curve and Other Growth Models Using Mplus (Revised 12-1-2014) The SEM approach offers a contrasting framework for use in analyzing

More information

Binary Logistic Regression

Binary Logistic Regression The coefficients of the multiple regression model are estimated using sample data with k independent variables Estimated (or predicted) value of Y Estimated intercept Estimated slope coefficients Ŷ = b

More information

Logistic Regression: Regression with a Binary Dependent Variable

Logistic Regression: Regression with a Binary Dependent Variable Logistic Regression: Regression with a Binary Dependent Variable LEARNING OBJECTIVES Upon completing this chapter, you should be able to do the following: State the circumstances under which logistic regression

More information

Lecture 14: Introduction to Poisson Regression

Lecture 14: Introduction to Poisson Regression Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu 8 May 2007 1 / 52 Overview Modelling counts Contingency tables Poisson regression models 2 / 52 Modelling counts I Why

More information

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview Modelling counts I Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu Why count data? Number of traffic accidents per day Mortality counts in a given neighborhood, per week

More information

36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs)

36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs) 36-309/749 Experimental Design for Behavioral and Social Sciences Dec 1, 2015 Lecture 11: Mixed Models (HLMs) Independent Errors Assumption An error is the deviation of an individual observed outcome (DV)

More information

Final Exam. Name: Solution:

Final Exam. Name: Solution: Final Exam. Name: Instructions. Answer all questions on the exam. Open books, open notes, but no electronic devices. The first 13 problems are worth 5 points each. The rest are worth 1 point each. HW1.

More information

Generalized linear models

Generalized linear models Generalized linear models Douglas Bates November 01, 2010 Contents 1 Definition 1 2 Links 2 3 Estimating parameters 5 4 Example 6 5 Model building 8 6 Conclusions 8 7 Summary 9 1 Generalized Linear Models

More information

Regression in R. Seth Margolis GradQuant May 31,

Regression in R. Seth Margolis GradQuant May 31, Regression in R Seth Margolis GradQuant May 31, 2018 1 GPA What is Regression Good For? Assessing relationships between variables This probably covers most of what you do 4 3.8 3.6 3.4 Person Intelligence

More information

Estimating a Piecewise Growth Model with Longitudinal Data that Contains Individual Mobility across Clusters

Estimating a Piecewise Growth Model with Longitudinal Data that Contains Individual Mobility across Clusters Estimating a Piecewise Growth Model with Longitudinal Data that Contains Individual Mobility across Clusters Audrey J. Leroux Georgia State University Piecewise Growth Model (PGM) PGMs are beneficial for

More information

Chapter 9 Regression with a Binary Dependent Variable. Multiple Choice. 1) The binary dependent variable model is an example of a

Chapter 9 Regression with a Binary Dependent Variable. Multiple Choice. 1) The binary dependent variable model is an example of a Chapter 9 Regression with a Binary Dependent Variable Multiple Choice ) The binary dependent variable model is an example of a a. regression model, which has as a regressor, among others, a binary variable.

More information

Chapter 19: Logistic regression

Chapter 19: Logistic regression Chapter 19: Logistic regression Self-test answers SELF-TEST Rerun this analysis using a stepwise method (Forward: LR) entry method of analysis. The main analysis To open the main Logistic Regression dialog

More information

8 Nominal and Ordinal Logistic Regression

8 Nominal and Ordinal Logistic Regression 8 Nominal and Ordinal Logistic Regression 8.1 Introduction If the response variable is categorical, with more then two categories, then there are two options for generalized linear models. One relies on

More information

Spring RMC Professional Development Series January 14, Generalized Linear Mixed Models (GLMMs): Concepts and some Demonstrations

Spring RMC Professional Development Series January 14, Generalized Linear Mixed Models (GLMMs): Concepts and some Demonstrations Spring RMC Professional Development Series January 14, 2016 Generalized Linear Mixed Models (GLMMs): Concepts and some Demonstrations Ann A. O Connell, Ed.D. Professor, Educational Studies (QREM) Director,

More information

Advanced Quantitative Data Analysis

Advanced Quantitative Data Analysis Chapter 24 Advanced Quantitative Data Analysis Daniel Muijs Doing Regression Analysis in SPSS When we want to do regression analysis in SPSS, we have to go through the following steps: 1 As usual, we choose

More information

B. Weaver (24-Mar-2005) Multiple Regression Chapter 5: Multiple Regression Y ) (5.1) Deviation score = (Y i

B. Weaver (24-Mar-2005) Multiple Regression Chapter 5: Multiple Regression Y ) (5.1) Deviation score = (Y i B. Weaver (24-Mar-2005) Multiple Regression... 1 Chapter 5: Multiple Regression 5.1 Partial and semi-partial correlation Before starting on multiple regression per se, we need to consider the concepts

More information

Lecture 12: Effect modification, and confounding in logistic regression

Lecture 12: Effect modification, and confounding in logistic regression Lecture 12: Effect modification, and confounding in logistic regression Ani Manichaikul amanicha@jhsph.edu 4 May 2007 Today Categorical predictor create dummy variables just like for linear regression

More information

Can you tell the relationship between students SAT scores and their college grades?

Can you tell the relationship between students SAT scores and their college grades? Correlation One Challenge Can you tell the relationship between students SAT scores and their college grades? A: The higher SAT scores are, the better GPA may be. B: The higher SAT scores are, the lower

More information

NELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation

NELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation NELS 88 Table 2.3 Adjusted odds ratios of eighth-grade students in 988 performing below basic levels of reading and mathematics in 988 and dropping out of school, 988 to 990, by basic demographics Variable

More information

STA 303 H1S / 1002 HS Winter 2011 Test March 7, ab 1cde 2abcde 2fghij 3

STA 303 H1S / 1002 HS Winter 2011 Test March 7, ab 1cde 2abcde 2fghij 3 STA 303 H1S / 1002 HS Winter 2011 Test March 7, 2011 LAST NAME: FIRST NAME: STUDENT NUMBER: ENROLLED IN: (circle one) STA 303 STA 1002 INSTRUCTIONS: Time: 90 minutes Aids allowed: calculator. Some formulae

More information

Logistic Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University

Logistic Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University Logistic Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Logistic Regression 1 / 38 Logistic Regression 1 Introduction

More information

In Class Review Exercises Vartanian: SW 540

In Class Review Exercises Vartanian: SW 540 In Class Review Exercises Vartanian: SW 540 1. Given the following output from an OLS model looking at income, what is the slope and intercept for those who are black and those who are not black? b SE

More information

Ordinary Least Squares Regression Explained: Vartanian

Ordinary Least Squares Regression Explained: Vartanian Ordinary Least Squares Regression Eplained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent

More information

Time-Invariant Predictors in Longitudinal Models

Time-Invariant Predictors in Longitudinal Models Time-Invariant Predictors in Longitudinal Models Today s Class (or 3): Summary of steps in building unconditional models for time What happens to missing predictors Effects of time-invariant predictors

More information

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 An Introduction to Multilevel Models PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 Today s Class Concepts in Longitudinal Modeling Between-Person vs. +Within-Person

More information

Truck prices - linear model? Truck prices - log transform of the response variable. Interpreting models with log transformation

Truck prices - linear model? Truck prices - log transform of the response variable. Interpreting models with log transformation Background Regression so far... Lecture 23 - Sta 111 Colin Rundel June 17, 2014 At this point we have covered: Simple linear regression Relationship between numerical response and a numerical or categorical

More information

Multilevel/Mixed Models and Longitudinal Analysis Using Stata

Multilevel/Mixed Models and Longitudinal Analysis Using Stata Multilevel/Mixed Models and Longitudinal Analysis Using Stata Isaac J. Washburn PhD Research Associate Oregon Social Learning Center Summer Workshop Series July 2010 Longitudinal Analysis 1 Longitudinal

More information

Generalized Linear Models 1

Generalized Linear Models 1 Generalized Linear Models 1 STA 2101/442: Fall 2012 1 See last slide for copyright information. 1 / 24 Suggested Reading: Davison s Statistical models Exponential families of distributions Sec. 5.2 Chapter

More information

Review of Multinomial Distribution If n trials are performed: in each trial there are J > 2 possible outcomes (categories) Multicategory Logit Models

Review of Multinomial Distribution If n trials are performed: in each trial there are J > 2 possible outcomes (categories) Multicategory Logit Models Chapter 6 Multicategory Logit Models Response Y has J > 2 categories. Extensions of logistic regression for nominal and ordinal Y assume a multinomial distribution for Y. 6.1 Logit Models for Nominal Responses

More information

ST3241 Categorical Data Analysis I Multicategory Logit Models. Logit Models For Nominal Responses

ST3241 Categorical Data Analysis I Multicategory Logit Models. Logit Models For Nominal Responses ST3241 Categorical Data Analysis I Multicategory Logit Models Logit Models For Nominal Responses 1 Models For Nominal Responses Y is nominal with J categories. Let {π 1,, π J } denote the response probabilities

More information

Two-Way ANOVA. Chapter 15

Two-Way ANOVA. Chapter 15 Two-Way ANOVA Chapter 15 Interaction Defined An interaction is present when the effects of one IV depend upon a second IV Interaction effect : The effect of each IV across the levels of the other IV When

More information

Regression so far... Lecture 21 - Logistic Regression. Odds. Recap of what you should know how to do... At this point we have covered: Sta102 / BME102

Regression so far... Lecture 21 - Logistic Regression. Odds. Recap of what you should know how to do... At this point we have covered: Sta102 / BME102 Background Regression so far... Lecture 21 - Sta102 / BME102 Colin Rundel November 18, 2014 At this point we have covered: Simple linear regression Relationship between numerical response and a numerical

More information

Random Intercept Models

Random Intercept Models Random Intercept Models Edps/Psych/Soc 589 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Spring 2019 Outline A very simple case of a random intercept

More information

Logistic Regression. Continued Psy 524 Ainsworth

Logistic Regression. Continued Psy 524 Ainsworth Logistic Regression Continued Psy 524 Ainsworth Equations Regression Equation Y e = 1 + A+ B X + B X + B X 1 1 2 2 3 3 i A+ B X + B X + B X e 1 1 2 2 3 3 Equations The linear part of the logistic regression

More information

Longitudinal Data Analysis of Health Outcomes

Longitudinal Data Analysis of Health Outcomes Longitudinal Data Analysis of Health Outcomes Longitudinal Data Analysis Workshop Running Example: Days 2 and 3 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information

Review of the General Linear Model

Review of the General Linear Model Review of the General Linear Model EPSY 905: Multivariate Analysis Online Lecture #2 Learning Objectives Types of distributions: Ø Conditional distributions The General Linear Model Ø Regression Ø Analysis

More information

y response variable x 1, x 2,, x k -- a set of explanatory variables

y response variable x 1, x 2,, x k -- a set of explanatory variables 11. Multiple Regression and Correlation y response variable x 1, x 2,, x k -- a set of explanatory variables In this chapter, all variables are assumed to be quantitative. Chapters 12-14 show how to incorporate

More information

Longitudinal Modeling with Logistic Regression

Longitudinal Modeling with Logistic Regression Newsom 1 Longitudinal Modeling with Logistic Regression Longitudinal designs involve repeated measurements of the same individuals over time There are two general classes of analyses that correspond to

More information

Generalized Models: Part 1

Generalized Models: Part 1 Generalized Models: Part 1 Topics: Introduction to generalized models Introduction to maximum likelihood estimation Models for binary outcomes Models for proportion outcomes Models for categorical outcomes

More information

Assessing the relation between language comprehension and performance in general chemistry. Appendices

Assessing the relation between language comprehension and performance in general chemistry. Appendices Assessing the relation between language comprehension and performance in general chemistry Daniel T. Pyburn a, Samuel Pazicni* a, Victor A. Benassi b, and Elizabeth E. Tappin c a Department of Chemistry,

More information

22s:152 Applied Linear Regression. Example: Study on lead levels in children. Ch. 14 (sec. 1) and Ch. 15 (sec. 1 & 4): Logistic Regression

22s:152 Applied Linear Regression. Example: Study on lead levels in children. Ch. 14 (sec. 1) and Ch. 15 (sec. 1 & 4): Logistic Regression 22s:52 Applied Linear Regression Ch. 4 (sec. and Ch. 5 (sec. & 4: Logistic Regression Logistic Regression When the response variable is a binary variable, such as 0 or live or die fail or succeed then

More information

Part 8: GLMs and Hierarchical LMs and GLMs

Part 8: GLMs and Hierarchical LMs and GLMs Part 8: GLMs and Hierarchical LMs and GLMs 1 Example: Song sparrow reproductive success Arcese et al., (1992) provide data on a sample from a population of 52 female song sparrows studied over the course

More information

Ch 6: Multicategory Logit Models

Ch 6: Multicategory Logit Models 293 Ch 6: Multicategory Logit Models Y has J categories, J>2. Extensions of logistic regression for nominal and ordinal Y assume a multinomial distribution for Y. In R, we will fit these models using the

More information

Chapter 20: Logistic regression for binary response variables

Chapter 20: Logistic regression for binary response variables Chapter 20: Logistic regression for binary response variables In 1846, the Donner and Reed families left Illinois for California by covered wagon (87 people, 20 wagons). They attempted a new and untried

More information

SOS3003 Applied data analysis for social science Lecture note Erling Berge Department of sociology and political science NTNU.

SOS3003 Applied data analysis for social science Lecture note Erling Berge Department of sociology and political science NTNU. SOS3003 Applied data analysis for social science Lecture note 08-00 Erling Berge Department of sociology and political science NTNU Erling Berge 00 Literature Logistic regression II Hamilton Ch 7 p7-4

More information

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

 M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2 Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

Comparing IRT with Other Models

Comparing IRT with Other Models Comparing IRT with Other Models Lecture #14 ICPSR Item Response Theory Workshop Lecture #14: 1of 45 Lecture Overview The final set of slides will describe a parallel between IRT and another commonly used

More information

REVIEW 8/2/2017 陈芳华东师大英语系

REVIEW 8/2/2017 陈芳华东师大英语系 REVIEW Hypothesis testing starts with a null hypothesis and a null distribution. We compare what we have to the null distribution, if the result is too extreme to belong to the null distribution (p

More information

12 Generalized linear models

12 Generalized linear models 12 Generalized linear models In this chapter, we combine regression models with other parametric probability models like the binomial and Poisson distributions. Binary responses In many situations, we

More information

NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION. ST3241 Categorical Data Analysis. (Semester II: ) April/May, 2011 Time Allowed : 2 Hours

NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION. ST3241 Categorical Data Analysis. (Semester II: ) April/May, 2011 Time Allowed : 2 Hours NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION Categorical Data Analysis (Semester II: 2010 2011) April/May, 2011 Time Allowed : 2 Hours Matriculation No: Seat No: Grade Table Question 1 2 3 4 5 6 Full marks

More information

Review of CLDP 944: Multilevel Models for Longitudinal Data

Review of CLDP 944: Multilevel Models for Longitudinal Data Review of CLDP 944: Multilevel Models for Longitudinal Data Topics: Review of general MLM concepts and terminology Model comparisons and significance testing Fixed and random effects of time Significance

More information

Lecture (chapter 13): Association between variables measured at the interval-ratio level

Lecture (chapter 13): Association between variables measured at the interval-ratio level Lecture (chapter 13): Association between variables measured at the interval-ratio level Ernesto F. L. Amaral April 9 11, 2018 Advanced Methods of Social Research (SOCI 420) Source: Healey, Joseph F. 2015.

More information

Linear Regression Models P8111

Linear Regression Models P8111 Linear Regression Models P8111 Lecture 25 Jeff Goldsmith April 26, 2016 1 of 37 Today s Lecture Logistic regression / GLMs Model framework Interpretation Estimation 2 of 37 Linear regression Course started

More information

Ordinary Least Squares Regression Explained: Vartanian

Ordinary Least Squares Regression Explained: Vartanian Ordinary Least Squares Regression Explained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent

More information

TABLE OF CONTENTS INTRODUCTION TO MIXED-EFFECTS MODELS...3

TABLE OF CONTENTS INTRODUCTION TO MIXED-EFFECTS MODELS...3 Table of contents TABLE OF CONTENTS...1 1 INTRODUCTION TO MIXED-EFFECTS MODELS...3 Fixed-effects regression ignoring data clustering...5 Fixed-effects regression including data clustering...1 Fixed-effects

More information

Generalized Linear Models

Generalized Linear Models Generalized Linear Models Methods@Manchester Summer School Manchester University July 2 6, 2018 Generalized Linear Models: a generic approach to statistical modelling www.research-training.net/manchester2018

More information

Correlation and regression

Correlation and regression 1 Correlation and regression Yongjua Laosiritaworn Introductory on Field Epidemiology 6 July 2015, Thailand Data 2 Illustrative data (Doll, 1955) 3 Scatter plot 4 Doll, 1955 5 6 Correlation coefficient,

More information

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model EPSY 905: Multivariate Analysis Lecture 1 20 January 2016 EPSY 905: Lecture 1 -

More information

An Introduction to Mplus and Path Analysis

An Introduction to Mplus and Path Analysis An Introduction to Mplus and Path Analysis PSYC 943: Fundamentals of Multivariate Modeling Lecture 10: October 30, 2013 PSYC 943: Lecture 10 Today s Lecture Path analysis starting with multivariate regression

More information

Lecture 24: Partial correlation, multiple regression, and correlation

Lecture 24: Partial correlation, multiple regression, and correlation Lecture 24: Partial correlation, multiple regression, and correlation Ernesto F. L. Amaral November 21, 2017 Advanced Methods of Social Research (SOCI 420) Source: Healey, Joseph F. 2015. Statistics: A

More information

WU Weiterbildung. Linear Mixed Models

WU Weiterbildung. Linear Mixed Models Linear Mixed Effects Models WU Weiterbildung SLIDE 1 Outline 1 Estimation: ML vs. REML 2 Special Models On Two Levels Mixed ANOVA Or Random ANOVA Random Intercept Model Random Coefficients Model Intercept-and-Slopes-as-Outcomes

More information

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 1: August 22, 2012

More information

Statistics 572 Semester Review

Statistics 572 Semester Review Statistics 572 Semester Review Final Exam Information: The final exam is Friday, May 16, 10:05-12:05, in Social Science 6104. The format will be 8 True/False and explains questions (3 pts. each/ 24 pts.

More information

Introduction to Generalized Models

Introduction to Generalized Models Introduction to Generalized Models Today s topics: The big picture of generalized models Review of maximum likelihood estimation Models for binary outcomes Models for proportion outcomes Models for categorical

More information

Turning a research question into a statistical question.

Turning a research question into a statistical question. Turning a research question into a statistical question. IGINAL QUESTION: Concept Concept Concept ABOUT ONE CONCEPT ABOUT RELATIONSHIPS BETWEEN CONCEPTS TYPE OF QUESTION: DESCRIBE what s going on? DECIDE

More information

Immigration attitudes (opposes immigration or supports it) it may seriously misestimate the magnitude of the effects of IVs

Immigration attitudes (opposes immigration or supports it) it may seriously misestimate the magnitude of the effects of IVs Logistic Regression, Part I: Problems with the Linear Probability Model (LPM) Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 22, 2015 This handout steals

More information

STAT 7030: Categorical Data Analysis

STAT 7030: Categorical Data Analysis STAT 7030: Categorical Data Analysis 5. Logistic Regression Peng Zeng Department of Mathematics and Statistics Auburn University Fall 2012 Peng Zeng (Auburn University) STAT 7030 Lecture Notes Fall 2012

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your

More information

Unit 6 - Introduction to linear regression

Unit 6 - Introduction to linear regression Unit 6 - Introduction to linear regression Suggested reading: OpenIntro Statistics, Chapter 7 Suggested exercises: Part 1 - Relationship between two numerical variables: 7.7, 7.9, 7.11, 7.13, 7.15, 7.25,

More information

Time Invariant Predictors in Longitudinal Models

Time Invariant Predictors in Longitudinal Models Time Invariant Predictors in Longitudinal Models Longitudinal Data Analysis Workshop Section 9 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section

More information

Identify the scale of measurement most appropriate for each of the following variables. (Use A = nominal, B = ordinal, C = interval, D = ratio.

Identify the scale of measurement most appropriate for each of the following variables. (Use A = nominal, B = ordinal, C = interval, D = ratio. Answers to Items from Problem Set 1 Item 1 Identify the scale of measurement most appropriate for each of the following variables. (Use A = nominal, B = ordinal, C = interval, D = ratio.) a. response latency

More information

Linear Regression With Special Variables

Linear Regression With Special Variables Linear Regression With Special Variables Junhui Qian December 21, 2014 Outline Standardized Scores Quadratic Terms Interaction Terms Binary Explanatory Variables Binary Choice Models Standardized Scores:

More information

ECON 497 Midterm Spring

ECON 497 Midterm Spring ECON 497 Midterm Spring 2009 1 ECON 497: Economic Research and Forecasting Name: Spring 2009 Bellas Midterm You have three hours and twenty minutes to complete this exam. Answer all questions and explain

More information

16.400/453J Human Factors Engineering. Design of Experiments II

16.400/453J Human Factors Engineering. Design of Experiments II J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential

More information

NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION (SOLUTIONS) ST3241 Categorical Data Analysis. (Semester II: )

NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION (SOLUTIONS) ST3241 Categorical Data Analysis. (Semester II: ) NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION (SOLUTIONS) Categorical Data Analysis (Semester II: 2010 2011) April/May, 2011 Time Allowed : 2 Hours Matriculation No: Seat No: Grade Table Question 1 2 3

More information

Nominal Data. Parametric Statistics. Nonparametric Statistics. Parametric vs Nonparametric Tests. Greg C Elvers

Nominal Data. Parametric Statistics. Nonparametric Statistics. Parametric vs Nonparametric Tests. Greg C Elvers Nominal Data Greg C Elvers 1 Parametric Statistics The inferential statistics that we have discussed, such as t and ANOVA, are parametric statistics A parametric statistic is a statistic that makes certain

More information

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F). STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis 1. Indicate whether each of the following is true (T) or false (F). (a) T In 2 2 tables, statistical independence is equivalent to a population

More information

Binary Dependent Variables

Binary Dependent Variables Binary Dependent Variables In some cases the outcome of interest rather than one of the right hand side variables - is discrete rather than continuous Binary Dependent Variables In some cases the outcome

More information

Generalized linear models

Generalized linear models Generalized linear models Outline for today What is a generalized linear model Linear predictors and link functions Example: estimate a proportion Analysis of deviance Example: fit dose- response data

More information

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis Review Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 22 Chapter 1: background Nominal, ordinal, interval data. Distributions: Poisson, binomial,

More information

Regression Methods for Survey Data

Regression Methods for Survey Data Regression Methods for Survey Data Professor Ron Fricker! Naval Postgraduate School! Monterey, California! 3/26/13 Reading:! Lohr chapter 11! 1 Goals for this Lecture! Linear regression! Review of linear

More information

Neural networks (not in book)

Neural networks (not in book) (not in book) Another approach to classification is neural networks. were developed in the 1980s as a way to model how learning occurs in the brain. There was therefore wide interest in neural networks

More information

Review: what is a linear model. Y = β 0 + β 1 X 1 + β 2 X 2 + A model of the following form:

Review: what is a linear model. Y = β 0 + β 1 X 1 + β 2 X 2 + A model of the following form: Outline for today What is a generalized linear model Linear predictors and link functions Example: fit a constant (the proportion) Analysis of deviance table Example: fit dose-response data using logistic

More information