Applied Statistics. J. Blanchet and J. Wadsworth. Institute of Mathematics, Analysis, and Applications EPF Lausanne

Size: px
Start display at page:

Download "Applied Statistics. J. Blanchet and J. Wadsworth. Institute of Mathematics, Analysis, and Applications EPF Lausanne"

Transcription

1 Applied Statistics J. Blanchet and J. Wadsworth Institute of Mathematics, Analysis, and Applications EPF Lausanne An MSc Course for Applied Mathematicians, Fall 2012

2 Outline 1 Motivation: Why Applied Statistics? 2 Descriptive Statistical Measures and Distributions 3 Mean Regression 4 Logistic Regression as an example of Generalized Linear Model

3 Part I Motivation: Why Applied Statistics?

4 Data are Everywhere! Science magazine in Feb. 2011: Large amount of data, Uncertainty/variability of these data, We want to learn something about these data, but what?

5 Why Should I Learn Applied Statistics? To develop statistical methods in medicine Example: Construction of Probabilistic Brain Atlas to detect specific patterns of anatomic alterations due to some disease. PBA of Alzeimher s disease: Source:

6 Why Should I Learn Applied Statistics? To nowcast and forecast the economy Nowcasting: to assess the state of an economy. The Economist in April 2011 Cycle indicator for the US P T P T P T P T P T P T P T P T P T P T Source: de Carvalho et al, 2011 Time

7 Part II Descriptive Statistical Measures and Distributions

8 Basic notions We have some data x 1,..., x n. We want to learn about the corresponding random variable X. We want to learn about the cumulative distribution function (CDF): F X (x) = Pr(X x), which gives Pr(a < X b) = F X (b) F X (a). If X is a continuous random variable, the CDF can be defined in terms of its probability density function (PDF) f X as follows: F X (x) = x f X (t) dt.

9 Example: the Normal (or Gaussian) distribution R: dnorm{base}+pnrom{base} The PDF of X N(µ, σ 2 ) is given by: f X (x) = 1 σ (x µ) 2 2π e 2σ 2, x R. There is no closed form for the CDF. PDF CDF µ: mean, σ 2 : variance, σ: standard deviation. N(0, 1): standard Normal distribution.

10 Moments R: moment + raw2central{moments} In practice, f x is unknown. Some statistics can be useful to describe the data. The l-th moment of X m l = E(X l ) = x l f X (x) dx; m 1 : mean. The l-th central moment of X m l = E(X m 1 ) l = (x m 1 ) l f X (x) dx. In practice approximations are needed; e.g. we use m 1 x = 1 n n i=1 x i to estimate m 1. R R

11 Features of a Distribution Described using Functions of Moments R: mean{base} + sd{stats} + (skewness + kurtosis){moments} Measure Feature Function of moment Mean Location µ = m 1 Standard deviation Scale σ = m 2 Skewness Asymmetry γ = m 3 /σ3 (Excess) Kurtosis Tailweight κ = m 4 /σ4 3

12 Skewness and kurtosis R: skewness{moments} + kurtosis{moments} Skewness: γ < 0 vs. γ > 0 Kurtosis κ > 0 vs. κ = 0 vs. κ < 0:

13 The previous statistics are useful as a first description of the data but they do not fully describe the data. For this, we would need to estimate the PDF f X. If we don t have any knowledge: nonparametric estimation, see lecture in two weeks. In the next slides, we are concerned with problems where we want to learn about of a certain output variable y i by using data from another (input) variable x i. For example, it may be of interest to understand in a class of n students how does the output variable study performance, evolves on average as a function of input variable, time spent on studying. This involves extensions of the concepts of mean, etc., where instead of being interested in learning simply about Y, we are interested in Y X = x.

14 Part III Mean Regression

15 Mean Regression: Introducing the linear model Suppose that we collect a random sample {(x i, y i ) : i = 1,..., n}. Assume that a linear specification gives a good approximation for describing the link between covariates x T i = (x 1,..., x k ) and the response variable y i, i.e. y i = x T i β + ε i, where ε i N(0, σ 2 ε ), for i = 1,..., n. Suppose for instance that we want to use the stature of parents (x i ) to predict the stature of their offspring (y i ). A simple linear regression model for modeling this data assumes y i = β 1 + β 2 x i + ε i, i = 1,..., n.

16 Mean Regression Regression towards mediocrity in hereditary stature (Galton, 1886) R: data(galton){usingr} Figure: Sir Francis Galton (left); scatterplot and regression line for an hereditary stature study (right). The conception of the model is however older having its roots in Theoria Motus Corporum Coelestium, a work due to the prominent mathematician Carl Gauss.

17 Mean Regression: What s Fixed? What s Random? In the next slides we follow a frequentist approach, and hence the parameter β is assumed to be an unknown but fixed quantity; later we will introduce the Bayesian approach, and there we replace this by the assumption that β is a random quantity. We prefer to assume that covariates x i are random, but there are other approaches where these are assumed to be fixed. We think that this assumption is preferable in many applied fields, and in particular it may be used to justify the idea that there may be some measurement errors in the covariates.

18 OLS (Ordinary Least Squares) Estimator The OLS estimator ( β 1, β 2 ) T defines the line such that the sum of the Euclidean distances between responses (y i ) and the ordinates of that line (β 1 + β 2 x i ), is minimal across all possible lines, i.e. β ( β1 β 2 ) = arg min (β 1,β 2 ) R 2 ( n (y i β 1 β 2 x i ) 2 = i=1 y β 2 x ni=1 (x i x)(y i y) ni=1 (x i x) 2 ). Here x and y denote the sample means of x i and y i. This is the same as the maximum likelihood estimator (more next week) since we assume that y i x i N(β 1 + β 2 x i, σ 2 ɛ )

19 OLS (Ordinary Least Squares) Estimator The predicted response is then given by ŷ i = β 1 + β 2 x i. This is the expected value of y i x i, since E(ɛ i ) = 0 The particular case where β 2 = 0 is instructive as the predicted response is simply the sample mean, i.e., ŷ i = y.

20 OLS (Ordinary Least Squares) Estimator Example In the hereditary stature data we have ( β = y β ) ( 1 x ni=1 (x i x)(y i y) ni=1 (x i x) / ) ( = ). The residuals of the fit, are the empirical counterparts of the error of the model, and are defined as e i = y i ŷ i. The assumption of normality of the errors can be tested using for instance the Jarque Bera statistic (R: jarque.bera.test{tseries}).

21 Fitting a Linear Model for Hereditary Stature Data R: lm{stats}= # Data are from the UsingR package # Uncomment the line below to install it # install.packages("usingr") # Load package, data, and attach data library(usingr) data(father.son) attach(father.son) # Mean regression fit <- lm(sheight~fheight) > fit Call: lm(formula = sheight ~ fheight) Coefficients: (Intercept) fheight The fitted model is thus son height i = father height i, i = 1,..., 1078.

22 Plotting the Hereditary Stature Data Against the Regression Line plot(sheight~fheight,bty = "l",pch = 20,xlab = "Father s Height", ylab="son s Height",cex.lab=1.4) abline(lm(sheight~fheight),lty=1,lwd=3,col="red") Son's Height Father's Height

23 Estimation Uncertainty Although we assume the parameters β = (β 1, β 2 ) are fixed, their estimators, β 1, β 2, are functions of the sample y x, which is random If we observed another sample of data from y x, then the parameter estimates would be different To quantify how different they might be, we look at the distribution of the estimators n 1/2 ( β β) N 2 (0, Σ). where Σ is a variance-covariance matrix depending on unknown quantities In practice therefore we estimate Σ

24 Estimation Uncertainty The variance covariance matrix of the OLS estimator is Σ β varcov( β) = σε 2 n i=1 x i 2 x i=1 (x i x) 2 n n n i=1 (x i x) 2 x 1 n i=1 (x i x) 2 n i=1 (x i x) 2 In practice to consistently estimate Σ β we plug in a consistent estimator of σ 2 ε in the expression above..

25 Estimation Uncertainty To make inference on the uncertainty in parameter estimates we use their asymptotic distribution t (κ) = β κ β κ N(0, 1), κ = 1, 2. var( β κ) This relies on var( β κ ) which is often unknown. The appropriate distribution is then t(κ) = β κ β κ t n 2, κ = 1, 2. var( β κ) (For large n, this is approximately Normal.)

26 Hypothesis testing The distributions of the estimators of β help us understand which range of values would be plausible if we were to observe another sample We can use the distributions to test hypotheses about the model Often of interest is the null hypothesis: H 0 : β 2 = 0 since this is equivalently the hypothesis that the covariate x does not influence the response y

27 Coefficient of Determination A central question is: does the model explain the variability in the response Y well? The coefficient of determination provides a simple diagnostic to the model fit r 2 = n i=1 (ŷ i y) 2 n i=1 (y i y) 2 = 1 n i=1 e2 i n i=1 (y i y) 2 = 1 error variation total variation in y. The smaller the error variation, relative to the total variation in y, the better the fit.

28 Back to the Hereditary Stature Data Example In the hereditary stature data we get since σ ε = ( Σ β = (n 2) 1 n i=1 e2 i ). Hypothesis testing: If our null hypothesis H 0 is that β 2 = 0, then t(2) = > Q (t 1076 ) = 1.96, and hence we reject H 0. The decision of rejecting the null hypothesis could also have been done by noticing that the p-value is smaller than 0.05, i.e. p-value = 2(1 F t1076 ( ) < 0.05 Model checking: r 2 = ni=1 (ŷ i y) 2 ni=1 (y i y) 2 = =

29 Assessing the Fitted Model in R > fit <- lm(sheight~fheight) > summary(fit) Call: lm(formula = sheight ~ fheight) Residuals: Min 1Q Median 3Q Max Coefficients: Estimate Std. Error t value Pr(> t ) (Intercept) <2e-16 *** fheight <2e-16 *** --- Signif. codes: 0 *** ** 0.01 * Residual standard error: on 1076 degrees of freedom Multiple R-squared: ,Adjusted R-squared: F-statistic: on 1 and 1076 DF, p-value: < 2.2e-16

30 Generalizing the Model: More Covariates Often we believe that the more than one covariate may be important, but the model can be easily generalized as y i = β 1 x i1 + + β k x ik + ε i, ε i N(0, σ 2 ). Inference can still be made in a similar way, but the number of degrees of freedom in the asymptotic distribution t(κ) = β κ β κ t n k, κ = 1,..., k. var( β κ ) In this more general setting we can take advantage of a matrix representation of the model.

31 Matrix Formulation Consider the following matrices where we store the data and the errors of the model y = y 1, X = x 11. x 1k., ε = ε 1.. y n The model can then be rewritten as x n1 x nk y = Xβ + ε, and to estimate the regression parameter β = (β 1,..., β k ) T we use the OLS estimator now defined as β = arg min β y Xβ 2 = (X T X) 1 X T y, where denotes the L 2 norm. Here the variance covariance matrix is Σ β varcov( β) = σ 2 ε(x T X) 1.. ε n

32 Mean Regression Theoretical reasons supporting the use of mean regression-based models The model is conceptually simple and computationally easy to implement. The case of heterocedasdicity (i.e. if σ 2 cannot be assumed to be constant over individuals) can lead to some complex problems, but overall the OLS estimator is unbiased, i.e., E( β X) = β. Under mild conditions it can be shown to be (strongly) consistent, i.e. β a.s. β, n. The Gauss Markov theorem provides strong theoretical support for using the OLS estimator in a linear regression model (Knight, 2000, p. 417). But...

33 Mean Regression Average effects Something to think about: Our linear specification and normality assumption imply that E(y i x i ) = x T i β. This assumes that y i can take any real value, and the errors conditionally upon knowledge of x T i β are continuous and normally distributed But: Not all data can take any value Not all data are continuous We need to generalize the framework for modelling different types of response data

34 Part IV Logistic Regression as an example of Generalized Linear Model

35 Introduction In many contexts the response variable of interest is categorical (e.g. Yes, No). Example: we ask n students how many hours they studied and whether they passed the exam. We want to learn the relationships between the output: passing the exam (y i s) and the input: hours of study (x i s). The application of the discussed linear regression models becomes inadequate. An alternative is provided by logistic regression which offers the possibility to model the probabilities of T classes, as functions of covariates, while obeying the constraint that these probabilities need to be in [0, 1], and sum to one. For simplicity, we focus here on the case of a single covariate and T = 2, but this can be adapted to a more general setting (Christensen, 1990, 4; Hastie et al, 4).

36 Logistic Regression Let p i denote a probability of failure in the ith trial. We wish to predict p i given the information available on a certain covariate x i. The model is based on the following log-linear specification ( ) pi logit(p i ) log = β 1 + β 2 x i, 1 p i where β 1 and β 2 respectively denote an intercept and a slope parameter. The quantity p i /(1 p i ) is the so-called odds ratio. logit( ) is the link function.

37 Maximum likelihood estimation Inference is typically done by maximum likelihood. Using the fact that the loglikelihood can be written as p i = exp(β 1 + β 2 x i ) 1 + exp(β 1 + β 2 x i ), l = n n log{p i I(failure i )} + log{(1 p i ) I(success i )}. i=1 i=1 In practice this likelihood can be optimized using a Newton Raphson algorithm (Hastie et al, 2001, p. 99), or a Monte Carlo stochastic optimization method (de Carvalho, 2011). A simple possibility is to code the function l in R, and for the optimization use the instruction optim{stats}.

38 Example: South African Heart Disease Data R: glm{stats} #install.packages(elemstatlearn) library(elemstatlearn)?saheart (...) Format: (...) A data frame with 462 observations on the following 10 variables. sbp systolic blood pressure tobacco cumulative tobacco (kg) ldl low density lipoprotein cholesterol famhist family history of heart disease, a factor with levels Absent Present obesity a numeric vector alcohol current alcohol consumption age age at onset chd response, coronary heart disease

39 Fitting a Logistic Model on South African Heart Disease Data R: glm{stats} #load and attach the data > data(saheart) > attach(saheart) # Logistic regression fit <- glm(chd~sbp+tobacco+ldl+famhist+obesity +alcohol+age, family=binomial) > summary(fit) Call: glm(formula = chd ~ sbp + tobacco + ldl + famhist + obesity + alcohol + age, family = binomial) Deviance Residuals: Min 1Q Median 3Q Max Coefficients: Estimate Std. Error z value Pr(> z ) (Intercept) e-05 *** sbp tobacco ** ldl ** famhistpresent e-05 *** obesity alcohol age e-05 *** --- Signif. codes: 0 *** ** 0.01 * (Dispersion parameter for binomial family taken to be 1)

40 Linear regression vs. logistic regression R: glm{stats} Linear regression: y i = x T i β + ε i, ε iid i N(0, σ 2 ). This is equivalent to Y i X = x. N(µ i (x), σ 2 ), µ i (x) = x T i β. Logistic regression: Y i X = x Bin(n i, p i (x)), logit(p i (x)) = x T i β. All are special case of so-called generalized linear model: Y i X = x F, g(e F (Y X = x)) = x T i β, for a given distribution F (in the exponential family) and a given bijective function g (the link function).

41 Generalized Linear Model R: glm{stats} Example (Some examples of generalized linear models) Y i X = x F, g(e F (Y X = x)) = x T i β. Name Distribution F Link function g Domain of Y Linear regression Gaussian identity continuous Logistic regression binomial logit discrete 0, 1,..., T Probit model binomial probit discrete 0, 1,..., T Poisson regression Poisson logarithm discrete 0, 1, 2,... Neg. binomial regression Neg. binomial logarithm discrete 0, 1, 2,... Exponential regression Exponential inverse continuous 0 Gamma regression Gamma inverse continuous 0 Inv. Gaussian regression Inv. Gaussian inverse squared continuous >

42 Take-Home Message Different regression models for different problems. What could be a suitable model for my problem? What are interesting explanatory variables? (more about this in the next week...) How much confidence can I put in my analysis? Robustness, power,... Are the underlying hypothesis of my model fulfilled? Maximum likelihood is a powerful principle of estimation and inference (more about this in the next week...).

UNIVERSITY OF TORONTO Faculty of Arts and Science

UNIVERSITY OF TORONTO Faculty of Arts and Science UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator

More information

Logistic Regression and Generalized Linear Models

Logistic Regression and Generalized Linear Models Logistic Regression and Generalized Linear Models Sridhar Mahadevan mahadeva@cs.umass.edu University of Massachusetts Sridhar Mahadevan: CMPSCI 689 p. 1/2 Topics Generative vs. Discriminative models In

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 9: Logistic regression (v2) Ramesh Johari ramesh.johari@stanford.edu 1 / 28 Regression methods for binary outcomes 2 / 28 Binary outcomes For the duration of this lecture suppose

More information

Administration. Homework 1 on web page, due Feb 11 NSERC summer undergraduate award applications due Feb 5 Some helpful books

Administration. Homework 1 on web page, due Feb 11 NSERC summer undergraduate award applications due Feb 5 Some helpful books STA 44/04 Jan 6, 00 / 5 Administration Homework on web page, due Feb NSERC summer undergraduate award applications due Feb 5 Some helpful books STA 44/04 Jan 6, 00... administration / 5 STA 44/04 Jan 6,

More information

Logistic Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University

Logistic Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University Logistic Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Logistic Regression 1 / 38 Logistic Regression 1 Introduction

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 12: Logistic regression (v1) Ramesh Johari ramesh.johari@stanford.edu Fall 2015 1 / 30 Regression methods for binary outcomes 2 / 30 Binary outcomes For the duration of this

More information

Statistics 203 Introduction to Regression Models and ANOVA Practice Exam

Statistics 203 Introduction to Regression Models and ANOVA Practice Exam Statistics 203 Introduction to Regression Models and ANOVA Practice Exam Prof. J. Taylor You may use your 4 single-sided pages of notes This exam is 7 pages long. There are 4 questions, first 3 worth 10

More information

Classification. Chapter Introduction. 6.2 The Bayes classifier

Classification. Chapter Introduction. 6.2 The Bayes classifier Chapter 6 Classification 6.1 Introduction Often encountered in applications is the situation where the response variable Y takes values in a finite set of labels. For example, the response Y could encode

More information

Review of Statistics

Review of Statistics Review of Statistics Topics Descriptive Statistics Mean, Variance Probability Union event, joint event Random Variables Discrete and Continuous Distributions, Moments Two Random Variables Covariance and

More information

STAT5044: Regression and Anova. Inyoung Kim

STAT5044: Regression and Anova. Inyoung Kim STAT5044: Regression and Anova Inyoung Kim 2 / 47 Outline 1 Regression 2 Simple Linear regression 3 Basic concepts in regression 4 How to estimate unknown parameters 5 Properties of Least Squares Estimators:

More information

Linear Regression Models P8111

Linear Regression Models P8111 Linear Regression Models P8111 Lecture 25 Jeff Goldsmith April 26, 2016 1 of 37 Today s Lecture Logistic regression / GLMs Model framework Interpretation Estimation 2 of 37 Linear regression Course started

More information

SCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models

SCHOOL OF MATHEMATICS AND STATISTICS. Linear and Generalised Linear Models SCHOOL OF MATHEMATICS AND STATISTICS Linear and Generalised Linear Models Autumn Semester 2017 18 2 hours Attempt all the questions. The allocation of marks is shown in brackets. RESTRICTED OPEN BOOK EXAMINATION

More information

Exam Applied Statistical Regression. Good Luck!

Exam Applied Statistical Regression. Good Luck! Dr. M. Dettling Summer 2011 Exam Applied Statistical Regression Approved: Tables: Note: Any written material, calculator (without communication facility). Attached. All tests have to be done at the 5%-level.

More information

STA 450/4000 S: January

STA 450/4000 S: January STA 450/4000 S: January 6 005 Notes Friday tutorial on R programming reminder office hours on - F; -4 R The book Modern Applied Statistics with S by Venables and Ripley is very useful. Make sure you have

More information

Generalized linear models for binary data. A better graphical exploratory data analysis. The simple linear logistic regression model

Generalized linear models for binary data. A better graphical exploratory data analysis. The simple linear logistic regression model Stat 3302 (Spring 2017) Peter F. Craigmile Simple linear logistic regression (part 1) [Dobson and Barnett, 2008, Sections 7.1 7.3] Generalized linear models for binary data Beetles dose-response example

More information

Statistics for Engineers Lecture 9 Linear Regression

Statistics for Engineers Lecture 9 Linear Regression Statistics for Engineers Lecture 9 Linear Regression Chong Ma Department of Statistics University of South Carolina chongm@email.sc.edu April 17, 2017 Chong Ma (Statistics, USC) STAT 509 Spring 2017 April

More information

MATH 644: Regression Analysis Methods

MATH 644: Regression Analysis Methods MATH 644: Regression Analysis Methods FINAL EXAM Fall, 2012 INSTRUCTIONS TO STUDENTS: 1. This test contains SIX questions. It comprises ELEVEN printed pages. 2. Answer ALL questions for a total of 100

More information

STAC51: Categorical data Analysis

STAC51: Categorical data Analysis STAC51: Categorical data Analysis Mahinda Samarakoon April 6, 2016 Mahinda Samarakoon STAC51: Categorical data Analysis 1 / 25 Table of contents 1 Building and applying logistic regression models (Chap

More information

ST430 Exam 1 with Answers

ST430 Exam 1 with Answers ST430 Exam 1 with Answers Date: October 5, 2015 Name: Guideline: You may use one-page (front and back of a standard A4 paper) of notes. No laptop or textook are permitted but you may use a calculator.

More information

Lecture 4 Multiple linear regression

Lecture 4 Multiple linear regression Lecture 4 Multiple linear regression BIOST 515 January 15, 2004 Outline 1 Motivation for the multiple regression model Multiple regression in matrix notation Least squares estimation of model parameters

More information

Chapter 1. Linear Regression with One Predictor Variable

Chapter 1. Linear Regression with One Predictor Variable Chapter 1. Linear Regression with One Predictor Variable 1.1 Statistical Relation Between Two Variables To motivate statistical relationships, let us consider a mathematical relation between two mathematical

More information

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida First Year Examination Department of Statistics, University of Florida August 20, 2009, 8:00 am - 2:00 noon Instructions:. You have four hours to answer questions in this examination. 2. You must show

More information

ECLT 5810 Linear Regression and Logistic Regression for Classification. Prof. Wai Lam

ECLT 5810 Linear Regression and Logistic Regression for Classification. Prof. Wai Lam ECLT 5810 Linear Regression and Logistic Regression for Classification Prof. Wai Lam Linear Regression Models Least Squares Input vectors is an attribute / feature / predictor (independent variable) The

More information

R Hints for Chapter 10

R Hints for Chapter 10 R Hints for Chapter 10 The multiple logistic regression model assumes that the success probability p for a binomial random variable depends on independent variables or design variables x 1, x 2,, x k.

More information

Stat 5101 Lecture Notes

Stat 5101 Lecture Notes Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random

More information

Y i = η + ɛ i, i = 1,...,n.

Y i = η + ɛ i, i = 1,...,n. Nonparametric tests If data do not come from a normal population (and if the sample is not large), we cannot use a t-test. One useful approach to creating test statistics is through the use of rank statistics.

More information

Inference for Regression

Inference for Regression Inference for Regression Section 9.4 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 13b - 3339 Cathy Poliak, Ph.D. cathy@math.uh.edu

More information

Ch 2: Simple Linear Regression

Ch 2: Simple Linear Regression Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component

More information

Stat 5102 Final Exam May 14, 2015

Stat 5102 Final Exam May 14, 2015 Stat 5102 Final Exam May 14, 2015 Name Student ID The exam is closed book and closed notes. You may use three 8 1 11 2 sheets of paper with formulas, etc. You may also use the handouts on brand name distributions

More information

Model Estimation Example

Model Estimation Example Ronald H. Heck 1 EDEP 606: Multivariate Methods (S2013) April 7, 2013 Model Estimation Example As we have moved through the course this semester, we have encountered the concept of model estimation. Discussions

More information

Lecture 14: Introduction to Poisson Regression

Lecture 14: Introduction to Poisson Regression Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu 8 May 2007 1 / 52 Overview Modelling counts Contingency tables Poisson regression models 2 / 52 Modelling counts I Why

More information

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview Modelling counts I Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu Why count data? Number of traffic accidents per day Mortality counts in a given neighborhood, per week

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 15: Examples of hypothesis tests (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 32 The recipe 2 / 32 The hypothesis testing recipe In this lecture we repeatedly apply the

More information

Generalized Linear Models Introduction

Generalized Linear Models Introduction Generalized Linear Models Introduction Statistics 135 Autumn 2005 Copyright c 2005 by Mark E. Irwin Generalized Linear Models For many problems, standard linear regression approaches don t work. Sometimes,

More information

Linear Regression. In this lecture we will study a particular type of regression model: the linear regression model

Linear Regression. In this lecture we will study a particular type of regression model: the linear regression model 1 Linear Regression 2 Linear Regression In this lecture we will study a particular type of regression model: the linear regression model We will first consider the case of the model with one predictor

More information

Data Mining Stat 588

Data Mining Stat 588 Data Mining Stat 588 Lecture 9: Basis Expansions Department of Statistics & Biostatistics Rutgers University Nov 01, 2011 Regression and Classification Linear Regression. E(Y X) = f(x) We want to learn

More information

Note on Bivariate Regression: Connecting Practice and Theory. Konstantin Kashin

Note on Bivariate Regression: Connecting Practice and Theory. Konstantin Kashin Note on Bivariate Regression: Connecting Practice and Theory Konstantin Kashin Fall 2012 1 This note will explain - in less theoretical terms - the basics of a bivariate linear regression, including testing

More information

Normal distribution We have a random sample from N(m, υ). The sample mean is Ȳ and the corrected sum of squares is S yy. After some simplification,

Normal distribution We have a random sample from N(m, υ). The sample mean is Ȳ and the corrected sum of squares is S yy. After some simplification, Likelihood Let P (D H) be the probability an experiment produces data D, given hypothesis H. Usually H is regarded as fixed and D variable. Before the experiment, the data D are unknown, and the probability

More information

ECLT 5810 Linear Regression and Logistic Regression for Classification. Prof. Wai Lam

ECLT 5810 Linear Regression and Logistic Regression for Classification. Prof. Wai Lam ECLT 5810 Linear Regression and Logistic Regression for Classification Prof. Wai Lam Linear Regression Models Least Squares Input vectors is an attribute / feature / predictor (independent variable) The

More information

EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7

EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7 Introduction to Generalized Univariate Models: Models for Binary Outcomes EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7 EPSY 905: Intro to Generalized In This Lecture A short review

More information

Linear Regression Model. Badr Missaoui

Linear Regression Model. Badr Missaoui Linear Regression Model Badr Missaoui Introduction What is this course about? It is a course on applied statistics. It comprises 2 hours lectures each week and 1 hour lab sessions/tutorials. We will focus

More information

Subject CS1 Actuarial Statistics 1 Core Principles

Subject CS1 Actuarial Statistics 1 Core Principles Institute of Actuaries of India Subject CS1 Actuarial Statistics 1 Core Principles For 2019 Examinations Aim The aim of the Actuarial Statistics 1 subject is to provide a grounding in mathematical and

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression ST 430/514 Recall: a regression model describes how a dependent variable (or response) Y is affected, on average, by one or more independent variables (or factors, or covariates).

More information

SF2930: REGRESION ANALYSIS LECTURE 1 SIMPLE LINEAR REGRESSION.

SF2930: REGRESION ANALYSIS LECTURE 1 SIMPLE LINEAR REGRESSION. SF2930: REGRESION ANALYSIS LECTURE 1 SIMPLE LINEAR REGRESSION. Tatjana Pavlenko 17 January 2018 WHAT IS REGRESSION? INTRODUCTION Regression analysis is a statistical technique for investigating and modeling

More information

Lecture 4: Newton s method and gradient descent

Lecture 4: Newton s method and gradient descent Lecture 4: Newton s method and gradient descent Newton s method Functional iteration Fitting linear regression Fitting logistic regression Prof. Yao Xie, ISyE 6416, Computational Statistics, Georgia Tech

More information

Logistic Regressions. Stat 430

Logistic Regressions. Stat 430 Logistic Regressions Stat 430 Final Project Final Project is, again, team based You will decide on a project - only constraint is: you are supposed to use techniques for a solution that are related to

More information

NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION. ST3241 Categorical Data Analysis. (Semester II: ) April/May, 2011 Time Allowed : 2 Hours

NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION. ST3241 Categorical Data Analysis. (Semester II: ) April/May, 2011 Time Allowed : 2 Hours NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION Categorical Data Analysis (Semester II: 2010 2011) April/May, 2011 Time Allowed : 2 Hours Matriculation No: Seat No: Grade Table Question 1 2 3 4 5 6 Full marks

More information

Linear Regression. In this problem sheet, we consider the problem of linear regression with p predictors and one intercept,

Linear Regression. In this problem sheet, we consider the problem of linear regression with p predictors and one intercept, Linear Regression In this problem sheet, we consider the problem of linear regression with p predictors and one intercept, y = Xβ + ɛ, where y t = (y 1,..., y n ) is the column vector of target values,

More information

Oct Simple linear regression. Minimum mean square error prediction. Univariate. regression. Calculating intercept and slope

Oct Simple linear regression. Minimum mean square error prediction. Univariate. regression. Calculating intercept and slope Oct 2017 1 / 28 Minimum MSE Y is the response variable, X the predictor variable, E(X) = E(Y) = 0. BLUP of Y minimizes average discrepancy var (Y ux) = C YY 2u C XY + u 2 C XX This is minimized when u

More information

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit LECTURE 6 Introduction to Econometrics Hypothesis testing & Goodness of fit October 25, 2016 1 / 23 ON TODAY S LECTURE We will explain how multiple hypotheses are tested in a regression model We will define

More information

Lectures on Simple Linear Regression Stat 431, Summer 2012

Lectures on Simple Linear Regression Stat 431, Summer 2012 Lectures on Simple Linear Regression Stat 43, Summer 0 Hyunseung Kang July 6-8, 0 Last Updated: July 8, 0 :59PM Introduction Previously, we have been investigating various properties of the population

More information

Statistics - Lecture Three. Linear Models. Charlotte Wickham 1.

Statistics - Lecture Three. Linear Models. Charlotte Wickham   1. Statistics - Lecture Three Charlotte Wickham wickham@stat.berkeley.edu http://www.stat.berkeley.edu/~wickham/ Linear Models 1. The Theory 2. Practical Use 3. How to do it in R 4. An example 5. Extensions

More information

Regression Analysis. y t = β 1 x t1 + β 2 x t2 + β k x tk + ϵ t, t = 1,..., T,

Regression Analysis. y t = β 1 x t1 + β 2 x t2 + β k x tk + ϵ t, t = 1,..., T, Regression Analysis The multiple linear regression model with k explanatory variables assumes that the tth observation of the dependent or endogenous variable y t is described by the linear relationship

More information

Lectures 5 & 6: Hypothesis Testing

Lectures 5 & 6: Hypothesis Testing Lectures 5 & 6: Hypothesis Testing in which you learn to apply the concept of statistical significance to OLS estimates, learn the concept of t values, how to use them in regression work and come across

More information

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008 MIT OpenCourseWare http://ocw.mit.edu 2.830J / 6.780J / ESD.63J Control of Processes (SMA 6303) Spring 2008 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.

More information

Business Statistics. Tommaso Proietti. Linear Regression. DEF - Università di Roma 'Tor Vergata'

Business Statistics. Tommaso Proietti. Linear Regression. DEF - Università di Roma 'Tor Vergata' Business Statistics Tommaso Proietti DEF - Università di Roma 'Tor Vergata' Linear Regression Specication Let Y be a univariate quantitative response variable. We model Y as follows: Y = f(x) + ε where

More information

Ch 3: Multiple Linear Regression

Ch 3: Multiple Linear Regression Ch 3: Multiple Linear Regression 1. Multiple Linear Regression Model Multiple regression model has more than one regressor. For example, we have one response variable and two regressor variables: 1. delivery

More information

UQ, Semester 1, 2017, Companion to STAT2201/CIVL2530 Exam Formulae and Tables

UQ, Semester 1, 2017, Companion to STAT2201/CIVL2530 Exam Formulae and Tables UQ, Semester 1, 2017, Companion to STAT2201/CIVL2530 Exam Formulae and Tables To be provided to students with STAT2201 or CIVIL-2530 (Probability and Statistics) Exam Main exam date: Tuesday, 20 June 1

More information

MATH 829: Introduction to Data Mining and Analysis Logistic regression

MATH 829: Introduction to Data Mining and Analysis Logistic regression 1/11 MATH 829: Introduction to Data Mining and Analysis Logistic regression Dominique Guillot Departments of Mathematical Sciences University of Delaware March 7, 2016 Logistic regression 2/11 Suppose

More information

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model

Topic 17 - Single Factor Analysis of Variance. Outline. One-way ANOVA. The Data / Notation. One way ANOVA Cell means model Factor effects model Topic 17 - Single Factor Analysis of Variance - Fall 2013 One way ANOVA Cell means model Factor effects model Outline Topic 17 2 One-way ANOVA Response variable Y is continuous Explanatory variable is

More information

Business Statistics. Lecture 10: Course Review

Business Statistics. Lecture 10: Course Review Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,

More information

Two Hours. Mathematical formula books and statistical tables are to be provided THE UNIVERSITY OF MANCHESTER. 26 May :00 16:00

Two Hours. Mathematical formula books and statistical tables are to be provided THE UNIVERSITY OF MANCHESTER. 26 May :00 16:00 Two Hours MATH38052 Mathematical formula books and statistical tables are to be provided THE UNIVERSITY OF MANCHESTER GENERALISED LINEAR MODELS 26 May 2016 14:00 16:00 Answer ALL TWO questions in Section

More information

8 Nominal and Ordinal Logistic Regression

8 Nominal and Ordinal Logistic Regression 8 Nominal and Ordinal Logistic Regression 8.1 Introduction If the response variable is categorical, with more then two categories, then there are two options for generalized linear models. One relies on

More information

Recap. HW due Thursday by 5 pm Next HW coming on Thursday Logistic regression: Pr(G = k X) linear on the logit scale Linear discriminant analysis:

Recap. HW due Thursday by 5 pm Next HW coming on Thursday Logistic regression: Pr(G = k X) linear on the logit scale Linear discriminant analysis: 1 / 23 Recap HW due Thursday by 5 pm Next HW coming on Thursday Logistic regression: Pr(G = k X) linear on the logit scale Linear discriminant analysis: Pr(G = k X) Pr(X G = k)pr(g = k) Theory: LDA more

More information

Review: what is a linear model. Y = β 0 + β 1 X 1 + β 2 X 2 + A model of the following form:

Review: what is a linear model. Y = β 0 + β 1 X 1 + β 2 X 2 + A model of the following form: Outline for today What is a generalized linear model Linear predictors and link functions Example: fit a constant (the proportion) Analysis of deviance table Example: fit dose-response data using logistic

More information

Chapter 12: Linear regression II

Chapter 12: Linear regression II Chapter 12: Linear regression II Timothy Hanson Department of Statistics, University of South Carolina Stat 205: Elementary Statistics for the Biological and Life Sciences 1 / 14 12.4 The regression model

More information

Monte Carlo Studies. The response in a Monte Carlo study is a random variable.

Monte Carlo Studies. The response in a Monte Carlo study is a random variable. Monte Carlo Studies The response in a Monte Carlo study is a random variable. The response in a Monte Carlo study has a variance that comes from the variance of the stochastic elements in the data-generating

More information

Generalized Linear Models. Last time: Background & motivation for moving beyond linear

Generalized Linear Models. Last time: Background & motivation for moving beyond linear Generalized Linear Models Last time: Background & motivation for moving beyond linear regression - non-normal/non-linear cases, binary, categorical data Today s class: 1. Examples of count and ordered

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression In simple linear regression we are concerned about the relationship between two variables, X and Y. There are two components to such a relationship. 1. The strength of the relationship.

More information

Probability and Probability Distributions. Dr. Mohammed Alahmed

Probability and Probability Distributions. Dr. Mohammed Alahmed Probability and Probability Distributions 1 Probability and Probability Distributions Usually we want to do more with data than just describing them! We might want to test certain specific inferences about

More information

NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION (SOLUTIONS) ST3241 Categorical Data Analysis. (Semester II: )

NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION (SOLUTIONS) ST3241 Categorical Data Analysis. (Semester II: ) NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION (SOLUTIONS) Categorical Data Analysis (Semester II: 2010 2011) April/May, 2011 Time Allowed : 2 Hours Matriculation No: Seat No: Grade Table Question 1 2 3

More information

* Tuesday 17 January :30-16:30 (2 hours) Recored on ESSE3 General introduction to the course.

* Tuesday 17 January :30-16:30 (2 hours) Recored on ESSE3 General introduction to the course. Name of the course Statistical methods and data analysis Audience The course is intended for students of the first or second year of the Graduate School in Materials Engineering. The aim of the course

More information

Section 4.6 Simple Linear Regression

Section 4.6 Simple Linear Regression Section 4.6 Simple Linear Regression Objectives ˆ Basic philosophy of SLR and the regression assumptions ˆ Point & interval estimation of the model parameters, and how to make predictions ˆ Point and interval

More information

Scatter plot of data from the study. Linear Regression

Scatter plot of data from the study. Linear Regression 1 2 Linear Regression Scatter plot of data from the study. Consider a study to relate birthweight to the estriol level of pregnant women. The data is below. i Weight (g / 100) i Weight (g / 100) 1 7 25

More information

AMS-207: Bayesian Statistics

AMS-207: Bayesian Statistics Linear Regression How does a quantity y, vary as a function of another quantity, or vector of quantities x? We are interested in p(y θ, x) under a model in which n observations (x i, y i ) are exchangeable.

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3 Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details Section 10.1, 2, 3 Basic components of regression setup Target of inference: linear dependency

More information

Clinical Trials. Olli Saarela. September 18, Dalla Lana School of Public Health University of Toronto.

Clinical Trials. Olli Saarela. September 18, Dalla Lana School of Public Health University of Toronto. Introduction to Dalla Lana School of Public Health University of Toronto olli.saarela@utoronto.ca September 18, 2014 38-1 : a review 38-2 Evidence Ideal: to advance the knowledge-base of clinical medicine,

More information

Chapter 8: Simple Linear Regression

Chapter 8: Simple Linear Regression Chapter 8: Simple Linear Regression Shiwen Shen University of South Carolina 2017 Summer 1 / 70 Introduction A problem that arises in engineering, economics, medicine, and other areas is that of investigating

More information

Statistics in medicine

Statistics in medicine Statistics in medicine Lecture 4: and multivariable regression Fatma Shebl, MD, MS, MPH, PhD Assistant Professor Chronic Disease Epidemiology Department Yale School of Public Health Fatma.shebl@yale.edu

More information

Generalized Linear Models and Exponential Families

Generalized Linear Models and Exponential Families Generalized Linear Models and Exponential Families David M. Blei COS424 Princeton University April 12, 2012 Generalized Linear Models x n y n β Linear regression and logistic regression are both linear

More information

Financial Econometrics and Quantitative Risk Managenent Return Properties

Financial Econometrics and Quantitative Risk Managenent Return Properties Financial Econometrics and Quantitative Risk Managenent Return Properties Eric Zivot Updated: April 1, 2013 Lecture Outline Course introduction Return definitions Empirical properties of returns Reading

More information

Logistic Regression - problem 6.14

Logistic Regression - problem 6.14 Logistic Regression - problem 6.14 Let x 1, x 2,, x m be given values of an input variable x and let Y 1,, Y m be independent binomial random variables whose distributions depend on the corresponding values

More information

Introduction to Simple Linear Regression

Introduction to Simple Linear Regression Introduction to Simple Linear Regression Yang Feng http://www.stat.columbia.edu/~yangfeng Yang Feng (Columbia University) Introduction to Simple Linear Regression 1 / 68 About me Faculty in the Department

More information

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout

More information

UNIVERSITY OF MASSACHUSETTS. Department of Mathematics and Statistics. Basic Exam - Applied Statistics. Tuesday, January 17, 2017

UNIVERSITY OF MASSACHUSETTS. Department of Mathematics and Statistics. Basic Exam - Applied Statistics. Tuesday, January 17, 2017 UNIVERSITY OF MASSACHUSETTS Department of Mathematics and Statistics Basic Exam - Applied Statistics Tuesday, January 17, 2017 Work all problems 60 points are needed to pass at the Masters Level and 75

More information

Generalized linear models

Generalized linear models Generalized linear models Douglas Bates November 01, 2010 Contents 1 Definition 1 2 Links 2 3 Estimating parameters 5 4 Example 6 5 Model building 8 6 Conclusions 8 7 Summary 9 1 Generalized Linear Models

More information

7/28/15. Review Homework. Overview. Lecture 6: Logistic Regression Analysis

7/28/15. Review Homework. Overview. Lecture 6: Logistic Regression Analysis Lecture 6: Logistic Regression Analysis Christopher S. Hollenbeak, PhD Jane R. Schubart, PhD The Outcomes Research Toolbox Review Homework 2 Overview Logistic regression model conceptually Logistic regression

More information

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018

Statistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 Statistics Boot Camp Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 March 21, 2018 Outline of boot camp Summarizing and simplifying data Point and interval estimation Foundations of statistical

More information

Practical Statistics

Practical Statistics Practical Statistics Lecture 1 (Nov. 9): - Correlation - Hypothesis Testing Lecture 2 (Nov. 16): - Error Estimation - Bayesian Analysis - Rejecting Outliers Lecture 3 (Nov. 18) - Monte Carlo Modeling -

More information

Review. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda

Review. DS GA 1002 Statistical and Mathematical Models.   Carlos Fernandez-Granda Review DS GA 1002 Statistical and Mathematical Models http://www.cims.nyu.edu/~cfgranda/pages/dsga1002_fall16 Carlos Fernandez-Granda Probability and statistics Probability: Framework for dealing with

More information

Introduction to General and Generalized Linear Models

Introduction to General and Generalized Linear Models Introduction to General and Generalized Linear Models Generalized Linear Models - part III Henrik Madsen Poul Thyregod Informatics and Mathematical Modelling Technical University of Denmark DK-2800 Kgs.

More information

Linear Regression. Data Model. β, σ 2. Process Model. ,V β. ,s 2. s 1. Parameter Model

Linear Regression. Data Model. β, σ 2. Process Model. ,V β. ,s 2. s 1. Parameter Model Regression: Part II Linear Regression y~n X, 2 X Y Data Model β, σ 2 Process Model Β 0,V β s 1,s 2 Parameter Model Assumptions of Linear Model Homoskedasticity No error in X variables Error in Y variables

More information

Association studies and regression

Association studies and regression Association studies and regression CM226: Machine Learning for Bioinformatics. Fall 2016 Sriram Sankararaman Acknowledgments: Fei Sha, Ameet Talwalkar Association studies and regression 1 / 104 Administration

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

Advanced Statistics I : Gaussian Linear Model (and beyond)

Advanced Statistics I : Gaussian Linear Model (and beyond) Advanced Statistics I : Gaussian Linear Model (and beyond) Aurélien Garivier CNRS / Telecom ParisTech Centrale Outline One and Two-Sample Statistics Linear Gaussian Model Model Reduction and model Selection

More information

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis. 401 Review Major topics of the course 1. Univariate analysis 2. Bivariate analysis 3. Simple linear regression 4. Linear algebra 5. Multiple regression analysis Major analysis methods 1. Graphical analysis

More information

Density Temp vs Ratio. temp

Density Temp vs Ratio. temp Temp Ratio Density 0.00 0.02 0.04 0.06 0.08 0.10 0.12 Density 0.0 0.2 0.4 0.6 0.8 1.0 1. (a) 170 175 180 185 temp 1.0 1.5 2.0 2.5 3.0 ratio The histogram shows that the temperature measures have two peaks,

More information

22s:152 Applied Linear Regression. Example: Study on lead levels in children. Ch. 14 (sec. 1) and Ch. 15 (sec. 1 & 4): Logistic Regression

22s:152 Applied Linear Regression. Example: Study on lead levels in children. Ch. 14 (sec. 1) and Ch. 15 (sec. 1 & 4): Logistic Regression 22s:52 Applied Linear Regression Ch. 4 (sec. and Ch. 5 (sec. & 4: Logistic Regression Logistic Regression When the response variable is a binary variable, such as 0 or live or die fail or succeed then

More information

ST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples

ST3241 Categorical Data Analysis I Generalized Linear Models. Introduction and Some Examples ST3241 Categorical Data Analysis I Generalized Linear Models Introduction and Some Examples 1 Introduction We have discussed methods for analyzing associations in two-way and three-way tables. Now we will

More information