Statistics II Logistic Regression. So far... Two-way repeated measures ANOVA: an example. RM-ANOVA example: the data after log transform
|
|
- Hollie Higgins
- 5 years ago
- Views:
Transcription
1 Statistics II Logistic Regression Çağrı Çöltekin Exam date & time: June 21, 10:00 13:00 (The same day/time lanned at the beginning of the semester) University of Groningen, Det of Information Science May 29, 2013 More on the exam next week Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 So far Two-way reeated measures ANOVA: an examle Simle regresion Single (tyically) numeric redictor, numeric resonse Multile regression Multile redictors, numeric resonse ANOVA Single categorical redictor, numeric resonse Factorail ANOVA Multile categorical redictor, numeric resonse Reated-measures ANOVA Like single/factorial ANOVA with deendent observations This week: logistic regression We return to last week s roduction exeriment examle We want to analyze the auses after three different hrase tyes: IP, PP and PAR In addition, we are also interested in effect of length of the hrase Particularly, whether it is single word or multi word We have a 3 2 reeated measures ANOVA We use 5 different examles for each exerimental condition, resulting in utterances recorded for each of our 30 articiants Note: we need to randomize the order of the utterances for each subject Why? Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 RM-ANOVA examle: the data RM-ANOVA examle: the data after log transform Pause (s) log(pause) IP PAR PP Phrase Tye multi single Length IP PAR PP Phrase Tye multi single Phrase Length Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 RM-ANOVA examle: assumtions RM-ANOVA examle: main ANOVA results Check each (3 2) grou for normality Check for shericity: Mauchly s Test for Shericity Effect W <05 2 hrase tye hrase tye : length Why don t we have a shericity test for length? What if shericity assumtion was violated? R outut: > summary ( aov ( log ( ause ) ~ hrase tye * length + Error ( subj ))) Error : subj Df Sum Sq Mean Sq F value Pr(>F) Residuals Error : Within Df Sum Sq Mean Sq F value Pr(>F) hrase tye <2e -16 *** length hrase tye : length * Residuals Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37
2 RM-ANOVA examle: interaction RM-ANOVA: a summary and further directions mean of log(ause) IP PAR PP RM-ANOVA is an efficient method for exerimentation ANOVA design can be mixed, where on can analyze multile between- as well as within-subject factors RM-ANOVA is difficult to lease, excet in carefully constructed exerimental conditions In our examle today, we ignored inter-hrase reeated measures This is a common roblem known as language-as-a-fixed-effect fallacy in sycholinguistics A newer alternative is to use linear mixed-effect models (or multilevel models) multi single Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Logistic regression: motivation Logistic regression: some examles For all methods discussed in this class, the resonse variable is numeric Sometimes we want to analyze/redict categorical resonses: Alive or dead Grammatical or ungrammatical Correct or incorrect Pass or fail Win or loose Present or absent Logistic regression is an extension of regression for categorical (tyically binary) resonse variables survival after a surgery deending on age, length of surgery, whether urchase occurs deending on age, income, website characteristics, whether seech errors occur deending on alcohol level when linguistic rules aly (final [t] in Dutch) deending on seed of utterance, stress, social grou, whether one votes to a olitical arty (or not) deending on age, income, ethnicity, Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Simle regression: a refresher Regression assumtions y i = a + bx i + e i y is the resonse (or outcome, or deendent) variable The index i reresent each unit observation/measurement (sometimes called a case ) x is the redictor (or exlanatory, or indeendent) variable a is the intercet b is the sloe of the regression line a + bx is the deterministic art of the model (we sometimes use ^y) e is the residual, error, or the variation that is not accounted for by the model Assumed to be (aroximately) normally distributed with 0 mean (e i are assumed to be iid) indeendence cases (observations) should be indeendent linearity the relation between y vs x should be linear normality residuals should be normally distributed with 0 mean constant variance residual variance should be constant Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Multile regression Multile regression: examle y i = a + b 1 x i,1 +b 2 x 2,i + + b k x }{{ k,i +e } i ^y Multile regression is a generalization of the simle regression, where we redict the outcome using multile redictors Multicollinearity causes roblems in estimation and interretation of multile-regression models Model selection (finding a model that fits the data well, but not more comlex than necessary) is imortant We will try to determine reaction time differences in a lexical decision task in two exerimental conditions C 1 and C 2, and also investigate the effect of age Here is how our data looks like, Reaction time (ms) condition age 563 C C C C 2 31 Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37
3 Multile regression: examle (2) Running multile regression (R outut): Call : lm( formula = log (rt) ~ age + condition ) Estimate Std Error t value Pr ( > t ) ( Intercet ) < 2e -16 *** age conditionc e -14 *** --- Residual standard error : on 37 degrees of freedom Multile R- squared : 07927, Adjusted R- squared : F- statistic : 7074 on 2 and 37 DF, - value : 2276e -13 Logistic Regression Logistic regression is a secial case of regression, where a binary outcome is redicted based on a number of redictors The main trick is to redict robability of an event, rather than the outcome directly This allows converting a categorical variable to a numeric variable (robabilities) One needs to overcome difficulties due to regression assumtions Probabilities are strictly bounded between 0 and 1 Error (residuals) with binary (or roortion) resonse is not normally distributed Note: you should do model diagnostics before interreting the model Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 An examle for logistic regression We go back to our lexical decision task examle In similar exeriments tyically we also whether the reaction was correct or incorrect Our data actually looks like this: Reaction time (ms) condition age correct 563 C C C C Our aim is to redict robability of correct decision Trying ordinary regression Model the outcome directly: lm( formula = correct ~ age + condition ) Estimate Std Error t value Pr ( > t ) ( Intercet ) age * conditionc * P(correct) = age 0210 condition Estimated robability of correct resonse to condition 1 from a 40-year old is: = 1102 Resonse variable (robability) is not bounded between 0 and 1 Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Transforming the resonse variable The logit function Instead of redicting the robability,, we can redicts odds Odds in favor of an event (eg, FC Groningen winning against Ajax) is defined based on, the robability of the event occurring, as odds = 1 odds are bounded between 0 and infinity, 0 odds further if we take the logarithm of odds, log 1 = logit() Bounds of logit is logit() + logit() logit() = log Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 But, what about the name logistic? Logistic function is the inverse of the logit function: logistic(x) logistic(x) = 1 1 e x x Trying ordinary regression with logit transform lm( formula = logit ( correct ) ~ age + condition ) Estimate Std Error t value Pr ( > t ) ( Intercet ) age * conditionc * Residual standard error : 2182 on 37 degrees of freedom Multile R- squared : 02501, Adjusted R- squared : F- statistic : 6169 on 2 and 37 DF, - value : Now our estimate is: logit(p(correct)) = age 1538 condition The model is more correct, but interretation is difficult Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37
4 Logistic regression: are we there yet? Let s look at the residual distribution: Regression for binary resonse: roblems so far Frequency Histogram of residuals Binary data is strictly bounded, robabilities smaller than 0 and larger than 1 are meaningless We solve this by transforming the resonse variable using logit function (other transformations are also ossible) When resonse is binary, the residuals are not normally distributed with 0 mean We estimate the coefficients without assuming normally distributed error The cost of is also giving u the familiar (and uniquely determined) least-squares regression Estimation is done with more comlicated rocedures (tyically maximum likelihood) More recisely, logistic regression is an instance of generalized linear models (GLMs) Residuals Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Logistic regression Maximum likelihood estimation Logistic regression is similar to (multile) regression, differences are: Categorical resonse variable Non-normal errors Relationshi is non-linear (linear after the transformation) Estimation is (tyically) done using maximum likelihood estimation Least-squares regression is not alicable logit( i ) = a + b 1 x 1,i + + b k x k,i + e i Maximum likelihood estimation tries to find the set of model arameters, or coefficients, a, b 1, b k, which make the data most likely (or minimizes the error) MLE is an iterative search for the otimum arameter values There is no exact solution In some cases, MLE may fail to find a solution If errors are normally distributed, MLE is equivalent to least-squares estimation Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Logistic regression: examle We return to the same examle, this time fitting a correct model: glm ( formula = correct ~ age + condition, family = binomial ) Deviance Estimate Std Error z value Pr ( > z ) ( Intercet ) age conditionc ( Disersion arameter for binomial family taken to be 1) Null deviance : on 39 degrees of freedom Residual deviance : on 37 degrees of freedom AIC : Number of Fisher Scoring iterations : 19 Estimated equation is: logit(p(correct)) = age condition Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Logistic regression: interreting the coefficients log P(correct) = age condition 1 P(correct) Sloe of a variable indicates the change in log odds for unit-change in the redictor (while other redictors ket constant) For examle, one unit change in age, say from 40 to 41, means log P(correct41) P(correct40) 1 P(correct 41) log 1 P(correct 40) = e log P(correct 41 ) 1 P(correct 41 ) log P(correct 40 ) 1 P(correct 40 ) = e} {{} ex(b) odds(correct 41) odds(correct 40) = When age is increased form 40 to 41, odds of correct resonse increase 12 times Note: the change is non-linear Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Logistic regression: inference and model fit Logistic regression: where things may go wrong In logistic regression the significance testing for coefficients are carried out using z-statistics (also known as Wald statistic or Wald s z ) We do not have r 2 as measure of fit (aroximations exists, but none have all the roerties of r 2 2LL ( 2 times Log Likelihood, smaller better) is the measure of fit in maximum likelihood estimation Log likelihood is not bounded like r 2, comarisons are valid only on the same data set 2LL is aroximately χ 2 distributed, a model can be tested against null model (similar to F-test for least-squares regression) Overdisersion is the case when variance in the data is higher than exected Logistic regression requires resonse to follow binomial distribution Variance of binomial distribution is redictable from it s mean In case of comlete searation, ie, when redictors erfectly searate cases of success and failure, logistic regression arameters cannot be estimated Logistic regression may also fail to find a solution, esecially when data is not evenly distributed Logistic regression is multile regression, other roblems, such as collinearity is also roblems for logistic regression Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37
5 Logistic regression: another examle We would like to guess whether a child would develo dyslexia or not based on a test alied to re-verbal children Here is a simlified roblem: We test children when they are less than 2 years of age The hyothesis is that the test may be relevant in diagnosing dyslexia We observe the same children when they are in the school age, and note whether they are diagnosed with dyslexia or not Our data looks like the following: Test score Dyslexia Examle: the analysis glm ( formula = dys ~ score, family = binomial, data = d) Deviance Estimate Std Error z value Pr ( > z ) ( Intercet ) ** score ** --- ( Disersion arameter for binomial family taken to be 1) Null deviance : on 39 degrees of freedom Residual deviance : on 38 degrees of freedom AIC : Number of Fisher Scoring iterations : 5 logit((dyslexia)) = score * The research question is from the longitudinal study by Ben Maasen and his colleagues Data is fake as usual Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Examle: interretation Visualizing the logistic regression logit((dyslexia)) = score score logit = log 1 odds = (dyslexia) Test score Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Summary Next week Logistic regression is alicable when resonseis binomial Two major differences from ordinary least-squares regression: Resonse variable is strictly bounded logit transformation make sure regression outut is maed to range [0, 1] Errors are not normal GLM framework allows non-normal errors However, we use maximum likelihood estimation instead of least squares Problems to watch out for: overdiserion comlete searation, or insufficient data for estimation like in multile regression: multicollinearity Extensions Multinomial logistic regression, when there are more than two categories of the resonse variable Genralized linear mixed-effect models when observations are not indeendent A summary of all methods When to use which analysis What to do when assumtions fail Beyond hyothesis testing: effect sizes Time for your questions Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37 Ç Çöltekin / RuG Statistics II: Logistic Regression May 29, / 37
STK4900/ Lecture 7. Program
STK4900/9900 - Lecture 7 Program 1. Logistic regression with one redictor 2. Maximum likelihood estimation 3. Logistic regression with several redictors 4. Deviance and likelihood ratio tests 5. A comment
More informationGeneral Linear Model Introduction, Classes of Linear models and Estimation
Stat 740 General Linear Model Introduction, Classes of Linear models and Estimation An aim of scientific enquiry: To describe or to discover relationshis among events (variables) in the controlled (laboratory)
More informationA Comparison between Biased and Unbiased Estimators in Ordinary Least Squares Regression
Journal of Modern Alied Statistical Methods Volume Issue Article 7 --03 A Comarison between Biased and Unbiased Estimators in Ordinary Least Squares Regression Ghadban Khalaf King Khalid University, Saudi
More informationMorten Frydenberg Section for Biostatistics Version :Friday, 05 September 2014
Morten Frydenberg Section for Biostatistics Version :Friday, 05 Setember 204 All models are aroximations! The best model does not exist! Comlicated models needs a lot of data. lower your ambitions or get
More informationDr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur
Analysis of Variance and Design of Exeriment-I MODULE II LECTURE -4 GENERAL LINEAR HPOTHESIS AND ANALSIS OF VARIANCE Dr. Shalabh Deartment of Mathematics and Statistics Indian Institute of Technology Kanur
More informationCHAPTER 5 STATISTICAL INFERENCE. 1.0 Hypothesis Testing. 2.0 Decision Errors. 3.0 How a Hypothesis is Tested. 4.0 Test for Goodness of Fit
Chater 5 Statistical Inference 69 CHAPTER 5 STATISTICAL INFERENCE.0 Hyothesis Testing.0 Decision Errors 3.0 How a Hyothesis is Tested 4.0 Test for Goodness of Fit 5.0 Inferences about Two Means It ain't
More informationOne-way ANOVA Inference for one-way ANOVA
One-way ANOVA Inference for one-way ANOVA IPS Chater 12.1 2009 W.H. Freeman and Comany Objectives (IPS Chater 12.1) Inference for one-way ANOVA Comaring means The two-samle t statistic An overview of ANOVA
More information4. Score normalization technical details We now discuss the technical details of the score normalization method.
SMT SCORING SYSTEM This document describes the scoring system for the Stanford Math Tournament We begin by giving an overview of the changes to scoring and a non-technical descrition of the scoring rules
More informationBiostat Methods STAT 5500/6500 Handout #12: Methods and Issues in (Binary Response) Logistic Regression
Biostat Methods STAT 5500/6500 Handout #12: Methods and Issues in (Binary Resonse) Logistic Regression Recall general χ 2 test setu: Y 0 1 Trt 0 a b Trt 1 c d I. Basic logistic regression Previously (Handout
More informationFE FORMULATIONS FOR PLASTICITY
G These slides are designed based on the book: Finite Elements in Plasticity Theory and Practice, D.R.J. Owen and E. Hinton, 1970, Pineridge Press Ltd., Swansea, UK. 1 Course Content: A INTRODUCTION AND
More informationBiostat Methods STAT 5820/6910 Handout #5a: Misc. Issues in Logistic Regression
Biostat Methods STAT 5820/6910 Handout #5a: Misc. Issues in Logistic Regression Recall general χ 2 test setu: Y 0 1 Trt 0 a b Trt 1 c d I. Basic logistic regression Previously (Handout 4a): χ 2 test of
More informationHotelling s Two- Sample T 2
Chater 600 Hotelling s Two- Samle T Introduction This module calculates ower for the Hotelling s two-grou, T-squared (T) test statistic. Hotelling s T is an extension of the univariate two-samle t-test
More informationLOGISTIC REGRESSION. VINAYANAND KANDALA M.Sc. (Agricultural Statistics), Roll No I.A.S.R.I, Library Avenue, New Delhi
LOGISTIC REGRESSION VINAANAND KANDALA M.Sc. (Agricultural Statistics), Roll No. 444 I.A.S.R.I, Library Avenue, New Delhi- Chairerson: Dr. Ranjana Agarwal Abstract: Logistic regression is widely used when
More informationUse of Transformations and the Repeated Statement in PROC GLM in SAS Ed Stanek
Use of Transformations and the Reeated Statement in PROC GLM in SAS Ed Stanek Introduction We describe how the Reeated Statement in PROC GLM in SAS transforms the data to rovide tests of hyotheses of interest.
More informationTests for Two Proportions in a Stratified Design (Cochran/Mantel-Haenszel Test)
Chater 225 Tests for Two Proortions in a Stratified Design (Cochran/Mantel-Haenszel Test) Introduction In a stratified design, the subects are selected from two or more strata which are formed from imortant
More informationAI*IA 2003 Fusion of Multiple Pattern Classifiers PART III
AI*IA 23 Fusion of Multile Pattern Classifiers PART III AI*IA 23 Tutorial on Fusion of Multile Pattern Classifiers by F. Roli 49 Methods for fusing multile classifiers Methods for fusing multile classifiers
More information7.2 Inference for comparing means of two populations where the samples are independent
Objectives 7.2 Inference for comaring means of two oulations where the samles are indeendent Two-samle t significance test (we give three examles) Two-samle t confidence interval htt://onlinestatbook.com/2/tests_of_means/difference_means.ht
More informationModels of Regression type: Logistic Regression Model for Binary Response Variable
Models of Regression tye: Logistic Regression Model for Binary Resonse Variable Gebrenegus Ghilagaber March 7, 2008 Introduction to Logistic Regression Let Y be a binary (0, ) variable de ned as 8 < if
More informationute measures of uncertainty called standard errors for these b j estimates and the resulting forecasts if certain conditions are satis- ed. Note the e
Regression with Time Series Errors David A. Dickey, North Carolina State University Abstract: The basic assumtions of regression are reviewed. Grahical and statistical methods for checking the assumtions
More informationThe Poisson Regression Model
The Poisson Regression Model The Poisson regression model aims at modeling a counting variable Y, counting the number of times that a certain event occurs during a given time eriod. We observe a samle
More informationGeneralized linear models for binary data. A better graphical exploratory data analysis. The simple linear logistic regression model
Stat 3302 (Spring 2017) Peter F. Craigmile Simple linear logistic regression (part 1) [Dobson and Barnett, 2008, Sections 7.1 7.3] Generalized linear models for binary data Beetles dose-response example
More informationReview: what is a linear model. Y = β 0 + β 1 X 1 + β 2 X 2 + A model of the following form:
Outline for today What is a generalized linear model Linear predictors and link functions Example: fit a constant (the proportion) Analysis of deviance table Example: fit dose-response data using logistic
More informationPretest (Optional) Use as an additional pacing tool to guide instruction. August 21
Trimester 1 Pretest (Otional) Use as an additional acing tool to guide instruction. August 21 Beyond the Basic Facts In Trimester 1, Grade 8 focus on multilication. Daily Unit 1: Rational vs. Irrational
More informationMATHEMATICAL MODELLING OF THE WIRELESS COMMUNICATION NETWORK
Comuter Modelling and ew Technologies, 5, Vol.9, o., 3-39 Transort and Telecommunication Institute, Lomonosov, LV-9, Riga, Latvia MATHEMATICAL MODELLIG OF THE WIRELESS COMMUICATIO ETWORK M. KOPEETSK Deartment
More informationNotes on Instrumental Variables Methods
Notes on Instrumental Variables Methods Michele Pellizzari IGIER-Bocconi, IZA and frdb 1 The Instrumental Variable Estimator Instrumental variable estimation is the classical solution to the roblem of
More informationRon Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)
Ron Heck, Fall 2011 1 EDEP 768E: Seminar in Multilevel Modeling rev. January 3, 2012 (see footnote) Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October
More informationSAS for Bayesian Mediation Analysis
Paer 1569-2014 SAS for Bayesian Mediation Analysis Miočević Milica, Arizona State University; David P. MacKinnon, Arizona State University ABSTRACT Recent statistical mediation analysis research focuses
More informationFeedback-error control
Chater 4 Feedback-error control 4.1 Introduction This chater exlains the feedback-error (FBE) control scheme originally described by Kawato [, 87, 8]. FBE is a widely used neural network based controller
More informationA New Asymmetric Interaction Ridge (AIR) Regression Method
A New Asymmetric Interaction Ridge (AIR) Regression Method by Kristofer Månsson, Ghazi Shukur, and Pär Sölander The Swedish Retail Institute, HUI Research, Stockholm, Sweden. Deartment of Economics and
More informationThe Longest Run of Heads
The Longest Run of Heads Review by Amarioarei Alexandru This aer is a review of older and recent results concerning the distribution of the longest head run in a coin tossing sequence, roblem that arise
More informationSlides Prepared by JOHN S. LOUCKS St. Edward s s University Thomson/South-Western. Slide
s Preared by JOHN S. LOUCKS St. Edward s s University 1 Chater 11 Comarisons Involving Proortions and a Test of Indeendence Inferences About the Difference Between Two Poulation Proortions Hyothesis Test
More informationGeneralized linear models
Generalized linear models Outline for today What is a generalized linear model Linear predictors and link functions Example: estimate a proportion Analysis of deviance Example: fit dose- response data
More informationMODELING THE RELIABILITY OF C4ISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL
Technical Sciences and Alied Mathematics MODELING THE RELIABILITY OF CISR SYSTEMS HARDWARE/SOFTWARE COMPONENTS USING AN IMPROVED MARKOV MODEL Cezar VASILESCU Regional Deartment of Defense Resources Management
More informationIntroducing Generalized Linear Models: Logistic Regression
Ron Heck, Summer 2012 Seminars 1 Multilevel Regression Models and Their Applications Seminar Introducing Generalized Linear Models: Logistic Regression The generalized linear model (GLM) represents and
More informationSystem Reliability Estimation and Confidence Regions from Subsystem and Full System Tests
009 American Control Conference Hyatt Regency Riverfront, St. Louis, MO, USA June 0-, 009 FrB4. System Reliability Estimation and Confidence Regions from Subsystem and Full System Tests James C. Sall Abstract
More informationGeneralized linear models
Generalized linear models Douglas Bates November 01, 2010 Contents 1 Definition 1 2 Links 2 3 Estimating parameters 5 4 Example 6 5 Model building 8 6 Conclusions 8 7 Summary 9 1 Generalized Linear Models
More informationWhy Proofs? Proof Techniques. Theorems. Other True Things. Proper Proof Technique. How To Construct A Proof. By Chuck Cusack
Proof Techniques By Chuck Cusack Why Proofs? Writing roofs is not most student s favorite activity. To make matters worse, most students do not understand why it is imortant to rove things. Here are just
More information16.2. Infinite Series. Introduction. Prerequisites. Learning Outcomes
Infinite Series 6.2 Introduction We extend the concet of a finite series, met in Section 6., to the situation in which the number of terms increase without bound. We define what is meant by an infinite
More informationEPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7
Introduction to Generalized Univariate Models: Models for Binary Outcomes EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7 EPSY 905: Intro to Generalized In This Lecture A short review
More informationLogistic Regressions. Stat 430
Logistic Regressions Stat 430 Final Project Final Project is, again, team based You will decide on a project - only constraint is: you are supposed to use techniques for a solution that are related to
More informationModel Estimation Example
Ronald H. Heck 1 EDEP 606: Multivariate Methods (S2013) April 7, 2013 Model Estimation Example As we have moved through the course this semester, we have encountered the concept of model estimation. Discussions
More informationIntroduction to Probability and Statistics
Introduction to Probability and Statistics Chater 8 Ammar M. Sarhan, asarhan@mathstat.dal.ca Deartment of Mathematics and Statistics, Dalhousie University Fall Semester 28 Chater 8 Tests of Hyotheses Based
More informationLinear Regression Models P8111
Linear Regression Models P8111 Lecture 25 Jeff Goldsmith April 26, 2016 1 of 37 Today s Lecture Logistic regression / GLMs Model framework Interpretation Estimation 2 of 37 Linear regression Course started
More informationEstimating Time-Series Models
Estimating ime-series Models he Box-Jenkins methodology for tting a model to a scalar time series fx t g consists of ve stes:. Decide on the order of di erencing d that is needed to roduce a stationary
More informationMODULE 6 LOGISTIC REGRESSION. Module Objectives:
MODULE 6 LOGISTIC REGRESSION Module Objectives: 1. 147 6.1. LOGIT TRANSFORMATION MODULE 6. LOGISTIC REGRESSION Logistic regression models are used when a researcher is investigating the relationship between
More informationPublished: 14 October 2013
Electronic Journal of Alied Statistical Analysis EJASA, Electron. J. A. Stat. Anal. htt://siba-ese.unisalento.it/index.h/ejasa/index e-issn: 27-5948 DOI: 1.1285/i275948v6n213 Estimation of Parameters of
More informationBackground. GLM with clustered data. The problem. Solutions. A fixed effects approach
Background GLM with clustered data A fixed effects aroach Göran Broström Poisson or Binomial data with the following roerties A large data set, artitioned into many relatively small grous, and where members
More informationStatics and dynamics: some elementary concepts
1 Statics and dynamics: some elementary concets Dynamics is the study of the movement through time of variables such as heartbeat, temerature, secies oulation, voltage, roduction, emloyment, rices and
More informationAn Improved Calibration Method for a Chopped Pyrgeometer
96 JOURNAL OF ATMOSPHERIC AND OCEANIC TECHNOLOGY VOLUME 17 An Imroved Calibration Method for a Choed Pyrgeometer FRIEDRICH FERGG OtoLab, Ingenieurbüro, Munich, Germany PETER WENDLING Deutsches Forschungszentrum
More informationElements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley
Elements of Asymtotic Theory James L. Powell Deartment of Economics University of California, Berkeley Objectives of Asymtotic Theory While exact results are available for, say, the distribution of the
More informationKeywords: pile, liquefaction, lateral spreading, analysis ABSTRACT
Key arameters in seudo-static analysis of iles in liquefying sand Misko Cubrinovski Deartment of Civil Engineering, University of Canterbury, Christchurch 814, New Zealand Keywords: ile, liquefaction,
More informationCombining Logistic Regression with Kriging for Mapping the Risk of Occurrence of Unexploded Ordnance (UXO)
Combining Logistic Regression with Kriging for Maing the Risk of Occurrence of Unexloded Ordnance (UXO) H. Saito (), P. Goovaerts (), S. A. McKenna (2) Environmental and Water Resources Engineering, Deartment
More informationA Generalized Linear Model for Binomial Response Data. Copyright c 2017 Dan Nettleton (Iowa State University) Statistics / 46
A Generalized Linear Model for Binomial Response Data Copyright c 2017 Dan Nettleton (Iowa State University) Statistics 510 1 / 46 Now suppose that instead of a Bernoulli response, we have a binomial response
More informationPretest (Optional) Use as an additional pacing tool to guide instruction. August 21
Trimester 1 Pretest (Otional) Use as an additional acing tool to guide instruction. August 21 Beyond the Basic Facts In Trimester 1, Grade 7 focus on multilication. Daily Unit 1: The Number System Part
More informationStatistics 203 Introduction to Regression Models and ANOVA Practice Exam
Statistics 203 Introduction to Regression Models and ANOVA Practice Exam Prof. J. Taylor You may use your 4 single-sided pages of notes This exam is 7 pages long. There are 4 questions, first 3 worth 10
More informationOutline for today. Maximum likelihood estimation. Computation with multivariate normal distributions. Multivariate normal distribution
Outline for today Maximum likelihood estimation Rasmus Waageetersen Deartment of Mathematics Aalborg University Denmark October 30, 2007 the multivariate normal distribution linear and linear mixed models
More informationExam Applied Statistical Regression. Good Luck!
Dr. M. Dettling Summer 2011 Exam Applied Statistical Regression Approved: Tables: Note: Any written material, calculator (without communication facility). Attached. All tests have to be done at the 5%-level.
More informationKEY ISSUES IN THE ANALYSIS OF PILES IN LIQUEFYING SOILS
4 th International Conference on Earthquake Geotechnical Engineering June 2-28, 27 KEY ISSUES IN THE ANALYSIS OF PILES IN LIQUEFYING SOILS Misko CUBRINOVSKI 1, Hayden BOWEN 1 ABSTRACT Two methods for analysis
More informationResearch Note REGRESSION ANALYSIS IN MARKOV CHAIN * A. Y. ALAMUTI AND M. R. MESHKANI **
Iranian Journal of Science & Technology, Transaction A, Vol 3, No A3 Printed in The Islamic Reublic of Iran, 26 Shiraz University Research Note REGRESSION ANALYSIS IN MARKOV HAIN * A Y ALAMUTI AND M R
More informationLecture: Condorcet s Theorem
Social Networs and Social Choice Lecture Date: August 3, 00 Lecture: Condorcet s Theorem Lecturer: Elchanan Mossel Scribes: J. Neeman, N. Truong, and S. Troxler Condorcet s theorem, the most basic jury
More informationLower Confidence Bound for Process-Yield Index S pk with Autocorrelated Process Data
Quality Technology & Quantitative Management Vol. 1, No.,. 51-65, 15 QTQM IAQM 15 Lower onfidence Bound for Process-Yield Index with Autocorrelated Process Data Fu-Kwun Wang * and Yeneneh Tamirat Deartment
More informationSolved Problems. (a) (b) (c) Figure P4.1 Simple Classification Problems First we draw a line between each set of dark and light data points.
Solved Problems Solved Problems P Solve the three simle classification roblems shown in Figure P by drawing a decision boundary Find weight and bias values that result in single-neuron ercetrons with the
More information2/26/2017. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2
PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 When and why do we use logistic regression? Binary Multinomial Theory behind logistic regression Assessing the model Assessing predictors
More informationStatistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018
Statistics Boot Camp Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 March 21, 2018 Outline of boot camp Summarizing and simplifying data Point and interval estimation Foundations of statistical
More informationShadow Computing: An Energy-Aware Fault Tolerant Computing Model
Shadow Comuting: An Energy-Aware Fault Tolerant Comuting Model Bryan Mills, Taieb Znati, Rami Melhem Deartment of Comuter Science University of Pittsburgh (bmills, znati, melhem)@cs.itt.edu Index Terms
More informationFinite Mixture EFA in Mplus
Finite Mixture EFA in Mlus November 16, 2007 In this document we describe the Mixture EFA model estimated in Mlus. Four tyes of deendent variables are ossible in this model: normally distributed, ordered
More informationBayesian Spatially Varying Coefficient Models in the Presence of Collinearity
Bayesian Satially Varying Coefficient Models in the Presence of Collinearity David C. Wheeler 1, Catherine A. Calder 1 he Ohio State University 1 Abstract he belief that relationshis between exlanatory
More informationMATH 2710: NOTES FOR ANALYSIS
MATH 270: NOTES FOR ANALYSIS The main ideas we will learn from analysis center around the idea of a limit. Limits occurs in several settings. We will start with finite limits of sequences, then cover infinite
More informationarxiv: v3 [physics.data-an] 23 May 2011
Date: October, 8 arxiv:.7v [hysics.data-an] May -values for Model Evaluation F. Beaujean, A. Caldwell, D. Kollár, K. Kröninger Max-Planck-Institut für Physik, München, Germany CERN, Geneva, Switzerland
More informationCryptanalysis of Pseudorandom Generators
CSE 206A: Lattice Algorithms and Alications Fall 2017 Crytanalysis of Pseudorandom Generators Instructor: Daniele Micciancio UCSD CSE As a motivating alication for the study of lattice in crytograhy we
More informationCompletely Randomized Design
CHAPTER 4 Comletely Randomized Design 4.1 Descrition of the Design Chaters 1 to 3 introduced some basic concets and statistical tools that are used in exerimental design. In this and the following chaters,
More informationChapter 3. GMM: Selected Topics
Chater 3. GMM: Selected oics Contents Otimal Instruments. he issue of interest..............................2 Otimal Instruments under the i:i:d: assumtion..............2. he basic result............................2.2
More informationA Qualitative Event-based Approach to Multiple Fault Diagnosis in Continuous Systems using Structural Model Decomposition
A Qualitative Event-based Aroach to Multile Fault Diagnosis in Continuous Systems using Structural Model Decomosition Matthew J. Daigle a,,, Anibal Bregon b,, Xenofon Koutsoukos c, Gautam Biswas c, Belarmino
More informationChecking model assumptions with regression diagnostics
@graemeleehickey www.glhickey.com graeme.hickey@liverpool.ac.uk Checking model assumptions with regression diagnostics Graeme L. Hickey University of Liverpool Conflicts of interest None Assistant Editor
More informationEstimating function analysis for a class of Tweedie regression models
Title Estimating function analysis for a class of Tweedie regression models Author Wagner Hugo Bonat Deartamento de Estatística - DEST, Laboratório de Estatística e Geoinformação - LEG, Universidade Federal
More information7/28/15. Review Homework. Overview. Lecture 6: Logistic Regression Analysis
Lecture 6: Logistic Regression Analysis Christopher S. Hollenbeak, PhD Jane R. Schubart, PhD The Outcomes Research Toolbox Review Homework 2 Overview Logistic regression model conceptually Logistic regression
More informationdn i where we have used the Gibbs equation for the Gibbs energy and the definition of chemical potential
Chem 467 Sulement to Lectures 33 Phase Equilibrium Chemical Potential Revisited We introduced the chemical otential as the conjugate variable to amount. Briefly reviewing, the total Gibbs energy of a system
More informationLogistic Regression 21/05
Logistic Regression 21/05 Recall that we are trying to solve a classification problem in which features x i can be continuous or discrete (coded as 0/1) and the response y is discrete (0/1). Logistic regression
More informationMonte Carlo Studies. Monte Carlo Studies. Sampling Distribution
Monte Carlo Studies Do not let yourself be intimidated by the material in this lecture This lecture involves more theory but is meant to imrove your understanding of: Samling distributions and tests of
More informationANALYSIS ON PROBLEMS AND MOTIVATION OF POST 90S UNDERGRADUATES' ETIQUETTE
ANALYSIS ON PROBLEMS AND MOTIVATION OF POST 90S UNDERGRADUATES' ETIQUETTE WEIHONG KONG Deartment of Social Sciences, Handan College, Handan 056005, Hebei, China ABSTRACT At resent, the ost 90s has become
More informationHidden Predictors: A Factor Analysis Primer
Hidden Predictors: A Factor Analysis Primer Ryan C Sanchez Western Washington University Factor Analysis is a owerful statistical method in the modern research sychologist s toolbag When used roerly, factor
More informationPlotting the Wilson distribution
, Survey of English Usage, University College London Setember 018 1 1. Introduction We have discussed the Wilson score interval at length elsewhere (Wallis 013a, b). Given an observed Binomial roortion
More informationASYMPTOTIC RESULTS OF A HIGH DIMENSIONAL MANOVA TEST AND POWER COMPARISON WHEN THE DIMENSION IS LARGE COMPARED TO THE SAMPLE SIZE
J Jaan Statist Soc Vol 34 No 2004 9 26 ASYMPTOTIC RESULTS OF A HIGH DIMENSIONAL MANOVA TEST AND POWER COMPARISON WHEN THE DIMENSION IS LARGE COMPARED TO THE SAMPLE SIZE Yasunori Fujikoshi*, Tetsuto Himeno
More informationOn Using FASTEM2 for the Special Sensor Microwave Imager (SSM/I) March 15, Godelieve Deblonde Meteorological Service of Canada
On Using FASTEM2 for the Secial Sensor Microwave Imager (SSM/I) March 15, 2001 Godelieve Deblonde Meteorological Service of Canada 1 1. Introduction Fastem2 is a fast model (multile-linear regression model)
More informationpp physics, RWTH, WS 2003/04, T.Hebbeker
1. PP TH 03/04 Accelerators and Detectors 1 hysics, RWTH, WS 2003/04, T.Hebbeker 2003-12-03 1. Accelerators and Detectors In the following, we concentrate on the three machines SPS, Tevatron and LHC with
More informationChapter 13 Variable Selection and Model Building
Chater 3 Variable Selection and Model Building The comlete regsion analysis deends on the exlanatory variables ent in the model. It is understood in the regsion analysis that only correct and imortant
More informationExtensions of the Penalized Spline Propensity Prediction Method of Imputation
Extensions of the Penalized Sline Proensity Prediction Method of Imutation by Guangyu Zhang A dissertation submitted in artial fulfillment of the requirements for the degree of Doctor of Philosohy (Biostatistics)
More informationLogistic Regression - problem 6.14
Logistic Regression - problem 6.14 Let x 1, x 2,, x m be given values of an input variable x and let Y 1,, Y m be independent binomial random variables whose distributions depend on the corresponding values
More informationChapter 7 Sampling and Sampling Distributions. Introduction. Selecting a Sample. Introduction. Sampling from a Finite Population
Chater 7 and s Selecting a Samle Point Estimation Introduction to s of Proerties of Point Estimators Other Methods Introduction An element is the entity on which data are collected. A oulation is a collection
More informationSAMARIS. Draft report
SAMARIS Work Package 5 - Performance based secifications Selection and evaluation of models for rediction of ermanent deformations of unbound granular materials in road avements Draft reort Pierre Hornych
More informationSTA 250: Statistics. Notes 7. Bayesian Approach to Statistics. Book chapters: 7.2
STA 25: Statistics Notes 7. Bayesian Aroach to Statistics Book chaters: 7.2 1 From calibrating a rocedure to quantifying uncertainty We saw that the central idea of classical testing is to rovide a rigorous
More information9 Generalized Linear Models
9 Generalized Linear Models The Generalized Linear Model (GLM) is a model which has been built to include a wide range of different models you already know, e.g. ANOVA and multiple linear regression models
More informationElements of Asymptotic Theory. James L. Powell Department of Economics University of California, Berkeley
Elements of Asymtotic Theory James L. Powell Deartment of Economics University of California, Berkeley Objectives of Asymtotic Theory While exact results are available for, say, the distribution of the
More informationMatched Pair Data. Stat 557 Heike Hofmann
Matched Pair Data Stat 557 Heike Hofmann Outline Marginal Homogeneity - review Binary Response with covariates Ordinal response Symmetric Models Subject-specific vs Marginal Model conditional logistic
More information2. Sample representativeness. That means some type of probability/random sampling.
1 Neuendorf Cluster Analysis Assumes: 1. Actually, any level of measurement (nominal, ordinal, interval/ratio) is accetable for certain tyes of clustering. The tyical methods, though, require metric (I/R)
More informationObjectives. 6.1, 7.1 Estimating with confidence (CIS: Chapter 10) CI)
Objectives 6.1, 7.1 Estimating with confidence (CIS: Chater 10) Statistical confidence (CIS gives a good exlanation of a 95% CI) Confidence intervals. Further reading htt://onlinestatbook.com/2/estimation/confidence.html
More informationWeek 7 Multiple factors. Ch , Some miscellaneous parts
Week 7 Multiple factors Ch. 18-19, Some miscellaneous parts Multiple Factors Most experiments will involve multiple factors, some of which will be nuisance variables Dealing with these factors requires
More informationOn split sample and randomized confidence intervals for binomial proportions
On slit samle and randomized confidence intervals for binomial roortions Måns Thulin Deartment of Mathematics, Usala University arxiv:1402.6536v1 [stat.me] 26 Feb 2014 Abstract Slit samle methods have
More informationLogistic Regression. James H. Steiger. Department of Psychology and Human Development Vanderbilt University
Logistic Regression James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Logistic Regression 1 / 38 Logistic Regression 1 Introduction
More informationChapter 10. Supplemental Text Material
Chater 1. Sulemental Tet Material S1-1. The Covariance Matri of the Regression Coefficients In Section 1-3 of the tetbook, we show that the least squares estimator of β in the linear regression model y=
More information