Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model
|
|
- Meghan Hopkins
- 5 years ago
- Views:
Transcription
1 Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model EPSY 905: Multivariate Analysis Lecture 1 20 January 2016 EPSY 905: Lecture 1 - Introduction
2 Today s Class Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance and Covariance Review of the General Linear Model EPSY 905: Lecture 1 - Introduction 2
3 COURSE OVERVIEW EPSY 905: Lecture 1 - Introduction 3
4 Guiding Principles for PRE 905 #1 of 3: Blocks #1. If you understand the building blocks of a model, you can build anything! EPSY 905: Lecture 1 - Introduction 4
5 4 (or 5*) Model Building Blocks 1. Linear models (for effects of predictors) 2. Link functions (for anything not normal) 3a*. Random effects (for describing dependency) 3b*. Latent variables (for measurement models) 4. Estimation (e.g., Maximum Likelihood, Bayesian) * These are really the same thing. EPSY 905: Lecture 1 - Introduction 5
6 Principles #2 of 3 - The Journey is Part of the Destination Not just blocks; Not just a journey in 905 you will learn: Ø Missing data (impute ) Ø Path models Ø Mediation and moderation Ø Testing complex hypotheses involving observed variables Ø Bayesian Ø Likelihood based methods EPSY 905: Lecture 1 - Introduction 6
7 Guiding Principles for PRE 905 the Bridge: #3 of 3 A bridge between what you know now and advanced statistical methods EPSY 905: Lecture 1 - Introduction 7
8 Motivation for Course Content The goal of this course is to provide you with a fundamental understanding of the underpinnings of the most commonly used contemporary statistical models The course is a combination of topics, picked to make your experience more extendable beyond coursework Some topics are math/statistics heavy Ø Mathematical statistics for the social sciences Upon completion of the course, you will be able to understand the communalities that link methods EPSY 905: Lecture 1 - Introduction 8
9 Course Structure (from the syllabus) Course format is all lecture based Ø Office hours held in labs Homework assignments Ø About one week to complete (Thursday-Thursday, usually) Ø Questions: data analysis, interpretation, some question-and-answer, some from the readings Ø Late penalty: 1 percent per calendar day Ø Revisions are allowed (minus any late penalties) Course Project Ø Three rounds of submissions Ø Iterative process EPSY 905: Lecture 1 - Introduction 9
10 Lecture Format Mix of theory and examples with data and syntax Ø Software: R package EPSY 905: Lecture 1 - Introduction 10
11 REVIEW: BASIC STATISTICAL TOPICS EPSY 905: Lecture 1 - Introduction 11
12 Data for Today s Lecture To help demonstrate the concepts of today s lecture, we will be using a data set with three variables Ø Female (Gender): Male (=0) or Female (=1) Ø Height in inches Ø Weight in pounds The end point of our lecture will be to build a linear model that predicts a person s weight Ø Linear model: a statistical model for an outcome that uses a linear combination (a weighted sum; weighted by a slope) of one or more predictor variables EPSY 905: Lecture 1 - Introduction 12
13 Visualizing the Data EPSY 905: Lecture 1 - Introduction 13
14 Histograms of Height and Weight The weight variable seems to be bimodal should that bother you? (hint: it shouldn t yet) EPSY 905: Lecture 1 - Introduction 14
15 Descriptive Statistics We can summarize each variable marginally through a set of descriptive statistics Ø Marginal: one variable by itself Common marginal descriptive statistics: Ø Central tendency: Mean, Median, Mode Ø Variability: Standard deviation (variance), range We can also summarize the joint (bivariate) distribution of two variables through a set of descriptive statistics: Ø Joint distribution: more than one variable simultaneously Common bivariate descriptive statistics: Ø Correlation and covariance EPSY 905: Lecture 1 - Introduction 15
16 Descriptive Statistics for Height/Weight Data Variable Mean SD Variance Height Weight , Female Diagonal: Variance Above Diagonal: Covariance Correlation /Covariance Height Weight Female Height Weight.798 3, Female Below Diagonal: Correlation EPSY 905: Lecture 1 - Introduction 16
17 Re-examining the Concept of Variance Variability is a central concept in advanced statistics Ø In multivariate statistics, covariance is also central Two formulas for the variance (about the same when N is large):. S $ "# = 1 N 1 ) Y $ +, Y- +,/+. S $ "# = 1 N ) Y $ +, Y- +,/+ Here: p = person; 1 = variable number one Unbiased or sample Biased/ML or population EPSY 905: Lecture 1 - Introduction 17
18 Interpretation of Variance The variance describes the spread of a variable in squared units $ (which come from the Y +, Y- + term in the equation) Variance: the average squared distance of an observation from the mean Ø Variance of Height: inches squared Ø Variance of Weight: 3, pounds squared Ø Variance of Female not applicable in the same way! Because squared units are difficult to work with, we typically use the standard deviation which is reported in units Standard deviation: the average distance of an observation from the mean Ø SD of Height: 7.44 inches Ø SD of Weight: pounds EPSY 905: Lecture 1 - Introduction 18
19 Variance/SD as a More General Statistical Concept Variance (and the standard deviation) is a concept that is applied across statistics not just for data Ø Statistical parameters have variance w e.g. The sample mean Y- + has a standard error (SE) of S " - = 1 2. The standard error is another name for standard deviation Ø So standard error of the mean is equivalent to standard deviation of the mean Ø Usually error refers to parameters; deviation refers to data Ø Variance of the mean would be S " - $ = More generally, variance = error Ø You can think about the SE of the mean as telling you how far off the mean is for describing the data EPSY 905: Lecture 1 - Introduction 19
20 Correlation of Variables Moving from marginal summaries of each variable to joint (bivariate) summaries, the Pearson correlation is often used to describe the association between a pair of variables: 1 r "#," 3 = N 1.,/+ Y +, Y- + Y $, Y- $ S "# S "3 The correlation is unitless as it ranges from -1 to 1 for continuous variables, regardless of their variances Ø Pearson correlation of binary/categorical variables with continuous variables is called a point-biserial (same formula) Ø Pearson correlation of binary/categorical variables with other binary/categorical variables has bounds within -1 and 1 EPSY 905: Lecture 1 - Introduction 20
21 More on the Correlation Coefficient The Pearson correlation is a biased estimator Ø Biased estimator: the expected value differs from the true value for a statistic w Other biased estimators: Variance/SD when + is used. The unbiased correlation estimate would be: r 7 "#," 3 = r "#," r " #," 3 2N Ø As N gets large bias goes away; Bias is largest when r "#," 3 = 0 Ø Pearson is an underestimate of true correlation If it is biased, then why does everyone use it anyway? Ø Answer: forthcoming when we talk about (ML) estimation $ EPSY 905: Lecture 1 - Introduction 21
22 Covariance of Variables: Association with Units The numerator of the correlation coefficient is the covariance of a pair of variables:. S "#," 3 = 1 N 1 ) Y +, Y- + Y $, Y- $,/+. S "#," 3 = 1 N ) Y +, Y- + Y $, Y- $,/+ The covariance uses the units of the original variables (but now they are multiples): Ø Covariance of height and weight: inch-pounds The covariance of a variable with itself is the variance The covariance is often used in multivariate analyses because it ties directly into multivariate distributions Ø But covariance and correlation are easy to switch between EPSY 905: Lecture 1 - Introduction Unbiased or sample Biased/ML or population 22
23 Going from Covariance to Correlation If you have the covariance matrix (variances and covariances): r "#," 3 = S " #," 3 S "# S "3 If you have the correlation matrix and the standard deviations: S "#," 3 = r "#," 3 S "# S "3 EPSY 905: Lecture 1 - Introduction 23
24 THE GENERAL LINEAR MODEL EPSY 905: Lecture 1 - Introduction 24
25 The General Linear Model The general linear model incorporates many different labels of analyses under one unifying umbrella: Categorical X s Continuous X s Both Types of X s Univariate Y ANOVA Regression ANCOVA Multivariate Y s MANOVA Multivariate Regression MANCOVA The typical assumption is that error is normally distributed meaning that the data are conditionally normally distributed Models for non-normal outcomes (e.g., dichotomous, categorical, count) fall under the Generalized Linear Model, of which the GLM is a special case (i.e., for when model residuals can be assumed to be normally distributed) EPSY 905: Lecture 1 - Introduction 25
26 General Linear Models: Conditional Normality Y, = β < + β + X, + β $ Z, + β? X, Z, + e, Model for the Means (Predicted Values): Each person s expected (predicted) outcome is a function of his/her values on x and z (and their interaction) y, x, and z are each measured only once per person (p subscript) Model for the Variance: e, N 0, σ D $ à ONE residual (unexplained) deviation e, has a mean of 0 with some estimated constant variance σ D $, is normally distributed, is unrelated to x and z, and is unrelated across people (across all observations, just people here) We will return to the normal distribution in a few weeks but for now know that it is described by two terms: a mean and a variance EPSY 905: Lecture 1 - Introduction 26
27 Building a Linear Model for Predicting a Person s Weight We will now build a linear model for predicting a person s weight, using height and gender as predictors Several models we will build are done for didactic reasons to show how regression and ANOVA work under the GLM Ø You wouldn t necessarily run these models in this sequence Our beginning model is that of an empty model no predictors for weight (an unconditional model) Our ending model is one with both predictors and their interaction (a conditional model) EPSY 905: Lecture 1 - Introduction 27
28 Model 1: The Empty Model Linear model: Weight, = β < + e, where e, N 0, σ D $ Estimated Parameters: [ESTIMATE (STANDARD ERROR)] Ø β < = (12.61) w Overall intercept the grand mean of weight across all people Just the mean of weight w SE for β < is standard error of the mean for weight 1 RSTUVW. Ø σ $ D = 3,179.1 (SE not given) w The (unbiased) variance of weight: e, = Weight, β < = Weight, Weight,. S $ D = 1 N 1 ) Weight $, Weight,,/+ w From Mean Square Error of F-table EPSY 905: Lecture 1 - Introduction 28
29 Model 2: Predicting Weight from Height ( Regression ) Linear model: Weight, = β < + β + Height, + e, where e, N 0, σ D $ Estimated Parameters: [ESTIMATE (STANDARD ERROR)] Ø β < = (73.483) w Predicted value of Weight for a person with Height = 0 w Nonsensical but we could have centered Height Ø β + = (1.076) w Change in predicted value of Weight for every one-unit increase in height (weight goes up pounds per inch) Ø σ $ D = 1,218 (SE not given) w The residual variance of weight w Height explains?,+\].+^+,$+_ = 61.7% of variance of weight?,+\].+ EPSY 905: Lecture 1 - Introduction 29
30 Model 2a: Predicting Weight from Mean-Centered Height Linear model: W, = β < + β + H, Ha + e, where e, N 0, σ D $ Estimated Parameters: [ESTIMATE (STANDARD ERROR)] Ø β < = (7.804) w Predicted value of Weight for a person with Height = Mean Height w Is the Mean Weight (regression line goes through means) Ø β + = (1.076) w Change in predicted value of Weight for every one-unit increase in height (weight goes up pounds per inch) w Same as previous Ø σ D $ = 1,218 (SE not given) w The residual variance of weight w Height explains?,+\].+^+,$+_?,+\].+ w Same as previous = 61.7% of variance of weight EPSY 905: Lecture 1 - Introduction 30
31 Plotting Model 2a EPSY 905: Lecture 1 - Introduction 31
32 Hypothesis Tests for Parameters To determine if the regression slope is significantly different from zero, we must use a hypothesis test: H < : β + = 0 H + : β + 0 We have two options for this test (both are same in this case) Ø Use ANOVA table: sums of squares F-test Ø Use Wald test for parameter: t = d # ed(d # ) Ø Here t $ = F Wald test: t = d # = g.<h_ ed(d # ) +.<\g = 5.62; p <.001 Conclusion: reject null H < ; slope is significant EPSY 905: Lecture 1 - Introduction 32
33 Model 3: Predicting Weight from Gender ( ANOVA ) Linear Model: Weight, = β < + β $ Female, + e, where e, N 0, σ D $ Note: because gender is a categorical predictor, we must first code it into a number before entering it into the model (typically done automatically in software) Ø Here we use Female = 1 for females; Female = 0 for males Estimated Parameters: [ESTIMATE (STANDARD ERROR)] Ø β < = (5.415) w Predicted value of Weight for a person with Female=0 (males) w Mean weight of males Ø β $ = w t = +<o \.go_ = 13.71; p <.001 w Change in predicted value of Weight for every one unit increase in female w In this case, the difference between the mean for males and the mean for females Ø σ $ D = 293 (SE not given) w The residual variance of weight w Gender explains?,+\].+^$]? = 90.8% of variance of weight?,+\].+ EPSY 905: Lecture 1 - Introduction 33
34 Model 3: More on Categorical Predictors Gender was coded using what is called reference or dummy coding: Ø Intercept becomes mean of the reference group (the 0 group) Ø Slopes become the difference in the means between reference and nonreference groups Ø For C categories, C-1 predictors are created All coding choices can be recovered from the model: Ø Predicted Weight for Females (mean weight for females): W, = β < + β $ = = Ø Predicted Weight for Males: W, = β < = What would β < and β $ be if we coded Male = 1? Ø Super cool idea: what if you could do this in software all at once? EPSY 905: Lecture 1 - Introduction 34
35 Model 3: Predictions and Plots EPSY 905: Lecture 1 - Introduction 35
36 Model 4: Predicting Weight from Height and Gender (w/o Interaction); ( ANCOVA ) Linear Model: W, = β < + β + H, Ha + β $ F, + e, where e, N 0, σ D $ Estimated Parameters: [ESTIMATE (STANDARD ERROR)] Ø β < = (1.439) w Predicted value of Weight for a person with Female=0 (males) and has Height = Mean Height H, Ha = 0 Ø β + = w t = $.\<_ = 17.52;p <.001 <.+oo w Change in predicted value of Weight for every one-unit increase in height (holding gender constant) Ø β $ = w t = _+.\+$ = 36.46;p <.001 $.$h+ w Change in predicted value of Weight for every one-unit increase in female (holding height constant) w In this case, the difference between the mean for males and the mean for females holding height constant Ø σ D $ = 16 (SE not given) w The residual variance of weight EPSY 905: Lecture 1 - Introduction 36
37 Model 4: By-Gender Regression Lines Model 4 assumes identical regression slopes for both genders but has different intercepts Ø This assumption is tested statistically by model 5 Predicted Weight for Females: W, = H, Ha F, = H, Ha Predicted Weight for Males: W, = H, Ha F, = H, Ha EPSY 905: Lecture 1 - Introduction 37
38 Model 4: Predicted Value Regression Lines EPSY 905: Lecture 1 - Introduction 38
39 Model 5: Predicting Weight from Height and Gender (with Interaction); ( ANCOVAish ) Linear Model: W, = β < + β + H, Ha + β $ F, + β? H, Ha F, + e, where e, N 0, σ D $ Estimated Parameters: [ESTIMATE (STANDARD ERROR)] Ø β < = (0.838) w Predicted value of Weight for a person with Female=0 (males) and has Height = Mean Height H, Ha = 0 Ø β + = w t =?.+]< = 28.65;p <.001 <.+++ w Simple main effect of height: Change in predicted value of Weight for every oneunit increase in height (for males only) w A conditional main effect: when interacting variable (gender) = 0 EPSY 905: Lecture 1 - Introduction 39
40 Model 5: Estimated Parameters Estimated Parameters: Ø β $ = (1.211) w t = _$.$\$ = 67.93; p < $++ w Simple main effect of gender: Change in predicted value of Weight for every one unit increase in female, for height = mean height w Gender difference at 67.9 inches Ø β? = w t = +.<]h = 6.52; p <.001 <.+g_ w Gender-by-Height Interaction: Additional change in predicted value of weight for change in either gender or height w Difference in slope for height for females vs. males w Because Female = 1, it modifies the slope for height for females (here the height slope is less positive than for females than for males) Ø σ D $ = 5 (SE not given) EPSY 905: Lecture 1 - Introduction 40
41 Model 5: By-Gender Regression Lines Model 5 does not assume identical regression slopes for both genders Ø Because β? was significantly different from zero, the data supports different slopes for the genders Predicted Weight for Females: W, = H, Ha F, H, Ha F, = H, Ha Predicted Weight for Males: W, = H, Ha F, H, Ha F, = H, Ha EPSY 905: Lecture 1 - Introduction 41
42 Model 5: Predicted Value Regression Lines EPSY 905: Lecture 1 - Introduction 42
43 Comparing Across Models Typically, the empty model and model #5 would be the only models run Ø The trick is to describe the impact of all and each of the predictors typically using variance accounted for (explained) All predictors: Ø Baseline: empty model #1; σ D $ = 3, Ø Comparison: model #5; σ D $ = Ø All predictors (gender, height, interaction)explained?,+\].<]o^h.\?+?,+\].<]o of variance in weight w R $ hall of fame worthy = 99.9% EPSY 905: Lecture 1 - Introduction 43
44 Comparing Across Models The total effect of height (main effect and interaction): Ø Baseline: model #3 (gender only); σ D $ = Ø Comparison: model #5 (all predictors); σ D $ = Ø Height explained $]?.$++^h.\?+ gender $]?.$++ w 98.4% of the % = 9.2% left after gender w True variance accounted for is 98.4%*9.2% = 9.1% = 98.4% of variance in weight remaining after The total effect of gender (main effect and interaction): Ø Baseline: model #2a (height only); σ D $ = 1, Ø Comparison: model #5 (all predictors); σ D $ = Ø Gender explained +,$+\.]\?^h.\?+ +,$+\.]\? after height w 99.6% of the % = 38.3% left after height w True variance accounted for is 99.6%*38.3% = 38.1% = 99.6% of variance in weight remaining EPSY 905: Lecture 1 - Introduction 44
45 About Weight The distribution of weight was bimodal (shown in the beginning of the class) Ø However, the analysis only called for the residuals to be normally distributed not the actual data Ø This is the same as saying the conditional distribution of the data given the predictors must be normal Residual: e, = Weight, Weight q, = Weight, β < + β + H, Ha + β $ F, + β? H, Ha F, EPSY 905: Lecture 1 - Introduction 45
46 CONCLUDING REMARKS EPSY 905: Lecture 1 - Introduction 46
47 Wrapping Up The general linear model forms the basis for many multivariate statistical techniques Ø Certain features of the model change, but many of the same interpretations remain We will continue to use these terms in more advanced models throughout the rest of the semester Ø Extra practice for linear model terms The trick of linear models is to construct one model that answers all of your research questions EPSY 905: Lecture 1 - Introduction 47
Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model
Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 1: August 22, 2012
More informationReview of the General Linear Model
Review of the General Linear Model EPSY 905: Multivariate Analysis Online Lecture #2 Learning Objectives Types of distributions: Ø Conditional distributions The General Linear Model Ø Regression Ø Analysis
More informationA Re-Introduction to General Linear Models (GLM)
A Re-Introduction to General Linear Models (GLM) Today s Class: You do know the GLM Estimation (where the numbers in the output come from): From least squares to restricted maximum likelihood (REML) Reviewing
More informationStatistical Distribution Assumptions of General Linear Models
Statistical Distribution Assumptions of General Linear Models Applied Multilevel Models for Cross Sectional Data Lecture 4 ICPSR Summer Workshop University of Colorado Boulder Lecture 4: Statistical Distributions
More informationEPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7
Introduction to Generalized Univariate Models: Models for Binary Outcomes EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7 EPSY 905: Intro to Generalized In This Lecture A short review
More informationRepeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models
Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models EPSY 905: Multivariate Analysis Spring 2016 Lecture #12 April 20, 2016 EPSY 905: RM ANOVA, MANOVA, and Mixed Models
More informationIntroduction to Matrix Algebra and the Multivariate Normal Distribution
Introduction to Matrix Algebra and the Multivariate Normal Distribution Introduction to Structural Equation Modeling Lecture #2 January 18, 2012 ERSH 8750: Lecture 2 Motivation for Learning the Multivariate
More informationA Re-Introduction to General Linear Models
A Re-Introduction to General Linear Models Today s Class: Big picture overview Why we are using restricted maximum likelihood within MIXED instead of least squares within GLM Linear model interpretation
More informationSimple, Marginal, and Interaction Effects in General Linear Models
Simple, Marginal, and Interaction Effects in General Linear Models PRE 905: Multivariate Analysis Lecture 3 Today s Class Centering and Coding Predictors Interpreting Parameters in the Model for the Means
More informationA Introduction to Matrix Algebra and the Multivariate Normal Distribution
A Introduction to Matrix Algebra and the Multivariate Normal Distribution PRE 905: Multivariate Analysis Spring 2014 Lecture 6 PRE 905: Lecture 7 Matrix Algebra and the MVN Distribution Today s Class An
More informationMultilevel Models in Matrix Form. Lecture 7 July 27, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2
Multilevel Models in Matrix Form Lecture 7 July 27, 2011 Advanced Multivariate Statistical Methods ICPSR Summer Session #2 Today s Lecture Linear models from a matrix perspective An example of how to do
More informationInteractions among Continuous Predictors
Interactions among Continuous Predictors Today s Class: Simple main effects within two-way interactions Conquering TEST/ESTIMATE/LINCOM statements Regions of significance Three-way interactions (and beyond
More informationUnivariate Normal Distribution; GLM with the Univariate Normal; Least Squares Estimation
Univariate Normal Distribution; GLM with the Univariate Normal; Least Squares Estimation PRE 905: Multivariate Analysis Spring 2014 Lecture 4 Today s Class The building blocks: The basics of mathematical
More informationSTA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.
STA441: Spring 2018 Multiple Regression This slide show is a free open source document. See the last slide for copyright information. 1 Least Squares Plane 2 Statistical MODEL There are p-1 explanatory
More informationReview of Multiple Regression
Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate
More informationRon Heck, Fall Week 3: Notes Building a Two-Level Model
Ron Heck, Fall 2011 1 EDEP 768E: Seminar on Multilevel Modeling rev. 9/6/2011@11:27pm Week 3: Notes Building a Two-Level Model We will build a model to explain student math achievement using student-level
More informationAn Introduction to Path Analysis
An Introduction to Path Analysis PRE 905: Multivariate Analysis Lecture 10: April 15, 2014 PRE 905: Lecture 10 Path Analysis Today s Lecture Path analysis starting with multivariate regression then arriving
More informationSimple, Marginal, and Interaction Effects in General Linear Models: Part 1
Simple, Marginal, and Interaction Effects in General Linear Models: Part 1 PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 2: August 24, 2012 PSYC 943: Lecture 2 Today s Class Centering and
More informationIntroduction to Within-Person Analysis and RM ANOVA
Introduction to Within-Person Analysis and RM ANOVA Today s Class: From between-person to within-person ANOVAs for longitudinal data Variance model comparisons using 2 LL CLP 944: Lecture 3 1 The Two Sides
More informationPath Analysis. PRE 906: Structural Equation Modeling Lecture #5 February 18, PRE 906, SEM: Lecture 5 - Path Analysis
Path Analysis PRE 906: Structural Equation Modeling Lecture #5 February 18, 2015 PRE 906, SEM: Lecture 5 - Path Analysis Key Questions for Today s Lecture What distinguishes path models from multivariate
More informationMaximum Likelihood Estimation; Robust Maximum Likelihood; Missing Data with Maximum Likelihood
Maximum Likelihood Estimation; Robust Maximum Likelihood; Missing Data with Maximum Likelihood PRE 906: Structural Equation Modeling Lecture #3 February 4, 2015 PRE 906, SEM: Estimation Today s Class An
More informationAn Introduction to Mplus and Path Analysis
An Introduction to Mplus and Path Analysis PSYC 943: Fundamentals of Multivariate Modeling Lecture 10: October 30, 2013 PSYC 943: Lecture 10 Today s Lecture Path analysis starting with multivariate regression
More informationTime-Invariant Predictors in Longitudinal Models
Time-Invariant Predictors in Longitudinal Models Today s Class (or 3): Summary of steps in building unconditional models for time What happens to missing predictors Effects of time-invariant predictors
More informationGeneralized Linear Models for Non-Normal Data
Generalized Linear Models for Non-Normal Data Today s Class: 3 parts of a generalized model Models for binary outcomes Complications for generalized multivariate or multilevel models SPLH 861: Lecture
More informationClass Introduction and Overview; Review of ANOVA, Regression, and Psychological Measurement
Class Introduction and Overview; Review of ANOVA, Regression, and Psychological Measurement Introduction to Structural Equation Modeling Lecture #1 January 11, 2012 ERSH 8750: Lecture 1 Today s Class Introduction
More informationReview of CLDP 944: Multilevel Models for Longitudinal Data
Review of CLDP 944: Multilevel Models for Longitudinal Data Topics: Review of general MLM concepts and terminology Model comparisons and significance testing Fixed and random effects of time Significance
More information36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs)
36-309/749 Experimental Design for Behavioral and Social Sciences Dec 1, 2015 Lecture 11: Mixed Models (HLMs) Independent Errors Assumption An error is the deviation of an individual observed outcome (DV)
More informationUnivariate Normal Distribution; GLM with the Univariate Normal; Least Squares Estimation
Univariate Normal Distribution; GLM with the Univariate Normal; Least Squares Estimation EPSY 905: Multivariate Analysis Spring 2016 Lecture 3: February 10, 2016 Today s Class The building blocks: The
More informationAn Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012
An Introduction to Multilevel Models PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012 Today s Class Concepts in Longitudinal Modeling Between-Person vs. +Within-Person
More informationFinal Exam - Solutions
Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your
More informationHierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture!
Hierarchical Generalized Linear Models ERSH 8990 REMS Seminar on HLM Last Lecture! Hierarchical Generalized Linear Models Introduction to generalized models Models for binary outcomes Interpreting parameter
More informationExample. Multiple Regression. Review of ANOVA & Simple Regression /749 Experimental Design for Behavioral and Social Sciences
36-309/749 Experimental Design for Behavioral and Social Sciences Sep. 29, 2015 Lecture 5: Multiple Regression Review of ANOVA & Simple Regression Both Quantitative outcome Independent, Gaussian errors
More informationTime-Invariant Predictors in Longitudinal Models
Time-Invariant Predictors in Longitudinal Models Today s Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building
More informationIntroduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation. EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016
Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016 EPSY 905: Intro to Bayesian and MCMC Today s Class An
More informationCorrelation. A statistics method to measure the relationship between two variables. Three characteristics
Correlation Correlation A statistics method to measure the relationship between two variables Three characteristics Direction of the relationship Form of the relationship Strength/Consistency Direction
More informationGeneralized Models: Part 1
Generalized Models: Part 1 Topics: Introduction to generalized models Introduction to maximum likelihood estimation Models for binary outcomes Models for proportion outcomes Models for categorical outcomes
More informationTime-Invariant Predictors in Longitudinal Models
Time-Invariant Predictors in Longitudinal Models Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building strategies
More informationExperimental Design and Data Analysis for Biologists
Experimental Design and Data Analysis for Biologists Gerry P. Quinn Monash University Michael J. Keough University of Melbourne CAMBRIDGE UNIVERSITY PRESS Contents Preface page xv I I Introduction 1 1.1
More informationBinary Logistic Regression
The coefficients of the multiple regression model are estimated using sample data with k independent variables Estimated (or predicted) value of Y Estimated intercept Estimated slope coefficients Ŷ = b
More informationModel Estimation Example
Ronald H. Heck 1 EDEP 606: Multivariate Methods (S2013) April 7, 2013 Model Estimation Example As we have moved through the course this semester, we have encountered the concept of model estimation. Discussions
More informationSubject CS1 Actuarial Statistics 1 Core Principles
Institute of Actuaries of India Subject CS1 Actuarial Statistics 1 Core Principles For 2019 Examinations Aim The aim of the Actuarial Statistics 1 subject is to provide a grounding in mathematical and
More informationRon Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)
Ron Heck, Fall 2011 1 EDEP 768E: Seminar in Multilevel Modeling rev. January 3, 2012 (see footnote) Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October
More informationData Analysis as a Decision Making Process
Data Analysis as a Decision Making Process I. Levels of Measurement A. NOIR - Nominal Categories with names - Ordinal Categories with names and a logical order - Intervals Numerical Scale with logically
More informationProfile Analysis Multivariate Regression
Lecture 8 October 12, 2005 Analysis Lecture #8-10/12/2005 Slide 1 of 68 Today s Lecture Profile analysis Today s Lecture Schedule : regression review multiple regression is due Thursday, October 27th,
More informationCan you tell the relationship between students SAT scores and their college grades?
Correlation One Challenge Can you tell the relationship between students SAT scores and their college grades? A: The higher SAT scores are, the better GPA may be. B: The higher SAT scores are, the lower
More informationANCOVA. ANCOVA allows the inclusion of a 3rd source of variation into the F-formula (called the covariate) and changes the F-formula
ANCOVA Workings of ANOVA & ANCOVA ANCOVA, Semi-Partial correlations, statistical control Using model plotting to think about ANCOVA & Statistical control You know how ANOVA works the total variation among
More informationMachine Learning Linear Regression. Prof. Matteo Matteucci
Machine Learning Linear Regression Prof. Matteo Matteucci Outline 2 o Simple Linear Regression Model Least Squares Fit Measures of Fit Inference in Regression o Multi Variate Regession Model Least Squares
More informationContents. Acknowledgments. xix
Table of Preface Acknowledgments page xv xix 1 Introduction 1 The Role of the Computer in Data Analysis 1 Statistics: Descriptive and Inferential 2 Variables and Constants 3 The Measurement of Variables
More informationLECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit
LECTURE 6 Introduction to Econometrics Hypothesis testing & Goodness of fit October 25, 2016 1 / 23 ON TODAY S LECTURE We will explain how multiple hypotheses are tested in a regression model We will define
More informationAP Final Review II Exploring Data (20% 30%)
AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure
More informationSimple Linear Regression Using Ordinary Least Squares
Simple Linear Regression Using Ordinary Least Squares Purpose: To approximate a linear relationship with a line. Reason: We want to be able to predict Y using X. Definition: The Least Squares Regression
More informationTime-Invariant Predictors in Longitudinal Models
Time-Invariant Predictors in Longitudinal Models Topics: Summary of building unconditional models for time Missing predictors in MLM Effects of time-invariant predictors Fixed, systematically varying,
More informationLongitudinal Modeling with Logistic Regression
Newsom 1 Longitudinal Modeling with Logistic Regression Longitudinal designs involve repeated measurements of the same individuals over time There are two general classes of analyses that correspond to
More informationIntroduction to Generalized Models
Introduction to Generalized Models Today s topics: The big picture of generalized models Review of maximum likelihood estimation Models for binary outcomes Models for proportion outcomes Models for categorical
More informationComparing IRT with Other Models
Comparing IRT with Other Models Lecture #14 ICPSR Item Response Theory Workshop Lecture #14: 1of 45 Lecture Overview The final set of slides will describe a parallel between IRT and another commonly used
More informationAcknowledgements. Outline. Marie Diener-West. ICTR Leadership / Team INTRODUCTION TO CLINICAL RESEARCH. Introduction to Linear Regression
INTRODUCTION TO CLINICAL RESEARCH Introduction to Linear Regression Karen Bandeen-Roche, Ph.D. July 17, 2012 Acknowledgements Marie Diener-West Rick Thompson ICTR Leadership / Team JHU Intro to Clinical
More informationInferences for Regression
Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In
More informationLongitudinal Data Analysis of Health Outcomes
Longitudinal Data Analysis of Health Outcomes Longitudinal Data Analysis Workshop Running Example: Days 2 and 3 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development
More informationwhere Female = 0 for males, = 1 for females Age is measured in years (22, 23, ) GPA is measured in units on a four-point scale (0, 1.22, 3.45, etc.
Notes on regression analysis 1. Basics in regression analysis key concepts (actual implementation is more complicated) A. Collect data B. Plot data on graph, draw a line through the middle of the scatter
More informationIntroduction to Random Effects of Time and Model Estimation
Introduction to Random Effects of Time and Model Estimation Today s Class: The Big Picture Multilevel model notation Fixed vs. random effects of time Random intercept vs. random slope models How MLM =
More informationReview of Statistics 101
Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods
More informationWU Weiterbildung. Linear Mixed Models
Linear Mixed Effects Models WU Weiterbildung SLIDE 1 Outline 1 Estimation: ML vs. REML 2 Special Models On Two Levels Mixed ANOVA Or Random ANOVA Random Intercept Model Random Coefficients Model Intercept-and-Slopes-as-Outcomes
More informationHypothesis testing, part 2. With some material from Howard Seltman, Blase Ur, Bilge Mutlu, Vibha Sazawal
Hypothesis testing, part 2 With some material from Howard Seltman, Blase Ur, Bilge Mutlu, Vibha Sazawal 1 CATEGORICAL IV, NUMERIC DV 2 Independent samples, one IV # Conditions Normal/Parametric Non-parametric
More informationFinal Exam - Solutions
Ecn 102 - Analysis of Economic Data University of California - Davis March 17, 2010 Instructor: John Parman Final Exam - Solutions You have until 12:30pm to complete this exam. Please remember to put your
More informationSTA441: Spring Multiple Regression. More than one explanatory variable at the same time
STA441: Spring 2016 Multiple Regression More than one explanatory variable at the same time This slide show is a free open source document. See the last slide for copyright information. One Explanatory
More informationPower Analysis. Ben Kite KU CRMDA 2015 Summer Methodology Institute
Power Analysis Ben Kite KU CRMDA 2015 Summer Methodology Institute Created by Terrence D. Jorgensen, 2014 Recall Hypothesis Testing? Null Hypothesis Significance Testing (NHST) is the most common application
More informationRegression With a Categorical Independent Variable: Mean Comparisons
Regression With a Categorical Independent Variable: Mean Lecture 16 March 29, 2005 Applied Regression Analysis Lecture #16-3/29/2005 Slide 1 of 43 Today s Lecture comparisons among means. Today s Lecture
More informationChapter 3 ANALYSIS OF RESPONSE PROFILES
Chapter 3 ANALYSIS OF RESPONSE PROFILES 78 31 Introduction In this chapter we present a method for analysing longitudinal data that imposes minimal structure or restrictions on the mean responses over
More informationCourse Review. Kin 304W Week 14: April 9, 2013
Course Review Kin 304W Week 14: April 9, 2013 1 Today s Outline Format of Kin 304W Final Exam Course Review Hand back marked Project Part II 2 Kin 304W Final Exam Saturday, Thursday, April 18, 3:30-6:30
More informationpsyc3010 lecture 2 factorial between-ps ANOVA I: omnibus tests
psyc3010 lecture 2 factorial between-ps ANOVA I: omnibus tests last lecture: introduction to factorial designs next lecture: factorial between-ps ANOVA II: (effect sizes and follow-up tests) 1 general
More informationCIVL 7012/8012. Simple Linear Regression. Lecture 3
CIVL 7012/8012 Simple Linear Regression Lecture 3 OLS assumptions - 1 Model of population Sample estimation (best-fit line) y = β 0 + β 1 x + ε y = b 0 + b 1 x We want E b 1 = β 1 ---> (1) Meaning we want
More informationChapter Fifteen. Frequency Distribution, Cross-Tabulation, and Hypothesis Testing
Chapter Fifteen Frequency Distribution, Cross-Tabulation, and Hypothesis Testing Copyright 2010 Pearson Education, Inc. publishing as Prentice Hall 15-1 Internet Usage Data Table 15.1 Respondent Sex Familiarity
More informationDescribing Change over Time: Adding Linear Trends
Describing Change over Time: Adding Linear Trends Longitudinal Data Analysis Workshop Section 7 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section
More informationDraft Proof - Do not copy, post, or distribute. Chapter Learning Objectives REGRESSION AND CORRELATION THE SCATTER DIAGRAM
1 REGRESSION AND CORRELATION As we learned in Chapter 9 ( Bivariate Tables ), the differential access to the Internet is real and persistent. Celeste Campos-Castillo s (015) research confirmed the impact
More informationPanel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43
Panel Data March 2, 212 () Applied Economoetrics: Topic March 2, 212 1 / 43 Overview Many economic applications involve panel data. Panel data has both cross-sectional and time series aspects. Regression
More informationBig Data Analysis with Apache Spark UC#BERKELEY
Big Data Analysis with Apache Spark UC#BERKELEY This Lecture: Relation between Variables An association A trend» Positive association or Negative association A pattern» Could be any discernible shape»
More informationExploratory Factor Analysis and Principal Component Analysis
Exploratory Factor Analysis and Principal Component Analysis Today s Topics: What are EFA and PCA for? Planning a factor analytic study Analysis steps: Extraction methods How many factors Rotation and
More informationReview of Unconditional Multilevel Models for Longitudinal Data
Review of Unconditional Multilevel Models for Longitudinal Data Topics: Course (and MLM) overview Concepts in longitudinal multilevel modeling Model comparisons and significance testing Describing within-person
More informationChapter 6. Logistic Regression. 6.1 A linear model for the log odds
Chapter 6 Logistic Regression In logistic regression, there is a categorical response variables, often coded 1=Yes and 0=No. Many important phenomena fit this framework. The patient survives the operation,
More informationIntroducing Generalized Linear Models: Logistic Regression
Ron Heck, Summer 2012 Seminars 1 Multilevel Regression Models and Their Applications Seminar Introducing Generalized Linear Models: Logistic Regression The generalized linear model (GLM) represents and
More informationGlossary. The ISI glossary of statistical terms provides definitions in a number of different languages:
Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the
More informationESP 178 Applied Research Methods. 2/23: Quantitative Analysis
ESP 178 Applied Research Methods 2/23: Quantitative Analysis Data Preparation Data coding create codebook that defines each variable, its response scale, how it was coded Data entry for mail surveys and
More informationIntroduction To Logistic Regression
Introduction To Lecture 22 April 28, 2005 Applied Regression Analysis Lecture #22-4/28/2005 Slide 1 of 28 Today s Lecture Logistic regression. Today s Lecture Lecture #22-4/28/2005 Slide 2 of 28 Background
More informationTime Invariant Predictors in Longitudinal Models
Time Invariant Predictors in Longitudinal Models Longitudinal Data Analysis Workshop Section 9 University of Georgia: Institute for Interdisciplinary Research in Education and Human Development Section
More informationLinear Regression. In this lecture we will study a particular type of regression model: the linear regression model
1 Linear Regression 2 Linear Regression In this lecture we will study a particular type of regression model: the linear regression model We will first consider the case of the model with one predictor
More information8/04/2011. last lecture: correlation and regression next lecture: standard MR & hierarchical MR (MR = multiple regression)
psyc3010 lecture 7 analysis of covariance (ANCOVA) last lecture: correlation and regression next lecture: standard MR & hierarchical MR (MR = multiple regression) 1 announcements quiz 2 correlation and
More informationClass Notes: Week 8. Probit versus Logit Link Functions and Count Data
Ronald Heck Class Notes: Week 8 1 Class Notes: Week 8 Probit versus Logit Link Functions and Count Data This week we ll take up a couple of issues. The first is working with a probit link function. While
More informationDesigning Multilevel Models Using SPSS 11.5 Mixed Model. John Painter, Ph.D.
Designing Multilevel Models Using SPSS 11.5 Mixed Model John Painter, Ph.D. Jordan Institute for Families School of Social Work University of North Carolina at Chapel Hill 1 Creating Multilevel Models
More informationStat 5101 Lecture Notes
Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random
More informationA (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data
A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data Today s Class: Review of concepts in multivariate data Introduction to random intercepts Crossed random effects models
More informationCategorical Predictor Variables
Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively
More informationRegression With a Categorical Independent Variable
Regression With a Independent Variable Lecture 10 November 5, 2008 ERSH 8320 Lecture #10-11/5/2008 Slide 1 of 54 Today s Lecture Today s Lecture Chapter 11: Regression with a single categorical independent
More informationSystematic error, of course, can produce either an upward or downward bias.
Brief Overview of LISREL & Related Programs & Techniques (Optional) Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised April 6, 2015 STRUCTURAL AND MEASUREMENT MODELS:
More informationLecture 14: Introduction to Poisson Regression
Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu 8 May 2007 1 / 52 Overview Modelling counts Contingency tables Poisson regression models 2 / 52 Modelling counts I Why
More informationModelling counts. Lecture 14: Introduction to Poisson Regression. Overview
Modelling counts I Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu Why count data? Number of traffic accidents per day Mortality counts in a given neighborhood, per week
More informationPS2.1 & 2.2: Linear Correlations PS2: Bivariate Statistics
PS2.1 & 2.2: Linear Correlations PS2: Bivariate Statistics LT1: Basics of Correlation LT2: Measuring Correlation and Line of best fit by eye Univariate (one variable) Displays Frequency tables Bar graphs
More informationChapter 19: Logistic regression
Chapter 19: Logistic regression Self-test answers SELF-TEST Rerun this analysis using a stepwise method (Forward: LR) entry method of analysis. The main analysis To open the main Logistic Regression dialog
More informationUni- and Bivariate Power
Uni- and Bivariate Power Copyright 2002, 2014, J. Toby Mordkoff Note that the relationship between risk and power is unidirectional. Power depends on risk, but risk is completely independent of power.
More informationEconometrics Review questions for exam
Econometrics Review questions for exam Nathaniel Higgins nhiggins@jhu.edu, 1. Suppose you have a model: y = β 0 x 1 + u You propose the model above and then estimate the model using OLS to obtain: ŷ =
More informationSTAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression
STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression Rebecca Barter April 20, 2015 Fisher s Exact Test Fisher s Exact Test
More information