Dynamics in Social Networks and Causality
|
|
- George Edwards
- 5 years ago
- Views:
Transcription
1 Web Science & Technologies University of Koblenz Landau, Germany Dynamics in Social Networks and Causality JProf. Dr. University Koblenz Landau GESIS Leibniz Institute for the Social Sciences
2 Last Time: Case Study: the spread of Facebook (Causal) Relationships Correlation Today: (Causal) Relationships Regression Analysis Matching Methods Spreading of culture 2
3 Isolating mechanisms is difficult but important. We should think about mechanisms that may explain causal effects before we measure them 5
4 Is X related with Y? RELATIONS 6
5 Correlation Coefficient Is the normalized version of the covariance: Variance: Frequently same sign of difference: positive correlation Frequently different sign of difference: negative correlation Vary sign of difference: no correlation 7
6 Correlation coefficient 8
7 Can we predict Y from X? LINEAR REGRESSION 9
8 Observations: Can we predict Y from X? Can we formulate a model that expresses Y as a function of X? 10
9 Assume Linear Relationship Y i = b 0 + b 1 X i + ε i iid ε1,., ε n N(0,σ 2 ) Y i N(b 0 + b 1 X i, σ 2 ) Y i = X i i Observed value 6 ε Y Y i = b 0 + b 1 X i + ε i Y 4 2 Predicted value: Y i = b 0 + b 1 X i 0 b X 11 11
10 Bivariate Regression How to estimate b0 and b1? N SSE = (Y i Y i ) 2 i=1 N SSE = (Y i b 0 + b 1 X i ) 2 i=1 6 Y X 12 12
11 Interpreting coefficients Y = b 0 + b 1 X + ε b 1 is the average effect of X on Y One unit increase in X leads to b 1 units of increase in Y b o is the best guess for Y if X=0 13
12 Example How much does the IQ of kid change on average if the mother has a high school degree or not? outcome predictor IQ kid = b 0 + b 1 HC mom + ε error intercept coefficient Binary predictor. b 1 tells us how much IQ kid differs between the 2 groups 14 14
13 Example Continuous Predictor How much does the IQ of a kid change on average if we increase IQ of mother? outcome predictor IQ kid = b 0 + b 1 IQ mom + ε intercept coefficient error IQ kid when IQ mom =0 Difference between IQ kid for 2 groups that differ in one unit of IQ mom 15 15
14 Example IQ kid = IQ mom If we compare children one unit change in IQ of mother leads to 0.6 change in IQ kid 10 points difference in mothers IQ corresponds to 6 point difference in kid s IQ What does the intercept tell us? Kids of mothers with IQ=0 who did not go to college would have a IQ of 26 16
15 Multiple Predictors Continouse predictor Difference between IQ kid for 2 groups that differ in HC mom but have equal IQ mom units outcome coefficient IQ kid = b 0 + b 1 IQ mom + b2 HC mom + ε intercept coefficient IQ kid when IQ mom =0 and HC mom =0 Difference between IQ kid for 2 groups that differ in one unit of IQ mom but have same HC mom value 17 17
16 Correlation versus Linear Regression Both measure the strength of the linear relationship between X and Y. Correlation gives you a bounded measurement of how close X and Y follow a perfect linear relationship Regression coefficient indicates the estimated change in the expected value of Y for a given value of X. It dependents on scale. The Pearson correlation coefficient, is the slope of the regression line when both variables have been standardized first. 18
17 Regression Models Causal Effects? Do students from elite colleges earn more later in life? earn ~ b0 +b1 *college +error Do people who went to an elite college earn on average more later in life? College Salary Time 19
18 Problem with Regressions Being accepted in an elite college correlates with motivation and socio-economic status These factors also correlate with salary socioeconomic status Motivation College Salary Time 20
19 Observational Data Randomization in experiments assures that: T O C T O In observational studies we need to control for covariates C that effect both, the outcome O and the treatment assignment T T O C 21
20 Regression Models Causality Even if we include all covariates that are relevant to ensure that T O C we measure population-wide average effects with regression model We measure the average effect of treatment T on outcome O when controlling for covariates C earn ~ b0 +b1 *college + b2 *motivation+ b3 *socio-econ+ error 22
21 Causal Effect Individual Level Effect wealth Did going to an elite college impact your future earnings? Y i (T) - Y i (C) college time 23
22 Does X cause Y? CAUSAL RELATIONSHIPS 24
23 Solutions Matching Methods Idea: find people that look like twins in pre-treatment covariates 25
24 Matching Methods Goal: treatment assignment T should be conditionally independent of outcome O given observed covariates X T O X Balance the distribution of observed covariates in treated and control group matching == pruning Unobserved covariates that are related with treatment and outcome and are not correlated with observed covariates are still a problem! 26
25 Matching Methods 4 Steps: Define a distance measure Find matches (e.g., use greedy k:1 matching) Assess quality of matches Are covariate distributions similar for treated and untreated group? Analyze effect of treatment on outcome 27
26 Position Does Special Training Help Job Promotion? Treated with elite education Outcome education (in years) 1-dimensional covariate Ho, Daniel, Kosuke Imai, Gary King, and Elizabeth Stuart Matching as Nonparametric Preprocessing for Reducing Model Dependence in Parametric Causal Inference. Political Analysis 15: Copy at 28
27 Example How to match people based on eductation? Greedy Distance Matching Treated Edu Pos Treated Edu Pos Matches: One really bad match. Optimal solution: 4-1, 7-6 and
28 Greedy Distance Matching Greedy matching may not find the optimal matches! Introduce Caliper = maximal acceptable distance Throw bad matches away Pair-matching (1:1 matching), but also 1:k matching is possible What if our data is multivariate? 30
29 Euclidean distance Euclidean distance doesn t make sense when different dimensions are on different scales. For example: yearly income, age, gender, body weight Problem: distance is dominated by largest values 31
30 Mahalanobis Distance Multivariate distance measure Measures how many std away two points are Rescales variables based on their direction and variance Inverse of variance-covariance matrix 32
31 position Example: Distance Matching Approximates fully blocked experiment Completely randomized: Flip a coin for each subject. Heads -> T, tails -> C. Could get unlucky: all men assigned T education Fully blocked experiment: First pair up similar subjects: e.g. same gender, age, Then flip a coin for each pair. One gets T, one C. Balances the known covariates. Gary King, "Why Propensity Scores Should Not Be Used for Matching, Methods Colloquium,
32 Assess Quality of Matches Was the matching successful? Are covariates balanced between the 2 groups? Standardized difference is the difference in groups for each covariate in units of standard deviation smd < 0.1 is good smd >0.2 indicates serious imbalance 34
33 Matching Methods 4 Steps: Define a distance measure Create matches (e.g., use greedy k:1 matching) Assess quality of matches Are covariate distributions similar for treated and untreated group? Analyze effect of treatment on outcome ATE Regression 35
34 Average Treatment Effect (ATE) Randomization Tests Compute test statistic from observed (matched) data ATE = Outcome Treated - Outcome Control Assume that null hypothesis is true (no treatment effect) ATE = E[Outcome Treated ] E[Outcome Control ]= 0 Randomly permute treatment assignments (shuffle treatment labels) Recompute test-statistic Is our observed test statistic surprising? 36
35 Regression After Matching position (p) pos = b0 + b1*edu + b2*is_treated 1) Preprocessing (matching) 2) Estimation of effects (regression models) education (e) Matching can help to reduce model dependence! 37
36 What if we simply use regression analysis? position change (p) pos = b0 + b1*edu + b2*is_treated binary variable estimated treatment effect education (e) Correcting for education, the treated group has higher positions. 38
37 Quadratic Regression position (p) pos = b0 + b 1 *e + b 2 *edu 2 + γ*is_treated Model Dependence Too much freedom given to analyst. Reason: Imbalance of covariates education (e) Correcting for education, the treated group has lower positions. 39
38 Distance Matching Distance matching works well if we have few covariates (less than 50). What if we have high dimensional data? Many covariates? Which should we include? Idea: project covariates into lower dimensional space compute for each observation (high dimensional vector) one number, the probability to be treated (propensity score). Match based on this number 40
39 Propensity Score Matching Propensity score is the probability of a subject to receive treatment given all covariates we want to control for If a subject has a propensity score=0.3 that means that a subject with these covariate values has a 30% chance of receiving the treatment Idea: subjects are not matched based on the covariates but based on the propensity score 41
40 position Propensity Score Matching Approximates complete randomization Completely randomized: Flip a coin for each subject. Heads -> T, tails -> C. Could get unlucky: all men assigned T education 43
41 Propensity Score Matching Estimate propensity score from observed data Outcome variable: T Independent variable: X Logistic regression: take predicted outcome T as propensity score Idea: achieve balance in covariates by conditioning on propensity score. This works if: P X = x π X = p, T = 1 = P(X = x π X = p, T = 0) Note: In a randomized trail the propensity score (==allocation probability) is known: P T = 1 X = P T = 1 =
42 Propensity Score Matching Matching: greedy, nearest neighbor Match on logit(propensity score) = log odds of propensity score We do that to stretch the values, often treatment probabilities are very small Look at std of transformed propensity scores Calliper: remove bad matches subjects that are more than 0.2 std away from the propensity score of their match Smaller calliper lower bias but more variance 45
43 Assess Quality of Matches Is the distribution of propensity scores similar for treated and control group? 48
44 Instrument Variables Problem: lets assume we cannot observe socio-economic status and motivation What could we do? socioeconomic status Motivation College Salary Time 50
45 Instrumental Variables Instrumental Variables are highly correlated with covariates but not with outcome Example Apply to elite college correlates with motivation Living area correlates with socio-economic status earn ~ b0 +b1 *college + b2*collegeap + b3*living +error We come closer to the causal effect of college choice on earning after controlling for the confounder socioeconomic status 51
46 Summary Matching methods and Instrumental Variables are powerful and help to approximate causality Problems Instrumental variables are often hard to find Researchers have lots of freedom when decide how to match We remove data we need to specify for which group the causal effect holds Compare results from different matching methods, different dimensionality reduction methods, different models Avoid model dependence and method dependence! 52
47 Dissemination of Culture How does culture evolve? 53 53
48 Cultural practices 54
49 Dynamics of Culture What is culture? How does it diffuse? Culture can be seen as an agglomerate of beliefs, opinions, values, behaviour and other things that certain groups of people have agreed on. People learn culture by interacting People are more likely to interact with similar people E.g. if they share a language they are more likely to interact and start sharing other traits over time Why do we see global polarization and local convergence? 55 55
50 56 56
51 Axelrod Model Each agent has a vector of f different features Each feature may allow q different traits Ethnicity Political Orientation Religion
52 Initial Setup 58 58
53 Axelrod Model Dynamic Process: An agent i and one of his neighbours j are randomly selected The overlap w i,j between their cultural vectors is computed With the probability w i,j the interaction takes place If the interaction takes place one feature is selected randomly and the trait of the neighbour j is set to the trait of i for this feature 59
54 Will we converge to one mono-culture? 60
55 Will we converge to one mono-culture? 61
56 Cultural Complexity The more cultural features we have the more likely it is the 2 agents will have something in common and can interact Few features and many traits the probability that agents will not share anything is high 62
57 Any further questions? See you next week 90
Propensity Score Matching
Methods James H. Steiger Department of Psychology and Human Development Vanderbilt University Regression Modeling, 2009 Methods 1 Introduction 2 3 4 Introduction Why Match? 5 Definition Methods and In
More informationPROPENSITY SCORE MATCHING. Walter Leite
PROPENSITY SCORE MATCHING Walter Leite 1 EXAMPLE Question: Does having a job that provides or subsidizes child care increate the length that working mothers breastfeed their children? Treatment: Working
More informationUse of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:
Use of Matching Methods for Causal Inference in Experimental and Observational Studies Kosuke Imai Department of Politics Princeton University April 27, 2007 Kosuke Imai (Princeton University) Matching
More informationCausal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies
Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed
More informationGov 2002: 5. Matching
Gov 2002: 5. Matching Matthew Blackwell October 1, 2015 Where are we? Where are we going? Discussed randomized experiments, started talking about observational data. Last week: no unmeasured confounders
More informationLecture Data Science
Web Science & Technologies University of Koblenz Landau, Germany Lecture Data Science Regression Analysis JProf. Dr. Last Time How to find parameter of a regression model Normal Equation Gradient Decent
More informationESTIMATION OF TREATMENT EFFECTS VIA MATCHING
ESTIMATION OF TREATMENT EFFECTS VIA MATCHING AAEC 56 INSTRUCTOR: KLAUS MOELTNER Textbooks: R scripts: Wooldridge (00), Ch.; Greene (0), Ch.9; Angrist and Pischke (00), Ch. 3 mod5s3 General Approach The
More informationUse of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:
Use of Matching Methods for Causal Inference in Experimental and Observational Studies Kosuke Imai Department of Politics Princeton University April 13, 2009 Kosuke Imai (Princeton University) Matching
More informationOrdinary Least Squares Regression Explained: Vartanian
Ordinary Least Squares Regression Explained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent
More informationHarvard University. Rigorous Research in Engineering Education
Statistical Inference Kari Lock Harvard University Department of Statistics Rigorous Research in Engineering Education 12/3/09 Statistical Inference You have a sample and want to use the data collected
More informationLab 4, modified 2/25/11; see also Rogosa R-session
Lab 4, modified 2/25/11; see also Rogosa R-session Stat 209 Lab: Matched Sets in R Lab prepared by Karen Kapur. 1 Motivation 1. Suppose we are trying to measure the effect of a treatment variable on the
More informationThe Simple Linear Regression Model
The Simple Linear Regression Model Lesson 3 Ryan Safner 1 1 Department of Economics Hood College ECON 480 - Econometrics Fall 2017 Ryan Safner (Hood College) ECON 480 - Lesson 3 Fall 2017 1 / 77 Bivariate
More informationEmpirical approaches in public economics
Empirical approaches in public economics ECON4624 Empirical Public Economics Fall 2016 Gaute Torsvik Outline for today The canonical problem Basic concepts of causal inference Randomized experiments Non-experimental
More informationECON Introductory Econometrics. Lecture 17: Experiments
ECON4150 - Introductory Econometrics Lecture 17: Experiments Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 13 Lecture outline 2 Why study experiments? The potential outcome framework.
More informationwhere Female = 0 for males, = 1 for females Age is measured in years (22, 23, ) GPA is measured in units on a four-point scale (0, 1.22, 3.45, etc.
Notes on regression analysis 1. Basics in regression analysis key concepts (actual implementation is more complicated) A. Collect data B. Plot data on graph, draw a line through the middle of the scatter
More informationThe Central Limit Theorem
- The Central Limit Theorem Definition Sampling Distribution of the Mean the probability distribution of sample means, with all samples having the same sample size n. (In general, the sampling distribution
More informationOverview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation
Bivariate Regression & Correlation Overview The Scatter Diagram Two Examples: Education & Prestige Correlation Coefficient Bivariate Linear Regression Line SPSS Output Interpretation Covariance ou already
More informationInferences for Regression
Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In
More informationBinary Logistic Regression
The coefficients of the multiple regression model are estimated using sample data with k independent variables Estimated (or predicted) value of Y Estimated intercept Estimated slope coefficients Ŷ = b
More information1 The Classic Bivariate Least Squares Model
Review of Bivariate Linear Regression Contents 1 The Classic Bivariate Least Squares Model 1 1.1 The Setup............................... 1 1.2 An Example Predicting Kids IQ................. 1 2 Evaluating
More informationIntroduction and Single Predictor Regression. Correlation
Introduction and Single Predictor Regression Dr. J. Kyle Roberts Southern Methodist University Simmons School of Education and Human Development Department of Teaching and Learning Correlation A correlation
More informationRon Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)
Ron Heck, Fall 2011 1 EDEP 768E: Seminar in Multilevel Modeling rev. January 3, 2012 (see footnote) Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October
More informationMatching for Causal Inference Without Balance Checking
Matching for ausal Inference Without Balance hecking Gary King Institute for Quantitative Social Science Harvard University joint work with Stefano M. Iacus (Univ. of Milan) and Giuseppe Porro (Univ. of
More informationCorrelation and regression. Correlation and regression analysis. Measures of association. Why bother? Positive linear relationship
1 Correlation and regression analsis 12 Januar 2009 Monda, 14.00-16.00 (C1058) Frank Haege Department of Politics and Public Administration Universit of Limerick frank.haege@ul.ie www.frankhaege.eu Correlation
More informationReview of Multiple Regression
Ronald H. Heck 1 Let s begin with a little review of multiple regression this week. Linear models [e.g., correlation, t-tests, analysis of variance (ANOVA), multiple regression, path analysis, multivariate
More informationMatching. Quiz 2. Matching. Quiz 2. Exact Matching. Estimand 2/25/14
STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University Frequency 0 2 4 6 8 Quiz 2 Histogram of Quiz2 10 12 14 16 18 20 Quiz2
More informationCausal Inference with General Treatment Regimes: Generalizing the Propensity Score
Causal Inference with General Treatment Regimes: Generalizing the Propensity Score David van Dyk Department of Statistics, University of California, Irvine vandyk@stat.harvard.edu Joint work with Kosuke
More informationRelating Graph to Matlab
There are two related course documents on the web Probability and Statistics Review -should be read by people without statistics background and it is helpful as a review for those with prior statistics
More informationCategorical Predictor Variables
Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively
More informationappstats27.notebook April 06, 2017
Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves
More informationStatistical Distribution Assumptions of General Linear Models
Statistical Distribution Assumptions of General Linear Models Applied Multilevel Models for Cross Sectional Data Lecture 4 ICPSR Summer Workshop University of Colorado Boulder Lecture 4: Statistical Distributions
More informationSelection on Observables: Propensity Score Matching.
Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017
More informationSimple Regression Model. January 24, 2011
Simple Regression Model January 24, 2011 Outline Descriptive Analysis Causal Estimation Forecasting Regression Model We are actually going to derive the linear regression model in 3 very different ways
More informationQuantitative Economics for the Evaluation of the European Policy
Quantitative Economics for the Evaluation of the European Policy Dipartimento di Economia e Management Irene Brunetti Davide Fiaschi Angela Parenti 1 25th of September, 2017 1 ireneb@ec.unipi.it, davide.fiaschi@unipi.it,
More informationLecture 14: Introduction to Poisson Regression
Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu 8 May 2007 1 / 52 Overview Modelling counts Contingency tables Poisson regression models 2 / 52 Modelling counts I Why
More informationModelling counts. Lecture 14: Introduction to Poisson Regression. Overview
Modelling counts I Lecture 14: Introduction to Poisson Regression Ani Manichaikul amanicha@jhsph.edu Why count data? Number of traffic accidents per day Mortality counts in a given neighborhood, per week
More informationRon Heck, Fall Week 3: Notes Building a Two-Level Model
Ron Heck, Fall 2011 1 EDEP 768E: Seminar on Multilevel Modeling rev. 9/6/2011@11:27pm Week 3: Notes Building a Two-Level Model We will build a model to explain student math achievement using student-level
More informationDraft Proof - Do not copy, post, or distribute. Chapter Learning Objectives REGRESSION AND CORRELATION THE SCATTER DIAGRAM
1 REGRESSION AND CORRELATION As we learned in Chapter 9 ( Bivariate Tables ), the differential access to the Internet is real and persistent. Celeste Campos-Castillo s (015) research confirmed the impact
More informationMachine Learning Linear Classification. Prof. Matteo Matteucci
Machine Learning Linear Classification Prof. Matteo Matteucci Recall from the first lecture 2 X R p Regression Y R Continuous Output X R p Y {Ω 0, Ω 1,, Ω K } Classification Discrete Output X R p Y (X)
More informationExam Applied Statistical Regression. Good Luck!
Dr. M. Dettling Summer 2011 Exam Applied Statistical Regression Approved: Tables: Note: Any written material, calculator (without communication facility). Attached. All tests have to be done at the 5%-level.
More informationwith the usual assumptions about the error term. The two values of X 1 X 2 0 1
Sample questions 1. A researcher is investigating the effects of two factors, X 1 and X 2, each at 2 levels, on a response variable Y. A balanced two-factor factorial design is used with 1 replicate. The
More informationSociology 593 Exam 2 Answer Key March 28, 2002
Sociology 59 Exam Answer Key March 8, 00 I. True-False. (0 points) Indicate whether the following statements are true or false. If false, briefly explain why.. A variable is called CATHOLIC. This probably
More informationIntroducing Generalized Linear Models: Logistic Regression
Ron Heck, Summer 2012 Seminars 1 Multilevel Regression Models and Their Applications Seminar Introducing Generalized Linear Models: Logistic Regression The generalized linear model (GLM) represents and
More informationPotential Outcomes Model (POM)
Potential Outcomes Model (POM) Relationship Between Counterfactual States Causality Empirical Strategies in Labor Economics, Angrist Krueger (1999): The most challenging empirical questions in economics
More informationCRP 272 Introduction To Regression Analysis
CRP 272 Introduction To Regression Analysis 30 Relationships Among Two Variables: Interpretations One variable is used to explain another variable X Variable Independent Variable Explaining Variable Exogenous
More informationLab #12: Exam 3 Review Key
Psychological Statistics Practice Lab#1 Dr. M. Plonsky Page 1 of 7 Lab #1: Exam 3 Review Key 1) a. Probability - Refers to the likelihood that an event will occur. Ranges from 0 to 1. b. Sampling Distribution
More informationTables and Figures. This draft, July 2, 2007
and Figures This draft, July 2, 2007 1 / 16 Figures 2 / 16 Figure 1: Density of Estimated Propensity Score Pr(D=1) % 50 40 Treated Group Untreated Group 30 f (P) 20 10 0.01~.10.11~.20.21~.30.31~.40.41~.50.51~.60.61~.70.71~.80.81~.90.91~.99
More informationSociology 593 Exam 2 March 28, 2002
Sociology 59 Exam March 8, 00 I. True-False. (0 points) Indicate whether the following statements are true or false. If false, briefly explain why.. A variable is called CATHOLIC. This probably means that
More informationCourse Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model
Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model EPSY 905: Multivariate Analysis Lecture 1 20 January 2016 EPSY 905: Lecture 1 -
More informationLecture (chapter 13): Association between variables measured at the interval-ratio level
Lecture (chapter 13): Association between variables measured at the interval-ratio level Ernesto F. L. Amaral April 9 11, 2018 Advanced Methods of Social Research (SOCI 420) Source: Healey, Joseph F. 2015.
More informationreview session gov 2000 gov 2000 () review session 1 / 38
review session gov 2000 gov 2000 () review session 1 / 38 Overview Random Variables and Probability Univariate Statistics Bivariate Statistics Multivariate Statistics Causal Inference gov 2000 () review
More information1 Correlation and Inference from Regression
1 Correlation and Inference from Regression Reading: Kennedy (1998) A Guide to Econometrics, Chapters 4 and 6 Maddala, G.S. (1992) Introduction to Econometrics p. 170-177 Moore and McCabe, chapter 12 is
More information22s:152 Applied Linear Regression. Example: Study on lead levels in children. Ch. 14 (sec. 1) and Ch. 15 (sec. 1 & 4): Logistic Regression
22s:52 Applied Linear Regression Ch. 4 (sec. and Ch. 5 (sec. & 4: Logistic Regression Logistic Regression When the response variable is a binary variable, such as 0 or live or die fail or succeed then
More informationOrdinary Least Squares Regression Explained: Vartanian
Ordinary Least Squares Regression Eplained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent
More informationFEEG6017 lecture: Akaike's information criterion; model reduction. Brendan Neville
FEEG6017 lecture: Akaike's information criterion; model reduction Brendan Neville bjn1c13@ecs.soton.ac.uk Occam's razor William of Occam, 1288-1348. All else being equal, the simplest explanation is the
More informationCourse Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model
Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 1: August 22, 2012
More informationWed, June 26, (Lecture 8-2). Nonlinearity. Significance test for correlation R-squared, SSE, and SST. Correlation in SPSS.
Wed, June 26, (Lecture 8-2). Nonlinearity. Significance test for correlation R-squared, SSE, and SST. Correlation in SPSS. Last time, we looked at scatterplots, which show the interaction between two variables,
More informationAn Introduction to Causal Analysis on Observational Data using Propensity Scores
An Introduction to Causal Analysis on Observational Data using Propensity Scores Margie Rosenberg*, PhD, FSA Brian Hartman**, PhD, ASA Shannon Lane* *University of Wisconsin Madison **University of Connecticut
More informationStatistical Analysis of the Item Count Technique
Statistical Analysis of the Item Count Technique Kosuke Imai Department of Politics Princeton University Joint work with Graeme Blair May 4, 2011 Kosuke Imai (Princeton) Item Count Technique UCI (Statistics)
More informationAchieving Optimal Covariate Balance Under General Treatment Regimes
Achieving Under General Treatment Regimes Marc Ratkovic Princeton University May 24, 2012 Motivation For many questions of interest in the social sciences, experiments are not possible Possible bias in
More informationSTAT/SOC/CSSS 221 Statistical Concepts and Methods for the Social Sciences. Random Variables
STAT/SOC/CSSS 221 Statistical Concepts and Methods for the Social Sciences Random Variables Christopher Adolph Department of Political Science and Center for Statistics and the Social Sciences University
More informationPath Analysis. PRE 906: Structural Equation Modeling Lecture #5 February 18, PRE 906, SEM: Lecture 5 - Path Analysis
Path Analysis PRE 906: Structural Equation Modeling Lecture #5 February 18, 2015 PRE 906, SEM: Lecture 5 - Path Analysis Key Questions for Today s Lecture What distinguishes path models from multivariate
More informationPropensity Score Analysis Using teffects in Stata. SOC 561 Programming for the Social Sciences Hyungjun Suh Apr
Propensity Score Analysis Using teffects in Stata SOC 561 Programming for the Social Sciences Hyungjun Suh Apr. 25. 2016 Overview Motivation Propensity Score Weighting Propensity Score Matching with teffects
More informationCan you tell the relationship between students SAT scores and their college grades?
Correlation One Challenge Can you tell the relationship between students SAT scores and their college grades? A: The higher SAT scores are, the better GPA may be. B: The higher SAT scores are, the lower
More informationVariance. Standard deviation VAR = = value. Unbiased SD = SD = 10/23/2011. Functional Connectivity Correlation and Regression.
10/3/011 Functional Connectivity Correlation and Regression Variance VAR = Standard deviation Standard deviation SD = Unbiased SD = 1 10/3/011 Standard error Confidence interval SE = CI = = t value for
More informationCovariate Balancing Propensity Score for General Treatment Regimes
Covariate Balancing Propensity Score for General Treatment Regimes Kosuke Imai Princeton University October 14, 2014 Talk at the Department of Psychiatry, Columbia University Joint work with Christian
More informationData Analysis 1 LINEAR REGRESSION. Chapter 03
Data Analysis 1 LINEAR REGRESSION Chapter 03 Data Analysis 2 Outline The Linear Regression Model Least Squares Fit Measures of Fit Inference in Regression Other Considerations in Regression Model Qualitative
More informationMatching. Stephen Pettigrew. April 15, Stephen Pettigrew Matching April 15, / 67
Matching Stephen Pettigrew April 15, 2015 Stephen Pettigrew Matching April 15, 2015 1 / 67 Outline 1 Logistics 2 Basics of matching 3 Balance Metrics 4 Matching in R 5 The sample size-imbalance frontier
More informationEducation Production Functions. April 7, 2009
Education Production Functions April 7, 2009 Outline I Production Functions for Education Hanushek Paper Card and Krueger Tennesee Star Experiment Maimonides Rule What do I mean by Production Function?
More informationEco 391, J. Sandford, spring 2013 April 5, Midterm 3 4/5/2013
Midterm 3 4/5/2013 Instructions: You may use a calculator, and one sheet of notes. You will never be penalized for showing work, but if what is asked for can be computed directly, points awarded will depend
More informationGov 2002: 4. Observational Studies and Confounding
Gov 2002: 4. Observational Studies and Confounding Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last two weeks: randomized experiments. From here on: observational studies. What
More informationChapter 27 Summary Inferences for Regression
Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test
More informationLinear Models: Comparing Variables. Stony Brook University CSE545, Fall 2017
Linear Models: Comparing Variables Stony Brook University CSE545, Fall 2017 Statistical Preliminaries Random Variables Random Variables X: A mapping from Ω to ℝ that describes the question we care about
More informationTHE PEARSON CORRELATION COEFFICIENT
CORRELATION Two variables are said to have a relation if knowing the value of one variable gives you information about the likely value of the second variable this is known as a bivariate relation There
More informationEconometrics with Observational Data. Introduction and Identification Todd Wagner February 1, 2017
Econometrics with Observational Data Introduction and Identification Todd Wagner February 1, 2017 Goals for Course To enable researchers to conduct careful quantitative analyses with existing VA (and non-va)
More informationPSY 305. Module 3. Page Title. Introduction to Hypothesis Testing Z-tests. Five steps in hypothesis testing
Page Title PSY 305 Module 3 Introduction to Hypothesis Testing Z-tests Five steps in hypothesis testing State the research and null hypothesis Determine characteristics of comparison distribution Five
More informationThe t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies
The t-test: So Far: Sampling distribution benefit is that even if the original population is not normal, a sampling distribution based on this population will be normal (for sample size > 30). Benefit
More informationChs. 15 & 16: Correlation & Regression
Chs. 15 & 16: Correlation & Regression With the shift to correlational analyses, we change the very nature of the question we are asking of our data. Heretofore, we were asking if a difference was likely
More informationAn Introduction to Mplus and Path Analysis
An Introduction to Mplus and Path Analysis PSYC 943: Fundamentals of Multivariate Modeling Lecture 10: October 30, 2013 PSYC 943: Lecture 10 Today s Lecture Path analysis starting with multivariate regression
More informationCorrelation and simple linear regression S5
Basic medical statistics for clinical and eperimental research Correlation and simple linear regression S5 Katarzyna Jóźwiak k.jozwiak@nki.nl November 15, 2017 1/41 Introduction Eample: Brain size and
More informationECON3150/4150 Spring 2015
ECON3150/4150 Spring 2015 Lecture 3&4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo January 29, 2015 1 / 67 Chapter 4 in S&W Section 17.1 in S&W (extended OLS assumptions) 2
More informationPredicting the Treatment Status
Predicting the Treatment Status Nikolay Doudchenko 1 Introduction Many studies in social sciences deal with treatment effect models. 1 Usually there is a treatment variable which determines whether a particular
More informationMS&E 226. In-Class Midterm Examination Solutions Small Data October 20, 2015
MS&E 226 In-Class Midterm Examination Solutions Small Data October 20, 2015 PROBLEM 1. Alice uses ordinary least squares to fit a linear regression model on a dataset containing outcome data Y and covariates
More informationClass Notes: Week 8. Probit versus Logit Link Functions and Count Data
Ronald Heck Class Notes: Week 8 1 Class Notes: Week 8 Probit versus Logit Link Functions and Count Data This week we ll take up a couple of issues. The first is working with a probit link function. While
More informationBackground of Matching
Background of Matching Matching is a method that has gained increased popularity for the assessment of causal inference. This method has often been used in the field of medicine, economics, political science,
More informationGov 2000: 9. Regression with Two Independent Variables
Gov 2000: 9. Regression with Two Independent Variables Matthew Blackwell Fall 2016 1 / 62 1. Why Add Variables to a Regression? 2. Adding a Binary Covariate 3. Adding a Continuous Covariate 4. OLS Mechanics
More informationChs. 16 & 17: Correlation & Regression
Chs. 16 & 17: Correlation & Regression With the shift to correlational analyses, we change the very nature of the question we are asking of our data. Heretofore, we were asking if a difference was likely
More informationStatistical Analysis of List Experiments
Statistical Analysis of List Experiments Graeme Blair Kosuke Imai Princeton University December 17, 2010 Blair and Imai (Princeton) List Experiments Political Methodology Seminar 1 / 32 Motivation Surveys
More informationLecture on Null Hypothesis Testing & Temporal Correlation
Lecture on Null Hypothesis Testing & Temporal Correlation CS 590.21 Analysis and Modeling of Brain Networks Department of Computer Science University of Crete Acknowledgement Resources used in the slides
More informationStatistical Inference for Means
Statistical Inference for Means Jamie Monogan University of Georgia February 18, 2011 Jamie Monogan (UGA) Statistical Inference for Means February 18, 2011 1 / 19 Objectives By the end of this meeting,
More informationPSC 504: Dynamic Causal Inference
PSC 504: Dynamic Causal Inference Matthew Blackwell 4/8/203 e problem Let s go back to a problem that we faced earlier, which is how to estimate causal effects with treatments that vary over time. We could
More informationVariance Decomposition and Goodness of Fit
Variance Decomposition and Goodness of Fit 1. Example: Monthly Earnings and Years of Education In this tutorial, we will focus on an example that explores the relationship between total monthly earnings
More informationIntroduction to Econometrics. Review of Probability & Statistics
1 Introduction to Econometrics Review of Probability & Statistics Peerapat Wongchaiwat, Ph.D. wongchaiwat@hotmail.com Introduction 2 What is Econometrics? Econometrics consists of the application of mathematical
More informationGov 2002: 9. Differences in Differences
Gov 2002: 9. Differences in Differences Matthew Blackwell October 30, 2015 1 / 40 1. Basic differences-in-differences model 2. Conditional DID 3. Standard error issues 4. Other DID approaches 2 / 40 Where
More informationIn Class Review Exercises Vartanian: SW 540
In Class Review Exercises Vartanian: SW 540 1. Given the following output from an OLS model looking at income, what is the slope and intercept for those who are black and those who are not black? b SE
More informationAn Introduction to Path Analysis
An Introduction to Path Analysis PRE 905: Multivariate Analysis Lecture 10: April 15, 2014 PRE 905: Lecture 10 Path Analysis Today s Lecture Path analysis starting with multivariate regression then arriving
More informationEconometrics in a nutshell: Variation and Identification Linear Regression Model in STATA. Research Methods. Carlos Noton.
1/17 Research Methods Carlos Noton Term 2-2012 Outline 2/17 1 Econometrics in a nutshell: Variation and Identification 2 Main Assumptions 3/17 Dependent variable or outcome Y is the result of two forces:
More information28. SIMPLE LINEAR REGRESSION III
28. SIMPLE LINEAR REGRESSION III Fitted Values and Residuals To each observed x i, there corresponds a y-value on the fitted line, y = βˆ + βˆ x. The are called fitted values. ŷ i They are the values of
More informationReview of probabilities
CS 1675 Introduction to Machine Learning Lecture 5 Density estimation Milos Hauskrecht milos@pitt.edu 5329 Sennott Square Review of probabilities 1 robability theory Studies and describes random processes
More informationECO220Y Simple Regression: Testing the Slope
ECO220Y Simple Regression: Testing the Slope Readings: Chapter 18 (Sections 18.3-18.5) Winter 2012 Lecture 19 (Winter 2012) Simple Regression Lecture 19 1 / 32 Simple Regression Model y i = β 0 + β 1 x
More information