For more information about how to cite these materials visit

Size: px
Start display at page:

Download "For more information about how to cite these materials visit"

Transcription

1 Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: We have reviewed this material in accordance with U.S. Copyright Law and have tried to maximize your ability to use, share, and adapt it. The citation key on the following slide provides information about how you may share and adapt this material. Copyright holders of content included in this material should contact with any questions, corrections, or clarification regarding the use of content. For more information about how to cite these materials visit Any medical information in this material is intended to inform and educate and is not a tool for self-diagnosis or a replacement for medical evaluation, advice, diagnosis or treatment by a healthcare professional. Please speak to your physician if you have questions about your medical condition. Viewer discretion is advised: Some medical content is graphic and may not be suitable for all viewers. 1 / 20

2 Review of Pearson Correlation Kerby Shedden Department of Statistics, University of Michigan Monday 11 th March, / 20

3 Data setting Paired data: (X 1, Y 1 ), (X 2, Y 2 ),..., (X n, Y n ), e.g. X i is a person s age and Y i is that same person s income. Note that this is different from having two independent samples: X 1,..., X n Y 1,..., Y n, e.g. where we have the ages of one group of people and the incomes of a different group of people. 3 / 20

4 Standard definition and basic properties Symmetry: n i=1 Ĉor(X, Y ) = (X i X )(Y i Ȳ ) [ n i=1 (X i X ) 2 n i=1 (Y i Ȳ )2 ] 1/2 Affine invariance: Ĉov(X, Y ) = Ĉov(Y, X ). Ĉor(a + bx, c + dy ) = Ĉor(X, Y ), where a, b > 0, c, d > 0 are real constants. 4 / 20

5 Alternative definition 1 Ĉor(X, Y ) = i X i X Y i Ȳ j (X j X ) 2 j (Y j Ȳ ) 2 We can interpret X i X j (X j X ) 2 as a standardized measure of the distance from X i to the mean value X. 5 / 20

6 Interpretation The contribution of case i to alternative definition 1 is: X i X j (X j X Y i Ȳ ) 2 j (Y j Ȳ )2 This is positive if X i and Y i lie on the same side of their mean values (i.e. if X i > X and Y i > Ȳ, or if X i < X and Y i < Ȳ ). The contribution of case i is negative if X i and Y i lie on opposite sides of their mean values. 6 / 20

7 Interpretation of the correlation coefficient Source: Wikimedia commons 7 / 20

8 Anscombe s quartet All plots have the same marginal means and standard deviations, and the same correlation coefficient. Source: Wikimedia commons 8 / 20

9 Relationship to mean, variance and standard deviation Ĉov(X, Y ) = Var(X ) = n (X i X )(Y i Ȳ )/(n 1) i=1 n (X i X ) 2 /(n 1) i=1 ŜD(X ) = Var(X ) Ĉor(X, Y ) = Ĉov(X, Y ) ŜD(X ) ŜD(Y ) 9 / 20

10 Alternative definition 2 Ĉor(X, Y ) = Ĉov( X, Ỹ ) = Ĉor( X, Ỹ ) where X i = (X i X )/ŜD(X ) and Ỹi = (Y i Ȳ )/ŜD(Y ) are the standardized versions of X and Y. 10 / 20

11 Definition in terms of linear algebra Inner product: Y, X = n i=1 Y ix i Norm: X = X, X = X X X 2 n ŜD(Y ) = Y Ȳ / n 1 Ĉov(Y, X ) = Y Ȳ, X X /(n 1) 11 / 20

12 Alternative definition 3: Ĉor(Y, X ) = Y Ȳ, X X Y Ȳ X X. This definition makes sense if you think of the vectors (X 1,..., X n ) and (Y 1,..., Y n ) as points in R n. Properties: The correlation coefficient is zero if and only if X X and Y Ȳ are orthogonal vectors. The Cauchy-Schwarz inequality states that U, V U V. Thus Ĉor(X, Y ) / 20

13 Interpretation in terms of angles If θ is the angle between X X and Y Ȳ, then cos(θ) = Y Ȳ, X X Y Ȳ X X. The correlation coefficient equals 1 if and only if θ = 0, which means that the vectors X X and Y Ȳ are parallel, i.e. X X = c (Y Ȳ ). This is equivalent to X and Y being linearly related for constants a and b > 0. Y i = a + bx i, 13 / 20

14 Relationship to least squares regression Suppose we use simple linear regression, fit using least squares, to identify an approximate linear relationship between X and Y of the form The optimal values of a and b are Y i a + bx i. and b = Ĉov(X, Y ) Var(X ) = Ĉor(X, Y ) ŜD(Y ) ŜD(X ) a = Ȳ b X. If ŜD(X ) = ŜD(Y ), e.g. if X and Y have been standardized, then b = Ĉor(X, Y ). 14 / 20

15 Other connections to least squares regression The fitted values of a regression analysis are The variance of Ŷ is Ŷ = a + bx. Var(Ŷ ) = b 2 Var(X ) = Ĉor(Y, X )2 Var(Y ). 15 / 20

16 Other connections to least squares regression The covariance between Y and Ŷ is Ĉov(Y, Ŷ ) = Ĉov(Y, a + bx ) = b Ĉov(Y, X ) = Ĉor(Y, X ) ŜD(Y )Ĉov(Y, X )/ŜD(X ) = Ĉor(Y, X )2 Var(Y ). 16 / 20

17 Other connections to least squares regression It follows that Ĉor(Y, Ŷ ) = Ĉov(Y, Ŷ ) ŜD(Y ) ŜD(Ŷ ) = = Ĉor(Y, X ) 2 Var(Y ) ŜD(Y ) Ĉor(Y, X ) ŜD(Y ) Ĉor(Y, X ). 17 / 20

18 A variance decomposition The residuals of a regression analysis are Y Ŷ. It is a fact that the fitted values and residuals are orthogonal: and Ŷ, Y Ŷ = 0, Ŷ Ȳ 1, Y Ŷ = / 20

19 A variance decomposition We can use the orthogonality between fitted values and residuals to decompose the variance in a regression: Y Ȳ 2 = Y Ŷ + Ŷ Ȳ 2 = Y Ŷ 2 + Ŷ Ȳ Y Ŷ, Ŷ Ȳ = Y Ŷ 2 + Ŷ Ȳ 2. Thus 1 = Y Ŷ 2 Ŷ Ȳ 2 + Y Ȳ 2 Y Ȳ / 20

20 A variance decomposition The term Ŷ Ȳ 2 Y Ȳ 2 = Var(Ŷ ) Var(Y ) is called the r 2 of the regression. Why is it called an r 2? Since Var(Ŷ ) = Ĉor(Y, X )2 Var(Y ), it follows that Ŷ Ȳ 2 Y Ȳ 2 = Ĉor(Y, X )2. 20 / 20

For more information about how to cite these materials visit

For more information about how to cite these materials visit Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/

More information

For more information about how to cite these materials visit

For more information about how to cite these materials visit Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/

More information

For more information about how to cite these materials visit

For more information about how to cite these materials visit Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/

More information

Copyright 2009, August E. Evrard.

Copyright 2009, August E. Evrard. Unless otherwise noted, the content of this course material is licensed under a Creative Commons BY 3.0 License. http://creativecommons.org/licenses/by/3.0/ Copyright 2009, August E. Evrard. You assume

More information

PHYSICS General Physics 1, Fall 2007

PHYSICS General Physics 1, Fall 2007 University of Michigan Deep Blue deepblue.lib.umich.edu 2007-09 PHYSICS 140 - General Physics 1, Fall 2007 Evrard, Gus Evrard, G. (2009, January 26). General Physics 1. Retrieved from Open.Michigan - Educational

More information

Copyright 2009, August E. Evrard.

Copyright 2009, August E. Evrard. Unless otherwise noted, the content of this course material is licensed under a Creative Commons BY 3.0 License. http://creativecommons.org/licenses/by/3.0/ Copyright 2009, August E. Evrard. You assume

More information

Copyright 2009, August E. Evrard.

Copyright 2009, August E. Evrard. Unless otherwise noted, the content of this course material is licensed under a Creative Commons BY 3.0 License. http://creativecommons.org/licenses/by/3.0/ Copyright 2009, August E. Evrard. You assume

More information

For more information about how to cite these materials visit

For more information about how to cite these materials visit Author(s): MELO 3D Project Team, 2012 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share-Alike 3.0 License: http://creativecommons.org/licenses/by-sa/2.0/

More information

Copyright 2009, August E. Evrard.

Copyright 2009, August E. Evrard. Unless otherwise noted, the content of this course material is licensed under a Creative Commons BY 3.0 License. http://creativecommons.org/licenses/by/3.0/ Copyright 2009, August E. Evrard. You assume

More information

SI Information Economics, Winter 2007

SI Information Economics, Winter 2007 University of Michigan Deep Blue deepblue.lib.umich.edu 2007-01 SI 646 - Information Economics, Winter 2007 MacKie-Mason, Jeffrey K. MacKie-Mason, J. K. (2007, April 19). Information Economics. Retrieved

More information

Copyright 2009, August E. Evrard.

Copyright 2009, August E. Evrard. Unless otherwise noted, the content of this course material is licensed under a Creative Commons BY 3.0 License. http://creativecommons.org/licenses/by/3.0/ Copyright 2009, August E. Evrard. You assume

More information

Copyright 2009, August E. Evrard.

Copyright 2009, August E. Evrard. Unless otherwise noted, the content of this course material is licensed under a Creative Commons BY 3.0 License. http://creativecommons.org/licenses/by/3.0/ Copyright 2009, August E. Evrard. You assume

More information

Copyright 2009, August E. Evrard.

Copyright 2009, August E. Evrard. Unless otherwise noted, the content of this course material is licensed under a Creative Commons BY 3.0 License. http://creativecommons.org/licenses/by/3.0/ Copyright 2009, August E. Evrard. You assume

More information

Copyright 2009, August E. Evrard.

Copyright 2009, August E. Evrard. Unless otherwise noted, the content of this course material is licensed under a Creative Commons BY 3.0 License. http://creativecommons.org/licenses/by/3.0/ Copyright 2009, August E. Evrard. You assume

More information

For more information about how to cite these materials visit

For more information about how to cite these materials visit Author(s): Peter Hitchcock, PH.D., 2009 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Non-commercial Share Alike 3.0 License: http://creativecommons.org/licenses/by-nc-sa/3.0/

More information

Copyright 2009, August E. Evrard.

Copyright 2009, August E. Evrard. Unless otherwise noted, the content of this course material is licensed under a Creative Commons BY 3.0 License. http://creativecommons.org/licenses/by/3.0/ Copyright 2009, August E. Evrard. You assume

More information

PHYSICS General Physics 1, Fall 2007

PHYSICS General Physics 1, Fall 2007 University of Michigan Deep Blue deepblue.lib.umich.edu 2007-09 PHYSICS 140 - General Physics 1, Fall 2007 Evrard, Gus Evrard, G. (2009, January 26). General Physics 1. Retrieved from Open.Michigan - Educational

More information

Applied Regression. Applied Regression. Chapter 2 Simple Linear Regression. Hongcheng Li. April, 6, 2013

Applied Regression. Applied Regression. Chapter 2 Simple Linear Regression. Hongcheng Li. April, 6, 2013 Applied Regression Chapter 2 Simple Linear Regression Hongcheng Li April, 6, 2013 Outline 1 Introduction of simple linear regression 2 Scatter plot 3 Simple linear regression model 4 Test of Hypothesis

More information

PHYSICS General Physics 1, Fall 2007

PHYSICS General Physics 1, Fall 2007 University of Michigan Deep Blue deepblue.lib.umich.edu 2007-09 PHYSICS 140 - General Physics 1, Fall 2007 Evrard, Gus Evrard, G. (2009, January 26). General Physics 1. Retrieved from Open.Michigan - Educational

More information

Copyright 2009, August E. Evrard.

Copyright 2009, August E. Evrard. Unless otherwise noted, the content of this course material is licensed under a Creative Commons BY 3.0 License. http://creativecommons.org/licenses/by/3.0/ Copyright 2009, August E. Evrard. You assume

More information

Principal Components Theory Notes

Principal Components Theory Notes Principal Components Theory Notes Charles J. Geyer August 29, 2007 1 Introduction These are class notes for Stat 5601 (nonparametrics) taught at the University of Minnesota, Spring 2006. This not a theory

More information

2. Matrix Algebra and Random Vectors

2. Matrix Algebra and Random Vectors 2. Matrix Algebra and Random Vectors 2.1 Introduction Multivariate data can be conveniently display as array of numbers. In general, a rectangular array of numbers with, for instance, n rows and p columns

More information

The geometry of least squares

The geometry of least squares The geometry of least squares We can think of a vector as a point in space, where the elements of the vector are the coordinates of the point. Consider for example, the following vector s: t = ( 4, 0),

More information

Bivariate data analysis

Bivariate data analysis Bivariate data analysis Categorical data - creating data set Upload the following data set to R Commander sex female male male male male female female male female female eye black black blue green green

More information

Basic Concepts in Matrix Algebra

Basic Concepts in Matrix Algebra Basic Concepts in Matrix Algebra An column array of p elements is called a vector of dimension p and is written as x p 1 = x 1 x 2. x p. The transpose of the column vector x p 1 is row vector x = [x 1

More information

MA 575 Linear Models: Cedric E. Ginestet, Boston University Revision: Probability and Linear Algebra Week 1, Lecture 2

MA 575 Linear Models: Cedric E. Ginestet, Boston University Revision: Probability and Linear Algebra Week 1, Lecture 2 MA 575 Linear Models: Cedric E Ginestet, Boston University Revision: Probability and Linear Algebra Week 1, Lecture 2 1 Revision: Probability Theory 11 Random Variables A real-valued random variable is

More information

PHYSICS General Physics 1, Fall 2007

PHYSICS General Physics 1, Fall 2007 University of Michigan Deep Blue deepblue.lib.umich.edu 2007-09 PHYSICS 140 - General Physics 1, Fall 2007 Evrard, Gus Evrard, G. (2009, January 26). General Physics 1. Retrieved from Open.Michigan - Educational

More information

Lecture 16 - Correlation and Regression

Lecture 16 - Correlation and Regression Lecture 16 - Correlation and Regression Statistics 102 Colin Rundel April 1, 2013 Modeling numerical variables Modeling numerical variables So far we have worked with single numerical and categorical variables,

More information

Econ 2120: Section 2

Econ 2120: Section 2 Econ 2120: Section 2 Part I - Linear Predictor Loose Ends Ashesh Rambachan Fall 2018 Outline Big Picture Matrix Version of the Linear Predictor and Least Squares Fit Linear Predictor Least Squares Omitted

More information

Regression diagnostics

Regression diagnostics Regression diagnostics Kerby Shedden Department of Statistics, University of Michigan November 5, 018 1 / 6 Motivation When working with a linear model with design matrix X, the conventional linear model

More information

Lecture Notes 1: Vector spaces

Lecture Notes 1: Vector spaces Optimization-based data analysis Fall 2017 Lecture Notes 1: Vector spaces In this chapter we review certain basic concepts of linear algebra, highlighting their application to signal processing. 1 Vector

More information

For more information about how to cite these materials visit

For more information about how to cite these materials visit Author(s): MELO 3D Project Team, 2012 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share-Alike 3.0 License: http://creativecommons.org/licenses/by-sa/2.0/

More information

Vectors and Matrices Statistics with Vectors and Matrices

Vectors and Matrices Statistics with Vectors and Matrices Vectors and Matrices Statistics with Vectors and Matrices Lecture 3 September 7, 005 Analysis Lecture #3-9/7/005 Slide 1 of 55 Today s Lecture Vectors and Matrices (Supplement A - augmented with SAS proc

More information

A Short Course in Basic Statistics

A Short Course in Basic Statistics A Short Course in Basic Statistics Ian Schindler November 5, 2017 Creative commons license share and share alike BY: C 1 Descriptive Statistics 1.1 Presenting statistical data Definition 1 A statistical

More information

Specification Errors, Measurement Errors, Confounding

Specification Errors, Measurement Errors, Confounding Specification Errors, Measurement Errors, Confounding Kerby Shedden Department of Statistics, University of Michigan October 10, 2018 1 / 32 An unobserved covariate Suppose we have a data generating model

More information

Correlation and Regression

Correlation and Regression Correlation and Regression October 25, 2017 STAT 151 Class 9 Slide 1 Outline of Topics 1 Associations 2 Scatter plot 3 Correlation 4 Regression 5 Testing and estimation 6 Goodness-of-fit STAT 151 Class

More information

Class 8 Review Problems solutions, 18.05, Spring 2014

Class 8 Review Problems solutions, 18.05, Spring 2014 Class 8 Review Problems solutions, 8.5, Spring 4 Counting and Probability. (a) Create an arrangement in stages and count the number of possibilities at each stage: ( ) Stage : Choose three of the slots

More information

Chapter 4 continued. Chapter 4 sections

Chapter 4 continued. Chapter 4 sections Chapter 4 sections Chapter 4 continued 4.1 Expectation 4.2 Properties of Expectations 4.3 Variance 4.4 Moments 4.5 The Mean and the Median 4.6 Covariance and Correlation 4.7 Conditional Expectation SKIP:

More information

From last time... The equations

From last time... The equations This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Covariance and Correlation

Covariance and Correlation Covariance and Correlation ST 370 The probability distribution of a random variable gives complete information about its behavior, but its mean and variance are useful summaries. Similarly, the joint probability

More information

Correlation and Linear Regression

Correlation and Linear Regression Correlation and Linear Regression Correlation: Relationships between Variables So far, nearly all of our discussion of inferential statistics has focused on testing for differences between group means

More information

Statistics 910, #15 1. Kalman Filter

Statistics 910, #15 1. Kalman Filter Statistics 910, #15 1 Overview 1. Summary of Kalman filter 2. Derivations 3. ARMA likelihoods 4. Recursions for the variance Kalman Filter Summary of Kalman filter Simplifications To make the derivations

More information

Exercises * on Principal Component Analysis

Exercises * on Principal Component Analysis Exercises * on Principal Component Analysis Laurenz Wiskott Institut für Neuroinformatik Ruhr-Universität Bochum, Germany, EU 4 February 207 Contents Intuition 3. Problem statement..........................................

More information

STAT 350: Geometry of Least Squares

STAT 350: Geometry of Least Squares The Geometry of Least Squares Mathematical Basics Inner / dot product: a and b column vectors a b = a T b = a i b i a b a T b = 0 Matrix Product: A is r s B is s t (AB) rt = s A rs B st Partitioned Matrices

More information

Correlation analysis 2: Measures of correlation

Correlation analysis 2: Measures of correlation Correlation analsis 2: Measures of correlation Ran Tibshirani Data Mining: 36-462/36-662 Februar 19 2013 1 Review: correlation Pearson s correlation is a measure of linear association In the population:

More information

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) 1 EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) Taisuke Otsu London School of Economics Summer 2018 A.1. Summation operator (Wooldridge, App. A.1) 2 3 Summation operator For

More information

Correlation 1. December 4, HMS, 2017, v1.1

Correlation 1. December 4, HMS, 2017, v1.1 Correlation 1 December 4, 2017 1 HMS, 2017, v1.1 Chapter References Diez: Chapter 7 Navidi, Chapter 7 I don t expect you to learn the proofs what will follow. Chapter References 2 Correlation The sample

More information

Economics 620, Lecture 2: Regression Mechanics (Simple Regression)

Economics 620, Lecture 2: Regression Mechanics (Simple Regression) 1 Economics 620, Lecture 2: Regression Mechanics (Simple Regression) Observed variables: y i ; x i i = 1; :::; n Hypothesized (model): Ey i = + x i or y i = + x i + (y i Ey i ) ; renaming we get: y i =

More information

Chapter 2: simple regression model

Chapter 2: simple regression model Chapter 2: simple regression model Goal: understand how to estimate and more importantly interpret the simple regression Reading: chapter 2 of the textbook Advice: this chapter is foundation of econometrics.

More information

Correlation analysis. Contents

Correlation analysis. Contents Correlation analysis Contents 1 Correlation analysis 2 1.1 Distribution function and independence of random variables.......... 2 1.2 Measures of statistical links between two random variables...........

More information

Business Statistics. Lecture 10: Correlation and Linear Regression

Business Statistics. Lecture 10: Correlation and Linear Regression Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form

More information

For more information about how to cite these materials visit

For more information about how to cite these materials visit Authr(s): MELO 3D Prject Team, 2011 License: This wrk is licensed under the Creative Cmmns Attributin-ShareAlike 3.0 Unprted License. T view a cpy f this license, visit http://creativecmmns.rg/licenses/by-sa/3.0/.

More information

Applied Linear Algebra in Geoscience Using MATLAB

Applied Linear Algebra in Geoscience Using MATLAB Applied Linear Algebra in Geoscience Using MATLAB Contents Getting Started Creating Arrays Mathematical Operations with Arrays Using Script Files and Managing Data Two-Dimensional Plots Programming in

More information

Lecture Notes 4 Vector Detection and Estimation. Vector Detection Reconstruction Problem Detection for Vector AGN Channel

Lecture Notes 4 Vector Detection and Estimation. Vector Detection Reconstruction Problem Detection for Vector AGN Channel Lecture Notes 4 Vector Detection and Estimation Vector Detection Reconstruction Problem Detection for Vector AGN Channel Vector Linear Estimation Linear Innovation Sequence Kalman Filter EE 278B: Random

More information

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information. STA441: Spring 2018 Multiple Regression This slide show is a free open source document. See the last slide for copyright information. 1 Least Squares Plane 2 Statistical MODEL There are p-1 explanatory

More information

Inner products. Theorem (basic properties): Given vectors u, v, w in an inner product space V, and a scalar k, the following properties hold:

Inner products. Theorem (basic properties): Given vectors u, v, w in an inner product space V, and a scalar k, the following properties hold: Inner products Definition: An inner product on a real vector space V is an operation (function) that assigns to each pair of vectors ( u, v) in V a scalar u, v satisfying the following axioms: 1. u, v

More information

Multiple Regression Analysis. Basic Estimation Techniques. Multiple Regression Analysis. Multiple Regression Analysis

Multiple Regression Analysis. Basic Estimation Techniques. Multiple Regression Analysis. Multiple Regression Analysis Multiple Regression Analysis Basic Estimation Techniques Herbert Stocker herbert.stocker@uibk.ac.at University of Innsbruck & IIS, University of Ramkhamhaeng Regression Analysis: Statistical procedure

More information

Two-Variable Regression Model: The Problem of Estimation

Two-Variable Regression Model: The Problem of Estimation Two-Variable Regression Model: The Problem of Estimation Introducing the Ordinary Least Squares Estimator Jamie Monogan University of Georgia Intermediate Political Methodology Jamie Monogan (UGA) Two-Variable

More information

Day 4: Shrinkage Estimators

Day 4: Shrinkage Estimators Day 4: Shrinkage Estimators Kenneth Benoit Data Mining and Statistical Learning March 9, 2015 n versus p (aka k) Classical regression framework: n > p. Without this inequality, the OLS coefficients have

More information

Regression and correlation. Correlation & Regression, I. Regression & correlation. Regression vs. correlation. Involve bivariate, paired data, X & Y

Regression and correlation. Correlation & Regression, I. Regression & correlation. Regression vs. correlation. Involve bivariate, paired data, X & Y Regression and correlation Correlation & Regression, I 9.07 4/1/004 Involve bivariate, paired data, X & Y Height & weight measured for the same individual IQ & exam scores for each individual Height of

More information

Properties of Summation Operator

Properties of Summation Operator Econ 325 Section 003/004 Notes on Variance, Covariance, and Summation Operator By Hiro Kasahara Properties of Summation Operator For a sequence of the values {x 1, x 2,..., x n, we write the sum of x 1,

More information

If g is also continuous and strictly increasing on J, we may apply the strictly increasing inverse function g 1 to this inequality to get

If g is also continuous and strictly increasing on J, we may apply the strictly increasing inverse function g 1 to this inequality to get 18:2 1/24/2 TOPIC. Inequalities; measures of spread. This lecture explores the implications of Jensen s inequality for g-means in general, and for harmonic, geometric, arithmetic, and related means in

More information

Stat 206: Sampling theory, sample moments, mahalanobis

Stat 206: Sampling theory, sample moments, mahalanobis Stat 206: Sampling theory, sample moments, mahalanobis topology James Johndrow (adapted from Iain Johnstone s notes) 2016-11-02 Notation My notation is different from the book s. This is partly because

More information

This does not cover everything on the final. Look at the posted practice problems for other topics.

This does not cover everything on the final. Look at the posted practice problems for other topics. Class 7: Review Problems for Final Exam 8.5 Spring 7 This does not cover everything on the final. Look at the posted practice problems for other topics. To save time in class: set up, but do not carry

More information

Intro to Linear Regression

Intro to Linear Regression Intro to Linear Regression Introduction to Regression Regression is a statistical procedure for modeling the relationship among variables to predict the value of a dependent variable from one or more predictor

More information

Introduction to bivariate analysis

Introduction to bivariate analysis Introduction to bivariate analysis When one measurement is made on each observation, univariate analysis is applied. If more than one measurement is made on each observation, multivariate analysis is applied.

More information

Vector spaces. DS-GA 1013 / MATH-GA 2824 Optimization-based Data Analysis.

Vector spaces. DS-GA 1013 / MATH-GA 2824 Optimization-based Data Analysis. Vector spaces DS-GA 1013 / MATH-GA 2824 Optimization-based Data Analysis http://www.cims.nyu.edu/~cfgranda/pages/obda_fall17/index.html Carlos Fernandez-Granda Vector space Consists of: A set V A scalar

More information

Stable Process. 2. Multivariate Stable Distributions. July, 2006

Stable Process. 2. Multivariate Stable Distributions. July, 2006 Stable Process 2. Multivariate Stable Distributions July, 2006 1. Stable random vectors. 2. Characteristic functions. 3. Strictly stable and symmetric stable random vectors. 4. Sub-Gaussian random vectors.

More information

Covariance and Correlation Class 7, Jeremy Orloff and Jonathan Bloom

Covariance and Correlation Class 7, Jeremy Orloff and Jonathan Bloom 1 Learning Goals Covariance and Correlation Class 7, 18.05 Jerem Orloff and Jonathan Bloom 1. Understand the meaning of covariance and correlation. 2. Be able to compute the covariance and correlation

More information

Introduction to bivariate analysis

Introduction to bivariate analysis Introduction to bivariate analysis When one measurement is made on each observation, univariate analysis is applied. If more than one measurement is made on each observation, multivariate analysis is applied.

More information

Linear Algebra. Alvin Lin. August December 2017

Linear Algebra. Alvin Lin. August December 2017 Linear Algebra Alvin Lin August 207 - December 207 Linear Algebra The study of linear algebra is about two basic things. We study vector spaces and structure preserving maps between vector spaces. A vector

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression In simple linear regression we are concerned about the relationship between two variables, X and Y. There are two components to such a relationship. 1. The strength of the relationship.

More information

1.2. Functions and Their Properties. Copyright 2011 Pearson, Inc.

1.2. Functions and Their Properties. Copyright 2011 Pearson, Inc. 1.2 Functions and Their Properties Copyright 2011 Pearson, Inc. What you ll learn about Function Definition and Notation Domain and Range Continuity Increasing and Decreasing Functions Boundedness Local

More information

6.241 Dynamic Systems and Control

6.241 Dynamic Systems and Control 6.241 Dynamic Systems and Control Lecture 2: Least Square Estimation Readings: DDV, Chapter 2 Emilio Frazzoli Aeronautics and Astronautics Massachusetts Institute of Technology February 7, 2011 E. Frazzoli

More information

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis.

401 Review. 6. Power analysis for one/two-sample hypothesis tests and for correlation analysis. 401 Review Major topics of the course 1. Univariate analysis 2. Bivariate analysis 3. Simple linear regression 4. Linear algebra 5. Multiple regression analysis Major analysis methods 1. Graphical analysis

More information

Linear Regression. In this problem sheet, we consider the problem of linear regression with p predictors and one intercept,

Linear Regression. In this problem sheet, we consider the problem of linear regression with p predictors and one intercept, Linear Regression In this problem sheet, we consider the problem of linear regression with p predictors and one intercept, y = Xβ + ɛ, where y t = (y 1,..., y n ) is the column vector of target values,

More information

Part 1a: Inner product, Orthogonality, Vector/Matrix norm

Part 1a: Inner product, Orthogonality, Vector/Matrix norm Part 1a: Inner product, Orthogonality, Vector/Matrix norm September 19, 2018 Numerical Linear Algebra Part 1a September 19, 2018 1 / 16 1. Inner product on a linear space V over the number field F A map,

More information

Study Sheet. December 10, The course PDF has been updated (6/11). Read the new one.

Study Sheet. December 10, The course PDF has been updated (6/11). Read the new one. Study Sheet December 10, 2017 The course PDF has been updated (6/11). Read the new one. 1 Definitions to know The mode:= the class or center of the class with the highest frequency. The median : Q 2 is

More information

3. For a given dataset and linear model, what do you think is true about least squares estimates? Is Ŷ always unique? Yes. Is ˆβ always unique? No.

3. For a given dataset and linear model, what do you think is true about least squares estimates? Is Ŷ always unique? Yes. Is ˆβ always unique? No. 7. LEAST SQUARES ESTIMATION 1 EXERCISE: Least-Squares Estimation and Uniqueness of Estimates 1. For n real numbers a 1,...,a n, what value of a minimizes the sum of squared distances from a to each of

More information

3 Multiple Linear Regression

3 Multiple Linear Regression 3 Multiple Linear Regression 3.1 The Model Essentially, all models are wrong, but some are useful. Quote by George E.P. Box. Models are supposed to be exact descriptions of the population, but that is

More information

INNER PRODUCT SPACE. Definition 1

INNER PRODUCT SPACE. Definition 1 INNER PRODUCT SPACE Definition 1 Suppose u, v and w are all vectors in vector space V and c is any scalar. An inner product space on the vectors space V is a function that associates with each pair of

More information

Def. The euclidian distance between two points x = (x 1,...,x p ) t and y = (y 1,...,y p ) t in the p-dimensional space R p is defined as

Def. The euclidian distance between two points x = (x 1,...,x p ) t and y = (y 1,...,y p ) t in the p-dimensional space R p is defined as MAHALANOBIS DISTANCE Def. The euclidian distance between two points x = (x 1,...,x p ) t and y = (y 1,...,y p ) t in the p-dimensional space R p is defined as d E (x, y) = (x 1 y 1 ) 2 + +(x p y p ) 2

More information

Gaussian Processes. 1. Basic Notions

Gaussian Processes. 1. Basic Notions Gaussian Processes 1. Basic Notions Let T be a set, and X : {X } T a stochastic process, defined on a suitable probability space (Ω P), that is indexed by T. Definition 1.1. We say that X is a Gaussian

More information

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Many economic models involve endogeneity: that is, a theoretical relationship does not fit

More information

Econometrics I KS. Module 1: Bivariate Linear Regression. Alexander Ahammer. This version: March 12, 2018

Econometrics I KS. Module 1: Bivariate Linear Regression. Alexander Ahammer. This version: March 12, 2018 Econometrics I KS Module 1: Bivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: March 12, 2018 Alexander Ahammer (JKU) Module 1: Bivariate

More information

Inner Product and Orthogonality

Inner Product and Orthogonality Inner Product and Orthogonality P. Sam Johnson October 3, 2014 P. Sam Johnson (NITK) Inner Product and Orthogonality October 3, 2014 1 / 37 Overview In the Euclidean space R 2 and R 3 there are two concepts,

More information

Political Science 6000: Beginnings and Mini Math Boot Camp

Political Science 6000: Beginnings and Mini Math Boot Camp Political Science 6000: Beginnings and Mini Math Boot Camp January 20, 2010 First things first Syllabus This is the most important course you will take. 1. You need to understand these concepts in order

More information

STOR Lecture 16. Properties of Expectation - I

STOR Lecture 16. Properties of Expectation - I STOR 435.001 Lecture 16 Properties of Expectation - I Jan Hannig UNC Chapel Hill 1 / 22 Motivation Recall we found joint distributions to be pretty complicated objects. Need various tools from combinatorics

More information

Random Vectors 1. STA442/2101 Fall See last slide for copyright information. 1 / 30

Random Vectors 1. STA442/2101 Fall See last slide for copyright information. 1 / 30 Random Vectors 1 STA442/2101 Fall 2017 1 See last slide for copyright information. 1 / 30 Background Reading: Renscher and Schaalje s Linear models in statistics Chapter 3 on Random Vectors and Matrices

More information

Multivariate Regression

Multivariate Regression Multivariate Regression The so-called supervised learning problem is the following: we want to approximate the random variable Y with an appropriate function of the random variables X 1,..., X p with the

More information

STA441: Spring Multiple Regression. More than one explanatory variable at the same time

STA441: Spring Multiple Regression. More than one explanatory variable at the same time STA441: Spring 2016 Multiple Regression More than one explanatory variable at the same time This slide show is a free open source document. See the last slide for copyright information. One Explanatory

More information

STAT5044: Regression and Anova. Inyoung Kim

STAT5044: Regression and Anova. Inyoung Kim STAT5044: Regression and Anova Inyoung Kim 2 / 51 Outline 1 Matrix Expression 2 Linear and quadratic forms 3 Properties of quadratic form 4 Properties of estimates 5 Distributional properties 3 / 51 Matrix

More information

Math Linear Algebra

Math Linear Algebra Math 220 - Linear Algebra (Summer 208) Solutions to Homework #7 Exercise 6..20 (a) TRUE. u v v u = 0 is equivalent to u v = v u. The latter identity is true due to the commutative property of the inner

More information

Unsupervised Learning: Dimensionality Reduction

Unsupervised Learning: Dimensionality Reduction Unsupervised Learning: Dimensionality Reduction CMPSCI 689 Fall 2015 Sridhar Mahadevan Lecture 3 Outline In this lecture, we set about to solve the problem posed in the previous lecture Given a dataset,

More information

A Primer in Econometric Theory

A Primer in Econometric Theory A Primer in Econometric Theory Lecture 1: Vector Spaces John Stachurski Lectures by Akshay Shanker May 5, 2017 1/104 Overview Linear algebra is an important foundation for mathematics and, in particular,

More information

Sample Size and Power Considerations for Longitudinal Studies

Sample Size and Power Considerations for Longitudinal Studies Sample Size and Power Considerations for Longitudinal Studies Outline Quantities required to determine the sample size in longitudinal studies Review of type I error, type II error, and power For continuous

More information

The Bootstrap 1. STA442/2101 Fall Sampling distributions Bootstrap Distribution-free regression example

The Bootstrap 1. STA442/2101 Fall Sampling distributions Bootstrap Distribution-free regression example The Bootstrap 1 STA442/2101 Fall 2013 1 See last slide for copyright information. 1 / 18 Background Reading Davison s Statistical models has almost nothing. The best we can do for now is the Wikipedia

More information

Chapter 3. Vectors and Coordinate Systems

Chapter 3. Vectors and Coordinate Systems Chapter 3. Vectors and Coordinate Systems Our universe has three dimensions, so some quantities also need a direction for a full description. For example, wind has both a speed and a direction; hence the

More information

Statistics II. Management Degree Management Statistics IIDegree. Statistics II. 2 nd Sem. 2013/2014. Management Degree. Simple Linear Regression

Statistics II. Management Degree Management Statistics IIDegree. Statistics II. 2 nd Sem. 2013/2014. Management Degree. Simple Linear Regression Model 1 2 Ordinary Least Squares 3 4 Non-linearities 5 of the coefficients and their to the model We saw that econometrics studies E (Y x). More generally, we shall study regression analysis. : The regression

More information

Intermediate Econometrics

Intermediate Econometrics Intermediate Econometrics Markus Haas LMU München Summer term 2011 15. Mai 2011 The Simple Linear Regression Model Considering variables x and y in a specific population (e.g., years of education and wage

More information