Multivariate Methods. Multivariate Methods: Topics of the Day

Size: px
Start display at page:

Download "Multivariate Methods. Multivariate Methods: Topics of the Day"

Transcription

1 Multivariate Methods LIR 832 Multivariate Methods: Topics of the Day A. Isolating Interventions in a multi-causal world B. Multivariate probability Distributions C. The Building Block: covariance D. The Next Step: Correlation 1

2 A Multivariate World Isolating Interventions in a Multi-Causal World A. Example of problem: Evaluate a program to reduce absences from a plant? Is there age discrimination? B. Types of data Experimental Quasi-experimental Non-experimental C. Need multivariate analysis to sort out causal relationships. Bi-Variate Relations: A First Run at Multivariate Methods A. Many of the issues we are interested in are essentially about the relationship between two variables. B. Bi-variate can be generalized to multivariate relationships C. We learn bi-variate formally and make more intuitive reference to multivariate. D. What do we mean by bi-variate relationship? 2

3 Bi-Variate Example Our firm, has formed teams of engineers, accountants and general managers at all plants to work on several issues that are considered important in the firm. The firm has long been committed to gender diversity and we are interested in the distribution of gender among our managerial classifications. We are particularly concerned about the distribution of gender on these teams and particularly among engineers. Consider the distribution of two statistics about these three person teams. a. gender of the team members (X: x = number of men) b. is the engineer a woman (Y: 0 = man, 1 = woman) Bi-Variate Example (cont.) 3

4 Bi-Variate Example (cont.) Bi-Variate Example (cont.) 4

5 Bi-Variate Example (cont.) Bi-Variate Example (cont.) We can also use this information to build We can also use this information to build conditional probabilities: What is the likelihood that the engineer is a woman, given that we have a man on the team? 5

6 Bi-Variate Example (cont.) What is the likelihood that the engineer is a woman, given that we have a man on the team? P(Y = 1 & X = 1 X= 1) = P(Y = 1 & X = 1)/P(X= 1) = (2/8) / (3/8) = 2/3 Note: P(Y= 1 X=2) is: the probability bilit that t Y is equal to 1 given that t X = 2" or the probability that Y = 1 conditional on X = 2" Bi-Variate Example (cont.) What is the likelihood that there is only one man, given the engineer is a woman? P(Y = 1 & X = 1 Y= 1) = P(Y = 1 & X = 1)/P(Y= 1) = (2/8)/(4/8) = 2/4 =1/2 6

7 Bi-Variate Example (cont.) What is the likelihood that the engineer is a woman? P(Y= 1) = 1/2 But if we know that there are two men, we can improve our estimate: P(Y=1 X=2) = P(Y=1 &X X=2 X=2) 2) = P(Y=1 &X=2) / P(X=2) = 1/8 / 3/8 = 1/3 What about calculating the likelihood of two men given the engineer is a woman? Example: Gender Distribution 7

8 Example: Gender Distribution Working with Conditional Probability: P(female) = 50.91% P(female LRHR) = p(female & LRHR)/P(LRHR) = 0.36/0.55 = 65% P(LRHR) = 0.55% P(LRHR Female) = p(lrhr & female)/p(female) =.36/50.91 =.70% Independence Defined Now that we know a bit about bi-variate relationships, we can define what it means, in a statistical sense, for two events to be independent. If events are independent, then Their conditional probability is equal to their Their conditional probability is equal to their unconditional probability The probability of the two independent events occurring is P(X)*P(Y) = P(X,Y). 8

9 Importance of Independence Why is independence important? If events are independent, then we are getting unique information from each data point. If events are not independent, then A practical example on running a survey on employee satisfaction within an establishment. Example: Employee Satisfaction 9

10 Covariance Covariance: Building Block of Multi- variate Analysis All very nice, but what we are looking for is a means of expressing and measuring the strength of association of two variables. How closely do they move together? Is variable A a good predictor of variable B? Move to a slightly more complex world, no more 2 and three category variables Example: Age and Income Data 10

11 Example: Age and Income Data Example: Age and Income Data 11

12 Example: Age and Income Data Descriptive Statistics: age, annual income Variable N Mean Median StDev SE Mean age annual I Variable Minimum Maximum Q1 Q3 age annual I Example: Age and Income Data 12

13 Example: Age and Income Data Adding some info to the graph Covariance and Correlation Defined Define Covariance and Correlation for a random sample of data: Let our data be composed of pairs of data (X i,y i ) where X has mean μ x and Y has mean μ y. Then the covariance, the co-movement around their means, is defined as: 13

14 Example: Covariance We observe the relationship between the number of employees at work at a plant and the output for five days in a row: Attendance Output What is the covariance of attendance and output? Example: Covariance (cont.) The covariance is positive. This suggests that when attendance is above its mean, output is also above its mean. Similarly, when attendance is below its mean, output is below its mean. 14

15 Example: Overtime Hours and Productivity 15

16 Example: Overtime Hours and Productivity Example: Overtime Hours and Productivity Covariances: prod-avg, week prod-avg week prod-avg week

17 Example: Overtime Hours and Productivity (cont.) Example: Overtime Hours and Productivity (cont.) 17

18 Example: Overtime Hours and Productivity (cont.) Covariances: prod-avg, week, week-hours prod-avg week week-hours prod-avg week week-hours Example: Overtime Hours and Productivity (cont.) 18

19 Example: Overtime Hours and Productivity (cont.) Example: Overtime Hours and Productivity (cont.) 19

20 Correlation vs. Covariance A limitation of covariance is that it is difficult to interpret. Its units are not well defined. Thus, we need a measure which is more readily interpreted and tells about the strength of association. Correlation: Population Correlation is Defined as: Correlation =

21 Correlation = 0.94 Correlation =

22 Correlation = Correlation: Previous Examples 22

23 Correlation: Previous Examples 23

24 Correlation: Previous Examples Correlation: Previous Examples Overtime-Productivity: Limit to 5 days, 10 hours: Correlations: prod-avg, week, week-hours prod-avg week week week-hours

25 Correlations: Previous Examples Example: Correlation 25

26 Example: Correlation Example: Correlation What about some real data: Relationship What about some real data: Relationship between age gender and weekly earnings among human resource managers (admin associated occupations)? 26

27 Example: Correlation Descriptive Statistics: Female, age, weekearn Variable N N* Mean Median TrMean StDev Female age weekearn Variable SE Mean Minimum Maximum Q1 Q3 Female age weekearn Example: Correlation Tabulated Statistics: Female Rows: Female weekearn weekearn Mean StDev Male Female All

28 Example: Correlation Tabulated Statistics: Female Rows: Female weekearn age weekearn age Mean Mean StDev StDev male female all Example: Correlation Covariances: age, weekearn, Female age weekearn Female age weekearn Female

29 Example: Correlation Correlations: age, Female, weekearn age Female Female weekearn Example: Correlation 29

30 Example: Non-Linearity Correlation and Covariance So covariance and correlation are measures So covariance and correlation are measures of linear association, but not measures of association in general (or of non-linear association). 30

31 Correlation and Covariance What if we do not have data on individuals but data on distributions? Example, we have plant level data but plants vary widely in employment. We want to give greater weight to plants with more employees. Correlation and Covariance 31

32 Correlation and Covariance Correlation and Covariance 32

33 Correlation and Covariance Correlation and Covariance 33

PS5: Two Variable Statistics LT3: Linear regression LT4: The test of independence.

PS5: Two Variable Statistics LT3: Linear regression LT4: The test of independence. PS5: Two Variable Statistics LT3: Linear regression LT4: The test of independence. Example by eye. On a hot day, nine cars were left in the sun in a car parking lot. The length of time each car was left

More information

ECON 482 / WH Hong Binary or Dummy Variables 1. Qualitative Information

ECON 482 / WH Hong Binary or Dummy Variables 1. Qualitative Information 1. Qualitative Information Qualitative Information Up to now, we assume that all the variables has quantitative meaning. But often in empirical work, we must incorporate qualitative factor into regression

More information

Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12)

Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12) Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12) Remember: Z.05 = 1.645, Z.01 = 2.33 We will only cover one-sided hypothesis testing (cases 12.3, 12.4.2, 12.5.2,

More information

Preliminary Statistics course. Lecture 1: Descriptive Statistics

Preliminary Statistics course. Lecture 1: Descriptive Statistics Preliminary Statistics course Lecture 1: Descriptive Statistics Rory Macqueen (rm43@soas.ac.uk), September 2015 Organisational Sessions: 16-21 Sep. 10.00-13.00, V111 22-23 Sep. 15.00-18.00, V111 24 Sep.

More information

Generative classifiers: The Gaussian classifier. Ata Kaban School of Computer Science University of Birmingham

Generative classifiers: The Gaussian classifier. Ata Kaban School of Computer Science University of Birmingham Generative classifiers: The Gaussian classifier Ata Kaban School of Computer Science University of Birmingham Outline We have already seen how Bayes rule can be turned into a classifier In all our examples

More information

Lecture 2: Categorical Variable. A nice book about categorical variable is An Introduction to Categorical Data Analysis authored by Alan Agresti

Lecture 2: Categorical Variable. A nice book about categorical variable is An Introduction to Categorical Data Analysis authored by Alan Agresti Lecture 2: Categorical Variable A nice book about categorical variable is An Introduction to Categorical Data Analysis authored by Alan Agresti 1 Categorical Variable Categorical variable is qualitative

More information

Supplemental Materials. In the main text, we recommend graphing physiological values for individual dyad

Supplemental Materials. In the main text, we recommend graphing physiological values for individual dyad 1 Supplemental Materials Graphing Values for Individual Dyad Members over Time In the main text, we recommend graphing physiological values for individual dyad members over time to aid in the decision

More information

1. Regressions and Regression Models. 2. Model Example. EEP/IAS Introductory Applied Econometrics Fall Erin Kelley Section Handout 1

1. Regressions and Regression Models. 2. Model Example. EEP/IAS Introductory Applied Econometrics Fall Erin Kelley Section Handout 1 1. Regressions and Regression Models Simply put, economists use regression models to study the relationship between two variables. If Y and X are two variables, representing some population, we are interested

More information

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011) Ron Heck, Fall 2011 1 EDEP 768E: Seminar in Multilevel Modeling rev. January 3, 2012 (see footnote) Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October

More information

The empirical ( ) rule

The empirical ( ) rule The empirical (68-95-99.7) rule With a bell shaped distribution, about 68% of the data fall within a distance of 1 standard deviation from the mean. 95% fall within 2 standard deviations of the mean. 99.7%

More information

Speci cation of Conditional Expectation Functions

Speci cation of Conditional Expectation Functions Speci cation of Conditional Expectation Functions Econometrics Douglas G. Steigerwald UC Santa Barbara D. Steigerwald (UCSB) Specifying Expectation Functions 1 / 24 Overview Reference: B. Hansen Econometrics

More information

Naïve Bayes classification

Naïve Bayes classification Naïve Bayes classification 1 Probability theory Random variable: a variable whose possible values are numerical outcomes of a random phenomenon. Examples: A person s height, the outcome of a coin toss

More information

Econometrics I Lecture 7: Dummy Variables

Econometrics I Lecture 7: Dummy Variables Econometrics I Lecture 7: Dummy Variables Mohammad Vesal Graduate School of Management and Economics Sharif University of Technology 44716 Fall 1397 1 / 27 Introduction Dummy variable: d i is a dummy variable

More information

(quantitative or categorical variables) Numerical descriptions of center, variability, position (quantitative variables)

(quantitative or categorical variables) Numerical descriptions of center, variability, position (quantitative variables) 3. Descriptive Statistics Describing data with tables and graphs (quantitative or categorical variables) Numerical descriptions of center, variability, position (quantitative variables) Bivariate descriptions

More information

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model EPSY 905: Multivariate Analysis Lecture 1 20 January 2016 EPSY 905: Lecture 1 -

More information

Naïve Bayes classification. p ij 11/15/16. Probability theory. Probability theory. Probability theory. X P (X = x i )=1 i. Marginal Probability

Naïve Bayes classification. p ij 11/15/16. Probability theory. Probability theory. Probability theory. X P (X = x i )=1 i. Marginal Probability Probability theory Naïve Bayes classification Random variable: a variable whose possible values are numerical outcomes of a random phenomenon. s: A person s height, the outcome of a coin toss Distinguish

More information

Introducing Generalized Linear Models: Logistic Regression

Introducing Generalized Linear Models: Logistic Regression Ron Heck, Summer 2012 Seminars 1 Multilevel Regression Models and Their Applications Seminar Introducing Generalized Linear Models: Logistic Regression The generalized linear model (GLM) represents and

More information

Review of the General Linear Model

Review of the General Linear Model Review of the General Linear Model EPSY 905: Multivariate Analysis Online Lecture #2 Learning Objectives Types of distributions: Ø Conditional distributions The General Linear Model Ø Regression Ø Analysis

More information

Quiz 1. Name: Instructions: Closed book, notes, and no electronic devices.

Quiz 1. Name: Instructions: Closed book, notes, and no electronic devices. Quiz 1. Name: Instructions: Closed book, notes, and no electronic devices. 1.(10) What is usually true about a parameter of a model? A. It is a known number B. It is determined by the data C. It is an

More information

Making sense of Econometrics: Basics

Making sense of Econometrics: Basics Making sense of Econometrics: Basics Lecture 4: Qualitative influences and Heteroskedasticity Egypt Scholars Economic Society November 1, 2014 Assignment & feedback enter classroom at http://b.socrative.com/login/student/

More information

Generalized logit models for nominal multinomial responses. Local odds ratios

Generalized logit models for nominal multinomial responses. Local odds ratios Generalized logit models for nominal multinomial responses Categorical Data Analysis, Summer 2015 1/17 Local odds ratios Y 1 2 3 4 1 π 11 π 12 π 13 π 14 π 1+ X 2 π 21 π 22 π 23 π 24 π 2+ 3 π 31 π 32 π

More information

a. Write what the survey would look like (Hint: there should be 2 questions and options to select for an answer!).

a. Write what the survey would look like (Hint: there should be 2 questions and options to select for an answer!). HW 13-1 1. Several students at Rufus King High School were debating whether males or females were more involved in afterschool activities. There are three organized activities in the afterschool program

More information

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 1: August 22, 2012

More information

Machine Learning. Regression-Based Classification & Gaussian Discriminant Analysis. Manfred Huber

Machine Learning. Regression-Based Classification & Gaussian Discriminant Analysis. Manfred Huber Machine Learning Regression-Based Classification & Gaussian Discriminant Analysis Manfred Huber 2015 1 Logistic Regression Linear regression provides a nice representation and an efficient solution to

More information

Regression - Modeling a response

Regression - Modeling a response Regression - Modeling a response We often wish to construct a model to Explain the association between two or more variables Predict the outcome of a variable given values of other variables. Regression

More information

MSP Research Note. RDQ Reliability, Validity and Norms

MSP Research Note. RDQ Reliability, Validity and Norms MSP Research Note RDQ Reliability, Validity and Norms Introduction This research note describes the technical properties of the RDQ. Evidence for the reliability and validity of the RDQ is presented against

More information

Outline. Clustering. Capturing Unobserved Heterogeneity in the Austrian Labor Market Using Finite Mixtures of Markov Chain Models

Outline. Clustering. Capturing Unobserved Heterogeneity in the Austrian Labor Market Using Finite Mixtures of Markov Chain Models Capturing Unobserved Heterogeneity in the Austrian Labor Market Using Finite Mixtures of Markov Chain Models Collaboration with Rudolf Winter-Ebmer, Department of Economics, Johannes Kepler University

More information

8. f(x)= x 3 + 9x 2 + 6x 56

8. f(x)= x 3 + 9x 2 + 6x 56 ALGEBRA 2 FINAL EXAM STUDY GUIDE Unit: Polynomials 1. (2x 4 7x 3 + 4x 7) + (2x 2 4x + 8) 2. (-4x 3 + 7x 6) (7x 4 + 3x 3 2x 4) 3. (3x 3 + 2x + 7)(x 2 4) 4. x 4 4x 3 3x 2 + 14x 8 (x 3) (Long AND synthetic

More information

2) For a normal distribution, the skewness and kurtosis measures are as follows: A) 1.96 and 4 B) 1 and 2 C) 0 and 3 D) 0 and 0

2) For a normal distribution, the skewness and kurtosis measures are as follows: A) 1.96 and 4 B) 1 and 2 C) 0 and 3 D) 0 and 0 Introduction to Econometrics Midterm April 26, 2011 Name Student ID MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. (5,000 credit for each correct

More information

Outline Conditional Probability The Law of Total Probability and Bayes Theorem Independent Events. Week 4 Classical Probability, Part II

Outline Conditional Probability The Law of Total Probability and Bayes Theorem Independent Events. Week 4 Classical Probability, Part II Week 4 Classical Probability, Part II Week 4 Objectives This week we continue covering topics from classical probability. The notion of conditional probability is presented first. Important results/tools

More information

4. Nonlinear regression functions

4. Nonlinear regression functions 4. Nonlinear regression functions Up to now: Population regression function was assumed to be linear The slope(s) of the population regression function is (are) constant The effect on Y of a unit-change

More information

Data Analysis and Uncertainty Part 1: Random Variables

Data Analysis and Uncertainty Part 1: Random Variables Data Analysis and Uncertainty Part 1: Random Variables Instructor: Sargur N. University at Buffalo The State University of New York srihari@cedar.buffalo.edu 1 Topics 1. Why uncertainty exists? 2. Dealing

More information

Sources of Inequality: Additive Decomposition of the Gini Coefficient.

Sources of Inequality: Additive Decomposition of the Gini Coefficient. Sources of Inequality: Additive Decomposition of the Gini Coefficient. Carlos Hurtado Econometrics Seminar Department of Economics University of Illinois at Urbana-Champaign hrtdmrt2@illinois.edu Feb 24th,

More information

NSHE DIVERSITY REPORT

NSHE DIVERSITY REPORT University of Nevada, Las Vegas University of Nevada, Reno NSHE DIVERSITY REPORT 2006-07 Nevada State College at Henderson College of Southern Nevada December 2007 Prepared by the Office of Academic and

More information

Lecture-1: Introduction to Econometrics

Lecture-1: Introduction to Econometrics Lecture-1: Introduction to Econometrics 1 Definition Econometrics may be defined as 2 the science in which the tools of economic theory, mathematics and statistical inference is applied to the analysis

More information

Statistics for Managers Using Microsoft Excel

Statistics for Managers Using Microsoft Excel Statistics for Managers Using Microsoft Excel 7 th Edition Chapter 1 Chi-Square Tests and Nonparametric Tests Statistics for Managers Using Microsoft Excel 7e Copyright 014 Pearson Education, Inc. Chap

More information

Practice Questions for Exam 1

Practice Questions for Exam 1 Practice Questions for Exam 1 1. A used car lot evaluates their cars on a number of features as they arrive in the lot in order to determine their worth. Among the features looked at are miles per gallon

More information

Hypothesis testing. Data to decisions

Hypothesis testing. Data to decisions Hypothesis testing Data to decisions The idea Null hypothesis: H 0 : the DGP/population has property P Under the null, a sample statistic has a known distribution If, under that that distribution, the

More information

Answers Only VI- Counting Principles; Further Probability Topics

Answers Only VI- Counting Principles; Further Probability Topics Answers Only VI- Counting Principles; Further Probability Topics 1) If you are dealt 3 cards from a shuffled deck of 52 cards, find the probability that all 3 cards are clubs. (Type a fraction. Simplify

More information

Binary Logistic Regression

Binary Logistic Regression The coefficients of the multiple regression model are estimated using sample data with k independent variables Estimated (or predicted) value of Y Estimated intercept Estimated slope coefficients Ŷ = b

More information

Machine Learning - MT Classification: Generative Models

Machine Learning - MT Classification: Generative Models Machine Learning - MT 2016 7. Classification: Generative Models Varun Kanade University of Oxford October 31, 2016 Announcements Practical 1 Submission Try to get signed off during session itself Otherwise,

More information

Topic 2 Probability. Basic probability Conditional probability and independence Bayes rule Basic reliability

Topic 2 Probability. Basic probability Conditional probability and independence Bayes rule Basic reliability Topic 2 Probability Basic probability Conditional probability and independence Bayes rule Basic reliability Random process: a process whose outcome can not be predicted with certainty Examples: rolling

More information

Gaussian and Linear Discriminant Analysis; Multiclass Classification

Gaussian and Linear Discriminant Analysis; Multiclass Classification Gaussian and Linear Discriminant Analysis; Multiclass Classification Professor Ameet Talwalkar Slide Credit: Professor Fei Sha Professor Ameet Talwalkar CS260 Machine Learning Algorithms October 13, 2015

More information

Learning Objectives. c D. Poole and A. Mackworth 2010 Artificial Intelligence, Lecture 7.2, Page 1

Learning Objectives. c D. Poole and A. Mackworth 2010 Artificial Intelligence, Lecture 7.2, Page 1 Learning Objectives At the end of the class you should be able to: identify a supervised learning problem characterize how the prediction is a function of the error measure avoid mixing the training and

More information

Robust Monte Carlo Methods for Sequential Planning and Decision Making

Robust Monte Carlo Methods for Sequential Planning and Decision Making Robust Monte Carlo Methods for Sequential Planning and Decision Making Sue Zheng, Jason Pacheco, & John Fisher Sensing, Learning, & Inference Group Computer Science & Artificial Intelligence Laboratory

More information

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 1

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 1 CS434a/541a: Pattern Recognition Prof. Olga Veksler Lecture 1 1 Outline of the lecture Syllabus Introduction to Pattern Recognition Review of Probability/Statistics 2 Syllabus Prerequisite Analysis of

More information

CPSC 540: Machine Learning

CPSC 540: Machine Learning CPSC 540: Machine Learning Undirected Graphical Models Mark Schmidt University of British Columbia Winter 2016 Admin Assignment 3: 2 late days to hand it in today, Thursday is final day. Assignment 4:

More information

SESSION 5 Descriptive Statistics

SESSION 5 Descriptive Statistics SESSION 5 Descriptive Statistics Descriptive statistics are used to describe the basic features of the data in a study. They provide simple summaries about the sample and the measures. Together with simple

More information

Econometrics I. Professor William Greene Stern School of Business Department of Economics 1-1/40. Part 1: Introduction

Econometrics I. Professor William Greene Stern School of Business Department of Economics 1-1/40. Part 1: Introduction Econometrics I Professor William Greene Stern School of Business Department of Economics 1-1/40 http://people.stern.nyu.edu/wgreene/econometrics/econometrics.htm 1-2/40 Overview: This is an intermediate

More information

Labour Supply Responses and the Extensive Margin: The US, UK and France

Labour Supply Responses and the Extensive Margin: The US, UK and France Labour Supply Responses and the Extensive Margin: The US, UK and France Richard Blundell Antoine Bozio Guy Laroque UCL and IFS IFS INSEE-CREST, UCL and IFS January 2011 Blundell, Bozio and Laroque ( )

More information

Problem #1 #2 #3 #4 #5 #6 Total Points /6 /8 /14 /10 /8 /10 /56

Problem #1 #2 #3 #4 #5 #6 Total Points /6 /8 /14 /10 /8 /10 /56 STAT 391 - Spring Quarter 2017 - Midterm 1 - April 27, 2017 Name: Student ID Number: Problem #1 #2 #3 #4 #5 #6 Total Points /6 /8 /14 /10 /8 /10 /56 Directions. Read directions carefully and show all your

More information

Warner, R. M. (2008). Applied Statistics: From bivariate through multivariate techniques. Thousand Oaks: Sage.

Warner, R. M. (2008). Applied Statistics: From bivariate through multivariate techniques. Thousand Oaks: Sage. Errata for Warner, R. M. (2008). Applied Statistics: From bivariate through multivariate techniques. Thousand Oaks: Sage. Most recent update: March 4, 2009 Please send information about any errors in the

More information

Machine Learning (CS 567) Lecture 2

Machine Learning (CS 567) Lecture 2 Machine Learning (CS 567) Lecture 2 Time: T-Th 5:00pm - 6:20pm Location: GFS118 Instructor: Sofus A. Macskassy (macskass@usc.edu) Office: SAL 216 Office hours: by appointment Teaching assistant: Cheol

More information

Regression with Qualitative Information. Part VI. Regression with Qualitative Information

Regression with Qualitative Information. Part VI. Regression with Qualitative Information Part VI Regression with Qualitative Information As of Oct 17, 2017 1 Regression with Qualitative Information Single Dummy Independent Variable Multiple Categories Ordinal Information Interaction Involving

More information

Discrete Multivariate Statistics

Discrete Multivariate Statistics Discrete Multivariate Statistics Univariate Discrete Random variables Let X be a discrete random variable which, in this module, will be assumed to take a finite number of t different values which are

More information

Reminders. Thought questions should be submitted on eclass. Please list the section related to the thought question

Reminders. Thought questions should be submitted on eclass. Please list the section related to the thought question Linear regression Reminders Thought questions should be submitted on eclass Please list the section related to the thought question If it is a more general, open-ended question not exactly related to a

More information

Bayesian Decision Theory

Bayesian Decision Theory Bayesian Decision Theory Dr. Shuang LIANG School of Software Engineering TongJi University Fall, 2012 Today s Topics Bayesian Decision Theory Bayesian classification for normal distributions Error Probabilities

More information

The Transactional. Understanding. with Attribute Data. Smita Skrivanek August 26, Copyright 2010 MoreSteam.com

The Transactional. Understanding. with Attribute Data. Smita Skrivanek August 26, Copyright 2010 MoreSteam.com The Transactional Dilemma: Understanding Regression with Attribute Data Smita Skrivanek August 26, 2010 Agenda Welcome Introduction of MBB Webcast Series Larry Goldman, MoreSteam.com The Transactional

More information

Statistics I Exercises Lesson 3 Academic year 2015/16

Statistics I Exercises Lesson 3 Academic year 2015/16 Statistics I Exercises Lesson 3 Academic year 2015/16 1. The following table represents the joint (relative) frequency distribution of two variables: semester grade in Estadística I course and # of hours

More information

Statistics 528: Homework 2 Solutions

Statistics 528: Homework 2 Solutions Statistics 28: Homework 2 Solutions.4 There are several gaps in the data, as can be seen from the histogram. Minitab Result: Min Q Med Q3 Max 8 3278 22 2368 2624 Manual Result: Min Q Med Q3 Max 8 338 22.

More information

REGRESSION ANALYSIS BY EXAMPLE

REGRESSION ANALYSIS BY EXAMPLE REGRESSION ANALYSIS BY EXAMPLE Fifth Edition Samprit Chatterjee Ali S. Hadi A JOHN WILEY & SONS, INC., PUBLICATION CHAPTER 5 QUALITATIVE VARIABLES AS PREDICTORS 5.1 INTRODUCTION Qualitative or categorical

More information

In order to carry out a study on employees wages, a company collects information from its 500 employees 1 as follows:

In order to carry out a study on employees wages, a company collects information from its 500 employees 1 as follows: INTRODUCTORY ECONOMETRICS Dpt of Econometrics & Statistics (EA3) University of the Basque Country UPV/EHU OCW Self Evaluation answers Time: 21/2 hours SURNAME: NAME: ID#: Specific competences to be evaluated

More information

Lecture 9 4.1: Derivative Rules MTH 124

Lecture 9 4.1: Derivative Rules MTH 124 Today we will see that the derivatives of classes of functions behave in similar ways. This is nice because by noticing this general pattern we can develop derivative rules which will make taking derivative

More information

GMAT-Arithmetic-3. Descriptive Statistics and Set theory

GMAT-Arithmetic-3. Descriptive Statistics and Set theory GMAT-Arithmetic-3 Descriptive Statistics and Set theory Descriptive Statistics 1). If S is a set of consecutive integers, what is the standard deviation of S? (1) Set S contains 23 terms (2) The median

More information

Regression Analysis in R

Regression Analysis in R Regression Analysis in R 1 Purpose The purpose of this activity is to provide you with an understanding of regression analysis and to both develop and apply that knowledge to the use of the R statistical

More information

Discrete Choice Modeling

Discrete Choice Modeling [Part 4] 1/43 Discrete Choice Modeling 0 Introduction 1 Summary 2 Binary Choice 3 Panel Data 4 Bivariate Probit 5 Ordered Choice 6 Count Data 7 Multinomial Choice 8 Nested Logit 9 Heterogeneity 10 Latent

More information

I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN

I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN Introduction Edps/Psych/Stat/ 584 Applied Multivariate Statistics Carolyn J Anderson Department of Educational Psychology I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN c Board of Trustees,

More information

Inference and Regression

Inference and Regression Name Inference and Regression Final Examination, 2015 Department of IOMS This course and this examination are governed by the Stern Honor Code. Instructions Please write your name at the top of this page.

More information

STAT/MA 416 Answers Homework 6 November 15, 2007 Solutions by Mark Daniel Ward PROBLEMS

STAT/MA 416 Answers Homework 6 November 15, 2007 Solutions by Mark Daniel Ward PROBLEMS STAT/MA 4 Answers Homework November 5, 27 Solutions by Mark Daniel Ward PROBLEMS Chapter Problems 2a. The mass p, corresponds to neither of the first two balls being white, so p, 8 7 4/39. The mass p,

More information

Final Exam logistics. Here is what I've found out about the final exam in MyMathLab (running from a week ago to 11:59pm five days after class tonight.

Final Exam logistics. Here is what I've found out about the final exam in MyMathLab (running from a week ago to 11:59pm five days after class tonight. MTH 209 Week 5 Final Exam logistics Here is what I've found out about the final exam in MyMathLab (running from a week ago to 11:59pm five days after class tonight.. Copyright 2009 Pearson Education, Inc.

More information

Module 11: Linear Regression. Rebecca C. Steorts

Module 11: Linear Regression. Rebecca C. Steorts Module 11: Linear Regression Rebecca C. Steorts Announcements Today is the last class Homework 7 has been extended to Thursday, April 20, 11 PM. There will be no lab tomorrow. There will be office hours

More information

Introduction. ECN 102: Analysis of Economic Data Winter, J. Parman (UC-Davis) Analysis of Economic Data, Winter 2011 January 4, / 51

Introduction. ECN 102: Analysis of Economic Data Winter, J. Parman (UC-Davis) Analysis of Economic Data, Winter 2011 January 4, / 51 Introduction ECN 102: Analysis of Economic Data Winter, 2011 J. Parman (UC-Davis) Analysis of Economic Data, Winter 2011 January 4, 2011 1 / 51 Contact Information Instructor: John Parman Email: jmparman@ucdavis.edu

More information

Learning Objectives. Zeroes. The Real Zeros of a Polynomial Function

Learning Objectives. Zeroes. The Real Zeros of a Polynomial Function The Real Zeros of a Polynomial Function 1 Learning Objectives 1. Use the Remainder and Factor Theorems 2. Use the Rational Zeros Theorem to list the potential rational zeros of a polynomial function 3.

More information

Impact Evaluation of Mindspark Centres

Impact Evaluation of Mindspark Centres Impact Evaluation of Mindspark Centres March 27th, 2014 Executive Summary About Educational Initiatives and Mindspark Educational Initiatives (EI) is a prominent education organization in India with the

More information

Looking at Data Relationships. 2.1 Scatterplots W. H. Freeman and Company

Looking at Data Relationships. 2.1 Scatterplots W. H. Freeman and Company Looking at Data Relationships 2.1 Scatterplots 2012 W. H. Freeman and Company Here, we have two quantitative variables for each of 16 students. 1) How many beers they drank, and 2) Their blood alcohol

More information

Lecture 1: Description of Data. Readings: Sections 1.2,

Lecture 1: Description of Data. Readings: Sections 1.2, Lecture 1: Description of Data Readings: Sections 1.,.1-.3 1 Variable Example 1 a. Write two complete and grammatically correct sentences, explaining your primary reason for taking this course and then

More information

Sociology 593 Exam 2 Answer Key March 28, 2002

Sociology 593 Exam 2 Answer Key March 28, 2002 Sociology 59 Exam Answer Key March 8, 00 I. True-False. (0 points) Indicate whether the following statements are true or false. If false, briefly explain why.. A variable is called CATHOLIC. This probably

More information

Lesson 5.4: The Normal Distribution, page 251

Lesson 5.4: The Normal Distribution, page 251 6. For females: Midpoint Salary ($) Frequency 22 5 92 27 5 52 32 5 9 37 5 42 5 4 47 5 52 5 3 57 5 3 x = $27 39.3 = $724.2 For males: Midpoint Salary ($) Frequency 25 86 35 78 45 28 55 2 65 22 75 85 4 95

More information

Classification Methods II: Linear and Quadratic Discrimminant Analysis

Classification Methods II: Linear and Quadratic Discrimminant Analysis Classification Methods II: Linear and Quadratic Discrimminant Analysis Rebecca C. Steorts, Duke University STA 325, Chapter 4 ISL Agenda Linear Discrimminant Analysis (LDA) Classification Recall that linear

More information

Causality. Pedro A. Ortega. 18th February Computational & Biological Learning Lab University of Cambridge

Causality. Pedro A. Ortega. 18th February Computational & Biological Learning Lab University of Cambridge Causality Pedro A. Ortega Computational & Biological Learning Lab University of Cambridge 18th February 2010 Why is causality important? The future of machine learning is to control (the world). Examples

More information

Machine Learning

Machine Learning Machine Learning 10-601 Tom M. Mitchell Machine Learning Department Carnegie Mellon University August 30, 2017 Today: Decision trees Overfitting The Big Picture Coming soon Probabilistic learning MLE,

More information

Regression techniques provide statistical analysis of relationships. Research designs may be classified as experimental or observational; regression

Regression techniques provide statistical analysis of relationships. Research designs may be classified as experimental or observational; regression LOGISTIC REGRESSION Regression techniques provide statistical analysis of relationships. Research designs may be classified as eperimental or observational; regression analyses are applicable to both types.

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your

More information

1. The following two-way frequency table shows information from a survey that asked the gender and the language class taken of a group of students.

1. The following two-way frequency table shows information from a survey that asked the gender and the language class taken of a group of students. Name Algebra Unit 13 Practice Test 1. The following two-way frequency table shows information from a survey that asked the gender and the language class taken of a group of students. Spanish French other

More information

DEPARTMENT OF COMPUTER SCIENCE Autumn Semester MACHINE LEARNING AND ADAPTIVE INTELLIGENCE

DEPARTMENT OF COMPUTER SCIENCE Autumn Semester MACHINE LEARNING AND ADAPTIVE INTELLIGENCE Data Provided: None DEPARTMENT OF COMPUTER SCIENCE Autumn Semester 203 204 MACHINE LEARNING AND ADAPTIVE INTELLIGENCE 2 hours Answer THREE of the four questions. All questions carry equal weight. Figures

More information

3. If 4x = 0, the roots of the equation are (1) 25 and 25 (2) 25, only (3) 5 and 5 (4) 5, only 3

3. If 4x = 0, the roots of the equation are (1) 25 and 25 (2) 25, only (3) 5 and 5 (4) 5, only 3 ALGEBRA 1 Part I Answer all 24 questions in this part. Each correct answer will receive 2 credits. No partial credit will be allowed. For each statement or question, choose the word or expression that,

More information

EEL 851: Biometrics. An Overview of Statistical Pattern Recognition EEL 851 1

EEL 851: Biometrics. An Overview of Statistical Pattern Recognition EEL 851 1 EEL 851: Biometrics An Overview of Statistical Pattern Recognition EEL 851 1 Outline Introduction Pattern Feature Noise Example Problem Analysis Segmentation Feature Extraction Classification Design Cycle

More information

Gov Multiple Random Variables

Gov Multiple Random Variables Gov 2000-4. Multiple Random Variables Matthew Blackwell September 29, 2015 Where are we? Where are we going? We described a formal way to talk about uncertain outcomes, probability. We ve talked about

More information

REGRESSION RECAP. Josh Angrist. MIT (Fall 2014)

REGRESSION RECAP. Josh Angrist. MIT (Fall 2014) REGRESSION RECAP Josh Angrist MIT 14.387 (Fall 2014) Regression: What You Need to Know We spend our lives running regressions (I should say: "regressions run me"). And yet this basic empirical tool is

More information

MS-A0504 First course in probability and statistics

MS-A0504 First course in probability and statistics MS-A0504 First course in probability and statistics Heikki Seppälä Department of Mathematics and System Analysis School of Science Aalto University Spring 2016 Probability is a field of mathematics, which

More information

Midterm 2 - Solutions

Midterm 2 - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put

More information

CIVL 7012/8012. Collection and Analysis of Information

CIVL 7012/8012. Collection and Analysis of Information CIVL 7012/8012 Collection and Analysis of Information Uncertainty in Engineering Statistics deals with the collection and analysis of data to solve real-world problems. Uncertainty is inherent in all real

More information

Example 1: Dear Abby. Stat Camp for the MBA Program

Example 1: Dear Abby. Stat Camp for the MBA Program Stat Camp for the MBA Program Daniel Solow Lecture 4 The Normal Distribution and the Central Limit Theorem 187 Example 1: Dear Abby You wrote that a woman is pregnant for 266 days. Who said so? I carried

More information

Mt. Douglas Secondary

Mt. Douglas Secondary Foundations of Math 11 Calculator Usage 207 HOW TO USE TI-83, TI-83 PLUS, TI-84 PLUS CALCULATORS FOR STATISTICS CALCULATIONS shows it is an actual calculator key to press 1. Using LISTS to Calculate Mean,

More information

Conditional probabilities and graphical models

Conditional probabilities and graphical models Conditional probabilities and graphical models Thomas Mailund Bioinformatics Research Centre (BiRC), Aarhus University Probability theory allows us to describe uncertainty in the processes we model within

More information

Introduction to Machine Learning

Introduction to Machine Learning Outline Introduction to Machine Learning Bayesian Classification Varun Chandola March 8, 017 1. {circular,large,light,smooth,thick}, malignant. {circular,large,light,irregular,thick}, malignant 3. {oval,large,dark,smooth,thin},

More information

Why aquarius women ignore. Why aquarius women ignore

Why aquarius women ignore. Why aquarius women ignore Paieška Paieška Paieška Why aquarius women ignore Why aquarius women ignore > > Jan 6, 2017. Aquarians don't intentionally ignore someone unless they're trying to. Aquarius women love a man who is striving

More information

Multivariate Statistical Analysis

Multivariate Statistical Analysis Multivariate Statistical Analysis Fall 2011 C. L. Williams, Ph.D. Lecture 9 for Applied Multivariate Analysis Outline Two sample T 2 test 1 Two sample T 2 test 2 Analogous to the univariate context, we

More information

Averages How difficult is QM1? What is the average mark? Week 1b, Lecture 2

Averages How difficult is QM1? What is the average mark? Week 1b, Lecture 2 Averages How difficult is QM1? What is the average mark? Week 1b, Lecture 2 Topics: 1. Mean 2. Mode 3. Median 4. Order Statistics 5. Minimum, Maximum, Range 6. Percentiles, Quartiles, Interquartile Range

More information

Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays

Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays Histograms: Shape, Outliers, Center, Spread Frequency and Relative Histograms Related to other types of graphical displays Sep 9 1:13 PM Shape: Skewed left Bell shaped Symmetric Bi modal Symmetric Skewed

More information