Introduction to statistical modeling
|
|
- Lindsey Cobb
- 6 years ago
- Views:
Transcription
1 Introduction to statistical modeling Illustrated with XLSTAT Jean Paul Maalouf linkedin.com/in/jean-paul-maalouf November 30,
2 PLAN XLSTAT: who are we? Statistics: categories Reminder: statistical testing Principles of statistical modeling Simple linear regression / ANOVA Principles XLSTAT demo & interpretation of outputs: coefficients, p-values, R² Assumptions about residuals and graphical verification Multiple linear regression Principles & warnings: overfitting & multicolinearity XLSTAT demo & interpretation of outputs What statistical modeling method to choose? Appendix: residuals-alternative verification methods Appendix: alternative modeling tools All the data in this webinar were made up unless otherwise specified 2
3 XLSTAT: Who are we? XLSTAT is a user-friendly statistical add-on software for Microsoft Excel 3
4 XLSTAT A growing software and team XLSTAT realizes its first sale on the Internet New version, VBA interface, C++ computations, 7 languages New products, new website, growing and dynamic team Thierry Fahmy develops a user-friendly solution for data analysis: XLSTAT is born 1996 The company Addinsoft is created 2006 New offers adapted to business needs 2015 XLSTAT 365 Cloud version of XLSTAT for Excel 365 XLSTAT Free Free limited Edition 4
5 XLSTAT in a few numbers 200+ statistical features General or field-oriented solutions 50k users Across the world. Companies, education, research 16 employees Always receptive to the needs of users 130k visits/month on the website Easy tutorials available in 5 languages 7 languages 400 downloads/day 5
6 Statistics: 4 categories 6
7 Statistics: 4 categories Recording Recording Recording Description Exploration Tests Modeling I want to summarize I want to easily extract I want to accept / I want to understand small data sets (1-3 information from a reject a very precise the way a phenomenon variables) using large data set hypothesis assuming evolves according to a simple statistics or without necessarily error risks. (t tests, set of parameters. charts (mean, having a precise ANOVA, correlation (regression, ANOVA, standard deviation, boxplots...) question to answer. (PCA, AHC...) tests, chi-square...) ANCOVA...) 7
8 Reminder on statistical testing I want to accept / reject a very precise hypothesis assuming error risks. 8
9 Reminder on statistical testing? Question Are averages A & B the same? The test computes a number called p-value. 0 < p-value < 1 H0 Ha Null Hypothesis Generally implies an idea of equality H0: Average A = Average B Alternative Hypothesis Generally implies an idea of difference Ha: Average A Average B Decision : If p-value < alfa, we reject H0 and accept Ha assuming a risk proportional to p- value of being wrong. 9
10 Principles of Statistical modeling I want to understand the way variables evolves according to other variables. 10
11 Principles of Statistical Modeling Definition A statistical model is a simplified representation of a phenomenon using numbers. It allows to better understand reality and to do predictions. 11
12 A very simple example Somebody asks you: what is the height of French people? First way of answering Recite the whole table, row after row Second way of answering Compute the mean and the standard deviation over the 200 values, and use these two numbers as an answer You have this table that contains height information (cm) of a representative sample of 200 French people. Individual Height Janine 169 Françoise 158 Roger 159 Albert 168 Isabelle 171 Jean-Luc 187 Nicolas 171 Benoît Representing French people height by a mean and a standard deviation is a way to model this height 12
13 Principles of Statistical Modeling Definition A statistical model is a simplified representation of a phenomenon using numbers. It allows to better understand reality and to do predictions. How models work technically A model allows to explain one or several dependent variables using one or several independent variables through mathematical equations that involve parameters. The mean and standard deviation model does not imply explanatory variables 13
14 Simple linear regression Principles, XLSTAT demo, interpretation of outputs, hypotheses on residuals 14
15 Individuals Data set: online shoe selling platform Variables Question: How does invoice amount vary according to time spent on site? 15
16 Example: modeling invoice amount according to time spent on website 16
17 Exemple : modeling invoice amount according to time spent on website We could try simple linear regression (y = a*x + b) Our way to simplify reality: a «straight line» model parameters What we were unable to capture with our model Invoice amount= a*time spent on site + b + residuals Dependent variable Explanatory variable Errors (Residuals) PS: we chose linear modeling, but this was absolutely not mandatory. 17
18 Salary ANOVA may also be perceived as a statistical model (qualitative explanatory variables) model Model One parameter Salary = average(reference level) + distance(average of the considered level) + residuals Two parameters Reference level Earth Pluto Mars Origin Errors (Residuals) ANOVA, linear regression & ANCOVA are linear models 18
19 Modeling parameter estimation. The case of simple linear regression The best parameter values are those that minimize the residuals sum of squares: n S a, b = i=1 y i ax i + b 2 Errors (Residuals) Observed Invoice amount (dots) Predicted invoice amount (line) This is what we call Least Square estimation 19
20 Example: modeling invoice amount according to time spent on website - XLSTAT 20
21 Example: modeling invoice amount according to time spent on website simple linear regression, XLSTAT outputs Parameter estimations (least squares) Confidence intervals around the estimation b a P-values related to: H 0 : parameter = 0 H a : parameter 0 Equation could be used to predict invoice amount according to new values of time spent on website 21
22 Example: modeling invoice amount according to time spent on website simple linear regression, XLSTAT outputs R² reflects goodness-of-fit (prefer Adjusted R²). 0<R²<1 Confidence interval of the model (based on parameter estimations) Confidence interval of the predictions (95% of new predictions will lie inside) 22
23 Linear model Assumptions about residuals A linear model is only reliable under certain conditions associated to residuals 23
24 Linear model: assumptions about residuals Independence No autocorrelation. One measurement per individual. Normality Residuals should follow a normal distribution. Not too many outliers In general, no more than 5% of outliers among residuals. Homoscedasticity Residuals should have a homogeneous variance. 24
25 Graphical examination of the assumptions about residuals Residuals vs explanatory variables chart Dots are homogeneously distributed around the y = 0 line model is reliable 25
26 Normalized residuals Normalized residuals Assumptions about residuals: common patterns of violation Violating the independence assumption ( autocorrelated residuals) Violating the homoscedasticity assumption ( variance heterogeneity) Time Frequently occurs in time series implying periodicity Age Frequently appears when variance is a function of the mean 26
27 Assumptions about residuals: solutions when violated Think about outliers (eliminate them?) Transform y or x data (log, square root, Box-Cox ) Use a more convenient model (non-linear, Poisson ) Autocorrelation: use the Cochrane-Orcutt model (XLSTAT-Forecast) 27
28 Multiple linear regression y = a*x 1 + b*x
29 Multiple linear regression - principles Investigate the linear influence of several explanatory variables on the dependent variable; increase predictive quality 29
30 Multiple linear regression - warnings In addition to the assumptions about residuals: beware of overfitting & multicolinearity 30
31 Adding explanatory variables Multiple linear regression warnings Adding explanatory variables will increase the R² Warning: do not add too many of them To avoid obtaining models that are too fitted on your particular data, and that will consequently be less generalizable. The AIC model quality index builds a compromise between: A good fitting to the data. A low number of parameters. AIC is a relative quality index that should only be used to compare models with each other. The model with the lowest AIC is the best model in the model set. Warning: beware of redundant variables Some correlated explanatory variables may hide each other in terms of effects on the dependent variable. This is called multicolinearity (VIF index > 5). Examples : day temperature & night temperature; weight & height 31
32 Linear modeling of invoice amount according to a set of variables Multiple linear regression Question: which variables (D-G columns) have the strongest linear influence on invoice amount? Can we predict invoice amount of two new clients? 32
33 Linear modeling of invoice amount according to a set of variables Multiple linear regression - XLSTAT 33
34 Linear modeling of invoice amount according to a set of variables Multiple linear regression Examining Multicolinearity High VIF (>5) Redundant variables Solution: exclude one of these 2 variables and re-launch the model 34
35 Linear modeling of invoice amount according to a set of variables Multiple linear regression excluding height Interpretation : Weight as a significant positive effect on Invoice amount 35
36 Linear modeling of invoice amount according to a set of variables Prediction 36
37 According to the type and number of dependent and explanatory variables, several solutions are available What statistical modeling method should you choose? Link: choose an appropriate modeling tool according to your situation 37
38 Conclusion: Let s get back to this question about height... Different models to answer the same question Somebody asks you: what is the height of French people? Height of French people: dependent variable 4 It depends linearly on age and origin ANCOVA Their height has this average and that 1 standard deviation 5 Normal distribution model It depends linearly on age and father s height Multiple linear regression It depends on geographic origin 2 6 One-way ANOVA It depends on origin and gender 2-way ANOVA It depends linearly on age 3 7 Simple linear regression Etc. etc. Quantitative explanatory var. Qualitative explanatory var. 38
39 In summary 39
40 Introduction to statistical modeling - summary Statistical modeling allows to: Investigate how dependent variables evolve according to explanatory variables using a mathematical equation that involves parameters. Predict using this equation Linear models are reliable only under certain assumptions related to residuals: normality, homoscedasticity, absence of autocorrelation & not too many outliers Beware of problems related to the introduction of too many explanatory variables: overfitting & multicollinearity. According to variable types, different models are available. 40
41 Thanks for attending! All the tools we saw are available in all XLSTAT solutions (except XLSTAT-Free) Survey time 41
42 Online recording availability of our webinars Until Dec. 16,
43 Appendix: Alternative modeling tools Tables with a high number of explanatory variables (> nb. Of observations) with potentially important multicollinearity: PLS regression Supervised Machine Learning: KNN, Naïve Bayes, SVM (especially for prediction); decision trees 43
44 Appendix: residuals-alternative verification methods Independence Run a Durbin-Watson test on std. Residuals (XLSTAT-Forecast). Normality Run a normality test on std. Residuals. Not too many outliers Check that not more than 5% of std. residuals are higher than Homoscedasticity Run a heteroscedasticity test (Breusch- Pagan or White) on std. residuals. 44
Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables.
Regression Analysis BUS 735: Business Decision Making and Research 1 Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn how to estimate
More informationDEMAND ESTIMATION (PART III)
BEC 30325: MANAGERIAL ECONOMICS Session 04 DEMAND ESTIMATION (PART III) Dr. Sumudu Perera Session Outline 2 Multiple Regression Model Test the Goodness of Fit Coefficient of Determination F Statistic t
More informationStatistics for Managers using Microsoft Excel 6 th Edition
Statistics for Managers using Microsoft Excel 6 th Edition Chapter 13 Simple Linear Regression 13-1 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of
More informationProject Report for STAT571 Statistical Methods Instructor: Dr. Ramon V. Leon. Wage Data Analysis. Yuanlei Zhang
Project Report for STAT7 Statistical Methods Instructor: Dr. Ramon V. Leon Wage Data Analysis Yuanlei Zhang 77--7 November, Part : Introduction Data Set The data set contains a random sample of observations
More informationCHAPTER 6: SPECIFICATION VARIABLES
Recall, we had the following six assumptions required for the Gauss-Markov Theorem: 1. The regression model is linear, correctly specified, and has an additive error term. 2. The error term has a zero
More informationMaking sense of Econometrics: Basics
Making sense of Econometrics: Basics Lecture 4: Qualitative influences and Heteroskedasticity Egypt Scholars Economic Society November 1, 2014 Assignment & feedback enter classroom at http://b.socrative.com/login/student/
More informationTrendlines Simple Linear Regression Multiple Linear Regression Systematic Model Building Practical Issues
Trendlines Simple Linear Regression Multiple Linear Regression Systematic Model Building Practical Issues Overfitting Categorical Variables Interaction Terms Non-linear Terms Linear Logarithmic y = a +
More informationChapter 4. Regression Models. Learning Objectives
Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Learning Objectives After completing
More informationRegression Models. Chapter 4. Introduction. Introduction. Introduction
Chapter 4 Regression Models Quantitative Analysis for Management, Tenth Edition, by Render, Stair, and Hanna 008 Prentice-Hall, Inc. Introduction Regression analysis is a very valuable tool for a manager
More informationChapter 13. Multiple Regression and Model Building
Chapter 13 Multiple Regression and Model Building Multiple Regression Models The General Multiple Regression Model y x x x 0 1 1 2 2... k k y is the dependent variable x, x,..., x 1 2 k the model are the
More informationBasic Business Statistics 6 th Edition
Basic Business Statistics 6 th Edition Chapter 12 Simple Linear Regression Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of a dependent variable based
More informationAUTOCORRELATION. Phung Thanh Binh
AUTOCORRELATION Phung Thanh Binh OUTLINE Time series Gauss-Markov conditions The nature of autocorrelation Causes of autocorrelation Consequences of autocorrelation Detecting autocorrelation Remedial measures
More informationMultiple Linear Regression
Multiple Linear Regression University of California, San Diego Instructor: Ery Arias-Castro http://math.ucsd.edu/~eariasca/teaching.html 1 / 42 Passenger car mileage Consider the carmpg dataset taken from
More informationRegression analysis is a tool for building mathematical and statistical models that characterize relationships between variables Finds a linear
Regression analysis is a tool for building mathematical and statistical models that characterize relationships between variables Finds a linear relationship between: - one independent variable X and -
More informationMULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS Page 1 MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level
More informationRegression Analysis. BUS 735: Business Decision Making and Research
Regression Analysis BUS 735: Business Decision Making and Research 1 Goals and Agenda Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn
More informationChapter 14 Student Lecture Notes 14-1
Chapter 14 Student Lecture Notes 14-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 14 Multiple Regression Analysis and Model Building Chap 14-1 Chapter Goals After completing this
More informationChapter 4: Regression Models
Sales volume of company 1 Textbook: pp. 129-164 Chapter 4: Regression Models Money spent on advertising 2 Learning Objectives After completing this chapter, students will be able to: Identify variables,
More informationFinQuiz Notes
Reading 10 Multiple Regression and Issues in Regression Analysis 2. MULTIPLE LINEAR REGRESSION Multiple linear regression is a method used to model the linear relationship between a dependent variable
More informationAssumptions of the error term, assumptions of the independent variables
Petra Petrovics, Renáta Géczi-Papp Assumptions of the error term, assumptions of the independent variables 6 th seminar Multiple linear regression model Linear relationship between x 1, x 2,, x p and y
More informationPsychology Seminar Psych 406 Dr. Jeffrey Leitzel
Psychology Seminar Psych 406 Dr. Jeffrey Leitzel Structural Equation Modeling Topic 1: Correlation / Linear Regression Outline/Overview Correlations (r, pr, sr) Linear regression Multiple regression interpreting
More informationChart types and when to use them
APPENDIX A Chart types and when to use them Pie chart Figure illustration of pie chart 2.3 % 4.5 % Browser Usage for April 2012 18.3 % 38.3 % Internet Explorer Firefox Chrome Safari Opera 35.8 % Pie chart
More informationChapter 3 Multiple Regression Complete Example
Department of Quantitative Methods & Information Systems ECON 504 Chapter 3 Multiple Regression Complete Example Spring 2013 Dr. Mohammad Zainal Review Goals After completing this lecture, you should be
More informationCourse in Data Science
Course in Data Science About the Course: In this course you will get an introduction to the main tools and ideas which are required for Data Scientist/Business Analyst/Data Analyst. The course gives an
More informationMBA Statistics COURSE #4
MBA Statistics 51-651-00 COURSE #4 Simple and multiple linear regression What should be the sales of ice cream? Example: Before beginning building a movie theater, one must estimate the daily number of
More informationMathematics for Economics MA course
Mathematics for Economics MA course Simple Linear Regression Dr. Seetha Bandara Simple Regression Simple linear regression is a statistical method that allows us to summarize and study relationships between
More informationForecasting. BUS 735: Business Decision Making and Research. exercises. Assess what we have learned
Forecasting BUS 735: Business Decision Making and Research 1 1.1 Goals and Agenda Goals and Agenda Learning Objective Learn how to identify regularities in time series data Learn popular univariate time
More informationGlossary. The ISI glossary of statistical terms provides definitions in a number of different languages:
Glossary The ISI glossary of statistical terms provides definitions in a number of different languages: http://isi.cbs.nl/glossary/index.htm Adjusted r 2 Adjusted R squared measures the proportion of the
More informationRegression Analysis By Example
Regression Analysis By Example Third Edition SAMPRIT CHATTERJEE New York University ALI S. HADI Cornell University BERTRAM PRICE Price Associates, Inc. A Wiley-Interscience Publication JOHN WILEY & SONS,
More informationFinding Relationships Among Variables
Finding Relationships Among Variables BUS 230: Business and Economic Research and Communication 1 Goals Specific goals: Re-familiarize ourselves with basic statistics ideas: sampling distributions, hypothesis
More informationRef.: Spring SOS3003 Applied data analysis for social science Lecture note
SOS3003 Applied data analysis for social science Lecture note 05-2010 Erling Berge Department of sociology and political science NTNU Spring 2010 Erling Berge 2010 1 Literature Regression criticism I Hamilton
More informationLINEAR REGRESSION ANALYSIS. MODULE XVI Lecture Exercises
LINEAR REGRESSION ANALYSIS MODULE XVI Lecture - 44 Exercises Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Exercise 1 The following data has been obtained on
More informationECON 497: Lecture 4 Page 1 of 1
ECON 497: Lecture 4 Page 1 of 1 Metropolitan State University ECON 497: Research and Forecasting Lecture Notes 4 The Classical Model: Assumptions and Violations Studenmund Chapter 4 Ordinary least squares
More informationThe simple linear regression model discussed in Chapter 13 was written as
1519T_c14 03/27/2006 07:28 AM Page 614 Chapter Jose Luis Pelaez Inc/Blend Images/Getty Images, Inc./Getty Images, Inc. 14 Multiple Regression 14.1 Multiple Regression Analysis 14.2 Assumptions of the Multiple
More informationWe like to capture and represent the relationship between a set of possible causes and their response, by using a statistical predictive model.
Statistical Methods in Business Lecture 5. Linear Regression We like to capture and represent the relationship between a set of possible causes and their response, by using a statistical predictive model.
More informationKeller: Stats for Mgmt & Econ, 7th Ed July 17, 2006
Chapter 17 Simple Linear Regression and Correlation 17.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will
More informationChapter 10. Regression. Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania
Chapter 10 Regression Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania Scatter Diagrams A graph in which pairs of points, (x, y), are
More informationLECTURE 11. Introduction to Econometrics. Autocorrelation
LECTURE 11 Introduction to Econometrics Autocorrelation November 29, 2016 1 / 24 ON PREVIOUS LECTURES We discussed the specification of a regression equation Specification consists of choosing: 1. correct
More informationTHE PRINCIPLES AND PRACTICE OF STATISTICS IN BIOLOGICAL RESEARCH. Robert R. SOKAL and F. James ROHLF. State University of New York at Stony Brook
BIOMETRY THE PRINCIPLES AND PRACTICE OF STATISTICS IN BIOLOGICAL RESEARCH THIRD E D I T I O N Robert R. SOKAL and F. James ROHLF State University of New York at Stony Brook W. H. FREEMAN AND COMPANY New
More informationCHAPTER 5 LINEAR REGRESSION AND CORRELATION
CHAPTER 5 LINEAR REGRESSION AND CORRELATION Expected Outcomes Able to use simple and multiple linear regression analysis, and correlation. Able to conduct hypothesis testing for simple and multiple linear
More informationPredictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore
What is Multiple Linear Regression Several independent variables may influence the change in response variable we are trying to study. When several independent variables are included in the equation, the
More informationStatistics Toolbox 6. Apply statistical algorithms and probability models
Statistics Toolbox 6 Apply statistical algorithms and probability models Statistics Toolbox provides engineers, scientists, researchers, financial analysts, and statisticians with a comprehensive set of
More informationSociology 6Z03 Review II
Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability
More informationregression analysis is a type of inferential statistics which tells us whether relationships between two or more variables exist
regression analysis is a type of inferential statistics which tells us whether relationships between two or more variables exist sales $ (y - dependent variable) advertising $ (x - independent variable)
More informationECON 4230 Intermediate Econometric Theory Exam
ECON 4230 Intermediate Econometric Theory Exam Multiple Choice (20 pts). Circle the best answer. 1. The Classical assumption of mean zero errors is satisfied if the regression model a) is linear in the
More informationy response variable x 1, x 2,, x k -- a set of explanatory variables
11. Multiple Regression and Correlation y response variable x 1, x 2,, x k -- a set of explanatory variables In this chapter, all variables are assumed to be quantitative. Chapters 12-14 show how to incorporate
More informationEconometrics Part Three
!1 I. Heteroskedasticity A. Definition 1. The variance of the error term is correlated with one of the explanatory variables 2. Example -- the variance of actual spending around the consumption line increases
More informationCorrelation & Simple Regression
Chapter 11 Correlation & Simple Regression The previous chapter dealt with inference for two categorical variables. In this chapter, we would like to examine the relationship between two quantitative variables.
More informationSTAT 212 Business Statistics II 1
STAT 1 Business Statistics II 1 KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA STAT 1: BUSINESS STATISTICS II Semester 091 Final Exam Thursday Feb
More informationChapter 16. Simple Linear Regression and Correlation
Chapter 16 Simple Linear Regression and Correlation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will
More informationTaguchi Method and Robust Design: Tutorial and Guideline
Taguchi Method and Robust Design: Tutorial and Guideline CONTENT 1. Introduction 2. Microsoft Excel: graphing 3. Microsoft Excel: Regression 4. Microsoft Excel: Variance analysis 5. Robust Design: An Example
More informationInferences for Regression
Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In
More informationBayesian Analysis LEARNING OBJECTIVES. Calculating Revised Probabilities. Calculating Revised Probabilities. Calculating Revised Probabilities
Valua%on and pricing (November 5, 2013) LEARNING OBJECTIVES Lecture 7 Decision making (part 3) Regression theory Olivier J. de Jong, LL.M., MM., MBA, CFD, CFFA, AA www.olivierdejong.com 1. List the steps
More informationIntroduction to Regression
Introduction to Regression ιατµηµατικό Πρόγραµµα Μεταπτυχιακών Σπουδών Τεχνο-Οικονοµικά Συστήµατα ηµήτρης Φουσκάκης Introduction Basic idea: Use data to identify relationships among variables and use these
More informationBusiness Statistics. Lecture 9: Simple Regression
Business Statistics Lecture 9: Simple Regression 1 On to Model Building! Up to now, class was about descriptive and inferential statistics Numerical and graphical summaries of data Confidence intervals
More informationTable of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z).
Table of z values and probabilities for the standard normal distribution. z is the first column plus the top row. Each cell shows P(X z). For example P(X.04) =.8508. For z < 0 subtract the value from,
More informationRegression Diagnostics Procedures
Regression Diagnostics Procedures ASSUMPTIONS UNDERLYING REGRESSION/CORRELATION NORMALITY OF VARIANCE IN Y FOR EACH VALUE OF X For any fixed value of the independent variable X, the distribution of the
More informationThis document contains 3 sets of practice problems.
P RACTICE PROBLEMS This document contains 3 sets of practice problems. Correlation: 3 problems Regression: 4 problems ANOVA: 8 problems You should print a copy of these practice problems and bring them
More informationINTRODUCTORY REGRESSION ANALYSIS
;»»>? INTRODUCTORY REGRESSION ANALYSIS With Computer Application for Business and Economics Allen Webster Routledge Taylor & Francis Croup NEW YORK AND LONDON TABLE OF CONTENT IN DETAIL INTRODUCTORY REGRESSION
More informationInteractions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept
Interactions Lectures 1 & Regression Sometimes two variables appear related: > smoking and lung cancers > height and weight > years of education and income > engine size and gas mileage > GMAT scores and
More informationStatistics Boot Camp. Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018
Statistics Boot Camp Dr. Stephanie Lane Institute for Defense Analyses DATAWorks 2018 March 21, 2018 Outline of boot camp Summarizing and simplifying data Point and interval estimation Foundations of statistical
More informationREVIEW 8/2/2017 陈芳华东师大英语系
REVIEW Hypothesis testing starts with a null hypothesis and a null distribution. We compare what we have to the null distribution, if the result is too extreme to belong to the null distribution (p
More informationMultiple Regression. Peerapat Wongchaiwat, Ph.D.
Peerapat Wongchaiwat, Ph.D. wongchaiwat@hotmail.com The Multiple Regression Model Examine the linear relationship between 1 dependent (Y) & 2 or more independent variables (X i ) Multiple Regression Model
More informationSix Sigma Black Belt Study Guides
Six Sigma Black Belt Study Guides 1 www.pmtutor.org Powered by POeT Solvers Limited. Analyze Correlation and Regression Analysis 2 www.pmtutor.org Powered by POeT Solvers Limited. Variables and relationships
More informationG. S. Maddala Kajal Lahiri. WILEY A John Wiley and Sons, Ltd., Publication
G. S. Maddala Kajal Lahiri WILEY A John Wiley and Sons, Ltd., Publication TEMT Foreword Preface to the Fourth Edition xvii xix Part I Introduction and the Linear Regression Model 1 CHAPTER 1 What is Econometrics?
More informationReview of Statistics 101
Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods
More informationApplied Regression Modeling
Applied Regression Modeling A Business Approach Iain Pardoe University of Oregon Charles H. Lundquist College of Business Eugene, Oregon WILEY- INTERSCIENCE A JOHN WILEY & SONS, INC., PUBLICATION CONTENTS
More informationx3,..., Multiple Regression β q α, β 1, β 2, β 3,..., β q in the model can all be estimated by least square estimators
Multiple Regression Relating a response (dependent, input) y to a set of explanatory (independent, output, predictor) variables x, x 2, x 3,, x q. A technique for modeling the relationship between variables.
More informationCase Study A Parametric Model for the Cost per Flight Hour (CPFH)
CU Alumni and Defence Conference War Museum, Athens, 1st June 2017 Case Study A Parametric Model for the Cost per Flight Hour (CPFH) Michail Bozoudis HAF Engineer General Directorate of Defence Investments
More informationLECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity
LECTURE 10 Introduction to Econometrics Multicollinearity & Heteroskedasticity November 22, 2016 1 / 23 ON PREVIOUS LECTURES We discussed the specification of a regression equation Specification consists
More informationBivariate Relationships Between Variables
Bivariate Relationships Between Variables BUS 735: Business Decision Making and Research 1 Goals Specific goals: Detect relationships between variables. Be able to prescribe appropriate statistical methods
More informationFORECASTING STANDARDS CHECKLIST
FORECASTING STANDARDS CHECKLIST An electronic version of this checklist is available on the Forecasting Principles Web site. PROBLEM 1. Setting Objectives 1.1. Describe decisions that might be affected
More informationModeling Spatial Relationships Using Regression Analysis
Esri International User Conference San Diego, California Technical Workshops July 24, 2012 Modeling Spatial Relationships Using Regression Analysis Lauren M. Scott, PhD Lauren Rosenshein Bennett, MS Answering
More informationStatistics for Managers Using Microsoft Excel
Statistics for Managers Using Microsoft Excel 7 th Edition Chapter 1 Chi-Square Tests and Nonparametric Tests Statistics for Managers Using Microsoft Excel 7e Copyright 014 Pearson Education, Inc. Chap
More informationREED TUTORIALS (Pty) LTD ECS3706 EXAM PACK
REED TUTORIALS (Pty) LTD ECS3706 EXAM PACK 1 ECONOMETRICS STUDY PACK MAY/JUNE 2016 Question 1 (a) (i) Describing economic reality (ii) Testing hypothesis about economic theory (iii) Forecasting future
More informationOne-Way ANOVA. Some examples of when ANOVA would be appropriate include:
One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement
More informationData Analysis and Statistical Methods Statistics 651
y 1 2 3 4 5 6 7 x Data Analysis and Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasini/teaching.html Lecture 32 Suhasini Subba Rao Previous lecture We are interested in whether a dependent
More informationOkun's Law Testing Using Modern Statistical Data. Ekaterina Kabanova, Ilona V. Tregub
Okun's Law Testing Using Modern Statistical Data Ekaterina Kabanova, Ilona V. Tregub The Finance University under the Government of the Russian Federation International Finance Faculty, Moscow, Russia
More informationChapter 7 Student Lecture Notes 7-1
Chapter 7 Student Lecture Notes 7- Chapter Goals QM353: Business Statistics Chapter 7 Multiple Regression Analysis and Model Building After completing this chapter, you should be able to: Explain model
More informationFRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE
FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE Course Title: Probability and Statistics (MATH 80) Recommended Textbook(s): Number & Type of Questions: Probability and Statistics for Engineers
More information2011 Pearson Education, Inc
Statistics for Business and Economics Chapter 7 Inferences Based on Two Samples: Confidence Intervals & Tests of Hypotheses Content 1. Identifying the Target Parameter 2. Comparing Two Population Means:
More informationFAQ: Linear and Multiple Regression Analysis: Coefficients
Question 1: How do I calculate a least squares regression line? Answer 1: Regression analysis is a statistical tool that utilizes the relation between two or more quantitative variables so that one variable
More informationSTA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.
STA441: Spring 2018 Multiple Regression This slide show is a free open source document. See the last slide for copyright information. 1 Least Squares Plane 2 Statistical MODEL There are p-1 explanatory
More informationCorrelation and Regression (Excel 2007)
Correlation and Regression (Excel 2007) (See Also Scatterplots, Regression Lines, and Time Series Charts With Excel 2007 for instructions on making a scatterplot of the data and an alternate method of
More informationAnswer all questions from part I. Answer two question from part II.a, and one question from part II.b.
B203: Quantitative Methods Answer all questions from part I. Answer two question from part II.a, and one question from part II.b. Part I: Compulsory Questions. Answer all questions. Each question carries
More informationGIS Analysis: Spatial Statistics for Public Health: Lauren M. Scott, PhD; Mark V. Janikas, PhD
Some Slides to Go Along with the Demo Hot spot analysis of average age of death Section B DEMO: Mortality Data Analysis 2 Some Slides to Go Along with the Demo Do Economic Factors Alone Explain Early Death?
More information1 The Multiple Regression Model: Freeing Up the Classical Assumptions
1 The Multiple Regression Model: Freeing Up the Classical Assumptions Some or all of classical assumptions were crucial for many of the derivations of the previous chapters. Derivation of the OLS estimator
More informationModeling Spatial Relationships using Regression Analysis
Esri International User Conference San Diego, CA Technical Workshops July 2011 Modeling Spatial Relationships using Regression Analysis Lauren M. Scott, PhD Lauren Rosenshein, MS Mark V. Janikas, PhD Answering
More informationModeling Spatial Relationships Using Regression Analysis. Lauren M. Scott, PhD Lauren Rosenshein Bennett, MS
Modeling Spatial Relationships Using Regression Analysis Lauren M. Scott, PhD Lauren Rosenshein Bennett, MS Workshop Overview Answering why? questions Introduce regression analysis - What it is and why
More informationEco and Bus Forecasting Fall 2016 EXERCISE 2
ECO 5375-701 Prof. Tom Fomby Eco and Bus Forecasting Fall 016 EXERCISE Purpose: To learn how to use the DTDS model to test for the presence or absence of seasonality in time series data and to estimate
More informationThe Ins and Outs of Using Dynamic Regression Models for Forecasting
The Ins and Outs of Using Dynamic Regression Models for Forecasting Presented by Eric Stellwagen Vice President & Cofounder Business Forecast Systems, Inc. estellwagen@forecastpro.com Business Forecast
More informationBasic Business Statistics, 10/e
Chapter 1 1-1 Basic Business Statistics 11 th Edition Chapter 1 Chi-Square Tests and Nonparametric Tests Basic Business Statistics, 11e 009 Prentice-Hall, Inc. Chap 1-1 Learning Objectives In this chapter,
More informationVARIANCE ANALYSIS OF WOOL WOVEN FABRICS TENSILE STRENGTH USING ANCOVA MODEL
ANNALS OF THE UNIVERSITY OF ORADEA FASCICLE OF TEXTILES, LEATHERWORK VARIANCE ANALYSIS OF WOOL WOVEN FABRICS TENSILE STRENGTH USING ANCOVA MODEL VÎLCU Adrian 1, HRISTIAN Liliana 2, BORDEIANU Demetra Lăcrămioara
More informationInference with Simple Regression
1 Introduction Inference with Simple Regression Alan B. Gelder 06E:071, The University of Iowa 1 Moving to infinite means: In this course we have seen one-mean problems, twomean problems, and problems
More information2 Prediction and Analysis of Variance
2 Prediction and Analysis of Variance Reading: Chapters and 2 of Kennedy A Guide to Econometrics Achen, Christopher H. Interpreting and Using Regression (London: Sage, 982). Chapter 4 of Andy Field, Discovering
More informationDiagnostics of Linear Regression
Diagnostics of Linear Regression Junhui Qian October 7, 14 The Objectives After estimating a model, we should always perform diagnostics on the model. In particular, we should check whether the assumptions
More informationLECTURE 5. Introduction to Econometrics. Hypothesis testing
LECTURE 5 Introduction to Econometrics Hypothesis testing October 18, 2016 1 / 26 ON TODAY S LECTURE We are going to discuss how hypotheses about coefficients can be tested in regression models We will
More informationThe Multiple Regression Model
Multiple Regression The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & or more independent variables (X i ) Multiple Regression Model with k Independent Variables:
More informationMultiple Regression Methods
Chapter 1: Multiple Regression Methods Hildebrand, Ott and Gray Basic Statistical Ideas for Managers Second Edition 1 Learning Objectives for Ch. 1 The Multiple Linear Regression Model How to interpret
More informationMathematical Notation Math Introduction to Applied Statistics
Mathematical Notation Math 113 - Introduction to Applied Statistics Name : Use Word or WordPerfect to recreate the following documents. Each article is worth 10 points and should be emailed to the instructor
More information