Applied Regression Modeling

Similar documents
Applied Regression Modeling

Regression Analysis By Example

Linear Models in Statistics

A Second Course in Statistics: Regression Analysis

Wiley. Methods and Applications of Linear Models. Regression and the Analysis. of Variance. Third Edition. Ishpeming, Michigan RONALD R.

Contents. Acknowledgments. xix

Generalized, Linear, and Mixed Models

Linear Models 1. Isfahan University of Technology Fall Semester, 2014

Continuous Univariate Distributions

THE PRINCIPLES AND PRACTICE OF STATISTICS IN BIOLOGICAL RESEARCH. Robert R. SOKAL and F. James ROHLF. State University of New York at Stony Brook

INTRODUCTORY REGRESSION ANALYSIS

Response Surface Methodology

Continuous Univariate Distributions

Handbook of Regression Analysis

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

From Practical Data Analysis with JMP, Second Edition. Full book available for purchase here. About This Book... xiii About The Author...

Statistical Methods in HYDROLOGY CHARLES T. HAAN. The Iowa State University Press / Ames

Contents. Preface to Second Edition Preface to First Edition Abbreviations PART I PRINCIPLES OF STATISTICAL THINKING AND ANALYSIS 1

DESIGN AND ANALYSIS OF EXPERIMENTS Third Edition

Stat 5101 Lecture Notes

FORECASTING. Methods and Applications. Third Edition. Spyros Makridakis. European Institute of Business Administration (INSEAD) Steven C Wheelwright

Course in Data Science

Applied Regression Modeling: A Business Approach Chapter 2: Simple Linear Regression Sections

Forecasting: Methods and Applications

Subjective and Objective Bayesian Statistics

DESIGNING EXPERIMENTS AND ANALYZING DATA A Model Comparison Perspective

INTRODUCTION TO LINEAR REGRESSION ANALYSIS

Regression Model Building

Chapter 13. Multiple Regression and Model Building

TIME SERIES ANALYSIS. Forecasting and Control. Wiley. Fifth Edition GWILYM M. JENKINS GEORGE E. P. BOX GREGORY C. REINSEL GRETA M.

STATISTICS; An Introductory Analysis. 2nd hidition TARO YAMANE NEW YORK UNIVERSITY A HARPER INTERNATIONAL EDITION

Statistical Evaluations in Exploration for Mineral Deposits

G. S. Maddala Kajal Lahiri. WILEY A John Wiley and Sons, Ltd., Publication

HANDBOOK OF APPLICABLE MATHEMATICS

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Regression Diagnostics Procedures

Response Surface Methodology:

An Introduction to Multivariate Statistical Analysis

1 Introduction Overview of the Book How to Use this Book Introduction to R 10

Pattern Recognition and Machine Learning

Contents. Preface xxiii About the Authors xxix

Multiple Linear Regression

COPYRIGHTED MATERIAL CONTENTS. Preface Preface to the First Edition

Lecture 18: Simple Linear Regression

STATISTICS 479 Exam II (100 points)

Contents. Set Theory. Functions and its Applications CHAPTER 1 CHAPTER 2. Preface... (v)

A Course in Time Series Analysis

Chapter 4 Multi-factor Treatment Designs with Multiple Error Terms 93

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California

Basic Business Statistics 6 th Edition

Any of 27 linear and nonlinear models may be fit. The output parallels that of the Simple Regression procedure.

Statistical Methods. for Forecasting

Statistical Methods for Forecasting

Experimental Design and Data Analysis for Biologists

Multicollinearity occurs when two or more predictors in the model are correlated and provide redundant information about the response.

Introduction to Eco n o m et rics

Unit 6 - Simple linear regression

A Guide to Modern Econometric:

Univariate Discrete Distributions

ELECTROCHEMICAL IMPEDANCE SPECTROSCOPY

Subject CS1 Actuarial Statistics 1 Core Principles

Finite Population Sampling and Inference

Data Analysis. with Excel. An introduction for Physical scientists. LesKirkup university of Technology, Sydney CAMBRIDGE UNIVERSITY PRESS

Linear Statistical Models

Psychology Seminar Psych 406 Dr. Jeffrey Leitzel

Linear Regression and Correlation

Elements of Multivariate Time Series Analysis

Generalized Linear Models (GLZ)

TIME SERIES DATA ANALYSIS USING EVIEWS

Christopher Dougherty London School of Economics and Political Science

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science.

Inference for Regression Inference about the Regression Model and Using the Regression Line

Machine Learning Linear Regression. Prof. Matteo Matteucci

Unit 6 - Introduction to linear regression

Linear Regression. Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x).

NUMERICAL METHODS FOR ENGINEERING APPLICATION

TABLE OF CONTENTS INTRODUCTION, APPROXIMATION & ERRORS 1. Chapter Introduction to numerical methods 1 Multiple-choice test 7 Problem set 9

Contents. Part I: Fundamentals of Bayesian Inference 1

Chapter 1 Statistical Inference

PART I INTRODUCTION The meaning of probability Basic definitions for frequentist statistics and Bayesian inference Bayesian inference Combinatorics

COMPREHENSIVE WRITTEN EXAMINATION, PAPER III FRIDAY AUGUST 26, 2005, 9:00 A.M. 1:00 P.M. STATISTICS 174 QUESTION

Unit 10: Simple Linear Regression and Correlation

Deccan Education Society s FERGUSSON COLLEGE, PUNE (AUTONOMOUS) SYLLABUS UNDER AUTOMONY. SECOND YEAR B.Sc. SEMESTER - III

DISCOVERING STATISTICS USING R

Chapter 15 Multiple Regression

Chapter 10 Correlation and Regression

Contents. Part I Statistical Background and Basic Data Handling 5. List of Figures List of Tables xix

Institute of Actuaries of India

The Business Forecasting Deal

Multiple Linear Regression

The simple linear regression model discussed in Chapter 13 was written as

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

Contents. I Background 1. Contents... Preface... Acknowledgments... The Blind Men and the Elephant... xxi. History of Impedance Spectroscopy...

Transition Passage to Descriptive Statistics 28

Exam Applied Statistical Regression. Good Luck!

Introduction to Statistical Analysis using IBM SPSS Statistics (v24)

1. 4 2y 1 2 = x = x 1 2 x + 1 = x x + 1 = x = 6. w = 2. 5 x

AN INTRODUCTION TO PROBABILITY AND STATISTICS

FORECASTING METHODS AND APPLICATIONS SPYROS MAKRIDAKIS STEVEN С WHEELWRIGHT. European Institute of Business Administration. Harvard Business School

Transcription:

Applied Regression Modeling A Business Approach Iain Pardoe University of Oregon Charles H. Lundquist College of Business Eugene, Oregon WILEY- INTERSCIENCE A JOHN WILEY & SONS, INC., PUBLICATION

CONTENTS Preface Acknowledgments xiii xv Introduction xvii 1.1 Statistics in business xvii 1.2 Learning statistics xix 1 Foundations 1 1.1 Identifying and summarizing data 1 1.2 Population distributions 4 1.3 Selecting individuals at random probability 9 1.4 Random sampling 10 1.4.1 Central limit theorem normal version 11 1.4.2 Student's t-distribution 12 1.4.3 Central limit theorem t version 14 1.5 Interval estimation 14 1.6 Hypothesis testing 17 1.6.1 The rejection region method 17 1.6.2 The p-value method 19 1.6.3 Hypothesis test errors 23 1.7 Random errors and prediction 23 vii

VIII CONTENTS 1.8 Chapter summary 26 Problems 27 2 Simple linear regression 31 2.1 Probability model for X and Y 31 2.2 Least Squares criterion 36 2.3 Model evaluation 40 2.3.1 Regression Standard error 41 2.3.2 Coefficient of determination R 2 43 2.3.3 Slope parameter 47 2.4 Model assumptions 54 2.4.1 Checking the model assumptions 54 2.5 Model Interpretation 59 2.6 Estimation and prediction 60 2.6.1 Confidence interval for the population mean, E(y) 61 2.6.2 Prediction interval for an individual F-value 62 2.7 Chapter summary 65 2.7.1 Review example 66 Problems 70 3 Multiple linear regression 73 3.1 Probability model for (Xx, X 2,...) and Y 73 3.2 Least Squares criterion 77 3J Model evaluation 81 3.3.1 Regression Standard error 81 3.3.2 Coefficient of determination R 2 82 3.3.3 Regression parameters global usefulness test 89 3.3.4 Regression parameters nested model test 93 3.3.5 Regression parameters individual tests 97 3.4 Model assumptions 105 3.4.1 Checking the model assumptions 106 3.5 Model interpretation 109 3.6 Estimation and prediction 111 3.6.1 Confidence interval for the population mean, E(7) 111 3.6.2 Prediction interval for an individual F-value 112 3.7 Chapter summary 114 Problems 116 4 Regression model building I 121 4.1 Transformations 122 4.1.1 Natural logarithm transformation for predictors 122

CONTENTS IX 4.1.2 Polynomial transformation for predictors 128 4.1.3 Reciprocal transformation for predictors 130 4.1.4 Natural logarithm transformation for the response 134 4.1.5 Transformations for the response and predictors 137 4.2 Interactions 140 4.3 Qualitative predictors 146 4.3.1 Qualitative predictors with two levels 147 4.3.2 Qualitative predictors with three or more levels 153 4.4 Chapter summary 158 Problems 160 5 Regression model buifding II 165 5.1 Influential points 165 5.1.1 Outliers 165 5.1.2 Leverage 168 5.1.3 Cook's distance 171 5.2 Regression pitfalls 173 5.2.1 Autocorrelation 173 5.2.2 Multicollinearity 175 5.2.3 Excluding important predictor variables 177 5.2.4 Overfitting 180 5.2.5 Extrapolation 181 5.2.6 Missing Data 183 5.3 Model building guidelines 186 5.4 Model Interpretation using graphics 188 5.5 Chapter summary 194 Problems 196 6 Case studies 201 6.1 Homeprices 201 6.1.1 Data description 201 6.1.2 Exploratory data analysis 203 6.1.3 Regression model building 204 6.1.4 Results and conclusions 205 6.1.5 Further questions 210 6.2 Vehicle fuel efficiency 211 6.2.1 Data description 211 6.2.2 Exploratory data analysis 211 6.2.3 Regression model building 213 6.2.4 Results and conclusions 214 6.2.5 Further questions 219

X CONTENTS 7 Extensions 221 7.1 Generalized linear modeis 222 7.1.1 Logistic regression 222 7.1.2 Poisson regression 226 7.2 Discrete choice modeis 229 7.3 Multilevel modeis 232 7.4 Bayesian modeling 234 7.4.1 Frequentist inference 234 7.4.2 Bayesian inference 235 Appendix A: Computer Software help 237 A.l SPSS 238 A. 1.1 Getting started and summarizing univariate data 238 A. 1.2 Simple linear regression 241 A. 1.3 Multiple linear regression 243 A.2 Minitab 245 A.2.1 Getting started and summarizing univariate data 245 A.2.2 Simple linear regression 248 A.2.3 Multiple linear regression 249 A.3 SAS 251 A.3.1 Getting started and summarizing univariate data 252 A.3.2 Simple linear regression 254 A.3.3 Multiple linear regression 255 A.4 R and S-PLUS 257 A.4.1 Getting started and summarizing univariate data 258 A.4.2 Simple linear regression 260 A.4.3 Multiple linear regression 261 A.5 Excel 263 A.5.1 Getting started and summarizing univariate data 263 A.5.2 Simple linear regression 265 A.5.3 Multiple linear regression 265 Problems 267 Appendix B: Critical values for t-distributions 269 Appendix C: Notation and formulas 273 C.l Univariate data 273 C.2 Simple linear regression 274 C.3 Multiple linear regression 275 Appendix D: Mathematics refresher 277

CONTENTS XI D. 1 The natural logarithm and exponential functions 277 D.2 Rounding and accuracy 278 Appendix E: Brief answers to selected problems 279 References 287 Glossary 291 Index 297