Factor analysis and multiple linear regression modeling

Size: px
Start display at page:

Download "Factor analysis and multiple linear regression modeling"

Transcription

1 IieghnalCharacterimàonofWaterQua!ity(Vrooeeàiniso{ths Baltimore Symposium, May 1989). IAHSPubl.no.182,1989. analysis and multiple linear regression modeling Dr.K.S.V.Basivi Reddy Pri ncipal Kakatiya Institute of Technology?< Science, Warangal India Dr. M. Panduranga Rao Pro-fessor o-f Engineering Geology Regional Engineering College, Warangal India ABSTRACT Water quality data obtained from Warangal Urban agglomeration which is a hard rock terrain was subjected to multiple regression analysis. 58 wells were monitored for 5 seasons and each sample was analysed for 13 parameters. analytic studies of the body of the data obtained from the Warangal water quality analysis show that only a few factors adequately represent the traits that define the water quality. Sodium, Chloride, Ionisation, Hardness and Total Dissolved Solids arib grouped under one factor representing salinity, mineralisation of waters and pollution. Another factor is represented by potassium, calcium and magnesium reflecting the calcium magnesium dominant nature o-f Warangal waters. The third factor is covered by sulphates and fluorides indicating permanent hardness. factor is covered by ph and bicarbonates and carbonates The fourth reflecting alkalinity and temporary hardness. The fifth factor is nitrate signifying man made pollution. This analysis has been used to suggest models for predicting water quality. On the whole, it appears, that sodium either independently or in association and bicarbonates are the causal with potassium, magnesium, calcium variables for the determination of total dissolved solids. It is also seen that chlorides, either independently or in association with bicarbonates and magnesium appears to be important, variables -for the contribution of TDS. 31

2 K. S. V. Basivi Ready & M. Panduranga Rao 32 With the help o-f these models it. is possible to predict the water quality in any water given one predictor value? such as specific conductivity which in turn clearly indicates TDS. SIGNIFICANCE The extensive collection of data related to chemical quality of groundwaters of the Warangal urban agglomeration comprises 13 different properties which are mostly inter-correlated making the interpretation complex. analysis is a technique which tries to interpret intercorrelated variable to yield meaningful conclusions. The basic assumption in the factor analysis is that if the test battery is intercorrelated, they can be transformed suitably to yield uncorrelated derived constructs known as factors. It is possible to interpret these factors for meaningful application. The analysis is, therefore, applied to the 58 samples collected in each of the five seasons-june 1981; Oct.1981; Dec.1981; Feb.1982 and May These were duly processed for 13 parameters throughout the factor analysis. THE FACTOR MODEL analysis technique provides a mathematical model which can be used to expedite the computation of multiple regression statistics by deriving a number of variables. The principal objective of the factor analysis is to develop a parsimonious description of the observed variables, and to discover the fundamental or basic traits among them. The technique consists in accounting for the tests and their intercorrelations to determine the minimum number of uncorrelated dimensions to yield factors which convey all the essential information given by the original set of variables. These dimensions or FACTORS, in turn help in identification of basic traits or other general concepts. There are several variations in the method of solving the factors

3 33 analysis and multiple linear regression modelling problem. The method of principal components based on the following model is mostly advocated for data reduction jobs (Cooley, W.W and Lohness,P.R.1971). Any standardised test score 'Z' of an individual 'i ' can be considered as a linear combination of several underlying factors by a model of the type. Zji = aji + fii + aj2 f2i + aj3 f3i + ajp fpi (j = l,2,....p) Where Zji = Standard score of an individual ajp = loading of an individual i on test j on factor p fpi = The amount of uncorrelated trait measured by factor 'p' which is possessed by individual i. This method is based on the contention that 13 variables can be represented in 13 dimensional tests space model and that the loci of uniform frequency density is essentially a hyperel1ipsoid. The axes of these ellipsoids correspond to the principal component thus defines the factor or basic dimension of all the variables which &re correlated with each other. A special feature of the solution is that the first principal component is a linear combination of all variables which extracts the maximum of the total variance, the second principal component which is orthogonal with the first and further extracts the maximum out of the residual variance and so on until all the variance -is extracted. In other words, the sum of the variance of all the principal components is equal to the sum of the variance of the original variables. If it is possible to find out a set of smaller number of linear combinations or components of the original variables which account for most of the variance, then, considerable perismony is achieved. In this work the procedure suggested by Harman is adopted. THE DATA The data that is employed for applying the analysis techniques is obtained from the investigation described earlier

4 K. S. V. Basivi Reddy & M. Panduranga Rao 34 for di-f-ferent seasons. The primary intention for applying the factor analysis is, to study the chemical parameters involved in the above investigation, to coal ice the abstract properties o-f the waters and to identify the basic parameters influencing the water quality. The following are the variables considered. Table 1: List o-f Parameters examined Variable No. Parameter ph Sodi urn Potassi urn Magnesi urn Cal ci urn Chloride Ni trate Bicarbonates + Carbonates Sulphate FIuori de Hardness Sum o-f ions 13. Total Dissolved Solids ANALYSIS A computer programme was prepared utilising the standard subroutines on Eigen vector and Vari-max rotation techniques. The data was -fed to the Integra 1001 system available at Computer Maintenance Corporation Ltd., Hyderabad and results: i) The Intercorrelated Matrix, Means and Standard deviation o-f the 13 variables for the 5 seasons ii) The Eigen values and Vectors -for the 5 seasons

5 35 analysis and multiple linear regression modelling iii) The Matrix -for the 5 seasons, and iv) The Rotated Matrix for the 5 seasons have been tabulated -from computer output. The tabular statements o-f these -four characteristics -for all -five seasons run into 20 tables and are lengthy. Owing to space restrictions these Bre not included in this paper. The coefficients of the Rotated Matrix indicate the correlations of the variables with the respective -factor and provide a basis for identifying their names. Generally the name selected is governed by the largest, correlations with the factors under consideration. The following variables are identified in the case of the data obtained for June I.. Variables 2,6,11,12,13 II.. Variables 3,4,5 III.. Variables 9,10 IV.. Variables 1,8 V.. Variable 7 The traits with significant coefficients for I are: Sodium (.923), Chloride (.939), Hardness (.719), Sum of ions (.926) and Total Dissolved Solids (.934) I Sodium, Chloride, Sum of ions, Hardness and T.D.S. are grouped under this factor. The sodium and Chloride reflects the salinity while the sum of ions, hardness and T.D.S. indicate the extent of mineralisation in the waters. Chloride is also indicative of pollution. II Potassium, Calcium and Magnesium srs grouped under this factor. The calcium and magnesium reflects the Ca-Mg dominant nature of waters. III Sulphate and Fluorides sre grouped under this factor. Sulphate is an indication of permanent hardness of waters.

6 K. S. V. Basivi Reddy & M, Panduranga Rao 36 IV ph and HCO + CO are grouped under this factor reflecting alkalinity and temporary hardness ot waters. V Nitrate has been named in this -factor signifying man made pollution. Similar analysis was done for the seasons October,1981, December,1981, February, 1982 and May,1982. Summary of the results indicate that most of the waters can be considered to have 5 different factors. Generally, I and II indicate the salinity and mineralisation. III indicates the permanent hardness. IV indicates temporary hardness. V indicates man made pollution. Therefore, it can be concluded that the Warangal waters have salinity, hardness as well as man made pollution for all the seasons in almost all the wells. REGRESSION A regression problem considers the frequency distribution of one variable when another is held fixed at each of several levels. A correlation problem considers the joint variation of two measurements, neither of which is restricted by the experiment. Correlation is a process by which the degree of association between samples of two variables X and Y is defined. The correlation coefficient is a mathematical definition of that association. The end product of the process of correlation is the correlation coefficient; it is not an equation. 1.MULTIPLE LINEAR REGRESSION The 'regression' model assumes that, some variable 'Y' responds to changes in other 'X' variables. The 'Y' variable is the quantity under study and is known as 'response' or 'dependent' variable. The X variables are those which exhibit a causal effect on the

7 37 analysis and multiple linear regression modelling value of the Y and Are known as the ' expl anatory ' or 'independent' variables. The model is expressed by Y = bo+blxl+...bkyk Where bo, bi... bk sre the least squares estimators of the unknown parameters, bo is the intercept, while bl... bk are regression coef f i ci ents. They are chosen in such a way as to minimise the squared sum of the residuals or deviations -from the estimated line. The major issues in the development of this model are: a) The identification of those variables which have significant and separte effects on the dependent variables. b) The model must not only provide good statistical fit to the present day date but must also be of a logical and meaningful form. c) The variables must be meaningful in explaining the dependent variable behaviour. With such an equation developed, it is possible to develop future levels of the dependent variable given future predictor indicators. The adequacy of the model can be tested by the Analysis of variance approach. The total sum of squares is decomposed into regression sum of squares and error sum of squares. SST = SSR + SSE The multifile correlation coefficient R. R = SSR/SST This indicates the degree of association between independent variable and the dependent variable. It varies between 0 and 1. Closer to 0 is worse but closer 1 is better. The significance of R is that its square R is approximately the decimal fraction of the variation of 'Y' accounted by independent variables xi, i.e. if R =0.941 then of the total variance in the data is explained by the model.

8 K. S. V. Basivi Reddy & M. Panduranga Rao 'F'TEST The regression sum of squares can be used to give some indication concerning whether or not the model is an adequate explanation of the true situation. One test is the F ratio. SSR/k F = at k, n-k-1 d.f. SSE(n-k-l) From F tables at prescribed confidence level the value of F can be known. F calculated must be more than F tables in which case the variation in Y is explained and is not by chance. 3.'t'TEST The t-statistic indicates the significance (or not) of the regression coefficient of each independent variable. Independent variables which have a 't' value of less than the table 't' value at the degrees of freedom do not have a significant relationship with the dependent variable and therefore, contribute nothing to the equation. If t-3 calculated for a parameters is and t at 907. level at (n-k-1) degrees of freedom is 3.36 from tables, i.e. t3 calculated is less than t tabulated, the coefficient does not significantly differ from zero. Hence, variable a3 can be dropped and other combinations tried. PRECAUTIONS: The following precautions B.re to be taken in the development of linear regression models. a) Independent variables should not be intercorrelated b) All variables should be capable of clear interpretation and measurable c) The size of regression intercept in relation to the mean dependent variable Cy) is to be small. d) Signs must be logical 4. DEVELOPMENT OF LINEAR REGRESSION MODELS In the present investigation the water samples were analysed to

9 39 analysis and multiple linear regression modelling determine 13 different chemical properties. Sodium, Calcium, Potassium, Magnesium, Chlorides, Bicarbonates including Carbonates, Sulfates, Nitrates, Fluorides, hardness, total dissolved solids (TDS) sum of ions and ph. Since TDS is the single parameter which could re+lect the influence o-f all the dissolved constituents it is desirable to develop a model by means of which TDS could be predicted or computed given all or a -few of the twelve independent chemical constituents. In other words it is hypothesised that TDS = f<na,ca,k,mg,cl, (HC03+CD3), SQ4, NO 3, F) There could be several types of -functional -forms but multiple linear regressions is the most powerful and easily explainable model available in the literature. For successful development of such a model the causal variable, must be truly independent of each other as explained earlier. For this purpose - analysis was performed on the test battery and it was found that there are four basic dimensions which 3.re truly uncorrel ated. This was shown in Table 2. It is now proposed to utilise the result in selecting such variables which Are truly uncorrelated with a view to develop a number of regression models for possible selection. All the possible combinations were tried in this case. Regression models were developed with the help of a FORTRAN package and the models 2 were examined for R, F, T and intercept statistics. Those models which do not satisfy any of these statistics are considered as poor and hence rejected. It was interesting to note that none of the models which contained Nitrates, Sulphates, ph were found to be satisfactory. As a result of this experience it was decided to repeat the programme with a new set of independent variables. The tables containing these

10 K. S. V. Basivi Reddy & M. Panduranga Rao 40 Table 2: VARIABLE RECOGNISABLE TO BE ASSOCIATED WITH EACH OF THE FACTORS Seasons Vari abl e recoe qnisabl June, 1981 October, 1981 December, 1981 February, 1982 May 1982 I 2,6,11, 12,13 2,6,10, 11,12,13 2,6,11, 12,13 2,6,11 12,13 2, 6, ,12,13 II 3,4,5 7,9 1,7 4,5 1 III 9,10 4,5 3,4 9,10 8 IV 1,8 1,8 8 3,4,5 V 7 3,7 5 s 4 s 3 s = s 4 s programmes are lengthy and are not included in view of the space restrictions in the paper. The tables exhibiting regression coefficients and the various statistics -for interpretation purposes. The recommended models based on -final Multiple Regression Analysis are as follows; June 1981: (Premonsoon and Summer) l.tds = 4.54 Na say 4.5 Na TDS = 3.-3 Cl say 3 Cl TDS = 4.56 Na K say 4.5 HB l< TDS = 4.52 Na Mg say 4.5 Na + 22 Hg TDS = 4.4 Na Ca say 4.4 Na Ca TDS = 2.97 Cl HC Say 3 CI HC03-75 October 1981: (Postmonsoon) l.tds = 1.86 Na Mg say 1.80 Na + IS Mg + 400

11 41 analysis and multiple linear regression modelling December 1981: (Winter) l.tds = 3.28 Na say 3.3 Na TDS = 2.15 Cl say 2.15 Cl TDS = 3.04 Na Ca say 3 Na Ca TDS = 2.06 Cl HC say 2 Cl HDC TDS = 3.22 Na Mg say 3.2 CI + 14 Mg February 1982 : (Late Winter) l.tds = 3.9 Na TDS = 2.4 Cl TDS = 2.16 Hardness TDS = 3.75 Ca Mg TDS = 3.5 Na Ca May 1982: (Summer) l.tds = Na TDS = 2.5 Cl TDS = 3.75 Na Ca TDS = 3.75 Na Ca TDS = 2.5 Na HC03-60 In order to predict water quality throughout year attempts sre made for conducting regression analysis on the data -from 290 samples collected throughout the year over five seasons and the following models have been found satisfactory. TOTAL ANNUAL DATA: l.tds = 3.25 Na TDS = 2.25 Cl TDS = 3.2 Na K TDS = 3.00 Na Ca TDS = 6.40 Mg Cl TDS = 2.20 Cl HC TDS = 3.15 Na Mg TDS = 6 Mg Cl HC

12 K. S. V. Basivi Reddy & M. Panduranga Rao 42 On the whole, it appears, that sodium either independently or in association with K, Mg, Ca and bicarbonates are the causal variables for the determination o-f TDS variable. Alternatively chlorides, either independently or in association with bicarbonates and Mg also appear to be the important variables -for the contribution o-f TDS in the water with the help o-f these models. It is now possible to predict TDS in any water given one predictor value. One important application o-f this analysis is that the entire water quality can be predicted through a single simple test like speci-fic conductance which in turn clearly indicates the TDS.

Econ 3790: Business and Economics Statistics. Instructor: Yogesh Uppal

Econ 3790: Business and Economics Statistics. Instructor: Yogesh Uppal Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal yuppal@ysu.edu Sampling Distribution of b 1 Expected value of b 1 : Variance of b 1 : E(b 1 ) = 1 Var(b 1 ) = σ 2 /SS x Estimate of

More information

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #6

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #6 STA 8 Applied Linear Models: Regression Analysis Spring 011 Solution for Homework #6 6. a) = 11 1 31 41 51 1 3 4 5 11 1 31 41 51 β = β1 β β 3 b) = 1 1 1 1 1 11 1 31 41 51 1 3 4 5 β = β 0 β1 β 6.15 a) Stem-and-leaf

More information

Simple Linear Regression: One Quantitative IV

Simple Linear Regression: One Quantitative IV Simple Linear Regression: One Quantitative IV Linear regression is frequently used to explain variation observed in a dependent variable (DV) with theoretically linked independent variables (IV). For example,

More information

An Introduction to Mplus and Path Analysis

An Introduction to Mplus and Path Analysis An Introduction to Mplus and Path Analysis PSYC 943: Fundamentals of Multivariate Modeling Lecture 10: October 30, 2013 PSYC 943: Lecture 10 Today s Lecture Path analysis starting with multivariate regression

More information

Regression Analysis II

Regression Analysis II Regression Analysis II Measures of Goodness of fit Two measures of Goodness of fit Measure of the absolute fit of the sample points to the sample regression line Standard error of the estimate An index

More information

Figure 1: The fitted line using the shipment route-number of ampules data. STAT5044: Regression and ANOVA The Solution of Homework #2 Inyoung Kim

Figure 1: The fitted line using the shipment route-number of ampules data. STAT5044: Regression and ANOVA The Solution of Homework #2 Inyoung Kim 0.0 1.0 1.5 2.0 2.5 3.0 8 10 12 14 16 18 20 22 y x Figure 1: The fitted line using the shipment route-number of ampules data STAT5044: Regression and ANOVA The Solution of Homework #2 Inyoung Kim Problem#

More information

Inference for Regression

Inference for Regression Inference for Regression Section 9.4 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 13b - 3339 Cathy Poliak, Ph.D. cathy@math.uh.edu

More information

An Introduction to Path Analysis

An Introduction to Path Analysis An Introduction to Path Analysis PRE 905: Multivariate Analysis Lecture 10: April 15, 2014 PRE 905: Lecture 10 Path Analysis Today s Lecture Path analysis starting with multivariate regression then arriving

More information

CHAPTER 3 Ionic Compounds. General, Organic, & Biological Chemistry Janice Gorzynski Smith

CHAPTER 3 Ionic Compounds. General, Organic, & Biological Chemistry Janice Gorzynski Smith CHAPTER 3 Ionic Compounds General, Organic, & Biological Chemistry Janice Gorzynski Smith CHAPTER 3: Ionic Compounds Learning Objectives: q Octet Rule & Predicting ionic Charges q Ionic Bonds q Formation

More information

STAT 360-Linear Models

STAT 360-Linear Models STAT 360-Linear Models Instructor: Yogendra P. Chaubey Sample Test Questions Fall 004 Note: The following questions are from previous tests and exams. The final exam will be for three hours and will contain

More information

Ch 2: Simple Linear Regression

Ch 2: Simple Linear Regression Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component

More information

Simple and Multiple Linear Regression

Simple and Multiple Linear Regression Sta. 113 Chapter 12 and 13 of Devore March 12, 2010 Table of contents 1 Simple Linear Regression 2 Model Simple Linear Regression A simple linear regression model is given by Y = β 0 + β 1 x + ɛ where

More information

The simple linear regression model discussed in Chapter 13 was written as

The simple linear regression model discussed in Chapter 13 was written as 1519T_c14 03/27/2006 07:28 AM Page 614 Chapter Jose Luis Pelaez Inc/Blend Images/Getty Images, Inc./Getty Images, Inc. 14 Multiple Regression 14.1 Multiple Regression Analysis 14.2 Assumptions of the Multiple

More information

Lect. 2: Chemical Water Quality

Lect. 2: Chemical Water Quality The Islamic University of Gaza Faculty of Engineering Civil Engineering Department M.Sc. Water Resources Water Quality Management (ENGC 6304) Lect. 2: Chemical Water Quality ١ Chemical water quality parameters

More information

Regression Analysis: Basic Concepts

Regression Analysis: Basic Concepts The simple linear model Regression Analysis: Basic Concepts Allin Cottrell Represents the dependent variable, y i, as a linear function of one independent variable, x i, subject to a random disturbance

More information

Basic Business Statistics 6 th Edition

Basic Business Statistics 6 th Edition Basic Business Statistics 6 th Edition Chapter 12 Simple Linear Regression Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of a dependent variable based

More information

ANSWERS: Atoms and Ions

ANSWERS: Atoms and Ions ANSWERS: Atoms and Ions 1) Available in April 2014 2) a) Atom Atomic No Electron arrangement of atom Electron arrangement of ion Ion symbol Ca 20 2,8,8,2 2,8,8 Ca 2+ F 9 2,7 2,8 F Cl 17 2,8,7 2,8,8 Cl

More information

STAT420 Midterm Exam. University of Illinois Urbana-Champaign October 19 (Friday), :00 4:15p. SOLUTIONS (Yellow)

STAT420 Midterm Exam. University of Illinois Urbana-Champaign October 19 (Friday), :00 4:15p. SOLUTIONS (Yellow) STAT40 Midterm Exam University of Illinois Urbana-Champaign October 19 (Friday), 018 3:00 4:15p SOLUTIONS (Yellow) Question 1 (15 points) (10 points) 3 (50 points) extra ( points) Total (77 points) Points

More information

Net Ionic Equations. Making Sense of Chemical Reactions

Net Ionic Equations. Making Sense of Chemical Reactions Making Sense of Chemical Reactions Now that you have mastered writing balanced chemical equations it is time to take a deeper look at what is really taking place chemically in each reaction. There are

More information

Chapter 14 Simple Linear Regression (A)

Chapter 14 Simple Linear Regression (A) Chapter 14 Simple Linear Regression (A) 1. Characteristics Managerial decisions often are based on the relationship between two or more variables. can be used to develop an equation showing how the variables

More information

Salinity. foot = 0.305m yard = 0.91m. Length. Area m 2 square feet ~0.09m2. Volume m 3 US pint ~ 0.47 L fl. oz. ~0.02 L.

Salinity. foot = 0.305m yard = 0.91m. Length. Area m 2 square feet ~0.09m2. Volume m 3 US pint ~ 0.47 L fl. oz. ~0.02 L. Length m foot = 0.305m yard = 0.91m Area m 2 square feet ~0.09m2 Volume m 3 US pint ~ 0.47 L, L (liters) fl. oz. ~0.02 L Speed m/s mph Acceleration m/s 2 mph/s Weight kg, gram pound ~0.45kg Temperature

More information

Effect of rainfall and temperature on rice yield in Puri district of Odisha in India

Effect of rainfall and temperature on rice yield in Puri district of Odisha in India 2018; 7(4): 899-903 ISSN (E): 2277-7695 ISSN (P): 2349-8242 NAAS Rating: 5.03 TPI 2018; 7(4): 899-903 2018 TPI www.thepharmajournal.com Received: 05-02-2018 Accepted: 08-03-2018 A Baliarsingh A Nanda AKB

More information

(2) (1) (2) The isotopic composition of a sample of sulphur is found using a mass spectrometer.

(2) (1) (2) The isotopic composition of a sample of sulphur is found using a mass spectrometer. 1. (a) State the meaning of the terms relative atomic mass......... mass number...... (iii) isotopes......... The isotopic composition of a sample of sulphur is found using a mass spectromer. Explain how

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression Simple linear regression tries to fit a simple line between two variables Y and X. If X is linearly related to Y this explains some of the variability in Y. In most cases, there

More information

Variance Decomposition and Goodness of Fit

Variance Decomposition and Goodness of Fit Variance Decomposition and Goodness of Fit 1. Example: Monthly Earnings and Years of Education In this tutorial, we will focus on an example that explores the relationship between total monthly earnings

More information

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression Chapter 14 Student Lecture Notes 14-1 Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Multiple Regression QMIS 0 Dr. Mohammad Zainal Chapter Goals After completing

More information

IGCSE Double Award Extended Coordinated Science

IGCSE Double Award Extended Coordinated Science IGCSE Double Award Extended Coordinated Science Chemistry 4.0 - Chemical Formulae and Equations - the chemical symbols for the first 20 elements - And the charges of the ions they form - And use them to

More information

Simple Linear Regression: One Qualitative IV

Simple Linear Regression: One Qualitative IV Simple Linear Regression: One Qualitative IV Simple linear regression with one qualitative IV variable is essentially identical to linear regression with quantitative variables. The primary difference

More information

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit LECTURE 6 Introduction to Econometrics Hypothesis testing & Goodness of fit October 25, 2016 1 / 23 ON TODAY S LECTURE We will explain how multiple hypotheses are tested in a regression model We will define

More information

Applied Regression Analysis

Applied Regression Analysis Applied Regression Analysis Chapter 3 Multiple Linear Regression Hongcheng Li April, 6, 2013 Recall simple linear regression 1 Recall simple linear regression 2 Parameter Estimation 3 Interpretations of

More information

FAQ: Linear and Multiple Regression Analysis: Coefficients

FAQ: Linear and Multiple Regression Analysis: Coefficients Question 1: How do I calculate a least squares regression line? Answer 1: Regression analysis is a statistical tool that utilizes the relation between two or more quantitative variables so that one variable

More information

Salinity. See Appendix 1 of textbook x10 3 = See Appendix 1 of textbook

Salinity. See Appendix 1 of textbook x10 3 = See Appendix 1 of textbook Length Area Volume m m m foot = 0.305m yard = 0.91m square feet ~0.09m2 US pint ~ 0.47 L fl. oz. ~0.02 L Speed m/s mph Acceleration m/s mph/s Weight kg, gram pound ~0.45kg Temperature o o See Appendix

More information

Variance Decomposition in Regression James M. Murray, Ph.D. University of Wisconsin - La Crosse Updated: October 04, 2017

Variance Decomposition in Regression James M. Murray, Ph.D. University of Wisconsin - La Crosse Updated: October 04, 2017 Variance Decomposition in Regression James M. Murray, Ph.D. University of Wisconsin - La Crosse Updated: October 04, 2017 PDF file location: http://www.murraylax.org/rtutorials/regression_anovatable.pdf

More information

Properties of Compounds

Properties of Compounds Chapter 6. Properties of Compounds Comparing properties of elements and compounds Compounds are formed when elements combine together in fixed proportions. The compound formed will often have properties

More information

Dimensionality Reduction Techniques (DRT)

Dimensionality Reduction Techniques (DRT) Dimensionality Reduction Techniques (DRT) Introduction: Sometimes we have lot of variables in the data for analysis which create multidimensional matrix. To simplify calculation and to get appropriate,

More information

Applied Regression Analysis. Section 2: Multiple Linear Regression

Applied Regression Analysis. Section 2: Multiple Linear Regression Applied Regression Analysis Section 2: Multiple Linear Regression 1 The Multiple Regression Model Many problems involve more than one independent variable or factor which affects the dependent or response

More information

The Multiple Regression Model

The Multiple Regression Model Multiple Regression The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & or more independent variables (X i ) Multiple Regression Model with k Independent Variables:

More information

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220 Dr. Mohammad Zainal Chapter Goals After completing

More information

Simple linear regression

Simple linear regression Simple linear regression Prof. Giuseppe Verlato Unit of Epidemiology & Medical Statistics, Dept. of Diagnostics & Public Health, University of Verona Statistics with two variables two nominal variables:

More information

FinQuiz Notes

FinQuiz Notes Reading 10 Multiple Regression and Issues in Regression Analysis 2. MULTIPLE LINEAR REGRESSION Multiple linear regression is a method used to model the linear relationship between a dependent variable

More information

SMAM 314 Exam 42 Name

SMAM 314 Exam 42 Name SMAM 314 Exam 42 Name Mark the following statements True (T) or False (F) (10 points) 1. F A. The line that best fits points whose X and Y values are negatively correlated should have a positive slope.

More information

Chapter 7 Case Studies with Regression. Jorge Luis Romeu IIT Research Institute June 24, 1999

Chapter 7 Case Studies with Regression. Jorge Luis Romeu IIT Research Institute June 24, 1999 Chapter 7 Case Studies with Regression Jorge Luis Romeu IIT Research Institute June 24, 1999 Executive Summary In this chapter we discuss the use of regression models through the development of four case

More information

Lecture 5: Linear Regression

Lecture 5: Linear Regression EAS31136/B9036: Statistics in Earth & Atmospheric Sciences Lecture 5: Linear Regression Instructor: Prof. Johnny Luo www.sci.ccny.cuny.edu/~luo Dates Topic Reading (Based on the 2 nd Edition of Wilks book)

More information

df=degrees of freedom = n - 1

df=degrees of freedom = n - 1 One sample t-test test of the mean Assumptions: Independent, random samples Approximately normal distribution (from intro class: σ is unknown, need to calculate and use s (sample standard deviation)) Hypotheses:

More information

Correlation and Regression

Correlation and Regression Correlation and Regression October 25, 2017 STAT 151 Class 9 Slide 1 Outline of Topics 1 Associations 2 Scatter plot 3 Correlation 4 Regression 5 Testing and estimation 6 Goodness-of-fit STAT 151 Class

More information

Ch 13 & 14 - Regression Analysis

Ch 13 & 14 - Regression Analysis Ch 3 & 4 - Regression Analysis Simple Regression Model I. Multiple Choice:. A simple regression is a regression model that contains a. only one independent variable b. only one dependent variable c. more

More information

Multiple regression: Model building. Topics. Correlation Matrix. CQMS 202 Business Statistics II Prepared by Moez Hababou

Multiple regression: Model building. Topics. Correlation Matrix. CQMS 202 Business Statistics II Prepared by Moez Hababou Multiple regression: Model building CQMS 202 Business Statistics II Prepared by Moez Hababou Topics Forward versus backward model building approach Using the correlation matrix Testing for multicolinearity

More information

Model Building Chap 5 p251

Model Building Chap 5 p251 Model Building Chap 5 p251 Models with one qualitative variable, 5.7 p277 Example 4 Colours : Blue, Green, Lemon Yellow and white Row Blue Green Lemon Insects trapped 1 0 0 1 45 2 0 0 1 59 3 0 0 1 48 4

More information

Lecture 10 Multiple Linear Regression

Lecture 10 Multiple Linear Regression Lecture 10 Multiple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: 6.1-6.5 10-1 Topic Overview Multiple Linear Regression Model 10-2 Data for Multiple Regression Y i is the response variable

More information

Mathematics for Economics MA course

Mathematics for Economics MA course Mathematics for Economics MA course Simple Linear Regression Dr. Seetha Bandara Simple Regression Simple linear regression is a statistical method that allows us to summarize and study relationships between

More information

Lecture 9: Linear Regression

Lecture 9: Linear Regression Lecture 9: Linear Regression Goals Develop basic concepts of linear regression from a probabilistic framework Estimating parameters and hypothesis testing with linear models Linear regression in R Regression

More information

Chapter 14. Linear least squares

Chapter 14. Linear least squares Serik Sagitov, Chalmers and GU, March 5, 2018 Chapter 14 Linear least squares 1 Simple linear regression model A linear model for the random response Y = Y (x) to an independent variable X = x For a given

More information

Chapter 15 Multiple Regression

Chapter 15 Multiple Regression Multiple Regression Learning Objectives 1. Understand how multiple regression analysis can be used to develop relationships involving one dependent variable and several independent variables. 2. Be able

More information

Seawater and Ocean Chemistry

Seawater and Ocean Chemistry Seawater and Ocean Chemistry Seawater Chemistry Water Seawater Salts in seawater Water Composition Properties Water is a chemical compound (H 2 O) comprising two atoms of hydrogen and one atom of oxygen,

More information

Regression. Estimation of the linear function (straight line) describing the linear component of the joint relationship between two variables X and Y.

Regression. Estimation of the linear function (straight line) describing the linear component of the joint relationship between two variables X and Y. Regression Bivariate i linear regression: Estimation of the linear function (straight line) describing the linear component of the joint relationship between two variables and. Generally describe as a

More information

Regression Models. Chapter 4. Introduction. Introduction. Introduction

Regression Models. Chapter 4. Introduction. Introduction. Introduction Chapter 4 Regression Models Quantitative Analysis for Management, Tenth Edition, by Render, Stair, and Hanna 008 Prentice-Hall, Inc. Introduction Regression analysis is a very valuable tool for a manager

More information

School of Mathematical Sciences. Question 1

School of Mathematical Sciences. Question 1 School of Mathematical Sciences MTH5120 Statistical Modelling I Practical 8 and Assignment 7 Solutions Question 1 Figure 1: The residual plots do not contradict the model assumptions of normality, constant

More information

Confidence Interval for the mean response

Confidence Interval for the mean response Week 3: Prediction and Confidence Intervals at specified x. Testing lack of fit with replicates at some x's. Inference for the correlation. Introduction to regression with several explanatory variables.

More information

Linear models and their mathematical foundations: Simple linear regression

Linear models and their mathematical foundations: Simple linear regression Linear models and their mathematical foundations: Simple linear regression Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/21 Introduction

More information

Understanding and Interpreting Soil and Plant Tissue Lab Reports

Understanding and Interpreting Soil and Plant Tissue Lab Reports Understanding and Interpreting Soil and Plant Tissue Lab Reports Dirk Holstege Director, UC Davis Analytical Laboratory dmholstege@ucdavis.edu 530-752-0148 224 Hoagland Hall UC Davis Anlab.ucdavis.edu

More information

Page 2. Define the term electron affinity for chlorine (2)

Page 2. Define the term electron affinity for chlorine (2) Q1.(a) Define the term electron affinity for chlorine. (b) Complete this Born Haber cycle for magnesium chloride by giving the missing species on the dotted lines. Include state symbols where appropriate.

More information

28. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III 28. SIMPLE LINEAR REGRESSION III Fitted Values and Residuals To each observed x i, there corresponds a y-value on the fitted line, y = βˆ + βˆ x. The are called fitted values. ŷ i They are the values of

More information

Multiple Linear Regression

Multiple Linear Regression 1. Purpose To Model Dependent Variables Multiple Linear Regression Purpose of multiple and simple regression is the same, to model a DV using one or more predictors (IVs) and perhaps also to obtain a prediction

More information

[4+3+3] Q 1. (a) Describe the normal regression model through origin. Show that the least square estimator of the regression parameter is given by

[4+3+3] Q 1. (a) Describe the normal regression model through origin. Show that the least square estimator of the regression parameter is given by Concordia University Department of Mathematics and Statistics Course Number Section Statistics 360/1 40 Examination Date Time Pages Final June 2004 3 hours 7 Instructors Course Examiner Marks Y.P. Chaubey

More information

Midterm 2 - Solutions

Midterm 2 - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis February 24, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put

More information

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test, October 2013

UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test, October 2013 UNIVERSITY OF TORONTO SCARBOROUGH Department of Computer and Mathematical Sciences Midterm Test, October 2013 STAC67H3 Regression Analysis Duration: One hour and fifty minutes Last Name: First Name: Student

More information

CS 5014: Research Methods in Computer Science

CS 5014: Research Methods in Computer Science Computer Science Clifford A. Shaffer Department of Computer Science Virginia Tech Blacksburg, Virginia Fall 2010 Copyright c 2010 by Clifford A. Shaffer Computer Science Fall 2010 1 / 207 Correlation and

More information

STA 4210 Practise set 2a

STA 4210 Practise set 2a STA 410 Practise set a For all significance tests, use = 0.05 significance level. S.1. A multiple linear regression model is fit, relating household weekly food expenditures (Y, in $100s) to weekly income

More information

PubH 7405: REGRESSION ANALYSIS. MLR: INFERENCES, Part I

PubH 7405: REGRESSION ANALYSIS. MLR: INFERENCES, Part I PubH 7405: REGRESSION ANALYSIS MLR: INFERENCES, Part I TESTING HYPOTHESES Once we have fitted a multiple linear regression model and obtained estimates for the various parameters of interest, we want to

More information

Lecture 3 questions Temperature, Salinity, Density and Circulation

Lecture 3 questions Temperature, Salinity, Density and Circulation Lecture 3 questions Temperature, Salinity, Density and Circulation (1) These are profiles of mean ocean temperature with depth at various locations in the ocean which in the following (a, b, c) figures

More information

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47 ECON2228 Notes 2 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 2 2014 2015 1 / 47 Chapter 2: The simple regression model Most of this course will be concerned with

More information

STAT 212 Business Statistics II 1

STAT 212 Business Statistics II 1 STAT 1 Business Statistics II 1 KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA STAT 1: BUSINESS STATISTICS II Semester 091 Final Exam Thursday Feb

More information

Questions for "Reaction Bingo" 1. The starting substances in a chemical reaction.

Questions for Reaction Bingo 1. The starting substances in a chemical reaction. Chemical Reactions Bingo, April 2011 1 Questions for "Reaction Bingo" 1. The starting substances in a chemical reaction. 2. A single compound gets broken apart in this type of reaction. (one of the 5 types

More information

Chemistry 222 Fall 2015 Exam 2: Chapters 5,6,7 80 Points

Chemistry 222 Fall 2015 Exam 2: Chapters 5,6,7 80 Points Chemistry 222 Fall 2015 Exam 2: Chapters 5,6,7 80 Points Name Complete two (2) of problems 1-3, problem 4, and three (3) of problems 5-8. CLEARLY mark the problems you do not want graded. You must show

More information

NEW DIAGRAM USEFUL FOR CLASSIFICATION OF GROUNDWATER QUALITY

NEW DIAGRAM USEFUL FOR CLASSIFICATION OF GROUNDWATER QUALITY NEW DIAGRAM USEFUL FOR CLASSIFICATION OF GROUNDWATER QUALITY Elhag A.B Department of Civil Engineering, College of Engineering, King Khalid University, Saudi ABSTRACT: Due to human and human activities

More information

Density Temp vs Ratio. temp

Density Temp vs Ratio. temp Temp Ratio Density 0.00 0.02 0.04 0.06 0.08 0.10 0.12 Density 0.0 0.2 0.4 0.6 0.8 1.0 1. (a) 170 175 180 185 temp 1.0 1.5 2.0 2.5 3.0 ratio The histogram shows that the temperature measures have two peaks,

More information

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups

One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups One-Way Analysis of Variance: A Guide to Testing Differences Between Multiple Groups In analysis of variance, the main research question is whether the sample means are from different populations. The

More information

Transition Pack for A Level Chemistry

Transition Pack for A Level Chemistry Transition Pack for A Level Chemistry Get ready for A-level! A guide to help you get ready for A-level Chemistry, including everything from topic guides to days out and online learning courses. Commissioned

More information

Half Yearly Exam 2015

Half Yearly Exam 2015 GOZO COLLEGE Secondary School KULLEĠĠ TA GĦAWDEX Skola Sekondarja Half Yearly Exam 015 Year 9 Track 3 CHEMISTRY Time: 1½ hours Name: Class: Useful Data: Atomic numbers and relative atomic masses are given

More information

Biostatistics 380 Multiple Regression 1. Multiple Regression

Biostatistics 380 Multiple Regression 1. Multiple Regression Biostatistics 0 Multiple Regression ORIGIN 0 Multiple Regression Multiple Regression is an extension of the technique of linear regression to describe the relationship between a single dependent (response)

More information

AMS 315/576 Lecture Notes. Chapter 11. Simple Linear Regression

AMS 315/576 Lecture Notes. Chapter 11. Simple Linear Regression AMS 315/576 Lecture Notes Chapter 11. Simple Linear Regression 11.1 Motivation A restaurant opening on a reservations-only basis would like to use the number of advance reservations x to predict the number

More information

Business Statistics. Lecture 10: Correlation and Linear Regression

Business Statistics. Lecture 10: Correlation and Linear Regression Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form

More information

Univariate analysis. Simple and Multiple Regression. Univariate analysis. Simple Regression How best to summarise the data?

Univariate analysis. Simple and Multiple Regression. Univariate analysis. Simple Regression How best to summarise the data? Univariate analysis Example - linear regression equation: y = ax + c Least squares criteria ( yobs ycalc ) = yobs ( ax + c) = minimum Simple and + = xa xc xy xa + nc = y Solve for a and c Univariate analysis

More information

ECON 450 Development Economics

ECON 450 Development Economics ECON 450 Development Economics Statistics Background University of Illinois at Urbana-Champaign Summer 2017 Outline 1 Introduction 2 3 4 5 Introduction Regression analysis is one of the most important

More information

****************************************************************************

**************************************************************************** **************************************************************************** To quickly summarize: 1. The solubility of a compound is decreased when an ion which is the same as one of the ions in the compound

More information

Compounds. Part 1: Types of Compounds & Bonding

Compounds. Part 1: Types of Compounds & Bonding Compounds Part 1: Types of Compounds & Bonding Review In their natural state, atoms have no overall charge. 18 Ar Argon 40 Protons = 18 Electrons = 18 This is because the number of protons (+) equals the

More information

Additional Chapter 7 Homework Problems: Due with chapter 7 homework, show your work for full credit!

Additional Chapter 7 Homework Problems: Due with chapter 7 homework, show your work for full credit! Additional Chapter 7 Homework Problems: Due with chapter 7 homework, show your work for full credit! Note: If you are struggling with these, see the chapter 7 worksheet titled: Molarity, Molality, Osmolality,

More information

Lesson on Electrolysis

Lesson on Electrolysis Lesson on Electrolysis This lesson package includes a lesson plan, a worksheet for students, and teachers notes on the worksheet. Activity Lesson 1 (50 min-2 Period lesson) Teacher explains (page 1 to

More information

Exam practice mark scheme C2: Discovering chemistry

Exam practice mark scheme C2: Discovering chemistry Exam practice mark scheme C: Discovering chemistry Foundation Tier (a)(i) Photo chlorine Any one correct for one mark Photo iodine Photo bromine Allow only one line from each photo and to each name Group

More information

Interaction effects for continuous predictors in regression modeling

Interaction effects for continuous predictors in regression modeling Interaction effects for continuous predictors in regression modeling Testing for interactions The linear regression model is undoubtedly the most commonly-used statistical model, and has the advantage

More information

PART I. (a) Describe all the assumptions for a normal error regression model with one predictor variable,

PART I. (a) Describe all the assumptions for a normal error regression model with one predictor variable, Concordia University Department of Mathematics and Statistics Course Number Section Statistics 360/2 01 Examination Date Time Pages Final December 2002 3 hours 6 Instructors Course Examiner Marks Y.P.

More information

SeCtiOn 7 [STOCK AND CUSTOM] Ion Chromatography Single and Multi-Element Standards

SeCtiOn 7 [STOCK AND CUSTOM] Ion Chromatography Single and Multi-Element Standards SeCtiOn 7 [STOCK AND CUSTOM] Ion Chromatography Single and Multi-Element Standards Your Science is Our Passion. Ion Chromatography Standards As with SPEX CertiPrep s Assurance Standards, every IC Standard

More information

CHAPTER 4 CRITICAL GROWTH SEASONS AND THE CRITICAL INFLOW PERIOD. The numbers of trawl and by bag seine samples collected by year over the study

CHAPTER 4 CRITICAL GROWTH SEASONS AND THE CRITICAL INFLOW PERIOD. The numbers of trawl and by bag seine samples collected by year over the study CHAPTER 4 CRITICAL GROWTH SEASONS AND THE CRITICAL INFLOW PERIOD The numbers of trawl and by bag seine samples collected by year over the study period are shown in table 4. Over the 18-year study period,

More information

AP Chemistry Summer Assignment

AP Chemistry Summer Assignment AP Chemistry Summer Assignment AP Chemistry Students: This summer you are responsible for the following assignments: 1. You need to master the formulas, charges, and names of the common ions. On the second

More information

Chapter 7. Chemical Equations and Reactions

Chapter 7. Chemical Equations and Reactions Chemical Equations and Reactions Chemical and Physical Changes In a physical change, the chemical composition of the substance remains constant. Examples of physical changes are the melting of ice or the

More information

A discussion on multiple regression models

A discussion on multiple regression models A discussion on multiple regression models In our previous discussion of simple linear regression, we focused on a model in which one independent or explanatory variable X was used to predict the value

More information

ECON3150/4150 Spring 2015

ECON3150/4150 Spring 2015 ECON3150/4150 Spring 2015 Lecture 3&4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo January 29, 2015 1 / 67 Chapter 4 in S&W Section 17.1 in S&W (extended OLS assumptions) 2

More information

Unit 7, Lesson 08: The ph of Salt Solutions, Answers

Unit 7, Lesson 08: The ph of Salt Solutions, Answers 1. Complete the following chart: Unit 7, Lesson 08: The ph of Salt Solutions, Answers on NH 4 PO 3 3- Parent Acid or Base s the parent strong or weak? Will this ion hydrolyze? f the ion will hydrolyze

More information

Experimental design. Matti Hotokka Department of Physical Chemistry Åbo Akademi University

Experimental design. Matti Hotokka Department of Physical Chemistry Åbo Akademi University Experimental design Matti Hotokka Department of Physical Chemistry Åbo Akademi University Contents Elementary concepts Regression Validation Hypotesis testing ANOVA PCA, PCR, PLS Clusters, SIMCA Design

More information

Reactants: Products: Definition:

Reactants: Products: Definition: Definition: A chemical reaction is a process in which one or more substances are changed to form new chemical substance(s) with different physical and chemical properties. Definition: A chemical reaction

More information