Considering Numerical Error Propagation in Modeling and Regression of Data

Size: px
Start display at page:

Download "Considering Numerical Error Propagation in Modeling and Regression of Data"

Transcription

1 Sesson 2520 Consderng Numercal Error Propagaton n Modelng and Regresson of Data Nema Brauner, Mordecha Shacham Tel-Avv Unversty/Ben-Guron Unversty of the Negev Abstract The use of user-frendly nteractve regresson software enables undergraduate engneerng students to reach a hgh level of sophstcaton n regresson, correlaton and analyss of data. In order to nterpret correctly the results, the students must be famlar wth potental causes for poor fts n correlatons, should be able to recognze a poor correlaton and mprove t f possble. They should also be aware of the practcal consequences of usng a correlaton whch has no statstcal valdty. In ths paper, the harmful effects of numercal error propagaton (resultng from collnearty among the ndependent varables) are explaned and demonstrated. Smple methods for mnmzng such error propagaton n polynomal regresson are ntroduced. Ths materal can be presented, for example, as part of 3rd year undergraduate mathematcal modelng and numercal methods course. Introducton Realstc modelng and accurate correlaton of expermental data are essental to sound engneerng desgn. Many of the statstcal technques for analyzng the accuracy of the correlatons have been known for several decades (see, for example, Draper and Smth, 1981, Hmmelblau, 1970, Bates and Watts, 1988 and Noggle, 1993). But, untl recently, those technques have not been utlzed n a sgnfcant level n undergraduate engneerng educaton. One of the man reasons for not utlzng those technques was that statstcal tests usually yeld numbers (varance, standard devaton, correlaton coeffcent, etc.). The meanng of these numbers can be easly msnterpreted f the statstcal theory and the assumptons made n developng the tests are not well understood. The emergence of software packages wth nteractve regresson and statstcal analyss capabltes (such as POLYMATH, MATLAB, MATHEMATICA, EXCEL) whch provdes both numercal and graphcal output changes the stuaton. These software packages enable undergraduate engneerng students, wth moderate statstcal background, to carry out rgorous regresson and statstcal analyss of data. They are able to select the most approprate correlaton model and test ts statstcal valdty usng resdual and confdence regon plots. They can analyze the qualty and precson of the laboratory data by plottng one ndependent varable versus the others to detect hdden collnearty that may exst among the varables. Page

2 Shacham et al (1996) had descrbed a set of lectures and exercses that s used to ntroduce freshman engneerng students to the bascs of data modelng and analyss usng nteractve software packages. Ths materal s ncluded n an ntroductory computng course or as part of an ntroductory engneerng course. The ntroducton to data modelng and analyss (descrbed by Shacham et al, 1996) ncludes the followng subjects: 1. Basc statstcal concepts. 2. Dscrmnaton between real expermental data and smoothed nterpolated data. 3. Usng resdual plots and confdence ntervals for selectng the most approprate model. 4. The dangers of extrapolaton, n partcular, when a non-theory based model s used. Ths ntroductory materal s very helpful to the students for modelng and analyzng ther own data. However, they may need more advanced materal when dealng wth, for example, models contanng large numbers of parameters. In ths paper, more advanced materal related to regresson s presented. The dscusson ncludes models whch are comprsed of a sum of functons of the same ndependent varable (as n polynomal regresson). Varous effects of the nterdependency between these functons are descrbed and demonstrated. The materal presented s taught to thrd year, undergraduate Chemcal Engneerng students at the Ben Guron Unversty as part of a mathematcal modelng and numercal methods course. The calculatons nvolved n solvng the examples presented have been carred out usng the POLYMATH 4.0 (Shacham and Cutlp, 1996) and MATLAB (MathWorks, 1992) packages, but other smlar packages can be used for ths purpose. Lnear Regresson wth Models Comprsng of Functons of One Independent Varable Let us assume that there s a set of N data ponts of a dependent varable (measured varable, such as vapor pressure, vscosty, heat capacty, etc.), y versus an ndependent varable (controlled varable, such as temperature,. concentraton, pressure) x, = 1, 2, N. A regresson model comprsed of a lnear combnaton of n dfferent functons of the ndependent varable s consdered. Thus, the regressors are x 1 = f 1 (x), x 2 = f 2 (x) x n = f n (x). For nstance, n polynomals x 1 = x 0, x 2 = x, x n = x n 1. A lnear model ftted to the data s of the form: y = β 0 +β 1 x 1 + β 2 x 2 + βn x n + (1) where β 0, β 1, β n are the parameters of the model and s a measurement error n y. It s assumed that s ndependently and dentcally (..d.) dstrbuted. The vector of estmated parameters ˆ T β = (ˆ β, ˆ ˆ 0 β1κ βn ) s usually calculated usng the least squares error approach, by mnmzng the followng functon: Page

3 S 2 = N [ y ( β0 +β1x1 +β2x2 +βn xn )] = 1 2 Λ (2) If the parameters appear n a lnear expresson (as n eq. (1)), the mnmzaton can be carred out by solvng a set of smultaneous lnear algebrac equatons (the normal equaton): T T X Xβˆ = X y (3) The columns of X are: x 0 = 1, x 1, x 2 x n and X T X = A s the normal matrx. To check the goodness of the ft between the observed y and estmated ŷ values of the dependent varable, they can be plotted versus x (when there s a sngle ndependent varable) or versus, the pont number (when there are several ndependent varables). The dstance between the observed and estmated values can serve as an ndcaton for the qualty of the ft. These dstances can be amplfed usng a resdual plot. In the resdual plot, the model error (resdual) εˆ s plotted usually versus y, where: ε ˆ = y yˆ (4) A random dstrbuton of the resduals around zero ndcates that the model correctly represent the partcular set of data. A defnte trend or pattern n the resdual plot may ndcate ether a lack of ft of the model, or that the assumpton of random error dstrbuton for the dependent varable s ncorrect. In some cases (for example, n cases where the value of the dependent varable changes by several orders of magntude over the range of nterest) the relatve error s dstrbuted normally. The relatve error s defned as: εˆ εˆ r = (5) y The approprate transformaton, whch results n mnmzaton of the relatve error n a regresson, s takng the logarthm of both sdes of the model equaton. It should be emphaszed, however, that when the varables are transformed, the resdual plot must be constructed usng the transformed form of the dependent varable, n order to account for the change n the error dstrbuton ntroduced by the transformaton. A numercal ndcator for the qualty of the ft whch s used most frequently s the square of standard error of the estmate, whch represents the sample varance, and s gven by: s 2 = N = 1 ( y yˆ ) N n 1 2 (6) Page

4 Thus, the sample varance s the sum of squares of errors dvded by the degrees of freedom (where the number of parameters, n+1, s subtracted from the number of data ponts, N) and s a measure for the varablty of the actual y values from the predcted ŷ values. Smaller varance ndcates a better ft of the model to the data. Confdence ntervals (n partcular the 95% confdence nterval) on the parameter values can be very useful ndcators of the ft between the model and the data. A better ft and more precse data lead to narrow confdence ntervals, whle a poor model ft and/or mprecse data result n wde confdence ntervals. Furthermore, confdence ntervals whch are larger (n absolute value) than the respectve parameters themoften ndcate that the model contans too many parameters. Confdence nterval s defned by: j ( ν, α) s a jj β j < βˆ j + t( ν α) s a jj β ˆ t, (7) where t (ν, α) s the statstcal t dstrbuton correspondng to ν degrees of freedom (ν=n (n+1)) and a desred confdence level, α and s s the standard error of the estmate. If the number of data ponts s large enough, the value of t approaches a constant value (for ν > 15, t 2 for the 95% confdence nterval). Therefore, t (ν, α) s often omtted. The term s a jj s called the standard error of the estmate of parameter β. One of the assumptons of the least squares error approach s that there s no error n the ndependent varables. Ths s rarely true, however. Most reports on expermental measurements nclude estmated error n the ndependent varable. If such an estmate s not ncluded, a lower lmt on the error can be estmated from the number of decmal dgts n whch the data are reported. (For example f the temperature s reported wth one dgt after the decmal pont n degrees K, then the error s at least ± 0.05K). Thus, the true value of an ndependent varable can be represented by: x = ~ x + δx (8) where x~ s the expected measured value and δ x s the error (or uncertanty) n ts value. The error n the ndependent varable s, of course, carred over to ts functons. The error n the dfferent functons can be estmated from: f j δx j δx (9) x x= x where δx j s the estmated error n the jth functon of x. The errors n x and ts functons are carred over to the normal matrx and to the X T y term n eq. (3). These errors can propagate durng the soluton process of eq. (3) yeldng naccurate and unstable parameter estmates. Denotng b = X T y, the errors n the calculated parameter values, δ βˆ are bounded by (p. 176 n Dahlqust et al, 1974): Page

5 βˆ δa δ κ( A ) (10) A βˆ +δβˆ where κ (A) s the condton number of the normal matrx and δa s the matrx of errors n A. A smlar equaton relates the error n b, δb, to the error δβ ˆ : δβˆ βˆ κ b ( A) δ b (11) The condton number s the rato of the largest to the smallest egenvalue of A. If the condton number s large, the δa and δb are amplfed consderably n the calculaton of βˆ, yeldng very poor estmate of the true parameter values. Large condton numbers result from lnear dependency (collnearty) between the varous (supposedly) ndependent varables. The normal matrx whch has large condton number s called ll-condtoned. The problems of collnearty has been extensvely dscussed n the lterature (see, for example, Shacham and Brauner (1997), Brauner and Shacham (1998)). In the next three sectons, the practcal results of collnearty, ll-condtonng and numercal error propagaton wll be demonstrated usng a vapor pressure correlaton example. Vapor Pressure Models and Data There are several models whch are beng extensvely used for correlatng vapor pressure versus temperature data. Three types of equatons wll be used n ths work. The frst s a polynomal: P T T 2 T n = β0 + β1 + β2 Λ βn 1 (12) ln + where T s temperature (K), P s pressure (k Pa) and n s the order of the polynomal. The precson and stablty of polynomal regresson can often be ncreased by transformng the data. The followng transformatons (whch were nvestgated by Shacham and Brauner (1997)) wll be used. Normalzaton of the temperature: ι = T /T max (13) where T max s the largest temperature value n the data set. The w transformaton defned by: w = (T T mn )/( T max T mn ) (14) whch yelds values n the rage 0 w 1, and the z transformaton Page

6 z 2T T T T T max mn = (15) max mn whch yelds varable dstrbuton n the range of -1 z 1. The Redel equaton s a four parameter equaton, whch s used n several slghtly dfferent forms. The followng defnton s used n ths work ln β τ 1 2 π = β0 + + β2 ln τ + β3τ (16) where π s the normalzed pressure π = P /P max. When only the frst two terms of the equaton are used, the model reduces to what s known as Clapeyron s equaton. The Wagner (1973) equaton s consdered the most accurate equaton, wth the smallest number of constants, for correlatng vapor pressure data n a wde range of temperatures (between the trple pont and the crtcal temperatures). Whle the number of terms and the exponents of the dfferent terms n Wagner s equaton may change, the most wdely used form of ths equaton s: ln P R = 1 P R [ β ( 1 T ) + β ( 1 T ) + β ( 1 T ) + β ( T ) ] 6 0 R 1 R 2 R 3 1 R (17) where T R = T/T c s the reduced temperature, P R = P/P c s the reduced pressure, T c s the crtcal temperature (K) and P c s the crtcal pressure (k P a ). To enable dscrmnaton between possble dfferent sources of naccuracy n a. regresson model, exact data generated by Wagner s equaton wll be used rather than real expermental data. Ths way, the precson of the data can be controlled by ntroducng randomly dstrbuted error (nose) to the varables. Table 1 shows the crtcal constants and the Wagner equaton coeffcents for Toluene, the substance whch s used n ths study. Three dfferent data sets were generated usng the Wagner equaton wth the constants gven n Table 1. In each set, 21 data ponts were generated n the vcnty of the normal bolng pont. The three sets are coverng temperature ranges of 100K, 50K and 20K. Mnmal and maxmal values of T, P, ι and π for the three data sets are also shown n Table 2. In Fgure 1, the vapor pressure of Toluene n the range of 310K-590K as calculated from Wagner s equaton s dsplayed. It can be seen that the vapor pressure changes by more than four orders of magntude over ths temperature range. The temperature ranges correspondng to the three data sets are also marked on the fgure. These data sets wll be regressed usng the varous vapor pressure models. Page

7 Table 1: Crtcal Propertes and the Wagner Equaton Constants for Toluene (McGarry (1983)). Meltng pont temperature K Normal bolng pont temperature K Crtcal temperature K Crtcal pressure kPa Wagner constants β β β β Reference: Weast (1978) Table 2: Mnmal and Maxmal Values for the Vapor Pressure Data Sets. Data Set 1 Data Set 2 Data Set 3 Range = 100K Range = 50K Range = 20K T max (K) T mn (K) ι max ι mn P max (kpa) P mn (kpa) π max π mn Page

8 Fgure 1: Vapor pressure of toluene n the 310K - 590K temperature range. Fttng Polynomals to the Data An equaton n the form of polynomal s often used for representng the relatonshp between an ndependent varable and a dependent varable. The resultng correlaton s an emprcal model, whch often lacks a theoretcal bass. Some of the ptfalls of usng such a model for correlatng vapor pressure data were dscussed by Shacham et al (1996). Recently, theoretcal aspects of the stablty of parameters estmaton as functon of range and precson of the data and the number of terms n the polynomal have been extensvely studed (Shacham and Brauner, 1997). The vapor pressure data can be used to demonstrate the practcal results predcted by these theoretcal studes. The model of eq. (2) wth data set 1 (range of 100K) were used n all polynomal regresson studes. The calculatons were carred out usng MATLAB (double precson computaton wth approxmately 16 decmal dgts accuracy) where eq. (3) s explctly solved for βˆ. Table 3 shows the coeffcents, varance and condton numbers of the X T X matrx for polynomals up to the 6th orders when the regresson s done on T and P wthout any normalzaton or transformaton. It can be seen that the varance decreases from for a lnear dependency to for 4th order polynomal of T. Increasng the order of the polynomal further causes a sharp ncrease of the varance to for the 6th order polynomal. The standard errors of the parameters (not shown) ndcate a smlar trend. They are smaller by about two orders of magntude then the parameter values for polynomals up to the 4th order. For the 5th order polynomal (and hgher orders), the standard errors become larger then the parameter values. Page

9 Table 3: Coeffcents, Varance and Condton Number n Polynomal Regresson of the Vapor Pressure Data Order cons β E E E E E E+01 β E E E E E E-01 β E E E E E-04 β E E E E-08 β E E E-09 β E E-12 β E-15 s E E E E E E-02 κ(a) E E E E E E+34 The practcal sgnfcance of such varatons n the varances and the standard errors can be seen n Fgures 2 and 3. Fgure 2 shows the relatve error n the calculated pressure values for 1st, 2nd and 6th order polynomals versus the expermental pressure. For the 1st order polynomal, there s a clear trend of the error and t reaches a maxmum value of about 16%. For the 2nd order polynomal, the error s much smaller and t hardly reaches 1%. Errors n the 3rd and 4th order polynomals cannot even be observed n the scale of Fgure 2. For the 6th order polynomal, there s agan a clear trend n the error and the maxmal error reaches 30%. The same trend s even more pronounced when lookng at the errors n the calculated dervatve values (Fgure 3). The maxmal error n the dervatve values usng the straght lne correlaton reaches only about 50%, t reduces to 0.1% for the 4th order polynomal, whereas wth the 6th order polynomal t ncreases up to 65% error. Thus, usng a too hgh order polynomal (often called overcorrelaton) yelds even poorer results n the dependent varable values and ts dervatve values then usng a polynomal of a too low order. Fgure 2: Relatve error n the calculated pressure values (polynomal regresson). Page

10 Fgure 3: Relatve error n calculated pressure dervatves (polynomal regresson). In tryng to determne what causes the poor results obtaned wth hgh order polynomals, the condton number of the normal matrx (shown n Table 3) can provde valuable nformaton. The value of the condton number s for the 1st order polynomal and t ncreases to for the 6th order polynomal. The MATLAB program, whch reles on the condton number n determnng the condtonng of the normal matrx, starts ssung warnng messages Matrx s close to sngular or badly scaled. Results may be naccurate already for the 3rd order polynomal. Does the 4th order polynomal represent the lmt of precson for ths correlaton? To answer ths queston, regressons have been carred out usng the varous transformatons of the temperature as defned n eqs. (13), (14) and (15). The results of these regressons are summarzed n Fg. 4, whch shows the varance as functon of the polynomal order for the varous transformatons. Wth no-transformaton or wth a smple normalzaton, a. steady decrease of the varance up to the 4th order polynomal s obtaned, whereby each addtonal term n the polynomal reduces the varance by about two orders of magntude. Startng at the 5th order polynomal, the normal matrx becomes ll-condtoned, whch causes a sharp ncrease n the varance. Wth the w and z transformatons, however, the same rate of decrease of the varance extends to the 6th order polynomal. For the w transformaton, the varance ncrease starts usng a 7th order polynomal, whle for the z transformaton, the varance reaches a steady mnmal level at the 8th order polynomal. Thus, usng the z-transformaton enables reducng the varance of the best ft to usng the 8th order polynomal from the mnmal value of obtaned wth the 4th order polynomal and the untransformed orgnal data. Page

11 Fgure 4: Varance n polynomal regresson usng varous transformatons of the temperature data. Fgure 5: Condton numbers n polynomal regresson usng varous transformatons of the temperature data. Fgure 5 shows the logarthm of the condton number as functon of the polynomal order for the varous transformaton. The logarthm of the condton number ncreases lnearly as the polynomal order ncreases. (The only excepton s the case of no transformaton where numercal error propagaton prevents obtanng accurate values for the condton numbers for Page

12 hgh order polynomals). The condton numbers are mnmal for the z-transformaton and ncrease n the followng order: w-transformaton, normalzaton and no transformaton. The reducton of the condton number correspondng to a partcular polynomal order when usng the w and z transformaton (nstead of normalzaton) s reflected n the lower varance values acheved wth these transformatons. It s to be noted, however, that the reducton n the condton number when usng normalzed data nstead of the orgnal data, has no sgnfcant effect on the order of the best-ft polynomal and ts varance. Fttng Clapeyron s and Redel s Equatons to the Data Clapeyron s equaton (the frst two terms n eq. (5)) and Redel s equaton are both frequently used for modelng of vapor pressure data. To nvestgate the effects of the range and precson of the ndependent varable data, both equatons were ftted usng the three data sets shown n Table 2. The data was generated usng Wagner s equaton, but for ths study, a random and normally.dstrbuted error was ntroduced nto the temperature data. Three cases were tested wth the followng error levels: δ T = , 0.05 and 0.5K. Usng the hghest precson (δt = K) and the wdest temperature range (100K) data wth Redel s model yelds a very accurate correlaton. The parameter values obtaned are β 0 = , β 1 = , β 2 = , β 4 = , the varance s and the standard errors of the parameters are smaller by three orders of magntude than the parameter values. The resultant maxmal error n the calculated pressure value s kpa. For the same data set, Clapeyron s equaton yelds a much worse ft. The resultant parameter values are β 0 = , β 1 = , the varance s and the maxmal error n the calculated pressure s kpa. Thus, the use of the four parameter Redel s equatons adds approxmately three more decmal dgts to the accuracy of the correlaton. To observe the effects of the range and precson of the temperature data on the accuracy of the correlaton, the varous data sets were regressed usng Clapeyron s and Redel s models. Fgure 6 shows the varance of the ft obtaned wth Clapeyron s equaton as functon of the range wth the error level δt as parameter. It can be seen that for the hgh precson temperature data (δt = K), the varance s for the 100K range. It reduces to for the 50K temperature range and further to for the 20K range. Ths s the trend that could have been expected n case the varance results from a lack of ft of a model (due to a lmted number of parameters). In such a case, the model can better represent the data n a narrower range of temperature and the varance s expected to decrease accordngly. However, the trend s dfferent for the low precson data (δt = 0.5K). For the 100K range, the varance s , t reduces to for the 50K range and ncreases to for the 20K range. For ths low precson data, numercal error propagaton (resultng from collnearty) s ntensfed wth reducng the range and for the range of 20K, t domnates over a better model ft whch could have been acheved n a narrower range. Fgure 7 shows the varance of the Redel correlaton as functon of the temperature range wth the error level δt as parameter. It can be seen that for all the δt s studed, reducng the range from 50K to 20K affects an of the varance. For the low precson data, Redel s model yelds even a larger varance than that of Clapeyron s equaton. Page

13 Fgure 6: Change of the varance as functon of range for Clapeyron s equaton. Fgure 7: Change of the varance as functon of range for Redel s equaton. Ths study clearly demonstrates that the optmal number of terms/parameters n a model s a functon of the range and precson of the ndependent varable data. When reducng the range mproves the accuracy of the correlaton consderably, the use of a dfferent model (conssts of more terms/parameters) should be consdered. But, when range reducton does not mprove the accuracy, addng more terms to the model wll probably lead to over-correlaton. The harmful effects of over-correlaton have been demonstrated n the prevous secton wth reference to the polynomal regresson. Page

14 Conclusons Numercal error propagaton, resultng from collnearty between varous functons of the same ndependent varable, can severely lmt the accuracy of a correlaton. Because of the complex theory nvolved, demonstratng to undergraduate students the causes and practcal sgnfcance of such naccuraces and dentfyng methods to mprove the accuracy can represent a great challenge. In ths paper, the graphcal and nteractve capabltes of two numercal computaton packages (MATLAB and POLYMATH) have been used to demonstrate these topcs. It was shown that n polynomal regresson, the hghest order of polynomal to be used s often lmted by numercal error propagaton. Dsregardng ths lmt can yeld large errors n the calculated values of the dependent varable and even more severe errors n ts dervatves. The condton number of the normal matrx provdes nformaton regardng the rate of the error propagaton, but ths nformaton can be naccurate and sometmes even msleadng. Collnearty n polynomal regresson can be sgnfcantly reduced by usng data transformatons, n partcular the z-transformaton, whch transform the ndependent varable data to the [-1, +1] range. To determne whether t s collnearty that lmts the hghest order polynomnal and the accuracy of the ft, t s advsable to repeat the regresson usng z-transformaton of the ndependent varable data. If the transformed data allows fttng by a hgher order polynomal, collnearty s a lmtng factor, otherwse other reasons for the low accuracy must be sought. Usng the two-parameter Clapeyron equaton and the four-parameter Redel equaton, t was shown that there s not a sngle model whch s superor n representng a partcular type of data. For correlatng low precson and/or narrow range data of vapor pressure, the Clapeyron equaton s more approprate than the Redel equaton. Whereas, for hgh precson and wde range data, Redel s equaton s more approprate. If reducng the range results n a sgnfcant reducton of the varance, addng more terms/parameters to the model can mprove the accuracy of the correlaton. But, addng more terms when range reducton does not lead to a better ft can result n over-correlaton. References 1. Bates DM and Watts DG (1988), Nonlnear Regresson Analyss and ts Applcaton, John Wley & Sons, New York. 2. Brauner N and Shacham M (1998), The Role of Range and Precson of the Independent Varable n Regresson of Data, AIChE J., 44(3), Dahlqust A, Björk A and Anderson N (1974), Numercal Methods, Prentce-Hall, Englewood Clffs, N.J. 4. Draper NR and Smth H (1981), Appled Regresson Analyss, John Wley & Sons, New York. 5. Hmmelblau DM (1970), Process Analyss by Statstcal Methods, John Wley & Sons, New York. 6. Math Works, Inc. (1992), The Student Edton of MATLAB ; Prentce-Hall, Englewood Clffs, N.J. 7. McGarry J (1983), Correlaton and Predcton of the Vapor Pressures of Pure Lquds over Large Pressure Ranges, Ind. Eng. Chem. Process. Des. Dev, 22, Noggle JH (1993), Practcal Curve Fttng and Data Analyss, Prentce-Hall, Englewood Clffs, N.J. Page

15 9. Shacham M and Brauner N (1997), Mnmzng the Effects of Collnearty n Polynomal Regresson, Ind. & Chem. Res. 36, (10), Shacham M, Brauner N and Cutlp MB (1996), Replacng Graph Paper wth Interactve Software n Modelng and Analyss of Expermental Data. Comp. Appl. Eng. Edu. 4(3), Shacham M and Cutlp MB (1996), POLYMATH 4.0 User s Manual, CACHE corporaton, Austn, TX. 12. Wagner W (1973) New Vapor Pressure Measurements for Argon and Ntrogen and a New Method for Establshng Ratonal Vapor Pressure Equatons, Cryogencs 13, Weast RC (Ed) (1975), Handbook of Chemstry and Physcs, 56th Ed, CRC Press, Oho. NEIMA BRAUNER receved her BSc and MSc from the Technon, Israel Insttute of Technology, and her PhD from the Unversty of Tel-Avv. She s currently a Professor n the Flud Mechancs and Heat Transfer Department. She teaches courses n Mass and Heat Transfer and Process Control. Her man research nterests nclude hydrodynamcs and transport phenomena n two-phase systems and appled regresson analyss. MORDECHAI SHACHAM s a Professor n the Chemcal Engneerng Department at the Ben Guron Unversty of the Negev, Beer-Sheva, Israel. He receved hs BSc and DSc from the Technon, Israel Insttute of Technology. Hs research nterests nclude appled numercal and statstcal methods, computer-aded nstructon, chemcal process smulaton, desgn and optmzaton. Page

is the calculated value of the dependent variable at point i. The best parameters have values that minimize the squares of the errors

is the calculated value of the dependent variable at point i. The best parameters have values that minimize the squares of the errors Multple Lnear and Polynomal Regresson wth Statstcal Analyss Gven a set of data of measured (or observed) values of a dependent varable: y versus n ndependent varables x 1, x, x n, multple lnear regresson

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

Statistics II Final Exam 26/6/18

Statistics II Final Exam 26/6/18 Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the

More information

Lecture 16 Statistical Analysis in Biomaterials Research (Part II)

Lecture 16 Statistical Analysis in Biomaterials Research (Part II) 3.051J/0.340J 1 Lecture 16 Statstcal Analyss n Bomaterals Research (Part II) C. F Dstrbuton Allows comparson of varablty of behavor between populatons usng test of hypothess: σ x = σ x amed for Brtsh statstcan

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems Numercal Analyss by Dr. Anta Pal Assstant Professor Department of Mathematcs Natonal Insttute of Technology Durgapur Durgapur-713209 emal: anta.bue@gmal.com 1 . Chapter 5 Soluton of System of Lnear Equatons

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

Statistics for Business and Economics

Statistics for Business and Economics Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Polynomial Regression Models

Polynomial Regression Models LINEAR REGRESSION ANALYSIS MODULE XII Lecture - 6 Polynomal Regresson Models Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Test of sgnfcance To test the sgnfcance

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 31 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 6. Rdge regresson The OLSE s the best lnear unbased

More information

Chapter 15 Student Lecture Notes 15-1

Chapter 15 Student Lecture Notes 15-1 Chapter 15 Student Lecture Notes 15-1 Basc Busness Statstcs (9 th Edton) Chapter 15 Multple Regresson Model Buldng 004 Prentce-Hall, Inc. Chap 15-1 Chapter Topcs The Quadratc Regresson Model Usng Transformatons

More information

NUMERICAL DIFFERENTIATION

NUMERICAL DIFFERENTIATION NUMERICAL DIFFERENTIATION 1 Introducton Dfferentaton s a method to compute the rate at whch a dependent output y changes wth respect to the change n the ndependent nput x. Ths rate of change s called the

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

Chapter 15 - Multiple Regression

Chapter 15 - Multiple Regression Chapter - Multple Regresson Chapter - Multple Regresson Multple Regresson Model The equaton that descrbes how the dependent varable y s related to the ndependent varables x, x,... x p and an error term

More information

Chapter 9: Statistical Inference and the Relationship between Two Variables

Chapter 9: Statistical Inference and the Relationship between Two Variables Chapter 9: Statstcal Inference and the Relatonshp between Two Varables Key Words The Regresson Model The Sample Regresson Equaton The Pearson Correlaton Coeffcent Learnng Outcomes After studyng ths chapter,

More information

One-sided finite-difference approximations suitable for use with Richardson extrapolation

One-sided finite-difference approximations suitable for use with Richardson extrapolation Journal of Computatonal Physcs 219 (2006) 13 20 Short note One-sded fnte-dfference approxmatons sutable for use wth Rchardson extrapolaton Kumar Rahul, S.N. Bhattacharyya * Department of Mechancal Engneerng,

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

This column is a continuation of our previous column

This column is a continuation of our previous column Comparson of Goodness of Ft Statstcs for Lnear Regresson, Part II The authors contnue ther dscusson of the correlaton coeffcent n developng a calbraton for quanttatve analyss. Jerome Workman Jr. and Howard

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X).

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X). 11.4.1 Estmaton of Multple Regresson Coeffcents In multple lnear regresson, we essentally solve n equatons for the p unnown parameters. hus n must e equal to or greater than p and n practce n should e

More information

COMMON ERRORS IN NUMERICAL PROBLEM SOLVING HOW CAN THEY BE DETECTED AND PREVENTED

COMMON ERRORS IN NUMERICAL PROBLEM SOLVING HOW CAN THEY BE DETECTED AND PREVENTED COMMON ERRORS IN NUMERICL PROBLEM SOLVING HOW CN THEY BE DETECTED ND PREVENTED Mordecha Shacham, Ben-Guron Unversty, Beer-Sheva, Israel Mchael B. Cutlp, Unversty of Connectcut, Storrs, CT 6269, US Mchael

More information

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9 Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth

More information

Chapter 6. Supplemental Text Material

Chapter 6. Supplemental Text Material Chapter 6. Supplemental Text Materal S6-. actor Effect Estmates are Least Squares Estmates We have gven heurstc or ntutve explanatons of how the estmates of the factor effects are obtaned n the textboo.

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

Statistics MINITAB - Lab 2

Statistics MINITAB - Lab 2 Statstcs 20080 MINITAB - Lab 2 1. Smple Lnear Regresson In smple lnear regresson we attempt to model a lnear relatonshp between two varables wth a straght lne and make statstcal nferences concernng that

More information

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE

ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE P a g e ANOMALIES OF THE MAGNITUDE OF THE BIAS OF THE MAXIMUM LIKELIHOOD ESTIMATOR OF THE REGRESSION SLOPE Darmud O Drscoll ¹, Donald E. Ramrez ² ¹ Head of Department of Mathematcs and Computer Studes

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

28. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III Ftted Values and Resduals US Domestc Beers: Calores vs. % Alcohol To each observed x, there corresponds a y-value on the ftted lne, y ˆ = βˆ + βˆ x. The are called ftted

More information

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers Psychology 282 Lecture #24 Outlne Regresson Dagnostcs: Outlers In an earler lecture we studed the statstcal assumptons underlyng the regresson model, ncludng the followng ponts: Formal statement of assumptons.

More information

Numerical Heat and Mass Transfer

Numerical Heat and Mass Transfer Master degree n Mechancal Engneerng Numercal Heat and Mass Transfer 06-Fnte-Dfference Method (One-dmensonal, steady state heat conducton) Fausto Arpno f.arpno@uncas.t Introducton Why we use models and

More information

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution Department of Statstcs Unversty of Toronto STA35HS / HS Desgn and Analyss of Experments Term Test - Wnter - Soluton February, Last Name: Frst Name: Student Number: Instructons: Tme: hours. Ads: a non-programmable

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

SIMPLE LINEAR REGRESSION

SIMPLE LINEAR REGRESSION Smple Lnear Regresson and Correlaton Introducton Prevousl, our attenton has been focused on one varable whch we desgnated b x. Frequentl, t s desrable to learn somethng about the relatonshp between two

More information

Answers Problem Set 2 Chem 314A Williamsen Spring 2000

Answers Problem Set 2 Chem 314A Williamsen Spring 2000 Answers Problem Set Chem 314A Wllamsen Sprng 000 1) Gve me the followng crtcal values from the statstcal tables. a) z-statstc,-sded test, 99.7% confdence lmt ±3 b) t-statstc (Case I), 1-sded test, 95%

More information

4.3 Poisson Regression

4.3 Poisson Regression of teratvely reweghted least squares regressons (the IRLS algorthm). We do wthout gvng further detals, but nstead focus on the practcal applcaton. > glm(survval~log(weght)+age, famly="bnomal", data=baby)

More information

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise. Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the

More information

Lectures - Week 4 Matrix norms, Conditioning, Vector Spaces, Linear Independence, Spanning sets and Basis, Null space and Range of a Matrix

Lectures - Week 4 Matrix norms, Conditioning, Vector Spaces, Linear Independence, Spanning sets and Basis, Null space and Range of a Matrix Lectures - Week 4 Matrx norms, Condtonng, Vector Spaces, Lnear Independence, Spannng sets and Bass, Null space and Range of a Matrx Matrx Norms Now we turn to assocatng a number to each matrx. We could

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 14 Multple Regresson Models 1999 Prentce-Hall, Inc. Chap. 14-1 Chapter Topcs The Multple Regresson Model Contrbuton of Indvdual Independent Varables

More information

CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE

CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE CHAPTER 5 NUMERICAL EVALUATION OF DYNAMIC RESPONSE Analytcal soluton s usually not possble when exctaton vares arbtrarly wth tme or f the system s nonlnear. Such problems can be solved by numercal tmesteppng

More information

The Ordinary Least Squares (OLS) Estimator

The Ordinary Least Squares (OLS) Estimator The Ordnary Least Squares (OLS) Estmator 1 Regresson Analyss Regresson Analyss: a statstcal technque for nvestgatng and modelng the relatonshp between varables. Applcatons: Engneerng, the physcal and chemcal

More information

Chapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2

Chapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2 Chapter 4 Smple Lnear Regresson Page. Introducton to regresson analyss 4- The Regresson Equaton. Lnear Functons 4-4 3. Estmaton and nterpretaton of model parameters 4-6 4. Inference on the model parameters

More information

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 1 Chapters 14, 15 & 16 Professor Ahmad, Ph.D. Department of Management Revsed August 005 Chapter 14 Formulas Smple Lnear Regresson Model: y =

More information

18. SIMPLE LINEAR REGRESSION III

18. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III US Domestc Beers: Calores vs. % Alcohol Ftted Values and Resduals To each observed x, there corresponds a y-value on the ftted lne, y ˆ ˆ = α + x. The are called ftted values.

More information

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS M. Krshna Reddy, B. Naveen Kumar and Y. Ramu Department of Statstcs, Osmana Unversty, Hyderabad -500 007, Inda. nanbyrozu@gmal.com, ramu0@gmal.com

More information

Statistics Chapter 4

Statistics Chapter 4 Statstcs Chapter 4 "There are three knds of les: les, damned les, and statstcs." Benjamn Dsrael, 1895 (Brtsh statesman) Gaussan Dstrbuton, 4-1 If a measurement s repeated many tmes a statstcal treatment

More information

CSci 6974 and ECSE 6966 Math. Tech. for Vision, Graphics and Robotics Lecture 21, April 17, 2006 Estimating A Plane Homography

CSci 6974 and ECSE 6966 Math. Tech. for Vision, Graphics and Robotics Lecture 21, April 17, 2006 Estimating A Plane Homography CSc 6974 and ECSE 6966 Math. Tech. for Vson, Graphcs and Robotcs Lecture 21, Aprl 17, 2006 Estmatng A Plane Homography Overvew We contnue wth a dscusson of the major ssues, usng estmaton of plane projectve

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

Uncertainty as the Overlap of Alternate Conditional Distributions

Uncertainty as the Overlap of Alternate Conditional Distributions Uncertanty as the Overlap of Alternate Condtonal Dstrbutons Olena Babak and Clayton V. Deutsch Centre for Computatonal Geostatstcs Department of Cvl & Envronmental Engneerng Unversty of Alberta An mportant

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students. PPOL 59-3 Problem Set Exercses n Smple Regresson Due n class /8/7 In ths problem set, you are asked to compute varous statstcs by hand to gve you a better sense of the mechancs of the Pearson correlaton

More information

Kernel Methods and SVMs Extension

Kernel Methods and SVMs Extension Kernel Methods and SVMs Extenson The purpose of ths document s to revew materal covered n Machne Learnng 1 Supervsed Learnng regardng support vector machnes (SVMs). Ths document also provdes a general

More information

Global Sensitivity. Tuesday 20 th February, 2018

Global Sensitivity. Tuesday 20 th February, 2018 Global Senstvty Tuesday 2 th February, 28 ) Local Senstvty Most senstvty analyses [] are based on local estmates of senstvty, typcally by expandng the response n a Taylor seres about some specfc values

More information

Scatter Plot x

Scatter Plot x Construct a scatter plot usng excel for the gven data. Determne whether there s a postve lnear correlaton, negatve lnear correlaton, or no lnear correlaton. Complete the table and fnd the correlaton coeffcent

More information

PHYS 450 Spring semester Lecture 02: Dealing with Experimental Uncertainties. Ron Reifenberger Birck Nanotechnology Center Purdue University

PHYS 450 Spring semester Lecture 02: Dealing with Experimental Uncertainties. Ron Reifenberger Birck Nanotechnology Center Purdue University PHYS 45 Sprng semester 7 Lecture : Dealng wth Expermental Uncertantes Ron Refenberger Brck anotechnology Center Purdue Unversty Lecture Introductory Comments Expermental errors (really expermental uncertantes)

More information

Learning Objectives for Chapter 11

Learning Objectives for Chapter 11 Chapter : Lnear Regresson and Correlaton Methods Hldebrand, Ott and Gray Basc Statstcal Ideas for Managers Second Edton Learnng Objectves for Chapter Usng the scatterplot n regresson analyss Usng the method

More information

January Examinations 2015

January Examinations 2015 24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)

More information

Linear Feature Engineering 11

Linear Feature Engineering 11 Lnear Feature Engneerng 11 2 Least-Squares 2.1 Smple least-squares Consder the followng dataset. We have a bunch of nputs x and correspondng outputs y. The partcular values n ths dataset are x y 0.23 0.19

More information

Lab 2e Thermal System Response and Effective Heat Transfer Coefficient

Lab 2e Thermal System Response and Effective Heat Transfer Coefficient 58:080 Expermental Engneerng 1 OBJECTIVE Lab 2e Thermal System Response and Effectve Heat Transfer Coeffcent Warnng: though the experment has educatonal objectves (to learn about bolng heat transfer, etc.),

More information

Inductance Calculation for Conductors of Arbitrary Shape

Inductance Calculation for Conductors of Arbitrary Shape CRYO/02/028 Aprl 5, 2002 Inductance Calculaton for Conductors of Arbtrary Shape L. Bottura Dstrbuton: Internal Summary In ths note we descrbe a method for the numercal calculaton of nductances among conductors

More information

Composite Hypotheses testing

Composite Hypotheses testing Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 008 Recall: man dea of lnear regresson Lnear regresson can be used to study

More information

Generalized Linear Methods

Generalized Linear Methods Generalzed Lnear Methods 1 Introducton In the Ensemble Methods the general dea s that usng a combnaton of several weak learner one could make a better learner. More formally, assume that we have a set

More information

Regularized Discriminant Analysis for Face Recognition

Regularized Discriminant Analysis for Face Recognition 1 Regularzed Dscrmnant Analyss for Face Recognton Itz Pma, Mayer Aladem Department of Electrcal and Computer Engneerng, Ben-Guron Unversty of the Negev P.O.Box 653, Beer-Sheva, 845, Israel. Abstract Ths

More information

Laboratory 1c: Method of Least Squares

Laboratory 1c: Method of Least Squares Lab 1c, Least Squares Laboratory 1c: Method of Least Squares Introducton Consder the graph of expermental data n Fgure 1. In ths experment x s the ndependent varable and y the dependent varable. Clearly

More information

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding Recall: man dea of lnear regresson Lecture 9: Lnear regresson: centerng, hypothess testng, multple covarates, and confoundng Sandy Eckel seckel@jhsph.edu 6 May 8 Lnear regresson can be used to study an

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

Lecture Notes on Linear Regression

Lecture Notes on Linear Regression Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume

More information

Laboratory 3: Method of Least Squares

Laboratory 3: Method of Least Squares Laboratory 3: Method of Least Squares Introducton Consder the graph of expermental data n Fgure 1. In ths experment x s the ndependent varable and y the dependent varable. Clearly they are correlated wth

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε Chapter 3 Secton 3.1 Model Assumptons: Multple Regresson Model Predcton Equaton Std. Devaton of Error Correlaton Matrx Smple Lnear Regresson: 1.) Lnearty.) Constant Varance 3.) Independent Errors 4.) Normalty

More information

Formulas for the Determinant

Formulas for the Determinant page 224 224 CHAPTER 3 Determnants e t te t e 2t 38 A = e t 2te t e 2t e t te t 2e 2t 39 If 123 A = 345, 456 compute the matrx product A adj(a) What can you conclude about det(a)? For Problems 40 43, use

More information

Chapter 14 Simple Linear Regression

Chapter 14 Simple Linear Regression Chapter 4 Smple Lnear Regresson Chapter 4 - Smple Lnear Regresson Manageral decsons often are based on the relatonshp between two or more varables. Regresson analss can be used to develop an equaton showng

More information

Chapter 8 Indicator Variables

Chapter 8 Indicator Variables Chapter 8 Indcator Varables In general, e explanatory varables n any regresson analyss are assumed to be quanttatve n nature. For example, e varables lke temperature, dstance, age etc. are quanttatve n

More information

Errors for Linear Systems

Errors for Linear Systems Errors for Lnear Systems When we solve a lnear system Ax b we often do not know A and b exactly, but have only approxmatons  and ˆb avalable. Then the best thng we can do s to solve ˆx ˆb exactly whch

More information

Supplementary Notes for Chapter 9 Mixture Thermodynamics

Supplementary Notes for Chapter 9 Mixture Thermodynamics Supplementary Notes for Chapter 9 Mxture Thermodynamcs Key ponts Nne major topcs of Chapter 9 are revewed below: 1. Notaton and operatonal equatons for mxtures 2. PVTN EOSs for mxtures 3. General effects

More information

Correlation and Regression

Correlation and Regression Correlaton and Regresson otes prepared by Pamela Peterson Drake Index Basc terms and concepts... Smple regresson...5 Multple Regresson...3 Regresson termnology...0 Regresson formulas... Basc terms and

More information

Application of B-Spline to Numerical Solution of a System of Singularly Perturbed Problems

Application of B-Spline to Numerical Solution of a System of Singularly Perturbed Problems Mathematca Aeterna, Vol. 1, 011, no. 06, 405 415 Applcaton of B-Splne to Numercal Soluton of a System of Sngularly Perturbed Problems Yogesh Gupta Department of Mathematcs Unted College of Engneerng &

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method Appled Mathematcal Scences, Vol. 7, 0, no. 47, 07-0 HIARI Ltd, www.m-hkar.com Comparson of the Populaton Varance Estmators of -Parameter Exponental Dstrbuton Based on Multple Crtera Decson Makng Method

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

Originated from experimental optimization where measurements are very noisy Approximation can be actually more accurate than

Originated from experimental optimization where measurements are very noisy Approximation can be actually more accurate than Surrogate (approxmatons) Orgnated from expermental optmzaton where measurements are ver nos Approxmaton can be actuall more accurate than data! Great nterest now n applng these technques to computer smulatons

More information

e i is a random error

e i is a random error Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown

More information

STATISTICS QUESTIONS. Step by Step Solutions.

STATISTICS QUESTIONS. Step by Step Solutions. STATISTICS QUESTIONS Step by Step Solutons www.mathcracker.com 9//016 Problem 1: A researcher s nterested n the effects of famly sze on delnquency for a group of offenders and examnes famles wth one to

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification

2E Pattern Recognition Solutions to Introduction to Pattern Recognition, Chapter 2: Bayesian pattern classification E395 - Pattern Recognton Solutons to Introducton to Pattern Recognton, Chapter : Bayesan pattern classfcaton Preface Ths document s a soluton manual for selected exercses from Introducton to Pattern Recognton

More information

Week3, Chapter 4. Position and Displacement. Motion in Two Dimensions. Instantaneous Velocity. Average Velocity

Week3, Chapter 4. Position and Displacement. Motion in Two Dimensions. Instantaneous Velocity. Average Velocity Week3, Chapter 4 Moton n Two Dmensons Lecture Quz A partcle confned to moton along the x axs moves wth constant acceleraton from x =.0 m to x = 8.0 m durng a 1-s tme nterval. The velocty of the partcle

More information