Chapter 3 Response Surface Approximations

Size: px
Start display at page:

Download "Chapter 3 Response Surface Approximations"

Transcription

1 Chapter 3 Respose Surface Approxmatos Most optmzato algorthms that are use for solvg aaltcal egeerg optmzato problems are sequetal ature. hat s, the objectve fucto ad costrats are evaluated at oe pot at a tme, ad the values at that pot, as well as prevous desg pot cotrbute to a decso o where desg space to move to for the ext evaluato. Whe the objectve fuctos ad (or) the costrats are evaluated b expermets rather Whe the objectve fuctos ad (or) the costrats are evaluated b expermets rather tha b aaltcal evaluatos, there s usuall a cetve to perform the expermets batches rather tha sgl. Oe reaso for batchg expermets s that most requre set-up tme, advace plag ad reservatos of expermetal facltes or techcas. Aother reaso s that expermetal errors make t dffcult to terpret the results of a sgle expermet. Whe a batch of expermets s performed, errors oe or two expermets ted to stad out. Duplcatg expermets for detcal omal codtos permts us to estmate the magtude of expermetal scatter due to errors ad varablt the propertes of the tested desgs. Fall, some of the expermetal scatter ca be averaged out b performg a large umber of expermets. Because of these advatages of rug expermets batches, expermetal optmzato has followed a dfferet route tha aaltcal optmzato. he stadard approach s to use a optmzato strateg that s based o the results of a batch of expermets. O the bass of the expermets, we costruct approxmatos to objectve fuctos ad/or costrats ad perform optmzato o the bass of these approxmatos. I most cases, the optmum obtaed s the tested, ad f satsfactor results are obtaed the desg procedure s termated. I some cases, the optmum s used as the cetral desg pot for a ew batch of expermets, ad the process s repeated oce or twce. hs process s sometmes called sequetal approxmate optmzato. However, because of the cost ad tme assocated wth coductg expermets, t s rare that the process s terated to covergece. Whe aaltcal calculatos were mostl based o closed form solutos or umercal models that requred mmal modelg ad computatos, the dfferece betwee aalss ad expermets was ver clear. However, toda umercal evaluatos of objectve fuctos ad costrats ofte share ma of the propertes of expermetal evaluatos. Frst, umercal models such as fte elemet structural models requre substatal vestmet of tme to set up ad debug. Furthermore, the evaluato of such models ma requre large computatoal resources, so that the cost of umercal smulato ma be comparable to the cost of expermets. Secod, wth aaltcal smulatos based o complex umercal models, ma sources of ose are ofte foud the results of umercal smulatos. hese cludes roud-off errors as well as errors due to complete covergece of teratve processes. Addtoall, 8

2 umercal smulatos are usuall based o dscretzato of cotua, ad the accurac of ths dscretzato depeds of the shape of the domas beg dscretzed. For example, whe stress aalss s performed a elastc bod usg fte-elemet dscretzato, the dscretzato error does ot chage smoothl wth the shape of the bod, because the umber of fte elemets ca var ol teger cremets. hese growg smlartes betwee aaltcal smulatos ad expermets create cetves to ru aaltcal smulatos batches ad use sequetal approxmate optmzato. Addtoall, the growg avalablt of parallel computers also provdes cetves for rug aaltcal smulatos batches. Fall, umercal smulatos are ofte ru wth software packages that are dffcult to coect drectl to optmzato programs. Approxmate sequetal optmzato provdes a mechasm for rug these software packages a stad-aloe mode ad coectg the optmzato programs to the approxmato. 3. Fttg a Approxmato to Gve Data Selectg the pots desg space where expermets are to be performed s possbl the most mportat part of obtag good approxmatos to sstem respose quattes that ma be used computg objectve fuctos ad costrats. However, ths pot selecto turs to be a dffcult optmzato problem tself, ad wll be dscussed the ext chapter. I ths secto we wll address the more modest questo of how to obta the best approxmato to a gve a set of expermets. We deote the respose fucto that we wat to approxmate as, ad assume that t ca be approxmated as a fucto of the desg varable vector x ad a vector of parameters, that s ˆ( x, ) (3..) where ŷ represets the approxmato, ad the error. We have the data from expermets at desg pots x ( x, ) (3..) ad we wat to fd the parameter vector that wll best ft the expermetal evdece. I ma o-desg applcatos, ŷ represets a umercal model such as a fte-elemet model, ad represets a set of phscal parameters such as masses ad stffesses. he process of fdg the values of the phscal parameters that best ft the expermetal results s called sstem detfcato. I desg applcatos, the vector of parameters ofte does ot have a phscal meag. Rather, we select some fuctoal represetato for ŷ, wth represetg some coeffcet to be determed so as to ft the data well. For example, wth a sgle desg varable x, a lear approxmato s of the form x (3..3) whle a ratoal approxmato ma take the form (3..4) x 9

3 he process of fdg to best ft the data s called regresso, ad ŷ s called a respose surface. he most commol used measure of the error the approxmato s the root-mea-square (rms) error e ˆ(, ) rms x. (3..5) Other measures that are ofte used are the average absolute error eav ˆ( x, ), (3..6) ad the maxmum error e max ˆ ( x, ). (3..7) max he populart of erms as a measure of the error s probabl due to the fact that t s ofte easer to estmate the that mmzes e rms tha the that mmzes the other two measures. However, there s also theoretcal justfcato for usg t. If the form of the model ŷ s exact (that s the true fucto s of that form) ad the error s ormall dstrbuted, wth o correlato betwee data pots ad all havg the same stadard devato, the mmzg e rms wll provde the best lear ubased estmate (BLUE) of. he value of that would be foud f we had a fte umber of expermets s sometmes thought of as the `true' value of the parameter vector. Wth a fte umber of expermets we obta ol a estmate of ths 'true' value, whch we deote as b. Estmatg the value of b that mmzes the rms error s eas for cases where ŷ s a lear fucto of, that s ( x), (3..8) Where (x) are gve shape fuctos, usuall moomals. For example, for the lear approxmato, Eq. 3..3, x. (3..9) Let us deote as b a estmate to that mmzes e rms for the gve expermets. he dfferece betwee the model ad the jth expermet, e, s e j j j j b ( x ), (3..0) or vector form, e= - Xb. (3..) Note that the (I,j) compoet of the matrx X s j ( x ). he rms error s the rms average of the compoets of e erms e ee, (3..) So that mmzg e rms s equvalet to mmzgee. Usg Eq. 3.. we have ee ( Xb) ( Xb) Xbb X b X Xb. 30

4 (3..3) o fd the vector b that mmzes e rms we dfferetate eewth respect to the compoets of b ad set the dervatves to zero. It s ot dffcult to check that we get X Xb X. (3..4) Equato 3..4 s a sstem of equatos called the ormal equato, ad we ca wrte ts soluto of ths equato as ( b X X) X. (3..5) However, the ormal equato s ofte ll-codtoed, especall whe s hgh. o avod some of the effects of the ll codtog we ca formulate the problem a slghtl dfferet form. Cosder the sstem Xb. (3..6) If we could solve ths equato exactl, the from Eq we see that the rms error wll be zero. However, ths sstem has equatos for the ukows, wth geeral larger tha, so that geeral, we caot fd a exact soluto. hat s, a vector b wll ot satsf Eq exactl, but stead there wll be a vector of resduals (dffereces betwee the left sde ad the rght sde of the equato). he soluto of the ormal equato s the vector b that mmzes the sum of the squares of the resduals. However, stead of solvg the ormal equatos, there are umercal methods, such as the QR decomposto, that solves for Eq drectl for the least square soluto, ad these are usuall more umercall stable tha umercal solutos of the ormal equatos. o mprove umercal stablt, t s also recommeded to traslate ad scale all the varables so that each chages the rage (-,). he vector of errors left after the least-squares ft s deoted ase r, ad the sum of the squares of the errors remag after the ft s deoted as SS e. Wth a bt of algebra we ca show that SS ee b X. (3..7) e r r 3

5 Example 3.. Ft a straght le bo b x to the followg three measuremets of. (0) = 0, () =, () = 0. Calculate frst a least square ft, ad the compare to a ft that wll mmze the maxmum error, ad to oe that wll mmze the average absolute error. If we wrte the three equatos that we would lke to satsf for the three pots, Eq we If we wrte the three equatos that we would lke to satsf for the three pots, Eq we get (0) b 0 () b 0 0 () b 0 b b 0 (3..8) So that the matrx X ad the vector are gve as 0 0 X, (3..9) 0 So that X X, (3..0) Ad 0 X 0 (3..) 0 We ca ow wrte the ormal equato, X Xb X, 3b0 3b 3b0 5b ad solve t to obta b 0 = /3 ad b = 0. So that our least-square ft s the le = /3. Wth two pots at = 0 ad oe pot at =, ths seems to be a sesble choce. he errors (resduals) at the three pots are e = /3, e = /3, ad e 3 = /3. So the sum of the squares of the errors s /3, ad the rms error s e rms (3..) We ca obta the same result also from Eq sce, b /3,0, b X /3, (3..3) the SS / 3 / 3, e SS / (3..4) e rms e B comparso, the average ad maxmum errors are 3

6 e , av emax (3..5) I geeral, t s dffcult to fd the respose surface that mmzes the maxmum error ad the respose surface that mmzes the average error. However, for ths smple example ths ca be easl doe. We expect that lke the le that mmzes the rms error, the les that mmze the other two errors would be horzotal les of the form c, 0c. o mmze the maxmum error, t s obvous that we must have c = 0.5, whch results a maxmum error e max I ths case ths s also the average error ad the rms error because all three pots have the same 0.5 error. o mmze the average absolute error, we ote that for the rage c [0,] the average absolute error wll be e c ( c) c c av, 3 3 (3..6) so that the mmum average error s acheved for c = 0. I that case the le s gve as = 0, the average error s the smallest at /3, but erms / , ad e max. he three cases are show Fg. 3.. For more complex examples, we should use computer software to perform the process of fttg the respose surface to the data. he followg example shows results for the softdrk-ca problem. Example 3.. Soft-Drk Ca Desg: A soft-drk compa has a ew drk that the pla to market, ad ther prelmar research has determed that the cost to produce ad dstrbute a cldrcal ca s approxmatel gve as C 0.8V 0 0.S, where C s the cost cets, V 0 s the volume flud ouces, ad S s the surface area of the ca square ches. he also determed that the ca sell the drk cas ragg from 5 to 5 ouces, ad the the prce P cets that ca be charged for a ca s estmated as P.5V 0 0.0V 0. Based o ther past experece the wll cosder ol a ca wth dameter D betwee.5 ad 3.5 ches, ad ther market research has show that soft-drk cas have to have a aspect rato of at least.0 to be eas to drk from. hat s, the heght H of the ca has to be at least twce the dameter. We express the volume ad the surface area terms of the dameter ad the heght of the ca V 0.5D /.8, S (0.5D ) DH, 0 H where the.8 factor s used to covert from cubc ches to flud ouces. he proft per ca, p ma be the wrtte as c p PC DH DH D DH c (0.5 ), 33

7 Feld tests are coducted ad resulted the followg 9 data pots, show as (dameter, heght, proft): (.8, 3.6, 5.9), (.4, 4.8, 3.), (3., 6.,.), (.5, 4.6, 4.7), (., 5.8,.0) (.7, 7., 0.9), (.4, 5.6, 4.9), (., 6.8,.5), (.6, 8.,.4) We wll assume that the equatos are more accurate tha the feld test results (how ca that be possble?). From the equatos, the correspodg trads are.8, 3.6, ), (.4, 4.8, 3.074), (3.,6.,.883), (.5, 4.6, 4.748), (., 5.8,.963) (.7, 7., 0.854), (.4, 5.6, 4.9), (., 6.8,.458), (.6, 8.,.38) Ft a quadratc polomal to the proft per ca usg the expermetal 9 data pots, Repeat wth a ft based o the 'true' cost. he quadratc polomal wll be of the form. p c b bd b3h b4d b5dh b6h (3..7) So that from the data we wll get 9 equatos for the sx coeffcets. he frst equato obtaed for the expermetal data s b.8b 3.6b3 3.4b4 6.48b5.96b6 5.9, (3..8) ad the last oe s b.6b 8b3 6.76b4 0.8b5 64b6.4, (3..9) Solvg the ormal equato ad substtutg the coeffcets to Eq we get p c D 0.30H 0.358D.00DH H. (3..30) I comparso, f we use the true values for the proft, we get D 0.675H 0.96D.09DH 0.5H. (3..3) p c 34

8 Note that seve of the e coeffcets are farl close, wth about 0 percet of oe aother. However, the coeffcet of H ad of H are qute dfferet, dcatg that these coeffcets are less mportat for fttg the data. hs does ot mea, however, that these coeffcets ma ot 35

9 affect predctos at other pots besde the data pots. We are thus wared that these coeffcets ma eed specal treatmet. he proft-per-ca cotours based o the expermetal data are show Fg. 3., ad the oes based o the 'true' values are show Fg

10 3. Estmatg the Accurac of the Respose Surface I most applcatos, the respose surface approxmato we costruct o the bass of gve data s teded for predcto of respose at other desg pots, tpcall for mprovg the desg. herefore the ultmate test of the respose surface s how well t predcts the respose at other pots of terest. However, f the respose surface does ot approxmate the respose well eve at the data pots, we caot expect t to approxmate well other pots desg space. he most mmedate measures of the accurac of the ft to the data are the varous errors dscussed earler, the rms error, the average absolute error ad the maxmum error. We deote as ^ the estmate of the respose surface for the respose, that s (see Eq b ( x), (3..) ad use to deote the values of the respose surface at the data pots ad e the error vector at these pots (vector of resduals). It s eas to check that the Xb, e ˆ. (3..) he average ad maxmum errors are the eav e, emax max e. (3..3) he sum of the squares of the remag errors, SSe s gve b Eq. 3..7, ad we ca calculate from t the rms error, erms as SSe erms. (3..4) However, ths calculato of rms error s qute msleadg f we wated to use t to assess the accurac of the respose surface. hs becomes clear f we ote that f the umber of data pots s equal to the umber of coeffcets,, the the respose surface wll pass through the data pots, ad the error wll be zero. We certal do ot expect that the error wll be zero at other pots, ot cluded the data. I fact, fttg a respose surface to a umber of pots equal to the umber of coeffcets (kow as saturated desg) s kow to ofte lead to poor approxmato, especall whe there s ose the data. A mpressve bod of theoretcal work has bee doe for the case where the ose the data s radom wth ormal dstrbuto wth zero mea ad stadard devato of, ad where the ose at oe pot s ucorrelated wth the ose at other data pots (e.g., Mers ad Motgomer, 995). O the bass of these assumptos we ca estmate the stadard devato of the ose the measuremets, whch also serves as a estmate of e rms at all the pots the rego of terest. A ubased estmate ˆ for ths stadard devato s gve as SSe ˆ, (3..5) 37

11 ad s kow as the stadard error. If ths estmate of the rms error s larger tha we ca tolerate for predctg values of the respose at caddate desg pots, we coclude that our respose surface s adequate. I ths case we must chage the form of the respose surface to tr to ft the data better. he smplest approach s to add terms to the polomal approxmato. If we used a lear polomal to start wth, we ma wat to go to a quadratc. If we used a quadratc, we ma wat to use a cubc. Ufortuatel, whle we ca alwas mprove the ft to the data b creasg the umber of terms the respose surface, t s ot clear that these gas wll traslate to gas predctg the respose surface at other pots. As we add coeffcets, we ru the dager of `overfttg' the data. hs dager s partcularl acute whe the data cotas substatal amout of ose. As we crease the umber of coeffcets we ma be fttg the ose rather tha the uderlg respose. hs dager s captured b Eq As we add more terms we expect to decrease the umerator, but the deomator wll also decrease. Aother measure related to the rms error s obtaed b calculatg the varato of the data from ts average. (3..6) he varato from the average s deoted as SS ad s gve as. (3..7) SS Smlarl the varato of the respose surface ŷ from the same average s deoted as SS r so that ˆ SSr. (3..8) he rato of the two, deoted R, measures the fracto of the varato the data s captured b the respose surface SS r SSe R, (3..9) SS SS where the last equalt s left to the reader as a exercse. As we add more coeffcets to the polomal, we reduce crease capabltes of our model R whch s gve as SSe Ra R SS SSe ad therefore R. However, as oted before, ths does ot mea that the predcto mprove. For ths reasos, there s a adjusted form of If the adjusted value,. (3..0) R, decreases as we crease the umber of coeffcets, t s a a warg that we ma be fttg the data better, but losg o predctve capablt. he ol true test to the predctve capabltes of the respose surface s evaluatg t at pots ot used ts costructo. hs s doe ofte b practtoers. However, because addtoal tests or umercal evaluatos of are ofte expesve, t 38

12 s worthwhle to look for was of checkg predctve capablt wthout performg addtoal evaluatos at ew pots. If the umber of pots used for the ft s substatall larger tha the umber of coeffcets, leavg out a few pots wll ot chage much the qualt of the ft. We ca therefore leave out a few pots, ft the respose surface to the remag pots ad check the error at the left-out pots. We ca the replcate the procedure wth other pots, so that each pot s left out oce, a procedure kow as cross valdato. Whe the cost of repeatg the ft s small (as t s for respose surface techques), cross valdato s usuall doe leavg oe pot at a tme. he procedure goes also b the ame of PRESS (for predcto error sum of squares), a ame that comes from usg the sum of the squares of the predcto errors as a measure of the predctve accurac of the ft. We deote the vector of errors as e p. It s possble to obta e p wthout actuall performg all of these fts. Whe ol oe pot s left out, t ca be show that the th compoet of e p s related to the th compoet of e r as er e p, (3..) E where E s the dagoal matrx of the so-called dempotet matrx E X ( X X ) X. (3..) It should be oted that the matrx X X s ofte ll codtoed, especall for large problems ad the the calculato of the matrx E from Eq. 3.. s ot ver relable. However, eve the drect calculato of e p b performg the fts, s usuall less expesve tha carrg out addtoal expermets order to test the accurac of the respose surface. Example 3.. A respose quatt s a fucto of a sgle varable x, ad has evetuall bee detfed to follow the smple relatoshp x. However, the frst set of measuremets that were take are gve the four (x, ) data pars (-,-.5), (-,-.5), (,.5), (,.75). From theoretcal cosderatos ou also kow that ( 0) 0. Ft a lear fucto ad a quadratc to the data, compare the two fts ad see how the model the true fucto. Lear ft: he respose surface s of the form b b x, ad usg the pot (0,0) as the thrd pots we have.5.5 X 0, 0. (3..3).5.75 So that 5 0 X X 0 0, 0 X 9.5. (3..4) So that we ca solve for b as 39

13 X X 0 0., 0 b ( X X ) X 0.95, (3..5) ad the lear respose surface, ad the error vector are ˆ 0.95 x, e 0.35, 0.575,0,0.35, 0.. (3..6) 0. 0 We ow calculate the sums of the squares, ad use Eqs ad 3..0 to obta 4 e a SS 9.5, SS , R , R (3..7) 3 hs lear ft appears to be qute satsfactor, wth the errors beg e 0.337, e 0.7, e (3..8) rms av max he estmate of the stadard devato of the ose (stadard error) the data from Eq s ˆ SSe 0.896, ˆ , (3..9) whch s le wth the actual fucto beg x, sce the errors we troduced the data are 0.5 ad 0.5. b b x b x Quadratc model: he respose surface s of the form 3 ad we get X 0 0, 0, (3..0) ad X X 0 0 0, X 9.5, (3..) We solve for b to obta 0.07 b X X X 0.95, (3..) So that our approxmato s x x, e 0.43,0.5, 0.07, 0.378,0.07, (3..3) We aga calculate the sums of the squares to obta SS 9.5, SS e , 4 R 0. 94, R a , (3..4) We see that there s ol a small mprovemet R, ad that R s actuall poorer, whch s a dcato that we have ot gaed a predctve capabltes b addg the quadratc terms. he error measures for the quadratc model are e 0.35, e 0.9, e, (3..5) rms av max 0.5 a 40

14 hese have ot chaged much compared to the lear model, wth small mprovemets the rms ad maxmum errors, ad small crease the average error. he estmate of the stadard devato of the ose the data (stadard error) from Eq s SS e ˆ 0.643, ˆ 0.54, (3..6) whch s a small crease over the lear case. hs crease s aother dcato that the quadratc approxmato s ot better tha the lear approxmato. he two approxmatos are compared Fg I Example 3.. we compared a lear model to a quadratc model. However, we do ot have to lmt ourselves to models that have all the terms up to a partcular order. For example, we ca cosder a quadratc model wthout the costat term, b x b x. Smlarl, wth two varables, t s commo for people to cosder a model of the form b bx b3 x b4 xx.hs model does ot clude quadratc terms x ad x, but t cludes the `teracto' term x x. We ca stpulate such a partal model o the bass of some kowledge of the behavor of the true fucto. However, most ofte such partal models are created b dscardg terms wth coeffcets whch caot be accuratel estmated o the bass of the data. Such coeffcets do ot have much effect o the accurac of the ft of the gve data, ad leavg them the model ca reduce ts predctve qualt for desg pots where these coeffcets have large effect o the predcto. I trg to ft a respose surface to the proft per ca, Example 3.., we observed that some of the coeffcets were sestve to small dffereces the data. We ca detf these coeffcets wthout chagg the data b estmatg the stadard devato of the coeffcets. Let us frst troduce the covarace matrx hs covarace matrx s defed as b of the vector of coeffcets b. 4

15 ( ) ( ) b be b be b (3..8) hat s, due to ose we get varous vectors b b fttg the data from several sets of expermets ad E(b) s the expected value (or average over ver large umber of expermets) of b ad b-e(b) s the devato of b from ts expected value. he the covarace matrx s the expected value of products of varous compoets of the dfferece. I partcular, the dagoal terms of the matrx are b defto the squares of the stadard devatos of the compoets of b, whle the off-dagoal terms are a measure of the correlato betwee compoets. It s possble to show that X X b, (3..9) so that wth our estmate ˆ of (see Eq. 3..5), we ca estmate the stadard devatos of the dvdual terms. he estmate of the stadard devato of the compoets of b s called the stadard error, se, so that ^ X X, se( b ) (3..30) A useful measure of the accurac of a compoet of b s the estmate of the coeffcet of varato, c, whch s the stadard devato of the compoet dvded b the absolute value of that compoet. he coeffcet of varato of the th compoet c s therefore the stadard error dvded b the compoet c se( b ), (3..3) b I ma respose surface procedures the quatt whch s used to assess the eed for a coeffcet s the verse of c, whch s called the test statstc, or the t-statstc. he coeffcet of varato s used a strateg called backward elmato (Mers ad Motgomer, p. 650). I ths strateg, we elmate the coeffcet wth the largest coeffcet of varato, perform aother regresso wth the remag terms, elmate the coeffcet wth the hghest coeffcet of varato, ad so o. We ca stop oce the coeffcet of varato of the remag terms s small eough, or we ca use a measure such as Ra to dcate to us whe elmatg addtoal terms s hurtg us. hs s llustrated b applg ths strateg to Example 3... Example 3.. Use backward elmato, startg wth the quadratc model of Example 3.., to fd the model wth the hghest value of R a. Cofrm our cocluso b a recalculatg coeffcets for a slght perturbato of the data at x to ( ). 35. For the quadratc model Example 3.., we foud that X X 0 0 0,, ˆ 0.643, (3..3) From ths we calculate 4

16 X X 0 0, b (3..33) he vector coeffcets for the quadratc model was Eq we get b 0.07, 0.95, , so from c 3.35, c 0.76, 3.56, c (3..34) Note that two of the coeffcets of varato are ver large. A value larger tha oe dcates that the stadard devato of the coeffcet s larger tha the value tself, so that we have ver lttle cofdece the value obtaed from the regresso. he coeffcet of varato assocated wth the costat term b s the largest oe, so we elmate t. hs leads to the partal quadratc model b x b x. For ths model we get X 0 0, 0, (3..35) Ad 0 0 X 9.5 X, X (3..36) We solve for b to obta 0.95 b X X X, (3..37) so that the approxmato s ˆ 0.95x x, e 0.6,0.597, 0, ,0.88, (3..38) We calculate the sums of the squares 9.5, SS 0.55, R , R SS e a (3..39) Comparg to the quadratc model Example 3.., we ote a substatal mprovemet Ra whch for the quadratc model was We also calculate a ew ˆ as SSe ˆ = ^ (3..40) 3 whch s also a mprovemet over the quadratc model. Next we calculate the ew covarace matrx 43

17 0 0 X X, 0 34 he ew coeffcets of varatos are the gve as b, (3..4) c 0.47, c 3.33, (3..4) We stll have a large value of c, so to cotue the process of backward elmato, we eed to elmate the b coeffcet, assocated wth the x term. However, we kow that we wll get the lear model back aga, because Example 3.., whe we ftted a lear model, ths model. Furthermore, both ˆ (0.4354) ad elded ˆ 0.95x R a (0.969) for the lear model obtaed Example 3.. are slghtl feror to our complete quadratc model. hs dcates that takg oe more step the backward elmato wll cofrm that the complete quadratc model s the best. Next we check how the coeffcets chage f we chage () =.35. If we repeat the quadratc model, we fd that we get ŷ = x x. We see that the small chage the data chaged the free coeffcet b more tha 30%, the x coeffcet b about 3%, ad the coeffet of x b ol about %. If we repeat the calculato for the complete quadratc model we fd ŷ = 0.935x x, whch s about % of the lear term, ad about 3% dfferet the quadratc term Nolear Regresso he advatage of havg a respose surface whch s a lear fucto of the coeffcets, s that the regresso process requres ol the soluto of a sstem of lear equatos (the ormal equatos). However, ma stuatos such a respose surface wll ot ft the data well. I partcular, most of the work wth lear regresso s doe wth lear ad quadratc polomals, ad these are ver lmted terms of the kd of fuctos that the wll ft well. I olear regresso, we assume that the respose surface s the more geeral form f ( x) ( x, ), (3.3.) We aga have a vector of fucto values ( x ),,..., ad we tr to estmate b a vector b that mmzes the rms error erms r, (3.3.) where r s the resdual at the th data pot r ( x, b) (3.3.3) However, ulke the case of lear regresso, we caot perform the ft b solvg a set of lear equatos, ad stead we have to perform ucostraed mmzato, tpcall requrg a umercal soluto. he followg example demostrates the advatages of olear regresso. 44

18 Example 3.3. Gve the four data pots, () = 0, () = 7, (3) = 5, ad (4) = 4, fd a approprate fucto to ft ad compare to polomal ft. he large value of () dcates that the fucto ma become ubouded ear that value, so that a approxmato usg ratoal fuctos ma be approprate. So we tr b b, (3.3.4) b3 x he resduals at the four data pots are b b r b 0, r b 7, (3.3.5) b b 3 b b r 3 b 5, r 4 b 4, (3.3.6) b3 3 b3 4 Optmzg wth Mcrosoft Excel the sum of the four resduals elded the followg soluto b, b 7, 0. 6 b3 that s 7, (3.3.7) x 0.6 A comparable polomal approxmato wth 3 coeffcets wll be a quadratc polomal, ad performg regresso elds x 3x, (3.3.8) wth a estmated 5, a rather substatal error. he two approxmatos are compared the Fgure 3.5. Whle we caot use lear regresso for the fttg, we ca use t after the fttg s doe to obta a estmate of the errors the coeffcets. hs ca be doe b learzato about the soluto of the olear regresso. We learze Eq about the soluto b* obtaed from the least square ft r ( x, b*) x, b * bj, (3.3.9) j bj hat s, f we ft the resduals r usg as bass fuctos the dervatves b j, we ca get estmates of the stadard devato of the coeffcets. 3 45

19 Example 3.3. We learze the prevous example about the value of the coeffcets obtaed from the olear regresso, obtag b 7 r b b3. (3.3.0) x 0.6 x 0.6 A lear regresso aalss wth Mcrosoft Excel gves b , b , b Note that these cremets should be subtracted from the values we obtaed the prevous example for more accurate ft, because the are a ft to the resdual ŷ. From the lear regresso we get a estmate for the stadard error ˆ 0.0, ad stadard errors for the coeffcets as se( b ) 0.0, se( b ) 0.50, se ( b 3 ) hese dcate hgh cofdece the values of the coeffcets, sce the stadard errors are less tha 0% of the values of the coeffcet here are ma stadard forms of olear regresso that we wll cosder subsequet chapters cludg Krgg, eural etworks, ad support vector regresso. Fall, the process of fdg phscal costats from laborator or feld measuremets ca be vewed as a process of olear regresso, sce the depedece of the measuremets s rarel lear these costats. Ufortuatel, the problem of mmzg the rms errors for fttg phscal expermets ofte has a large umber of local optma, so that choosg the best aswer s ot alwas smple. 46

20 3.4 Oulers Both phscal expermets ad computer smulatos occasoall we have data wth large errors. I computer smulatos these ma reflect falures of the soluto algorthm, software mplemetato, or mstakes b the user of the software. Data pots wth large errors are called outlers. It s mportat to detect them ad ether remove or repar them, because the ca have a large detrmetal effect o the accurac of the respose surface. A stadard tool for detectg outlers s Iteratvel Reweghted Least Squares (IRLS) proce dures. I order to uderstad ther bass, let us cosder frst the weghted least square (WLS) procedure. Weghted least squares procedures mmze a weghted sum of the squares of the resduals. here are ma possble reasos for usg WLS rather tha stadard least squares. We ma have more cofdece some data tha others. We ma wat to weght more heavl pots that are close to the rego where we wll eed to predct the respose tha pots far awa. he ose at some pots ma be kow to be hgher tha other pots. I these cases, stead of usg Eq. 3.. we use ewrms we e We, (3.4.) where, w s the weght assocated wth the th pot, ad W s a dagoal matrx wth the weghts o the dagoal. Mmzg the weghted rms error, ewrms elds a modfed set of ormal equatos X WXb X W, (3.4.) Iteratvel reweghted least square procedures weght pots wth large resduals wth small weghts, wth the weght decreasg wth creasg magtude of the resdual. he the WLS procedure s performed. If the pot s a outler, the respose surface wll move awa from t, so ts resdual wll crease. We wll assg a smaller weght to the pot ad repeat the procedure. Evetuall, outlers are lkel to ed up wth zero or low weght. here are several weghtg schemes, see e.g., Mers ad Motgomer, p. 67. Oe of the smplest s Huber's w f e / ˆ (3.4.3) ˆ / e otherwse Example 3.4. We eed to estmate Youg's modulus E o the bass of four stress-stra measuremets. For values of stras of,, 3, ad 4, mllstras, we measures stress values of 9,, 36, ad 39 ks. Deotg the stress b ad the stra b x we perform a stadard least square ft =Ex. X, (3.4.4)

21 So we have X X 30, X 37, E 0.567Ms (3.4.5) e.567,0.867,4.3,3.67 gvg he vector of resduals s ˆ ee33.846, see ( ) , (3.4.6) he ol resdual greater tha ˆ s the thrd oe, so ts weght w We ow perform a weghted least squares ft, gvg us X WX 7.876, X W 95.5, E 0.46, (3.4.7) he ew vector of resduals se.457,.085,4.68,.830, gvg ˆ e We Aga, the ol resdual larger tha ˆ s the thrd oe, ad ts weght ow reduces to w Oe more terato gves us 3 X WX 6.904, X W , E 0.40, (3.4.8) We ca cotue teratg, or we ca be satsfed that the process has detfed the thrd pot as a outler ad dscard t. If we do that we get E = 9.95 Ms, wth a stadard error se(e) = Exercses. For Example 3.. calculate the rms error from the cross-valdato PRESS procedure ad compare to other estmates of the error.. Check the accurac of the quadratc respose surface Example 3.. the rego.5 D 3.5, 3 H 7. Fd the maxmum error, average error ad rms error (a) for the 9 data pots, (b) for the etre rego (usg the aaltcal expresso). For part (b) ou ma perform the calculato aaltcall, or ou ma cover the doma wth a grd of (0X0) pots ad calculate the error each oe of the 400 pots. (c) Calculate the error usg the PRESS procedure ad compare to the result of part (b) 3. Usg the data Example 3.. ft a quadratc polomal to the proft per ouce, ad perform the same error aalss as requested Problem. 4. Usg the data of Problem, use backward elmato to fd a complete quadratc polomal wth the hghest R. a. he check for the accurac of the ft compared to the a aaltcal proft per ca over the etre rego, ad compare to the results obtaed Problem. 48

Objectives of Multiple Regression

Objectives of Multiple Regression Obectves of Multple Regresso Establsh the lear equato that best predcts values of a depedet varable Y usg more tha oe eplaator varable from a large set of potetal predctors {,,... k }. Fd that subset of

More information

ESS Line Fitting

ESS Line Fitting ESS 5 014 17. Le Fttg A very commo problem data aalyss s lookg for relatoshpetwee dfferet parameters ad fttg les or surfaces to data. The smplest example s fttg a straght le ad we wll dscuss that here

More information

Simple Linear Regression

Simple Linear Regression Statstcal Methods I (EST 75) Page 139 Smple Lear Regresso Smple regresso applcatos are used to ft a model descrbg a lear relatoshp betwee two varables. The aspects of least squares regresso ad correlato

More information

Analyzing Two-Dimensional Data. Analyzing Two-Dimensional Data

Analyzing Two-Dimensional Data. Analyzing Two-Dimensional Data /7/06 Aalzg Two-Dmesoal Data The most commo aaltcal measuremets volve the determato of a ukow cocetrato based o the respose of a aaltcal procedure (usuall strumetal). Such a measuremet requres calbrato,

More information

Econometric Methods. Review of Estimation

Econometric Methods. Review of Estimation Ecoometrc Methods Revew of Estmato Estmatg the populato mea Radom samplg Pot ad terval estmators Lear estmators Ubased estmators Lear Ubased Estmators (LUEs) Effcecy (mmum varace) ad Best Lear Ubased Estmators

More information

Third handout: On the Gini Index

Third handout: On the Gini Index Thrd hadout: O the dex Corrado, a tala statstca, proposed (, 9, 96) to measure absolute equalt va the mea dfferece whch s defed as ( / ) where refers to the total umber of dvduals socet. Assume that. The

More information

12.2 Estimating Model parameters Assumptions: ox and y are related according to the simple linear regression model

12.2 Estimating Model parameters Assumptions: ox and y are related according to the simple linear regression model 1. Estmatg Model parameters Assumptos: ox ad y are related accordg to the smple lear regresso model (The lear regresso model s the model that says that x ad y are related a lear fasho, but the observed

More information

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model

Lecture 7. Confidence Intervals and Hypothesis Tests in the Simple CLR Model Lecture 7. Cofdece Itervals ad Hypothess Tests the Smple CLR Model I lecture 6 we troduced the Classcal Lear Regresso (CLR) model that s the radom expermet of whch the data Y,,, K, are the outcomes. The

More information

Correlation and Simple Linear Regression

Correlation and Simple Linear Regression Correlato ad Smple Lear Regresso Berl Che Departmet of Computer Scece & Iformato Egeerg Natoal Tawa Normal Uverst Referece:. W. Navd. Statstcs for Egeerg ad Scetsts. Chapter 7 (7.-7.3) & Teachg Materal

More information

Functions of Random Variables

Functions of Random Variables Fuctos of Radom Varables Chapter Fve Fuctos of Radom Varables 5. Itroducto A geeral egeerg aalyss model s show Fg. 5.. The model output (respose) cotas the performaces of a system or product, such as weght,

More information

Multiple Choice Test. Chapter Adequacy of Models for Regression

Multiple Choice Test. Chapter Adequacy of Models for Regression Multple Choce Test Chapter 06.0 Adequac of Models for Regresso. For a lear regresso model to be cosdered adequate, the percetage of scaled resduals that eed to be the rage [-,] s greater tha or equal to

More information

Simple Linear Regression

Simple Linear Regression Correlato ad Smple Lear Regresso Berl Che Departmet of Computer Scece & Iformato Egeerg Natoal Tawa Normal Uversty Referece:. W. Navd. Statstcs for Egeerg ad Scetsts. Chapter 7 (7.-7.3) & Teachg Materal

More information

ENGI 3423 Simple Linear Regression Page 12-01

ENGI 3423 Simple Linear Regression Page 12-01 ENGI 343 mple Lear Regresso Page - mple Lear Regresso ometmes a expermet s set up where the expermeter has cotrol over the values of oe or more varables X ad measures the resultg values of aother varable

More information

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions.

Ordinary Least Squares Regression. Simple Regression. Algebra and Assumptions. Ordary Least Squares egresso. Smple egresso. Algebra ad Assumptos. I ths part of the course we are gog to study a techque for aalysg the lear relatoshp betwee two varables Y ad X. We have pars of observatos

More information

Summary of the lecture in Biostatistics

Summary of the lecture in Biostatistics Summary of the lecture Bostatstcs Probablty Desty Fucto For a cotuos radom varable, a probablty desty fucto s a fucto such that: 0 dx a b) b a dx A probablty desty fucto provdes a smple descrpto of the

More information

Multiple Regression. More than 2 variables! Grade on Final. Multiple Regression 11/21/2012. Exam 2 Grades. Exam 2 Re-grades

Multiple Regression. More than 2 variables! Grade on Final. Multiple Regression 11/21/2012. Exam 2 Grades. Exam 2 Re-grades STAT 101 Dr. Kar Lock Morga 11/20/12 Exam 2 Grades Multple Regresso SECTIONS 9.2, 10.1, 10.2 Multple explaatory varables (10.1) Parttog varablty R 2, ANOVA (9.2) Codtos resdual plot (10.2) Trasformatos

More information

Mean is only appropriate for interval or ratio scales, not ordinal or nominal.

Mean is only appropriate for interval or ratio scales, not ordinal or nominal. Mea Same as ordary average Sum all the data values ad dvde by the sample sze. x = ( x + x +... + x Usg summato otato, we wrte ths as x = x = x = = ) x Mea s oly approprate for terval or rato scales, ot

More information

Chapter 5 Transformation and Weighting to Correct Model Inadequacies

Chapter 5 Transformation and Weighting to Correct Model Inadequacies Chapter 5 Trasformato ad Weghtg to Correct Model Iadequaces The graphcal methods help detectg the volato of basc assumptos regresso aalss. Now we cosder the methods ad procedures for buldg the models through

More information

Part 4b Asymptotic Results for MRR2 using PRESS. Recall that the PRESS statistic is a special type of cross validation procedure (see Allen (1971))

Part 4b Asymptotic Results for MRR2 using PRESS. Recall that the PRESS statistic is a special type of cross validation procedure (see Allen (1971)) art 4b Asymptotc Results for MRR usg RESS Recall that the RESS statstc s a specal type of cross valdato procedure (see Alle (97)) partcular to the regresso problem ad volves fdg Y $,, the estmate at the

More information

TESTS BASED ON MAXIMUM LIKELIHOOD

TESTS BASED ON MAXIMUM LIKELIHOOD ESE 5 Toy E. Smth. The Basc Example. TESTS BASED ON MAXIMUM LIKELIHOOD To llustrate the propertes of maxmum lkelhood estmates ad tests, we cosder the smplest possble case of estmatg the mea of the ormal

More information

Lecture Notes Types of economic variables

Lecture Notes Types of economic variables Lecture Notes 3 1. Types of ecoomc varables () Cotuous varable takes o a cotuum the sample space, such as all pots o a le or all real umbers Example: GDP, Polluto cocetrato, etc. () Dscrete varables fte

More information

best estimate (mean) for X uncertainty or error in the measurement (systematic, random or statistical) best

best estimate (mean) for X uncertainty or error in the measurement (systematic, random or statistical) best Error Aalyss Preamble Wheever a measuremet s made, the result followg from that measuremet s always subject to ucertaty The ucertaty ca be reduced by makg several measuremets of the same quatty or by mprovg

More information

Chapter 14 Logistic Regression Models

Chapter 14 Logistic Regression Models Chapter 4 Logstc Regresso Models I the lear regresso model X β + ε, there are two types of varables explaatory varables X, X,, X k ad study varable y These varables ca be measured o a cotuous scale as

More information

Maximum Likelihood Estimation

Maximum Likelihood Estimation Marquette Uverst Maxmum Lkelhood Estmato Dael B. Rowe, Ph.D. Professor Departmet of Mathematcs, Statstcs, ad Computer Scece Coprght 08 b Marquette Uverst Maxmum Lkelhood Estmato We have bee sag that ~

More information

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE THE ROYAL STATISTICAL SOCIETY 00 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER I STATISTICAL THEORY The Socety provdes these solutos to assst caddates preparg for the examatos future years ad for the

More information

Chapter 5 Properties of a Random Sample

Chapter 5 Properties of a Random Sample Lecture 6 o BST 63: Statstcal Theory I Ku Zhag, /0/008 Revew for the prevous lecture Cocepts: t-dstrbuto, F-dstrbuto Theorems: Dstrbutos of sample mea ad sample varace, relatoshp betwee sample mea ad sample

More information

Midterm Exam 1, section 2 (Solution) Thursday, February hour, 15 minutes

Midterm Exam 1, section 2 (Solution) Thursday, February hour, 15 minutes coometrcs, CON Sa Fracsco State Uverst Mchael Bar Sprg 5 Mdterm xam, secto Soluto Thursda, Februar 6 hour, 5 mutes Name: Istructos. Ths s closed book, closed otes exam.. No calculators of a kd are allowed..

More information

Chapter 8. Inferences about More Than Two Population Central Values

Chapter 8. Inferences about More Than Two Population Central Values Chapter 8. Ifereces about More Tha Two Populato Cetral Values Case tudy: Effect of Tmg of the Treatmet of Port-We tas wth Lasers ) To vestgate whether treatmet at a youg age would yeld better results tha

More information

Chapter Two. An Introduction to Regression ( )

Chapter Two. An Introduction to Regression ( ) ubject: A Itroducto to Regresso Frst tage Chapter Two A Itroducto to Regresso (018-019) 1 pg. ubject: A Itroducto to Regresso Frst tage A Itroducto to Regresso Regresso aalss s a statstcal tool for the

More information

PGE 310: Formulation and Solution in Geosystems Engineering. Dr. Balhoff. Interpolation

PGE 310: Formulation and Solution in Geosystems Engineering. Dr. Balhoff. Interpolation PGE 30: Formulato ad Soluto Geosystems Egeerg Dr. Balhoff Iterpolato Numercal Methods wth MATLAB, Recktewald, Chapter 0 ad Numercal Methods for Egeers, Chapra ad Caale, 5 th Ed., Part Fve, Chapter 8 ad

More information

Regresso What s a Model? 1. Ofte Descrbe Relatoshp betwee Varables 2. Types - Determstc Models (o radomess) - Probablstc Models (wth radomess) EPI 809/Sprg 2008 9 Determstc Models 1. Hypothesze

More information

residual. (Note that usually in descriptions of regression analysis, upper-case

residual. (Note that usually in descriptions of regression analysis, upper-case Regresso Aalyss Regresso aalyss fts or derves a model that descres the varato of a respose (or depedet ) varale as a fucto of oe or more predctor (or depedet ) varales. The geeral regresso model s oe of

More information

Feature Selection: Part 2. 1 Greedy Algorithms (continued from the last lecture)

Feature Selection: Part 2. 1 Greedy Algorithms (continued from the last lecture) CSE 546: Mache Learg Lecture 6 Feature Selecto: Part 2 Istructor: Sham Kakade Greedy Algorthms (cotued from the last lecture) There are varety of greedy algorthms ad umerous amg covetos for these algorthms.

More information

Descriptive Statistics

Descriptive Statistics Page Techcal Math II Descrptve Statstcs Descrptve Statstcs Descrptve statstcs s the body of methods used to represet ad summarze sets of data. A descrpto of how a set of measuremets (for eample, people

More information

ENGI 4421 Propagation of Error Page 8-01

ENGI 4421 Propagation of Error Page 8-01 ENGI 441 Propagato of Error Page 8-01 Propagato of Error [Navd Chapter 3; ot Devore] Ay realstc measuremet procedure cotas error. Ay calculatos based o that measuremet wll therefore also cota a error.

More information

ECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity

ECONOMETRIC THEORY. MODULE VIII Lecture - 26 Heteroskedasticity ECONOMETRIC THEORY MODULE VIII Lecture - 6 Heteroskedastcty Dr. Shalabh Departmet of Mathematcs ad Statstcs Ida Isttute of Techology Kapur . Breusch Paga test Ths test ca be appled whe the replcated data

More information

Lecture Notes 2. The ability to manipulate matrices is critical in economics.

Lecture Notes 2. The ability to manipulate matrices is critical in economics. Lecture Notes. Revew of Matrces he ablt to mapulate matrces s crtcal ecoomcs.. Matr a rectagular arra of umbers, parameters, or varables placed rows ad colums. Matrces are assocated wth lear equatos. lemets

More information

( ) = ( ) ( ) Chapter 13 Asymptotic Theory and Stochastic Regressors. Stochastic regressors model

( ) = ( ) ( ) Chapter 13 Asymptotic Theory and Stochastic Regressors. Stochastic regressors model Chapter 3 Asmptotc Theor ad Stochastc Regressors The ature of eplaator varable s assumed to be o-stochastc or fed repeated samples a regresso aalss Such a assumpto s approprate for those epermets whch

More information

0/1 INTEGER PROGRAMMING AND SEMIDEFINTE PROGRAMMING

0/1 INTEGER PROGRAMMING AND SEMIDEFINTE PROGRAMMING CONVEX OPIMIZAION AND INERIOR POIN MEHODS FINAL PROJEC / INEGER PROGRAMMING AND SEMIDEFINE PROGRAMMING b Luca Buch ad Natala Vktorova CONENS:.Itroducto.Formulato.Applcato to Kapsack Problem 4.Cuttg Plaes

More information

Chapter 2 Supplemental Text Material

Chapter 2 Supplemental Text Material -. Models for the Data ad the t-test Chapter upplemetal Text Materal The model preseted the text, equato (-3) s more properl called a meas model. ce the mea s a locato parameter, ths tpe of model s also

More information

4. Standard Regression Model and Spatial Dependence Tests

4. Standard Regression Model and Spatial Dependence Tests 4. Stadard Regresso Model ad Spatal Depedece Tests Stadard regresso aalss fals the presece of spatal effects. I case of spatal depedeces ad/or spatal heterogeet a stadard regresso model wll be msspecfed.

More information

Assignment 5/MATH 247/Winter Due: Friday, February 19 in class (!) (answers will be posted right after class)

Assignment 5/MATH 247/Winter Due: Friday, February 19 in class (!) (answers will be posted right after class) Assgmet 5/MATH 7/Wter 00 Due: Frday, February 9 class (!) (aswers wll be posted rght after class) As usual, there are peces of text, before the questos [], [], themselves. Recall: For the quadratc form

More information

L5 Polynomial / Spline Curves

L5 Polynomial / Spline Curves L5 Polyomal / Sple Curves Cotets Coc sectos Polyomal Curves Hermte Curves Bezer Curves B-Sples No-Uform Ratoal B-Sples (NURBS) Mapulato ad Represetato of Curves Types of Curve Equatos Implct: Descrbe a

More information

ε. Therefore, the estimate

ε. Therefore, the estimate Suggested Aswers, Problem Set 3 ECON 333 Da Hugerma. Ths s ot a very good dea. We kow from the secod FOC problem b) that ( ) SSE / = y x x = ( ) Whch ca be reduced to read y x x = ε x = ( ) The OLS model

More information

Lecture Notes Forecasting the process of estimating or predicting unknown situations

Lecture Notes Forecasting the process of estimating or predicting unknown situations Lecture Notes. Ecoomc Forecastg. Forecastg the process of estmatg or predctg ukow stuatos Eample usuall ecoomsts predct future ecoomc varables Forecastg apples to a varet of data () tme seres data predctg

More information

To use adaptive cluster sampling we must first make some definitions of the sampling universe:

To use adaptive cluster sampling we must first make some definitions of the sampling universe: 8.3 ADAPTIVE SAMPLING Most of the methods dscussed samplg theory are lmted to samplg desgs hch the selecto of the samples ca be doe before the survey, so that oe of the decsos about samplg deped ay ay

More information

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ " 1

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. x, where. = y - ˆ  1 STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS Recall Assumpto E(Y x) η 0 + η x (lear codtoal mea fucto) Data (x, y ), (x 2, y 2 ),, (x, y ) Least squares estmator ˆ E (Y x) ˆ " 0 + ˆ " x, where ˆ

More information

Lecture 3. Sampling, sampling distributions, and parameter estimation

Lecture 3. Sampling, sampling distributions, and parameter estimation Lecture 3 Samplg, samplg dstrbutos, ad parameter estmato Samplg Defto Populato s defed as the collecto of all the possble observatos of terest. The collecto of observatos we take from the populato s called

More information

CHAPTER 3 POSTERIOR DISTRIBUTIONS

CHAPTER 3 POSTERIOR DISTRIBUTIONS CHAPTER 3 POSTERIOR DISTRIBUTIONS If scece caot measure the degree of probablt volved, so much the worse for scece. The practcal ma wll stck to hs apprecatve methods utl t does, or wll accept the results

More information

X X X E[ ] E X E X. is the ()m n where the ( i,)th. j element is the mean of the ( i,)th., then

X X X E[ ] E X E X. is the ()m n where the ( i,)th. j element is the mean of the ( i,)th., then Secto 5 Vectors of Radom Varables Whe workg wth several radom varables,,..., to arrage them vector form x, t s ofte coveet We ca the make use of matrx algebra to help us orgaze ad mapulate large umbers

More information

Multivariate Transformation of Variables and Maximum Likelihood Estimation

Multivariate Transformation of Variables and Maximum Likelihood Estimation Marquette Uversty Multvarate Trasformato of Varables ad Maxmum Lkelhood Estmato Dael B. Rowe, Ph.D. Assocate Professor Departmet of Mathematcs, Statstcs, ad Computer Scece Copyrght 03 by Marquette Uversty

More information

Lecture 8: Linear Regression

Lecture 8: Linear Regression Lecture 8: Lear egresso May 4, GENOME 56, Sprg Goals Develop basc cocepts of lear regresso from a probablstc framework Estmatg parameters ad hypothess testg wth lear models Lear regresso Su I Lee, CSE

More information

CHAPTER VI Statistical Analysis of Experimental Data

CHAPTER VI Statistical Analysis of Experimental Data Chapter VI Statstcal Aalyss of Expermetal Data CHAPTER VI Statstcal Aalyss of Expermetal Data Measuremets do ot lead to a uque value. Ths s a result of the multtude of errors (maly radom errors) that ca

More information

Example: Multiple linear regression. Least squares regression. Repetition: Simple linear regression. Tron Anders Moger

Example: Multiple linear regression. Least squares regression. Repetition: Simple linear regression. Tron Anders Moger Example: Multple lear regresso 5000,00 4000,00 Tro Aders Moger 0.0.007 brthweght 3000,00 000,00 000,00 0,00 50,00 00,00 50,00 00,00 50,00 weght pouds Repetto: Smple lear regresso We defe a model Y = β0

More information

PTAS for Bin-Packing

PTAS for Bin-Packing CS 663: Patter Matchg Algorthms Scrbe: Che Jag /9/00. Itroducto PTAS for B-Packg The B-Packg problem s NP-hard. If we use approxmato algorthms, the B-Packg problem could be solved polyomal tme. For example,

More information

Homework Solution (#5)

Homework Solution (#5) Homework Soluto (# Chapter : #6,, 8(b, 3, 4, 44, 49, 3, 9 ad 7 Chapter. Smple Lear Regresso ad Correlato.6 (6 th edto 7, old edto Page 9 Rafall volume ( vs Ruoff volume ( : 9 8 7 6 4 3 : a. Yes, the scatter-plot

More information

Investigation of Partially Conditional RP Model with Response Error. Ed Stanek

Investigation of Partially Conditional RP Model with Response Error. Ed Stanek Partally Codtoal Radom Permutato Model 7- vestgato of Partally Codtoal RP Model wth Respose Error TRODUCTO Ed Staek We explore the predctor that wll result a smple radom sample wth respose error whe a

More information

Evaluating Polynomials

Evaluating Polynomials Uverst of Nebraska - Lcol DgtalCommos@Uverst of Nebraska - Lcol MAT Exam Expostor Papers Math the Mddle Isttute Partershp 7-7 Evaluatg Polomals Thomas J. Harrgto Uverst of Nebraska-Lcol Follow ths ad addtoal

More information

Statistics. Correlational. Dr. Ayman Eldeib. Simple Linear Regression and Correlation. SBE 304: Linear Regression & Correlation 1/3/2018

Statistics. Correlational. Dr. Ayman Eldeib. Simple Linear Regression and Correlation. SBE 304: Linear Regression & Correlation 1/3/2018 /3/08 Sstems & Bomedcal Egeerg Departmet SBE 304: Bo-Statstcs Smple Lear Regresso ad Correlato Dr. Ama Eldeb Fall 07 Descrptve Orgasg, summarsg & descrbg data Statstcs Correlatoal Relatoshps Iferetal Geeralsg

More information

Probability and. Lecture 13: and Correlation

Probability and. Lecture 13: and Correlation 933 Probablty ad Statstcs for Software ad Kowledge Egeers Lecture 3: Smple Lear Regresso ad Correlato Mocha Soptkamo, Ph.D. Outle The Smple Lear Regresso Model (.) Fttg the Regresso Le (.) The Aalyss of

More information

Multiple Linear Regression Analysis

Multiple Linear Regression Analysis LINEA EGESSION ANALYSIS MODULE III Lecture - 4 Multple Lear egresso Aalyss Dr. Shalabh Departmet of Mathematcs ad Statstcs Ida Isttute of Techology Kapur Cofdece terval estmato The cofdece tervals multple

More information

Introduction to local (nonparametric) density estimation. methods

Introduction to local (nonparametric) density estimation. methods Itroducto to local (oparametrc) desty estmato methods A slecture by Yu Lu for ECE 66 Sprg 014 1. Itroducto Ths slecture troduces two local desty estmato methods whch are Parze desty estmato ad k-earest

More information

2.28 The Wall Street Journal is probably referring to the average number of cubes used per glass measured for some population that they have chosen.

2.28 The Wall Street Journal is probably referring to the average number of cubes used per glass measured for some population that they have chosen. .5 x 54.5 a. x 7. 786 7 b. The raked observatos are: 7.4, 7.5, 7.7, 7.8, 7.9, 8.0, 8.. Sce the sample sze 7 s odd, the meda s the (+)/ 4 th raked observato, or meda 7.8 c. The cosumer would more lkely

More information

UNIT 2 SOLUTION OF ALGEBRAIC AND TRANSCENDENTAL EQUATIONS

UNIT 2 SOLUTION OF ALGEBRAIC AND TRANSCENDENTAL EQUATIONS Numercal Computg -I UNIT SOLUTION OF ALGEBRAIC AND TRANSCENDENTAL EQUATIONS Structure Page Nos..0 Itroducto 6. Objectves 7. Ital Approxmato to a Root 7. Bsecto Method 8.. Error Aalyss 9.4 Regula Fals Method

More information

9.1 Introduction to the probit and logit models

9.1 Introduction to the probit and logit models EC3000 Ecoometrcs Lecture 9 Probt & Logt Aalss 9. Itroducto to the probt ad logt models 9. The logt model 9.3 The probt model Appedx 9. Itroducto to the probt ad logt models These models are used regressos

More information

Lecture 1: Introduction to Regression

Lecture 1: Introduction to Regression Lecture : Itroducto to Regresso A Eample: Eplag State Homcde Rates What kds of varables mght we use to epla/predct state homcde rates? Let s cosder just oe predctor for ow: povert Igore omtted varables,

More information

The number of observed cases The number of parameters. ith case of the dichotomous dependent variable. the ith case of the jth parameter

The number of observed cases The number of parameters. ith case of the dichotomous dependent variable. the ith case of the jth parameter LOGISTIC REGRESSION Notato Model Logstc regresso regresses a dchotomous depedet varable o a set of depedet varables. Several methods are mplemeted for selectg the depedet varables. The followg otato s

More information

Simulation Output Analysis

Simulation Output Analysis Smulato Output Aalyss Summary Examples Parameter Estmato Sample Mea ad Varace Pot ad Iterval Estmato ermatg ad o-ermatg Smulato Mea Square Errors Example: Sgle Server Queueg System x(t) S 4 S 4 S 3 S 5

More information

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS Postpoed exam: ECON430 Statstcs Date of exam: Jauary 0, 0 Tme for exam: 09:00 a.m. :00 oo The problem set covers 5 pages Resources allowed: All wrtte ad prted

More information

Linear Regression Linear Regression with Shrinkage. Some slides are due to Tommi Jaakkola, MIT AI Lab

Linear Regression Linear Regression with Shrinkage. Some slides are due to Tommi Jaakkola, MIT AI Lab Lear Regresso Lear Regresso th Shrkage Some sldes are due to Tomm Jaakkola, MIT AI Lab Itroducto The goal of regresso s to make quattatve real valued predctos o the bass of a vector of features or attrbutes.

More information

The equation is sometimes presented in form Y = a + b x. This is reasonable, but it s not the notation we use.

The equation is sometimes presented in form Y = a + b x. This is reasonable, but it s not the notation we use. INTRODUCTORY NOTE ON LINEAR REGREION We have data of the form (x y ) (x y ) (x y ) These wll most ofte be preseted to us as two colum of a spreadsheet As the topc develops we wll see both upper case ad

More information

Statistics MINITAB - Lab 5

Statistics MINITAB - Lab 5 Statstcs 10010 MINITAB - Lab 5 PART I: The Correlato Coeffcet Qute ofte statstcs we are preseted wth data that suggests that a lear relatoshp exsts betwee two varables. For example the plot below s of

More information

Linear Regression with One Regressor

Linear Regression with One Regressor Lear Regresso wth Oe Regressor AIM QA.7. Expla how regresso aalyss ecoometrcs measures the relatoshp betwee depedet ad depedet varables. A regresso aalyss has the goal of measurg how chages oe varable,

More information

Kernel-based Methods and Support Vector Machines

Kernel-based Methods and Support Vector Machines Kerel-based Methods ad Support Vector Maches Larr Holder CptS 570 Mache Learg School of Electrcal Egeerg ad Computer Scece Washgto State Uverst Refereces Muller et al. A Itroducto to Kerel-Based Learg

More information

Chapter Business Statistics: A First Course Fifth Edition. Learning Objectives. Correlation vs. Regression. In this chapter, you learn:

Chapter Business Statistics: A First Course Fifth Edition. Learning Objectives. Correlation vs. Regression. In this chapter, you learn: Chapter 3 3- Busess Statstcs: A Frst Course Ffth Edto Chapter 2 Correlato ad Smple Lear Regresso Busess Statstcs: A Frst Course, 5e 29 Pretce-Hall, Ic. Chap 2- Learg Objectves I ths chapter, you lear:

More information

{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution:

{ }{ ( )} (, ) = ( ) ( ) ( ) Chapter 14 Exercises in Sampling Theory. Exercise 1 (Simple random sampling): Solution: Chapter 4 Exercses Samplg Theory Exercse (Smple radom samplg: Let there be two correlated radom varables X ad A sample of sze s draw from a populato by smple radom samplg wthout replacemet The observed

More information

ECON 482 / WH Hong The Simple Regression Model 1. Definition of the Simple Regression Model

ECON 482 / WH Hong The Simple Regression Model 1. Definition of the Simple Regression Model ECON 48 / WH Hog The Smple Regresso Model. Defto of the Smple Regresso Model Smple Regresso Model Expla varable y terms of varable x y = β + β x+ u y : depedet varable, explaed varable, respose varable,

More information

X ε ) = 0, or equivalently, lim

X ε ) = 0, or equivalently, lim Revew for the prevous lecture Cocepts: order statstcs Theorems: Dstrbutos of order statstcs Examples: How to get the dstrbuto of order statstcs Chapter 5 Propertes of a Radom Sample Secto 55 Covergece

More information

Class 13,14 June 17, 19, 2015

Class 13,14 June 17, 19, 2015 Class 3,4 Jue 7, 9, 05 Pla for Class3,4:. Samplg dstrbuto of sample mea. The Cetral Lmt Theorem (CLT). Cofdece terval for ukow mea.. Samplg Dstrbuto for Sample mea. Methods used are based o CLT ( Cetral

More information

STA302/1001-Fall 2008 Midterm Test October 21, 2008

STA302/1001-Fall 2008 Midterm Test October 21, 2008 STA3/-Fall 8 Mdterm Test October, 8 Last Name: Frst Name: Studet Number: Erolled (Crcle oe) STA3 STA INSTRUCTIONS Tme allowed: hour 45 mutes Ads allowed: A o-programmable calculator A table of values from

More information

2SLS Estimates ECON In this case, begin with the assumption that E[ i

2SLS Estimates ECON In this case, begin with the assumption that E[ i SLS Estmates ECON 3033 Bll Evas Fall 05 Two-Stage Least Squares (SLS Cosder a stadard lear bvarate regresso model y 0 x. I ths case, beg wth the assumto that E[ x] 0 whch meas that OLS estmates of wll

More information

Bayes (Naïve or not) Classifiers: Generative Approach

Bayes (Naïve or not) Classifiers: Generative Approach Logstc regresso Bayes (Naïve or ot) Classfers: Geeratve Approach What do we mea by Geeratve approach: Lear p(y), p(x y) ad the apply bayes rule to compute p(y x) for makg predctos Ths s essetally makg

More information

Chapter 9 Jordan Block Matrices

Chapter 9 Jordan Block Matrices Chapter 9 Jorda Block atrces I ths chapter we wll solve the followg problem. Gve a lear operator T fd a bass R of F such that the matrx R (T) s as smple as possble. f course smple s a matter of taste.

More information

Simple Linear Regression and Correlation.

Simple Linear Regression and Correlation. Smple Lear Regresso ad Correlato. Correspods to Chapter 0 Tamhae ad Dulop Sldes prepared b Elzabeth Newto (MIT) wth some sldes b Jacquele Telford (Johs Hopks Uverst) Smple lear regresso aalss estmates

More information

Midterm Exam 1, section 1 (Solution) Thursday, February hour, 15 minutes

Midterm Exam 1, section 1 (Solution) Thursday, February hour, 15 minutes coometrcs, CON Sa Fracsco State Uversty Mchael Bar Sprg 5 Mdterm am, secto Soluto Thursday, February 6 hour, 5 mutes Name: Istructos. Ths s closed book, closed otes eam.. No calculators of ay kd are allowed..

More information

Simple Linear Regression and Correlation. Applied Statistics and Probability for Engineers. Chapter 11 Simple Linear Regression and Correlation

Simple Linear Regression and Correlation. Applied Statistics and Probability for Engineers. Chapter 11 Simple Linear Regression and Correlation 4//6 Appled Statstcs ad Probablty for Egeers Sth Edto Douglas C. Motgomery George C. Ruger Chapter Smple Lear Regresso ad Correlato CHAPTER OUTLINE Smple Lear Regresso ad Correlato - Emprcal Models -8

More information

Lecture 3 Probability review (cont d)

Lecture 3 Probability review (cont d) STATS 00: Itroducto to Statstcal Iferece Autum 06 Lecture 3 Probablty revew (cot d) 3. Jot dstrbutos If radom varables X,..., X k are depedet, the ther dstrbuto may be specfed by specfyg the dvdual dstrbuto

More information

b. There appears to be a positive relationship between X and Y; that is, as X increases, so does Y.

b. There appears to be a positive relationship between X and Y; that is, as X increases, so does Y. .46. a. The frst varable (X) s the frst umber the par ad s plotted o the horzotal axs, whle the secod varable (Y) s the secod umber the par ad s plotted o the vertcal axs. The scatterplot s show the fgure

More information

2006 Jamie Trahan, Autar Kaw, Kevin Martin University of South Florida United States of America

2006 Jamie Trahan, Autar Kaw, Kevin Martin University of South Florida United States of America SOLUTION OF SYSTEMS OF SIMULTANEOUS LINEAR EQUATIONS Gauss-Sedel Method 006 Jame Traha, Autar Kaw, Kev Mart Uversty of South Florda Uted States of Amerca kaw@eg.usf.edu Itroducto Ths worksheet demostrates

More information

Bounds on the expected entropy and KL-divergence of sampled multinomial distributions. Brandon C. Roy

Bounds on the expected entropy and KL-divergence of sampled multinomial distributions. Brandon C. Roy Bouds o the expected etropy ad KL-dvergece of sampled multomal dstrbutos Brado C. Roy bcroy@meda.mt.edu Orgal: May 18, 2011 Revsed: Jue 6, 2011 Abstract Iformato theoretc quattes calculated from a sampled

More information

Chapter 13 Student Lecture Notes 13-1

Chapter 13 Student Lecture Notes 13-1 Chapter 3 Studet Lecture Notes 3- Basc Busess Statstcs (9 th Edto) Chapter 3 Smple Lear Regresso 4 Pretce-Hall, Ic. Chap 3- Chapter Topcs Types of Regresso Models Determg the Smple Lear Regresso Equato

More information

STK4011 and STK9011 Autumn 2016

STK4011 and STK9011 Autumn 2016 STK4 ad STK9 Autum 6 Pot estmato Covers (most of the followg materal from chapter 7: Secto 7.: pages 3-3 Secto 7..: pages 3-33 Secto 7..: pages 35-3 Secto 7..3: pages 34-35 Secto 7.3.: pages 33-33 Secto

More information

Lecture 2: Linear Least Squares Regression

Lecture 2: Linear Least Squares Regression Lecture : Lear Least Squares Regresso Dave Armstrog UW Mlwaukee February 8, 016 Is the Relatoshp Lear? lbrary(car) data(davs) d 150) Davs$weght[d]

More information

CS 2750 Machine Learning. Lecture 8. Linear regression. CS 2750 Machine Learning. Linear regression. is a linear combination of input components x

CS 2750 Machine Learning. Lecture 8. Linear regression. CS 2750 Machine Learning. Linear regression. is a linear combination of input components x CS 75 Mache Learg Lecture 8 Lear regresso Mlos Hauskrecht mlos@cs.ptt.edu 539 Seott Square CS 75 Mache Learg Lear regresso Fucto f : X Y s a lear combato of put compoets f + + + K d d K k - parameters

More information

Chapter 13, Part A Analysis of Variance and Experimental Design. Introduction to Analysis of Variance. Introduction to Analysis of Variance

Chapter 13, Part A Analysis of Variance and Experimental Design. Introduction to Analysis of Variance. Introduction to Analysis of Variance Chapter, Part A Aalyss of Varace ad Epermetal Desg Itroducto to Aalyss of Varace Aalyss of Varace: Testg for the Equalty of Populato Meas Multple Comparso Procedures Itroducto to Aalyss of Varace Aalyss

More information

Statistics: Unlocking the Power of Data Lock 5

Statistics: Unlocking the Power of Data Lock 5 STAT 0 Dr. Kar Lock Morga Exam 2 Grades: I- Class Multple Regresso SECTIONS 9.2, 0., 0.2 Multple explaatory varables (0.) Parttog varablty R 2, ANOVA (9.2) Codtos resdual plot (0.2) Exam 2 Re- grades Re-

More information

Lecture Note to Rice Chapter 8

Lecture Note to Rice Chapter 8 ECON 430 HG revsed Nov 06 Lecture Note to Rce Chapter 8 Radom matrces Let Y, =,,, m, =,,, be radom varables (r.v. s). The matrx Y Y Y Y Y Y Y Y Y Y = m m m s called a radom matrx ( wth a ot m-dmesoal dstrbuto,

More information

DISTURBANCE TERMS. is a scalar and x i

DISTURBANCE TERMS. is a scalar and x i DISTURBANCE TERMS I a feld of research desg, we ofte have the qesto abot whether there s a relatoshp betwee a observed varable (sa, ) ad the other observed varables (sa, x ). To aswer the qesto, we ma

More information

Analysis of Variance with Weibull Data

Analysis of Variance with Weibull Data Aalyss of Varace wth Webull Data Lahaa Watthaacheewaul Abstract I statstcal data aalyss by aalyss of varace, the usual basc assumptos are that the model s addtve ad the errors are radomly, depedetly, ad

More information

Introduction to Matrices and Matrix Approach to Simple Linear Regression

Introduction to Matrices and Matrix Approach to Simple Linear Regression Itroducto to Matrces ad Matrx Approach to Smple Lear Regresso Matrces Defto: A matrx s a rectagular array of umbers or symbolc elemets I may applcatos, the rows of a matrx wll represet dvduals cases (people,

More information