Customized Sequential Designs for Random Simulation Experiments van Beers, W.C.M.; Kleijnen, J.P.C.

Size: px
Start display at page:

Download "Customized Sequential Designs for Random Simulation Experiments van Beers, W.C.M.; Kleijnen, J.P.C."

Transcription

1 Tlburg Unversty Customzed Sequental Desgns for Random Smulaton Eperments van Beers, W.C.M.; Klenen, J.P.C. Publcaton date: 25 Lnk to publcaton General rghts Copyrght and moral rghts for the publcatons made accessble n the publc portal are retaned by the authors and/or other copyrght owners and t s a condton of accessng publcatons that users recognse and abde by the legal requrements assocated wth these rghts. - Users may download and prnt one copy of any publcaton from the publc portal for the purpose of prvate study or research - You may not further dstrbute the materal or use t for any proft-makng actvty or commercal gan - You may freely dstrbute the URL dentfyng the publcaton n the publc portal Take down polcy If you beleve that ths document breaches copyrght, please contact us provdng detals, and we wll remove access to the work mmedately and nvestgate your clam. Download date: 14. Aug. 218

2 No CUSTOMIZED SEQUENTIAL DESIGNS FOR RANDOM SIMULATION EXPERIMENTS: KRIGING METAMODELING AND BOOTSTRAPPING By Wm C.M. van Beers, Jack P.C. Klenen March 25 ISSN

3 Customzed Sequental Desgns for Random Smulaton Eperments: Krgng Metamodelng and Bootstrappng Wm C.M. van Beers Department of Informaton Systems and Management Tlburg Unversty (UvT), Postbo 9153, 5 LE Tlburg, The Netherlands Phone: ; Fa: ; E-mal: wvbeers@uvt.nl Jack P.C. Klenen Department of Informaton Systems and Management/ Center for Economc Research (CentER) Tlburg Unversty (UvT), Postbo 9153, 5 LE Tlburg, The Netherlands Phone: ; Fa: ; E-mal: klenen@uvt.nl Ths paper proposes a novel method to select an epermental desgn for nterpolaton n random smulaton, especally dscrete event smulaton. (Though the paper focuses on Krgng, ths desgn approach may also apply to other types of metamodels such as lnear regresson models.) Assumng that smulaton requres much computer tme, t s mportant to select a desgn wth a small number of observatons (or smulaton runs). The proposed method s therefore sequental. Its novelty s that t accounts for the specfc nput/output behavor (or response functon) of the partcular smulaton at hand;.e., the method s customzed or applcaton-drven. A tool for ths customzaton s bootstrappng, whch enables the estmaton of the varances of predctons for nputs not yet smulated. The new method s tested through two classc smulaton models: eample 1 estmates the epected steady-state watng tme of the M/M/1 queueng model; eample 2 estmates the mean costs of a termnatng (s, S) nventory smulaton. For these smulatons the novel desgn ndeed gves better results than Latn Hypercube Samplng (LHS) wth a prefed sample of the same sze. Key words: Smulaton: desgn of eperments, statstcal analyss, Krgng, bootstrappng, regresson, C, C1, C9, C15, C44.

4 1. Introducton In ths paper, we focus on epensve smulatons; that s, we assume that a sngle smulaton run takes much computer tme. Consequently, nterpolaton s needed;.e., from the smulated nput/output (I/O) data, the outputs are predcted for nput combnatons not yet smulated. We devse a method that s meant to mnmze the number of smulaton runs for such nterpolaton. We talor our desgn of eperments (DOE) to the actual smulaton; that s, we do not derve a generc desgn such as a classc desgn (for eample, a 2 k p desgn) or a LHS desgn. The dfferences between customzed and generc desgns are as follows (also see Klenen and Van Beers (24), who focus on determnstc smulaton). A metamodel s a model of the I/O functon (or response functon ) mpled by the underlyng smulaton model. We denote the metamodel by Y () where denotes the k- dmensonal vector of the k nputs (factors) so = ( 1,,,, k ). Classc DOE assumes a smple metamodel. For eample, desgns of resoluton III (ncludng certan 2 k p desgns) assume a frst-order polynomal I/O functon. Composte desgns (CCD) assume a secondorder polynomal. These desgns are dscussed for physcal eperments n (for eample) the well-known tetbook Bo, Hunter, and Hunter (1978) and the recent tetbook Myers and Montgomery (22); for smulaton eperments we refer to Klenen (1987). LHS (much appled n Krgng, descrbed below) assumes that an adequate metamodel s more complcated than a low-order polynomal. LHS, however, does not assume a specfc metamodel. Instead, LHS focuses on the desgn space formed by the k dmensonal unt cube, defned by 1 ( = 1,, k) after standardzng (scalng) the nputs. LHS s one of the space fllng desgns: LHS samples that space accordng to some pror dstrbuton for the nputs, such as ndependent unform dstrbutons on [, 1]; see McKay, Beckman, and Conover (1979), and also Klenen et al. (25), Koehler and Owen (1996), and Santner, Wllams, and Notz (23). Unlke LHS, we eplctly account for the I/O functon. Unlke classc DOE, we assume that a low-order polynomal (estmated through regresson analyss) gves an nadequate appromaton of the I/O functon. In our method we estmate the uncertanty of predcted outputs at unobserved nput combnatons (these combnatons are also called scenaros, desgn ponts, combnatons of factor levels, or smulaton nputs). To estmate the uncertanty of these predctons caused by the nose and the shape of the I/O functon we use bootstrappng;.e., we resample the outputs for each scenaro already smulated (for 1

5 bootstrappng n general see the classc tetbook, Efron and Tbshran 1993; for bootstrappng n the valdaton of regresson metamodels n smulaton see Klenen and Deflandre 25). We make our procedure sequental for the followng two reasons. 1. Sequental statstcal procedures are known to be more effcent ; that s, they requre fewer observatons than fed-sample (one-shot) procedures; see, for eample, the handbook by Ghosh and Sen (1991) and the recent artcle by Park et al. (22). 2. Smulaton eperments proceed sequentally (unless parallel computers are used; our procedure also fts parallel computers). The lterature on determnstc smulaton shows several desgns that lke ours account for the specfc smulaton s I/O functon, and are sequental. For eample, Crary (22) dscusses G-optmal and I-optmal desgns, whch the DOE lterature defnes as follows. G-optmal desgns mnmze the mamum Mean Squared Error (MSE) of the predcted output; I-optmal or Integrated MSE (IMSE) desgns mnmze the average MSE (obvously, the MSE reduces to the varance f the predctor s unbased; see (5) and (6) below). Wllams, Santner, and Notz (2, 22) use a Bayesan approach to derve sequental IMSE desgns. Sasena, Papalambros, and Govaerts (22) derve sequental desgns for the optmsaton of determnstc smulaton models. Klenen and Van Beers (24) derve customzed sequental desgns for determnstc smulatons. We, however, focus on DOE for random smulatons, and we seem to be the frst to apply bootstrappng for ths problem. (Random smulaton ncludes Dscrete Event Dynamc Systems or DEDS smulaton such as M/M/1 smulaton, but also smulaton models consstng of stochastc dfference equatons.) We shall see that our desgns select most of ther nput combnatons n sub-areas that have more nterestng I/O behavor. In our frst eample we spend most of our computer smulaton tme on the challengng eplosve part of the metamodel that estmates the mean steady-state watng tme for varous traffc rates of sngle-server queueng systems wth Markovan (Posson) arrval and servce tmes known as the M/M/1 model. (The reader may take a peek at Fgure 1, dscussed n subsecton 5.1.) In our second eample, we estmate the average total costs n an (s, S) nventory model; there are several varatons on ths model, but we take the specfcaton gven by Law and Kelton (2). Agan, we fnd a concentraton of the nput combnatons n the sub-area where the metamodel shows steep slopes. (See Fgure 7, detaled n subsecton 5.2.) In both eamples, we compare our desgns wth LHS; our desgns gve better predctons. 2

6 The remander of ths paper s organzed as follows. Secton 2 summarzes the bascs of Krgng. Secton 3 summarzes DOE and Krgng. Usng the M/M/1 model, secton 4 eplans our method, whch apples bootstrappng to estmate the varances of the Krgng predctons for canddate nputs not yet smulated and sequentally selects as the net nput to be smulated, the one wth the largest bootstrap varance. Secton 5 demonstrates the procedure through two classc eamples: subsecton 5.1 uses M/M/1 smulatons, and subsecton 5.2 uses an (s, S) nventory model wth two nputs. For both eamples our method gves better results than LHS wth a prefed sample sze. Secton 6 presents conclusons and topcs for further research. 2. Krgng bascs Krgng (named after the South-Afrcan mnng engneer Krge) s an nterpolaton method that predcts unknown values of a random functon or random process; see Journel and Hubregts (1978) and Cresse s (1993) classc Krgng tetbook on spatal (geo)statstcs. Whereas spatal statstcs consders the two-dmensonal locaton as the known nput of ths process, smulaton consders the k dmensonal scenaro as nput; see Sacks et al. s (1989) classc artcle on the Desgn and Analyss of Computer Eperments (DACE) these computer eperments concern determnstc smulaton. Random (stochastc) smulaton ncludng DEDS smulatons s the topc of our paper. More precsely, a Krgng predcton s a weghted lnear combnaton of all output values already observed. The weghts depend on the dstances between the new nput to be predcted and the old nputs already observed. Krgng assumes that the closer the nputs are, the more postvely correlated the outputs are. Mathematcal formulatons follow n equatons (1) through (4). Currently, Krgng s frequently appled n determnstc smulaton, whch s much used n engneerng; agan see Sacks et al. (1989); for an update see Smpson et al. (21). In determnstc smulaton, Krgng has an mportant advantage over regresson analyss: the predcted values at old nputs are eactly equal to the observed (smulated) outputs. In random smulaton, however, ths property dsappears. Now, each scenaro s smulated several tmes wth non-overlappng pseudo-random number (PRN) streams. Van Beers and Klenen (23) show that Krgng nterpolates the average output per scenaro. These averages, however, are stll random, so the property that at scenaros already smulated the Krgng predctons equal the averages, loses ts ntutve appeal. Stll, Krgng may be 3

7 attractve because t may decrease the predcton bas (and hence the MSE) at scenaros close together. Indeed, n the eamples presented by Van Beers and Klenen (23) the Krgng predctons are much better than the regresson predctons (regresson analyss may be useful for other goals such as screenng and valdaton; see Klenen et al. 24). Therefore we do not further dscuss regresson analyss n ths paper. Mathematcally formulated, Krgng assumes the followng metamodel: Y ( ) = µ ( ) + δ ( ) wth δ ( ) ~ IID(, σ 2 ( )) (1) where µ () s the mean of the stochastc process Y (), and δ () s the addtve nose, whch s assumed ndependently and dentcally dstrbuted (IID) wth mean zero and varance σ 2 ( ). Ordnary Krgng to whch we lmt ourselves further assumes a statonary covarance process for Y () n (1);.e., the epected values µ () are a constant µ and the covarances of Y ( + h) and Y () depend only on the Eucldean dstance (lag) h = ( + h) ( ). (The assumpton µ () = µ s standard n Ordnary Krgng, and does not mply a flat response surface; see Sacks et al ) Y ˆ( The Krgng predctor for the unobserved (non-smulated) nput (say) denoted by ) s a weghted lnear combnaton of all the n observed outputs: n / Yˆ( ) = λ Y ( ) = Y (2) = 1 n wth = λ = 1,,, ) 1 = λn ( λ 1 and Y ( y 1,, y ). To select these weghts, Krgng = n derves the Best Lnear Unbased Predctor (BLUP), whch (by defnton) mnmzes the MSE of the predctor: { } 2 { MSE( Yˆ( ))} mn E( Y ( ) Yˆ( ) mn = ). (3) λ λ 4

8 Obvously, ths soluton depends on the output s covarances. It can be proven that the optmal weghts n (2) resultng from (3) are / 1 / / = + / (4) wth the followng symbols: s the vector of covarances between the outputs at the nput to be predcted and at the nputs already observed, so = / ( γ ( 1),, γ ( n )) ; / 1 = ( 1,,1) s the vector wth n ones; s the n n matr whose element (, ) s the (co)varance at the nputs already observed γ ) wth, = 1,, n. ( Note that the weghts n (4) vary wth (nput to be predcted), whereas regresson analyss uses the same estmated metamodel for all nputs. Note further that the lterature on (determnstc) smulaton speaks of covarances and correspondng correlatons, whereas the geostatstcs lterature speaks of the varogram, defned as 2γ ( h) = var( Y( + h) Y ( ) ). Snce we shall use the Matlab Krgng toolbo DACE made avalable free of charge by Lophaven, Nelsen, and Søndergaard (22) we avod the term varogram. (Recent alternatve free software s made avalable va see Santner, Wllams, and Notz 23.) We emphasze that n practce the covarances and n (4) are unknown so they must be estmated. The classcal estmator for ) (h s ˆ( h) ( Y( ) ( )) 2 Y /( 2N( h) ) =, where N ( h) denotes the number of dstnct pars n N ( h ) {(, ) : = h}. N( h) = Consequently, the weghts n (4) become random varables (say) ˆ. These weghts make the Krgng predctor resultng from (2) non-lnear. Ths characterstc s often neglected n the Krgng lterature. In general, non-lnear functons of random varables are hard to analyze a smple computer-ntensve soluton s bootstrappng; see Efron and Tbshran (1993). Ignorng the randomness of the estmated optmal weghts ˆ tends to underestmate the true varance of the Krgng predctor. For eample, n the bvarate normal case ths 2 follows from the formula for the condtonal varance, namely var( Y X ) = (1 ρ ) var( Y ) ; see, for eample, Kreyszg (197, p. 343). To tackle ths problem, Cresse (1993, p. 146) 5

9 proposes cross-valdaton. Cross-valdaton s also used by Klenen and Van Beers (24) for determnstc smulaton. For determnstc smulaton, Den Hertog, Klenen, and Sem (25) apply parametrc bootstrappng assumng normally dstrbuted predcton errors and fnd that gnorng the randomness of the Krgng weghts leads to serous errors. Because random smulaton may have non-normal outputs (for eample, queueng smulatons have dstrbutons wth heavy rght-hand tals), we use dstrbuton-free bootstrappng as we shall eplan n Secton DOE and Krgng By defnton, an epermental desgn s a set of n combnatons of k factor values. These combnatons are usually bounded by bo constrants: a b wth a, b R and = 1,, k. The set of all feasble combnatons s called the epermental regon (say) H. We suppose that H s a k-dmensonal unt cube, after rescalng the orgnal rectangular area (see Secton 1). Our goal s to fnd the best desgn for Krgng predctons wthn H; the Krgng lterature proposed several crtera (see Sacks et al. 1989, p. 414). Most of these crtera are based on the predctor s MSE. Most progress has been made for the IMSE (see Bates et al. 1996): ( Yˆ( ) ) φ( ) d IMSE = MSE (5) H where MSE follows from (3), and φ () s a gven weght functon usually assumed to be a constant. To evaluate a desgn, Sacks et al. (1989, p. 416) compare the predctons wth the known output values of a test set consstng of (say) N nputs. Assumng a constant φ () n (5), the IMSE can then be estmated by the Emprcal IMSE (EIMSE): EIMSE = 1 N N = 1 2 ( ˆ ( ) ( ) ). y y (6) Besdes ths EIMSE, we wll also study the mamum MSE; that s, we also consder rsk-averse users (also see Van Groengen, 2). So IMSE defned n (5) s replaced by 6

10 { MSE( Yˆ( ) )} MaMSE = ma (7) H and EIMSE n (6) by EMaIMSE = ma {1,..., m} 2 {( yˆ ( ) y ( ) ) }. (8) 4. Sequental DOE We devse the followng sequental DOE procedure wth eght steps, whch we llustrate through the M/M/1 model wth epermental regon H = { ρ :.1 ρ.9} where ρ denotes the traffc rate. Step 1. We start wth a small plot desgn wth (say) n nput combnatons; for eample, n = 5. We select the specfc n values such that they are equally spread over the epermental regon. There are varous space fllng desgns; for eample, LHS desgns. In the frst eample n Secton 5 namely the M/M/1 we use a mamn desgn, whch (by defnton) mamzes the mnmum dstance between any two ponts of the desgn; see Koehler and Owen (1996, p. 288). So n ths eample, we select the traffc rates {.1,.3,.5,.7,.9} ( = 1,, 5 ). Step 2: For each nput value, we ntally generate (say) m IID replcates because bootstrappng requres IID observatons; see Efron and Tbshran (1993). To obtan IID observatons n our M/M/1 smulaton eample, we apply renewal (regeneratve) analyss (see, for eample, Klenen and Van Groenendaal 1992, and Law and Kelton 2). As the renewal state, we choose the dle (empty) state. We therefore start the smulaton run n the empty state for each traffc rate. Net we observe m cycles each wth (random) cycle lengths (say) L (the hgher, the hgher L tends to be). Besdes the m cycle lengths ( = 1,, m ) per traffc rate, we observe the sum of the watng tmes over that cycle: L ; L ; sw ; = w ; t t = 1 ; ( = 1,, n ; = 1,, m ). (9) 7

11 To reduce the varance when comparng the (random) outputs for dfferent nputs (.e., to mprove the sgnal/nose rato), we use common random numbers (CRN). Ths s a popular varance reducton technque (VRT). It s well known that n M/M/1 smulaton the varance decreases substantally f the PRN (say) r t are manpulated as follows: successve PRN are used alternatvely to smulate the arrval tme (say) a and the servce tme s ; n other words, a t = ln r2 t 1E( a) and s t = ln r 2t E( s) (t = 1, 2, ). The correlaton coeffcents for the average watng tmes of two neghborng traffc rates turn out to be very hgh, namely roughly.99. To generate the PRN, we use the Matlab command rand. To ntalze the PRN, we set the Matlab generator (rather arbtrarly) to ts ntal state s =. The Matlab web ste further states: The unform random number generator n MATLAB 5 (and above) uses a lagged Fbonacc generator, wth a cache of 32 floatng pont numbers, combned wth a shft regster random nteger generator. The nteger generator uses shfts and eclusve OR s. ; see ( and also Moler (1995). For further detals on CRN, VRT, and PRN we refer to Law and Kelton (2). Step 3. Based on these m bvarate IID outputs ( L ;, sw ; ) ( = 1,, m ) per nput value, we estmate the mean watng tmes through m sw ; = 1 y ( m) = m. (1) L = 1 ; Ths rato estmator s consstent; for references see agan Klenen and Van Groenendaal (1992) and Law and Kelton (2). We do not try to mprove the small-sample performance of ths estmator (for eample, through ackknfng whch s closely related to bootstrappng), because ths estmator suffces for our Krgng metamodel. To estmate the precson of the estmate defned n (1), we use the followng probablty statement that holds asymptotcally per nput value : σˆ σˆ m m P y ( m ) t E( w ) y ( m ) + t = α α α 1 m 1; 1 2 m 1; 1 2 (11) L L 8

12 2 2 m where σ ˆ = vâr( sw ) + y vâr( L ) 2y côv( sw, L ) and L = L m ; agan see Klenen / = 1 ; and Van Groenendaal (1992). Note that ths nterval does not have an asymptotc ont (or epermentwse) probablty (1- α ) over all smulated nput values. Net, we add replcates one-at-a-tme sequental samplng untl the desred halfwdth of the nterval n (11) has reduced to a prefed relatve error (say) δ ; for eample, δ =.15 (agan see Klenen and Van Groenendaal 1992 and Law and Kelton 2). We denote the fnal number of replcates per nput by nput based on m replcates; see (1) wth m replaced by m. Ths gves the average output y m ) per m. ( Step 4. Based on these n average outputs y ( m ) for the n nputs, we compute the Krgng predctors for the epected outputs of a new set of (say) c n canddate nput values c g (g = 1,, c n ). We agan select these canddates n a space-fllng way; n the M/M/1 eample, we choose the canddate nputs halfway between two old neghborng nputs c so we avod etrapolaton: ( + ) 1 2 (wth g = 1,, n 1 ). g = g g + By defnton, the Krgng predctor s a weghted lnear combnaton of all outputs already observed; see (2). So now Krgng weghts the n values already observed n steps 1 through 3: yˆ( c g n ) = λ y( ) (12) = 1 n wth = 1 λ = 1. To estmate the weghts λ n (12), Krgng uses the old data set (, y ( m )) ( = 1,, n ). To estmate the varance of ths non-lnear predctor, we use bootstrappng as follows. Step 5. Per nput, we bootstrap the m bvarate IID outputs ( L ;, sw ; );.e., we resample wth replacement the outputs resultng from steps 1 through 3. We denote these bootstrap observatons by the superscrpt * (as s tradtonal n the bootstrap lterature): * * * * {( sw, L ),, ( sw, L )}. (13) ; 1 ; 1 ; m ; m 9

13 Usng these bootstrapped observatons and (1), we compute the bootstrap averages: m = 1 m = 1 sw * ; * y ( m ) =. (14) * L ; * Usng the bootstrapped I/O data (, y ( m )) ( = 1,, n ) and (12), we compute the bootstrapped Krgng predctor: * c yˆ ( g n * * ) = λ y ( ). (15) = 1 * We agan estmate the bootstrap weghts λ n (15) through the Matlab Toolbo DACE; see Secton 2. Note that DACE ams to obtan the mamum lkelhood estmator (MLE) of the Krgng weghts * λ n (15). For the numercal search that leads to ths MLE, DACE uses startng values. As startng values, we use the MLE for λ based on the orgnal I/O data n (12). Step 6. The resamplng per nput n step 5 s repeated (say) B tmes (ths B s called the bootstrap sample sze). Hence, (13) through (15) gve y ( ) wth b = 1,, B. ˆ * b c g For each of the c n canddate nputs c g, we compute the bootstrap varance of the Krgng predctor y ˆ c* g at c g : B c* 1 c* c* 2 v âr( yˆ g ) = ( yˆ g; b yˆ g ) (16) B 1 b = 1 where * ˆ c; g b y s the predcted value at canddate nput c g based on the bootstrapped I/O data * c* B c* (, y ( m )) ( = 1,, n ) ; b and yˆ = g b = yˆ B 1 g; b. varance (16): Step 7. We determne whch canddate nput has the largest bootstrap predcton 1

14 c* { vâr( yˆ )} v = arg ma g, (17) c g {1,..., n } and we add ths wnnng nput c v to the old desgn. Now, we run the smulaton model wth ths nput c v untl we have m replcates for ths nput. We stll apply CRN (so we ntalze the PRN wth the seed s ). Furthermore, we agan start wth the empty system as the renewal state. We contnue the smulaton untl the confdence nterval reaches the threshold δ ; see (11). Step 8. We repeat the steps 4 through 7 untl we have reached a stoppng crteron. In other words, we bootstrap the old I/O set augmented wth the canddate selected n step 7. We select a new set of canddates. For these canddates, we compute the Krgng predctors and ther bootstrap varances. Alternatve stoppng crtera may be: () the computer budget has been ehausted, () the proect has reached ts deadlne, () the precson of the Krgng metamodel s acceptable. We observe that addng one pont at a tme as we do n our sequental DOE s not necessarly optmal. However, t s a smple albet myopc heurstc; also see Banevc and Swtzer (22), who refer to Ferr and Pccon (1992). 5. Two eamples We test our customzed sequental desgn (CSD) through two classc academc smulaton models, namely the M/M/1 model and an (s, S) model M/M/1 model An M/M/1 has as true I/O functon the hyperbole y = wth < < 1 (18) 1 where y denotes the epected steady-state watng tme assumng a unt servce rate, and denotes the traffc rate. We apply the procedure descrbed n secton 4, selectng the followng parameters. 11

15 Step 1: We select a plot desgn of sze n = 5. Step 2: We obtan m 1 replcates to get ntal estmates of the varances; we select as the = ntal PRN seed s =. Step 3: We eperment wth two values for the precson, namely =.5 and =.15, and two values for the type-i error rate, namely =.1 and.5 so (11) gves four confdence ntervals. For hgher traffc rates (say, >.7), the numbers of cycles and the cycle lengths may be very large. To lmt computer tme, we lmt the number of cycles ( lmt preserves the renewal property, but may decrease the precson. Step 6: We eperment wth the bootstrap sample szes: B = 5 and B = 1. L ; ) to 1. Ths Step 8: We eperment wth a stoppng crteron that specfes that the total desgn sze s ether n = 1 or n = 5. Fgure 1 dsplays smulaton results for both our desgn and a LHS desgn. Ths fgure s based on the confdence ntervals n (11) wth =.5 and =.15. The bootstrap sample sze s only B = 5. The stoppng crteron s that n = 1 traffc rates have been smulated. Ths fgure corresponds wth one scenaro (labeled 1) of the eght scenaros n our eperment; see Table 1b below. LHS turns out to smulate fewer challengng nputs;.e., hgh traffc rates. Insert Fgure 1 To evaluate our procedure, we use a test set wth N = 32 equdstant traffc rates, namely {.1125,.1375,,.8875} (Sacks et al also use test sets to evaluate ther procedure). We compare the Krgng predctons of the two desgns wth the true outputs of the test set, computed from (18). (The two desgns may contan some members of the test set, but we gnore ths phenomenon.) Fgure 2 llustrates the 32 predctons for replcate 1 of scenaro 1. Insert Fgure 2 To compare the predctons of our desgn and LHS, we mght use the EIMSE crteron, defned n (6). However, the fnal numbers of replcates n the two desgns may dffer, so we calculate the corrected EIMSE, denoted by e later on: 12

16 1 e = CEIMSE = C n t n t t t 2 ( yˆ( ) ) y( ) = 1, (19) where C s the rato of the total number of replcates n the LHS desgn and n our desgn, n t s the number of I/O combnatons n the test set (so n = 32 ), and test set. t t s the th nput of the We compute ths crteron for eght scenaros;.e., eght combnatons of values of the type-i error rate α, the relatve error δ, the bootstrap sample sze B, and the fnal desgn sze n. These scenaros are specfed through a k p 2 desgn wth k = 4 and p = 1. Ths desgn s epressed n standardzed values n Table 1a (see Klenen and Van Groenendaal 1992); note that all columns are orthogonal. The orgnal values are dsplayed n Table 1b. Insert Tables 1a and 1b To decrease the randomness of CEIMSE n (19), we replcate each scenaro n Table 1 R = 5 tmes. To ensure that the PRN streams do not overlap, we start Matlab s PRN generator n the ntal state s = (usng the command RAND('state', )) n the frst replcaton of each scenaro. Net we save the generator s state of the scenaro that requres the largest number of smulaton runs; we use that state as the ntal state for each of the eght scenaros n the net replcaton, and so on. Table 2a shows the R = 5 CEIMSEs per scenaro, denoted by e r ( r = 1,, R), for the Customzed Sequental Desgn; Table 2b shows e r for LHS. Insert Tables 2a and 2b We analyze the results n Table 2 as follows. Comparng Tables 2a and 2b shows that our desgns do not have smaller CEIMSE than LHS desgns, n all cases (scenaros and replcates). More precsely, our desgns gve better results only f the desgn sze n s small ; see the scenaros 1, 5, 6, and 7. But t s eactly these cases that we are nterested n, snce (as we stated n Secton 1) we focus on epensve smulatons, whch mply that bg desgn szes are nfeasble. So, we compute the dfferences 13

17 d e e ; r ; r; LHS ; r; CSD = wth = 1,, 8; r = 1,, 5. (2) Lumpng all scenaros together, the Student t test does not gve sgnfcant dfferences at a type-i error rate of 5% (the varaton of the dfferences d ; s large). However, Fgure 3 r suggests that each of the four scenaros wth small n (desgn sze) gves sgnfcantly postve dfferences. We therefore nvestgate whch factors eplan the performance of our desgn relatve to LHS, as follows. Insert Fgure 3 Remember that we have the k = 4 factors correspondng wth α, δ, B, and n. So we estmate the frst-order polynomal, whch has the man effects β : k + β + ε r = 1 d r = β. (21) We wsh to account for varance heterogenety: 2 var( ε ) σ. Moreover we use CRN, so d r and ( = 1,, 8) are not ndependent. Therefore we compute the OLS estmator of the d r parameters n (21) per replcaton: ˆ = ( X X ) X (22) 1 r d r where X s the 8 5 matr followng from (21) and Table 1a. Ths gves the average OLS estmator based on all R = 5 replcatons: R ˆ 1 = ˆ r. (23) R 1 r= Hence the standard error for the th man effect s R ˆ ˆ 2 R s ˆ ( β ; r β ) ( 1) ( β ) r s( ˆ = 1 β ) = =, (24) R R 14

18 VRWKH6WXGHQWVWDWLVWLFZLWK R 1 degrees of freedom s t ; ν ˆ β β =. (25) s( ˆ β ) Ths statstc assumes normalty, whch probably holds because the Central Lmt Theorem may be appled. The classc null-hypothess s that β = ( = 1,, 4) n (21). We dsplay the correspondng t-statstcs defned by (25) n Table 3 for three values of the type-i error rate, namely.1,.5, and.1. Insert Table 3 Table 3 shows that the desgn sze n (factor 4) has a sgnfcant negatve effect on the dfference d (for any of the three type-i error rates);.e., the advantage of our desgn becomes smaller as the desgn sze n ncreases. Further, the bootstrap sample sze B (factor 3) has no sgnfcant effect: our procedure uses the bootstrap only to estmate whch canddate nput has the largest varance of the Krgng predctor; see (17). So n practce the smaller sze, B = 5, may be used. (Most bootstrap applcatons requre the estmaton of the whole dstrbuton functon, so B s much hgher than 5; for eample, B = 1.) Changes n and (factors 1 and 2) affect the number of replcates, but ths effect s ncorporated n CEIMSE va the factor C; see (19). Rsk-averse users may be guded by EMaIMSE, defned n (8). Agan, our desgns outperform LHS desgns for the smaller desgn szes n. Table 4a shows the fve EMaIMSE values for scenaro, denoted by values for LHS; Fgure 4 shows the dfferences, d ma e for our desgn, and Table 4b shows the analogous ;r = e e. ma ma ma ; r ; r; LHS ; r; CSD Insert Tables 4a and 4b, and Fgure 4 Note that m (number of requred cycles) ndeed ncreases wth (traffc rate). For eample, for the precson requrements =.5 and =.15, =.1 requres 489 cycles, 15

19 whereas =.9 requres the mamum number of cycles, namely 1; see Fgure 5. Moreover, a cycle s lkely to be longer as the traffc rate ncreases. For eample, f =.1 then the average cycle length s L = 4. 8 for m 1 replcates; f =.9 then L = =. For a hgh traffc rate, the mamum number of cycles (1) s reached, n ths fgure. For hgher accuracy ( =.5) ths mamum s also reached for moderate traffc rates. Insert Fgure 5 A queston about our desgn mght be: s the concentraton of the smulaton runs n the nput range wth hgh traffc rates caused by the hgh sgnal (E(y)) or the hgh nose (var(y)) (both the mean and the varance of the M/M/1 s steady-state watng tme ncrease wth the traffc rate)? To answer ths queston, we run some Monte Carlo eperments nspred by the M/M/1 model. In these eperments we use the relatve precson δ =.15, the type-i error rate α =.5, and the fnal desgn sze n = 15. We use the same PRN seed for the same macro-replcate of the four eperments. We run s macro-replcates; the results across the s macro-replcates look very much alke, so to save space we do not dsplay the fgures for all macro-replcates; Fgure 6 gves results for one macro-replcate. Insert Fgure 6 (a) Increasng sgnal and constant nose: y = /( 1 ) + r wth.1. 9 and r U ( 1, 1) ; n other words, the sgnal follows (18), but the nose s unformly dstrbuted between 1 and 1, for any nput value. Fgure 6(a) shows that our desgn allocates ts runs to the area wth rapdly changng sgnal as our desgn dd for the M/M/1 n Fgure (b) Constant sgnal and ncreasng nose: y = 5 + 1r. Fgure 6(b) shows that our desgn agan allocates ts runs to the hgh nput values wth hgh nose. (c) Constant sgnal and constant nose: y = 5 + r. Fgure 6(c) shows that now our desgn spreads ts runs unformly across the epermental area. (d) Increasng sgnal and decreasng nose: y = /( 1 ) + r /(1). Fgure 6(d) shows that now our desgn allocates most of ts runs to the mddle of the epermental area. Our eplanaton s that the ncreasng sgnal pulls the runs to the hgh nput values, whereas the decreasng nose pulls them to the low values so that the net result s a compromse. 16

20 5.2. (s, S) nventory model In an (s, S) model (wth s < S) wth random demand D, the nventory I s replenshed to the order up-to level S whenever the nventory decreases to a value smaller than the reorder level s;.e., the order quantty Q s S I f I < s Q = f I s. There are several varatons on ths basc model, but we smulate Law and Kelton (2, p. 6, 651) s eample 12.9 whch has the followng features. Tmes between demands are IID eponental random varables wth a mean of.1 month. If a demand arrves, ts sze s gven by the probablty functon D Pr{D} The nventory s revewed at the begnnng of each month. Law and Kelton defne an aulary varable d = S s to estmate the optmal values for s and S; the (re)order quantty, however, s not a fed quantty (the order quantty Q vares wth the actual nventory poston, defned as stock on hand, mnus customer backorders, plus outstandng suppler orders; see Bashyam and Fu (1998)). The lead-tme of an order s unformly dstrbuted between.5 and 1 month. Demand s satsfed mmedately f the nventory level I s at least as large as the demand sze D. Otherwse, the demand s possbly partly backlogged and delvered as soon as the nventory s replenshed. The backlog costs are $5 per month per tem backlogged. Holdng costs per tem per month are $1. Orderng costs consst of a setup cost of $32 per order plus ncremental costs of $3 per tem. Law and Kelton smulate the system for 12 months, startng wth an ntal nventory I ( ) = 6 ;.e., ths smulaton model s termnatng (eample 1 estmates a steady-state mean of an M/M/1). Law and Kelton obtan fve replcates for each of the 36 combnatons formed by s =, 2, 4, 6, 8, 1 and d =, 2, 4, 6, 8, 1. Based on these 18 I/O data, they ft the followng second-order polynomal regresson (meta) model for the average monthly total costs called R: 17

21 R ˆ( s, d) = s 1.24d +.14sd +.7s.1d. (26) They compare ths model s predctons wth the true E (R) estmated from 1 replcates for each of 42 new and old combnatons formed by s =, 5, 1,, 1, and d = 5, 1, 15,, 1. We, however, replace (26) by a Krgng model, ftted to the same I/O data (mplyng 36 average outputs), and compare our Krgng predctons wth the true outputs. We fnd that our Krgng model gves more accurate predctons than the regresson model (26); see the Append for detals. Net, we change the desgn from Law and Kelton s grd (wth 16 combnatons of the two nputs s and d wth ( s, d) [2, 8] [2, 8] ) nto our desgn (wth the same fnal desgn sze, namely 16); see Fgure 7. Insert Fgure 7 Lke Law and Kelton, we obtan 5 replcatons per nput combnaton. Net, we ft a Krgng model, and predct 81 true outcomes for the test set ( s, d) {1, 2,, 9} {1, 2,, 9} (a subset of Law and Kelton s true set). Agan, we calculate EIMSE and EMaIMSE defned n (6) and (8). To reduce nose, we repeat ths procedure 5 tmes (usng non-overlappng PRN streams) for our desgns and LHS. Our desgns gve substantal better EIMSE and EMaIMSE; see Table 5. Insert Table 5 We conclude that n ths eample, our sequental desgn also gves more accurate Krgng predctons than LHS wth a fed desgn sze. 6. Conclusons and future research In practce, smulaton often requres much computer tme per run (or replcate) so t s desrable to have an effcent epermental desgn for nterpolaton. It s well known n mathematcal statstcs that sequental desgns are more effcent than fed-sample desgns. Our specfc sequental desgns add as the net nput to be smulated, the nput wth the 18

22 mamum estmated varance for the output predcted at specfc canddate nputs. To obtan such predctons, we use Krgng; to estmate the varances of the Krgng predctors, we use bootstrappng. We appled ths procedure to estmate () the epected steady-state watng tme n M/M/1 smulaton, and () the epected cost n termnatng nventory (s, S) smulaton. We compared the Krgng predcton errors of our sequental desgns and those of fed-sample LHS. Our results show that our procedure gves ndeed smaller predcton errors. In future research, (asymptotc) proofs of the performance of our procedure mght be derved. More epermentaton and analyses may be done to derve rules of thumb for our procedure s parameters, such as the ntal desgn sze n and the ntal number of replcates m. Our procedure may be appled to eamples more complcated than the M//M/1 queueng model or the (s, S) nventory model. Stoppng rules based on a measure of accuracy or precson may be nvestgated. Besdes LHS, other desgns wth prefed szes may be eplored; for eample, mn-ma desgns. Besdes Ordnary Krgng, other metamodels may be used to analyze the I/O data. For eample, the optmal weghts n Ordnary Krgng assume that the predctors equal the average outputs at the nputs already observed; droppng ths constrant mples that new Krgng software must be developed. New Krgng weghts may be derved, replacng the IMSE crteron by the mamum squared error crteron. Besdes Krgng, other nterpolaton models may be used; for eample, lnear or nonlnear regresson metamodels. We focus on senstvty analyss; searchng for the optmal nput of the smulaton model requres further research. Append Law & Kelton s (2, p. 651) data set conssts of 5 replcates for each of the 16 nput combnatons formed by s {2, 4, 6, 8} and d {2, 4, 6, 8} (ths set s a subset of the one n the man tet). Based on ths nput set, we fnd the followng estmates ˆ = ( ,.263,.533,.88,.52,.38), whch agrees wth ther values up to two decmals. As a test set (used to compare regresson and Krgng metamodels), we use ther true I/O set, whch conssts of 1 replcates of each of 42 = 21 2 nput combnatons wth 19

23 s {, 5, 1,, 1} and d { 5, 1, 15,, 1}. For the regresson model we fnd an EIMSE of 145.5, whereas for the Krgng model we fnd an EIMSE of So Krgng does result n a smaller EIMSE. Ths EIMSE, however, s stll rather large, because we have to etrapolate the data outsde the regon [ 2,8] [2,8]. In general, we strongly recommend avodng etrapolaton when fttng a metamodel; ndeed, n smulaton t s easy to avod etrapolaton because we can select our own nput combnatons. Law and Kelton also use a data set consstng of 18 I/O combnatons, namely 5 replcates for each of 36 nput combnatons wth s {, 2, 4, 6, 8, 1} and d {, 2, 4, 6, 8, 1}. We use ther computer program (mported from ther web page to generate the output. Agan, we ft both a second-order regresson model and a Krgng model. We compare the two ftted models va the true data set. For the regresson model, we fnd an EIMSE of 152., whereas for the Krgng model we fnd an EIMSE of only 14. (n ths case etrapolaton s ndeed avoded. Acknowledgements We thank Dck den Hertog (Tlburg Unversty) for hs comments on an earler verson, whch lead to the addtonal Monte Carlo eperments reported n Secton 5, and Ruud Brekelmans (Tlburg Unversty) for helpng us to mport Law and Kelton s C-program codes nto our Matlab program. References Banevc, M. and P. Swtzer (22), Bayesan network desgns for varance as a functon of the locaton. Proceedngs of the 22 JSM Conference, Secton on Statstcs and the Envronment, New York, NY Bashyam, S. and M.C. Fu (1998), Optmzaton of (s, S) nventory systems wth random lead tmes and a servce level constrant. Management Scence. 44, no. 12, pp Bates, R.A., R.J. Buck, E. Rccomagno and H.P. Wynn (1996), Epermental desgn and observaton for large systems. Royal Statstcal Socety. 58, no. 1, pp Bo, G.E.P., W.G. Hunter and J.S. Hunter (1978), Statstcs for epermenters: an ntroducton to desgn, data analyss and model buldng. John Wley & Sons, Inc., New York 2

24 Crary, S.B. (22), Desgn of computer eperments for metamodel generaton, Analog Integrated Crcuts and Sgnal Processng, 32, pp Cresse, N.A.C. (1993), Statstcs for spatal data. John Wley & Sons, Inc., New York Efron, B. and R.J. Tbshran (1993). An ntroducton to the bootstrap. Chapman & Hall, New York Ferr, M. and M. Pccon (1992), Optmal selecton of statstcal unts. Computatonal Statstcs & Data Analyss, 13, pp Ghosh, B.K. and P.K. Sen (edtors), 1991, Handbook of sequental analyss. Marcel Dekker, Inc., New York Den Hertog, D., J.P.C. Klenen, and A.Y.D. Sem (25), The correct Krgng varance estmated by bootstrappng. Journal of the Operatonal Research Socety (accepted; preprnt: Journel, A.G. and C.J. Hubregts (1978), Mnng geostatstcs, Academc Press, London Klenen, J.P.C. (1987), Statstcal tools for smulaton practtoners. Marcel Dekker, Inc., New York Klenen, J.P.C. and D. Deflandre (25), Valdaton of regresson metamodels n smulaton: Bootstrap approach. European Journal of Operatonal Research (n press) Klenen, J.P.C., S.M. Sanchez, T.W. Lucas and T.M. Coppa (25), A user s gude to the brave new world of desgnng smulaton eperments. INFORMS Journal on Computng (accepted as State-of-the-Art Revew) Klenen, J.P.C. and W.C.M. van Beers (24), Applcaton-drven sequental desgns for smulaton eperments: Krgng metamodelng. Journal of the Operatonal Research Socety, no. 55, pp Klenen, J.P.C. and W. van Groenendaal (1992), Smulaton: a statstcal perspectve. John Wley, Chchester (England) Koehler, J.R. and A.B. Owen (1996), Computer eperments. Handbook of statstcs, by S. Ghosh and C.R. Rao, vol. 13, pp Kreyszg, E. (197), Introductory mathematcal statstcs: prncples and methods. John Wley & Sons, Inc., New York Law, A.M. and W.D. Kelton (2), Smulaton modelng and analyss, thrd edton, McGraw-Hll, Boston Lophaven, S.N., H.B. Nelsen and J. Søndergaard (22), A Matlab Krgng toolbo. Techncal report IMM-TR-22-12, Techncal Unversty of Denmark 21

25 McKay, M.D., R.J. Beckman and W.J. Conover (1979), A comparson of three methods for selectng values of nput varables n the analyss of output from a computer code. Technometrcs, 21, no. 2, pp (reprnted n 2: Technometrcs, 42, no. 1, pp Moler, C. (1995), Random thoughts. MATLAB News & Notes, pp Myers, R.H. and D.C. Montgomery (22). Response surface methodology: process and product optmzaton usng desgned eperments; second edton. Wley, New York Park, S., J.W. Fowler, G.T. Mackulak, J.B. Keats, and W.M. Carlyle (22), D-optmal sequental eperments for generatng a smulaton-based cycle tme-throughput curve. Operatons Research, 5, no. 6, pp Sacks, J., W.J. Welch, T.J. Mtchell and H.P. Wynn (1989), Desgn and analyss of computer eperments. Statstcal Scence, 4, no. 4, pp Santner, T.J., B.J. Wllams, and W.I. Notz (23), The desgn and analyss of computer eperments. Sprnger-Verlag, New York Sasena, M.J, P. Papalambros, and P. Goovaerts (22), Eploraton of metamodelng samplng crtera for constraned global optmzaton. Engneerng Optmzaton 34, no.3, pp Smpson, T.W., T.M. Mauery, J.J. Korte, and F. Mstree (21), Krgng metamodels for global appromaton n smulaton-based multdscplnary desgn optmzaton. AIAA Journal, 39, no. 12, 21, pp Van Beers, W. and J.P.C. Klenen (23), Krgng for nterpolaton n random smulaton. Journal of the Operatonal Research Socety, no. 54, pp Van Groengen, J.W. (2), The nfluence of varogram parameters on optmal samplng schemes for mappng by Krgng. Geoderma, no. 97, pp Wllams, B.J., T.J. Santner, and W.I. Notz (2), Sequental desgn of computer eperments to mnmze ntegrated response functons, Statstca Snca, 1, Wllams, B.J., T.J. Santner, and W.I. Notz (22), Sequental desgn of computer eperments for constraned optmzaton of ntegrated response functons, Workng Paper. Oho State Unversty 22

26 ---: True I/O functon * : Smulated output for Customzed Sequental Desgn O: Smulated output for Latn Hypercube Desgn y Fgure 1: Two desgns for M/M/1 wth 1 traffc rates and average smulaton outputs y 23

27 ---: True I/O functon * : Customzed Desgn predcton O: LHS predcton ŷ t Fgure 2: Predctons ŷ for the test set for M/M/1, for two desgns n replcate 1 of scenaro 1 24

28 .4.3 d scenaro Fgure 3: Dfferences d ; r e ; r; LHS e ; r; CSD = for scenaro = 1,, 8 and replcate r = 1,, d scenaro Fgure 4: Dfferences d = e e for scenaro = 1,, 8 and replcate r = 1,, 5 ma ma ma ; r ; r; LHS ; r; CSD 25

29 m Fgure 5: Number of cycles m per traffc rate for M/M/1, gven DQG 26

30 (a) constant nose (b) ncreasng nose y, yˆ y, ˆ y y (c) constant nose (d) decreasng nose y, yˆ y, yˆ y y Fgure 6: Monte Carlo eperments wth four combnatons of sgnal and nose functons; --- denotes sgnal and *** denotes I/O of Customzed Sequental Desgn 27

31 R Fgure 7: I/O smulaton data for (s, S) nventory model wth 16 scenaros denoted by { 28

32 Table 1a: A desgn epressed n standardzed factor values factor B n scenaro = Table 1b: Eght scenaros or combnatons of type-i error rate α, relatve error δ, bootstrap sample sze B, and fnal desgn sze n scenaro B n

33 Table 2a: CEIMSE e r for Customzed Sequental Desgns n 8 scenaros replcated 5 tmes, computed from test set wth 32 values scenaro e 1 e 2 e 3 e 4 e Table 2b: CEIMSE e r for LHS desgns n 8 scenaros replcated 5 tmes, computed from test set wth 32 values scenaro e 1 e 2 e 3 e 4 e

34 Table 3: Sgnfcance of estmated man effects t-statstc two-sded sgnfcance level t α =. 1 α =. 5 α =. 1 ;ν β sgnfcant sgnfcant not sgnf. β sgnfcant sgnfcant not sgnf. β not sgnf. not sgnf. not sgnf. β sgnfcant sgnfcant sgnfcant 4 βˆ 31

35 ma Table 4a: EMaIMSE e r for Customzed Sequental Desgns n 8 scenaros replcated 5 tmes, computed from test set wth 32 values EMaIMSE e for Customze Sequental Desgns scenaro e 1 e 2 e 3 e 4 e ma Table 4b: EMaIMSE e r for LHS desgns n 8 scenaros replcated 5 tmes, computed from test set wth 32 values EMaIMSE e for LHS scenaro e 1 e 2 e 3 e 4 e

36 Table 5: EIMSE and EMaIMSE for CSD and LHS for (s, S) nventory smulaton, based on test set wth 81 true values CSD LHS replcate EIMSE EMaIMSE EIMSE EMaIMSE

No CUSTOMIZED SEQUENTIAL DESIGNS FOR RANDOM SIMULATION EXPERIMENTS: KRIGING METAMODELING AND BOOTSTRAPPING

No CUSTOMIZED SEQUENTIAL DESIGNS FOR RANDOM SIMULATION EXPERIMENTS: KRIGING METAMODELING AND BOOTSTRAPPING No. 24 63 CUSTOMIZED SEQUENTIAL DESIGNS FOR RANDOM SIMULATION EXPERIMENTS: KRIGING METAMODELING AND BOOTSTRAPPING By W.C.M. van Beers, J.P.C. Klejnen July 24 ISSN 924-7815 Seq Desgn_Boot_7.doc; wrtten:

More information

Assignment 5. Simulation for Logistics. Monti, N.E. Yunita, T.

Assignment 5. Simulation for Logistics. Monti, N.E. Yunita, T. Assgnment 5 Smulaton for Logstcs Mont, N.E. Yunta, T. November 26, 2007 1. Smulaton Desgn The frst objectve of ths assgnment s to derve a 90% two-sded Confdence Interval (CI) for the average watng tme

More information

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4) I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes

More information

Negative Binomial Regression

Negative Binomial Regression STATGRAPHICS Rev. 9/16/2013 Negatve Bnomal Regresson Summary... 1 Data Input... 3 Statstcal Model... 3 Analyss Summary... 4 Analyss Optons... 7 Plot of Ftted Model... 8 Observed Versus Predcted... 10 Predctons...

More information

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands Content. Inference on Regresson Parameters a. Fndng Mean, s.d and covarance amongst estmates.. Confdence Intervals and Workng Hotellng Bands 3. Cochran s Theorem 4. General Lnear Testng 5. Measures of

More information

x i1 =1 for all i (the constant ).

x i1 =1 for all i (the constant ). Chapter 5 The Multple Regresson Model Consder an economc model where the dependent varable s a functon of K explanatory varables. The economc model has the form: y = f ( x,x,..., ) xk Approxmate ths by

More information

Comparison of Regression Lines

Comparison of Regression Lines STATGRAPHICS Rev. 9/13/2013 Comparson of Regresson Lnes Summary... 1 Data Input... 3 Analyss Summary... 4 Plot of Ftted Model... 6 Condtonal Sums of Squares... 6 Analyss Optons... 7 Forecasts... 8 Confdence

More information

Economics 130. Lecture 4 Simple Linear Regression Continued

Economics 130. Lecture 4 Simple Linear Regression Continued Economcs 130 Lecture 4 Contnued Readngs for Week 4 Text, Chapter and 3. We contnue wth addressng our second ssue + add n how we evaluate these relatonshps: Where do we get data to do ths analyss? How do

More information

Chapter 11: Simple Linear Regression and Correlation

Chapter 11: Simple Linear Regression and Correlation Chapter 11: Smple Lnear Regresson and Correlaton 11-1 Emprcal Models 11-2 Smple Lnear Regresson 11-3 Propertes of the Least Squares Estmators 11-4 Hypothess Test n Smple Lnear Regresson 11-4.1 Use of t-tests

More information

Statistics for Economics & Business

Statistics for Economics & Business Statstcs for Economcs & Busness Smple Lnear Regresson Learnng Objectves In ths chapter, you learn: How to use regresson analyss to predct the value of a dependent varable based on an ndependent varable

More information

Uncertainty as the Overlap of Alternate Conditional Distributions

Uncertainty as the Overlap of Alternate Conditional Distributions Uncertanty as the Overlap of Alternate Condtonal Dstrbutons Olena Babak and Clayton V. Deutsch Centre for Computatonal Geostatstcs Department of Cvl & Envronmental Engneerng Unversty of Alberta An mportant

More information

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9 Chapter 9 Correlaton and Regresson 9. Correlaton Correlaton A correlaton s a relatonshp between two varables. The data can be represented b the ordered pars (, ) where s the ndependent (or eplanator) varable,

More information

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also

More information

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests

Simulated Power of the Discrete Cramér-von Mises Goodness-of-Fit Tests Smulated of the Cramér-von Mses Goodness-of-Ft Tests Steele, M., Chaselng, J. and 3 Hurst, C. School of Mathematcal and Physcal Scences, James Cook Unversty, Australan School of Envronmental Studes, Grffth

More information

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS M. Krshna Reddy, B. Naveen Kumar and Y. Ramu Department of Statstcs, Osmana Unversty, Hyderabad -500 007, Inda. nanbyrozu@gmal.com, ramu0@gmal.com

More information

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise. Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where y + = β + β e for =,..., y and are observable varables e s a random error How can an estmaton rule be constructed for the

More information

The Ordinary Least Squares (OLS) Estimator

The Ordinary Least Squares (OLS) Estimator The Ordnary Least Squares (OLS) Estmator 1 Regresson Analyss Regresson Analyss: a statstcal technque for nvestgatng and modelng the relatonshp between varables. Applcatons: Engneerng, the physcal and chemcal

More information

e i is a random error

e i is a random error Chapter - The Smple Lnear Regresson Model The lnear regresson equaton s: where + β + β e for,..., and are observable varables e s a random error How can an estmaton rule be constructed for the unknown

More information

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6 Department of Quanttatve Methods & Informaton Systems Tme Seres and Ther Components QMIS 30 Chapter 6 Fall 00 Dr. Mohammad Zanal These sldes were modfed from ther orgnal source for educatonal purpose only.

More information

A Robust Method for Calculating the Correlation Coefficient

A Robust Method for Calculating the Correlation Coefficient A Robust Method for Calculatng the Correlaton Coeffcent E.B. Nven and C. V. Deutsch Relatonshps between prmary and secondary data are frequently quantfed usng the correlaton coeffcent; however, the tradtonal

More information

x = , so that calculated

x = , so that calculated Stat 4, secton Sngle Factor ANOVA notes by Tm Plachowsk n chapter 8 we conducted hypothess tests n whch we compared a sngle sample s mean or proporton to some hypotheszed value Chapter 9 expanded ths to

More information

STAT 3008 Applied Regression Analysis

STAT 3008 Applied Regression Analysis STAT 3008 Appled Regresson Analyss Tutoral : Smple Lnear Regresson LAI Chun He Department of Statstcs, The Chnese Unversty of Hong Kong 1 Model Assumpton To quantfy the relatonshp between two factors,

More information

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics ECOOMICS 35*-A Md-Term Exam -- Fall Term 000 Page of 3 pages QUEE'S UIVERSITY AT KIGSTO Department of Economcs ECOOMICS 35* - Secton A Introductory Econometrcs Fall Term 000 MID-TERM EAM ASWERS MG Abbott

More information

Chapter 13: Multiple Regression

Chapter 13: Multiple Regression Chapter 13: Multple Regresson 13.1 Developng the multple-regresson Model The general model can be descrbed as: It smplfes for two ndependent varables: The sample ft parameter b 0, b 1, and b are used to

More information

Polynomial Regression Models

Polynomial Regression Models LINEAR REGRESSION ANALYSIS MODULE XII Lecture - 6 Polynomal Regresson Models Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Test of sgnfcance To test the sgnfcance

More information

Bayesian Planning of Hit-Miss Inspection Tests

Bayesian Planning of Hit-Miss Inspection Tests Bayesan Plannng of Ht-Mss Inspecton Tests Yew-Meng Koh a and Wllam Q Meeker a a Center for Nondestructve Evaluaton, Department of Statstcs, Iowa State Unversty, Ames, Iowa 5000 Abstract Although some useful

More information

Lecture 3 Stat102, Spring 2007

Lecture 3 Stat102, Spring 2007 Lecture 3 Stat0, Sprng 007 Chapter 3. 3.: Introducton to regresson analyss Lnear regresson as a descrptve technque The least-squares equatons Chapter 3.3 Samplng dstrbuton of b 0, b. Contnued n net lecture

More information

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications Durban Watson for Testng the Lack-of-Ft of Polynomal Regresson Models wthout Replcatons Ruba A. Alyaf, Maha A. Omar, Abdullah A. Al-Shha ralyaf@ksu.edu.sa, maomar@ksu.edu.sa, aalshha@ksu.edu.sa Department

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 13 13-1 Basc Busness Statstcs 11 th Edton Chapter 13 Smple Lnear Regresson Basc Busness Statstcs, 11e 009 Prentce-Hall, Inc. Chap 13-1 Learnng Objectves In ths chapter, you learn: How to use regresson

More information

Answers Problem Set 2 Chem 314A Williamsen Spring 2000

Answers Problem Set 2 Chem 314A Williamsen Spring 2000 Answers Problem Set Chem 314A Wllamsen Sprng 000 1) Gve me the followng crtcal values from the statstcal tables. a) z-statstc,-sded test, 99.7% confdence lmt ±3 b) t-statstc (Case I), 1-sded test, 95%

More information

Lecture 4 Hypothesis Testing

Lecture 4 Hypothesis Testing Lecture 4 Hypothess Testng We may wsh to test pror hypotheses about the coeffcents we estmate. We can use the estmates to test whether the data rejects our hypothess. An example mght be that we wsh to

More information

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U) Econ 413 Exam 13 H ANSWERS Settet er nndelt 9 deloppgaver, A,B,C, som alle anbefales å telle lkt for å gøre det ltt lettere å stå. Svar er gtt . Unfortunately, there s a prntng error n the hnt of

More information

More metrics on cartesian products

More metrics on cartesian products More metrcs on cartesan products If (X, d ) are metrc spaces for 1 n, then n Secton II4 of the lecture notes we defned three metrcs on X whose underlyng topologes are the product topology The purpose of

More information

Linear Approximation with Regularization and Moving Least Squares

Linear Approximation with Regularization and Moving Least Squares Lnear Approxmaton wth Regularzaton and Movng Least Squares Igor Grešovn May 007 Revson 4.6 (Revson : March 004). 5 4 3 0.5 3 3.5 4 Contents: Lnear Fttng...4. Weghted Least Squares n Functon Approxmaton...

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 31 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 6. Rdge regresson The OLSE s the best lnear unbased

More information

Research Article Green s Theorem for Sign Data

Research Article Green s Theorem for Sign Data Internatonal Scholarly Research Network ISRN Appled Mathematcs Volume 2012, Artcle ID 539359, 10 pages do:10.5402/2012/539359 Research Artcle Green s Theorem for Sgn Data Lous M. Houston The Unversty of

More information

Tracking with Kalman Filter

Tracking with Kalman Filter Trackng wth Kalman Flter Scott T. Acton Vrgna Image and Vdeo Analyss (VIVA), Charles L. Brown Department of Electrcal and Computer Engneerng Department of Bomedcal Engneerng Unversty of Vrgna, Charlottesvlle,

More information

Composite Hypotheses testing

Composite Hypotheses testing Composte ypotheses testng In many hypothess testng problems there are many possble dstrbutons that can occur under each of the hypotheses. The output of the source s a set of parameters (ponts n a parameter

More information

Statistics for Business and Economics

Statistics for Business and Economics Statstcs for Busness and Economcs Chapter 11 Smple Regresson Copyrght 010 Pearson Educaton, Inc. Publshng as Prentce Hall Ch. 11-1 11.1 Overvew of Lnear Models n An equaton can be ft to show the best lnear

More information

CIS526: Machine Learning Lecture 3 (Sept 16, 2003) Linear Regression. Preparation help: Xiaoying Huang. x 1 θ 1 output... θ M x M

CIS526: Machine Learning Lecture 3 (Sept 16, 2003) Linear Regression. Preparation help: Xiaoying Huang. x 1 θ 1 output... θ M x M CIS56: achne Learnng Lecture 3 (Sept 6, 003) Preparaton help: Xaoyng Huang Lnear Regresson Lnear regresson can be represented by a functonal form: f(; θ) = θ 0 0 +θ + + θ = θ = 0 ote: 0 s a dummy attrbute

More information

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X

3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number

More information

Statistics II Final Exam 26/6/18

Statistics II Final Exam 26/6/18 Statstcs II Fnal Exam 26/6/18 Academc Year 2017/18 Solutons Exam duraton: 2 h 30 mn 1. (3 ponts) A town hall s conductng a study to determne the amount of leftover food produced by the restaurants n the

More information

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution Department of Statstcs Unversty of Toronto STA35HS / HS Desgn and Analyss of Experments Term Test - Wnter - Soluton February, Last Name: Frst Name: Student Number: Instructons: Tme: hours. Ads: a non-programmable

More information

Lecture 16 Statistical Analysis in Biomaterials Research (Part II)

Lecture 16 Statistical Analysis in Biomaterials Research (Part II) 3.051J/0.340J 1 Lecture 16 Statstcal Analyss n Bomaterals Research (Part II) C. F Dstrbuton Allows comparson of varablty of behavor between populatons usng test of hypothess: σ x = σ x amed for Brtsh statstcan

More information

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA 4 Analyss of Varance (ANOVA) 5 ANOVA 51 Introducton ANOVA ANOVA s a way to estmate and test the means of multple populatons We wll start wth one-way ANOVA If the populatons ncluded n the study are selected

More information

Chapter 5 Multilevel Models

Chapter 5 Multilevel Models Chapter 5 Multlevel Models 5.1 Cross-sectonal multlevel models 5.1.1 Two-level models 5.1.2 Multple level models 5.1.3 Multple level modelng n other felds 5.2 Longtudnal multlevel models 5.2.1 Two-level

More information

β0 + β1xi and want to estimate the unknown

β0 + β1xi and want to estimate the unknown SLR Models Estmaton Those OLS Estmates Estmators (e ante) v. estmates (e post) The Smple Lnear Regresson (SLR) Condtons -4 An Asde: The Populaton Regresson Functon B and B are Lnear Estmators (condtonal

More information

Statistical Hypothesis Testing for Returns to Scale Using Data Envelopment Analysis

Statistical Hypothesis Testing for Returns to Scale Using Data Envelopment Analysis Statstcal Hypothess Testng for Returns to Scale Usng Data nvelopment nalyss M. ukushge a and I. Myara b a Graduate School of conomcs, Osaka Unversty, Osaka 560-0043, apan (mfuku@econ.osaka-u.ac.p) b Graduate

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours UNIVERSITY OF TORONTO Faculty of Arts and Scence December 005 Examnatons STA47HF/STA005HF Duraton - hours AIDS ALLOWED: (to be suppled by the student) Non-programmable calculator One handwrtten 8.5'' x

More information

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition) Count Data Models See Book Chapter 11 2 nd Edton (Chapter 10 1 st Edton) Count data consst of non-negatve nteger values Examples: number of drver route changes per week, the number of trp departure changes

More information

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010

Parametric fractional imputation for missing data analysis. Jae Kwang Kim Survey Working Group Seminar March 29, 2010 Parametrc fractonal mputaton for mssng data analyss Jae Kwang Km Survey Workng Group Semnar March 29, 2010 1 Outlne Introducton Proposed method Fractonal mputaton Approxmaton Varance estmaton Multple mputaton

More information

Lecture Notes on Linear Regression

Lecture Notes on Linear Regression Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume

More information

T E C O L O T E R E S E A R C H, I N C.

T E C O L O T E R E S E A R C H, I N C. T E C O L O T E R E S E A R C H, I N C. B rdg n g En g neern g a nd Econo mcs S nce 1973 THE MINIMUM-UNBIASED-PERCENTAGE ERROR (MUPE) METHOD IN CER DEVELOPMENT Thrd Jont Annual ISPA/SCEA Internatonal Conference

More information

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students. PPOL 59-3 Problem Set Exercses n Smple Regresson Due n class /8/7 In ths problem set, you are asked to compute varous statstcs by hand to gve you a better sense of the mechancs of the Pearson correlaton

More information

Basic Statistical Analysis and Yield Calculations

Basic Statistical Analysis and Yield Calculations October 17, 007 Basc Statstcal Analyss and Yeld Calculatons Dr. José Ernesto Rayas Sánchez 1 Outlne Sources of desgn-performance uncertanty Desgn and development processes Desgn for manufacturablty A general

More information

Numerical Heat and Mass Transfer

Numerical Heat and Mass Transfer Master degree n Mechancal Engneerng Numercal Heat and Mass Transfer 06-Fnte-Dfference Method (One-dmensonal, steady state heat conducton) Fausto Arpno f.arpno@uncas.t Introducton Why we use models and

More information

THE ROYAL STATISTICAL SOCIETY 2006 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE

THE ROYAL STATISTICAL SOCIETY 2006 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE THE ROYAL STATISTICAL SOCIETY 6 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE PAPER I STATISTICAL THEORY The Socety provdes these solutons to assst canddates preparng for the eamnatons n future years and for

More information

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Analyss of Varance and Desgn of Experment-I MODULE VII LECTURE - 3 ANALYSIS OF COVARIANCE Dr Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur Any scentfc experment s performed

More information

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur

Module 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:

More information

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y) Secton 1.5 Correlaton In the prevous sectons, we looked at regresson and the value r was a measurement of how much of the varaton n y can be attrbuted to the lnear relatonshp between y and x. In ths secton,

More information

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method

Comparison of the Population Variance Estimators. of 2-Parameter Exponential Distribution Based on. Multiple Criteria Decision Making Method Appled Mathematcal Scences, Vol. 7, 0, no. 47, 07-0 HIARI Ltd, www.m-hkar.com Comparson of the Populaton Varance Estmators of -Parameter Exponental Dstrbuton Based on Multple Crtera Decson Makng Method

More information

/ n ) are compared. The logic is: if the two

/ n ) are compared. The logic is: if the two STAT C141, Sprng 2005 Lecture 13 Two sample tests One sample tests: examples of goodness of ft tests, where we are testng whether our data supports predctons. Two sample tests: called as tests of ndependence

More information

STAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression

STAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression STAT 45 BIOSTATISTICS (Fall 26) Handout 5 Introducton to Logstc Regresson Ths handout covers materal found n Secton 3.7 of your text. You may also want to revew regresson technques n Chapter. In ths handout,

More information

Originated from experimental optimization where measurements are very noisy Approximation can be actually more accurate than

Originated from experimental optimization where measurements are very noisy Approximation can be actually more accurate than Surrogate (approxmatons) Orgnated from expermental optmzaton where measurements are ver nos Approxmaton can be actuall more accurate than data! Great nterest now n applng these technques to computer smulatons

More information

28. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III 8. SIMPLE LINEAR REGRESSION III Ftted Values and Resduals US Domestc Beers: Calores vs. % Alcohol To each observed x, there corresponds a y-value on the ftted lne, y ˆ = βˆ + βˆ x. The are called ftted

More information

Chapter 6. Supplemental Text Material

Chapter 6. Supplemental Text Material Chapter 6. Supplemental Text Materal S6-. actor Effect Estmates are Least Squares Estmates We have gven heurstc or ntutve explanatons of how the estmates of the factor effects are obtaned n the textboo.

More information

4DVAR, according to the name, is a four-dimensional variational method.

4DVAR, according to the name, is a four-dimensional variational method. 4D-Varatonal Data Assmlaton (4D-Var) 4DVAR, accordng to the name, s a four-dmensonal varatonal method. 4D-Var s actually a drect generalzaton of 3D-Var to handle observatons that are dstrbuted n tme. The

More information

Learning Objectives for Chapter 11

Learning Objectives for Chapter 11 Chapter : Lnear Regresson and Correlaton Methods Hldebrand, Ott and Gray Basc Statstcal Ideas for Managers Second Edton Learnng Objectves for Chapter Usng the scatterplot n regresson analyss Usng the method

More information

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 13 The Smple Lnear Regresson Model and Correlaton 1999 Prentce-Hall, Inc. Chap. 13-1 Chapter Topcs Types of Regresson Models Determnng the Smple Lnear

More information

Basically, if you have a dummy dependent variable you will be estimating a probability.

Basically, if you have a dummy dependent variable you will be estimating a probability. ECON 497: Lecture Notes 13 Page 1 of 1 Metropoltan State Unversty ECON 497: Research and Forecastng Lecture Notes 13 Dummy Dependent Varable Technques Studenmund Chapter 13 Bascally, f you have a dummy

More information

Global Sensitivity. Tuesday 20 th February, 2018

Global Sensitivity. Tuesday 20 th February, 2018 Global Senstvty Tuesday 2 th February, 28 ) Local Senstvty Most senstvty analyses [] are based on local estmates of senstvty, typcally by expandng the response n a Taylor seres about some specfc values

More information

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X).

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X). 11.4.1 Estmaton of Multple Regresson Coeffcents In multple lnear regresson, we essentally solve n equatons for the p unnown parameters. hus n must e equal to or greater than p and n practce n should e

More information

Uncertainty on Fatigue Damage Accumulation for Composite Materials Toft, Henrik Stensgaard; Sørensen, John Dalsgaard

Uncertainty on Fatigue Damage Accumulation for Composite Materials Toft, Henrik Stensgaard; Sørensen, John Dalsgaard Aalborg Unverstet Uncertanty on Fatgue Damage Accumulaton for Composte Materals Toft, Henrk Stensgaard; Sørensen, John Dalsgaard Publshed n: Proceedngs of the Twenty Second Nordc Semnar on Computatonal

More information

Statistics MINITAB - Lab 2

Statistics MINITAB - Lab 2 Statstcs 20080 MINITAB - Lab 2 1. Smple Lnear Regresson In smple lnear regresson we attempt to model a lnear relatonshp between two varables wth a straght lne and make statstcal nferences concernng that

More information

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity LINEAR REGRESSION ANALYSIS MODULE IX Lecture - 30 Multcollnearty Dr. Shalabh Department of Mathematcs and Statstcs Indan Insttute of Technology Kanpur 2 Remedes for multcollnearty Varous technques have

More information

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida Frst Year Examnaton Department of Statstcs, Unversty of Florda May 7, 010, 8:00 am - 1:00 noon Instructons: 1. You have four hours to answer questons n ths examnaton.. You must show your work to receve

More information

Generalized Linear Methods

Generalized Linear Methods Generalzed Lnear Methods 1 Introducton In the Ensemble Methods the general dea s that usng a combnaton of several weak learner one could make a better learner. More formally, assume that we have a set

More information

Chapter 3 Describing Data Using Numerical Measures

Chapter 3 Describing Data Using Numerical Measures Chapter 3 Student Lecture Notes 3-1 Chapter 3 Descrbng Data Usng Numercal Measures Fall 2006 Fundamentals of Busness Statstcs 1 Chapter Goals To establsh the usefulness of summary measures of data. The

More information

NUMERICAL DIFFERENTIATION

NUMERICAL DIFFERENTIATION NUMERICAL DIFFERENTIATION 1 Introducton Dfferentaton s a method to compute the rate at whch a dependent output y changes wth respect to the change n the ndependent nput x. Ths rate of change s called the

More information

Testing for seasonal unit roots in heterogeneous panels

Testing for seasonal unit roots in heterogeneous panels Testng for seasonal unt roots n heterogeneous panels Jesus Otero * Facultad de Economía Unversdad del Rosaro, Colomba Jeremy Smth Department of Economcs Unversty of arwck Monca Gulett Aston Busness School

More information

A Simple Inventory System

A Simple Inventory System A Smple Inventory System Lawrence M. Leems and Stephen K. Park, Dscrete-Event Smulaton: A Frst Course, Prentce Hall, 2006 Hu Chen Computer Scence Vrgna State Unversty Petersburg, Vrgna February 8, 2017

More information

A Comparative Study for Estimation Parameters in Panel Data Model

A Comparative Study for Estimation Parameters in Panel Data Model A Comparatve Study for Estmaton Parameters n Panel Data Model Ahmed H. Youssef and Mohamed R. Abonazel hs paper examnes the panel data models when the regresson coeffcents are fxed random and mxed and

More information

Introduction to Regression

Introduction to Regression Introducton to Regresson Dr Tom Ilvento Department of Food and Resource Economcs Overvew The last part of the course wll focus on Regresson Analyss Ths s one of the more powerful statstcal technques Provdes

More information

January Examinations 2015

January Examinations 2015 24/5 Canddates Only January Examnatons 25 DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR STUDENT CANDIDATE NO.. Department Module Code Module Ttle Exam Duraton (n words)

More information

Lecture 6: Introduction to Linear Regression

Lecture 6: Introduction to Linear Regression Lecture 6: Introducton to Lnear Regresson An Manchakul amancha@jhsph.edu 24 Aprl 27 Lnear regresson: man dea Lnear regresson can be used to study an outcome as a lnear functon of a predctor Example: 6

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore Sesson Outlne Introducton to classfcaton problems and dscrete choce models. Introducton to Logstcs Regresson. Logstc functon and Logt functon. Maxmum Lkelhood Estmator (MLE) for estmaton of LR parameters.

More information

STAT 511 FINAL EXAM NAME Spring 2001

STAT 511 FINAL EXAM NAME Spring 2001 STAT 5 FINAL EXAM NAME Sprng Instructons: Ths s a closed book exam. No notes or books are allowed. ou may use a calculator but you are not allowed to store notes or formulas n the calculator. Please wrte

More information

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

2016 Wiley. Study Session 2: Ethical and Professional Standards Application 6 Wley Study Sesson : Ethcal and Professonal Standards Applcaton LESSON : CORRECTION ANALYSIS Readng 9: Correlaton and Regresson LOS 9a: Calculate and nterpret a sample covarance and a sample correlaton

More information

Lossy Compression. Compromise accuracy of reconstruction for increased compression.

Lossy Compression. Compromise accuracy of reconstruction for increased compression. Lossy Compresson Compromse accuracy of reconstructon for ncreased compresson. The reconstructon s usually vsbly ndstngushable from the orgnal mage. Typcally, one can get up to 0:1 compresson wth almost

More information

Statistical analysis using matlab. HY 439 Presented by: George Fortetsanakis

Statistical analysis using matlab. HY 439 Presented by: George Fortetsanakis Statstcal analyss usng matlab HY 439 Presented by: George Fortetsanaks Roadmap Probablty dstrbutons Statstcal estmaton Fttng data to probablty dstrbutons Contnuous dstrbutons Contnuous random varable X

More information

CHAPTER 6 GOODNESS OF FIT AND CONTINGENCY TABLE PREPARED BY: DR SITI ZANARIAH SATARI & FARAHANIM MISNI

CHAPTER 6 GOODNESS OF FIT AND CONTINGENCY TABLE PREPARED BY: DR SITI ZANARIAH SATARI & FARAHANIM MISNI CHAPTER 6 GOODNESS OF FIT AND CONTINGENCY TABLE Expected Outcomes Able to test the goodness of ft for categorcal data. Able to test whether the categorcal data ft to the certan dstrbuton such as Bnomal,

More information

The Gaussian classifier. Nuno Vasconcelos ECE Department, UCSD

The Gaussian classifier. Nuno Vasconcelos ECE Department, UCSD he Gaussan classfer Nuno Vasconcelos ECE Department, UCSD Bayesan decson theory recall that we have state of the world X observatons g decson functon L[g,y] loss of predctng y wth g Bayes decson rule s

More information

7.1. Single classification analysis of variance (ANOVA) Why not use multiple 2-sample 2. When to use ANOVA

7.1. Single classification analysis of variance (ANOVA) Why not use multiple 2-sample 2. When to use ANOVA Sngle classfcaton analyss of varance (ANOVA) When to use ANOVA ANOVA models and parttonng sums of squares ANOVA: hypothess testng ANOVA: assumptons A non-parametrc alternatve: Kruskal-Walls ANOVA Power

More information

Linear Regression Analysis: Terminology and Notation

Linear Regression Analysis: Terminology and Notation ECON 35* -- Secton : Basc Concepts of Regresson Analyss (Page ) Lnear Regresson Analyss: Termnology and Notaton Consder the generc verson of the smple (two-varable) lnear regresson model. It s represented

More information

Distributions /06. G.Serazzi 05/06 Dimensionamento degli Impianti Informatici distrib - 1

Distributions /06. G.Serazzi 05/06 Dimensionamento degli Impianti Informatici distrib - 1 Dstrbutons 8/03/06 /06 G.Serazz 05/06 Dmensonamento degl Impant Informatc dstrb - outlne densty, dstrbuton, moments unform dstrbuton Posson process, eponental dstrbuton Pareto functon densty and dstrbuton

More information

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems

Chapter 5. Solution of System of Linear Equations. Module No. 6. Solution of Inconsistent and Ill Conditioned Systems Numercal Analyss by Dr. Anta Pal Assstant Professor Department of Mathematcs Natonal Insttute of Technology Durgapur Durgapur-713209 emal: anta.bue@gmal.com 1 . Chapter 5 Soluton of System of Lnear Equatons

More information

Convergence of random processes

Convergence of random processes DS-GA 12 Lecture notes 6 Fall 216 Convergence of random processes 1 Introducton In these notes we study convergence of dscrete random processes. Ths allows to characterze phenomena such as the law of large

More information

Chapter 14 Simple Linear Regression

Chapter 14 Simple Linear Regression Chapter 4 Smple Lnear Regresson Chapter 4 - Smple Lnear Regresson Manageral decsons often are based on the relatonshp between two or more varables. Regresson analss can be used to develop an equaton showng

More information

Modeling and Simulation NETW 707

Modeling and Simulation NETW 707 Modelng and Smulaton NETW 707 Lecture 5 Tests for Random Numbers Course Instructor: Dr.-Ing. Magge Mashaly magge.ezzat@guc.edu.eg C3.220 1 Propertes of Random Numbers Random Number Generators (RNGs) must

More information

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction ECONOMICS 35* -- NOTE 7 ECON 35* -- NOTE 7 Interval Estmaton n the Classcal Normal Lnear Regresson Model Ths note outlnes the basc elements of nterval estmaton n the Classcal Normal Lnear Regresson Model

More information